Does AutoFocus support stopwords lists to exclude common words from indexing?
Although Lucene does provide functionality for applying stop word lists, we do not use this in AutoFocus. In fact, including all words in the index makes the use of phrase searches possible. AutoFocus does contain stop word lists for about 20 languages, but they are used for a different purpose. During indexing, AutoFocus uses a simple heuristic algorithm to determine a list of significant keywords for each document. These keywords serve as a sort of document summary and can also be used as search suggestions. The suggestions in the Keyword Search facet are in fact derived from these significant terms. The stop word lists are used in this process to suppress these common words from ever being suggested as a significant term or search suggestion.
Related Questions
- It seems that double byte (Chinese-traditional words) cannot be shown exactly. Does JReport support Chinese characters?
- Does the plan exclude coverage for pre-existing conditions or for illnesses or injuries common to students?
- Does AutoFocus support stopwords lists to exclude common words from indexing?