Class Summary |
DiacriticsUtils |
Helper class for Diacritics usage |
ElemComparator |
Helper class for FTTokenBufferStore: to calculate top n words |
FTTokenBufferStore |
Storage of Tokens (materialized), Linguistic Tokens and indexes on them |
LinguisticTokenGenerator |
A class for preprocessing the documents: Tokenization and conversion of XDM tokens into Linguistic Tokens |
NGramExtractor |
Implementation of an NGram Extractor |
PhraseIterator |
Helper iterator (over phrases) |
PhraseToken |
|
SentenceToken |
Helper class for sentence information |
Stemmer |
Stemmer, implementing the Porter Stemming Algorithm
The Stemmer class transforms a word into its root form. |
Thesaurus |
Implementation of Local Thesaurus |
TrieNode |
Implementation of a Node in a Trie |
WildcardUtils |
Helper class for Wildcard usage
FIXME: can be done more efficient when using regex, but not available with CLDC API |