The TermsComponent is looking at *indexed* terms that have been passed through the analysis chain. So I suspect you're seeing the results of stemming.
WordDelimiterFilterFactory will also break things up, as will other tokenizers/analyzers. If you want your original input you'll need to have a pretty bare-bones analysis chain. Best Erick On Tue, Aug 9, 2011 at 10:20 AM, Royi Ronen <ronen.r...@gmail.com> wrote: > Hi, > I am using the terms component. > Many times an 'e' at the end of the word is missing. > E.g., it gives 'googl' instead of 'google', 'youtub' instead of 'youtube'. > The problem does not exist for some other words ending with 'e'. > Any ideas why it happens? > Royi >