The TermsComponent is looking at *indexed* terms that have
been passed through the analysis chain. So I suspect you're
seeing the results of stemming.

WordDelimiterFilterFactory will also break things up, as will
other tokenizers/analyzers. If you want your original input
you'll need to have a pretty bare-bones analysis chain.

Best
Erick

On Tue, Aug 9, 2011 at 10:20 AM, Royi Ronen <ronen.r...@gmail.com> wrote:
> Hi,
> I am using the terms component.
> Many times an 'e' at the end of the word is missing.
> E.g., it gives 'googl' instead of 'google', 'youtub' instead of 'youtube'.
> The problem does not exist for some other words ending with 'e'.
> Any ideas why it happens?
> Royi
>

Reply via email to