On Sun, May 26, 2013 at 8:16 PM, Jack Krupansky <j...@basetechnology.com> wrote:
> The only comment I was trying to make here is the relationship between the
> RemoveDuplicatesTokenFilterFactory and the KeywordRepeatFilterFactory.
>
> No, stemmed terms are not considered the same text as the original word. By
> definition, they are a new value for the term text.
>
>

I see, for some reason I did not concentrate on this key quote of yours:
"...to remove the tokens that did not produce a stem ..."

Now it makes perfect sense.

Thank you, Jack!


--
Dotan Cohen

http://gibberish.co.il
http://what-is-what.com

Reply via email to