On Sun, May 26, 2013 at 8:16 PM, Jack Krupansky <j...@basetechnology.com> wrote: > The only comment I was trying to make here is the relationship between the > RemoveDuplicatesTokenFilterFactory and the KeywordRepeatFilterFactory. > > No, stemmed terms are not considered the same text as the original word. By > definition, they are a new value for the term text. > >
I see, for some reason I did not concentrate on this key quote of yours: "...to remove the tokens that did not produce a stem ..." Now it makes perfect sense. Thank you, Jack! -- Dotan Cohen http://gibberish.co.il http://what-is-what.com