Re: EdgeNGramTokenFilter, term position?

Chris Hostetter Mon, 17 Sep 2007 14:36:21 -0700

: Should the EdgeNGramFilter use the same term position for the ngrams within a
: single token?


i can see the argument going both ways ... imagine a hypothetical 
CharSplitterTokenFilter that takes replaces each token in the stream with 
one token per character in the orriginal token (ie: "hello" becomes 
h,e,l,l,o) ... should those tokens all have the same position?  the have a 
logical ordered flow to them, so in theory they are sequential ... but 
they did occupy the same "space" in the orriginal token stream.

when in doubt: make it an option



-Hoss

Re: EdgeNGramTokenFilter, term position?

Reply via email to