: Should the EdgeNGramFilter use the same term position for the ngrams within a : single token?
i can see the argument going both ways ... imagine a hypothetical CharSplitterTokenFilter that takes replaces each token in the stream with one token per character in the orriginal token (ie: "hello" becomes h,e,l,l,o) ... should those tokens all have the same position? the have a logical ordered flow to them, so in theory they are sequential ... but they did occupy the same "space" in the orriginal token stream. when in doubt: make it an option -Hoss