EdgeNGramTokenFilter, term position?

Ryan McKinley Sun, 16 Sep 2007 00:08:59 -0700

Should the EdgeNGramFilter use the same term position for the ngramswithin a single token?

As is, the EdgeNGramTokenFilter increments the term position for eachcharacter. In analysis.jsp, with the input "hello", I get:


term position   1       2       3       4       5
term text       h       he      hel     hell    hello
term type       word    word    word    word    word
start,end       0,1     0,2     0,3     0,4     0,5


I would expect something more like what is generated from SOLR-357:

term position   1
term text       hello
                hell
                hel
                he
                h
term type       word
                prefix
                prefix
                prefix
                prefix
start,end       0,5
                0,4
                0,3
                0,2
                0,1

This seems like it would affect slop queries, but I don't reallyunderstand them yet.


thanks
ryan

EdgeNGramTokenFilter, term position?

Reply via email to