Hello all,
The titles of our docs have the form "ABC0001231-This is an important
doc.pdf". I would like to be able to
search for 'important', or '1231', or 'ABC000*', or 'This is an
important doc' in the title field. I looked a the NGramTokenizer and
tried to use it.
In the index it doesn't seem to work, I cannot get any hits. The
analysis tool on the admin pages shows me that the
ngram tokenizing works by highlighting the matches between the indexed
value and a query. I have set the
min and max ngram size to 2 and 6, with side equal to left.
Can anyone recommend a procedure that will allow me to search as stated
above?
I would also like to find out more about how to use the NgramTokenizer,
but have found little in the form of
documentation. Anyone know about any good sources?
Thanks,
Brian