Hello all,

The titles of our docs have the form "ABC0001231-This is an important doc.pdf". I would like to be able to search for 'important', or '1231', or 'ABC000*', or 'This is an important doc' in the title field. I looked a the NGramTokenizer and tried to use it. In the index it doesn't seem to work, I cannot get any hits. The analysis tool on the admin pages shows me that the ngram tokenizing works by highlighting the matches between the indexed value and a query. I have set the
min and max ngram size to 2 and 6, with side equal to left.

Can anyone recommend a procedure that will allow me to search as stated above?

I would also like to find out more about how to use the NgramTokenizer, but have found little in the form of
documentation. Anyone know about any good sources?

Thanks,

Brian

Reply via email to