Re: Tokenize ShingleFilterFactory results and apply filters to tokens

2015-10-19 Thread Alexandre Rafalovitch
This sounds like an attempt to create an auto-complete using n-grams in text. In which case, Ted Sullivan's writing might be of relevance: http://lucidworks.com/blog/author/tedsullivan/ Regards, Alex. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.c

Re: Tokenize ShingleFilterFactory results and apply filters to tokens

2015-10-19 Thread Steve Rowe
Hi Vitaliy, I don’t know of any combination of built-in Lucene/Solr analysis components that would do what you want, but there used to be filter called ShingleMatrixFilter that (if I understand both that filter and what you want correctly), would do what you want, following an EdgeNGramFilter:

Re: Tokenize ShingleFilterFactory results and apply filters to tokens

2015-10-17 Thread vitaly bulgakov
/why don't you put EdgeNGramFilter just after ShingleFilter?/ Because it will do Edge Ngrams over a shingle as a string: for "Home Improvement" shingle it will do: Hom, Home, Home , Home I, Home Im, Home Imp .. But I need: ... Hom Imp, Hom Impr .. -- View this message in context:

Re: Tokenize ShingleFilterFactory results and apply filters to tokens

2015-10-15 Thread Koji Sekiguchi
Hi Vitaly, I'm not sure I understand you correctly, why don't you put EdgeNGramFilter just after ShingleFilter? That is: Koji On 2015/10/15 22:47, vitaly bulgakov wrote: I want to rephrase my question I asked in another post. As far as I understand filter ShingleFilterFactory creates shin