Okay so as suggested Shingle works perfectly well for what I need ! Thank you Erick
2011/2/3 openvictor Open <openvic...@gmail.com> > Thank you for these inputs. > > I was silly asking for ngrams because I already knew it. I think I was > tired yesterday... > > Thank you Eric Erickson, once again you gave me a more than useful comment. > Indeed Shingles seems to be the perfect fit for the work I want to do. I > will try to implement that tonight and I will come back to see if it's > working. > > Regards, > Victor > > 2011/2/3 Erick Erickson <erickerick...@gmail.com> > > First, you'll get a lot of insight by defining something simply and looking >> at the analysis page from solr admin. That's a very valuable page. >> >> To your question: >> commongrams are "shingles" that work between stopwords and >> other words. For instance, "this is some text" gets analyzed into >> this, this_is, is, is_some, some text. Note that the stopwords >> are the only things that get combined with the text after. >> >> NGrams form on letters. It's too long to post the whole thing, but >> the above phrase gets analyzed as >> t, h, i, s, th, hi, is, i, s, is, s, o, m, e, so, om, me...... It splits a >> single >> token into grams whereas commongrams essentially combines tokens >> when they're stopwords. >> >> Have you looked at "shingles"? See: >> >> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory >> Best >> Erick >> >> >> On Thu, Feb 3, 2011 at 10:15 AM, openvictor Open <openvic...@gmail.com >> >wrote: >> >> > Thank you, I will do that and hopefuly it will be handy ! >> > >> > But can someone explain me difference between CommonGramFIlterFactory et >> > NGramFilterFactory ? ( Maybe the solution is there) >> > >> > Thank you all, >> > best regards >> > >> > 2011/2/3 Grijesh <pintu.grij...@gmail.com> >> > >> > > >> > > Use analysis.jsp to see what happening at index time and query time >> with >> > > your >> > > input data.You can use highlighting to see if match found. >> > > >> > > ----- >> > > Thanx: >> > > Grijesh >> > > http://lucidimagination.com >> > > -- >> > > View this message in context: >> > > >> > >> http://lucene.472066.n3.nabble.com/Using-terms-and-N-gram-tp2410938p2411244.html >> > > Sent from the Solr - User mailing list archive at Nabble.com. >> > > >> > >> > >