Okay so as suggested Shingle works perfectly well for what I need !
Thank you Erick

2011/2/3 openvictor Open <openvic...@gmail.com>

> Thank you for these inputs.
>
> I was silly asking for ngrams because I already knew it. I think I was
> tired yesterday...
>
> Thank you Eric Erickson, once again you gave me a more than useful comment.
> Indeed Shingles seems to be the perfect fit for the work I want to do. I
> will try to implement that tonight and I will come back to see if it's
> working.
>
> Regards,
> Victor
>
> 2011/2/3 Erick Erickson <erickerick...@gmail.com>
>
> First, you'll get a lot of insight by defining something simply and looking
>> at the analysis page from solr admin. That's a very valuable page.
>>
>> To your question:
>> commongrams are "shingles" that work between stopwords and
>> other words. For instance, "this is some text" gets analyzed into
>> this, this_is, is, is_some, some text. Note that the stopwords
>> are the only things that get combined with the text after.
>>
>> NGrams form on letters. It's too long to post the whole thing, but
>> the above phrase gets analyzed as
>> t, h, i, s, th, hi, is, i, s, is, s, o, m, e, so, om, me...... It splits a
>> single
>> token into grams whereas commongrams essentially combines tokens
>> when they're stopwords.
>>
>> Have you looked at "shingles"? See:
>>
>> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory
>> Best
>> Erick
>>
>>
>> On Thu, Feb 3, 2011 at 10:15 AM, openvictor Open <openvic...@gmail.com
>> >wrote:
>>
>> > Thank you, I will do that and hopefuly it will be handy !
>> >
>> > But can someone explain me difference between CommonGramFIlterFactory et
>> > NGramFilterFactory ? ( Maybe the solution is there)
>> >
>> > Thank you all,
>> > best regards
>> >
>> > 2011/2/3 Grijesh <pintu.grij...@gmail.com>
>> >
>> > >
>> > > Use analysis.jsp to see what happening at index time and query time
>> with
>> > > your
>> > > input data.You can use highlighting to see if match found.
>> > >
>> > > -----
>> > > Thanx:
>> > > Grijesh
>> > > http://lucidimagination.com
>> > > --
>> > > View this message in context:
>> > >
>> >
>> http://lucene.472066.n3.nabble.com/Using-terms-and-N-gram-tp2410938p2411244.html
>> > > Sent from the Solr - User mailing list archive at Nabble.com.
>> > >
>> >
>>
>
>

Reply via email to