StandardTokenizer is tricky.
it does a lot of kooky things that probably made sense when it was
written, you'll not in your output that the "term type" is getting set to
"HOST" Standard Tokenizer has decided that L.I.C looks like a hostname, so
it's not splitting on the periods.
: analys
]
Sent: Tuesday, March 31, 2009 1:50 PM
To: solr-user@lucene.apache.org
Subject: RE: Not getting the proper result.
Did you try creating your indexes again after modifying the schema.xml
The way Solr search is, whatever string you have
Radha C. wrote:
>
>
> Thanks Grant,
>
gt; To: solr-user@lucene.apache.org
> Subject: Re: Not getting the proper result.
>
> The StandardTokenizer splits on punctuation, so L.I.C. is likely becoming
> 'l', 'i', 'c', while LIC -> lic. One helpful tool is the Analysis page on
> the Solr admin:
[mailto:gsing...@apache.org]
Sent: Monday, March 30, 2009 7:46 PM
To: solr-user@lucene.apache.org
Subject: Re: Not getting the proper result.
The StandardTokenizer splits on punctuation, so L.I.C. is likely becoming
'l', 'i', 'c', while LIC -> lic. One helpful
The StandardTokenizer splits on punctuation, so L.I.C. is likely
becoming 'l', 'i', 'c', while LIC -> lic. One helpful tool is the
Analysis page on the Solr admin: http://localhost:8983/solr/admin/analysis.jsp
as it can help you figure out what is going on with analysis on both
the query