Re: Solr Filter for matching non-accented characters to their accented equivalents

Jack Krupansky Sat, 12 May 2012 11:51:53 -0700

Your field needs to use a field type which has a character folding/mappingfilter, such as:

<charFilter class="solr.MappingCharFilterFactory"mapping="mapping-ISOLatin1Accent.txt"/>


Such as in:

<fieldType name="text_char_norm" class="solr.TextField"positionIncrementGap="100" >

 <analyzer>

<charFilter class="solr.MappingCharFilterFactory"mapping="mapping-ISOLatin1Accent.txt"/>

   <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 </analyzer>
</fieldType>

See the example schema.

In older releases of Solr there was an ISO Latin-1 filter and later the"ASCII Folding Filter", but in 3.6 and trunk the aboveMappingCharFilterFactory char filter is used.


-- Jack Krupansky

-----Original Message-----From: Chiniga

Sent: Friday, May 11, 2012 6:41 AM
To: solr-user@lucene.apache.org

Subject: Solr Filter for matching non-accented characters to their accentedequivalents


Hello,

Our company is maintaining a Vietnamese website and here is the problem:

Our keyboards do not contain accented characters... and once we search for a
product, our non-accented-character searches result to nothing. We need Solr
to match our non-accented characters to their accented character
equivalents. For example:

Searching for Tre Trung would contain results with the words Trẻ trung.


Really hope someone can help. Thanks!

--

View this message in context:http://lucene.472066.n3.nabble.com/Solr-Filter-for-matching-non-accented-characters-to-their-accented-equivalents-tp3979562.htmlSent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Filter for matching non-accented characters to their accented equivalents

Reply via email to