Your field needs to use a field type which has a character folding/mapping filter, such as:

<charFilter class="solr.MappingCharFilterFactory" mapping="mapping-ISOLatin1Accent.txt"/>

Such as in:

<fieldType name="text_char_norm" class="solr.TextField" positionIncrementGap="100" >
 <analyzer>
<charFilter class="solr.MappingCharFilterFactory" mapping="mapping-ISOLatin1Accent.txt"/>
   <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 </analyzer>
</fieldType>

See the example schema.

In older releases of Solr there was an ISO Latin-1 filter and later the "ASCII Folding Filter", but in 3.6 and trunk the above MappingCharFilterFactory char filter is used.

-- Jack Krupansky

-----Original Message----- From: Chiniga
Sent: Friday, May 11, 2012 6:41 AM
To: solr-user@lucene.apache.org
Subject: Solr Filter for matching non-accented characters to their accented equivalents

Hello,

Our company is maintaining a Vietnamese website and here is the problem:

Our keyboards do not contain accented characters... and once we search for a
product, our non-accented-character searches result to nothing. We need Solr
to match our non-accented characters to their accented character
equivalents. For example:

Searching for Tre Trung would contain results with the words Trẻ trung.


Really hope someone can help. Thanks!

--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-Filter-for-matching-non-accented-characters-to-their-accented-equivalents-tp3979562.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to