Hi all!

I have problem with Solr, and I hope everyboby in there can help me :)

I want to search text without diacritic but Solr will response diacritic
text and without diacritic text.

For example, I query "solr index", it will response "solr index", "sôlr
index", "sòlr index", "sólr indèx",...

I was tried ASCIIFoldingFilter and ISOLatin1AccentFilterFactory but it is
not correct :(

My schema config:

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
      <analyzer type="index">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>       
        <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="0" generateNumberParts="0" catenateWords="0"
catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/>
         <filter class="solr.ASCIIFoldingFilterFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.SnowballPorterFilterFactory" language="English"
protected="protwords.txt"/>
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
        <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="0" generateNumberParts="0" catenateWords="0"
catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.SnowballPorterFilterFactory" language="English"
protected="protwords.txt"/>
      </analyzer>
    </fieldType>


-- 
View this message in context: 
http://old.nabble.com/Search-wihthout-diacritics-tp27430345p27430345.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to