Hello folks!

Sorry to ask such a basic question but synonyms might be the end of me.. I suspect that there is something fundamentally wrong with the field type I've set up..

<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.TrimFilterFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>

<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> <filter class="solr.EnglishPorterFilterFactory" protected="protwords.txt"/> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
</fieldType>

In synonyms.txt I have a *large* list of synonyms in the following format..

a, b, c d e, f, g => something

I'm having the behavior that searches for a, b, f, and g all work, but the c d e does not. I suspected that was because things were getting split on white space before they were going to the synonym filter, so I moved the synonym filters to be before the tokenizer. Something's still wrong though... any help would be most appreciated!

Thank you for your time!

Matthew Runo
Software Engineer, Zappos.com
[EMAIL PROTECTED] - 702-943-7833

Reply via email to