Hello folks!
Sorry to ask such a basic question but synonyms might be the end of
me.. I suspect that there is something fundamentally wrong with the
field type I've set up..
<fieldType name="text" class="solr.TextField"
positionIncrementGap="100">
<analyzer>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.TrimFilterFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt"/>
<filter class="solr.EnglishPorterFilterFactory"
protected="protwords.txt"/>
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
</fieldType>
In synonyms.txt I have a *large* list of synonyms in the following
format..
a, b, c d e, f, g => something
I'm having the behavior that searches for a, b, f, and g all work, but
the c d e does not. I suspected that was because things were getting
split on white space before they were going to the synonym filter, so
I moved the synonym filters to be before the tokenizer. Something's
still wrong though... any help would be most appreciated!
Thank you for your time!
Matthew Runo
Software Engineer, Zappos.com
[EMAIL PROTECTED] - 702-943-7833