I have the following field type set up in my schema. The idea is to fire phrases of text such as 'fund manager summary' (without the quotes) at it, and for the synonym processing to recognise this, and add the rest of the synonyms (index-time synonym processing with expansion) to the index from my synonym file (example below)
<fieldType name="synonymstring" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.KeywordTokenizerFactory"/> </analyzer> </fieldType> in synonyms.txt..... fund manager summary, fund manager report guide, product guide I run into 2 issues... (1) After analysis of the field in SOLR, I find that both fund manager summay fund manage report are NOT getting picked up by the synonym factory (after processing I just get the source term outputted from the synonym filter) (2) If I analyse guide, I do get product and guide (*2) outputted from the synonym filter factory - but as seperate terms (3 terms in total), I expected it to generate just 1 additional term - i.e. product guide It seems that it is able to pick up a single word and output two (as seperate terms), but it fails to pick up multiple words. Can anyone help? (incidentally when I use this approach on a SOLR text field type it all works fine, but I cant use a SOLR text field type for this as I use this field for facetting. If you wish to view the St. James's Place email disclaimer, please use the link below http://www.sjp.co.uk/portal/internet/SJPemaildisclaimer