On Mon, Oct 12, 2009 at 9:09 PM, Audrey Foo <au...@hotmail.com> wrote:
> > In my search docs, I have content such as 'powershot' and 'powerShot'. > I would expect 'powerShot' would be searched as 'power', 'shot' and > 'powershot', so that results for all these are returned. Instead, only > results for 'power' and 'shot' are returned. > Any suggestions? > In the schema, index analyzer:<filter > class="solr.WordDelimiterFilterFactory" generateWordParts="0" > generateNumberParts="0" catenateWords="1" catenateNumbers="1" > catenateAll="0"/><filter class="solr.LowerCaseFilterFactory"/> > In the schema, query analyzer<filter > class="solr.WordDelimiterFilterFactory" generateWordParts="1" > generateNumberParts="1" catenateWords="0" catenateNumbers="0" > catenateAll="0" splitOnCaseChange="1"/><filter > class="solr.LowerCaseFilterFactory"/> > I find your index-time and query-time configuration very strange. Assuming that you also have a lowercase filter, it seems that a token "powerShot" will not be split and indexed as "powershot". Then during query, both "power" and "shot" will match nothing. I suggest you start with the configuration given in the example schema. Else, it'd be easier for us if you can help us understand the reasons behind changing these parameters. -- Regards, Shalin Shekhar Mangar.