Re: capitalization and delimiters

Shalin Shekhar Mangar Wed, 14 Oct 2009 10:59:18 -0700

On Mon, Oct 12, 2009 at 9:09 PM, Audrey Foo <au...@hotmail.com> wrote:


>
> In my search docs, I have content such as 'powershot' and 'powerShot'.
> I would expect 'powerShot' would be searched as 'power', 'shot' and
> 'powershot', so that results for all these are returned. Instead, only
> results for 'power' and 'shot' are returned.
> Any suggestions?
> In the schema, index analyzer:<filter
> class="solr.WordDelimiterFilterFactory" generateWordParts="0"
> generateNumberParts="0" catenateWords="1" catenateNumbers="1"
> catenateAll="0"/><filter class="solr.LowerCaseFilterFactory"/>
> In the schema, query analyzer<filter
> class="solr.WordDelimiterFilterFactory" generateWordParts="1"
> generateNumberParts="1" catenateWords="0" catenateNumbers="0"
> catenateAll="0" splitOnCaseChange="1"/><filter
> class="solr.LowerCaseFilterFactory"/>
>

I find your index-time and query-time configuration very strange. Assuming
that you also have a lowercase filter, it seems that a token "powerShot"
will not be split and indexed as "powershot". Then during query, both
"power" and "shot" will match nothing.

I suggest you start with the configuration given in the example schema.
Else, it'd be easier for us if you can help us understand the reasons behind
changing these parameters.

-- 
Regards,
Shalin Shekhar Mangar.

Re: capitalization and delimiters

Reply via email to