Re: Problem with camelCase but not casing in general

Yonik Seeley Mon, 07 Jan 2008 14:35:52 -0800

On Jan 7, 2008 5:26 PM, Brendan Grainger <[EMAIL PROTECTED]> wrote:
> I think your problem is happening because splitOnCaseChange is 1 in
> your WordDelimiterFilterFactory:
>
> <filter class="solr.WordDelimiterFilterFactory"
> generateWordParts="1" generateNumberParts="1" catenateWords="1"
> catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
>
> So "getElementById" is tokenized to:
>
> (get,0,3)
> (Element,3,10)
> (By,10,12)
> (Id,12,14)
> (getElementById,0,14,posIncr=0)
>
> However getelementbyid is tokenized to:
>
> (getelementbyid,0,14)
>
> which wouldn't be a term in the index??


It would be a term in the index since both go through the lowercase filter.

Anyway, if splits on capitalization changes is not desired, getting
rid of the WordDelimiterFilter in both the index and query analyzers
is the right thing to do.

-Yonik

Re: Problem with camelCase but not casing in general

Reply via email to