Bah... While attempting to duplicate this on our 4.x instance I
realized I was mis-reading the analysis output. In the example I
mentioned it was actually a SynonymFilter in the analysis chain that
was affecting the term position (we have several synonyms for
"telescope").

Regardless, it seems to not be a problem in 4.x.

Thanks,
--jay

On Tue, Oct 23, 2012 at 10:45 AM, Shawn Heisey <s...@elyograg.org> wrote:
> On 10/23/2012 8:16 AM, Jay Luker wrote:
>>
>>  From looking at the analysis debugger I can see that the WDF is
>> getting the term "Telescope,SALT" and correctly splitting on the
>> comma. The problem seems to be that the original term is given the 1st
>> position, e.g.:
>>
>> Pos  Term
>> 1      Southern
>> 2      African
>> 3      Large
>> 4      Telescope,SALT  <-- original term
>> 5      Telescope
>> 6      SALT
>
>
> Jay, I have WDF with preserveOriginal turned on.  I get the following from
> WDF parsing in the analysis page on either 3.5 or 4.1-SNAPSHOT, and the
> analyzer shows that all four of the query words are found in consecutive
> fields.  On the new version, I had to slide a scrollbar to the right to see
> the last term.  Visually they were not in consecutive fields on the new
> version (they were on 3.5), but the position number says otherwise.
>
>
> 1    Southern
> 2    African
> 3    Large
> 4    Telescope,SALT
> 4    Telescope
> 5    SALT
> 5    TelescopeSALT
>
> My full WDF parameters:
> index: {preserveOriginal=1, splitOnCaseChange=1, generateNumberParts=1,
> catenateWords=1, splitOnNumerics=1, stemEnglishPossessive=1,
> luceneMatchVersion=LUCENE_35, generateWordParts=1, catenateAll=0,
> catenateNumbers=1}
> query: {preserveOriginal=1, splitOnCaseChange=1, generateNumberParts=1,
> catenateWords=0, splitOnNumerics=1, stemEnglishPossessive=1,
> luceneMatchVersion=LUCENE_35, generateWordParts=1, catenateAll=0,
> catenateNumbers=0}
>
> I understand from other messages on the mailing list that I should not have
> preserveOriginal on the query side, but I have not yet changed it.
>
> If your position numbers really are what you indicated, you may have found a
> bug.  I have not tried the released 4.0.0 version, I expect to deploy from
> the 4.x branch under development.
>
> Thanks,
> Shawn
>

Reply via email to