That didn't help.  Seems like another case where I should get matches but don't 
and this time it is only for some documents.  Others with similar content do 
match just fine.  The debug output 'explain other' section for a non-matching 
document seems to say the term frequency is 0 for my problematic term, although 
I know it is in the content.  

I ended up making a synonym to do what the analysis stack *should* be doing: 
splitting LaserJet on case changes.  IE putting LaserJet, laser jet in synonyms 
at index time makes this work.  I don't know why though.

Question:  Does this debug output mean it is matching the terms but the term 
frequency vector is returning 0 for the frequency of this term.  IE Does this 
mean the term is in the doc but not in the tf array?

0.0 = no match on required clause (moreWords:"laser jet")
>>
>>    0.0 = weight(moreWords:"laser jet" in 32497), product of:
>>
>>      0.60590804 = queryWeight(moreWords:"laser jet"), product of:
>>
>>        14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>        0.041507367 = queryNorm
>>
>>      0.0 = fieldWeight(moreWords:"laser jet" in 32497), product of:
>>
>>        0.0 = tf(phraseFreq=0.0)
>>
>>        14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>        0.078125 = fieldNorm(field=moreWords, doc=32497)
>>
>>


-----Original Message-----
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: Monday, July 25, 2011 3:28 PM
To: solr-user@lucene.apache.org
Subject: Re: please help explaining debug output

Hmmm, I can't find a convenient 1.4.0 to download, but re-indexing is a good
idea since this seems like it *should* work.

Erick

On Mon, Jul 25, 2011 at 5:32 PM, Robert Petersen <rober...@buy.com> wrote:
> I'm still on solr 1.4.0 and the analysis page looks like they should match, 
> and other products with the same content do in fact match.  I'm reindexing 
> the non-matching ones to rule that out.
>
> -----Original Message-----
> From: Erick Erickson [mailto:erickerick...@gmail.com]
> Sent: Monday, July 25, 2011 1:58 PM
> To: solr-user@lucene.apache.org
> Subject: Re: please help explaining debug output
>
> Hmmm, I'm assuming that moreWords is your default text field, yes?
>
> But it works for me (tm), using 1.4.1. What version of Solr are you on?
>
> Also, take a glance at the admin/analysis page, that might help...
>
> Gotta run
>
> Erick
>
> On Mon, Jul 25, 2011 at 4:52 PM, Robert Petersen <rober...@buy.com> wrote:
>> Sorry, to clarify a search for P1102W matches all three docs but a
>> search for p1102w LaserJet only matches the second two.  Someone asked
>> me a question while I was typing and I got distracted, apologies for any
>> confusion.
>>
>> -----Original Message-----
>> From: Robert Petersen [mailto:rober...@buy.com]
>> Sent: Monday, July 25, 2011 1:42 PM
>> To: solr-user@lucene.apache.org
>> Subject: please help explaining debug output
>>
>> I have three documents with the following product titles in a text field
>> called moreWords with analysis stack matching the solr example text
>> field definition.
>>
>>
>>
>> 1.       HP LaserJet P1102W Monochrome Laser Printer
>> <http://www.buy.com/prod/hp-laserjet-p1102w-monochrome-laser-printer/q/l
>> oc/101/213824965.html>
>>
>> 2.       HP CE285A (85A) Remanufactured Black Toner Cartridge for
>> LaserJet M1212nf, P1102, P1102W Series
>> <http://www.buy.com/prod/hp-ce285a-85a-remanufactured-black-toner-cartri
>> dge-for-laserjet/q/loc/101/217145536.html>
>>
>> 3.       Black HP CE285A Toner Cartridge For LaserJet P1102W, LaserJet
>> M1130, LaserJet M1132, LaserJet M1210
>> <http://www.buy.com/prod/black-hp-ce285a-toner-cartridge-for-laserjet-p1
>> 102w-laserjet-m1130/q/loc/101/222045267.html>
>>
>>
>>
>> A search for P1102W matches (2) and (3), but not (1) above.  Can someone
>> explain the debug output?  It looks like I am getting a non-match on (1)
>> because term frequency is zero?  Am I reading that right?  If so, how
>> could that be? the searched terms are equivalently in all three docs.  I
>> don't get it.
>>
>>
>>
>>
>>
>> <lst name="debug">
>>
>> <str name="rawquerystring">p1102w LaserJet </str>
>>
>> <str name="querystring">p1102w LaserJet </str>
>>
>> <str name="parsedquery">+PhraseQuery(moreWords:"p 1102 w")
>> +PhraseQuery(moreWords:"laser jet")</str>
>>
>> <str name="parsedquery_toString">+moreWords:"p 1102 w" +moreWords:"laser
>> jet"</str>
>>
>> <lst name="explain">
>>
>> <str name="222045267">
>>
>> 3.64852 = (MATCH) sum of:
>>
>>  2.4758534 = weight(moreWords:"p 1102 w" in 6667236), product of:
>>
>>    0.7955347 = queryWeight(moreWords:"p 1102 w"), product of:
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.041507367 = queryNorm
>>
>>    3.1121879 = fieldWeight(moreWords:"p 1102 w" in 6667236), product
>> of:
>>
>>      1.7320508 = tf(phraseFreq=3.0)
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.09375 = fieldNorm(field=moreWords, doc=6667236)
>>
>>  1.1726664 = weight(moreWords:"laser jet" in 6667236), product of:
>>
>>    0.60590804 = queryWeight(moreWords:"laser jet"), product of:
>>
>>      14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>      0.041507367 = queryNorm
>>
>>    1.9353869 = fieldWeight(moreWords:"laser jet" in 6667236), product
>> of:
>>
>>      1.4142135 = tf(phraseFreq=2.0)
>>
>>      14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>      0.09375 = fieldNorm(field=moreWords, doc=6667236)
>>
>>
>>
>> </str>
>>
>> <str name="222045265">
>>
>> 2.8656518 = (MATCH) sum of:
>>
>>  1.4294347 = weight(moreWords:"p 1102 w" in 6684158), product of:
>>
>>    0.7955347 = queryWeight(moreWords:"p 1102 w"), product of:
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.041507367 = queryNorm
>>
>>    1.7968225 = fieldWeight(moreWords:"p 1102 w" in 6684158), product
>> of:
>>
>>      1.0 = tf(phraseFreq=1.0)
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.09375 = fieldNorm(field=moreWords, doc=6684158)
>>
>>  1.4362172 = weight(moreWords:"laser jet" in 6684158), product of:
>>
>>    0.60590804 = queryWeight(moreWords:"laser jet"), product of:
>>
>>      14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>      0.041507367 = queryNorm
>>
>>    2.3703551 = fieldWeight(moreWords:"laser jet" in 6684158), product
>> of:
>>
>>      1.7320508 = tf(phraseFreq=3.0)
>>
>>      14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>      0.09375 = fieldNorm(field=moreWords, doc=6684158)
>>
>>
>>
>> </str>
>>
>> </lst>
>>
>> <str name="otherQuery">sku:213824965
>>
>> </str>
>>
>> <lst name="explainOther">
>>
>> <str name="213824965">
>>
>> 0.0 = (NON-MATCH) Failure to meet condition(s) of required/prohibited
>> clause(s)
>>
>>  1.1911955 = weight(moreWords:"p 1102 w" in 32497), product of:
>>
>>    0.7955347 = queryWeight(moreWords:"p 1102 w"), product of:
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.041507367 = queryNorm
>>
>>    1.4973521 = fieldWeight(moreWords:"p 1102 w" in 32497), product of:
>>
>>      1.0 = tf(phraseFreq=1.0)
>>
>>      19.166107 = idf(moreWords: p=189166 1102=1135 w=445720)
>>
>>      0.078125 = fieldNorm(field=moreWords, doc=32497)
>>
>>  0.0 = no match on required clause (moreWords:"laser jet")
>>
>>    0.0 = weight(moreWords:"laser jet" in 32497), product of:
>>
>>      0.60590804 = queryWeight(moreWords:"laser jet"), product of:
>>
>>        14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>        0.041507367 = queryNorm
>>
>>      0.0 = fieldWeight(moreWords:"laser jet" in 32497), product of:
>>
>>        0.0 = tf(phraseFreq=0.0)
>>
>>        14.597603 = idf(moreWords: laser=26731 jet=12685)
>>
>>        0.078125 = fieldNorm(field=moreWords, doc=32497)
>>
>>
>>
>> </str>
>>
>> </lst>
>>
>>
>

Reply via email to