Hmmm. First, highlighting should work here. If you have it configured
to work  on the dc.description field.

As to whether the phrase "management changes" is near enough, I
pretty much guarantee it is. This is where the admin/analysis page can
answer this type of question authoritatively since it's based exactly
on your particular analysis chain.

Best,
Erick

On Tue, Jun 16, 2015 at 8:25 AM, Alistair Young
<alistair.yo...@uhi.ac.uk> wrote:
> yes prolly not a bug. The highlighting is on but nothing is highlighted.
> Perhaps this text is triggering it?
>
> 'consider the impacts of land management changes’
>
> that would seem reasonable. It’s not a direct match so no highlighting
> (the highlighting does work on a direct match) but 'management changes’
> must be near enough ‘manage change’ to trigger a result.
>
> Alistair
>
> --
> mov eax,1
> mov ebx,0
> int 80h
>
>
>
>
> On 16/06/2015 16:18, "Erick Erickson" <erickerick...@gmail.com> wrote:
>
>>I agree with Allesandro the behavior you're describing
>>is _not_ correct at all given your description. So either
>>
>>1> There's something "interesting" about your configuration
>>      that doesn't seem important that you haven't told us,
>>      although what it could be is a mystery to me  too ;)
>>
>>2> it's matching on something else. Note that the
>>     phrase has been stemmed, so something in there
>>     besides management might stem to manag and/or
>>    something other than changes might stem to chang
>>    and the two of _them_ happen to be next to each
>>    other. "are managers changing?" for instance. Or
>>    even something less likely. Perhaps turn on
>>    highlighting and see if it pops out?
>>
>>
>>3> you've uncovered a bug. Although I suspect others
>>    would have reported it and the unit tests would have
>>    barfed all over the place.
>>
>>One other thing you can do. Go to the admin/analysis
>>page and turn on the "verbose" check box. Put
>>management is undergoing many changes
>>in both the query and index boxes. The result (it's
>>kind of hard to read I'll admit) will include the position
>>of each token after all the analysis is done. Phrase
>>queries (without slop) should only be matching adjacent
>>positions. So the question is whether the position info
>>"looks correct"....
>>
>>Best,
>>Erick
>>
>>On Tue, Jun 16, 2015 at 4:40 AM, Alessandro Benedetti
>><benedetti.ale...@gmail.com> wrote:
>>> According to your debug you are using a default Lucene Query Parser.
>>> This surprise me as i would expect with that query a match with
>>>distance 0
>>> between the 2 terms .
>>>
>>> Are you sure nothing else is that field that matches the phrase query ?
>>>
>>> From the documentation
>>>
>>> "Lucene supports finding words are a within a specific distance away.
>>>To do
>>> a proximity search use the tilde, "~", symbol at the end of a Phrase.
>>>For
>>> example to search for a "apache" and "jakarta" within 10 words of each
>>> other in a document use the search:
>>>
>>> "jakarta apache"~10 "
>>>
>>>
>>> Cheers
>>>
>>>
>>> 2015-06-16 11:33 GMT+01:00 Alistair Young <alistair.yo...@uhi.ac.uk>:
>>>
>>>> it¹s a useful behaviour. I¹d just like to understand where it¹s
>>>>deciding
>>>> the document is relevant. debug output is:
>>>>
>>>> <lst name="debug">
>>>>   <str name="rawquerystring">dc.description:"manage change"</str>
>>>>   <str name="querystring">dc.description:"manage change"</str>
>>>>   <str name="parsedquery">PhraseQuery(dc.description:"manag
>>>>chang")</str>
>>>>   <str name="parsedquery_toString">dc.description:"manag chang"</str>
>>>>   <lst name="explain">
>>>>     <str name="tst:test">
>>>> 1.2008798 = (MATCH) weight(dc.description:"manag chang" in 221)
>>>> [DefaultSimilarity], result of:
>>>>   1.2008798 = fieldWeight in 221, product of:
>>>>     1.0 = tf(freq=1.0), with freq of:
>>>>       1.0 = phraseFreq=1.0
>>>>     9.6070385 = idf(), sum of:
>>>>       4.0365543 = idf(docFreq=101, maxDocs=2125)
>>>>       5.5704846 = idf(docFreq=21, maxDocs=2125)
>>>>     0.125 = fieldNorm(doc=221)
>>>> </str>
>>>>   </lst>
>>>>   <str name="QParser">LuceneQParser</str>
>>>>   <lst name="timing">
>>>>     <double name="time">41.0</double>
>>>>     <lst name="prepare">
>>>>       <double name="time">3.0</double>
>>>>       <lst name="query">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="facet">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="mlt">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="highlight">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="stats">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="debug">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>     </lst>
>>>>     <lst name="process">
>>>>       <double name="time">35.0</double>
>>>>       <lst name="query">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="facet">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="mlt">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="highlight">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="stats">
>>>>         <double name="time">0.0</double>
>>>>       </lst>
>>>>       <lst name="debug">
>>>>         <double name="time">35.0</double>
>>>>       </lst>
>>>>     </lst>
>>>>   </lst>
>>>> </lst>
>>>>
>>>>
>>>> thanks,
>>>>
>>>> Alistair
>>>>
>>>> --
>>>> mov eax,1
>>>> mov ebx,0
>>>> int 80h
>>>>
>>>>
>>>>
>>>>
>>>> On 16/06/2015 11:26, "Alessandro Benedetti"
>>>><benedetti.ale...@gmail.com>
>>>> wrote:
>>>>
>>>> >Can you show us how the query is parsed ?
>>>> >You didn't tell us nothing about the query parser you are using.
>>>> >Enable the debugQuery=true will show you how the query is parsed and
>>>>this
>>>> >will be quite useful for us.
>>>> >
>>>> >
>>>> >Cheers
>>>> >
>>>> >2015-06-16 11:22 GMT+01:00 Alistair Young <alistair.yo...@uhi.ac.uk>:
>>>> >
>>>> >> Hiya,
>>>> >>
>>>> >> I've been looking for documentation that would point to where I
>>>>could
>>>> >> modify or explain why 'near neighbours' are returned from a phrase
>>>> >>search.
>>>> >> If I search for:
>>>> >>
>>>> >> "manage change"
>>>> >>
>>>> >> I get back a document that contains "this will help in your
>>>>management
>>>> >>of
>>>> >> <lots more words...> changes". It's relevant but I'd like to
>>>>understand
>>>> >>why
>>>> >> solr is returning it. Is it a combination of fuzzy/slop? The
>>>>distance
>>>> >> between the two variations of the two words in the document is quite
>>>> >>large.
>>>> >>
>>>> >> thanks,
>>>> >>
>>>> >> Alistair
>>>> >>
>>>> >> --
>>>> >> mov eax,1
>>>> >> mov ebx,0
>>>> >> int 80h
>>>> >>
>>>> >
>>>> >
>>>> >
>>>> >--
>>>> >--------------------------
>>>> >
>>>> >Benedetti Alessandro
>>>> >Visiting card : http://about.me/alessandro_benedetti
>>>> >
>>>> >"Tyger, tyger burning bright
>>>> >In the forests of the night,
>>>> >What immortal hand or eye
>>>> >Could frame thy fearful symmetry?"
>>>> >
>>>> >William Blake - Songs of Experience -1794 England
>>>>
>>>>
>>>
>>>
>>> --
>>> --------------------------
>>>
>>> Benedetti Alessandro
>>> Visiting card : http://about.me/alessandro_benedetti
>>>
>>> "Tyger, tyger burning bright
>>> In the forests of the night,
>>> What immortal hand or eye
>>> Could frame thy fearful symmetry?"
>>>
>>> William Blake - Songs of Experience -1794 England
>

Reply via email to