Hmmm, not quite sure what to say. Offsets and positions help, particularly with FastVectorHighlighter, but the highlighting is usually re-analyzed anyway so it _shouldn't_ matter. But what I don't know about highlighting could fill volumes ;)..
Sorry I can't be more help here. Erick On Tue, Feb 24, 2015 at 12:16 AM, Dmitry Kan <solrexp...@gmail.com> wrote: > Erick, > > Our default operator is AND. > > Both queries below parse the same: > > a OR (b c) OR d > a OR (b AND c) OR d > > The parsed query: > > <str name="parsedquery_toString">Contents:a (+Contents:b +Contents:c) > Contents:d</str> > > So this part is consistent with our expectation. > > >>> I'm a bit puzzled by your statement that "c" didn't contribute to the > score. > what I meant was that the term c was not hit by the scorerer: the explain > section does not refer to it. I'm using the made up terms here, but the > concept holds. > > The code suggests that we could benefit from storing term offsets and > positions: > http://grepcode.com/file/repo1.maven.org/maven2/org.apache.solr/solr-core/4.3.1/org/apache/solr/highlight/DefaultSolrHighlighter.java#470 > > Is it correct assumption? > > On Mon, Feb 23, 2015 at 8:29 PM, Erick Erickson <erickerick...@gmail.com> > wrote: > >> Highlighting is such a pain... >> >> what does the parsed query look like? If the default operator is OR, >> then this seems correct as both 'd' and 'c' appear in the doc. So >> I'm a bit puzzled by your statement that "c" didn't contribute to the >> score. >> >> If the parsed query is, indeed >> a +b +c d >> >> then it does look like something with the highlighter. Whether other >> highlighters are better for this case.. no clue ;( >> >> Best, >> Erick >> >> On Mon, Feb 23, 2015 at 9:36 AM, Dmitry Kan <solrexp...@gmail.com> wrote: >> > Erick, >> > >> > nope, we are using std lucene qparser with some customizations, that do >> not >> > affect the boolean query parsing logic. >> > >> > Should we try some other highlighter? >> > >> > On Mon, Feb 23, 2015 at 6:57 PM, Erick Erickson <erickerick...@gmail.com >> > >> > wrote: >> > >> >> Are you using edismax? >> >> >> >> On Mon, Feb 23, 2015 at 3:28 AM, Dmitry Kan <solrexp...@gmail.com> >> wrote: >> >> > Hello! >> >> > >> >> > In solr 4.3.1 there seem to be some inconsistency with the >> highlighting >> >> of >> >> > the boolean query: >> >> > >> >> > a OR (b c) OR d >> >> > >> >> > This returns a proper hit, which shows that only d was included into >> the >> >> > document score calculation. >> >> > >> >> > But the highlighter returns both d and c in <em> tags. >> >> > >> >> > Is this a known issue of the standard highlighter? Can it be >> mitigated? >> >> > >> >> > >> >> > -- >> >> > Dmitry Kan >> >> > Luke Toolbox: http://github.com/DmitryKey/luke >> >> > Blog: http://dmitrykan.blogspot.com >> >> > Twitter: http://twitter.com/dmitrykan >> >> > SemanticAnalyzer: www.semanticanalyzer.info >> >> >> > >> > >> > >> > -- >> > Dmitry Kan >> > Luke Toolbox: http://github.com/DmitryKey/luke >> > Blog: http://dmitrykan.blogspot.com >> > Twitter: http://twitter.com/dmitrykan >> > SemanticAnalyzer: www.semanticanalyzer.info >> > > > > -- > Dmitry Kan > Luke Toolbox: http://github.com/DmitryKey/luke > Blog: http://dmitrykan.blogspot.com > Twitter: http://twitter.com/dmitrykan > SemanticAnalyzer: www.semanticanalyzer.info