The natives have been getting restless since I turned off the highlighter on 
our application quite a few months ago.    In an attempt to rectify that I've 
migrated to Solr 3.1 (found it quite stable so far) in hopes to use the vector 
highlighter.    So far, I'm not seeing a significant performance hit in the 
highlighter.   I was spending many seconds highlighting results in 1.4 (1.3 way 
considerably faster) ... so big win there.


But ... the fragments often cut off words (preceding non-highlighted words).

Title field of "Apache MQ Message Queue Setup Notes (Dev)" is getting 
highlighted as "he MQ <em>Message Queue</em> Setup Notes (Dev)"
Some seem to highlight fine ... some not.   No common thread that I can see.

From http://wiki.apache.org/solr/HighlightingParameters
hl.fragsize

"The size, in characters, of fragments to consider for highlighting. "0" 
indicates that the whole field value should be used (no fragmenting). This 
parameter accepts per-field overrides."

I thought that might help the situation but that results in
SEVERE: java.lang.IllegalArgumentException: fragCharSize(0) is too small. It 
must be 18 or higher.
        at 
org.apache.lucene.search.vectorhighlight.SimpleFragListBuilder.createFieldFragList(SimpleFragListBuilder.java:36)
        at 
org.apache.lucene.search.vectorhighlight.FastVectorHighlighter.getFieldFragList(FastVectorHighlighter.java:167)
        at 
org.apache.lucene.search.vectorhighlight.FastVectorHighlighter.getBestFragments(FastVectorHighlighter.java:158)
        at 
org.apache.solr.highlight.DefaultSolrHighlighter.doHighlightingByFastVectorHighlighter(DefaultSolrHighlighter.java:506)
        at 
org.apache.solr.highlight.DefaultSolrHighlighter.doHighlighting(DefaultSolrHighlighter.java:379)
...

I also set the value to a value larger than the possible size of the field but 
I still get a left truncated highlight in many cases.


hl.fragListBuilder and hl.fragmentsBuilder sound like they may be relevant but 
I haven't been able to locate any documentation on those.   

This is my highlighter set up for the time being.

<str name="hl">true</str>
<str name="hl.useFastVectorHighlighter">true</str>
<str name="hl.snippets">3</str>
<str name="f.title.hl.fragsize">10000</str>
<str name="hl.fragsize">200</str>
<str name="hl.mergeContiguous">true</str>
<str name="hl.simple.pre"><b></str>
<str name="hl.simple.post"></b></str>


Any suggestions?  Thanks.
I'm running revision 1021880 in the lusolr 3_1 branch.


Trey Hyde
th...@centraldesktop.com
Central Desktop, Inc.
Organize, Share, Collaborate

Reply via email to