AW: Highlighting in large text fields

Burkamp, Christian Mon, 25 Jun 2007 14:46:16 -0700

Hi Mike,

Thanks for the quick help. I just added a call to 
Highlighter.setMaxDocBytesToAnalyze() to my local copy of the 
HighlightingUtil.java and it worked all right. It would be great to have the 
limit for the docBytesToAnalyze configurable in solrconfig.xml. (But it's out 
of scope for me to implement this right now).


--Christian

-----Ursprüngliche Nachricht-----
Von: Mike Klaas [mailto:[EMAIL PROTECTED] 
Gesendet: Montag, 25. Juni 2007 19:34
An: solr-user@lucene.apache.org
Betreff: Re: Highlighting in large text fields

On 25-Jun-07, at 4:59 AM, Burkamp, Christian wrote:

> Hi list,
>
> Highlighting does not work for words that are not located near the 
> beginning of a text field.
> In my index the whole text is stored in a text field for highlighting 
> purpose. This field is just stored but not indexed. The maxFieldLength 
> was set to 100000.
> The document content can be retrieved from the index without any 
> problem but for some terms highlighting does not return anything. This 
> is the case for all words from position 9162 on.
> When I try to highlight the whole text (hl.fragsize=0) with some 
> common word as query it returns the highlighted content but just the 
> first
> 9162
> words. The rest is omitted.
>
> Any idea what might be going wrong? 9162 seems not to be a standard 
> limit for IT systems.

The lucene highlighter by default only processes the first 50kB of  
text.  This is probably something that should be made configurable.   
I'll add it to the future features.

-Mike

AW: Highlighting in large text fields

Reply via email to