Hi Mike, Thanks for the quick help. I just added a call to Highlighter.setMaxDocBytesToAnalyze() to my local copy of the HighlightingUtil.java and it worked all right. It would be great to have the limit for the docBytesToAnalyze configurable in solrconfig.xml. (But it's out of scope for me to implement this right now).
--Christian -----Ursprüngliche Nachricht----- Von: Mike Klaas [mailto:[EMAIL PROTECTED] Gesendet: Montag, 25. Juni 2007 19:34 An: solr-user@lucene.apache.org Betreff: Re: Highlighting in large text fields On 25-Jun-07, at 4:59 AM, Burkamp, Christian wrote: > Hi list, > > Highlighting does not work for words that are not located near the > beginning of a text field. > In my index the whole text is stored in a text field for highlighting > purpose. This field is just stored but not indexed. The maxFieldLength > was set to 100000. > The document content can be retrieved from the index without any > problem but for some terms highlighting does not return anything. This > is the case for all words from position 9162 on. > When I try to highlight the whole text (hl.fragsize=0) with some > common word as query it returns the highlighted content but just the > first > 9162 > words. The rest is omitted. > > Any idea what might be going wrong? 9162 seems not to be a standard > limit for IT systems. The lucene highlighter by default only processes the first 50kB of text. This is probably something that should be made configurable. I'll add it to the future features. -Mike