Re: Creating a document blurb when nothing is returned from highlight feature

Mike Klaas Thu, 09 Aug 2007 14:33:03 -0700

On 9-Aug-07, at 2:10 PM, Benjamin Higgins wrote:

Hi all, I'd like to provide a blurb of documents matching a search in
the case when there is no text highlighted. I assumed that perhapsthehighlighter would give me back the first few words in a document ifthisoccurred, but it doesn't. My conundrum is that I'd rather not grabthewhole document body field because some of them are large. Is theresome
way I can request from Lucene the first N words or lines from a field?

The way I deal with this is that I modified the highlighter fragmentscorer to return a positive (but low) score for the first fewfragments of a doc. This will work, but tends not to provide greatsummaries and will definitely still fetch and process the entire doccontents.

The better way to do this is to generate a better general summaryyourself and store it in a separate field; this can be used if nohighlighting is generated (or, capability in Solr to automaticallysubstitute a field in the case of no highlighting would be cool). Imight even implement this if there is sufficient interest :).

Unfortunately, the highlighter does not know (and realy has no way ofknowing) what parts of a doc matched, so it would still have to tryhighlighting first.

Note that you can control the cpu usage for long fields by settinghl.maxAnalyzedChars (will be in the next release).


best,
-Mike

Re: Creating a document blurb when nothing is returned from highlight feature

Reply via email to