I found https://issues.apache.org/jira/browse/SOLR-1397 but there is not much going on there

LUCENE-1522 <https://issues.apache.org/jira/browse/LUCENE-1522>has a lot of fascinating discussion on this topic though


There is a couple of long lived issues in jira for this (I'd like to try to search
them, but I couldn't access jira now).

For FVH, it is needed to be modified at Lucene level to use external data.

koji
Koji - is that really so? It appears to me that would could extend BaseFragmentsBuilder and override

createFragments(IndexReader reader, int docId,
      String fieldName, FieldFragList fieldFragList, int maxNumFragments,
      String[] preTags, String[] postTags, Encoder encoder )

providing a version that retrieves text from some external source rather than from Lucene fields.

It sounds to me like a really useful modification in Lucene core would be to retain match points that have already been computed during scoring so the highlighter doesn't have to attempt to reinvent all that logic! This has all been discussed at length in LUCENE-1522 already, but is there is any recent activity?

My hope is that since (at least in my test) search code seems to spend 80% of its time highlighting, folks will take up this banner and do the plumbing needed to improve it - should lead to huge speed-ups for searching! I'm continuing to read, but not really capable of making a meaningful contribution at this point.

-Mike

Reply via email to