I found https://issues.apache.org/jira/browse/SOLR-1397 but there is not
much going on there
LUCENE-1522 <https://issues.apache.org/jira/browse/LUCENE-1522>has a lot
of fascinating discussion on this topic though
There is a couple of long lived issues in jira for this (I'd like to
try to search
them, but I couldn't access jira now).
For FVH, it is needed to be modified at Lucene level to use external
data.
koji
Koji - is that really so? It appears to me that would could extend
BaseFragmentsBuilder and override
createFragments(IndexReader reader, int docId,
String fieldName, FieldFragList fieldFragList, int maxNumFragments,
String[] preTags, String[] postTags, Encoder encoder )
providing a version that retrieves text from some external source rather
than from Lucene fields.
It sounds to me like a really useful modification in Lucene core would
be to retain match points that have already been computed during scoring
so the highlighter doesn't have to attempt to reinvent all that logic!
This has all been discussed at length in LUCENE-1522 already, but is
there is any recent activity?
My hope is that since (at least in my test) search code seems to spend
80% of its time highlighting, folks will take up this banner and do the
plumbing needed to improve it - should lead to huge speed-ups for
searching! I'm continuing to read, but not really capable of making a
meaningful contribution at this point.
-Mike