Martin,
You may want to follow Mark Miller's effort
https://issues.apache.org/jira/browse/LUCENE-1286 as it develops --
perhaps even help with it. He's developing a Lucene highlighter which
would "run through query terms by using their offsets" making
highlighting large documents much more t
I tried to post it myself, got the address wrong, thanks for re-posting.
the problem we have with highlighting outside of the indexer is that the
systems we use that store co-ords are... based on term string (in one
case) and the specific term offset in another. Both of which break
horribly when t