I'd be very interested in this, as well, if you do it before me and are
willing to share...
A related question I have tried to ask on this list, and have never
really gotten a good answer to, is whether it makes sense to just chuck
the external storage and treat the lucene index as the primary storage
for documents. I have a feeling the answer is no; perhaps because of
increased I/O costs for lucene and solr, but I don't really know. I've
been considering doing some experimentation, but would really love an
expert opinion...
-Mike
On 06/20/2011 08:41 AM, Jamie Johnson wrote:
I am trying to index data where I'm concerned that storing the contents of a
specific field will be a bit of a hog so we are planning to retrieve this
information as needed for highlighting from an external source. I am
looking to extend the default solr highlighting capability to work with
information pulled from this external source and it looks like this is
possible by extending DefaultSolrHighlighter (line 418 to pull a particular
field from external source) for standard highlighting and
BaseFragmentsBuilder (line 99) for FastVectorHighlighter. I could just hard
code this to say if the field name is a specific value look into the
external source, is this the best way to accomplish this? Are there any
other extension points to do what I'm suggesting?