Mike

I would be very interested in the answer to that question too. My hunch is that 
the answer is no too. I have a few text databases that range from 200MB to 
about 60GB with which I could run some tests. I will have some downtime in 
early July and will post results.

From what I can tell the Guardian newspaper is doing just that:

        
http://www.guardian.co.uk/open-platform/blog/what-is-powering-the-content-api
        
http://www.lucidimagination.com/blog/2010/04/29/for-the-guardian-solr-is-the-new-database/

Cheers

François


On Jun 20, 2011, at 9:05 AM, Mike Sokolov wrote:

> I'd be very interested in this, as well, if you do it before me and are 
> willing to share...
> 
> A related question I have tried to ask on this list, and have never really 
> gotten a good answer to, is whether it makes sense to just chuck the external 
> storage and treat the lucene index as the primary storage for documents.  I 
> have a feeling the answer is no; perhaps because of increased I/O costs for 
> lucene and solr, but I don't really know.  I've been considering doing some 
> experimentation, but would really love an expert opinion...
> 
> -Mike
> 
> On 06/20/2011 08:41 AM, Jamie Johnson wrote:
>> I am trying to index data where I'm concerned that storing the contents of a
>> specific field will be a bit of a hog so we are planning to retrieve this
>> information as needed for highlighting from an external source.  I am
>> looking to extend the default solr highlighting capability to work with
>> information pulled from this external source and it looks like this is
>> possible by extending DefaultSolrHighlighter (line 418 to pull a particular
>> field from external source) for standard highlighting and
>> BaseFragmentsBuilder (line 99) for FastVectorHighlighter.  I could just hard
>> code this to say if the field name is a specific value look into the
>> external source, is this the best way to accomplish this?  Are there any
>> other extension points to do what I'm suggesting?
>> 
>>   

Reply via email to