On Jun 12, 2007, at 8:51 AM, Ard Schrijvers wrote:
is it possible to configure solr to store the document URI in the lucene index (the URI is not an xml field, but just the document's location)?

Yes. Set the field to be store and non-indexed, field type "string" is what I use.

Or is everybody used to storing the contents of a document in the lucene index (doesn't this imply a much larger index though?), so instead of retrieving the document's content through a seperate fetch over http/filesystem just show the result from the stored content field?

This all depends on the needs of your project. Its perfectly fine to store the text outside of the index, and that is the way it really has to be done for very large indexes where as few fields as possible are "stored".

If you're also asking about Solr fetching the remote resource, that is a different story altogether, and no it does not do that. [though with the streaming capability you can feed in a document entirely from a URL, but I haven't experimented with that feature yet myself]

        Erik

Reply via email to