Re: SOLR query.

Erik Hatcher Thu, 05 Mar 2009 10:15:40 -0800


On Mar 5, 2009, at 1:07 PM, Suryasnat Das wrote:

I have some queries on SOLR fo which i need immediate resolution. Afast
help would be greatly appreciated.
a.) We know that fields are also indexed. So can we index somespecificfields(like author, id, etc) first and then do the indexing for restof the
fields(like creation date etc) at a later time.

You have to reindex the entire document in order to add fields to it,but you certainly can do so at any time. In other words, you can justadd fields to an existing document without sending in all the fieldsyou want on that document.

b.) SOLR returns the whole text content of a file during a searchoperation.So how can we extract a portion of the whole content? I mean asnippet ofthe content containing that search keyword. Sample code would be ofgreat
help.

Use Solr's highlighting capabilities: <http://wiki.apache.org/solr/HighlightingParameters>

c.) What is multi core indexing?

Separate Solr/Lucene indexes, that all are served from a singleinstance of Solr.

d.) What is the number of index files that are normally created in aindex
operation?

Depends on the number of fields, and how you have the indexconfiguration set. If file handles ever become a problem you can setit to use the compound file format, but in practice I've never seen itbe a problem.

What will be the expected number of index files when i index a 4
tera byte of filedata and what will be the index size for all theindexfiles? If anybody has worked nsuch huge volume of data then somepointers
would be of great help.

The rule of thumb is that a Lucene index is roughly 35% the size ofthe original text, assuming you are not storing the fields in Lucene,but only indexing it.


        Erik

Re: SOLR query.

Reply via email to