RE: Solr index - Size and indexing speed

2009-08-29 Thread Fuad Efendi
>I tried to merge the 15 indexes again, and I found out that the new merged >index (without opitmization) size was about 351 GB , but when I optimize it >the size return back to 411 GB, Why? Just as a sample, IOT in Oracle... Ok, just in a kids-lang, what 'optimization' means? It means that Ma

Impact of compressed=true attribute (in schema.xml) on Indexing/Query

2009-08-29 Thread Silent Surfer
Hi, We observed that when we use the setting "compressed=true" the index size is around 0.66 times the actual log file, where as if we do not use any compressed=true setting, the index size is almost as much as 2.6 times. Our sample solr document size is approximately 1000 bytes. In addition to

indexing of documents

2009-08-29 Thread manishkbawne
I am trying to index pdf and other documents. but got this error:- java.lang.ClassCastException: org.apache.solr.handler.extraction.ExtractingRequestHandler cannot be cast to org.apache.solr.request.SolrRequestHandler at org.apache.solr.core.RequestHandlers$1.create(Req

Re: Solr index - Size and indexing speed

2009-08-29 Thread Yonik Seeley
On Tue, Aug 25, 2009 at 3:30 PM, engy.ali wrote: > >  Summary > === > > I had about 120,000 object of total size 71.2 GB, those objects are already > indexed using Lucene. The index size is about 111 GB. > > I tried to use solr 1.4 nightly build to index the same collection. I > divided

Re: Solr index - Size and indexing speed

2009-08-29 Thread Yonik Seeley
On Sat, Aug 29, 2009 at 7:09 AM, engy.ali wrote: > I thought that optimization would decrease or at least be equal to the same > index size before optimization Some index structures like norms are non-sparse. Index one unique field with norms and there is a byte allocated for every document in th

RE: Solr index - Size and indexing speed

2009-08-29 Thread engy.ali
Hi, Thanks for your reply. I will work on your suggestion for using only one solr instance. I tried to merge the 15 indexes again, and I found out that the new merged index (without opitmization) size was about 351 GB , but when I optimize it the size return back to 411 GB, Why? I thought tha