Indexing rates depend heavily on document size (text) and pre-indexing
processing. Other things probably matter, too, like number of fields.

My application is indexing 20X faster than Christian's, because I have
small documents (a few hundred bytes) that are extracted from an RDBMS
and submitted in Solr's XML format.

I am probably seeing something close to the maximum rate at 250 docs/s.
This is on a dual-CPU 3 GHz Xeon, Fedora Core 4, JDK 1.5. A fast RAID
would probably make it go faster, but that is about the only speedup
I can think of.

This has been discussed before, so check the mailing list archives.

wunder

On 2/20/07 2:58 AM, "Burkamp, Christian" <[EMAIL PROTECTED]> wrote:

> I do agree. There's probably no need to go to the index directly.
> My current solr test server has more than 5M documents and a size of about
> 60GB.
> I still index at 13 docs per second and this still includes filtering of the
> documents.
> (If you have your content ready in XML format performance will be even
> better).
> It seems to me that indexing performance does not drop as the index increases.
> Optimizing the index although does take huge amounts of time for large
> indexes.
> 
> --Christian
> 
> -----Ursprüngliche Nachricht-----
> Von: Erik Hatcher [mailto:[EMAIL PROTECTED]
> Gesendet: Dienstag, 20. Februar 2007 11:43
> An: solr-user@lucene.apache.org
> Betreff: Re: solr performance
> 
> 
> You could build your index using Lucene directly and then point a
> Solr instance at it once its built.  My suspicion is that the
> overhead of forming a document as an XML string and posting to Solr
> via HTTP won't be that much different than indexing with Lucene
> directly.
> 
> My largest Solr index is currently at 1.4M and it takes a max of 3ms
> to add a document (according to Solr's console), most of them 1ms.
> My single threaded indexer is indexing around 1000 documents per
> minute, but I think I can get this number even faster by
> parallelizing the indexer.
> 
> I'm curious what rates others are indexing at ???
> 
> Erik
> 
> 
> 
> On Feb 20, 2007, at 2:21 AM, Jack L wrote:
> 
>> Hello,
>> 
>> I have a question about solr's performance of accepting inserts and
>> indexing. If I have 10 million documents that I'd like to index, I
>> suppose it will take some time to submit them to solr. Is there any
>> faster way to do this than through the web interface?
>> 
>> --
>> Best regards,
>> Jack
>> 
>> __________________________________________________
>> Do You Yahoo!?
>> Tired of spam?  Yahoo! Mail has the best spam protection around
>> http://mail.yahoo.com
> 

Reply via email to