I do agree. There's probably no need to go to the index directly.
My current solr test server has more than 5M documents and a size of about 60GB.
I still index at 13 docs per second and this still includes filtering of the 
documents.
(If you have your content ready in XML format performance will be even better).
It seems to me that indexing performance does not drop as the index increases.
Optimizing the index although does take huge amounts of time for large indexes.

--Christian

-----Ursprüngliche Nachricht-----
Von: Erik Hatcher [mailto:[EMAIL PROTECTED] 
Gesendet: Dienstag, 20. Februar 2007 11:43
An: solr-user@lucene.apache.org
Betreff: Re: solr performance


You could build your index using Lucene directly and then point a  
Solr instance at it once its built.  My suspicion is that the  
overhead of forming a document as an XML string and posting to Solr  
via HTTP won't be that much different than indexing with Lucene  
directly.

My largest Solr index is currently at 1.4M and it takes a max of 3ms  
to add a document (according to Solr's console), most of them 1ms.   
My single threaded indexer is indexing around 1000 documents per  
minute, but I think I can get this number even faster by  
parallelizing the indexer.

I'm curious what rates others are indexing at ???

        Erik



On Feb 20, 2007, at 2:21 AM, Jack L wrote:

> Hello,
>
> I have a question about solr's performance of accepting inserts and 
> indexing. If I have 10 million documents that I'd like to index, I 
> suppose it will take some time to submit them to solr. Is there any 
> faster way to do this than through the web interface?
>
> --
> Best regards,
> Jack
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com

Reply via email to