Re: Indexing performance with solrj vs. direct lucene API

2012-11-29 Thread Mark Bennett
Hi Robert, SolrJ is sending data over a socket so that might explain some of the lag. Are is your SolrJ app and the Solr server running on the same physical machine? I thought Mark M's idea sounded good. One other idea: When initializing SolrJ's connection for normal searching you probably use

Re: Indexing performance with solrj vs. direct lucene API

2012-11-28 Thread Mark Miller
One difference is that Solr will call update rather than add by default. If you are willing to ensure unique id's, you can specify overwrite=false (I think thats the one) and it will use add instead. - Mark On Wed, Nov 28, 2012 at 1:02 PM, Robert Stewart wrote: > I have a project where I am port

Indexing performance with solrj vs. direct lucene API

2012-11-28 Thread Robert Stewart
I have a project where I am porting existing application from direct Lucene API usage to using SOLR and SOLRJ client API. The problem I have is that indexing is 2-5x slower using SOLRJ+SOLR than using direct Lucene API. I am creating batches of documents between 200 and 500 documents per call to