First of all, what I was seeing was different from what I thought I was seeing 
because a few weeks ago I uncommented the <autoCommit> block in the 
solrconfig.xml file and I didn't realize it until yesterday just before I went 
home, so that was controlling the commits more than the add and commit calls 
that I was making. When I commented that block out again, the times for index 
with add(docs, commitWithinMs) and with add(docs) and commit(false, false) were 
very similar. Both of them were about 20 minutes faster (38 minutes instead of 
about an hour) than indexing with <autoCommit> set to commit after every 1,000 
documents or fifteen minutes.

Is this the blog post you are talking about: 
http://www.searchworkings.org/blog/-/blogs/gimme-all-resources-you-have-i-can-use-them!/?
 It seems to be about the right topic.

I am using Solr 3.5. The feature matrix on one of the Lucid Imagination web 
pages says that DocumentWriterPerThread is available in Solr 4.0 and LucidWorks 
2.0. I assume that means LucidWorks Enterprise. Is that right?
Thanks,
Mike

-----Original Message-----
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: Thursday, April 05, 2012 2:45 PM
To: solr-user@lucene.apache.org
Subject: Re: waitFlush and waitSearcher with SolrServer.add(docs, 
commitWithinMs)

Solr version? I suspect your outlier is due to merging segments, if so this 
should have happened quite some time into the run. See Simon Wilnauer's blog 
post on DocumenWriterPerThread (trunk) code.

What commitWithin time are you using?


Best
Erick

On Wed, Apr 4, 2012 at 7:50 PM, Mike O'Leary <tmole...@uw.edu> wrote:
> I am indexing some database contents using add(docs, commitWithinMs), and 
> those add calls are taking over 80% of the time once the database begins 
> returning results. I was wondering if setting waitSearcher to false would 
> speed this up. Many of the calls take 1 to 6 seconds, with one outlier that 
> took over 11 minutes.
> Thanks,
> Mike
>
> -----Original Message-----
> From: Mark Miller [mailto:markrmil...@gmail.com]
> Sent: Wednesday, April 04, 2012 4:15 PM
> To: solr-user@lucene.apache.org
> Subject: Re: waitFlush and waitSearcher with SolrServer.add(docs, 
> commitWithinMs)
>
>
> On Apr 4, 2012, at 6:50 PM, Mike O'Leary wrote:
>
>> If you index a set of documents with SolrJ and use 
>> StreamingUpdateSolrServer.add(Collection<SolrInputDocument> docs, int 
>> commitWithinMs), it will perform a commit within the time specified, and it 
>> seems to use default values for waitFlush and waitSearcher.
>>
>> Is there a place where you can specify different values for waitFlush 
>> and waitSearcher, or if you want to use different values do you have 
>> to call StreamingUpdateSolrServer.add(Collection<SolrInputDocument>
>> docs) and then call StreamingUpdateSolrServer.commit(waitFlush, 
>> waitSearcher) explicitly?
>> Thanks,
>> Mike
>
>
> waitFlush actually does nothing in recent versions of Solr. waitSearcher 
> doesn't seem so important when the commit is not done explicitly by the user 
> or a client.
>
> - Mark Miller
> lucidimagination.com
>
>
>
>
>
>
>
>
>
>
>

Reply via email to