On Tue, Jan 12, 2010 at 2:53 PM, Smith G wrote:
> 4) queuesize parameter of Streaming constructer: What could be the
> rough-value when it comes
> to real time application having a million+ documents to be indexed ? ..
> So what does "queuesize" is exactly for ? , if we can go on
> addin
Hello,
Yeah, to be brief.. I wanted to read documents and update them
simoultaneously with different threads. Main issue I considered is To
call add / commit for " how many " documents, because I can not keep
adding millions of documents one after another to
StreamingUpdateSolrServer by just
On Tue, Jan 12, 2010 at 1:09 PM, Smiley, David W. wrote:
> The beauty of StreamingUpdateSolrServer is that you don't have to worry about
> batch sizes; it streams them all. Just keep calling add() with one document
> and it'll get enqueued. You can pass a collection but there's no performance
The beauty of StreamingUpdateSolrServer is that you don't have to worry about
batch sizes; it streams them all. Just keep calling add() with one document
and it'll get enqueued. You can pass a collection but there's no performance
benefit.
StreamingUpdateSolrServer can be configured to use mu
Hello ,
I am using add() method which receives Collection of
SolrInputDocuments instead of add() which receives a single document.
I am afraid, is sending a group of documents being called as
"batching" in Solr terminology? . If yes, then I am doing it ( by
including additional logic i
On Tue, Jan 12, 2010 at 3:48 AM, Smith G wrote:
> Hello All,
> I am trying to find a better approach ( perfomance wise
> ) to index documents. Document count is approximately a million+.
> First, I thought of writing multiple threads using
> CommonsHttpSolrServer to submit documents.
2) Also, is CommonsHttpSolrServer thread safe?
it is only if you initialize it with the MultiThreadedHttpConnectionManager:
http://hc.apache.org/httpclient-3.x/apidocs/org/apache/commons/httpclient/MultiThreadedHttpConnectionManager.html
Cheers,
Chantal