Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-23 Thread Mark Miller
On Mar 23, 2012, at 12:49 PM, I-Chiang Chen wrote: > Caused by: java.lang.OutOfMemoryError: Map failed Hmm...looks like this is the key info here. - Mark Miller lucidimagination.com

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-23 Thread I-Chiang Chen
We saw couple distinct errors and all machines in a shard is identical: -On the leader of the shard Mar 21, 2012 1:58:34 AM org.apache.solr.common.SolrException log SEVERE: shard update error StdNode: http://blah.blah.net:8983/solr/master2-slave1/:org.apache.solr.common.SolrException: Map failed a

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-23 Thread Markus Jelsma
We did some tests too with many millions of documents and auto-commit enabled. It didn't take long for the indexer to stall and in the meantime the number of open files exploded, to over 16k, then 32k. On Friday 23 March 2012 12:20:15 Mark Miller wrote: > What issues? It really shouldn't be a pr

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-23 Thread Mark Miller
What issues? It really shouldn't be a problem. On Mar 22, 2012, at 11:44 PM, I-Chiang Chen wrote: > At this time we are not leveraging the NRT functionality. This is the > initial data load process where the idea is to just add all 200 millions > records first. Than do a single commit at the e

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-22 Thread I-Chiang Chen
At this time we are not leveraging the NRT functionality. This is the initial data load process where the idea is to just add all 200 millions records first. Than do a single commit at the end to make them searchable. We actually disabled auto commit at this time. We have tried to leave auto commi

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-22 Thread Mark Miller
On Mar 21, 2012, at 9:37 PM, I-Chiang Chen wrote: > We are currently experimenting with SolrCloud functionality in Solr 4.0. > The goal is to see if Solr 4.0 trunk with is current state is able to > handle roughly 200million documents. The document size is not big around 40 > fields no more than