But again, why someone has OOM??? I never had... What I discovered is: committing millions docs (in SOLR-1.4) may take several days (although adding docs takes a day) if you have somehow _many_segments_ and bad I/O with <= 2 CPUs; I am using heavy ramBufferSizeMB instead of heavy mergeFactor, and quad cores...
Yes, I am using SolrJ with binary format. 20 minutes to commit millions of docs (including overwrites of existing ones with same uniqueId); I usually have 2 segments (>10 Gb each) -Fuad http://www.casaGURU.com ========= If you're using SolrJ, it's due to improvements there too: 1) binary format by default - no XML parsing 2) not used by default, but try using StreamingUpdateSolrServer -Yonik http://www.lucidimagination.com > Bill in most cases you probably cannot do one large commit as you will > hit OOM. How many documents can be uncommitted is based on the size of > the documents. Committing every document is slow. I have done a commit > every 10,000 mostly. Results may vary. Someone might have a better > answer then me.