But again, why someone has OOM??? I never had...

What I discovered is: committing millions docs (in SOLR-1.4) may take
several days (although adding docs takes a day) if you have somehow
_many_segments_ and bad I/O with <= 2 CPUs; I am using heavy ramBufferSizeMB
instead of heavy mergeFactor, and quad cores...


Yes, I am using SolrJ with binary format. 20 minutes to commit millions of
docs (including overwrites of existing ones with same uniqueId); I usually
have 2 segments (>10 Gb each)
-Fuad
http://www.casaGURU.com
=========


If you're using SolrJ, it's due to improvements there too:
1) binary format by default - no XML parsing
2) not used by default, but try using StreamingUpdateSolrServer

-Yonik
http://www.lucidimagination.com


> Bill in most cases you probably cannot do one large commit as you will 
> hit OOM. How many documents can be uncommitted is based on the size of 
> the documents. Committing every document is slow. I have done a commit 
> every 10,000 mostly. Results may vary. Someone might have a better 
> answer then me.



Reply via email to