RE: frequency of commit when building index from scratch

2009-08-25 Thread Fuad Efendi
But again, why someone has OOM??? I never had... What I discovered is: committing millions docs (in SOLR-1.4) may take several days (although adding docs takes a day) if you have somehow _many_segments_ and bad I/O with <= 2 CPUs; I am using heavy ramBufferSizeMB instead of heavy mergeFactor, and

Re: frequency of commit when building index from scratch

2009-08-25 Thread Yonik Seeley
On Tue, Aug 25, 2009 at 8:37 PM, Lance Norskog wrote: > The latest Solr 1.4 can index 200k records in several minutes, then commit > in a few seconds. I don't know but I'm guessing it is due to Lucene > improvements. It does not use much memory doing this. If you're using SolrJ, it's due to improv

Re: frequency of commit when building index from scratch

2009-08-25 Thread Lance Norskog
, millions of small docs... it takes 20 minutes in > average... why OOM? I see only reduced I/O... > > > -Original Message- > From: Edward Capriolo [mailto:edlinuxg...@gmail.com] > Sent: August-25-09 5:35 PM > To: solr-user@lucene.apache.org > Subject: Re: frequen

RE: frequency of commit when building index from scratch

2009-08-25 Thread Fuad Efendi
I do commit once a day, millions of small docs... it takes 20 minutes in average... why OOM? I see only reduced I/O... -Original Message- From: Edward Capriolo [mailto:edlinuxg...@gmail.com] Sent: August-25-09 5:35 PM To: solr-user@lucene.apache.org Subject: Re: frequency of commit when

Re: frequency of commit when building index from scratch

2009-08-25 Thread Bill Au
That's my gut feeling (start big and go lower if OOM occurs) too. Bill On Tue, Aug 25, 2009 at 5:34 PM, Edward Capriolo wrote: > On Tue, Aug 25, 2009 at 5:29 PM, Bill Au wrote: > > Just curious, how often do folks commit when building their Solr/Lucene > > index from scratch for index with milli

Re: frequency of commit when building index from scratch

2009-08-25 Thread Edward Capriolo
On Tue, Aug 25, 2009 at 5:29 PM, Bill Au wrote: > Just curious, how often do folks commit when building their Solr/Lucene > index from scratch for index with millions of documents?  Should I just wait > and do a single commit at the end after adding all the documents to the > index? > > Bill > Bil