Kalyan, Using SolrJ? Use the StreamingServer, it's nice and fast. Alternatively, start multiple indexing threads (match the number of Solr server CPU cores) and index from there. Send batches of docs, not one by one. Don't commit or optimize until you are done.
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: "Manepalli, Kalyan" <kalyan.manepa...@orbitz.com> > To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> > Sent: Wednesday, July 1, 2009 3:42:45 PM > Subject: Tips on speeding the indexing process > > Hi, > I have a very generic question regarding indexing. In my current > app, I have about 450,000 docs each doc size around 2k. The total indexing > time > is around 2hrs. > Now due to multi language support, the number of documents is increasing to > 2.0 > million. The total indexing time is exceeding 6 hrs. > I wanted to know if there are any general tips to speedup the indexing > process. > > Thanks, > Kalyan Manepalli