Re: indexing best practices

2010-07-20 Thread Lance Norskog
with using the > "nomerge" merge policy. I hope to have some results to report on our blog > sometime in the next  month or so. > > Tom Burton-West > www.hathitrust.org/blogs > > -Original Message- > From: kenf_nc [mailto:ken.fos...@realestate.com] > Se

RE: indexing best practices

2010-07-19 Thread Burton-West, Tom
che.org Subject: Re: indexing best practices No one has done performance analysis? Or has a link to anywhere where it's been done? basically fastest way to get documents into Solr. So many options available, what's the fastest: 1) file import (xml, csv) vs DIH vs POSTing 2) number

Re: indexing best practices

2010-07-18 Thread Geert-Jan Brits
Have you read: http://wiki.apache.org/lucene-java/ImproveIndexingSpeed http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Scaling-Lucene-and-Solr To be short there are only guidelines (see links) no definitive answers. If you followed the guidelines for improviing indexing sp

Re: indexing best practices

2010-07-18 Thread kenf_nc
No one has done performance analysis? Or has a link to anywhere where it's been done? basically fastest way to get documents into Solr. So many options available, what's the fastest: 1) file import (xml, csv) vs DIH vs POSTing 2) number of concurrent clients 1 vs 10 vs 100 ...is there a di