Re: Using another way instead of DIH

2013-04-26 Thread Shawn Heisey
On 4/25/2013 9:00 AM, xiaoqi wrote: > i using DIH to build index is slow , when it fetch 2 million rows , it will > spend 20 minutes , very slow. If it takes 20 minutes for two million records, I'd say it's working very well. I do six simultaneous MySQL imports of 13 million records each. It ta

RE: Using another way instead of DIH

2013-04-26 Thread Dyer, James
batch the updates so everything can fit in-memory. Doing it this way, the SolrJ code was very straightforward and quick & easy to write. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: xiaoqi [mailto:belivexia...@gmail.com] Sent: Friday, April 26, 2013 5:10

RE: Using another way instead of DIH

2013-04-26 Thread Dyer, James
yes, I misspoke. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: xiaoqi [mailto:belivexia...@gmail.com] Sent: Thursday, April 25, 2013 8:37 PM To: solr-user@lucene.apache.org Subject: RE: Using another way instead of DIH Thanks for help . "data-config.xml

Re: Using another way instead of DIH

2013-04-26 Thread xiaoqi
below is my data-import.xml any suggestion ?

Re: Using another way instead of DIH

2013-04-26 Thread Majirus FANSI
Hi, It simply means the configuration file of your DIH. Cheers On 26 April 2013 03:37, xiaoqi wrote: > Thanks for help . > > "data-config.xml" ? i can not find this file , u mean data-import.xml or > solrconfig.xml ? > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabb

RE: Using another way instead of DIH

2013-04-25 Thread xiaoqi
Thanks for help . "data-config.xml" ? i can not find this file , u mean data-import.xml or solrconfig.xml ? -- View this message in context: http://lucene.472066.n3.nabble.com/Using-another-way-instead-of-DIH-tp4058937p4059067.html Sent from the Solr - User mailing list archive at Nabble.co

RE: Using another way instead of DIH

2013-04-25 Thread Dyer, James
If you post your data-config.xml here, someone might be able to find something you could change to speed things up. If the issue is parallelization, then you could possibly partition your data somehow and then run multiple DIH request handlers at the same time. This might be easier than writin