Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
Thanks for your kind reply. I tried using both sqlentityprocessor and set batchSize to -1but didn't get any improvement. It'd be helpful if I can see data import handler's log. On Saturday, November 7, 2015, Alexandre Rafalovitch wrote: > LoL. Of course I meant SolrJ. I had to misspell the most

Re: data import extremely slow

2015-11-07 Thread Alexandre Rafalovitch
LoL. Of course I meant SolrJ. I had to misspell the most important word of the hundreds I wrote in this thread :-) Thank you Erick for the correction. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 7 November 2015 at 19:18, Erick Erickson wro

Re: data import extremely slow

2015-11-07 Thread Erick Erickson
Alexandre, did you mean SolrJ? Here's a way to get started https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/ Best, Erick On Sat, Nov 7, 2015 at 2:22 PM, Alexandre Rafalovitch wrote: > Have you thought of just using Solr. Might be faster than troubleshooting > DIH for complex scenarios

Re: data import extremely slow

2015-11-07 Thread Alexandre Rafalovitch
Have you thought of just using Solr. Might be faster than troubleshooting DIH for complex scenarios. On 7 Nov 2015 3:39 pm, "Yangrui Guo" wrote: > I found multiple strange things besides the slowness. I performed count(*) > in MySQL but only one-fifth of the records were imported. Also sometimes

Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
I found multiple strange things besides the slowness. I performed count(*) in MySQL but only one-fifth of the records were imported. Also sometimes dataimporthandler either doesn't import at all or only imports a portion of the table. How can I debug the importer? On Saturday, November 7, 2015, Y

Re: data import extremely slow

2015-11-07 Thread Yangrui Guo
I just realized that not everything was ok. Three child entities were not imported. Had set batchSize to -1 but again solr was stuck :( On Fri, Nov 6, 2015 at 3:11 PM, Yangrui Guo wrote: > Thanks for the reply. I just removed CacheKeyLookUp and CachedKey and used > WHERE clause instead. Everythi

Re: data import extremely slow

2015-11-06 Thread Yangrui Guo
Thanks for the reply. I just removed CacheKeyLookUp and CachedKey and used WHERE clause instead. Everything works fine now. Yangrui On Friday, November 6, 2015, Shawn Heisey wrote: > On 11/6/2015 10:32 AM, Yangrui Guo wrote: > > > There's a good chance that JDBC is trying to read the entire re

Re: data import extremely slow

2015-11-06 Thread Shawn Heisey
On 11/6/2015 10:32 AM, Yangrui Guo wrote: > http://wiki.apache.org/solr/DataImportHandlerFaq?highlight=%28mysql%29|%28batchsize%29#I.27m_using_DataImportHandler_with_a_MySQL_database._My_table_is_huge_and_DataImportHandler_is_going_out_of_memory._Why_does_DataImportHandler_bring_everything_to_memor

data import extremely slow

2015-11-06 Thread Yangrui Guo
Hi I'm using Solr's data import handler and MySQL 5.5 to index imdb database. However the data-import takes a few minutes to process one document while there are over 3 million movies. This is going to take forever yet I can select the rows in MySQL in no time. Where am I doing wrong? My data-conf