Thanks Faud,

Have started working optimizing my Database structure, since the tables are 
huge 
in terms of records, optimization is taking time. 

Will update the results when complete.

Regards,
Rohit



________________________________
From: Fuad Efendi <f...@efendi.ca>
To: "Solr-User@Lucene. Org" <solr-user@lucene.apache.org>
Sent: Sun, 5 June, 2011 10:05:22 AM
Subject: Re: URGENT HELP: Improving Solr indexing time

Hi Rohit,

I am currently working on https://issues.apache.org/jira/browse/SOLR-2233
which fixes multithreading issues

How complex is your dataimport schema? SOLR-2233 (multithreading, better
connection handling) improves performance... Especially if SQL is
extremely complex and uses few long-running CachedSqlEntityProcessors and
etc.

Also, check your SQL and indexes, in most cases you can _significantly_
improve performance by simply adding appropriate (for your specific SQL)
indexes. I noticed that even very experienced DBAs sometimes create index
<KEY1, KEY2>, and developer executes query "WHERE KEY2=? ORDER BY KEY1" -
check everything...

Thanks,


-- 
Fuad Efendi
416-993-2060
Tokenizer Inc., Canada
Data Mining, Search Engines
http://www.tokenizer.ca <http://www.tokenizer.ca/>







On 11-06-05 12:09 AM, "Rohit Gupta" <ro...@in-rev.com> wrote:

>No didn't double post, my be it was in my outbox and went out again.
>
>The queries outside solr dont take so long, to return around 500000 rows
>it 
>takes 250 seconds, so I am doing a delta import of around 500,000 rows at
>a 
>time. I have tried turning auto commit  on and things are moving a bit
>faster 
>now. Are there any more tweeking i can do?
>
>Also, planning to move to master-salve model, but am failing to
>understand where 
>to start exactly. 
>
>Regards,
>Rohit
>
>
>
>________________________________
>From: lee carroll <lee.a.carr...@googlemail.com>
>To: solr-user@lucene.apache.org
>Sent: Sun, 5 June, 2011 4:59:44 AM
>Subject: Re: URGENT HELP: Improving Solr indexing time
>
>Rohit - you have double posted maybe - did Otis's answer not help with
>your issue or at least need a response to clarify ?
>
>On 4 June 2011 22:53, Chris Cowan <chrisco...@plus3network.com> wrote:
>> How long does the query against the DB take (outside of Solr)? If
>>that's slow 
>>then it's going to take a while to update the index. You might need to
>>figure a 
>>way to break things up a bit, maybe use a delta import instead of a full
>>import.
>>
>> Chris
>>
>> On Jun 4, 2011, at 6:23 AM, Rohit Gupta wrote:
>>
>>> My Solr server takes very long to update index. The table it hits to
>>>index is
>>> huge with 10Million + records , but even in that case I feel this is
>>>very 
>long
>>> time to index. Below is the snapshot of the /dataimport page
>>>
>>> <str name="status">busy</str>
>>> <str name="importResponse">A command is still running...</str>
>>> <lst name="statusMessages">
>>> <str name="Time Elapsed">1:53:39.664</str>
>>> <str name="Total Requests made to DataSource">16276</str>
>>> <str name="Total Rows Fetched">24237</str>
>>> <str name="Total Documents Processed">16273</str>
>>> <str name="Total Documents Skipped">0</str>
>>> <str name="Full Dump Started">2011-06-04 11:25:26</str>
>>> </lst>
>>>
>>> How can i determine why this is happening and how can I improve this.
>>>During 
>>>all
>>> our test on the local server before the migration we could index 5
>>>million
>>> records in 4-5 hrs, but now its taking too long on the live server.
>>>
>>> Regards,
>>> Rohit
>>
>>

Reply via email to