Hi,
I'm facing a dilemma of choosing the indexing strategies.
My application architecture is 
 - I have a listing table in my DB
 - For each listing, I have 3 calls to a URL Datasource of different system

 I have 200k records

 Time taken to index 25 docs is 1Minute, so for 200k it might take more than
100hrs :-(?

 
 I know there are lot of factors to consider from Network to DB.
I'm looking for different strategies that we could perform index.

 - Can we run multiple data import handlers? one data-config for first 100k
and second one is for another 100k
 - Would it be possible to write java service using SolrJ and perform
multi-threaded calls to Solr to Index?
 - The URL Datasources i'm using is actually resided in MSSQL database of
different system. Could I be able to fasten indexing time if I just could
use JDBCDataSource that calls DB directly instead through API URL data
source?

Is there any other strategies we could use?

Thank you,
 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexing-strategies-tp4116852.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to