ConcurrentUpdateSolrClient vs CloudSolrClient for bulk update to SolrCloud

2016-01-12 Thread Shivaji Dutta
updates. What is the recommended API for updating large amounts of documents with higher throughput rate. Thanks, Shivaji Dutta

Re: Pro and cons of using Solr Cloud vs standard Master Slave Replica

2016-01-13 Thread Shivaji Dutta
- SolrCloud uses zookeeper to manage HA - Zookeeper is a standard for all HA in Apache Hadoop - You have collections which will manage your shards across nodes - SolrJ Client is now fault tolerant with CloudSolrClient This is the way future direction of the product will go. On 1/13/16,

Re: ConcurrentUpdateSolrClient vs CloudSolrClient for bulk update to SolrCloud

2016-01-13 Thread Shivaji Dutta
shards. Not so with CUSC. > >FWIW, >Erick > >On Tue, Jan 12, 2016 at 8:06 PM, Shawn Heisey wrote: >> On 1/12/2016 7:42 PM, Shivaji Dutta wrote: >>> Now since with ConcurrentUdateSolrClient I am able to use a queue and >>>a pool of threads, which makes it more

Re: ConcurrentUpdateSolrClient vs CloudSolrClient for bulk update to SolrCloud

2016-01-14 Thread Shivaji Dutta
orks.com/blog/2015/10/05/really-batch-updates-solr-2/ >I was indexing 11M Wiki articles n about 10 minutes on some tests >recently. Solr can scale that close to linearly with more shards and >enough indexing clients. Is it really worth the effort of using Lucene? > >FWIW, >Erick > &

Solr Query Tuning

2016-01-14 Thread Shivaji Dutta
Team Thanks for all the help before. Current State I am working with a customer that has about a billion documents on 20 shards. The documents are extremely small about 100 characters each. The insert rate is pretty good, but they are trying to fetch the document by using SolrJ SolrQuery Solr