updates.
What is the recommended API for updating large amounts of documents with higher
throughput rate.
Thanks,
Shivaji Dutta
- SolrCloud uses zookeeper to manage HA
- Zookeeper is a standard for all HA in Apache Hadoop
- You have collections which will manage your shards across nodes
- SolrJ Client is now fault tolerant with CloudSolrClient
This is the way future direction of the product will go.
On 1/13/16,
shards. Not so with CUSC.
>
>FWIW,
>Erick
>
>On Tue, Jan 12, 2016 at 8:06 PM, Shawn Heisey wrote:
>> On 1/12/2016 7:42 PM, Shivaji Dutta wrote:
>>> Now since with ConcurrentUdateSolrClient I am able to use a queue and
>>>a pool of threads, which makes it more
orks.com/blog/2015/10/05/really-batch-updates-solr-2/
>I was indexing 11M Wiki articles n about 10 minutes on some tests
>recently. Solr can scale that close to linearly with more shards and
>enough indexing clients. Is it really worth the effort of using Lucene?
>
>FWIW,
>Erick
>
&
Team
Thanks for all the help before.
Current State
I am working with a customer that has about a billion documents on 20 shards.
The documents are extremely small about 100 characters each.
The insert rate is pretty good, but they are trying to fetch the document by
using SolrJ SolrQuery
Solr