Re: Reindex single shard on solr

2018-12-15 Thread Mahmoud Almokadem
You're right Erick. for the Hash.murmurhash3_x86_32 method I don't know should I pass my Id directly or with specific format like '1874f9aa-4cad-4839-a282-d624fe2c40c6!document_id', so I used a predefined method that get shard name directly. createCollection method doesn't create a collection phy

Re: Reindex single shard on solr

2018-12-14 Thread Erick Erickson
Why do you need to create a collection? That's probably just there in the test code to have something to test against. WARNING: I haven't verified this, but it should be something like the following. What you need is the hash range for the shard (slice) you're trying to update, then send each doc

Re: Reindex single shard on solr

2018-12-14 Thread Mahmoud Almokadem
Thanks Erick, I got it from TestHashPartitioner.java https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/solr/core/src/test/org/apache/solr/cloud/TestHashPartitioner.java Here is a sample code router = DocRouter.getDocRouter(CompositeIdRouter.NAME); int shardsCou

Re: Reindex single shard on solr

2018-12-14 Thread Mahmoud Almokadem
Thanks Erick, You know how to use this method. Or I need to dive into the code? I've the document_id as string uniqueKey and have 12 shards. On Fri, Dec 14, 2018 at 5:58 PM Erick Erickson wrote: > Sure. Of course you have to make sure you use the exact same hashing > algorithm on the . > > See

Re: Reindex single shard on solr

2018-12-14 Thread Erick Erickson
Sure. Of course you have to make sure you use the exact same hashing algorithm on the . See CompositeIdRouter.sliceHash Best, Erick On Fri, Dec 14, 2018 at 3:36 AM Mahmoud Almokadem wrote: > > Hello, > > I've a corruption on some of the shards on my collection and I've a full > dataset on my dat

Reindex single shard on solr

2018-12-14 Thread Mahmoud Almokadem
Hello, I've a corruption on some of the shards on my collection and I've a full dataset on my database, and I'm using CompositeId for routing documents. Can I traverse the whole dataset and do something like hashing the document_id to identify that this document belongs to a specific shard to se