SPLITSHARD - data loss of child documents

2020-12-17 Thread Nussbaum, Ronen
Hi Everyone, We're using version 8.6.1 with nested documents. I used the SPLITSHARD API and after it finished successfully, I've noticed the following: 1. Most of child documents are missing - before the split: ~600M, after: 68M 2. Retrieving a document with its children, shows child docum

Re: SPLITSHARD - data loss of child documents

2020-12-17 Thread Mike Drob
I was under the impression that split shard doesn’t work with child documents, if that is missing from the ref guide we should update it On Thu, Dec 17, 2020 at 4:30 AM Nussbaum, Ronen wrote: > Hi Everyone, > > We're using version 8.6.1 with nested documents. > I used the SPLITSHARD API and afte

Re: DIH and UUIDProcessorFactory

2020-12-17 Thread Dmitri Maziuk
On 12/12/2020 4:36 PM, Shawn Heisey wrote: On 12/12/2020 2:30 PM, Dmitri Maziuk wrote: Right, ```Every update request received by Solr is run through a chain of plugins known as Update Request Processors, or URPs.``` The part I'm missing is whether DIH's 'name="/dataimport"' counts as an "Upda

Re: DIH and UUIDProcessorFactory

2020-12-17 Thread Alexandre Rafalovitch
Try with the explicit URP chain too. It may work as well. Regards, Alex. On Thu, 17 Dec 2020 at 16:51, Dmitri Maziuk wrote: > > On 12/12/2020 4:36 PM, Shawn Heisey wrote: > > On 12/12/2020 2:30 PM, Dmitri Maziuk wrote: > >> Right, ```Every update request received by Solr is run through a chai

Re: DIH and UUIDProcessorFactory

2020-12-17 Thread Dmitri Maziuk
On 12/17/2020 4:05 PM, Alexandre Rafalovitch wrote: Try with the explicit URP chain too. It may work as well. Actually in this case we're just making sure uniqueKey is in fact unique in all documents, so default is what we want. For this particular dataset I may at some future point look int