[jira] [Commented] (SOLR-14278) data loss during live shard split if leader dies

Yonik Seeley (Jira) Fri, 28 Feb 2020 14:35:42 -0800


    [ 
https://issues.apache.org/jira/browse/SOLR-14278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17048007#comment-17048007
 ]


Yonik Seeley commented on SOLR-14278:
-------------------------------------

I'm going to keep this issue open since I looped this test for another couple 
days and I do see a occasional failure of missing documents.
The failures happen only a few times a day on average... about .5% of the runs 
I think.

> data loss during live shard split if leader dies
> ------------------------------------------------
>
>                 Key: SOLR-14278
>                 URL: https://issues.apache.org/jira/browse/SOLR-14278
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Yonik Seeley
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> While trying to develop better tests for shared storage (SOLR-13101), I ran 
> across another failure for normal replica types as well (one of the first 
> things I do when a test fails for shared storage is to try and validate that 
> normal NRT replicas succeed.)  The PR I'll open has a test adapted from the 
> one in SOLR-13813 for master.
> Scenario:
>   - indexing is happening during shard split
>   - leader is killed shortly after (before the split has finished) and never 
> brought back up
>   - there are often some missing documents at the end
> While it's possible that the simulated killing of the node in the unit test 
> is imperfect, I haven't reproduced a failure if I comment out the split 
> command and just kill the leader.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14278) data loss during live shard split if leader dies

Reply via email to