morningman opened a new issue #4147:
URL: https://github.com/apache/incubator-doris/issues/4147


   **Describe the bug**
   When decommission BE, some of tablets may has lots of replicas generated by 
clone task.
   
   **Why**
   
   Give examples of why:
   
   1. Tablet X has 3 replicas on A, B, C 3 BEs.
   2. C is decommission, so we choose the Backend D to relocating the new 
replica,
   3. After relocating, Tablet X has 4 replicas: A, B, C(decommision), D(may be 
version incomplete)
   4. D may be version incomplete because the clone task ran a long time, the 
new version has been published.
   5. At the next time of tablet checking, Tablet X's status is still 
REPLICA_RELOCATING.
       If we don't choose D as dest BE to do the new relocating, it will choose 
new backend E
       to store the new replicas. So back and forth, the number of replicas 
will increase forever.
   
   So a better solution is to select C as dest BE again to do the clone task. 
This may trigger an increment clone task
   that can be done more fast.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to