jtao15 opened a new issue #7413:
URL: https://github.com/apache/pinot/issues/7413


   There's a potential race condition issue when we run merge/roll-up task & 
segment upload tasks (e.g. purge, convertToRawIndex, etc) on the same table 
(purge task for example):
   1. Segment A is picked both by merge/roll-up and purge tasks
   2. Merge/roll-up task is finished, and the retention manager tries to clean 
up segment A and the lineage entry before purge task uploading the purged 
segment.
   3. Purge task uploaded the segment, and we have duplicate data for the table.
   
   There are two potential improvements:
   1. Honor segment lineage when scheduling purge tasks to avoid picking 
segments that will be removed.
   2. Existence check + ZK version check when uploading segments by purge tasks.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to