Umeshkumar9414 commented on PR #7075:
URL: https://github.com/apache/hbase/pull/7075#issuecomment-3061080761

   > As I asked above, if we do not change the mechanism that we will only 
schedule a new split worker while the previous one is complete or the region 
server is dead, what is the advantage of using different file names for split 
workers?
   
   @Apache9 In the wal splitting using recovered edits output sink, worker 
first writes the temporary edit files for each regions and then rename it to 
actual recovered edits. If a worker find a existing temporary edit file it 
deletes the temporary edit file and creates a new one. 
   In our case where a zombie worker and a new worker are splitting the wal at 
the same time, we don't have any issue if new worker deletes the temporary 
files but zombie worker can also delete the temporary files being used by new 
worker causing new worker to fail with FileNotFoundException. 
   
   If zombie worker is slow then a new worker can fail multiple times, blocking 
the ServerCrashProcedure for extended period of time, thus reducing 
avaialability. 
   
   Using differenet name ensures that both are not interferring with each other.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@hbase.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to