ndimiduk commented on PR #6370:
URL: https://github.com/apache/hbase/pull/6370#issuecomment-2446258976

   > When we go to restore this cluster, it'll pull the full backup and the 2 
incrementals. The full backup will get restored first, so the new table will 
have regions A, B, C.
   
   Targeting the original split boundaries means making more work for the 
region servers as the regions load up and then hit their split thresholds. 
Actually, what happens if the region partition points change while the restore 
is happening? Say you restore the full backup, and the first incremental. The 
regions get too big and split. Now the second incremental is being restored, 
but the region partitions have changed.
   
   Instead of targeting the old region partitions, which we know have changed 
(i.e., due to increased data volume), what if the restore process targets the 
region partitions of the most recent incremental backup -- G, H, I, J, K, L, 
from your example. I guess the splitting process is really challenging and slow?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@hbase.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to