[ https://issues.apache.org/jira/browse/HBASE-29310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tak-Lon (Stephen) Wu resolved HBASE-29310. ------------------------------------------ Hadoop Flags: Reviewed Resolution: Fixed > Handle Bulk Load Operations in Continuous Backup and PITR Workflow > ------------------------------------------------------------------ > > Key: HBASE-29310 > URL: https://issues.apache.org/jira/browse/HBASE-29310 > Project: HBase > Issue Type: Task > Components: backup&restore > Reporter: Vinayak Hegde > Assignee: asolomon > Priority: Major > Labels: HBASE-28957, pull-request-available > Fix For: HBASE-28957 > > > When bulk load operations are performed, the resulting files are backed up to > the backup location via the continuous backup process. > However, during Point-In-Time Recovery (PITR), restoring these bulk-loaded > files efficiently can be challenging. > To address this, we propose the following guidelines for handling bulk loads > in the context of continuous backup and PITR: > * When a user performs a bulk load on any table under continuous backup and > PITR, they *must take a full or incremental backup* afterward. An incremental > backup is generally sufficient and faster. > *Required Changes* > # {*}Documentation Update{*}: > ## Add a note in the HBase Backup and Restore documentation explaining this > rule and its importance. > # {*}Logging Suggestion{*}: > ## After a bulk load operation completes, log a message suggesting the user > perform a full or incremental backup. > # {*}PITR Enhancements{*}: > * During PITR, check if any bulk load operation occurred after the last > successful backup. > * If no such backup exists, inform the user and fail the process. > * -If the user chooses to proceed (e.g., using a {{{}{}}}{{{}force{}}} > flag), continue with a warning that the bulk-loaded files will not be part of > the restored table.- -- This message was sent by Atlassian Jira (v8.20.10#820010)