[
https://issues.apache.org/jira/browse/HBASE-29220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinayak Hegde reassigned HBASE-29220:
-------------------------------------
Assignee: Vinayak Hegde
> Track the Age/Timestamp of the Last Successfully Backed-Up WAL Entry in
> Continuous Backup Replication Endpoint
> --------------------------------------------------------------------------------------------------------------
>
> Key: HBASE-29220
> URL: https://issues.apache.org/jira/browse/HBASE-29220
> Project: HBase
> Issue Type: Task
> Components: backup&restore
> Reporter: Vinayak Hegde
> Assignee: Vinayak Hegde
> Priority: Major
>
> We use HBase’s replication framework for Continuous Backup through
> {{{}ContinuousBackupReplicationEndpoint{}}}. This replicates WAL entries to
> the backup location, which are then used for Point-In-Time Recovery (PITR)
> and Incremental Backup (an optimization technique that collects WALs and
> generates HFiles for faster recovery).
> However, the {{ReplicationEndpoint}} can lag behind in time.
> For example, if replication is one hour behind,
> {{ContinuousBackupReplicationEndpoint}} will currently be writing WAL entries
> that are one hour old. This means that if a user requests a PITR for the
> current time or attempts an incremental backup, they will miss that one hour
> of data.
> To prevent this, we need to ensure that users can only request data that has
> been fully backed up. Therefore, we must track the timestamp of the last
> successfully backed-up WAL entry:
> * For PITR: Users should only be allowed to restore to a point before this
> timestamp.
> * For Incremental Backup: The incremental backup process should store this
> timestamp as the backup time to maintain data consistency.
> This ensures data integrity and prevents users from requesting backups that
> include unprocessed WAL entries.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)