anthonysgro commented on issue #6619:
URL: https://github.com/apache/iceberg/issues/6619#issuecomment-1404528549

   Yes. So here is specifically how it happens:
   
   Creating my table:
   I create my table through an Athena query
   ```
   CREATE TABLE IF NOT EXISTS db.friends (
       id string,
       name string,
       birthday timestamp
   )
   LOCATION s3://iceberg-datalake
   TBLPROPERTIES (
      'table_type' = 'ICEBERG'
   )
   ```
   
   We can insert a couple rows into this table, yada yada.
   
   Then comes AWS Backup. It is a service that is natively integrated into S3. 
I went to the AWS Backup console, navigated to **Protected Resources**, and 
picked my iceberg datalake bucket, and created a recovery point. It takes about 
an hour or so for it to get backed up. 
   
   Once a recovery point becomes available, I deleted my entire 
s3://iceberg-datalake bucket. If I query my iceberg table through Athena, it 
throws an error (as expected, the data is deleted!). But no worries, I restored 
the bucket with a click of a button in AWS Backup, and voila, it recreates 
after a bit of time. I can once again query my iceberg table, as if nothing 
happened. Perfect.
   
   However, some cases where this doesn't work:
   If I run:
   ```
   DROP TABLE db.friends
   ```
   it deletes the table and the S3 data, as expected. AWS tells me iceberg 
tables do that, for some reason. But then, if I recreate the table using the 
above command, and then restore my backup, it throws a strange error, and it 
can't seem to get back to normal.
   
   Is there something with a metadata pointer or something if this occurs? I 
haven't been able to find a way to restore iceberg table data after a DROP 
TABLE command. I would absolutely be interested in hearing more if this S3 
access point integration helps prevent agains this!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to