amogh-jahagirdar commented on issue #9388:
URL: https://github.com/apache/iceberg/issues/9388#issuecomment-1894876445

   > For instance the folder for time_day=2023-12-27 remained so I checked for 
its contents:
   
       from the GCP UI there are no rows to display
       from the CloudShell interface I have 2 hidden empty objects each of them 
with size 0 bytes and unnamed, with creation timestamp equals to the execution 
of the expire_snapshot procedure that deleted the files in the folder for the 
day. (gcloud storage ls -L command)
   
   Could they be some sort of "leftovers" left behind form the expire_snapshot 
procedure? Or maybe some kind of "soft_deletion" that my organization as in 
place in GCS?
   If you have any suggestion it would be really appreciated, in the meanwhile 
I'll dig deeper into it.
   
   
   Hmm I don't think there should "leftovers" in the sense that the "folder" 
remains even if all the objects in it are deleted. I'm less familiar with GCS 
to be honest, so it's just probably lack of knowledge on my part there. I 
haven't seen anything like that in S3. 
   
   > Or maybe some kind of "soft_deletion" that my organization as in place in 
GCS?
   
   I'm guessing by soft deletion you mean you have versioning enabled and so 
deletes on the objects can be undone by reverting to previous versions. Maybe? 
Again, it's probably just my lack of knowledge on GCS, maybe it's something 
specific there but I don't recall seeing anything like that for S3.
   
   Either way, I don't think there's an Iceberg issue since the relevant files 
are in fact being removed as part of the expire snapshots procedure. It's just 
surfacing in an unexpected way in GCS. 
   
   I'll close this issue, but again based on your investigation if you still 
think that Iceberg is doing something wrong please feel free to reopen.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to