alessandro-nori opened a new issue, #13528:
URL: https://github.com/apache/iceberg/issues/13528

   ### Feature Request / Improvement
   
   Currently, S3InputStream throws a NotFoundException when getObject is called 
on a location that doesn’t exist. This behavior ensures that 
BaseMetastoreTableOperations stops retrying, avoiding unnecessary retries when 
the location is definitively not found.
   
   However, GCSInputStream and AdlsInputStream do not throw the same exception 
for missing locations. As a result, BaseMetastoreTableOperations continues to 
retry the operation multiple times, even though the location will never exist, 
leading to wasted resources and latency.
   
   ## Expected behavior
   
   All InputStream implementations (S3InputStream, GCSInputStream, 
AdlsInputStream, etc.) should consistently throw a NotFoundException (or 
equivalent) when accessing a non-existent object location. This allows 
downstream logic (like BaseMetastoreTableOperations) to fail fast and avoid 
unnecessary retries.
   
   ### Query engine
   
   None
   
   ### Willingness to contribute
   
   - [ ] I can contribute this improvement/feature independently
   - [x] I would be willing to contribute this improvement/feature with 
guidance from the Iceberg community
   - [ ] I cannot contribute this improvement/feature at this time


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to