Re: [I] CachingCatalog does not close FileIO on cache eviction, causing S3FileIO / SDK v2 thread leak in long-running applications [iceberg]

via GitHub Mon, 20 Apr 2026 02:22:29 -0700


boaz-gold commented on issue #15898:
URL: https://github.com/apache/iceberg/issues/15898#issuecomment-4279356253


   @manuzhang - I added production stats to the PR.
    
   Responding on @anoopj two alternatives:                                      
                                                                                
        
                     
   The explicit cleanup method won't help for long-running Thrift servers — 
there's no point in the lifecycle where a cleanup hook is called between table 
evictions. The leak still accumulates.   
                                                                                
                                                                                
                      
   Reference counting inside CachingCatalog sounds like the right approach. 
   The catalog already owns the cache, so it can map each FileIO instance to a 
ref count, increment on load, decrement on eviction, and call close() only when 
the count hits zero. 
   That correctly handles the shared FileIO case without leaking anything.
   
   wdyt ?
                                                                                
                                                                                
                      
   Happy to test any implementation against a live cluster that reproduces the 
crash deterministically within 24 hours. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] CachingCatalog does not close FileIO on cache eviction, causing S3FileIO / SDK v2 thread leak in long-running applications [iceberg]

Reply via email to