Re: [PR] Core: add support to add custom schemes via properties in ResolvingFileIO [iceberg]

via GitHub Fri, 08 Mar 2024 22:56:07 -0800


danielcweeks commented on PR #9884:
URL: https://github.com/apache/iceberg/pull/9884#issuecomment-1986761215


   > I don't understand why we have s3 and s3a in ResolvingFileIO, actually why 
do we have ResolvingFileIO ?
   
   This is a good question and somewhat gets to the point.  ResolvingFileIO was 
a way for users to be able to avoid having to lookup and set the FileIO 
implementation manually.  The reason for the different schemes (e.g. `s3a`) is 
that people transitioning from Hive/Hadoop frequently used those legacy 
prefixes and ran into problems with mixing the more common `s3` scheme.  
Eventually this replaced the HadoopFileIO as the recommended FileIO since the 
native paths were more performant and tailored to Iceberg.
   
   So, if you're already using a custom FileIO, you need to set it one way or 
another.  Now we're creating a second way to set it, and you have to set the 
scheme, which is probably already handled by the custom FileIO.  (See the Dell 
EcsFileIO as an example).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [PR] Core: add support to add custom schemes via properties in ResolvingFileIO [iceberg]

Reply via email to