mrcnc commented on PR #11294:
URL: https://github.com/apache/iceberg/pull/11294#issuecomment-2420675717

   > Hi @mrcnc, @RussellSpitzer . To confirm this PR solution to 10127: any 
existing (or new) iceberg tables stored in azure with wasbs + .blob. will be 
interpreted interchangeably with abfss prefix + .dfs. endpoint, regardless of 
whether the table's existing metadata displays wasbs + .blob. vs the new abfss 
+ dfs?
   
   > That being said, does this mean iceberg api supports tables with possibly 
mixed schemes? (i.e. some paths in a table's metadata were written with 'wasbs' 
while others were with 'abfss')?
   
   The path for new files in existing tables is determined by the table's base 
location or 
[LocationProvider](https://iceberg.apache.org/javadoc/latest/org/apache/iceberg/io/LocationProvider.html)
 so tables that have paths using the `wasbs` scheme would still use the same 
scheme for new files.  For new tables, the location will default to the default 
location for the warehouse or namespace but each table can explicitly provide a 
location when it is created.  It seems possible that there could be mixed 
schemes (for example you changed the implementation of a custom 
LocationProvider) so I think it's worthwhile to understand if mixed paths are 
supported but I'm not sure currently.  The intent of this PR was to use the 
ADLSFileIO for wasbs paths by default, instead of delegating these to 
HadoopFileIO as a fallback


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to