tustvold commented on issue #172:
URL: https://github.com/apache/iceberg-rust/issues/172#issuecomment-1911697303

   Thank you both for the responses.
   
   > In iceberg's design, all file ios are hidden under the FileIO interface, 
and the backends, i.e. OpenDAL or object_store are not directly exposed to 
user, so I think we can integrate it without any breaking changes.
   
   Glad to here efforts are being made to keep the IO primitives abstracted and 
pluggable 👍. I would just observe that FileIO appears to mirror filesystem 
APIs, and that this has historically been a pain point in systems that chose 
this path. For example Spark has had a very hard time getting a performant S3 
integration, with proper vectored IO only being added to OSS Spark very 
recently. By contrast the object_store APIs mirror those of the actual stores, 
and are designed to work well with the APIs in arrow-rs, avoiding all the 
complexities of prefetching heuristics and similar.
   
   > discussing replacements
   
   I entirely agree, I guess I was more suggesting that the IO abstraction 
mirror object_store as this is what both the upstream crates use and expect, 
and what the underlying stores provide. If people then wanted additional 
backend support they could plug OpenDAL into this interface?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to