Re: [I] discuss: Split iceberg-rust high level API into a mini engine instead [iceberg-rust]

via GitHub Mon, 03 Mar 2025 03:30:59 -0800


liurenjie1024 commented on issue #1036:
URL: https://github.com/apache/iceberg-rust/issues/1036#issuecomment-2694070892

Thanks @Xuanwo for raising this, I think in general it's a good idea to have
a standalone module for simple data processing, just like what we already have
in
[java](https://github.com/apache/iceberg/blob/e230f5d79d82a50439029db5c73f8b59497b2e9f/data),
e.g. an `iceberg-data` module.

That's to say we have following crates:
```
iceberg
/ \
iceberg-data iceberg-datafusion
```

The `iceberg` crate is similar to the `iceber-core` + `iceberg-api` module
in java, which contains api for manipulating metadata, and it's supposed to be
compute engine independent. Also it's used for planning scan tasks for compute
engines.

The `iceberg-data` crate contains necessary implementation for executing
table scan in local machine, and doesn't depend on any execution engine. We may
add feature of appending data to table, but row level modifications seem
somehow challenging at this time.

The `iceberg-datafusion` crate contains integration with datafusion, which
allows users to execute sql against iceberg, and of course powered by
datafusion.

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [I] discuss: Split iceberg-rust high level API into a mini engine instead [iceberg-rust]

Reply via email to