singhpk234 commented on PR #9830:
URL: https://github.com/apache/iceberg/pull/9830#issuecomment-2091715487

   > For (2): We have not discussed incremental refresh plans in the Iceberg 
community, but [there is some relevant work 
here](https://www.slideshare.net/walaa_eldin_moustafa/incremental-view-maintenance-with-coral-dbt-and-iceberg).
 You can review some of the test cases 
[here](https://github.com/linkedin/coral/blob/0d5dd3f300f48e48cd2404a49dbb799d7f4ce190/coral-incremental/src/test/java/com/linkedin/coral/incremental/RelToIncrementalSqlConverterTest.java#L28).
   
   @wmoustafa, Read this today, was wondering if there is something we can 
utilize from CDC (considering iceberg has support for that) perspective ? how 
expensive the refreshes of a PB size tables are and what is the ideal frequency 
of updates in this model, if you can share some datapoints ? rewrite to get 
incremental refresh by computing deltas between the snapshots and then joining 
it with other deltas and having union of those does seems user-friendly though 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to