mattmartin14 commented on PR #1534: URL: https://github.com/apache/iceberg-python/pull/1534#issuecomment-2634290851
@Fokko - as an FYI, i do have working code to just use pyarrow to identify rows to insert; for rows to update, its a little more tricky; would you like me to replace the existing data fusion function on the rows_to_insert part with my pyarrow code? FYI, this is the code I could implement: ```python def get_rows_not_in_target(source_table: pa.Table, target_table: pa.Table, join_cols: list) -> pa.Table: return source_table.join(target_table, keys=join_cols, join_type='left anti') ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org