kevinjqliu commented on issue #1201: URL: https://github.com/apache/iceberg-python/issues/1201#issuecomment-2460351486
hey @francocalvo the `MERGE` operation is not yet support (https://github.com/apache/iceberg-python/issues/402) For write, pyiceberg currently supports `append` and `overwrite`. I think `overwrite` + `overwrite_filter` gets you close to the `MERGE` use case. > but I can't really use that with a DataFrame, I'd have to pass it as a string, right? The writes work with pyarrow tables and dataframe. Im don't think you need to pass as string > And in that case, a IN clause with thousands of IDs would deteriorate performance It depends on the exact logic. But we do some optimizations such as filter pushdowns to speed up reads and writes -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org