Re: [I] PyIceberg Cookbook [iceberg-python]

via GitHub Wed, 06 Nov 2024 09:16:38 -0800


kevinjqliu commented on issue #1201:
URL: 
https://github.com/apache/iceberg-python/issues/1201#issuecomment-2460351486


   hey @francocalvo 
   the `MERGE` operation is not yet support 
(https://github.com/apache/iceberg-python/issues/402)
   For write, pyiceberg currently supports `append` and `overwrite`. I think 
`overwrite` + `overwrite_filter` gets you close to the `MERGE` use case. 
   
   > but I can't really use that with a DataFrame, I'd have to pass it as a 
string, right? 
   The writes work with pyarrow tables and dataframe. Im don't think you need 
to pass as string
   
   > And in that case, a IN clause with thousands of IDs would deteriorate 
performance
   It depends on the exact logic. But we do some optimizations such as filter 
pushdowns to speed up reads and writes
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [I] PyIceberg Cookbook [iceberg-python]

Reply via email to