mattmartin14 commented on code in PR #1534: URL: https://github.com/apache/iceberg-python/pull/1534#discussion_r1944794775
########## pyiceberg/table/__init__.py: ########## @@ -1064,6 +1067,78 @@ def name_mapping(self) -> Optional[NameMapping]: """Return the table's field-id NameMapping.""" return self.metadata.name_mapping() + @dataclass(frozen=True) + class UpsertResult: + """Summary the upsert operation""" + rows_updated: int = 0 + rows_inserted: int = 0 + info_msgs: Optional[str] = None + error_msgs: Optional[str] = None + + def upsert(self, df: pa.Table, join_cols: list + , when_matched_update_all: bool = True + , when_not_matched_insert_all: bool = True + ) -> UpsertResult: + """ + Shorthand API for performing an upsert to an iceberg table. Review Comment: thank you for the suggestion. i've updated the context as follows: ```python """ Shorthand API for performing an upsert to an iceberg table. Args: self: the target Iceberg table to execute the upsert on df: The input dataframe to upsert with the table's data. join_cols: The columns to join on. These are essentially analogous to primary keys when_matched_update_all: Bool indicating to update rows that are matched but require an update due to a value in a non-key column changing when_not_matched_insert_all: Bool indicating new rows to be inserted that do not match any existing rows in the table Example Use Cases: Case 1: Both Parameters = True (Full Upsert) Existing row found → Update it New row found → Insert it Case 2: when_matched_update_all = False, when_not_matched_insert_all = True Existing row found → Do nothing (no updates) New row found → Insert it Case 3: when_matched_update_all = True, when_not_matched_insert_all = False Existing row found → Update it New row found → Do nothing (no inserts) Case 4: Both Parameters = False (No Merge Effect) Existing row found → Do nothing New row found → Do nothing (Function effectively does nothing) Returns: a UpsertResult class (contains details of rows updated and inserted) """ ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org