asp437 opened a new issue, #8229: URL: https://github.com/apache/iceberg/issues/8229
### Query engine Spark ### Question I use `MERGE INTO` query to update values in Iceberg table via Spark with some condition in `WHEN MATCH` clause. And I want to collect metrics of such queries to track how many rows were updated/inserted/deleted. Is there a way to get a number of rows affected by `MERGE INTO` query? I tried to look on snapshot information but it contains number of rows in affected PARQUET files. And this values are expected to be much higher e.g. in case of query updating half of rows, but due to storage configuration all data files were rewritten. This is useful metric too, but I also need a number of rows changed based on query logic, not physical representation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
