karuppayya commented on code in PR #11606: URL: https://github.com/apache/iceberg/pull/11606#discussion_r1857170703
########## docs/docs/spark-procedures.md: ########## @@ -936,3 +936,40 @@ as an `UPDATE_AFTER` image, resulting in the following pre/post update images: |-----|--------|--------------| | 3 | Robert | UPDATE_BEFORE| | 3 | Dan | UPDATE_AFTER | + +## Table stats + +### `compute_table_stats` + +This procedure calculates the [Number of Distinct Values (NDV) statistics](https://iceberg.apache.org/puffin-spec/#apache-datasketches-theta-v1-blob-type) for a specific table. Review Comment: the spec is found at [`format/puffin-spec.md`](https://github.com/apache/iceberg/blob/main/format/puffin-spec.md). Do we want to reference a file outside the `docs` older? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org