pvary commented on PR #12629: URL: https://github.com/apache/iceberg/pull/12629#issuecomment-2866569257
Discussed this with @ajantha-bhat offline: - We think that the need to recompute the stats is very-very rare. Ajantha mentioned that maybe when the stats file was deleted accidentally then it could lead to corruption when the stats removal is needed. - Also he mentioned that using the java API the user could remove the stats already: ``` table.refresh(); UpdatePartitionStatistics update = table.updatePartitionStatistics(); table.snapshots().forEach(s -> update.removePartitionStatistics(s.snapshotId())); update.commit(); ``` - Based on this, if there are no immediate needs from the compute engines we could just omit the `computeAndWriteStatsFullRefresh` method. - We can always add the new method to the API if the need rises. What do you think @deniskuzZ, @gaborkaszab? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org