Fokko commented on issue #403: URL: https://github.com/apache/iceberg-python/issues/403#issuecomment-1941430839
The Iceberg metadata does not contain this information to optimize a distinct query :( What would help is a lazy implementation of an Arrow dataset should keep the memory footprint lower. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org