dhruv-pratap opened a new pull request, #6069: URL: https://github.com/apache/iceberg/pull/6069
Taking a first dig at TableScan Plan Files API. The implementation at present evaluates partition filters but does not evaluate row-level filters at the moment as it requires Projections. The API is exposed by the CLI tool and lists out all the data files that would need to be scanned in order to perform the scan plan. <img width="1912" alt="image" src="https://user-images.githubusercontent.com/11399509/198372507-7f4a9b8a-bd3d-44d8-a534-234bacb0bba0.png"> High-level changes done as part of this PR: 1. Add `new_scan()` API to the `Table` interface. 2. Add `TableScan` interface with just `plan_files` abstract method. 3. Add `FileScanTask` model that represents a scan task over the existing `DataFile` model. 4. Add `DataTableScan` class that extends `TableScan` and implements `plan_files` API to return a collection of `FileScanTasks` using the existing `_ManifestEvalVisitor` to perform partition pruning. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org