dhruv-pratap opened a new pull request, #6069:
URL: https://github.com/apache/iceberg/pull/6069

   Taking a first dig at TableScan Plan Files API. The implementation at 
present evaluates partition filters but does not evaluate row-level filters at 
the moment as it requires Projections.
   
   The API is exposed by the CLI tool and lists out all the data files that 
would need to be scanned in order to perform the scan plan.
   
   <img width="1912" alt="image" 
src="https://user-images.githubusercontent.com/11399509/198372507-7f4a9b8a-bd3d-44d8-a534-234bacb0bba0.png";>
   
   High-level changes done as part of this PR:
   1. Add `new_scan()` API to the `Table` interface.
   2. Add `TableScan` interface with just `plan_files` abstract method.
   3. Add `FileScanTask` model that represents a scan task over the existing 
`DataFile` model.
   4. Add `DataTableScan` class that extends `TableScan` and implements 
`plan_files` API to return a collection of `FileScanTasks` using the existing 
`_ManifestEvalVisitor` to perform partition pruning.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to