laskoviymishka commented on PR #1053: URL: https://github.com/apache/iceberg-go/pull/1053#issuecomment-4444904828
Refactored this. `readAllDeletionVectors` now returns `map[string]*dv.RoaringPositionBitmap`, so we no longer materialize DVs into `int64` positions. I also added a new `filterByDeletionVector` pipeline step. It builds a per-batch Arrow boolean keep-mask using `Contains`, then applies `compute.Filter`. The bitmap now threads through `recordBatchesFromTasksAndDeletes → recordsFromTask` as a separate map alongside `perFilePosDeletes`, so the two sources don’t need to share a representation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
