danielcweeks commented on code in PR #14264:
URL: https://github.com/apache/iceberg/pull/14264#discussion_r2417187369


##########
core/src/main/java/org/apache/iceberg/BaseIncrementalChangelogScan.java:
##########
@@ -71,6 +80,12 @@ protected CloseableIterable<ChangelogScanTask> doPlanFiles(
             .filter(manifest -> 
changelogSnapshotIds.contains(manifest.snapshotId()))
             .toSet();
 
+    // Build delete file index for existing deletes (before the start snapshot)
+    DeleteFileIndex existingDeleteIndex = 
buildExistingDeleteIndex(fromSnapshotIdExclusive);

Review Comment:
   I'm a little confused as to why we need to build the delete index before the 
start index.  If a delete file exists prior to the start snapshot, it would 
only apply to data files that exist prior to selected scan range.  This feels 
like it would never apply to the scan range.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to