Nitin-Kashyap commented on code in PR #27784: URL: https://github.com/apache/doris/pull/27784#discussion_r1414212209
########## fe/fe-core/src/main/java/org/apache/doris/planner/external/FileQueryScanNode.java: ########## @@ -372,7 +386,11 @@ public void createScanRangeLocations() throws UserException { curLocations.getScanRange().getExtScanRange().getFileScanRange().addToRanges(rangeDesc); TScanRangeLocation location = new TScanRangeLocation(); Backend selectedBackend; - if (enableSqlCache) { + int bucketNum = 0; + if (isBucketedHiveTable) { + bucketNum = HiveBucketUtil.getBucketNumberFromPath(fileSplit.getPath().getName()).getAsInt(); + selectedBackend = backendPolicy.getNextConsistentBe(bucketNum); Review Comment: It is to ensure the left side goes to the same BE where the right side hash will send, given that Hive may replicate the same block on different remote nodes. I kept this thinking we can extend this to Colocate Join; by reading left and right buckets on same BE. (I'm prototyping this, shall share soon) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org