kaijchen commented on code in PR #32958: URL: https://github.com/apache/doris/pull/32958#discussion_r1582610450
########## fe/fe-common/src/main/java/org/apache/doris/common/Config.java: ########## @@ -636,6 +636,12 @@ public class Config extends ConfigBase { varType = VariableAnnotation.EXPERIMENTAL) public static boolean enable_single_replica_load = false; + @ConfField(mutable = true, masterOnly = true, description = { + "对于 DUPLICATE KEY 表启用 shuffle 的最小 tablet 数量", Review Comment: Enable shuffle can bring both positive effect and negative effect. Shuffle must be enabled for `UNIQ` and `AGG` tables for data consistency reasons. But for `DUP` tables, we can do some trade-offs. If shuffle is disabled, the load will be faster but it will use more memory. If shuffle is enabled, the load will be slower but it will use less memory. When loading into table with more buckets, it's more likely to be memory constrained. So we choose to enable shuffle only for tables with certain amount buckets. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org