kaijchen commented on code in PR #32958:
URL: https://github.com/apache/doris/pull/32958#discussion_r1582610450


##########
fe/fe-common/src/main/java/org/apache/doris/common/Config.java:
##########
@@ -636,6 +636,12 @@ public class Config extends ConfigBase {
             varType = VariableAnnotation.EXPERIMENTAL)
     public static boolean enable_single_replica_load = false;
 
+    @ConfField(mutable = true, masterOnly = true, description = {
+            "对于 DUPLICATE KEY 表启用 shuffle 的最小 tablet 数量",

Review Comment:
   Enable shuffle can bring both positive effect and negative effect.
   Shuffle must be enabled for `UNIQ` and `AGG` tables for data consistency 
reasons.
   But for `DUP` tables, we can do some trade-offs.
   If shuffle is disabled, the load will be faster but it will use more memory.
   If shuffle is enabled, the load will be slower but it will use less memory.
   
   When loading into table with more buckets, it's more likely to be memory 
constrained.
   So we choose to enable shuffle only for tables with certain amount buckets.
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to