shounakmk219 commented on code in PR #16571:
URL: https://github.com/apache/pinot/pull/16571#discussion_r2269538877


##########
pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/minion/PinotTaskManager.java:
##########
@@ -238,6 +238,15 @@ public Map<String, String> createTask(String taskType, 
String tableName, @Nullab
         LOGGER.warn("No ad-hoc task generated for task type: {}", taskType);
         continue;
       }
+      int maxNumberOfSubTasks = taskGenerator.getMaxNumSubTasks();

Review Comment:
   Agree with Manish on informing the user in some way that the system is 
throttling the task generation. But as the default limit is `Integer.MAX_VALUE` 
I am not that concerned though. 



##########
pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/minion/PinotTaskManager.java:
##########
@@ -739,6 +748,21 @@ protected TaskSchedulingInfo 
scheduleTask(PinotTaskGenerator taskGenerator, List
         List<PinotTaskConfig> presentTaskConfig =
             minionInstanceTagToTaskConfigs.computeIfAbsent(minionInstanceTag, 
k -> new ArrayList<>());
         taskGenerator.generateTasks(List.of(tableConfig), presentTaskConfig);
+        int maxNumberOfSubTasks = taskGenerator.getMaxNumSubTasks();
+        // choose first maxNumberOfSubTasks tasks to schedule from 
presentTaskConfig
+        if (presentTaskConfig.size() > maxNumberOfSubTasks) {
+          LOGGER.warn("Number of tasks generated for task type: {} for table: 
{} is {}, which is greater than the "
+              + "maximum number of tasks to schedule: {}. Only the first {} 
tasks will be scheduled. This is controlled"
+                  + " by the cluster config maxAllowedSubTasks which is set 
based on controller's performance",
+              taskType, tableName, presentTaskConfig.size(), 
maxNumberOfSubTasks, maxNumberOfSubTasks);
+          presentTaskConfig = new ArrayList<>(presentTaskConfig.subList(0, 
maxNumberOfSubTasks));

Review Comment:
   We can go over `presentTaskConfig` and  replace 
`MinionConstants.TABLE_MAX_NUM_TASKS_KEY` with the `maxNumberOfSubTasks` value 
right?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to