t0mpere opened a new issue, #11282:
URL: https://github.com/apache/pinot/issues/11282

   Hello, I've tried to debug why scheduling `SegmentGenerationAndTarPush` 
Minion jobs take so long to schedule and I've narrowed it down the problem to 
this part of the code. 
   
   
https://github.com/apache/pinot/blob/78308da90debe5b2fc958750a9a11acc3b9a9e8e/pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/minion/PinotHelixTaskResourceManager.java#L297-L309
   
   I'm currently use `POST /tasks/execute` API to schedule the job.
   The culprit seems to be the while loop waiting for the task to get a state. 
I'm not familiar on how helix handles this in the background. Do you think it 
would be possible to avoid looping on `synchronized getTaskState()` and maybe 
implement a callback to get the result of a job scheduling.
   This is a big deal for us since scheduling takes more than ingestion and 
doesn't allow to keep up with new data and scale. 
   It might also be a misconfiguration problem but in this case I will need 
your help to find it.
   
   Current configuration:
   GKE
   version 0.12.1
   GCS for deep storage
   3 ZK - 8 CPU and 18GB ram
   6 Servers - 16CPU and 32 64GB ram 1.45TB SSD
   2 Controllers - 16 CPU and 32GB ram
   2 Brokers - 5 CPU 16.25GB ram
   32 Minions - 2 CPU and 2GB of ram
   
   1M Segments 4TB of data
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to