MeihanLi opened a new issue, #11293:
URL: https://github.com/apache/pinot/issues/11293

   We recently observed that reloading segments,  even reloading a small set of 
segments one by one for a partial upsert table (~90 segment replicas and each 
replica is around 650MB) can cause a significant increase in CPU usage 
(~30-40%). 
   
   Server tenant:
   <img width="1480" alt="image" 
src="https://github.com/apache/pinot/assets/42751784/bdd139b6-7b2a-4be5-bfda-97a8129afe25";>
   
   We have looked into one server and even we only reloaded 11 segments during 
the time, the CPU usage is pretty high.
   Server (# of cores):
   <img width="677" alt="image" 
src="https://github.com/apache/pinot/assets/42751784/a8a16437-3172-4ffc-b038-0a20948d462a";>
   
   We are using default max.parallel.refresh.threads = 1 in 
HelixInstanceDataManagerConfig. It means there is no parallel during segment 
reloading. In other words, the server load mainly comes from refreshing a 
single segment. 
   
   Is there a way to improve the performance? This is blocking our schema 
evolution for such upsert tables and any suggestions or ideas are appreciated. 
Thanks!
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to