qq619618919 commented on PR #8026:
URL: https://github.com/apache/hadoop/pull/8026#issuecomment-3394100330

   ### Performance Verification in Production
   We tested this patch in a production YARN cluster and used Arthas to monitor 
RM node event handling performance via:
   ```bash
   monitor -c 5 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$NodeEventDispatcher
 handle
   ```
   
   ### Result:
   Before patch (with original YARN-11003 behavior): average NM heartbeat 
handling time ≈ 1.10 ms
   After patch (skip/caching getCapability() when Opportunistic containers 
disabled): average NM heartbeat handling time ≈ 0.09 ms
   This shows over 12× improvement in heartbeat event processing latency, 
reducing RM AsyncDispatcher thread load significantly and improving scheduling 
responsiveness in large clusters.
   
   ### Conclusion:
   The patch removes unnecessary getCapability() calls when the Opportunistic 
container feature is disabled, reducing CPU overhead and improving event queue 
turnover rate.
   This optimization has already proven effective in production with 
substantial gains in RM performance.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to