xiangfu0 commented on PR #14698:
URL: https://github.com/apache/pinot/pull/14698#issuecomment-4139694522

   Updated with commit fc14886275. The partitioned combine path now uses 
thread-local per-partition tables plus parallel final reduction, which removes 
the 8-thread initialization race and improves the partitioned path on the same 
local synthetic workload from 106.341 ms to 77.933 ms at 4 threads (~27% 
faster). On 8 threads the previous implementation reproduced the NPE; the 
current code completes at 59.218 ms, and in the current 3-way comparison the 
partitioned path is 54.288 ms vs 68.945 ms for non-blocking and 150.688 ms for 
default.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to