gortiz commented on PR #15844:
URL: https://github.com/apache/pinot/pull/15844#issuecomment-2894788414

   > In distributed group-by queries that apply a LIMIT without an accompanying 
ORDER BY clause, each server arbitrarily selects a subset of groups, which can 
lead to non-deterministic behavior and inconsistent final aggregations when the 
total number of groups exceeds the limit; this randomness in group selection 
may omit important groups from the final result and compromise query accuracy, 
thereby necessitating a deterministic mechanism that ensures all non-ordering 
group keys are uniformly considered across all servers to guarantee the 
correctness and consistency of the aggregated output.
   
   Why is that an issue? The fact that there is no deterministic order is a 
feature in terms of performance, don't you think?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to