lnbest0707-uber commented on PR #12336:
URL: https://github.com/apache/pinot/pull/12336#issuecomment-1928495920

   > sounds good. so do we have
   > 
   > * 0 : OFFLINE
   > * 1 : ??
   > * expected - 1 : SEGMENTS_WITH_LESS_REPLICAS
   > * expected : HEALTHY
   > 
   > am I correct?
   
   Yes, correct, right now the second (1) case is not treated separately.
   
   > 1. how do we plan to avoid these alerts during say rebalance or server 
upgrade? (it will be one less during operation which might be a noise to 
monitoring system)
   The patch only produces metrics, and alert wise relies on how the admin 
monitoring the system. From my point of view, it could be based on the number 
and duration of SEGMENTS_WITH_LESS_REPLICAS.
   For example, to improve the node replacement operation. The key metrics is 
**how many** segments are running with SEGMENTS_WITH_LESS_REPLICAS for **how 
long**. And then in the real operation, if this metrics runs over the threshold 
we set (by experiment), it could indicate something unusual happened.
   Same for rebalancing and restart, if it goes over some threshold, then 
either something goes wrong or the existing workflow needs to be improved.
   > 2. do we want to address the naming of the "only 1 replica available, 
where configured to have more than 1" ?
   Right now, I do not have enough data to justify if it is critical to tell 
"only 1 replica available". In most of cases I saw, monitoring 
SEGMENTS_WITH_LESS_REPLICAS could uncover most of issues.
   > 3. what if the RF is set to two? do we honor "only 1 replica" or "1 less 
replica" or both?
   If replica is set to 2, then 1 == expected - 1, the 
SEGMENTS_WITH_LESS_REPLICAS in the patch could tell both. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to