J-HowHuang opened a new pull request, #15175:
URL: https://github.com/apache/pinot/pull/15175

   ## Description
   Disk utilization check was added to controller's periodic task in 
https://github.com/apache/pinot/pull/15008. This change was to pause data 
ingestion automatically.
   
   Server can also suffer from the lack of disk space while rebalanced. 
Therefore, there should also be sufficient observability of disk utilization 
under the context of rebalancing.
   
   ## Current plans
   
   - [ ] Add a pre-check item to check if disk utilization exceeds the safe 
threshold for each server
   - [x] Add disk utilization for each server into rebalance result summary
   - [ ] Estimate or compute the actual size net change of the segments 
migration for each server
   - [ ] Expose disk utilization threshold in configuration
   - [ ] Add flag to enable disk utilization in rebalance API (also need to 
check if controller has `controller.enable.resource.utilization.check` 
configured true)
   
   ## Issues and discussions
   
   - Disk utilization change is not obvious if multiple servers mount on the 
same disk
   - Is it reasonable to take the disk utilization values that are fetched by 
the controller's periodic task? 
       - If so, can we rely on the configurations (e.g. 
`controller.resource.utilization.checker.frequency`) or do we need to override 
some?
       - Alternative: make API requests to server instances while doing 
rebalance
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to