Jackie-Jiang commented on issue #11278:
URL: https://github.com/apache/pinot/issues/11278#issuecomment-1682820878

   > To be generic, Pinot does't perfectly support adding new server nodes to 
cater with organic data growth. When a new node join the tenants, which might 
be triggered manually or the horizontal scaling based on the signal of running 
out of disk space, the data rebalancing should also be auto triggered when the 
pinot-controller observe the new node is alive for a while (e.g. 30 mins). With 
this feature, the disk shortage issue can be mitigated regardless of the 
underlying infra (ec2, k8s) taht holds pinots. Otherwise, each company's DevOps 
team need to implement a post processing workflow to trigger the tenant 
rebalancing after spinning up a new node.
   
   We can add a controller periodic task to automatically rebalance all tables 
(rebalance is idempotent, so it is no-op if no instance is changed), but that 
need to be disabled by default. Automatically rebalance works for non 
performance critical use cases, but for performance sensitive use cases (e.g. 
user facing high throughput use case), rebalance will reduce the cluster 
capacity, so should be performed within certain maintenance window and 
triggered manually.
   
   > In addition, pool based segment assignment strategy requires to trigger an 
api in order to assign **new** segments onto new nodes, which is even worse 
than the balanced assignment strategy.
   
   This is intentional, and maybe we should do the same even for balanced 
strategy. For use cases that requires all segments for a partition to be 
colocated, assigning new segments to new nodes will break it. To solve this, we 
should use the approach above by automatically rebalancing it.
   
   > The broker auto rebalancing needs to be considered too. #10181
   
   Can we close #10181 and open a new ticket for it? The title is very 
confusing because instance should not be removed from IS when disabled


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to