Re: Rebalance a cassandra cluster

2017-09-14 Thread Anthony Grasso
As Kurt mentioned, you definitely need to pick a partition key that ensure data is uniformly distributed. If you want to want to redistribute the data in cluster and move tokens around, you could decommission the node with the tokens you want to redistribute and then bootstrap a new node into the

Re: Rebalance a cassandra cluster

2017-09-13 Thread kurt greaves
You should choose a partition key that enables you to have a uniform distribution of partitions amongst the nodes and refrain from having too many wide rows/a small number of wide partitions. If your tokens are already uniformly distributed, recalculating in order to achieve a better data load bala

Re: Rebalance a cassandra cluster

2017-09-13 Thread Akshit Jain
Suppose I have a cassandra cluster with the data that is skewed such that one node have 40% more data than other nodes.Since while creating the cassandra the tokens were distributed uniformly. Now to make the data uniform I have to recalculate the tokens and assign them to nodes in the cluster. The

Re: Rebalance a cassandra cluster

2017-09-12 Thread Hannu Kröger
Hi, you should make sure that token range is evenly distributed if you have a single token configured per node. You can use e.g. this tool to calculate tokens: https://www.geroba.com/cassandra/cassandra-token-calculator/ Also, make sure that none of the partitions in your data model are hotspots

RE: Rebalance a cassandra cluster

2017-09-12 Thread Harika Vangapelli -T (hvangape - AKRAYA INC at Cisco)
Check with nodetool repair. [http://wwwin.cisco.com/c/dam/cec/organizations/gmcc/services-tools/signaturetool/images/logo/logo_gradient.png] Harika Vangapelli Engineer - IT hvang...@cisco.com Tel: Cisco Systems, Inc. United States cisco.com [http://www.cisco.com