Even if the data were absolutely evenly distributed, that won't guarantee
that the hash values of the partition keys used in your client queries
won't collide a result in a hotspot.
Another possibility is that your data is not partitioned well at the
primary key level. Are you using clustering key
Hi,
Did you try to run the following on all your nodes and compare ?
du -sh /*whatever*/cassandra/data/*
Of course if you have unequal snapshots sizes remove them in the above
command (or directly remove them).
This should answer (barely) your question about an eventual even
distribution (/!\ h
Is there a way to find out how data is distributed within column family by each
node? Nodetool provides how data is distributed across nodes that only shows
all the data by node. We are seeing heavy load on one node and I suspect that
partitioning is not distributing data equally. But to prove t