Re: Data Distribution in Table/Column Family

2015-08-27 Thread Jack Krupansky
Even if the data were absolutely evenly distributed, that won't guarantee that the hash values of the partition keys used in your client queries won't collide a result in a hotspot. Another possibility is that your data is not partitioned well at the primary key level. Are you using clustering key

Re: Data Distribution in Table/Column Family

2015-08-27 Thread Alain RODRIGUEZ
Hi, Did you try to run the following on all your nodes and compare ? du -sh /*whatever*/cassandra/data/* Of course if you have unequal snapshots sizes remove them in the above command (or directly remove them). This should answer (barely) your question about an eventual even distribution (/!\ h

Data Distribution in Table/Column Family

2015-08-27 Thread Saladi Naidu
Is there a way to find out how data is distributed within column family by each node? Nodetool provides how data is distributed across nodes that only shows all the data by node. We are seeing heavy load on one node and I suspect that partitioning is not distributing data equally. But to prove t