Re: [Discuss] num_tokens default in Cassandra 4.0

2020-03-09 Thread Paulo Motta
Great investigation, good job guys! > Personally I would have liked to have seen even more iterations. While 14 run iterations gives an indication, the average of randomness is not what is important here. What concerns me is the consequence to imbalances as the cluster grows when you're very unluc

Re: [Discuss] num_tokens default in Cassandra 4.0

2020-03-09 Thread Jon Haddad
There's a lot going on here... hopefully I can respond to everything in a coherent manner. > Perhaps a simple way to avoid this is to update the random allocation algorithm to re-generate tokens when the ranges created do not have a good size distribution? Instead of using random tokens for the f

2020-03-09 4.0 Status

2020-03-09 Thread Jon Meredith
Link to JIRA board: https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=355&projectKey=CASSANDRA It's been a week of toiling on the tasks we need to ship a release -- fixing bugs and flaky tests. We've had 0 new ticket opened against 4.0 since the last status email (6d). https://issue