Re: Uncaught exception on thread CounterMutationStage

2017-07-27 Thread David Salz
Hi Jeff, thanks for the pointers! We upgraded to C* 3.11.0 now and the situation has improved a little bit, the node does not die completely any more, but the WriteTimeoutExceptions persists and still 'freeze' the node for a couple of minutes. > A single node with 20 cores and 256GB of RAM is p

Re: Uncaught exception on thread CounterMutationStage

2017-07-26 Thread Jeff Jirsa
On 2017-07-25 12:49 (-0700), David Salz wrote: > Hi, > > has anyone seen the following exception before? > > Context: > > * Cassandra 3.9, > > * single node (20 Cores / 256 GB RAM) > A single node with 20 cores and 256GB of RAM is probably not going to be the best choice - while it's a g

Uncaught exception on thread CounterMutationStage

2017-07-25 Thread David Salz
Hi, has anyone seen the following exception before? Context: * Cassandra 3.9, * single node (20 Cores / 256 GB RAM) * doing lots of counter mutations * Whenever this exception happens, CPU spikes, node becomes unresponsive for a few minutes. Eventually, the node will "die", i.e. become comple