Hello,

We recently deployed a cluster of 5 Cassandra nodes into production, and ran into big problems with periodically hiccups (individual node goes down, high CPU, client connection timeout). It was terrible with 0.5 (one hiccups every 5-10 minutes), today we upgraded to 0.6.1, it happens less frequently now (likely once every 30 minutes or so). But it is still quite frustrating.

We used ReplicationFactor=3 for all column families. 5 nodes are behind haproxy. Java client goes through haproxy. The most obvious behavior is: as soon as one node goes down, the connections between haproxy and Cassandra nodes just shoot up to 1000 (in normal case it is stable at 40, which should be really trivial for Cassandra), and the connections don't go down for quite a while. Meanwhile Java clients just get all kind of TimeoutException, then kept on retrying. Eventually we have to restart haproxy, then things go back to normal.

Each node has 5GB max heap, powerful enough CPU (quad-core), software RAID mirror. We are definitely NOT putting lots of load yet, mostly 20-50 concurrent requests to Cassandra, but it is not holding up! Please help, we are on the verge of giving up Cassandra after 5 days of periodic "outage".

Couple observations:

- cfstats shows significant read latency on "system" keyspace, almost 5s (see below)

- RecentReadLatencyMicros and RecentWriteLatencyMicros are super high for StorageProxy, as well as every column family in JMX: up to 152676.92 and 6950 (they are in ms, right?). However, in cfstats, they are quite small.

- Every second we see 5-10 DigestMismatchException in the log:

INFO [pool-1-thread-15857] 2010-04-22 00:37:37,887 StorageProxy.java (line 499) DigestMismatchException: Mismatch for key 1068022523 (d41d8cd98f00b204e9800998ecf8427e vs 0dd4cdaeeb1a334ae133c6955e109629)

Please advice. Thank you!

snippets of storage-conf, cfstats, tpstats are listed below:

<Storage>

  <AutoBootstrap>false</AutoBootstrap>
  <Keyspaces>
      <Keyspace Name="Pandora">
<ReplicaPlacementStrategy>org.apache.cassandra.locator.RackUnawareStrategy</ReplicaPlacementStrategy>
          <ReplicationFactor>3</ReplicationFactor>
<EndPointSnitch>org.apache.cassandra.locator.EndPointSnitch</EndPointSnitch>
      </Keyspace>
      </Keyspaces>


  <RpcTimeoutInMillis>10000</RpcTimeoutInMillis>
  <CommitLogRotationThresholdInMB>128</CommitLogRotationThresholdInMB>

  <DiskAccessMode>auto</DiskAccessMode>
  <RowWarningThresholdInMB>512</RowWarningThresholdInMB>
  <SlicedBufferSizeInKB>64</SlicedBufferSizeInKB>
  <FlushDataBufferSizeInMB>32</FlushDataBufferSizeInMB>
  <FlushIndexBufferSizeInMB>8</FlushIndexBufferSizeInMB>
  <ColumnIndexSizeInKB>64</ColumnIndexSizeInKB>
  <MemtableThroughputInMB>64</MemtableThroughputInMB>
  <BinaryMemtableThroughputInMB>256</BinaryMemtableThroughputInMB>
  <MemtableOperationsInMillions>0.3</MemtableOperationsInMillions>
     <MemtableFlushAfterMinutes>60</MemtableFlushAfterMinutes>

  <ConcurrentReads>8</ConcurrentReads>
  <ConcurrentWrites>32</ConcurrentWrites>

  <CommitLogSync>periodic</CommitLogSync>
  <CommitLogSyncPeriodInMS>10000</CommitLogSyncPeriodInMS>
<!-- <CommitLogSyncBatchWindowInMS>1</CommitLogSyncBatchWindowInMS> -->

  <GCGraceSeconds>864000</GCGraceSeconds>
</Storage>
r...@cdb-006:/glass/sfw/cassandra# bin/nodetool -h localhost cfstats
Keyspace: system
  Read Count: 878
  Read Latency: 5752.042634396355 ms.
  Write Count: 2260398
  Write Latency: 0.014567047926957996 ms.
  Pending Tasks: 0
      Column Family: LocationInfo
      SSTable count: 2
      Space used (live): 3569
      Space used (total): 3569
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 1
      Read Count: 1
      Read Latency: NaN ms.
      Write Count: 6
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 2
      Key cache size: 1
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: HintsColumnFamily
      SSTable count: 2
      Space used (live): 70272035
      Space used (total): 70272035
      Memtable Columns Count: 56264
      Memtable Data Size: 486854
      Memtable Switch Count: 21
      Read Count: 877
      Read Latency: 13614.412 ms.
      Write Count: 2260392
      Write Latency: 0.142 ms.
      Pending Tasks: 0
      Key cache capacity: 2
      Key cache size: 2
      Key cache hit rate: 0.25
      Row cache: disabled
      Compacted row minimum size: 78567
      Compacted row maximum size: 39561901
      Compacted row mean size: 27878603

----------------
Keyspace: Titan
  Read Count: 8948702
  Read Latency: 7.949136100185256 ms.
  Write Count: 3393490
  Write Latency: 0.19255415398306758 ms.
  Pending Tasks: 0
      Column Family: FbUser
      SSTable count: 6
      Space used (live): 3675014807
      Space used (total): 3675014807
      Memtable Columns Count: 250055
      Memtable Data Size: 9146339
      Memtable Switch Count: 9
      Read Count: 6591406
      Read Latency: 8.361 ms.
      Write Count: 343030
      Write Latency: 0.078 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 200000
      Key cache hit rate: 0.6341912478864628
      Row cache: disabled
      Compacted row minimum size: 320
      Compacted row maximum size: 586
      Compacted row mean size: 479

      Column Family: Payment
      SSTable count: 3
      Space used (live): 2473
      Space used (total): 2473
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 27728
      Read Latency: 0.059 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: Club
      SSTable count: 6
      Space used (live): 363609256
      Space used (total): 477968226
      Memtable Columns Count: 124134
      Memtable Data Size: 38144903
      Memtable Switch Count: 140
      Read Count: 63668
      Read Latency: 29.187 ms.
      Write Count: 1966098
      Write Latency: 0.184 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 63751
      Key cache hit rate: 0.6543073524771221
      Row cache: disabled
      Compacted row minimum size: 274
      Compacted row maximum size: 49784
      Compacted row mean size: 7711

      Column Family: RandomUsers
      SSTable count: 1
      Space used (live): 1329464
      Space used (total): 1329464
      Memtable Columns Count: 170433
      Memtable Data Size: 5525529
      Memtable Switch Count: 8
      Read Count: 58236
      Read Latency: 6.215 ms.
      Write Count: 143339
      Write Latency: 0.575 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 21
      Key cache hit rate: 1.0
      Row cache: disabled
      Compacted row minimum size: 1244196
      Compacted row maximum size: 1327409
      Compacted row mean size: 1287929

      Column Family: User
      SSTable count: 4
      Space used (live): 583812728
      Space used (total): 583812728
      Memtable Columns Count: 248833
      Memtable Data Size: 7909687
      Memtable Switch Count: 12
      Read Count: 2207667
      Read Latency: 8.413 ms.
      Write Count: 941029
      Write Latency: 0.107 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 94940
      Key cache hit rate: 0.9769078238427732
      Row cache: disabled
      Compacted row minimum size: 259
      Compacted row maximum size: 174340
      Compacted row mean size: 3916

----------------
Keyspace: Pandora
  Read Count: 1475530
  Read Latency: 5.249669284257182 ms.
  Write Count: 856550
  Write Latency: 0.16070848053236822 ms.
  Pending Tasks: 0
      Column Family: Folder
      SSTable count: 7
      Space used (live): 942926793
      Space used (total): 942926793
      Memtable Columns Count: 150119
      Memtable Data Size: 814194
      Memtable Switch Count: 21
      Read Count: 604605
      Read Latency: 7.539 ms.
      Write Count: 553015
      Write Latency: 0.173 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 111884
      Key cache hit rate: 0.9770064886992118
      Row cache: disabled
      Compacted row minimum size: 234
      Compacted row maximum size: 1644541
      Compacted row mean size: 1238

      Column Family: Attachment
      SSTable count: 5
      Space used (live): 754692822
      Space used (total): 754692822
      Memtable Columns Count: 548
      Memtable Data Size: 7110
      Memtable Switch Count: 8
      Read Count: 22708
      Read Latency: 6.950 ms.
      Write Count: 29835
      Write Latency: 0.025 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 8299
      Key cache hit rate: 0.017857142857142856
      Row cache: disabled
      Compacted row minimum size: 269
      Compacted row maximum size: 274
      Compacted row mean size: 272

      Column Family: Message
      SSTable count: 7
      Space used (live): 3689251440
      Space used (total): 3689251440
      Memtable Columns Count: 50399
      Memtable Data Size: 1361773
      Memtable Switch Count: 10
      Read Count: 163630
      Read Latency: 16.981 ms.
      Write Count: 257843
      Write Latency: 0.113 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 52427
      Key cache hit rate: 0.5041365046535677
      Row cache: disabled
      Compacted row minimum size: 272
      Compacted row maximum size: 2754
      Compacted row mean size: 752

      Column Family: FolderInfo
      SSTable count: 0
      Space used (live): 0
      Space used (total): 0
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 8704
      Read Latency: 0.008 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: StandardByUUID1
      SSTable count: 0
      Space used (live): 0
      Space used (total): 0
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 8704
      Read Latency: 0.005 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: User
      SSTable count: 4
      Space used (live): 120121309
      Space used (total): 120121309
      Memtable Columns Count: 1786
      Memtable Data Size: 50657
      Memtable Switch Count: 8
      Read Count: 641067
      Read Latency: 2.728 ms.
      Write Count: 15857
      Write Latency: 0.148 ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 45965
      Key cache hit rate: 0.957460499034818
      Row cache: disabled
      Compacted row minimum size: 271
      Compacted row maximum size: 506
      Compacted row mean size: 499

      Column Family: Standard1
      SSTable count: 0
      Space used (live): 0
      Space used (total): 0
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 8704
      Read Latency: 0.005 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: Standard2
      SSTable count: 0
      Space used (live): 0
      Space used (total): 0
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 8704
      Read Latency: 0.006 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

      Column Family: Folder1
      SSTable count: 0
      Space used (live): 0
      Space used (total): 0
      Memtable Columns Count: 0
      Memtable Data Size: 0
      Memtable Switch Count: 0
      Read Count: 8704
      Read Latency: 0.005 ms.
      Write Count: 0
      Write Latency: NaN ms.
      Pending Tasks: 0
      Key cache capacity: 200000
      Key cache size: 0
      Key cache hit rate: NaN
      Row cache: disabled
      Compacted row minimum size: 0
      Compacted row maximum size: 0
      Compacted row mean size: 0

----------------
r...@cdb-006:/glass/sfw/cassandra# bin/nodetool -h localhost tpstats
Pool Name                    Active   Pending      Completed
FILEUTILS-DELETE-POOL             0         0             70
STREAM-STAGE                      0         0              0
RESPONSE-STAGE                    0         0       18298968
ROW-READ-STAGE                    1         0       10168985
LB-OPERATIONS                     0         0              0
MESSAGE-DESERIALIZER-POOL         0         0       24960681
GMFD                              0         0          93647
LB-TARGET                         0         0              0
CONSISTENCY-MANAGER               0         0              0
ROW-MUTATION-STAGE                0         0        4280703
MESSAGE-STREAMING-POOL            0         0             12
LOAD-BALANCER-STAGE               0         0              0
FLUSH-SORTER-POOL                 0         0              0
MEMTABLE-POST-FLUSHER             0         0            224
FLUSH-WRITER-POOL                 0         0            224
AE-SERVICE-STAGE                  0         0             22
HINTED-HANDOFF-POOL               1         5             73

Here are some system load data I collected.

top - 00:43:43 up 1 day, 20:15,  2 users,  load average: 2.96, 2.74, 2.37
Tasks: 160 total,   1 running, 159 sleeping,   0 stopped,   0 zombie
Cpu0 : 10.9%us, 2.6%sy, 0.0%ni, 83.2%id, 3.2%wa, 0.0%hi, 0.0%si, 0.0%st Cpu1 : 9.6%us, 2.2%sy, 0.0%ni, 85.4%id, 2.8%wa, 0.0%hi, 0.0%si, 0.0%st Cpu2 : 37.1%us, 4.2%sy, 0.0%ni, 23.3%id, 33.8%wa, 1.2%hi, 0.3%si, 0.0%st Cpu3 : 18.5%us, 3.9%sy, 0.0%ni, 62.6%id, 9.5%wa, 4.1%hi, 1.4%si, 0.0%st Cpu4 : 11.1%us, 2.6%sy, 0.0%ni, 80.1%id, 3.2%wa, 2.9%hi, 0.2%si, 0.0%st Cpu5 : 10.1%us, 2.7%sy, 0.0%ni, 84.4%id, 2.7%wa, 0.0%hi, 0.0%si, 0.0%st Cpu6 : 20.1%us, 2.6%sy, 0.0%ni, 62.6%id, 14.7%wa, 0.0%hi, 0.0%si, 0.0%st Cpu7 : 18.9%us, 4.4%sy, 0.0%ni, 69.0%id, 7.7%wa, 0.0%hi, 0.1%si, 0.0%st
Mem:   8188108k total,  8135168k used,    52940k free,    12968k buffers
Swap: 62500792k total,   277484k used, 62223308k free,  2381704k cached

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 18000 root 20 0 15.6g 6.7g 1.5g S 64 85.4 552:54.13 java 1 root 20 0 19316 672 412 S 0 0.0 0:03.36 init
r...@cdb-006:/glass/sfw/cassandra# vmstat 5
procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu---- r b swpd free buff cache si so bi bo in cs us sy id wa 5 1 277984 50180 12888 2382292 84 67 880 127 10 4 17 4 69 10 3 1 278000 51536 12856 2381316 4 14 2556 37 8660 10714 7 4 82 8 2 0 277968 52084 12768 2380156 7 2 891 618 6287 8865 6 3 87 3 3 0 277872 52020 12764 2379412 24 7 1070 22 5492 10111 8 3 84 4 0 1 277528 50764 12760 2379176 2 10 1530 696 8773 13661 8 4 83 5

Reply via email to