anything on the roadmap to know the status return of executing nodetool maintenance commands

2010-09-20 Thread Dathan Pattishall
I would love the ability to run nodetool repair and get status updates on what is being done and when to what, as well as an estimated time of completion Like Grabbing data from nodes populating memtable cleanups etc. Currently this stuff happens asynchronously from the nodetool point of view (fi

what are ways to keep the SSTable Count down low

2010-09-20 Thread Dathan Pattishall
How do you set the compaction threshold from storage-conf.xml? is this possible? What is the consensus on a basic Key-Value store of setting the compactionthreshold min/max from ./nodetool --host=localhost --port=8181 getcompactionthreshold Current compaction threshold: Min=4, Max=32 to say some

Re: Memtable adjusting impact expectations?

2010-09-14 Thread Dathan Pattishall
mpact on my cluster," the answer is almost always "less impact." > > On Tue, Sep 14, 2010 at 8:33 PM, Dathan Pattishall > wrote: > > Okay from what I gather. When data is written its always written to > memory. > > The flow for our concerns is the data is written to

Memtable adjusting impact expectations?

2010-09-14 Thread Dathan Pattishall
Okay from what I gather. When data is written its always written to memory. The flow for our concerns is the data is written to the commitLog then to the memtable. If any of memtable's 3 tunable thresholds are hit a flush occurs writing the data sorted by key to the SSTABLE still enabling sequenti

Re: Using this I think I found/verified an interesting problem with Cassandra

2010-09-08 Thread Dathan Pattishall
you pay for not > doing random i/o at insert time). > > On Wed, Sep 8, 2010 at 2:08 PM, Dathan Pattishall > wrote: > > This link describe ganglia / cassandra graphing. > > > > http://mysqldba.blogspot.com/2010/09/cassandra-and-ganglia.html > > > > I ran into

Using this I think I found/verified an interesting problem with Cassandra

2010-09-08 Thread Dathan Pattishall
This link describe ganglia / cassandra graphing. http://mysqldba.blogspot.com/2010/09/cassandra-and-ganglia.html I ran into a problem illustrated here. http://www.flickr.com/photos/dathan/4971255111/ This screen shot shows a huge spike of transport exceptions between the hours of 12:15 - to 1:3

Re: Capping the memory limit in cassandra

2010-09-07 Thread Dathan Pattishall
Ah thanks Johnathan. I am using mmap, which is outside the control of the JVM from the looks of it. On Tue, Sep 7, 2010 at 11:15 AM, Jonathan Ellis wrote: > see http://www.mail-archive.com/user@cassandra.apache.org/msg03583.html > > On Tue, Sep 7, 2010 at 12:38 PM, Dathan Pattishall

Capping the memory limit in cassandra

2010-09-07 Thread Dathan Pattishall
For this java process /opt/java/bin/java -ea -Xms1G -*Xmx7G *-XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSParallelRemarkEnabled -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1 -XX:+HeapDumpOnOutOfMemoryError -Dcom.sun.management.jmxremote.port=8181 -Dcom.sun.management.jmxremote.ssl=false -Dcom

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-08-03 Thread Dathan Pattishall
iced: are you running 14 instances of Cassandra on a > single physical machine or are all those java processes something > else? > > On Mon, Jul 26, 2010 at 12:22 PM, Dathan Pattishall > wrote: > > I have 4 nodes on enterprise type hardware (Lots of Ram 12GB, 16 i7 > co

Re: what is the expected result of changing this in storage.conf

2010-08-03 Thread Dathan Pattishall
10 at 7:08 PM, Jonathan Ellis wrote: > You need to find out where your bottleneck is, before you start trying > to mitigate it. Some good first steps are at > http://spyced.blogspot.com/2010/01/linux-performance-basics.html > > On Tue, Aug 3, 2010 at 9:03 PM, Dathan Pattishall >

what is the expected result of changing this in storage.conf

2010-08-03 Thread Dathan Pattishall
Do reads at high concurrency get a boost if I where to raise this value? 8 32 Any benchmarks/reports showing a good sweet spot per CPU core?

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-30 Thread Dathan Pattishall
On Fri, Jul 30, 2010 at 1:05 AM, Peter Schuller wrote: > > Keyspace: TimeFrameClicks > > Read Count: 42686 > > Read Latency: 47.21777100220213 ms. > > Write Count: 18398 > > Write Latency: 0.17457457332318732 ms. > > Is this all traffic across "a few days" as you m

Re: Key Caching

2010-07-27 Thread Dathan Pattishall
woot thnx, lots of knobs to play with! On Tue, Jul 27, 2010 at 12:16 AM, Peter Schuller < peter.schul...@infidyne.com> wrote: > > @Todd, I noticed some new ops in your cassandra.in.sh. Is there any > > documentation on what these ops are, and what they do? > > > > For instance AggressiveOpts, etc

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-27 Thread Dathan Pattishall
e and read to the same disk may help some. > It’s not so much about “Cassandra’s” read rate but what your hardware can > manage. > > > > /Justus > > > > *Från:* Dathan Pattishall [mailto:datha...@gmail.com] > *Skickat:* den 27 juli 2010 08:56 > *Till:* user@c

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
hing but that really depends > > on your data set. > > > > -Anthony > > > > On Mon, Jul 26, 2010 at 12:22:46PM -0700, Dathan Pattishall wrote: > >> I have 4 nodes on enterprise type hardware (Lots of Ram 12GB, 16 i7 > cores, > >> RAID Disks). > >&

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
@Garo htop. On Mon, Jul 26, 2010 at 11:28 PM, Juho Mäkinen wrote: > Off topic, but what was this tool which prints per cpu utilization? > > - Garo > > On Mon, Jul 26, 2010 at 10:22 PM, Dathan Pattishall > wrote: > > But the 16 cores are hardly utilized. Which indic

Re: Key Caching

2010-07-26 Thread Dathan Pattishall
@Todd, I noticed some new ops in your cassandra.in.sh. Is there any documentation on what these ops are, and what they do? For instance AggressiveOpts, etc. On Mon, Jul 26, 2010 at 4:33 PM, B. Todd Burruss wrote: > i run cassandra with a 30gb heap on machines with 48gb total with good > resul

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
Okay so why would the pending be so large? I looked at iostat and the diskload is not bad at all. The service times /etc are all good. If it's not disk based I/O then is it thread thrashing? If so which thread pool should I monitor more closely. MESSAGE-DESERIALIZER-POOL:1 which would indicate t

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
Hpw does one set/get the read-concurrency? Also which read-concurrency, for which pool? According to the threads tab in jconsole read-concurrency could be among different pools. There are row-read-stag threads, lots of write threads, row mutation stage etc. On Mon, Jul 26, 2010 at 12:56 PM, P

what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
I have 4 nodes on enterprise type hardware (Lots of Ram 12GB, 16 i7 cores, RAID Disks). ~# /opt/cassandra/bin/nodetool --host=localhost --port=8181 tpstats Pool NameActive Pending Completed STREAM-STAGE 0 0 0 RESPONSE-STAGE

Re: Re: What is consuming the heap?

2010-07-20 Thread Dathan Pattishall
By off chance on writes are you using ConsistencyLevel::ZERO? On Tue, Jul 20, 2010 at 9:41 PM, 王一锋 wrote: > So the bloom filters reside in memory completely? > > We do have a lot of small values, hundreds of millions of columns in a > columnfamily. > > I count the total size of *-Filter.db f

Re: what causes a cassandra to block and throw a null exception

2010-07-20 Thread Dathan Pattishall
t; -Chris > > On Jul 20, 2010, at 8:59 PM, Dathan Pattishall wrote: > >> Type 'help' or '?' for help. Type 'quit' or 'exit' to quit. >> cassandra> connect cass01/9160 >> cassandra> get TimeFrameClicks.Standard2['test_cassandra_

what causes a cassandra to block and throw a null exception

2010-07-20 Thread Dathan Pattishall
Type 'help' or '?' for help. Type 'quit' or 'exit' to quit. cassandra> connect cass01/9160 cassandra> get TimeFrameClicks.Standard2['test_cassandra_alive'] Exception null The data exists and I can grab the data after I restart all the nodes, but once the cluster runs for a few minutes I cannot gra

Re: how come some nodes will drop nodes from the ring and not others?

2010-07-20 Thread Dathan Pattishall
Original Message----- > From: "Dathan Pattishall" > Sent: Tuesday, July 20, 2010 6:09pm > To: user@cassandra.apache.org > Subject: how come some nodes will drop nodes from the ring and not others? > > dsh is a distributed shell that basically runs the same command o

how come some nodes will drop nodes from the ring and not others?

2010-07-20 Thread Dathan Pattishall
dsh is a distributed shell that basically runs the same command on multiple servers. Notice that cass03 sees all 4 servers, yet the other 3 only sees three servers? Storage-conf.xml is the same among all nodes i.e. 10.129.28.14 10.129.28.20 10.129.28.22 10.129.28.23 c

Re: Ran into an issue where Cassandra Crashed when running out of heap space

2010-07-20 Thread Dathan Pattishall
The storage structure is rather simple. For every 1 key there is 1 column and a timestamp for that column. We don't enable pulling a huge amount of data and all other nodes are up servicing the same request. I suspect there may be another problem with Memory management inside Cassandra. Attac

Ran into an issue where Cassandra Crashed when running out of heap space

2010-07-20 Thread Dathan Pattishall
 INFO [HINTED-HANDOFF-POOL:1] 2010-07-20 15:10:43,721 HintedHandOffManager.java (line 210) Finished hinted handoff of 0 rows to endpoint /10.129.28.23 ERROR [pool-1-thread-37895] 2010-07-20 15:10:51,622 CassandraDaemon.java (line 83) Uncaught exception in thread Thread[pool-1-thread-37895,5,main] j