Re: Estimating write throughput with LeveledCompactionStrategy

2013-02-11 Thread Ивaн Cобoлeв
Yup, we set it to 100M. Currently we have around 1Tb of data per node(getting to level 5 now) + data pieces are rather large(small tables would flush more often). Yes, you're right, it's slower thus building mental models is more time effective than experimenting :) Ivan 2013/2/6 Wei Zhu : > I h

Spike in latency, one node keeps firing "Interval min > max" errors

2013-02-11 Thread Drew Broadley
Hi there, I have a cluster of three nodes running Cassandra 1.2.0 I received alerts from my monitoring, and then discovered this huge spike in cluster latency: https://dl.dropbox.com/u/3444322/Screen%20Shot%202013-02-12%20at%205.07.49%20PM.png Investigating what is going on, there is no load on

Re: Operation Consideration with Counter Column Families

2013-02-11 Thread Drew Kutcharian
For anyone interested, I came across this video where Sylvain explains how counters are actually implemented in Cassandra. http://vimeo.com/26011102 On Feb 6, 2013, at 8:08 PM, aaron morton wrote: >> Thanks Aaron, so will there only be one "value" for each counter column per >> sstable just

Cassandra 1.2 Atomic Batches and Thrift API

2013-02-11 Thread Drew Kutcharian
Hey Guys, Is the new atomic batch feature in Cassandra 1.2 available via the thrift API? If so, how can I use it? -- Drew

Re: Upgrade to Cassandra 1.2

2013-02-11 Thread Daning Wang
Thanks Aaron. I tried to migrate existing cluster(ver 1.1.0) to 1.2.1 but failed. - I followed http://www.datastax.com/docs/1.2/install/upgrading, have merged cassandra.yaml, with follow parameter num_tokens: 256 #initial_token: 0 the initial_token is commented out, current token should be obta

Re: Cassandra 1.1.2 -> 1.1.8 upgrade

2013-02-11 Thread Mike
So the upgrade sstables is recommended as part of the upgrade to 1.1.3 if you are using counter columns Also, there was a general recommendation (in another response to my question) to run upgrade sstables because of: "upgradesstables always needs to be done between majors. While 1.1.2 -> 1.

Re: Cassandra libraries for Golang

2013-02-11 Thread Ben Hood
Hi Boris, I use this one with Cassandra 1.2+ (you'll need to turn the native port on): https://github.com/titanous/gocql HTH, Ben On Friday, 8 February 2013 at 16:40, Boris Solovyov wrote: > Hi, > > I'm developing Go application. I see there is gossie, which doesn't support > the native b

Re: what addresses to use in EC2 cluster (whenever an instance restarts it gets a new private ip)?

2013-02-11 Thread Andrey Ilinykh
You have to use private IPs, but if an instance dies you have to bootstrap it with replace token flag. If you use EC2 I'd recommend Netflix's Priam tool. It manages all that stuff, plus you have S3 backup. Andrey On Mon, Feb 11, 2013 at 11:35 AM, Brian Tarbox wrote: > How do I configure my clu

Cassandra becnhmark

2013-02-11 Thread Kanwar Sangha
Hi - I am trying to do benchmark using the Cassandra-stress tool. They have given an example to insert data across 2 nodes - /tools/stress/bin/stress -d 192.168.1.101,192.168.1.102 -n 1000 But when I run this across my 2 node cluster, I see the same keys in both nodes. Replication is not en

Re: Directory structure after upgrading 1.0.8 to 1.2.1

2013-02-11 Thread aaron morton
I think it's a little more subtle that that https://issues.apache.org/jira/browse/CASSANDRA-5242 Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 8/02/2013, at 10:21 PM, "Desimpel, Ignace" wrote: > Yes it are new d

what addresses to use in EC2 cluster (whenever an instance restarts it gets a new private ip)?

2013-02-11 Thread Brian Tarbox
How do I configure my cluster to run in EC2? In my cassandra.yaml I have IP addresses under seed_provider, listen_address and rpc_address. I tried setting up my cluster using just the EC2 private addresses but when one of my instances failed and I restarted it there was a new private address. Su

Re: Why CQL returns data in byte format, while Hive de-serialize and return the data in readable format

2013-02-11 Thread aaron morton
I suggest trying Int32Type and Double http://www.datastax.com/docs/datastax_enterprise2.1/analytics/about_hive#mapping-hive-external-tables-to-cassandra-column-families Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On

Time complexity of cassandra operations

2013-02-11 Thread Tim Wintle
Hi, I've tried searching for this all over the place, but I can't find an answer anywhere... What is the (theoretical) time complexity of basic C* operations? I assume that single lookups are O(log(R/N)) for R rows across N nodes (as SST lookups should be O(log(n)) and there are R/N rows per nod

RE: unbalanced ring

2013-02-11 Thread Stephen.M.Thompson
Aaron, thanks for your feedback. .125 num_tokens: 256 # initial_token: .126 num_tokens: 256 #initial_token: .127 num_tokens: 256 # initial_token: This all looks correct. So when you say to do this with a "clean" setup, what are you asking me to do? Is it enough to blow away /var/lib/cassandr

Re: Cassandra jmx stats ReadCount

2013-02-11 Thread aaron morton
Are you using counters? They require a read before write. Also secondary index CF's require a read before write. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 8/02/2013, at 1:26 PM, Daning Wang wrote: > We have

Re: unbalanced ring

2013-02-11 Thread aaron morton
The tokens are not right, not right at all. Some are too short and some are too tall. More technically they do not appear to be randomly arranged. The tokens for the .125 node all start with -3, the 126 node only has negative tokens and the 127 node mostly has positive tokens. Check that on

RuntimeException during leveled compaction

2013-02-11 Thread Andre Sprenger
Hi, I'm running a 6 node Cassandra 1.1.5 cluster on EC2. We have switched to leveled compaction a couple of weeks ago, this has been successful. Some days ago 3 of the nodes start to log the following exception during compaction of a particular column family: ERROR [CompactionExecutor:726] 2013-0

Re: CQL 3 compound row key error

2013-02-11 Thread Shahryar Sedghi
Thanks Aaron. Opened CASSANDRA-5240 On Mon, Feb 11, 2013 at 4:34 AM, aaron morton wrote: > That sounds like a bug, or something that is still under work. Sylvain has > his finger on all things CQL. > > Can you raise a ticket on

Re: Cassandra 1.1.2 -> 1.1.8 upgrade

2013-02-11 Thread Michal Michalski
OK, thanks Aaron. I ask because NEWS.txt is not a big help in case of > 1.1.5 versions because there's no info on them in it (especially on 1.1.7 which seems to be the most important one in this case, according to the DataStax' upgrade instructions) ;-) https://github.com/apache/cassandra/blob

Re: Cassandra 1.1.2 -> 1.1.8 upgrade

2013-02-11 Thread aaron morton
You can always run them. But in some situations repair cannot be used, and in this case new nodes cannot be added. The news.txt file is your friend there. As a general rule when upgrading a cluster I move one node to the new version and let it soak in for an hour or so. Just to catch any crazy

Re: Read-repair working, repair not working?

2013-02-11 Thread aaron morton
> CL.ONE : this is primarily for performance reasons … This makes reasoning about "correct" behaviour a little harder. If there is anyway you can run some tests with R + W > N strong consistency I would encourage you to do so. You will then have a baseline of what works. > (say I make 100 requ

Re: High CPU usage during repair

2013-02-11 Thread Tamar Fraenkel
Thank you very much! Due to monetary limitations I will keep the m1.large for now, but try the throughput modification. Tamar *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On

Re: CQL 3 compound row key error

2013-02-11 Thread aaron morton
That sounds like a bug, or something that is still under work. Sylvain has his finger on all things CQL. Can you raise a ticket on https://issues.apache.org/jira/browse/CASSANDRA Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpic

Re: High CPU usage during repair

2013-02-11 Thread aaron morton
> What machine size? > m1.large If you are seeing high CPU move to an m1.xlarge, that's the sweet spot. > That's normally ok. How many are waiting? > > I have seen 4 this morning That's not really abnormal. The pending task count goes when when a file *may* be eligible for compaction, not wh

Re: Cassandra 1.1.2 -> 1.1.8 upgrade

2013-02-11 Thread Alain RODRIGUEZ
Not sure this will be useful for you but "nodetool drain" doesn't work properly well for a while. If you are using counters I recommend you to remove commit logs after you drained ans stopped the node, before restarting the node to avoid replaying counts. https://issues.apache.org/jira/browse/CASS

Re: Read-repair working, repair not working?

2013-02-11 Thread Brian Fleming
Hi Aaron, Many thanks for your reply - answers below. Cheers, Brian > What CL are you using for reads and writes? > I would first build a test case to ensure correct operation when using strong > consistency. i.e. QUOURM write and read. Because you are using RF 2 per DC I > assume you