Anyway To Query Just The Partition Key?

2013-04-12 Thread Gareth Collins
Hello, If I have a cql3 table like this (I don't have a table with this data - this is just for example): create table ( surname text, city text, country text, event_id timeuuid, data text, PRIMARY KEY ((surname, city, country),event_id)); there is no way of (easily) gett

Re: Any experience of 20 node mini-itx cassandra cluster

2013-04-12 Thread Jabbar Azam
What about using quad core athlon x4 740 3.2 GHz with 8gb of ram and 256gb ssds? I know it will depend on our workload but will be better than a dual core CPU. I think Jabbar Azam On 13 Apr 2013 01:05, "Edward Capriolo" wrote: > Duel core not the greatest you might run into GC issues before

Re: CQL3 And ReversedTypes Question

2013-04-12 Thread Gareth Collins
OK, trying out 1.2.4. The previous issue seems to be fine, but I am experiencing a new one: cqlsh:location> create table test_y (message_id timeuuid, name text, PRIMARY KEY (name,message_id)); cqlsh:location> insert into test_y (message_id,name) VALUES (now(),'foo'); cqlsh:location> insert into te

Re: Any experience of 20 node mini-itx cassandra cluster

2013-04-12 Thread Edward Capriolo
Duel core not the greatest you might run into GC issues before you run out of IO from your ssd devices. Also cassandra has other concurrency settings that are tuned roughly around the number of processors/cores. It is not uncommon to see 4-6 cores of cpu (600 % in top dealing with young gen garbage

Re: A couple of question about replication

2013-04-12 Thread Tyler Hobbs
It seems like you have a few concepts about consistency levels and replication confused. It might be more useful to do some background reading first: - http://www.datastax.com/docs/1.2/cluster_architecture/data_distribution - http://www.datastax.com/docs/1.2/cluster_architecture/about_client_requ

A couple of question about replication

2013-04-12 Thread Ostap Bender
Hi All, I don't have any experience with Cassandra but we are about to evaluate it for one of our project and a few questions had came up, namely:--Let say we have replica set with 9 nodes. There is a write with consistency 1 and replication "majority", so 5 nodes will get the data (sa

Re: unexplained hinted handoff

2013-04-12 Thread Dane Miller
On Fri, Apr 12, 2013 at 1:12 PM, Dane Miller wrote: > I'm seeing hinted handoff kick in on all our nodes during periods of > high activity, but all the nodes seem to be up (according to the logs > and nodetool status). The pattern in the logs is something like this: > > 18:10:45 194 READ messages

unexplained hinted handoff

2013-04-12 Thread Dane Miller
I'm seeing hinted handoff kick in on all our nodes during periods of high activity, but all the nodes seem to be up (according to the logs and nodetool status). The pattern in the logs is something like this: 18:10:45 194 READ messages dropped in last 5000ms 18:11:10 Started hinted handoff for ho

Re: Cassandra Summit 2013

2013-04-12 Thread Brian Tarbox
Jonathan, I'm a bit puzzled. I had planned to attend Cassandra's major conference in the summer but then the NYC* conference was announced. I spoke with DataStax and was told that there was no summer conference this year and that NYC* was all there was. So, I spent my conference time/budget on i

ECCN for Cassandra

2013-04-12 Thread Mark McCraw
Hi All, I was wondering if Cassandra is registered for a ECCN (Export Control Classification Number) with the US Federal Government for export/distribution purposes. It doesn't seem to be in the matrix with other Apache projects (http://www.apache.org/licenses/exports/#matrix), but I didn't know

Re: Any experience of 20 node mini-itx cassandra cluster

2013-04-12 Thread Jabbar Azam
That's my guess. My colleague is still looking at CPU's so I'm hoping he can get quad core CPU's for the servers. Thanks Jabbar Azam On 12 April 2013 16:48, Colin Blower wrote: > If you have not seen it already, checkout the Netflix blog post on their > performance testing of AWS SSD instanc

Re: Any experience of 20 node mini-itx cassandra cluster

2013-04-12 Thread Colin Blower
If you have not seen it already, checkout the Netflix blog post on their performance testing of AWS SSD instances. http://techblog.netflix.com/2012/07/benchmarking-high-performance-io-with.html My guess, based on very little experience, is that you will be CPU bound. On 04/12/2013 03:05 AM, Jabb

Cassandra Summit 2013

2013-04-12 Thread Jonathan Ellis
Hi all, Last year's Summit saw fantastic talks [1] and over 800 attendees. The feedback was enthusiastic; the most commonly requested improvement was to extend it to two days. We're pleased to deliver just that for 2013! This year's Cassandra Summit will be at Fort Mason in San Francisco, Califo

Re: Repair hanges on 1.1.4

2013-04-12 Thread Alexis Rodríguez
Adeel, It may be a problem in the remote node, could you check the system.log? Also you might want to check the rpc_timeout_in_ms in both nodes, maybe an increase in this parameter helps. On Fri, Apr 12, 2013 at 9:17 AM, wrote: > Hi, > > I have started repair on newly added node with -pr a

Re: Adding new nodes to an existing cluster

2013-04-12 Thread Gabriel Ciuloaica
Thanks Marcus, So, at this moment I have to wait until release of version 1.2.5 to be able to extend the cluster ... I haven't seen mention any workaround ... Thanks, Gabi On 4/12/13 11:11 AM, Marcus Eriksson wrote: looks very similar to this: https://issues.apache.org/jira/browse/CASSANDRA-5

Re: Does Memtable resides in Heap?

2013-04-12 Thread Edward Capriolo
This issue describes the design of the arena allocation of memtabes. https://issues.apache.org/jira/browse/CASSANDRA-2252 On Fri, Apr 12, 2013 at 1:35 AM, Viktor Jevdokimov < viktor.jevdoki...@adform.com> wrote: > Memtables resides in heap, write rate impacts GC, more writes - more > frequent an

Re: Broken pipe when variating a lot number of connections

2013-04-12 Thread Edward Capriolo
The YCSB client is not very advanced. Hector, asynax or the native driver will work better. There are a few ycsb forks as each nosql person usually needs to fork ycsb to get the most out of it, check github. On Thu, Apr 11, 2013 at 6:28 PM, Rodrigo Felix < rodrigofelixdealme...@gmail.com> wrote:

Re: Timestamps and CQL

2013-04-12 Thread Edward Capriolo
You are correct. In CQL the timestamps come from the server unless specified. In thrift the user must supply, otherwise it is always 0. On Fri, Apr 12, 2013 at 9:20 AM, Michael Theroux wrote: > Hello, > > We are having an odd sporadic issue that I believe maybe due to time > synchronization. Wi

Re: Astyanax with C* 1.2.3 and VNodes

2013-04-12 Thread Edward Capriolo
Your best bet is to switch to RandomPartitioner. Otherwise you have to patch or wait until astynax catches up. On Fri, Apr 12, 2013 at 9:42 AM, Keith Wright wrote: > Hi all, > > I am trying to use Astyanax 1.56.37 to connect to C* 1.2.3 using > murmur3 and Vnodes and I am consistently getti

Re: CorruptedBlockException

2013-04-12 Thread Alexis Rodríguez
Lanny, We will try that. Thanks a lot On Thu, Apr 11, 2013 at 11:13 PM, Lanny Ripple wrote: > Saw this in earlier versions. Our workaround was disable; drain; snap; > shutdown; delete; link from snap; restart; > > -ljr > > On Apr 11, 2013, at 9:45, wrote: > > I have formulated the followin

Astyanax with C* 1.2.3 and VNodes

2013-04-12 Thread Keith Wright
Hi all, I am trying to use Astyanax 1.56.37 to connect to C* 1.2.3 using murmur3 and Vnodes and I am consistently getting NoAvailableHostsExceptions. I found the following thread which seems to indicate that the issue is due to Cassandra not properly communicating the ring info to Astyanax

Timestamps and CQL

2013-04-12 Thread Michael Theroux
Hello, We are having an odd sporadic issue that I believe maybe due to time synchronization. Without going into details on the issue right now, quick question, from the documentation I see numerous references that Cassandra utilizes timestamps generated by the clients to determine write serial

Repair hanges on 1.1.4

2013-04-12 Thread adeel . akbar
Hi, I have started repair on newly added node with -pr and this nodes exist on another data center. I have 5MB internet connection and configured setstreamthroughput 1. After some time repair goes hang and following meesage found in logs; # /opt/apache-cassandra-1.1.4/bin/nodetool -h loca

Re: running cassandra on 8 GB servers

2013-04-12 Thread Hiller, Dean
Hmmm, what is the recommendation for a 10G network if 1G was 300G to 500GŠI am guessing I can't do 10 times that, correct? But maybe I could squeak out 600G to 1T? Thanks, Dean On 4/11/13 2:26 PM, "aaron morton" wrote: >> The data will be huge, I am estimating 4-6 TB per server. I know this >>

Re: Blobs in CQL?

2013-04-12 Thread Mikhail Mazursky
Hi, let me ask a related question. Is it planned to support passing InputStream or Guava's InputSupplier/ByteSource to BoundStatement to make it possible to take advantage of new native transport's streaming? p.s. the new driver looks cool, but I haven't tried it yet. Mikhail.

Any experience of 20 node mini-itx cassandra cluster

2013-04-12 Thread Jabbar Azam
Hello, I'm going to be building a 20 node cassandra cluster in one datacentre. The spec of the servers will roughly be dual core Celeron CPU, 256 GB SSD, 16GB RAM and two nics. Has anybody done any performance testing with this setup or have any gotcha's I should be aware of wrt to the hardware?

Re: Exception for version 1.1.0

2013-04-12 Thread Winsdom Chen
Hi Aaron, Thanks for your reply! I've checked with release note, the patch has applied in 1.2.3. If upgrade from 1.1.0 to 1.2.3, any data migration or other efforts?

Re: Adding new nodes to an existing cluster

2013-04-12 Thread Marcus Eriksson
looks very similar to this: https://issues.apache.org/jira/browse/CASSANDRA-5418 /Marcus On Fri, Apr 12, 2013 at 9:12 AM, Gabriel Ciuloaica wrote: > Hi, > > From yesterday, I'm trying to add a new node to an existing 3 nodes > Cassandra cluster, running version 1.2.3. Today I have started clea

Re: multiple Datacenter values in PropertyFileSnitch

2013-04-12 Thread Andras Szerdahelyi
I would replicate your different keyspaces to different DCs and scale those appropriately So, for example, HighLoad KS replicates to really-huge-dc, which would have, 10 nodes, LowerLoad KS replicates to smaller-dc with 5 nodes. The idea is , you do not mix your different keyspaces in the same da

Re: is the select result grouped by the value of the partition key?

2013-04-12 Thread Sorin Manolache
On 2013-04-11 22:10, aaron morton wrote: Is it guaranteed that the rows are grouped by the value of the partition key? That is, is it guaranteed that I'll get Your primary key (k1, k2) is considered in type parts (partition_key , grouping_columns). In your case the primary_key is key and the gro

Adding new nodes to an existing cluster

2013-04-12 Thread Gabriel Ciuloaica
Hi, From yesterday, I'm trying to add a new node to an existing 3 nodes Cassandra cluster, running version 1.2.3. Today I have started clean, removed the node that I have tried to add yesterday and try to add another node. Verified both cassandra.yaml and cassandra-topology.properties are id