Re: simple data movement ?

2014-12-17 Thread Ben Bromhead
Just copy the data directory from each prod node to your test node (and relevant configuration files etc). If your IP addresses are different between test and prod, follow https://engineering.eventbrite.com/changing-the-ip-address-of-a-cassandra-node-with-auto_bootstrapfalse/ On 18 December 2014

Re: bootstrapping manually when auto_bootstrap=false ?

2014-12-17 Thread Ben Bromhead
- In Cassandra yaml set auto_bootstrap = false - Boot node - nodetool rebuild Very similar to http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_add_dc_to_cluster_t.html On 18 December 2014 at 14:04, Kevin Burton wrote: > > I’m trying to figure out the best way

Cassandra for Analytics?

2014-12-17 Thread Ajay
Hi, Can Cassandra be used or best fit for Real Time Analytics? I went through couple of benchmark between Cassandra Vs HBase (most of it was done 3 years ago) and it mentioned that Cassandra is designed for intensive writes and Cassandra has higher latency for reads than HBase. In our case, we wil

bootstrapping manually when auto_bootstrap=false ?

2014-12-17 Thread Kevin Burton
I’m trying to figure out the best way to bootstrap our nodes. I *think* I want our nodes to be manually bootstrapped. This way an admin has to explicitly bring up the node in the cluster and I don’t have to worry about a script accidentally provisioning new nodes. The problem is HOW do you do it

Re: Cassandra metrics & Graphite

2014-12-17 Thread Dennis Lovely
Hi Nigel, I'm using Casssandra 2.0.7 and in Cassandra lib directory, I'm dropping in this reporter jar: http://search.maven.org/#artifactdetails|com.yammer.metrics|metrics-graphite|2.2.0|jar My cass metrics are flowing in properly into Graphite 0.9.10. Here's my metrics yaml: graphite: -

simple data movement ?

2014-12-17 Thread Langston, Jim
Hi all, I have set up a test environment with C* 2.1.2, wanting to test our applications against it. I currently have C* 1.2.9 in production and want to use that data for testing. What would be a good approach for simply taking a copy of the production data and moving it into the test env and havi

Re: getting column names

2014-12-17 Thread Adam Holmberg
Stephen, This topic is more appropriate for the python-driver-user list: https://groups.google.com/a/lists.datastax.com/forum/#!forum/python-driver-user Can we pick this up there, with a little further information including your table definition? Adam On Fri, Dec 12, 2014 at 1:04 PM, Stephen Jo

High Bloom Filter FP Ratio

2014-12-17 Thread Chris Hart
Hi, I have create the following table with bloom_filter_fp_chance=0.01: CREATE TABLE logged_event ( time_key bigint, partition_key_randomizer int, resource_uuid timeuuid, event_json text, event_type text, field_error_list map, javascript_timestamp timestamp, javascript_uuid uuid,

Re: Cassandra metrics & Graphite

2014-12-17 Thread Karl Rieb
This seemed to be due to a bug with how metric names are converted to file system paths. os.path.join() is used, but the metric path converts into an absolute path (e.g /org/apache/cassandra). This means you end up doing something like: os.path.join('/opt/graphite/storage/whatever', '/org/apach

Reload/resync system.peers table

2014-12-17 Thread Paulo Ricardo Motta Gomes
Hello, Due to CASSANDRA-6053 there are lots of ghost nodes on the system.peers table, because decommisioned nodes were not properly removed from this table. Is there any automatic way of reloading/resyncing the system.peers table? Or the only way is by removing ghost nodes? Tried to restart the

Cassandra metrics & Graphite

2014-12-17 Thread Nigel LEACH
I'm running Cassandra Cassandra 2.0.11.83 (via DSE 4.6.0), and Graphite 0.9.10. I know a bit about Cassandra, but not much about Graphite. Our Graphite server exposes system metrics, and also those from the example python scripts, successfully. I can see Cassandra metrics hitting the Graphite s

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread Adam Holmberg
This is something that could easily be improved in cqlsh. I'll get a ticket open today. Adam On Wed, Dec 17, 2014 at 8:38 AM, nitin padalia wrote: > Thanks! Michael. > On Dec 17, 2014 8:02 PM, "Laing, Michael" > wrote: > >> http://datastax.github.io/python-driver/api/cassandra.html >> >> On We

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread nitin padalia
Thanks! Michael. On Dec 17, 2014 8:02 PM, "Laing, Michael" wrote: > http://datastax.github.io/python-driver/api/cassandra.html > > On Wed, Dec 17, 2014 at 9:27 AM, nitin padalia > wrote: >> >> Thanks! Philip/Ryan, >> Ryan I am using single Datacenter. >> Philip could you point some link where we

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread Laing, Michael
http://datastax.github.io/python-driver/api/cassandra.html On Wed, Dec 17, 2014 at 9:27 AM, nitin padalia wrote: > > Thanks! Philip/Ryan, > Ryan I am using single Datacenter. > Philip could you point some link where we could see those enums. > -Nitin > On Dec 17, 2014 7:14 PM, "Philip Thompson"

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread nitin padalia
Thanks! Philip/Ryan, Ryan I am using single Datacenter. Philip could you point some link where we could see those enums. -Nitin On Dec 17, 2014 7:14 PM, "Philip Thompson" wrote: > I believe the problem here is that the consistency level it is showing you > is not the number of nodes that need to

Re: Query strategy with respect to tombstones

2014-12-17 Thread Ryan Svihla
so first limits are good, the unlimited row count of a user can eventually eat you, which I suspect it is here, you maybe better off partitioning your data with some reasonable limits, but this is a bigger domain modeling conversation. Second, tombstone overflowing is typically a canary for a data

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread Philip Thompson
I believe the problem here is that the consistency level it is showing you is not the number of nodes that need to respond, but the enum value that corresponds to QUORUM internally. If you would like, you can file an improvement request on the Apache Cassandra Jira. On Wed, Dec 17, 2014 at 12:47 A

Re: [Consitency on cqlsh command prompt]

2014-12-17 Thread Ryan Svihla
DO you have 2 data centers? QUORUM is (total_replicas/2)+1 You may want LOCAL_QUORUM On Tue, Dec 16, 2014 at 11:47 PM, nitin padalia wrote: > > Hi, > > When I set Consistency to QUORUM in cqlsh command line. It says > consistency is set to quorum. > > cqlsh:testdb> CONSISTENCY QUORUM ; > Consist

Query strategy with respect to tombstones

2014-12-17 Thread Jens Rantil
Hi, I have a table with composite primary id ((userid), id). Some patterns about my table:  * Each user generally has 0-3000 rows. But there is currently no upper limit.  * Deleting rows for a user is extremely rare, but when done it can be done thousands of rows at a time.  * The absolutely mo

Re: 100% CPU utilization, ParNew and never completing compactions

2014-12-17 Thread Arne Claassen
Ok, tonight we rolled out on the production cluster. This one has 4 nodes and we dropped and recreated the keyspace before re-processing to avoid all possibility of Everything seemed ok, even if the CPU load was pegged and we saw lots of MUTATION dropped message, but after all the reprocessing