Re: Cassandra adding node issue (no UJ status)

2015-09-15 Thread Mark Greene
Hey Rock, I've seen this occur as well. I've come to learn that in some cases, like a network blip, the join can fail. There is usually something in the log to the effect of "Stream failed" When I encounter this issue, I make an attempt to bootstrap the new node again. If that doesn't help, I run

Re: Nodetool repair with Load times 5

2015-08-18 Thread Mark Greene
Hey Jean, Did you try running a nodetool cleanup on all your nodes, perhaps one at a time? On Tue, Aug 18, 2015 at 3:59 AM, Jean Tremblay < jean.tremb...@zen-innovations.com> wrote: > Hi, > > I have a phenomena I cannot explain, and I would like to understand. > > I’m running Cassandra 2.1.8 on

RepairException on C* 2.1.3

2015-04-17 Thread Mark Greene
I'm receiving an exception when I run a repair process via: 'nodetool repair -par ' I'm not sure if this is a bug or not but was curious to know if there was something that can be done to remedy this situation? Full stack trace from the logs: ERROR [ValidationExecutor:3] 2015-04-17 18:16:56,174

Re: Column value not getting updated

2015-03-31 Thread Mark Greene
Hey Saurabh, We're actually preparing for this ourselves and spinning up our own NTP server pool. The public NTP pools have a lot of drift and should not be relied upon for cluster technology that is sensitive to time skew like C*. The folks at Logentries did a great write up about this which we

C* 2.1.3 - Incremental replacement of compacted SSTables

2015-02-21 Thread Mark Greene
I saw in the NEWS.txt that this has been disabled. Does anyone know why that was the case? Is it temporary just for the 2.1.3 release? Thanks, Mark Greene

Re: High Bloom Filter FP Ratio

2014-12-19 Thread Mark Greene
We're seeing similar behavior except our FP ratio is closer to 1.0 (100%). We're using Cassandra 2.1.2. Schema --- CREATE TABLE contacts.contact ( id bigint, property_id int, created_at bigint, updated_at bigint,

Error when dropping keyspaces; One row required, 0 found

2014-12-01 Thread Mark Greene
I'm running Cassandra 2.1.0. I was attempting to drop two keyspaces via cqlsh and encountered an error in the CLI as well as the appearance of losing all my keyspaces. Below is the output from my cqlsh session: $ cqlsh

Re: Dynamic Columns in Cassandra 2.X

2014-06-13 Thread Mark Greene
value as blob or text(as JSON) and take care of the > serialization/deserialization yourself at the client side > > As an alternative you can save custom properties in a map, provided that > their number is not too large. But considering the business case of CRM, I > believe that

Re: Dynamic Columns in Cassandra 2.X

2014-06-13 Thread Mark Greene
is an accepted practice? --> Can you give some details > about your use case ? How would you "alter" a table structure to adapt it > to a new customer ? > > Wouldn't it be better to model your table so that it supports > addition/removal of customer ? > > > &g

Re: Dynamic Columns in Cassandra 2.X

2014-06-13 Thread Mark Greene
requirements. > > 3) If I utilize CQL collections, will Cassandra page the entire thing into > the heap? --> Of course. All collections and maps in Cassandra are eagerly > loaded entirely in memory on server side. That's why it is recommended to > limit their cardinality to ~

Dynamic Columns in Cassandra 2.X

2014-06-13 Thread Mark Greene
I'm looking for some best practices w/r/t supporting arbitrary columns. It seems from the docs I've read around CQL that they are supported in some capacity via collections but you can't exceed 64K in size. For my requirements that would cause problems. So my questions are: 1) Is using Thrift a

Re: ec2 tests

2010-05-28 Thread Mark Greene
That should get you started. On Fri, May 28, 2010 at 3:10 AM, Chris Dean wrote: > Mark Greene writes: > > If you give us an objective of the test that will help. Trying to get max > > write throughput? Read throughput? Weak consistency? > > I would like reading to be as fast

Re: ec2 tests

2010-05-27 Thread Mark Greene
If you give us an objective of the test that will help. Trying to get max write throughput? Read throughput? Weak consistency? On Thu, May 27, 2010 at 8:48 PM, Chris Dean wrote: > I'm interested in performing some simple performance tests on EC2. I > was thinking of using py_stress and Cassandr

Re: Why are writes faster than reads?

2010-05-25 Thread Mark Greene
I'm fairly certain the write path hits the commit log first, then the memtable. 2010/5/25 Peter Schüller > > I have seen several off-hand mentions that writes are inherently faster > than > > reads. Why is this so? > > I believe the primary factor people are referring to is that writes > are fas

Re: Cassandra data loss

2010-05-24 Thread Mark Greene
Ryan King actually has a very nice, short and sweet explanation that cuts through the FUD: http://theryanking.com/entries/2010/04/29/potential-consistency/ On Mon, May 24, 2010 at 12:01 PM, Steve Lihn wrote: > So if I set it up to be strongly consistent, I should have the same level > of consis

Re: delete mutation

2010-05-21 Thread Mark Greene
need to provide a ts for every operation so that if there are > multiple clients updating the same column at the same time, cassandra > can decide who "wins." > > On Fri, May 21, 2010 at 6:55 AM, Mark Greene wrote: > > Is there a particular reason why timestamp is requir

delete mutation

2010-05-21 Thread Mark Greene
Is there a particular reason why timestamp is required to do a deletion? If i'm reading the api docs correctly, this would require a read of the column first correct? I know there is an issue filed to have a better way to delete via range slices but I wanted to make sure this was the only way to

Re: Problems running Cassandra 0.6.1 on large EC2 instances.

2010-05-17 Thread Mark Greene
B > 127605887595351923798765477786913079296| | > 10.250.XX.XX Up 2.46 MB > 170141183460469231731687303715884105728|-->| > > Any thoughts? > > Best, > > Curt > -- > Curt, ZipZapPlay Inc., www.PlayCrafter.com, > http://apps.facebook.com/happyhabi

Re: Problems running Cassandra 0.6.1 on large EC2 instances.

2010-05-17 Thread Mark Greene
Can you provide us with the current JVM args? Also, what type of work load you are giving the ring (op/s)? On Mon, May 17, 2010 at 6:39 PM, Curt Bererton wrote: > Hello Cassandra users+experts, > > Hopefully someone will be able to point me in the correct direction. We > have cassandra 0.6.1 wor

Re: replication impact on write throughput

2010-05-12 Thread Mark Greene
o many operations per second whether that be a replicated key or not and you could potentially take that number and multiply it by the number of nodes in the ring to give you your write capacity per second. On Tue, May 11, 2010 at 10:13 PM, Paul Prescod wrote: > On Tue, May 11, 2010 at 5:56 PM

Re: replication impact on write throughput

2010-05-11 Thread Mark Greene
the total capacity is 500k wps. But perhaps I've misunderstood some key concepts. Still a novice myself ;-) On Tue, May 11, 2010 at 3:08 PM, Bill de hOra wrote: > Mark Greene wrote: > >> If you have for example, your replication factor equal to the total amount >> of node

Re: replication impact on write throughput

2010-05-11 Thread Mark Greene
If you have for example, your replication factor equal to the total amount of nodes in the ring, I suspect you will hit a brick wall pretty soon. The biggest impact on your write performance will most likely be the consistency level of your writes. In other words, how many nodes you want to wait f

Re: pagination through slices with deleted keys

2010-05-07 Thread Mark Greene
I like your idea about specifying it at the SP level. On Fri, May 7, 2010 at 8:29 AM, Joost Ouwerkerk wrote: > +1. There is some disagreement on whether or not the API should > return empty columns or skip rows when no data is found. In all of > our use cases, we would prefer skipped rows. And

Re: pagination through slices with deleted keys

2010-05-06 Thread Mark Greene
Hey Ian, I actually just wrote a quick example of how to iterate over a CF that may have tombstones. This may help you out: http://markjgreene.wordpress.com/2010/05/05/iterate-over-entire-cassandra-column-family/ On Thu, May 6, 2010 at 12:17 PM, Ian Kallen wrote: > I read the DistributedDeletes

Re: Cassandra training on May 21 in Palo Alto

2010-05-04 Thread Mark Greene
Jonathan, Awesome! Any plans to offer this training again in the future for those of us who can't make it this time around? -Mark On Tue, May 4, 2010 at 5:07 PM, Jonathan Ellis wrote: > I'll be running a day-long Cassandra training class on Friday, May 21. > I'll cover > > - Installation and

Re: value size, is there a suggested limit?

2010-04-25 Thread Mark Greene
http://wiki.apache.org/cassandra/CassandraLimitations On Sun, Apr 25, 2010 at 4:19 PM, S Ahmed wrote: > Is there a suggested sized maximum that you can set the value of a given > key? > > e.g. could I convert a document to bytes and store it as a value to a key? > if yes, which I presume so, wh

Re: getting cassandra setup on windows 7

2010-04-23 Thread Mark Greene
Try the cassandra-with-fixes.bat file attached to the issue. I had the same issue an that bat file got cassandra to start. It still throws another error complaining about the log4j.properties. On Fri, Apr 23, 2010

Re: CassandraLimitations

2010-04-21 Thread Mark Greene
Hey Bill, Are you asking if there are limits in the context of a single node or a ring of nodes? On Wed, Apr 21, 2010 at 3:58 PM, Bill de hOra wrote: > http://wiki.apache.org/cassandra/CassandraLimitations has good coverage on > the limits around columns. > > Are there are design (or practical)

Re: Just to be clear, cassandra is web framework agnostic b/c of Thrift?

2010-04-21 Thread Mark Greene
I'll try to test this out tonight. On Wed, Apr 21, 2010 at 1:07 PM, Jonathan Ellis wrote: > There is a patch attached to > https://issues.apache.org/jira/browse/CASSANDRA-948 that needs > volunteers to test. > > On Sun, Apr 18, 2010 at 11:13 PM, Mark Greene wrote: > &

Re: At what point does the cluster get faster than the individual nodes?

2010-04-21 Thread Mark Greene
Right it's a similar concept to DB sharding where you spread the write load around to different DB servers but won't necessarily increase the throughput of an one DB server but rather collectively. On Wed, Apr 21, 2010 at 12:16 PM, Mike Gallamore < mike.e.gallam...@googlemail.com> wrote: > Some

Re: Cassandra tuning for running test on a desktop

2010-04-21 Thread Mark Greene
> > > > On Wed, Apr 21, 2010 at 4:45 PM, Mark Greene wrote: > >> Hit send to early >> >> That being said a lot of people running Cassandra in production are using >> 4-6GB max heaps on 8GB machines, don't know if that helps but hopefully >> gives

Re: Cassandra tuning for running test on a desktop

2010-04-21 Thread Mark Greene
Hit send to early That being said a lot of people running Cassandra in production are using 4-6GB max heaps on 8GB machines, don't know if that helps but hopefully gives you some perspective. On Wed, Apr 21, 2010 at 10:39 AM, Mark Greene wrote: > RAM doesn't necessari

Re: Cassandra tuning for running test on a desktop

2010-04-21 Thread Mark Greene
; *To:* user@cassandra.apache.org >> *Subject:* Re: Cassandra tuning for running test on a desktop >> >> >> >> I have try 1400M, and Cassandra OOM too. >> >> Is there another solution ? My data isn't very big. >> >> It seems that is the merg

Re: Cassandra tuning for running test on a desktop

2010-04-21 Thread Mark Greene
Trying increasing Xmx. 1G is probably not enough for the amount of inserts you are doing. On Wed, Apr 21, 2010 at 8:10 AM, Nicolas Labrot wrote: > Hello, > > For my first message I will first thanks Cassandra contributors for their > great works. > > I have a parameter issue with Cassandra (I ho

Re: Just to be clear, cassandra is web framework agnostic b/c of Thrift?

2010-04-18 Thread Mark Greene
With the 0.6.0 release, the windows cassandra.bat file errors out. There's a bug filed for this already. There's a README or something similar in the install directory, that tells you the basic CLI operations and explains the basic data model. On Sun, Apr 18, 2010 at 11:23 PM, S Ahmed wrote: > I

Re: Forced Failover Test for 0.6.0-RC1

2010-04-10 Thread Mark Greene
assandra/Durability > > On Sat, Apr 10, 2010 at 10:38 AM, Mark Greene wrote: > > Hi, > > I'm testing out failover for 0.6.0-RC1 and seeing varied behavior in > > Cassandra's ability to replay the commit log after a forced failure. > > My test is this: > &g

Forced Failover Test for 0.6.0-RC1

2010-04-10 Thread Mark Greene
Hi, I'm testing out failover for 0.6.0-RC1 and seeing varied behavior in Cassandra's ability to replay the commit log after a forced failure. My test is this: 1) Run ./cassandra -f 2) Insert a value through the CLI and immediately force a shutdown of cassandra after I see the "Value inserted" co

Re: Write consistency

2010-04-08 Thread Mark Greene
So unless you re-try the write, the previous stale write stays on the other two nodes? Would a read repair fix this eventually? On Thu, Apr 8, 2010 at 11:36 AM, Avinash Lakshman < avinash.laksh...@gmail.com> wrote: > What your describing is a distributed transaction? Generally strong > consistenc

Separate disks with cloud deployment

2010-03-25 Thread Mark Greene
The FAQ page makes mention of using separate disks for the commit log and data directory. How would one go about achieving this in a cloud deployment such as Rackspace cloud servers or EC2 EBS? Or is it just preferred to use dedicated hardware to get the optimal performance? Thanks In Advance! Be