Re: Best PHP client

2010-04-02 Thread Michael Pearson
> As  said by Jonathan, SimpleCassie and Pandra are both based on Thrift.. I > think a Cassandra upgrade will require an upgrade of Thrift classes of high > level clients too. Am I right ? This is right but impact on these libraries isn't significant unless a Thrift API dependency is quickly depre

Re: get_range_slice leads to java.lang.OutOfMemoryError?

2010-04-02 Thread Gautam Singaraju
Benoit, Thanks, that helped. I should have checked that one out before. --- Gautam On Fri, Apr 2, 2010 at 10:16 AM, Benoit Perroud wrote: > A way to read all the db without having an OOM is to limit the amount > of rows to be returned, and to iterate over the query, the starting > key being the

Bug regarding removing and retrieving entire supercolumn.

2010-04-02 Thread Arash Bazrafshan
Hullo. Think I got a bug in Cassandra. Do you also think it's a bug? It should be noted that I experience this bug when using cassandra through thrift's php api (the low-level one generated by thrift, not some high-level from the cassandra wiki). It happens when i insert a supercolumn (with a si

Re: Deployment on AWS

2010-04-02 Thread Benjamin Black
http://github.com/b/cookbooks/tree/master/cassandra/ On Fri, Apr 2, 2010 at 4:49 PM, Masood Mortazavi wrote: > > Is there a ready recipe for deploying a Cassandra cluster in AWS? ... (Seeds > need some "fixed" IP addresses.) > > Regards, > - m. > >

Deployment on AWS

2010-04-02 Thread Masood Mortazavi
Is there a ready recipe for deploying a Cassandra cluster in AWS? ... (Seeds need some "fixed" IP addresses.) Regards, - m.

Heap sudden jump during import

2010-04-02 Thread Weijun Li
I'm running a test to write 30 million columns (700bytes each) to Cassandra: the process ran smoothly for about 20mil then the heap usage suddenly jumped from 2GB to 3GB which is the up limit of JVM, --from this point Cassandra will freeze for long time (terrible latency, no response to nodetool th

Re: Stalled Bootstrapping Process

2010-04-02 Thread Chris Goffinet
+1 On Fri, Apr 2, 2010 at 3:49 PM, Jonathan Ellis wrote: > Ah, right. That's confusing for everyone. I think the best solution > there is to just get > http://issues.apache.org/jira/browse/CASSANDRA-579 done so it can > start streaming immediately. > > On Fri, Apr 2, 2010 at 5:45 PM, Dan Di Sp

Re: Stalled Bootstrapping Process

2010-04-02 Thread Dan Di Spaltro
I agree. That would have other good side-effects, like minimizing shooting yourself in the foot, for new folks. On Fri, Apr 2, 2010 at 3:49 PM, Jonathan Ellis wrote: > Ah, right.  That's confusing for everyone.  I think the best solution > there is to just get > http://issues.apache.org/jira/bro

Re: Stalled Bootstrapping Process

2010-04-02 Thread Jonathan Ellis
Ah, right. That's confusing for everyone. I think the best solution there is to just get http://issues.apache.org/jira/browse/CASSANDRA-579 done so it can start streaming immediately. On Fri, Apr 2, 2010 at 5:45 PM, Dan Di Spaltro wrote: > It did once it was actually done anti-compacting.  The

Re: Stalled Bootstrapping Process

2010-04-02 Thread Dan Di Spaltro
It did once it was actually done anti-compacting. The biggest question-mark (for us) was, what was happening during the anti-compaction phase. On Fri, Apr 2, 2010 at 3:39 PM, Jonathan Ellis wrote: > Great, glad it worked. > > Sounds like we do have a bug though if the destination node never > sh

Re: Stalled Bootstrapping Process

2010-04-02 Thread Jonathan Ellis
Great, glad it worked. Sounds like we do have a bug though if the destination node never showed anything in Streaming mbean. :( On Fri, Apr 2, 2010 at 5:11 PM, Dan Di Spaltro wrote: > To close the loop on this, the node finished bootstrapping.  The > source node rebooting definitely halted the p

Re: Stalled Bootstrapping Process

2010-04-02 Thread Dan Di Spaltro
To close the loop on this, the node finished bootstrapping. The source node rebooting definitely halted the process. Visibility-wise, watching the anti-compactions is the best way to tell how much progress is being made on the bootstrapping process. The CompactionManager mbean gives you insight

Re: Best PHP client

2010-04-02 Thread David Timothy Strauss
I've written one here as a Drupal module, but it isn't very Drupal-specific: http://bazaar.launchpad.net/~davidstrauss/pressflow/cassandra-votingapi/files/head:/sites/all/modules/cassandra/ I think my interface is a little prettier than SimpleCassie. You can see it in action in the test suite

Re: Best PHP client

2010-04-02 Thread yaw
Peter, I agree with you, but sometimes choice can be very easy, if a tool offers real advantages ... During several weeks, I have thought that choice could be : low level (thrift) or high level (pandra) as described on Cassandra Wiki.. Today, I discovered that SimpleCassie exists ... (I especi

RE: Slow Responses from 2 of 3 nodes in RC1

2010-04-02 Thread Mark Jones
To further complicate matters, when I read only from cassdb1, I can check about 100/second/thread (40 threads) when I read only from cassdb2, I can check about 50/second/thread (40 threads) when I read only from cassdb3, I can check about 50/second/thread (40 threads) This is with a consist

RE: best practice for migrating data

2010-04-02 Thread Mark Jones
I got the idea for this from: http://wiki.apache.org/cassandra/StorageConfiguration I put my keyspace setup on a webserver, and I pull it into the config like this: storage-conf.xml starts with: http://cassandraconfig /seeds.xml"> http://cassandraconfig /autobootstrap.xml"> http://cassandracon

Slow Responses from 2 of 3 nodes in RC1

2010-04-02 Thread Mark Jones
I have a 3 node cassandra cluster I'm trying to work with: All three machines are about the same: 6-8GB per machine (fastest machine has 8GB, JavaVM limited to 5GB) separate spindle for cassandra data and commit log I wrote ~7 Million items to Cassandra, now, I'm trying to read them back, the o

Re: Best PHP client

2010-04-02 Thread Peter Chang
Questions like "What is the best ___?" that have subjective answers are rarely simple. I've tried various clients and I'd just recommend trying a few out and getting involved win the projects. Some may have adavtnages of ease of use while others may be more flexible or have features such as connec

Re: Read Performance

2010-04-02 Thread James Golick
Yes. On Fri, Apr 2, 2010 at 10:35 AM, Ryan King wrote: > On Thu, Apr 1, 2010 at 8:37 PM, James Golick > wrote: > > Well, folks, I'm feeling a little stupid right now (adding to the injury > > inflicted by one Mr. Stump :-P). > > So, here's the story. The cache hit rate is up around 97% now. The

Re: Read Performance

2010-04-02 Thread Ryan King
On Thu, Apr 1, 2010 at 8:37 PM, James Golick wrote: > Well, folks, I'm feeling a little stupid right now (adding to the injury > inflicted by one Mr. Stump :-P). > So, here's the story. The cache hit rate is up around 97% now. The ruby code > is down to around 20-25ms to multiget the 20 rows. I di

Re: Best PHP client

2010-04-02 Thread Jonathan Ellis
You need Thrift for any of the others. On Fri, Apr 2, 2010 at 11:48 AM, yaw wrote: > Hi all, > just a (simple ?) question for today : > > What is the best client for Cassandra ? > > - Thrift (need to build compiler first) > - Pandra (seems not updated) > - SimpleCassie  (just discovered it few mi

Best PHP client

2010-04-02 Thread yaw
Hi all, just a (simple ?) question for today : What is the best client for Cassandra ? - Thrift (need to build compiler first) - Pandra (seems not updated) - SimpleCassie (just discovered it few minutes ago) Is there any other PHP client ? If Cassandra is upgraded (0.5 to 0.6) , I suppose they

cms content and numerous sort operations

2010-04-02 Thread S Ahmed
Greetings! Content management systems usually have complex sort operations, how would this be best handled with Cassandra? Is the only way to handle this type of situation to build indexes for each and every sort? example model: Content: { contentID: { title: "this is a title",

Re: best practice for migrating data

2010-04-02 Thread Gary Dusbabek
Correct. One node at a time, in rolling fashion. As long as the config files are not drastically different (you isolate your changes to the Keyspaces), you will not have a problem. Gary. On Fri, Apr 2, 2010 at 09:54, yaw wrote: > Hi all, > this tells us to stop and start Cassandra ... > I hope

Re: Architecture question

2010-04-02 Thread Jonathan Ellis
On Fri, Apr 2, 2010 at 10:50 AM, Brian Hawkins wrote: > What pieces of data make up the token that determines on what node the data > is placed? The row key. > Specifically are all the columns placed on the same node?  What about super > columns, are they all placed on the same node? Yes and ye

Re: best practice for migrating data

2010-04-02 Thread yaw
Hi all, this tells us to stop and start Cassandra ... I hope we are not asked to stop whole cluster ... Can we proceed node after node ? ( But how cluster could handle different config.xml files ?) 2010/4/2 Gary Dusbabek > AJ, > > The wiki gives an outline of how to do this: > http://wiki

Architecture question

2010-04-02 Thread Brian Hawkins
What pieces of data make up the token that determines on what node the data is placed? Specifically are all the columns placed on the same node? What about super columns, are they all placed on the same node? Thanks Brian

Re: Cassandra Hackathon in SF @ Digg - 04/22 6:30pm

2010-04-02 Thread Eric Evans
On Thu, 2010-03-25 at 15:13 -0700, Chris Goffinet wrote: > As promised, here is the official invite to register for the hackathon > in SF. The event starts at 6:30pm on April 22nd. > > > http://cassandrahackathon.eventbrite.com/ It looks like there is also a workshop on Big Data at the Computer

Re: Strange results

2010-04-02 Thread Sokolov Evgeniy
No, we use SimpleCassie ( http://code.google.com/p/simpletools-php/wiki/SimpleCassie) as client api. We just run 8 scripts from 8 console at one time. They connect to server via host:port 2010/4/2 Jonathan Ellis > Are you using a thrift connection object from more than one thread? > > On Fri,

Re: Strange results

2010-04-02 Thread Jonathan Ellis
Are you using a thrift connection object from more than one thread? On Fri, Apr 2, 2010 at 9:49 AM, Sokolov Evgeniy wrote: > Hello! Can you help me? We test cassandra and got strange results. We > started cassandra on one server with 2GB RAM, limit memory for cassandra to > 512Mb and set 1 millio

Re: Proxy instances?

2010-04-02 Thread Josh
Is the notion here that you'd run all writes/reads through that node and let it decide where to get the data from? I've been working on a C# client library and I've been picking a node at random from the cluster and letting it figure things out. Would a setup like this be better? Keep all the tr

Strange results

2010-04-02 Thread Sokolov Evgeniy
Hello! Can you help me? We test cassandra and got strange results. We started cassandra on one server with 2GB RAM, limit memory for cassandra to 512Mb and set 1 million simple records (160 byte length). First, when we store data throught 8 scripts from another server, we got several exceptions lin

Re: get_range_slice leads to java.lang.OutOfMemoryError?

2010-04-02 Thread Benoit Perroud
A way to read all the db without having an OOM is to limit the amount of rows to be returned, and to iterate over the query, the starting key being the last returned key. Note that doing that way the first key of the next iteration is the same as the last key of the preivous iteration. The warning

get_range_slice leads to java.lang.OutOfMemoryError?

2010-04-02 Thread Gautam Singaraju
I call the get_range_slice method in Java to get the list of all keys in Cassandra db. The db is pretty small, about 1.3GB on disk. I received the following error on the server: "java.lang.OutOfMemoryError: Requested array size exceeds VM limit" I changed the JVM size from 1 GB to 2 GB in cassand

Re: how to store list ?

2010-04-02 Thread gabriele renzi
On Fri, Apr 2, 2010 at 12:46 PM, Shuge Lee wrote: > For example: > user['lee'] = { >     'name': 'lee', >     'age'; '21', >     'girls': ['java', 'actionscript', 'python'], > } > how to store above in Apache Cassndra ? check what a SuperColumn is in the wiki -- blog en: http://www.riffraff.inf

Re: best practice for migrating data

2010-04-02 Thread Gary Dusbabek
AJ, The wiki gives an outline of how to do this: http://wiki.apache.org/cassandra/FAQ#modify_cf_config Gary On Thu, Apr 1, 2010 at 23:18, AJ Chen wrote: > when adding/changing a column to a column family for existing data in > cassandra, what's a good way to do it? > thanks, > -aj-- > AJ Chen,

how to store list ?

2010-04-02 Thread Shuge Lee
For example: user['lee'] = { 'name': 'lee', 'age'; '21', 'girls': ['java', 'actionscript', 'python'], } how to store above in Apache Cassndra ? -- Shuge Lee | Lee Li | 李蠡