Re: total disk space used on a node for a CF is too large than expected

2010-07-10 Thread Schubert Zhang
Disk space includes: 1. Live SSTable files (Data, Index, Filter) 2. Garbage (compacted) SSTable files. For each column, except for the value bytes, there are anditional bytes include (2+columnname+1+8) On Sat, Jul 10, 2010 at 2:57 AM, Jonathan Ellis wrote: > you should read the "cassandra disk

Re: Question about hinted handoff

2010-07-10 Thread Schubert Zhang
The answer of Benjamin is very right. On Sun, Jul 11, 2010 at 6:27 AM, Benjamin Black wrote: > You constructed a pathological case and then got confused at the result. > > Consider instead a realistic case: RF=3, CL=QUORUM. Writes should go > to all of A, B, and C. B is down when the write req

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Schubert Zhang
t is ardently discussing @http://news.ycombinator.com/item?id=1502756 Here are my comments: 1. Cassandra is very young! Especially, the design and implementation of local storage and local indexing are junior and not good. 2. Pool read-performance is also due to the poor local storage implementatio

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Colin Clark
Benjamin, Please see below - it sounds like you're taking this a little personally and I'm not sure why. You've made some errors in your reply. Colin +1 315 886 3422 cella +1 701 212 4314 office http://blog.cloudeventprocessing.com http://twitter.com/EventCloudPro

Re: Question about hinted handoff

2010-07-10 Thread Benjamin Black
You constructed a pathological case and then got confused at the result. Consider instead a realistic case: RF=3, CL=QUORUM. Writes should go to all of A, B, and C. B is down when the write request arrives, so does not acknowledge the it. A and C acknowledge the write. Since quorum is achieved

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Benjamin Black
On Sat, Jul 10, 2010 at 12:22 PM, Colin Clark wrote: > > Although I'm a fan of Cassandra, there's no way I'd use it today for my tier > 1 deployments, because I don't have the resources of Facebook, and even > though Cassandra is open source, that doesn't mean I can fix it when it goes > down.  An

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Jason Dixon
On Sun, Jul 11, 2010 at 01:06:31AM +0530, Sumit Datta wrote: > Hello, > I have been a silent spectator in this list for a long while, and > while I like reading much mail traffic, this one I thought I should > reply to. > You know what I see in all this? More "Twitter" and "Facebook" than > "Cassan

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Brett Thomas
Idunno, I think understanding those companies' decisions is extremely relevant for anybody working with cassandra. I really like this thread and hope it keeps going. On Jul 10, 2010 3:38 PM, "Sumit Datta" wrote: Hello, I have been a silent spectator in this list for a long while, and while I lik

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Sumit Datta
Hello, I have been a silent spectator in this list for a long while, and while I like reading much mail traffic, this one I thought I should reply to. You know what I see in all this? More "Twitter" and "Facebook" than "Cassandra". Are we here to discuss them or the software? What I do not see are

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Colin Clark
I'm not aware of anyone classifying what twitter is doing today as 'working.' In fact, I believe that twitter's problems are much larger than just technology but that's a whole different subject. What twitter may have realized is that they don't have the resources of Facebook, that Facebook's

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Ryan King
On Sat, Jul 10, 2010 at 10:33 AM, Marty Greenia wrote: > It almost seems counter-intuitive. For analytics, you'd think they'd want a > database that supports more sophisticated query functionality (sql). Whereas > for everyday tweet storage, something fast and high-throughput (cassandra) > makes s

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Dan Di Spaltro
This sounds more like high-throughput external analytics, aka they will know all the queries consumers will use. This isn't for internal analytics. On Sat, Jul 10, 2010 at 10:33 AM, Marty Greenia wrote: > It almost seems counter-intuitive. For analytics, you'd think they'd want a > database that

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread Marty Greenia
It almost seems counter-intuitive. For analytics, you'd think they'd want a database that supports more sophisticated query functionality (sql). Whereas for everyday tweet storage, something fast and high-throughput (cassandra) makes sense. I'd be curious to here the details as well. On Sat, Jul

Re: Implementing queues in Cassandra

2010-07-10 Thread Edward Ribeiro
Qsandra project aims to use Cassandra as a back-end to ActiveMQ ( http://github.com/ticktock/qsandra ). On Sat, Jul 10, 2010 at 1:34 PM, Dodong Juan wrote: > Hi, >> > > Has anyone tried implementing queues in Cassandra? > >> >>

Re: TechCrunch article on Twitter and Cassandra

2010-07-10 Thread S Ahmed
Nice link. >From what I understood, they are not using it to store tweets but rather will use mysql? I wish they went into more detail as to why... On Sat, Jul 10, 2010 at 1:25 AM, Kochheiser,Todd W - TOK-DITT-1 < twkochhei...@bpa.gov> wrote: > A good read. > > http://techcrunch.com/2010/07/09/

Re: Question about hinted handoff

2010-07-10 Thread Schubert Zhang
3. B node is down during write operation, so return failure message to client, and write a hint to C node. Will write to the coordinator node. On Thu, Jul 8, 2010 at 10:04 PM, ChingShen wrote: > If so, when does hinted handoff work? > > > On Thu, Jul 8, 2010 at 9:55 PM, Anty wrote: > >> >> >

Implementing queues in Cassandra

2010-07-10 Thread Dodong Juan
Hi, Has anyone tried implementing queues in Cassandra?