[VOTE] Release 0.6.0-rc1
The 0.6.0 blockers are now out of the way and things are looking good. I propose the following tag/artifacts for 0.6.0-rc1: SVN Tag: https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1 0.6.0-rc1 artifacts: http://people.apache.org/~eevans -- Eric Evans eev...@rackspace.com
Cassandra on top of B-Tree
Hi All, I am a master student in UBC CS dept. I along with one of my lab mates are trying to implement the Cassandra on top of a B-Tree implementation rather than of DHT approach that we have right now. We hope to do benchmarking the two approaches and really want to see which one scales better. In the lab we already have a project (which is not yet completed) on developing a Distributed B-Tree on top of a Sinfonia like system. We would be trying to integrate the Cassandra source with the B-tree preserving the rest of the Cassandra logic. Since we are still in its very early stage of this experiment, thought of getting your expert thoughts and comments on this and we were wondering whether this could be a potential GSoc project as well. Regards, Primal
Re: [VOTE] Release 0.6.0-rc1
+1 On Sun, Mar 28, 2010 at 11:02 AM, Eric Evans wrote: > > The 0.6.0 blockers are now out of the way and things are looking good. I > propose the following tag/artifacts for 0.6.0-rc1: > > SVN Tag: > https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1 > 0.6.0-rc1 artifacts: http://people.apache.org/~eevans > > -- > Eric Evans > eev...@rackspace.com > >
Re: Cassandra on top of B-Tree
On 2010-03-28 21:11, Primal Wijesekera wrote: > I am a master student in UBC CS dept. I along with one of my lab mates are > trying to implement the Cassandra on top of a B-Tree implementation rather > than of DHT approach that we have right now. We hope to do benchmarking the > two approaches and really want to see which one scales better. > > In the lab we already have a project (which is not yet completed) on > developing a Distributed B-Tree on top of a Sinfonia like system. We would be > trying to integrate the Cassandra source with the B-tree preserving the rest > of the Cassandra logic. > > Since we are still in its very early stage of this experiment, thought of > getting your expert thoughts and comments on this and we were wondering > whether this could be a potential GSoc project as well. I'm sorry, but it doesn't make much sense to run Cassandra on top of a B-tree. Reorganizing indexes when writing goes against one of Cassandra's primary design goals: streaming writes to disk as efficiently as possible. http://wiki.apache.org/cassandra/FAQ#reads_slower_writes Additionally, there are *so many* other systems that do use B-tree already. Why add it to Cassandra? You may want to look at Project Voldemort, which can already distribute data across servers similarly to Cassandra but (optionally) with B-tree-based storage on each box. MongoDB also supports sharded data with B-tree-based indexes. Finally, HBase is a distributed B-tree. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature
Re: [VOTE] Release 0.6.0-rc1
+1 -Chris On Mar 28, 2010, at 9:02 AM, Eric Evans wrote: > > The 0.6.0 blockers are now out of the way and things are looking good. I > propose the following tag/artifacts for 0.6.0-rc1: > > SVN Tag: > https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1 > 0.6.0-rc1 artifacts: http://people.apache.org/~eevans > > -- > Eric Evans > eev...@rackspace.com >
Re: Cassandra on top of B-Tree
It sounds like an interesting project but I don't think it would be a good fit for GSoc because it's just too different to be a good candidate to actually incorporate into Cassandra mainline. (The consistency and availability models are completely different, assuming your system ends up looking something like the one in the Aguilera paper.) On Sun, Mar 28, 2010 at 4:11 PM, Primal Wijesekera wrote: > Hi All, > > I am a master student in UBC CS dept. I along with one of my lab mates are > trying to implement the Cassandra on top of a B-Tree implementation rather > than of DHT approach that we have right now. We hope to do benchmarking the > two approaches and really want to see which one scales better. > > In the lab we already have a project (which is not yet completed) on > developing a Distributed B-Tree on top of a Sinfonia like system. We would be > trying to integrate the Cassandra source with the B-tree preserving the rest > of the Cassandra logic. > > Since we are still in its very early stage of this experiment, thought of > getting your expert thoughts and comments on this and we were wondering > whether this could be a potential GSoc project as well. > Regards, > Primal > > >
Re: Cassandra on top of B-Tree
Here is why I think it is not a good fit for Cassandra (at least top 3 reasons that come to mind): (1) Cassandra strives to make updates very very cheap. With BTree's every update is a read modify write. (2) BTree on rebalance tend to result in a lot of fragmentation leading to poorer performance over time. (3) Every write (update) is a random write while in Cassandra as it is today it will be a sequential one because of a log structured nature. What is your motivation for BTree backed storage? Perhaps we could try to see if it can be provided in the exitsting model itself. Cheers Avinash On Sun, Mar 28, 2010 at 4:48 PM, Jonathan Ellis wrote: > It sounds like an interesting project but I don't think it would be a > good fit for GSoc because it's just too different to be a good > candidate to actually incorporate into Cassandra mainline. (The > consistency and availability models are completely different, assuming > your system ends up looking something like the one in the Aguilera > paper.) > > On Sun, Mar 28, 2010 at 4:11 PM, Primal Wijesekera > wrote: > > Hi All, > > > > I am a master student in UBC CS dept. I along with one of my lab mates > are trying to implement the Cassandra on top of a B-Tree implementation > rather than of DHT approach that we have right now. We hope to do > benchmarking the two approaches and really want to see which one scales > better. > > > > In the lab we already have a project (which is not yet completed) on > developing a Distributed B-Tree on top of a Sinfonia like system. We would > be trying to integrate the Cassandra source with the B-tree preserving the > rest of the Cassandra logic. > > > > Since we are still in its very early stage of this experiment, thought of > getting your expert thoughts and comments on this and we were wondering > whether this could be a potential GSoc project as well. > > Regards, > > Primal > > > > > > >
Re: [VOTE] Release 0.6.0-rc1
+1 Regards, On Sun, Mar 28, 2010 at 2:49 PM, Chris Goffinet wrote: > +1 > > -Chris > > On Mar 28, 2010, at 9:02 AM, Eric Evans wrote: > > > > > The 0.6.0 blockers are now out of the way and things are looking good. I > > propose the following tag/artifacts for 0.6.0-rc1: > > > > SVN Tag: > > https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1 > > 0.6.0-rc1 artifacts: http://people.apache.org/~eevans > > > > -- > > Eric Evans > > eev...@rackspace.com > > > >