[VOTE] Release 0.6.0-rc1

2010-03-28 Thread Eric Evans

The 0.6.0 blockers are now out of the way and things are looking good. I
propose the following tag/artifacts for 0.6.0-rc1:

SVN Tag:
https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1
0.6.0-rc1 artifacts: http://people.apache.org/~eevans

-- 
Eric Evans
eev...@rackspace.com



Cassandra on top of B-Tree

2010-03-28 Thread Primal Wijesekera
Hi All,

I am a master student in UBC CS dept. I along with one of my lab mates are 
trying to implement the Cassandra on top of a B-Tree implementation rather than 
of DHT approach that we have right now. We hope to do benchmarking the two 
approaches and really want to see which one scales better. 

In the lab we already have a project (which is not yet completed) on developing 
a Distributed B-Tree on top of a Sinfonia like system. We would be trying to 
integrate the Cassandra source with the B-tree preserving the rest of the 
Cassandra logic.

Since we are still in its very early stage of this experiment, thought of 
getting your expert thoughts and comments on this and we were wondering whether 
this could be a potential GSoc project as well.
 Regards,
Primal


  

Re: [VOTE] Release 0.6.0-rc1

2010-03-28 Thread Jonathan Ellis
+1

On Sun, Mar 28, 2010 at 11:02 AM, Eric Evans  wrote:
>
> The 0.6.0 blockers are now out of the way and things are looking good. I
> propose the following tag/artifacts for 0.6.0-rc1:
>
> SVN Tag:
> https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1
> 0.6.0-rc1 artifacts: http://people.apache.org/~eevans
>
> --
> Eric Evans
> eev...@rackspace.com
>
>


Re: Cassandra on top of B-Tree

2010-03-28 Thread David Strauss
On 2010-03-28 21:11, Primal Wijesekera wrote:
> I am a master student in UBC CS dept. I along with one of my lab mates are 
> trying to implement the Cassandra on top of a B-Tree implementation rather 
> than of DHT approach that we have right now. We hope to do benchmarking the 
> two approaches and really want to see which one scales better. 
> 
> In the lab we already have a project (which is not yet completed) on 
> developing a Distributed B-Tree on top of a Sinfonia like system. We would be 
> trying to integrate the Cassandra source with the B-tree preserving the rest 
> of the Cassandra logic.
> 
> Since we are still in its very early stage of this experiment, thought of 
> getting your expert thoughts and comments on this and we were wondering 
> whether this could be a potential GSoc project as well.

I'm sorry, but it doesn't make much sense to run Cassandra on top of a
B-tree. Reorganizing indexes when writing goes against one of
Cassandra's primary design goals: streaming writes to disk as
efficiently as possible.

http://wiki.apache.org/cassandra/FAQ#reads_slower_writes

Additionally, there are *so many* other systems that do use B-tree
already. Why add it to Cassandra?

You may want to look at Project Voldemort, which can already distribute
data across servers similarly to Cassandra but (optionally) with
B-tree-based storage on each box. MongoDB also supports sharded data
with B-tree-based indexes. Finally, HBase is a distributed B-tree.

-- 
David Strauss
   | da...@fourkitchens.com
Four Kitchens
   | http://fourkitchens.com
   | +1 512 454 6659 [office]
   | +1 512 870 8453 [direct]



signature.asc
Description: OpenPGP digital signature


Re: [VOTE] Release 0.6.0-rc1

2010-03-28 Thread Chris Goffinet
+1

-Chris

On Mar 28, 2010, at 9:02 AM, Eric Evans wrote:

> 
> The 0.6.0 blockers are now out of the way and things are looking good. I
> propose the following tag/artifacts for 0.6.0-rc1:
> 
> SVN Tag:
> https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1
> 0.6.0-rc1 artifacts: http://people.apache.org/~eevans
> 
> -- 
> Eric Evans
> eev...@rackspace.com
> 



Re: Cassandra on top of B-Tree

2010-03-28 Thread Jonathan Ellis
It sounds like an interesting project but I don't think it would be a
good fit for GSoc because it's just too different to be a good
candidate to actually incorporate into Cassandra mainline.  (The
consistency and availability models are completely different, assuming
your system ends up looking something like the one in the Aguilera
paper.)

On Sun, Mar 28, 2010 at 4:11 PM, Primal Wijesekera
 wrote:
> Hi All,
>
> I am a master student in UBC CS dept. I along with one of my lab mates are 
> trying to implement the Cassandra on top of a B-Tree implementation rather 
> than of DHT approach that we have right now. We hope to do benchmarking the 
> two approaches and really want to see which one scales better.
>
> In the lab we already have a project (which is not yet completed) on 
> developing a Distributed B-Tree on top of a Sinfonia like system. We would be 
> trying to integrate the Cassandra source with the B-tree preserving the rest 
> of the Cassandra logic.
>
> Since we are still in its very early stage of this experiment, thought of 
> getting your expert thoughts and comments on this and we were wondering 
> whether this could be a potential GSoc project as well.
>  Regards,
> Primal
>
>
>


Re: Cassandra on top of B-Tree

2010-03-28 Thread Avinash Lakshman
Here is why I think it is not a good fit for Cassandra (at least top 3
reasons that come to mind):

(1) Cassandra strives to make updates very very cheap. With BTree's every
update is a read modify write.
(2) BTree on rebalance tend to result in a lot of fragmentation leading to
poorer performance over time.
(3) Every write (update) is a random write while in Cassandra as it is today
it will be a sequential one because of a log structured nature.

What is your motivation for BTree backed storage? Perhaps we could try to
see if it can be provided in the exitsting model itself.

Cheers
Avinash

On Sun, Mar 28, 2010 at 4:48 PM, Jonathan Ellis  wrote:

> It sounds like an interesting project but I don't think it would be a
> good fit for GSoc because it's just too different to be a good
> candidate to actually incorporate into Cassandra mainline.  (The
> consistency and availability models are completely different, assuming
> your system ends up looking something like the one in the Aguilera
> paper.)
>
> On Sun, Mar 28, 2010 at 4:11 PM, Primal Wijesekera
>  wrote:
> > Hi All,
> >
> > I am a master student in UBC CS dept. I along with one of my lab mates
> are trying to implement the Cassandra on top of a B-Tree implementation
> rather than of DHT approach that we have right now. We hope to do
> benchmarking the two approaches and really want to see which one scales
> better.
> >
> > In the lab we already have a project (which is not yet completed) on
> developing a Distributed B-Tree on top of a Sinfonia like system. We would
> be trying to integrate the Cassandra source with the B-tree preserving the
> rest of the Cassandra logic.
> >
> > Since we are still in its very early stage of this experiment, thought of
> getting your expert thoughts and comments on this and we were wondering
> whether this could be a potential GSoc project as well.
> >  Regards,
> > Primal
> >
> >
> >
>


Re: [VOTE] Release 0.6.0-rc1

2010-03-28 Thread Vijay
+1

Regards,




On Sun, Mar 28, 2010 at 2:49 PM, Chris Goffinet  wrote:

> +1
>
> -Chris
>
> On Mar 28, 2010, at 9:02 AM, Eric Evans wrote:
>
> >
> > The 0.6.0 blockers are now out of the way and things are looking good. I
> > propose the following tag/artifacts for 0.6.0-rc1:
> >
> > SVN Tag:
> > https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0-rc1
> > 0.6.0-rc1 artifacts: http://people.apache.org/~eevans
> >
> > --
> > Eric Evans
> > eev...@rackspace.com
> >
>
>