interesting that I just came up with the same question this morning and found https://issues.apache.org/jira/browse/CASSANDRA-342
http://mail-archives.apache.org/mod_mbox/incubator-cassandra-dev/200907.mbox/<f5f3a6290907240123y22f065edp1649f7c5c1add...@mail.gmail.com> these give u some perspective into the thinking behind the implementation On Mon, May 17, 2010 at 12:41 PM, Yan Virin <jan.vi...@gmail.com> wrote: > Hi, > Can someone explain how this works? As long as I know, there is no execution > engine in Cassandra alone, so I assume that Hadoop gives the MapReduce > execution engine which uses Cassandra as the distributed storage? Is data > locality preserved? How mature this "couple" is? How is the performance of > this compared to the original Hadoop over HDFS? > > Thanks, > > > -- > Jan Virin > http://www.linkedin.com/in/yanvirin >