rew.dah...@bronto.com]
> Sent: Monday, June 28, 2010 7:17 AM
> To: user@cassandra.apache.org
> Subject: Re: Map Reduce support
>
> The difference is noticeable but small. I did a test just reading data
> in from Cassandra on our cluster & dumping it to a csv file. Pure map
Drew,
I was wondering if you care to share your map-reduce code
Thanks
Carlos
From: Drew Dahlke [drew.dah...@bronto.com]
Sent: Monday, June 28, 2010 7:17 AM
To: user@cassandra.apache.org
Subject: Re: Map Reduce support
The difference is noticeable but
The difference is noticeable but small. I did a test just reading data
in from Cassandra on our cluster & dumping it to a csv file. Pure map
reduce was going at ~17k records/sec versus ~15k from Pig. There is
overhead to using Pig, but it'll reduce your development time & make
for more readable cod
Thanks for the information Drew and Jonathan.
Is there any difference in performance while using Pig compared to MapReduce
directly on data store ?
I will do the experiments with both of them though in some time.
On Fri, Jun 25, 2010 at 5:46 PM, Drew Dahlke wrote:
> The cassandra column family i
The cassandra column family input format will go over a an entire
column family sending a slice of a row into a mapper at a time. From
there there's a lot you can do. As far as how you aggregate data
together, I'd suggest experimenting with the latest version of Pig
which thankfully supports the ne
does http://wiki.apache.org/cassandra/HadoopSupport help?
On Thu, Jun 24, 2010 at 11:01 AM, Atul Gosain wrote:
> Hi
> What kind of Map Reduce support is provided for Cassandra ?
> Can i get some columns from different rows and then aggregate them up
> together. Its basically aggregation of stat