Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
Yeah I realized that shortly... :) I'm still not able to point the word_count to a live cluster. If i have a single node cluster and the thrift address is the IP of that box, and it has seed value as the IP of itself as well. How do i run the word_count remotely then? Sorry I must be missing so

Re: Iterate through entire data set

2010-04-08 Thread Jonathan Ellis
those aren't shipped with Cassandra. On Thu, Apr 8, 2010 at 3:00 PM, Sonny Heer wrote: > Missing the commons logging and commons httpclient jars.  Must be > using the the wrong jdk? > > On Thu, Apr 8, 2010 at 12:38 PM, Sonny Heer wrote: >> Is there other documentation on how to setup all the pie

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
Missing the commons logging and commons httpclient jars. Must be using the the wrong jdk? On Thu, Apr 8, 2010 at 12:38 PM, Sonny Heer wrote: > Is there other documentation on how to setup all the pieces? > > Currently I'm simply trying to test the example word_count, but will > likely need to wr

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
Is there other documentation on how to setup all the pieces? Currently I'm simply trying to test the example word_count, but will likely need to write other map/reduce programs over the cassandra data set. For this test I have one box (ubuntu) where i have moved cass .6 rc1 binary , and started

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
Okay I moved everything to the ubuntu box: ~/dev/cassandra-0.6.0-rc1/contrib/word_count$ bin/word_count_setup 10/04/08 11:15:10 INFO config.DatabaseDescriptor: Auto DiskAccessMode determined to be standard 10/04/08 11:15:10 WARN config.DatabaseDescriptor: KeysCachedFraction is deprecated: use Keys

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:77) >>>>        at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:70) >>>>        at >>>> org.apache.cassandra.net.MessagingService.listen(MessagingService.java:138) >>>>        at &g

Re: Iterate through entire data set

2010-04-08 Thread Benjamin Black
   at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:77) >>>        at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:70) >>>        at >>> org.apache.cassandra.net.MessagingService.listen(MessagingService.java:138) >>>        at &

Re: Iterate through entire data set

2010-04-08 Thread Benjamin Black
) >>>        at >>> org.apache.cassandra.net.MessagingService.listen(MessagingService.java:138) >>>        at >>> org.apache.cassandra.service.StorageService.initClient(StorageService.java:289) >>>        at WordCountSetup.main(Unknown Source) >>&g

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
>        at >> org.apache.cassandra.service.StorageService.initClient(StorageService.java:289) >>        at WordCountSetup.main(Unknown Source) >> >> >> Sorry, I'm a bit new to this... help? >> >> >> On Wed, Apr 7, 2010 at 6:07 PM, Stu Hood wrote: >&g

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
 at >> org.apache.cassandra.service.StorageService.initClient(StorageService.java:289) >>        at WordCountSetup.main(Unknown Source) >> >> >> Sorry, I'm a bit new to this... help? >> >> >> On Wed, Apr 7, 2010 at 6:07 PM, Stu Hood wrote: >>

Re: Iterate through entire data set

2010-04-08 Thread Benjamin Black
Client(StorageService.java:289) >        at WordCountSetup.main(Unknown Source) > > > Sorry, I'm a bit new to this... help? > > > On Wed, Apr 7, 2010 at 6:07 PM, Stu Hood wrote: >> Please read the README in the contrib/word_count directory. >> >> -Ori

Re: Iterate through entire data set

2010-04-08 Thread Benjamin Black
e) > > > Sorry, I'm a bit new to this... help? > > > On Wed, Apr 7, 2010 at 6:07 PM, Stu Hood wrote: >> Please read the README in the contrib/word_count directory. >> >> -Original Message- >> From: "Sonny Heer" >> Sent: Wednesday, Ap

Re: Iterate through entire data set

2010-04-08 Thread Sonny Heer
Wed, Apr 7, 2010 at 6:07 PM, Stu Hood wrote: > Please read the README in the contrib/word_count directory. > > -Original Message- > From: "Sonny Heer" > Sent: Wednesday, April 7, 2010 6:33pm > To: user@cassandra.apache.org > Subject: Re: Iterate through entir

Re: Iterate through entire data set

2010-04-07 Thread Stu Hood
Please read the README in the contrib/word_count directory. -Original Message- From: "Sonny Heer" Sent: Wednesday, April 7, 2010 6:33pm To: user@cassandra.apache.org Subject: Re: Iterate through entire data set Jon, I've got the word_count.jar and a Hadoop cluster. How

Re: Iterate through entire data set

2010-04-07 Thread Sonny Heer
Jon, I've got the word_count.jar and a Hadoop cluster. How do you usually run this sample? On Wed, Apr 7, 2010 at 3:04 PM, Jonathan Ellis wrote: > Yes > > On Wed, Apr 7, 2010 at 5:01 PM, Sonny Heer wrote: >> These examples work on Cassandra .06 and Hadoop .20.2? >> >> On Wed, Apr 7, 2010 at 2:

Re: Iterate through entire data set

2010-04-07 Thread Jonathan Ellis
Yes On Wed, Apr 7, 2010 at 5:01 PM, Sonny Heer wrote: > These examples work on Cassandra .06 and Hadoop .20.2? > > On Wed, Apr 7, 2010 at 2:49 PM, Jonathan Ellis wrote: >> Look at the READMEs for contrib/word_count and contrib/pig. >> >> On Wed, Apr 7, 2010 at 4:47 PM, Sonny Heer wrote: >>> I n

Re: Iterate through entire data set

2010-04-07 Thread Sonny Heer
These examples work on Cassandra .06 and Hadoop .20.2? On Wed, Apr 7, 2010 at 2:49 PM, Jonathan Ellis wrote: > Look at the READMEs for contrib/word_count and contrib/pig. > > On Wed, Apr 7, 2010 at 4:47 PM, Sonny Heer wrote: >> I need a way to process all of my data set. >> >> A way to process e

Re: Iterate through entire data set

2010-04-07 Thread Jonathan Ellis
Look at the READMEs for contrib/word_count and contrib/pig. On Wed, Apr 7, 2010 at 4:47 PM, Sonny Heer wrote: > I need a way to process all of my data set. > > A way to process every keyspace, CF, row, column, and perform some > operation based on that mapped combination. > > The map bucket would

Iterate through entire data set

2010-04-07 Thread Sonny Heer
I need a way to process all of my data set. A way to process every keyspace, CF, row, column, and perform some operation based on that mapped combination. The map bucket would collect down to column name. Is there a map/reduce program which shows how to go about doing this?