Re: AWS EMR <-> Cassandra

2013-01-16 Thread Marcelo Elias Del Valle
That's good info! Thanks! 2013/1/16 William Oberman > DataStax recommended (forget the reference) to use the ephemeral disks in > RAID0, which is what I've been running for well over a year now in > production. > > In terms of how I'm doing Cassandra/AWS/Hadoop, I started by doing the > split d

Re: AWS EMR <-> Cassandra

2013-01-16 Thread William Oberman
DataStax recommended (forget the reference) to use the ephemeral disks in RAID0, which is what I've been running for well over a year now in production. In terms of how I'm doing Cassandra/AWS/Hadoop, I started by doing the split data center thing (one DC for low latency queries, one DC for hadoop

Re: AWS EMR <-> Cassandra

2013-01-16 Thread Marcelo Elias Del Valle
William, I just saw your message today. I am using Cassandra + Amazon EMR (hadoop 1.0.3) but I am not using PIG as you are. I set my configuration vars in Java, as I have a custom jar file and I am using ColumnFamilyInputFormat. However, if I understood well your problem, the only thing yo

Re: AWS EMR <-> Cassandra

2013-01-04 Thread William Oberman
So I've made it work, but I don't "get it" yet. I have no idea why my DIY server works when I set the environment variables on the machine that kicks off pig ("master"), and in EMR it doesn't. I recompiled ConfigHelper and CassandraStorage with tons of debugging, and in EMR I can see the hadoop C

Re: AWS EMR <-> Cassandra

2013-01-04 Thread William Oberman
On all tasktrackers, I see: java.io.IOException: PIG_OUTPUT_INITIAL_ADDRESS or PIG_INITIAL_ADDRESS environment variable not set at org.apache.cassandra.hadoop.pig.CassandraStorage.setStoreLocation(CassandraStorage.java:821) at org.apache.pig.backend.hadoop.executionengine.mapReduceL

Re: AWS EMR <-> Cassandra

2013-01-03 Thread aaron morton
> Instead, I get an error from CassandraStorage that the initial address isn't > set (on the slave, the master is ok). Can you post the full error ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/01/2013, at 11:15

AWS EMR <-> Cassandra

2013-01-03 Thread William Oberman
Anyone ever try to read or write directly between EMR <-> Cassandra? I'm running various Cassandra resources in Ec2, so the "physical connection" part is pretty easy using security groups. But, I'm having some configuration issues. I have managed to get Cassandra + Hadoop working in the past usi