That's good info! Thanks!
2013/1/16 William Oberman
> DataStax recommended (forget the reference) to use the ephemeral disks in
> RAID0, which is what I've been running for well over a year now in
> production.
>
> In terms of how I'm doing Cassandra/AWS/Hadoop, I started by doing the
> split d
DataStax recommended (forget the reference) to use the ephemeral disks in
RAID0, which is what I've been running for well over a year now in
production.
In terms of how I'm doing Cassandra/AWS/Hadoop, I started by doing the
split data center thing (one DC for low latency queries, one DC for
hadoop
William,
I just saw your message today. I am using Cassandra + Amazon EMR
(hadoop 1.0.3) but I am not using PIG as you are. I set my configuration
vars in Java, as I have a custom jar file and I am using
ColumnFamilyInputFormat.
However, if I understood well your problem, the only thing yo
So I've made it work, but I don't "get it" yet.
I have no idea why my DIY server works when I set the environment variables
on the machine that kicks off pig ("master"), and in EMR it doesn't. I
recompiled ConfigHelper and CassandraStorage with tons of debugging, and in
EMR I can see the hadoop C
On all tasktrackers, I see:
java.io.IOException: PIG_OUTPUT_INITIAL_ADDRESS or PIG_INITIAL_ADDRESS
environment variable not set
at
org.apache.cassandra.hadoop.pig.CassandraStorage.setStoreLocation(CassandraStorage.java:821)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceL
> Instead, I get an error from CassandraStorage that the initial address isn't
> set (on the slave, the master is ok).
Can you post the full error ?
Cheers
-
Aaron Morton
Freelance Cassandra Developer
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 4/01/2013, at 11:15
Anyone ever try to read or write directly between EMR <-> Cassandra?
I'm running various Cassandra resources in Ec2, so the "physical
connection" part is pretty easy using security groups. But, I'm having
some configuration issues. I have managed to get Cassandra + Hadoop
working in the past usi