Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Thomas Heller wrote: Hey, Is this possible in 0.6.0? (Note: for the next startToken, I was just planning on computing the MD5 digest of the last key directly since I'm accessing Cassandra through Thrift.) Can't speak for 0.6.0 but it works for 0.6.3. Just implemented this in ruby (minus

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Jonathan Ellis wrote: There have been a number of bug fixes to this since 0.6.0 -- as Thomas said, it works in 0.6.3. (Although there is one related bug scheduled to be fixed in 0.6.4, https://issues.apache.org/jira/browse/CASSANDRA-1042) Ah, this is exactly one of the cases I've been seeing!

Re: Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Hi Jonathan, The code snippet below was from the repository. I mentioned 0.6.0 specifically just to confirm that reading a CF using token-based range queries with the RandomPartitioner should (or shouldn't) also work in that version. I've seen discussions about whether range queries are now s

Reading all rows in a column family in parallel

2010-07-08 Thread Brent N. Chun
Hello, I'm running Cassandra 0.6.0 on a cluster and have an application that needs to read all rows from a column family using the Cassandra Thrift API. Ideally, I'd like to be able to do this by having all nodes in the cluster read in parallel (i.e., each node reads a disjoint set of rows th