Re: RDD partitions per executor in Cassandra Spark Connector

2015-03-03 Thread Pavel Velikhov
Hi, is there a paper or a document where one can read how Spark reads Cassandra data in parallel? And how it writes data back from RDDs? Its a bit hard to have a clear picture in mind. Thank you, Pavel Velikhov > On Mar 3, 2015, at 1:08 AM, Rumph, Frens Jan wrote: > > Hi all, >

Re: best supported spark connector for Cassandra

2015-02-13 Thread Pavel Velikhov
Hi Marcelo, Were you able to use the Spark SQL features of the Cassandra connector? I couldn’t make a .jar that wouldn’t confict with Spark SQL native .jar… So I ended up using only the basic features, cannot use SQL queries. > On Feb 13, 2015, at 7:49 PM, Paulo Ricardo Motta Gomes > wrote

Re: Two problems with Cassandra

2015-02-12 Thread Pavel Velikhov
> On Feb 12, 2015, at 12:37 AM, Robert Coli wrote: > > On Wed, Feb 11, 2015 at 2:22 AM, Pavel Velikhov <mailto:pavel.velik...@gmail.com>> wrote: > 2. While trying to update the full dataset with a simple transformation > (again via python driver), single node and clu

Re: Two problems with Cassandra

2015-02-11 Thread Pavel Velikhov
; Tel: 1649 > www.pythian.com <http://www.pythian.com/> > On Wed, Feb 11, 2015 at 11:22 AM, Pavel Velikhov <mailto:pavel.velik...@gmail.com>> wrote: > Hi, > > I’m using Cassandra to store NLP data, the dataset is not that huge (about > 1TB), but I need to iterate

Two problems with Cassandra

2015-02-11 Thread Pavel Velikhov
expect much heavier traffic to/from Cassandra. Any hints, war stories, etc. very appreciated! Thank you, Pavel Velikhov