Hi, is there a paper or a document where one can read how Spark reads Cassandra
data in parallel? And how it writes data back from RDDs? Its a bit hard to have
a clear picture in mind.
Thank you,
Pavel Velikhov
> On Mar 3, 2015, at 1:08 AM, Rumph, Frens Jan wrote:
>
> Hi all,
>
Hi Marcelo,
Were you able to use the Spark SQL features of the Cassandra connector? I
couldn’t make a .jar that wouldn’t confict with Spark SQL native .jar…
So I ended up using only the basic features, cannot use SQL queries.
> On Feb 13, 2015, at 7:49 PM, Paulo Ricardo Motta Gomes
> wrote
> On Feb 12, 2015, at 12:37 AM, Robert Coli wrote:
>
> On Wed, Feb 11, 2015 at 2:22 AM, Pavel Velikhov <mailto:pavel.velik...@gmail.com>> wrote:
> 2. While trying to update the full dataset with a simple transformation
> (again via python driver), single node and clu
; Tel: 1649
> www.pythian.com <http://www.pythian.com/>
> On Wed, Feb 11, 2015 at 11:22 AM, Pavel Velikhov <mailto:pavel.velik...@gmail.com>> wrote:
> Hi,
>
> I’m using Cassandra to store NLP data, the dataset is not that huge (about
> 1TB), but I need to iterate
expect much heavier traffic
to/from Cassandra. Any hints, war stories, etc. very appreciated!
Thank you,
Pavel Velikhov