Re: Running java-driver in parallel cassandra-driver-core 2.1.5, multithreading wokrs extremely slow.

2015-07-03 Thread Serega Sheypak
Thu, Jul 2, 2015 at 1:39 AM Serega Sheypak > wrote: > >> What is the reason to do that? I understand BatchStatement as a kind of >> atomic insert hack. >> How it can help me to solve concurrency problem? 1 thread with sync >> insert gives me 1K ops/sec. 10 threads g

Re: Running java-driver in parallel cassandra-driver-core 2.1.5, multithreading wokrs extremely slow.

2015-07-02 Thread Serega Sheypak
liseconds 99% <= 0.02 milliseconds 99.9% <= 0.12 milliseconds what should I do to reach better performance when i use several threads? 2015-07-02 10:34 GMT+02:00 Vova Shelgunov : > Did you tried to use BatchStatement? > On Jul 2, 2015 11:00 AM, "Serega She

Running java-driver in parallel cassandra-driver-core 2.1.5, multithreading wokrs extremely slow.

2015-07-02 Thread Serega Sheypak
Hi, I have weird driver behaviour. Can you help me please to find the problem? Problem: I try to insert data using 10 threads. I see that 10 thread starts, they start to insert some data and then they hung. It takes enormous amount of time to insert (seconds for 1K inserts). It runs 1K per second i

Re: com.datastax.driver.core.exceptions.SyntaxError: line 1:37 no viable alternative at character

2015-06-24 Thread Serega Sheypak
omg!!! It was some weird unprinted character. That is why C* driver failed to parse it 2015-06-24 11:35 GMT+02:00 Serega Sheypak : > Sorry, misprint > //composeQuery() => INSERT INTO packets (id, fingerprint, mark) VALUES > (?, ?, ?); > PreparedStatement preparedStatement =

Re: com.datastax.driver.core.exceptions.SyntaxError: line 1:37 no viable alternative at character

2015-06-24 Thread Serega Sheypak
Sorry, misprint //composeQuery() => INSERT INTO packets (id, fingerprint, mark) VALUES (?, ?, ?); PreparedStatement preparedStatement = session.prepare(composeQuery()); //exception happens here! 2015-06-24 11:20 GMT+02:00 Serega Sheypak : > Hi, I'm trying to use bounded query and I

com.datastax.driver.core.exceptions.SyntaxError: line 1:37 no viable alternative at character

2015-06-24 Thread Serega Sheypak
Hi, I'm trying to use bounded query and I get weird error: Here is a query: Bounded query: INSERT INTO packets (id, fingerprint, mark) VALUES (?, ?, ?); Here is a code: PreparedStatement preparedStatement = session.prepare(composeQuery()); //composeQuery returns INSERT INTO packets (id, finge

spark-sql estimates Cassandra table with 3 rows as 8 TB of data, Cassandra 2.1, DSE 4.7

2015-06-17 Thread Serega Sheypak
Hi, spark-sql estimated input for Cassandra table with 3 rows as 8 TB. sometimes it's estimated as -167B. I run it on laptop, I don't have 8 TB space for the data. We use DSE 4.7 with bundled spark and spark-sql-thriftserver Here is the stat for a dummy select foo from bar where bar three rows an

Re: cassandra and spark from cloudera distirbution

2015-04-22 Thread Serega Sheypak
om/en/datastax_enterprise/4.6/datastax_enterprise/spark/sparkTOC.html > On Apr 22, 2015 2:05 PM, "Serega Sheypak" > wrote: > >> What is "embedded" spark? Where can I read about it? >> Right now we just install spark 1.2 built fro hadoop 2.4 and use it to &g

Re: cassandra and spark from cloudera distirbution

2015-04-22 Thread Serega Sheypak
h something else. > > However you could probably read or write from/to DSE / Cassandra from a > cloudera spark cluster using the open source DataStax connector. Are you > looking for a particular feature that is not available in Spark 1.1? > On Apr 22, 2015 1:50 PM, "Serega Sheyp

Re: cassandra and spark from cloudera distirbution

2015-04-22 Thread Serega Sheypak
ces/datastax-enterprise > > Thanks, > Jay > > On Wed, Apr 22, 2015 at 6:41 AM, Serega Sheypak > wrote: > >> Hi, are Cassandra and Spark from Cloudera compatible? >> Where can I find these compatilibity notes? >> > >

cassandra and spark from cloudera distirbution

2015-04-22 Thread Serega Sheypak
Hi, are Cassandra and Spark from Cloudera compatible? Where can I find these compatilibity notes?

Re: COPY command to export a table to CSV file

2015-04-20 Thread Serega Sheypak
hi, what happens if unloader meets blob field? 2015-04-20 23:43 GMT+02:00 Sebastian Estevez : > Try Brian's cassandra-unloader > > > All the best, > > > [image: datastax_logo.png] > > Sebastián Estévez

Re: OrderPreservingPartitioner and compound partition key

2015-04-09 Thread Serega Sheypak
I understand the reason, but If I user OrderPreservingPartitioner and have compound partition key, can I use select using only FIRST component of compound partition key? 2015-04-08 20:43 GMT+02:00 Robert Coli : > On Wed, Apr 8, 2015 at 1:27 AM, Serega Sheypak > wrote: > >

OrderPreservingPartitioner and compound partition key

2015-04-08 Thread Serega Sheypak
Hi imagine I have a table "events" with fields: ymd int user_id uuid ts timestamp attr_1 attr_2 with primary key ((ymd, user_id, ts)) and I set OrderPreservingPartitioner as a partitioner for the table ymd is int representation for the day: 20150410, 20150411, e.t.c. Can I select from table usin

Re: Cassandra vs OS x

2015-04-07 Thread Serega Sheypak
one more good summary: http://superuser.com/questions/845143/any-limitation-for-having-many-files-in-a-directory-in-mac-os-x 2015-04-07 13:49 GMT+02:00 Serega Sheypak : > That is the reason for trying to work with ZFS. Unfortunately, it was > dropped. > And that is the reason pcie inte

Re: Cassandra vs OS x

2015-04-07 Thread Serega Sheypak
That is the reason for trying to work with ZFS. Unfortunately, it was dropped. And that is the reason pcie interface for SSD in my MacBook pro. 2015-04-07 13:46 GMT+02:00 Serega Sheypak : > HFS: > The Catalog File, which stores all the file and directory records in a > single data

Re: Cassandra vs OS x

2015-04-07 Thread Serega Sheypak
B in Mac OS and 8 KB in OS X. Fields in HFS are of fixed size, in HFS Plus the size can vary depending on the actual size of the data they store. 2015-04-07 13:41 GMT+02:00 Serega Sheypak : > It's single-threaded for writing :) > > 2015-04-07 13:13 GMT+02:00 Jean Tremblay <

Re: Cassandra vs OS x

2015-04-07 Thread Serega Sheypak
It's single-threaded for writing :) 2015-04-07 13:13 GMT+02:00 Jean Tremblay : > Hi, > > Why do everyone say that Cassandra should not be used in production on an > Mac OS x? > Why would this not work? > Are there anyone out there using OS x in production? What is your > experience with this? >

Re: Timeseries analysis using Cassandra and partition by date period

2015-04-06 Thread Serega Sheypak
1 hour. Cassandra is set to 3 weeks. > http://kairosdb.github.io/kairosdocs/CassandraSchema.html > > On Mon, Apr 6, 2015 at 3:27 PM, Serega Sheypak > wrote: > >> Thanks, is it a kind of opentsdb? >> >> 2015-04-05 18:28 GMT+02:00 Kevin Burton : >> >>>

Re: Timeseries analysis using Cassandra and partition by date period

2015-04-06 Thread Serega Sheypak
> 0.95 beta2 just shipped yesterday as well so you have good timing. > > https://github.com/kairosdb/kairosdb > > On Sat, Apr 4, 2015 at 11:29 AM, Serega Sheypak > wrote: > >> Okay, so bucketing by day/week/month is a capacity planning stuff and >> actual question

DSE 4.6 with OpsCenter 5.1.1, agent can't start, port 9042 is occupied by DSE

2015-04-05 Thread Serega Sheypak
Hi, getting weird problem when agent to connect to OpsCenter OpsCenter installed on VM with DSE and agent. It's not for production, I have 3 VMs with DSE and OpsCenter for dev/test purposes. The stacktrace from agent log is: vagrant@dsenode03:~$ sudo cat /var/log/datastax-agent/agent.log Starti

Re: Timeseries analysis using Cassandra and partition by date period

2015-04-04 Thread Serega Sheypak
days. > > Yes, all of the rows within a partition are stored on one physical node as > well as the replica nodes. > > -- Jack Krupansky > > On Sat, Apr 4, 2015 at 1:38 PM, Serega Sheypak > wrote: > >> >non-equal relation on a partition key is not supported >

Re: Timeseries analysis using Cassandra and partition by date period

2015-04-04 Thread Serega Sheypak
for the row key. Then you could query > within the partition. The partition key determines which node can satisfy > the query. Designing your partition key judiciously is the key (haha!) to > performant Cassandra applications. > > -- Jack Krupansky > > On Sat, Apr 4, 2015 at 9:33

Re: Timeseries analysis using Cassandra and partition by date period

2015-04-04 Thread Serega Sheypak
the data rate - is it many events per day or just a few > events per week, or over what time period. You need to be careful - you > don't want your Cassandra partitions to be too big (millions of rows) or > too small (just a few or even one row per partition.) > > -- Jack Kru

Timeseries analysis using Cassandra and partition by date period

2015-04-04 Thread Serega Sheypak
Hi, I switched from HBase to Cassandra and try to find problem solution for timeseries analysis on top Cassandra. I have a entity named "Event". "Event" has attributes: user_id - a guy who triggered event event_ts - when even happened event_type - type of event some_other_attr - some other attrs we

Re: Multinode Cassandra and sstableloader

2015-04-02 Thread Serega Sheypak
uot; >> >> I would definitely use this method to do this (I actually did already, >> multiple times). >> >> Up to you, I heard once that there is almost as much way of doing >> operational on Cassandra as the number of operators :). You should go with >> metho

Re: SSTable structure

2015-04-02 Thread Serega Sheypak
al and remote) counter > shards > > - bharat > > On Wed, Apr 1, 2015 at 12:02 AM, Serega Sheypak > wrote: > >> Hi bharat, >> you are talking about Cassandra 1.2.5 Does it fit Cassandra 2.1? >> Were there any significant changes to SSTable format and layou

Re: Testing sstableloader between Cassandra 2.1 DSE and community edition 2.1

2015-04-01 Thread Serega Sheypak
Got it. 2015-04-01 20:39 GMT+02:00 Michael Shuler : > On 04/01/2015 08:10 AM, Serega Sheypak wrote: > >> Sorry >> cluster1 community version is: ii cassandra 2.1.3 >>distributed storage system for structured data >> cluster2 DSE ve

Re: Testing sstableloader between Cassandra 2.1 DSE and community edition 2.1

2015-04-01 Thread Serega Sheypak
Sorry cluster1 community version is: ii cassandra 2.1.3 distributed storage system for structured data cluster2 DSE version is: ii dse-libcassandra4.6.2-1 The DataStax Enterprise package includes a production-certifie 2015-04-01 14:53 GMT+02:00 Serega Sheypak

Testing sstableloader between Cassandra 2.1 DSE and community edition 2.1

2015-04-01 Thread Serega Sheypak
Hi, I have 2 cassandra clusters. cluster1 is datastax community 2.1 cluster2 is datastax DSE I can run sstableloader from cluster1(Community) and stream data to cluster2 (DSE) But I get exception while streaming from cluster2 (DSE) to cluster1 (Community) The expection is: Could not retrieve e

Re: SSTable structure

2015-04-01 Thread Serega Sheypak
Hi bharat, you are talking about Cassandra 1.2.5 Does it fit Cassandra 2.1? Were there any significant changes to SSTable format and layout? Thank you, article is interesting. Hi jacob , HBase does it for example. http://hbase.apache.org/book.html#_hfile_format_2 It would be great to give general

Re: Multinode Cassandra and sstableloader

2015-03-31 Thread Serega Sheypak
> > I never used the tool, but that's what would be "logical" imho. Wait for a > confirmation as I wouldn't to lead you to a failure of any kind. Also, I > don't know if data is also replicated directly with sstableloader or if you > need to repair c2 after l

Multinode Cassandra and sstableloader

2015-03-31 Thread Serega Sheypak
Hi, I have a simple question and can't find related info in docs. I have cluster1 with 3 nodes and cluster2 with 5 nodes. I want to transfer whole keyspace named 'mykeyspace' data from cluster1 to cluster2 using sstableloader. I understand that it's not the best solution, I need it for testing pu