Re: Out of Memory Error While Opening SSTables on Startup

2015-03-19 Thread Jan
Paul Nickerson;  curious, did you get a solution to your problem ?  Regards,Jan/  On Tuesday, February 10, 2015 5:48 PM, Flavien Charlon wrote: I already experienced the same problem (hundreds of thousands of SSTables) with Cassandra 2.1.2. It seems to appear when running an incre

Re: Cassandra Read Timeout

2015-03-19 Thread Jan
Yulian;  Quote :Raw size is aroung 190MB.There are bigger raws with similar structure ( its index raws , which actually stores keys ) and everything is working fine on them, everything is working also fine on this cf but on other raw. Tables data from CFStats ( First table has bigger raws but wor

Re: active queries

2015-03-19 Thread Rahul Bhardwaj
Thanks to all, I will get back to you after looking all. On Fri, Mar 20, 2015 at 2:56 AM, Robert Coli wrote: > On Thu, Mar 19, 2015 at 1:30 PM, Jan wrote: > >> Answer: nodetool tpstats & nodetool cfsstats >> > > For a better view of this : > > https://github.com/hailocab/ctop > > =Rob >

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Pranay Agarwal
Also, the new nodes (3 of them, in *UJ state*) are showing some data size (~10g). Is there any data loss chances with stopping the cassandra on them? On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal wrote: > Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0 > > What do you mean by point

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Pranay Agarwal
Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0 What do you mean by point 3? Also, by doing one at a time, does it mean wait till nodetool status of the new node is UN from UJ? On Thu, Mar 19, 2015 at 5:44 PM, Robert Coli wrote: > On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal > w

Re: Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal wrote: > I have 14 nodes cassandra cluster, each node as around 50gb of data. I > added 3 new nodes to the cluster and I can see the status as *UJ *for the > new nodes. They have been in that for almost a day now and their data size > seems to be sa

Adding new node to Cassandra cluster is too slow

2015-03-19 Thread Pranay Agarwal
Hi, I have 14 nodes cassandra cluster, each node as around 50gb of data. I added 3 new nodes to the cluster and I can see the status as *UJ *for the new nodes. They have been in that for almost a day now and their data size seems to be same as well. There is almost no CPU or disk usage either on t

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 4:56 PM, Rahul Neelakantan wrote: > Wouldn't GC Grace set to 34 days increase the bloat in the DB? > Yes, but as I say in the ticket, my belief is that the fixed cost of repair combined with the fact that it frequently doesn't work at all (hangs forever, etc.) is much mor

Re: best way to measure repair times?

2015-03-19 Thread Rahul Neelakantan
Wouldn't GC Grace set to 34 days increase the bloat in the DB? Rahul > On Mar 19, 2015, at 3:02 PM, Robert Coli wrote: > >> On Thu, Mar 19, 2015 at 10:30 AM, Ian Rose wrote: >> I'd like to (a) monitor how long my repairs are taking, and (b) know when a >> repair is finished so that I can take

Re: Seed Node

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 3:56 PM, jean paul wrote: > Please,i have a question a bout the seed node.. as i read it is the > bootstrap node, each new node joins the seed node that's it? > if it leaves the cluster, how can a new node joins the rest of the group ? > What "a seed" is within Cassandra

Seed Node

2015-03-19 Thread jean paul
Hello All, Please,i have a question a bout the seed node.. as i read it is the bootstrap node, each new node joins the seed node that's it? if it leaves the cluster, how can a new node joins the rest of the group ? Thanks a lot for answer. Best Regards.

Re: active queries

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 1:30 PM, Jan wrote: > Answer: nodetool tpstats & nodetool cfsstats > For a better view of this : https://github.com/hailocab/ctop =Rob

Re: FileNotFoundException

2015-03-19 Thread Jan
HI Batranut; In both errors you described above the files seem to be missing while compaction is running. Without knowing what else is going on your system,  I would presume that this error occurs on this single node only and not your entire cluster.  Some guesses:a)  You may have a disk corrupt

RE: best way to measure repair times?

2015-03-19 Thread Jason Kushmaul | WDA
Ian, In my experience I don’t get any output from repair (2.0.7) that is useful until the keyspace is finished. Perhaps this has been solved but we do something much more painful: We tail the log on the node having repair run on it, watching for the first repair session, and then count each

Re: best way to measure repair times?

2015-03-19 Thread Ian Rose
Thanks Jan, although I'm a bit unsure of the details. It looks like when you run a repair this actually occurs over several "sessions". e.g. in your example above there are 2 different "repair session [...] finished" lines. So does it makes sense that I would want to measure between when I first

Re: Delete columns

2015-03-19 Thread Jan
Benyi ;  have you considered using the TTL option in case your columns are meant to be deleted after a predetermined amount of time ? Its probably the easiest way to get the task accomplished. cheersJan On Friday, February 27, 2015 10:38 AM, Benyi Wang wrote: In C* 2.1.2, is there

Re: active queries

2015-03-19 Thread Jan
HI Rahul;  your question: Can we see active queries on cassandra cluster. Is there any tool? Answer:     nodetool tpstats  &  nodetool  cfsstats  The nodetool tpstats command provides statistics about the number of active, pending, and completed tasks for each stage of Cassandra operations by th

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 1:03 PM, Jan wrote: > to respond to your specific question: > > You could pipe the output of your repair into a file and subsequently > determine the time taken. > By this method, what is the duration of a repair which will never complete? =Rob

Re: best way to measure repair times?

2015-03-19 Thread Jan
Ian;  to respond to your specific question: You could pipe the output of your repair into a file and subsequently determine the time taken.    example: nodetool repair -dc DC1 [2014-07-24 21:59:55,326] Nothing to repair for keyspace 'system' [2014-07-24 21:59:55,617] Starting repair command #2, re

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:53 PM, Paulo Motta wrote: > This is only true if you never have hardware failure. Hardware failure > means that > For the record, I hate this formulation for being a little too clever. " This is never true, because we live in a world where hardware fails. " Would be a

Re: best way to measure repair times?

2015-03-19 Thread Paulo Motta
From: http://www.datastax.com/dev/blog/modern-hinted-handoff Repair and the fine print At first glance, it may appear that Hinted Handoff lets you safely get away without needing repair. This is only true if you never have hardware failure. Hardware failure means that 1. We lose “historical” d

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:13 PM, Ali Akhtar wrote: > Cassandra doesn't guarantee eventual consistency? > If you run regularly scheduled repair, it does. If you do not run repair, it does not. Hinted handoff, for example, is considered an optimization for repair, and does not assert that it pro

Re: best way to measure repair times?

2015-03-19 Thread Ali Akhtar
Cassandra doesn't guarantee eventual consistency? On Fri, Mar 20, 2015 at 12:04 AM, Robert Coli wrote: > On Thu, Mar 19, 2015 at 10:32 AM, Ali Akhtar wrote: > >> Just wondering - why do you have to trigger the repairs? Is that >> necessary in Cassandra? >> > > Manual repair is the only mechanis

Re: active queries

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 12:11 AM, Rahul Bhardwaj < rahul.bhard...@indiamart.com> wrote: > Can we see active queries on cassandra cluster. Is there any tool? > Not really : https://issues.apache.org/jira/browse/CASSANDRA-5084 =Rob

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 10:32 AM, Ali Akhtar wrote: > Just wondering - why do you have to trigger the repairs? Is that necessary > in Cassandra? > Manual repair is the only mechanism in Cassandra which guarantees consistency. A repair must be run once per gc_grace_seconds in every column family

Re: best way to measure repair times?

2015-03-19 Thread Robert Coli
On Thu, Mar 19, 2015 at 10:30 AM, Ian Rose wrote: > I'd like to (a) monitor how long my repairs are taking, and (b) know when > a repair is finished so that I can take some kind of followup action. > What's the best way to tackle either or both of these? > https://issues.apache.org/jira/browse/C

Re: best way to measure repair times?

2015-03-19 Thread Ali Akhtar
Just wondering - why do you have to trigger the repairs? Is that necessary in Cassandra? (Sorry for the off topic question) On Thu, Mar 19, 2015 at 10:30 PM, Ian Rose wrote: > Howdy - > > I'd like to (a) monitor how long my repairs are taking, and (b) know when > a repair is finished so that I

best way to measure repair times?

2015-03-19 Thread Ian Rose
Howdy - I'd like to (a) monitor how long my repairs are taking, and (b) know when a repair is finished so that I can take some kind of followup action. What's the best way to tackle either or both of these? Some potentially relevant details: - running community apache cassandra (not DSE) - vers

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-19 Thread Jack Krupansky
Content management (large blobs such as images and video) can be done with Cassandra, but it is tricky and great care is needed. As with any Cassandra app, you need to model your data based on how you intend to query and access the data. You can certainly access large amounts of data with Cassandra

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-19 Thread Kai Wang
With your reading path and data model, it doesn't matter how many nodes you have. All data with the same image_caseid is physically located on one node (Well, on RF nodes but only one of those will try to server your query). You are not taking advantage of Cassandra by creating hot spots on both re

active queries

2015-03-19 Thread Rahul Bhardwaj
Hi , Can we see active queries on cassandra cluster. Is there any tool? Please help. Regards: Rahul Bhardwaj -- Follow IndiaMART.com for latest updates on this and more: