Re: RE: Re: Re: High mutation stage in multi dc deployment

2021-07-20 Thread Jeff Jirsa
This is sufficiently atypical that many people aren't going to have enough intuition to figure it out without seeing your metrics / logs / debugging data (e.g. heap dumps). My only guess, and it's a pretty big guess, is that your write timeout is low enough (or network quality bad enough, though

RE: RE: Re: Re: High mutation stage in multi dc deployment

2021-07-20 Thread MyWorld
Kindly help in this regard. What could be the possible reason for load and mutation spike in india data center On 2021/07/20 00:14:56 MyWorld wrote: > Hi Arvinder, > It's a separate cluster. Here max partition size is 32mb. > > On 2021/07/19 23:57:27 Arvinder Dhillon wrote: > > Is this the same cl

RE: Re: Re: High mutation stage in multi dc deployment

2021-07-19 Thread MyWorld
Hi Arvinder, It's a separate cluster. Here max partition size is 32mb. On 2021/07/19 23:57:27 Arvinder Dhillon wrote: > Is this the same cluster with 1G partition size? > > -Arvinder > > On Mon, Jul 19, 2021, 4:51 PM MyWorld wrote: > > > Hi daemeon, > > We have already tuned the TCP settings to i

Re: Re: High mutation stage in multi dc deployment

2021-07-19 Thread Arvinder Dhillon
Is this the same cluster with 1G partition size? -Arvinder On Mon, Jul 19, 2021, 4:51 PM MyWorld wrote: > Hi daemeon, > We have already tuned the TCP settings to improve the bandwidth. Earlier > we had lot of hint and mutation msg drop which were gone after tuning TCP. > Moreover we are writing

RE: Re: High mutation stage in multi dc deployment

2021-07-19 Thread MyWorld
Hi daemeon, We have already tuned the TCP settings to improve the bandwidth. Earlier we had lot of hint and mutation msg drop which were gone after tuning TCP. Moreover we are writing with CL local quorum at US side, so ack is taken from local DC. I m still concern what could be reason of increase

RE: Re: High mutation stage in multi dc deployment

2021-07-19 Thread MyWorld
Hi Patrick, Currently we are using 3.11.6 apache cassandra version. We are performing write with CL local quorum in US side DC. We have 4-5 tables with supplier_details, supplier_prod_details, supplier_rating. We also have an mview attached with rating table. For batching part, I need to check with

Re: High mutation stage in multi dc deployment

2021-07-19 Thread Patrick McFadin
Hi Ashish, Can you give us some information about some of the details? Specifically some indication on the version of Cassandra, data model, consistency levels used, and how you are bulk loading. Is this a batch by any chance? Patrick On Mon, Jul 19, 2021 at 10:38 AM MyWorld wrote: > Hi all, >

Re: High mutation stage in multi dc deployment

2021-07-19 Thread daemeon reiydelle
You may want to think about the latency impacts of a cluster that has one node "far away". This is such a basic design flaw that you need to do some basic learning, and some basic understanding of networking and latency. On Mon, Jul 19, 2021 at 10:38 AM MyWorld wrote: > Hi all, > > Currently

High mutation stage in multi dc deployment

2021-07-19 Thread MyWorld
Hi all, Currently we have a cluster with 2 DC of 3 nodes each. One DC is in GCP-US while other is in GCP-India. Just to add here, configuration of every node accross both DC is same. Cpu-6, Ram-32gb, Heap-8gb We do all our write on US data center. While performing a bulk write on GCP US, we obser

Re: number of racks in a deployment with VMs

2021-02-15 Thread Bowen Song
The reason for the zoho (and my) emails go to the spam box is because the Apache mailing list software is messing around with the DKIM signature and the "From:" address. I have created INFRA-21415 for this. On 15/02/2021 22:36, Kane Wilson wr

Re: number of racks in a deployment with VMs

2021-02-15 Thread Kane Wilson
There are operational advantages to having #racks == RF, however it's by no means mandatory. Having more racks than RF doesn't cause any availability/health/balance problems, it is only disadvantageous in that it makes some cluster maintenance tasks more expensive/unwieldy like repairs and DC migra

number of racks in a deployment with VMs

2021-02-14 Thread onmstester onmstester
Hi, In a article by thelastpickle [1], i noticed: The key here is to configure the cluster so that for a given datacenter the number of racks is the same as the replication factor. When using virtual machines as Cassandra nodes we have to set up the cluster in a way that number of racks is

Re: Deployment

2019-01-12 Thread amit sehas
ch node I meant), but obviously keeping application in same region as that of cassandra cluster would make sense, can't comment much on cloud deployment. Thanks,Rajesh On Sat, Jan 12, 2019 at 8:54 AM amit sehas wrote: I am new to Cassandra, i am wondering how the Cassandra applications

Re: Deployment

2019-01-12 Thread Andy Tolbert
because it would have > bigger implications when stuffs like re-balancing would occur. So, your > application should be unaware where your data lies (in which node I meant), > but obviously keeping application in same region as that of cassandra > cluster would make sense, can't com

Re: Deployment

2019-01-12 Thread amit sehas
lication should be unaware where your data lies (in which node I meant), but obviously keeping application in same region as that of cassandra cluster would make sense, can't comment much on cloud deployment. Thanks,Rajesh On Sat, Jan 12, 2019 at 8:54 AM amit sehas wrote: I am new to Cass

Re: Deployment

2019-01-12 Thread Rajesh Kishore
your application should be unaware where your data lies (in which node I meant), but obviously keeping application in same region as that of cassandra cluster would make sense, can't comment much on cloud deployment. Thanks, Rajesh On Sat, Jan 12, 2019 at 8:54 AM amit sehas wrote: > I a

Re: Deployment

2019-01-11 Thread Goutham reddy
In the cloud world all the Apps and servers are deployed independently. Having said that application has to submit request to post or get the data it’s the Cassandra takes care of routing it internally to get the data. Yes it’s better to have on same cloud for better performance. Hope you got some

Deployment

2019-01-11 Thread amit sehas
I am new to Cassandra, i am wondering how the Cassandra applications are deployed in the cloud. Does Cassandra have a client server architecture and the application is deployed as a 3rd tier that sends over queries to the clients, which then submit them to the Cassandra servers?  Or does the app

Deployment of .net application on production is erroring out

2015-08-28 Thread Asit KAUSHIK
Hi All, Please excuse my limited knowledge . We have an application in .Net and the backend database is Cassandra.We have deployed in our application into production which is behing the Firewall. We have opened the 9042 Port from our Webserver to the cassandra cluster. But again we are getting the

Re: tokens and RF for multiple phases of deployment

2012-06-01 Thread Chong Zhang
RAC1Up Normal 7.07 MB > 0.00% 1 > 10.10.10.2DC1 RAC1Up Normal 288.91 MB > 50.00% 85070591730234615865843651857942052864 > > Thanks, > Chong > > On Thu, May 31, 2012 at 5:48 AM, aaron morton wrote: > >> >> Could you

Re: tokens and RF for multiple phases of deployment

2012-05-31 Thread aaron morton
5865843651857942052864 > > Thanks, > Chong > > On Thu, May 31, 2012 at 5:48 AM, aaron morton wrote: > >> Could you provide some guide on how to assign the tokens in this growing >> deployment phases? > > background > http://www.datastax.com/docs/1.0

Re: tokens and RF for multiple phases of deployment

2012-05-31 Thread Chong Zhang
Chong On Thu, May 31, 2012 at 5:48 AM, aaron morton wrote: > > Could you provide some guide on how to assign the tokens in this growing > deployment phases? > > > background > http://www.datastax.com/docs/1.0/install/cluster_init#calculating-tokens-for-a-multi-data-center-clu

Re: tokens and RF for multiple phases of deployment

2012-05-31 Thread aaron morton
> Could you provide some guide on how to assign the tokens in this growing > deployment phases? background http://www.datastax.com/docs/1.0/install/cluster_init#calculating-tokens-for-a-multi-data-center-cluster Start with tokens for a 4 node cluster. Add the next 4 between between e

tokens and RF for multiple phases of deployment

2012-05-30 Thread Chong Zhang
maintenance. Could you provide some guide on how to assign the tokens in this growing deployment phases? I looked at some docs but not very clear on how to assign tokens on the fail-over case. Also if we use the same RF (3) in both DC, and use EACH_QUORUM for write and LOCAL_QUORUM for read, can the

Remote Hadoop Job Deployment

2012-01-24 Thread Brian O'Neill
FYI... we finally got around to releasing a version of Virgil that includes the ability to deploy jobs to remote Hadoop clusters running against Cassandra Column Families. http://brianoneill.blogspot.com/2012/01/virgil-remote-hadoop-job-deployment-via.html This has enabled an army of people to

Re: advice for EC2 deployment

2011-06-23 Thread Sasha Dolgy
we use a combination of Vyatta & OpenVPN on the nodes that are EC2 and nodes that aren't Ec2works a treat. On Thu, Jun 23, 2011 at 10:23 PM, Sameer Farooqui wrote: > EC2Snitch doesn't currently support multi-Regions in Amazon. > Tickets to track: > https://issues.apache.org/jira/browse/CASSAN

Re: advice for EC2 deployment

2011-06-23 Thread Sameer Farooqui
. > > has anybody got there nodes talking to each other across regions by just > using public-dns? > > I am also looking into open vpn and how to deploy it. > > -- > View this message in context: > http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/advice-for-EC2-d

Re: advice for EC2 deployment

2011-06-23 Thread pankajsoni0126
ss regions by just using public-dns? I am also looking into open vpn and how to deploy it. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/advice-for-EC2-deployment-tp6294613p6508278.html Sent from the cassandra-u...@incubator.apache.org ma

Re: advice for EC2 deployment

2011-06-23 Thread Sasha Dolgy
> that I >> > posted this "IP address resolution in MultiDC setup". >> > >> > But when it is to get nodes talking to each other on different regions >> > say, >> > us-east and us-west over private IP's of EC2 nodes I am facing pro

Re: advice for EC2 deployment

2011-06-23 Thread pankaj soni
> > But when it is to get nodes talking to each other on different regions > say, > > us-east and us-west over private IP's of EC2 nodes I am facing problems. > > > > I am assuming if Cassandra is built for multi-DC setup it should be > easily > > deployed

Re: advice for EC2 deployment

2011-04-28 Thread aaron morton
If you are not going to be multi-region straight away, but wish to be in the near future I would consider: - 1 region - 2 AZ's, with the same number of nodes - Using the EC2Snitch as is, this will map to 1 cassandra DC and 2 cassandra Racks - Using the NetworkTopology strategy For background s

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
I think you're right about changing NetworkToplogyStrategy, but the timing isn't working in my favor at this point. I wonder how bad that will really be On Wed, Apr 27, 2011 at 9:35 AM, Sasha Dolgy wrote: > so can you not simply leverage a strategy that replicates data between > "racks" and

Re: advice for EC2 deployment

2011-04-27 Thread Sasha Dolgy
so can you not simply leverage a strategy that replicates data between "racks" and at some point in the future when you move to multi-dc upgrade the replication strategy to maintain the current replication and add in some replication between DC's ... ? i'll go re-read your posts to see if you've a

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
Oh, and Route53 doesn't do anything automatically, but there is an API to manage the DNS. It's up to you to run a task on instance boot/terminate, or a cron job if you want to do this trick (for now, seems like a solid future feature of Route53). Though, I hear geographical aware Route53 is alrea

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
I don't think of it as migrating an instance, it's more of a destroy/start with EC2. But, I still think it would be very useful to spin up a set of instances with known hostnames (cassandra1, 2, 3... N) and be able to quickly SSH to them by doing "ssh ec2u...@cassandra1.random.ec2.mydomain.com ".

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
Thanks Sasha. Fortunately/unfortunately I did realize the default & current behavior of the Ec2Snitch, but my application isn't multi-region capable (yet), so I need to get intra-region redundancy. And having a SingleRegionEc2Snitch that did DC=ec2zone and RACK=??? would be much better for me (fo

Re: advice for EC2 deployment

2011-04-27 Thread Sasha Dolgy
if you migrate the instance, does Route53 automatically re-map all the information to the new ec2 instance? another issue is that cassandra only maintains the IP of the other nodes, and not the hostname (assumed based on output of the nodetool ring) ... which means, if you migrate the instance a

Re: advice for EC2 deployment

2011-04-27 Thread Sasha Dolgy
Hi William, The default behavior of Ec2Snitch is outlined below: http://svn.apache.org/repos/asf/cassandra/trunk/src/java/org/apache/cassandra/locator/Ec2Snitch.java // Split "us-east-1a" or "asia-1a" into "us-east"/"1a" and "asia"/"1a". String azone = new String(b ,"UTF-8");

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
It's great advice, but I'm still torn. I've never done multi-region work before, and I'd prefer to wait for 0.8 with built-in inter-node security, but I'm otherwise ready to roll (and need to roll) cassandra out sooner than that. Given how well my system held up with a total single AZ failure, I'

Re: advice for EC2 deployment

2011-04-27 Thread William Oberman
nodes I am facing problems. > > > > I am assuming if Cassandra is built for multi-DC setup it should be > easily > > deployed with node1's DC1's public IP listed as seed in all nodes in DC2 > and > > to gain idea about network topology? I have hit

Re: advice for EC2 deployment

2011-04-27 Thread Sasha Dolgy
> to gain idea about network topology? I have hit a dud for deployment in such > scenario. > > Or is it there any way possible to use Private IP's for such a scenario in > EC2, as Public Ip are less secure and costly?

Re: advice for EC2 deployment

2011-04-27 Thread pankajsoni0126
I am assuming if Cassandra is built for multi-DC setup it should be easily deployed with node1's DC1's public IP listed as seed in all nodes in DC2 and to gain idea about network topology? I have hit a dud for deployment in such scenario. Or is it there any way possible to use Private IP

Re: advice for EC2 deployment

2011-04-27 Thread pankajsoni0126
I am assuming if Cassandra is built for multi-DC setup it should be easily deployed with node1's DC1's public IP listed as seed in all nodes in DC2 and to gain idea about network topology? I have hit a dud for deployment in such scenario. Or is it there any way possible to use Private IP

Re: advice for EC2 deployment

2011-04-27 Thread aaron morton
Using the EC2Snitch you could have one AZ in us-east-1 and one Az in us-west-1, treat each AZ as a single rack and each region as a DC. The network topology is rack aware so will prefer request that go to the same rack (not much of an issue when you have only one rack). If possible I would use

Re: advice for EC2 deployment

2011-04-26 Thread William Oberman
I see what you're saying. I was able to control write latency on mysql using insert vs insert delayed (what I feel is MySQLs poor man's eventual consistency option) + the fact that replication was a background asynchronous process. In terms of read latency, I was able to do up to a few hundred we

Re: advice for EC2 deployment

2011-04-26 Thread aaron morton
One difference between Cassandra and MySQL replication may be when the network IO happens. Was the MySQL replication synchronous on transaction commit ? I was only aware that it had async replication, which means the client is not exposed to the network latency. In cassandra the network latency

Re: advice for EC2 deployment

2011-04-26 Thread William Oberman
Thanks Aaron! Unless no one on this list uses EC2, there were a few minor troubles end of last week through the weekend which taught me a lot about obscure failure modes in various applications I use :-) My original post was trying to be more redundant than fast, which has been by overall goal fr

Re: advice for EC2 deployment

2011-04-25 Thread aaron morton
For background see this article: http://www.datastax.com/dev/blog/deploying-cassandra-across-multiple-data-centers And this recent discussion http://www.mail-archive.com/user@cassandra.apache.org/msg12502.html Issues that may be a concern: - lots of cross AZ latency in us-east, e.g. LOCAL_QUORUM

advice for EC2 deployment

2011-04-21 Thread William Oberman
Hi, My service is not yet ready to be fully multi-DC, due to how some of my legacy MySQL stuff works. But, I wanted to get cassandra going ASAP and work towards multi-DC. I have two main cassandra use cases: one where I can handle eventual consistency (and all of the writes/reads are currently O

Re: Multi-DC Deployment

2011-04-21 Thread Peter Schuller
> Again, for a lot of services, it is fully acceptable, and a lot better, to > return an almost complete (or maybe even complete, but no verified by > quorum) result than no result at all. +1, except maybe "a lot" depending on how one chooses to define that. There are definitely cases where suffic

Re: Multi-DC Deployment

2011-04-21 Thread Peter Schuller
> Cassandra doesn't "replicate sstable corruptions". It detects corrupt > data and only replicates good data. This is incorrect. Depending on the nature of the corruption it may spread to other nodes. Checksumming (done right) would be a great addition to alleiate this. Yes, there is code that tri

Re: Multi-DC Deployment

2011-04-20 Thread Terje Marthinussen
Sure, the update queue could just as well replicate problems, but the queue would be a lot simpler than cassandra and it would not modify already acknowledged data like like for instance compaction or read-repair/hint deliveries may. There is a fair bit of re-writing/re-assemblying of data even tho

Re: Multi-DC Deployment

2011-04-20 Thread Adrian Cockcroft
Queues replicate bad data just as well as anything else. The biggest source of bad data is broken app code... You will still need to implement a reconciliation/repair checker, as queues have their own failure modes when they get backed up. We have also looked at using queues to bounce data between

Re: Multi-DC Deployment

2011-04-20 Thread Terje Marthinussen
Assuming that you generally put an API on top of this, delivering to two or more systems then boils down to a message queue issue or some similar mechanism which handles secure delivery of messages. Maybe not trivial, but there are many products that can help you with this, and it is a lot easier t

Re: Multi-DC Deployment

2011-04-20 Thread Adrian Cockcroft
Hi Terje, If you feed data to two rings, you will get inconsistency drift as an update to one succeeds and to the other fails from time to time. You would have to build your own read repair. This all starts to look like "I don't trust Cassandra code to work, so I will write my own buggy one off ve

Re: Multi-DC Deployment

2011-04-19 Thread Terje Marthinussen
If you have RF=3 in both datacenters, it could be discussed if there is a point to use the built in replication in Cassandra at all vs. feeding the data to both datacenters and get 2 100% isolated cassandra instances that cannot replicate sstable corruptions between each others My point is rea

Re: Multi-DC Deployment

2011-04-19 Thread Adrian Cockcroft
If you want to use local quorum for a distributed setup, it doesn't make sense to have less than RF=3 local and remote. Three copies at both ends will give you high availability. Only one copy of the data is sent over the wide area link (with recent versions). There is no need to use mirrored or R

Re: Multi-DC Deployment

2011-04-18 Thread Terje Marthinussen
Hum... Seems like it could be an idea in a case like this with a mode where result is always returned (if possible), but where a flay saying if the consistency level was met, or to what level it was met (number of nodes answering for instance).? Terje On Tue, Apr 19, 2011 at 1:13 AM, Jonathan El

Re: Multi-DC Deployment

2011-04-18 Thread Jonathan Ellis
They will timeout until failure detector realizes the DC1 nodes are down (~10 seconds). After that they will immediately return UnavailableException until DC1 comes back up. On Mon, Apr 18, 2011 at 10:43 AM, Baskar Duraikannu wrote: > We are planning to deploy Cassandra on two data centers.   Let

Multi-DC Deployment

2011-04-18 Thread Baskar Duraikannu
We are planning to deploy Cassandra on two data centers. Let us say that we went with three replicas with 2 being in one data center and last replica in 2nd Data center. What will happen to Quorum Reads and Writes when DC1 goes down (2 of 3 replicas are unreachable)? Will they timeout? R

Re: Cassandra 2 DC deployment

2011-04-15 Thread Peter Schuller
> You are right about the automatic fallback to ONE. Its quite possible, if 2 > nodes die for some reason I will have the same problem. So probably the right > thing to do would be to read/write at ONE only when we lose a DC by changing > some manual configuration. Since we shouldn't be losing D

RE: Cassandra 2 DC deployment

2011-04-13 Thread Nair, Rajesh
user@cassandra.apache.org Subject: Re: Cassandra 2 DC deployment > When the down data center comes back up, the Quorum reads will result in a > read-repair, so you will get valid data.   Besides that, hinted handoff will > take care of getting data replicated to a previously down node. *Even

Re: Cassandra 2 DC deployment

2011-04-12 Thread Peter Schuller
> When the down data center comes back up, the Quorum reads will result in a > read-repair, so you will get valid data.   Besides that, hinted handoff will > take care of getting data replicated to a previously down node. *Eventually* though, but yes. I.e., there would be no expectation to insta

Re: Cassandra 2 DC deployment

2011-04-12 Thread Narendra Sharma
I think this is reasonable assuming you have enough backhaul to perform reads across DC if read requests hit DC2 (with one copy of data) or one replica from DC1 is down. Moreover, since you clearly stated that you would prefer availability over consistency, you should be prepared for stale reads :

Re: Cassandra 2 DC deployment

2011-04-12 Thread Jonathan Colby
When the down data center comes back up, the Quorum reads will result in a read-repair, so you will get valid data. Besides that, hinted handoff will take care of getting data replicated to a previously down node. You're example is a little unrealistic because you could theoretically have a D

Cassandra 2 DC deployment

2011-04-12 Thread Raj N
Hi experts, We are planning to deploy Cassandra in 2 datacenters. Let assume there are 3 nodes, RF=3, 2 nodes in 1 DC and 1 node in 2nd DC. Under normal operations, we would read and write at QUORUM. What we want to do though is if we lose a datacenter which has 2 nodes, DC1 in this case, we w

Re: Cassandra deployment options on Terremark

2011-02-22 Thread Jonathan Ellis
Good thread on this recently: http://www.mail-archive.com/user@cassandra.apache.org/msg09020.html On Tue, Feb 22, 2011 at 10:38 PM, Sanjay Sharma wrote: > Hi All, > I am looking for Cassandra deployment options on Terremark eCloud. > Terremark uses NAS drive so that every machine inst

Cassandra deployment options on Terremark

2011-02-22 Thread Sanjay Sharma
Hi All, I am looking for Cassandra deployment options on Terremark eCloud. Terremark uses NAS drive so that every machine instance gets allocated disk storage from this NAS, so not sure how the NAS replication model fits in with Cassandra replication model. The deployment size would be around 4

Re: Few questions regarding cassandra deployment on windows

2010-09-10 Thread Gary Dusbabek
On Thu, Sep 9, 2010 at 22:23, kannan chandrasekaran wrote: > > Thanks for the replies My comments in Bold... > Kannan > > > From: Gary Dusbabek > To: user@cassandra.apache.org > Sent: Thu, September 9, 2010 5:43:31 AM > Subject: Re: Few questions regarding cass

Re: Few questions regarding cassandra deployment on windows

2010-09-09 Thread kannan chandrasekaran
Thanks for the replies My comments in Bold... Kannan From: Gary Dusbabek To: user@cassandra.apache.org Sent: Thu, September 9, 2010 5:43:31 AM Subject: Re: Few questions regarding cassandra deployment on windows On Tue, Sep 7, 2010 at 17:42, kannan chandrasekaran wrote: > Hi All, >

Re: Few questions regarding cassandra deployment on windows

2010-09-09 Thread Gary Dusbabek
On Tue, Sep 7, 2010 at 17:42, kannan chandrasekaran wrote: > Hi All, > > We are currently considering Cassandra for our application. > > Platform: > * a single-node cluster. > * windows '08 > * 64-bit jvm > > For the sake of brevity let, > Cassandra service =  a single node cassandra server runnin

Re: Few questions regarding cassandra deployment on windows

2010-09-08 Thread kannan chandrasekaran
n for this limitation ? If yes, Can someone please help me understand the reason... Thanks Kannan From: Courtney To: user@cassandra.apache.org Sent: Tue, September 7, 2010 5:31:46 PM Subject: Re: Few questions regarding cassandra deployment on windows I ha

Re: Few questions regarding cassandra deployment on windows

2010-09-07 Thread Courtney
s yes, he did it, so ... From: kannan chandrasekaran Sent: Wednesday, September 08, 2010 1:20 AM To: user@cassandra.apache.org Subject: Re: Few questions regarding cassandra deployment on windows Can you please elaborate on why you think Cassandra would not be suitable for this ? Main reason

Re: Few questions regarding cassandra deployment on windows

2010-09-07 Thread kannan chandrasekaran
: Tue, September 7, 2010 4:38:41 PM Subject: Re: Few questions regarding cassandra deployment on windows This does not sound like a good application for Cassandra at all. Why are you using it? On Tue, Sep 7, 2010 at 3:42 PM, kannan chandrasekaran wrote: > Hi All, > > We are currently considering

Re: Few questions regarding cassandra deployment on windows

2010-09-07 Thread Benjamin Black
This does not sound like a good application for Cassandra at all. Why are you using it? On Tue, Sep 7, 2010 at 3:42 PM, kannan chandrasekaran wrote: > Hi All, > > We are currently considering Cassandra for our application. > > Platform: > * a single-node cluster. > * windows '08 > * 64-bit jvm >

Few questions regarding cassandra deployment on windows

2010-09-07 Thread kannan chandrasekaran
Hi All, We are currently considering Cassandra for our application. Platform: * a single-node cluster. * windows '08 * 64-bit jvm For the sake of brevity let, Cassandra service = a single node cassandra server running as an embedded service inside a JVM My use cases: 1) Start with a sc

Re: Deployment on AWS

2010-04-04 Thread Krishna Sankar
Dan, AFAIK, AZ gives you infrastructure redundancy but not necessarily geographical dispersion. Regions are meant for that (as well as other characteristics). An interesting blog on this topic http://alestic.com/2009/07/ec2-availability-zones Cheers On 4/4/10 Sun Apr 4, 10, "Dan Di Spalt

Re: Deployment on AWS

2010-04-04 Thread Benjamin Black
Not guaranteed within the same region. On Sun, Apr 4, 2010 at 6:48 PM, Dan Di Spaltro wrote: > A little off-topic, but is an availability zone in a separate physical > datacenter? >

Re: Deployment on AWS

2010-04-04 Thread Michael Russo
On 2010-04-04, at 10:18 PM, Masood Mortazavi wrote: > > (My question remains. I'm interested in seed configuration practice/recipe > when deploying on AWS. In the scenario, assume Cassandra sits behind some > other part of the service -- say, web container -- that are then exposed > publicly. C

Re: Deployment on AWS

2010-04-04 Thread Masood Mortazavi
See here: http://docs.amazonwebservices.com/AWSEC2/latest/UserGuide/index.html?concepts-regions-availability-zones.html (My question remains. I'm interested in seed configuration practice/recipe when deploying on AWS. In the scenario, assume Cassandra sits behind some other part of the service --

Re: Deployment on AWS

2010-04-04 Thread Dan Di Spaltro
A little off-topic, but is an availability zone in a separate physical datacenter? On Sat, Apr 3, 2010 at 5:08 PM, Benjamin Black wrote: > Right, you determine AZ by looking at the metadata.  us-east-1a is a > different AZ from us-east-1b.  You can't infer anything beyond that, > either with the

Re: Deployment on AWS and replication strategies

2010-04-04 Thread Mike Gallamore
Pluggable placement: that is cool. It wasn't something that was obvious to me that was available from the documentation I read. I thought maybe the the rackaware and rackunaware were hard coded in somewhere. I'm not a java developer so I haven't looked at the code much. That said I'll take a lo

Re: Deployment on AWS and replication strategies

2010-04-04 Thread Benjamin Black
On Sat, Apr 3, 2010 at 8:23 PM, Mike Gallamore wrote: >> > I didn't mean a real time determination, more of if the nodes aren't > identical. For example if you have a cluster made up of a bunch of EC2 light > instances and decide to add a large instance, it would be nice if the new > node would

Re: Deployment on AWS and replication strategies

2010-04-03 Thread Mike Gallamore
Hi Benjamin, Thanks for the reply. On 2010-04-03, at 8:12 PM, Benjamin Black wrote: > On Sat, Apr 3, 2010 at 3:41 PM, Mike Gallamore > wrote: >> >> Useful things that nodes could advertise: >> >> data-centre they are in, > > This is what the snitches do. Cool. > >> performance info: mem, CPU

Re: Deployment on AWS and replication strategies

2010-04-03 Thread Benjamin Black
On Sat, Apr 3, 2010 at 3:41 PM, Mike Gallamore wrote: > > Useful things that nodes could advertise: > > data-centre they are in, This is what the snitches do. > performance info: mem, CPU etc (these could be used to more intelligently > decide how to partition the data that the new node gets fo

Re: Deployment on AWS

2010-04-03 Thread Benjamin Black
Right, you determine AZ by looking at the metadata. us-east-1a is a different AZ from us-east-1b. You can't infer anything beyond that, either with the AWS API or guesses about IP addressing. My EC2 snitch recipe builds a config file for the property snitch that treats AZs like racks (just break

Re: Deployment on AWS and replication strategies

2010-04-03 Thread Mike Gallamore
Hi everyone, At my work we are in the early stages of moving our data which lives on EC2 machines from a Flare/memcache system to Cassandra so your chat has been interesting to me. I realize that this might complicate things and make things less "simple" but would it be useful for the nodes th

Re: Deployment on AWS

2010-04-03 Thread Joe Stump
On Apr 3, 2010, at 2:54 PM, Benjamin Black wrote: > I'm pretty familiar with EC2, hence the question. I don't believe any > patches are required to do these things. Regardless, as I noted in > that ticket, you definitely do NOT need AWS credentials to determine > your availability zone. It is

Re: Deployment on AWS

2010-04-03 Thread Benjamin Black
I'm pretty familiar with EC2, hence the question. I don't believe any patches are required to do these things. Regardless, as I noted in that ticket, you definitely do NOT need AWS credentials to determine your availability zone. It is available through the metadata web server for each instance

Re: Deployment on AWS

2010-04-03 Thread Joe Stump
On Apr 3, 2010, at 1:53 PM, Benjamin Black wrote: > What specific features are you looking for to operate on EC2? It seemed people weren't looking for features, but tools to help with the management. The two things we've created that people might be interested in are: 1. An EC2-specific rack-a

Re: Deployment on AWS

2010-04-03 Thread Benjamin Black
What specific features are you looking for to operate on EC2? b On Sat, Apr 3, 2010 at 1:37 PM, Lenin Gali wrote: > We are looking to take advantage of this as Well. Please let us know when it > is ready. > > Lenin > > On Sat, Apr 3, 2010 at 11:32 AM, Peter Chang wrote: >> >> Woot. Ver much lo

Re: Deployment on AWS

2010-04-03 Thread Lenin Gali
We are looking to take advantage of this as Well. Please let us know when it is ready. Lenin On Sat, Apr 3, 2010 at 11:32 AM, Peter Chang wrote: > Woot. Ver much looking forward to this stuff Joe. > > > On Sat, Apr 3, 2010 at 10:14 AM, Joe Stump wrote: > >> >> On Apr 2, 2010, at 4:49 PM, Masoo

Re: Deployment on AWS

2010-04-03 Thread Peter Chang
Woot. Ver much looking forward to this stuff Joe. On Sat, Apr 3, 2010 at 10:14 AM, Joe Stump wrote: > > On Apr 2, 2010, at 4:49 PM, Masood Mortazavi wrote: > > > Is there a ready recipe for deploying a Cassandra cluster in AWS? ... > (Seeds need some "fixed" IP addresses.) > > We have a lot of c

Re: Deployment on AWS

2010-04-03 Thread Joe Stump
On Apr 2, 2010, at 4:49 PM, Masood Mortazavi wrote: > Is there a ready recipe for deploying a Cassandra cluster in AWS? ... (Seeds > need some "fixed" IP addresses.) We have a lot of code around this that we're trying to get released. We have a rack aware strategy for cross-AZ clusters. We als

Re: Deployment on AWS

2010-04-02 Thread Benjamin Black
http://github.com/b/cookbooks/tree/master/cassandra/ On Fri, Apr 2, 2010 at 4:49 PM, Masood Mortazavi wrote: > > Is there a ready recipe for deploying a Cassandra cluster in AWS? ... (Seeds > need some "fixed" IP addresses.) > > Regards, > - m. > >

Deployment on AWS

2010-04-02 Thread Masood Mortazavi
Is there a ready recipe for deploying a Cassandra cluster in AWS? ... (Seeds need some "fixed" IP addresses.) Regards, - m.

Re: Separate disks with cloud deployment

2010-03-25 Thread Jonathan Ellis
as much as sharing the machine with a bunch of other users, so I wouldn't worry too much. On Thu, Mar 25, 2010 at 10:10 AM, Mark Greene wrote: > The FAQ page makes mention of using separate disks for the commit log and > data directory. How would one go about achieving this in a cloud

Re: Separate disks with cloud deployment

2010-03-25 Thread Ethan Rowe
On 03/25/2010 11:18 AM, Ethan Rowe wrote: [snip] I'll defer to the Rackspace folks regarding Rackspace Cloud; it has been I/O on average since you're dealing with a real, local disk. But I don't know about getting a second disk in that environment, though. That should have said "better I/O o

Re: Separate disks with cloud deployment

2010-03-25 Thread Ethan Rowe
On 03/25/2010 11:10 AM, Mark Greene wrote: The FAQ page makes mention of using separate disks for the commit log and data directory. How would one go about achieving this in a cloud deployment such as Rackspace cloud servers or EC2 EBS? Or is it just preferred to use dedicated hardware to get

Separate disks with cloud deployment

2010-03-25 Thread Mark Greene
The FAQ page makes mention of using separate disks for the commit log and data directory. How would one go about achieving this in a cloud deployment such as Rackspace cloud servers or EC2 EBS? Or is it just preferred to use dedicated hardware to get the optimal performance? Thanks In Advance

  1   2   >