Re: 2 VM setup for SOLRCLOUD?

Daniel Collins Sat, 01 Jun 2013 01:56:57 -0700

Document updates will fail with less than the quorum of ZKs, so you won't beable to index anything when 1 server is down.

Its the one area that always seems counter intuitive (to me at any rate),after all you have your 2 instances on 1 server, so you have all the sharddata, logically you should be able to index just using that (and if you hada single ZK running on that server it would indeed be fine)... However, ZKneeds a 3rd instance running somewhere in order to maintain its majorityrule.

The consensus I've seen tends to be run a ZK on all your cloud servers, andthen run some "outside" the cloud on other machines. If you had a 3rd VMthat just ran ZK and nothing else, you could lose any 1 of the 3 machinesand still be ok. But if you lose 2 you are in trouble.

-----Original Message-----From: James Dulin

Sent: Friday, May 31, 2013 10:28 PM
To: solr-user@lucene.apache.org
Subject: RE: 2 VM setup for SOLRCLOUD?

Thanks. When you say updates will fail, do you mean document updates willfail, or, updates to the cluster, like adding a new node? If adding newdata will fail, I will definitely need to figure out a different way to setthis up.


-----Original Message-----
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Friday, May 31, 2013 4:33 PM
To: solr-user@lucene.apache.org
Subject: Re: 2 VM setup for SOLRCLOUD?

Be really careful here. Zookeeper requires a quorum, which is ((zk

nodes)/2) + 1. So the problem here is that if (zk nodes) is 2, both of themneed to be up. If either of them is down, searches will still work, butupdates will fail.


Best
Erick

On Fri, May 31, 2013 at 11:39 AM, James Dulin <jdu...@crelate.com> wrote:

Thanks, I think that the load balancer will be simple enough to set up inAzure. My only other current concern is having the zookeepers on thesame VMs as Solr. While not ideal, we basically just need simpleredunancy, so my theory is that if VM1 goes down, VM 2 will have theshard, node, and zookeeper to keep everything going smooth.
-----Original Message-----
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Friday, May 31, 2013 8:07 AM
To: solr-user@lucene.apache.org
Subject: Re: 2 VM setup for SOLRCLOUD?
Actually, you don't technically _need_ a load balancer, you could hardcode all requests to the same node and internally, everything would "justwork". But then you'd be _creating_ a single point of failure if that nodewent down, so a fronting LB is usually indicated.
Perhaps the thing you're missing is that Zookeeper is there explicitly forthe purpose of knowing where all the nodes are and what their state is.Solr communicates with ZK and any incoming requests (update or query) arehandled appripriately thus Jason's comment that once a request gets to anynode in the cluster, things are handled automatically.
All that said, if you're using SolrJ and use CloudSolrServer exclusively,then the load balancer isn't necessary. Internally CloudSolrServer (theclient) reads the list of accessible nodes from Zookeeper and will befault tolerant and load balance internally.
Best
Erick
On Thu, May 30, 2013 at 3:51 PM, Jason Hellman<jhell...@innoventsolutions.com> wrote:
Jamey,
You will need a load balancer on the front end to direct traffic into oneof your SolrCore entry points. It doesn't matter, technically, which onethough you will find benefits to narrowing traffic to fewer (for purposesof better cache management).
Internally SolrCloud will round-robin distribute requests to other shardsonce a query begins execution. But you do need an entry point externallyto be defined through your load balancer.
Hope this is useful!

Jason

On May 30, 2013, at 12:48 PM, James Dulin <jdu...@crelate.com> wrote:
Working to setup SolrCloud in Windows Azure.  I have read over the
solr Cloud wiki, but am a little confused about some of the
deployment options.  I am attaching an image for what I am thinking
we want to do.  2 VM's that will have 2 shards spanning across them.
4 Nodes total across the two machines, and a zookeeper on each VM.
I think this is feasible, but, I am a little confused about how each
node knows how to respond to requests (do I need a load balancer in
front, or can we just reference the "collection" etc.)



Thanks!

Jamey

Re: 2 VM setup for SOLRCLOUD?

Reply via email to