Re: Running Solr in a cluster - high availability only

Jack Krupansky Mon, 15 Jul 2013 05:56:00 -0700

* Go with SolrCloud - unless you think you're smarter than Yonik and MarkMiller.

* "Replicas" are used for both query capacity and resilience (HA).

* "Shards" are used for increased index capacity (number of documents) andto reduce query latency (parallel processing of portions of a query.)* You need at least three zookeepers for HA. They need to be external to thecluster in production.* Load balancing - you need to do your own testing to confirm whether youneed it. If so, that is outside of Solr.

* SolrCloud automatically recovers nodes when they come back up.


-- Jack Krupansky

-----Original Message-----From: Mysurf Mail

Sent: Monday, July 15, 2013 8:32 AM
To: solr-user@lucene.apache.org
Subject: Running Solr in a cluster - high availability only

Hi,
I would like to run two Solr instances on different computers as a cluster.
My main interest is High availability - meaning, in case one server crashes
or is down there will be always another one.

(my performances on a single instance are great. I do not need to split the
data to two servers.)

Questions:
1. What is the best practice?
   Is it different than clustering for index splitting? Do I need Shards?
2. Do I need zoo keeper?
3. Is it a container based configuration (different for jetty and tomcat)
4, Do I need an external NLB for that ?

5. When one computer is up after crashing. how dows it updates its index?

Re: Running Solr in a cluster - high availability only

Reply via email to