On 3/24/2017 7:47 AM, vrindavda wrote: > In my case query rate will be average or say low, 100-120 concorrent > requests.
That is not a low query rate. A low query rate would be X queries per second, where X is a small single-digit number. If there are 100 *simultaneous* requests, then the query rate is likely at least several hundred per second, which is very high. Handling that many requests per second with an index of the size you have mentioned is almost certainly going to require more than two servers/replicas. > As per my understanding replica too aid shards in getting result documents, > correct if I am wrong. SolrCloud will automatically load balance requests sent to a single server across the cloud, taking advantage of multiple replicas. Depending on what kind of client software is in use, a separate load balancer might still be a good idea, so the IP address and port isn't a single point of failure. If you have software that can move the IP address to another machine in the event of a failure, that would probably be enough. Thanks, Shawn