One other fault-tolerance issue is that you'll need at least one replica
per shard. As I understand it, at least *one* machine has to be running
for each shard for the cluster to work.

This doesn't address the shardId issue, but is something to keep in
mind when testing.

Best
Erick

On Wed, Feb 29, 2012 at 2:30 AM, Ranjan Bagchi <ranjan.bag...@gmail.com> wrote:
> Hi,
>
> I'm interested in setting up a solr cluster where each machine [at least
> initially] hosts a separate shard of a big index [too big to sit on the
> machine].  I'm able to put a cloud together by telling it that I have (to
> start out with) 4 nodes, and then starting up nodes on 3 machines pointing
> at the zkInstance.  I'm able to load my sharded data onto each machine
> individually and it seems to work.
>
> My concern is that it's not fault tolerant:  if one of the non-zookeeper
> machines falls over, the whole cluster won't work.  Also, I can't create a
> shard with more data, and have it work within the existing cloud.
>
> I tried using -DshardId=shard5 [on an existing 4-shard cluster], but it
> just started replicating, which doesn't seem right.
>
> Are there ways around this?
>
> Thanks,
> Ranjan Bagchi

Reply via email to