That's a tradeoff for you to make based on your own requirements, but the point is that it is LESS SAFE to run zookeeper on the same machine as a Solr instance.

Also keep in mind that the goal is to have at least THREE zookeeper instances running at any moment, so if you run zookeeper on the same machine as a Solr instance, you will need more than three zookeepeers. Figure three plus the MAXIMUM number of Solr nodes that you expect could be down simultaneously.

Also keep in mind that SolrCloud is about scaling, but the intention is NOT to scale the zookeeper ensemble linearly with the number of Solr nodes. That means you would have to deal with the messiness of sometimes running zookeeper with Solr and sometimes not. So, unless you are running a very small SolrCloud cluster, you are much better off keeping zookeeper off your Solr machines.

The intent is that there will be a relatively small "ensemble" of zookeepers that service a large "army" or "armada" of Solr nodes.

-- Jack Krupansky

-----Original Message----- From: Marcin Rzewucki
Sent: Wednesday, November 21, 2012 5:06 PM
To: solr-user@lucene.apache.org
Subject: Re: SolrCloud and external Zookeeper ensemble

Yes, I meant the same (not -zkRun). However, I was asking if it is safe to
have zookeeper and solr processes running on the same node or better on
different machines?

On 21 November 2012 21:18, Rafał Kuć <r....@solr.pl> wrote:

Hello!

As I told I wouldn't use the Zookeeper that is embedded into Solr, but
rather setup a standalone one.

--
Regards,
 Rafał Kuć
 Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch

> First of all: thank you for your answers. Yes, I meant side by side
> configuration. I think the worst case for ZKs here is to loose two of
them.
> However, I'm going to use 4 availability zones in same region so at > least
> this will reduce the risk of loosing both of them at the same time.
> Regards.

> On 21 November 2012 17:06, Rafał Kuć <r....@solr.pl> wrote:

>> Hello!
>>
>> Zookeeper by itself is not demanding, but if something happens to your
>> nodes that have Solr on it, you'll loose ZooKeeper too if you have
>> them installed side by side. However if you will have 4 Solr nodes and
>> 3 ZK instances you can get them running side by side.
>>
>> --
>> Regards,
>>  Rafał Kuć
>>  Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch -
ElasticSearch
>>
>> > Separate is generally nice because then you can restart Solr nodes
>> > without consideration for ZooKeeper.
>>
>> > Performance-wise, I doubt it's a big deal either way.
>>
>> > - Mark
>>
>> > On Nov 21, 2012, at 8:54 AM, Marcin Rzewucki <mrzewu...@gmail.com>
>> wrote:
>>
>> >> Hi,
>> >>
>> >> I have 4 solr collections, 2-3mn documents per collection, up to >> >> 100K
>> >> updates per collection daily (roughly). I'm going to create
SolrCloud4x
>> on
>> >> Amazon's m1.large instances (7GB mem,2x2.4GHz cpu each). The
question is
>> >> what about zookeeper? It's going to be external ensemble, but is it
>> better
>> >> to use same nodes as solr or dedicated micro instances? Zookeeper
does
>> not
>> >> seem to be resources demanding process, but what would be better in
this
>> >> case ? To keep it inside of solrcloud or separately (micro instances
>> seem
>> >> to be enough here) ?
>> >>
>> >> Thanks in advance.
>> >> Regards.
>>
>>



Reply via email to