Hi,

In such setup, you would definitely want to use zookeeper in conjunction with Solr for at least two things :

a) deploy management / (having cron-based pulls for a 128-node setup could be a nightmare without appropriate sync guarantees, coordination, etc) b) configuration distribution / (having solr boxes registering on zookeeper and receiving dynamical configuration updates, etc - you would definitely want this in such setup)

You could trivially implement stuff like a) on sysadmin-level by wrapping up execution of appropriate solr bin/ scripts inside other pieces of code that schedule their execution by synchronizing on appropriate structures in zookeeper (there are command-line tools for basic zookeeper sync, which might help you get started)

b)-style problems could be solved much more elegant (than on sysadmin- level), by implementing Solr module which would use zookeeper's async api for the general procedures of conf-updates, core swaps, etc. However, as far as I know, no such solution currently exists ("dynamical refresh of Solr conf from zookeeper in runtime") - so I guess if you really want it... you would have to custom-code it (or just solve the problem the bunch-of-scripts-synchronizing-with- zookeeper-copying-files-and-restarting-solr-when-needed - way :) )

.Alek

On Oct 8, 2008, at 5:23 PM, Anoop Bhatti wrote:

Hello,

I am evaluating Solr on a small cluster (~8 nodes) and will likely
deploy it to a cluster of ~128 nodes.  Of course, configuration
management will be a big concern.  Has anyone used Zookeeper for a
Solr cluster?  I'm looking for any documentation, tutorials, etc. on
how to setup Zookeeper for a Solr cluster, and any architecture
recommendations.

Thanks,

Anoop Bhatti
--
Committed to open source technology.

Reply via email to