Jason,

My initial reluctance to indexing directly to both data centers is that we
are doing a lot of bulk loading through CSV handler. We never get just 1
document at a time. It comes in large batch updates. And now we would have
to send the batch updates twice.

That is not to say that we won't go this way. But I am exploring other
solutions as well.


On Tue, Jun 25, 2013 at 11:21 AM, Jason Hellman <
jhell...@innoventsolutions.com> wrote:

> Kevin,
>
> I can imagine this working if you consider your second data center a pure
> slave relationship to your SolrCloud cluster.  I haven't tried it, but I
> don't see why the solrconfig.xml can't identify as a master allowing you to
> call any of your cores in the cluster to replicate out.  That being said,
> this idea doesn't facilitate a SolrCloud cluster in the second data
> center…just a slave that could be a repeater.
>
> You say that sending the data in both directions is not idea, but it works
> and is conceptually very simple.  What is the reasoning behind wanting to
> get away from that approach?
>
> Jason
>
> On Jun 25, 2013, at 10:07 AM, Kevin Osborn <kevin.osb...@cbsi.com> wrote:
>
> > We are going to have two datacenters, each with their own SolrCloud and
> > ZooKeeper quorums. The end result will be that they should be replicas of
> > each other.
> >
> > One method that has been mentioned is that we should add documents to
> each
> > cluster separately. For various reasons, this may not be ideal for us.
> > Instead, we are playing around with the idea of always indexing to one
> > datacenter. And then having that replicate to the other datacenter. And
> > this is where I am having some trouble on how to proceed.
> >
> > The nice thing about SolrCloud is that there is no masters and slaves.
> Each
> > node is equals, has the same configs, etc. But in this case, I want to
> have
> > a node in one datacenter poll for changes in another data center. Before
> > SolrCloud, I would have used slave/master replication. But in the
> SolrCloud
> > world, I am not sure how to configure this setup?
> >
> > Or is there any better ideas on how to use replication to push or pull
> data
> > from one datacenter to another?
> >
> > In my case, NRT is not a requirement. And I will also be dealing with
> about
> > 3 collections and 5 or 6 shards.
> >
> > Thanks.
> >
> > --
> > *KEVIN OSBORN*
> > LEAD SOFTWARE ENGINEER
> > CNET Content Solutions
> > OFFICE 949.399.8714
> > CELL 949.310.4677      SKYPE osbornk
> > 5 Park Plaza, Suite 600, Irvine, CA 92614
> > [image: CNET Content Solutions]
>
>


-- 
*KEVIN OSBORN*
LEAD SOFTWARE ENGINEER
CNET Content Solutions
OFFICE 949.399.8714
CELL 949.310.4677      SKYPE osbornk
5 Park Plaza, Suite 600, Irvine, CA 92614
[image: CNET Content Solutions]

Reply via email to