It's true that SolrCloud is adding some complexity.
But few observations :

SolrCloud has some disadvantages and can't beat the easiness and simpleness
> of
> Master Slave Replica. So I can only encourage to keep Master Slave Replica
> in future versions.


I agree, it can happen situations when you have really simple and not
critical systems.
Anyway old style replication is still used in SolrCloud, so I think it is
going to stay for a while ( until is replaced with something else) .

To answer to Gian :

One of the problem I've found is that I've not found a simple way to backup
> the content of a collection to restore in situation of disaster recovery.
> With simple master / slave scenario we can use the replication handler to
> generate backups that can be easily used to restore content of a core,
> while with SolrCloud is not clear how can we obtain a full backup


To be fair, Disaster recovery is when SolrCloud shines.
If you lose random nodes across your collection, you simply need to fix
them and spin up again .
The system will automatically restore the content to the last version
available ( the tlog first and the  leader ( if the tlog is not enough)
will help the dead node to catch up .
If you lose all the replicas for a shard and you lose the content in disk
of all this replicas ( index and tlog), SolrCloud can't help you.
For this unlikely scenarios a backup is suggested.
You could restore anyway the backup only to one node, and the replicas are
going to catch up .

Probably is just a matter of backupping every shard with standard
> replication handler and then restore each shard after recreating the
> collection


Definitely not, SolrCloud is there to avoid this manual stuff.

Cheers


On 14 January 2016 at 08:58, Gian Maria Ricci - aka Alkampfer <
alkamp...@nablasoft.com> wrote:

> I agree that SolrCloud has not only advantages, I really understand that
> it offers many more features, but it introduces some complexity.
>
> One of the problem I've found is that I've not found a simple way to
> backup the content of a collection to restore in situation of disaster
> recovery. With simple master / slave scenario we can use the replication
> handler to generate backups that can be easily used to restore content of a
> core, while with SolrCloud is not clear how can we obtain a full backup.
> Probably is just a matter of backupping every shard with standard
> replication handler and then restore each shard after recreating the
> collection, but I've not found (probably I need to search better) official
> documentation on backup / restore procedures for SolrCloud.
>
> Thanks.
>
> --
> Gian Maria Ricci
> Cell: +39 320 0136949
>
>
> -----Original Message-----
> From: Bernd Fehling [mailto:bernd.fehl...@uni-bielefeld.de]
> Sent: giovedì 14 gennaio 2016 08:22
> To: solr-user@lucene.apache.org
> Subject: Re: Pro and cons of using Solr Cloud vs standard Master Slave
> Replica
>
> SolrCloud has some disadvantages and can't beat the easiness and
> simpleness of Master Slave Replica. So I can only encourage to keep Master
> Slave Replica in future versions.
>
> Bernd
>
> Am 13.01.2016 um 21:57 schrieb Jack Krupansky:
> > The "Legacy Scaling and Distribution" section of the Solr Reference
> > Guide also gives info elated to so-called master-slave mode:
> > https://cwiki.apache.org/confluence/display/solr/Legacy+Scaling+and+Di
> > stribution
> >
> > Also, although the old master-slave mode is still technically
> > supported in the sense that the code and doc is still there, You won't
> > be able to get the level of community support  here on the mailing
> > list as you can get for SolrCloud.
> >
> > Unless you're simply trying to decide whether to leave an old legacy
> > system as-is with the old distributed mode, nobody should be
> > considered a fresh new distributed Solr deployment with anything other
> than SolrCloud.
> >
> > (Hmmm... have any of the committers considered deprecating the old
> > non-SolrCloud distributed mode features?)
>
> -1
>
> >
> > -- Jack Krupansky
> >
> > On Wed, Jan 13, 2016 at 9:02 AM, Shivaji Dutta
> > <sdu...@hortonworks.com>
> > wrote:
> >
> >> - SolrCloud uses zookeeper to manage HA
> >>         - Zookeeper is a standard for all HA in Apache Hadoop
> >> - You have collections which will manage your shards across nodes
> >> - SolrJ Client is now fault tolerant with CloudSolrClient
> >>
> >> This is the way future direction of the product will go.
> >>
> >>
> >>
> >> On 1/13/16, 5:58 AM, "Gian Maria Ricci - aka Alkampfer"
> >> <alkamp...@nablasoft.com> wrote:
> >>
> >>> Thanks.
> >>>
> >>> --
> >>> Gian Maria Ricci
> >>> Cell: +39 320 0136949
> >>>
> >>>
> >>>
> >>> -----Original Message-----
> >>> From: Shawn Heisey [mailto:apa...@elyograg.org]
> >>> Sent: lunedì 11 gennaio 2016 18:28
> >>> To: solr-user@lucene.apache.org
> >>> Subject: Re: Pro and cons of using Solr Cloud vs standard Master
> >>> Slave Replica
> >>>
> >>> On 1/11/2016 4:28 AM, Gian Maria Ricci - aka Alkampfer wrote:
> >>>> a customer need a comprehensive list of all pro and cons of using
> >>>> standard Master Slave replica VS using Solr Cloud. I¹m interested
> >>>> especially in query performance consideration, because in this
> >>>> specific situation the rate of new documents is really slow, but
> >>>> the amount of data is about 50 millions of document, and the index
> >>>> size on disk for single core is about 30 GB.
> >>>
> >>> The primary advantage to SolrCloud is that SolrCloud handles most of
> >>> the administrative and operational details for you automatically.
> >>>
> >>> SolrCloud is a little more complicated to set up initially, because
> >>> you must worry about Zookeeper as well as Solr, but once it's
> >>> properly set up, there is no single point of failure.
> >>>
> >>>> Such amount of data should be easily handled by a Master Slave
> >>>> replica with a  single core replicated on a certain number of
> >>>> slaves, but we need to evaluate also the option of SolrCloud,
> >>>> especially for fault tolerance.
> >>>>
> >>>
> >>> Once you're beyond initial setup, fault tolerance with SolrCloud is
> >>> much easier than master/slave replication.  Switching a slave to a
> >>> master is possible, but the procedure is somewhat complicated.
> >>> SolrCloud does not
> >>> *have* masters, it is a true cluster.
> >>>
> >>> With master/slave replication, the master handles all indexing, and
> >>> the finished index segments are copied to the slaves via HTTP, and
> >>> the slaves simply need to open them.  SolrCloud does indexing on all
> >>> shard replicas, nearly simultaneously.  Usually this is an
> >>> advantage, not a disadvantage, but in heavy indexing situations
> >>> master/slave replication
> >>> *might* show better performance on the slaves.
> >>>
> >>> Thanks,
> >>> Shawn
> >>>
> >>>
> >>
> >>
> >
>
>


-- 
--------------------------

Benedetti Alessandro
Visiting card : http://about.me/alessandro_benedetti

"Tyger, tyger burning bright
In the forests of the night,
What immortal hand or eye
Could frame thy fearful symmetry?"

William Blake - Songs of Experience -1794 England

Reply via email to