+1 to Jan's "clustered" vs "non clustered".

If we clean up terminology, I suggest we also clarify the meaning and use
of Slice vs Shard vs Leader vs Replica vs Core. Here's my understanding:

I consider Slice == Shard (and would happily drop Slice): a logical concept
of a specific subset of a collection.
A Shard then has one or multiple copies of the data called Replicas (if a
shard has no copy of the data there's an issue). The Leader is one such
Replica. A shard with a replication factor of 1 has a single Replica that
happens to be the Leader. "Replica" does therefore not imply "replication".
A Core is an in memory instantiation of a disk index representing a
Replica. I believe that often the on disk index is referred to as "Core" as
well (I'm not bothered by this, there's no associated confusion IMO).

Overseer is a central place where a fair bit of the cluster management
logic is implemented today (Collection API, Autoscaling, Cluster state
change). It is therefore a cluster manager. Note that a different
implementation of "Clustered Solr" (a.k.a. SolrCloud) can most likely be
done without the need of a central process in addition to the already
centralized storage backend (currently ZooKeeper). In other words, Overseer
is not IMO the defining characteristic of SolrCloud, it is one
implementation choice, and there are others. To keep in mind for clarity
and to guide renaming.

On Fri, Jun 19, 2020 at 3:23 PM j.s. <jsay...@brandeis.edu> wrote:

> hi
>
> solr is very helpful.
>
> On 6/18/20 9:50 PM, Rahul Goswami wrote:
> > So +1 on "slave" being the problematic term IMO, not "master".
>
> but you cannot have a master without a slave, n'est-ce pas?
>
> i think it is better to use the metaphor of copying rather than one of
> hierarchy. language has so many (unintended) consequences ...
>
> good luck!
>

Reply via email to