Re: cores vs. instances vs. zookeeper vs. cloud vs ?

John Blythe Fri, 16 Dec 2016 13:07:51 -0800

ahhhh, there's a lightbulb. your last paragraph cleared up confusion i was
carrying through the responses (and, incidentally, was likely the reason
for confusion on the ZK question you couldn't make sense of). i was
thinking zookeeper was a separate means of handling things from solrcloud,
two entirely different approaches to scaling out. very much helpful to see
how off balance i was on that assumption!


thanks shawn-

-- 
*John Blythe*
Product Manager & Lead Developer

251.605.3071 | j...@curvolabs.com
www.curvolabs.com

58 Adams Ave
Evansville, IN 47713

On Fri, Dec 16, 2016 at 3:31 PM, Shawn Heisey <apa...@elyograg.org> wrote:

> On 12/16/2016 10:30 AM, John Blythe wrote:
> > thanks, erick. this is helpful. a few questions for clarity's sake, but
> > first: nope, not using SolrCloud as of yet.
> >
> >    - if i start using SolrCloud i could have my current multi-core setup
> >    (e.g. "transactions", "opportunities", etc.) exist within the
> appropriate
> >    collection. so instead of dev-transactions i'd have a 'dev'
> collection that
> >    has a 'transactions' core inside of it?
>
> No.  You would not be thinking in terms of cores at all.  When your
> programs talk to SolrCloud, they will only care about collections.
>
> Some terminology clarification: Collections are made up of one or more
> shards.  Shards are made up of one or more replicas.  Each shard replica
> is a core.  One replica for each shard is elected as leader.  If there's
> only one replica, then there's no redundancy, and that replica becomes
> leader.
>
> For the example you gave, you would have dev-transactions and
> prod-transactions collections.  Each of these collections might have
> shard replicas (cores) on completely different machines in the cloud ...
> or they might be on the same machines.  During normal operation, you
> would never access a core directly.  You'd probably only ever do that if
> something went very wrong and you needed to take very unusual steps to
> fix it or figure out what went wrong.
>
> >    - this seems to be the same with ZK, too?
>
> No idea what you're asking here.  Perhaps it should be obvious, but I
> can't figure it out.
>
> >    - i'm totally fine w separate/diff indexing. the demo collection, for
> >    instance, *has* to be separate from production bc the data has been
> >    stitched together from various customers' accounts on prod and
> blinded so
> >    that we have avoid privacy issues and can have all the various goodies
> >    under one demo account rather than separate ones. is the separate
> indexing
> >    happening out of the box w Cloud or something it's even capable of?
>
> Again, I don't really know what you're asking with "separate indexing".
> Different collections are separate from each other, just like cores in
> standalone mode.  Each collection is linked to a configuration in
> Zookeeper, which all of its shard replicas (cores) will use.  You could
> have all your collections pointing to the same config.  Some (or all) of
> them could point to completely different configs, too.
>
> Addressing a later question:  You don't have SolrCloud if you don't have
> zookeeper.  ZK is a requirement.  You don't really interact directly
> with zookeeper when you're using SolrCloud.  It's an administrative
> detail in the *setup* of SolrCloud.
>
> Thanks,
> Shawn
>
>

Re: cores vs. instances vs. zookeeper vs. cloud vs ?

Reply via email to