I think Mark has given this as a presentation a few times, maybe https://www.youtube.com/watch?v=eVK0wLkLw9w
On Thu, May 30, 2024 at 3:03 AM Ilan Ginzburg <ilans...@gmail.com> wrote: > Hi, > > I'm looking for a blog post or documentation giving a good overview of the > architecture of SolrCloud (not the user experience and use of Solr). The > audience would be engineers knowledgeable about distributed systems that > know nothing about SolrCloud (side note: I looked for something similar for > Elasticsearch and didn't find either). > > Documentation that will touch on the index structure > (collection/shard/replica), the way they're materialized as cores, the > coordination done by ZooKeeper (shard leader élections). How these replicas > interact with each other and the update log in steady state and in recovery > scenarios. > The Overseer role (I wrote the detailed overseer doc > https://github.com/apache/solr/blob/main/dev-docs/overseer/overseer.adoc) > and how it interacts with ZooKeeper, the storage of all metadata in > ZooKeeper, the way nodes start, the way cluster state is propagated and > managed (watches for collections with replicas on a node vs all other > collection) etc. > > If anybody knows about a doc that gives a complete overview, I'm very > interested. > Otherwise might end up writing it 🤓 > > Thanks, > Ilan >