Re: [Beowulf] cluster deployment and config management

Joe Landman Tue, 05 Sep 2017 05:21:07 -0700

Good morning ...


On 09/05/2017 01:24 AM, Stu Midgley wrote:

Morning everyone
I am in the process of redeveloping our cluster deployment and configmanagement environment and wondered what others are doing?
First, everything we currently have is basically home-grown.

Nothing wrong with this, if it adequately solves the problem. Many ofthe frameworks people use for these things are highly opinionated, andoften, you'll find their opinions grate on your expectations. At$dayjob-1, I developed our own kit precisely because so many of theother toolkits did little to big things wrong; not simply from anopinion point of view, but actively made specific errors that thedevelopers glossed over as that aspect was unimportant to them ... whilebeing of critical importance to me and my customers at the time.

Our cluster deployment is a system that I've developed over the yearsand is pretty simple - if you know BASH and how pxe booting works. Ithas everything from setting the correct parameters in the bios, zfsram disks for the OS, lustre for state files (usually in /var) - allin the initrd.
We use it to boot cluster nodes, lustre servers, misc servers anddesktops.
We basically treat everything like a cluster.

The most competent baked distro out there for this was (in the past,haven't used it recently) Warewulf. See https://github.com/warewulf/ .Still under active development, and Greg and team do a generally greatjob. Least opinionated distro around, most flexible, and some of thebest tooling.

However... we do have a proliferation of images... and all need to bekept up-to-date and managed. Most of the changes from one image tothe next are config files.

Ahhh ... One of the things we did with our toolchain (it is open source,I've just never pushed it to github) was to completely separate bootingfrom configuration. That is, units booted to an operational statebefore we applied configuration. This was in part due to longexperience with nodes hanging during bootup with incorrectconfigurations. If you minimize the chance for this, your nodes(barring physical device failure) always boot. The only specificopinion we had w.r.t. this system was that the nodes had to be bootablevia PXE, and there fore a working dhcp needed to exist on the network.

Post boot configuration, we drove via a script that downloaded andlaunched other scripts. Since we PXE booted, network addresses werefine. We didn't even enforce final network address determination on PXEstartup.

We looked at the booting process as a state machine. Lower level wasraw hardware, no power. Subsequent levels were bios POST, PXE ofkernel, configuration phase. During configuration phase *everything*was on the table w.r.t. changes. We could (and did) alter networking,using programmatic methods, databases, etc. to determine and configurefinal network configs. Same for disks, and other resources.

Configuration changes could be pushed post boot by updating a script andeither pushing (not normally recommended for clusters of reasonablesize) or triggering a pull/run cycle for that script/dependencies.


This allowed us to update images and configuration asynchronously.

We had to manage images, but this turned out to be generally simple. Iwas in the midst of putting image mappings into a distributed objectstore when the company died. Config store is similarly simple, againusing the same mechanisms, and could be driven entirely programmatically.

Of course, for the chef/puppet/ansible/salt/cloudformation/... people,we could drive their process as well.

We don't have a good config management (which might, hopefully, reducethe number of images we need). We tried puppet, but it seems everyonehates it. Its too complicated? Not the right tool?

Highly opinionated config management is IMO (and yes, I am aware this isredundant humor) generally a bad idea. Config management that gets outof your way until you need it is the right approach. Which is why wenever tried to dictate what config management our users would use. Wesimply handled getting the system up to an operational state, and theycould use ours, theirs, or Frankensteinian kludges.

I was thinking of using git for config files, dumping a list of rpm's,dumping the active services from systemd and somehow munging all thattogether in the initrd. ie. git checkout the server to get configfiles and systemctl enable/start the appropriate services etc.
It started to get complicated.

Any feedback/experiences appreciated.  What works well?  What doesn't?

IMO things that tie together config and booting are problematic atscale. Leads to nearly unmanageable piles of images, as you'veexperienced. Booting to an operational state, and applying all configpost boot (ask me about my fstab replacement some day), makes for a verynice operational solution that scales wonderfully .... you can replicateimages to local image servers if you wish, replicate config servers,load balance the whole thing to whatever scale you need.


Thanks.



--
Dr Stuart Midgley
sdm...@gmail.com <mailto:sdm...@gmail.com>


_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf


--
Joe Landman
e: joe.land...@gmail.com
t: @hpcjoe
w: https://scalability.org
g: https://github.com/joelandman
l: https://www.linkedin.com/in/joelandman

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] cluster deployment and config management

Reply via email to