Walid wrote:
---------- Forwarded message ----------
From: *Walid* <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>>
Date: Aug 30, 2007 9:22 PM
Subject: Re: [Beowulf] Configuration change monitoring
To: "Robert G. Brown" <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>>

Hi Robert

On 8/30/07, *Robert G. Brown * <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote:

    To amplify Mark's remark a bit -- linux in general already has many
    fairly powerful tools for DOING monitoring, updates, and so on.  For
    example, one can use e.g. yum, kickstart, apt, and more to install a
    "canned" node configuration and keep it up to date.  It is so totally
    automatic that there is basically no need to "check for inconsistencies"
    on a node. Warewulf and several other tools also permit one to have
    rigorous control over node configuration.


Some of the inconsistencies we see is usually when one of the admin choses the easy routes, and does changes manually to a set of nodes, and does not update the installation files (our installations are rpm kickstart based), others are becuase we manage quite a large variants and by mistake or intention configurations files that are not supposed to be on cluster A appears on Cluster B.

    Monitoring tools abound, of course, ranging from things like syslog-ng
    for centralized monitoring/logging of LAN systems activity to nightly
    ... ...
    Is there something else one needs to do?  Well, most cluster admins tend
    to be fairly skilled linux administrators and good at shell script magic
    or even real programming.  So if one has an edge-case need that isn't
    directly met by one of the available tools, it is usually a fairly
simple matter to hack out a script to accomplish it.

That is one reason why they want to push some commercial tools, They want to minimize the amount of customizations that are done locally by the team, as some writes in bash, another writes in Perl, and the other prefers Python, in the long run there is an issue of maintaining the scripts, and knowing which set of scripts belongs to which cluster, and what kind of modifications are needed for new clusters,.etc all of which is documented, but that is another story

More than likely you'll end up with commercial "solutions" that preclude a rapid response to a problem and take your system down. A good admin team communicates and you don't see confusion at the level you are describing.

gerry
--
Gerry Creager -- [EMAIL PROTECTED]
Texas Mesonet -- AATLT, Texas A&M University
Cell: 979.229.5301 Office: 979.862.3982 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to