We do this with a Unix/Linux shell script every minute. Keep a simple log in a plain text file. It notifies via text message if everything isn't perfect. Check the log to see which server didn't respond.
We actually test several paths - the status page of the midtiers gives you an end-to-end status and verifies that at least one combination server and web server works. That gives an availability metric. You can also use the api driver program from the command line to test different servers in a server group and even check each fast and list thread pool. #!/bin/ksh, it's a wonderful thing :-) Doug -- Doug Blair +1 224-558-5462 Sent from my iPhone6+ Auto-corrected typos, misspellings and non-sequiturs are gratefully attributed to Steve Jobs :-) > On Oct 16, 2014, at 2:52 PM, Joe D'Souza <[email protected]> wrote: > > ** > If you are on UNIX or LINUX you might be able to design a cron job that runs > every few minutes to do that using the output of arsignal –v. > > Joe > > From: Action Request System discussion list(ARSList) > [mailto:[email protected]] On Behalf Of Rob Dudley > Sent: Thursday, October 16, 2014 2:31 PM > To: [email protected] > Subject: Re: Open question to BMC on troubleshooting without the WUT > > ** > I really just wish that when users do become "hung", in the load balanced > web/server group environment, and it is one of the servers which is > unresponsive - that there would be some way that the server group would > recognize it and restart the armonitor processes... OR at least place > something in the log files which directs you to WHY the server locked up!! > > On Thu, Oct 16, 2014 at 2:21 PM, Rick Cook <[email protected]> wrote: > ** > Good ideas, Joe! Perhaps the arserver -v (or whatever it is now) that > returns the version would fit the bill - quick return, confirms that the > server is up, and almost zero performance impact. > > Rick Cook > > On Thu, Oct 16, 2014 at 11:19 AM, Joe D'Souza <[email protected]> wrote: > ** > How about issuing one of the command line utilities to each of the servers > such as maybe the arsignal –g that reloads the def cache. If a server is not > responding you would get a error 90. > > I do not remember all the switches to arsignal offhand but there should be > one that has less of an impact to a server than –g if you are worried about > the performance impact it may have to servers that are responding well. > > Or you could try to arcreateentry into a test form. > > Joe > > From: Action Request System discussion list(ARSList) > [mailto:[email protected]] On Behalf Of William Rentfrow > Sent: Thursday, October 16, 2014 2:00 PM > To: [email protected] > Subject: Re: Open question to BMC on troubleshooting without the WUT > > Yes and no. > > I've seen instances where users are locked up but the admin threads work (and > vice versa). So it's not the same test really as connecting as a user. > > William Rentfrow > [email protected] > Office: 715-204-3061 or 701-232-5697x25 > Cell: 715-498-5056 > > From: Action Request System discussion list(ARSList) > [mailto:[email protected]] On Behalf Of Arner, Todd > Sent: Thursday, October 16, 2014 12:56 PM > To: [email protected] > Subject: Re: Open question to BMC on troubleshooting without the WUT > > ** > Could you confirm the server is available by opening the server up in Dev > Studio? Just a thought. > > Todd Arner > > The information contained in this communication may be confidential, is > intended only for the use of the recipient(s) named above, and may be > protected under state or federal law. If the reader of this message is not > the intended recipient, you are hereby notified that any dissemination, > distribution, or copying of this communication, or any of its contents, is > strictly prohibited. If you have received this communication in error, please > forward the communication to [email protected] immediately and destroy or > delete the original message and any copy of it from your computer system. If > you have any questions concerning this message, please contact the sender. > No virus found in this message. > Checked by AVG - www.avg.com > Version: 2014.0.4765 / Virus Database: 4040/8371 - Release Date: 10/11/14 > _ARSlist: "Where the Answers Are" and have been for 20 years_ > _ARSlist: "Where the Answers Are" and have been for 20 years_ _ARSlist: > "Where the Answers Are" and have been for 20 years_ > > _ARSlist: "Where the Answers Are" and have been for 20 years_ > > _ARSlist: "Where the Answers Are" and have been for 20 years_ > _ARSlist: "Where the Answers Are" and have been for 20 years_ _______________________________________________________________________________ UNSUBSCRIBE or access ARSlist Archives at www.arslist.org "Where the Answers Are, and have been for 20 years"

