I am using a modified version of this nagios check_crm script to monitor my cluster:
http://article.gmane.org/gmane.linux.highavailability.user/21849 It works fairly well by just parsing the output of crm_mon However it does not provide the failcount of resources since those are unavailable from crm_mon. I wrote a ruby script to process the cib.xml directly and retrieve the failcounts, this works well, but I would also like to duplicate the information from crm_mon, namely which services are started and on what node they are running. How do I get that information from the cib.xml? I looked at the crm.dtd but it didn't help me too much. I also tried grokking the source for crm_mon.c but I didn't get too far. Can anyone provide pointers to code or documentation about how to extract the information that crm_mon displays from the cib.xml? thanks, Jesse _______________________________________________ Pacemaker mailing list [email protected] http://list.clusterlabs.org/mailman/listinfo/pacemaker
