I am using a modified version of this nagios check_crm script
to monitor my cluster:

  http://article.gmane.org/gmane.linux.highavailability.user/21849

It works fairly well by just parsing the output of crm_mon

However it does not provide the failcount of resources since those
are unavailable from crm_mon.

I wrote a ruby script to process the cib.xml directly and retrieve
the failcounts, this works well, but I would also like to duplicate
the information from crm_mon, namely which services are started
and on what node they are running.

How do I get that information from the cib.xml? I looked at the 
crm.dtd but it didn't help me too much. I also tried grokking the
source for crm_mon.c but I didn't get too far.

Can anyone provide pointers to code or documentation about how to 
extract the information that crm_mon displays from the cib.xml?

thanks, Jesse

_______________________________________________
Pacemaker mailing list
[email protected]
http://list.clusterlabs.org/mailman/listinfo/pacemaker

Reply via email to