HP XC System Software Administration Guide Version 3.1

D HP MCS Monitoring
You can monitor the optional HP Modular Cooling System (MCS) by using the Nagios interface.
During HP XC system installation, you generated an initialization file, /opt/hptc/config/mcs.ini,
which specifies the names and IP addresses of the MCS devices. This file is used in the creation of the
/opt/hptc/nagios/etc/mcs_local.cfg file, which Nagios uses to monitor the MCS devices.
This appendix addresses the following topics:
“Customizing the Configuration for Your Installation” (page 273)
“Regenerating the Nagios MCS Configuration” (page 274)
“Useful Administrative Commands” (page 274)
“MCS Log Files” (page 275)
“Nagios Plug-Ins for MCS” (page 275)
D.1 Customizing the Configuration for Your Installation
You might need to customize the configuration of Nagios to monitor MCS devices for your site. Use the
following procedure to customize and verify the configuration:
1. Enter the following command to verify the status of your MCS devices:
# manage_mcs_status
Current status of automatic MCS drains/poweroffs:
Disnabled
Current MCS unit status:
mcs1: offline
mcs2: offline
2. Only the MCS devices in the online state will be monitored with Nagios. If your MCS devices are
set to offline, as they are in the example in step 1, and the MCS devices are operating correctly, use
the manage_mcs_status command to set their status to online:
# manage_mcs_status --online
mcs1 status set to 'online'
mcs2 status set to 'online'
3. Optionally, use the manage_mcs_status command again to verify that the MCS devices are in the
online state.
4. Examine the /opt/hptc/nagios/etc/nagios_vars.ini file to ensure that the MCS values are
appropriate for your site.
Perform the following steps if you changed the nagios_vars.ini file:
a. Propagate the nagios_vars.ini file to all the nodes in the HP XC system. For more information,
see Chapter 10 (page 129)
b. Restart Nagios. For more information, see “Stopping and Restarting Nagios” (page 110).
5. Examine the /opt/hptc/nagios/etc/templates/mcs_template.cfg file to ensure that the
warning and critical values for the MCS Cluster Monitor service are appropriate for your site.
Specifically, examine the second and third (numeric) parameters on the check_command line to
ensure they, respectively, correctly specify the following:
The number of MCS hosts or services in the HP XC system that must be in a WARNING,
CRITICAL, or UNKNOWN state in order to return a WARNING status.
The number of MCS hosts or services in the HP XC system that must be in the CRITICAL state
in order to return a CRITICAL status.
Perform the following steps If you changed the
/opt/hptc/nagios/etc/templates/mcs_template.cfg file:
D.1 Customizing the Configuration for Your Installation 273