HP XC System Software Administration Guide Version 3.2

D HP MCS Monitoring
You can monitor the optional HP Modular Cooling System (MCS) by using the Nagios interface.
During HP XC system installation, you generated an initialization file,
/opt/hptc/config/mcs.ini, which specifies the names and IP addresses of the MCS devices.
This file is used in the creation of the /opt/hptc/nagios/etc/mcs_local.cfg file, which
Nagios uses to monitor the MCS devices.
This appendix addresses the following topics:
“Customizing the Configuration for Your Installation” (page 311)
“Regenerating the Nagios MCS Configuration” (page 312)
“Useful Administrative Commands” (page 312)
“MCS Log Files” (page 313)
“Nagios Plug-Ins for MCS” (page 313)
D.1 Customizing the Configuration for Your Installation
You might need to customize the configuration of Nagios to monitor MCS devices for your site.
Use the following procedure to customize and verify the configuration:
1. Enter the following command to verify the status of your MCS devices:
# manage_mcs_status
Current status of automatic MCS drains/poweroffs:
Disnabled
Current MCS unit status:
mcs1: offline
mcs2: offline
2. Only the MCS devices in the online state will be monitored with Nagios. If your MCS
devices are set to offline, as they are in the example in step 1, and the MCS devices are
operating correctly, use the manage_mcs_status command to set their status to online:
# manage_mcs_status --online
mcs1 status set to 'online'
mcs2 status set to 'online'
3. Optionally, use the manage_mcs_status command again to verify that the MCS devices
are in the online state.
4. Examine the /opt/hptc/nagios/etc/nagios_vars.ini file to ensure that the MCS
values are appropriate for your site.
Perform the following steps if you changed the nagios_vars.ini file:
a. Propagate the nagios_vars.ini file to all the nodes in the HP XC system. For more
information, see Chapter 11 (page 139)
b. Restart Nagios. For more information, see “Stopping and Restarting Nagios” (page 115).
5. Examine the /opt/hptc/nagios/etc/templates/mcs_template.cfg file to ensure
that the warning and critical values for the MCS Cluster Monitor service are appropriate
for your site.
Specifically, examine the second and third (numeric) parameters on the check_command
line to ensure they, respectively, correctly specify the following:
The number of MCS hosts or services in the HP XC system that must be in a WARNING,
CRITICAL, or UNKNOWN state in order to return a WARNING status.
The number of MCS hosts or services in the HP XC system that must be in the CRITICAL
state in order to return a CRITICAL status.
Perform the following steps If you changed the
/opt/hptc/nagios/etc/templates/mcs_template.cfg file:
D.1 Customizing the Configuration for Your Installation 311