HP Insight Control for Linux 6.0 User Guide

Table 14-2 Services monitored on managed systems (continued)
Function/DescriptionService name
Links to any consolidated log messages that match patterns in the /opt/hptc/
nagios/etc/syslogAlertRules file.
Syslog Alerts
1
Links to any System Event Log messages that match patterns in the /opt/hptc/
nagios/etc/selRules file.
The System Event Log is collected through the management processor, either an
iLO or an IPMI BMC. System Events are hardware-related alerts such as memory
errors, power supply faults, and so on.
System Event Log
1
Displays the system free space in /root, /tmp, /var, and /hptc_cluster. This
data is compared to thresholds defined in the nagios_vars.ini file.
System Free Space
2
Reports static system configuration information for a single system such as server
type, memory, and processors.
Configuration
1
1 This information is collected without agents, thus it is available for any host that is being monitored by Insight
Control for Linux.
2
This service uses mond to collect its data.
14.5 Understanding Nagios alert messages
HP Insight Control for Linux provides several value-added plug-ins that can generate alert
messages based on patterns provided by various data sources, such as syslog and the Hardware
System Event logs.
All these plug-ins use a common syntax to describe patterns and status to report on matches.
The rules that trigger alarms are configured in the following files:
/opt/hptc/nagios/etc/selRules
Contains patterns for alerting on System Event Log messages.
You can modify the selRules file as follows:
Add a rule to this file for a new alert.
Modify the corresponding rule to change an existing alert.
Comment out a rule to remove the corresponding alert.
/opt/hptc/nagios/etc/syslogAlertRules
Contains patterns for alerting on consolidated log entries.
/opt/hptc/nagios/libexec/sensorData.dat
Contains patterns for alerting based on sensor results.
Nagios uses email to send formatted alerts. The following is the default format of a Nagios alert:
Type: PROBLEM
1
State: return code
2
Service: service
3
Host: system
4
Address: IP Address
5
Info: message output
6
Date/Time: date and time
7
Elapsed: time
8
Number: number
9
1
Valid values are PROBLEM or RECOVERY.
2
The Nagios plug-in return code; the values for this code are as follows:
0 OK
166 Using graphical tools to monitor managed systems