HP Insight Management Agents 9.1 Managing ProLiant Servers with Linux HOWTO Whitepaper Abstract This HOWTO provides instructions to help system administrators install, upgrade, and remove Version 8.4.
© Copyright 2011, 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Confidential computer software.
Contents 1 Software architecture..................................................................................5 System Health application and Command Line utilities(hp-health) ...................................................5 Health monitor....................................................................................................................7 System temperature monitoring..........................................................................................7 System fan monitoring.....
A Error messages........................................................................................27 B Troubleshooting........................................................................................31 C hp-snmp-agents command lines and arguments............................................
1 Software architecture This section describes the features and architecture of the following HP Linux management software: • HP System Health Application and Command Line Utilities (hp-health) • Insight Management SNMP Agents for HP ProLiant Systems (hp-snmp-agents) • HP System Management Homepage (hpsmh) • Descriptions for HP management consoles for Linux System Health application and Command Line utilities(hp-health) The System Health Application and Command Line Utilities (hp-health) package col
Table 1 hp-health applications (continued) Application Details controller contains an Intelligent Platform Management Interface (IPMI) Version 2.0 Base Management Controller (BMC) that replaces the operating system-based software management functionality provided by the legacy hpasmd application. The hpasmxld application is not supported on ProLiant servers that have Integrated Lights-Out 3 (iLO 3) management controller.
Table 2 Controller, health-daemon, and kernel driver combinations (continued) Lights-Out Controller Kernel Version (uname -r) Red Hat Enterprise Linux Hp-OpenIPMI hpilo module installed? available? daemon iLO 3 NA NA NA Yes hpasmlited /dev/hpilo hpilo iLO 4 NA NA NA No hpasmlited /dev/ipmi0 distro IPMI iLO 4 NA NA NA Yes hpasmlited /dev/hpilo hpilo dev file kernel driver Another source of information includes the following manpages provided with the hp-health package: • hp-health •
If the normal operating range is exceeded for any of these sensors, the Health Monitor does the following: • Displays a message on the console stating the problem • Makes an entry in the system health log and the operating system log Additionally, on some servers, the fans gradually increase to full speed in an attempt to cool the server as the external environment temperature increases.
The Health Monitor does the following: • Displays a message on the console stating the problem • Makes an entry in the system health log This server feature is configured using RBSU. On ProLiant servers that do not support AMP mirroring, an uncorrectable (double bit) memory error causes the operating system to halt abruptly. Logging of the error might not be possible if the error occurs in memory used by the Health Monitor.
Table 3 hplog options Command Description hplog –t Shows the current temperature and the threshold levels of all temperature sensors hplog –f Shows the status of all fans hplog –p Shows the status of all power supplies hplog –v Shows the IML entries on the standard output For more information about these components, see the online documentation by entering: $ man hplog HP Unique Identifier Utility (hpuid) The HP Unique Identifier Utility (hpuid) allows local manipulation of the ProLiant Unique Id
Insight Management SNMP Agents for HP ProLiant (hp-snmp-agents) The ProLiant Insight Management Agents provide proactive notification of server events through the HP Systems Insight Manager console. Alternatively, the ProLiant Insight Management Agents allow the status of the server to be monitored or checked using a standard Web browser.
Table 6 Sub-agents of the Server Agent (continued) Sub-agent Description Monitor. The System Health Agent executable is /opt/hp/hp-snmp-agents/server/bin/cmahealthd. System Rack Agent (cmarackd) System Rack agent gathers data for the Rack MIB for P-Class and also monitors the rack health through system management.
Table 7 Sub-agents of the Storage Agent (continued) Sub-agent Description • ATAPI device information • SATA devices The IDE Agent is located in /opt/hp/hp-snmp-agents/storage/bin/cmaided. The suggested poll_time is 15 seconds (default). The minimum recommended poll_time is 5 seconds. FCA Agent (cmafcad) The FCA Agent gathers data for the FCA MIB.
Performance agent (cmaperfd) The Performance Agent Daemon (cmaperfd) collects CPU, Memory, Disk and Network Interface Controller performance data at periodic intervals and makes the collected data available to the HPSIM. Data Collection agent Data Registries are composed of standard Linux directories and associated files. Each file in the data registry is a logical object containing "n" related data items.
Table 9 ProLiant BL Rack Upgrade Utility parameters Parameter Description -a address1,address2,… This optional parameter considers only enclosures with address1, address2, and so on. The list of addresses must be composed of 16-bit quantities separated by commas. The addresses can be obtained by running –q (see below). No white spaces are allowed in between the commas and the addresses. If a no comma-separated list is given, all possible addresses in the rack are considered.
interface service. The driver enables communication routing of SNMP traffic from the ProLiant Management Agents through the dedicated iLO management NIC. For documentation on Integrated Lights-out which is supported by the iLO management interface driver, visit http://h18013.www1.hp.com/products/servers/management/remotemgmt.html. See the QuickSpecs for each product to determine the servers and the operating systems supported.
2 Manual installation This section describes how to install, upgrade, and remove HP System Health Application and Command Line Utilities (hp-health) and Insight Management SNMP Agents for HP ProLiant Systems (hp-snmp-agents) packages. The latest versions of this software can be downloaded from http://hp.com/go/proliantlinux Prerequisite: Installing package dependencies The software described in this HOWTO is distributed in standard package formats that provide prerequisite information internally.
$ man hp-health NOTE: The version number for the RPM file varies depending on the supported systems and functionality. The distribution refers to the Linux distribution supported by the RPM. The platform refers to the processor architecture the RPM was built to support. The RPM file has a binary compiled for the supported distribution with the default kernel. After the installation process, the health service is configured to automatically start each time your system boots.
Table 10 Uninstall drivers and agents commands Command Description # rpm –e hp-snmp-agents Removes the hp-snmp-agents package from your system # rpm –e hp-ilo Removes the hp-ilo package from your system # rpm –e hp-health Removes the hp-health package from your system # rpm –e hp-OpenIPMI Removes the hp-OpenIPMI package from your system CAUTION: If a service is running when the corresponding package is removed, it is automatically shut down during the removal process.
RPM provides the -U option to upgrade a package. For example, to upgrade hp-health to a newer version you could use the command: # rpm –Uvh hp-health-...
3 Customization This section includes advanced topics on data center customization. Configuration files The ProLiant Management Agents Configuration file /opt/hp/hp-snmp-agents/cma.conf is shared by all HP ProLiant Management Agents. Currently, exclude directives, taint directives, trap interface, trap email notification configuration, and base socket number (used by cmaX) are supported. The agents are capable of sending email notifications in addition to SNMP traps.
# service hp-snmp-agents restart You can also manipulate the /opt/hp/hp-snmp-agents/cma.conf file which contains one or more exclude directives. Any string after the exclude keyword is interpreted as an agent name that should not be started. Examples include: exclude cmahealthd exclude cmastdeqd These two lines exclude two agents from the startup: the Health Agent (cmahealthd) and the Standard Equipment Agent (cmastdeqd).
Traps are configured using the standard SNMP configuration file (snmpd.conf). See the snmpd.conf manual page for the most current configuration information. When the snmpd.conf or snmpd.local.conf configuration files are changed or when the SNMPCONFPATH environment variable is changed, the cmanic daemon must be restarted. If your operating system has an active firewall configuration, external SNMP requests might be rejected by the system, which prevents remote management operation.
Support and other resources Information to collect before contacting HP Be sure to have the following information available before you contact HP: • Software product name • Hardware product model number • Operating system type and version • Applicable error message • Third-party hardware and software • Technical support registration number (if applicable) How to contact HP Use the following methods to contact HP technical support: • In the United States, see the Customer Service / Contact HP Un
HP authorized resellers For the name of the nearest HP authorized reseller, see the following sources: • In the United States, see the HP U.S. service location website: http://www.hp.com/service_locator • In other locations, see the Contact HP worldwide website: http://welcome.hp.com/country/us/en/wwcontact.html Documentation feedback HP welcomes your feedback. To make comments and suggestions about product documentation, send a message to: mailto:docsfeedback@hp.
Replaceable The name of a placeholder that you replace with an actual value. [] In command syntax statements, these characters enclose optional content. {} In command syntax statements, these characters enclose required content. | The character that separates items in a linear list of choices. ... Indicates that the preceding element can be repeated one or more times.
A Error messages Messages logged if an ASR event occurs are listed in Table 14 (page 27). Table 14 Error messages Message Number Details Message 1 Message 2 Message 3 Message 4 NMI-Automatic Server Recovery timer expiration – Hour %d-%d/%d/%d Description This message indicates that the Health Monitor detected an ASR timeout and is attempting to gracefully shut down the Operating System.
stopped, dependent applications like the Rack Firmware Upgrade Utility terminate as well. Table 15 (page 28)lists possible issues. Table 15 cpqriisd messages Message Number Details Message 1 Could not setup server semaphores Could not destroy server semaphores Up sem: Ioctl Failure! Down sem: Ioctl Failure! Get sem: Ioctl Failure! Set sem: Ioctl Failure! Message 2 Description These messages indicate that the synchronization objects called “semaphores”, cannot be set up correctly.
Table 15 cpqriisd messages (continued) Message Number Details with the HP ProLiant Rack Daemon (cmarackd). Restart cmarackd. If the problem persists, contact your HP field service engineer. Message 7 iLO exceeded the number of allotted back offs, is it stuck? Description iLO responds with a “backoff” command indicating a busy state, which is a temporary condition. If this condition lasts too long (5000 tries), the message appears.
Table 15 cpqriisd messages (continued) Message Number Details Description These messages indicate a problem that occurred during initialization of the service.
B Troubleshooting This section describes common problems that might occur during installation and operation of the HP ProLiant Management Software for Linux. Table 16 (page 31) describes issues and workarounds for the hp-health and hp-snmp-agents packages. Any problems reported to HP should include the following files: • /var/log/messages • /var/log/boot.log (for Red Hat Linux distributions) • /var/log/warn (for SuSE LINUX distributions) • /var/log/hp-snmp-agents/cma.
Table 16 Issues and workarounds for the hp-health and hp-snmp-agents packages (continued) Issue Number Details # Don’t log private authentication messages! * . info;mail .none;news .none;authpriv.
Table 17 Known issues with agents Issue Number Details Issue 1 Cannot manage server from Systems Insight Manager, grayed-out utilization button, or missing file system space used information in the mass storage window Workaround To work around this issue, complete the following steps: 1. Check if the network is reachable by pinging the server from the system running Systems Insight Manager 2.
Table 17 Known issues with agents (continued) Issue Number Details Workaround Information about the configuration of the device indicates that a SCSI controller is installed, but no further information is available.
Table 17 Known issues with agents (continued) Issue Number Details Next, generate a trap to localhost using the Linux “snmptrap” utility. The Linux command “snmptrapd –f –Le” should display the trap. Note that the recent versions of snmptrapd will not accept incoming notifications by default. See snmptrapd.conf(5) manpage for information on configuring access control settings to enable incoming notifications.
C hp-snmp-agents command lines and arguments Table 18 (page 36) lists the command lines and Table 19 (page 37) lists the command arguments for hp-snmp-agents.
NOTE: • All agents support –p, –s and –t as startup parameters • Each agent has an associated run level script which is located in /opt/hp/ hp-snmp-agents//etc/. All important settings such as poll time arguments are contained in these individual scripts. Table 19 Command line arguments for hp-snmp-agents Command line argument Description -p poll_time Specifies the number of seconds to wait between data collection intervals.