HP Insight Management Agents 9.2 Managing ProLiant Servers with Linux HOWTO White Paper Abstract This HOWTO provides instructions to help system administrators install, upgrade, and remove Version 8.4.
© Copyright 2002, 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Confidential computer software.
Contents 1 Software architecture..................................................................................5 HP System Health Application and Command Line Utilities(hp-health) .............................................5 Health Monitor....................................................................................................................7 System temperature monitoring..........................................................................................7 System fan monitoring........
A Error messages........................................................................................28 B Troubleshooting........................................................................................32 C hp-snmp-agents command lines and arguments............................................
1 Software architecture This section describes the features and architecture of the following HP Linux management software: • HP System Health Application and Command Line Utilities (hp-health) • Insight Management SNMP Agents for HP ProLiant Systems (hp-snmp-agents) • HP System Management Homepage (hpsmh) • Descriptions for HP management consoles for Linux HP System Health Application and Command Line Utilities(hp-health) The System Health Application and Command Line Utilities (hp-health) package
Table 1 hp-health applications (continued) Application Details Management Controller (BMC) that replaces the operating system–based software management functionality provided by the legacy hpasmd application. The hpasmxld application is not supported on ProLiant servers that have Integrated Lights-Out 3 (iLO 3) management controller. The corresponding hp-OpenIPMI package for ProLiant servers is available for download for select distributions at: http://h20000.www2.hp.com/bizsupport/TechSupport/Product.
Table 2 Controller, health-daemon, and kernel driver combinations (continued) Lights-Out controller Red Hat Kernel version Enterprise (uname -r) Linux hpilo hp-OpenIPMI module installed? available? Daemon iLO 3 NA NA NA Yes hpasmlited /dev/hpilo hpilo iLO 4 NA NA NA No hpasmlited /dev/ipmi0 distro IPMI iLO 4 NA NA NA Yes hpasmlited /dev/hpilo hpilo Dev file kernel driver Another source of information includes the following manpages provided with the hp-health package: • hp-health •
If the normal operating range is exceeded for any of these sensors, the Health Monitor does the following: • Displays a message on the console stating the problem • Makes an entry in the system health log and the operating system log Additionally, on some servers, the fans gradually increase to full speed in an attempt to cool the server as the external environment temperature increases.
The Health Monitor does the following: • Displays a message on the console stating the problem • Makes an entry in the system health log This server feature is configured via RBSU. On ProLiant servers that do not support AMP mirroring, an uncorrectable (double bit) memory error causes the operating system to halt abruptly. Logging of the error might not be possible if the error occurs in memory that the Health Monitor uses.
Table 3 hplog options Command Description hplog –t Shows the current temperature and the threshold levels of all temperature sensors hplog –f Shows the status of all fans hplog –p Shows the status of all power supplies hplog –v Shows the IML entries on the standard output For more information about these components, see the online documentation by entering: $ man hplog HP Unique Identifier Utility (hpuid) The HP Unique Identifier Utility (hpuid) allows local manipulation of the ProLiant Unique Id
Insight Management SNMP Agents for HP ProLiant Systems (hp-snmp-agents) The ProLiant Insight Management Agents provide proactive notification of server events through the HP Systems Insight Manager (HP SIM) console. Alternatively, the ProLiant Insight Management Agents allow the status of the server to be monitored or checked through a standard Web browser.
Table 6 Sub agents of the Server Agent (continued) Sub-agent Description Monitor. The System Health Agent executable is /opt/hp/hp-snmp-agents/server/bin/cmahealthd. System Rack Agent (cmarackd) The System Rack Agent gathers data for the Rack MIB for p Class and also monitors the rack health through system management.
Table 7 Subagents of the Storage Agent (continued) Sub-agent Description The IDA Agent is located in /opt/hp/hp-snmp-agents/storage/bin/cmaided. The suggested poll_time is 15 seconds (default). The minimum recommended poll_time is 5 seconds. IDE Agent (cmaided) The IDE Agent gathers data for the IDE MIB.
NIC Agent (cmanic) The NIC Agent collects information from network interface controllers at periodic intervals, makes the collected data available to the SNMP agent, and provides SNMP alerts. The NIC Agent gathers data for the NIC MIB from supported NIC device drivers. The data includes: • Physical mapping and configuration data for each network interface. • Network statistics for Ethernet interfaces. Information is provided for HP controllers. Limited information might be provided for third-party NICs.
Usage: cpqblru [-eql?] [-a address1,address2, . . .] [-c chassis1,chassis2,. . .] Table 9: “ProLiant BL Rack Upgrade Utility parameters” describes the parameters for the utility. Table 9 ProLiant BL Rack Upgrade Utility parameters Parameter Description -a address1,address2,… This optional parameter considers only enclosures with address1, address2, and so on. The list of addresses must consists of 16-bit quantities separated by commas. The addresses can be obtained by running –q.
HP ProLiant Channel Interface Device Driver for iLO/iLO 2/iLO 3/iLO 4 (hp-ilo) The HP ProLiant Channel Interface Device Driver for iLO/iLO 2/iLO 3/iLO 4 (hp-ilo) enables iLO data collection and integration with the ProLiant Management Agents and the rack infrastructure interface service. The driver enables communication routing of SNMP traffic from the ProLiant Management Agents through the dedicated iLO management NIC.
For installation information, see the SIM Linux Installation and Configuration Guide available at http://h18013.www1.hp.com/products/servers/management/hpsim/infolibrary.html.
2 Manual installation This section describes how to install, upgrade, and remove HP System Health Application and Command Line Utilities (hp-health) and Insight Management SNMP Agents for HP ProLiant Systems (hp-snmp-agents) packages. The latest versions of this software can be downloaded from http://hp.com/go/proliantlinux. Prerequisite: Installing package dependencies The software described in this HOWTO is distributed in standard package formats that provide prerequisite information internally.
$ man hp-health NOTE: The version number for the RPM file varies depending on the supported systems and functionality. The distribution refers to the Linux distribution supported by the RPM. The platform refers to the processor architecture the RPM was built to support. The RPM file has a binary compiled for the supported distribution with the default kernel. After the installation process, the health service is configured to automatically start each time your system boots.
Table 10 Uninstall drivers and agents commands Command Description # rpm –e hp-snmp-agents Removes the hp-snmp-agents package from your system # rpm –e hp-ilo Removes the hp-ilo package from your system # rpm –e hp-health Removes the hp-health package from your system # rpm –e hp-OpenIPMI Removes the hp-OpenIPMI package from your system CAUTION: If a service is running when the corresponding package is removed, it is automatically shut down during the removal process.
RPM provides the -U option to upgrade a package. For example, to upgrade hp-health to a newer version, you can use the command: # rpm –Uvh hp-health-...
3 Customization This section includes advanced topics on data-center customization. Configuration files The ProLiant Management Agents Configuration file /opt/hp/hp-snmp-agents/cma.conf is shared by all HP ProLiant Management Agents. Currently, exclude directives, taint directives, trap interface, trap email notification configuration, and base socket number (used by cmaX) are supported. The agents can send email notifications in addition to SNMP traps. The trap email entries in /opt/hp/hp-snmp-agents/cma.
# service snmpd restart # service hp-snmp-agents restart You can also manipulate the /opt/hp/hp-snmp-agents/cma.conf file, which contains one or more exclude directives. Any string after the exclude keyword is interpreted as an agent name that should not be started. Examples include: exclude cmahealthd exclude cmastdeqd These two lines exclude two agents from the start-up: the Health Agent (cmahealthd) and the Standard Equipment Agent (cmastdeqd).
Traps are configured via the standard SNMP configuration file (snmpd.conf). For the most current configuration information, see snmpd.conf manual page for the most current configuration information. When the snmpd.conf or snmpd.local.conf configuration files are changed or when the SNMPCONFPATH environment variable is changed, the cmanic daemon must be restarted.
4 Support and other resources Information to collect before contacting HP Be sure to have the following information available before you contact HP: • Software product name • Hardware product model number • Operating system type and version • Applicable error message • Third-party hardware and software • Technical support registration number (if applicable) How to contact HP Use the following methods to contact HP technical support: • In the United States, see the Customer Service / Contact HP
HP authorized resellers For the name of the nearest HP authorized reseller, see the following sources: • In the United States, see the HP U.S. service location website: http://www.hp.com/service_locator • In other locations, see the Contact HP worldwide website: http://welcome.hp.com/country/us/en/wwcontact.html Documentation feedback HP welcomes your feedback. To make comments and suggestions about product documentation, send a message to: docsfeedback@hp.
Term A term or phrase that is defined in the body text of the document, not in a glossary. User input Indicates commands and text that you type exactly as shown. Replaceable The name of a placeholder that you replace with an actual value. [] In command syntax statements, these characters enclose optional content. {} In command syntax statements, these characters enclose required content. | The character that separates items in a linear list of choices. ...
A Error messages Messages logged if an ASR event occurs are listed in Table 14 (page 28). Table 14 Error messages Message Number Details Message 1 Message 2 Message 3 Message 4 NMI-Automatic Server Recovery timer expiration – Hour %d-%d/%d/%d Description This message indicates that the Health Monitor detected an ASR timeout and is attempting to gracefully shut down the Operating System.
stopped, dependent applications like the Rack Firmware Upgrade Utility terminate as well. Table 15 (page 29)lists possible issues. Table 15 cpqriisd messages Message Number Details Message 1 Could not setup server semaphores Could not destroy server semaphores Up sem: Ioctl Failure! Down sem: Ioctl Failure! Get sem: Ioctl Failure! Set sem: Ioctl Failure! Message 2 Description These messages indicate that the synchronization objects called “semaphores”, cannot be set up correctly.
Table 15 cpqriisd messages (continued) Message Number Details with the HP ProLiant Rack Daemon (cmarackd). Restart cmarackd. If the problem persists, contact your HP field service engineer. Message 7 iLO exceeded the number of allotted back offs, is it stuck? Description iLO responds with a “backoff” command indicating a busy state, which is a temporary condition. If this condition lasts too long (5000 tries), the message appears.
Table 15 cpqriisd messages (continued) Message Number Details Description These messages indicate a problem that occurred during initialization of the service.
B Troubleshooting This section describes common problems that might occur during installation and operation of the HP ProLiant Management Software for Linux. Table 16 (page 32) describes issues and workarounds for the hp-health and hp-snmp-agents packages. Any problems reported to HP should include the following files: • /var/log/messages • /var/log/boot.log (for Red Hat Linux distributions) • /var/log/warn (for SuSE LINUX distributions) • /var/log/hp-snmp-agents/cma.
Table 16 Issues and workarounds for the hp-health and hp-snmp-agents packages (continued) Issue Number Details # Don’t log private authentication messages! * . info;mail .none;news .none;authpriv.
Table 17 Known issues with agents Issue Number Details Issue 1 Cannot manage server from Systems Insight Manager, grayed-out utilization button, or missing file system space used information in the mass storage window Workaround To work around this issue, complete the following steps: 1. Check if the network is reachable by pinging the server from the system running Systems Insight Manager 2.
Table 17 Known issues with agents (continued) Issue Number Details Workaround Information about the configuration of the device indicates that a SCSI controller is installed, but no further information is available.
Table 17 Known issues with agents (continued) Issue Number Details Next, generate a trap to localhost using the Linux “snmptrap” utility. The Linux command “snmptrapd –f –Le” should display the trap. Note that the recent versions of snmptrapd will not accept incoming notifications by default. See snmptrapd.conf(5) manpage for information on configuring access control settings to enable incoming notifications.
C hp-snmp-agents command lines and arguments Table 18 (page 37) lists the command lines and Table 19 (page 38) lists the command arguments for hp-snmp-agents.
NOTE: • All agents support –p, –s, and –t as start-up parameters. • Each agent has an associated run level script which is located in /opt/hp/ hp-snmp-agents//etc/. These individual scripts contain all important settings, such as poll-time arguments. Table 19 Command-line arguments for hp-snmp-agents 38 Command line argument Description -p poll_time Specifies the number of seconds to wait between data collection intervals.