Fault Monitoring on Windows Integrity Servers
Page 1
Introduction
Insight Management Agents is a utility for HP Integrity servers that monitors hardware and
software to provide two key pieces of information: 1) component inventory and 2) server
health. This paper describes the agents that monitor server health.
The HP Integrity Insight Management Agents comprise many individual components that
monitor system activity and performance to detect errors and record information in system log
files. In addition, the agents notify system managers of potentially dangerous situations (for
example, escalating memory failures) by sending alerts from the log files and traps. Figure 1
illustrates the general organization and flow of information in the package.
Figure 1 — HP Integrity Insight Management Agents – System Overview
Manageability Firmware
Environmental
Errors
IPMI
System Event Log
(SEL)
System Firmware
SAL_GET_STATE_ERRORS
Corrected errors
passed to OS
System
Errors
Windows
System
Log
Log
Files
SNMP
Traps
SMH
E-Mail
Windows OS
WMI
CMC / CPE
records
Insight Management
Agents
Event
Subsystem
Predictive
Failure Monitor
MCA Monitor
Loop-back
Event
Service
Foundation
Agent
Storage
Agent
NIC
Agent
Server
Agent
Exclusive HP Integrity Server features
The following sections describe each component agent in detail by answering three
questions: 1) what fault conditions does each agent monitor (type of errors); 2) how does the
agent monitor and record errors (methodology); and 3) what data is recorded or distributed to
system managers (output).
Note: This paper covers only the System Network Management Protocol (SNMP)
implementation in the HP Integrity server and the associated support provided by
HP Insight Management Agents.
Agents that support Web-Based Enterprise Management (WBEM) are not covered
here.