Online Diagnostics (EMS and STM) Administrator's Guide September 2012

Table Of Contents
1 Introduction
This chapter introduces you to the Online Diagnostics software and the tools that it contains. This
chapter addresses the following topics:
“Overview ” (page 6)
“Hardware Monitoring” (page 6)
“Support Tools Manager” (page 10)
“OnlineDiag Bundle Media” (page 12)
Overview
The Online Diagnostics software is a collection of tools that enables you to monitor and test server
hardware. It comprises the Event Monitoring Service (EMS) framework, the EMS Hardware Monitors,
and the Support Tools Manager (STM).
EMS is a framework that supports hardware monitoring. The EMS Hardware Monitors monitor
server hardware and notify users of errors that occur in the monitored devices. STM comprises
support tools that you can use to run tests on hardware resources. Online Diagnostics is supported
only on servers running HP-UX.
Hardware Monitoring
This section addresses the following topics:
“Event Monitoring Service” (page 6)
“EMS Hardware Monitors” (page 6)
“Startup Client ” (page 7)
“Hardware Monitoring Request Manager ” (page 7)
“Multiple-View and Non-Multiple-View Monitors” (page 7)
“Event Tracking Methods” (page 7)
“Peripheral Status Monitor” (page 7)
Architecture” (page 8)
“Products Supported by EMS Hardware Monitors ” (page 9)
Event Monitoring Service
The Event Monitoring Service (EMS) is a framework that supports hardware monitoring. The EMS
framework supports the EMS Hardware Monitors, the EMS High-Availability (HA) Monitors, third
party monitors, and so on. Using EMS, you can manage the Peripheral Status Monitor (PSM) and
EMS HA Monitor requests.
The EMS framework provides the necessary notification methods such as sending e-mail notifications
to the root user and communicating with HP OpenView.
EMS Hardware Monitors
EMS Hardware Monitors are daemons that proactively monitor hardware devices such as CPU,
memory, and hard disks. These monitors work with the EMS framework to detect and report
hardware problems. When a monitor detects an error or an abnormal behavior, it generates an
event that includes details such as a summary of the event, the cause of the error, and the
recommended action, and it reports them to the EMS. EMS uses the monitoring request to determine
whether and how to deliver the event.
6 Introduction