HP XC System Software Administration Guide Version 2.1

ManualsBrandsHP ManualsSoftwareHP XC 1 Processor LTU

Monitoring the System

System monitoring can identify situations th at can become problems later. This chapter

discusses the following topics:

• An overview of the m o nitoring to ols (Sec

tion 6.2)

• How to observe system environmental data (Section 6.3)

• How to view system statistics (Section 6.4)

• How to customize Nagios metrics gathe

ring (Section 6.5)

• Organization of t he logging of node events and how to view those events (Section 6.6)

6.1 Monitoring Strategy

The H P XC system monitoring strateg y is built on the Nagios system, the Supermon monitors,

and the syslog-ng logging system. I t is designed in a tiered setup to allow for vary ing levels of

granularity in logging and monito ring information.

For monitoring pu rpo ses, the nodes can be considered as a hierarchy, where local nodes talk

to manage men t hubs (regional nodes), which in turn talk to a central man agem ent console

(global node). Figure 6-1 shows this structure.

Figure 6-1: Tiered Structure for Node Events

Local

Node

Local

Node

Local

Node

Local

Node

Local

Node

Local

Node

Global

Node

Regional

Node

Regional

Node

6.2 Monitoring Tools

The HP XC System Software includes t he shownode metrics command in addition to

standard Linux monitoring commands. The HP XC system also includes the Nagios Web-based

utility fo r s ystem monitoring.

6.2.1 Commands for Monitoring Node Status

The shownode metrics comm and, which can be issued from any node in the HP XC

system, provides the ability t o mon ito r the status of all the nodes in t he system.

The following arguments to the shownode metrics co mmand monitor the node status:

• shownode metrics cpus

• shownode metrics cputotals

• shownode metrics load

• shownode metrics mem

Monitoring the System 6-1