HP XC System Software Administration Guide Version 3.1

/opt/hptc/nagios/ibexec/sensorData.dat
Contains patterns for alerting based on sensor results.
Nagios uses e-mail to send formatted alerts. The following is the default format of a Nagios alert:
Type: PROBLEM
1
State: return code
2
Service: service
3
Host: node
4
Address: IP Address
5
Info: message output
6
Date/Time: date and time stamp
7
Elapsed: time stamp
8
Number: number
9
1
Problem or Recovery
2
The Nagios plug-in return code; the values for this code:
0 OK
1 Warning
2 Critical
other
Unknown
3
The name of the Nagios service description, see the corresponding template file:
/opt/hptc/nagios/etc/templates/*_template.cfg.
4
The alert applies to this host name.
5
The IP address of the host.
6
The message output from the plug-in.
In the following example, [n47] indicates that this data was collected by the Nagios monitor running
on management hub n47. The message is from the plug-in associated with the Nagios service for the
system event log, in this case, /opt/hptc/nagios/libexec/check_sel.
[n47] Power Unit Power Redundancy Redundancy Lost
7
A date and time stamp indicating when the cause for the alert happened.
8
How long the message waited in the nand queue, that is, how much time elapsed before this message
was mailed.
9
The nand sequence number. The nand daemon receives and batches messages generated by Nagios
and sends them by e-mail.
8.4.4 System Event Log Monitoring
This section explains the system event log and describe configuration details.
8.4.4.1 Understanding the System Event Log
Each HP hardware platform supplies an event logging mechanism to capture platform-specific events to
track hardware state and changes. Information in the system event log varies, but it typically contains
information including, but not limited to, the following:
Memory ECC errors
Power supply failures
Voltage problems
Event logs are stored by the firmware and can become full over time. Some platforms require regular
maintenance to clear the logs to avoid losing critical events. In addition, errors that indicate failure or
pending failure of a component need to be brought to the operator's immediate attention.
8.4 Configuring Nagios on HP XC Systems 119