Fault Monitoring on Windows Integrity Servers

Error type
[Hardware]
Threshold Message
hours Predictive Failure in Memory
Severity=Warning
HelpText=You will receive this message if the memory system
is observing a lot of corrected ECC errors from a DIMM. The
specified DIMM may need to be serviced.
Contact your HP support representative to check the affected
hardware.
[zx2 systems]
Per system: 24 errors in 24
hours
Event ID 5814
Significant numbers of corrected memory errors have been
detected on the memory subsystem
Severity=Warning
HelpText=You will receive this message if the system is
observing a lot of corrected ECC errors in memory. This
could be caused by problems with the system's memory or
by unexpected environmental conditions inside the server.
Contact your HP support representative to check your
memory system.
Double-chip
sparing
[sx2000 systems]
1 in 24 hours
When ACTIVE_ERASURES > 1
Event ID 7120
Double DRAM chip sparing events have been invoked.
Severity=Warning
HelpText=System firmware has detected and corrected
memory errors. Double DRAM chip sparing events have
been invoked to help mitigate this condition. On the next
reboot, system firmware will test the specified memory
components, and may take them offline if the errors persist.
Contact your HP support representative to check the
specified memory.
Fabric error
[sx2000 systems]
Note: The fabric connector types designated below are internal HP hardware designations.
When you contact your HP support representative to resolve the issue, these labels will
help identify the source of the problem.
Fabric connector type:
Moab-skyline
Skyline-moab
Skyline-skyline
Threshold:
Alrec_spare_sel =
non-zero
Event ID 7121
A platform error was detected by the firmware/hardware,
and corrected by using a spare channel.
Severity=Warning
HelpText=Hardware has detected many link retries on one of
its channels and has switched to a spare one.
Monitor the situation and contact your HP support
representative to check the affected hardware.
Fabric connector type:
moab-moab
Threshold:
alrec_spare_sel =
non-zero
Event ID 7127
A platform error was detected by the firmware/hardware,
and corrected by using a spare channel.
Severity=Warning
HelpText=The error occurred between the crossbar chips on
the backplane(s).
Contact your support representative to have the backplane
and / or connections between backplanes checked.
Fabric connector type:
skyline-escalante
Event ID 7128
A platform error was detected by the firmware/hardware,
Page 6