Fault Monitoring on Windows Integrity Servers
Error type
[Hardware]
Threshold Message
• escalante-skyline
Threshold:
• alrec_spare_sel =
non-zero
and corrected by using a spare channel.
Severity=Warning
HelpText=The error occurred between the cell controller and
the IO controller.
Contact your support representative to have the Cell
Controller to IO Interface checked. Check these FRUs:
Cell-IO cable, IO chassis, IO backplane, cell, or system
backplane
Fabric connector type:
• Moab-skyline
• skyline-moab
• skyline-skyline
Threshold:
• Alrec_err_count >= 2047
Event ID 7129
Multiple platform errors were detected and corrected by the
firmware/hardware.
Severity=Warning
HelpText=The errors occurred between the cell controller and
the backplane.
Contact your support representative to have the Cell
Controller to backplane Interface checked.
Fabric connector type:
• moab-moab
Threshold:
• alrec_err_count >= 2047
Event ID 7130
Multiple platform errors were detected and corrected by the
firmware/hardware.
Severity=Warning
HelpText=The errors occurred between the crossbar chips on
the backplane(s).
Contact your support representative to have the backplane
and / or connections between backplanes checked.
Fabric connector type:
• skyline-escalante
• escalante-skyline
Threshold:
Alrec_err_count >= 2047
Event ID 7131
Multiple platform errors were detected and corrected by the
firmware/hardware.
Severity=Warning
HelpText=The errors occurred between the cell controller and
the IO controller.
Contact your support representative to have the Cell
Controller to IO Interface checked. Check these FRUs:
Cell-IO cable, IO chassis, IO backplane, cell, or system
backplane.
Enhanced Thermal
Management
(ETM) error
[Systems with
Montecito
processors]
70 errors in 24 hours Event ID 7228
Over-Temperature or power condition detected for processor
Severity=Warning
HelpText=The processor temperature or power has exceeded
normal limits. The Enhanced Thermal Management (ETM)
feature of the processor has been employed to reduce the
chip power to allow operation within normal limits. If this
condition persists, this will have an adverse impact on the
performance of this processor.
Check that there are no issues with the cooling, and if the
problem persists contact HP service.
Intel Cache Safe
Technology
Performance error
[Systems with
1 error in 24 hours Event ID 7343
CPU performance degraded due to excessive errors in third
level cache
Page 7