Server Administrator Version 7.
Notes and Cautions NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates potential damage to hardware or loss of data if instructions are not followed. ____________________ Information in this document is subject to change without notice. © 2012 Dell Inc. All rights reserved. Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc. is strictly forbidden.
Contents 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . What’s New in this Release . . . . . . . . . . . . . . . . Sample Event Message Text 8 . . . . . . . . . . . . . 8 . . . . . . . . . . . . Viewing Alerts and Event Messages . . . . . . . . . . Viewing Events in Microsoft Windows Server 2008 . . . . . . . . . . . . . . . 10 11 . . . . . . 12 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server . . . . . . . . . 13 . . . . . . . 14 . . . . . . .
Chassis Intrusion Messages . . . . . . . . . . . . . . 34 Redundancy Unit Messages . . . . . . . . . . . . . . . 37 . . . . . . . . . . . . . . . . 41 Power Supply Messages Memory Device Messages Fan Enclosure Messages . . . . . . . . . . . . . . . 45 . . . . . . . . . . . . . . . . 46 AC Power Cord Messages . . . . . . . . . . . . . . . . Hardware Log Sensor Messages . . . . . . . . . . . . 49 Processor Sensor Messages . . . . . . . . . . . . . . 51 Pluggable Device Messages . . . .
Voltage Sensor Events Fan Sensor Events . . . . . . . . . . . . . . . . . 239 . . . . . . . . . . . . . . . . . . . . 241 . . . . . . . . . . . . . . . . 243 . . . . . . . . . . . . . . . . . . 245 . . . . . . . . . . . . . . . . . . . 250 Processor Status Events Power Supply Events Memory ECC Events . . . . . . . . . . . . . . . . . 251 . . . . . . . . . . . . . . . . . . . . . 252 BMC Watchdog Events Memory Events . . . . . . . . . . . . . . 254 . . . . . . . . . . . . . . . . . . . . .
Contents
1 Introduction Server Administrator generates event messages stored primarily in the operating system or Server Administrator event logs and sometimes in Simple Network Management Protocol (SNMP) traps. This document describes the event messages that are created by Server Administrator version 7.1 and displayed in the Server Administrator alert log. Server Administrator creates events in response to sensor status changes and other monitored parameters.
What’s New in this Release None Messages Not Described in This Guide This guide describes only event messages logged by Server Administrator and Storage Management that are displayed in the Server Administrator alert log.
Server Administrator generates events based on status changes in the following sensors: • Temperature Sensor — Helps protect critical components by alerting the systems management console when temperatures become too high inside a chassis; also monitors the temperature in a variety of locations in the chassis and in attached system(s). • Fan Sensor — Monitors fans in various locations in the chassis and in attached system(s).
• Pluggable Device Sensor — Monitors the addition, removal, or configuration errors for some pluggable devices, such as memory cards. • Battery Sensor — Monitors the status of one or more batteries in the system. • SD Card Device Sensor — Monitors instrumented Secure Digital (SD) card devices in the system. Sample Event Message Text The following example shows the format of the event messages logged by Server Administrator.
The location of the event log file depends on the operating system you are using. • On systems running the Microsoft Windows operating systems, event messages are logged in the operating system event log and the Server Administrator event log. NOTE: The Server Administrator event log file is named dcsys32.xml and is located in the \omsa\log directory. The default install_path is C:\Program Files\Dell\SysMgt.
• On systems running the Red Hat Enterprise Linux, SUSE Linux Enterprise Server, Citrix XenServer and VMware ESX operating systems, you can locate the configuration file in the /opt/dell/srvadmin/etc/ srvadmin-deng/ini directory and set the property UnitextLog.enabled=true. Run the /etc/init.d/dataeng restart command to restart the Server Administrator Event Manager service and enable the setting. This also restarts the Server Administrator Data Manager and SNMP services.
Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 1 Log in as root. 2 Use a text editor such as vi or emacs to view the file named /var/log/messages. The following example shows the Red Hat Enterprise Linux and SUSE Linux Enterprise Server message log, /var/log/messages. The text in boldface type indicates the message text. NOTE: These messages are typically displayed as one long line.
Viewing Events in VMware ESX/ESXi 1 Log in to the system running VMware ESX/ESXi with VMware vSphere Client. 2 Click ViewAdministrationSystem Logs. 3 Select Server Log /var/log/messages entry from the drop-down list. Viewing the Event Information The event log for each operating system contains some or all of the following information: • Date — The date the event occurred. • Time — The local time the event occurred. • Type — A classification of the event severity: Information, Warning, or Error.
Table 1-2. Event Description Reference (continued) Description Line Item Explanation Additional Details: Specifies additional details available for the hot plug Memory device: DIMM1_A Serial number: FFFF30B1 Specifies information pertaining to the event, for example: Chassis intrusion state: Specifies whether the chassis intrusion state is Open or Closed.
Table 1-2.
Table 1-2.
Introduction
Server Management Messages 2 The following tables lists in numerical order each event ID and its corresponding description, along with its severity and cause. NOTE: For corrective actions, see the appropriate documentation. Server Administrator General Messages The messages in Table 2-1 indicate that certain alert systems are up and working. Table 2-1. Server Administrator General Messages Event Description ID Severity 0000 Information User cleared the log from Server Administrator.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1004 Thermal shutdown protection has been initiated Error This message is generated when a system is configured for thermal shutdown due to an error event. If a temperature sensor reading exceeds the error threshold for which the system is configured, the operating system shuts down and the system powers off.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity 1008 Systems Management Data Manager Started Information Systems Management Data Manager services were started. 1009 Systems Management Data Manager Stopped Information Systems Management Data Manager services were stopped. 1011 RCI table is corrupt Error 1012 IPMI Status Information This message is generated to indicate the Intelligent Platform Management Interface (IPMI)) status of the system.
Temperature Sensor Messages The temperature sensors listed in Table 2-2 help protect critical components by alerting the systems management console when temperatures become too high inside a chassis. The temperature sensor messages use additional variables: sensor location, chassis location, previous state, and temperature sensor value or state. Table 2-2.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1052 Temperature sensor returned to a normal value Information A temperature sensor on the backplane board, Sensor location: drive carrier in the Chassis location: returned to a valid range after crossing Previous state was: a failure threshold.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1054 Temperature sensor detected a failure value Error A temperature sensor on the backplane board, system board, or drive carrier in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and temperature sensor value are provided.
Cooling Device Messages The cooling device sensors listed in Table 2-3 monitor how well a fan is functioning. Cooling device messages provide status and warning information for fans in a particular chassis. Table 2-3. Cooling Device Messages Event Description ID Severity Cause 1100 Fan sensor has failed Error A fan sensor in the specified system is not functioning. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity 1102 Fan sensor returned to a normal value Information A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity Cause 1105 Fan sensor detected a non-recoverable value Error A fan sensor detected an error from which it cannot recover. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Voltage Sensor Messages The voltage sensors listed in Table 2-4 monitor the number of volts across critical components. Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis. Table 2-4. Voltage Sensor Messages Event Description ID Severity Cause 1150 Voltage sensor has failed Warning A voltage sensor in the specified system failed. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity 1152 Voltage sensor returned to a normal value Information A voltage sensor in the specified system returned to a valid range after crossing a failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity Cause 1154 Voltage sensor detected a failure value Error A voltage sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided. Error A voltage sensor in the specified system detected an error from which it cannot recover.
Current Sensor Messages The current sensors listed in Table 2-5 measure the amount of current (in amperes) that is traversing critical components. Current sensor messages provide status and warning information for current sensors in a particular chassis. Table 2-5. Current Sensor Messages Event Description ID Severity Cause 1200 Current sensor has failed Error A current sensor in the specified system failed. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1201 Current sensor value unknown Warning A current sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and a nominal current sensor value information is provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1203 Current sensor detected a warning value Warning A current sensor in the specified system exceeded its warning threshold. The sensor location, chassis location, previous state, and current sensor value are provided. Error A current sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1205 Current sensor detected a non-recoverable value Error A current sensor in the specified system detected an error from which it cannot recover. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-6. Chassis Intrusion Messages Event Description ID Severity Cause 1250 Error A chassis intrusion sensor in the specified system failed. The sensor location, chassis location, previous state, and chassis intrusion state are provided. Warning A chassis intrusion sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and chassis intrusion state are provided.
Table 2-6. Chassis Intrusion Messages (continued) Event Description ID Severity Cause 1253 Warning A chassis intrusion sensor in the specified system detected that a system cover is currently being opened and the system is operating. The sensor location, chassis location, previous state, and chassis intrusion state information is provided. Critical A chassis intrusion sensor in the specified system detected that the system cover was opened while the system was operating.
Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components. Fans and power supplies, for example, are so important for preventing damage or disruption of a computer system that a chassis may have “extra” fans or power supplies installed. Redundancy allows a second or nth fan to keep the chassis components at a safe temperature when the primary fan has failed. Redundancy is normal when the intended number of critical components are operating.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1302 Redundancy not applicable Information A redundancy sensor in the specified system detected that a unit was not redundant. The redundancy location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1304 Redundancy regained Information A redundancy sensor in the specified system detected that a “lost” redundancy device has been reconnected or replaced; full redundancy is in effect. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity Cause 1306 Redundancy lost Error A redundancy sensor in the specified system detected that one of the components in the redundant unit has been disconnected, has failed, or is not present. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy are provided.
Power Supply Messages The power supply sensors monitor how well a power supply is functioning. The power supply messages listed in Table 2-8 provide status and warning information for power supplies present in a particular chassis. Table 2-8. Power Supply Messages Event Description ID Severity Cause 1350 Error A power supply sensor in the specified system failed.
Table 2-8. Power Supply Messages (continued) Event Description ID Severity Cause 1351 Warning A power supply sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, power supply type, additional power supply status, and configuration error type information are provided.
Table 2-8. Power Supply Messages (continued) Event Description ID Severity Cause 1353 Warning A power supply sensor reading in the specified system exceeded a user-definable warning threshold. The sensor location, chassis location, previous state, power supply type, additional power supply status, and configuration error type information are provided. Error A power supply has been disconnected or has failed.
Table 2-8.
Memory Device Messages The memory device messages listed in Table 2-9 provide status and warning information for memory modules present in a particular system. Memory devices determine health status by monitoring the ECC memory correction rate and the type of memory events that have occurred. NOTE: A critical status does not always indicate a system failure or loss of data. In some instances, the system has exceeded the ECC correction rate.
Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans. Fan enclosure messages listed in Table 2-10 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis. Table 2-10. Fan Enclosure Messages Event Description ID Severity Cause 1450 Critical/ Failure / Error The fan enclosure sensor in the specified system failed. The sensor and chassis location information is provided.
Table 2-10. Fan Enclosure Messages (continued) Event Description ID Severity Cause 1454 Error A fan enclosure has been removed from the specified system for a user-definable length of time. The sensor and chassis location information is provided. Error A fan enclosure sensor in the specified system detected an error from which it cannot recover. The sensor and chassis location are provided.
AC Power Cord Messages The AC power cord messages listed in Table 2-11 provide status and warning information for power cords that are part of an AC power switch, if your system supports AC switching. Table 2-11. AC Power Cord Messages Event Description ID Severity 1500 Critical/ An AC power cord sensor in Failure/ Error the specified system failed. The AC power cord status cannot be monitored. The sensor and chassis location information is provided.
Table 2-11. AC Power Cord Messages (continued) Event Description ID Severity Cause 1503 AC power has been lost Critical/ Power supply is disrupted to Sensor location: Failure/ Error the AC power cord or an AC power cord is not transmitting power, but there is sufficient Chassis location: redundancy to classify this as a warning. The sensor and chassis location information is provided.
Table 2-12. Hardware Log Sensor Messages Event Description ID Severity Cause 1550 Warning A hardware log sensor in the specified system is disabled. The log type information is provided. Log monitoring has been disabled Log type: 1551 Log status is unknown Information A hardware log sensor in the specified system could not Log type: obtain a reading. The log type information is provided.
Processor Sensor Messages The processor sensors monitor how well a processor is functioning. Processor messages listed in Table 2-13 provide status and warning information for processors in a particular chassis. Table 2-13. Processor Sensor Messages Event Description ID Severity Cause 1600 Critical/ Failure/ Error A processor sensor in the specified system is not functioning. The sensor location, chassis location, previous state and processor sensor status information is provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity 1602 Information A processor sensor in the specified system transitioned back to a normal state. The sensor location, chassis location, previous state and processor sensor status are provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity Cause 1604 Error A processor sensor in the specified system is disabled, has a configuration error, or experienced a thermal trip. The sensor location, chassis location, previous state and processor sensor status are provided. Error A processor sensor in the specified system has failed. The sensor location, chassis location, previous state and processor sensor status are provided.
Pluggable Device Messages The pluggable device messages listed in Table 2-14 provide status and error information when some devices, such as memory cards, are added or removed. Table 2-14. Pluggable Device Messages Event Description ID 1650 Severity Cause Information A pluggable device event message of unknown type was received. The device location, chassis Device location: location, and additional event
Table 2-14. Pluggable Device Messages (continued) Event Description ID Severity 1652 Information A device was removed from the specified system. The device location, chassis location, and additional event details, if available, are provided.
Battery Sensor Messages The battery sensors monitor how well a battery is functioning. The battery messages listed in Table 2-15 provide status and warning information for batteries in a particular chassis. Table 2-15.
Table 2-15. Battery Sensor Messages (continued) Event Description ID Severity 1702 Battery sensor returned to a normal value 1703 Battery sensor detected a warning value Information A battery sensor in the specified system detected that a Sensor Location: back to a normal Chassis Location:
Table 2-15. Battery Sensor Messages (continued) Event Description ID Severity Cause 1705 Error A battery sensor in the specified system could not retrieve a value. The sensor location, chassis location, previous state, and battery sensor status information is provided.
Table 2-16. SD Card Device Messages Event ID Description Severity Cause 1751 SD card device sensor value unknown 1752 SD card device returned to Information An SD card device normal sensor in the specified system detected that Sensor location: back to a normal state.
Table 2-16. SD Card Device Messages Event ID Description Severity Cause 1753 SD card device detected a warning Warning An SD card device sensor in the specified system detected a warning condition. The sensor location, chassis location, previous state, and SD card device type information is provided. The SD card state is provided if an SD card is present in the SD card device. Error An SD card device sensor in the specified system detected an error.
Table 2-16. SD Card Device Messages Event ID Description Severity 1755 SD card device sensor Error detected a non-recoverable value Sensor location: Chassis location: Previous state was: SD card device type: SD card state: Cause An SD card device sensor in the specified system detected an error from which it cannot recover.
Chassis Management Controller Messages The Alerts sent by M1000e Chassis Management Controller (CMC) are organized by severity. That is, the event ID of the CMC trap indicates the severity (informational, warning, critical, or non-recoverable) of the alert. Each CMC alert includes the originating system name, location, and event message text. The alert message text matches the corresponding Chassis Event Log message text that is logged by the sending CMC for that event. Table 2-17.
Storage Management Message Reference 3 The Server Administrator Storage Management’s alert or event management features let you monitor the health of storage resources such as controllers, enclosures, physical disks, and virtual disks. Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging. By default, the Storage Management service starts when the managed system starts up. If you stop the Storage Management Service, then alert monitoring and logging stops.
Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log, the alert identifies the specific components such as the controller name or the virtual disk name to which the alert applies. In an actual operating environment, a storage system can have many combinations of controllers and disks as well as user-defined names for virtual disks and other components. Each environment is unique in its storage configuration and user-defined names.
NOTE: A, B, C and X, Y, Z in the following examples are variables representing the storage object name or number. Table 3-2. Message Format with Variables for Each Storage Object Storage Object Message Variables Controller Message Format: Controller A (Name) Message Format: Controller A For example, 2326 A foreign configuration has been detected: Controller 1 (PERC 5/E Adapter) NOTE: The controller name is not always displayed.
Table 3-2. Message Format with Variables for Each Storage Object (continued) Storage Object Message Variables SAS Power Supply Message Format: Power Supply X Controller A, Connector B, Enclosure C For example, 2312 A power supply in the enclosure has an AC failure: Power Supply 1, Controller 1, Connector 0, Enclosure 2 SCSI Temperature Probe Message Format: Temperature Probe X Controller A, Connector B, Target ID C where C is the SCSI ID number of the EMM managing the temperature probe.
Alert Message Change History The following table describes the changes made to the Storage Management alerts from the previous release of Storage Management to the current release. Table 3-3. Alert Message Change History Storage Management 4.2 Product Versions to which changes apply Storage Management 4.2.0 Server Administrator 7.2.0 New Alerts 2433, 2434, 2435, 2436, 2437, 2438 Deleted Alerts None Modified Alerts 2359 Storage Management 4.
Table 3-3. Alert Message Change History (continued) Product Versions to which changes apply Storage Management 3.4.0 Server Administrator 6.4.0 New Alerts 2405, 2406, 2407, 2408, 2409, 2410, 2411, 2412, 2413, 2414, 2415, 2416, 2417, 2418 NOTE: The CacheCade feature is available from calendar year 2011. Deleted Alerts None Modified Alerts None Storage Management 3.3 Product Versions to which changes apply Storage Management 3.3.0 Server Administrator 6.3.
To locate an alert, scroll through the following table to find the alert number displayed on the Server Administrator Alert tab or search this file for the alert message text or number. See “Understanding Event Messages” on page 8 for more information on severity levels. For more information regarding alert descriptions and the appropriate corrective actions, see the online help. Table 3-4.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2049 Physical disk removed Warning / Non-critical Cause: A physical disk has been removed from the disk group. This alert can also be caused by loose or defective cables or by problems with the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2050 Physical disk offline Warning / Non-critical Cause: A physical disk in the disk group is offline. The user may have manually put the physical disk offline. Clear Alert 903 Number: 2158 Action: Perform a rescan. You can also select the offline disk and perform a Make Online operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2052 Physical disk inserted OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert Number: 2065, 2305, 2367 LRA Number: None 2053 Virtual disk created OK / Normal / Cause: This alert is for Clear Alert: 1201 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2056 Virtual disk failed Critical / Cause: One or more Failure / Error physical disks included in the virtual disk have failed. If the virtual disk is non-redundant (does not use mirrored or parity data), then the failure of a single physical disk can cause the virtual disk to fail.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2057 Virtual disk degraded Warning / Non-critical Cause 1: This alert message occurs when a physical disk included in a redundant virtual disk fails. Because the virtual disk is redundant (uses mirrored or parity information) and only one physical disk has failed, the virtual disk can be rebuilt.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause 2: A physical disk in the disk group has been removed. 2057 contd. Action 2: If a physical disk was removed from the disk group, either replace the disk or restore the original disk. You can identify which disk has been removed by locating the disk that has a red “X” for its status. Perform a rescan after replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2060 Copy of data started from physical disk %2 to physical disk %1. OK / Normal Cause: This alert is for Clear Alert 1201 /Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2063 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguratio Informational informational purposes. Number: n started 2090 Action: None Related Alert Number: None LRA Number: None 2064 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 rebuild started Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2067 Virtual disk check consistency cancelled OK / Normal / Cause: The check Informational consistency operation was cancelled because a physical disk in the array has failed or because a user cancelled the check consistency operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2070 Virtual disk initialization cancelled OK / Normal / Cause: The virtual disk Informational initialization cancelled because a physical disk included in the virtual disk has failed or because a user cancelled the virtual disk initialization. Clear Alert 1201 Number: None OK / Normal / Cause: The user has Informational cancelled the rebuild operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2075 Copy of data completed from physical disk %2 to physical disk %1. OK / Normal / Cause: This alert is Clear Alert 1201 Informational provided for Number: informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2077 Virtual disk format failed. Critical / Cause: A physical disk Failure / Error included in the virtual disk failed. Action: Replace the failed physical disk. You can identify which physical disk has failed by locating the disk that has a red X for its status. Rebuild the physical disk. When finished, restart the virtual disk format operation. 2079 Virtual disk initialization failed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2080 Physical disk initialization failed Critical / Cause: The physical Clear Alert 904 Failure / Error disk has failed or is not Number: functioning. None Action: Replace the failed or non-functional disk. You can identify a disk that has failed by locating the disk that has a red “X” for its status. Restart the initialization.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Software RAID: 2081 contd. • Perform a backup with the Verify option. • If the file backup fails, try to restore the failed file from a previous backup. • When the backup with the Verify option is complete without any errors, delete the Virtual Disk. • Recreate a new Virtual Disk with new drives. • Restore the data from backup.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2083 Physical disk rebuild failed Critical / Cause: A physical disk Failure / Error included in the virtual disk has failed or is not functioning. A user may also have cancelled the rebuild. Related SNMP Alert Trap Information Numbers Clear Alert 904 Number: None Related Alert Number: None Action: Replace the failed or non-functional LRA disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2086 Virtual disk format completed OK / Normal / Cause: This alert is for Clear Alert 1201 Informational informational purposes. Status: Alert 2086 Action: None is a clear alert for alert 2059.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2088 Virtual disk initialization completed OK / Normal / Cause: This alert is for Clear Alert 1201 Informational informational purposes. Status: Alert 2088 Action: None is a clear alert for alerts 2061 and 2136.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2090 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguration Informational informational purposes. Status: completed Alert 2090 Action: None is a clear alert for alert 2063.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2092 Physical disk rebuild completed OK / Normal / Cause: This alert is for Clear Alert 901 Informational informational purposes. Status: Alert 2092 Action: None is a clear alert for alert 2065.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2094 Predictive Failure reported. Warning / Non-critical Cause: The physical disk is predicted to fail. Many physical disks contain Self Monitoring Analysis and Reporting Technology (SMART). When enabled, SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description 2094 cond. Severity Cause and Action Related SNMP Alert Trap Information Numbers If this disk is a hot spare, then unassign the hot spare; perform the Prepare to Remove task on the disk; replace the disk; and assign the new disk as a hot spare. CAUTION: If this disk is part of a nonredundant disk, back up your data immediately. If the disk fails, you cannot recover the data. 2095 SCSI sense data %1.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2099 Global hot spare unassigned OK / Normal / Cause: A physical disk Informational that was assigned as a hot spare has been unassigned and is no longer functioning as a hot spare. The physical disk may have been unassigned by a user or automatically unassigned by Storage Management. Storage Management unassigns hot spares that have been used to rebuild data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Temperature exceeded the maximum warning threshold Warning / Non-critical Cause: The physical disk enclosure is too hot. A variety of factors can cause the excessive temperature. For example, a fan may have failed, the thermostat may be set too high, or the room temperature may be too hot. Action: Check for factors that may cause overheating. For example, verify that the enclosure fan is working.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2101 Temperature Warning / dropped below Non-critical the minimum warning threshold Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk enclosure is too cool. Clear Alert 1053 Number: 2353 Action: Check if the thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2102 Temperature exceeded the maximum failure threshold Critical / Cause: The physical disk Failure / Error enclosure is too hot. A variety of factors can cause the excessive temperature. For example, a fan may have failed, the thermostat may be set too high, or the room temperature may be too hot. Action: Check for factors that may cause overheating. For example, verify that the enclosure fan is working.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2103 Temperature Critical / Cause: The physical dropped below Failure / Error disk enclosure is too the minimum cool. failure Action: Check if the threshold thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2106 SMART FPT exceeded Warning / Non-critical Cause: A disk on the specified controller has received a SMART alert (predictive failure) indicating that the disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the LRA disk that has received Number: the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2107 SMART configuration change Critical / Cause: A disk has Failure / Error received a SMART alert (predictive failure) after a configuration change. The disk is likely to fail in the near future. Related SNMP Alert Trap Information Numbers Clear Alert 904 Number: None Related Alert Number: None Action: Replace the disk that has received LRA the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2108 SMART warning Warning / Non-critical Cause: A disk has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the disk that has received the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2109 SMART warning temperature Warning / Non-critical Cause: A disk has reached an unacceptable temperature and received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None LRA Action 1: Determine Number: why the physical disk 2070 has reached an unacceptable temperature.
Table 3-4. Storage Management Messages (continued) Event ID 2109 contd Description Severity Cause and Action Make sure the enclosure has enough ventilation and that the room temperature is not too hot. See the physical disk enclosure documentation for more diagnostic information. Action 2: If you cannot identify why the disk has reached an unacceptable temperature, then replace the disk. If the physical disk is a member of a non-redundant virtual disk, then back up the data before replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2110 SMART warning degraded Warning / Non-critical Cause: A disk is degraded and has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert 903 Number: None Related Alert Number: None Action: Replace the disk that has received LRA the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2112 Enclosure was Critical / Cause: The physical shut down Failure / Error disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range. Related SNMP Alert Trap Information Numbers Clear Alert 854 Number: None Related Alert Number: None Action: Check for factors that may cause LRA overheating or excessive Number: cooling.
Table 3-4. Storage Management Messages (continued) Event ID Description 2114 A consistency OK / Normal / check on a Informational virtual disk has been paused (suspended) 2115 Severity A consistency OK / Normal / check on a Informational virtual disk has been resumed Cause and Action Related SNMP Alert Trap Information Numbers Cause: The check consistency operation on a virtual disk was paused by a user.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2116 A virtual disk OK / Normal / Cause: A user has caused and its mirror Informational a mirrored virtual disk to have been split be split. When a virtual disk is mirrored, its data is copied to another virtual disk in order to maintain redundancy. After being split, both virtual disks retain a copy of the data although the mirror is no longer intact.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2118 Change write policy OK / Normal / Cause: A user has Informational changed the write policy for a virtual disk. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: None Related Alert Number: None LRA Number: None 2120 Enclosure firmware mismatch Warning / Non-critical Cause: The firmware on the EMM is not the same version.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2121 Device returned to normal OK / Normal / Cause: A device that Informational was previously in an error state has returned to a normal state. For example, if an enclosure became too hot and subsequently cooled down, you may receive this alert. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2122 Redundancy degraded Warning / Non-critical Cause: One or more of Clear Alert 1305 the enclosure Status: components has failed. 2124 For example, a fan or power supply may have failed. Although the enclosure is currently operational, the failure of additional components could cause the enclosure to fail.
Table 3-4. Storage Management Messages (continued) Event ID 2122 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. See the enclosure documentation for information on replacing enclosure components and for other diagnostic information.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2123 Redundancy lost Warning / Non-critical Cause: A virtual disk or an enclosure has lost data redundancy. In the case of a virtual disk, one or more physical disks included in the virtual disk have failed. Due to the failed physical disk or disks, the virtual disk is no longer maintaining redundant (mirrored or parity) data.
Table 3-4. Storage Management Messages (continued) Event ID 2123 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. Click the controller that displays a Warning or Failed status. This action displays the controller Health subtab which displays the status of the individual controller components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2124 Redundancy normal OK / Normal / Cause: Data Informational redundancy has been restored to a virtual disk or an enclosure that previously suffered a loss of redundancy. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1304 Number: Alert 2124 is a clear alert for alerts 2122 and 2123.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2126 SCSI sense Warning / sector reassign Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: A sector of the physical disk is corrupted and data cannot be maintained on this portion of the disk. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2127 Background initialization (BGI) started OK / Normal / Cause: BGI of a virtual Informational disk has started. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Status: 2130 Related Alert Number: None LRA Number: None 2128 BGI cancelled OK / Normal / Cause: BGI of a virtual Informational disk has been cancelled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2130 BGI completed OK / Normal / Cause: BGI of a virtual Informational disk has completed. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: Alert 2130 is a clear alert for alert 2127.
Table 3-4. Storage Management Messages (continued) Event ID Description 2132 Driver version Warning / mismatch Non-critical 2135 Severity Array Manager Warning / is installed on Non-critical the system NOTE: This is not supported on Server Administrator version 6.0.1. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller Clear Alert 753 driver is not a supported Number: version. None Action: Install a supported version of the driver.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2136 Virtual disk initialization OK / Normal / Cause: Virtual disk Informational initialization is in progress. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2137 Communication timeout Warning / Non-critical Cause: The controller is unable to communicate with an enclosure. There are several reasons why communication may be lost. For example, there may be a bad or loose cable. An unusual amount of I/O may also interrupt communication with the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID 2137 contd. 2138 Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Action: Check for problems with the cables. See the online help for more information on checking the cables. You should also check to see if the enclosure has degraded or failed components. To do so, select the enclosure object in the tree view and click the Health subtab. The Health subtab displays the status of the enclosure components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2139 Enclosure OK / Normal / Cause: A user has alarm disabled Informational disabled the enclosure alarm. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 851 Number: None Related Alert Number: None LRA Number: None 2140 Dead disk segments restored OK / Normal / Cause: Disk space that Informational was formerly “dead” or inaccessible to a redundant virtual disk has been restored.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2142 Controller rebuild rate has changed OK / Normal / Cause: A user has Informational changed the controller rebuild rate. This alert is for informational purposes. Action: None Related SNMP Alert Trap Information Numbers Clear Alert 751 Number: None Related Alert Number: None LRA Number: None 2143 Controller OK / Normal / Cause: A user has alarm enabled Informational enabled the controller alarm.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2145 Controller battery low Warning / Non-critical Cause: The controller battery charge is low. Clear Alert: 1153 None Action: Recondition the battery. See the online help for more information. Related Alert: None Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2149 Bad block Warning / extended sense Non-critical error Cause and Action Related SNMP Alert Trap Information Numbers Cause: A portion of a physical disk is damaged. Clear Alert: 753 None Action: See the Server Administrator Storage Management online help for more information. 2150 Bad block extended medium error Warning / Non-critical Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2153 Enclosure service tag changed OK / Normal / Cause: An enclosure Informational service tag was changed. In most circumstances, this service tag should only be changed by your service provider. Related SNMP Alert Trap Information Numbers Clear Alert: 851 None Related Alert: None LRA Number: Action: Ensure that the None tag was changed under authorized circumstances.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2157 Controller OK / Normal / Cause: A user has reset configuration Informational the controller has been reset configuration. See the online help for more information. This alert is for informational purposes. Action: None 2158 Physical disk online OK / Normal / Cause: An offline Informational physical disk has been made online. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2159 Virtual disk renamed OK / Normal / Cause: A user has Informational renamed a virtual disk. When renaming a virtual disk on a PERC 4/SC, 4/DC, 4e/DC, 4/Di, CERC ATA100/4ch, PERC 5/E, PERC 5/i or SAS 5/iR controller, this alert displays the new virtual disk name.
Table 3-4. Storage Management Messages (continued) Event ID Description 2161 Dedicated hot OK / Normal / Cause: A physical disk spare Informational that was assigned as a unassigned hot spare has been unassigned and is no longer functioning as a hot spare. The physical disk may have been unassigned by a user or automatically unassigned by Storage Management. Storage Management unassigns hot spares that have been used to rebuild data.
Table 3-4. Storage Management Messages (continued) Event ID Description Cause and Action Related SNMP Alert Trap Information Numbers Action: Although this alert is provided for informational purposes, you may need to assign a new hot spare to the virtual disk. 2161 Cont. 2162 Severity Communicatio OK / Normal / Cause: Communication n regained Informational with an enclosure has been restored. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2164 See the OK / Normal / Readme file for Informational a list of validated controller driver versions Cause and Action Related SNMP Alert Trap Information Numbers Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller drivers. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2165 The RAID Warning / controller Non-critical firmware and driver validation was not performed. The configuration file cannot be opened. Cause and Action Related SNMP Alert Trap Information Numbers Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller firmware and drivers. This situation may occur for a variety of reasons.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2166 The RAID Warning / controller Non-critical firmware and driver validation was not performed. The configuration file is out of date, missing the required information, or not properly formatted to complete the comparison.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2167 The current Warning / kernel version Non-critical and the non-RAID SCSI driver version are older than the minimum required levels. See readme.txt for a list of validated kernel and driver versions. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The version of the kernel and the driver do not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2168 The non-RAID Warning / SCSI driver Non-critical version is older than the minimum required level. See readme.txt for the validated driver version. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The version of the driver does not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2170 The controller OK / Normal / Cause: This alert is for Clear Alert: 1151 battery charge Informational informational purposes. None level is normal. Action: None Related Alert: None LRA Number: None 2171 The controller Warning / battery Non-critical temperature is above normal.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2172 The controller OK / Normal / Cause: This alert is for battery Informational informational purposes. temperature is Action: None normal. Related SNMP Alert Trap Information Numbers Clear Alert 1151 Status: Alert 2172 is a clear alert for alert 2171. Related Alert: None LRA Number: None 2173 134 Unsupported configuration detected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2174 The controller Warning / battery has Non-critical been removed. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller cannot communicate with the battery. The battery may be removed, or the contact point between the controller and the battery may be burnt or corroded. Clear Alert: 1153 None Action: Replace the battery if it has been removed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2176 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None started. Related SNMP Alert Trap Information Numbers Clear Alert 1151 Number: 2177 Related Alert: None LRA Number: None 2177 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None completed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2178 The controller Warning / battery Learn Non-critical cycle has timed out. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller battery must be fully charged before the Learn cycle can begin. The battery may be unable to maintain a full charge causing the Learn cycle to timeout.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2180 The controller OK / Normal / battery Learn Informational cycle starts in %1 days. Cause and Action Related SNMP Alert Trap Information Numbers Cause: This alert is for informational purposes. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2183 Replace Member Operation failed on physical disk %1 from physical disk %2. Critical / Cause: The physical Failure / Error disk participating in the Replace Member Operation operation has failed. Physical disk Replace Member Operation cancelled. OK / Normal / Cause: User cancelled Informational the Replace Member Operation operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2186 The controller Warning / cache has been Non-critical discarded. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller has flushed the cache and any data in the cache has been lost. This may happen if the system has memory or battery problems that cause the controller to distrust the cache.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2188 The controller OK / Normal / write policy Informational has been changed to Write Through. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller battery is unable to maintain cached data for the required period of time. For example, if the required period of time is 24 hours, the battery is unable to maintain cached data for 24 hours.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2190 The controller OK / Normal / Cause: This alert is for has detected a Informational informational purposes. hot-add of an Action: None enclosure. Related SNMP Alert Trap Information Numbers Clear Alert: 751 None Related Alert: None LRA Number: None 2191 Multiple Critical / Cause: There are too enclosures are Failure / Error many enclosures attached to the attached to the controller.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2192 The virtual Informational Cause: The virtual disk disk Check Check Consistency has Consistency identified errors and has made made corrections. For corrections and example, the Check completed. Consistency may have encountered a bad disk block and remapped the disk block to restore data consistency. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2195 Dedicated hot OK / Normal / Cause: This alert is for spare assigned. Informational informational purposes. Physical disk Action: None %1 Related SNMP Alert Trap Information Numbers Clear Alert 1201 Number: 2196 Related Alert: None LRA Number: None 2196 Dedicated hot OK / Normal / Cause: This alert is for Informational informational purposes. spare unassigned.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2198 The physical disk is too small to be used for Replace Member Operation. OK / Normal / Cause: This alert is for Clear Alert 903 Informational informational purposes. Number: None Action: None Related Alert Number: None LRA Number: None 2199 2200 The virtual disk cache policy has changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2201 A global hot spare failed. Warning / Non-critical Cause: The controller is not able to communicate with a disk that is assigned as a dedicated hot spare. The disk may have been removed. There may also be a bad or loose cable. Clear Alert: 903 None Action: Check if the disk is healthy and that it has not been removed. Check the cables.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2203 A dedicated hot spare failed. Warning / Non-critical Cause: The controller is unable to communicate with a disk that is assigned as a dedicated hot spare. The disk may have failed or been removed. There may also be a bad or loose cable. Clear Alert: 903 None Action: Check if the disk is healthy and that it has not been removed. Check the cables.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2205 A dedicated hot spare has been automatically unassigned. OK / Normal / Cause: The hot spare is Informational no longer required because the virtual disk it was assigned to has been deleted. Action: None Related SNMP Alert Trap Information Numbers Clear Alert: 901 None Related Alert Number: 2098, 2161, 2196 LRA Number: None 2206 The only hot Warning / spare available Non-critical is a SATA disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2207 The only hot Warning / spare available Non-critical is a SAS disk. SAS disks cannot replace SATA disks. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The only physical disk available to be assigned as a hot spare is using SAS technology. The physical disks in the virtual disk are using SATA technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2211 The physical disk is not supported. Warning / Non-critical Cause: The physical disk may not have a supported version of the firmware or the disk may not be supported by your service provider. Clear Alert: 903 None Action: If the disk is supported, update the firmware to a supported version.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2214 Battery charge OK / Normal / Cause: This alert is for Clear Alert: 1151 in progress Informational informational purposes. None None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2215 Battery charge OK / Normal / Cause: This alert is for Clear Alert: 1151 process Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2218 None of the Controller Property are set. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None 2219 2220 152 Action: You should change at least one controller property and run the command again. Abort Check OK / Normal / Consistency on Informational Error, Replace Member Operation, Auto Replace Member Operation on Predictive Failure and Loadbalance changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2221 Auto Replace Member Operation on Predictive Failure, Abort CC on Error and Loadbalance changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes. None Loadbalance and Auto Replace Member Operation on Predictive Failure changed. OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2225 Abort Check OK / Normal / Consistency on Informational Error and Load balance changed. 2226 Load balance changed Severity Cause and Action Cause: This alert is for Clear Alert: 751 informational purposes. None Action: Change at least one controller property and run the command again.
Table 3-4. Storage Management Messages (continued) Event ID Description 2229 Abort Check OK / Normal / Consistency on Informational Error and Auto Replace Member Operation on Predictive Failure changed. 2230 2231 2232 Auto Replace Member Operation on Predictive Failure changed. Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: This alert is for Clear Alert: 751 informational purposes. None Action: Change at least one controller property and run the command again.
Table 3-4. Storage Management Messages (continued) Event ID Description 2233 The OK / Normal / Cause: This alert is for Background Informational informational purposes. initialization Action: None (BGI) rate has changed. 2234 The Patrol Read rate has changed. Severity Cause and Action Related SNMP Alert Trap Information Numbers Clear Alert: 751 None Related Alert: None LRA Number: None OK / Normal / Cause: This alert is for Clear Alert: 751 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2237 Abort Check OK / Normal / Consistency on Informational Error modified. 2238 Severity The controller OK / Normal / debug log file Informational has been exported. Cause and Action Cause: This alert is for Clear Alert: 751 informational purposes. None Action: Change at least one controller property and run the command again. Related Alert: None Cause: The user has attempted to export the controller debug log.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2241 The Patrol OK / Normal / Cause: The controller Read mode has Informational has changed the patrol changed. read mode. This alert is for informational purposes. Action: None 2242 The Patrol OK / Normal / Cause: The controller Read operation Informational has started the Patrol has started. Read operation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2244 A virtual disk OK / Normal / Cause: This alert is for Clear Alert: 1201 blink has been Informational informational purposes. None initiated. Action: None Related Alert: None LRA Number: None 2245 A virtual disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 1201 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2247 The controller OK / Normal / Cause: This alert is for Clear Alert 1151 battery is Informational informational purposes. Number: charging. 2358 Action: None Related Alert: None LRA Number: None 2248 The controller OK / Normal / Cause: This alert is for battery is Informational informational purposes. executing a Action: None Learn cycle.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2251 The physical disk blink has initiated. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2252 The physical disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2255 The physical disk has been started. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2259 An enclosure OK / Normal / Cause: This alert is for Clear Alert 851 blink operation Informational informational purposes. Number: has initiated. 2260 Action: None Related Alert: None LRA Number: None 2260 An enclosure blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: 851 Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2263 SMART thermal shutdown is disabled. OK / Normal / Cause: This alert is for Clear Alert: 101 Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2264 A device is missing. Warning / Non-critical Cause: The controller cannot communicate with a device. The device may be removed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2265 A device is in an unknown state. Warning / Non-critical Cause: The controller cannot communicate with a device. The state of the device cannot be determined. There may be a bad or loose cable. The system may also be experiencing problems with the application programming interface (API). There could also be a problem with the driver or firmware.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2266 Controller log OK / Normal / Cause: The %1 file entry: %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2268 %1, Storage Critical / Cause: Storage Management Failure / Error Management has lost has lost communication with a communicatio controller. This may n with the conoccur if the controller troller. An driver or firmware is immediate experiencing a problem. reboot is The %1 indicates a strongly substitution variable. recommended The text for this to avoid substitution variable is further displayed with the alert problems.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2270 The physical disk Clear operation failed. Critical / Cause: A Clear task was Failure / Error being performed on a physical disk but the task was interrupted and did not complete successfully. The controller may have lost communication with the disk. The disk may have been removed or the cables may be loose or defective.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2272 Patrol Read found an uncorrectable media error. Critical / Cause: The Patrol Read Failure / Error task has encountered an error that cannot be corrected. There may be a bad disk block that cannot be remapped. Action: Back up your data. If you are able to back up the data successfully, then fully initialize the disk and then restore from back up.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2273 A block on the Critical / Cause: The controller physical disk Failure / Error encountered an has been unrecoverable medium punctured by error when attempting the controller. to read a block on the physical disk and marked that block as invalid.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2276 The dedicated Warning / hot spare is too Non-critical small. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The dedicated hot spare is not large enough to protect all virtual disks that reside on the disk group. Clear Alert: 903 None Action: Assign a larger disk as the dedicated hot spare. 2277 The global hot Warning / spare is too Non-critical small.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2278 The controller OK / Normal / battery charge Informational level is below a normal threshold. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The battery is discharging. A battery discharge is a normal activity during the battery Learn cycle. The battery Learn cycle recharges the battery. You should receive alert 2179 when the recharge occurs.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2280 A disk media OK / Normal / Cause: A disk media error has been Informational error was detected corrected. while the controller was completing a background task. A bad disk block was identified. The disk block has been remapped. Related SNMP Alert Trap Information Numbers Clear Alert: 1201 None Related Alert: None LRA Number: None Action: Consider replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2282 Hot spare SMART polling failed. Critical / Cause: The controller Failure / Error firmware attempted a SMART polling on the hot spare but was unable to complete it. The controller has lost communication with the hot spare. Action: Check the health of the disk assigned as a hot spare. You may need to replace the disk and reassign the hot spare. Make sure the cables are attached securely.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2283 A redundant Warning / path is broken. Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: The controller has two connectors that are connected to the same enclosure. The communication path on one connector has lost connection with the enclosure. The communication path on the other connector is reporting this loss.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2285 A disk media error was corrected during recovery. OK / Normal / Cause: This alert is for Clear Alert: 901 Informational informational purposes. None 2286 2287 Cause and Action Action: None Related Alert: None LRA Number: None A Learn cycle OK / Normal / Cause: This alert is for start is pending Informational informational purposes. while the Action: None battery charges.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2289 Multi-bit ECC Critical / Cause: An error error on Failure / Error involving multiple bits controller has been encountered DIMM. during a read or write operation. The error correction algorithm recalculates parity data during read and write operations. If an error involves only a single bit, it may be possible for the error correction algorithm to correct the error and maintain parity data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2290 Single-bit ECC error on controller DIMM. Warning / Non-critical Cause: An error involving a single bit has been encountered during a read or write operation. The error correction algorithm has corrected this error. Clear Alert: 753 None Action: None 2291 2292 An enclosure management module (EMM) has been discovered.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2293 The EMM has Critical / Cause: The failure may failed. Failure / Error be caused by a loss of power to the EMM. The EMM self test may also have identified a failure. There could also be a firmware problem or a multi-bit error. Related SNMP Alert Trap Information Numbers Clear Alert: 854 None Related Alert: None LRA Number: 2091 Action: Replace the EMM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2296 An EMM has OK / Normal / Cause: This alert is for Clear Alert: 951 been inserted. Informational informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2297 2298 An EMM has Critical / Cause: An EMM has been removed. Failure / Error been removed. The enclosure Warning / has a bad Non-critical sensor %1. Action: Reinsert the EMM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2299 Bad PHY %1 Critical / Cause: There is a Failure / Error problem with a physical connection or PHY. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2300 The enclosure Critical / Cause: The controller is is unstable. Failure / Error not receiving a consistent response from the enclosure. There could be a firmware problem or an invalid cabling configuration. If the cables are too long, they degrade the signal. Action: Power down all enclosures attached to the system and reboot the system.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2301 The enclosure Critical / Cause: The enclosure or has a hardware Failure / Error an enclosure error. component is in a Failed or Degraded state. Clear Alert: 854 None Cause: The enclosure or The enclosure Critical / is not Failure / Error an enclosure responding. component is in a Failed or Degraded state.
Table 3-4. Storage Management Messages (continued) Event ID Description 2304 An attempt to OK / Normal / Cause: This alert is for hot plug an Informational informational purposes. EMM has been Action: None detected. This type of hot plug is not supported. 2305 184 The physical disk is too small to be used for a rebuild. Severity Warning / Non-critical Cause and Action Cause: The physical disk is too small to rebuild the data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2306 Bad block table Warning / is 80% full. Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: The bad block table is used for remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped, and disk errors can no longer be corrected. At this point, data loss can occur. The bad block table is now 80% full.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2307 Bad block table Critical / Cause: The bad block is full. Unable Failure / Error table is used for to log block %1 remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped and disk errors can no longer be corrected. At this point, data loss can occur. The %1 indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2309 A physical disk Warning / is Non-critical incompatible. Cause and Action Related SNMP Alert Trap Information Numbers Cause: You have attempted to replace a disk with another disk that is using an incompatible technology. For example, you may have replaced one side of a mirror with a SAS disk when the other side of the mirror is using SATA technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2311 The firmware Warning / on the EMMs Non-critical is not the same version. EMM0 %1 EMM1 %2 Cause and Action Related SNMP Alert Trap Information Numbers Cause: The firmware on the EMM modules is not the same version. It is required that both modules have the same version of the firmware. This alert may be caused if you attempt to insert an EMM module that has a different firmware version than an existing module.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2313 A power supply Warning / in the Non-critical enclosure has a DC failure. Cause and Action Related SNMP Alert Trap Information Numbers Cause: The power Clear Alert 1003 supply has a DC failure. Number: 2323 Action: Replace the power supply. Related Alert Number: 2122, 2322.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2315 Diagnostic message %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the utility that ran the diagnostics and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2318 2319 Cause and Action Related SNMP Alert Trap Information Numbers Problems with Warning / the battery or Non-critical the battery charger have been detected. The battery health is poor. Cause: The battery or the battery charger is not functioning properly. Clear Alert: 1153 None Warning / Non-critical Cause: The DIMM is beginning to malfunction. Single-bit ECC error. The DIMM is degrading.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2320 Single-bit ECC error. The DIMM is critically degraded. Critical / Cause: The DIMM is Failure / Error malfunctioning. Data loss or data corruption may be imminent. Related SNMP Alert Trap Information Numbers Clear Alert: 754 None Related Alert Number: 2321 Action: Replace the DIMM immediately to LRA avoid data loss or data Number: corruption. The DIMM 2061 is a part of the controller battery pack.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2322 The DC power Critical / Cause: The power supply is Failure / Error supply unit is switched switched off. off. Either a user switched off the power supply unit or it is defective. Related SNMP Alert Trap Information Numbers Clear Alert 1004 Number: 2323 Related Alert: None LRA Action: Check if the Number: power switch is turned 2091 off. If it is turned off, turn it on.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2324 The AC power Critical / Cause: The power cable supply cable Failure / Error may be pulled out has been or removed. The power removed. cable may also have overheated and become warped and nonfunctional. Action: Replace the power cable. 2325 The power supply cable has been inserted.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2326 A foreign configuration has been detected. OK / Normal / Cause: This alert is for Informational informational purposes. The controller has physical disks that were moved from another controller. These physical disks contain virtual disks that were created on the other controller.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2327 The NVRAM has corrupted data. The controller is reinitializing the NVRAM. Warning / Non-critical Cause: The nonvolatile random access memory (NVRAM) is corrupt. This may occur after a power surge, a battery failure, or for other reasons. The controller is reinitializing the NVRAM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2329 SAS port report: %1 Warning / Non-critical Cause: The text for this alert is generated by the controller and can vary depending on the situation. The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2330 SAS port report: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2332 A controller OK / Normal / Cause: This alert is for Clear Alert: 751 hot plug has Informational informational purposes. None been detected. Action: None Related Alert: None LRA Number: None 2334 Controller event log: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2335 Controller event log: %1 Warning / Non-critical Cause: The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2336 Controller event log: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2337 The controller is Critical / Cause: The controller unable to Failure / Error was unable to recover recover cached data from the cache. data from the This may occur when battery backup the system is without unit (BBU). power for an extended period of time when the battery is discharged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2340 The BGI com- Critical / Cause: The BGI task pleted with Failure / Error encountered errors that uncorrectable cannot be corrected. errors. The virtual disk contains physical disks that have unusable disk space or disk errors that cannot be corrected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2342 The Check Consistency found inconsistent parity data. Data redundancy may be lost. Warning / Non-critical Cause: The data on a source disk and the redundant data on a target disk is inconsistent. Clear Alert: 1203 None The Check Consistency logging of inconsistent parity data is disabled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2345 The virtual disk initialization failed. Critical / Cause: The controller Failure / Error cannot communicate with attached devices. A disk may be removed or contain errors. Cables may also be loose or defective. Action: Verify the health of attached devices. Review the Alert Log for significant events. Make sure the cables are attached securely.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2346 Error occurred: Warning / %1 Non-critical Cause and Action Related SNMP Alert Trap Information Numbers Cause: A physical device may have an error. The %1 indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2347 The rebuild Critical / Hardware RAID: failed due to Failure / Error Cause: You are errors on the attempting to rebuild source physical data that resides on a disk. defective disk. Action: Replace the source disk and restore from backup.
Table 3-4. Storage Management Messages (continued) Event ID Description 2348 The rebuild Critical / Cause: You are failed due to Failure / Error attempting to rebuild errors on the data on a disk that is target physical defective. disk. Action: Replace the target disk. If a rebuild does not automatically start after replacing the disk, initiate the Rebuild task. You may need to assign the new disk as a hot spare to initiate the rebuild.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2351 A physical disk OK / Normal / Cause: This alert is for Clear Alert 901 is marked as Informational informational purposes. Number: missing. 2352 Action: None. Related Alert: None LRA Number: None 2352 A physical disk OK / Normal / Cause: This alert is for Informational informational purposes. that was marked as Action: None. missing has been replaced.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2354 Enclosure firmware download in progress. OK / Normal / Cause: This alert is Clear Alert 851 Informational provided for Status: informational purposes. None Action: None Related SNMP Alert Trap Information Numbers Related Alert: None LRA Number: None 2355 210 Enclosure firmware download failed. Warning / Non-critical Cause: The system was unable to download firmware to the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID 2355 Cont. Description Severity Cause and Action Related SNMP Alert Trap Information Numbers Action: Attempt to download the enclosure firmware again. If problems continue, verify that the controller can communicate with the enclosure. Make sure that the enclosure is powered on. Check the cables. See the Cables Attached Correctly section for more information on checking the cables. Verify the health of the enclosure and its components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2356 SAS SMP Critical / Cause: The text for this communicatio Failure / Error alert is generated by the ns error %1 firmware and can vary depending on the situation. The reference to SMP in this text refers to SAS Management Protocol. Action: There may be a SAS topology error. See the hardware documentation for information on correct SAS topology configurations.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2357 SAS expander error: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2359 Disk found is not supplied by an authorized hardware provider Warning / Non-critical Cause: The physical disk does not comply with the standards set and is not supported. Clear Alert: 903 None Action: Replace the physical disk with a physical disk that is supported.
Table 3-4. Storage Management Messages (continued) Event ID Description 2362 Physical OK / Normal / Cause: This alert is for disk(s) have Informational informational purposes. been removed Action: None. from a virtual disk. The virtual disk is in Failed state during the next system reboot. Clear Alert: 751 None All virtual disks OK / Normal / Cause: This alert is for are missing Informational informational purposes. from the Action: None. controller. This situation was discovered during system startup.
Table 3-4. Storage Management Messages (continued) Event ID Description 2367 Rebuild is not Warning / possible Non-critical because mixing of different media type (SSD/HDD) and bus protocols (SATA/SAS) is not supported on the same virtual disk. 2368 216 Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk is using an incompatible technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2369 Virtual Disk Redundancy has been degraded. OK / Normal / Cause: A physical disk Informational in a RAID 6 virtual disk has either failed or been removed. Action: Replace the missing or failed physical disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2372 Attempted import of Virtual Disk exceeding the limit supported on the controller. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2376 Attempted import of Virtual Disk with stale physical disk OK / Normal / Cause: User is Informational attempting to import a foreign virtual disk with a stale physical disk. This alert is provided for informational purposes. Action: None. 2377 Attempted import of an orphan drive OK / Normal / Cause: User is Informational attempting to import an orphan drive.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2380 Foreign configuration has been partially imported. Some configuration failed to import. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2381 Controller preserved cache is recovered. OK / Normal / Cause: This alert is Clear Alert: 751 Informational provided for None informational purposes. Related Action: None Alert: None LRA Number: None 2382 An unWarning / supported Non-critical configuration was detected.
Table 3-4. Storage Management Messages (continued) Event ID Description 2384 The Warning Warning / level set for the Non-critical hot spare protection policy is violated for the Virtual Disk. 2385 2386 222 Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: The number of physical disks you specified for the hot spare protection policy is violated.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2387 A virtual disk bad block medium error is detected. Critical / Cause: Virtual disk bad Failure / Error blocks are due to presence of unrecoverable bad blocks on one or more member physical disks. Action: 1 Perform a backup of the virtual disk with the Verify option selected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2387 contd. 2388 Cause and Action Related SNMP Alert Trap Information Numbers 2 To clear these bad blocks, execute the Clear Virtual Disk Bad Blocks task. 3 Run Patrol Read to ensure no new bad blocks are found. The Controller OK / Normal / Encryption Informational Key is destroyed. Cause: The Controller Encryption Key is destroyed. Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2392 The drive Encryption Key is invalid. Warning / Non-critical Cause: The controller failed to verify the specified Passphrase. Clear Alert: 753 None The virtual disk is encrypted. OK / Normal / Cause: The Encrypted Informational virtual disk operation on normal virtual disk (created using Selfencrypting disks only) is successful.
Table 3-4. Storage Management Messages (continued) Event ID Description 2396 The Check Critical / Cause: The Check Clear Alert: 1204 Consistency Failure / Error Consistency task None detected detects uncorrectable Related uncorrectable multiple errors. Alert: None multiple Action: Replace the LRA medium errors failed physical disk. You Number: can identify the failed None disk by locating the disk that has a red “X” for its status. Rebuild the physical disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2399 The Physical OK / Normal / Disk Power Informational status changed from 1% to 2% Cause and Action Related SNMP Alert Trap Information Numbers Cause: The physical disk power status is changed from one state to another. A physical disk can have the following power statuses: spun down, transition, and spun up.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2403 Virtual Disk is OK / Normal / Cause: The operating available Informational system detects the newly created virtual disk. Action: None NOTE: This alert also appears when a CacheCade is created but is not available for the operating system (as it is a CacheCade and not a Virtual Disk).
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2407 Controller Encryption mode is enabled in LKM Informational Cause: The Local Key Management (LKM) encryption mode is enabled. 2411 Cause and Action Action: None Controller Informational Cause: Using Manage LKM Encryption Key Encryption key operations, encryption is changed key is changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2414 Controller CacheCade is deleted Informational Cause: This alert is Clear Alert: 1201 provided for None informational purposes. Related Alert: None Action: None LRA Number: None 2415 Controller battery is discharging Informational Cause: The battery learn cycle has started.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2417 There is an unrecoverable medium error detected on virtual disk Critical / Cause: Unrecoverable Failure / Error medium error found on one or more member physical disks of a virtual disk. Related SNMP Alert Trap Information Numbers Clear Alert: 1204 None Related Alert: None LRA Number: Action: Perform a None backup of the virtual disk with the Verify option selected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2417 cntd. NOTE: If the unrecoverable medium error has not been corrected, it may be reported again by the system. This error can be fixed by writing data on the affected area or deleting and recreating the Virtual Disk as demonstrated in the following procedure. 1 Back up the data. 2 Delete the Virtual Disk. 3 Recreate the Virtual Disk using the same parameters like size, RAID level, disks, etc.
Table 3-4. Storage Management Messages (continued) Event ID Description 2426 State change Informational Cause: User triggered on Physical action. disk from NonAction: Configure the RAID to drive to be ready using READY. CLI/GUI. Clear Alert: 901 None Related Alert: None Drive Prepared Informational Cause: User triggered for Removal. action.
Table 3-4. Storage Management Messages (continued) Event ID Description 2432 The PCIeSSD Warning device was found to be in security locked state. Full initialization has to be done on the security locked drive to recover the drive in usable state. 2433 2434 2435 234 Severity Cause and Action Related SNMP Alert Trap Information Numbers Cause: Last full initialization was stopped for some reason and hence the device is in security locked state.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2436 Physical Device is in read-only mode. Warning Cause: User triggered task. Clear Alert: 903 None Related Alert: None Action: None LRA Number: None 2437 The physical device blink has initiated. Informational Cause: User triggered task. Action: None Clear Alert: 901 None Related Alert: None LRA Number: None 2438 The physical device blink has ceased.
Storage Management Message Reference
System Event Log Messages for IPMI Systems 4 The tables in this chapter list the system event log (SEL) messages, their severity, and cause. NOTE: For corrective actions, see the appropriate documentation. Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis.
Table 4-1. Temperature Sensor Events (continued) Event Message Severity Warning temperature sensor returned to warning state . Cause Temperature of the backplane board, system board, or the carrier in the specified system returned from critical state to non-critical state. temperature sensor returned to normal state .
Table 4-1. Temperature Sensor Events (continued) Event Message Severity Cause The temperature is within range. Information Temperature of the backplane, system board, system inlet, or the carrier in the specified system returned to a normal operating range. Voltage Sensor Events The voltage sensor event messages monitor the number of volts across critical components.
Table 4-2. Voltage Sensor Events (continued) Event Message Severity Cause voltage sensor detected a warning . Warning Voltage of the monitored entity exceeded the warning threshold. voltage sensor returned to normal . Information The voltage of a previously reported is returned to normal state. The voltage is less than the lower warning threshold.
Fan Sensor Events The cooling device sensors monitor how well a fan is functioning. These messages provide status warning and failure messages for fans for a particular chassis. Table 4-3. Fan Sensor Events Event Message Severity Critical Fan sensor detected a failure where is the entity that this sensor is monitoring. For example "BMC Back Fan" or "BMC Front Fan.
Table 4-3. Fan Sensor Events (continued) Event Message Severity Cause Information The fan specified by may have started redundancy regained functioning again and hence, the redundancy has been regained. Fan RPM is less than the lower warning threshold. Warning The speed of the specified fan might not provide enough cooling to the system. Fan RPM is less than the lower critical threshold.
Table 4-3. Fan Sensor Events (continued) Event Message Severity Cause Fan redundancy is lost. Critical One or more required fans may have failed or removed and hence, the redundancy was lost. Fan redundancy is degraded. Warning One or more fans may have failed or removed and hence, the redundancy has been degraded. Processor Status Events The processor status messages monitor the functionality of the processors in a system.
Table 4-4. Processor Status Events (continued) Event Message Severity Cause status processor sensor terminator not present. Information This event is generated if the terminator is missing on an empty processor slot. presence was deasserted. Critical presence was asserted. Information This event is generated when the earlier processor detection error was corrected. thermal tripped was deasserted.
Table 4-4. Processor Status Events (continued) Event Message Severity Cause CPU terminator is Information This event is generated if the present. terminator is present on a processor slot. CPU terminator is Warning absent. This event is generated if the terminator is missing on an empty processor slot. CPU is throttled. Warning This event is generated when the processor slows down to prevent overheating. CPU is absent.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause power supply sensor power supply that failed or returned to normal state. removed was replaced and the state has returned to normal. PS Redundancy sensor redundancy degraded. Information Power supply redundancy is degraded if one of the power supply sources is removed or failed. PS Redundancy sensor redundancy lost.
Table 4-5. Power Supply Events (continued) Event Message Severity PS 1 Status: Power supply Critical sensor for PS 1, failure was asserted Cause This event is generated when the power supply has failed. PS 1 Status: Power supply Information This event is generated when the sensor for PS 1, failure power supply has recovered from was deasserted an earlier failure event.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause A predictive failure detected on power supply . Warning This event is generated when the power supply is about to fail. The power input for power Critical supply is lost. This event is generated when input power is removed from the power supply. The input power for power Information This event is generated if the supply has been power supply has been restored. reconnected or replaced.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause An over current fault detected on power supply . Critical The specified power supply detected an over current condition. Fan failure detected on power supply . Critical The specified power supply fan has failed. Communication has been restored to power supply . Information This event is generated when the power supply has recovered from an earlier communication problem.
Memory ECC Events The memory ECC event messages monitor the memory modules in a system. These messages monitor the ECC memory correction rate and the type of memory events that occurred. Table 4-6. Memory ECC Events Event Message Severity ECC error correction detected on Bank # DIMM [A/B]. Information This event is generated when there is a memory error correction on a particular Dual Inline Memory Module (DIMM). ECC uncorrectable error detected on Bank # [DIMM].
BMC Watchdog Events The BMC watchdog operations are performed when the system hangs or crashes. These messages monitor the status and occurrence of these events in a system. Table 4-7. BMC Watchdog Events Event Message Severity Cause BMC OS Watchdog timer expired. Information This event is generated when the BMC watchdog timer expires and no action is set. BMC OS Watchdog performed system reboot.
Table 4-7. BMC Watchdog Events (continued) Event Message Severity Cause The OS watchdog timer powered cycle the system. Critical This event is generated when the BMC watchdog detects that the system has crashed (timer expired because no response was received from Host) and the action is set to power cycle. The OS watchdog timer powered off the system.
Table 4-8. Memory Events (continued) Event Message Severity Cause Memory Mirrored redundancy degraded. Warning This event is generated when there is a memory failure in a mirrored memory configuration. Memory Mirrored redundancy lost. Critical This event is generated when redundancy is lost in a mirrored memory configuration. Memory Mirrored redundancy regained. Information This event is generated when the redundancy lost or degraded earlier is regained in a mirrored memory configuration.
Table 4-8. Memory Events (continued) Event Message Severity Memory mirror is redundant. Information This event is generated when the memory redundancy mode has change to mirror redundant. Memory mirror Critical redundancy is lost. Check memory device at location(s) . Cause This event is generated when redundancy is lost in a mirror-configured memory configuration. Memory mirror redundancy is degraded. Check memory device at location . Warning Memory spare is redundant.
Table 4-9. Hardware Log Sensor Events Event Message Severity Cause Log full detected. Critical This event is generated when the SEL device detects that only one entry can be added to the SEL before it is full. Log cleared. Information This event is generated when the SEL is cleared. Drive Events The drive event messages monitor the health of the drives in a system. These events are generated when there is a fault in the drives indicated. Table 4-10.
Table 4-10. Drive Events (continued) Event Message Severity Drive Informational This event is generated when the drive is taken out of hot spare. hot spare was deasserted Drive Warning consistency check in progress was asserted Drive consistency check in progress was deasserted Drive Cause This event is generated when the drive is placed in consistency check. Informational This event is generated when the consistency check of the drive is completed.
Table 4-10. Drive Events (continued) Event Message Severity Cause Fault detected on drive . Critical This event is generated when the specified drive in the array is faulty. Intrusion Events The chassis intrusion messages are a security measure. Chassis intrusion alerts are generated when the system's chassis is opened. Alerts are sent to prevent unauthorized removal of parts from the chassis. Table 4-11.
Table 4-11. Intrusion Events (continued) Event Message Severity Cause The chassis is closed Information This event is generated when the while the power is on. earlier intrusion has been corrected while the power is on. The chassis is open while the power is off. Critical This event is generated when the intrusion sensor detects an intrusion while the system is off. The chassis is closed while the power is off.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause System Event PCIE Fatal Err. Critical This error is generated when a fatal error is detected on the PCIE bus. POST Err Critical This event is generated when an error occurs during system boot. See the system documentation for more information on the error code. POST fatal error # Critical or This event is generated when a fatal error occurs during system boot.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause Information This event is generated when memory is added to the system. (BANK# DIMM#) presence was asserted Memory Add (BANK# DIMM#) presence was asserted Information This event is generated when memory is removed from the system. Memory Cfg Err Critical Memory Removed configuration error (BANK# DIMM#) was asserted This event is generated when memory configuration is incorrect for the system.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause USB Over-current Critical This event is generated when the USB exceeds a predefined current level. transition to non-recoverable Hdwr version err hardware Critical incompatibility (BMC/iDRAC Firmware and CPU mismatch) was asserted This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause LinkT/FlexAddr: Link Tuning sensor, device option ROM failed to support link tuning or flex address (Mezz XX) was asserted Critical This event is generated when the PCI device option ROM for a NIC does not support link tuning or the Flex addressing feature. LinkT/FlexAddr: Link Tuning sensor, failed to program virtual MAC address () was asserted.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause A PCI system error was Critical detected on a component at bus device function . This is generated when the system has crashed and recovered. A PCI system error was Critical detected on a component at slot . This is generated when the system has crashed and recovered. A bus correctable error was detected on a component at bus device function .
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause A fatal IO error detected on a component at bus device function . Critical This error is generated when a fatal IO error is detected. A fatal IO error detected on a component at slot . Critical This error is generated when a fatal IO error is detected. A non-fatal PCIe error Warning detected on a component at bus device function .
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Memory device at location Critical is overheating. Cause This event is generated when system memory reaches critical temperature. An OEM diagnostic event occurred. Information This event is generated when an OEM event occurs. OEM events can be used by the service team to better understand the cause of the failure. CPU protocol error detected.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity A hardware incompatibility Critical detected between BMC/iDRAC firmware and CPU. Cause This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa. A hardware incompatibility Information This event is generated when an was corrected between BMC/ earlier mismatch between the iDRAC firmware and CPU. BMC and iDRAC firmware and the processor is corrected.
POST Code Table Table 4-13 lists the POST Code errors that are generated when a fatal error occurs during system boot. Table 4-13. POST Code Errors Fatal Error Description Code Cause 80 No memory detected This error code implies that no memory is installed. 81 Memory detected but is not configurable This error code indicates memory configuration error that could be a result of bad memory, mismatched memory or bad socket. 82 Memory configured but not usable.
Table 4-13. POST Code Errors (continued) Fatal Error Description Code Cause C0 Shutdown test failure This error code indicates a shutdown test failure. C1 POST Memory test failure This error code indicates bad memory detection. C2 RAC configuration failure Check screen for the actual error message C3 CPU configuration failure Check screen for the actual error message C4 Incorrect memory configuration Memory population order not correct.
Table 4-14. Operating System Generated Events (continued) A runtime critical stop occurred. Critical The operating system encountered a critical error and was stopped abnormally. An OS graceful stop occurred. Information The operating system was stopped. An OS graceful shut-down occurred. Information The operating system was shutdown normally. Cable Interconnect Events The cable interconnect messages in Table 4-15 are used for detecting errors in the hardware cabling. Table 4-15.
Battery Events Table 4-16. Battery Events Description Severity Cause Critical This event is generated when the sensor detects a failed or missing battery. Information This event is generated when the earlier failed battery was corrected. Warning This event is generated when the sensor detects a low battery condition. Information This event is generated when the earlier low battery condition was corrected.
Power And Performance Events The power and performance events are used to detect degradation in system performance with change in power supply. Table 4-17. Description Power And Performance Events Severity Cause System Board Power Normal Optimized: Performance status sensor for System Board, degraded, was deasserted This event is generated when system performance was restored.
Table 4-17. Power And Performance Events (continued) Description Severity Cause System Board Power Warning Optimized: Performance status sensor for System Board, degraded, user defined power capacity was asserted This event is generated when a change in power supply degrades system performance. System Board Power Normal Optimized: Performance status sensor for System Board, degraded, user defined power capacity was deasserted This event is generated when the system performance is restored.
Table 4-17. Power And Performance Events (continued) Description Severity Cause The system performance degraded because of thermal protection. Warning This event is generated when a change in thermal protection degrades system performance. The system performance degraded because cooling capacity has changed. Warning This event is generated when a change in cooling degrades system performance. The system Warning performance degraded because power capacity has changed.
Table 4-17. Power And Performance Events (continued) Description Severity Cause The system performance restored Information This event is generated when system performance was restored. Entity Presence Events The entity presence messages are used for detecting different hardware devices. Table 4-18. Entity Presence Events Description Severity Cause Information This event is generated when the device was detected. Critical This event is generated when the device was not detected.
Miscellaneous The following table provides events related to hardware and software components like mezzanine cards, sensors, firmware etc. and compatibility issues. Table 4-19. Miscellaneous Events Description Severity Cause System Board Video Riser: Module sensor for System Board, device removed was asserted Critical This event is generated when the required module is removed.
Table 4-19. Miscellaneous Events (continued) Hdwar version err: Version Change sensor, hardware incompatibility (BMC firmware and CPU mismatch) was asserted Critical This event is generated when the CPU and firmware are not compatible. Link Tuning: Version Change sensor, successful software or F/W change was deasserted Warning This event is generated when the link tuning setting for proper NIC operation fails to update.
Table 4-19. Miscellaneous Events (continued) LinkT/FlexAddr: Critical Link Tuning sensor, failed to get link tuning or flex address data from BMC/iDRAC was asserted This event is generated when link tuning or Flex address information is not obtained from BMC/iDRAC. The is removed. Critical This event is generated when the device was removed. The is inserted. Information This event is generated when the device was inserted or installed.
Table 4-19. Miscellaneous Events (continued) Critical This event is generated when TXT Post failed. SINIT Authenticated Critical Code Module detected an Intel Trusted Execution Technology (TXT) error at boot. This event is generated when the Authenticated Code Module detected a TXT initialization failure. Intel Trusted Information Execution Technology (TXT) is operating correctly. This event is generated when the TXT returned from a previous failure.
Index A C AC power cord messages, 48 cable interconnect messages, 269 AC power cord sensor, 9 AC power cord sensor has failed, 255 Change write policy, 105 chassis intrusion messages, 34 Asset name changed, 122 Chassis intrusion sensor, 245 Asset tag changed, 122 chassis intrusion sensor, 9 Communication regained, 127 B Background initialization, 113 Bad block extended medium error, 122 Bad block extended sense error, 122 Communication timeout, 117 Controller event log %1, 199-201 Controller rebu
E Hot spare SMART polling, 174 Enclosure alarm, 118 Enclosure firmware mismatch, 105 entity presence messages, 271 Error occurred %1, 206 event description reference, 14 I Intrusion Events, 257 intrusion messages, 257 L Log monitoring, 257 F fan enclosure messages, 46 fan enclosure sensor, 9 fan sensor, 9 Fan Sensor Events, 241 Fan sensor has failed, 239 fan sensor messages, 241 Firmware version mismatch, 114 G Global hot spare, 91 H hardware log sensor, 9 Hardware Log Sensor Events, 255 hardware log
fan sensor, 241 hardware log sensor, 254 intrusion, 257 memory device, 45 memory ECC, 250 memory modules, 252 pluggable device, 54, 258 power supply, 41, 245 processor sensor, 51 processor status, 243 r2 generated system, 267 redundancy unit, 37 Server Administrator General, 19 storage management, 69 temperature sensor, 22, 237 voltage sensor, 28, 239 Multi-bit ECC error.
temperature, 9 voltage, 9 viewing events in Windows operating systems, 12 Service tag changed, 123 Virtual disk initialization, 116 Single-bit ECC error limit, 140 Virtual disk renamed, 125 Single-bit ECC error.