Owners Manual
Table Of Contents
- Dell EMC PowerVault ME4 Series Storage System Owner’s Manual
- Contents
- Storage system hardware
- Troubleshooting and problem solving
- Module removal and replacement
- ESD precautions
- Dealing with hardware faults
- Firmware updates
- Continuous operation during replacement
- Shutting down attached hosts
- Shutting down a controller module
- Verifying component failure
- Customer-replaceable units (CRUs)
- Attach or remove the front bezel of a 2U enclosure
- Replacing a drive carrier module in a 2U enclosure
- Replacing a DDIC in a 5U enclosure
- Replacing a controller module or IOM in a 2U or 5U enclosure
- Replacing a power supply unit (PSU) in a 5U enclosure
- Replacing a fan cooling module (FCM) in a 5U enclosure
- Replacing a power cooling module (PCM) in a 2U enclosure
- Completing the component installation process
- Verifying component operation
- Performing updates in PowerVault Manager after replacing an FC or SAS HBA
- Events and event messages
- Connecting to the CLI port using a serial cable
- Technical specifications
- Standards and regulations
Troubleshooting and problem solving
These procedures are intended to be used only during initial configuration, for the purpose of verifying that hardware setup is
successful. They are not intended to be used as troubleshooting procedures for configured systems using production data and
I/O.
Topics:
• Overview
• Fault isolation methodology
• LEDs
• Troubleshooting 2U enclosures
• Troubleshooting 5U enclosures
• Temperature sensors
• Host I/O
Overview
The enclosure system includes a Storage Enclosure Processor (SEP) and associated monitoring and control logic to enable it
to diagnose problems with the enclosure’s power, cooling, and drive systems. Management interfaces allow for provisioning,
monitoring, and managing the storage system.
NOTE: See Fault isolation methodology on page 30 when conducting system diagnostics.
Fault isolation methodology
Dell EMC PowerVault ME4 Series Storage Systems provide many ways to isolate faults. This section presents the basic
methodology used to locate faults within a storage system, and to identify the pertinent CRUs affected.
Use the PowerVault Manager to configure and provision the system upon completing the hardware installation. As part of this
process, configure and enable event notification so the system will notify you when a problem occurs that is at or above the
configured severity (see the topic about configuring event notification within the Dell EMC PowerVault ME4 Series Storage
System Administrator’s Guide). With event notification configured and enabled, you can follow the recommended actions in the
notification message to resolve the problem, as further discussed in the options presented in the following section.
Fault isolation methodology basic steps
Following is a summary of the basic steps used to perform fault isolation and troubleshooting:
● Gather fault information, including using system LEDs as described in Gather fault information
● Determine where in the system the fault is occurring as described in Determine where the fault is occurring.
● Review event logs as described in Review the event logs.
● If required, isolate the fault to a data path component or configuration as described in Review the event logs .
Options available for performing basic steps
When performing fault isolation and troubleshooting steps, select the option or options that best suit your site environment.
Use of any option (four options are described below) is not mutually exclusive to the use of another option. You can use
the PowerVault Manager to check the health icons/values for the system and its components to ensure that everything is
okay, or to drill down to a problem component. If you discover a problem, either the PowerVault Manager or the CLI provide
recommended-action text online. Options for performing basic steps are listed according to frequency of use:
2
30 Troubleshooting and problem solving