Dell Server PRO Management Pack Version 4.
Notes, Cautions, and Warnings NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death. Copyright © 2014 Dell Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws.
Contents 1 Introduction........................................................................................................... 4 What's New in this Release................................................................................................................... 4 Overview................................................................................................................................................ 5 Related Terms...................................................................
Introduction 1 This document is intended for system administrators who use the Dell Server PRO Management Pack (Dell PRO Pack) to monitor Dell systems and take remedial action when an inefficient system is identified. The Dell PRO Pack version 4.
Overview Operations Manager uses PRO-enabled Management Pack to collect and store information on Dell hardware along with a description of their health status. Dell PRO Pack works with Operations Manager and VMM 2012 to manage Dell physical devices and their hosted virtual machines (VMs) using this available health information.
• Generates PRO Tip when the monitored hardware moves to an unhealthy state. • Performs VM live migration with no downtime. For more information, see VM Live Migration. • Overrides Dell PRO Pack default recovery actions. For more information, see Overriding Recovery Actions. • Minimizes downtime by implementing the remedial action provided on PRO Tips.
Table 1. Sequence Number and Events Sequence Number Event 1 Operations Manager agents on the host are enabled to detect the warning, error, or failure alerts that are generated by OMSA. 2 Alert is sent to Operations Manager. 3 Operations Manager console displays active PRO alerts. 4 Operations Manager notifies the alert and the associated PRO Tip ID to VMM. 5 VMM displays a corresponding entry in the PRO Tip window with remedial action.
Using Dell Performance Resource Optimization Pack 2 This chapter suggests steps to use PRO Pack. Planning the Environment for PRO Tips You can plan for enabling the PRO Monitors that are relevant for the environment. By default, all the PRO Monitors are disabled in the Dell PRO Pack. For the list of alerts and the recovery actions, see Alerts and Recovery Actions. Select the alerts that you want to enable.
Alternatively, if you select the Show this window when new PRO Tips are created option in the PRO Tip window, the window automatically opens on the VMM console when a PRO Tip is generated. The PRO Tip window displays information such as source, tip, and state of the PRO Tip in a tabular format. The window also displays description of the problem that triggered the alert, the cause, and the suggested remedial action for recovery.
status. It does not migrate VMs with status such as Stop, Pause, and Saved. This is based on the star rating of the associated servers. After you successfully implement the recovery task, the following changes take place: • The status of PRO Tip changes to Resolved and the PRO Tip entry moves out of the PRO Tip window. • Corresponding alert disappears in the Operations Manager Alert View. • An entry is displayed in the Jobs section on the VMM console.
The difference in quick migration and live migration is that there is a downtime in quick migration whereas; there is no downtime in live migration. NOTE: Windows Server 2008 Hyper-V supports quick migration. Windows Server 2008 R2 Hyper-V supports both quick migration and live migration. Monitoring Using PRO Specific Alerts on Operations Manager You can monitor the physical devices in your network using the Operations Manager console.
Using Health Explorer to Reset Alerts Health Explorer enables you to view and take action on alerts. When you select Dismiss in the PRO Tip window, the alert is removed from it. To manually reset the alert: 1. On the Actions menu, click Health Explorer. 2. Right-click the alert that you want to close. 3. Select Reset Health. The alert disappears from the PRO Tip window. Overriding Recovery Actions PRO Pack 4.0 supports two recovery actions.
NOTE: When you select Enable, Operations Manager performs an auto-implementation for the unit monitor. Since this involves VMM migration, review and set the values accordingly. 8. Select the Enforce option. 9. Click Apply CAUTION: Saving the settings in the default management pack, creates a dependency between PRO Pack and the management pack. When you remove or delete PRO Pack, you must delete the default management pack as well, as it contains default settings for Operations Manager.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action configured, the operating system shuts down and the system powers off. This event may also be initiated on certain systems when a fan enclosure is removed from the system for an extended period of time.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action value information is provided. 1104;5104 Fan sensor detected a failure value Error A fan sensor in the specified system detected the failure of one or more fans. Restrict 1105;5105 Fan sensor detected a nonrecoverable value Error A fan sensor Restrict detected an error from which it cannot recover.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action value information is provided. 1203;5203 Current sensor Warning detected a warning value A current sensor in Restrict the specified system exceeded its warning threshold value. 1204;5204 Current sensor detected a failure value Error A current sensor in Restrict and Migrate the specified system exceeded its failure threshold value.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action system was operating. The sensor location, chassis location, previous state, and chassis intrusion state information is provided. 1305;5305 Redundancy degraded Warning A redundancy Restrict sensor in the specified system detected that one of the components of the redundancy unit has failed but the unit is still redundant.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM 1353;5353 Alert Cause Dell PRO Tip Recommended Remedial Action Power supply Warning detected a warning A power supply sensor reading in the specified system exceeded definable warning threshold. Restrict 1354;5354 Power supply detected a failure Error A power supply has been disconnected or has failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action because of an irrecoverable error. 1453;5453 Fan enclosure removed from system Warning A fan enclosure has Restrict been removed from the specified system. The sensor and chassis location information is provided.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action classified as an error. The sensor and chassis location information is provided. 1505;5505 AC power cord sensor in the system failed 1603;5603 Processor sensor Warning detected a warning value A processor sensor Restrict in the specified system is in a throttled state.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action previous state and processor sensor status are provided. 1703;5703 Battery sensor Warning detected a warning value A battery sensor in the specified system detected that a battery is in a predictive failure state. Restrict 1704;5704 Battery sensor detected a failure value Error A battery sensor in the specified system detected that a battery has failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2056 Virtual Disk Failed Critical One or more physical disks included in the virtual disk have failed. Restrict and Migrate 2057 Virtual Disk Degraded Warning Warning This alert message occurs when a physical disk included in a redundant virtual disk fails.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action the excessive temperature. 2103 Temperature dropped below the Minimum Failure Threshold Critical The physical disk enclosure is too cool. Restrict and Migrate 2112 Enclosure shutdown Critical The physical disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action battery may have been already recharged the maximum number of times. In addition, the battery charger may not be working. 2171 The controller battery temperature is above normal Warning The room Restrict temperature may be too hot. The system fan may also be degraded or failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action a disk that is assigned as a dedicated hot spare. 2206 The only hot spare available is a SATA disk. SATA disks cannot replace SAS disks Warning The only physical disk available to be assigned as a hot spare is using SATA technology. Restrict 2207 The only hot spare available is a SAS disk.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause 2268 Storage Management communication Error Critical Storage Restrict and Migrate Management has lost communication with a controller. This may occur if the controller driver or firmware is experiencing a problem. 2272 Patrol Read found an uncorrectable media error Critical The Patrol Read Restrict and Migrate task has encountered an error that cannot be corrected.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action that are connected to the same enclosure. 2289 Multi-bit ECC error Critical on controller DIMM An error involving multiple bits has been encountered during a read or write operation. 2290 Single-bit ECC error on controller DIMM Warning An error involving a Restrict single bit has been encountered during a read or write operation.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action Failed or Degraded state. 2302 The enclosure is not responding Critical The enclosure or an Restrict and Migrate enclosure component is in a Failed or Degraded state. 2306 Bad block table is 80% full Warning The bad block table is the table used for remapping bad disk blocks. This table fills as bad disk blocks are remapped.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2318 Problems with the battery or the battery charger have been detected. The battery health is poor. Warning The battery or the battery charger is not functioning properly. Restrict 2319 Single-bit ECC Warning error on controller DIMM. The DIMM is degrading. The dual in-line Restrict and Migrate memory module (DIMM) is beginning to malfunction.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action for other reasons. The controller is reinitializing the NVRAM. 2328 The NVRAM has corrupt data Warning The NVRAM has corrupt data. The controller is unable to correct the situation. 2329 SAS port report Warning The text for this Restrict and Migrate alert is generated by the controller and can vary depending on the situation.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity reassigned during a write operation Alert Cause Dell PRO Tip Recommended Remedial Action contains bad disk blocks that could not be reassigned. Data loss may have occurred. 2350 There was an unrecoverable disk media error during the rebuild or recovery operation Critical The rebuild or recovery operation encountered an unrecoverable disk media error. Restrict 2355 Enclosure firmware Warning download failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action unrecoverable bad blocks on one or more member physical disks. 2396 The Check Consistency detected uncorrectable multiple medium errors Critical Medium errors in the physical drives. Restrict 2397 The Check Consistency completed with uncorrectable errors Critical Medium errors in the physical drives.
Related Documentation and Resources 3 This chapter gives the details of documents and resources to help you work with the Pro Pack 4.0. Security Considerations Operations Console access privileges are handled internally by Operations Manager. You can setup this using the User Roles option under Administration Security feature on the Operations Manager console. The profile of the role assigned to you determines what actions you can perform and which objects you are able to manage.
Contacting Dell 4 NOTE: If you do not have an active Internet connection, you can find contact information on your purchase invoice, packing slip, bill, or Dell product catalog. Dell provides several online and telephone-based support and service options. Availability varies by country and product, and some services may not be available in your area. To contact Dell for sales, technical support, or customer service issues: 1. Go to dell.com/support. 2. Select your support category. 3.
Accessing documents from Dell support site 5 You can access the required documents in one of the following ways: • Using the following links: – For all Enterprise Systems Management documents — dell.com/softwaresecuritymanuals – For Enterprise Systems Management documents — dell.com/openmanagemanuals – For Remote Enterprise Systems Management documents — dell.com/esmmanuals – For OpenManage Connections Enterprise Systems Management documents — dell.