Users Guide

Table Of Contents
The RESTORE ALARMS option is a convenient way to restore the default alarm configuration without uninstalling and
reinstalling the product. If any Dell EMC alarm configurations are changed since installation, those changes are reverted
using the RESTORE ALARMS option.
NOTE: The events and alarms settings are not enabled after restoring the appliance. You can enable the Events and
Alarms settings again from the Settings tab.
Forecast Memory Page Retire (MPR) in OMIVV
Memory Page Retire (MPR) is a pre-failure function available in the supported PowerEdge hosts. This feature enables host to
notify Operating System about the correctable memory errors that have occurred on a memory page. At present, MPR events
are registered for all OMIVV-managed hosts.
If enough errors occur in a given sector, it can be an indicator of potential weakening in that DIMM. This can lead to an
uncorrectable error event and potential system crash.
OMIVV accumulates the MEM0002 alerts for each DIMM as it receives them from iDRAC. Once the alerts reach a threshold
value (14400) and accumulated across all the DIMMs in the system, OMIVV displays an event on the vCenter Events page.
This monitoring feature is close approximation of possible forecast of MPR in OMIVV. For more information about calculating
threshold value setting, see Calculate threshold setting on page 104.
To post the alarm notification, enable the Enable Memory Page Retire alarm for all hosts option on the Events and Alarms
page of OMIVV. For more information, see Configure events and alarms on page 103.
When the threshold is reached for memory correctable errors and if the MPR forecast alarm is enabled, host is moved to the
maintenance mode.
NOTE: MPR feature is not supported for PowerEdge MX host managed using chassis credential profile with unified IP.
Calculate threshold setting
This threshold value (14400) is configured based on the default page size of 1 MB (default configuration in ESXi 6.7 and later).
Forecasted MPR is generated after reaching 60% of correctable error count. MPR per 4 KB page is 96 correctable errors and
for 1 MB page size, and 60% of correctable error is 14400.
Count starts when host is added to Host Credential Profile. The count reset happens when the threshold is reached, or OMIVV
is restarted.
NOTE:
When ever OMIVV resets or restarts, the count is reinitialized to zero. This results in a lesser accurate forecasting
of MPR occurrence event.
View chassis events
Steps
1. In vSphere Client, expand Menu, and then select Hosts and Clusters.
2. In the left pane, select an instance of vCenter.
3. In the right pane, click Monitor > Tasks and Events > Events.
4. To view more information, select a specific event.
NOTE:
For a PowerEdge MX chassis with MCM configuration, the source of the event is displayed as lead chassis.
However, the message details have the Service Tag of the member chassis for identification.
NOTE: When a PowerEdge MX host is managed using chassis credential profile with unified IP, a generic event message
is displayed (for example, MM generic critical system alert) for all the host events. In this case, check the specific
component health changes in OME-Modular.
104 Manage vCenter settings