Technical Whitepaper Installation Guide for OpenManage Enterprise Power Manager A quick start whitepaper and checklist to setup OpenManage Enterprise Power Manager Abstract This technical whitepaper provides guidance for setting up Power Manager for the first-time with details on how to install it on OpenManage Enterprise and setting up Power Manager to kick start power monitoring and management for a data center.
Revisions Revisions Date Description September 2020 Initial release with Dell EMC OpenManage Enterprise 3.5 and Power Manager 1.2 versions.
Terminologies Terminologies 3 Acronym Expansion CMC Chassis Management Controller CPU Central Processing Unit EPR Emergency Power Reduction GUI Graphical User Interface iDRAC Integrated Dell Remote Access Controller I/O Input Output OME OpenManage Enterprise OME-M OpenManage Enterprise Modular QRL Quick Resource Locator RAM Random Access Memory REST Representational State Transfer SSB Small Scale Business VM Virtual Machine PMP OME Power Manager Plugin CUPs Compute Usage pe
Table of contents Table of contents Revisions............................................................................................................................................................................. 2 Acknowledgements ............................................................................................................................................................. 2 Terminologies ...................................................................................................
Executive summary Executive summary This technical whitepaper provides a guidance for setting up Power Manager by providing details on how to install this plugin on OpenManage Enterprise and the configurations required to kick start Power Manager for power saving in a data center.
Introduction 1 Introduction Power Manager is designed to support a data center administrator by providing functionalities to monitor the power, thermal and various other metrics consumption trends for servers and chassis which are part of the data center. This whitepaper provides a guidance on how to set up the appliance starting from installation, to collect and manage data and view data in a concise way.
Installation of Plugin 2 Installation of Plugin This section highlights the beginning of usage of the plugin. There are 2 major ways of installing the Power Manager Plugin – the online and the offline installation methods. 2.1 Online Installation This method is provisioned for customers having internet connectivity and no firewall restrictions on connecting to the internet. Check the following pre-requirements for installing Power Manager: • • • Check for active internet connection.
Installation of Plugin 2.2 Offline Installation This method is provisioned for customers who either have a stringent firewall in place or work completely in a dark-site and have no active connectivity to the internet. Offline method of installation requires the customer to have the plugin files downloaded and maintained in an offline share. OpenManage Enterprise supports NFS/HTTPS/HTTP share types for offline installation of the plugin.
Installation of Plugin 4. To check for a successful connection, after configuring the Update Settings section for offline updates in OpenManage Enterprise, click Application Settings Console and Plugins Update Settings Test Connection. Following are the configurations required in OpenManage Enterprise appliance. • • • Application Settings Consoles and Extensions Update Settings.
License Requirements 3 License Requirements After the plugin is installed, the next step is to import the licenses on devices to make them Power Manager capable devices. The metric collection for the devices begins after adding them to Power Manager (Monitoring List). If the license is already imported, perform a re-inventory of the system to find capable devices for power monitoring and management in Power Manager.
License Requirements License capability with chassis 11 License available on Chassis Power Manager Plugin Capability Chassis models CMC Enterprise license Monitoring Power Policy EPR Throttle PowerEdge VRTX No Yes No Yes PowerEdge VRTX Yes Yes Yes Yes PowerEdge FX2 or PowerEdge FX2s No Yes No Yes PowerEdge FX2 or PowerEdge FX2s Yes Yes Yes Yes PowerEdge M1000e NA Yes Yes Yes PowerEdge MX7000 NA Yes Yes Yes Installation Guide for OpenManage Enterprise Power Manager
Adding Devices and Groups to Monitoring List 4 Adding Devices and Groups to Monitoring List This section provides recommendation about leveraging the custom Static Group feature of OpenManage Enterprise for creation of groups for better power management of devices. This section additionally calls out method for addition of device and groups to the Monitoring List of Power Manager.
Adding Devices and Groups to Monitoring List Device Statistics of Capable and Monitored Devices 4.3 Adding Groups to Monitoring List Power Manager leverages the custom Static Group creation capability of OpenManage Enterprise in order to create the groups. Following are some of the use cases for this feature: • • Addition of empty group to Monitoring List–you can add empty groups to the Monitoring List.
Adding Devices and Groups to Monitoring List Adding a Static Group to the Monitoring List • Addition of Physical Groups to Monitoring List–Power Manager automatically adds any Physical Group created in OpenManage Enterprise, into the Monitoring list along with all its constituent Power Manager Monitoring Capable devices. - 14 There is no option to add or remove Physical Groups manually. To not monitor a physical group in Power Manager, delete the Physical Group.
Adding Devices and Groups to Monitoring List Physical Groups in OpenManage Enterprise Physical Group automatically added to Power Manager’s monitoring list 15 Installation Guide for OpenManage Enterprise Power Manager
View and Analyze Metric Data 5 View and Analyze Metric Data After the devices are added to the Monitoring List, the background task of collecting the metric data for various devices and aggregation of this data for the groups begin. This section highlights the way one can read and analyze the metric data. 5.1 Metric Data for Devices View metric data for devices by navigating to All Devices page of Open Manage Enterprise.
View and Analyze Metric Data CPU Utilization Graph for a Device Fig xx: I/O Utilization for a Device Memory Bandwidth Utilization Graph for a device 17 Installation Guide for OpenManage Enterprise Power Manager
View and Analyze Metric Data System Airflow data for a device 5.2 Metric Data for Groups View metric data for groups by navigating to All Devices page of Open Manage Enterprise. Following are the steps to view the metrics for a group added to the Monitoring List: • In OpenManage Enterprise, click a group name Group Details Metrics and Monitoring History This section lists the Power History, Thermal History and System Airflow History graphs (if constituent devices provide this data).
View and Analyze Metric Data Thermal History Graph for a group 5.3 Analyzing data using Metric Graphs The metric graphs shown for devices and groups added to Monitoring List are a convenient way to identify any outliers. Using the graph and the duration options (6 Hours, 12 Hours, 1 Day, 7 Days, 1 Month, 3 Months, 6 Months and 1 Year) you can identify any sudden spikes in the consumption that might have occurred recently or in the past.
Setting Power and Temperature Threshold 6 Setting Power and Temperature Threshold Another step in the checklist of setting up the plugin is to create Power and Temperature Thresholds for devices and groups. Thresholds provide a set of boundary conditions in order to generate events for immediate attention of a Data Center Administrator to investigate the problem.
Setting Power and Temperature Threshold Setting threshold on a Group 6.2 Event Generation Whenever any of the threshold values are violated, an alert is generated. View the alerts from OpenManage Enterprise page, Alerts Alert Log, under the System Health Category and Metrics Sub-Category. The violations of thresholds are also captured in the Audit Logs and on the dashboard of OpenManage Enterprise Home Page Under Recent Alerts section if the alerts are generated recently.
Setting Power and Temperature Threshold Alerts inside OpenManage Enterprise Alerts Log Section Audit Logs for Threshold Alerts along with their severity of violation Top Offenders list for threshold violators in Groups and Devices—the devices or groups that generate alerts in a large number are added to the Top Offenders under the Overview page in Power Manager. The top 10 violators of threshold for both devices and groups based on Temperature or Power threshold violations are added here.
Setting Power and Temperature Threshold Devices in Top 10 Offenders List 23 Installation Guide for OpenManage Enterprise Power Manager
Setting Static and Temperature-Triggered Policy 7 Setting Static and Temperature-Triggered Policy The Static and Temperature-Triggered policy feature of Power Manager provides you with management capability where you can impose certain power restrictions and apply Emergency Power Restrictions (EPR) when the configured events arise.
Setting Static and Temperature-Triggered Policy Configuring Temperature-Triggered policy: 1. In OpenManage Enterprise, click Power Management Policies Create. 2. In the policy creation wizard, choose the policy type as Temperature-Triggered, provide a name and description for the policy and click Next. 3. Select a group and click Next. 4. Set the temperature value and click Next. 5. Review the policy details and click Finish.
EPR 8 EPR The EPR feature helps reduce power consumption of devices immediately during a power emergency. These sudden power failures can cause a partial or complete center cessation and lead to major losses. In order to avoid these losses a Data Center Administrator can apply EPR to reduce the power consumption of your managed devices. There are two types of EPR options available for devices and groups—Throttle and Shut Down. • Steps to configure EPR on a device: 1. 2. 3. 4.
Reading Reports 9 Reading Reports Power Manager offers multiple reports related to the monitoring and management of data. Reports provide a single space for all this data with all the details intact.
Configuring Dashboards 10 Configuring Dashboards Reports offer an option to view all the data in a single place, however the dashboard offers a more concise and user-friendly method to view important statistics. This data related to Power Manager is rendered on the Power Manager section on the OpenManage dashboard (Home Page) and Power Management Overview page. 10.
Configuring Dashboards Dashlet for Thermal History for Group Dashlet for Top Energy Consumer 29 Installation Guide for OpenManage Enterprise Power Manager
Configuring Dashboards 10.2 Power Management Overview Page The overview page under Power Management section provides the following information: Top 10 Power Offenders–this is a table that shows the statistics of top 10 devices and groups for which the power metric has crossed the set threshold values. This table provides option to filter out the list based on Entity Type (Device or Group) and State (Unknown, Normal, Critical or Warning).
Configuring Dashboards Various dashlets on Power Manager Overview page 31 Installation Guide for OpenManage Enterprise Power Manager
Conclusion 11 Conclusion Power Manager is a powerful tool plugged in to OpenManage Enterprise which comes with a variety of features such as device power, thermal, airflow and CUPs monitoring; threshold and alerting; power capping and EPR; Built-in and custom reports; Power Manager dashboards for Top Offenders of power and temperature for groups and devices; space and power headroom charts to see availability of power and space along a physical infrastructure.
Terminologies Used 12 Terminologies Used Monitoring–this is the metric collection for various devices and groups. These metrics for devices include Maximum, Minimum, Average and Instant Power; Maximum, Minimum, Average and Instant Temperature; Maximum, Minimum and Average CPU Utilization; Maximum, Minimum and Average Memory Bandwidth Utilization; Maximum, Minimum and Average Input Output Utilization; System Airflow and Energy Consumption.
Technical Support 13 Technical Support Dell.com/support is focused on meeting customer needs with proven services and support.