Technical Document Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.0 in Data Center Environment Abstract This technical white paper provides information about the best practices that you can follow while using Dell EMC OpenManage Enterprise Power Manager 2.0 to optimize the usage of supported devices in a data center environment.
Revisions Revisions Date Description August 2021 Scalability whitepaper for Power Manager 2.0 release with OpenManage Enterprise version 3.6.1. Acknowledgements This technical white paper is documented by the following members of the Dell EMC Server Engineering Team. Authors: Ashish Singh, Aarsi Kumar, Jimmy Muraleedharan Support: Shruthi Ravoor The information in this publication is provided “as is.” Dell Inc.
Table of contents Table of contents Revisions............................................................................................................................................................................. 2 Acknowledgements ............................................................................................................................................................. 2 Table of contents ...............................................................................................
Table of contents 8.3 9 Test Results for EPR actions ............................................................................................................................22 Alert Thresholds .........................................................................................................................................................23 9.1 Configuration Details and Generic Recommendations ....................................................................................23 9.1.
Executive Summary Executive Summary This technical whitepaper provides an overview about the performance, reliability, and scalability testing of Power Manager version 2.0 deployed on OpenManage Enterprise version 3.6.1.
Introduction 1 Introduction Dell EMC OpenManage Enterprise Power Manager is a plugin to Dell EMC OpenManage Enterprise (OME) console which uses fine-grained instrumentation to provide increased visibility over power consumption, anomalies, and utilization. Also, Power Manager alerts and reports on power and thermal events with servers, chassis, and groups consisting of servers and chassis.
Generic Recommendations 2 Generic Recommendations Power Manager features are similar to OpenManage Enterprise that are dependent on the underlying hardware configurations and the resources allocated to the hosted Virtual Machine. To optimize the setup and task scheduling in Power Manager, prioritize the hardware allocation using the following guidelines: Memory: Memory is a gating factor to a number of tasks which can be run simultaneously.
Generic Recommendations Minimum system configuration requirements in scale environment 8 Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Test Environment and Methodology 3 Test Environment and Methodology The scalability, performance, reliability, and longevity tests for Power Manager Plugin 2.0 were performed on the following environments: Small Business Customer’s Environment: This environment consists of 1000 managed systems in OpenManage Enterprise and the same set of devices monitored in Power Manager. CONFIG-1 has 1000 devices monitored in Power Manager plugin as part of Individual Devices, Static Groups and Physical Groups.
Scalability and Performance tests for Power Manager 2.0 4 Scalability and Performance tests for Power Manager 2.0 The primary focus of scalability and performance test was to collect and analyze various performance parameters of Power Manager version 2.0 for the following features: • • • • Metric Collection: Metric gathering for Power and Thermal History, CPU Utilization, I/O Utilization, Memory Utilization and System Air Flow History.
Addition of devices and groups in Power Manager 5 Addition of devices and groups in Power Manager 5.1 Configuration Details and Generic Recommendations 5.1.1 Configuration Details Add severs and chassis to Power Manager through Add Device(s) or Add Group(s) options in Plugins > Power Management > Power Manager Devices tab. Following test results include addition and removal of devices and groups in Power Manager for the test environments listed in Table 1 and recording the performance data.
Addition of devices and groups in Power Manager 5.2 Test Results in Scaled infrastructure (Devices and Groups) 5.2.1 Addition of devices and groups in Power Manager 5.2.2 Below mentioned test results are on updating the list of devices and groups in Power Manager after clearing browser cache in Google Chrome version 91.0.4472.164.
Metric Collection 6 Metric Collection Metric details are captured from monitored servers, chassis, PDU’s and virtual machines at regular intervals of 15, 30, or 60 minutes as per the configured Power Manager Settings.
Metric Collection 6.2 Test Results 6.2.1 Small Deployment - CONFIG 1 Metric Collection results for Small Deployment – Config1 Devices in Configuration Power Name Manager Servers Chassis PDUs VMs Metric Interval Average Time Taken CONFIG-1 500 80 20 400 15 min 3 min 1000 CPU Utilization for Small Deployment for 4 hours Memory Utilization for Small Deployment for 4 hours 14 Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Metric Collection 6.2.
Metric Collection Memory Utilization for Large Deployments (Configurations 2 and 3) for 4 hours 6.2.
Metric Collection CPU Utilization for Large Deployments (Configurations 4 and 5) for 4 hours Memory Utilization for Large Deployments (Configurations 4 and 5) for 4 hours 17 Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Power Policy (Power Capping) 7 Power Policy (Power Capping) 7.1 Configuration Details and Generic Recommendations 7.1.1 Configuration Details 7.1.2 Recommendations Create and apply power policies for regulating the power consumption of a device or a group of devices. Power Manager supports applying a Power Policy on a single target device or single static or physical group at a time from the policy creation wizard.
Power Policy (Power Capping) 7.3 Test Results for Power Policy Actions Test Results for Power Policy related actions for Heterogeneous Large Business Configuration Devices in Power Manager Tested Devices Action Time Taken 8000 400 Servers + 30 Chassis Create policies 1 m 52 s 8000 400 Servers + 30 Chassis Disable policies 1m9s 8000 400 Servers + 30 Chassis Enable policies 1 m 14 s 8000 400 Servers + 30 Chassis Delete policies 1m7s Process(taskexecutionservice)\CPU Utilization 180 160
Power Policy (Power Capping) Process(taskexecutionservice)\Totalmemory(kb) 3790000 3785000 3780000 3775000 3770000 3765000 3760000 3755000 3750000 3745000 Disable Policy Enable Policy Delete Policy Create Policy Memory Utilization for Power Policy related actions 20 Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Emergency Power Reduction 8 Emergency Power Reduction Emergency Power Reduction (EPR) is a feature in Power Manager which helps you to reduce power consumption of devices immediately during a power emergency. Apply EPR on Individual Devices using the Apply EPR option that is available in Power Management and Monitoring > Policies and EPR tab. And, apply EPR on Static and Physical Groups using the Apply EPR option that is available in Group Details > Policies and EPR tab.
Emergency Power Reduction 8.3 Test Results for EPR actions Test Results for EPR actions for Heterogeneous Large Business Configuration Devices in Power Tested Devices EPR Action Manager Time Taken 8000 400 Servers Enable EPR Throttle 8 m 30 s 8000 400 Servers Disable EPR Throttle 5 m 42 s 8000 400 Servers Enable EPR Shutdown 8 m 18 s 8000 400 Servers Disable EPR Shutdown 8 m 14 s Process(taskexecutionservice)\CPU Utilization 4000000 3500000 3000000 2500000 2000000 1500000 1000000 50000
Alert Thresholds 9 Alert Thresholds Power Manager leverages the Events and Alerts feature in OpenManage Enterprise and supports different alerts such as Threshold violation alerts, Group membership change alerts, Policy Violation alerts and so on. Power and Temperature Threshold violation alerts are used to notify the administrator when the power consumption and temperature has exceeded certain configured values so that the administrator can take immediate actions. 9.
-07-2021_04:13:10 24-07-2021_04:16:45 24-07-2021_04:20:19 24-07-2021_04:23:53 24-07-2021_04:27:27 24-07-2021_04:31:02 24-07-2021_04:34:36 24-07-2021_04:38:10 24-07-2021_04:41:44 24-07-2021_04:45:18 24-07-2021_04:48:53 24-07-2021_04:52:27 24-07-2021_04:56:02 24-07-2021_04:59:36 24-07-2021_05:03:08 24-07-2021_05:06:43 24-07-2021_05:10:17 24-07-2021_05:13:52 24-07-2021_05:17:26 24-07-2021_05:21:00 24-07-2021_05:24:34 24-07-2021_05:28:08 24-07-2021_05:31:42 24-07-2021_05:35:16 24-07-2021_05:38:51 24-07-2021_0
Importing Physical Groups and Device Association from a CSV file 10 Importing Physical Groups and Device Association from a CSV file Creation of physical group hierarchies for existing devices in a data center can be a time-consuming activity and hence, Power Manager facilitates importing the devices as physical groups through a csv file. 10.1 Configuration Details and Generic Recommendations 10.1.
Importing Physical Groups and Device Association from a CSV file Hierarchy created after Importing Physical Groups and Device Association from CSV file 10.1.2 Recommendations After analyzing the test results, following are the recommendations for utilizing the feature to Import Physical Groups from CSV file in Power Manager. • • • • For better performance, the physical hierarchies and device associations can be split into multiple files and imported as different tasks one after the other.
24-07-2021_21:13:02 24-07-2021_21:18:53 24-07-2021_21:24:45 24-07-2021_21:30:39 24-07-2021_21:36:31 24-07-2021_21:42:22 24-07-2021_21:48:11 24-07-2021_21:54:00 24-07-2021_21:59:47 24-07-2021_22:05:34 24-07-2021_22:11:05 24-07-2021_22:16:57 24-07-2021_22:22:47 24-07-2021_22:28:36 24-07-2021_22:34:23 24-07-2021_22:40:11 24-07-2021_22:45:58 24-07-2021_22:51:46 24-07-2021_22:57:32 24-07-2021_23:03:26 24-07-2021_23:09:00 24-07-2021_23:14:36 24-07-2021_23:20:20 24-07-2021_23:26:09 24-07-2021_23:32:01 24-07-202
Plugin Actions (Install, Disable, Enable, Uninstall) 11 Plugin Actions (Install, Disable, Enable, Uninstall) 11.1 Configuration Details and Generic Recommendations 11.1.1 Configuration Details 11.1.2 Tests are run on a Small Business Environment with 1000 devices in Open Manage Enterprise and the same is monitored in Power Manager as part of 50 static and physical groups and on Enterprise Business Environment with 8000 devices monitored in Power Manager as part of 500 nested groups.
Plugin Actions (Install, Disable, Enable, Uninstall) Uninstall 29 1000 1000 25 1000 1000 2m 12s 8000 8000 500 4000 3000 2m 25s Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Longevity Test Results 12 Longevity Test Results 12.1 Metric Collection 12.1.1 Configuration Details The primary goal for running these tests is to validate the reliability of Power Manager with OpenManage Enterprise when the appliance is utilized continuously for a defined period. Longevity test was run for a duration of 10 days.
Longevity Test Results • Clear the browser cache to yield better results. The following graphs represent the Total CPU and Total Memory utilization of the appliance during the monitored period of longevity testing. 12.3.1 12.3.2 Disk Usage Disk Usage of all tested configurations (Longevity) Deployment Type Configuration Name Devices in Power Servers Chassis Manager PDU VMs Monitored Groups Disk Usage Small Business CONFIG-1 1000 500 80 20 400 200 38.
Longevity Test Results 12.3.3 Average CPU Utilization and Memory Utilization for Large Deployments Total CPU and RAM Utilization for Metric Collection (Longevity) in CONFIG-4 Total CPU and RAM Utilization for Metric Collection (Longevity) in CONFIG-5 32 Benchmark the Performance, Reliability and Scalability of Dell EMC OpenManage Enterprise Power Manager 2.
Troubleshooting 13 Troubleshooting 13.1 Metric Collection happening at non-regular intervals in a scaled datacenter environment Issue In the Metrics and Monitoring graphs for Individual Devices and Static Groups, the metric collection has happened at non-regular intervals when compared to the configuration set in Power Manager Setting for Metric Gathering Interval.
Troubleshooting 13.2 Association of devices to Physical Groups may fail when trying to create more than 100 groups using csv import method Issue When you import a CSV file with more than 100 groups and devices, the group creation is successful, but the association of devices to the groups may fail, with the following error message on the Jobs page under Physical Group CSV Import Task job: “Unable to complete the action because the value entered for GroupId is invalid.
Conclusion 14 35 Conclusion Dell EMC provides its customers with products that simplify and streamline their IT processes, and freeing administrator’s time to focus on activities that help grow the business. This technical white paper provides comprehensive information about the test environment, recommendations, and test results in the form of neatly arranged infographics to make it easy for reading, scanning, and understanding.
Technical Support and Resources. 15 Technical Support and Resources. • • • 15.1 Related Resources • • • • • 36 Dell.com/support is focused on meeting customer needs with proven services and support. To watch quick and short videos about handling the PowerEdge server components, visit the QRL video website. Storage technical documents and videos provide expertise that helps to ensure customer success on Dell EMC storage platforms.