The present disclosure relates to virtualized data center energy management. More specifically, the present disclosure relates to methods and apparatus for coordinated virtualized data center energy management at cluster-level server on-off (consolidation) management, and server-level DVFS (dynamic voltage and frequency scaling) management.
The annual data center energy consumption in the US is estimated to grow to over 100 billion kWh at a cost of $7.4 billion by 2011. Another trend analysis estimates by 2014, Infrastructure and Energy costs would contribute about 75% while IT would contribute a significantly smaller 25% towards the overall total cost of operating a data center. Increasing the actual amount of computing work completed in the data center relative to the amount of energy used, is an urgent need in the new green IT initiative.
Both server consolidation and server features like voltage and frequency scaling can have a significant effect on overall data center performance per watt. Server consolidation is based on the observation that many enterprise servers do not utilize the available server resources maximally all of the time, and virtualization technologies facilitate consolidation of several physical servers onto a single high end system for higher resource utilization. Modem server power control features like Dynamic voltage and frequency scaling (DVFS) drive the server performance approximating energy-proportional computing by adapting processor operating frequency (voltage) upon run-time workload. The problem to be solved is designing an efficient solution for managing a virtualized data center with reduced energy consumption by utilizing both cluster-level server on-off energy management, and local DVFS control for server-level energy management.
On server level power control, there are independent or cooperative DVFS techniques. On cluster level energy management, there are dynamic server/load consolidation methods. While the solutions at each level may be combined to run a virtualized data center, they may interfere with each other, and a arbitrary combination of two solutions do not necessarily utilize their full capability. No energy management solution is proposed to explicitly manage the two levels in a coordinated approach.
A method for coordinating energy management in a virtualized data center comprising a plurality of physical servers and a plurality of virtual machines, is disclosed herein. The method comprises analyzing in a computer process, status information about the virtualized data center; from the analyzed status information, determining in a computer process, server utilization target settings for server consolidation; and executing the server consolidation in a computer process, according to the determined server utilization target settings.
Also disclosed herein is a system for coordinating energy management in a virtualized data center comprising a plurality of physical servers and a plurality of virtual machines. The system comprises a processor executing instructions for analyzing status information about the virtualized data center; determining server utilization target settings for server consolidation, from the analyzed status information; and executing the server consolidation according to the determined server utilization target settings.
The present disclosure presents a Coordinated Energy Management (COEM) system. The COEM system enables lower energy costs in running a virtualized data center, and allows the enforcement of a probabilistic service level agreement (SLA) defining server overloading probability.
The COEM system 10 coordinates two-level energy management through two processes. In the first process, the system status analysis component 11 decides the server utilization target settings for the server consolidation process, which is executed by the VM placement decision component 12. After that, the VM placement decision component 12 decides the energy-related management target, such as server on-off or consolidation control or DVFS control target setting, for local power control components at the individual physical servers 15.
In the second process, a new VM sizing technology referred to herein as effective sizing, is applied in the VM placement decision, which improves the performance of both server consolidation and server DVFS control.
In block 102, the VM placement decision component 12 performs the server consolidation process. The VM placement decision component 12 uses VM workload, server inventory, and existing VM hosting information, and generates a new VM placement decision by considering migration cost and server utilization targets. The VM placement decision component 12 then notifies: 1) available VM management tools for VM-related execution, such as VM migration (represented by VMa-VMe in
In block 103, the local power control components 13 at the servers 15, take orders on local power management and execute these orders to meet requirements. Specifically, the local power control components 13 at the servers 15 manage the power consumption of many components of the servers 15 including without limitation the CPUs, memories, disks, and network cards. Each local power control component 13 dynamically adjusts a local resource device (e.g., the CPU's power setting through the DVFS interface) so that the resource utilization (e.g., CPU utilization) can be, maintained around a fixed point (e.g., 90%) if energy can be saved with a lower device power setting. Many approaches, such as feedback-loop control based, can be used for implementing one or more of the local power control components 13, and the COEM system 10 has no specification for that implementation. In the COEM system 10, the local power control components 13 are coordinated with a VM consolidation algorithm performed by the VM placement decision component 12 in two ways: at run-time, they are coordinated through the server utilization target setting; and 2) in performance optimization, they both benefit from load variance reduction through the introduction of effective VM sizing, as described further on.
Referring again to
In block 202, the VM placement decision component 12 executes a VM placement algorithm, which determines the order and the rules used for placing the VMs in the physical servers 15 and is based on the effective sizing of the VMs.
and Nij is derived by solving the following equation
where Uk are independent and identically distributed (i.i.d.) random variables with the same distribution as Xi and Cj is the server utilization target of server j. Intuitively, Nij is the number of VMs that can be packed into server j without breaking a probabilistic Service Level Agreement (SLA) when all the VMs have the same workload pattern as VM i. The probabilistic SLA is defined in equation (2), which describes: for a server j in the virtualized data center 14, the probability that its aggregated resource demand (e.g., CPU utilization), caused by the VMs hosted on it, exceeds a server utilization threshold Cj (i.e., the server utilization target setting) at any time will be no higher than a numerical value p, where p is a server overloading probability defined in the probabilistic SLA.
Hence, the resource demand of VM i is calculated in block 301 using three—factors: 1) its resource utilization historical information, which is modeled as a random variable Xi with its distribution information including mean and variance; 2) the server capacity Cj; and 3) a Service Level Agreement (SLA) defining server overloading probability p. The SLA is defined in equation 2 as follows: for a server j in the virtualized data center, the probability that its resource utilization (e.g., CPU utilization), caused by the VMs hosted on it, is greater than a threshold C, at any time will be no higher than a numerical value p.
Equation (2) is solved to find Nij, the number of VMs that can be packed into server j without breaking the probabilistic SLA when all the VMs have the same workload distribution as VM i but are probabilistically independent from each other. Equation (1) is solved based on Nij and on the server capacity Cj, and provides the value for intrinsic demand ES.
In block 302, the correlation-aware demand ES can be represented by the following equation
ES
CA
=Z
(√{square root over (σ2+σ2+)}−√{square root over (σ2+σ2)}) (3)
where σi2 and σ2 are the variances of the random variables Xi and Xj; ρxy is the correlation between Xi and Xj; and Zα denotes the α-percentile for the unit normal distribution (α=1−ρ). For example, if we want the server overflowing probability ρ=0.25%, then α=99.75% and Zα=3.
Therefore, in block 302, the extra resource demand of VM i is calculated, which calculation considers workload correlation between VM i and the other VMs hosted on server j. The extra resource demand is based on three factors: 1) VM i's resource utilization historical information, which is modeled as a random variable Xi with its distribution information including mean and variance; 2) the resource utilization historical information of server j derived from the aggregated workload of all the VMs currently allocated to server j, which is modeled as a random variable Xj with its distribution information including the load variance; and 3) the SLA defining server overloading probability p, also defined for block 301 . Equation (3) is solved by finding the difference between Zα√{square root over ()}, the expected load variation of the aggregated workload on server j considering the correlation between X and Xj, and Zα√{square root over (σi2+σj2)}, the expected load variation of the aggregated workload on server j assuming no correlation between Xi and Xj. The result of equation (3) is referred to herein as the correlation-aware demand ES.
In block 303, the effective size ESij of VM i if hosted on server j is determined by combining the intrinsic demand and the correlation-aware demand:
ES
if
=ES
+ES
In practice, equation (2) may be difficult to solve in block 401 without a simple load model, where the term “load” refers to a VM's resource utilization (e.g., CPU utilization). In one embodiment, equation (2) can be solved by using a normal distribution model for approximation and by calculating two numbers (μi, σi) for each VM i based on its load (i.e., resource utilization history information), where μi is the average load, and a, is the load standard deviation. The closed-form solution for equation (2) under s normal distribution is
If the cluster of servers are heterogeneous thereby preventing a VM's effective size from be calculated before block 403, then blocks 401 and 402 can be modified to base the placement order on the value of μi+Zασi, and the actual effective size can be used in the target server in block 403. To incorporate the server power efficiency factor into this process, the servers 15 in the server list can be sorted in decreasing order by the power efficiency metrics in block 403. Other resource constraints, such as server memory, can be considered in this process by adding them in the block 403 process when judging a server's availability.
If the cluster of servers are heterogeneous thereby preventing a VM's effective size from be calculated before block 503, then blocks 501 and 502 can be modified to base the placement order on the value of μi+Zασi, which is independent of the servers; and the actual effective size in the target server will be calculated for the intrinsic load using equations (1) and (2), and correlation-aware demand using equation (3) in block 503. To incorporate the server power efficiency factor into this process, the servers 15 in the server list can be sorted in decreasing order by the power efficiency metrics in block 503. Other resource constraints, such as server memory, can be considered in this process by adding them in the block 503 process when judging a server's availability.
In block 603, the remaining underloaded servers are sorted in increasing order by server load. In block 604, for each underloaded server j in the order determined in block 603, each VM i on the server is placed on or migrated to the best non-empty and underloaded server k in the physical server list, which has sufficient remaining capacity and yields a minimal correlation-aware demand ES using equation (3). If no such server is available, the searching process is terminated for this server and the process moves on to the next underloaded server. When all the VMs in this server can find a target underloaded server to migrate to, this server is labeled as an empty server and all the VMs are migrated out of this server. The empty server is then shutdown to save energy. Note that VMs could end up with different servers, as block 604 is executed sequentially on VMs one after another.
To have the server power efficiency factor considered in this process, the servers can be subsequently sorted in decreasing order by the power efficiency metrics in the processes of blocks 602 and 603, i.e., more power efficient servers are higher in the order; and servers with the same power efficiency will be sorted in increasing order by the load. Other resource constraints, such as memory, can be considered in this process by adding them in processes of blocks 602 and 603 when judging a server's availability.
The methods of the present disclosure may be performed by an appropriately programmed computer apparatus, the configuration of which is well known in the art. An appropriate computer apparatus may be implemented, for example, using well known computer processors, memory units, storage devices, computer software, and other modules. A block diagram of a non-limiting embodiment of the computer apparatus is shown in
One skilled in the art will recognize that an actual implementation of a computer apparatus executing computer program instructions corresponding to the methods of the present disclosure, can also include other elements as well, and that
While exemplary drawings and specific embodiments have been described and illustrated herein, it is to be understood that that the scope of the present disclosure is not to be limited to the particular embodiments discussed. Thus, the embodiments shall be regarded as illustrative rather than restrictive, and it should be understood that variations may be made in those embodiments by persons skilled in the art without departing from the scope of the present invention as set forth in the claims that follow and their structural and functional equivalents.
This application claims the benefit of U.S. Provisional Application No. 61/294,589, filed Jan. 13, 2010, the entire disclosure of which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61294589 | Jan 2010 | US |