This disclosure generally pertains to the field of electricity supply for digital electronic computers. More specifically, the focus is on matching the electrical power consumed by a digital electronic computer with the electrical power available to the computer from an external power source.
One important application of the present disclosure is to the management of server loads in data centres.
The article “Towards green data center microgrids by leveraging data center loads in providing frequency regulation” by W. Qi et al., published in 2019 in Scientia Iranica D, 26(6), pp. 3559-3570, proposes a load control strategy for data centre microgrids. In this control strategy, flexibilities of the data centre's computing servers in processing delay-tolerant tasks are leveraged to control the data centre's power consumption and thus regulate the AC power frequency of the data centre microgrid.
The control strategy is explained in section 3 of the article, starting on page 3564, and illustrated by the article's
The sole aim of this known data centre power management scheme is to keep the AC power supply frequency within a corridor around the normal operating frequency (e.g., 60 Hz). This is to ensure the stability and security of the whole microgrid.
A drawback of this scheme is its considerable negative impact on the performance of the data centre's computing servers. Indeed, each time there is a power shortage, whole server clusters are shut down.
In view of the above, it is an object of the present disclosure to provide an improved method of adapting the power consumption of computers, and in particular of servers, in electrical grids to the varying power supply levels in the electrical grid.
According to the present disclosure, this object is achieved with a method of adapting the electrical power consumption of a digital electronic computer, which is supplied with electrical power from a power bus of an external electrical grid, to the instantaneous electrical power available from the power bus, wherein the digital electronic computer has a processing unit, the processing unit being configured for executing computing tasks, and wherein the power bus is either an AC bus providing AC electrical power at an electrical frequency F, or a DC bus providing DC electrical power at an electrical voltage U, the method comprising the following steps: a. measuring the instantaneous electrical frequency F on the power bus in case of an AC power bus, or the instantaneous electrical voltage U on the power bus in case of a DC power bus, thus obtaining a power supply measurement indicative of the instantaneous level of power supply from the power bus, and b. adapting the processing unit's computing task execution based on the obtained power supply measurement to thus maintain, increase, or reduce the computer's electrical power consumption.
In other words, the way the computer's processing unit executes tasks is made dependent on the current power state of the external electrical grid. Depending on the power available from the external electrical grid, the processing unit will perform more or less tasks, execute tasks more slowly, or execute tasks more quickly. Accordingly, the IT load on the electrical grid can be adapted much more precisely and dynamically. In particular, the IT load can be modified in small increments by increasing or reducing the task execution of the processing unit.
In contrast to the control strategy of the above-mentioned article, the method of the present disclosure follows a bottom-up approach. In the case of a data centre grid, according to the present disclosure, each computing server does its own individual and independent load management. In other words, the load controlling task is distributed over all the different computing servers.
According to the present disclosure, it is the individual computer that monitors the health of the external electrical grid and that adapts its consumption accordingly. If there is plenty of power available from the electrical grid, the computer will run its processing unit at maximum capacity. On the other hand, if there is a power shortage on the electrical grid, the computer will throttle its processing unit down.
The following features can be optionally implemented, separately or in combination one with the others:
The present disclosure also relates to a method of managing electrical power consumption in a data centre, the data centre including a plurality of computing servers and a power bus supplying electrical power to the computing servers, wherein the electrical power consumption of each computing server is adapted in accordance with the method as detailed above.
Optionally, in this power consumption managing method, step a is implemented as a central measurement operation, and the obtained power supply measurement is transmitted to each computing server to be used as a basis for the adaptation of the computing server's electrical power consumption.
Preferred embodiments of the present disclosure will now be described in detail with reference to the appended drawings, wherein:
Typically, datacentre 100 is an IT installation, which provides IT services such as data storage, data processing and computing to paying customers.
Datacentre 100 includes a plurality of computing servers 102 and a power bus 104, which supplies electrical power to the computing servers 102. The computing servers 102 handle the different IT services.
Only two servers, labelled “Server 1” and “Server i”, are shown in
The servers 102 are all digital electronic computers. As such, each server 102 comprises a processing unit 106.
Typically, the processing unit 106 is the server's central processing unit or CPU. However, it may also be a graphical processing unit (GPU), or another type of processing unit.
The processing unit 106 executes various computing tasks under the control of a task manager 108, which is also part of the server 102.
The task manager 108 manages the computing tasks executed by the processing unit 106. In particular, the task manager 108 may allocate new tasks to the processing unit 106, may end the execution of tasks by the processing unit 106, or may change the allocation of the processing unit's resources to the various tasks. By doing so, task manager 108 may speed up or slow down the execution of specific tasks. The task manager 108 may be implemented in the server 102 as a hardware and/or software module.
In the present example, each server 102 also comprises a DC power supply unit or PSU 110. The PSU 110 is a DC-to-DC converter, which converts the higher electrical voltage of the DC bus 104 to the lower electrical voltage needed by the server 102. As shown in
An input side 112 of the PSU 110 is electrically connected to the DC power bus 104.
A voltage sensor 114 is arranged at the input side 112 of the PSU 110. Here, the voltage sensor 114 is part of the server 102. However, alternatively, both the PSU 110 and the voltage sensor 114 may be located outside the server 102. In this case, the PSU 110 may be implemented as a separate device between the DC power bus 104 and the server 102.
Each server 102 also includes a monitoring module 116. The monitoring module 116 may be implemented as hardware, for example, in the form of a logic circuit, or as a software module.
The monitoring module 116 receives as an input a voltage measurement Ubus, which is output by the voltage sensor 114.
The output of the monitoring module 116 is a set of load factors LFj, which are sent to the task manager 108. The role of the monitoring module 116 is to constantly monitor the evolution of the voltage Ubus on the DC bus 104 and to tell the task manager 108 via the load factors LFj when and how to throttle down the various tasks carried out by the CPU 106.
In the present example, the electrical power available from the power bus 104 is provided by three different power sources 10, 12 and 14. Power source 10 corresponds to the public AC electricity network. Power source 12 is an electrical battery. Power source 14 is a renewable energy source such as a solar panel array.
As apparent from
As seen from the servers 102, the DC power bus 104 and the power sources 10, 12 and 14 together form an external electrical grid, and the servers 102 are connected thereto.
The electrical systems shown in
With reference to
In the exemplary method illustrated by
The amount of power available from the DC power bus 104 correlates with the electrical voltage Ubus on the power bus. The higher the voltage Ubus, the higher the available power. Conversely, if the bus voltage Ubus is low, then there is not much power available from the DC power bus 104.
A droop control ensures that the voltage Ubus on the DC power bus 104 remains between an upper limit U maximum (Umax) and a lower limit U emergency (Uemer). This voltage corridor is illustrated by the voltage scale at the bottom of
Inside the voltage corridor delimited by Umax and Uemer there are two intermediate voltages U balance (Ubal) and U minimum (Umin). The voltage Ubal is the optimum voltage for the DC power bus 104. At this voltage, there is a balance between the power generated by the power sources 10, 12 and 14 and the power consumed by the data centre 100. The minimum voltage Umin is a lower limit above the critical lower limit Uemer. When the bus voltage Ubus falls below the minimum voltage Umin but is still above the critical voltage Uemer there is a power shortage on the DC power bus 104. This should be avoided, but the data centre 100 can still operate in limited power mode. If the bus voltage Ubus falls below the critical limit Uemer the power shortage becomes a power outage and installations in the data centre 100 will start shutting down uncontrollably. Accordingly, the electrical system shown in
In the embodiment of
In other words, the voltage sensors 114 measure the instantaneous electrical voltage Ubus on the power bus 104 and thus obtain a power supply measurement indicative of the instantaneous level of power supply from the power bus 104. Subsequently, the computing task execution of each processing unit 106 is adapted based on the obtained power supply measurement Ubus to thus maintain, increase, or reduce the servers' electrical power consumption.
More precisely, in each task manager 108, the tasks executed by the associated CPU 106 are ranked according to priority. With reference to
We will now first focus on the task throttling strategy implemented in Server 1. This strategy is illustrated by
The uppermost graph in
The throttling down of the CPU between Umin and Uemer is a combined result of the throttling down of the computing task T1, the backup task T2 and the other tasks T3.
The task throttling is implemented as follows: two voltage thresholds Uthj and Uendj are associated with each task or group of tasks j with the same priority. The first voltage threshold Uthj, called a throttling threshold, is the voltage below which the task or group of tasks j is throttled down. The second lower voltage threshold Uendj, called an ending threshold, is the voltage at which the task or group of tasks j is ended. If the power supply measurement Ubus falls below the throttling threshold Uthj less of the processing unit's resources are allocated to the associated task or group of tasks j to reduce the processing unit's power consumption. Preferably, as can be seen in
If the power supply measurement Ubus falls to the lower ending threshold Uendj the associated task or group of tasks is ended.
For each task or group of tasks j the two thresholds Uthj and Uendj are provided as input parameters to the monitoring module 116 (cf.
For each task or group of tasks j, the corresponding load factor LFj may be calculated with the following two equations:
LF
j=1, if Ubus≤Uthj (1)
LF
j=1−(Uthj−Ubus)/(Uthj−Uendj), if Ubus<Uthj (2)
According to the above equations, the load factor is equal to 1 if the voltage on the DC power bus 104 is greater than or equal to the throttling voltage Uthj. This means that, as long as the bus voltage Ubus is above the throttling voltage of the corresponding task, the task runs at hundred percent.
However, if the voltage Ubus on the DC power bus 104 falls below the throttling voltage Uthj, then the load factor LFj falls below one. According to equation (2) above, the load factor LFj decreases linearly from 1 to 0 as the voltage Ubus on the DC power bus 104 drops from the throttling voltage Uthj to the ending voltage Uendj. The linear rate of decrease of the load factor LFj depends on the difference between the throttling voltage Uthj and the ending voltage Uendj. The closer the throttling voltage Uthj to the ending voltage Uendj, the higher the rate of decrease of the load factor LFj. The value of the throttling voltage Uthj, of the ending voltage Uendj and thus the difference between the two will be chosen in accordance with the priority of the corresponding task or group of tasks j. This is illustrated by
Task T2 with lower priority has the same ending voltage as task T1. However, the difference between the throttling voltage Uth2 and the ending voltage Uend2 is larger so that task T2 will be throttled down earlier than task T1. The other tasks T3 with the lowest priority have an ending voltage Uend3, which is higher than the ending voltage of tasks T1 and T2. Furthermore, the throttling voltage Uth3 of the other tasks T3 is the highest and set at the minimum voltage Umin.
In the just described strategy to control the power consumption of the servers 102 the calculation of the load factors LFj is directly based on the voltage measurements Ubus provided by the voltage sensors 114. However, alternatively, as described in detail below, instead of directly using the voltage measurement for the load factor calculation, one may also first derive a power supply level indicator SOG from the power supply measurement Ubus and then calculate the load factors using the derived power supply level indicator SOG.
As used here, the indicator SOG indicates the instantaneous level of the power supply from the power bus 104. The indicator SOG is a proxy for the state of the external electrical grid, hence the acronym SOG, which stands for State Of the Grid.
The following equation is an example of how the indicator SOG may be calculated:
SOG=(Ubus−Umin)/(Umax−Umin) (3)
According to equation (3), SOG is a linear function of the bus voltage Ubus. Also according to equation (3), the indicator SOG will be a number between 0 and 1 since the bus voltage Ubus can only vary between the upper limit Umax and the lower limit Umin.
In equation (3) above, Umin may be replaced by Uemer in a setup where the bus voltage Ubus may be allowed to drop as low as the emergency voltage Uemer.
Using the above defined SOG indicator has the advantage that the amount of power currently available from the power bus 104 can be directly ascertained from the indicator. Indeed, if the SOG indicator is close to 0 then there is very little power available. If the SOG indicator is close to 1 then there is plenty of power available.
Estimating the available power from the voltage measurement Ubus is less straightforward since it requires knowledge about the operating parameters of the DC bus 104, namely Umax and Umin.
When using the power supply level indicator SOG, the two thresholds for each task or group of tasks j are indicator thresholds and not voltage thresholds. Each task or group of tasks j then has a throttling threshold SOGthj and an ending threshold SOGendj.
Equations (1) and (2) above then become:
LF
j=1, if SOG≥SOGthj (4)
LF
j=1−(SOGthj−SOG)/(SOGthj−SOGendj), if SOG<SOGthj (5)
Task throttling based on the indicator SOG is illustrated by the graphs of
The task throttling strategy shown in
The task throttling strategy of
With reference to
In the data centre 200, the voltage sensors 214, which measure the voltage on the DC bus 204, are in the protection devices 218. Accordingly, in the data centre 200, the bus voltage Ubus is measured centrally at each protection device 218, and the obtained power supply measurements Ubus are transmitted to each computing server 202 to be used as a basis for the adaptation of the computing servers' electrical power consumption.
In the data centre 200, since the voltage measurements are transmitted from a central location 218 to the individual servers 202, it is possible to make the power management even more versatile. For example, it is possible to transmit a virtual voltage value to one of the servers 202 instead of the real bus voltage Ubus. This virtual voltage value may, for example, be equal to Umax and may be transmitted to a high priority server. Accordingly, the server will keep on running at full load, even if the real bus voltage Ubus is low.
In a further variant, each server 202 may have its own associated local voltage sensor (as shown in
The above description and the figures relate to data centres equipped with a DC power bus. However, the present disclosure is equally applicable to data centres with AC power buses. In this case, it is the instantaneous electrical frequency F of the AC bus that is measured instead of the voltage. The measured electrical frequency F is used as a proxy for the electrical power available from the AC power bus and the task execution of the CPUs of the servers is adapted based on the frequency measurements.
Number | Date | Country | Kind |
---|---|---|---|
22170182.4 | Apr 2022 | EP | regional |