The present invention relates generally to the management of the operation of one or more computing systems. More specifically, the present invention relates to the determination of performance and power characteristics of one or more computing systems.
It is often desirable to determine specific data relating to the performance capabilities and power consumption characteristics of a computing system. Such data may be used to maintain an amount of power consumed by the computing system at an acceptable power level, or to maintain the performance of the computing system at an acceptable performance level. However, it is often difficult to simultaneously maintain both the power consumption and the performance of the computing system at acceptable levels. In many cases, this may be accomplished by throttling, e.g., lowering the frequency, of the computing system such that the power consumption stays below the acceptable power level. However, the amount of throttling for reaching a particular power level may vary by 5-10% due to variations in the power consumption characteristics of system components. Thus, appropriate determination of, and control over, the power consumption and performance of the computing system is often difficult to achieve.
The same numbers are used throughout the disclosure and the figures to reference like components and features. Numbers in the 100 series refer to features originally found in
As discussed above, embodiments described herein relate generally to the management of the operation of a computing system. More specifically, embodiments described herein relate to the determination of performance and power characteristics of a computing system. Such performance and power characteristics may be used, for example, to determine appropriate system operating configurations such that the power consumption and performance of the computing system are maintained at appropriate levels.
As used herein, the term “performance” refers to a response of a computing system during and after execution of a particular workload. In various embodiments, a definition of performance for a particular computing system may be defined by a user of the computing system via a user interface. The performance may be defined based on any of a number of different parameters, such as, for example, time of completion, frequency, response time, or frame rate. In various embodiments, the performance is defined based on architectural features of the computing system.
A computing system may operate in any number of system operational states. As used herein, the term “system operational state” refers to a state in which a computing system can run a specific workload by varying configurable parameters. According to embodiments described herein, a computing system may be operated in each system operational state for each of a number of workloads. Power consumption and performance data for each workload may be used to determine a number of system operational performance-power (SOPP) states for each workload. SOPP states may be generally referred to herein as “performance-power states.”
Performance-power states are operational states in which a performance of the computing system is within a predefined performance range and a power consumption of the computing system is within a predefined power range. In addition, each performance-power state is separated from a previous or next performance-power state by a predefined performance and a predefined power consumption, as shown below in Eqs. 1 and 2.
PerformanceSOPP state
PowerSOPP state
In the following description and claims, the terms “coupled” and “connected,” along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other. Rather, in particular embodiments, “connected” may be used to indicate that two or more elements are in direct physical or electrical contact with each other. “Coupled” may mean that two or more elements are in direct physical or electrical contact. However, “coupled” may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
Some embodiments may be implemented in one or a combination of hardware, firmware, and software. Some embodiments may also be implemented as instructions stored on a machine-readable medium, which may be read and executed by a computing platform to perform the operations described herein. A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine, e.g., a computer. For example, a machine-readable medium may include read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; or electrical, optical, acoustical or other form of propagated signals, e.g., carrier waves, infrared signals, digital signals, or the interfaces that transmit and/or receive signals, among others.
An embodiment is an implementation or example. Reference in the specification to “an embodiment,” “one embodiment,” “some embodiments,” “various embodiments,” or “other embodiments” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least some embodiments, but not necessarily all embodiments, of the inventions. The various appearances “an embodiment,” “one embodiment,” or “some embodiments” are not necessarily all referring to the same embodiments.
Not all components, features, structures, characteristics, etc. described and illustrated herein need be included in a particular embodiment or embodiments. If the specification states a component, feature, structure, or characteristic “may”, “might”, “can” or “could” be included, for example, that particular component, feature, structure, or characteristic is not required to be included. If the specification or claim refers to “a” or “an” element, that does not mean there is only one of the element. If the specification or claims refer to “an additional” element, that does not preclude there being more than one of the additional element.
It is to be noted that, although some embodiments have been described in reference to particular implementations, other implementations are possible according to some embodiments. Additionally, the arrangement and/or order of circuit elements or other features illustrated in the drawings and/or described herein need not be arranged in the particular way illustrated and described. Many other arrangements are possible according to some embodiments.
In each system shown in a figure, the elements in some cases may each have a same reference number or a different reference number to suggest that the elements represented could be different and/or similar. However, an element may be flexible enough to have different implementations and work with some or all of the systems shown or described herein. The various elements shown in the figures may be the same or different. Which one is referred to as a first element and which is called a second element is arbitrary.
The processor 102 may be connected through a bus 106 to an input/output (I/O) device interface 108 adapted to connect the computing system 100 to one or more I/O devices 110. The I/O devices 110 may include, for example, a keyboard and a pointing device, wherein the pointing device may include a touchpad or a touchscreen, among others. The I/O devices 110 may be built-in components of the computing system 100, or may be devices that are externally connected to the computing system 100.
The processor 102 may also be linked through the bus 106 to a display interface 112 adapted to connect the computing system 100 to a display device 114. The display device 114 may include a display screen that is a built-in component of the computing system 100. The display device 114 may also include a computer monitor, television, or projector, among others, that is externally connected to the computing system 100.
A network interface controller (NIC) 116 may be adapted to connect the computing system 100 through the bus 106 to a network 118. The network 118 may be a wide area network (WAN), local area network (LAN), or the Internet, among others. Through the network 118, the computing system 100 may access electronic text and imaging documents 120. The computing system 100 may also download the electronic text and imaging documents 120 and store the electronic text and imaging documents 120 within a storage device 122 of the computing system 100.
Through the network 118, the computing system 100 may be communicatively coupled to a number of remote computing systems 124. In some embodiments, power and performance data 126 may be downloaded from the remote computing systems 124, and may be stored within the storage device 122 of the computing system 100. In addition, any number of workloads 128, or data relating to the workloads 128, may be downloaded from the remote computing systems 124, and may be stored within the storage device 122 of the computing system 100. Further, power and performance data 126 and any number of workloads 128 relating to the computing system 100 may be stored within the storage device 122 of the computing system 100.
The storage device 122 can include a hard drive, an optical drive, a thumbdrive, an array of drives, or any combinations thereof. The storage device 122 may include a system manager 130 that is configured to perform the techniques for managing the operation of computing systems that are described herein. In various embodiments, the system manager 130 may be used to generate a number of performance-power states 132 relating to any of the remote computing systems 124, as well as the computing system 100. The performance-power states 132 may also be stored within the storage device 122 of the computing system 100.
It is to be understood that the block diagram of
The computing environment 200 may also include a number of computing systems 204, e.g., computing system A 204A, computing system B 204B, and computing system C 204C, among others. The computing systems 204 may be communicatively coupled to the system management agent 202 via a network 206. The network 206 may be, for example, a WAN, LAN, or the Internet.
The computing systems 204 may be any types of computing devices, such as servers, laptop computers, desktop computers, tablet computers, or mobile devices, among others. In some embodiments, the computing systems 204 may be included within a distributed computing environment. The computing systems 204 may be homogeneous, meaning that all of the computing systems 204 include identical hardware and software configurations, or heterogeneous, meaning that any number of the computing systems 204 include disparate hardware or software configurations.
The system management agent 202 may be configured to perform a process for determining performance-power states for each of a number of workloads executed by any number of the computing systems 204. In some embodiments, the system management agent 202 may perform such a process in response to input from a user via a user interface, such as a graphical user interface (GUI). For example, the system management agent 202 may be controlled by input from a datacenter administrator or IT personnel, among others.
As discussed above, the performance of a computing system refers to the system response during and after execution of a particular workload. In various embodiments, the performance of a computing system is quantified according to a specific performance metric. The performance metric may be defined according to input from the user of the system management agent 202. The performance metric may be based on specific parameters. Such parameters may include, for example, a completion time for a particular workload, a responsiveness of the system, a frame rate, a number of transactions completed, a rate of completion, or a rate of communication, e.g., bandwidth, among others. In addition, the performance metric may be based on multiple individual parameters, or may be based on a statistical operation, e.g., a mean, of multiple parameters.
The system management agent 202 may create a number of possible system operational states for one of the computing systems 204 based on the architecture of the computing system 204. For example, any of a number of architectural features of the computing system 204 may be used to generate the system operational states. The system operational states may then be used to determine performance-power states for specific workloads that are executed by the computing system 204.
In various embodiments, architectural features that are used to generate the system operational states may include core frequencies, processor frequencies, non-core frequencies, or memory frequencies, among others. The term “frequency” refers to the number of occurrences of a repeating event per unit time. For example, the frequency, or dock rate, of a central processing unit (CPU) of the computing system 204 may be used to characterize the rate at which the CPU is running. As used herein, the term frequency may refer to a standard frequency, or a combination of voltage and frequency. In some embodiments, core frequencies include frequencies relating to a CPU of the computing system 204. Non-core frequencies, on the other hand, may include frequencies relating to last level caches, interconnects between various processor blocks, or various internal agent or integrated I/O controllers, among others.
Any number of architectural features may be enabled or disabled to aid in the determination of the system operational states. This may be particularly applicable to architectural features that are often used in parallel with one another, such as, for example, multiple execution units, multiple retirement units, queues, or multiple graphics engines, among others. In addition, parameters of certain operations may be varied to aid in the determination of the system operational states. Such parameters may include, for example, a display refresh rate or an image quality, among others. Further, certain architectural features may be “de-featured” to aid in the determination of the system operational states. This may include, for example, executing less instructions at a time, operating a vector processing unit at a lower capacity, or operating I/O communications links at a lower bandwidth.
In some embodiments, each of the computing systems 204 includes a corresponding local system management agent (not shown). In such embodiments, each local system management agent may be configured to perform the process for determining performance-power states for the corresponding computing system 204. The system management agent 202 may act as the central system management agent. The central system management agent 202 may be configured to compile data from the local system management agent on each computing system 204, and to generate performance and power information based on the performance-power states. For example, the central system management agent 202 may use the data from the local system management agents to generate performance-power state tables relating to the performance-power states for each type of workload on each type of computing system 204.
It is to be understood that the block diagram of
In various embodiments, the computing system that is used according to the method 300 is the computing system 100 discussed above with respect to
In various embodiments, the method 300 may be initiated during the initial boot of the computing system using pre-loaded workloads, or may be initiated by an operating system or virtualized operating system on demand. In addition, the method 300 may be initiated in response to a request by the user of the computing system, or a request by users of other computing systems. Further, the method 300 may be performed while the computing system is being built and configured.
The method begins at block 302, at which a number of workloads on the computing system are determined. In addition, a number of parameters corresponding to each workload may be determined. The parameters corresponding to each workload may include a definition of performance for each workload. The definition of performance may be based on specific architectural features of the computing system, as discussed above with respect to
At block 304, a number of performance-power states for each workload, as well as a corresponding performance range and power consumption range for each performance-power state, are determined. Each performance-power state represents a state in which the computing system is capable of running a specific workload within the corresponding performance range and power consumption range. The determination of the performance-power states may be accomplished by operating the computing system in each system operational state for each workload. Then, data relating to each system operational state for each workload may be processed. In various embodiments, the data may include information relating to a performance for each workload in each system operational state, and information relating to a power consumption of the computing system for each workload in each system operational state. The performance-power states may be determined from the data based on the system operational states, as well as the acceptable power consumption range and acceptable performance range for each workload. In various embodiments, each performance-power state is separated from a previous performance-power state and a next performance-power state by a minimum performance and a minimum power consumption.
At block 306, the performance and power consumption of the computing system are managed based on the performance-power states. This may include maintaining the power consumption below a predefined power limit and maintaining the performance above a predefined performance limit. In other embodiments, this includes maintaining the power consumption below a predefined power limit and maintaining the performance at a predetermined performance level. For example, the computing system may be maintained at the predetermined performance level to ensure that the operation of the computing system is compatible with the operation of any number of other computing systems that are communicatively coupled to the computing system.
In some embodiments, the performance-power states are used to determine operating configurations for the computing system such that a balance between performance and energy usage is achieved. The performance-power states may establish a metric of energy cost of performance, which may be used to provide a user of the computing system with a choice between efficient battery usage and high performance. For example, a user may use the performance-power states to maintain a highest performance of the computing system at a given rate of power usage. Further, in some embodiments, the performance-power states are used to maintain the power consumption of the computing system below a specified power consumption limit. This may allow for the maintenance of availability of the computing system during “brown out” conditions.
It is to be understood that the process flow diagram of
In various embodiments, the method 300 may be implemented for multiple computing systems. In such embodiments, the method 300 includes determining a number of workloads corresponding to the computing systems and parameters corresponding to each workload. The method 300 also includes determining a number of performance-power states for each workload and a corresponding performance range and power consumption range for each performance-power state. This may be accomplished by operating each computing system in each system operational state for each workload and processing data relating to each system operational state for each workload. The method 300 further includes managing performance and power consumption of the computing systems based on the performance-power states. This may include, for example, maintaining an overall power consumption of the computing systems below a power limit while maintaining the performance of the computing systems at a uniform performance level. Further, according to such embodiments, the method 300 may be implemented by a datacenter manager that is located outside the computing systems and is communicatively coupled to the computing systems.
The various software components discussed herein may be stored on the tangible, non-transitory computer-readable medium 400, as indicated in
In addition, a performance-power state generation module 410 may be configured to generate a number of performance-power states for each workload based on the power consumption and performance data for each system operational state. A system management module 412 may be configured to control the operation, e.g., the power consumption and performance, of the one or more computing systems based on the performance-power states.
It is to be understood that the block diagram of
A method for managing the operation of a computing system is described herein. The method includes determining a number of workloads on the computing system. The method also includes determining a number of performance-power states for each workload and a corresponding performance range and power consumption range for each performance-power state. The method further includes managing performance and power consumption of the computing system based on the performance-power states.
Parameters corresponding to each workload may be determined by specifying a definition of performance for each workload. The definition of performance may be based on architectural features of the computing system. The definition of performance for each workload may be received from a user via a user interface.
The number of performance-power states for each workload may be determined by running each workload in each system operational state, measuring a performance of the computing system for each workload in each system operational state, and measuring a power consumption of the computing system for each workload in each system operational state. Further, the operation of the computing system may be managed in response to input from a system management agent. Managing the operation of the computing system may include maintaining the power consumption below a predefined power limit and maintaining the performance at a predefined performance level.
Each performance-power state may be separated from a previous performance-power state and a next performance-power state by a minimum power consumption and a minimum performance.
A number of workloads corresponding to a number of computing systems and parameters corresponding to each workload may be determined. A number of performance-power states for each workload and a corresponding performance range and power range for each performance-power state may be determined. The performance and power consumption of the computing systems may be managed based on the performance-power states. This may be performed in response to input from a datacenter manager located outside the computing systems. Managing the performance and the power consumption of the computing systems may include maintaining an overall power consumption of the computing systems below a power limit while maintaining the performance of the computing systems at a uniform performance level.
A computing system including a processor that is adapted to execute stored instructions and a storage device that stores instructions is described herein. The storage device includes processor executable code that, when executed by the processor, is adapted to determine a number of workloads of the computing system. The processor executable code is also adapted to determine a number of performance-power states for each workload, wherein determining a performance-power state includes defining a performance level range and a power level range for the performance-power state. The processor executable code is further adapted to manage performance and power consumption of the computing system based on the performance-power states.
A system operational state is a state in which the computing system is capable of running a specific workload by varying configurable parameters. A performance-power state is a state in which the computing system is capable of running a specific workload within the performance level range and the power level range.
The processor executable code may be executed by the processor during an initial system boot. The processor executable code may be executed by the processor by an operating system on demand. The processor executable code may also be executed by the processor as a response to a request by a user of another computing system.
Each performance-power state may be separated from a previous performance-power state and a next performance-power state by a minimum power consumption and a minimum performance.
At least one non-transitory machine readable medium having instructions stored therein is described herein. In response to being executed on a computing system, the instructions cause the computing system to determine a number of workloads of the computing system. The instructions also cause the computing system to determine a number of performance-power states for each workload and a corresponding performance range and power range for each performance-power state. In addition, the instructions cause the computing system to control performance and power consumption of the computing system based on the performance-power states.
A system operational state is a state in which the computing system is capable of running a specific workload by varying configurable parameters. A performance-power state is a state in which the computing system is capable of running a specific workload within a specified performance level range and a specified power level range.
Parameters corresponding to each workload may be determined by specifying a definition of performance for each workload based on any of a number of architectural features of the computing system. Controlling the performance and the power consumption of the computing system may include maintaining a highest performance of the computing system at a given rate of power usage.
It is to be understood that specifics in the aforementioned examples may be used anywhere in one or more embodiments. For instance, all optional features of the computing system described above may also be implemented with respect to either of the methods or the computer-readable medium described herein. Furthermore, although flow diagrams and/or state diagrams may have been used herein to describe embodiments, the inventions are not limited to those diagrams or to corresponding descriptions herein. For example, flow need not move through each illustrated box or state or in exactly the same order as illustrated and described herein
The inventions are not restricted to the particular details listed herein. Indeed, those skilled in the art having the benefit of this disclosure will appreciate that many other variations from the foregoing description and drawings may be made within the scope of the present inventions. Accordingly, it is the following dams including any amendments thereto that define the scope of the inventions.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2012/037812 | 5/14/2012 | WO | 00 | 5/30/2014 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/172816 | 11/21/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7861068 | Gorbatov et al. | Dec 2010 | B2 |
8060762 | Banginwar et al. | Nov 2011 | B2 |
20070011480 | Banginwar | Jan 2007 | A1 |
20080235364 | Gorbatov | Sep 2008 | A1 |
20090138219 | Bletsch | May 2009 | A1 |
20090254660 | Hanson | Oct 2009 | A1 |
20100262975 | Reysa | Oct 2010 | A1 |
20100281285 | Blanding | Nov 2010 | A1 |
20100299675 | Yuyitung | Nov 2010 | A1 |
20110022868 | Harchol-Balter | Jan 2011 | A1 |
20130111494 | Hyser | May 2013 | A1 |
Number | Date | Country |
---|---|---|
H08503566 | Apr 1996 | JP |
20110038648 | Apr 2011 | KR |
9411801 | May 1994 | WO |
2009156447 | Dec 2009 | WO |
2013172816 | Nov 2013 | WO |
Entry |
---|
International Search Report and Written Opinion received for PCT Patent Application No. PCT/US2012/037812, mailed on Jan. 23, 2013, 10 pages. |
Number | Date | Country | |
---|---|---|---|
20140281647 A1 | Sep 2014 | US |