1. Field of the Disclosure
The present disclosure relates generally to processing systems and, more particularly, to power management in processing systems.
2. Description of the Related Art
Many processing systems such as laptops, tablets, or smart phones can operate in an alternating current (AC) mode, e.g. when the processing device is plugged into a wall outlet, or a direct current (DC) mode, e.g. when the processing device is unplugged and operating on an internal battery supply. Processing systems that are operating in the AC mode are conventionally optimized to enhance performance. For example, a processing unit such as a central processing unit (CPU) or graphics processing unit (GPU) in the processing system may be boosted by increasing the operating frequency or operating voltage of the CPU or GPU. Processing systems that are operating in the DC mode are conventionally optimized to conserve power. For example boosting of the CPU or GPU may be disabled to conserve power and extend battery life because boosting the processing units may drain the battery at a much higher rate than operating the processing unit at lower operating frequencies or operating voltages.
The present disclosure may be better understood, and its numerous features and advantages made apparent to those skilled in the art by referencing the accompanying drawings. The use of the same reference symbols in different drawings indicates similar or identical items.
The operating point of a processing unit can be boosted based upon estimates of power consumption concurrent with performance of a set of one or more tasks by the processing unit. In some cases, boosting a processing unit while it is performing the task(s) may allow the processing unit to complete the set of tasks and enter the idle mode more quickly. This race-to-idle technique may result in a net power savings, especially when the duration of the set of tasks is relatively short. Battery power in a processing system that includes the processing unit may therefore be conserved while also improving the performance of a processing system by the processing unit for the set of tasks. Embodiments of the energy-aware technique described herein allow boosting short-duration workloads with embedded idle periods, while also keeping battery life within bounds. As used herein, the term “boosting” refers to increasing the operating point of a processing unit, e.g., by increasing the operating voltage or operating frequency of the processing unit.
Some embodiments of the processing system may decide to boost a processing unit by comparing the actual battery drain rate to a threshold battery drain rate that is determined by the battery capacity and a target battery life. If the actual battery drain rate is less than the threshold battery drain rate for a predetermined time interval, indicating that the rate of energy consumption (or power consumption) is relatively low and possibly indicating that the processing unit is performing short duration tasks separated by relatively long idle periods, the processing unit can be boosted to allow the processing unit to complete its tasks more quickly. If the actual battery drain rate is larger than or equal to the threshold battery drain rate, indicating that the rate of energy consumption is relatively high and possibly indicating that the processing unit is performing long duration tasks separated by relatively short idle periods, the processing system may bypass boosting of the processing unit to conserve battery power.
Some embodiments of the processing system may base the boost decision on platform-level indicators of energy consumption in the processing system. As used herein, the term “platform-level indicator” refers to a measured parameter that indicates a physical state at one or more locations on or near the platform that supports the processing system. In some embodiments, platform-level indicators may be used to determine battery drain rates in conjunction with or as alternatives to battery status indicators. For example, one platform-level indicator is the skin temperature that is measured at one or more locations proximate to the surface of the case that encloses the processing system. Higher skin temperatures may indicate that the processing unit is performing relatively longer-duration sustained tasks separated by relatively short idle periods and so boosting of the processing unit may be bypassed when the skin temperature exceeds a threshold value.
The processing system 100 includes an input/output engine 120 for handling input or output operations associated with elements of the processing system such as keyboards, mice, displays, printers, external disks, and the like. A graphics processing unit (GPU) 125 is included in the processing system 100 for creating visual images intended for output to a display. Some embodiments of the GPU 125 may include multiple processor cores that are not shown in
The processing system 100 shown in
Power may be supplied to components in the processing device 100 by a DC power supply 110 or an AC power supply 115. The DC power supply 110 may include a battery 112 (as shown in
The processing device 100 may preferentially operate in an AC mode so that the components of the processing device 100 receive power supplied by the AC power supply 115 when available, e.g., when the AC power supply 115 is receiving power from an external source such as a wall outlet. The battery 112 may also be charged by the AC power supply 115. The processing device 100 may transition to a DC mode when power is not available from the AC power supply 115. In the DC mode, the processing device 100 receives power from the DC power supply 110, e.g., from the internal battery 112 or an external battery (not shown) associated with the processing device 100. Techniques for selectively coupling the DC power supply 110 or the AC power supply 115 are known in the art. In the interest of clarity, logic and circuitry for selectively coupling the DC power supply 110 or the AC power supply 115 to the components in the processing device 100 are not shown in
Components of the processing device 100 may be enclosed in a case 140. For example, the components of the processing device 100 may be packaged in a case 140 that is formed of plastic, metal, glass, or a combination thereof. One or more sensors 136, 137, 138 (referred to hereinafter as “the sensors 136-138”) may be positioned proximate different locations on the case 140 and may generate platform-level indicators such as temperatures. Some embodiments of the sensors 136-138 may sense the temperature near the corresponding sensor 136-138 and may be positioned on the case 140 near components that are expected to dissipate a significant amount of heat during operation. For example, the sensor 136 may be positioned on the case 140 near the CPU 105, the sensor 137 may be positioned on the case 140 near the GPU 125, and the sensor 138 may be positioned on the case 140 near the DC power supply 110. The sensors 136-138 may generate signals indicative of the temperatures measured by the sensors 136-138 and these signals may be used by other entities in the processing device 100, as discussed herein.
Processing units in the processing device 100 such as the CPU 105, the GPU 125, or the processor cores 106-109 may be operated in different power management states. Examples of power management states include an active state in which a processing unit is performing tasks or executing instructions, an idle state in which the processing unit is not performing tasks or executing instructions, or a power-gated state in which power supplied to the processing device is gated so that no current is supplied to the processing unit, thereby reducing stand-by and leakage power consumption. The processing device 100 may therefore conserve power by idling one or more processing units when there are no instructions to be executed by the processing unit(s). If the processing unit is idle for a relatively long time, power supplied to the processing unit may then be gated so that no current is supplied to the component, thereby reducing stand-by and leakage power consumption.
Performance of the processing units may also be enhanced by boosting the processing units. As used herein, the term “boosting” refers to increasing the operating point of a processing unit, e.g., by increasing the operating voltage or operating frequency of the processing unit. For example, the CPU 105 may be boosted by increasing the voltage supplied to the CPU 105 by the DC power supply 110 (when operating in the DC mode) or the AC power supply 115 (when operating in the AC mode) from a nominal voltage supplied to the processing unit while it is in the active power management state to a boost voltage that is higher than the nominal voltage. Increasing the operating voltage may also increase the operating frequency of the processing unit.
The processing system 100 shown in
At times T<T1, the actual battery drain rate 205 is less than the threshold drain rate 210, which may be an indication that one or more associated processing units are performing a set of tasks that includes one or more relatively short duration tasks separated by relatively long idle intervals. Energy credits may therefore be accumulated during this time interval, resulting in an increase in the energy credits 215. At T=T1, the energy credit 215 reaches a threshold energy credit 220 that indicates that the processing unit(s) have been performing relatively short duration tasks for a sufficiently long time interval. The boost logic may therefore boost one or more of the processing units by increasing their operating voltage or operating frequency at T=T1.
The boosted processing unit(s) consume energy at a higher rate and the battery drain rate 205 may therefore increase for T>T1. Increased rate of energy consumption (or increased power consumption) may be the result of the boosted processing unit(s) performing a set of tasks that includes one or more relatively long duration tasks separated by relatively short idle intervals. Although the increased rate of energy consumption shown in
At T=T3, the energy credit 215 reaches a threshold energy credit 225 that indicates that the processing unit(s) have been performing relatively long duration tasks that have increased the battery drain rate. The boost logic may therefore de-boost (or exit the boost) one or more of the processing units by decreasing their operating voltage or operating frequency at T=T3. For example, if the boost logic previously boosted the operating frequency of a processing unit from a relatively low value to a relatively high value, the boost logic may de-boost the processing unit by returning its operating frequency to the relatively low value. Some embodiments may use a threshold energy credit 220 that is larger than the threshold energy credit 225 to provide a hysteresis. Some embodiments of the boost logic may also be able to de-boost one or more processing units based on other criteria such as a time limit for operating the processing unit(s) in the boosted mode.
The plot 301 shows the skin temperature 315, which may be determined using a single sensor or a statistical combination of temperatures measured by multiple sensors. For example, the skin temperature 315 may be equal to the temperature sensed by a single sensor deployed on a case of the processing system or an average of temperatures sensed by multiple sensors deployed on the case. The skin temperature 315 varies in response to changes in the battery drain rate, which reflects changes in the platform activity and the rate of energy consumption (or power consumption). Some embodiments of the boost logic may use the skin temperature 315 as a heuristic that is indicative of the rate of energy consumption, which may also be indicative of the duration of the tasks or idle periods. Some embodiments of the boost logic may also use the measured skin temperature 315 to estimate the battery drain rate and thereby estimate the rate of energy consumption. The boost logic may then use the estimated battery drain rate to calculate a heuristic such as the energy credits shown in
At times T<T1, the actual battery drain rate 305 is less than the threshold drain rate 310, which is an indication that one or more processing units may be consuming energy at a relatively low rate, e.g. while performing relatively short duration tasks separated by relatively long idle intervals. Since the processing units are operating at less than capacity, they may be dissipating a small amount of heat (e.g., relative to the cooling capacity of the system) and so the skin temperature 315 may decrease. At T=T1, the skin temperature 315 reaches a threshold skin temperature 320 that indicates that the processing unit(s) may be consuming energy at a relatively low rate, e.g. while performing relatively short duration tasks for a sufficiently long time interval. The boost logic may therefore boost one or more of the processing units by increasing their operating voltage or operating frequency at T=T1.
The boosted processing unit(s) consume energy at a higher rate (i.e., they consume more power) and the battery drain rate 305 may therefore increase for T>T1. The skin temperature 315 may then begin to rise as the boosted processing unit(s) dissipates more heat. The increased rate of energy consumption may be the result of the boosted processing unit(s) performing relatively long duration tasks separated by relatively short idle intervals. Although the increased rate of energy consumption shown in
The skin temperature 315 reaches a threshold skin temperature 325 at T=T2, which indicates that the processing unit(s) may be consuming energy at a relatively high rate, e.g. while performing relatively long duration tasks that have increased the battery drain rate and heat dissipation. The boost logic may therefore de-boost one or more of the processing units by decreasing their operating voltage or operating frequency at T=T2. For example, if the boost logic previously boosted the operating frequency of a processing unit from a relatively low value to a relatively high value, the boost logic may de-boost the processing unit by returning its operating frequency to the relatively low value. Some embodiments may use a threshold skin temperature 325 that is larger than the threshold skin temperature 320 to provide a hysteresis. Some embodiments of the boost logic may also be able to de-boost one or more processing units based on other criteria such as a time limit for operating the processing unit(s) in the boosted mode.
In response to the boost criteria being satisfied (at decision block 410), the boost logic may set a boost level for one or more processing units at block 415. Some embodiments of the boost logic may set the boost level by determining an operating frequency or an operating voltage for the processing units based on an estimated power savings that would result from the race-to-idle created by boosting the processing units. The power savings may be estimated by determining how many platform components of the processing system may be able to enter an idle state by completing a process earlier as a result of the increase in the operating frequency or operating voltage of the boosted processing units. For example, the race-to-idle may allow a memory or a display associated with the process to be released sooner, thereby reducing power consumption. A larger boost (e.g., a larger increase in the operating frequency or operating voltage) may be selected if it results in a power savings that offsets the increased power consumption of the boosted processing units. The boost logic may generate signals that cause the processing unit(s) to be boosted at block 420.
At decision block 425, the boost logic determines whether a boost exit criteria has been satisfied. For example, the boost logic may determine whether accumulated energy credits have fallen below a threshold (as shown in
In some embodiments, the apparatus and techniques described above are implemented in a system comprising one or more integrated circuit (IC) devices (also referred to as integrated circuit packages or microchips), such as the processing device described above with reference to
A computer readable storage medium may include any storage medium, or combination of storage media, accessible by a computer system during use to provide instructions and/or data to the computer system. Such storage media can include, but is not limited to, optical media (e.g., compact disc (CD), digital versatile disc (DVD), Blu-Ray disc), magnetic media (e.g., floppy disc , magnetic tape, or magnetic hard drive), volatile memory (e.g., random access memory (RAM) or cache), non-volatile memory (e.g., read-only memory (ROM) or Flash memory), or microelectromechanical systems (MEMS)-based storage media. The computer readable storage medium may be embedded in the computing system (e.g., system RAM or ROM), fixedly attached to the computing system (e.g., a magnetic hard drive), removably attached to the computing system (e.g., an optical disc or Universal Serial Bus (USB)-based Flash memory), or coupled to the computer system via a wired or wireless network (e.g., network accessible storage (NAS)).
At block 502 a functional specification for the IC device is generated. The functional specification (often referred to as a micro architecture specification (MAS)) may be represented by any of a variety of programming languages or modeling languages, including C, C++, SystemC, Simulink, or MATLAB.
At block 504, the functional specification is used to generate hardware description code representative of the hardware of the IC device. In some embodiments, the hardware description code is represented using at least one Hardware Description Language (HDL), which comprises any of a variety of computer languages, specification languages, or modeling languages for the formal description and design of the circuits of the IC device. The generated HDL code typically represents the operation of the circuits of the IC device, the design and organization of the circuits, and tests to verify correct operation of the IC device through simulation. Examples of HDL include Analog HDL (AHDL), Verilog HDL, SystemVerilog HDL, and VHDL. For IC devices implementing synchronized digital circuits, the hardware descriptor code may include register transfer level (RTL) code to provide an abstract representation of the operations of the synchronous digital circuits. For other types of circuitry, the hardware descriptor code may include behavior-level code to provide an abstract representation of the circuitry's operation. The HDL model represented by the hardware description code typically is subjected to one or more rounds of simulation and debugging to pass design verification.
After verifying the design represented by the hardware description code, at block 506 a synthesis tool is used to synthesize the hardware description code to generate code representing or defining an initial physical implementation of the circuitry of the IC device. In some embodiments, the synthesis tool generates one or more netlists comprising circuit device instances (e.g., gates, transistors, resistors, capacitors, inductors, diodes, etc.) and the nets, or connections, between the circuit device instances. Alternatively, all or a portion of a netlist can be generated manually without the use of a synthesis tool. As with the hardware description code, the netlists may be subjected to one or more test and verification processes before a final set of one or more netlists is generated.
Alternatively, a schematic editor tool can be used to draft a schematic of circuitry of the IC device and a schematic capture tool then may be used to capture the resulting circuit diagram and to generate one or more netlists (stored on a computer readable media) representing the components and connectivity of the circuit diagram. The captured circuit diagram may then be subjected to one or more rounds of simulation for testing and verification.
At block 508, one or more EDA tools use the netlists produced at block 506 to generate code representing the physical layout of the circuitry of the IC device. This process can include, for example, a placement tool using the netlists to determine or fix the location of each element of the circuitry of the IC device. Further, a routing tool builds on the placement process to add and route the wires needed to connect the circuit elements in accordance with the netlist(s). The resulting code represents a three-dimensional model of the IC device. The code may be represented in a database file format, such as, for example, the Graphic Database System II (GDSII) format. Data in this format typically represents geometric shapes, text labels, and other information about the circuit layout in hierarchical form.
At block 510, the physical layout code (e.g., GDSII code) is provided to a manufacturing facility, which uses the physical layout code to configure or otherwise adapt fabrication tools of the manufacturing facility (e.g., through mask works) to fabricate the IC device. That is, the physical layout code may be programmed into one or more computer systems, which may then control, in whole or part, the operation of the tools of the manufacturing facility or the manufacturing operations performed therein.
In some embodiments, certain aspects of the techniques described above may implemented by one or more processors of a processing system executing software. The software comprises one or more sets of executable instructions stored or otherwise tangibly embodied on a non-transitory computer readable storage medium. The software can include the instructions and certain data that, when executed by the one or more processors, manipulate the one or more processors to perform one or more aspects of the techniques described above. The non-transitory computer readable storage medium can include, for example, a magnetic or optical disk storage device, solid state storage devices such as Flash memory, a cache, random access memory (RAM) or other non-volatile memory device or devices, and the like. The executable instructions stored on the non-transitory computer readable storage medium may be in source code, assembly language code, object code, or other instruction format that is interpreted or otherwise executable by one or more processors.
Note that not all of the activities or elements described above in the general description are required, that a portion of a specific activity or device may not be required, and that one or more further activities may be performed, or elements included, in addition to those described. Still further, the order in which activities are listed are not necessarily the order in which they are performed. Also, the concepts have been described with reference to specific embodiments. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the present disclosure as set forth in the claims below. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of the present disclosure.
Benefits, other advantages, and solutions to problems have been described above with regard to specific embodiments. However, the benefits, advantages, solutions to problems, and any feature(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential feature of any or all the claims. Moreover, the particular embodiments disclosed above are illustrative only, as the disclosed subject matter may be modified and practiced in different but equivalent manners apparent to those skilled in the art having the benefit of the teachings herein. No limitations are intended to the details of construction or design herein shown, other than as described in the claims below. It is therefore evident that the particular embodiments disclosed above may be altered or modified and all such variations are considered within the scope of the disclosed subject matter. Accordingly, the protection sought herein is as set forth in the claims below.