The present application claims priority to Indian Patent Application Serial No. 6690/CHE/2015, filed Dec. 14, 2015 and entitled “DYNAMIC PREDICTIVE WAKE-UP TECHNIQUES,” which is incorporated herein by reference in its entirety.
I. Field of the Disclosure
The technology of the disclosure relates generally to sleep management for integrated circuits.
II. Background
The number and variety of mobile computing devices has increased dramatically over the past two decades. Part of the reason for the expansion in the number of devices is the increased versatility of such devices. In particular, the number and types of functions that are enabled has steadily increased so that what were initially simple mobile phones have evolved into multi-function multimedia devices. The increase in versatility is due, in part, to better battery life. However, in a moderately ironic twist, battery life is negatively impacted as people use the devices for more activities that have been enabled by the increased functionality.
One way that manufacturers have attempted to extend battery life is by putting various components within a mobile computing device into a sleep or low-power mode. Various types of sleep modes exist, and different elements within the mobile computing device may use different sleep modes depending on system parameters such as entry and exit latency, energy overhead, and the like. One particular sleep mode that is used by manufacturers is to place a central processing unit (CPU) in a low-power state after requesting a memory access or initiating a multimedia coding transfer. When the memory access or multimedia coding transfer is complete, the CPU wakes and issues the next instruction and/or begins processing the data from the transfer.
Delays may accrue when the CPU begins to wake after completion of the memory access or multimedia coding transfer. Various techniques such as low-power mode disabling have been implemented to try to make sure the CPU is ready substantially concurrently with the end of the memory access or multimedia coding transfer so as to avoid accrual of delays. However, these methods are not optimal because disabling the low-power mode reduces the power savings and decreases battery life.
Aspects disclosed in the detailed description include dynamic predictive wake-up techniques. In an exemplary aspect, a central processing unit (CPU) may initiate a memory access or other input/output (I/O) transfer such as a multimedia encoding transfer. The CPU may ascertain if a predicted time for the transfer exceeds an amount of time required to enter and exit a low-power mode. If the predicted time is sufficiently large to justify entering the low-power mode, then the CPU enters the low-power mode after the transfer is initiated. An I/O controller may generate the predicted time that the CPU uses to determine whether to enter the low-power mode. The I/O controller may further use the predicted time for the transfer to predict an early wake-up. Specifically, the I/O controller may calculate how long the transfer will take and compare that calculation to a known exit latency associated with the CPU. The calculated value is decremented by the amount of the known exit latency and the I/O controller may generate an early wake command at the decremented value. The CPU receives the early wake command and wakes such that the CPU is awake and ready to process data at conclusion of the transfer. While it is possible to do a static calculation, it should be appreciated that encoding and data transfer speeds may vary during the course of the transfer. Accordingly, the I/O controller may dynamically or iteratively recalculate the time at which the early wake command should be sent. By waking the CPU “just in time” to begin the next transfer, maximal power savings are achieved relative to minimal latency. Thus, performance of the device is improved. While aspects of the present disclosure are particularly well-suited for use with a CPU, other logic elements that initiate I/O transfers and have sleep cycles, such as a digital signal processor (DSP), a graphics processing unit (GPU), a microcontroller, base band processor (BBP) or other processor or sub-system which uses the data of an input/output or is needed to control the next processing, may also benefit from the concepts of the present disclosure.
In this regard in one aspect, a method for promoting efficient waking of a logic element is disclosed. The method includes initiating, at a logic element, an I/O transfer. The method also includes determining a first transfer rate associated with the I/O transfer. The method also includes calculating a first time needed to complete the I/O transfer based on the first transfer rate. The method also includes deducting from the first time an exit latency associated with the logic element to determine an early wake time. The method also includes determining if the first transfer rate has changed. The method also includes updating the early wake time. The method also includes sending a wake command to the logic element at the early wake time.
In another aspect, a computing device is disclosed. The computing device includes an I/O element. The computing device also includes a logic element configured to initiate an I/O transfer with the I/O element. The computing device also includes an I/O controller. The I/O controller is configured to manage the I/O transfer while the logic element enters a low-power mode. The I/O controller is also configured to determine a first transfer rate associated with the I/O transfer. The I/O controller is also configured to calculate a first time needed to complete the I/O transfer based on the first transfer rate. The I/O controller is also configured to deduct from the first time an exit latency associated with the logic element to determine an early wake time. The I/O controller is also configured to determine if the first transfer rate has changed. The I/O controller is also configured to update the early wake time. The I/O controller is also configured to send a wake command to the logic element at the early wake time.
In another aspect, an I/O controller is disclosed. The I/O controller includes a bus interface configured to couple to a bus. The I/O controller also includes circuitry. The circuitry is configured to manage an I/O transfer while an initiating logic element enters a low-power mode. The circuitry is also configured to determine a first transfer rate associated with the I/O transfer. The circuitry is also configured to calculate a first time needed to complete the I/O transfer based on the first transfer rate. The circuitry is also configured to deduct from the first time an exit latency associated with the initiating logic element to determine an early wake time. The circuitry is also configured to determine if the first transfer rate has changed. The circuitry is also configured to update the early wake time. The circuitry is also configured to send a wake command to the initiating logic element at the early wake time.
In another aspect, a method for promoting efficient waking of a logic element is disclosed. The method includes initiating, at a logic element, an I/O transfer. The method also includes determining a first transfer rate associated with the I/O transfer. The method also includes calculating a time completion value representing a time needed to complete the I/O transfer based on the first transfer rate. The method also includes calculating a current transfer rate. The method also includes updating the time completion value to a current time completion value based on the current transfer rate. The method also includes comparing the current time completion value to a known exit latency. The method also includes sending a wake command to the logic element when the current time completion value is less than or equal to the known exit latency.
In another aspect, a method for controlling entry of a logic element into a low-power mode is disclosed. The method includes calculating, with an I/O controller, a first time needed to complete an I/O transfer initiated by a logic element. The method also includes comparing the first time to a sum of an exit latency associated with the logic element and an entry latency associated with the logic element. The method also includes precluding entry into a low-power mode if the first time is less than the sum.
With reference now to the drawing figures, several exemplary aspects of the present disclosure are described. The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any aspect described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects.
Aspects disclosed in the detailed description include dynamic predictive wake-up techniques. In an exemplary aspect, a central processing unit (CPU) may initiate a memory access or other input/output (I/O) transfer such as a multimedia encoding transfer. The CPU may ascertain if a predicted time for the transfer exceeds an amount of time required to enter and exit a low-power mode. If the predicted time is sufficiently large to justify entering the low-power mode, then the CPU enters the low-power mode after the transfer is initiated. An I/O controller may generate the predicted time that the CPU uses to determine whether to enter the low-power mode. The I/O controller may further use the predicted time for the transfer to predict an early wake-up. Specifically, the I/O controller may calculate how long the transfer will take and compare that calculation to a known exit latency associated with the CPU. The calculated value is decremented by the amount of the known exit latency and the I/O controller may generate an early wake command at the decremented value. The CPU receives the early wake command and wakes such that the CPU is awake and ready to process data at conclusion of the transfer. While it is possible to do a static calculation, it should be appreciated that encoding and data transfer speeds may vary during the course of the transfer. Accordingly, the I/O controller may dynamically or iteratively recalculate the time at which the early wake command should be sent. By waking the CPU “just in time” to begin the next transfer, maximal power savings are achieved relative to minimal latency. Thus, performance of the device is improved. While aspects of the present disclosure are particularly well-suited for use with a CPU, other logic elements that initiate I/O transfers and have sleep cycles, such as a digital signal processor (DSP), a graphics processing unit (GPU), a microcontroller, base band processor (BBP) or other processor or sub-system which uses the data of an input/output or is needed to control the next processing, may also benefit from the concepts of the present disclosure.
While the present disclosure uses the word “transfer” to describe the I/O action, it should be appreciated that some such transfers may also be described as transactions without departing from the scope of the present disclosure. For example, an encoding/decoding transfer may also be described as an encoding or decoding transaction. Thus, as used herein, a transaction is a subset of the transfers discussed herein.
Exemplary aspects of the present disclosure deal with controlling whether a logic element, such as a CPU, enters a low-power mode and using a prediction related to a pending I/O transfer to determine when to wake the logic element so as to reduce latency associated with the transfer. In this regard,
In normal operation, the CPU 12 may initiate an I/O transfer such as a memory access (e.g., read or write) to the memory 16 or utilization of the multimedia encoder 14 to encode data. As such, the memory 16 and the multimedia encoder 14 are sometimes referred to herein as I/O elements. It should be appreciated that other I/O controllers and/or other I/O elements (not illustrated) may also be present, and the term I/O element is not limited to memories and multimedia encoders. While the CPU 12 waits for the transfer to conclude, the CPU 12 may be idle. In such instances, it may be appropriate to put the CPU 12 into a low-power mode. Exemplary aspects of the present disclosure initially determine if use of a low-power mode is appropriate, and if a low-power mode is used, exemplary aspects of the present disclosure further allow the CPU 12 to return to normal operation immediately prior to conclusion of the I/O transfer. Such timely return to normal operation reduces latency associated with prior solutions.
In this regard,
In contrast, exemplary aspects of the present disclosure initially determine whether latency will be increased by entering the low-power mode, and if the low-power mode is used, exemplary aspects eliminate the delay waiting for the wake-up to finish. In particular, exemplary aspects of the present disclosure predict when a data transfer is likely to be completed and compare this time prediction to a sum of the entry latency and the exit latency. If the time prediction is shorter than the sum, then entering a low-power mode is actually inefficient because it adds latency to the system and the potential power savings are not realized since the logic element begins exiting the low-power mode as soon as the low-power mode has been achieved. Exemplary aspects further decrement the predicted time by a wake-up latency for the CPU 12 of
In this regard,
With continued reference to
With continued reference to
With continued reference to
Instead of iteratively comparing the pending transfer time to the exit latency (e.g., block 100), exemplary aspects of the present disclosure may calculate a wake-up time by decrementing a calculated transfer time. In this regard,
The dynamic wake-up techniques according to aspects disclosed herein may be provided in or integrated into any processor-based device. Examples, without limitation, include a set top box, an entertainment unit, a navigation device, a communications device, a fixed location data unit, a mobile location data unit, a mobile phone, a cellular phone, a smart phone, a tablet, a phablet, a computer, a portable computer, a desktop computer, a personal digital assistant (PDA), a monitor, a computer monitor, a television, a tuner, a radio, a satellite radio, a music player, a digital music player, a portable music player, a digital video player, a video player, a digital video disc (DVD) player, a portable digital video player, and an automobile.
Those of skill in the art will further appreciate that the various illustrative logical blocks, modules, circuits, and algorithms described in connection with the aspects disclosed herein may be implemented as electronic hardware, instructions stored in memory or in another computer readable medium and executed by a processor or other processing device, or combinations of both. The devices described herein may be employed in any circuit, hardware component, integrated circuit (IC), or IC chip, as examples. Memory disclosed herein may be any type and size of memory and may be configured to store any type of information desired. To clearly illustrate this interchangeability, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. How such functionality is implemented depends upon the particular application, design choices, and/or design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The various illustrative logical blocks, modules, and circuits described in connection with the aspects disclosed herein may be implemented or performed with a processor, a DSP, an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices (e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration).
The aspects disclosed herein may be embodied in hardware and in instructions that are stored in hardware, and may reside, for example, in Random Access Memory (RAM), flash memory, Read Only Memory (ROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), registers, a hard disk, a removable disk, a CD-ROM, or any other form of computer readable medium known in the art. An exemplary storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a remote station. In the alternative, the processor and the storage medium may reside as discrete components in a remote station, base station, or server.
It is also noted that the operational steps described in any of the exemplary aspects herein are described to provide examples and discussion. The operations described may be performed in numerous different sequences other than the illustrated sequences. Furthermore, operations described in a single operational step may actually be performed in a number of different steps. Additionally, one or more operational steps discussed in the exemplary aspects may be combined. It is to be understood that the operational steps illustrated in the flowchart diagrams may be subject to numerous different modifications as will be readily apparent to one of skill in the art. Those of skill in the art will also understand that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
The previous description of the disclosure is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the spirit or scope of the disclosure. Thus, the disclosure is not intended to be limited to the examples and designs described herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Number | Date | Country | Kind |
---|---|---|---|
6690/CHE/2015 | Dec 2015 | IN | national |