In one embodiment, a method includes, for an N-bit system on chip (SoC), calculating a minimum number of bits NP that can simultaneously switch without producing an error across a complete warranty time period of the N-bit SoC. The method also includes carrying out power estimation calculations for the N-bit SoC using the calculated minimum number of bits NP.
In another embodiment, a method includes receiving a number of channels to be employed in a system on chip (SoC), and determining a number of bits N for the SoC based on the number of channels and based on a number of bits per channel. The method also includes receiving warranty information for the SoC. Based on the bits N and the warranty information, a minimum number of bits NP that can simultaneously switch without producing an error across a complete warranty time period of the N-bit SoC is calculated.
In yet another embodiment, a computer-readable storage medium having encoded therein computer-executable instructions for causing a computing system programmed thereby to perform a method is provided. The method includes, for an N-bit system on chip (SoC), calculating a minimum number of bits NP that can simultaneously switch without producing an error across a complete warranty time period of the N-bit SoC. The method also includes carrying out power estimation calculations for the N-bit SoC using the calculated minimum number of bits NP.
This summary is not intended to describe each disclosed embodiment or every implementation of the probability-based optimization of SoC power disclosed herein. Many other novel advantages, features, and relationships will become apparent as this description proceeds. The figures and the description that follow more particularly exemplify illustrative embodiments.
Embodiments of the disclosure relate to increasing a capacity of a SoC (system on chip) without linearly increasing the power for the SoC, thereby saving power. The SoC power or current has two components: 1) a steady state component, which is a substantially constant current drawn to maintain a steady state (e. g., a constant 0 or a constant 1), and 2) a switching current that is drawn when a transition occurs from a 1 to a 0 or a 0 to a 1. If the switching current is x when 1 bit transitions from 0 to 1 (or 1 to 0), that current doubles to 2× when 2 bits simultaneously switch. In essence, the switching current linearly increases with the increase in the number of bits switching simultaneously.
For an N-bit SoC, power calculations are currently carried out based on all of the N bits of the SoC simultaneously switching. However, the disclosure recognizes that, as N increases (e.g., the capacity of the SoC increases), the probability of all N bits simultaneously switching decreases. Accordingly, to save power, embodiments of the disclosure determine SoC power by using a probability-based method. The method includes, for an N-bit SoC, calculating a minimum number of bits NP that simultaneously switch without producing an error across a complete warranty time period of the N-bit SoC. The calculated minimum number of bits NP may be less than N. The method further includes carrying out power estimation calculations for the N-bit SoC using the calculated minimum number of bits NP. Prior to providing details regarding power estimation calculations using the calculated minimum number of bits NP, examples of general SoC power estimation calculations are provided in connection with
It should be noted that the same reference numerals are used in different figures for same or similar elements. It should also be understood that the terminology used herein is for the purpose of describing embodiments, and the terminology is not intended to be limiting. Unless indicated otherwise, ordinal numbers (e.g., first, second, third, etc.) are used to distinguish or identify different elements or steps in a group of elements or steps, and do not supply a serial or numerical limitation on the elements or steps of the embodiments thereof. For example, “first,” “second,” and “third” elements or steps need not necessarily appear in that order, and the embodiments thereof need not necessarily be limited to three elements or steps. It should also be understood that, unless indicated otherwise, any labels such as “left,” “right,” “front,” “back,” “top,” “bottom,” “forward,” “reverse,” “clockwise,” “counter clockwise,” “up,” “down,” or other similar terms such as “upper,” “lower,” “aft,” “fore,” “vertical,” “horizontal,” “proximal,” “distal,” “intermediate” and the like are used for convenience and are not intended to imply, for example, any particular fixed location, orientation, or direction. Instead, such labels are used to reflect, for example, relative location, orientation, or directions. It should also be understood that the singular forms of “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise.
Power estimation may be carried out at an initial stage of the SoC design cycle. Initial power calculation is based on Ohm's Law (e.g., “VI” power calculation), where the total input/output (I/O) current is computed and then multiplied with a nominal I/O supply voltage. As indicated above, the total I/O current is divided into steady state and switching currents. Examples of steady state current calculations are described below with the help of an example Flash I/O interface.
The steady state current on a Flash interface is affected by the following two parameters:
As indicated above, the switching current is the current drawn from the I/O supply at the time of switching (e.g., the current supplied when a transition occurs from a 1 to a 0 or a 0 to a 1).
Consider a Flash I/O domain having “ch” channels with each of the ch channels including “8” single ended data I/Os and “1” differential strobe I/O, with the differential strobe I/O having a switching current that is twice the switching current of the single ended data I/O. Here, the total maximum switching current is
I
SWITCHING
_
MAX
_
TOTAL=(N×ISWITCH_DATA)+(NSTROBE×ISWITCH_STROBE) Equation 4
and the maximum switching power is
P
IOSWITCHING
_
MAX
=V
IO
_
DOMAIN
×I
SWITCHING
_
MAX
_
TOTAL Equation 5
In equations 3, 4 and 5:
An average current may be obtained by averaging instantaneous current values over a given time period. In other words, the average current of a signal may be the area under a waveform with respect to a time axis. The Flash I/O domain is taken as an example to explain average current. A current profile for a worst case of one write cycle is shown in
Equation 7 below is used to calculate the average current of the waveform shown in
The waveform shown in
For a “ch” channel Flash I/O domain, there would be “N” data bits and “ch” strobe bits. The max average current for data I/Os is provided in Equation 12A below.
I
AVERAGE
_
DATA
=N×I
STEADY
_
STATE+[½×Data Rate×(N×ISWITCH_DATA)×TSWITCHING_TIME)] Equation 12A
For “ch” strobe bits, the average current is provided in Equation 12B below.
I
AVERAGE
_
STROBE
=ch×I
STEADY
_
STATE
_
STROBE+[(½×Data Rate×(ch×ISWITCH_STROBE)×TSWITCHING_TIME)] Equation 12B
Total average current is provided in Equation 12C below.
I
AVERAGE
=I
AVERAGE
_
DATA
+I
AVERAGE
_
STROBE Equation 12C
The average power is provided in Equation 13 below.
I
AVERAGE
=I
AVERAGE
×V
IO
_
DOMAIN Equation 13
In equations 6-13:
In order to meet the increasing demand for data storage, the capacity of a SoC may be increased by selecting a higher number of Flash memory channels. As the number of channels increases, the number of data bits increases (e.g., each channel includes 8 data bits and 1 strobe bit), thereby increasing the simultaneous switching bits. The greater the simultaneous switching bits, the more the current. In other words, to cope with the increased Flash memory channels, the average power demand also increases due to more current consumption.
As indicated above, conventional solutions are too pessimistic as they would consider all data bits switching while calculating the maximum average power. Thus, as noted above, a probability-based methodology is implemented, which calculates the minimum number of data bits switching without giving any error across the complete warranty time of the product. To demonstrate why a probability-based method is suitable, an example comparing 8 bit and 64 bit Flash memory systems is discussed below in connection with
A probability of switching of all “N” bits simultaneously is given by Equation 14.
The probability of simultaneous switching of “59” bits out of total “64” bits calculated using Equation 15 is ≈4.13×10−13. In an example memory device having a warranty period of 5 years and a 10 gigabyte (GB) per day data handling capacity, the maximum switching of any one bit in 5 years for a 64 bit data bus is (10×109×8×5×365)÷64=2.28×1012. Since the probability of “59” bits switching is less than the maximum switching possible in a complete lifetime, any switching beyond this could be ignored without introducing any data error. Using the probability theorem, a break-even point is computed where the probability of minimum simultaneously switching bits is computed without compromising data integrity. The minimum simultaneously switching bits could be further lowered in case the SoC has a back to back error correction mechanism.
A data bit has four transition states 0→0, 1→1, 1→0 and 0→1, but only two states 0→1 and 1→0 draw switching current. For the remaining transition states 0→0 and 1→1, switching current is zero (and the steady state current flows). Hence, the probability of switching is defined by Equation 16 below.
At step 606, a binomial probability theorem in Equation 17 below is used to calculate the probability of “R” data bits switching out of the total of “N” data bits.
In case the SoC has an error correction mechanism, an extra error correction bit (ECB) 607 is taken as an input to find a back to back error probability. A decision regarding how many times the original data should be resent by the memory to the host if the data is incorrect is decided based on the ECB value. At step 608, Equation 18 below is used to calculate the probability of R bits switching considering the ECB.
P(R with ECB)[0:N]=[P(R)]ECB Equation 18
A probability of at least “R+1” bits switching simultaneously is the sum of probabilities from “R+1” bits switching to “N” bits switching. At step 610, Equation 19 below is used to calculate the probability of at least “R+1” bits switching.
To calculate a maximum switching of a bit over a life cycle of the product (e.g., the Flash memory), parameters shown in box 611 and included below are employed:
At step 612, Equation 21 below is used to calculate the maximum number of times a data bit can switch over the product's life cycle.
NSwitch_lifetime=NSwitch_one_day×(WP)×365 Equation 21
A number of error bits over the product's life cycle is calculated by multiplying the maximum number of times a data bit can switch over the product's life cycle (calculated in Equation 21) with the probability of at least “R+1” bits switching simultaneously (calculated in Equation 19). At step 614, Equation 22 below is used to calculate the number of error bits over the product's life cycle.
Error Bit [R→0:N−1]=P(≥R+1 with ECB)×NSwitch_lifetime Equation 22
There is an error bit value for every switching bit (R), which is calculated using Equation 22. As the number of simultaneously switching bits increases, the number of error bits decreases. Whenever an error bit number is less than “1” or when “R=N” is reached, the corresponding switching bits “R” is the minimum number of bits for less than one error. The determination of a value for the minimum number of data bits switching for less than one error (NP) is shown in box 615. At step 616, a bit counter “i” is set equal to 1. At step 618, a determination is made as to whether the bit error number (Error[i]) is less than 1. If Error[i] is less than 1 at step 618, then i becomes the value for NP, which is show in box 623. If Error[i] is not less than 1, “i” is incremented by 1 at step 620. At step 622, a determination is made as to whether i is equal to N. If i is not equal to N, control returns to step 618. If i is equal to N, then the value of NP is equal to N.
The obtained value of NP is used to carry out certain electrical current calculations listed in box 625. Additional inputs used for the current calculations are shown in boxes 627 and 629. The electrical specifications of the I/O (Driving Strength) and the memory (ODT) are taken as inputs to calculate the steady state current. The steady state current calculation is based on a total number of data bits (NDATA) and strobe bits (NSTROBE) present in the flash I/O domain. Equation 24 below is used to calculate the steady state current.
As noted above, the minimum data bits switching for less than one bit error (NP) ( ) is calculated in box 615, and that value is taken as an input to calculate the switching current and power. Equations 25 and 26 below are used to calculate the switching current and switching power, respectively.
I
SWITCHING
_
PROBABILITY=(NP×ISWITCH_DATA)+(NSTROBE×ISWITCH_STROBE) Equation 25
I
SWITCHING
_
PROBABILITY
=I
SWITCHING
_
MAX
×V
IO
_
DOMAIN Equation 26
The dynamic current is the sum of the steady state and switching currents. In other words, it is the total current drawn by the I/O at the time of switching. Equations 27 and 28 below are used to calculate dynamic current and dynamic power, respectively.
I
DYNAMIC
=I
STEADY
_
STATE
+I
SWITCHING
_
PROBABILITY Equation 27
I
DYNAMIC
=I
DYNAMIC
×V
IO
_
DOMAIN Equation 28
The switching time (TSWITCHING_TIME) is taken as an input to calculate the average current. The average current calculation is based on the steady state and switching currents. Equations 29 and 30 are used to calculate the average current and average power, respectively.
I
AVERAGE
_
PROB
=I
STEADY
_
STATE+[(½×Data Rate×ISWITCHING_PROBABILITY×TSWITCHING_TIME)] Equation 29
I
AVERAGE
_
PROB
=I
AVERAGE
_
PROB
×V
IO
_
DOMAIN Equation 30
The power calculations are shown in parallelogram 631 of
The computing environment of
Embodiments of the disclosure can be applied in the context of computer systems other than computing environment 700. Other appropriate computer systems include handheld devices, multi-processor systems, various consumer electronic devices, mainframe computers, and the like. Those skilled in the art will also appreciate that embodiments can also be applied within computer systems wherein tasks are performed by remote processing devices that are linked through a communications network (e.g., communication utilizing Internet or web-based software systems). For example, program modules may be located in either local or remote memory storage devices or simultaneously in both local and remote memory storage devices. Similarly, any storage of data associated with embodiments of the disclosure may be accomplished utilizing either local or remote storage devices, or simultaneously utilizing both local and remote storage devices.
Computing device 710 further includes a hard disc drive 724, an external memory device 728, and an optical disc drive 730. External memory device 728 can include an external disc drive or solid state memory that may be attached to computing device 710 through an interface such as Universal Serial Bus interface 734, which is connected to system bus 716. Optical disc drive 730 can illustratively be utilized for reading data from (or writing data to) optical media, such as a CD-ROM disc 731. Hard disc drive 724 and optical disc drive 730 are connected to the system bus 716 by a hard disc drive interface 732 and an optical disc drive interface 736, respectively. The drives and external memory devices and their associated non-transitory computer-readable media provide nonvolatile storage media for the computing device 710 on which computer-executable instructions and computer-readable data structures may be stored. Other types of media that are readable by a computer may also be used in the exemplary operation environment.
A number of program modules may be stored in the drives and RAM 720, including an operating system 738, one or more application programs 740, other program modules 742 and program data 744. In particular, application programs 740 can include code used to carry out probability-related calculations shown in
Input devices including a keyboard 763 and a mouse 765 are connected to system bus 716 through an input/output interface 746 that is coupled to system bus 716. Monitor 748 is connected to the system bus 716 through a video adapter 750 and provides graphical images to users. Other peripheral output devices (e.g., speakers or printers) could also be included but have not been illustrated. In accordance with some embodiments, monitor 748 comprises a touch screen that both displays input and provides locations on the screen where the user is contacting the screen.
The computing device 710 may operate in a network environment utilizing connections to one or more remote computers, such as a remote computer 752. The remote computer 752 may be a server, a router, a peer device, or other common network node. Remote computer 752 may include many or all of the features and elements described in relation to computing device 710, although only a memory storage device 754 has been illustrated in
The computing device 710 is connected to the LAN 756 through a network interface 760. The computing device 710 is also connected to WAN 758 and includes a modem 762 for establishing communications over the WAN 758. The modem 762, which may be internal or external, is connected to the system bus 716 via the I/O interface 746. Modem 762 may be a wired modem or wireless modem that receives and transmits signals through an antenna.
In a networked environment, program modules depicted relative to the computing device 710, or portions thereof, may be stored in the remote memory storage device 754. For example, application programs may be stored utilizing memory storage device 754. In addition, data associated with an application program may illustratively be stored within memory storage device 754. It will be appreciated that the network connections shown in
In general, the present embodiments are operational with numerous general purpose or special purpose computing system environments or configurations. Examples of well-known computing systems, environments, and/or configurations that may be suitable for use with the present embodiments include, but are not limited to, personal computers, server computers, hand-held or laptop devices, PDAs, multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, telephony systems, distributed computing environments that include any of the above systems or devices, and the like.
The present embodiments may be described in the general context of computer-executable instructions, such as program modules, stored on one or more computer-readable storage media (e.g., volatile memory components (such as DRAM or SRAM), or non-volatile memory components (such as flash memory or hard drives)) and executed on a computer (e.g., any commercially available computer, including smart phones or other mobile devices that include computing hardware). The term computer-readable storage media does not include communication connections, such as signals and carrier waves. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks. The present embodiments may be designed to be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules are located in both local and remote computer storage media including memory storage devices.
Although the operations in some embodiments are described in a particular sequence, it should be understood that this manner of description encompasses rearrangement. For example, operations described sequentially may in some cases be rearranged or performed to operate in parallel.
It should be noted that any functionality described herein can be performed, at least in part, by one or more hardware logic components, instead of software. For example, and without limitation, illustrative types of hardware logic components that can be used include Field-programmable Gate Arrays (FPGAs), Application-specific Integrated Circuits (ASICs), Application-specific Standard Products (ASSPs), System-on-a-chip systems (SOCs), Complex Programmable Logic Devices (CPLDs), etc.
Furthermore, any of the software-based embodiments (comprising, for example, computer-executable instructions for causing a computer to perform any of the disclosed methods) can be uploaded, downloaded, or remotely accessed through a suitable communication means. Such suitable communication means include, for example, the Internet, the World Wide Web, an intranet, software applications, cable (including fiber optic cable), magnetic communications, electromagnetic communications (including RF, microwave, and infrared communications), electronic communications, or other such communication means.
A software tool developed to verify the methodology shown in
The method described above in connection with
The probability based present solution shows that the minimum number of data bits switching in the product life cycle of the 8-channel controller is 46 bits. As per
The difference in the average current is explained using the probability curve shown in
Advantages of embodiments of the disclosure include the following:
The illustrations of the embodiments described herein are intended to provide a general understanding of the structure of the various embodiments. The illustrations are not intended to serve as a complete description of all of the elements and features of apparatus and systems that utilize the structures or methods described herein. Many other embodiments may be apparent to those of skill in the art upon reviewing the disclosure. Other embodiments may be utilized and derived from the disclosure, such that structural and logical substitutions and changes may be made without departing from the scope of the disclosure. Additionally, the illustrations are merely representational and therefore are not drawn to scale. Certain proportions within the illustrations may be exaggerated, while other proportions may be reduced. Accordingly, the disclosure and the figures are to be regarded as illustrative rather than restrictive.
Although specific embodiments have been illustrated and described herein, it should be appreciated that any subsequent arrangement designed to achieve the same or similar purpose may be substituted for the specific embodiments shown. This disclosure is intended to cover any and all subsequent adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the description.
In addition, in the foregoing Detailed Description, various features may be grouped together or described in a single embodiment for the purpose of streamlining the disclosure. This disclosure is not to be interpreted as reflecting an intention that the claimed embodiments employ more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter may be directed to less than all of the features of any of the disclosed embodiments.
The above-disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments, which fall within the true spirit and scope of the present disclosure. Thus, to the maximum extent allowed by law, the scope of the present disclosure is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description.