The present invention relates to the general field of shift registers that aid in performing fast calculations based on shifting contents among registers. These types of shift registers are especially useful in signal processor applications.
Shift register arrays are widely used in many signal processing applications such as Finite Impulse Response (FIR) filters and Pipeline Fast Fourier Transforms (FFT) and its inverse Fast Fourier Transforms (IFFT).
Since there is no combinational circuit logic between registers, the shift register array can run at a high speed in conventional integrated circuit designs, for example, a Very Large Scale Integrated Circuit (VLSI) implementation. However, since N shifts are required for the input data to reach the output for each cycle in the shift register array, dynamic power consumption is correlates directly to the number N. Consequently, when N is a large number, the power consumption is also large.
Engineers are keenly aware that power consumption is an important concern in modern VLSI design, which is especially true for integrated circuits used in mobile or portable devices. A low-power design is strongly desirable since these devices are powered by a battery. In such cases, it is justified to trade reasonable hardware cost for lower power consumption. Consequently, the invention is directed to reduce the power consumption in the shift-register array using a low-power register array. The invention provides a Random Access Memory (RAM) technique that leads to low-power dissipation. Since the invention is constructed of registers, the invention can also achieve high throughput.
The invention provides a low-power register array for fast shift calculations. In the exemplary embodiments, a low-power RAM-like register array is utilized to provide the shift operations. The RAM-like register is similar to the shift register array and it can achieve a high throughput required by some applications such as fast FIR and high-speed FFT. However, the invention consumes much less dynamic power than a shift register array as it works like a RAM. Several exemplary architectures for the low-power RAM-like register array are provided.
In the exemplary embodiment, a data register for use in a computer comprises a clock terminal configured to receive a clock signal. A plurality of registers are configured to selectively store data. A data input circuit is coupled to the registers and configured to receive input data and selectively deliver the input data to the registers. A data output circuit is coupled to the data registers and configured to selectively output the output data. A selector is coupled to the data input circuit and the data output circuit, and configured to permit the input data to enter selected registers through the data input circuit and permit selected registers to output data through the data output circuit.
The invention provides an efficient technique for loading the shift registers without a large number of simultaneous serial shifts. The result is a power-efficient device that achieves high performance objectives while minimizing power consumption.
The invention is described with reference to the following figures.
The invention is described with reference to specific apparatus and embodiments. Those skilled in the art will recognize that the description is for illustration and to provide the best mode of practicing the invention.
One exemplary concept of the invention is that a low-power RAM-like register array can be constructed so that only one data is input to the array and one data is output from the array at any given time. Therefore, the N data shifts may be avoided by delivering the input data to a register, whose content will be the output at current clock cycle. Thus, only one register is toggled instead of N registers. This concept helps to significantly reduce power consumption while still providing a fast throughput.
The data input circuit 330 can be constructed in a number of different ways, which are demonstrated below in additional figures. Likewise, while the data output circuit 340 is shown as a multiplexer in all the figures below, there are similar modifications that can be made to that circuit.
The address generator 350A generates an address signal for the demultiplexer 330 so that the input data can be correctly passed to the register, whose content will be output at this cycle. The same address signal goes to the multiplexer 340 since the register accepting the input data will produce the output.
Compared to the shift register architecture in
Additional embodiments are provided to demonstrate further reductions in hardware that can be implemented according to the invention.
This embodiment eliminates the demultiplexer 330A in
Another way to achieve more power saving with a reasonable extra hardware is to use clock gating.
A comparison in term of hardware cost and power saving for the above three architectures are shown in Table 1.
As shown in Table 1, the architectures depicted in
Advantages of the invention are numerous. The invention provides an efficient technique for loading the shift registers without a large number of simultaneous serial shifts. The result is a power-efficient device that achieves high performance objectives while minimizing power consumption.
Having disclosed exemplary embodiments and the best mode, modifications and variations may be made to the disclosed embodiments while remaining within the subject and spirit, of the invention as defined by the following claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2006/050415 | 2/8/2006 | WO | 00 | 3/11/2009 |
Number | Date | Country | |
---|---|---|---|
60651434 | Feb 2005 | US |