The present invention relates generally to adder circuits for use in semiconductor integrated circuits and other electronic devices, and more particularly to techniques for generating overflow flags in such adder circuits.
In a conventional n-bit prefix tree adder, the addition of two numbers A and B,
represented in two's complement binary form, can be accomplished by computing:
where c−1 is the primary carry-input signal. The signals designated gj, tj, pj and cj are referred to as generate, transmit, propagate and carry signals, respectively. The resulting sum of A and B is
An overflow occurs, and the resulting sum is invalid, if
OVF=cn−1⊕cn−2=1,
where OVF is an overflow flag.
A gate level model may be used to quantify the computational delay of the
Improved adders which can provide significant reductions in logic depth, computational delay and circuit area relative to conventional adders are disclosed in the above-cited U.S. patent applications Ser. No. 09/291,677, Ser. No. 09/525,644 and Ser. No. 09/569,022. Although these improved adders provide substantial advantages over conventional adders, a need nonetheless remains for further improvements, particularly in terms of reducing the above-described computational delay associated with overflow flag generation.
The invention provides improved techniques for generation of an overflow flag in a prefix tree adder or other type of adder circuit. In accordance with one aspect of the invention, an overflow flag is generated in parallel with generation of at least one of a sum signal and a primary carry-output signal of the adder. As a result, the computational delay associated with overflow flag generation is substantially reduced. Advantageously, this improvement is provided without requiring any increase in the transistor count or circuit area of the adder.
In an illustrative embodiment of the invention, an n-bit adder includes a number of computational stages each associated with one or more bit positions. Particular ones of the computational stages generate a sum output signal in the form of a final sum bit sn−1 of the n-bit adder and a primary carry-output signal cn−1 of the n-bit adder. A flag generation circuit is coupled to at least one of the stages and is operative to generate an overflow flag for the adder substantially in parallel with the generation of the sum output signal sn−1 and the primary carry-output signal cn−1. The flag generation circuit is configured such that it does not require the primary carry-output signal cn−1 to generate the overflow flag for the n-bit adder.
In accordance with another aspect of the invention, the flag generation circuit for an n-bit adder may generate an overflow flag OVF as:
OVF=cn−2{overscore (t)}n−1+gn−1{overscore (c)}n−2,
where cn−2 is an n−2 carry signal of the adder, tn−1 is an n−1 transmit signal of the adder, and gn−1 is an n−1 generate signal of the adder. As a result, the generation of the overflow flag OVF does not require the use of a primary carry-output signal cn−1 of the adder.
In accordance with a further aspect of the invention, the flag generation circuit may include a multiplexer which selects one of a plurality of input signals for propagation to its output as the overflow flag based at least in part on a signal associated with the signal line of at least one of the computational stages. For example, the overflow flag OVF described above can be generated using a 2-to-1 multiplexer which receives as a first input a transmit signal tn−1, receives as a second input a generate signal {overscore (g)}n−1, and receives as a select signal a carry signal cn−2. Depending on the value of the select signal, the multiplexer selects either the transmit signal tn−1 or the generate signal {overscore (g)}n−1 for propagation in inverted form to its output as the overflow flag.
The efficient overflow flag generation techniques of the present invention provide a substantially reduced computational delay relative to the above-described conventional adder architectures. The techniques of the invention are applicable to a wide variety of adders, including prefix tree adders, carry-lookahead adders, carry-skip adders, carry-ripple adders, carry-save adders as well as other types of adders, and to both radix-2 and non-radix-2 implementations of such adders. These and other features and advantages of the present invention will become more apparent from the accompanying drawings and the following detailed description.
The present invention will be illustrated below in conjunction with an example prefix tree adder. It should be understood, however, that the invention is not limited to use with any particular type of adder, but is instead more generally applicable to any type of adder in which it is desirable to significantly improve the computational delay associated with generation of an overflow flag without requiring an increase in transistor count or area for the adder circuit, and thus without substantially increasing the cost or complexity of the adder circuit. For example, although illustrated using a radix-2 prefix tree adder, it will be apparent to those skilled in the art that the disclosed techniques are readily applicable to both radix-2 and non-radix-2 adders, as well as to other types of adders, such as carry-lookahead adders, carry-skip adders, carry-ripple adders and carry-save adders.
In accordance with the invention, the overflow flag OVF is generated in parallel with generation of the sum signal sn−1 and the primary carry-output signal cn−1, using 2-to-1 inverting multiplexer 208 as shown in FIG. 2. More specifically, the following formulation is used to define the overflow flag OVF in the illustrative embodiment:
OVF=cn−1⊕cn−2=cn−2⊕(gn−1+tn−1+tn−1cn−2),
which can be simplified as follows:
OVF=cn−2{overscore (t)}n−1+gn−1{overscore (c)}n−2.
Based on this simplification, the overflow flag OVF does not require the use of the primary carry-output signal cn−1, and thus can be generated in parallel with the output sum signal sn−1 and the primary carry-output signal cn−1, as is shown in FIG. 2.
The 2-to-1 inverting multiplexer 208 receives as a first input the transmit signal tn−1 and as a second input the generate signal {overscore (g)}n−1. The carry signal cn−2 is also applied to the multiplexer 208 as a select signal. Depending on the value of the select signal, the multiplexer 208 selects either the transmit signal tn−1 or the generate signal {overscore (g)}n−1 for propagation in inverted form to its output as the overflow flag OVF. More particularly, if the carry signal cn−2 has a logic “one” value, the transmit signal tn−1 is propagated in inverted form to the output of the multiplexer 208 as the overflow flag OVF, and if the carry signal cn−2 has a logic “zero” value, the generate signal {overscore (g)}n−1 is propagated in inverted form to the output of the multiplexer 208 as the overflow flag OVF. The multiplexer 208 thus implements the simplified equation given above for the overflow flag OVF. Although the multiplexer 208 in this embodiment is an inverting multiplexer, the invention can be implemented using other types of circuitry.
Using the previously-described gate level model, the computation delay associated with generation of the overflow flag OVF in the circuit 200 of
Advantageously, the reduction in the computation time required to generate the overflow flag OVF in the illustrative embodiment is achieved without requiring any significant increase in the transistor count or circuit area of the adder, and thus without increasing adder cost or complexity.
It should be emphasized that the logic circuitry in
As noted previously, the present invention may be implemented in a number of different types of adders. One such adder will now be described in greater detail in conjunction with FIG. 3.
Step 1 (1 Stage):
Calculate
Step 2 (┌log2n┐ Stages):
For k=1 . . . ┌log2n┐ calculate
cj=Gj−2
(Gj−2
Step 3 (1 Stage)
Calculate
sj=pj⊕cj−1℄j0≦j<n,
and
cn−1=G0n−1+T0n−1c−1.
In the set of prefix trees of
In the above description, (Gjj, Tjj)=(gj, tj) and
(Gij,Tij)=(gj,tj)o(gj−1,tj−1)o . . . o(g1,ti) if j≦i,
where o is the fundamental carry operator. The carry cj for each bit position is then given by
cj=G0j+T0jc−1
where c−1 is the primary carry input. If there is no primary carry input, then cj is simply G0j.
The logic depth of an n-bit prefix tree adder configured as shown in
Although static circuits are used in the prefix-tree adder 300 of
The above-described illustrative embodiment of the invention maybe configured to meet the requirements of a variety of different circuit applications, and may be implemented in adder circuits using any desired value of n. Moreover, as previously noted, a variety of other types of adders, including non-radix-2 adders, may also be implemented using the techniques of the present invention.
Adders in accordance with the invention may be used as elements of many different types of circuits, such as, e.g., arithmetic logic units (ALUs), multiply-add units, and comparators. The invention can be incorporated in a wide variety of integrated circuits or other processing devices, including, e.g., microprocessors, digital signal processors (DSPs), microcontrollers, application-specific integrated circuits (ASICs), memory circuits, telecommunications hardware and other types of processing devices.
As a more particular example, the overflow flag generation techniques of the invention can be implemented in a straightforward manner in a DSP such as the DSP16000 from the Microelectronics Group of Lucent Technologies Inc., Allentown, Pa., as described in DSP16000 Digital Signal Processor Core, Information Manual, Lucent Technologies, July 1998, which is incorporated by reference herein. In the DSP 16000, two modules which can be configured to utilize the overflow flag generation techniques of the invention are the ADDSUB and ALU modules, both described in the above-cited Information Manual.
Numerous alternative embodiments of the present invention within the scope of the following claims will be readily apparent to those skilled in the art.
The present application is related to U.S. patent application Ser. No. 09/291,677 filed Apr. 14, 1999 in the name of inventors M. Besz et al. and entitled “Prefix Tree Adder with Efficient Carry Generation,” which issued Mar. 4, 2003 as U.S. Pat. No. 6,529,931, U.S. patent application Ser. No. 09/525,644 filed Mar. 15, 2000 in the name of inventors A. Goldovsky et al. and entitled “Prefix Tree Adder with Efficient Sum Generation,” which issued Mar. 25, 2003 as U.S. Pat. No. 6,539,413, and U.S. patent application Ser. No. 09/569,022 filed May 11, 2000 in the name of inventors A. Goldovsky et al. and entitled “Incorporation of Split-Adder Logic within a Carry-Skip Adder without Additional Propagation Delay,” which issued Jun. 24, 2003 as U.S. Pat. No. 6,584,484, all of which are incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
4441158 | Kanuma | Apr 1984 | A |
4739503 | Armer et al. | Apr 1988 | A |
5010511 | Hartley et al. | Apr 1991 | A |
5257218 | Poon | Oct 1993 | A |
5260890 | Suzuki | Nov 1993 | A |
5270955 | Bosshart et al. | Dec 1993 | A |
5337269 | McMahan et al. | Aug 1994 | A |
5477480 | Inui | Dec 1995 | A |
5523963 | Hsieh et al. | Jun 1996 | A |
5539685 | Otaguro | Jul 1996 | A |
5581497 | Kumar | Dec 1996 | A |
5636156 | Mikan et al. | Jun 1997 | A |
5907498 | Rim | May 1999 | A |
6529931 | Besz et al. | Mar 2003 | B1 |
6539413 | Goldovsky et al. | Mar 2003 | B1 |
Number | Date | Country | |
---|---|---|---|
20020103842 A1 | Aug 2002 | US |