The present invention relates to an arithmetic unit used for the processor of a computer, and in particular to the configuration of an arithmetic unit that performs calculations for squaring floating-point numbers and an arithmetic method therefor.
The squaring for scientific engineering calculations of values expressed as floating-point numbers is frequently performed using a computer, and the capability of a computer to perform the required calculations is greatly affected by the processing speed of the multiplier provided for the squaring of the floating-point numbers. For this reason, various devices have been devised to improve the processing speeds of multipliers used for squaring floating-point numbers.
An explanation will now be given for the square calculation of a floating-point number using an electronic circuit and a conventional method employed for improving calculation speed.
For the multiplication of a floating-point number, two processes are required: the multiplication of numerical values, and the rounding off of the product that is performed. Usually, the multiplication of numerical values is the process used for conventional devices designed to speed up squaring calculations performed for floating-point numbers.
First, an explanation will be given for the multiplication of eight-bit numbers represented by a (=a7, a6, a5, a4, a3, a2, a1 and a0) and b (=b7, b6, b5, b4, b3 b2, b1 and b0).
For a squaring calculation, two like numbers are multiplied, and when floating-point numbers are multiplied, the most significant bit (MSB) is always “1”. Therefore, the squaring multiplication of the number a, consisting of eight, eight bit numbers, in
aiai=ai (a)
aiaj=ajai (b)
are established.
In equation (a), since like terms are multiplied, an AND gate is not required.
In equation (b), since the product term aiaj corresponds to the product term ajai, it is therefore found that when these two product terms are added at the same position, they need only be collated to form a single product term in order to be inserted in a one level higher position.
Conventionally, there is a well known method for whereby a Wallace tree can be simplified by using the symmetry of the product terms in a squaring multiplier.
In
Then, since the product terms at position s1 are a1a0 and a0a1, equation (b) can be applied for these product terms, and therefore, the product term a1a0, obtained by collating the above product terms, is carried over and entered at one higher position, s2.
At position s2, there are three product terms, a2a0, a1a1 and a0a2. For these product terms, equation (a) can be applied for product term a1a1, and equation (b) can be applied for product terms a2a0 and a0a2. Therefore, at position s2, by applying equation (a) for a1a1, a1 is entered, and a2a0, obtained by applying equation (b) for the terms a2a0 and a0a2, is carried over and entered at position s3.
As a result, the 64 product terms in
For a binary adder for calculating the above product terms, a circuit technique, called a Carry Look Ahead (CLA), is available that uses a combinational circuit to generate a higher carry from a lower carry. This Carry Look Ahead circuit technique can reduce the delay resulting from the addition process performed by the adder.
Furthermore, as is described above, since when floating-point numbers are multiplied the number of effective input bits equals the number of effective output bits, a rounding off process is performed for the addition results obtained for the numerical values.
In
The above described calculation method and rounding off method used for floating-point numbers conform to standard IEEE (Institute of Electrical and Electronics Engineers) 754.
As is described above, various devices have been provided for increasing the processing speed of squaring multipliers for floating-point numbers. But even so, currently, in line with requests that the processing capabilities of computers be improved, even greater increases are being sought for squaring multipliers for floating-point numbers.
It is, therefore, one object of the present invention to provide a squaring multiplier for floating-point numbers for which the number of constituent arithmetic units is reduced by locally compressing the addition of the floating-point numbers (the addition of mantissas), and to provide increased processing speeds.
It is another object of the present invention to increase the processing speeds of squaring multipliers for floating-point numbers by performing in parallel the addition of floating-point numbers and the rounding off process performed for the addition results.
To achieve the above objects, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing various arithmetic operations for the target variable, wherein the arithmetic unit includes a pseudo carry generator, for generating pseudo information concerning a carry in a number equivalent to the predetermined bit count in an arithmetic operation for the target variable, and a combinational circuit, for performing an arithmetic operation for the target variable by using the pseudo information concerning a carry that is generated by the pseudo carry generator.
The pseudo generation of information concerning a carry does not mean that a carry is obtained as a result of the actual numerical calculation, but means only that a look ahead operation is performed and a carry is generated by using the combinational circuit (pseudo carry generator).
For a target bit for a rounding off process during an arithmetic operation, the pseudo carry generator generates the pseudo information concerning a carry.
According to the present invention, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing various arithmetic operations for the target variable, wherein the arithmetic unit includes a pseudo carry generator, for performing look ahead operations for generating carries in a number equivalent to a predetermined bit count in an arithmetic operation performed for the target variable, and a combinational circuit for using the results obtained by the pseudo carry generator to perform an arithmetic operation for the target variable.
According to the present invention, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing various arithmetic operations for the target variable, wherein the arithmetic unit includes a pseudo carry generator, for generating information concerning a carry when a numerical value is calculated for a value equivalent in number to the lower predetermined bit count for the target variable, and a combinational circuit, for calculating a value for a higher bit while taking into account the information concerning the carry.
According to the present invention, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing various arithmetic operations for the target variable, wherein the arithmetic unit includes a first combinational circuit, for obtaining, directly from a target variable, information concerning the location of a round-off bit that is used for a rounding off process performed in conjunction with an arithmetic operation performed for a variable, and a second combinational circuit, for performing the arithmetic operation for the target variable while performing the rounding off process using the information concerning the location of the round-off bit that is obtained by the first combinational circuit.
More specifically, the second combinational circuit performs the arithmetic operation beginning with the lowest digit of the target variable. Further, when the second combinational circuit obtains, from the first combinational circuit, the information concerning the location of the round-off bit, and progresses the calculation up to the location of the round-off bit, the second combinational circuit establishes the value of the round-off bit, and performs the calculation for a higher digit while taking into account the value of the round-off bit.
According to the present invention, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing various arithmetic operations for the target variable, wherein the arithmetic unit includes an MSB look ahead circuit, for employing the target variable to establish, in a look ahead manner, the location of the most significant bit (MSB) of the operation results, and a combinational circuit, for performing a rounding off process based on the location of the most significant bit established by the MSB look ahead circuit.
According to the present invention, a processor comprises: a register, for holding a predetermined binary variable; and an arithmetic unit, for reading a target variable from the register and for performing squaring calculation for the target variable, wherein the arithmetic unit includes an MSB look ahead circuit, for comparing the target variable with the √{square root over (2)} and for employing the comparison results to establish the location of the most significant bit (MSB) of the results obtained for the operation, and a combinational circuit, for performing a rounding off process based on the location of the most significant bit that is established by the MSB look ahead circuit.
According to the present invention, an arithmetic unit that multiplies predetermined binary floating-point numbers comprises: means for reading a target floating-point number from a register for holding floating-point numbers; means for generating information concerning a carry, equivalent in number to a predetermined bit count, for the target floating-point number; and means for adding the mantissa of the target floating-point number while taking into account the information concerning a carry.
The means for generating the information concerning a carry generates information concerning a carry for a value equivalent in number to a predetermined lower bit count in the mantissa of the target floating-point number. The means for performing the addition adds the higher bits of the mantissa while taking into account the information concerning a carry.
According to the present invention, an arithmetic unit that multiples predetermined binary floating-point numbers comprises: means for reading a target floating-point number from a register for holding floating-point numbers; means for performing carry look ahead operations in a number equivalent to a predetermined bit count during the multiplication of the target floating-point number; and means for multiplying the target floating-point number by using the results obtained by the carry look ahead operations.
According to the invention, an arithmetic unit that multiplies predetermined binary floating-point numbers comprises: means for reading a target floating-point number from a register for holding floating-point numbers; means for obtaining, directly from the target floating-point number, information that is used for a rounding off process performed in conjunction with the multiplication of the target floating-point number; and means for adding a mantissa of the target floating-point number while performing the rounding off process using the thus obtained information.
According to the present invention, an arithmetic unit that multiplies predetermined binary floating-point numbers comprises: means for reading a target floating-point number from a register for holding floating-point numbers; and means for obtaining, directly from the mantissa of the target floating-point number, the location of the most significant bit (MSB) of the multiplication results obtained for the target floating-point number.
According to the invention, an arithmetic unit that performs squaring calculations for predetermined binary floating-point numbers comprises: means for reading a target floating-point number from a register for holding floating-point numbers; and means for comparing the mantissa of the target floating-point number with the √{square root over (2)}, and for, based on the comparison results, establishing the location of the most significant bit (MSB) of the operation results.
According to the invention, an arithmetic method, for an arithmetic unit that multiplies predetermined binary floating-point numbers, comprises the steps of: reading the target floating-point numbers from registers for holding floating-point numbers; generating, for a value equivalent in number to the predetermined lower bit count for the mantissas of the target floating-point numbers, information concerning a carry generated when a numerical value is calculated; and calculating the value of a higher bit while taking into account the information concerning the carry.
According to the present invention, an arithmetic method, for an arithmetic unit that multiplies predetermined binary floating-point numbers, comprises the steps of: reading target floating-point numbers from registers for holding floating-point numbers; calculating mantissas for the target floating-point numbers beginning with the lowest digits, and detecting the location of a round-off bit that is used for a rounding off process; establishing the value of the round-off bit when the calculation progresses are completed up to the location of the round-off bit; and calculating a higher digit while taking into account the value of the round-off bit.
The preferred embodiment of the present invention will now be described in detail while referring to the accompanying drawings.
In this invention, to increase the processing speed of a squaring multiplier for floating-point numbers, the following two methods are proposed:
In this embodiment, an arithmetic unit is provided that includes combinational circuits (a pseudo carry generator and an MSB look ahead circuit, which will be described later) that implement these methods.
The arithmetic unit 21 further includes: a pseudo carry generator 21a, which serves as a combinational circuit constituting arithmetic operation means for performing the squaring calculations for a floating-point number, and also as a combinational circuit constituting carry look ahead means for performing a look ahead operation during the addition of the lower bits of the floating-point number; and an MSB look ahead circuit 21b that serves as a combinational circuit constituting MSB look ahead means for performing a look ahead operation to establish the location of the MSB in the addition results of the floating-point number.
The squaring multiplier of the floating-point number for this invention is provided by especially specifying and optimizing the squaring calculation function of the arithmetic unit 21 in
The above described two methods for increasing the processing speed of the squaring multiplier for the floating-point number will now be described in detail.
(1) Method for performing a carry look ahead operation to increase the floating-point number addition speed
During the addition of floating-point numbers, the higher bits of mantissas are employed as effective bits, and the lower bits are used for the rounding off process. Therefore, regardless of the bit values, the only determinations that are required are those to determine whether a carry is generated and whether a bit value of 1 is present.
Thus, so long as, for the addition of numerical values, rather than having to perform actual calculations for a lower bit all that is necessary is for information for the bit (whether a carry is generated and whether a bit value of 1 is present) to be generated by a combinational circuit (the pseudo carry generator 21a), the squaring calculations for a floating-point number can be performed quickly using a simple circuit configuration.
For the constitution of the pseudo carry generator 21a, an explanation will now be given for a rounding off signal and the number of carry signals employed during the squaring calculations performed for a floating-point number.
For the example calculation in
<Position s0>
Since at position s0 a0 is the only term to be added, the addition result is a0 and no carry is generated, and the rounding off result is a0. Therefore, when the carry signal at this digit is defined as Carry0 and the rounding off signal is defined as Round0, the following equations are established.
s0=a0
Carry0=0
Round0=a0
<Position s1>
Since there is no term to be added at position s1, no carry is generated and the rounding off results are stored. Thus, when the carry signal for this digit is defined as Carry1 and the rounding off signal is defined as Round1, the following equations are established.
s1=0
Carry1=0
Round1=a0
<Position s2>
At position s2, two terms a1 and a1a0 are added, and only Carry1 from the lower position need be added to the addition results. Since Carry1 is always 0 based on the above studies for positions s0 and s1, the relationship between the bit pattern and the output of a1a0 conforms to the following truth table.
Since the total of the number of effective terms is equal to or smaller than 2, only one carry signal is generated, and the rounding off result is updated. Therefore, when the carry signal for this digit is defined as Carry2 and the rounding off signal is defined as Round2, the following equations are established.
Carry1=a1a0
Round2=Round1+a1*−a0=a0+a1
<Position s3>
The term a2a0 is added at position s3. The truth table to obtain carry signal Carry3 and rounding off signal Round3 is as follows:
Thus, the following equations are established for the carry signal Carry3 and the rounding off signal Round3 for this digit.
Round3=Round2+−a2*a1*a0+a2*−a1*a0=a0+a1
Carry3=a2*a1*a0
<Position s4>
The terms a2, a3a0 and a2a1 are added, and the truth table to obtain carry signal Carry4 and rounding off signal Round4 is as follows:
Thus, the truth table for Round4 and Carry4 is as follows:
Therefore, the following equations are established for the carry signal Carry4 and the rounding off signal Round4 for this digit.
<Position s5>
Terms a4a0 and a3a1 are added at position s5, and the truth table for obtaining carry signal Carry5 and rounding off signal Round5 is as follows:
Thus, the truth table for Round5 and Carry5 is as follows:
In this case, the pseudo carry generator for Carry5 is generated.
In the above truth table, the terms for which the value of Carry5 is set to “1” or “2” are collected as follows:
From this table, four prime implicants of Carry5 are found: a3a2a1, a4a2a1a0, a4a3a2a0 and a4a3a1a0. To find these prime implicants, Quin-McCluskey's method, which is a well known logical compression method, can be employed. The following relationship is obtained between three of these prime implicants and Carry5.
pt0=a3a2a1
pt1=a4a2a1a0
pt2=a4a3a2a0
pt3=a4a3a1a0
As is apparent from (i), to generate two pseudo carries, pt0, pt1, pt2 and pt3 need only be separated into two groups. For this, arbitrary grouping may be employed, yielding the equations
Carry5a=pt0=a3a2a1 and
Carry5b=pt1+pt2+pt3=a4a0(a3a2+a2a1+a1a3),
for example. Further, Round5 can be generated using the following logical equation.
Round5=a2+a1+a0
These three equations can be employed as proxies for the logic up to position s5 of the squaring calculation circuit.
Position s6>
At position s6, terms a3, a5a0, a4a1 and a3a2 are added, and the truth table for the total number of terms is as follows:
When this truth table is arranged for Carry6 and Round6, the following table is obtained.
This table is further rearranged for Carry6, and the following table is obtained.
When the above table is logically compressed, the prime implicants of Carry6 can be obtained as follows:
The contribution to the output made by each prime implicant is as follows:
When only the prime implicants are considered in the above table, by referring to (vi), (i) and (iii) belong to the same group, while by referring to (vii), (i) and (iii) belong to different groups, so that these groups contradict each other. In order to remove this contradiction, a new term must be created such that the prime implicant is set (the value becomes 1) at (vii), and is not set (the value does not become 1) at (vi). Therefore, a new term, (iii)′=a5a4a3a0, is prepared, wherein (iii)′ is a partial term. At this time, there are three carry signals, as follows:
Carry6a=(i)+(iii)+(iv)+(v)
Carry6b=(ii)
Carry6c=(iii)′
Further, rounding off signal Round6 can be generated using the following logical equation.
Round6=a3+a2+a1+a0
As is described above, there is one case wherein a pseudo carry can not be generated merely only by referring to the inclusive relationship. In this case, a new term, such as (iii)′, is prepared for performing a logical calculation. This new term is not a prime implicant but is a partial term of a specific term.
<Position s7>
At position s7, terms a6a0, a5a1 and a4a2 are added. The truth table for the total number of terms is as follows:
When this truth table is arranged for Carry7 and Round7, the following table is obtained.
When the above table is further arranged for Carry7, the following table is obtained.
When this truth table is logically compressed, the following prime implicants for Carry7 are obtained.
The number of carry signals, and a rounding off signal will now be studied for each of the digits s0 to s7 in
As is shown in
Then, a process is performed to match the number of carries with the number of groups of prime implicants that are set (step 206). When one pseudo carry is obtained after the process at step 205 has been completed, the value of the pseudo carry is uniquely determined. But when two pseudo carries are generated, generally a plurality of values, as pseudo carries, are present and if more than three pseudo carries are present, the values of the pseudo carries tend not to be determined and specific sorting is required. For example, for the digits s0 to s7, the number of carries at s0 and s1 is 0, and this case is not applied for the process. Since the number of carries for s2 and s3 is 1, the value of the pseudo carry is uniquely determined. While when the number of carries for s4 is 2, the number of prime implicants matches the number of carries, and the values of the pseudo carries are uniquely determined. The number of carries for s5 is 2, and a plurality of values are available for the pseudo carry. The number of carries for s6 and s7 is 3, and the values of the pseudo carries can not be determined merely by using the prime implicants. In this case, since there are a plurality of values available for the pseudo carries, at step 206 the number of the pluralities of groups of prime implicants that are set is matched by the number of carries, and two pseudo carries are generated.
When the value of a pseudo carry can not be determined during the process performed from step 202 to step 206, an appropriate prime implicant is separated into partial terms, program control returns to step 202 to repeat the following process, and the value of the pseudo carry is determined (steps 207 and 208). In the above example, since the case for s6 is pertinent, the prime implicant is separated into partial terms, and the values of three pseudo carries are determined.
Pseudo carries can be prepared in the above manner. However, as is described above, for bits at position s6 or higher the value available for a pseudo carry is increased and the process required to uniquely determine this value becomes complicated, and it is therefore not realistic to perform a carry look ahead operation. Thus, in this embodiment, when two pseudo carries that are generated at position s5 are defined as f0=Carry5a and f1=Carry5b and the rounding is defined as r5, these carries and the rounding are substituted into the original equations, so that the squaring calculation for the floating-point number is as shown in
As is shown in
When the calculation in
The actual squaring multiplier is so designed by a CAD that it writes an arithmetic expression in
In this embodiment, the squaring calculation for floating-point numbers is employed. However, the method for performing a carry look ahead operation and increasing the speed at which the floating-point numbers are added can also be employed for another arithmetic operation. That is, according to the method of the invention, when multiple bits are to be added, such as the addition of product terms for multiplication, and when, as a result of the rounding off process, only determinations as to whether a carry is generated and whether a bit of 1 is present are required for several lower bits, regardless of the values of these bits, only information concerning the carry look ahead operation is required and the numerical calculation is eliminated. Therefore, the present invention can be applied for not only squaring calculations, but also for other calculations, such as the multiplication of floating-point numbers, for which the same conditions apply.
In the squaring calculation in this embodiment, i.e., in the squaring calculation for an eight bit floating-point number, the product terms to be added at the higher positions s15, s14, s13 and s12 are constituted only by a6, a5 and a4. Therefore, the calculation of this portion can be simplified.
Since a maximum of two carries are generated at position s11, the following truth table for s12 and Carry12 is obtained while the two carries are defined as Carry11.
The truth table for s13 and Carry13 is obtained as follows by using the obtained Carry12.
Further, by using the obtained Carry13, the truth table for s14 and s15 (=Carry14) is obtained as follows:
When s6, s5, s4 and Carry11 are determined by referring to these relationships, all the pseudo carries at positions higher than s12 can be determined. Further, s14 and s15 can be determined by using the pseudo carries. For example, S14 can be obtained by using the following calculation.
S14≦‘1’ when S(6 downto 5)=“00” or S(6 downto 5)=“11”
When Carry11 is determined in the above manner, S12, S13, S14 and S15 can be established by using a two-gate delay. This is faster than when an adder is located in this portion and these bits are determined beginning with the lowest. However, since the higher bit is a portion along the declining slope of the delay of the Wallace tree, various circuit configurations can be employed. For example, the same improvement in the processing speed can be produced by using a two-step carry skip adder.
When, so far as gate delays are concerned, the results provided by the multiplier of this embodiment up until the output for the Wallace tree at s6 is established are compared with those provided by a conventional multiplier, it is found that a seven-gate delay is provided when the ordinary multiplier it is employed for a squaring calculation, and that even when, as is shown in
It is logically possible that the pseudo carry generator 21a can be prepared for a higher digit. However, six terms are required for Carry6 to constitute the pseudo carry generator using a combinational circuit, or ten terms are required for Carry10, and as number of digits is increased, the advantage of the pseudo carry generator, i.e., the high speed processing, is gradually reduced. In addition, since not only the prime implicant but also a partial term is required to generate a pseudo carry equal to or higher than Carry7, the calculation time is further increased.
When there are many carries, the method employed by an ordinary arithmetic circuit should be used to handle carries, so as to reduce both circuit size and circuit delays. That is, for higher digits, the results provided by the optimized pseudo carry generator matches those provided by an arithmetic circuit that generates a true carry, and no significant advantage is conveyed by the use of a pseudo carry.
(2) Method for performing a look ahead operation and establishing the MSB in results obtained by adding floating-point numbers, and for performing the rounding off process for the addition results in parallel to the addition
During the multiplication of floating-point numbers, a rounding off process is performed for the addition results obtained for mantissas in order to equalize the number of effective bits for input and the number of effective bits for output. In the rounding off process, the location of the MSB of “1” in the addition results must be established to determine, as a round-off bit the digit in the addition results of the floating-point numbers. Therefore, generally, the rounding off process is performed after the floating-point number addition has been completed.
So long as an appropriate combinational circuit (the MSB look ahead circuit 21b) is employed perform a look ahead operation and establish the location of the MSB which has a value of “1”, the location of the round-off bit can also be established based on this location. Therefore, the rounding off process can be performed in parallel with to the addition of the floating-point numbers, and the squaring multiplication of the floating-point number can be performed faster.
As preparation of the explanation of the method of the invention for looking ahead and establishing the location of the MSB, an explanation will now be given for the general method used for performing the rounding off process after the addition of floating-point numbers is completed.
As previously described for the background art, the multiplication of the floating-point numbers, including the ordinary rounding off process based on standard IEEE 754, is performed as is shown in the flowchart in
The squaring calculation performed for the floating-point numbers in
RoundBit=‘0’when(“1000111000111000111001”=“0000000000000000000000”) else ‘1’
Therefore, for the addition result in
RoundBit=‘1’.
When the 23rd bit from the lowest, i.e., the guard bit, has a value of “1”, and when the round-off bit or the 24th bit from the lowest, i.e., ulp (Unit of Least Precision) has a value of “1”, according to standard IEEE 754 the addition result obtained for the ulp bit value and “1” is defined as the rounding off process result (see step 905). For the other cases, the values from the MSB to the 24th bit are defined as the rounding off process result. Since in the addition result in
111000111000111000111001+1=111000111000111000111010.
Furthermore, when a carry is generated as a result of the rounding off process and the MSB is shifted, a value of 1 is added to the exponential (see step 906). However, since in the example in
In this manner, the location of the MSB is detected based on the addition results, and the rounding off is performed along the succeeding process sequence.
A method for performing the look ahead operation for the location of the MSB of “1” will now be described.
During the multiplication of floating-point numbers, the location of the MSB varies depending on whether the multiplication result is equal to or greater than 2. When it is known in advance that the multiplication result is equal to or greater than 2, the look ahead operation can be performed to locate and detect the MSB. Therefore, during the squaring calculation, whether the calculation result is equal to or greater than 2 can be determined by comparing it with the √{square root over (2)} (=21/2) the mantissa of the floating-point number to be squared, and the location of the MSB in the calculation results can be established.
A specific explanation will now be given for a 32 bit single precision type and a 64 bit double precision type that are defined by standard IEEE 754.
The value of the √{square root over (2)} for the single precision type is
√{square root over (2)}=1.0110 1010 0000 1001 1110 011,
and the square thereof is
1.1111 1111 1111 1111 1111 111.
Therefore, when the original number to be calculated is equal to or smaller than the √{square root over (2)}, the square thereof is equal to or smaller than 2.
Similarly, the value of the √{square root over (2)} for the double precision type is
√{square root over (2)}=1.0110 1010 0000 1001 1110 0110 0110 0111 1111 0011 1010 0010 0000 1,
and the square thereof is
1.1111 1111 1111 1111 1111 1111 1111 1111 1111 1111 1111 1111 1111 1.
Therefore, when the original number to be calculated is equal to or smaller than the √{square root over (2)}, the square thereof is equal to or smaller than 2.
As is described above, when both for single precision and double precision the mantissa of the floating-point number to be multiplied is compared with the √{square root over (2)}, the location of the MSB in the multiplication result can be established. Based on the location of the MSB, the exponential, the ulp, the guard bit and the round-off bit in the multiplication results are established.
Assume that the look ahead operation for the MSB is performed for the squaring calculation of 1.01010101010101010101011, which is the above described binary expression for numerical value 4/3.
When the above described single precision √{square root over (2)} having the value 1.0110 1010 0000 1001 1110 011 is compared with 1.01010101010101010101011 (=4/3), the following expression is established.
4/3<√{square root over (2)}
Therefore, without performing the addition of the mantissa, it is determined that (4/3)2 is smaller than 2, and the look ahead operation can be performed and the location of the MSB in the multiplication result established.
When the MSB look ahead operation is employed for the squaring calculation of the floating-point number, the rounding off process is performed in parallel with the addition of the mantissa.
That is, during the addition of the mantissas, the product terms are generated for the individual terms of the mantissas and the Wallace tree is employed for the obtained product terms, and the binary adder performs the addition. During this process, the MSB look ahead circuit 21b compares the mantissa with the √{square root over (2)}, and employs the comparison result to establish the location of the MSB, as well as the locations of the ulp, the guard bit and the round-off bit.
When the addition of the mantissas has progressed up to the digit of the round-off bit that is established by the MSB look ahead circuit 21b and the bit value is determined, the round-off bit is established.
Sequentially the addition employed to obtain a bit higher than the mantissa is performed. Since the round-off bit has already been determined, the rounding off process is terminated at the same time as the addition of the mantissas is terminated.
The circuit configuration of the MSB look ahead circuit 21b will now be described by using the squaring multiplier in FIG. 5.
Assuming A=[1 a6 a5 a4 a3 a2 a1 a0], when A>181,
A×A=1XXXXXXXXXXXXXXX,
and when A<181,
A×A=01XXXXXXXXXXXXXX,
so that the location of the MSB is shifted. Since the variable range of A is 255>A>128, when A is defined as a floating-point number, only whether [a6 a5 a4 a3 a2 a1 a0]>53 is established need be determined, so that the location of the MSB can be established.
In the squaring multiplier in
When [a6 a5 a4 a3 a2 a1 a0]>53 is established, the effective numbers are [s15 s14 s13 s12 s11 s10 s9 s8], the guard bit is s7, and the round-off bit is the OR of s6 and s0. At this time, when
p1=s7 & (s8+(s6+r5))
is added at the digit of s8, the rounding off process is initiated. This process is performed by a combinational circuit 502, which is the rounding off process means of the squaring multiplier in
Further, when [a6 a5 a4 a3 a2 a1 a0]≦53, the effective numbers are [s14 s13 s12 s11 s10 s9 s8 s7], the guard bit is s6 and the round-off bit is the OR of s5 and s0. At this time, when
p0=s6 & (s7+r5)
is added to the digit of s7, the rounding off process is initiated. This process is performed by a combinational circuit 503, which is the rounding off process means of the squaring multiplier in
The above described determination as to whether [a6 a5 a4 a3 a2 a1 a0]>53 is established can be performed satisfactorily quickly, and the calculations for the rounding off process can be performed while the squaring multiplier in
As is described above, according to the present invention, the rounding off process can be hidden by the addition of the mantissas of the floating-point numbers. That is, since the rounding off process is terminated at the same time as the addition at step 901 in
In addition, in this embodiment, while the MSB look ahead circuit 21b is included in the arithmetic unit 21, the adder for performing the rounding off process based on the addition results obtained for the mantissas of the floating-point numbers is not required. As a whole, therefore, the number of gates is reduced, and accordingly, the circuit size is also reduced.
As is described above, according to the invention, since the addition of the floating-point numbers (addition of the mantissas) is logically compressed, the number of arithmetic units that constitute the squaring multiplier for the floating-point numbers can be reduced, and the processing speed can be increased.
Further, according to the invention, since the addition of the floating-point numbers and the rounding off process for the addition results are performed in parallel, the processing speed of the squaring multiplier for the floating-point numbers can be increased.
Number | Date | Country | Kind |
---|---|---|---|
2001-168737 | Jun 2001 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
RE35365 | Colavin | Oct 1996 | E |
6018758 | Griesbach et al. | Jan 2000 | A |
6301598 | Dierke et al. | Oct 2001 | B1 |
6393453 | Purcell | May 2002 | B1 |
6766346 | Amer | Jul 2004 | B2 |
20010018699 | Amer | Aug 2001 | A1 |
Number | Date | Country | |
---|---|---|---|
20030126177 A1 | Jul 2003 | US |