The present invention relates to a divider and method for implementing division of two numbers. More particularly, it relates to a divider that can be implemented with efficient use of logic cells. In addition, the invention can be adapted to carry out division to an arbitrary level of precision.
Digital dividers for carrying out division of two values, generally in binary form, are implemented in various manners for a wide range of applications, such as telecommunications equipment. For example, algorithms that implement frequency lock loops and receivers in telecommunications equipment use digital dividers for division of real values. These values are often represented as quantised integer values. One such representation scheme is known as “fixed point Q15”, which provides a 16 bit presentation of real-world signal values. In Q15, 0 (zero) bits are used to present the value on the left-hand side of the decimal point, fifteen bits are used to present values on the right-hand side of the decimal point, and one bit is used for the sign (+/−). Q15 presentation assumes that real signal values are presented in range of (−1,1), and the quantised values are in steps of 1/32767 (that is, ½15). The algorithms assume that all values that require division are decimal, in the range of −1 to 1. The arithmetic operation of division takes two input values in this range, one a dividend and the other a divisor, and produces a result or quotient. The result is represented in the same Q15 format. Any other level of quantisation can be implemented, where appropriate, although Q15 is a commonly used format.
Some examples of decimal values and the equivalent Q15 value (in decimal) are shown below.
The Q15 value is then converted to a decimal value for input into the divider. Some examples of division using Q15 format are as follows:
The Q15 representation is shown in decimal for clarity, however, a divider operates with Q15 represented numbers in a binary format.
The arithmetic operation of dividing two numbers presents a considerable problem in most DSPs or more generally, embedded system implementations. Most processing units do not have an instruction set to execute division. For example, the TI 5402 DSP, does not have a direct instruction for division. Division is implemented indirectly via the special subtraction instruction. Fast division (parallel implementation) requires many logic cells/gates involving large areas of silicon. That is why the “slow” series implementation algorithms, which use few logic gates, are very popular and mostly used in industry. These are cheaper and take up less room than parallel implementations, due to the lesser number of logic gates required.
For example, in the industry, a standard approach is commonly implemented in an ALTERA™ FPGA. ALTERA™ provides several design solutions to achieve division, however, these do not produce results in Q15 precision. The parallel implementation takes around 1200 logic cells and executes in one cycle. While obviously this is time efficient, the implementation requires a lot of silicon “real estate”. The serial divider proposed by ALTERA™ takes approximately 120 logic cells and executes in 32 cycles. Both solutions present only core division operation. Further logic, comprising approximately 100-200 logic cells, is required for interfacing, sign correction, saturation and presentation in Q15. For example, the serial implementation therefore needs a minimum of 350-400 logic cells, and executes in more than 32 cycles. While using significantly fewer gates than the parallel implementation, the execution time (in cycles) is much longer.
It is an object of the invention to provide a divider and/or division method that is capable of being implemented using fewer logic cells than a typical parallel divider implementation, but can execute the division operation more quickly than a typical serial divider implementation, or at least to provide an alternative to existing divider implementations. Additionally, the divider and/or method may be adapted to provide an arbitrary level of precision in the answer, and utilise input values of arbitrary precision.
In broad terms in one aspect the invention comprises a divider for dividing a dividend by a divisor including: a subtractor for subtracting the divisor from the dividend to produce a result, storage space with a preliminary answer, and a processor for revising the dividend and preliminary answer based on the result, wherein the divider is adapted to reiterate the subtraction and revision multiple times, based on a revised dividend and revised preliminary answer each iteration.
In broad terms in another aspect the invention comprises electronic hardware adapted to divide a dividend by a divisor including one or more electronic components for: a) subtracting the divisor from the dividend to produce a result, b) revising the dividend based on the result, c) revising a preliminary answer based on the result, d) repeating steps a)-c) until the preliminary answer reaches the desired precision to provide an answer to the division, and e) outputting or otherwise utilising the answer.
In broad terms in another aspect the invention comprises a method for dividing a dividend by a divisor including: subtracting the divisor from the dividend to produce a result, revising the dividend and a preliminary answer based on the result, and reiterating the subtraction and revision multiple times, based on a revised dividend and revised prelimi answer each iteration.
Preferred embodiments of the invention will be described with reference to the following Figures of which;
Referring to the drawings it should be appreciated that a divider and associated method according to the invention could be implemented in different forms using a range of hardware technologies for various applications. The following embodiments are given by way of example only. It should also be appreciated that binary computations and the manner in which they are implemented are known to those skilled in this area of technology, and therefore no detailed description is required.
The divider includes Q and Div shift registers 11, 12 for storing a dividend and divisor respectively, in binary form. The divider also includes a D shift register 18 for storing an intermediary or preliminary answer to the division in binary, and an output register 19 for storing the quotient, or final answer in binary. Preferably, these are 16-bit registers for storing binary numbers in Q15 form, although other sized registers can readily be implemented as appropriate, if an alternative precision is required. The Q, Div and D registers 11, 12, 18 are coupled to a clock signal 20 for clocking the input. The Q register 11 receives its input value from a 16-bit multiplexer 15 upon clocking. On the input of the multiplexer 15 are an initial input dividend value 23, a feedback value 14, and a control line 24 for selecting an initial input value (dividend) 23. The Div register 12 receives its input value directly from a divisor input line 22. The Q and Div registers 11, 12 are coupled to a subtractor 13 arranged to, upon loading, subtract the value in the Div register 12 from the value in the Q register 11. The subtractor 13 can be implemented in any suitable manner known to those skilled in the technology.
The subtractor 13 outputs a result and overflow bit, which indicates whether the divisor is bigger than the dividend. In particular, a logical high overflow bit indicates that the divisor is bigger than the dividend, while conversely, a logical low overflow bit indicates that the dividend is bigger than the divisor. The output 14 of the subtractor 13 (that is, the result of the subtraction) is fed back to an input of the 16-bit multiplexer 15 along with the overflow bit, which is fed back to an input via a control line 16. The overflow bit on the control line 16 controls selection and loading of the subtractor output 14 into the Q register 11, via the multiplexer 15. More particularly, when the overflow feedback control line 16 is high, indicating that an overflow took place during subtraction, this instructs the multiplexer 15 to select the output 14 for passing to the Q register 11. When the overflow line 16 is logical low, the subtractor output 14 on the input to the multiplexer 15 is unselected, and the Q register 11 is not loaded.
The overflow bit is also placed on a data line 17, which is coupled to the input of the least significant bit of the D register 18 via an inverter 25. The output line 26 from the shift overflow of the D register 18 is coupled to an input of the least significant bit of the Q register 11. The output 27 of the entire D register 18 is coupled to an input of the output register 19. A sign bit 28 from the dividend input line 23 is also coupled to an input of the output register 19.
A preferred method of conducting a division operation in accordance with the invention will be described with reference to
On the final clock cycle of the subtraction, the overflow bit is placed on the input line 17 to D register 19, and the complement formed by inverter 25. In the same cycle, the overflow bit is also placed on the feed back control line 16 to the multiplexer 15, and the result of the subtraction 14 is fed back to one input of the multiplexer 15. On the next clock cycle, several operations are carried out. The value in the D register 18 is shifted left one bit, effectively performing a multiplication by two. The inverted overflow bit is input to least significant bit of the just shifted D register 18 (effectively adding one to the shifted D register value), and the most significant bit, which is removed from the D register 18 by the shift operation, is placed on the input 26 to the least significant bit of the Q register 11. The D register then contains a preliminary answer to the division. The Q register 11 value is also revised or updated beginning with this clock cycle. More particularly, if no overflow took place during the subtraction, the overflow control line 13 is high and instructs the multiplexer 15 to select the input line 14 with the subtraction result, and pass this result through the output of the multiplexer 15 to the input of the Q register 11. If no overflow took place, the control line 16 is low, and prevents the fed back result 14 being loaded into the Q register 11. Rather, the present value in the Q register 11 is retained. On the next cycle, the value on the input to the Q register 11 is loaded in.
This completes one iteration of the division cycle, and the divider 10 is then set up to carry out another iteration, if required, beginning with left shifting the value in the Q register 11 and loading into the least significant bit, the most significant bit shifted out of the D register 18, as explained above. As mentioned previously, the D register 18 contains an intermediary answer to the division. This can be refined by further iterations of the division cycle, each one revising the preliminary answer in the D register 18, until an answer to the desired precision is obtained. So, for example, to get a Q15, 16-bit precision answer, the divider would carry out sixteen iterations of the method. Any arbitrary level of precision in the answer can be obtained by reiterating the cycle the required number of times. Once the desired precision is reached, the contents of the D register are loaded into the output register 19. The cycle counter 29 receives input indicating the required precision of the answer, and monitors the number of iterations that are carried out by the divider 10. Once the required number of iterations is reached, it instructs the D register 18 to load its current value into the output register 19, and instructs the divider 10 to cease. At this point, additional processing logic 31 is instructed to insert a decimal point at the required location in the output value 19, based on the number of division cycles carried out, and the precision of the input dividend 23 and divisor 22. The additional logic 31 design will be known to those skilled in this technology, and need not be described here. The sign bit 30 is also specified, based upon the sign of the input values 23, 22. The divider 10 enables input values of arbitrary precision to be processed by the divider to obtain a result of arbitrary precision. From here, the result in the output register 19 can be used as required.
The core divider 10 according to the invention can be implemented in approximately 110 logic cells and executes in 15 cycles. A complete solution can be implemented using 202 logic cells, which executes in 16 cycles, and provides full interface, sign correction, saturation, produces the desired result in case of division by zero, and produces the result in desired Q15 fixed point presentation. Obviously the complete solution could be altered if another presentation and precision other than Q15 is required. The divider 10 according to the invention can be ported in any other application where fixed point division is required. The input parameter defines the number of bits used for decimal portion of the value. If a larger accuracy, of for example Q32, is required, implementation can readily be achieved without a significant increase in logic cell requirements.
First of all, the dividend, 0110, is loaded into the Q register 11 and the divisor, 1001, is loaded into the Div register 12. Initially both the D and output registers 18, 19 contain all zeros. This is shown in step 32 of
Div is then subtracted from Q in the subtractor 13, to produce the result 0011 as shown in step 34. There is no overflow as Div is less than Q, so when D register 18 is left shifted, a “1” (inverse of overflow bit) is loaded into the least significant bit of D register 18. This is the same as asking whether the value in the Q register is greater than a value in the Div register so in this step the yes arrow is followed from question box 35 to box 36. Further, as there is no overflow, the result, 0011, is loaded into Q register 11, it is left shifted, and the most significant bit (MSB) of D register, 0, that was shifted out is loaded into the least significant bit (LSB) of Q register 11, resulting in the value 0110. No change takes place to the Div and 0 registers 12, 19. These steps are shown in step 38 of
Div is then subtracted from Q in the subtractor 13, to produce the result 0011 as shown in step 34 of
Again, Div is subtracted from Q to produce the result 0011. There is no overflow so when D register 18 is left shifted, a “1” is loaded into the least significant bit. The result, 0011, is loaded into Q register 11, is left shifted, and the most significant bit of D register, 0, is loaded into the least significant bit of Q register 11, resulting in the value 0110. No change takes place to the Div and 0 registers 12, 19 as shown by step 38 of
Div is subtracted from Q to produce the result 0011. There is overflow as Div is greater than Q, so when D register 18 is left shifted, a “0” (inverse of overflow bit) is loaded into the least significant bit of D register 18 as shown in step 39 of
The output register value when converted from Q4 format to decimal becomes 0.666, the answer to 0.4/0.6. As explained earlier, the division process can be carried out for more (or fewer) cycles if a different precision is required for the answer. Where more than four steps are carried out, the additional circuitry 31 (of
The divider can be implemented in any application where division functionality is required. For example, in telecommunications, it could be utilised in FM modulation/demodulation circuitry and/or in frequency estimation circuitry, such as that described in NZ524537 and/or NZ524369. It will be appreciated that the divider could also be implemented in many other technology fields.
The foregoing describes the invention including preferred forms thereof. Alterations and modifications as will be obvious to those skilled in the art are intended to be incorporated in the scope hereof as defined by the accompanying claims.
Number | Date | Country | Kind |
---|---|---|---|
524378 | Feb 2003 | NZ | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/NZ2004/000036 | 2/24/2004 | WO | 00 | 1/10/2008 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2004/075043 | 9/2/2004 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4594680 | Schomburg et al. | Jun 1986 | A |
4692891 | Yamaoka et al. | Sep 1987 | A |
4891730 | Saddow et al. | Jan 1990 | A |
4891780 | Miyoshi | Jan 1990 | A |
4992969 | Yamahata | Feb 1991 | A |
6115733 | Oberman et al. | Sep 2000 | A |
6282556 | Chehrazi et al. | Aug 2001 | B1 |
20030220958 | Aldridge et al. | Nov 2003 | A1 |
20040167956 | Vihriala | Aug 2004 | A1 |
Number | Date | Country |
---|---|---|
0351242 | Jan 1990 | EP |
WO 9519006 | Jul 1995 | WO |
WO 9745787 | Dec 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20080140744 A1 | Jun 2008 | US |