Method for reducing round-off error in fixed-point arithmetic

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

The invention will be more fully understood by reference to the following detailed description of the invention in conjunction with FIG. 1, which illustrates a method for reducing round-off error in fixed-point arithmetic, according to the presently disclosed invention.

DETAILED DESCRIPTION OF THE INVENTION

When multiplying two numbers A and B in a fixed-point process, existing techniques select the number with the smaller magnitude and scale it up as much as possible. For example, if |A|>|B|, the existing technique would scale up the magnitude of B. To do this, the largest integer l is identified, such that

b=2
^l
·B+ε
₂

where l is a scaling factor, b is the rounded integer after scaling,

$- \frac{1}{2} \leq ɛ_{2} \leq \frac{1}{2},$

and most importantly b and A*b are in the range of └−2^N-1,2^N-1−1┘ (i.e., there is no underflow or overflow). Then,

$A \cdot B = A \cdot \frac{b - ɛ_{2}}{2^{l}} = \frac{A \cdot b}{2^{l}} - \frac{A \cdot ɛ_{2}}{2^{l}}$

whereby the rounding error is

$\frac{A \cdot ɛ_{2}}{2^{l}} .$

The disadvantage of this known technique is that the scaling range is very limited, especially when one of the multipliers has a large magnitude, which leads to large rounding errors.

In contrast, the presently disclosed technique, or process, provides an optimal setting for scaling factors such that the rounding error is minimized. An analytic formula which minimizes the rounding error is now illustrated. Assume a and b are scaled values of A and B, i.e.,

a=2^k·A+ε₁, and

b=2^l·B+ε₂

where k,l are scaling factors (when k>0, it is scaling up; when k<0, it is scaling down), k+l is fixed, a,b are rounded integers after scaling, and

$- \frac{1}{2} \leq ɛ_{1}, ɛ_{2} \leq \frac{1}{2} .$

First,

$A \cdot B = \frac{a - ɛ_{1}}{2^{k}} \cdot \frac{b - ɛ_{2}}{2^{l}}$

$A \cdot B = \frac{a \cdot b}{2^{k + l}} + \frac{- a ɛ_{2} - b ɛ_{1} + ɛ_{1} ɛ_{2}}{2^{k + I}}$

whereby the rounding error is

$\frac{- a ɛ_{2} - b ɛ_{1} + ɛ_{1} ɛ_{2}}{2^{k + I}},$

which is approximately equal to

$\frac{- a ɛ_{2} - b ɛ_{1}}{2^{k + 1}} .$

Since k+l is fixed, rounding error is minimized by minimizing −aε₂−bε₁. As

$\langle - a ɛ_{2} - b ɛ_{1} \rangle \leq \frac{\langle a \rangle + \langle b \rangle}{2}$

and a·b is in the fixed range of 2^k+lA·B, it follows that when |a| and |b| are closer in value, then

$\frac{\langle a \rangle + \langle b \rangle}{2}$

gets smaller, and so does |−aε₂−bε₁|.

Thus, one would choose k and l, such that after scaling, the scaled values |a| and |b| are in the same range └2ⁿ,2ⁿ⁺¹), for some integer n, then the rounding error from a·b is minimized.

With the above theoretical analysis, to derive appropriate settings which minimize the rounding error, an initial scaling factor pair (k₀,l₀) is defined, such that:

1. The input value is scaled up as much as possible (up to the boundary of overflow/underflow); and

2. After scaling, the two values |2^k⁰·A| and |2^l⁰·B| are as close to each other as possible.

Next, the scaling factor pair is finely tuned via increasing or decreasing each value by one such that:

1. There is no overflow/underflow;

2. The value is scaled up as much as possible; and

3. The rounding error is minimized.

The rounding error is computed by directly computing (either addition or multiplication) the values with and without scaling. Usually after three or four fine tunings, the appropriate settings which minimize the rounding error will be derived, i.e., when the rounding error could not be further reduced.

After all appropriate settings which minimize the rounding error for each multiplication are derived, the final output will be normalized to cancel out all scaling factors.

The foregoing method for reducing round-off error in fixed-point arithmetic can be implemented by a wide variety of computing hardware and software, including specially programmed general purpose computing systems, custom-designed computing hardware including application specific integrated circuits (ASICs), etc.

These and other embodiments of the invention illustrated above are intended by way of example and should not be viewed as limiting the scope of the disclosure or of the claims. The actual scope of the invention is to be limited solely by the scope and spirit of the following claims.

Method for reducing round-off error in fixed-point arithmetic

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims