1. Field of the Invention
The present invention relates to performing arithmetic operations on interval operands within a computer system. More specifically, the present invention relates to a method and an apparatus for finding the roots of a nonlinear equation of one variable using the interval version of Newton's method.
2. Related Art
Rapid advances in computing technology make it possible to perform trillions of computational operations each second. This tremendous computational speed makes it practical to perform computationally intensive tasks as diverse as predicting the weather and optimizing the design of an aircraft engine. Such computational tasks are typically performed using machine-representable floating-point numbers to approximate values of real numbers. (For example, see the Institute of Electrical and Electronics Engineers (IEEE) standard 754 for binary floating-point numbers.)
In spite of their limitations, floating-point numbers are generally used to perform most computational tasks.
One limitation is that machine-representable floating-point numbers have a fixed-size word length, which limits their accuracy. Note that a floating-point number is typically encoded using a 32, 64 or 128-bit binary number, which means that there are only 232, 264 or 2128 possible symbols that can be used to specify a floating-point number. Hence, most real number values can only be approximated with a corresponding floating-point number. This creates estimation errors that can be magnified through even a few computations, thereby adversely affecting the accuracy of a computation.
A related limitation is that floating-point numbers contain no information about their accuracy. Most measured data values include some amount of error that arises from the measurement process itself. This error can often be quantified as an accuracy parameter, which can subsequently be used to determine the accuracy of a computation. However, floating-point numbers are not designed to keep track of accuracy information, whether from input data measurement errors or machine rounding errors. Hence, it is not possible to determine the accuracy of a computation by merely examining the floating-point number that results from the computation.
Interval arithmetic has been developed to solve the above-described problems. Interval arithmetic represents numbers as intervals specified by a first (left) endpoint and a second (right) endpoint. For example, the interval [a, b], where a<b, is a closed, bounded subset of the real numbers, R, which includes a and b as well as all real numbers between a and b. Arithmetic operations on interval operands (interval arithmetic) are defined so that interval results always contain the entire set of possible values. The result is a mathematical system for rigorously bounding numerical errors from all sources, including measurement data errors, machine rounding errors and their interactions. (Note that the first endpoint normally contains the “infimum”, which is the largest number that is less than or equal to each of a given set of real numbers. Similarly, the second endpoint normally contains the “supremum”, which is the smallest number that is greater than or equal to each of the given set of real numbers.)
One commonly performed computational operation is to find the roots of a nonlinear equation using Newton's method. The interval version of Newton's method works in the following manner. From the mean value theorem,
ƒ(x)−ƒ(x*)=(x−x*)ƒ′(ξ),
where ξ is some generally unknown point between x and x*. If x* is a zero of ƒ, then ƒ(x*)=0 and, from the previous equation,
x*=x−ƒ(x)/ƒ′(ξ).
Let X be an interval containing both x and x*. Since ξ is between x and x*, it follows that ξ∈X Moreover, from basic properties of interval analysis it follows that ƒ′(ξ) ∈ƒ′(X). Hence, x * ∈N(x,X) where
N(x,X)=x−ƒ(x)/ƒ′(X).
Temporarily assume 0 ∉ƒ′(X) so that N(x,X) is a finite interval. Since any zero of ƒ in X is also in N(x,X), the zero is in the intersection X ∩N(x,X). Using this fact, we define an algorithm for finding zero x*. Let X0 be an interval containing x*. For n=0, 1, 2, . . . , define
Xn=m(Xn)
N(xn,Xn)=xn−ƒ(xn)/ƒ′(Xn)
Xn=1=Xn∩N(xn,Xn),
wherein m(X) is the midpoint of the interval X. We call xn the point of expansion for the Newton method. It is not necessary to choose xn to be the midpoint of Xn. The only requirement is that xn∈Xn to assure that x*∈N(xn,Xn). However, it is convenient and efficient to choose xn=m(Xn).
Roots of an interval equation can be intervals rather than points when the equation contains non-degenerate interval constants or parameters. Suppose the interval version of Newton's method to find the roots of a single nonlinear equation has not yet satisfied the user-specified convergence tolerances. Then it is difficult to distinguish between the following three situations:
What is needed is a method and an apparatus for terminating the interval version of Newton's root finding method before iterations lose their practical value in isolating meaningfully distinct interval roots.
One embodiment of the present invention provides a system for finding zeros of a function, ƒ within an interval, X, using the interval version of Newton's method. The system operates by receiving a representation of the interval X. This representation including a first floating-point number, a, representing the left endpoint of X, and a second floating-point number, b, representing the right endpoint of X. Next, the system performs an interval Newton step on X, wherein the point of expansion is the midpoint, x, of the interval X. Note that performing the interval Newton step involves evaluating ƒ(x) to produce an interval result ƒI(x). If ƒI(x) contains zero, the system evaluates ƒ(a) to produce an interval result ƒI(a). It also evaluates ƒ(b) to produce an interval result ƒI(b). The system then evaluates a termination condition for the processing of the current interval X, wherein the termination condition is TRUE if a number of sub-conditions are satisfied, including if ƒI(a) contains zero and if ƒI(b) contains zero. If the termination condition is TRUE, the system terminates the processing of the current interval X, and records X as a final bound.
In one embodiment of the present invention, if ƒI(a) does not contain zero, evaluating ƒ(a) additionally involves performing an interval Newton step wherein the point of expansion is a.
In one embodiment of the present invention, if ƒI(b) does not contain zero, evaluating ƒ(b) additionally involves performing an interval Newton step wherein the point of expansion is b.
In one embodiment of the present invention, if ƒI(x) contains zero and if ƒ′(X) contains zero, the termination condition for processing the current interval X is TRUE if ƒI(a) contains zero, ƒI(b) contains zero, ƒI(x1) contains zero and if ƒI(x2) contains zero. Note that x1 is the midpoint between a and x; and x2 is the midpoint between x and b.
In one embodiment of the present invention, if ƒI(x) contains zero and if ƒ′(X) contains zero, and if either ƒI(x1) or ƒI(x2) does not contain zero, the system additionally splits the interval X in half and applies the interval Newton method to each half separately.
In one embodiment of the present invention, in the case where ƒI(x) does not contain zero and if ƒ′(X) contains zero, if the width of the interval X divided by the magnitude of the interval X is less than a first threshold value, and the magnitude of ƒ(X) is less than a second threshold value, the method further comprises terminating the processing of the current interval X, and recording X as a final bound.
In one embodiment of the present invention, if a given Newton step does not reduce the width of an interval by at least half, the system additionally splits the interval in half and applies the interval Newton method to each half separately.
In one embodiment of the present invention, if an interval Newton step results in two intervals, the system additionally applies the interval Newton method to each of the two intervals separately.
In one embodiment of the present invention, if the result of an interval Newton step is the empty interval, the system returns to process another interval.
In one embodiment of the present invention, if ƒI(a) contains zero and if ƒI(b) contains zero, the system returns to process another interval.
In one embodiment of the present invention, if ƒI(a) does not contain zero or if ƒI(b) does not contain zero, the system performs an interval Newton step wherein the point of expansion is the midpoint of the interval X. If the result of this interval Newton step is the empty interval, the system returns to process another interval.
In one embodiment of the present invention, the system terminates the processing of the current interval X after a predetermined number of iterations.
The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
The data structures and code described in this detailed description are typically stored on a computer readable storage medium, which may be any device or medium that can store code and/or data for use by a computer system. This includes, but is not limited to, magnetic and optical storage devices such as disk drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs or digital video discs), and computer instruction signals embodied in a transmission medium (with or without a carrier wave upon which the signals are modulated). For example, the transmission medium may include a communications network, such as the Internet.
Computer System
Processor 102 can include any type of processor, including, but not limited to, a microprocessor, a mainframe computer, a digital signal processor, a personal organizer, a device controller and a computational engine within an appliance. Processor 102 includes an arithmetic unit 104, which is capable of performing computational operations using floating-point numbers.
Processor 102 communicates with storage device 108 through bridge 106 and peripheral bus 110. Storage device 108 can include any type of non-volatile storage device that can be coupled to a computer system. This includes, but is not limited to, magnetic, optical, and magneto-optical storage devices, as well as storage devices based on flash memory and/or battery-backed up memory.
Processor 102 communicates with memory 112 through bridge 106. Memory 112 can include any type of memory that can store code and data for execution by processor 102. As illustrated in
Note that although the present invention is described in the context of computer system 100 illustrated in
Compiling and Using Interval Code
Processor 102 also retrieves interval values 115 from memory 112 and passes these interval values 115 through arithmetic unit 104 to produce results 212. Results 212 can also include interval values.
Note that the term “compilation” as used in this specification is to be construed broadly to include pre-compilation and just-in-time compilation, as well as use of an interpreter that interprets instructions at run-time. Hence, the term “compiler” as used in the specification and the claims refers to pre-compilers, just-in-time compilers and interpreters.
Arithmetic Unit for Intervals
In the embodiment illustrated in
Note that arithmetic unit 104 includes circuitry for performing the interval operations that are outlined in FIG. 5. This circuitry enables the interval operations to be performed efficiently.
However, note that the present invention can also be applied to computing devices that do not include special-purpose hardware for performing interval operations. In such computing devices, compiler 204 converts interval operations into a executable code that can be executed using standard computational hardware that is not specially designed for interval operations.
Interval Operations
The interval X is a closed subset of the extended (including −∞ and +∞) real numbers R* (see line 1 of FIG. 5). Similarly the interval Y also has two endpoints and is a closed subset of the extended real numbers R* (see line 2 of FIG. 5).
Note that an interval is a point or degenerate interval if X=[x, x]. Also note that the left endpoint of an interior interval is always less than or equal to the right endpoint. The set of extended real numbers, R* is the set of real numbers, R, extended with the two ideal points negative infinity and positive infinity:
R*=R ∪{−∞}∪{+∞}.
In the equations that appear in
The addition operation X+Y adds the left endpoint of X to the left endpoint of Y and rounds down to the nearest floating-point number to produce a resulting left endpoint, and adds the right endpoint of X to the right endpoint of Y and rounds up to the nearest floating-point number to produce a resulting right endpoint.
Similarly, the subtraction operation X−Y subtracts the right endpoint of Y from the left endpoint of X and rounds down to produce a resulting left endpoint, and subtracts the left endpoint of Y from the right endpoint of X and rounds up to produce a resulting right endpoint.
The multiplication operation selects the minimum value of four different terms (rounded down) to produce the resulting left endpoint. These terms are: the left endpoint of X multiplied by the left endpoint of Y; the left endpoint of X multiplied by the right endpoint of Y; the right endpoint of X multiplied by the left endpoint of Y; and the right endpoint of X multiplied by the right endpoint of Y. This multiplication operation additionally selects the maximum of the same four terms (rounded up) to produce the resulting right endpoint.
Similarly, the division operation selects the minimum of four different terms (rounded down) to produce the resulting left endpoint. These terms are: the left endpoint of X divided by the left endpoint of Y; the left endpoint of X divided by the right endpoint of Y; the right endpoint of X divided by the left endpoint of Y; and the right endpoint of X divided by the right endpoint of Y. This division operation additionally selects the maximum of the same four terms (rounded up) to produce the resulting right endpoint. For the special case where the interval Y includes zero, X/Y is an exterior interval that is nevertheless contained in the interval R*.
Note that the result of any of these interval operations is the empty interval if either of the intervals, X or Y, are the empty interval. Also note, that in one embodiment of the present invention, extended interval operations never cause undefined outcomes, which are referred to as “exceptions” in the IEEE 754 standard.
Interval Version of Newton's Method
In the following description, the current interval is denoted by X at each step although X changes from step to step. Also, x denotes the midpoint m(X) so x changes as well. Note that except for a few cases described below, the Newton step is generally defined by an expansion about x.
The system starts by placing the initial interval, X0, in a list of intervals to be processed, L (step 602).
If L is empty, the system stops. Otherwise, the system selects the interval X, from L that has been in L for the shortest amount of time, and the system deletes X from L (step 604).
Next, the system determines if the derivative off which is represented as ƒ′, when evaluated over the interval X contains zero. In other words, is 0 ∈ƒ′(X)? (step 606). If not, the system finds roots for the case where ƒ′(X) does not contain zero (step 608). The operations involved in solving for this case are described in more detail with reference to
If ƒ′(X) contains zero at step 606, the system determines whether ƒI(x) contains zero (step 610). Note that ƒI(x) denotes the interval resulting from an evaluation of ƒ(x). If the conditions in both steps 606 and 610 are true, the system finds the roots of ƒ for the case where ƒ′(X) does not contain zero (step 612). The operations involved in solving for this case are described in more detail with reference to
If ƒI(x) does not contain zero at step 610, the system performs tests involving the stopping tolerances εX and εF (step 614). If the width of the interval X(denoted as w(X)) divided by the magnitude of X (denoted as |X|)is less than εX, and if the magnitude of ƒ(X) (denoted as |ƒ(X)|) is less than εF, the system records X as a final bound (step 616) and returns to step 604 to process additional intervals if necessary. Note that the width of the interval X=[a,b] (denoted as w(X)) is simply b−a. Also note that the magnitude of an interval X=[a, b] (denoted by |X|) is the maximum value of |x| for all x ∈ X. Thus, |X|=max(|a|,|b|). Furthermore, note that the stopping tolerances, εX and εF, are used to stop the system from performing further computations when the interval bound on a zero of ƒ is sufficiently narrow.
If at step 614 one or both of the tested quantities is greater than or equal to the corresponding stopping tolerance, the system applies the interval Newton step to the interval X (step 617).
The system then examines the result of the Newton step (step 618). If the result is empty, the system returns to step 604 to process additional intervals, if necessary. If the result is a single interval, the system determines whether the Newton step reduced the interval X by at least half in size (step 620). If so, the system returns to step 606 to continue processing the current interval X. Otherwise, the system splits X into halves, puts one half into the list of intervals to be processed, L, and designates the other half as the current interval X (step 622). The system then returns to step 606 to continue processing the current interval X.
If the result of the Newton step at step 618 is two intervals, the system puts one of the two intervals into the list of intervals to be processed, L, and designates the other interval as the current interval, X (step 624). The system then returns to step 606 to continue processing the current interval X.
Case Where 0 ∉ ƒ′(X)
Note that the relation 0 ∉ ƒ′(X) assures that there is no more than one point or interval zero of ƒ in X. Moreover, if there is a zero of ƒ in X, then it is a simple one. Also note that inclusion isotonicity assures that 0 ∉ ƒ′(X′) for any interval X′ contained in X.
The system starts by initializing a counter variable, n, and two flags, Fa and Fb, to zero (step 702). Note that the system sets Fa=1 when 0 ∈ƒ′(a). Similarly, the system sets Fb=1 when 0 ∈ƒ′(b). Also note that the system cycles through the operations listed below no more than four times. The integer n counts the cycles.
After initializing the variables in step 702, the system tests Fa to see if Fa==1 (step 703). If Fa==1, the system proceeds to step 710 to test flag Fb. On the other hand, if Fa≠1 at step 703, the system evaluates ƒ(a) to produce the resulting interval ƒI(a) (step 704). Next, the system determines if ƒI(a) contains zero (step 706). If so, the system sets Fa=1 (step 707) and proceeds to step 710 to test flag Fb. If at step 706 ƒI(a) does not contain zero, the system applies an interval Newton step wherein the point of expansion is “a” (step 708). If the result of this expansion is the empty interval, the system returns to process the next interval.
Otherwise, the system tests Fb to see if Fb==1 (step 710). If Fb==1, the system proceeds to step 718 to test both flags Fa and Fb. On the other hand, if Fb≠1 at step 710, the system evaluates ƒ(b) to produce the resulting interval ƒI(b) (step 712). Next, the system determines if ƒI(b) contains zero (step 714). If so, the system sets Fb=1 (step 715) and proceeds to step 718 to test both flags Fa and Fb. If at step 714 ƒI(b) does not contain zero, the system applies an interval Newton step wherein the point of expansion is “b” (step 716). If the result of this expansion is the empty interval, the system returns to process the next interval. Let x** be the midpoint of X that satisfies the condition 0 ∈ƒI(x) in step 610. x** is set in step 612. Note that if x** is no longer an element of X, the system uses the midpoint of the current X, m(X), as the point of expansion.
Otherwise, the system tests to see if both Fa==1 and Fb==1 (step 718). If both Fa==1 and Fb==1, the bounds, a and b, on the zero of ƒ are as tight as they are likely to become. Hence, the system returns to process the next interval (step 721).
Otherwise, the system applies an interval Newton step wherein the point of expansion is m(X) (step 720). If the result of this expansion is the empty interval, the system returns to process the next interval. Next, the system increments n (step 722) and tests to see if n<4 (step 724). If n≧4, the system returns to process the next interval because the bounds on the zero are unlikely to be further tightened by additional processing (step 726). Otherwise, if n<4, the system returns to step 703 to repeat the process.
Case Where 0 ∈ƒ′(X)
The system first evaluates ƒ(a) to produce the resulting interval ƒI(a) (step 802). Next, the system determines if ƒI(a) contains zero (step 804). If not, the system applies an interval Newton step wherein the point of expansion is “a” (step 806) and returns to the main program.
On the other hand, if ƒI(a) contains zero, the system evaluates ƒ(b) to produce the resulting interval ƒI(b) (step 808). Next, the system determines if ƒI(b) contains zero (step 810). If not, the system applies an interval Newton step wherein the point of expansion is “b” (step 812) and returns to the main program.
On the other hand, if ƒI(b) contains zero, the system next finds the centers, x1 and x2, of the two halves of the interval X by computing x1=(3a+b)/4 and X2=(a+3b)/4 (step 814). The system then evaluates the functions ƒ at each of the centers, x1 and x2, to produce the resulting intervals ƒI(x1) and ƒI(x2). Next, the system determines if both ƒ(x1) contains zero and ƒ(x2) contains zero (step 816). If so, the computed (interval) value of ƒ contains zero for each of the five equally spaced points, a, x1, m(X), x2 and b. In this case, additional processing is unlikely to narrow the X further. Hence, the system accepts X as a final bound on a zero of ƒ (step 822).
On the other hand, if either ƒI(x1) or ƒI(x2) does not contain zero, the system splits the interval X in half, and applies the interval Newton method to each half separately (step 820).
Note that the system can also make use of higher precision interval arithmetic to help resolve the uncertainty when 0 ∈ƒI(x) and 0 ∈ƒ′(X).
The foregoing descriptions of embodiments of the present invention have been presented for purposes of illustration and description only. They are not intended to be exhaustive or to limit the present invention to the forms disclosed. Accordingly, many modifications and variations will be apparent to practitioners skilled in the art.
For example, in the computational process described above with reference to
In another embodiment of the present invention, if the function, ƒ, has already been evaluated at a right endpoint or a left endpoint of an interval, X, the system saves the result of this evaluation in the list, L, so that the function does not have to be reevaluated at the right endpoint or left endpoint of X.
Hence, the above disclosure is not intended to limit the present invention. The scope of the present invention is defined by the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5220524 | Hesson | Jun 1993 | A |
5631858 | Narita et al. | May 1997 | A |
20040015830 | Reps | Jan 2004 | A1 |
Number | Date | Country | |
---|---|---|---|
20030033339 A1 | Feb 2003 | US |