The present invention relates to the acceleration or in other words the fast computation of signal and image models and, more particularly, to an improved accelerated predictive-transform (“PT”) modeling method and apparatus for use in signal and image compression, estimation, detection, identification, channel source integrated (“CSI”) coding, control and other related application areas.
Accelerated signal and image modeling methods and devices are well known. Some known traditional acceleration methods are based on transformation signal and image modeling schemes that approximate optimum transformation schemes commonly referred to as the Karhunen-Loeve transform (“KLT”). For example, the discrete cosine transform (“DCT”) is such an acceleration scheme that has found widespread application in diverse areas such as image compression where it is the basis of international standards such as JPEG and MPEG. These standards decompose the images into low dimensional, 8×8, picture element (“pixel”) blocks for their fast transformation and subsequent compression. However, these acceleration schemes are limited to transformation models, and generally do not work as well when applied in more general modeling frameworks that combine both prediction and transformation.
For example, in image compression applications, a methodology integrating prediction and transformation models and using an optimum minimum mean squared error (“MMSE”) criterion has been found to significantly improve the blocking effects that result from transformation methods such as the DCT that do not use prediction to exploit the correlation that exists between encoded pixel blocks. An image compression method based on this MMSE PT modeling methodology was filed on Oct. 22, 2000 as U.S. patent application Ser. No. 09/696,197 entitled “Super Predictive-Transform Coding” (the '197 application), the disclosure of which is herein incorporated by reference in its entirety. The '197 application has been found to significantly outperform the best image compression techniques available at the present time, including both DCT and wavelet based compressors. The key to the high performance of the aforementioned invention is a MMSE PT signal and image model superstructure that forms the basis of the proposed method and apparatus. The MMSE PT model consists of prediction and transformation matrices that result from the solution of coupled eigensystem and normal design equations.
In addition to image compression, the modeling technique can also be incorporated in other signal and image processing applications such as estimation, detection, identification, channel and source integrated (“CSI”) coding, control and other related areas. See for instance Feria, E. H., “Predictive-Transform Estimation”, IEEE Transactions on Signal Processing, November 1991 (the “1991 IEEE Trans. On Signal Processing paper”)and also Feria, E. H. “Decomposed Predictive-Transform Estimation”, IEEE Transactions on Signal Processing, October 1994, the disclosure of both which is herein incorporated by reference in their entirety, where it was shown that a MMSE PT signal model can be used to generate a new class of estimators that has as special cases classical Kalman and Wiener estimators and that in addition leads to very simple decomposed structures. In two other publications Guerci, J. R. and Feria, E. H. “On a Least Squares Predictive-Transform Modeling Methodology,” IEEE Transactions on Signal Processing, July 1996, and Guerci, J. R. and Feria, E. H. “Application of a Least Squares Predictive-Transform Modeling Methodology to Space-Time Adaptive Array Processing,” IEEE Transactions on Signal Processing, July 1996, the disclosure of both is herein incorporated by reference in their entirety, demonstrate how the MMSE PT signal model forms the basis of a more general adaptive signal modeling strategy that has widespread applications. In Feria, E. H. “Predictive Transform: Signal Coding and Modeling,” Eleventh Triennial World Congress of IFAC, Tallinn (Estonia, USSR), Oxford: Pergamon Press, August 1990, the disclosure of which is herein incorporated by reference in its entirety, the technique was applied to the modeling of control processes or plants.
One problem with traditional methods of prediction and transformation is that the tend to be slow. Additionally, the computational burden of traditional methods was excessive.
These and other deficiencies in the methods for traditional accelerated signal and image modeling are addressed by the present invention.
A first embodiment of the present invention is directed to a method of digital image modeling involving a digital image that has been divided into a plurality of digital image blocks. First, a coder input vector of a plurality of digital image blocks is received. Second, a coefficient vector is approximated in which each coefficient of the coefficient vector corresponds to one of the digital image blocks. The approximating step further comprises the steps of (1) generating a forward transformation matrix that yields a coefficient vector from a product of the coder input vector and (2) decomposing the forward transformation matrix into a product of at least one of a multi-diagonal transformation matrix and a sparse unitary transformation matrix and an energy sequencer unitary transformation matrix.
A second embodiment of the present invention is directed to a method of digital image modeling involving a digital image that has been divided into a plurality of digital image blocks. First, a coefficient vector is received in which each coefficient or coefficient vector corresponds to one of a plurality of digital image blocks. Second, a coder input vector of the digital image blocks is approximated. The approximating step further comprises the steps of (1) generating a backward transformation matrix that yields a coder input vector from a product of the coefficient vector and (2) decomposing the backward transformation matrix into a product of at least one of a multi-diagonal transformation matrix and a sparse unitary transformation matrix and an energy sequencer unitary transformation matrix.
The third embodiment of the present invention is directed to a method of digital image modeling involving a digital image that has been divided into a plurality of digital image blocks. First, a predictive vector is received representing a plurality of digital image blocks immediately adjacent a coder input vector comprising a plurality of digital image blocks of a divided digital image. Second, a coefficient vector is approximated in which each coefficient of the coefficient vector corresponds to one of the digital image blocks of the divided digital image. The approximating step further comprises the steps of (1) generating a forward prediction matrix that yields a coefficient vector from a product of the coder input vector and (2) decomposing the forward prediction matrix into a product of at least one of a multi-diagonal and a sparse unitary transformation matrix and an energy sequencer unitary transformation matrix and a cross variance matrix divided by the coder input vector's average power
Other objects and features of the present invention will become apparent from the following detailed description considered in conjunction with the accompanying drawings. It is to be understood, however, that the drawings are designed solely for the purpose of illustration and not as a definition of the invention.
The foregoing and other features of the present invention will be more readily apparent from the following detailed description and drawings of illustrative embodiments of the invention wherein like reference numbers refer to similar elements throughout the several views and in which:
a is a schematic block diagram of a traditional MMSE PT forward transformation technique;
b is a schematic block diagram of a forward transformation scheme in accordance with an embodiment of the present invention;
a is a schematic block diagram of a traditional MMSE PT backward transformation technique;
b is a schematic block diagram of a backward transformation scheme in accordance with an embodiment of the present invention;
a is a schematic block diagram of a traditional MMSE PT forward prediction technique; and,
b is a schematic block diagram of a forward prediction scheme in accordance with an embodiment of the present invention.
By way of overview and introduction, the present invention is the acceleration of both the design, on-line and/or off-line, and implementation, on-line, of optimum, i.e., MMSE, prediction and transformation matrices. These matrices are found to inherently arise from the solution of coupled eigensystem and normal, or generalized Wiener-Hopf, equations and lead to a MMSE PT signal or image model. The present application has application in diverse fields, including but not limited to signal and image compression, estimation, detection, identification, CSI coding and control.
The present invention successfully overcomes the excessive computational burden associated with the evaluation of prediction and transformation matrices from generally high-dimensional coupled eigensystem and normal equations. This is achieved by the decomposition of the original high-dimensional design equations into a set of easily solved low-dimensional design equations. When the decomposed design equations are of very low dimension, say 2×2, analytical solutions are forthcoming resulting in even greater accelerations.
The decomposition of a PT unitary transformation matrix into a product of multi-diagonal and/or sparse unitary matrices in accordance with the present invention leads to an accelerated evaluation of the product of each PT unitary transformation matrix by an arbitrary vector. The representation of a PT prediction matrix by the product of an integer cross covariance matrix and a PT unitary transformation matrix in decomposed form, also leads to an accelerated evaluation of the product of each component of the PT prediction matrix by an arbitrary vector.
Hereinafter our method will be referred as accelerated predictive-transform (“APT”), with the understanding that the acceleration refers to both the acceleration of the design of the APT transformation and prediction matrices and the acceleration of the product of the APT transformation and prediction matrices by arbitrary and properly dimensioned vectors.
A signal and/or image compression application will be used to first document the origins of the MMSE PT signal model from a unifying MMSE PT source coding scheme initially formulated in Feria, E. H., “Predictive-Transform Coding,”, Proceedings of 1986 IEEE NAECON, Dayton, Ohio, May 1986, (the “1986 IEEE NAECON paper”) and then to show how the APT scheme of the current invention arises naturally from the MMSE PT signal model.
The APT scheme in accordance with the present invention will be explained using
The lossy encoder and decoder sections of
The coder input vector x(k+1) 114 of dimension n×1.
The coefficient vector c(k+1) of dimension n×1.
The coder input and coefficient vector estimates {circumflex over (x)}(k+1) and ĉ(k+1), each of dimension n×1.
The prediction vector z(k) of dimension m×1.
The predicted coefficient vector c′(k+1) of dimension n×1.
The coefficient error or innovation vector δc(k) of dimension n×1. More specifically, δc(k)=[δc1(k)δc2(k) . . . δcn(k)]t with the variance of each scalar element δcj(k) increasing as the value of j increases from 1 to n. Also the coefficient error δc(k) is zero mean with uncorrelated elements. A justification of these properties, for instance, the formulation of MMSE PT coding may be found in the 1986 IEEE NAECON paper, the disclosure of which is incorporated herein by reference in its entirety.
The truncated coefficient error vector δe(k) of dimension q×1 where q≦n. The q elements of δe(k) are the q most energetic elements of δc(k), i.e., δe(k)=[δcn−q+1(k)δcn−q+2(k) . . . δcn(k)]t.
The scaled and truncated coefficient error vector δf(k) of dimension q×1.
The scaled and truncated coefficient error integer vector δê(k) of dimension q×1.
The truncated coefficient error vector estimate δê(k) of dimension q×1.
The coefficient error vector estimate δĉ(k) of dimension n×1.
The bit stream b of dimension 1×B where B denotes the number of bits present in b.
The following nine subsystems may also characterize the lossy encoder and decoder:
A transposed transformation matrix Rt 110 of dimension n×n—where the matrix R is unitary and hence its transpose is the same as its inverse—which multiplies the coder input vector x(k+1) 114 to yield the coefficient vector c(k+1). When the transformation is used in this way it is said that a forward transformation from the spatial domain of x(k+1) to the coefficient domain of c(k+1) has occurred.
A unitary transformation matrix R 420 that multiplies the coefficient vector estimate ĉ(k+1) to yield the coder input vector estimate {circumflex over (x)}(k+1). When the transformation is used in this way it is said that a backward transformation from the coefficient domain of ĉ(k+1) to the spatial domain of {circumflex over (x)}(k+1) has occurred.
A transposed prediction matrix Pt 530 of dimension n×m that is multiplied by the prediction vector z(k) to yield the predicted coefficient vector c′(k+1). In this case it is said that a forward prediction has occurred from the spatial domain of z(k) to the coefficient domain of c′(k+1).
A dimensionality reduction subsystem 150 that multiplies the n−q less energetic elements of the n-dimensional coefficient error vector δc(k) by zero gains. This multiplication, in turn, results in the q-dimensional truncated coefficient error vector δe(k).
A memory device 160 that temporarily stores recently reconstructed coder input vector estimates {{circumflex over (x)}(0) . . . {circumflex over (x)}(k)}. These stored vectors are used at each processing stage to construct the prediction vector z(k).
A scaling device with gain 1/Comp 170 responsible for establishing the amount of compression associated with the coder. More specifically, the constant Comp is adjusted to produce the desired amount of compression for the coder.
q scalar quantizers 180 implemented by finding the closest integer vector, δ{circumflex over (f)}(k), to the scaled and truncated coefficient error δf(k), i.e., δ{circumflex over (f)}(k)=Round(δf(k)).
A scaling device with gain Comp 172 responsible for generating the truncated coefficient error vector estimate δê(k) from the scaled and truncated coefficient error integer vector δ{circumflex over (f)}(k).
A dimensionality restoration subsystem 190 that restores the coefficient error estimate δĉ(k) from the truncated coefficient error estimate δê(k) via zero padding.
The lossless PT encoder 140 and decoder 142 of
The PT source coder structure of
E[x(k+1)−{circumflex over (x)}(k+1))t(x(k+1)−{circumflex over (x)}(k+1))] (1)
with respect to the prediction and transformation matrix R and P and subject to zero mean and uncorrelated coefficient innovations, leads to optimum prediction and transformation matrices, R and P, that are designed off-line (or on-line in adaptive applications) by solving the following coupled eigensystem and normal equations:
KR=RΛ (2)
P=JR (3)
where: a) K is a n×n positive definite error or innovation covariance matrix; b) J is a m×n cross covariance matrix; c) Λ is a n×n diagonal eigenvalue matrix; d) Im is a m×m identity matrix; e) 0m×1 is a zero column vector; f) Ex and Ez are first order expectations of the coder input vector 114 and prediction signals 112 x(k+1) and z(k); g) Exxt, Exzt, Ezxt and Ezzt are second order expectations of x(k+1) and z(k); and h) “Inv(•)” denotes a matrix inversion.
The need for the off-line (or on-line for the adaptive case) evaluation of the optimum prediction and transformation matrices, P and R, from the MMSE PT design equations (2)–(3) can result in a significant computational burden. This problem is well documented in pp. 767–771 of the recent text “Numerical Methods for Engineer”, by Steven C. Chapra and Raymond P. Canale, Fourth Edition, 2002, herein incorporated by reference in its entirety. For example, the “Power Method” described in that text begins the evaluation of each eigenvector/eigenvalue of R with the assumption that the initial value for the considered eigenvector—the one with highest eigenvalue—is a vector whose elements are all one, and then obtains during each algorithmic iteration an improved result until the desired eigenvector is obtained. A new eigenvector is then obtained using as a foundation the previously obtained eigenvector. Clearly this approach is computationally inefficient, particularly for the case where the dimensionality of the eigensystem is large. The APT invention to be discussed shortly addresses this fundamental problem.
When the PT source coder is fully lossless, i.e., when q=n, Comp=1, and the q scalar quantizers are all replaced with unity gains, it follows that
δ{circumflex over (c)}(k)=δc(k) (6)
and the lossy decoder of
x(k+1)=Rc(k+1) (7)
c(k+1)=Ptz(k)+δc(k) (8)
where all the variables of the signal model (7) and (8) were defined earlier for
An investigation of
n2 multiplications and n(n−1) additions (9)
under the simplifying assumption that m=n when considering the MMSE PT forward prediction product. Similarly as with the solution of the coupled eigensystem and normal design equations of (2) and (3) it is noticed that the computational burden associated with the above three products (9) is quite inefficient when the dimension of the input signal n is large. The APT invention to be discussed shortly addresses this fundamental problem.
E[xi,j]=C and E[xi,jxi+v,j+h]=Pavgρ√{square root over (v
(10)
where: a) E[xi,j] denotes the mean value of the pixel xi,j; b) C is a constant value; b) v and h denote the vertical and horizontal distances between pixels xi,j and xi+v,j+h; c) E[xi,jxi+v,j+h] denotes the correlation between the pixels xi,j and xi+v,j+h; d) Pavg denotes the average power for any pixel xi,j; and e) ρ denotes the correlation coefficient between any two adjacent pixels either in the vertical direction, i.e., the pixels xi,j and xi+1,j for all i and j, or horizontal direction, i.e., the pixels xi,j and xi,j+1 for all i and j.
To facilitate the presentation of the APT invention we will assume that monochrome images are being compressed and each image pixel is represented with 8 bits, i.e., the image pixels can have integer values from 0 to 255. In addition, we will assume for the isotropic model of (10) the parameter values C=100, Pavg=11,200, and ρ=0.99, and for the coder input vector 214 and prediction signal 212 geometry the one shown in
Using expressions (11) and (12) in the design equation (2)–(3) the following MMSE PT 7×8 prediction, 8×8 transformation and 8×8 eigenvalue matrices are obtained:
Note from (15) that the eigenvalues associated with the MMSE PT transformation matrix R are in ascending order. This property is consistent with the desire to have the variance of the elements of the efficient innovation δc(k)=[δc1(k)δc2(k) . . . δcn(k)]t the PT source coder of
In
c(k+1)=Rtx(k+1), (16)
is replaced in the present invention with a cascade of L+1 computational blocks that as a
when evaluating the expression
{tilde over (c)}(k+1)={tilde over (R)}tx(k+1) with {tilde over (R)}=H0H1 . . . H L−1E (19)
where: a) the dimension of the input signal x(k+1), n, is assumed to be some positive integer power of 2, i.e.,
n=2N with Nε(1, 2, . . . ) (20)
b) {tilde over (R)} is a n×n unitary transformation matrix that approximates the MMSE PT transformation R; c) {tilde over (c)}(k+1) is an approximation to the MMSE PT coefficient vector c(k+1); d) Hf for all f=0, 1, . . . , L−1 is a n×n unitary transformation matrix characterized by a sparse structure, with exactly wf nonzero elements for each of its rows and columns, and a wf×wf eigensystem, that may be solved to find all of its nonzero elements; e) wf for all f is a positive integer power of 2 less than or equal to n, i.e.,
wf=2q
and
wf≦n; (22)
f) the sparse structure of Hf leads to a computational burden for the evaluation of expression (19) that is summarized by expressions (17) and (18); g) the number of Hf computational blocks, L, satisfies the constraint
e.g., when wf is given by the stage invariant expression
wf=2q for all f (24)
it follows from (23) that
L=log2n/q, (25)
furthermore it is noted, after using (25) in expressions (17) and (18), that for the assumed stage invariant expression for wf (24) the computational burden of the APT forward transformation is given by the more compact expressions
2qn log2n/q multiplications (26)
and
(2q−1)n log2n/q additions; (27)
and h) E is an energy sequencer unitary transformation matrix of dimension n×n characterized by a sparse structure and composed of exactly n unity elements; the objective of this matrix is to organize the energy of the elements of the coefficient innovation vector Δc(k) in an ascending order, as required by
The sparse unitary transformation matrix Hf is defined by the following three expressions:
where: a) Sf is a nonnegative integer number given by the following recursive expression
S0=0; (32)
b) the dimension zf of the zf×zf diagonal matrix
is given by the following expression
zf=2−S
c)
for all g is a zfwf×zfwf multi-diagonal, more specifically (2wf−1)-diagonal, matrix consisting of wf2 diagonal sub-matrices,
d)
Hf is a sparse matrix-diagonal matrix consisting of 2f(2wf−1)-diagonal matrices,
e)
Hf consists of
nwf nonzero elements (34)
and
n(n−wf) zero elements; (35)
and f) the nwf nonzero elements of Hf are determined via a wf×wf eigensystem equation to be defined next after an illustrative example is considered.
The sparse structure of Hf is now illustrated using the image compression example of
From the above Hf structures it is noted that only sixteen (16) out of the n2=64 elements of H0, H1 or H2 are nonzero values, this also confirms expression (34) which yields the same number of nonzero values when n=8 and wf=2. Using expressions (26) and (27), it is also noted that the computational burden associated with the APT forward transformation of the present invention shown in
48 multiplications and 24 additions. (39)
Since the corresponding number of multiplications and additions for the traditional forward transformation technique of
The wf×wf eigensystem equation from which all the elements of Hf for all f=0, 2, . . . , L−1 may be designed is given by
hKfhRf=hRfhΛf (40)
where hKf, hRf and hΛf are wf×wf innovation covariance, eigenvector and eigenvalue matrices defined as follows:
where: a) the matrix index h has n/wf possible realizations defined by
h=1+gwfzf, 2+gwfzf, . . . zf+gwfzf (44)
with g having 2S
g=0, 1, . . . , 2S
b) the set of innovation covariance elements
of (41) are found directly from a f-th iteration n×n innovation covariance matrix Kf, to be defined shortly, where
is the i-th row and j-th column element of Kf; c) the set of transformation elements
of (42) for all possible values of h are the same as those of the
diagonal sub-matrices,
see expressions (28)–(30); and d) the set of transformation elements
of (42) and the set of eigenvalues
of (43) result from the solution of the eigensystem design equation (40).
Finally, the f-th iteration n×n innovation covariance matrix Kf, from which the set of elements
of (41) are obtained is found recursively from the following expression
Kf+1=HftKfHf for f=0, 1, . . . L−1 (46)
where the initial value for Kf, K0, is the same as the innovation covariance K of (2), i.e.,
K0=K. (47)
The solution of the eigensystem (40) is now illustrated with our image compression example of
We then proceed to find all the nonzero elements of H0 by solving the following four 2×2 eigensystem design equations:
where, for instance, the innovation covariance matrix of (49) is obtained from (48) and is given by
Using expression (53) in (49) the following eigenvector and eigenvalue matrices are obtained:
The remaining design equations (50)–(52) are then solved in a similar fashion as (49) resulting in the following expression for H0:
Using expressions (48) and (55) for K0 and H0 in the recursive expression (46) the following result is then obtained for K1:
All the nonzero elements of H1 are next found by solving the following four 2×2 eigensystem design equations:
Using K1 of (56) when solving (57)–(60) the following expression is then derived for H1:
The last iteration innovation covariance matrix K2 for our example is then derived by using expressions (56) and (61) for K1 and H1 in the recursive expression (46) to yield
All the nonzero elements of H2 are next found by solving the following four 2×2 eigensystem design equations:
Using K2 of (62) when solving (63)–(66) the following expression is then derived for H2:
As a preamble to the definition of the energy sequencer unitary matrix E, a Preliminary APT forward transformation matrix {overscore (R)} is defined
{overscore (R)}=H0H1 . . . HL−1. (68)
Multiplying the transpose of {overscore (R)} by the input signal x(k+1) one then obtains a Preliminary APT coefficient vector {tilde over (c)}′(k+1), i.e.,
{tilde over (c)}′(k+1)={overscore (R)}tx(k+1) (69)
From (19), (68) and (69) it then follows that the relation between the APT coefficient vector {tilde over (c)}(k+1) of (19) and the Preliminary APT coefficient vector {tilde over (c)}′(k+1) of (69) is given by
{tilde over (c)}(k+1)=Et{tilde over (c)}′(k+1). (70)
Next it is noted that the eigenvalues of {overscore (R)} can be approximated by the main diagonal of the L-th iteration innovation covariance KL. Denoting the Preliminary APT eigenvalue matrix of {overscore (R)} by the n×n diagonal matrix {overscore (Λ)}, it then follows that
diag({overscore (Λ)})≈diag(KL) (71)
Also in the ideal case that occurs when the off-diagonal elements of KL are zero, it can be shown that the eigenvectors of the MMSE PT transformation matrix R and the Preliminary APT transformation matrix {overscore (R)} are identical. This in turn implies that the main diagonal of KL will contain all the eigenvalues of the MMSE PT transformation matrix R, except that they will not appear necessarily in ascending order as they appear for the MMSE PT eigenvalue matrix Λ illustrated in (15) for our simple example. The objective of the energy sequencer unitary matrix E is then to rearrange the eigenvectors of the Preliminary APT transformation matrix {overscore (R)} such that their corresponding eigenvalues appear in ascending order as required by the PT source coder structure of
Next we illustrate the above methodology for obtaining E with the image example of
{overscore (R)}=H0H1H2. (72)
Using expressions (55), (61) and (67) for H0, H1 and H2 in (72) we then obtain the following result for {overscore (R)}:
Next we determine the L-th iteration innovation covariance KL for our example, i.e., K3, by substituting expressions (62) and (67) for K2 and H2 in (46) to yield
Making use of (74) in (71) it then follows that the Preliminary APT eigenvalue matrix for {overscore (R)}, {overscore (Λ)}, is given by
A comparison of expressions (14) and (15) for the MMSE PT transformation matrix R and corresponding eigenvalue matrix Λ with expressions (73)–(75) of the Preliminary APT transformation matrix {overscore (R)} and eigenvalue matrix {overscore (Λ)} reveals the following two results:
The first result is that the main diagonal values of K3 (74) are indeed approximations to the eigenvalues of the corresponding eigenvectors for {overscore (R)}. It is of interest to note that the off-diagonal elements of K3 tend, in a general sense, to be significantly smaller than its main diagonal values for the example considered here. In addition, it is noted that the magnitudes of the main diagonal elements of K3 approximate the eigenvalues of the MMSE PT transformation matrix R, e.g., the smallest value is 6.08 which is very close to the smallest eigenvalue of R which is 5.853, and the largest value is 101.24 which is even closer, using percentage change as criterion, to the largest eigenvalue of R which is 101.6.
The second result is that the energy sequencer unitary matrix E that will order the elements of the main diagonal of K3, (74) or equivalently {overscore (Λ)} (75) in ascending order is given by
Multiplying the Preliminary APT transformation matrix {overscore (R)} by the energy sequencer unitary matrix E one then obtains the desired APT transformation matrix
Furthermore we denote the APT eigenvalue matrix of {tilde over (R)} by the n×n diagonal matrix {tilde over (Λ)}. It can be shown that this matrix is related to the energy sequencer unitary matrix E and the Preliminary APT eigenvalue matrix {overscore (Λ)} by the following relation
{tilde over (R)}=E′{tilde over (Λ)}{tilde over (Λ)}=Et{overscore (Λ)}E. (78)
Making use of (75) and (76) in (78) it then follows that
At this juncture it will be shown how the APT transformation matrix (77) and corresponding APT eigenvalue matrix (79) can be further improved while yielding an on-line computational burden very close to that of (39). This is done by using a so-called “Booster Stage” with w0=4 that boosts by a factor of two the number of nonzero elements of the initial computational block H0, see expression (36) for the case where w0=2, and then employing the lower value of wf=2 for the remaining f=1 stage; it should be noted from expression (23) that the number of Hf stages L is now given by two for our simple example, i.e., L=2 since
log2(w0=4)+log2(w1=2)=log2(n=8). (80)
Using the same methodology used to derive expressions (77) and (79) it is first found that H0 and H1 have the following structure:
Note from (81) that the computational block H0 now has four nonzero elements for each of its rows and columns as opposed to two nonzero elements for the previous example which is shown in (36).
The nonzero elements of the new H0 are then found from the following two 4×4 eigensystem equations:
On the other hand, the nonzero elements of H1 are found by solving the following four 2×2 eigensystem design equations:
Solving the innovation covariance recursive equation (46) in conjunction with expressions (81) thru (88) the following results are obtained for the Preliminary APT Transformation {overscore (R)}=H0H1 and the L-th iteration innovation covariance KL=K2
Next it is noted that the energy sequencer unitary matrix E (70) that will organize the diagonal elements of K2 (90) in ascending order is given by the following expression:
Making use of expressions (89) and (91) in (19) and (78) we then obtain the following expressions for the desired APT transformation matrix {tilde over (R)} and corresponding APT eigenvalue matrix {tilde over (Λ)}.
Finally the on-line computational burden associated with the Booster scheme is noted from (17) and (18) to be given by
8(4+2)=48 multiplications and 8(3+1)=32 additions (94)
A comparison of expressions (92)–(93) and (77)–(79) with the MMSE PT eigenvector and eigenvalue matrices (14) and (15) reveals a noticeable improvement in the approximations due to the use of our Booster algorithm. It is also noticed from a comparison of the computational burdens (39) and (92) for the two algorithms that the approximation improvements of the Booster algorithm are obtained with a negligible increase in the on-line computational burden, i.e., only eight additional additions are needed for the Booster algorithm. With regard to the design computational burden, off-line and/or on-line, the need for the solution of the two 4×4 eigensystem expressions (83) and (84) for the Booster computational block H0 is not as efficient as the solution of the 2×2 eigensystem equations required by the first algorithm, see for instance expressions (49)–(52); it should be noted, however, that for our image compression application the design computational burden is of not practical concern since it is done off-line without regard to the design time used.
In
{circumflex over (x)}(k+1)=Rĉ(k+1), (95)
is substituted with L+1 computational blocks that require
in the evaluation of the expression
{tilde over (x)}(k+1)={tilde over (R)}ĉ(k+1) with {tilde over (R)}=H0H1 . . . HL−1E (98)
where all the parameters of
The forward prediction apparatus of the APT invention is depicted in
c′(k+1)=Ptz(k), (99)
is substituted with L+3 computational blocks where: a) {overscore (z)}(k) is an integer vector of dimension m×1 that is obtained by rounding the real prediction vector z(k), i.e.,
{overscore (z)}(k)=Round(z(k)); (100)
b) Pavg is either a real or integer scalar that denotes the average power of the input signal x(k+1), see for instance the isotropic model of (10) which makes use of Pavg in the generation of the second order statistics of the illustrative image example of
{overscore (J)}=Round(PavgJ). (101)
and d) the MMSE PT cross covariance matrix J is approximated in
J≈{overscore (J)}/Pavg. (102)
The validity of the approximation (102) is next demonstrated with our image example of
Using expression (103) and also Pavg=11,200 in (102) it then follows that J is approximated by
A comparison of expressions (12) and (104) reveals that the approximation of J with {overscore (J)}/Pavg is indeed an excellent approximation for the considered example.
As summarized in
c′(k+1)={tilde over (P)}t{overscore (z)}(k) (105)
where
{tilde over (P)}={overscore (J)}{tilde over (R)}/Pavg, (106)
is as follows:
The integer product
{overscore (J)}t{overscore (z)}(k) (107)
consists of
nm integer multiplications and n(m−1) integer additions. (108)
The ratio
[{overscore (J)}t{overscore (z)}(k)]/Pavg (109)
can be computed via
n divisions (110)
either real or integer depending on the nature of Pavg.
The real product
EtHtL−1 . . . H1tH0t[{overscore (J)}t{overscore (z)}(k)/Pavg] (111)
is computed by executing
For completeness of presentation the APT forward predictor of expression (106) is evaluated next for the simple example of
A comparison of the MMSE PT forward predictor P of (13) and the APT forward predictor {tilde over (
At this point it is noted that the computational burden associated with the APT forward predictor of
In some cases it is possible to find analytical solutions for the reduced order APT eigensytem design equation (40). For instance, when wf=2 the APT eigensystem design equation is of dimension 2×2 and it is generally described as follows:
Under the assumption that the K matrix given in (115), i.e.,
is a positive define innovation covariance matrix, an analytical solution is forthcoming for the corresponding optimum transformation matrix R, i.e.,
and its eigenvalue matrix Λ, i.e.,
The analytical solution of the 2×2 eigensystem (115) is as follows:
For the case where kj,j>ki,i it can be shown that
ri,i=cos(Φ), (119)
ri,j=rj,i=sin(Φ) (120)
rj,j=−cos(Φ) (121)
where
Φ=0.5 arctan(2ki,j/(ki,i−kj,j)) (122)
and for the case where kj,j≦ki,j it can be shown that
ri,i=sin(Φ), (123)
ri,j=rj,i=−cos(Φ), (124)
rj,j=−sin(Φ) (125)
where Φ is as defined in (122).
The eigenvalues of (118) are in turn given by the following analytical expressions:
The above analytical expressions (119)–(125) accelerate the solution of the 2×2 eigensystem (115). This acceleration will be highly desirable in adaptive applications where the signal and/or image model must be found on-line.
While the invention has been particularly shown and described with reference to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention.
This application claims the benefit of priority from provisional U.S. Patent Application Ser. No. 60/337,787 entitled “Accelerated Predictive-Transform” filed Nov. 7, 2001, the disclosure of which is incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5295203 | Krause et al. | Mar 1994 | A |
5966470 | Miyashita et al. | Oct 1999 | A |
6760479 | Feria | Jul 2004 | B1 |
6839467 | Bruna et al. | Jan 2005 | B1 |
Number | Date | Country | |
---|---|---|---|
20030113024 A1 | Jun 2003 | US |
Number | Date | Country | |
---|---|---|---|
60337787 | Nov 2001 | US |