The present invention relates generally to signal processing techniques, and more particularly to interpolating filter banks for use in processing signals which include samples arranged in the form of an arbitrary lattice in any number of dimensions.
Interpolating filter banks are presently used to process sampled signals in a wide variety of applications, such as video signal processing. A signal put through an interpolating filter bank may be in the form of a multidimensional lattice of sample points. For example, in video applications, an input signal put through the filter bank may be a three-dimensional set of pixels. The interpolating filter bank generally downsamples the input signal, thereby selecting a subset of the signal sample points for further processing. The filter bank includes predict filters for providing an estimate of the sample points which were removed as a result of the downsampling. The output of the predict filters may be supplied to other signal processing hardware or software for further processing of the downsampled signal. When this further processing is complete, the filter bank interpolates and upsamples the result to provide an output with the same number of sample points as the original input signal. The interpolating filter bank thus allows a high-resolution sampled signal to be processed at a lower sample rate, and then reconstructs the result to return the processed signal to its original resolution. Such filter banks are useful in numerous digital signal processing applications.
It is generally desirable for a filter bank to provide “perfect reconstruction.” That is, a filter bank configured to downsample, interpolate, and then upsample a given input signal, without any further processing of the downsampled and interpolated signal, should ideally provide perfect reconstruction of the original input signal. Filter banks designed to provide this property are described in, for example, F. Mintzer, “Filters for distortion-free two-band multirate filter banks,” IEEE Trans. Acoust. Speech Signal Process., Vol. 33, pp. 626-630, 1985; M. J. T. Smith and T. P. Barnwell, “Exact reconstruction techniques for tree-structured subband coders,” IEEE Trans. Acoust. Speech Signal Process., Vol. 34, No. 3, pp. 434-441, 1986; P. P. Vaidyanathan, “Theory and design of M-channel maximally decimated quadrature mirror filters with arbitrary M, having perfect reconstruction property,” IEEE Trans. Acoust. Speech Signal Process., Vol. 35, No. 2, pp. 476-492, 1987; and M. Vetterli, “Filter banks allowing perfect reconstruction,” Signal Processing, Vol. 10, pp. 219-244, 1986.
The design of perfect reconstruction filter banks is related to the concept of “wavelets” in mathematical analysis. Wavelets have been defined as translates and dilates of a fixed function, and have been used to both analyze and represent general functions, as described in A. Grossman and J. Morlet, “Decomposition of Hardy functions into square integrable wavelets of constant shape,” SIAM J. Math. Anal., Vol. 15, No. 4, pp. 723-736, 1984, and Y. Meyer, Ondelettes et Opérateurs, I: Ondelettes, II: Opérateurs de Calderón-Zygmund, III: (with R. Coifman), Opérateurs multilinéaires, Paris: Hermann, 1990, English translation of first volume, Wavelets and Operators, Cambridge University Press, 1993. In the late 1980s, the introduction of multiresolution analysis provided a connection between wavelets and the subband filters used in filter banks, as described in S. G. Mallat, “Multiresolution approximations and wavelet orthonormal bases of L2(R),” Trans. Amer. Math. Soc., Vol. 315, No. 1, pp. 69-87, 1989. This led to the first construction of smooth, orthogonal, and compactly supported wavelets as described in I. Daubechies, “Orthonormal bases of compactly supported wavelets,” Comm. Pure Appl. Math., Vol. 41, pp. 909-996, 1988. Many generalizations to biorthogonal or semiorthogonal wavelets followed. Biorthogonality allows the construction of symmetric wavelets and thus linear phase filters as described in A Cohen, I. Daubechies, and J. Feauveau, “Bi-orthogonal bases of compactly supported wavelets,” Comm. Pure Appl. Math., Vol. 45, pp. 485-560, 1992; and M. Vetterli and C. Herley, “Wavelets and filter banks: Theory and design,” IEEE Trans. Acoust. Speech Signal Process., Vol. 40, No. 9, pp. 2207-2232, 1992. For further background information on wavelets and subband filters, see, for example, I. Daubechies, “Ten Lectures on Wavelets,” CBMS-NSF Regional Conf. Series in Appl. Math., Vol. 61, Philadelphia, Pa.: Society for Industrial and Applied Mathematics, 1992; M. Vetterli and J. Kova{hacek over (c)}ević, “Wavelets and Subband Coding,” Prentice Hall, Englewood Cliffs, N.J., 1995; and G. Strang and T. Nguyen, “Wavelets and Filter Banks,” Wellesley, Cambridge, 1996.
An important unsolved problem in filter bank design is how to generalize filter banks to process signals in multiple dimensions. One possible approach is to use tensor products of available one-dimensional solutions, which generally leads to separable filters in the filter bank. However, this approach only provides symmetry around the coordinate axes and rectangular divisions of the frequency spectrum, while in many applications nonrectangular divisions are more desirable and better preserve signal content. Several other approaches, both orthogonal and biorthogonal and using different types of signal lattices, have been proposed in J. Kova{hacek over (c)}ević and M. Vetterli, “Nonseparable multidimensional perfect reconstruction filter banks and wavelet bases for Rn,” IEEE Trans. Inform. Theory, Vol. 38, No. 2, pp. 533-555, 1992; S. D. Riemenschneider and Z. Shen, “Wavelets and pre-wavelets in low dimensions,” J. Approx. Theory, Vol. 71, No. 1, pp. 18-38, 1992; A. Cohen and I. Daubechies, “Non-separable bidimensional wavelet bases,” Rev. Mat. Iberoamericana, Vol. 9, No. 1, pp. 51-137, 1993; and A. Cohen and J.-M. Schlenker, “Compactly supported bidimensional wavelet bases with hexagonal symmetry,” Constr. Approx., Vol. 9, No. 2, pp. 209-236, 1993. However, these approaches are generally limited to dimensions of two or three, because the algebraic conditions which must be solved to determine filter characteristics become increasingly cumbersome in higher dimensions. Although systematic approaches based on the McClellan transform and cascade structures also exist, these and the other approaches noted above all fail to provide adequate techniques for designing filter banks in arbitrary dimensions.
Recently, a new approach to the study of wavelet construction was provided by the so-called “lifting scheme” as described in W. Sweldens, “The lifting scheme: A custom-design construction of biorthogonal wavelets,” Journal of Appl. and Comput. Harmonic Analysis, Vol. 3, No. 2, pp. 186-200, 1996; and W. Sweldens, “The lifting scheme: A construction of second generation wavelets,” <http://cm.bell-labs.com/who/wim>, both of which are incorporated by reference herein. These references show that in one-dimensional, two-channel applications, a filter bank may be designed using two lifting operations, a predict operation and an update operation, where the update filter for implementing the update operation is one half times the adjoint of the predict filter implementing the predict operation. While the original motivation for lifting was to build time-varying perfect reconstruction filter banks or so-called “second generation wavelets,” lifting turned out to have several advantages for classic time-invariant wavelet construction. In the time-invariant case, lifting has many connections to earlier approaches. The basic idea behind lifting is that there is a simple relationship between all filter banks that share the same lowpass or the same highpass filter. This relationship was observed in the above-cited M. Vetterli and C. Herley reference, and is sometimes referred to as the Herley-Vetterli lemma. Also, lifting leads to a particular implementation of the filter bank. This implementation is known as a ladder structure, as described in A. A. M. L. Bruekens and A. W. M. van den Enden, “New networks for perfect inversion and perfect reconstruction,” IEEE J. Selected Areas Commun., Vol. 10, No. 1, 1992. Recently, it was shown that all finite impulse response (FIR) filter banks fit into the lifting framework. See I. A. Shah and A. A. C. Kalker, “On ladder structures and linear phase conditions for multidimensional biorthogonal filter banks,” Preprint, Philips Research Laboratories, Eindhoven, Netherlands, and I. Daubechies and W. Sweldens, “Factoring wavelet and subband transforms into lifting steps,” Technical Report, Bell Laboratories, Lucent Technologies, <http://cm.bell-labs.com/who/wim>. Although the above-cited references indicate that the lifting approach is suitable for processing one-dimensional signals in two-channel filter banks, it has not been apparent whether or how lifting could be used for processing signals of arbitrary dimension, and in filter banks with M channels, where M is greater than two. Moreover, the above-cited references are primarily directed to techniques for breaking down an existing filter bank into the lifting framework, and thus fail to provide general techniques for designing a filter which has a desired response while also fitting within the lifting framework.
A need therefore exists for a general approach to building filter banks with any number of channels that may be used to process signals with any number of dimensions.
The present invention allows lifting techniques, heretofore considered applicable only to one-dimensional signals in two-channel filter banks, to be applied to multidimensional signals and to filter banks having M channels, where M is greater than two. An input signal to be filtered may be in the form of a lattice of sample points of dimension d. A given filter bank for filtering the input signal is built by first computing a set of shifts τi as D−1ti, i=1, 2, . . . M−1, where ti is a set of coset representatives taken from a unit cell of a sublattice representing a rotated portion of the input signal lattice, and D is a dilation matrix having a determinant equal to M. A polynomial interpolation algorithm, such as the well-known de Boor-Ron algorithm, is then applied to determine weights for a set of M−1 predict filters Pi having the shifts τi. A corresponding set of M−1 update filters Ui are then selected as Ui=P*i/M, where P*i is the adjoint of the predict filter Pi. The resulting predict and update filters are arranged in a lifting structure such that each of the predict and update filters are associated with a pair of the M channels of the filter bank. In various embodiments of the invention, M may be greater than or equal to two, while the input signal is in the form of a lattice of sample points having dimension d greater than or equal to two. Alternatively, M may be greater than two, and the input signal may be in the form of a lattice of sample points having dimension d greater than or equal to one.
An input signal applied to the filter bank is downsampled in each of the M channels, and then interpolated using the M−1 predict filters and the M−1 update filters. The filter bank includes a number of delay elements for delaying the input signal in each of the last M−1 of the M channels, such that a delay of ti is applied to the input signal in the ith channel, for i=1, . . . M−1, prior to downsampling the input signal in that channel. A downsampled and interpolated signal may be reconstructed using a second set of M−1 predict filters and a corresponding second set of M−1 update filters, where each of the predict and update filters in the second sets have the same weights as those in the first sets, and are also associated with a pair of the M channels of the filter bank. An upsampler in each of the M channels upsamples portions of the signal in each of the M channels, after processing in the second sets of predict and update filters, and a signal combiner recombines outputs of the M channels to generate an output signal having substantially the same resolution as the input signal.
The invention provides a general lifting-based technique for building filter banks that works in any dimension, for any type of signal lattice and any number of primal and dual vanishing moments. The invention provides the advantages commonly associated with other lifting-based techniques, such as custom design, in-place computation and integer-to-integer transforms, as well as computation speed, which is particularly important in multidimensional applications. The techniques of the present invention may be implemented in hardware, software or combinations thereof, and may be used in video signal processing as well as other any digital signal processing application in which signals are processed by filter banks. These and other features and advantages of the present invention will become more apparent from the accompanying drawings and the following detailed description.
The present invention will be illustrated below in conjunction with exemplary filter banks and signal lattices. It should be understood, however, that the disclosed techniques are suitable for use with a wide variety of other types of signals and in numerous diverse signal processing applications. The term “filter bank” as used herein refers generally to a device for processing a set of samples of a signal. A “lattice” is a representation of signal sample points in one or more dimensions. A “sublattice” is a subset of the sample points of a lattice, and is generally shown in the sampled domain, that is, without other signal sample points which are removed as a result of a downsampling operation. The sublattice typically represents a rotated portion of its corresponding signal lattice. A sublattice is used to identify a particular “neighborhood,” where a neighborhood refers generally to a local subset of sample points in the corresponding sublattice, which is located around a point to be interpolated in a filter bank. A neighborhood is also referred to herein as a “ring.”
The present invention provides general techniques for constructing filter banks which may be used to process signals with any number of dimensions, any lattice representation and any number of primal and dual vanishing moments. The invention provides a generalization to multiple dimensions and multiple channels of the one-dimensional, two-channel lifting technique described in the above-cited W. Sweldens references. As will be described in greater detail below, the invention allows lifting techniques with nonseparable filters to be applied with multidimensional signals and in M-channel applications. More particularly, it will be shown that filter banks constructed in accordance with the invention may be used to process signals in multiple dimensions and M channels, and that a given update filter may be selected as the adjoint of the corresponding predict filter divided by M. The predict filter can be constructed using, for example, a polynomial interpolation algorithm such as the de Boor-Ron algorithm for multivariate polynomial interpolation. The invention provides the advantages commonly associated with other lifting-based techniques, such as custom design, in-place computation and integer-to-integer transforms, as described in more detail in R. Calderbank, I. Daubechies, W. Sweldens, and B.-L. Yeo, “Wavelet transforms that map integers to integers,” Technical Report, Department of Mathematics, Princeton University, 1996, and speed, which is particularly important in multidimensional applications and is described for the one-dimensional case in I. Daubechies and W. Sweldens, “Factoring wavelet and subband transforms into lifting steps,” Technical Report, Bell Laboratories, Lucent Technologies, <http://cm.bell-labs.com/who/wim>.
The notation used herein to describe the invention will now be summarized. A signal x is a sequence of real-valued numbers indexed by the index K:
x={xk∈R|k∈K}∈RK.
In general, K can be either a finite set or an infinite set. The following description will focus on signals defined on a lattice in a d-dimensional Euclidean space such that K=Zd. A sequence is said to be finite if only a finite number of the xk are nonzero. A sequence x is of finite energy, x∈l2=l2(Zd), if
The inner product <x, y> of a pair of sequences x and y of finite energy l2 is defined as:
Linear operators A: l2→l2 are often used. The adjoint or transpose A* of a linear operator A is defined such that
<Ax,y>=<x,A*y> for x,y∈l2
Let π(x) be a multivariate polynomial, with x∈Rd. The notation π(Zd) or simply π will denote the sequence formed by evaluating the polynomial π(x) on the lattice Zd:
π=π(Zd)={π(k)|k∈Zd}
Πn will denote the space of all polynomial sequences of total degree strictly less than n.
The notation for one-dimensional filters will now be described. A linear operator A which is time invariant will be referred to herein as a filter. Its action is then simply convolution with the impulse response sequence {ak|k∈K},
The following description will assume that the impulse response is finite, i.e., A is a finite impulse response (FIR) filter. This implies that the action of a filter on a polynomial sequence is well defined. The Fourier series of the impulse response sequence is
The z-transform of the impulse response is then the Laurent polynomial, with z=eiw:
It should be noted that capital letters are used herein to denote operators as well as the Fourier transform of sequences. The meaning will be clear from the context.
Filters described herein may be characterized as using differentiation with respect to ω. To simplify the notation, it is convenient to define a scaled version of the differentiation operator as
This symbol is also used with the z notation. It is important to keep in mind that the differentiation is with respect to ω and that z is simply a place holder for eiω. For example, zα is simply eiαω and there is no ambiguity even if α is non-integer. This leads to the following:
∇zα=αzα for α∈R.
In addition,
Consequently,
∇nA(zα)=αnA(n)(zα).
In the case of multidimensional filters, where K=Zd, the description given above for the one-dimensional case is modified to utilize a so-called multi-index notation in which an index k∈Zd represents a vector (k1, . . . , kd) where kj∈Z. Similarly, z represents a vector (z1, . . . , zd), n a vector (n1, . . . , nd), and α a vector (α1, . . . , αd). Now kn should be understood as
and similarly zα is
The differentiation operator ∇ is now given by
Similarly,
Given the above, it is still true, as in the one-dimensional case, that
as well as that
∇nA(zα)=αnA(n)(zα). (4)
Note that the above equation is a vector equation. The following description will also use 1 to denote a vector (1, . . . , 1)∈Zd. The size of a multi-index is defined as
so that the degree of the monomial
xn=x1n
is |n|.
The notation for lattices and sublattices will now be described. If D is a d×d matrix with integer coefficients, a sublattice of K=Zd may be found as DZd. As noted above, a sublattice is a representation in the sampled domain of a subset of the sample points in a lattice. The determinant of D is an integer denoted by M, where |D|=M∈Z. Then there are (M−1) distinct cosets each of the form DZd+tj with tj∈Zd and 1<j<M−1. Zd may be written as
where to=0 and the union is disjoint. The choice of the tj can be made unique by choosing D−1tj to be in the unit hypercube [0, 1]d. Given a lattice L=DZd+t, a polynomial on this lattice can be sampled as
π(L)={π(Dk+t)|k∈Zd}
Note that
π(L)∈Rz
and it is thus a sequence indexed by Zd and not by L.
Since the following description will be dealing with upsampling in multiple dimensions, zD is defined where z is a complex vector and D is a matrix. Thus
zD={zd
where di is the ith column vector of matrix D and zdi is as defined in Equation (2).
In some applications, data may be sampled on a more general lattice K=ΓZd, where Γ is an invertible d×d matrix. A typical example is a two-dimensional triangular lattice where
One can now find sublattices by simply premultiplying D×Γ. Given that the illustrative filter construction of the invention relies on polynomial interpolation and that polynomial spaces of fixed degree are invariant under affine transforms, it may be assumed without loss of generality that Γ is the identity matrix. Γ can play a role in choosing neighborhoods for interpolants, as will be described in more detail below.
A multidimensional filter is defined herein as an interpolating filter when hDk=δk. Taking the z-transform of hk and summing it along all the cosets, the only term left on the original lattice is 1, since all others are zero by definition. Then, in the two-channel case the above definition is equivalent to
H(z)=1+zt
These filters are also called halfband filters. In the M-channel case, an interpolating filter can be written as
and these filters are called Mth-band filters. The filters H and {tilde over (H)} are biorthogonal in the case that H(z){tilde over (H)}(1/z) is interpolating.
In the following, a particular class of filters is introduced which may be used to construct filter banks in accordance with the invention. These filters are closely related to polynomial interpolation, and are therefore referred to herein as Neville filters. See, for example, the description of the Neville algorithm in J. Stoer and R. Bulirsch, “Introduction to Numerical Analysis,” New York: Springer Verlag, 1980. It will be shown how Neville filters are related to other existing filter families such as Coiflets as well as Lagrangian halfband or Deslauriers-Dubuc filters. Moreover, filters from filter families other than those noted may be used in place of or in conjunction with Neville filters in interpolating filter banks in accordance with the invention.
A Neville filter applied to a polynomial of a certain degree sampled on a lattice results in that same polynomial, but now sampled on a lattice which is offset by τ with respect to the original lattice. This leads to the following definition. A filter P is a Neville filter of order N with shift τ∈ Rd if
Pπ(Zd)=π(Zd+τ) for π∈ΠN (6)
An important thing to note is that τ need not be an integer vector. The Neville condition of a filter can be related back to its impulse response {pk}. Writing out Equation (6) for a monomial xn yields
Given that polynomial spaces are shift invariant it is sufficient to impose the following condition for l=0:
where the left-hand side according to Equation (3) is equal to P(n)(1). It then follows that a filter P is a Neville filter of order N with shift τ if and only if its impulse response satisfies:
It can then be shown that the adjoint of a Neville filter, that is, the filter obtained by time-reversing its impulse response, is also a Neville filter. In other words, if a filter P is a Neville filter of order N with shift τ, then P* is a Neville filter of order N with shift −τ. It can also be shown that if P is a Neville filter of order N with shift τ, and P′ is a Neville filter of order N′ with shift τ′ then PP′ is a Neville filter of order min(N, N′) and shift τ+τ′. This shows that Neville filters of a fixed order form a group. Finally, If P is a Neville filter of order N with shift τ, then Q, where Q(z)=P(zD), is a Neville filter of order N with shift Dτ.
Building Neville filters in d dimensions with a certain prescribed order N and shift τ involves polynomial interpolation. There are
equations like (7) to satisfy, so one would expect to need f=q filter taps to construct the Neville filter. Given that τ is in the above-defined unit hypercube, the f filter taps should be located around the unit hypercube in a symmetric neighborhood. Finding the Neville filters is then equivalent to solving a q×q linear system. In the one-dimensional case, this system has a Vandermonde matrix and is always invertible. A very easy and fast method to find the solution is the Neville algorithm, described in the above-cited J. Stoer and R. Bulirsch reference. In the multidimensional setting, the situation becomes much more complicated and the linear system is not necessarily solvable. It can be either overdetermined or underdetermined, so to achieve order N one may need either more than q or less than q filter taps. For example, three points in R2 in general define a linear polynomial, unless they lie on a line, in which case they define a quadratic polynomial. Thus the degree of interpolation does not only depend on the number of interpolation points but also on the arrangement of the points. It is not clear a priori how many interpolation points one needs to obtain a certain degree. However, a solution may be obtained using a technique described in C. de Boor and A. Ron, “Computational aspects of polynomial interpolation in several variables,” Math. Comp., Vol. 58, pp. 705-727, 1992 and C. de Boor and A. Ron, “On multivariate polynomial interpolation,” Constr. Approx., Vol. 6, pp. 287-302, 1990, both of which are incorporated by reference herein. This technique involves first fixing a point configuration of f points in Rd and then finding the correct polynomial space and order N to define interpolation, and provides an algorithm to compute the interpolant. The present invention will make use of this de Boor-Ron algorithm to compute Neville filters. The computation will first fix a neighborhood around τ and then compute the order N. If the order is insufficient, the neighborhood will be enlarged.
It should be noted that a number of well-known filters may be characterized as Neville filters. For example, the identity filter, where the impulse response is a Kronecker delta pulse, is a Neville filter of order infinity with shift 0. Similarly, an impulse response zk with k∈Zd is a Neville filter of order infinity with shift k. This illustrates that a Neville filter with any desired shift can be constructed by multiplying a Neville filter with a shift in the unit hypercube by the appropriate power of z. In addition, the Deslauriers-Dubuc interpolating subdivision technique uses filters which can predict the values of a polynomial at the half integers given the polynomial at the integers, as described in G. Deslauriers and S. Dubuc, “Interpolation dyadique,” in “Fractals, dimensions non entières et applications,” pp. 44-55, Paris: Masson, 1987. These filters are thus Neville filters with shift ½. Lagrangian halfband filters, as described in P. Vaidyanathan, “Multirate Systems and Filter Banks,” Englewood Cliffs, N.J.: Prentice Hall, 1992, are interpolating filters with a prescribed number of zeros at −1. Because H(z)=1−H(−z), the filter has the same order of flatness at 0. Their odd polyphase component coincides with a Deslauriers-Dubuc filter. It can be shown that if H(z)=1+z−1 P(z2) and P is a Neville filter of shift ½ then His a Neville filter of the same order and 0. It thus is a Lagrangian halfband filter.
It is also known that the lowpass orthogonal Daubechies filters, as described in I. Daubechies, “Orthonormal bases of compactly supported wavelets,” Comm. Pur Appl. Math., Vol. 41, pp. 909-996, 1988, of order 2 or more satisfy H(2)(1)=(H(1)(1))2. See also W. Sweldens and R. Piessens “Quadrature formulae and asymptotic error expansions for wavelet approximations of smooth functions,” SIAM J. Numer. Anal., Vol. 31, No. 4, pp. 1240-1264, 1994; and R. A. Gopinath and C. S. Burrus, “On the moments of the scaling function,” in Proceedings ISCAS, pp. 963-966, IEEE, 1992. In fact, this is true for any orthogonal lowpass filters having an order of at least 2. Consequently, these filters are Neville filters of order 3 with a shift corresponding to their first moment H(1)(1). As another example, Coiflets are filters H with zero moments up to N, excluding the zeroth moment which is one, H(n)(1)=δn for n<N. They are thus Neville filters with shift zero. Consequently polynomials of degree less than N are eigenfunctions of Coiflets. It can be shown that the autocorrelation of any Neville filter is a Coiflet of the same order. Neville filters are also closely related to the one-point quadrature formula from the above-cited W. Sweldens and R. Piessens reference. Basically, if a Neville filter has order N then the one-point quadrature formula for the corresponding scaling function has degree of accuracy N+1. The ideal Neville filter with shift τ and order infinity is the allpass filter zτ. However, it is not an FIR filter unless τ is an integer vector. One has to be extremely careful in applying such a filter to a polynomial sequence as the summation only converges conditionally. These filters are thus of limited practical use but can be thought of as the limiting case for FIR Neville filters with fixed τ as N goes to infinity.
H=(↓D)H, G=(↓D)G.
where ↑ denotes downsampling. Similar operations are defined for duals H* and G* of H and G, respectively, such that H*=H*(↑D) and G*=G*(↑D), where ↑ denotes upsampling. Ideally, the filter bank 10 should provide perfect reconstruction, such that analysis followed by synthesis gives the identity
H*{tilde over (H)}+G*{tilde over (G)}=1, (9)
and synthesis followed by analysis is the identity as well, or
{tilde over (H)}H*=1, {tilde over (H)}G*=0, {tilde over (G)}H*=0, {tilde over (G)}G*=1.
This implies that H*{tilde over (H)} is a projection operator and the filter bank 10 thus provides splitting in complementary subspaces.
The interaction of filter bank 10 with a polynomial sequence will now be described. Note that since all of the filters have been assumed to be FIR filters, they may be permitted to operate on polynomial sequences. This will lead to the definition of two important characteristics of a filter bank, the number N of primal vanishing moments and the number Ñ of dual vanishing moments. These correspond to the degree of polynomials which are filtered out by the highpass filters in filter bank 10. A filter bank has N primal and Ñ dual vanishing moments if
{tilde over (G)}π=0 for π∈ΠÑ, and Gπ=0 for π∈ΠN.
Using perfect reconstruction, the above implies that H*{tilde over (H)}π=π. If the highpass filter filters out certain polynomials, these polynomials are preserved only in the lowpass branch of the filter bank. Similarly, the primal moment condition implies that π*H*{tilde over (H)}=π*. The number of primal vanishing moments concerns the degree of the moments of an input sequence that are preserved by the lowpass branch, or equivalently the number of zero moments of elements in the highpass branch. As expected, the mean or DC component of the signal appears only in the lowpass branch. In summary, it is desirable for a filter bank such as that shown in
As noted above, several methods have been proposed in the prior art to build filter banks that exhibit the PR, DM and PM properties. Typically, these prior art techniques attempt to satisfy all three conditions at once, which leads to cumbersome algebraic conditions, particularly in high dimensions. An important attribute of the lifting techniques of the present invention is that they permit each of the properties to be satisfied separately. First, every filter bank constructed using lifting automatically has PR, so this property need not be further addressed. In accordance with the invention, a filter bank may be constructed by starting with a trivial filter bank and then using lifting steps to enhance its properties. An embodiment of the invention to be described below uses only two lifting steps. The first one, referred to as “predict,” provides the above-noted DM property, while the second, referred to as “update,” provides the PM property. It will be shown how each of these two lifting steps can be designed separately.
As previously noted, the filter bank 30 automatically provides the PR property for a given input signal. The filter bank 30, which is not time invariant due to the presence of downsampling, becomes time invariant in the polyphase domain. In the first lifting step, the predict filter P is used to predict the odd samples from the even ones. The even samples remain unchanged, while the result of the predict operation applied to the even samples in filter P is subtracted from the odd samples in signal combiner 38 to yield the highpass or wavelet coefficients. The predict filter P should be designed such that if the input sequence is a polynomial sequence, then the prediction of the odd samples is exact, the highpass coefficients are zero, and the DM property is satisfied. In the second lifting step, the update filter U is used to update the even samples based on the previously-computed highpass or wavelet coefficients. The update filter U is therefore designed to satisfy the PM property. The polyphase matrix M can be identified from
The inverse adjoint polyphase matrix is
From the polyphase decomposition, the filter H may be found as
H(z)=He(zD)+zt
It then follows that H is interpolating since He(z)=1. The DM condition now becomes
{tilde over (G)}π(Zd)={tilde over (G)}eπ(DZd)+{tilde over (G)}oπ(DZd+t)=0.
Substituting values from the polyphase matrix of Equation (11) yields:
Pπ(DZd)=π(DZd+t).
Given that this has to hold for all π∈ΠN and that this space is invariant under D, this is equivalent to
Pπ(Zd)=π(Zd+D−1t), for all π∈ΠN. (13)
It can therefore be seen that to satisfy DM, the predict filter P has to be a Neville filter of order Ñ and shift τ=D−1t. Similarly, from Equation (12) the PM condition becomes
Gπ(Zd)=Geπ(DZd)+Goπ(DZd+t)=−U*π(DZd)+(1−U*P*)π(DZd+t)=0
for π∈ΠN. From the foregoing it is known that P is a Neville filter of order Ñ and shift τ. Therefore if N≦Ñ, the PM condition is equivalent to
−U*π(DZd)+π(DZd+t)−U*π(DZd)=0
or
2U*π(DZd)=π(DZd+t).
Thus 2U has to be a Neville filter of order N with shift −τ=−D−1t. The filter U may be selected as ½ times the adjoint of a predict filter with order N. More generally, if Pm is a family of Neville filters of order m and shift τ=D−1t and if N≦Ñ, one can construct a filter bank of the type shown in
As noted above, the present invention allows a lifting technique using nonseparable filters, such as the lifting technique illustrated in FIG. 2 and previously applied only to one-dimensional signals in two-channel filter banks, to be applied to multidimensional signals and to filter banks having M channels, where M is greater than two. The invention will now be illustrated for applications in which two-channel lifting filter banks, such as filter bank 30 of
which is also called the “symmetry” matrix. The coset representative in this example is t1=(1, 0). In the lattice diagram of FIG. 4A and other lattice diagrams to be described herein, the solid circles represent sample points to be processed in the filter bank, while the open circles represent sample points which will be dropped as a result of downsampling. The shaded region of the lattice diagram corresponds to the unit cell of a sublattice of the quincunx lattice of
The number in the second row of TABLE 1 identifies a particular neighborhood in FIG. 4B. The numbers in parentheses after the neighborhood number in the second row of TABLE 1 give the number of points in the corresponding ring, as is apparent from FIG. 4B. The “taps” column specifies the number of taps in the Neville filter P and the “order” column specifies the interpolation order in the de Boor-Ron algorithm.
Using the results of the computations in TABLE 1, an exemplary filter bank may be constructed with Ñ=4 and N=2. According to the above description,
where the update U is
U(z1,z2)=½P2*(z1,z2)=½3(1+z1−1+z2−1+z1−1z2−1)*=½3(1+z1+z2+z1z2).
A two-channel lifting filter bank of the type shown in
{tilde over (H)}(z1,z2)={tilde over (H)}e(zD)+z−t1{tilde over (H)}o(zD)={tilde over (H)}e(z1z2,z1/z2)+z−t1{tilde over (H)}o(z1z2,z1/z2),
{tilde over (G)}(z1,z2)={tilde over (G)}e(z1z2, z1/z2)+z−t1{tilde over (G)}o(z1z2, z1/z2).
Using Equation (11) results in:
{tilde over (H)}(z1, z2)=1−U(z1z2, z1/z2)P(z1z2, z1/z2)+z1−1U(z1z2, z1/z2),
{tilde over (G)}(z1, z2)=1−P(z1z2, z1/z2).
The synthesis/primal filters may be obtained in a similar manner. It can be verified that
∇n{tilde over (G)}(z1, z2)|(0,0)=0 for 0≦n<Ñ=4 and ∇n G(z1, z2)|(0,0)=0 for 0≦n<N=2.
The coset representative in this example is t1=(1, 1, 1). The shaded region in
As in TABLE 1, the number in the second row of TABLE 2 identifies a particular neighborhood in
The above-described quincunx and FCO lattices are special cases of the so-called Dn or checkerboard lattice, described in J. Conway and N. Sloane, “Sphere Packings, Lattices and Groups,” Springer-Verlag, 1992. In the general d-dimensional case, the first neighborhood contains 2d points and the second neighborhood contains d2d points. The corresponding Neville filter weights are given in TABLE 3 below.
As noted above, the invention allows lifting techniques to be applied to M-channel filter banks. In these embodiments, |D|=M and there are M−1 cosets of the form DZd+tj where tj∈Zd. There are M polyphase components which will be designated using the subscripts 0 to M−1, such that
As in the above description, two lifting operations are used for each of the i channels, 1≦i≦M−1: a predict operation provided by a predict filter designated Pi and an update operation provided by an update filter designated Ui.
The polyphase matrix M characterizing the M-channel filter bank 100 of
with an adjoint inverse given by:
The condition for dual vanishing moments (DM) can now be found, in a manner similar to that used in the above-described two-channel case, as
Substituting values from the jth row of the polyphase matrix M yields:
Pjπ(DZd)=π(DZd+tj) for π∈ΠN, j=1, . . . , M−1.
This implies that the predict filter Pj is a Neville filter of order N with shift τj=D−1tj. Similarly, the primal moments condition, using the jth row of M*−1, becomes
Given that predict filter Pj is a Neville filter of order N with shift τj, the above condition, if N≦Ñ, becomes:
MU*jπ(DZd)=π(DZd+tj) for π∈ΠN.
Consequently, MU*j is a Neville filter of order N with shift τj. More generally, if Pjm is a family of Neville filters of order m and shift D−1tj, and if N≦Ñ, a filter bank with N primal vanishing moments and Ñ dual vanishing moments can be constructed by letting the predict filters be PjÑ and choosing the update filters Uj as PjN/M. The same remarks from the two-channel case above regarding the choice of U and the N≦Ñ restriction are also applicable to the M-channel case.
A number of examples will now be provided to illustrate M-channel filter banks in accordance with the invention, where M>2. In a one-dimensional, M-channel embodiment, the coset representatives are ti=i, 0≦i<M, and the corresponding shifts are τi=i/M. Thus Pi is a Neville filter with shift i/M. Given that τi=1−τM-i, the predict filter Pi(z)=zPM-i(1/z). TABLES 4 and 5 below give the predict filters P1 for three-channel and four-channel filter banks in the one-dimensional case. TABLE 4 shows the predict filters P1, while P2(z)=z P1(1/z). TABLE 5 gives the predict filters P1, while P2 corresponds to Deslauriers-Dubuc predict filters, and P3(z)=z P1 (1/z).
The coset representatives taken from the unit cell are:
t0=(0, 0), t1=(1, 0), t2=(0, 1), t3=(1, 1),
with the corresponding shifts τi given by:
τ1=(½, 0), τ2=(0, ½), τ3=(½, ½),
It should be noted that for the predict filters P1 and P3 the neighborhoods in the sampled domain, as shown in
Another example of a filter bank with M=4 channels for processing a signal in the form of a lattice of dimension d=2 will be described in conjunction with
The coset representatives taken from the unit cell of
Another example of a filter bank with M=4 channels for processing a signal in the form of a lattice of dimension d=2 will be described in conjunction with
As in the case of the separable lattice of
An example of a filter bank with M=3 channels for processing a signal in the form of a lattice of dimension d=2 will be described in conjunction with
The coset representatives taken from the unit cell of
An implementation of a filter bank in accordance with the invention will now be described in greater detail. The input signal is a sequence x={xk|k∈K} and the filter bank generates M output sequences: one low pass sequence s={sk|k∈K}, and M−1 high pass sequences di={di;k|k∈K}. Each sample of the output sequences can be identified with a unique sample of the input sequence: the sk correspond to xDk and the di;k correspond to xDk+ti. This correspondence combined with the above-described lifting steps allows for an in-place computation: instead of allocating new memory for the s and di sequences, it is possible to simply overwrite the corresponding elements in the x sequence. Lifting guarantees that a sample which is needed later will not be overwritten. It will be assumed that the impulse responses of the predict filter Pi and the update filter Ui are given by pi;k and ui;k, respectively. The predict and update sequences are non-zero in a finite symmetric neighborhood around the origin N: pi;k≠0 for k∈N⊂K. The filter bank can now be described by the following analysis and synthesis pseudocode:
Analysis
for 1≦i≦M−1:
for 1≦i≦M−1:
for 1≦i≦M−1:
for 1≦i≦M−1:
In order to illustrate how much the lifting process can speed up the operation of a filter bank, consider the polyphase matrix used above to characterize the filter bank 100 of FIG. 7. To estimate the cost of implementing the polyphase matrix in the lifting framework of the present invention compared to a conventional implementation, assume that the computational cost of each predict and update filter is the same and equal to C. The computational cost of a lifting implementation, which corresponds to Equation (14) above describing of the polyphase matrix of filter bank 100, is then 2(M−1)C. The cost of the standard implementation, which corresponds to Equation (15) above, is the cost of the lifting implementation plus the cost associated with the top left element in the matrix of Equation (15):
If the above formula is used to implement a filter corresponding to this element, the cost is about 2(M−1)C. An alternative is to expand the expression of the element, and then apply a corresponding expanded filter. Assuming that the neighborhoods of the predict and update operators are given roughly by balls in d dimensions, the cost of the expanded filter will be 2dC. For applications with low dimensions and a high number of subbands, this alternative implementation may be less costly. Combining the above, it can be seen that the speed-up factor provided by lifting is roughly equivalent to:
If 2d-1>M−1, that is, in cases in which there are either high dimensions or a low number of subbands, the speed-up factor is two. Otherwise, it is lower than two. For low dimensions with a high number of sidebands, the factor becomes insignificant. Separable lattices, where M=2d, represent an intermediate case in which the speed-up factor is about 1.5.
The above-described embodiments of the invention are intended to be illustrative only. Numerous alternative embodiments within the scope of the following claims will be apparent to those skilled in the art.
This application is a division of pending application Ser. No. 09/253,410, filed Feb. 17, 1999 now U.S. Pat. No. 6,249,797, as a division of application Ser. No. 08/902,557, filed Jul. 29, 1997, and now issued as U.S. Pat. No. 6,018,753.
Number | Name | Date | Kind |
---|---|---|---|
4799179 | Masson et al. | Jan 1989 | A |
5436940 | Nguyen | Jul 1995 | A |
5568142 | Velazquez et al. | Oct 1996 | A |
5745187 | Hulyalkar et al. | Apr 1998 | A |
6018753 | Kovacevic et al. | Jan 2000 | A |
Number | Date | Country | |
---|---|---|---|
20010011290 A1 | Aug 2001 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09253410 | Feb 1999 | US |
Child | 09794865 | US | |
Parent | 08902557 | Jul 1997 | US |
Child | 09253410 | US |