The present application relates to digital filters. More particularly, the present application relates to infinite impulse response filters in a parallel-input parallel-output form.
Digital filtering is widely used in digital signal processing (DSP) applications. In DSP, a sequence of digital samples (e.g., numbers represented in binary format) represents an actual analog signal within a system (e.g., a communications system). The samples are taken at a generally fixed rate, referred to as the sample rate. Conversion between the digital samples and analog signals can be performed by analog to digital converters (ADCs) for signals entering the DSP subsystem, and by digital to analog converters (DACs) for signals exiting the DSP subsystem.
In a number of DSP applications, finite impulse filters (FIR) are quite popular. An FIR filter is a feed-forward filter form, in which an impulse input results in a finite time duration filter output sequence. FIR filters are popular, in part, as a number of efficient filter implementation forms are known. Many different design techniques for FIR filters are also available, making FIR filters generally easy to design and integrate into a DSP application. Polyphase FIR filters are also known which can be used to perform sample rate conversions, such as decimating (downsampling) and interpolating (upsampling).
In contrast to an FIR filter, an infinite impulse response filter (IIR) uses a feed-back filter form, in which an impulse input results in an infinite time duration filter output sequence. IIR filters have been less frequently used in DSP due to a number of factors. One factor is that the use of feedback complicates the use of pipelining techniques sometimes used to increase throughput in DSP systems. Design tools and techniques for digital IIR filters have also lagged somewhat behind those available for digital FIR filters. IIR filters can, however, provide advantages over FIR filters in some applications. For example, an IIR filter can provide superior filter characteristics (e.g., ripple, transition bandwith, etc.) to an FIR filter using similar amounts of DSP resources.
In many applications (e.g. software defined radio technology used in communications systems) high digital filter processing rates are desired. Very high sample rates can present a number of challenges to the design of digital filters. For example, even when a low output rate of a filter is desired (e.g., because a high decimation factor is present), many input samples must be stored and processed by the filter. Conversely, an interpolating filter may have a low input sample rate, but be required to produce a high output sample rate. Thus, portions of a digital filter typically operate at either the input sample rate or the output sample rate.
Unfortunately, ADC and DAC speeds can outpace the processing rates achievable in some DSP chips (DSPCs) and field programmable gate arrays (FPGAs). While the processing speed of programmable gate arrays, custom hardware, and processors continues to improve each year, sample rate requirements and available ADC and DAC speeds have also been increasing. Thus, there is often a mismatch between the input or output samples rates desired, and the processing rates which can be achieved using straightforward DSP techniques.
It has been recognized that it would be advantageous to develop a parallel-input parallel-output infinite impulse response filter.
In some embodiments of the invention an infinite impulse response (IIR) filter has a parallel-input and a parallel-output. The IIR filter can include a first finite impulse response filter having a parallel input and a parallel output, a plurality of delays coupled to the first finite impulse response filter, and a plurality of summers coupled in a feed-back arrangement to the plurality of delays and to the first finite impulse response filter so as to form an all-pole filter. The IIR filter can also include a second finite impulse response filter having a parallel input and a parallel output to form an all-zero filter. The first finite impulse response filter and the second finite impulse response filter can be coupled to form an infinite impulse response filter having both poles and zeros.
In some embodiments of the invention a method for performing a parallel-input parallel-output infinite impulse response filter can include accepting a plurality of N input samples in a parallel form and performing, in parallel, a plurality of finite impulse response filtering operations to form a plurality of M first intermediate samples in parallel form. The first intermediate samples can be an all-zero filtered version of the input samples. Another operation can be summing the plurality of first intermediate samples with a plurality of M second intermediate samples, in parallel, to form a plurality of M output samples in parallel form. The second intermediate samples can be obtained by performing, in parallel, a plurality of finite impulse response filtering operations on the plurality of output samples, with appropriate delays, so that the plurality of output samples are an all-pole filtered version of the first intermediate samples.
In some embodiments of the invention a method for performing a parallel-input parallel-output infinite impulse response filter can include accepting a plurality of N input samples in a parallel form and summing the plurality of N input samples with a plurality of N first intermediate samples to form a plurality of N second intermediate samples in parallel form. Another operation can be performing in parallel a plurality of finite impulse response filtering operations on the plurality of N second intermediate samples to form the plurality of N first intermediate samples, in parallel form, and with appropriate delays so that the N second intermediate samples are an all-pole filtered version of the input samples. The method can also include performing in parallel a plurality of finite impulse response filtering operations on the plurality of second intermediate samples to provide a plurality of M output samples, in parallel form, that are an all-zero filtered version of the second intermediate samples.
Additional features and advantages of the invention will be apparent from the detailed description which follows, taken in conjunction with the accompanying drawings, which together illustrate, by way of example, features of the invention; and, wherein:
Reference will now be made to the exemplary embodiments illustrated in the drawings, and specific language will be used herein to describe the same. It will nevertheless be understood that no limitation of the scope of the invention is thereby intended. Alterations and further modifications of the inventive features illustrated herein, and additional applications of the principles of the inventions as illustrated herein, which would occur to one skilled in the relevant art and having possession of this disclosure, are to be considered within the scope of the invention.
The challenges addressed by high sample rates in an FIR filter have been addressed to some extent using parallel processing techniques. For example, the above-mentioned Parent Case describes various implementations of a block parallel FIR filter. The block parallel filter can provide for higher processing throughput in part because samples are input in a parallel form and output samples are also provided in a parallel form.
Parallel input refers to providing more than one sample simultaneously at the input to the filter, in contrast to conventional DSP techniques were one sample at a time is provided as input. Similarly, parallel output refers to providing more than one sample simultaneously at the output of the filter. It should be noted that thus use of parallel-input here is distinct from so-called parallel data, where, for example a single 8-bit sample is provided as 8-binary values on 8 separate lines which are all valid on the same clock edge. In contrast, parallel-input here means that N samples (N≧2) or data words are provided simultaneously. For example, if N=2, and each sample has 8 bits of resolution, a total of 16 data lines can be used (8 for each sample). Of course, various values for N can be used, such as N=2, 3, 4, 8, etc.
By using parallel input and parallel output, input and output from the filter can occur at clock rates lower than the overall sample rate. For example, using a parallelism of N=4, the filter clock rate can be one fourth of the sample rate. This can allow for processing of a relatively high sample rate using circuitry which operates at a lower clock rate. For example, multiplications and other operations in the filter can be performed in parallel using a number of processing components (e.g. multipliers, adders, etc).
In contrast to an FIR filter, however, providing a parallel-input parallel-output form IIR filter is less straightforward because of the feed-back terms. To maintain proper timing relationships of various terms within the filter, various delay elements can be used within the feed-back path.
Operation of the parallel-input parallel-output IIR filter can be described mathematically. In general, an IIR filter can be described by the difference equation:
where x(n) are the input samples (n being the sample number or time index) and y(n) are the output samples. K represents the number of taps in the feed-forward filter, which has coefficients {b(k)} and L represents the number of taps in the feed-back filter, which has coefficients {a(k)}.
The system function, i.e., the z transform of the impulse response h(n), can be defined as
The filter can thus be described by the system function H(z), by the all-zero (feed-forward) and all-pole (feed-back) components, B(z) and A(z), respectively, or by the time-domain feed-forward and feed-back taps, b(k) and a(k), respectively. All of the descriptions are equivalent mathematically.
There are several different arrangements in which the filter can be implemented. For example, we can re-write Equation (2) as:
which is equivalent to the following time-domain convolution equation:
y(n)=x(n)*b(n)−y(n−1)*ã(n), (4)
with impulse response ã(n)={a(1), a(2), . . . , a(L)}. This is shown in block diagram form in
A parallel-input, parallel-output implementation of the filter of
Because the inputs 302 are provided in parallel form, the feed-forward filter 304 can be decomposed as a polyphase filter. In general, the feed-forward filter may have N inputs 302 in parallel and M outputs 310 in parallel. For the case where N=M, the filter can be expressed as:
where Ui(z) is i-th phase output of the feed-forward filter, which is the z-domain of ui(n)=u(Mn−i), and Gi(z), the z-domain representation of gi(n), are the polyphase components of the feed-forward filter B(z). The polyphase components can be obtained from the filter response as will now be described.
In a polyphase filter, the polyphase filter can expanded as the sum of a number of sub-filters which can be referred to as polyphase components. This expansion can be particularly useful in decimation and interpolation operations as discussed further below in this text. First, for the case of an FIR filter, the response can be represented as follows:
or
where Hi(z) are the polyphase components of the filter H(z).
If desired, the input-output relationship can be obtained in the z-domain:
or
in which the delays are combined with the input signal. From an implementation standpoint, delays can be provided by various means, including for example shift registers, memory distributed within an FPGA, memory chips, and the like. As noted above, delays can be provided at the input, or at the output of the filter.
The order (or number of taps) of each of the polyphase sub-filters is K/M where K is the number of taps in the original filter h(n) and M is the expansion factor.
The filter components, Hi(zM), introduced in Equation (4) and are defined via the z-transform as follows:
where h(n) are the FIR filter coefficients. Hi(z) for i=0, 1, . . . , M−1 are the polyphase components and are related to the time domain filter coefficients h(n), n=0, 1, . . . , M−1 by hi(n)=h(Mn+i) or, explicitly stated, by
For the case where upsampling is performed, the filter components are an interpolated version (by interpolation factor of M) of the polyphase filter components (see Equations (13) and (14) below). For example, the non-zero coefficients in Equation (13) are separated by M−1 zeros, which is identified in the z-domain by the zM instead of z.
Some specific examples will now be provided. One implementation of a parallel FIR filter for the case where M=N=6 is depicted in
The filter 400 accepts parallel-input samples x(6n) . . . x(6n−5) 402 which are processed in parallel by a plurality of filter segments 404 or phases. The outputs of the filter segments are summed by a plurality of summers 406 to produce parallel-output samples u(6n) . . . u(6n−5) 408. A set of parallel filtering operations is performed at one-sixth the sample clock rate. Thus, the effect of the z−1 delays is to produce a delay of six sample clocks.
If desired, a change in the sample rate can be produced by increasing or decreasing the number of filter segments (phases). For example, for a decrease in sample rate, N>M, M can be set equal to an integer N/D where D is the decimation factor. For example, D can be set to be an integer 2, 3, 4, etc. For an increase in sample rate, N<M, M can be set equal to an integer N*U where U is the upsampling or interpolation factor. For example, U can be set to be an integer 2, 3, 4, etc.
and the filtering of
In general, the feed-forward filter may use any of these forms or similar arrangements. Accordingly, the feed-forward filter can have N parallel inputs and M parallel outputs, where N=M, N<M, N>M.
Returning to
where Vi(z), the z-transform of vi(n), is the i-th phase at the output 312 of the feed-back filter.
A specific example of a parallel implementation of a feed-back filter for M=2 is shown in
Equations (16) and (17) can be re-written as:
and
This demonstrates that the feed-back filter can be decomposed into two linear operators. The first operator is a filtering operation which follows the same structure as that of the feed-forward filter. The second operator is a parallel implementation of the delay.
Returning to
Because the filters are linear operators, and linear operators are commutative, the order of the filtering can be reversed. Accordingly,
The filter 600 uses an N-input N-output feed-back filter and an N-input M-output feed-forward filter. The arrangement of
Yet another arrangement of a parallel-input parallel-output IIR filter 800 is shown in
Figuring the filter coefficients for the filter 800 is slightly different than for that of
Following the system block diagram in the z-domain, the output, Y(z), can be written as:
where X(z) and Y(z) are the z-transform of the input signal, x(n), and the output signal, y(n), respectively. Thus, the linear system transfer function is described by
This is equivalent to the traditional IIR system if we choose the feed-forward and feed-back filter coefficients to satisfy:
The methods 900, 950 can be implemented using a processor, including a digital signal processor, general purpose processor, or the like in which case computer readable instructions can be provided on a computer readable media to cause the processor to execute the method. Various computer readable media are known in the art, including for example, memory devices such as flash memory, programmable read only memory, mask-programmed read only memory, fuse-programmable memory, magnetic memory, ferro-electric memory, optical memory, and the like.
The methods 900, 950 can also be implemented using digital hardware, including application specific integrated circuits, fully custom integrated circuits, field programmable gate arrays, discrete digital hardware, and the like. Individual operations in digital logic can be performed using multipliers, adders, subtractors, lookup tables, memories, and similar elements. Digital logic may be designed and expressed in a hardware description language such as Verilog™ or VHDL in either behavioral or structural descriptions (sometimes also referred to as soft or hard macros). Hardware description language may provide for bit widths, number of filter segments, number of coefficients, parallelism, decimation/interpolation factors, and other parameters to be configurable parameters which are specified by a designer at synthesis time.
Summarizing and reiterating to some extent, a parallel-input parallel-output infinite impulse response filter has been developed. The filter can be used in a wide variety of signal processing applications. By providing both parallel-input and parallel-output, high throughput can be obtained through the filter without requiring high input and output sample rates. The filter response can include both poles and zeros. The filter is also applicable to upsampling and downsampling sample rate changing using polyphase techniques. Various arrangements of the filter can be used, allowing for high computational efficiency as a function of the sample rate change factor and relative lengths of the pole filter and zero filter. While various particular filter arrangements have been shown for illustration purpose, it will be appreciated that implementation of the FIR filter components can use various arrangements, including for example direct form, transpose form, cascade form, lattice form, frequency sampling, using Fourier transforms, and the like. Accordingly, the invention is not limited to any particular FIR filter form.
It is to be understood that the above-referenced arrangements are illustrative of the application for the principles of the present invention. It will be apparent to those of ordinary skill in the art that numerous modifications can be made without departing from the principles and concepts of the invention as set forth in the claims.
The present application is a continuation in part of commonly-owned co-pending U.S. patent application Ser. No. 10/914,554, filed Aug. 9, 2004, and entitled “Parallel Filter realization for Wideband Programmable Digital Radios” (hereinafter referred to as “Parent Case”) which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
4782324 | Underwood | Nov 1988 | A |
4811263 | Hedley et al. | Mar 1989 | A |
4821223 | David | Apr 1989 | A |
5325318 | Harris et al. | Jun 1994 | A |
5432723 | Chen et al. | Jul 1995 | A |
5548543 | Wang | Aug 1996 | A |
5926455 | Allpress | Jul 1999 | A |
6889238 | Johnson | May 2005 | B2 |
20050286619 | Haddadin et al. | Dec 2005 | A1 |
20060031274 | Haddadin et al. | Feb 2006 | A1 |
20070156800 | Sturza et al. | Jul 2007 | A1 |
Entry |
---|
Krukowski at al., “Polyphase IIR Filter Banks for Subband Adaptive Echo Cancellation Applications” IEEE, 2003, 4 pages. |
U.S. Appl. No. 11/540,142, filed Sep. 28, 2006, Nelson. |
Number | Date | Country | |
---|---|---|---|
Parent | 10914554 | Aug 2004 | US |
Child | 12117555 | US |