The invention relates to a downsampling finite impulse response (FIR) filter. In particular, the invention relates to an optimised architecture for a downsampling FIR filter which could be used in, but is not limited to, digital front-end filtering in a mobile telecommunications apparatus.
Downsampling FIR filters are an important component for many digital signal-processing applications.
However, frequently the downsampling process results in inefficient operation due to the performance of redundant operations. Redundant operations may also be performed when the underlying ‘base’ filter has degenerate coefficients. In particular, redundant operations may be performed if the filter coefficients are symmetric.
The performance of redundant operations leads to an unnecessarily large power requirement. This is sub-optimal.
Furthermore it is desirable to reduce the number of components in downsampling FIR filters in order to reduce the apparatus size. However, previous FIR filters have not exploited the downsampling process or the coefficient degeneracy or symmetry in order to minimize the number of components.
The shift register 1 receives an input data stream and provides a data part to the calculation unit 2 every clock cycle. The multipliers each multiply a portion of the data part with a coefficient of the filter. The adder 3 sums the output of the multipliers. Just before the downsampling unit 5, the output of the filter is given by
y
n
=C
1
x
n
+C
2
xd
n-1
+C
3
x
n-2
+C
4
x
n-3
+C
5
x
n-4
+C
6
x
n-5
where xn is the input signal, yn is the signal just before the downsampling unit, and C1, C2, . . . , Cn are the filter coefficients. The downsampling unit 5 decimates every second input which it receives. Thus, the downsampling unit is operable in a data decimation phase, in which received data will be decimated, and in a data preserving phase, in which received data will not be decimated. All of the multiplication and addition operations performed during the data decimation phase are redundant, since the calculated data sent to the downsampling unit during this phase is discarded.
For the filter shown in
The present invention provides an apparatus comprising a calculation unit, a polyphase addition unit and a downsampling unit operable in a data decimation phase and a data preserving phase, wherein the calculation unit is arranged to receive input data and is configured to generate a first data part during the data decimation phase and a second part during the data preserving phase; and the polyphase addition unit is configured to generate a third data part in dependence on said first and second data parts and to output said third data part to the downsampling unit such that the apparatus generates filtered and downsampled data.
The calculation unit may comprise a multiplier and the apparatus may be configured so that the multiplier receives a first multiplier coefficient during the data decimation phase and a second multiplier coefficient during the data preserving phase and the calculation unit may be configured to generate the first and second data parts in dependence on the first and second multiplier coefficients respectively.
The apparatus may further comprise a plurality of multipliers, and each multiplier may serially receive its own sequence of multiplier coefficients and the multiplier coefficients received may be the coefficients of a predetermined linear function.
The plurality of multipliers may alternately receive even and odd coefficients of the predetermined linear function.
The apparatus may be arranged so that the number of different coefficients received by each multiplier is equal to the downsampling factor of the downsampling unit.
The apparatus may further comprise an adder, and the polyphase addition unit may comprise a plurality of polyphase addition sub-units and may be configured to receive data from the calculation unit and output data to the downsampling unit and the downsampling unit may comprise a plurality of downsampling sub-units and the adder may be configured to calculate the sum of the outputs of the downsampling sub-units.
The calculation unit may comprise an adder and a plurality of multipliers and the adder may be configured to calculate the sum of the outputs of the multipliers and to output the sum to the polyphase addition unit.
The calculation unit may be configured to receive an input data part during a time interval and to multiply a portion of the input data part by a predetermined coefficient.
The calculation unit may receive an input data part during a time interval and the apparatus may be configured to calculate a linear function of the input data part.
The calculation unit may comprise an adder, and the received input data part may have a first data portion and a second data portion and the linear function may have first and second degenerate coefficients and the calculation unit may calculate the sum of the product of the first data portion and the first degenerate coefficient and the product of the second data portion and the second degenerate coefficient by summing the first and second data portions in the adder and multiplying the output of the adder by the value of the degenerate coefficient.
The apparatus may further comprise means for receiving a clock signal and the time interval may be a clock cycle.
The apparatus may further comprise a shift register, and the shift register may receive an input data stream and a clock signal and the calculation unit may receive data from the shift register.
The calculation unit may further comprise a data direction unit and the data direction unit may be configured to receive a plurality of data parts and a control signal and to generate output data in dependence on the control signal and on a received data part.
The calculation unit may have a calculation sub-unit and the calculation sub-unit may receive data from the data direction unit.
The data direction unit may be a multiplexer.
The data direction unit may be a re-ordering block.
The re-ordering block may receive a plurality of data parts in a first order and may be configured to output said plurality of data parts in a second order.
The re-ordering block may comprise one or more registers and one or more multiplexers configured to receive the control signal.
The polyphase addition unit may comprise a register and an adder.
The polyphase addition unit may be arranged to calculate and store a cumulative sum of n received inputs, where n is the downsampling factor of the downsampling unit.
The polyphase addition unit may further comprise a multiplexer, and the multiplexer may reset the stored cumulative sum to 0 after the cumulative sum of n inputs has been calculated.
The apparatus may implement a downsampling finite impulse response filter.
According to the invention, there is provided a method comprising generating filtered and downsampled data by receiving input data generating a first data part during a first time interval and a second element during a second time interval and generating an output data stream in dependence on said first and second data parts; and downsampling the output data stream.
The method may further comprise steps of receiving a first multiplier coefficient during a first time interval and a second multiplier coefficient during a second time interval and generating the first and second data parts in dependence on the first and second multiplier coefficients respectively.
The method may further comprise a step of serially receiving a sequence of coefficients.
The method may further comprise steps of receiving an input data part during a time interval and multiplying a portion of the input data part by a predetermined coefficient.
The calculation unit may receive an input data part during a time interval and the method may calculate a linear function of the input data part.
The linear function may have first and second degenerate coefficients and the method may calculate the sum of the products of the first data portion and the first degenerate coefficient and the second data portion and the second degenerate coefficient by adding the first and second data portion and multiplying the result by the value of the degenerate coefficient.
The method may further comprise directing data in dependence on a control signal. The method may further comprise re-ordering data.
The method may implement a downsampling finite impulse response filter.
Embodiments of the present invention will now be described, by way of example only, with reference to the accompanying drawings in which:
Referring to
The shift register 1 receives an input data stream and provides a data part to the multipliers 6. The multipliers 6 each multiply a portion of the data part by a coefficient. Each polyphase addition sub-unit 10 generates an output which is the sum of the input to the polyphase addition sub-unit at a particular clock cycle and the input to the polyphase addition unit at the previous clock cycle. Thus, the polyphase addition sub-units 10 combine the output of the multiplier over two consecutive clock cycles. The output of the each polyphase addition sub-unit is downsampled by a factor of two in a downsampling sub-unit 13. The adder 3 then sums the output of the downsampling sub-units, hence generating the output.
Two of the original coefficients are alternatingly given to the multipliers 6: In a first phase all even ones, and in a second phase all odd ones.
Thus, the structure generates the same output as the structure of
In general the following equation holds for the total summation wordlength:
n+┌log2(m)┐+┌log2(d)┐<m(n+┌log2(d)┐)
Where n is the wordlength of the partial products, m is the number of coefficients divided by the decimation ratio and d is the decimation ratio.
Therefore, compared to previous embodiments, the number of adders and registers in the present embodiment is reduced. This may be advantageous in terms of area power and cost, for example. The minimization can be implemented regardless of how the polyphase addition is achieved.
In the first memory part 24 the multiplexer 28 may receive a first multiplexer input data portion from the output of a register 9 of the shift register 1 and a second multiplexer input data portion from the output of a register 29 of the first memory part 24 of the re-ordering unit 29. The multiplexer 28 outputs either the first multiplexer input data portion or the second multiplexer input data portion to the input of the register 29 in dependence on the control signal. If the control signal is ‘0’ the input multiplexer 28 may output the output of the register 29 to the input of the register 29. If the control signal is a ‘1’ the input multiplexer 28 may output the output of a register 9 of the shift register 1 to the input of the register 29.
In the second memory part 25 the multiplexer 30 may receive a first multiplexer input data portion from the output of a register 9 of the shift register 1 and a second multiplexer input data portion from the output of a register 31 of the second memory part 25 of the re-ordering unit 29. The multiplexer 30 outputs either the first multiplexer input data portion or the second multiplexer input data portion to the input of the register 31 in dependence on the control signal. If the control signal is ‘1’ the input multiplexer 30 may output the output of the register 31 to the input of the register 31. If the control signal is a ‘0’ the input multiplexer 30 may output the output of a register 9 of the shift register 1 to the input of the register 31.
The output multiplexer 27 may output the data stored in either the first memory part 24 or the second memory part 25 to a register 9 of the shift register 1.
Thus, the re-ordering unit may re-order data. In particular, the re-ordering block may re-order data so that any two samples that are received by the re-ordering block are output by the re-ordering block in reverse order.
The adder 32 may receive data from the re-ordering block. Furthermore, the adder, 33, may receive data which has been re-ordered by the re-ordering block. Thus, the adders 32, 33, may receive data in dependence on the control signal. The multipliers 34, 35 may receive data from the adders 32, 33. Thus, the multiplier may receive data in dependence on the control signal. Thus, according to the invention, the structure may comprise calculation sub-units which receive data in dependence on the control signal.
The re-ordering block enables the addition of (delayed) input samples that are to be multiplied with the same coefficient.
The re-ordering block may allow a reduction in the number of multipliers. Furthermore, the re-ordering block may allow the dataflow to be handled in an efficient way.
The addition of (delayed) input samples may also be achieved by using multiplexing circuitry for every element in the later part of the delay chain.
For downsampling factors other than two, the re-ordering may be generalised in such a way that any group of n samples that is taken into the re-ordering block may be output by the re-ordering block in reverse order. For example, for a downsampling factor of 4, the re-ordering block may take in 4 samples (1 2 3 4) and may produce an output of (4 3 2 1)
The above described embodiments and alternatives may be used either singly or in combination to achieve the effects provided by the invention.
Many other modifications and variations will be evident to those skilled in the art, that fall within the scope of the following claims:
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2007/051096 | 3/28/2007 | WO | 00 | 9/28/2009 |