This invention relates to video signal processing and is particularly concerned with non-linear filtering.
It has been found that in a wide variety of video signal processes—including de-interlacing, decoding, enhancement, noise reduction, and standards conversion—considerable advantage can be secured by the use of complex non-linear filters. It has been found in particular that polynomial filters can be very useful. In many applications, quadratic behaviour in the filter is not sufficient and third or higher orders are typically necessary. Where real time operation is required, hardware implementations are usually essential and the hardware costs of such high order polynomial filters are substantial.
It is an object of the present invention to provide improved methods and apparatus in video signal processing which offer third or higher order behaviour in a relatively simple filter architecture.
Accordingly, the present invention consists, in one aspect, in a method of video signal processing, comprising the steps of conducting three linear filtering operations on an input video signal to produce three filtered signals, each linear filtering operation comprising the taking of a weighted sum of pixels; and multiplying together said three filtered signals to produce an output video signal.
Suitably, the weighted sum is taken over pixels of the input video signal defined by a filter aperture and, preferably, all three linear filtering operations have the same filter aperture.
In one embodiment, for at least one linear filtering operation, the taking of a weighted sum of pixels includes the output pixel of the respective linear filtering operation.
In another aspect, the present invention consists in apparatus for video signal processing comprising an input terminal for receiving an input video signal; first, second and third linear filters each connected with the input terminal and arranged to provide an output through taking a weighted sum of pixels; a first multiplier for multiplying together the respective outputs of the first and second filters; and a second multiplier for multiplying together the respective outputs of the first multiplier and the third filter to produce an output video signal.
Advantageously, a filter is interposed between the output of the first multiplier and the second multiplier.
Preferably, the apparatus further comprises a linear filter path connected with the input terminal, and a combiner for combining the outputs of the linear filter path with the output of said second multiplier.
Suitably, a filter is interposed between the output of the second multiplier and said combiner.
The invention will now be described by way of example with reference to the accompanying drawings in which:
The example will be taken of a de-interlacer and, for reasons of clarity, a de-interlacer will be described that utilises only vertical information. It will be understood that horizontal and temporal information could be included in ways which will immediately evident to the skilled reader.
In
In the non-linear signal path 20, the output of two four point linear filters (h1 and h2) are multiplied together and passed through a two point linear filter (h4). The output of this is then multiplied with the output of a five point linear filter (h3). The resulting signal is filtered through another two point linear filter (h5) before being added on to the linear path. Although in this case the filter lengths are 4, 5 and 2, larger filters with more taps can be used to give better results. The lengths (or more generally, the sizes) of the filters need not be related and can be made larger or smaller to provide different trade-offs between quality and cost.
It will be recognised that the arrangement of
The filter coefficients can be selected by ‘training’ the filter on real pictures. In this example of de-interlacing, a still frame is taken and split into fields. A set of coefficients is used to estimate Field 2 from Field 1 and the mean squared error between the estimate of Field 2 and the original Field 2 is measured. A genetic algorithm can then be used to search the multi-dimensional filter space for the set of filter coefficients that gives the lowest mean squared error.
If the described non-linear de-interlacer is tested on the EBU/SMPTE test picture “Girl with Toys”, the non-linear path is found to reduce the average mean squared error by approximately 15% with respect to the linear filter. There is also a noticeable reduction in jagging.
A polynomial filter with the same number of input pixel taps produces an almost equivalent reduction in error. However, a major advantage of this new architecture over the polynomial filter can be seen by considering the number of multiplications of pixels; multiplications of pixels by a constant; and additions, that each filter requires. These are shown in Table 1.
It can be seen that the largest reduction is in the multiplication of pixels. This is particularly significant as these are the most expensive to implement.
In summary, the new architecture is able to reduce many of the artefacts associated with traditional linear interpolation whilst being relatively simple to implement.
It should be understood that this invention has been described by way of example only and that a wide variety of modifications are possible without departing from the scope of the invention. Thus, whilst the separation into linear and non-linear paths offers important advantages, such as the option to preserve higher bit accuracy in the linear path, it will not always appropriate. Similarly, the described use of vertical filters is—as has been explained—merely an example. Horizontal, vertical and temporal filters can be employed and filters can have one, two or three of these dimensions. Whilst Finite Impulse Response (FIR) filters will be important, the invention also encompasses other forms of linear filter such as recursive filters which include The output pixel in the weighted sum. The filters which are to be multiplied together need not be of the same category. However, providing three FIR or transversal filters with the same filter aperture ensures that in the multiplication of the three filtered signals, all possible cross products of input pixels are made available.
It will be recognised that although de-interlacing has been chosen as an example, filters according to the present invention can be applied to other problems in video processing, including composite to component decoding, enhancement, noise reduction, up and down conversion and standards conversion.
Number | Date | Country | Kind |
---|---|---|---|
9810555.4 | May 1998 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB99/01574 | 5/17/1999 | WO | 00 | 1/16/2001 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO99/60780 | 11/25/1999 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4446484 | Powell | May 1984 | A |
4989987 | Berryman et al. | Feb 1991 | A |
5003618 | Meno | Mar 1991 | A |
5086488 | Kato et al. | Feb 1992 | A |
5142380 | Sakagami et al. | Aug 1992 | A |
5249053 | Jain | Sep 1993 | A |
5438625 | Klippel | Aug 1995 | A |
5512956 | Yan | Apr 1996 | A |
5629779 | Jeon | May 1997 | A |
5642115 | Chen | Jun 1997 | A |
5652620 | Sugiura et al. | Jul 1997 | A |
5671298 | Markandey et al. | Sep 1997 | A |
5748245 | Shimizu et al. | May 1998 | A |
5802118 | Bliss et al. | Sep 1998 | A |
5802218 | Brailean | Sep 1998 | A |
5831688 | Yamada et al. | Nov 1998 | A |
5930398 | Watney | Jul 1999 | A |
5991456 | Rahman et al. | Nov 1999 | A |
6005952 | Klippel | Dec 1999 | A |
6088388 | Slavin | Jul 2000 | A |
6151362 | Wang | Nov 2000 | A |
6163573 | Mihara | Dec 2000 | A |
6181382 | Kieu et al. | Jan 2001 | B1 |
6269120 | Boice et al. | Jul 2001 | B1 |
6278735 | Mohsenian | Aug 2001 | B1 |
6335990 | Chen et al. | Jan 2002 | B1 |
6427031 | Price | Jul 2002 | B1 |
6437827 | Baudouin | Aug 2002 | B1 |
6539120 | Sita et al. | Mar 2003 | B1 |
6570922 | Wang et al. | May 2003 | B1 |