The present invention relates to a method and signal processor for processing auxiliary data of video sequences. In particular, the present invention relates to a pre-processing of auxiliary data of video sequences in order to achieve an improved processing of video sequences, in particular for interpolation purposes.
Motion estimation is employed in an increasing number of applications, in particular, in digital signal processing of modern television receivers. Specifically, modern television receivers perform a frame-rate conversion, especially in form of an up-conversion or motion compensated up-conversion, for increasing the picture quality of the reproduced images. Motion compensated up-conversion is performed, for instance, for video sequences having a field or frame frequency of 50 Hz to higher frequencies like 60 Hz, 66.67 Hz, 75 Hz, 100 Hz etc. While a 50 Hz input signal frequency mainly applies to television signal broadcasts based on PAL or SECAM standard, NTSC based video signals have an input frequency of 60 Hz. A 60 Hz input video signal may be up-converted to higher frequencies like 72 Hz, 80 Hz, 90 Hz, 120 Hz etc.
During up-conversion, intermediate images are to be generated, which reflect the video content at positions in time which are not represented by the 50 Hz or 60 Hz input video sequence. For this purpose, the motion of objects has to be taken into account in order to appropriately reflect the changes between subsequent images caused by the motion of objects. The motion of objects is calculated on a block basis, and motion compensation is performed based on the relative position in time of the newly generated image between the previous and subsequent images.
In order to enable a processing of picture improvement algorithms, a number of characteristic information items of the video sequence to be processed are required. These information items are preferably obtained on a block basis. The characteristic information include data indicating whether a block includes still image data or moving image data, data indicating whether or not the image information of a block stem from motion pictures (film mode), and data indicating the motion phase pattern in case of film mode. These data enable a selection of the appropriate image data for interpolation purposes.
The present invention aims to enable an image processing with improved picture quality based on an enhancement of auxiliary data of a video sequence to be processed.
This is achieved by the features of the independent claims.
According to a first aspect of the present invention, a method for processing auxiliary data of a sequence of video images is provided. The auxiliary information is received in form of a field including an information item for each of the blocks of an image. The received field of auxiliary information is subjected to filtering in order to detect and eliminate an irregularity.
According to a further aspect of the present invention, a signal processor is provided for processing auxiliary data of a sequence of video images. The signal processor receives a field of auxiliary information. Each video image is divided into a plurality of blocks and the field of auxiliary information includes an information item for each of the blocks of an image. The signal processor comprises a filter means for subjecting the received field of auxiliary information to filtering in order to detect and eliminate an irregularity.
It is the particular approach of the present invention to detect abnormal patterns of auxiliary information and to eliminate such patterns therefrom. In this manner, an auxiliary information item reflecting an abnormal behavior compared to its surrounding is eliminated and replaced by a more likely information value. Accordingly, a picture improvement processing is able to apply a smoothened field of auxiliary information as implausible information items are replaced by more plausible ones.
Preferably, the auxiliary information represents characteristic information of the video sequence. By applying the present invention, defective determinations can be removed from the auxiliary data.
According to a preferred embodiment, the auxiliary information indicates whether or not an image block contains motion or still image data. Accordingly, the application of a motion compensated interpolation can be put on a more reliable basis by removing unlikely data items.
Preferably, a single bit is provided for each block in order to indicate motion or still image data.
Preferably, the auxiliary information includes information indicating whether or not an image block contains film mode or video mode data. Most preferably, a single bit is provided therefore. By removing unlikely film mode or video mode indications, an improved motion compensated interpolation result can be achieved.
Preferably, the auxiliary information further indicates an individual motion scheme of a film mode block. In this manner, a picture quality improvement algorithm can accurately take the motion phase pattern of pull down schemes into account during interpolation processing.
Preferably, three bits are provided for the indication of an individual motion scheme. Most preferably, these three bits indicate at least two bit combinations representing a PAL motion phase pattern, five bit combinations which represent NTSC motion phase pattern and a single bit combination representing an image scene change. In this manner, a small number of bits can be used to represent all most likely pull down motion patterns for world-wide applications.
Preferably, the filtering is performed either in row or column direction. In this manner, an irregularity can be efficiently detected by employing only a small computational effort.
Preferably, those auxiliary data items are removed which do not have at least two neighboring data items of a corresponding value in horizontal or vertical direction. According to an alternative embodiment, individual auxiliary data items are removed which do not have at least a single neighboring data item of a corresponding value in horizontal and at least a single neighboring data item of a corresponding value in vertical direction. According to another alternative embodiment, auxiliary data items are removed which do not have at least two corresponding data items at an adjacent position. Accordingly, individual regularities can efficiently be removed from the field of the data items.
Preferably, the removed data item is replaced by the data item of a neighboring block. In this manner, an efficient concealment scheme with low computational and hardware effort can be applied.
Preferably, the detection of an irregularity is performed by comparing a current pattern of block data with pre-stored irregularity patterns. Upon detecting the current pattern to match a pre-stored irregularity pattern, the current pattern is replaced. By providing a plurality of predefined irregularity patterns, possible irregularity configurations can reliably be detected and removed.
Preferably, a replacement pattern is stored in association with a respective irregularity pattern. Consequently, the most appropriate replacement pattern is available upon detecting an irregularity based on a stored regularity pattern.
While an embodiment of low hardware complexity employs patterns of a three data items length, a more sophisticated approach employs a pattern extending in two directions. Such a two dimensional pattern approach enables to detect a plurality of unlikely irregularities with increased efficiency and reliability.
Preferred embodiments of the present invention are the subject matter of the dependent claims.
Other embodiments and advantages of the present invention will become more apparent from the following description of preferred embodiments, in which:
The present invention relates to digital signal processing, especially to signal processing in modern television receivers. Modern television receivers employ up-conversion algorithms in order to increase the reproduced picture quality. For this purpose, intermediate images are to be generated from two subsequent images. For generating an intermediate image, the motion of objects has to be taken into account in order to appropriately adapt the object position to the point of time reflected by the interpolated image.
Motion estimation is performed on a block basis. For this purpose, each received image is divided into a plurality of blocks as illustrated in
xmax=Pixels per line/X
ymax=Pixels per column/Y
The digital signal processing in modern television receivers applies picture improvement algorithms, which make use of auxiliary data reflecting characteristic information of the video sequence to be processed. For this purpose, a still image/motion image indication, a film/video indication and a motion phase indication are preferably included on a block basis into the auxiliary data. These data result from a Block Mode Detection (BMD) processing. The block mode detection is part of a feature for modern media display devices like CRT, TFT or plasma displays. It is the main function of BMD to automatically select the settings for signal processing in order to achieve the best picture quality of the current video data.
The auxiliary information is available for each block of each incoming video field, wherein the individual data items are stored in form of a block matrix. Examples of the individual information retrieved for each block is illustrated in
The motion/still information 30 is one bit wide (Bs) and enables to determine whether or not the current block of the input field relates to a moving or still object. If a still block is indicated, the image data from two subsequent fields can be used for re-interleaving in order to achieve the best picture quality output. Preferably, the sill/motion bit is defined as follows:
0=motion
1=still
A further bit (Bm) is employed in order to indicate film mode or video mode. If the data of the current block stems from film mode, two (A+B) or three (A+B+A) fields relate to the same motion phase. In contrast, in video mode each field relates to a different motion phase. The film/video mode bit (Bm) is preferably defined as follows:
0=video camera
1=motion picture film
In case of motion picture data, a three bit phase information (Bp) is additionally provided. This three bit information (Bp) reflects the motion phase pattern of the current film data.
In contrast to interlaced video signals, motion picture data is composed of complete frames. The most wide spread frame rate of motion picture data is 24 Hz. When transforming motion picture data into an interlaced video sequence for display on a television receiver, the 24 Hz frame rate is converted into an interlaced video sequence by employing a “pull down” technique.
For converting motion picture film into interlaced PAL of a field rate of 50 Hz, a two-two pull down technique is employed. The two-two pull down technique generates two fields out of each film frame. The motion picture film is played at 25 frames per second. Consequently, two succeeding fields contain information originating from the same frame.
When converting motion picture into NTSC having a field rate of 60 Hz, the frame rate of 24 Hz is converted into a 60 Hz field rate employing a three-two pull down technique. This three-two pull down technique generates two video fields from a given motion picture frame and three video fields from the next motion picture frame. As can be learned from the pull down techniques described above the resulting video sequences include pairs or triplets of adjacent fields reflecting an identical motion phase. The pull down techniques employed for converting motion picture frames into video fields in accordance with the PAL or NTSC standard are illustrated in
Motion phases, reflected by the motion phase bits (Bp), are illustrated, by way of example, in
Present picture quality improvement algorithms have to cope with irregular or defective auxiliary information, in particular for the still/motion indication, the film/video mode indication, and/or the motion phase indication. These irregularities result in a respectively impaired picture quality.
The present invention removes such irregularities by applying a filtering to a field of auxiliary information items. For this purpose, the present invention exploits the spatial neighborhood of each auxiliary data item in order to detect irregular data items. An example for removing irregular data items is illustrated in
An example of the application of horizontal filter is illustrated in
The processing for three data items as evaluated above is illustrated in
The mode processing for Bm is similar to the processing for Bs.
The filtering process for the motion phase data items, described in connection with
While the filtering as described by way of example with reference to horizontal filtering in
Reference numeral 900 designates the field of auxiliary data of a complete image. The data items 940, to be stored for processing the vertically adjacent data items 910, 920 and 930, are marked as grey colored blocks in
The above described filtering operations are applicable to all block positions, except the border rows and border columns. In order to appropriately process these blocks, either the vertical or the horizontal filtering operation is disabled, due to a lack of neighbouring data. For the first and the last row, the vertical filtering is disabled and for the first and the last column the horizontal filtering is disabled.
According to a preferred embodiment, a plurality of irregularity patterns are stored in a look-up-table. A pattern to be evaluated is compared to a set of stored irregularity patterns. In case a current pattern matches one of the stored patterns, an irregularity is detected and removed from the field of data items.
Preferably, the recorded irregularity patterns have stored associated replacement patterns. These replacement patterns can be used as an alternative embodiment to the replacement processing described in connection with
The use of a look-up-table further enables to employ two dimensional irregularity patterns, for instance, block patterns of a 4×4 block size. An example thereof is indicated by X6 in
Summarizing, the present invention relates to a pre-processing of auxiliary data of a video sequence in order to enable improved processing results for applying picture improvement algorithms. Irregularities occurring within an auxiliary data field providing data items on a block basis are detected and removed. In particular, a film/video mode indication or a motion/still indication is processed accordingly. The removal of irregularities enables a respective improved image processing, for instance, interpolation processing during up-conversion and interlaced/progressive conversion.
Number | Date | Country | Kind |
---|---|---|---|
04010302.0 | Apr 2004 | EP | regional |