The present invention relates to an improved film mode determination. In particular, the present invention relates to a method for determining improved film mode determinations and to a corresponding film mode detector.
Film mode indications are employed in motion compensated image processing which is used in an increasing number of applications, in particular in digital signal processing of modern television receivers. Specifically, modern television receivers perform a frame-rate conversion, especially in the form of a up-conversion or a motion compensated up-conversion, for increasing the picture quality of the reproduced images. Motion compensated up-conversion is performed, for instance, for video sequences having a field or frame frequency of 50 Hz to higher frequencies like 60 Hz, 66.67 Hz, 75 Hz, 100 Hz, etc. While a 50 Hz input signal frequency mainly applies to television signals broadcast based on PAL or SECAM standards, NTSC based video signals have an input frequency of 60 Hz. A 60 Hz input video signal may be up-converted to higher frequencies like 75 Hz, 80 Hz, 90 Hz, 120 Hz, etc.
During upconversion, intermediate images are to be generated which reflect the video content at temporal positions which are not represented in the 50 Hz or 60 Hz input video sequence. For this purpose, the motion of objects has to be taken into account in order to appropriately reflect the changes between subsequent images caused by the motion of objects. The motion of objects is calculated on a block basis, and motion compensation is performed based on the relative position and time of the newly generated image between the previous and subsequent images.
For motion vector determination, each image is divided into a plurality of blocks. Each block is subjected to motion estimation in order to detect a shift of an object from the previous image.
In contrast to interlaced video signals like PAL or NTSC signals, motion picture data is composed of complete frames. The most widespread frame rate of motion picture data is 24 Hz (24 p). When converting motion picture data into an interlaced video sequence for display on a television receiver (this conversion is called telecine), the 24 Hz frame rate is converted by employing a “pull down” technique.
For converting motion picture film into an interlaced signal according to the PAL standard with a field rate of 50 Hz (50 i), a 2-2 pull down technique is employed. The 2-2 pull down technique generates two fields out of each film frame. The motion picture film is played at 25 frames per second (25 p). Consequently, two succeeding fields contain information originating from the same frame and representing the identical temporal position of the video content, in particular of moving objects. When converting motion picture film into a standard NTSC signal having a field rate of 60 Hz (60 i), the frame rate of 24 Hz is converted into a 60 Hz field rate employing a 3-2 pull down technique. This 3-2 pull down technique generates two video fields from a given motion picture frame and three video fields from the next motion picture frame.
The telecine conversion process for generating interlaced video sequences in accordance with different television standards is illustrated in
The detection of the individual pull down pattern employed is required in order to appropriately perform a picture quality improvement processing, in particular to decide whether or not a film motion compensation is to be employed. A detection of a respective pull down pattern is already known, for instance, from EP-A-0 720 366 and EP-A-1 198 138.
The present invention aims to further improve film mode detection and to provide an improved method of film mode detection and an improved film mode detector.
This is achieved by the features of the independent claims.
According to a first aspect of the present invention, a method for determining film mode indications for a plurality of image areas of a current image is provided. The current image is part of an image sequence. The method receives a film mode indication for current image area and obtains a motion vector for the current image area. Based on the received motion vector, film mode indications of the current image are corrected.
According to a further aspect of the present invention, a film mode detector for determining film mode indications for a plurality of image areas of a current image is provided. The current image is part of an image sequence. The film mode detector comprises input means and extrapolation means. The input means obtains a film mode indication and a motion vector for a current image area. The extrapolation means corrects film mode indications of the current image based on the obtained motion vector.
It is the particular approach of the present invention to improve film mode detection by obtaining film mode indications on a local basis and to extrapolate the film mode indication of a current image area to neighbouring image areas in accordance with a motion vector determined for the current image area. In this manner, the accuracy and reliability of film mode indications around leading edges of moving objects can be increased. The image quality achievable by picture improvement algorithms is accordingly enhanced.
Conventionally, the correct film mode indication is only detected if a moving object covers most of the respective image area. Thus, the correct film mode indication is not detected if the moving object only covers a smaller proportion of an image area. In accordance with the present invention, the film mode indications of image areas including a leading edge of a moving object can be converted into the correct mode.
Further, film mode indications of image areas around a leading edge of a moving object generally do not switch immediately to a newly detected mode due to a delay which is introduce in order to increase the reliability of the film mode indications. However, this is only achieved at the expense of a correct determination of a film mode determination for leading edges of moving objects. This drawback is avoided by employing a film mode indication extrapolation in accordance with the present invention.
Preferably, image areas located between the current image area and an image area pointed to by the motion vector are set to film mode if the film mode indication received for the current image area is film mode. Accordingly, film mode is extrapolated in accordance with the motion vector determined for the current image area.
Preferably, the extrapolation is only performed if the target block, i.e. the block pointed to by the motion vector, is not in film mode. Accordingly, an extrapolation is only performed if the motion vector points to image areas of another mode which is different from that of the current image area.
If the motion vector points from the current image area to a position outside of the current image, the motion vector length is preferably clipped such that the clipped vector only points to a position located within the current image.
Preferably, the images of the image sequence are divided into a plurality of blocks wherein the film mode indications and motion vectors are provided on a block basis, i.e. the image areas correspond to the block structure. Accordingly, the extrapolation can be performed in a simple manner based on an existing image area structure.
Preferably, the motion vector pointing from a current block into a target block is quantized in order to fit into the raster of image blocks. Accordingly, the film mode extrapolation can be implemented in a simple manner.
The image areas to be set to film mode when performing film mode extrapolation, are preferably selected in accordance with a predefined image area pattern, i.e. a pattern which identifies the individual image areas to be corrected. In this manner, those image areas for which the film mode indication needs to be corrected can be determined in a reliable and simple manner.
The predefined pattern is preferably selected from a plurality of prestored patterns in a memory. This selection is performed based on the relative positions of the current image area and the target image area. Accordingly, a pattern to be applied to a current image area can be selected in a fast and simple manner.
Preferably, the prestored patterns provide all possible combinations of relative positions of the current image area and the target image area. The image areas for which the film mode indication is to be corrected can thus be determined in a reliable manner.
According to a preferred embodiment, the image areas to be set to film mode are determined based on an iterative determination starting at the current image area and stepwisely approaching the target image area.
The step size for determining new image areas to be set to film mode is preferably determined based on the motion vector's orientation. Most preferably, the step size is set by dividing the larger vector component by the smaller vector component of the horizontal and vertical vector components.
Preferably, an additional indication is stored in connection with each of the image areas indicating whether or not the film mode indication of a current image area has been corrected. In this manner, an original film mode indication can be distinguished from a corrected film indication in a reliable manner. A further extrapolation of film mode indications can be inhibited when the occurrence of a “corrected” film mode indication is detected. In this manner, a once extrapolated film mode does not serve as a basis for a further film mode extrapolation.
According to a preferred embodiment, image areas between a current image area and a target image area are set to video mode if the current image area is in video mode. In this manner, the film mode indications of a moving object in video mode inserted into an environment in film mode can be accurately determined by extrapolating a video mode accordingly.
Preferably, the video mode is only extrapolated if the target image area is in film mode.
By Preferred embodiments of the present invention are the subject matter of dependent claims.
Other embodiments and advantages of the present invention will become more apparent from the following description of the preferred embodiments, in which:
The present invention relates to digital signal processing, especially to digital signal processing in modern television receivers. Modern television receivers employ up-conversion algorithms in order to increase the reproduced picture quality. For this purpose, intermediate images are to be generated from two subsequent images. For generating an intermediate image, the motion of objects has to be taken into account in order to appropriately adapt the object position to the point of time reflected by the compensated image.
Motion estimation for determining a motion vector and motion compensation are performed on a block basis. For this purpose, each image is divided into a plurality of blocks as illustrated, for example, in
In order to be able to correctly apply motion compensation to an image area, the determination of a film mode indication, i.e. film mode or video mode, for that image area is required. By applying the correct picture quality improvement processing in accordance with the detected film mode indication, image artefacts are avoided.
A video signal processing is particularly required to drive progressive displays and to make use of higher frame rates, in particular for HDTV display devices. The detection of motion picture film converted into interlaced image sequences for television broadcast (further referred to as film mode) is crucial for the signal processing.
For picture improvement processing an interlaced/progressive conversion (I/P) is possible, using an inverse telecine processing, i.e. a re-interleaving of even and odd fields. For image sequences stemming from a 3-2 pull down scheme, the single redundant field from a triplet (the grey colored fields in
More advanced up-conversion algorithms employ a motion vector based interpolation of frames. The output frame rate can be an uneven fraction of the input video rate, for instance, a 60 Hz input signal frequency may be up-converted to a 72 Hz output frequency corresponding to a ratio of 5:6. Accordingly, only every sixth output frame can be generated from a single input field alone, when a continuous motion impression of moving objects is to be maintained.
The film-mode characteristic of an image may be determined on an image basis or, according to an improved approach, be a local characteristic of individual image areas. In particular, television signals are composed of different types of image areas such as no-motion areas (e.g. logo, background), video camera areas (e.g. newsticker, video insertion) and film mode areas (e.g. main movie, PIP). A pull down scheme detection is separately performed for each of these image areas enabling an up-conversion result with improved picture quality.
Film mode detection generally involves a recognition of a pull down pattern. Conventionally, pixel differences are accumulated to a Displaced Frame Difference (DFD) representing the motion between subsequent images. In order to avoid sudden changes in the detected film-mode indication, which would result in an unstable impression to the viewer, detection delays are employed for triggering a switch from a film mode to a video mode and vice versa.
In order to increase the film mode indication accuracy, a film mode detection is performed on a block basis as illustrated, for instance, in
The data obtained for each of the image blocks are illustrated for a single block in
A block based film mode detection and problems arising therefrom are illustrated in
Further, a switching delay, which is introduced in order to avoid a frequent switch between different modes, causes the leading edge of a moving object 10 not to be properly detected being in film mode as shown in image T=1 of
The delay further causes that the trailing edge of the moving object 10 has a trailing image area of film mode blocks (d) although the trailing blocks are not covered by the moving object 10 (see in particular images T=2 and T=3 of
In order to overcome these drawbacks, the present invention employs motion vectors determined for image blocks in order to enable and improve up-conversion processing. The extrapolation of a film mode detection enables to cover leading borders of moving objects. An example of an improved film mode detection in accordance with the present invention is illustrated in
The left hand image in
For this purpose, the film mode detection of the current block is extrapolated as illustrated in
The approach of the present invention to extrapolate film mode indications in accordance with a motion vector 110 will now be described in detail. Each field is divided into a plurality of image areas of blocks as illustrated in
The extrapolation process is started by obtaining the motion vector and source mode for the current block 100 (step S220). If the current block turns out to be film mode in step S230, the motion vector 110 of the current block 100 is quantized in order to fit into the block grid (step S240). If the motion vector points to a position outside of the current image, the motion vector length is clipped in order to point to a respective block at the image border.
After determining the target block 120, i.e. the block to which the motion vector points starting from the current block 100, the mode (target mode) of the target block 120 is determined (step S250). An extrapolation is only performed if the following conditions are met:
Only if it has been determined in step S250 that the target block is in video mode, extrapolation is performed (step S260). Extrapolation is performed by setting each block 130 under the motion vector 110 pointing from the current block 100 to the target block 120 to film mode.
The determination of the blocks to be set to film mode can be implemented by means of a modulo addressing of the current block index. The motion vector component of the horizontal and vertical components having the larger value is considered as primary axis V1, while the smaller motion component is considered to represent a secondary axis V2. The respective signs determine the directions Dir1, Dir2. The step width for determining stepwisely blocks to be set to film mode is calculated based on an integer division of the larger motion component by the smaller motion vector component as indicated below:
It is to be noted that each of these artificially set film mode blocks 130 (in
The source block 100 is not set to artificial mode. The first block set to film mode and having the artificial bit set accordingly is determined in the direction of the sign of the primary axis V1 (Sign(V1)).
The method for iteratively determining the blocks 130 between the source block 100 and the target block 120 is illustrated in
For the method of modulo addressing, the typical loop variables i and j are used. The variable i is used for the primary direction Dir1, whereas j is used for Dir2.
The originally determined source block 100 is in film mode and shall not be set again and marked as artificial. Therefore processing starts in step S320 by adding the sign of Dir1 to the index i. This is the block marked “Start” at position 1,0 in
In step 330 the condition for an increment of the variable j is checked, which is responsible for incrementing the artificial marking position in S340 in the secondary direction Dir2. The condition is true if i equals an even multiple of the value “Step” calculated above. This is marked as “Step=2” in
In step S350 the absolute position of the artificial film block is calculated, by means of adding the current indexes i and j to the absolute position of the source block (Index1/2(Source)). The result is held in the variables k and I indicating the position in the image.
Then the artificial bit and film bit are set in step S360, indicated as 130 in
If the index i of the primary direction Dirn has advanced to a value equal to the vector magnitude of V1, then modulo addressing ends in S370 (“Last Block” in
Accordingly, a number of blocks 130 is determined as illustrated by the gray marked blocks in
The iterative approach for determining the blocks between the current block 100 and the target block 120 has the disadvantage that for some motion vectors, the target block cannot be reached and consequently the target block 120 cannot be approached stepwisely.
According to another preferred embodiment, the artificial mode marking is implemented by employing a look-up-table (LUT) for every possible combination of x/y vector components. Each entry in the look-up-table identifies those blocks which are to be artificially marked. For this purpose, the stored pattern describes which block is to be marked next. This can be implemented based on a binary indication wherein a “0” indicates up/down step and a “1” indicates right/left step. The moving direction is given by the sign of the respective vector component. The example illustrated in
This approach does not allow the marking of blocks in a diagonal manner without having any adjacent blocks in a horizontal or vertical direction. Consequently, the number of blocks marked increases resulting in a better vector path coverage.
The skilled person is aware, that the described approaches for determining those blocks to be artificially set to film mode between a current block and a target block is not limited to the described embodiments and every other approach may be used with the same effect.
The image area is described above to correspond to a block size know from motion estimation. The present invention is not limited to such an image area size for film mode determination and, particularly, for film mode extrapolation. Image areas larger or smaller than a block may be defined. For instance, image areas smaller than a block refine the film mode resolution. A film mode determination and extrapolation may be implemented based on image areas having a size between a whole field and just a single pixel, or even a sub-pixel size.
Further, the film mode extrapolation can be enhanced by an additionally implemented motion vector aided extrapolation of detected video modes of the film mode indication. Under the assumption that a video mode detection for each block can be performed accurately and with high reliability, the motion path of a video mode object does not interfere with that of a film mode object.
Summarizing, the present invention enables an improved film mode determination in particular for border areas of moving objects. This is achieved by a film mode extrapolation. The film mode indication of the current block is extrapolated in accordance with a motion vector determined for the identical block. In this manner, the accuracy of the film mode determinations for the current image can be improved and image processing yielding improved picture quality can be improved accordingly.
Number | Date | Country | Kind |
---|---|---|---|
04010293.1 | Apr 2004 | EP | regional |