This application claims the benefit, under 35 U.S.C. § 119 of European Patent Application 03292464.9, filed October 7, 2003.
The present invention relates to a method and an apparatus for processing video pictures especially for dynamic false contour effect and dithering noise compensation.
The plasma display technology now makes it possible to achieve flat colour panels of large size and with limited depth without any viewing angle constraints. The size of the displays may be much larger than the classical CRT picture tubes would have ever allowed.
Plasma Display Panel (or PDP) utilizes a matrix array of discharge cells, which could only be “on” or “off”. Therefore, unlike a Cathode Ray Tube display or a Liquid Crystal Display in which gray levels are expressed by analog control of the light emission, a PDP controls gray level by a Pulse Width Modulation of each cell. This time-modulation will be integrated by the eye over a period corresponding to the eye time response. The more often a cell is switched on in a given time frame, the higher is its luminance or brightness. Let us assume that we want to dispose of 8 bit luminance levels i.e 255 levels per colour. In that case, each level can be represented by a combination of 8 bits with the following weights:
1-2-4-8-16-32-64-128
To realize such a coding, the frame period can be divided in 8 lighting sub-periods, called subfields, each corresponding to a bit and a brightness level. The number of light pulses for the bit “2” is the double as for the bit “1”; the number of light pulses for the bit “4” is the double as for the bit “2” and so on . . . . With these 8 sub-periods, it is possible through combination to build the 256 gray levels. The eye of the observers will integrate over a frame period these sub-periods to catch the impression of the right gray level. The
The light emission pattern introduces new categories of image-quality degradation corresponding to disturbances of gray levels and colours. These will be defined as “dynamic false contour effect” since it corresponds to disturbances of gray levels and colours in the form of an apparition of coloured edges in the picture when an observation point on the PDP screen moves. Such failures on a picture lead to the impression of strong contours appearing on homogeneous area. The degradation is enhanced when the picture has a smooth gradation, for example like skin, and when the light-emission period exceeds several milliseconds.
When an observation point on the PDP screen moves, the eye will follow this movement. Consequently, it will no more integrate the same cell over a frame (static integration) but it will integrate information coming from different cells located on the movement trajectory and it will mix all these light pulses together, which leads to a faulty signal information.
Basically, the false contour effect occurs when there is a transition from one level to another with a totally different code. The European patent application EP 1 256 924 proposes a code with n subfields which permits to achieve p gray levels, typically p=256, and to select m gray levels, with m<p, among the 2n possible subfields arrangements when working at the encoding or among the p gray levels when working at the video level so that close levels will have close subfields arrangements. The problem is to define what “close codes” means; different definitions can be taken, but most of them will lead to the same results. Otherwise, it is important to keep a maximum of levels in order to keep a good video quality. The minimum of chosen levels should be equal to twice the number of subfields.
As seen previously, the human eye integrates the light emitted by Pulse Width Modulation. So if you consider all video levels encoded with a basic code, the temporal center of gravity of the light generation for a subfield code is not growing with the video level. This is illustrated by the
where sfWi is the subfield weight of ith subfield;
The center of gravity SfCGi of the seven first subfields of the frame of
So, with this definition, the temporal centers of gravity of the 256 video levels for a 11 subfields code with the following weights, 1 2 3 5 8 12 18 27 41 58 80, can be represented as shown in
In this case, for this example, 40 levels (m=40) will be selected among the 256 possible. These 40 levels permit to keep a good video quality (gray-scale portrayal). This is the selection that can be made when working at the video level, since only few levels, typically 256, are available. But when this selection is made at the encoding, there are 2n different subfield arrangements, and so more levels can be selected as seen on the
The main idea of this Gravity Center Coding, called GCC, is to select a certain amount of code words in order to form a good compromise between suppression of false contour effect (very few code words) and suppression of dithering noise (more code words meaning less dithering noise).
The problem is that the whole picture has a different behavior depending on its content. Indeed, in area having smooth gradation like on the skin, it is important to have as many code words as possible to reduce the dithering noise. Furthermore, those areas are mainly based on a continuous gradation of neighboring levels that fits very well to the general concept of GCC as shown on
However, let us analyze now the situation on the border between the forehead and the hairs as presented on the
In the
It is an object of the present invention to disclose a method and a device for processing video pictures enabling to reduce the false contour effects and the dithering noise whatever the content of the pictures.
This is achieved by the solution claimed in independent claims 1 and 10.
The main idea of this invention is to divide the picture to be displayed in areas of at least two types, for example low video gradient areas and high video gradient areas, to allocate a different set of GCC code words to each type of area, the set allocated to a type of area being dedicated to reduce false contours and dithering noise in the area of this type, and to encode the video levels of each area of the picture to be displayed with the allocated set of GCC code words.
In this manner, the reduction of false contour effects and dithering noise in the picture is optimized area by area.
Exemplary embodiments of the invention are illustrated in the drawings and in more detail in the following description.
In the figures:
According to the invention, we use a plurality of sets of GCC code words for coding the picture. A specific set of GCC code words is allocated to each type of area of the picture. For example, a first set is allocated to smooth areas with low video gradient of the picture and a second set is allocated to high video gradient areas of the picture. The values and the number of subfield code words in the sets are chosen to reduce false contours and dithering noise in the corresponding areas.
The first set of GCC code words comprises q different code words corresponding to q different video levels and the second set comprises less code words, for example r code words with r<q<n. This second set is preferably a direct subset of the first set in order to make invisible any change between one coding and another.
The first set is chosen to be a good compromise between dithering noise reduction and false contours reduction. The second set, which is a subset of the first set, is chosen to be more robust against false contours.
Two sets are presented below for the example based on a frame with 11 sub-fields: 12358121827415880
The first set, used for low video level gradient areas, comprises for example the 38 following code words. Their value of center of gravity is indicated on the right side of the following table.
The temporal centers of gravity of these code words are shown on the
The second set, used for high video level gradient areas, comprises the 11 following code words.
The temporal centers of gravity of these code words are shown on the
These 11 code words belong to the first set. In the first set, we have kept 11 code words from the 38 of the first set corresponding to a standard GCC approach. However, these 11 code words are based on the same skeleton in terms of bit structure in order to have absolutely no false contour level.
Let us comment this selection:
Levels 1 and 4 will introduce no false contour between them since the code 1 (1 0 0 0 0 0 0 0 0 0 0) is included in the code 4 (1 0 1 0 0 0 0 0 0 0 0). It is also true for levels 1 and 9 and levels 1 and 17 since both 9 and 17 are starting with 1 0. It is also true for levels 4 and 9 and levels 4 and 17 since both 9 and 17 are starting with 1 0 1, which represents the level 4. In fact, if we compare all these levels 1, 4, 9 and 17, we can observe that they will introduce absolutely no false contour between them. Indeed, if a level M is bigger than level N, then the first bits of level N up to the last bit to 1 of the code of the level N are included in level M as they are.
This rule is also true for levels 37 to 163. The first time this rule is contravened is between the group of levels 1 to 17 and the group of levels 37 to 163. Indeed, in the first group, the second bit is 0 whereas it is 1 in the second group. Then, in case of a transition 17 to 37, a false contour effect of a value 2 (corresponding to the second bit) will appear. This is negligible compared to the amplitude of 37.
It is the same for the transition between the second group (37 to 163) and 242 where the first bit is different and between 242 and 255 where the first and sixth bits are different.
The two sets presented below are two extreme cases, one for the ideal case of smooth area and one for a very strong transition with high video gradient. But it is possible to define more than 2 subsets of GCC coding depending on the gradient level of the picture to be displayed as shown on
Besides the definition of the set and subsets of GCC code words, the main idea of the concept is to analyze the video gradient around the current pixel in order to be able to select the appropriate encoding approach.
Below, you can find a standard filter approaches in order to extract current video gradient values:
The three filters presented above are only example of gradient extraction. The result of such a gradient extraction is shown on the
Many other types of filters can be used. The main idea in our concept is only to extract the value of the local gradient in order to decide which set of code words should be used for encoding the video level of the pixel.
Horizontal gradients are more critical since there are much more horizontal movement than vertical in video sequence. Therefore, it is useful to use gradient extraction filters that have been increased in the horizontal direction. Such filters are still quite cheap in terms of on-chip requirements since only vertical coefficient are expensive (requires line memories). An example of such an extended filter is presented below:
In that case, we will define gradient limits for each coding set so that, if the gradient of the current pixel is inside a certain range, the appropriate encoding set will be used.
A device implementing the invention is presented on
where γ is more or less around 2.2 and MAX represents the highest possible input value. The output signal of this block is preferably more than 12 bits to be able to render correctly low video levels. It is forwarded to a gradient extraction block 2, which is one of the filters presented before. In theory, it is also possible to perform the gradient extraction before the gamma correction. The gradient extraction itself can be simplified by using only the Most Significant Bits (MSB) of the incoming signal (e.g. 6 highest bits). The extracted gradient level is sent to a coding selection block 3, which selects the appropriate GCC coding set to be used. Based on this selected mode, a resealing LUT 4 and a coding LUT 6 are updated. Between them, a dithering block 7 adds more than 4 bits dithering to correctly render the video signal. It should be noticed that the output of the resealing block 4 is p×8 bits where p represents the total amount of GCC code words used (from 40 to 11 in our example). The 8 additional bits are used for dithering purposes in order to have only p levels after dithering for the encoding block.
Number | Date | Country | Kind |
---|---|---|---|
03292464 | Oct 2003 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
5598482 | Balasubramanian | Jan 1997 | A |
20030164961 | Daly | Sep 2003 | A1 |
Number | Date | Country |
---|---|---|
EP 0978816 | Feb 2000 | DE |
1 256 924 | Nov 2002 | EP |
1 262 942 | Dec 2002 | EP |
Number | Date | Country | |
---|---|---|---|
20050083343 A1 | Apr 2005 | US |