The invention relates to a method of processing data corresponding to pixels of a sequence of digital images so as to detect a grid corresponding to blocking artefacts, said method comprising a step of high-pass filtering a portion of a digital image, intended to supply at least one card of discontinuity pixels, and a step of detecting blocking artefacts from the at least one card of discontinuity pixels.
The invention also relates to a television receiver comprising a processing device for implementing the data processing method according to the invention.
It notably finds its application in the detection of blocking artefacts within a digital image which has been previously encoded and then decoded in accordance with a block-based encoding technique, for example, the MPEG standard (“Motion Pictures Expert Group”), and in the correction of data comprised in these blocks in order to attenuate the visual artefacts caused by the block-based encoding technique.
The blocking artefacts constitute a crucial problem for the block-based encoding techniques using a discrete transform of the discrete cosine transform DCT type. They appear in the form of block mosaics which are sometimes extremely visible in the decoded image sequences. These artefacts are due to a strong quantization subsequent to the discrete transform, which strong quantization causes strong discontinuities to appear at the borders of the encoding blocks.
The article entitled “Optimal JPEG Decoding” by J. Jung, M. Antonini, M. Barlaud, Proc. Of ICIP '98, vol. 1, pp. 410-414, Chicago, October 1998 describes a data processing method with which blocking artefacts can be detected and corrected. To this end, said method comprises a step of frequency transform of the wavelet transform type, which is applied horizontally and vertically to an image. The result of this transform comprises two sub-images which have high-frequency coefficients. These high-frequency coefficients correspond to blocking artefacts or to natural contours. In accordance with the prior-art method, the high-frequency coefficients corresponding to blocking artefacts are spatially located on a grid of 8×8 pixels and have a value which is smaller than a threshold, a value higher than this threshold corresponding to a natural contour.
However, this method is only capable of effecting a basic modeling of the blocking artefacts, which limits its possibilities of detecting said artefacts. Moreover, it only searches the blocking artefacts in 8×8 pixel grids. The grid may be distorted within the image because of a resampling of the image. This distortion may sometimes be known in advance, as in the case of the 3/4 encoding format where the width of the grid varies in accordance with the 10-11-11 pattern. However, in the majority of cases, this variation is arbitrary because it originates, for example, from a rate transcoding, an image format conversion in a 16/9 television receiver, from a 4/3 format into, for example, a 16/9 format a zoom in a portion of the image, an AD conversion, or even a combination of these different conversions. In this case, the prior-art method only detects blocking artefacts in a grid having a fixed size and position and applies a post-processing step based on this detection, with the risk of a partial or even inefficient correction.
It is an object of the invention to propose a data processing method which is more efficient.
To this end, the data processing method according to the invention is characterized in that the detection step is also intended to detect a second type of elementary blocking artefact from the at least one card of discontinuity pixels.
The invention uses the observations illustrated in
The method according to the invention also takes the second blocking artefact profile into account by virtue of a more powerful analysis. The modeling thus effected takes a possible resampling operation of the image into account, so that the result obtained in the matter of detecting blocking artefacts is improved. The blocking artefacts may also be detected independently in any grid, thus rendering the processing method more efficient both for detecting and for correcting blocking artefacts.
These and other aspects of the invention are apparent from and will be elucidated, by way of non-limitative example, with reference to the embodiment(s) described hereinafter.
In the drawings:
The invention relates to a method of processing a sequence of digital images encoded and decoded in accordance with a block-based encoding technique. In our example, the encoding technique used is the MPEG standard based on the discrete cosine transform DCT, but may alternatively be any other equivalent standard, such as, for example, the H.263 or H.26L standard. It should be noted that this method may also be applied to a fixed image, encoded, for example, in accordance with the JPEG standard. The processing method first relates to the detection of blocking artefacts due to these block-based encoding techniques and subsequently to the ensuing application such as, for example, post-processing techniques or image quality measurements.
In the preferred embodiment, the filtering operation is of the wavelet transform type. The wavelet transform, described with reference to
The result is an approximation image I2 which has a resolution divided by 2 and three detail images E2v, E2h, E2d which give the errors between the original image and the approximate image. The detail images E2h and E2v represent the discontinuities in the horizontal and vertical directions, respectively.
The method also comprises a step of determining the discontinuity corresponding to blocking artefacts BAD (120). Said step is based on forming thresholds and comparisons between a current filtered coefficient and filtered coefficients which are adjacent thereto.
The method of determining blocking artefacts is described below for the detection of vertical blocking artefacts by considering W1V(m,k); it is applicable to the detection of the horizontal blocking artefacts by considering W1H(k,n).
A vertical artefact corresponding to the profile p1 is detected if the following cumulative conditions are satisfied, W1V(m,n) being a coefficient of the sub-sampled image E2v:
S1<|W1V(m,n)<S2
|W1V(m,n)|>A1·|W1V(m,n±1)|
|W1V(m,n±2)|<S3
A vertical artefact corresponding to the profile p2 is detected if the following cumulative conditions are fulfilled:
S1<|W1V(m,n)|<S2 and S1<|W1V(m,n+1)|<S2
|W1V(m,n)|>A1·|W1V(m,n−1)|
|W1V(m,n+1)|>A1·|W1V(m,n+2)|
|W1V(m,n−2)|<S3 and |W1V(m,n+3)|<S3
A1 is a predetermined coefficient equal to 4 in our example. S1 and S2 are first and second predetermined thresholds, the first threshold corresponding to a visibility threshold, the second threshold to the limit from which the pixel with position (m,n) corresponds to a natural contour. They are equal to 2 and 10, respectively, in our example. S3 is a third threshold obtained from the representation in the frequency domain after wavelet transform of the blocking artefact profiles. In our example, it is equal to 1 and serves to make the detection more reliable by controlling the contrast of the discontinuity. It may be particularly advantageous in the case of MPEG4 applications, where access to video data streams and thus to field quantization steps is possible, to vary the thresholds S1 and S2 as a function of said quantization step so as to further improve the efficiency of the processing method. For example, the threshold values are a linear function of the quantization step.
Because of the sub-sampling by 2 of the decomposition in wavelets, the location at the approximate pixel of the blocking artefact is not an easy matter. Indeed, a coefficient of the frequency domain of the first sub-band may be associated with two pixels in the spatial domain. This is why a finer analysis is necessary, taking into account wavelet coefficient signs W1V.
In a particularly advantageous embodiment, the filtering operation is a gradient filtering operation using the filter hp2=[1,−1,−4,8,−4,−1,1]. This filter is applied horizontally and vertically, row by row, to the luminance pixels Y(m,n) of the field of a digital image of the sequence. The result of this filtering operation is preferably constituted by two cards of discontinuity pixels, a horizontal card Eh and a vertical card Ev. As the majority of resampling operations is performed in the horizontal direction, the horizontal card Eh showing the vertical discontinuities may suffice in a first approximation. However, the processing method according to the invention will have an optimal efficiency when it is based on processing the two cards of discontinuity pixels. Other gradient filters are possible such as, for example, the high-pass filter of the wavelet transform hp1 proposed by Antonini et al. The filter hp2 is particularly easy to implement and reliably approximates the filter hp2.
In this case, the step of determining discontinuities corresponding to blocking artefacts comprises a sub-step of detecting natural contours and non-visible artefacts. To this end, coefficient values filtered horizontally Yfh(m,n) and/or vertically Yfv(m,n) must be between the first and second thresholds S1 and S2 so as to be able to correspond to a blocking artefact. The condition is preferably taken for the absolute value of coefficients filtered as follows:
S1<|Yfh(m,n)|<S2 and S1<|Yfv(m,n)|<S2
As an alternative, the following condition is used:
S′1<|Yfh(m,n)|2+|S′1Yfv(m,n)|2<S′2,
in which S′1 and S′2 have the same function as S1 and S2.
The threshold values depend on the filter used. For the filter hp1, we take, for example S′1=0.6 and S′2=400, S1=0.5 and S2=20.
The step of determining the discontinuities corresponding to blocking artefacts comprises a sub-step of detecting blocking artefacts. A vertical artefact corresponding to the profile p1 is detected by scanning the field in a horizontal direction corresponding to the row m if the following condition is satisfied:
|Yfv(m,n)|>|Yfv(m,n+k)| with k=−2, −1, +1, +2.
The border of the block is localized between the pixel of position (m,n) and that of position (m,n+1) if |Y(m,n)−Y(m,n−1)|<|Y(m,n)−Y(m,n+1)| and between the pixel of position (m,n−1) and that of position (m,n) in the opposite case.
An artefact corresponding to profile p2 is detected if the following cumulative conditions are satisfied:
f1·|Yfv(m,n)|<(|Yfv(m,n−1)|+|Yfv(m,n+1)|)
|Yfv(m,n−1)|>f2·|Yfv(m,n−2)|
|Yfv(m,n+1)|>f2·|Yfv(m,n+2)|
with f1=6 and f2=2 in the preferred embodiment.
The border of the block is localized between the pixel of position (m,n−1) and that of position (m,1). The detection of a horizontal artefact corresponding to each profile p1 and p2 is effected in a similar manner by scanning the horizontal card Eh comprising the coefficients Yfh(m,n) filtered in a vertical direction corresponding to the column n. The step of determining the discontinuities which has been described hereinbefore has the advantage that it is particularly easy to implement.
A first application of the data processing method according to the invention is constituted by the MPEG detection, i.e. the detection of a sequence of digital images that have been encoded and subsequently decoded in accordance with the MPEG standard and of its grid of blocking artefacts among analog image sequences. This MPEG detection is effected at the level of a television receiver and is generally followed by a step of post-processing images, intended to correct said blocking artefacts which are present in the grid.
To this end, the processing method also comprises a step of selecting SEL (130) segments in a horizontal row or a vertical row of the field, which segments comprise a number of consecutive discontinuity pixels which is higher than a fourth predetermined threshold S0. Indeed, the isolated discontinuities generally correspond to a supplementary noise, while the blocking artefacts which are due to a coarse quantization of the DCT coefficients generally cause linear faults to appear along the encoding blocks. The value S0 of the predetermined threshold must not be too low so as not to favor the false detections. It must neither be too high so as not to constrain the selection too much by reducing the number of segments of detected elementary artefacts. In practice, the value S0 is fixed at 3 for a field of 288 rows of 720 pixels.
Advantageously, the processing method also comprises a step of searching, within the field, a set of grid rows, a grid row having a density of elementary block effects present in the segments which is substantially larger than that of its neighboring rows. Such a step allows an even further reduction of the risk of false detections.
A second application of the data processing method according to the invention is constituted by post-processing images intended, to correct the blocking artefacts which are present in a grid. Said grid has been determined by the method described previously or is known as, for example, the post-processing operation is effected in an MPEG-4 video decoder. The correction depends on the profile of the detected blocking artefact.
If the blocking artefact corresponds to the profile p1, the correction described with reference to
If the blocking artefact corresponds to the profile p2, the correction must be modified considerably. Indeed, the position of the border of the block must be given more precisely because of the double step of the staircase corresponding to the profile p2, as illustrated in
It is alternatively possible to cause the luminance value of the intermediate pixel to correspond to that of the pixel on the left, or to that of the pixel having the nearest luminance value. In both cases, the positioning of the segments u and v is adapted accordingly so as to apply the correction step.
A third application of the data processing method according to the invention is constituted by measuring the block level of the field from blocking artefacts which are present in the grid so as to determine the quality of the images. The quality measurement may be effected at the level of a television receiver in which the grid has been determined by the method previously described or at the level of an MPEG-4 video decoder, with the grid already being known so as to ensure a given service quality.
The level of the block B of the field f is preferably obtained by summing the amplitudes of the filtered values W1V(mn) corresponding to elementary blocking artefacts, i.e.
wherein δ(x)=1 if x is true and 0 if not, artV and artH comprising the pixels detected as blocking artefacts.
Such a measurement has the advantage that it takes the amplitude W1 of the degradation into account. It also takes into account the position (m,n) of the degradation, while a weighting coefficient K(m,n) as a function of the perception of the human visual system can be introduced. Moreover, this measurement allows determination of a block level for a grid having an arbitrary dimension or even being variable with respect to time.
It is possible to implement the processing method according to the invention by means of a television receiver circuit, said circuit being suitably programmed. A computer program stored in a programming memory may cause the circuit to perform the different operations described hereinbefore with reference to
Any reference sign between parentheses in the present text should not be construed as being limitative. Use of the verb “comprise” and its conjugations does not exclude the presence of elements or steps other than those stated in the claims. Use of the article “a” or “an” preceding an element or step does not exclude the presence of a plurality. of such elements or steps.
Number | Date | Country | Kind |
---|---|---|---|
02/07863 | Jun 2002 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB03/02835 | 6/16/2003 | WO | 12/16/2004 |