The present invention generally relates to video compression and, more particularly, to a method of coding an input digital video sequence corresponding to an original color image sequence, said method comprising at least the following steps:
The invention also relates to a corresponding encoder, to a method of decoding signals coded by means of said coding method, to a corresponding decoder, and to systems comprising computer readable program codes for implementing said coding and decoding methods.
Data compression systems generally operate on an original data stream by exploiting the redundancies in the data, in order to reduce the size of said data to a compressed format more adapted to a transmission or storing operation. For these data, several color spaces may be used (a color space is completely parametrized with three colors linearly independent), and for instance the red-green-blue (RGB) color space (which is still severely redundant) or the so-called opponent color space, nominally white/black (or WB), red/green (or RG) and blue/yellow (or BY), or, in the video case, the YUV space.
In classical video approaches, the video is often encoded along the three following separate channels: luminance Y, component U of chrominance, component V of chrominance. As it seems difficult, with this classical (Y, U, V) representation scheme, to highly improve the rate/distorsion ratio, it has been proposed in the european patent application no 02290484.1 filed on Feb. 28, 2002, by the applicant (PHFR020014) to change the representation space in order to achieve a higher coding efficiency (for example in order to encode more information with the same bit budget, or less information with far less bits). The coding method described in said document mainly comprises, before the coding step, a pre-processing step, provided for verifying in which color space the input video sequence is and transforming said space into a less redundant one by means of a non linear transformation. However, less information may lead to a lower quality.
It is therefore a first object of the invention to propose another encoding method for the compression of a digital color video sequence, allowing to transform the original color space of said sequence into a less redundant one, by means of a non-linear transformation taking into account the possible lower quality finally obtained.
To this end, the invention relates to a coding method such as defined in the introductory part of the description and which is moreover characterized in that it also comprises, before said converting step, a pre-processing step, provided for determining if the color space of the input video sequence is the YUV color space, where Y is the luminance component and U, V the chrominance components, and transforming said YUV color space into a less redundant color space by means of a non-linear transformation taking into account the possible lower quality finally obtained.
By coding with a greater precision all the relevant part of the information, whereas non-relevant information may be degraded, a better coding efficiency is obtained.
The invention will now be described in a more detailed manner, with reference to the accompanying drawings in which:
Considering that, for a wide range of applications (such as digital movies, high-definition television, transmission or visualization of scientific imagery, . . . ), the ultimate consumer is the human eye, the basic idea of the invention consists in choosing a representation based upon the partition of the visual signals by the early human visual system, i.e. in designing the image codes in such a way that they match the visual capacities of the human observer.
Perceptual studies have already shown that, under standard viewing conditions, human eyes cannot distinguish small luminance variations (from 1 to 5 grey levels). A common approach has then consisted in uniformly compressing the luminance dynamic by using less grey levels, which is for instance illustrated in
According to the invention, it is then proposed to adaptively compress the luminance dynamic. Perceptual tests performed by the applicant show that, for a luminance dynamic including 256 grey levels (from 0 to 255 for example), human eyes are more sensitive to luminance changes inside the luminance range [70;130] than in the range [0;70] or in the range [130;255]. More generally, the applicant has considered that, for a luminance dynamic including N grey levels (from 0 to N-1 for example), a more relevant information is the one located in a central range [A;B] and a less relevant information is located in the side ranges [0;A] and [B;N-1].
In order to exploit this property of a variable perception according to the considered luminance ranges, it is then proposed, given an original luminance range of N grey levels (for example from 0 to N-1 as illustrated in
Practically, several compression modes may be proposed. In the example of
It may also be noticed that, because only M integer values are used after the dynamic compression, once the luminance transformation is performed, more precise values are used for original values inside the central range (between A and B), whereas outside said central range many original values are clustered (as depicted in
An embodiment of a coding device for the implementation of the coding method according to the invention is now described. As shown in
At the decoding side, a decoding device is provided for implementing the above-mentioned inverse transformation and comprises, as shown in
The encoding and decoding devices, (61, 62) and (71, 72) respectively, can be implemented in a variety of ways to perform the functionalities described herein. In one embodiment, they may be embodied as software stored on media and executed by a general purpose or specifically configured computer system, typically including a central processing unit, memory and one or more input/output devices and processors. Alternatively, they may be implemented as a combination of hardware, software or firmware, without excluding that a single item of hardware or software can carry out several functions or that an assembly of items of hardware or software or both carry out a single function. The described methods and devices may be implemented by any type of computer system or other apparatus adapted for carrying out the methods described herein, this computer system including a computer program that, when loaded and executed, controls the computer system such that it carries out the methods described herein.
Alternatively, a specific use computer, containing specialized hardware for carrying out one or more of the functional tasks of the invention, can be utilized. The present invention can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods and functions described herein, and which—when loaded in a computer system—is able to carry out these methods and functions. Computer program, software program, program, program product, or software, in the present context mean any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: (a) conversion to another language, code or notation; and/or (b) reproduction in a different material form.
Number | Date | Country | Kind |
---|---|---|---|
02290934.5 | Apr 2002 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB03/01371 | 4/3/2003 | WO |