This application claims priority from European Patent Application No. 16305529.6, entitled “METHOD AND APPARATUS FOR ENCODING/DECODING A HIGH DYNAMIC RANGE PICTURE INTO A CODED BITSTREAM”, filed on May 4, 2016 and European Patent Application No. 16305527.0, entitled “METHOD AND APPARATUS FOR ENCODING/DECODING A HIGH DYNAMIC RANGE PICTURE INTO A CODED BITSTREAM”, filed on May 4, 2016, the contents of which are hereby incorporated by reference in their entirety.
The present disclosure generally relates to picture/video encoding and decoding. Particularly, but not exclusively, the technical field of the present disclosure is related to encoding/decoding of a picture whose pixels values belong to a high-dynamic range.
In the following, a color picture contains several arrays of samples (pixel values) in a specific picture/video format which specifies all information relative to the pixel values of a picture (or a video) and all information which may be used by a display and/or any other device to visualize and/or decode a picture (or video) for example. A color picture comprises at least one component, in the shape of a first array of samples, usually a luma (or luminance) component, and at least one another component, in the shape of at least one other array of samples. Or, equivalently, the same information may also be represented by a set of arrays of color samples (color components), such as the traditional tri-chromatic RGB representation.
A pixel value is represented by a vector of c values, where c is the number of components. Each value of a vector is represented with a number of bits which defines a maximal dynamic range of the pixel values.
Standard-Dynamic-Range pictures (SDR pictures) are color pictures whose luminance values are represented with a limited dynamic usually measured in power of two or f-stops. SDR pictures have a dynamic around 10 fstops, i.e. a ratio 1000 between the brightest pixels and the darkest pixels in the linear domain, and are coded with a limited number of bits (most often 8 or 10 in HDTV (High Definition Television systems) and UHDTV (Ultra-High Definition Television systems) in a non-linear domain, for instance by using the ITU-R BT.709 OETF (Optico-Electrical-Transfer-Function) (Rec. ITU-R BT.709-5, April 2002) or ITU-R BT.2020 OETF (Rec. ITU-R BT.2020-1, June 2014) to reduce the dynamic. This limited non-linear representation does not allow correct rendering of small signal variations, in particular in dark and bright luminance ranges. In High-Dynamic-Range pictures (HDR pictures), the signal dynamic is much higher (up to 20 f-stops, a ratio one million between the brightest pixels and the darkest pixels) and a new non-linear representation is needed in order to maintain a high accuracy of the signal over its entire range. In HDR pictures, raw data are usually represented in floating-point format (either 32-bit or 16-bit for each component, namely float or half-float), the most popular format being openEXR half-float format (16-bit per RGB component, i.e. 48 bits per pixel) or in integers with a long representation, typically at least 16 bits.
A color gamut is a certain complete set of colors. The most common usage refers to a set of colors which can be accurately represented in a given circumstance, such as within a given color space or by a certain output device.
A color gamut is sometimes defined by RGB primaries and a white point provided in the CIE1931 color space chromaticity diagram, as illustrated in
It is common to define primaries in the so-called CIE1931 color space chromaticity diagram. This is a two dimensional diagram (x,y) defining the colors independently on the luminance component. Any color XYZ is then projected in this diagram thanks to the transform:
The z=1-x-y component is also defined but carries no extra information.
A gamut is defined in this diagram by a triangle whose vertices are the set of (x,y) coordinates of the three primaries RGB. The white point W is another given (x,y) point belonging to the triangle, usually close to the triangle center. For example, W can be defined as the center of the triangle.
A color volume is defined by a color space and a dynamic range of the values represented in said color space.
For example, a color gamut is defined by a RGB ITU-R Recommendation BT.2020 color gamut for UHDTV. An older standard, ITU-R Recommendation BT.709, defines a smaller color gamut for HDTV. In SDR, the dynamic range is defined officially up to 100 nits (candela per square meter) for the color volume in which data are coded, although some display technologies may show brighter pixels.
High Dynamic Range pictures (HDR pictures) are color pictures whose luminance values are represented with a HDR dynamic that is higher than the dynamic of a SDR picture.
As explained extensively in “A Review of RGB Color Spaces” by Danny Pascale, a change of representation of a gamut, i.e. a transform that converts the three primaries and the white point from a linear color space to another, can be performed by using a 3×3 matrix in linear RGB color space. Also, a change of color space from XYZ to RGB is performed by a 3×3 matrix. As a consequence, whatever RGB or XYZ are the color spaces, a change of gamut can be performed by a 3×3 matrix. For example, a change of gamut representation from BT.2020 linear RGB to BT.709 XYZ can be performed by a 3×3 matrix.
The HDR dynamic is not yet defined by a standard but one may expect a dynamic range of up to a few thousand nits. For instance, a HDR color volume is defined by a RGB BT.2020 color space and the values represented in said RGB color space belong to a dynamic range from 0 to 4000 nits. Another example of HDR color volume is defined by a RGB BT.2020 color space and the values represented in said RGB color space belong to a dynamic range from 0 to 1000 nits.
Color-grading a picture (or a video) is a process of altering/enhancing the colors of the picture (or the video). Usually, color-grading a picture involves a change of the color volume (color space and/or dynamic range) or a change of the color gamut relative to this picture. Thus, two different color-graded versions of a same picture are versions of this picture whose values are represented in different color volumes (or color gamuts) or versions of the picture whose at least one of their colors has been altered/enhanced according to different color grades. This may involve user interactions.
For example, in cinematographic production, a picture and a video are captured using tri-chromatic cameras into RGB color values composed of 3 components (Red, Green and Blue). The RGB color values depend on the tri-chromatic characteristics (color primaries) of the sensor. A first color-graded version of the captured picture is then obtained in order to get theatrical renders (using a specific theatrical grade). Typically, the values of the first color-graded version of the captured picture are represented according to a standardized YUV format such as BT.2020 which defines parameter values for UHDTV.
The YUV format is typically performed by applying a non-linear function, so called Optical Electronic Transfer Function (OETF) on the linear RGB components to obtain non-linear components R′G′B′, and then applying a color transform (usually a 3×3 matrix) on the obtained non-linear R′G′B′ components to obtain the three components YUV. The first component Y is a luminance component and the two components U,V are chrominance components.
Then, a Colorist, usually in conjunction with a Director of Photography, performs a control on the color values of the first color-graded version of the captured picture by fine-tuning/tweaking some color values in order to instill an artistic intent.
The known MPEG video coders, such as HEVC standard for example, are not compatible with HDR (High Dynamic Range) video. Furthermore, a lot of displays/terminals are not compatible with the HDR video.
In order to distribute compressed HDR video to a wide variety of displays/terminals and to make it possible to use known video coding tools, such MPEG video coding standards, an HDR video is distributed as an SDR video representative of the HDR with a more limited dynamic range and a set of parameters allowing reconstruct an HDR video from the SDR video. In such a system, the SDR video is compressed using known tools, such as the standard HEVC Main 10 profile.
On the encoding side, the HDR video is first decomposed into an SDR video, such a decomposition delivering a set of parameters suitable to reconstruct at the decoder or at display level an HDR video from the decoded SDR video. Such a set of parameters may be coded with the compressed SDR video, typically in optional syntax messages, such as SEI (Supplemental Enhancement Information) messages for the HEVC standard.
In a step E30, from the input HDR picture and its characteristics (side information), mapping variables are derived. Such a step of mapping parameters derivation delivers a luminance mapping function LUTTM, which allows to map a linear-light luminance value of the HDR picture into an SDR-like luma value.
In a step E31, the luminance signal is then mapped to an SDR luma signal using the luminance mapping variables. That is for each pixel of the input HDR picture, the luminance L is derived from the HDR linear light R, G, B values of the pixel and from the luminance mapping function as:
with A=[A1 A2 A3]T being the conventional 3×3 R′G′B′-to-Y′CbCr conversion matrix (e.g. BT.2020 or BT.709 depending on the colour space), A1, A2, A3 being 1×3 matrices.
The linear-light luminance L is mapped to an SDR-like luma Ypre0, using the luminance mapping function: Ypre0=LUTTM(L).
In a step E32, a conversion of the R, G, B colour to derive the chroma components of the SDR signal is applied. The chroma components Upre0, Vpre0 are built as follows:
A pseudo-gammatization using square-root (close to BT.709 OETF) is applied to the RGB values of the pixel
Then the Upre0 and Vpre0 values are derived as follows
This step results in a gamut shifting, that is changes in colour hue and saturation compared to the input HDR signal. Such gamut shifting is corrected by a step E34 of colour gamut correction.
In step E34, the chroma component values are corrected as follows:
where A2, A3 are made of the second and third lines of coefficients of the conversion matrix from R′G′B′-to-Y′CbCr, and b0 is a pre-processing colour correction LUT (for Look Up Table).
Then, the mapped luma component is corrected as follows:
Ypre1=Ypre0−ν×max(0, a×Upre1+b×Vpre1), where a and b are pre-defined parameters and v is a control parameter enabling to control the saturation. The higher the value Y is, the more the picture is saturated.
The HDR picture to SDR picture decomposition results in an output SDR picture with pixels arrays Ypre1Upre1Vpre1.
The HDR reconstruction process is the inverse of the HDR-to-SDR decomposition process.
In a step E40, the values Upost1 and Vpost1 are derived as follows for each pixel (x,y) of the SDR picture:
where midSampleVal is a predefined shifting constant.
In a step E41, the value Ypost1 for the pixel (x,y) of the SDR picture is derived as follows:
Y
post1=SDRy[x][y]+ν×max(0,a×Upost1+b×Vpost1),
where a and b are the same pre-defined parameters and v is a control parameter enabling to control the saturation, as in the decomposition process. Therefore, such parameters should be known to the reconstruction module. They may be part of HDR parameters coded with the compressed SDR picture are predefined at the decoder.
Such a step may possibly be followed by a clipping to avoid being out of the legacy signal range.
In a step E42, colour correction is performed. In step E42, Upost1 and Vpost1 are modified as follows:
where bp is a post-processing colour correction LUT, that depends directly on the pre-processing colour correction LUTb0.
The post-processing colour correction LUT bp can be determined by:
T=k0×Upost1×Vpost1+k1×Upost1×Upost1+k2×Vpost1×Vpost1
where k0, k1, k2 are predefined values depending on the SDR colour gamut. The value S0 is then initialized to 0, and the following applies:
The values R1, G1, B1 are derived as follows.
In a step E44, the RGB values from the HDR picture are then reconstructed from the SDR RGB values. In step E44, the values R2, G2, B2 are derived from R1, G1, B1 as follows:
where invLUT corresponds to the square-root of the inverse look-up-table LUTTM derived from the luma mapping parameters transmitted to the reconstruction module.
And the output samples HDRR, HDRG, HDRB are derived from R2, G2, B2 as follows:
A clipping may be applied to limit the range of the output HDR signal.
The process for deriving the LUT b0 is independent from the content. It applies in the container colour gamut and takes into account the content colour gamut. In order to better control the HDR to SDR decomposition and thus the quality of the resulting SDR picture, the computation of the LUT b0 is performed so as to control the color saturation of the derived SDR signal.
For computing the LUT b0, for each luma value Y, the following steps are applied. The luminance L is generated using the inverse function of LUTTM: L=invLUT[Y]. Then the best b0[Y] for luminance L (and therefore for luma Y) is identified as follows. Values btest in a given pre-defined range are evaluated. For this, a cumulative error err associated to btest is computed as follows:
The output sample YUVSDR as described in the HDR-to-SDR decomposition process is built, with b0=btest from the scaled RGBHDR samples. Then, an error in the Lab color space, errorab, between RGB′sdr samples values reconstructed from the output sample YUVSDR and RGBHDR is computed. And err is updated as follows:
err=err+errorab
The final value b0[Y] corresponds to btest giving the lowest cumulated err value among all the tested btest values.
It can be seen that such a computation of the pre-processing colour correction b0, and thus the post-processing colour correction bp, is complex and is time and resource consuming.
There is thus a need for a new method and apparatus for encoding at least one high dynamic range picture into a coded bitstream with lower complexity, and for a correspondingly decoding method and apparatus.
According to an aspect of the present principle, a method for coding at least one high dynamic range picture into a coded bitstream is disclosed. Such a method comprises:
Preferably, said at least one parameter computed from said at least one high dynamic range picture is a saturation skew parameter.
According to this principle, the computation of the pre-processing colour correction function b0 is simplified. On the encoder side, a set of pre-computed post-processing colour correction function is defined according for example to different characteristics of the HDR content. Then, a specific post-processing colour correction function is selected from this set for each HDR picture of the video according to a predetemined criterion. Then, the pre-processing colour correction function b0 is further computed from the post-processing colour correction function. Complexity is thus reduced on the encoder side.
According to another embodiment, said encoding method further comprises:
According to this embodiment, it is not needed at the decoder to know the predetermined post-processing colour correction function bp_det. A set of pivot points representative of an adjustment function fadj is coded into the coded bitstream. Such pivot points make it possible to reconstruct at the decoder a high dynamic range picture from the coded standard dynamic range picture and predefined post-processing colour correction function bp_default which are known to the decoder. As an example, such predefined post-processing colour correction function bp_default could be post-processing colour correction function that are sent to the decoder for the whole sequence or that are already defined in a compression standard.
According to another embodiment, said parameter for reconstructing said high dynamic range picture from said standard dynamic range picture is an index representative of the selected predetermined post-processing colour correction function bp_det, and said post-processing colour correction function bp corresponds to said selected predetermined post-processing colour correction function bp_det.
According to this embodiment, the set of predetermined post-processing colour correction functions bpset from which bp_det has been selected is known to the decoder. As an example, such a set may be predefined at the decoder. Such an embodiment makes it possible to reduce the decoder complexity since it is not necessary to adjust a predetermined post-processing colour correction function bp_default. Furthermore, the encoder complexity is further reduced since the adjustment function do not need to be computed at encoder side.
According to a variant, said method further comprises a step of coding into said coded bitstream a set of parameters representative of said set of predetermined post-processing colour correction functions bpset.
According to this embodiment, the set of post-processing colour correction function bpset, is coded into the coded bitstream at a sequence level or a group of pictures level for example. For example, such a set of post-processing colour function may be transmitted to the decoder after a cut detection or at Random Access point. This embodiment allows to adapt the set of post-processing colour correction function bpset according to the characteristics of the video sequence.
A method for decoding at least one high dynamic range picture from a coded bitstream is also disclosed. Said decoding method comprises:
According to one embodiment, said decoding method further comprises a step of decoding from said coded bitstream a set of parameters representative of said set of predetermined post-processing colour correction functions bpset.
Another aspect of the disclosure is an apparatus for coding at least one high dynamic range picture into a coded bitstream. Such a coding apparatus comprises:
Preferably, said at least one parameter computed from said at least one high dynamic range picture is a saturation skew parameter.
Another aspect of the disclosure is an apparatus for decoding at least one high dynamic range picture from a coded bitstream.
According to one embodiment, such a decoding apparatus comprises:
According to another embodiment, such a decoding apparatus further comprises means for decoding from said coded bitstream a set of parameters representative of said set of predetermined post-processing colour correction functions bpset.
Another aspect of the disclosure is a computer program comprising software code instructions for performing any one of the embodiments described in the present disclosure, when the computer program is executed by a processor.
Another aspect of the disclosure is a bitstream representative of at least one coded high dynamic range picture comprising:
According to one embodiment, such a bitstream further comprises coded data representative of said set of parameters representative of said set of predetermined post-processing colour correction functions bpset.
A non-transitory processor readable medium having stored thereon a bitstream is disclosed wherein the bitstream comprises:
The disclosure is described for encoding/decoding a color HDR picture but extends to the encoding/decoding of a sequence of pictures (video) because each color picture of the sequence is sequentially encoded/decoded as described below.
An HDR picture is first input to a module of HDR to SDR decomposition. Such a module performs HDR to SDR decomposition and outputs an SDR picture which is a dynamic reduced version of the input HDR picture.
The output SDR picture is a reshaped version of the input HDR picture such that the hue and perceived saturation are preserved and the visual quality of the SDR picture relative to the HDR picture is increased. The HDR to SDR decomposition module also outputs a set of HDR parameters which are further used for HDR picture reconstruction.
Such a set of HDR parameters comprises at least luma mapping parameters allowing to derive an inverse luma mapping table for converting SDR luma to HDR luminance.
The SDR picture is then input to an encoding module performing picture encoding. Such an encoding module may be for example an HEVC Main 10 coder suitable for encoding video and picture represented on a 10 bit-depth. The encoding module outputs a coded bitstream representative of a compressed version of SDR picture. The HDR parameters are also encoded by the encoding module as part of the coded bitstream. As an example, such HDR parameters may be coded in SEI message (Supplemental Enhancement Information message) of an HEVC Main 10 bitstream.
Such a coded bitstream may then be stored or transmitted over a transmission medium.
The method steps of the encoding system presented here are further describes according to various embodiments disclosed herein with
Such a coded bitstream comprises coded data representative of an SDR picture and coded data representative of HDR parameters suitable for reconstructing an HDR picture from a decoded version of the SDR picture compressed in the coded bitstream.
Such a coded bitstream may be stored in a memory or received from a transmission medium.
The coded bitstream is first input to a decoding module performing picture decoding and HDR parameters decoding. The decoding module may be for example a decoder conformed to an HEVC Main 10 profile decoder.
The decoding module outputs a decoded SDR picture and a set of HDR parameters. The decoded SDR picture may be displayed by a legacy SDR display (SDR output). Such an SDR picture may be viewable by an end-user from his legacy SDR display. Thus, the disclosed system is backward compatible with any SDR legacy display.
The decoded SDR picture and HDR parameters are then input to a module for SDR to HDR reconstruction. Such a module reconstructs the HDR picture from the decoded SDR picture using the given HDR parameters. Then, a decoded HDR picture is output and can be displayed by an HDR compatible display (HDR output).
The method steps of the decoding system presented here are further describes according to various embodiments disclosed herein with
1—Coding an HDR Picture into a Coded Bitstream:
In step E30, luma mapping parameters are first computed according to step E30 already described with
In an optional step E16, a set of predetermined post-processing colour correction functions bpset is obtained from at least one saturation skew parameter computed from at least said high dynamic range picture.
According to an embodiment of the step E16, the set of predetermined post-processing colour correction functions bpset is obtained from a saturation skew parameter called satskew computed from a high dynamic range picture according to, for example, the method described in relation with
If one satskew value is associated to each predetermined post-processing colour correction functions bpset, the derivation of the satskew value from the high dynamic range picture to identify the post-processing colour correction function can be replaced by the direct identification of an index i which identifies the corresponding ith post-processing colour correction function. According to an embodiment of the step E16, the set of predetermined post-processing colour correction functions bpset is obtained from an index.
In a step E1, a first predetermined post-processing colour correction function bp_det is selected among a first set of predetermined post-processing colour correction functions bpset, according to at least one parameter pHDR computed from at least said high dynamic range picture, for instance the saturation skew parameter used in step E16.
More generally, the set of predetermined post-processing colour correction functions bpset comprises pre-computed post-processing correction function bp(k), with k=0 to Nk, where Nk is the number of post-processing colour correction functions of the first set bpset. Each pre-computed post-processing correction function bp(k) corresponds to given characteristics of HDR content, such as color saturation or hue for example, or corresponds to a value of the parameter satskew. Equivalently, an index, actually related to the satskew value, can be derived to identify the predetermined post-processing colour correction function bp_det. According to an embodiment, the satskew parameter is computed for at least the HDR picture according to the method described in relation with
Each post-processing colour correction function bp(k) of the set bpset is thus associated with a parameter p(k) representative of a color rendering of an SDR picture obtained using the post-processing colour correction function bp(k). Such a representative parameter p(k) could be representative of the hue or saturation level of the picture as an example.
According to an embodiment of the present principle, at least one parameter pHDR is extracted from the HDR picture to code, such as the hue or color saturation level. Such parameters are obtained from an analysis of the HDR picture and are used to select a post-processing colour correction function bp_det among the first post-processing colour correction function set bpset using the corresponding representative parameter p(k) which has been associated with the post-processing colour correction functions bp(k) of the set bpset.
In a step E2, a pre-processing colour correction function b0 is determined in a manner known per se (see step E42 above) from the selected post-processing colour correction function bp_det such that, when applied to the luminance component of the HDR picture, luminance of a SDR picture is obtained. The pre-processing colour correction and post-processing colour correction function are for instance directly linked by equation eq.1 discussed above. The pre-processing colour correction function b0 is thus determined as follows for each Y values:
b
0(Y)=bp_det(Y)×K×√{square root over (L(Y))},
In a step E3, the HDR picture is decomposed into an SDR picture, using the pre-processing colour correction function b0 determined as in step E2 and the luma mapping parameters obtained in step E30. Such a decomposition process is performed in a similar manner as described in relation with
In a step E4, the SDR picture is then coded into a coded bitstream. Any picture or video coding method may be used. For example, an HEVC Main 10 profile encoder may be used.
The luma mapping params are also coded into the coded bitstream so as to make it possible to derive the inverse LUT mapping luminance.
In a step E5, at least one parameter for reconstructing said HDR picture from a decoded version of said SDR picture and from a post-processing colour correction function bp_dec is coded into said coded bitstream.
According to an embodiment, such at least one parameter corresponds to a set of pivot points representative of a piecewise linear adjustment function fadj used on the decoding side, to adjust as described below a default post-processing colour correction function bp_default to the post-processing colour correction function bp_det determined at step E1. As this function is piecewise linear, each linear segment between any two neighbored pivot points can be determined such as to define this adjustment function.
In a step E6, a second predetermined post-processing colour correction function bp_default is selected among a second set of predetermined post-processing colour correction function bsetp_default. The second set bsetp_default comprises pre-defined default LUTs bp_default[k], k=1 to N, which are predefined on the decoder side. For instance, one LUT is defined for each triple (container colour gamut, content colour gamut, peak luminance). In step E6, bp_default is selected from this second set according to the HDR picture characteristics (container colour gamut, content colour gamut, peak luminance). Such characteristics are part of the picture parameters and are sent to the decoder into the coded bitstream. On the decoder side, it is thus possible to select the corresponding post-processing colour correction function bp_default.
At a step E7, an adjustment function fadj is determined. Said adjustment function fadj is determined by taking into account said selected predetermined post-processing colour correction function bp_default and said predetermined post-processing colour correction function bp_det selected at step E1. The adjustment function fadj is built to map as much as possible the function bP_default to the selected function bp_det by minimizing the difference between bp_det and bp_dec where bp_dec is set as:
b
p
_
dec
[Y]=f
adj
[Y]×b
p
_
default
[Y] (eq.2)
The adjustment function fadj is built so that bp_dec is as close as possible to bp_det for all Y values. In the present embodiment, fadj is by minimization of an error based on equation eq.2, however any types of relationship may be used.
Then, the fadj function is coded in step E5 and transmitted to the decoder side. The function fadj is modeled using pivot points of a piece-wise linear model. In step E5, only the set of pivot points representative of the fadj function are coded. Each x and y components of such pivot points are coded into the coded bitstream, for example as part of the HDR parameters as described in
According to a variant, the set of predetermined post-processing colour correction functions bpset comprises a single pre-computed post-processing correction function bp.
According to a variant, the set of predetermined post-processing colour correction functions bpset is obtained for each high dynamic range picture of a video sequence.
In this embodiment, steps E30, E1, E2, E3 and E4 are performed similarly as in the embodiment described with
According to the embodiment described with
According to the present embodiment, in step E5, an index idx is coded into the coded bitstream, for example using a fixed length code. Said index idx is representative of the first predetermined post-processing colour correction function bp_det selected from the first set bpset.
In this embodiment, such a first set bpset should be known at the decoder. For instance, the first set of post-processing colour correction functions is predefined at the decoder.
Alternatively, in a step E8, such a first set bpset of post-processing colour correction functions is coded into the coded bitstream and transmitted to the decoder. For example, this first set bpset is coded at a sequence level with the video sequence parameters and stored by the decoder during all the decoding process of the pictures of the video.
Each post-processing colour correction function bp(k) of the first set bpset is coded as a one dimension array comprising a number of NbY elements, where NbY represents the number of luma values Y of the SDR picture.
According to a preferred variant of this embodiment, the corresponding representative parameter p(k) associated with a post-processing colour correction function bp(k) is also coded into the coded bitstream. According to this variant, the selected first post-processing colour correction bp_det could be determined at the decoder using a corresponding parameter pHDR which could be sent to the decoder. As an example, such parameter pHDR is coded into the coded bitstream with information at a picture parameter level.
Each function of the set bpset is coded as a one dimension array comprising a number of N elements, where N represents the number of luma values Y of the SDR picture.
According to a first variant, before encoding pictures of a video sequence, Nk post-processing colour correction LUT bp(k) are pre-computed. Each post-processing colour correction LUT bp(k) is associated with a parameter p(k) representative of a color rendering of an SDR picture (e.g hue or saturation level) derived using such post-processing colour correction LUT bp(k). These post-processing colour correction LUTs bp(k) are computed using a learning process from a large set of representative HDR pictures.
In a step E60, for each HDR picture of this set, a default post-processing colour correction bpdef LUT is used to generate a default SDR picture. The SDR picture is generated according to the decomposition process described according to
b0
where K is a constant value, L is the linear-light luminance derived from L=invLUTTM[Y], with invLUTTM being the square-root of the inverse function of the LUTTM.
Starting from this default SDR picture, a colorist modifies this default post-processing colour correction LUT bpdef for optimizing the color rendering of the SDR picture (hue or/and saturation). The resulting post-processing colour correction LUT bpres is associated with the HDR input picture.
In a step E61, a classification algorithm gathers the HDR pictures presenting common characteristics in a subset. For each subset of HDR pictures, an average of the LUTs bpres associated with the HDR pictures of the subset is computed to obtain one representative post-processing colour correction LUT bp(k) per subset. The classification algorithm resulted in a number of Nk subsets, each subset k being associated with a representative post-processing colour correction LUT bp(k).
In a step E62, for each subset k, at least one parameter p(k) is extracted from the HDR pictures of the subset, such as saturation level or hue. Such an extracted parameter is representative of the subset. This parameter allows to distinguish the subsets.
In the coding method described according to one embodiment, in reference to
At the end of the post-processing colour correction LUT computation method described above, the set of post-processing colour function bpset comprises the Nk post-processing colour correction function computed at step E61.
Then, the computed set of post-processing colour function bpset and the corresponding representative parameter of each subset associated with a post-processing colour correction function bp(k) are input to step E1 for selecting a post-processing colour correction function bp_det as disclosed in
According to a second variant, before encoding pictures of a video sequence, K post-processing colour correction LUT bp are pre-computed for the whole sequence. For example, K is equal to 3.
In a step E60, a pre-processing LUT b0[k] (k going from 0 to K), are first computed according, for example, to the method described in relation with
In the present embodiment, each pre-processing LUT b0[k] is computed for different saturation skew values. For instance, in the case where K is equal to 3, the following saturation skew values are used:
As these pre-processing LUT are computed for all the pictures of a video sequence, in the present embodiment, the minimization is done with a large number of Tone Mapping parameters. This allows to optimize the accuracy of the computed LUT compared to the minimization described above in which one Tone Mapping parameters (the one of the current picture) was used.
In a variant, the LUT may be computed off-line (no real-time limitation). Then the minimization can be done with full precision that alos optimizes the accuracy of the LUT.
In a step E61, for each k from 0 to K−1, the post-processing colour correction function bp[k] are derived from the pre-processing colour correction function b0[k] computed at step E60, using equation (3) disclosed above:
The set of post-processing colour function bpset comprises the K post-processing colour correction function computed at step E61.
Then, the computed set of post-processing colour function bpset is input to step E1 for selecting a post-processing colour correction function bp_det as described in relation with
According to this embodiment, the pre-processing colour correction function b0[k] are obtained from a minimization of an error value errorab (expressed in Lab color space), between RGBsdr and RGBhdr.
The process is independent from the content. It applies in the container colour gamut and takes into account the content color gamut. In order to better control the HDR to SDR decomposition and thus the quality of the resulting SDR picture, the computation of a pre-processing colour correction function b0[k] is controlled by a satskew parameter (saturation skew parameter). Thus, the color saturation of the derived SDR signal can be controlled.
For computing the pre-processing colour correction function b0[k], for each luma value Y of a HDR picture, the following steps are applied:
In a step 130, the luminance L is generated using the inverse function of LUTTM: L=invLUT[Y].
In step 140, the best β0[Y] for luminance L (and therefore for luma Y) is identified as follows:
For each luma value Y,
According to an embodiment of the step 1.2.3, an error in the Lab is calculated between YUVSDR and RGBHDR as follows:
Convert RGB_hdr to XYZ_hdr (container gamut)
Generate reference Yref_hdr for HDR
a_hdr=500×(f(X_hdr)−f(Y_hdr))
b_hdr=200×(f(Y_hdr)−f(Z_hdr))
a_sdr=500×(f(X_sdr)−f(Y_sdr))
b_sdr=200×(f(Y_sdr)−f(Z_sdr))
error=(a_hdr−a_sdr)2+(b_hdr−b_sdr)2
The HDR picture is analyzed and hue and saturation histograms are computed for the HDR picture. Then a pre-analysis of such histograms is used to determine the satskew parameter as follows. The higher the saturation is, the more the satskew value will increase. The satskew parameter is then used to select a post-processing colour correction function bp_det among the post-processing colour correction function set bpset.
The satskew parameter value is determined using histograms based on the HDR picture characteristics (saturation, hue, luma). The algorithm is summarized by the following:
Where the sRGB function is:
We define maxRGB as maxRGB=max[R′,B′,G′] and minRGB as minRGB=min [R′,B′,G′]
The saturation S is computed by:
By definition, the saturation is included in [0; 1]. The histogram is computed on all the picture and consists in 101 bins of witdth 0.01 for example.
From the highest to the lowest saturations, we sum the histogram bin size until we get 5%, 10% and 20% of the image size. This allows us to define S5P, S10P, S20P as the sets of pixels that are the 5%, 10% and 20% percent most saturated pixels of the image. These pixels can be characterized by saturation, hue and luminance metrics. The average saturation values
The luminance is averaged on
meanL5P,meanL10P,meanL20P metrics.
The luminance is computed by
L=M
1
R+M
2
G+M
3
B
Where, in a 709 container:
The algorithm needs to determine the main color of the most saturated pixels. Therefore, while computing the saturation histogram, hue histograms are also computed.
The hue values represent color through angles. The red colors are around 0°, the green colors are around 120° and the blue colors are around 240°.
The hue value is determination is the
following:
Hence three histograms are computed (hue_hist_R for the red colors which contains the hueR values, hue_hist_G for the green colors which contains the hueG values and hue_hist_B for the blue colors whiche contains the hueE values). Only the defined hue values are considered in the rest of the algorithm.
The hue histograms are indexed by saturation values and consists also in 101 bins of width 0.01. From the S5P, S10P, S20P are derived the sets:
Determination of the Satskew Value from Metrics
The algorithm is based on the comparison of the previous metrics with some thresholds. An example for three satskew values (5, 10 and 15) is proposed.
We define the thresholds used for a ten satskew value by:
If (mean luma on the image<5) and (
Or (mean luma on the image<10) and (ratioB20P>0.7)
Or (mean luma on the image<20) and (ratioB20P>0.8)
Or (mean luma on the image<30) and (ratioB20P>0.9))
In a step E9, an SDR picture is decoded from said coded bitstream. For example, when the coded bitstream is conformant with an HEVC Main 10 profile, the coded bitstream is decoded according to the corresponding decoding process.
In step E9, HDR parameters are also decoded from the coded bitstream. THe HDR parameters may comprise at least: luma mapping parameters allowing to derive a LUT for mapping SDR luma to HDR luma, reconstruction parameters such as the v, a, and b parameters used to derive luma values from the decoded luma samples of the SDR picture.
In a step E14, the LUT invLUT for mapping luma values to luminance values is derived from the luma mapping parameters.
According to the present embodiment, in a step E10, an index representative of a predetermined post-processing colour correction function bp_det among a set of predetermined post-processing colour correction functions bpset is decoded.
According to one variant, the set of predetermined post-processing colour correction functions bpset is predefined at the decoder.
In a step E12, the post-processing colour correction function bp_det is selected according to the decoded index idx.
In a step E13, the HDR picture is then reconstructed from the decoded SRD picture and using the selected post-processing colour correction function bp_det. Such reconstruction step E13 is performed similarly as the reconstruction process described in
According to another variant, in a step E11, the set of predetermined post-processing colour correction functions bpset is decoded from the coded bitstream.
According to this embodiment, the coded bitstream comprises a set of pivot points representative of an adjustment function fadj used to adjust a post-processing colour correction function bp_default known at the decoder.
In step E9, the SDR picture and HDR parameters are decoded from the coded bitstream.
In step E14, the LUT invLUT for mapping luma values of SDR picture to luminance values of HDR picture is derived from the luma mapping parameters.
According to the present embodiment, in step E10, the pivot points representative of the adjustment function fadj are decoded.
In step E12, a post-processing colour correction function bp_dec is built from the adjustment function fadj and a predetermined post-processing colour correction function bp_default.
According to this embodiment, the post-processing colour correction function bp_default is selected among a set of predetermined post-processing colour correction functions bp_defaultset wherein the post-processing colour correction function (LUT) are predefined at the decoder. For instance, one LUT is defined for each triple (container colour gamut, content colour gamut, peak luminance). The post-processing colour correction function bp_default is identified according to the content characteristics parameters coded at the picture or sequence level in the coded bitstream.
The post-processing colour correction function bp_dec is then built according to equation (eq. 2).
In step E13, the HDR picture is then reconstructed from the decoded SRD picture and using the adjusted post-processing colour correction function bp_dec. Such reconstruction step E13 is performed similarly as the reconstruction process described in
On
Device 110 comprises following elements that are linked together by a data and address bus 111:
According to a variant, the battery 116 is external to the device. Each of these elements of
RAM 114 comprises, in a register, the program executed by the CPU 112 and uploaded after switch on of the device 110, input data in a register, intermediate data in different states of the method in a register, and other variables used for the execution of the method in a register.
The implementations described herein may be implemented in, for example, a method or a process, an apparatus, a software program, a data stream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method or a device), the implementation of features discussed may also be implemented in other forms (for example a program). An apparatus may be implemented in, for example, appropriate hardware, software, and firmware. The methods may be implemented in, for example, an apparatus such as, for example, a processor, which refers to processing devices in general, including, for example, a computer, a microprocessor, an integrated circuit, or a programmable logic device. Processors also include communication devices, such as, for example, computers, cell phones, portable/personal digital assistants (“PDAs”), and other devices that facilitate communication of information between end-users.
According to a specific embodiment of encoding or encoder, the HDR color picture is obtained from a source. For example, the source belongs to a set comprising:
According to different embodiments of the decoding or decoder, the HDR decoded picture is sent to a destination; specifically, the destination belongs to a set comprising:
According to different embodiments of encoding or encoder, the coded bitstream is sent to a destination. As an example, the coded bitstream is stored in a local or remote memory, e.g. a video memory (114) or a RAM (114), a hard disk (113). In a variant, the bitstream is sent to a storage interface, e.g. an interface with a mass storage, a flash memory, ROM, an optical disc or a magnetic support and/or transmitted over a communication interface (115), e.g. an interface to a point to point link, a communication bus, a point to multipoint link or a broadcast network.
According to different embodiments of decoding or decoder, the bitstream is obtained from a source. Exemplarily, the bitstream is read from a local memory, e.g. a video memory (114), a RAM (114), a ROM (113), a flash memory (113) or a hard disk (113). In a variant, the bitstream is received from a storage interface, e.g. an interface with a mass storage, a RAM, a ROM, a flash memory, an optical disc or a magnetic support and/or received from a communication interface (115), e.g. an interface to a point to point link, a bus, a point to multipoint link or a broadcast network.
According to different embodiments, device 110 being configured to implement an encoding method described in relation with
According to different embodiments, device 110 being configured to implement a decoding method described in relation with
Implementations of the various processes and features described herein may be embodied in a variety of different equipment or applications. Examples of such equipment include an encoder, a decoder, a post-processor processing output from a decoder, a pre-processor providing input to an encoder, a video coder, a video decoder, a video codec, a web server, a set-top box, a laptop, a personal computer, a cell phone, a PDA, and any other device for processing a picture or a video or other communication devices. As should be clear, the equipment may be mobile and even installed in a mobile vehicle.
Additionally, the methods may be implemented by instructions being performed by a processor, and such instructions (and/or data values produced by an implementation) may be stored on a computer readable storage medium. A computer readable storage medium can take the form of a computer readable program product embodied in one or more computer readable medium(s) and having computer readable program code embodied thereon that is executable by a computer. A computer readable storage medium as used herein is considered a non-transitory storage medium given the inherent capability to store the information therein as well as the inherent capability to provide retrieval of the information therefrom. A computer readable storage medium can be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. It is to be appreciated that the following, while providing more specific examples of computer readable storage mediums to which the present principles can be applied, is merely an illustrative and not exhaustive listing as is readily appreciated by one of ordinary skill in the art: a portable computer diskette; a hard disk; a read-only memory (ROM); an erasable programmable read-only memory (EPROM or Flash memory); a portable compact disc read-only memory (CD-ROM); an optical storage device; a magnetic storage device; or any suitable combination of the foregoing.
The instructions may form an application program tangibly embodied on a processor-readable medium.
Instructions may be, for example, in hardware, firmware, software, or a combination. Instructions may be found in, for example, an operating system, a separate application, or a combination of the two. A processor may be characterized, therefore, as, for example, both a device configured to carry out a process and a device that includes a processor-readable medium (such as a storage device) having instructions for carrying out a process. Further, a processor-readable medium may store, in addition to or in lieu of instructions, data values produced by an implementation.
As will be evident to one of skill in the art, implementations may produce a variety of signals formatted to carry information that may be, for example, stored or transmitted. The information may include, for example, instructions for performing a method, or data produced by one of the described implementations. For example, a signal may be formatted to carry as data the rules for writing or reading the syntax of a described embodiment, or to carry as data the actual syntax-values written by a described embodiment. Such a signal may be formatted, for example, as an electromagnetic wave (for example, using a radio frequency portion of spectrum) or as a baseband signal. The formatting may include, for example, encoding a data stream and modulating a carrier with the encoded data stream. The information that the signal carries may be, for example, analog or digital information. The signal may be transmitted over a variety of different wired or wireless links, as is known. The signal may be stored on a processor-readable medium.
A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different implementations may be combined, supplemented, modified, or removed to produce other implementations. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting implementations will perform at least substantially the same function(s), in at least substantially the same way(s), to achieve at least substantially the same result(s) as the implementations disclosed. Accordingly, these and other implementations are contemplated by this application.
Number | Date | Country | Kind |
---|---|---|---|
16305527.0 | May 2016 | EP | regional |
16305529.6 | May 2016 | EP | regional |