Digital imaging generally involves an image source, such as a camera to capture an image or video of a natural scene, a digital representation of captured source video, and a display for reproducing the video for a viewer. Each stage of a digital imaging system, including source, representation, and display, have imaging parameters, including limitations, for the types of images they can produce, represent, or display. For example, each image source, representation, and display can support a certain dynamic range of brightness, from darkest to brightest, and can support only a certain color gamut. When a source, representation, and display in an image processing system do not have matching parameters, a conversion process may be required.
High-dynamic range (HDR) video has become common in some video systems, including cameras capable of capturing HDR video, HDR digital image representations, and HDR displays. HDR video can be compared to standard dynamic range (SDR) video system. HDR images or videos have a wider dynamic range of luminosity, as compared to traditional SDR images or videos. HDR enables capturing (or representing in a data format, or displaying) scenes, for example, with both a very bright region, such as a bright sun-lite sky, and a dark region, while preserving image details in both bright and dark regions.
Generally, a need for improvement is perceived in systems where imaging parameters such as dynamic range may vary between the stages of an imaging system.
This disclosure provides techniques for adjusting image data for a virtual display. When HDR images or videos are captured and have to be displayed on an SDR display, then dynamic range compression may be needed. Conversely, if SDR images or videos are to be displayed on an HDR display, then dynamic range expansion may be needed. By adjusting video into an intermediate format for a virtual display, consistent viewing experiences may be enabled on a real physical display. Embodiments include determining a tone curve based on imaging parameters of a source representation and a virtual display, and then adjusting source video to an intermediate format with the tone curve. Intermediate format video data can then be formatted for any particular final target display. In some embodiments, tone curves for adjustment of chroma data to the virtual display may be derived from a luma tone curve.
If imaging parameters do not match between the media source 130 and the target display 120, a conversion process may be performed. Many target displays include the ability to accept more than one format of input video, and then convert the accepted formats as necessary to accommodate the target display's native physical imaging parameters. However, format conversion inside the media player may be advantageous for several reasons. The format conversion techniques within a target display may be of poor quality, or a delay may be incurred by the target display when the imaging parameters of a video source change. For example, a source media item may have been generated according to several source representations, which may change at various points during rendering of a media item. When a switch between different representations occurs, a target display, which is designed for real time rendering of media content, may incur a delay, such as up to several seconds, before presenting video with the new imaging parameters. Such a delay while switching between different imaging parameters may cause a poor viewer experience, such as a momentary blank screen or a reset of a viewer's custom settings. In some cases, a target display may not be able to detect changes in source video, requiring a viewer to intervene by manually changing conversion settings.
Format conversion at media player 110 may also be advantageous to enable blending of imaging sources at the media player where the blended image sources have different imaging parameters. For example, media player 110 may have a graphical user interface that is synthetically rendered by the media player and composited into images from media source 130 for a combined presentation at target display 120. Alternately, two separate media sources may be combined with a picture-in-picture effect at the media player. A format conversion at the media player may be necessary to blend or composite images at the media player where the different sources have different imaging parameters.
Media player 110 may convert any video source data to conform to imaging parameters of a hypothetical virtual display for later consumption by target display 120. The imaging parameters of the virtual display may be selected to be a superset of possible imaging parameters of possible media sources, and may be selected to be a superset of possible target displays. For example, a virtual display may be selected to have a dynamic range as great, a color gamut as wide, and a sample precision as high as the largest, widest, and highest of any possible sources and target displays. In one example, source media in 8-bit SDR format may be converted to a virtual display having imaging parameters that match HDR with 10-bit precision.
For example, a source video with a maximum brightness of 150 nits could be displayed on a target display with a brightness of 300 nits. A brute-force mapping, where a source pixel of brightest possible value representing 150 nits is presented on a target display at the target display at 300 nits, would result in brightening the source image. A conversion of the source image with a tone curve that, for example, maps source image brightness non-linearly to a virtual display, may better preserve the look of the original source image. A conversion to a virtual display with one or more tone curves may prevent brightening an image or supersaturating colors. As display technology improves and target displays become capable of presenting even brighter images with larger color gamuts, the potential for distorting source images may grow.
Parameters 204 may include imaging parameters such as a white point and a black point of source video or a virtual display. In some embodiments, parameters may also include data such as the ambient light measured or estimated in the environment around or outside a particular target display. Parameters 204 may also specify the display technology used in a particular target display.
Tone curve basis functions 202.1-202.3 may be predetermined, for example based on perceptual modeling of a human visual system, including models of human perceived change in brightness or color. Parameters 204 may include imaging parameters of the source video and imaging parameters of a virtual display. In some cases, imaging parameters of a virtual display may be determined based on the imaging parameters of a specific known target display or based on a collection of possible target displays. Weights may be assigned in box 206, for example, by selecting a predetermined matrix of weights for basis functions based on, for example, a white point and black point of the source video, and based on the white point and black point of the virtual display. Alternately, weights may be assigned in box 206 as an algorithmic function of parameters 204.
Luma tone curve may be created by combining in 222 the basis functions 202.1-202.3 weighted by 208.1-208.3. In other embodiments, different numbers of basis functions and weights may be used, for example, more or less than three basis functions. The combining operation of 222 may be performed by piecewise addition of the weighted basis functions. In other embodiments, the combining operation of 222 may produce a sum of the weighted basis functions input to 222.
A tone curve, such as the luma tone curve 221 or chroma tone curves 232, may function to map a single input scalar value to an output scalar value. For example, luma tone curve 221 may map the luma value of a source pixel to a luma value in the virtual display domain. Similarly, a first chroma tone curve may map a first chroma value of a source pixel to a first chroma value of the virtual display. A tone curve may be implemented, for example, as a look-up-table (LUT), or may be implemented as an algorithmic function such as a piece-wise linear function.
In some embodiments, the imaging parameters of the virtual display may be determined based on parameters of a particular target display, and the format adjustment in box 214 may include adjustments to pixel data related to a particular target display. For example, adjustments to source pixel data may be made in box 214 that account for a measured or estimated ambient light at a target display, and may include an adjustment based on the display type of the target display. A display type of a target display may include specification of a physical display technology such as CRT, LCD, or OLED. Adjustments to source data based on a particular target display may be made after application of tone curves. Alternately, adjustments to source data based on a particular target display may be made by selecting different weighting of basis functions in box 206, or by modifying the basis functions 202.1-202.3.
Format adjustment in box 218 may involve data packaging changes, such as altering or creating image header information, and inserting metadata. In contrast with box 214, which may change the value or relationship of pixel data such as non-linear mappings of pixel values, box 218 may create or change image metadata, or change the organization of pixel data, without changing pixel values. In some embodiments, data about the image adjustments of box 214 may be added as metadata to the output video 220, for example in the format adjustment of box 218. Such added metadata may include specification of some or all of the parameters 204, the weights 206, the basis functions 202.1-202.3, the luma tone curve 221 and/or the chroma tone curve 232.
In alternate embodiments, a conversion profile for converting source image data may be derived from parameters describing characteristics of a domain of source image data and characteristics of a domain of a target rendering environment. Source image data may be converted according to the conversion profile, and then output to a target display or rendering environment. For example, the conversion profile may be derived from a stored library of basis functions, where contribution of individual basis functions may be weighted according to the characteristics of the source image domain and the target rendering domain.
In other embodiments, adjustments for ambient light and display type may be made by adjusting one or more tone curves before applying the tone curves, in contrast to the adjustments for ambient light and display type depicted in
In an embodiment, the image adjustment techniques described herein may be performed by a central processor of a computer system.
The central processor 510 may read and execute various program instructions stored in the memory 540 that define an operating system 512 of the system 500 and various applications 514.1-514.N. The program instructions may perform image processing according to the techniques described herein. As it executes those program instructions, the central processor 510 may read from the memory 540 the image data received by network interface 530, and it may generate tone curves and adjust the received source images as described hereinabove. The memory 540 may store the program instructions on electrical-, magnetic- and/or optically-based storage media.
The controller 260 of
Several embodiments of the disclosure are specifically illustrated and/or described herein. However, it will be appreciated that modifications and variations of the disclosure are covered by the above teachings and within the purview of the appended claims without departing from the spirit and intended scope of the disclosure.