Color displays made using different technologies, or even those of the same technology, often have different primaries and thus different color gamuts. To ensure consistent rendering of an image across different displays, standard color communication protocols such as those promulgated by the International Color Consortium (ICC)—ICC color profiles—may be used to specify how to convert from an image's “native” color space to the target display's color space. In practice, however, some displays either do not support color management or do not support the full ICC specification. (See ICC specification ICC.1:2010-12 (Profile version 4.3.0.0), which is technically equivalent to ISO 15076-1:2010.)
Wider gamut systems, often newer on the market and designed with an awareness of the need to be compatible with older devices, mostly support some form of color management so that images with an attached color profile can be properly displayed. For still images without a color profile, current prevailing practice is to assume they are to be rendered for the sRGB color space. In this way most of the commonly available images can display properly on newer wide-gamut displays, though not taking advantage of the expanded color gamut.
To actually have colors that take advantage of a wide-gamut display, images need to be rendered for the wider gamut during the image's capture so that saturated colors are not clipped. If an image rendered for a wide-gamut display is shown on an sRGB display without color management however, the image's colors will appear desaturated. Herein lies the difficulty with maintaining backward compatibility, especially during the commercial transition period when there is a mixture of systems on the market (e.g., sRGB and wide-gamut displays), all of which do not support proper color management.
In one embodiment the disclosed technology provides a method to render wide-gamut images correctly on color managed wide-gamut display units and, without modification, on non-color managed display units. The method includes receiving first wide-gamut image data of a scene encoded in a first wide-gamut color space (e.g., the P3 color space) and obtaining a first forward transform to convert the first wide-gamut image data into first narrow-gamut image data, the first narrow-gamut image data having a first narrow-gamut color space (e.g., sRGB), where the first wide-gamut color space (e.g., the ProPhoto RGB color space) is larger than the first narrow-gamut color space (e.g., sRGB). The first forward transform may be applied to the first wide-gamut image data to generate the first narrow-gamut image data. A first recover or reverse transform may be obtained (generated or selected from a number of pre-determined transforms) to convert the first narrow-gamut image data to second wide-gamut image data, where the second wide-gamut image data is also encoded in the first wide-gamut color space. Note, if the reverse transform were “perfect” and there was no clipping during the conversion of the first wide-gamut image data to the first narrow-gamut image data, the first and second wide-gamut image data would be the same (within a round-off or computational error in any actual implementation). A first dual-target image (DTI) may be generated by storing the first narrow-gamut image data in a first data portion of a DTI file and the first reverse transform in a first metadata portion of the first DTI file. In one embodiment the first wide-gamut image data may be used to determine a first wide-gamut size metric which may be used to determine the first forward and reverse transforms. In another embodiment, the first gamut size metric may be stored in the DTI's first metadata portion so as to be available during subsequent display operations. If a display at which a DTI is received does not support color management (regardless of whether it is a wide-gamut or narrow-gamut display), the narrow-gamut image data may be displayed directly. If the display supports color management, the narrow-gamut image data may be transformed into the display's color space for display (e.g., through a profile connection space, PCS). In other embodiments computer instructions designed to cause a programmable display system (e.g., a portable media player or a mobile telephone) to perform the described operations may be implemented.
In the future, if industry adopts the P3 gamut as the display standard and later moves to even wider gamut displays (identified here as “P3+”), the same dual-target scheme described herein may be used during that transition. The dual-target formulation disclosed here is generally applicable when expanding from any gamut to a wider gamut.
This disclosure pertains to systems, methods, and computer readable media for successfully rendering wide-gamut images. In general, techniques are disclosed for displaying wide-gamut images as intended on color-managed wide-gamut display systems while rendering a visually consistent image when rendered on targeted narrow-gamut display systems (regardless of whether the narrow-gamut displays are color-managed). For this reason, an image represented in accordance with this disclosure is referred to as a dual-target image (DTI): one target being the image's original wide-gamut color space, the other target being a specified narrow-gamut color space. In one embodiment, the wide-gamut image may be an image represented in the P3 color space. In another embodiment, the wide-gamut image may be an image represented in the Reference Output Medium Metric (ROMM) RGB color space (also referred to as the ProPhoto RGB color space). The narrow-gamut image may, for example, be the sRGB color space although any color space that is smaller than, and wholly enclosed by, the wider gamut color space may be used. Conversely, the wide-gamut color space may be any color space that is larger than, and wholly encloses, the narrow-gamut color space. The novel representational scheme described herein retains narrow-gamut rendering for those colors in a wide-gamut image that are within the targeted narrow-gamut color space, transitioning to wide-gamut rendering for those colors in the wide-gamut image that are outside the targeted narrow-gamut color space. This approach avoids or minimizes pixel clipping when rendering a wide-gamut image for a narrow-gamut display, while allowing the original wide-gamut pixel values to be recovered when rendering for a wide-gamut display.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the disclosed concepts. As part of this description, some of this disclosure's drawings represent structures and devices in block diagram form in order to avoid obscuring the novel aspects of the disclosed concepts. In the interest of clarity, not all features of an actual implementation are described. Moreover, the language used in this disclosure has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter, resort to the claims being necessary to determine such inventive subject matter. Reference in this disclosure to “one embodiment” or to “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the disclosed subject matter, and multiple references to “one embodiment” or “an embodiment” should not be understood as necessarily all referring to the same embodiment.
It will be appreciated that in the development of any actual implementation (as in any software and/or hardware development project), numerous decisions must be made to achieve the developers' specific goals (e.g., compliance with system- and business-related constraints), and that these goals may vary from one implementation to another. It will also be appreciated that such development efforts might be complex and time-consuming, but would nevertheless be a routine undertaking for those of ordinary skill in the design and implementation of graphics processing systems having the benefit of this disclosure.
Due to the nature of commonly seen object surface reflectance properties and natural illuminants, most colors in images captured using consumer-grade cameras fall into the sRGB color gamut. Notwithstanding this fact, highly saturated colors such as some flowers, car paint and colorful fabrics, especially when captured under bright illumination, may be outside the sRGB gamut. As a consequence, their color values can be clipped when rendered for an sRGB display. In light of this recognition, the remainder of this disclosure will assume the target narrow-gamut color space is the sRGB color space. This selection, while providing a solution to a current technological problem (the display of wide-gamut images on narrow-gamut display devices), should not be considered limiting. As noted above, the dual-target formulation disclosed here is generally applicable when expanding from any gamut to a wider gamut.
To begin, let A1 represent a color matrix that transforms color values in a wide-gamut color space S0 (e.g., P3) to a smaller-gamut color space S1 (e.g., sRGB). Let:
At={(1−k)A1+kI} EQ. 1
be the color matrix that transforms values from S0 to an image-specific wide-gamut color space St given the image's gamut size k. As used herein, the phrase “image-specific gamut” means a gamut just large enough to include the colors of a specific wide-gamut image as characterized by the image's gamut size metric k (described below). For a pixel with S0 color values [r0, g0, b0], its S1 color values may be given as:
[r1,g1,b1]=(A1[r0,g0,b0]t). EQ. 2
and its St color values as:
[rt,gt,bt]=(At[r0,g0,b0]t). EQ. 3
For colors that are not likely to be out of the S1 gamut, it has been found desirable to keep their S1 color values as much as possible. For color values closer to the S1-S0 gamut boundary (i.e., close to “entering” the S0 gamut), it may be desirable to gradually transition to a St rendering to preserve the color information.
Let ‘y’ represent a weight value that roughly represents the likelihood a pixel having the S0 color [r0, g0, b0] might be clipped if converted to an S1 value. This may be represented formally as:
y=F[r0,g0,b0] EQ. 4
Dual-target rendering [rx, gx, bx] for a pixel having S0 color values [r0, g0, b0] may then be found as a y-weighted combination of S0 and St rendering as follows:
[rx,gx,bx]t=(1−y)A1[r0,g0,b0]t+yAt[r0,g0,b0]t EQ. 5A
=[(1−y)A1+yAt][r0,g0,b0]t EQ. 5B
The function ‘F’ to determine y for a given pixel may take any of a number of different forms. It is not critical to use any particular form as long as it results in a value [rx, gx, bx] that is perceptually close to A1[r0, g0, b0] for most pixels. In one embodiment, F may be specified as a function of a pixel's maximum RGB value ‘m’:
y=F(m) EQ. 6A
and
m=max(r0,g0,b0). EQ. 6B
When F takes on the form shown in
From EQS. 5A and 5B, one may see that no matter how y is determined there will be a trade-off between how similar [rx, gx, bx] can me made equal to A1[r0, g0, b0], and how well [r0, g0, b0] may be recovered from [rx, gx, bx]. The more similar [rx, gx, bx] is to A1[r0, g0, b0], the more likely the values will be clipped thus making it harder to recover [r0, g0, b0]. In addition, the transition to wide-gamut rendering at highlights will inevitably desaturate these highlight colors to some degree, making them less colorful than their S1 rendering if viewed on target narrow-gamut display systems (e.g., sRGB display systems). In response to this problem, a slight darkening of highlight colors at the S1 to St transition may be made to reduce the perceived desaturation and to further minimize the clipping of [rx, gx, bx]:
where ‘d’ represents a darkening factor with (d=0) corresponding to maximum darkening and (d=1) corresponding to no darkening, and ‘m2’ represents the point where the function y=F(m) starts to rise above 0 (e.g.,
Referring to
Based, at least in part, on the gamut size metric's value, a forward transformation may be determined that maps values in the wide-gamut image's color space to the target narrow-gamut color space (block 220). In one embodiment sets of forward look-up tables may be generated (one set for each value of k; each set having a table for each primary R, G and B) that map wide-gamut pixel values [r0, g0, b0] to dual-target gamut pixel values [rx, gx, bx], and which, when applied to wide-gamut image 205 (block 225), generates dual-target image (DTI) data portion 230 of DTI 235. Recovery transform (e.g., reverse look-up tables), that map DTI pixel values [rx, gx, bx] to wide-gamut pixel values [r0, g0, b0] may also be found (block 240). Forward transform in accordance with block 220 and recovery or reverse transform in accordance with block 240 may be generated in a variety of ways, the simplest of which may be to generate the mapping from [r0, g0, b0] to [rx, gx, bx] for densely sampled values of [r0, g0, b0] to generate forward lookup tables using equation 7, and then inverting those tables to get recovery or reverse lookup tables. In another embodiment forward transform determination 215 may initially find color matrices At (EQ. 1) and A1 as described above. Color matrices At and A1 may then be combined to generate wide-gamut-to-DTI gamut color correction matrices (CCMs) or forward lookup tables in accordance with block 220. By way of example, matrix At may be blended with matrix A1 based on per-pixel luma values to create luma-adaptive color lookup tables. Forward lookup tables may be inverted to generate recovery transform or reverse lookup tables (block 240).
There may be cases where the generated [rx, gx, bx] values have to be clipped to range. In such cases the recovery from [rx, gx, bx] values to [r0, g0, b0] values will not be fully accurate. The trade-off between recovery accuracy and color desaturation or darkening in the dual-target rendering is a matter of tuning. Reverse transform in accordance with block 240 may be pre-calculated for different k values and stored for use at capture time. In one embodiment, pre-calculated tables whose k value most closely matches the wide-gamut image's k value may be selected as the recovery transform. In another embodiment, pre-calculated tables whose k values are closest, but greater than and less than the wide-gamut image's k value, may be combined to generate recovery transform (e.g., through interpolation). For example, the combination may be based on a weighted sum of the two pre-calculated transforms where the difference between the pre-calculated transform's k value and the wide-gamut image's k value determines the weighting factor. In general, any means of selecting a reverse transform that permits the wide-gamut image to be restored from the DTI, within an acceptable implementation-specific error, may be used.
Since DTIs may be shared across multiple systems with varying color management capabilities, recovery operations in accordance with this disclosure may be supported by a widely used standard. ICC color profiles support the use of color lookup tables to convert input image RGB values to a device-independent color space; the profile connection space (PCS). One illustrative PCS is the CIE XYZ color space. Another illustrative PCS is the CIE LAB color space. Accordingly, ICC profiles may be used to store the metadata needed to convert DTI RGB values (expressed as [rx, gx, bx] tuples) to wide-gamut display-specific values (expressed as [r0, g0, b0] tuples). Wide-gamut-to-PCS transform tables may be obtained in accordance with the ICC standard (block 245) and applied to the recovery transform determined in accordance with block 240 (block 250) to generate metadata portion 255 of DTI 235.
While not shown explicitly in
In the embodiments described above, both forward and backward transforms are provided by CLUTs. Generation of forward CLUTs are not, however, constrained to the luma-weighted blending scheme outlined earlier. By way of example, the forward CLUTs may be generated using a direct minimization of color errors (e.g., CIE DeltaE color differences) between the DTI and the original image in both the narrow-gamut rendering and the recovered wide-gamut rendering. As such, there is a great deal of freedom in how these tables may be generated to minimize loss of wide gamut information (S0) and to minimize distortion of the sRGB (S1) rendering when compressing the wide-gamut colors into the sRGB range. In one embodiment, these table calculations may be made offline and can be continuously improved if considered necessary.
Referring to
Referring to
Lens assembly 405 may include a single lens or multiple lens, filters, and a physical housing unit (e.g., a barrel). One function of lens assembly 405 is to focus light from scene 415 onto image sensor 410. Image sensor 410 may, for example, be a CCD (charge-coupled device) or CMOS (complementary metal-oxide semiconductor) imager. There may be more than one lens assembly and more than one image sensor. There could also be multiple lens assemblies each focusing light onto a single image sensor (at the same or different times) or different portions of a single image sensor. IPP 420 may process image sensor output (e.g., RAW image data) to yield wide-gamut image 205. More specifically, IPP 420 may perform a number of different tasks one of which can be the conversion of a RAW image into an image represented in a linear color space (e.g., the P3, ROMM or sRGB color spaces). Other operations IPP 420 may perform include, but need not be limited to, black level removal, de-noising, lens shading correction, white balance adjustment, demosaic operations, and the application of local or global tone curves or maps. IPP 420 may comprise a custom designed integrated circuit, a programmable gate-array, a central processing unit, a graphical processing unit, memory, or a combination of these elements (including more than one of any given element). Some functions provided by IPP 420 may be implemented at least in part via software (including firmware). Display element 425 may be used to display text and graphic output as will as receiving user input via user interface 430. For example, display element 425 may be a touch-sensitive display screen. User interface 430 can also take a variety of other forms such as a button, keypad, dial, a click wheel, and keyboard. Processor 435 may be a system-on-chip (SOC) such as those found in mobile devices and include one or more dedicated graphics processing units (GPUs). Processor 435 may be based on reduced instruction-set computer (RISC) or complex instruction-set computer (CISC) architectures or any other suitable architecture and may include one or more processing cores. Graphics hardware 440 may be special purpose computational hardware for processing graphics and/or assisting processor 435 perform computational tasks. In one embodiment, graphics hardware 440 may include one or more programmable GPUs each of which may have one or more cores. Audio circuit 445 may include one or more microphones, one or more speakers and one or more audio codecs. Image processing circuit 450 may aid in the capture of still and video images from image sensor 410 and include at least one video codec. Image processing circuit 450 may work in concert with IPP 420, processor 435 and/or graphics hardware 440. Images, once captured, may be stored in memory 455 and/or storage 460. Memory 455 may include one or more different types of media used by IPP 420, processor 435, graphics hardware 440, audio circuit 445, and image processing circuitry 450 to perform device functions. For example, memory 455 may include memory cache, read-only memory (ROM), and/or random access memory (RAM). Storage 460 may store media (e.g., audio, image and video files), computer program instructions or software, preference information, device profile information, and any other suitable data. Storage 460 may include one more non-transitory storage mediums including, for example, magnetic disks (fixed, floppy, and removable) and tape, optical media such as CD-ROMs and digital video disks (DVDs), and semiconductor memory devices such as Electrically Programmable Read-Only Memory (EPROM), and Electrically Erasable Programmable Read-Only Memory (EEPROM). Device sensors 465 may include, for example, proximity sensor/ambient light sensor, accelerometer and/or gyroscopes. Communication interface 470 may be used to connect device 400 to one or more networks. Illustrative networks include, but are not limited to, a local network such as a USB network, an organization's local area network, and a wide area network such as the Internet. Communication interface 470 may use any suitable technology (e.g., wired or wireless) and protocol (e.g., Transmission Control Protocol (TCP), Internet Protocol (IP), User Datagram Protocol (UDP), Internet Control Message Protocol (ICMP), Hypertext Transfer Protocol (HTTP), Post Office Protocol (POP), File Transfer Protocol (FTP), and Internet Message Access Protocol (IMAP)). Communication link 475 may be a continuous or discontinuous communication path and may be implemented, for example, as a bus, a switched interconnect, or a combination of these technologies.
Referring to
It is to be understood that the above description is intended to be illustrative, and not restrictive. The material has been presented to enable any person skilled in the art to make and use the disclosed subject matter as claimed and is provided in the context of particular embodiments, variations of which will be readily apparent to those skilled in the art (e.g., some of the disclosed embodiments may be used in combination with each other). For example,
Number | Name | Date | Kind |
---|---|---|---|
5373327 | McGee | Dec 1994 | A |
9472162 | Stauder | Oct 2016 | B2 |
9640140 | Shibuya | May 2017 | B2 |
20130194321 | Wan | Aug 2013 | A1 |
20130335439 | Jeong | Dec 2013 | A1 |
20170048561 | Oh | Feb 2017 | A1 |
Number | Date | Country | |
---|---|---|---|
20170098428 A1 | Apr 2017 | US |
Number | Date | Country | |
---|---|---|---|
62237630 | Oct 2015 | US |