The present invention relates to color systems, and more specifically to a wide gamut color system with an increased number of primary colors.
It is generally known in the prior art to provide for an increased color gamut system within a display.
Prior art patent documents include the following:
U.S. Pat. No. 10,222,263 for RGB value calculation device by inventor Yasuyuki Shigezane, filed Feb. 6, 2017 and issued Mar. 5, 2019, is directed to a microcomputer that equally divides the circumference of an RGB circle into 6×n (n is an integer of 1 or more) parts, and calculates an RGB value of each divided color. (255, 0, 0) is stored as a reference RGB value of a reference color in a ROM in the microcomputer. The microcomputer converts the reference RGB value depending on an angular difference of the RGB circle between a designated color whose RGB value is to be found and the reference color, and assumes the converted RGB value as an RGB value of the designated color.
U.S. Pat. No. 9,373,305 for Semiconductor device, image processing system and program by inventor Hiorfumi Kawaguchi, filed May 29, 2015 and issued Jun. 21, 2016, is directed to an image process device including a display panel operable to provide an input interface for receiving an input of an adjustment value of at least a part of color attributes of each vertex of n axes (n is an integer equal to or greater than 3) serving as adjustment axes in an RGB color space, and an adjustment data generation unit operable to calculate the degree of influence indicative of a following index of each of the n-axis vertices, for each of the n axes, on a basis of distance between each of the n-axis vertices and a target point which is an arbitrary lattice point in the RGB color space, and operable to calculate adjusted coordinates of the target point in the RGB color space.
U. S. Publication No. 20130278993 for Color-mixing bi-primary color systems for displays by inventors Heikenfeld, et al., filed Sep. 1, 2011 and published Oct. 24, 2013, is directed to a display pixel. The pixel includes first and second substrates arranged to define a channel. A fluid is located within the channel and includes a first colorant and a second colorant. The first colorant has a first charge and a color. The second colorant has a second charge that is opposite in polarity to the first charge and a color that is complimentary to the color of the first colorant. A first electrode, with a voltage source, is operably coupled to the fluid and configured to moving one or both of the first and second colorants within the fluid and alter at least one spectral property of the pixel.
U.S. Pat. No. 8,599,226 for Device and method of data conversion for wide gamut displays by inventors Ben-Chorin, et al., filed Feb. 13, 2012 and issued Dec. 3, 2013, is directed to a method and system for converting color image data from a, for example, three-dimensional color space format to a format usable by an n-primary display, wherein n is greater than or equal to 3. The system may define a two-dimensional sub-space having a plurality of two-dimensional positions, each position representing a set of n primary color values and a third, scaleable coordinate value for generating an n-primary display input signal. Furthermore, the system may receive a three-dimensional color space input signal including out-of range pixel data not reproducible by a three-primary additive display, and may convert the data to side gamut color image pixel data suitable for driving the wide gamut color display.
U.S. Pat. No. 8,081,835 for Multiprimary color sub-pixel rendering with metameric filtering by inventors Elliott, et al., filed Jul. 13, 2010 and issued Dec. 20, 2011, is directed to systems and methods of rendering image data to multiprimary displays that adjusts image data across metamers as herein disclosed. The metamer filtering may be based upon input image content and may optimize sub-pixel values to improve image rendering accuracy or perception. The optimizations may be made according to many possible desired effects. One embodiment comprises a display system comprising: a display, said display capable of selecting from a set of image data values, said set comprising at least one metamer; an input image data unit; a spatial frequency detection unit, said spatial frequency detection unit extracting a spatial frequency characteristic from said input image data; and a selection unit, said unit selecting image data from said metamer according to said spatial frequency characteristic.
U.S. Pat. No. 7,916,939 for High brightness wide gamut display by inventors Roth, et al., filed Nov. 30, 2009 and issued Mar. 29, 2011, is directed to a device to produce a color image, the device including a color filtering arrangement to produce at least four colors, each color produced by a filter on a color filtering mechanism having a relative segment size, wherein the relative segment sizes of at least two of the primary colors differ.
U.S. Pat. No. 6,769,772 for Six color display apparatus having increased color gamut by inventors Roddy, et al., filed Oct. 11, 2002 and issued Aug. 3, 2004, is directed to a display system for digital color images using six color light sources or two or more multicolor LED arrays or OLEDs to provide an expanded color gamut. Apparatus uses two or more spatial light modulators, which may be cycled between two or more color light sources or LED arrays to provide a six-color display output. Pairing of modulated colors using relative luminance helps to minimize flicker effects.
U.S. Pat. No. 9,035,969 for Method for multiple projector display using a GPU frame buffer by inventors Ivashin, et al., filed Nov. 29, 2012 and issued May 19, 2015, is directed to a primary image transformed into secondary images for projection, via first and second frame buffers and view projection matrixes. To do so, a first image is loaded into the first frame buffer. A calibration data set, including the view projection matrixes, is loaded into an application. The matrixes are operable to divide and transform a primary image into secondary images that can be projected in an overlapping manner onto a projection screen, providing a corrected reconstruction of the primary image. The first image is rendered from the first frame buffer into the second images, by using the application to apply the calibration data set. The second images are loaded into a second frame buffer, which can be coupled to the video projectors.
U.S. Pat. No. 9,307,616 for Method, system and apparatus for dynamically monitoring and calibrating display tiles by inventors Robinson, et al., filed May 15, 2015 and issued Apr. 5, 2016, is directed to a method, system and apparatus for dynamically monitoring and calibrating display tiles. The apparatus comprises: an array of light emitting devices; one or more light emitting devices paired with light emitting devices of the array; one or more sensors configured to detect an optical characteristic and/or an electrical characteristic of the one or more paired light emitting devices; and, circuitry configured to: drive the array; drive each of the one or more further light emitting devices under same conditions as light emitting devices of the array; temporarily drive each of the one or more paired light emitting devices under different conditions from the array; and, adjust driving of the array based on the optical characteristic and/or electrical characteristic of the one or more paired light emitting devices detected at sensor(s) when the one or more paired light emitting devices are driven under the different conditions.
U.S. Pat. No. 8,911,291 for Display system and display method for video wall by inventor Liu, filed Nov. 26, 2012 and issued Dec. 16, 2014, is directed to a display system and a display method for video walls. The display system includes at least one server and a plurality of player devices. Each server renders an image and transmits the image to a network. The player devices are coupled to the at least one server through the network. Each player device receives the image or a part of the image rendered by one of the at least one server, and determines a synchronization time together with at least one of the other player devices. Each player device uses a display of a video wall to simultaneously display the image or the part of the image at the synchronization time.
U.S. Pat. No. 10,079,963 for Display method and display system for video wall by inventors Liu, et al., filed May 12, 2017 and issued Sep. 19, 2018, is directed to a display method and a display system for a video wall. The method is applicable to a display system having a server and multiple player devices. Each of the player devices is connected to the server and a video wall having multiple displays, and each of the player devices corresponds to a different one of the displays and a different one of regions in a video stream. The method includes to receive the video stream from the server by each of the player devices, to send a broadcast command by a master player device among the player devices to other player devices, and to start displaying the corresponding region in a first frame of the video stream on the corresponding display of the video wall by each of the player devices after a preset delay time interval according to the broadcast command.
U.S. Pat. No. 7,535,433 for Dynamic multiple display configuration by inventors Ledebohm, et al., filed May 18, 2006 and issued May 19, 2009, is directed to a system and method for modifying the configuration of one or more graphics adapters and one or more displays without rebooting the system allows a user to quickly transition between different graphics adapter/display configurations. A single display driver interfaces between the operating system and the one or more graphics devices. The display driver reconfigures the one or more graphics devices to change the adapter/display configuration without shutting down or rebooting the system. Unlike a conventional system reboot performed by the operating system, the display driver checks that there are no memory leaks or error conditions during the reconfiguration.
U.S. Pat. No. 10,162,590 for Video wall system and method of making and using same, by inventor Ritter, filed May 4, 2015 and issued Dec. 25, 2018, is directed to a hub which in turn is made of a housing, at least one video input port, at least two video output ports, a digital card enabling communication between a computer and at least one display without a direct physical connection and a processor. The hub is used to make a video wall.
U.S. Pat. No. 9,911,176 for System and method of processing images into sub-image portions for output to a plurality of displays such as a network video wall by inventors Griffin, et al., filed Jan. 12, 2015 and issued Mar. 6, 2018, is directed to a system for improving the flexibility and performance of video walls including a method for using a primary GPU for initial rendering to a GPU frame buffer, copying of this frame buffer to system memory for processing into multiple sub-frames then outputting the sub-frames via multiple secondary graphics controllers. This system enables the video wall server to leverage performance advantages afforded by GPU acceleration and maintaining performance while providing full flexibility of the CPU and system memory to apply the required transformations to the sub-images as well as flexibility in the selection of secondary graphics controllers (including network graphics approaches where the graphics controller is connected over a network) for outputting the multiple sub-images to a plurality of displays. This has applications generally in the field of real-time multiple display graphics processing as well as specific applications in the field of video walls and network video walls. A method and computer readable medium also operate in accordance with the system.
U.S. Pat. No. 10,185,533 for Video wall control system and method by inventors Kim, et al., filed Sep. 24, 2014 and issued Jan. 22, 2019, is directed to a video wall control system for controlling a video wall including a plurality of screens, the video wall control system including: at least one client module controlling the layout of the video wall; a central control module acquiring camera unique identification (UID) and a video stream from a monitoring system, storing the camera UID and the video stream, and controlling the layout of the video wall; a storage module storing the modified video wall layout; a gateway module receiving a layout modification event from the client module or the central control module and load the modified video wall layout from the storage module; and a decoding module loading the camera UID and the video stream from the central control module, receiving the modified video wall layout from the gateway module, and modifying the layout of the video wall based on the received modified video wall layout.
It is an object of this invention to provide an enhancement to the current RGB systems or a replacement for them.
In one embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), a display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit the image display data to each of the plurality of display devices, and wherein the plurality of display devices is operable to display the image display data.
In another embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), a display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit an image display signal to each of the plurality of display devices, wherein the image display signal includes a portion of the image display data, and wherein the plurality of display devices is operable to display the image display data.
In yet another embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), at least one display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit an image display signal to each of the plurality of display devices, wherein the image display signal includes a portion of the image display data, wherein the plurality of display devices is operable to display the image display data, and wherein the image display data includes a plurality of images.
These and other aspects of the present invention will become apparent to those skilled in the art after a reading of the following description of the preferred embodiment when considered with the drawings, as they support the claimed invention.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The present invention is generally directed to a multi-primary color system.
In one embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), a display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit the image display data to each of the plurality of display devices, and wherein the plurality of display devices is operable to display the image display data.
In another embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), a display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit an image display signal to each of the plurality of display devices, wherein the image display signal includes a portion of the image display data, and wherein the plurality of display devices is operable to display the image display data.
In yet another embodiment, the present invention includes a system for displaying image data including at least one graphics processing unit (GPU), at least one display engine, at least one display controller, and a plurality of display devices, wherein the image data includes a luminance and two colorimetric coordinates, and wherein the two colorimetric coordinates are independent from the luminance, wherein the at least one GPU is operable to render the image data for display on the plurality of display devices, thereby creating rendered image data, wherein the rendered image data is transmitted to the display engine, wherein the display engine is operable to apply at least one non-linear transfer function to the luminance, thereby creating a luma, wherein the rendered image data is transmitted to the at least one display controller, wherein the at least one display controller is operable to scale the rendered image data for display on the plurality of display devices, thereby creating image display data, wherein the at least one display controller is operable to transmit an image display signal to each of the plurality of display devices, wherein the image display signal includes a portion of the image display data, wherein the plurality of display devices is operable to display the image display data, and wherein the image display data includes a plurality of images.
The present invention relates to color systems. A multitude of color systems are known, but they continue to suffer numerous issues. As imaging technology is moving forward, there has been a significant interest in expanding the range of colors that are replicated on electronic displays. Enhancements to the television system have expanded from the early CCIR 601 standard to ITU-R BT.709-6, to SMPTE RP431-2, and ITU-R BT.2020. Each one has increased the gamut of visible colors by expanding the distance from the reference white point to the position of the Red (R), Green (G), and Blue (B) color primaries (collectively known as “RGB”) in chromaticity space. While this approach works, it has several disadvantages. When implemented in content presentation, issues arise due to the technical methods used to expand the gamut of colors seen (typically using a more-narrow emissive spectrum) can result in increased viewer metameric errors and require increased power due to lower illumination source. These issues increase both capital and operational costs.
With the current available technologies, displays are limited in respect to their range of color and light output. There are many misconceptions regarding how viewers interpret the display output technically versus real-world sensations viewed with the human eye. The reason we see more than just the three emitting primary colors is because the eye combines the spectral wavelengths incident on it into the three bands. Humans interpret the radiant energy (spectrum and amplitude) from a display and process it so that an individual color is perceived. The display does not emit a color or a specific wavelength that directly relates to the sensation of color. It simply radiates energy at the same spectrum which humans sense as light and color. It is the observer who interprets this energy as color.
When the CIE 2° standard observer was established in 1931, common understanding of color sensation was that the eye used red, blue, and green cone receptors (James Maxwell & James Forbes 1855). Later with the Munsell vision model (Munsell 1915), Munsell described the vision system to include three separate components: luminance, hue, and saturation. Using RGB emitters or filters, these three primary colors are the components used to produce images on today's modern electronic displays.
There are three primary physical variables that affect sensation of color. These are the spectral distribution of radiant energy as it is absorbed into the retina, the sensitivity of the eye in relation to the intensity of light landing on the retinal pigment epithelium, and the distribution of cones within the retina. The distribution of cones (e.g., L cones, M cones, and S cones) varies considerably from person to person.
Enhancements in brightness have been accomplished through larger backlights or higher efficiency phosphors. Encoding of higher dynamic ranges is addressed using higher range, more perceptually uniform electro-optical transfer functions to support these enhancements to brightness technology, while wider color gamuts are produced by using narrow bandwidth emissions. Narrower bandwidth emitters result in the viewer experiencing higher color saturation. But there can be a disconnect between how saturation is produced and how it is controlled. What is believed to occur when changing saturation is that increasing color values of a color primary represents an increase to saturation. This is not true, as changing saturation requires the variance of a color primary spectral output as parametric. There are no variable spectrum displays available to date as the technology to do so has not been commercially developed, nor has the new infrastructure required to support this been discussed.
Instead, the method that a display changes for viewer color sensation is by changing color luminance. As data values increase, the color primary gets brighter. Changes to color saturation are accomplished by varying the brightness of all three primaries and taking advantage of the dominant color theory.
Expanding color primaries beyond RGB has been discussed before. There have been numerous designs of multi-primary displays. For example, SHARP has attempted this with their four-color QUATTRON TV systems by adding a yellow color primary and developing an algorithm to drive it. Another four primary color display was proposed by Matthew Brennesholtz which included an additional cyan primary, and a six primary display was described by Yan Xiong, Fei Deng, Shan Xu, and Sufang Gao of the School of Physics and Optoelectric Engineering at the Yangtze University Jingzhou China. In addition, AU OPTRONICS has developed a five primary display technology. SONY has also recently disclosed a camera design featuring RGBCMY (red, green, blue, cyan, magenta, and yellow) and RGBCMYW (red, green, blue cyan, magenta, yellow, and white) sensors.
Actual working displays have been shown publicly as far back as the late 1990's, including samples from Tokyo Polytechnic University, Nagoya City University, and Genoa Technologies. However, all of these systems are exclusive to their displays, and any additional color primary information is limited to the display's internal processing.
Additionally, the Visual Arts System for Archiving and Retrieval of Images (VASARI) project developed a colorimetric scanner system for direct digital imaging of paintings. The system provides more accurate coloring than conventional film, allowing it to replace film photography. Despite the project beginning in 1989, technical developments have continued.
None of the prior art discloses developing additional color primary information outside of the display. Moreover, the system driving the display is often proprietary to the demonstration. In each of these executions, nothing in the workflow is included to acquire or generate additional color primary information. The development of a multi-primary color system is not complete if the only part of the system that supports the added primaries is within the display itself.
Referring now to the drawings in general, the illustrations are for the purpose of describing one or more preferred embodiments of the invention and are not intended to limit the invention thereto.
Additional details about multi-primary systems are available in U.S. Pat. Nos. 10,950,160; 10,950,161; 10,950,162; 10,997,896; 11,011,098; 11,017,708; 11,030,934; 11,037,480; 11,037,481; 11,037,482; 11,043,157; 11,049,431; 11,062,638; 11,062,639; 11,069,279; 11,069,280; and 11,100,838 and U.S. Publication Nos. 20200251039, 20210233454, and 20210209990, each of which is incorporated herein by reference in its entirety.
Traditional displays include three primaries: red, green, and blue. The multi-primary systems of the present invention include at least four primaries. The at least four primaries preferably include at least one red primary, at least one green primary, and/or at least one blue primary. In one embodiment, the at least four primaries include a cyan primary, a magenta primary, and/or a yellow primary. In one embodiment, the at least four primaries include at least one white primary.
In one embodiment, the multi-primary system includes six primaries. In one preferred embodiment, the six primaries include a red (R) primary, a green (G) primary, a blue (B) primary, a cyan (C) primary, a magenta (M) primary, and a yellow (Y) primary, often referred to as “RGBCMY”. However, the systems and methods of the present invention are not restricted to RGBCMY, and alternative primaries are compatible with the present invention.
6P-B
6P-B is a color set that uses the same RGB values that are defined in the ITU-R BT.709-6 television standard. The gamut includes these RGB primary colors and then adds three more color primaries orthogonal to these based on the white point. The white point used in 6P-B is D65 (ISO 11664-2).
In one embodiment, the red primary has a dominant wavelength of 609 nm, the yellow primary has a dominant wavelength of 571 nm, the green primary has a dominant wavelength of 552 nm, the cyan primary has a dominant wavelength of 491 nm, and the blue primary has a dominant wavelength of 465 nm as shown in Table 1. In one embodiment, the dominant wavelength is approximately (e.g., within ±10%) the value listed in the table below. Alternatively, the dominant wavelength is within ±5% of the value listed in the table below. In yet another embodiment, the dominant wavelength is within ±2% of the value listed in the table below.
6P-C
6P-C is based on the same RGB primaries defined in SMPTE RP431-2 projection recommendation. Each gamut includes these RGB primary colors and then adds three more color primaries orthogonal to these based on the white point. The white point used in 6P-B is D65 (ISO 11664-2). Two versions of 6P-C are used. One is optimized for a D60 white point (SMPTE ST2065-1), and the other is optimized for a D65 white point. Additional information about white points is available in ISO 11664-2:2007 “Colorimetry—Part 2: CIE standard illuminants” published in 2007 and “ST 2065-1:2012—SMPTE Standard—Academy Color Encoding Specification (ACES),” in ST 2065-1:2012, pp. 1-23, 17 Apr. 2012, doi: 10.5594/SMPTE.ST2065-1.2012, each of which is incorporated herein by reference in its entirety.
In one embodiment, the red primary has a dominant wavelength of 615 nm, the yellow primary has a dominant wavelength of 570 nm, the green primary has a dominant wavelength of 545 nm, the cyan primary has a dominant wavelength of 493 nm, and the blue primary has a dominant wavelength of 465 nm as shown in Table 2. In one embodiment, the dominant wavelength is approximately (e.g., within ±10%) the value listed in the table below. Alternatively, the dominant wavelength is within ±5% of the value listed in the table below. In yet another embodiment, the dominant wavelength is within ±2% of the value listed in the table below.
In one embodiment, the red primary has a dominant wavelength of 615 nm, the yellow primary has a dominant wavelength of 570 nm, the green primary has a dominant wavelength of 545 nm, the cyan primary has a dominant wavelength of 423 nm, and the blue primary has a dominant wavelength of 465 nm as shown in Table 3. In one embodiment, the dominant wavelength is approximately (e.g., within ±10%) the value listed in the table below. Alternatively, the dominant wavelength is within ±5% of the value listed in the table below. In yet another embodiment, the dominant wavelength is within ±2% of the value listed in the table below.
Super 6P
One of the advantages of ITU-R BT.2020 is that it can include all of the Pointer colors and that increasing primary saturation in a six-color primary design could also do this. Pointer is described in “The Gamut of Real Surface Colors”, M. R. Pointer, Published in Colour Research and Application Volume #5, Issue #3 (1980), which is incorporated herein by reference in its entirety. However, extending the 6P gamut beyond SMPTE RP431-2 (“6P-C”) adds two problems. The first problem is the requirement to narrow the spectrum of the extended primaries. The second problem is the complexity of designing a backwards compatible system using color primaries that are not related to current standards. But in some cases, there is a need to extend the gamut beyond 6P-C and avoid these problems. If the goal is to encompass Pointer's data set, then it is possible to keep most of the 6P-C system and only change the cyan color primary position. In one embodiment, the cyan color primary position is located so that the gamut edge encompasses all of Pointer's data set. In another embodiment, the cyan color primary position is a location that limits maximum saturation. With 6P-C, cyan is positioned as u′=0.096, v′=0.454. In one embodiment of Super 6P, cyan is moved to u′=0.075, v′=0.430 (“Super 6 Pa” (S6 Pa)). Advantageously, this creates a new gamut that covers Pointer's data set almost in its entirety.
Table 4 is a table of values for Super 6 Pa. The definition of x,y are described in ISO 11664-3:2012/CIE S 014 Part 3, which is incorporated herein by reference in its entirety. The definition of u′,v′ are described in ISO 11664-5:2016/CIE S 014 Part 5, which is incorporated herein by reference in its entirety. defines each color primary as dominant color wavelength for RGB and complementary wavelengths CMY.
In an alternative embodiment, the saturation is expanded on the same hue angle as 6P-C as shown in
Table 5 is a table of values for Super 6Pb. The definition of x,y are described in ISO 11664-3:2012/CIE S 014 Part 3 published in 2012, which is incorporated herein by reference in its entirety. The definition of u∝,v′ are described in ISO 11664-5:2016/CIE S 014 Part 5 published in 2016, which is incorporated herein by reference in its entirety. defines each color primary as dominant color wavelength for RGB and complementary wavelengths CMY.
In a preferred embodiment, a matrix is created from XYZ values of each of the primaries. As the XYZ values of the primaries change, the matrix changes. Additional details about the matrix are described below.
Formatting and Transportation of Multi-Primary Signals
The present invention includes three different methods to format video for transport: System 1, System 2, and System 3. System 1 is comprised of an encode and decode system, which can be divided into base encoder and digitation, image data stacking, mapping into the standard data transport, readout, unstack, and finally image decoding. In one embodiment, the basic method of this system is to combine opposing color primaries within the three standard transport channels and identify them by their code value.
System 2 uses a sequential method where three color primaries are passed to the transport format as full bit level image data and inserted as normal. The three additional channels are delayed by one pixel and then placed into the transport instead of the first colors. This is useful in situations where quantizing artifacts is critical to image performance. In one embodiment, this system is comprised of the six primaries (e.g., RGB plus a method to delay the CMY colors for injection), image resolution identification to allow for pixel count synchronization, start of video identification, and RGB Delay.
System 3 utilizes a dual link method where two wires are used. In one embodiment, a first set of three channels (e.g., RGB) are sent to link A and a second set of three channels (e.g., CMY) is sent to link B. Once they arrive at the image destination, they are recombined.
To transport up to six color components (e.g., four, five, or six), System 1, System 2, or System 3 can be used as described. If four color components are used, two of the channels are set to 0. If five color components are used, one of the channels is set to 0. Advantageously, this transportation method works for all primary systems described herein that include up to six color components.
Comparison of Three Systems
Advantageously, System 1 fits within legacy SDI, CTA, and Ethernet transports. Additionally, System 1 has zero latency processing for conversion to an RGB display. However, System 1 is limited to 11-bit words.
System 2 is advantageously operable to transport 6 channels using 16-bit words with no compression. Additionally, System 2 fits within newer SDI, CTA, and Ethernet transport formats. However, System 2 requires double bit rate speed. For example, a 4K image requires a data rate for an 8K RGB image.
In comparison, System 3 is operable to transport up to 6 channels using 16-bit words with compression and at the same data required for a specific resolution. For example, a data rate for an RGB image is the same as for a 6P image using System 3. However, System 3 requires a twin cable connection within the video system.
Nomenclature
In one embodiment, a standard video nomenclature is used to better describe each system.
R describes red data as linear light (e.g., without a non-linear function applied). G describes green data as linear light. B describes blue data as linear light. C describes cyan data as linear light. M describes magenta data as linear light. Yc and/or Y describe yellow data as linear light.
R′ describes red data as non-linear light (e.g., with a non-linear function applied). G′ describes green data as non-linear light. B′ describes blue data as non-linear light. C′ describes cyan data as non-linear light. M′ describes magenta data as non-linear light. Yc′ and/or Y′ describe yellow data as non-linear light.
Y6 describes the luminance sum of RGBCMY data. YRGB describes a System 2 encode that is the linear luminance sum of the RGB data. YCMY describes a System 2 encode that is the linear luminance sum of the CMY data.
CR describes the data value of red after subtracting linear image luminance. CB describes the data value of blue after subtracting linear image luminance. CC describes the data value of cyan after subtracting linear image luminance. CY describes the data value of yellow after subtracting linear image luminance.
Y′RGB describes a System 2 encode that is the nonlinear luminance sum of the RGB data. Y′CMY describes a System 2 encode that is the nonlinear luminance sum of the CMY data. −Y describes the sum of RGB data subtracted from Y6.
C′R describes the data value of red after subtracting nonlinear image luminance. C describes the data value of blue after subtracting nonlinear image luminance. C′C describes the data value of cyan after subtracting nonlinear image luminance. C′Y describes the data value of yellow after subtracting nonlinear image luminance.
B+Y describes a System 1 encode that includes either blue or yellow data. G+M describes a System 1 encode that includes either green or magenta data. R+C describes a System 1 encode that includes either green or magenta data.
CR+CC describes a System 1 encode that includes either color difference data. CB+CY describes a System 1 encode that includes either color difference data.
4:4:4 describes full bandwidth sampling of a color in an RGB system. 4:4:4:4:4:4 describes full sampling of a color in an RGBCMY system. 4:2:2 describes an encode where a full bandwidth luminance channel (Y) is used to carry image detail and the remaining components are half sampled as a Cb Cr encode. 4:2:2:2:2 describes an encode where a full bandwidth luminance channel (Y) is used to carry image detail and the remaining components are half sampled as a Cb Cr Cy Cc encode. 4:2:0 describes a component system similar to 4:2:2, but where Cr and Cb samples alternate per line. 4:2:0:2:0 describes a component system similar to 4:2:2, but where Cr, Cb, Cy, and Cc samples alternate per line.
Constant luminance is the signal process where luminance (Y) values are calculated in linear light. Non-constant luminance is the signal process where luminance (Y) values are calculated in nonlinear light.
Deriving Color Components
When using a color difference method (4:2:2), several components need specific processing so that they can be used in lower frequency transports. These are derived as:
The ratios for Cr, Cb, Cc, and Cy are also valid in linear light calcuations.
Magenta can be calculated as follows:
System 1
In one embodiment, the multi-primary color system is compatible with legacy systems. A backwards-compatible multi-primary color system is defined by a sampling method. In one embodiment, the sampling method is 4:4:4. In one embodiment, the sampling method is 4:2:2. In another embodiment, the sampling method is 4:2:0. In one embodiment of a backwards compatible multi-primary color system, new encode and decode systems are divided into the steps of performing base encoding and digitization, image data stacking, mapping into the standard data transport, readout, unstacking, and image decoding (“System 1”). In one embodiment, System 1 combines opposing color primaries within three standard transport channels and identifies them by their code value. In one embodiment of a backwards-compatible multi-primary color system, the processes are analog processes. In another embodiment of a backwards compatible multi-primary color system, the processes are digital processes.
In one embodiment, the sampling method for a multi-primary color system is a 4:4:4 sampling method. Black and white bits are redefined. In one embodiment, putting black at midlevel within each data word allows the addition of CMY color data.
System 2
System 2A
System 2 sequences on a pixel to pixel basis. However, a quadrature method is also possible (“System 2A”) that is operable to transport six primaries in stereo or twelve primary image information. Each quadrant of the frame contains three color primary data sets. These are combined in the display. A first set of three primaries is displayed in the upper left quadrant, a second set of three primaries is displayed in the upper right quadrant, a third set of primaries is displayed in the lower left quadrant, and a fourth set of primaries is displayed in lower right quadrant. In one embodiment, the first set of three primaries, the second set of three primaries, the third set of three primaries, and the fourth set of three primaries do not contain any overlapping primaries (i.e., twelve different primaries). Alternatively, the first set of three primaries, the second set of three primaries, the third set of three primaries, and the fourth set of three primaries contain overlapping primaries (i.e., at least one primary is contained in more than one set of three primaries). In one embodiment, the first set of three primaries and the third set of three primaries contain the same primaries and the second set of three primaries and the fourth set of three primaries contain the same primaries.
Advantageously, System 2A allows for the ability to display multiple primaries (e.g., 12P and 6P) on a conventional monitor. Additionally, System 2A allows for a simplistic viewing of false color, which is useful in the production process and allows for visualizing relationships between colors. It also allows for display of multiple projectors (e.g., a first projector, a second projector, a third projector, and a fourth projector).
System 3
System 3 is simpler and more straight forward than Systems 1 and 2. The advantage with this system is that adoption is simply to format non-RGB primaries (e.g., CMY) on a second link. In one example, for an SDI design, RGB is sent on a standard SDI stream just as it is currently done. There is no modification to the transport and this link is operable to be sent to any RGB display requiring only the compensation for the luminance difference because the non-RGB (e.g., CMY) components are not included. Data for the non-RGB primaries (e.g., CMY data) is transported in the same manner as RGB data. This data is then combined in the display to make up a 6P image. The downside is that the system requires two wires to move one image. This system is operable to work with most any format including SMPTE ST292, 424, 2082, and 2110. It also is operable to work with dual High-Definition Multimedia Interface (HDMI)/CTA connections. In one embodiment, the system includes at least one transfer function (e.g., OETF, EOTF).
System 4
Color is generally defined by three component data levels (e.g., RGB, YCbCr). A serial data stream must accommodate a word for each color contributor (e.g., R, G, B). Use of more than three primaries requires accommodations to fit this data based on an RGB concept. This is why System 1, System 2, and System 3 use stacking, sequencing, and/or dual links. Multiple words are required to define a single pixel, which is inefficient because not all values are needed.
In a preferred embodiment, color is defined as a colorimetric coordinate. Thus, every color is defined by three words. Serial systems are already based on three color contributors (e.g., RGB). System 4 preferably uses XYZ or Yxy as the three color contributors. System 4 preferably uses two colorimetric coordinates and a luminance or a luma. In one embodiment, System 4 includes, but is not limited to, Yxy, L*a*b*, ICTCP, YCbCr, YUV, Yu′v′, YPbPr, YIQ, and/or XYZ. In a preferred embodiment, System 4 uses color contributors that are independent of a white point and/or a reference white value. Alternatively, System 4 uses color contributors that are not independent of a white point and/or a reference white value (e.g., YCbCr, L*a*b*). In another embodiment, System 4 uses color contributors that require at least one known primaries (e.g., ICTCP). In yet another embodiment, L*C*h or other non-rectangular coordinate systems (e.g., cylindrical, polar) are compatible with the present invention. In one embodiment, a polar system is defined from Yxy by converting x,y to a hue angle (e.g., θ=arctan(y/x)) and a magnitude vector (e.g., r) that is similar to C* in an L*C*h polar system. However, when converting Yxy to a polar system, θ is restricted from 0 to 90 degrees because x and y are always non-negative. In one embodiment, the θ angle is expanded by applying a transform (e.g., an affine transform) to x, y data wherein the x, y values of the white point of the system (e.g., D65) are subtracted from the x, y data such that the x, y data includes negative values. Thus, θ ranges from 0 to 360 degrees and the polar plot of the Yxy data is operable to occupy more than one quadrant.
XYZ has been used in cinema for over 10 years. XYZ needs 16-bit float and 32-bit float encode or a minimum of 12 bits for gamma or log encoded images for better quality. Transport of XYZ must be accomplished using a 4:4:4 sample system. Less than a 4:4:4 sample system causes loss of image detail because Y is used as a coordinate along with X and Z and carries color information, not a value. Further, X and Z are not orthogonal to Y and, therefore, also include luminance information. Advantageously, converting to Yxy or Yu′v′ concentrates the luminance in Y only, leaving two independent and pure chromaticity values. In one embodiment, X, Y, and Z are used to calculate x and y. Alternatively, X, Y, and Z are used to calculate u′ and v′.
However, if Y or an equivalent component is used as a luminance value with two independent colorimetric coordinates (e.g., x and y, u′ and v′, u and v, etc.) used to describe color, then a system using subsampling is possible because of differing visual sensitivity to color and luminance. In one embodiment, I or L* components are used instead of Y, wherein I and/or L* data are created using gamma functions. As a non-limiting example, I is created using a 0.5 gamma function, while L* is created using a ⅓ gamma function. In these embodiments, additional gamma encoding is not applied to the data as part of transport. The system is operable to use any two independent colorimetric coordinates with similar properties to x and y, u′ and v′, and/or u and v. In a preferred embodiment, the two independent colorimetric coordinates are x and y and the system is a Yxy system. In another preferred embodiment, the two colorimetric coordinates are u′ and v′ and the system is a Yu′v′ system. Advantageously, the two independent colorimetric coordinates (e.g., x and y) are independent of a white point. This reduces the complexity of the system when compared to XYZ, which includes a luminance value for all three channels (i.e., X, Y, and Z). Further, this also provides an advantage for subsampling (e.g., 4:2:2, 4:2:0 and 4:1:1). In one embodiment, other systems (e.g., ICTCP and L*a*b*) require a white point in calculations. However, a conversion matrix, e.g., using the white point of [1,1,1] is operable to be used for ICTCP and L*a*b* to remove the white point reference. The white point reference is still operable to then be recaptured as [1,1,1] in XYZ space. In a preferred embodiment, the image data includes a reference to at least one white point.
Current technology uses components derived from the legacy National Television System Committee (NTSC). Encoding described in SMPTE, International Telecommunication Union (ITU), and CTA standards includes methods using subsampling as 4:2:2, 4:2:0, and 4:1:1. Advantageously, this allows for color transportation of more than three primaries, including, but not limited to, at least four primaries, at least five primaries, at least six primaries, at least seven primaries, at least eight primaries, at least nine primaries, at least ten primaries, at least eleven primaries, and/or at least twelve primaries (e.g., through a SMPTE ST292 or an HDMI 1.2 transport).
System 1, System 2, and System 3 use a YCbCr expansion to transport six color primary data sets, and the same transport (e.g., a YCbCr expansion) is operable to accommodate the image information as Yxy where Y is the luminance information and x,y describe CIE 1931 color coordinates in the half sample segments of the data stream (e.g., 4:2:2). Alternatively, x,y are fully sampled (e.g., 4:4:4). In yet another embodiment, the sampling rate is 4:2:0 or 4:1:1. In still another embodiment, the same transport is operable to accommodate the information as luminance and colorimetric coordinates other than x,y. In one embodiment, the same transport is operable to accommodate data set using one channel of luminance data and two channels of colorimetric data. Alternatively, the same transport is operable to accommodate the image information as Yu′v′ with full sampling (e.g., 4:4:4) or partial sampling (e.g., 4:2:2, 4:2:0, 4:1:1). In one embodiment, the same transport is used with full sampling (e.g., XYZ).
Advantageously, there is no need to add more channels, nor is there any need to separate the luminance information from the color components. Further, for example, x,y have no reference to any primaries because x,y are explicit colorimetric positions. In the Yxy space, x and y are chromaticity coordinates such that x and y can be used to define a gamut of visible color. Similarly, in the Yu′v′ space, u′ and v′ are explicit colorimetric positions. It is possible to define a gamut of visible color in other formats (e.g., L*a*b*, ICTCP, YCbCr), but it is not always trivial. To determine if a color is visible in Yxy space, it must be determined if the sum of x and y is greater than or equal to zero. If not, the color is not visible. If the x,y point is within the CIE x,y locus (CIE horseshoe), the color is visible. If not, the color is not visible. The Y value plays a role especially in a display. In one embodiment, the display is operable to reproduce an x,y color within a certain range of Y values, wherein the range is a function of the primaries. Another advantage is that an image can be sent as linear data (e.g., without a non-linear function applied) with a non-linear function (e.g., opto-optical transfer function (OOTF)) added after the image is received, rather than requiring a non-linear function (e.g., OOTF) applied to the signal. This allows for a much simpler encode and decode system. In one embodiment, only Y, L*, or I are altered by a non-linear function. Alternatively, Y, L*, or I are sent linearly (e.g., without a non-linear function applied).
There are many different RGB sets so the matrix used to convert the image data from a set of RGB primaries to XYZ will involve a specific solution given the RGB values:
In an embodiment where the image data is 6P-B data, the following equation is used to convert to XYZ data:
In an embodiment where the image data is 6P-C data with a D60 white point, the following equation is used to convert to XYZ data:
In an embodiment where the image data is 6P-C data with a D65 white point, the following equation is used to convert to XYZ data:
To convert the XYZ data to Yxy data, the following equations are used:
Finally, the XYZ data must converted to the correct standard color space. In an embodiment where the color gamut used is a 6P-B color gamut, the following equations are used:
In an embodiment where the color gamut used is a 6P-C color gamut with a D60 white point, the following equations are used:
In another embodiment where the color used is a 6P-C color gamut with a D65 white point, the following equations are used:
In an embodiment where the color gamut used is an ITU-R BT709.6 color gamut, the matrices are as follows:
In an embodiment where the color gamut used is a SMPTE RP431-2 color gamut, the matrices are as follows:
In an embodiment where the color gamut used is an ITU-R BT.2020/2100 color gamut, the matrices are as follows:
To convert the Yxy data to the XYZ data, the following equations are used:
In one embodiment, the set of image data includes pixel mapping data. In one embodiment, the pixel mapping data includes a subsample of the set of values in a color space. In a preferred embodiment, the color space is a Yxy color space (e.g., 4:2:2). In one embodiment, the pixel mapping data includes an alignment of the set of values in the color space (e.g., Yxy color space, Yu′v′).
Table 6 illustrates mapping to SMPTE ST2110 for 4:2:2 sampling of Yxy data. Table 7 illustrates mapping to SMPTE ST2110 for 4:4:4 linear and non-linear sampling of Yxy data. The present invention is compatible with a plurality of data formats (e.g., Yu′v′) and not restricted to Yxy data.
Advantageously, XYZ is used as the basis of ACES for cinematographers and allows for the use of colors outside of the ITU-R BT.709 and/or the P3 color spaces, encompassing all of the CIE color space. Colorists often work in XYZ, so there is widespread familiarity with XYZ. Further, XYZ is used for other standards (e.g., JPEG 2000, Digital Cinema Initiatives (DCI)), which could be easily adapted for System 4. Additionally, most color spaces use XYZ as the basis for conversion, so the conversions between XYZ and most color spaces are well understood and documented. Many professional displays also have XYZ option as a color reference function.
In one embodiment, the image data converter includes at least one look-up table (LUT). In one embodiment, the at least one look-up table maps out-of-gamut colors to zero. In one embodiment, the at least one look-up table maps out-of-gamut colors to a periphery of visible colors. In one embodiment, an out-of-gamut color is mapped to the periphery along a straight line between the out-of-gamut color in its original location and a white point of the system (e.g., D65). In one embodiment, the luminance and/or luma value is maintained, and only the colorimetric coordinates are affected by the mapping. In one embodiment, gamma transforms and/or scaling are added after mapping. In one embodiment, the mapping is used to convert Yxy to XYZ and back. Alternatively, the mapping is used to convert Y′xy to X′Y′Z′ and back. In one embodiment, a gamma function and/or a scaling is maintained throughout the conversion. As a non-limiting example, a 2.6 gamma function is used to scale x by 0.74 and y by 0.84. Alternatively, the gamma and/or the scaling are removed after conversion.
Transfer Functions
The system design minimizes limitations to use standard transfer functions for both encode and/or decode processes. Current practices used in standards include, but are not limited to, ITU-R BT.1886, ITU-R BT.2020, SMPTE ST274, SMPTE ST296, SMPTE ST2084, and ITU-R BT.2100. These standards are compatible with this system and require no modification.
Encoding and decoding multi-primary (e.g., 6P, RGBC) images is formatted into several different configurations to adapt to image transport frequency limitations. The highest quality transport is obtained by keeping all components as multi-primary (e.g., RGBCMY) components. This uses the highest sampling frequencies and requires the most signal bandwidth. An alternate method is to sum the image details in a luminance channel at full bandwidth and then send the color difference signals at half or quarter sampling (e.g., Y Cr Cb Cc Cy). This allows a similar image to pass through lower bandwidth transports.
An IPT system is a similar idea to the Yxy system with several exceptions. An IPT system or an ICTCP system is still an extension of XYZ and is operable to be derived from RGB and multiprimary (e.g., RGBCMY, RGBC) color coordinates. An IPT color description can be substituted within a 4:4:4 sampling structure, but XYZ has already been established and does not require the same level of calculations. For an ICTCP transport system, similar substitutions can be made. However, both substitution systems are limited in that a non-linear function (e.g., OOTF) is contained in all three components. Although the non-linear function can be removed for IPT or ICTCP, the derivation is still based on a set of RGB primaries with a white point reference. In one embodiment, removing the non-linear function alters the bit depth noise and compressibility.
For transport, simple substitutions can be made using the foundation of what is described with transport of XYZ for the use of IPT in current systems as well as the current standards used for ICTCP.
Transfer functions used in systems 1, 2, and 3 are generally framed around two basic implementations. For images displaying using a standard dynamic range, the transfer functions are defined within two standards. The OETF is defined in ITU-R BT.709-6, table 1, row 1.2. The inverse function, the EOTF, is defined in ITU-R BT.1886. For high dynamic range imaging, the perceptual quantizer (PQ) and hybrid log-gamma (HLG) curves are described in ITU-R BT.2100-2: 2018, table 4.
System 4 is operable to use any of the transfer functions, which can be applied to the Y component. However, to improve compatibility and to simplify conversion between standard transfer functions, a new method has been developed: a ½ gamma function. Advantageously, the ½ gamma function allows for a single calculation from the luminance (e.g., Y) component of the signal (e.g., Yxy signal) to the display. Advantageously, the ½ gamma function is designed for data efficiency, not as an optical transform function. In one embodiment, the ½ gamma function is used instead of a nonlinear function (e.g., OETF or EOTF). In one embodiment, signal input to the ½ gamma function is assumed to be linear and constrained between values of 0 and 1. In one embodiment, the ½ gamma function is optimized for 10-bit transport and/or 12-bit transport. Alternatively, the ½ gamma function is optimized for 14-bit transport and/or 16-bit transport. In an alternative embodiment, the ½ gamma function is optimized for 8-bit transport. A typical implementation applies an inverse of the ½ gamma function, which linearizes the signal. A conversion to a display gamut is then applied.
In one embodiment, for a source n=√{square root over (L)} and for a display L=n2. In another embodiment, a display gamma is calculated as L=n2/λ, where λ is a desired final EOTF. Advantageously, using the ½ gamma function with the display gamma combines the functions into a single step rather than utilizing a two-step conversion process. In one embodiment, at least one tone curve is applied after the ½ gamma function. The ½ gamma function advantageously provides ease to convert to and from linear values. Given that all color and tone mapping has to be done in the linear domain, having a simple to implement conversion is desirable and makes the conversion to and from linear values easier and simpler.
While a ½ gamma is ideal for converting images with 16-bit (e.g., 16-bit float) values to 12-bit (e.g., 12-bit integer) values, for other data sets a ⅓ gamma provides equivalent performance in terms of peak signal-to-noise ratio (PSNR). For high dynamic range (HDR) content, which has a wider luminance dynamic range (e.g., up to 1000 cd/m2), the ⅓ gamma conversion from 16-bit float maintains the same performance as ½ gamma. In one embodiment, an equation for finding an optimum value of gamma is:
In one embodiment, the Minimum Float Value is based on the Institute of Electrical and Electronics Engineers (IEEE) Standard for Floating-Point Arithmetic (IEEE 754) (July 2019), which is incorporated herein by reference in its entirety. In one embodiment, the range of image values is normalized to between 0 and 1. The range of image values is preferably normalized to between 0 and 1 and then the gamma function is applied.
For example, for an HDR system (e.g., with a luminance dynamic range of 1000-4000 cd/m2), the above equation becomes:
Encoder and Decoder
In one embodiment, the multi-primary system includes an encoder operable to accept image data input (e.g., RAW, SDI, HDMI, DisplayPort, ethernet). In one embodiment, the image data input is from a camera, a computer, a processor, a flash memory card, a network (e.g., local area network (LAN)), or any other file storage or transfer medium operable to provide image data input. The encoder is operable to send processed image data (e.g., Yxy, XYZ, Yu′v′) to a decoder (e.g., via wired or wireless communication). The decoder is operable to send formatted image data (e.g., SDI, HDMI, Ethernet, DisplayPort, Yxy, XYZ, Yu′v′, legacy RGB, multi-primary data (e.g., RGBC, RGBCMY, etc.)) to at least one viewing device (e.g., display, monitor, projector) for display (e.g., via wired or wireless communication). In one embodiment, the decoder is operable to send formatted image data to at least two viewing devices simultaneously. In one embodiment, two or more of the at least two viewing devices use different color spaces and/or formats. In one example, the decoder sends formatted image data to a first viewing device in HDMI and a second viewing device in SDI. In another example, the decoder sends formatted image data as multi-primary (e.g., RGBCMY, RGBC) to a first viewing device and as legacy RGB (e.g., Rec. 709) to a second viewing device. In one embodiment, the Ethernet formatted image data is compatible with SMPTE ST2022. Additionally or alternatively, the Ethernet formatted image data is compatible with SMPTE ST2110 and/or any internet protocol (IP)-based transport protocol for image data.
The encoder and the decoder preferably include at least one processor. By way of example, and not limitation, the at least one processor is be a general-purpose microprocessor (e.g., a central processing unit (CPU)), a graphics processing unit (GPU), a microcontroller, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Programmable Logic Device (PLD), a controller, a state machine, gated or transistor logic, discrete hardware components, or any other suitable entity or combinations thereof that can perform calculations, process instructions for execution, and/or other manipulations of information. In one embodiment, one or more of the at least one processor is operable to run predefined programs stored in at least one memory of the encoder and/or the decoder.
The encoder and/or the decoder include hardware, firmware, and/or software. In one embodiment, the encoder and/or the decoder is operable to be inserted into third party software (e.g., via a dynamic-link library (DLL)). In one embodiment, functionality and/or features of the encoder and/or the decoder are combined for efficiency.
The at least one encoder input includes, but is not limited to, an SDI input, an HDMI input, a DisplayPort input, an ethernet input, and/or a SMPTE ST2110 input. The SDI input preferably follows a modified version of SMPTE ST352 payload identification (ID) standard. In one embodiment, the SDI input is SMPTE ST292, SMPTE ST425, and/or SMPTE ST2082. In one embodiment, a video signal from the SDI input is then sent to the encoder equalizer to compensate for cable type and length. In one embodiment, the HDMI input is decoded with a standard HDMI receiver circuit. In one embodiment, the HDMI input is converted to a parallel format. In one embodiment, the HDMI input is defined within the CTA 861 standard. In another embodiment, the at least one encoder input includes image data (e.g., RAW data) from a flash device. The configuration CPU identifies a format on the flash card and/or a file type, and has software operable to read the image data and make it available to the encoder.
In one embodiment, the encoder operations port is operable to connect to an encoder control system (e.g., via a micro universal serial bus (USB) or equivalent). In one embodiment, the encoder control system is operable to control the at least one encoder memory that holds tables for the DeBayer engine, load modifications to the linear converter and/or scaler, select the at least one input, loads a table for the at least one custom encoder LUT, bypass one or more of the at least one custom encoder LUT, bypass the DeBayer engine, add or modify conversion tables for the RGB to XYZ converter, modify the gamma function (e.g., a ½ gamma function), turn the watermark engine on or off, modify a digital watermark for the watermark engine, and/or perform functions for the flash memory player (e.g., play, stop, forward, fast forward, rewind, fast rewind, frame selection).
In one embodiment, the at least one S/P converter is up to n bit for improved processing efficiency. The at least one S/P converter preferably formats the processed image data so that the encoder and/or the decoder is operable to use parallel processing. Advantageously, parallel processing keeps processing fast and minimizes latency.
The at least one encoder formatter is operable to organize the serial stream as a proper format. In a preferred embodiment, the encoder includes a corresponding encoder formatter for each of the at least one encoder output. For example, if the encoder includes at least one HDMI output in the at least one encoder output, the encoder also includes at least one HDMI formatter in the at least one encoder formatter; if the encoder includes at least one SDI output in the at least one encoder output, the encoder also includes at least one SDI formatter in the at least one encoder formatter; if the encoder includes at least one Ethernet output in the at least one encoder output, the encoder also includes at least one Ethernet formatter in the at least one encoder formatter; and so forth.
There is an advantage of inputting a RAW camera image to take advantage of the extended dynamic range and wider color gamut versus using a standard video input. In one embodiment, the DeBayer engine is operable to convert RAW image data into a raster image. In one embodiment, the raster image is a 3-channel image (e.g., RGB). In one embodiment, the DeBayer engine is bypassed for data that is not in a RAW image format. In one embodiment, the DeBayer engine is configured to accommodate at least three primaries (e.g., 3, 4, 5, 6, 7, 8, etc.) in the Bayer or stripe pattern. To handle all of the different DeBayer options, the operations programming port is operable to load a file with code required to adapt a specific Bayer pattern. For images that are not RAW, a bypass path is provided and switched to and from using the encoder configuration CPU. In one embodiment, the encoder is operable to recognize the image data format and select the correct path automatically. Alternatively, the image data format is included in metadata.
The encoder configuration CPU is operable to recognize an input nonlinearity value and provide an inverse value to the linear converter to linearize the image data. The scaler is operable to map out of gamut values into in gamut values.
In one embodiment, the at least one custom encoder LUT is operable to transform an input (e.g., a standard from a manufacturer) to XYZ, Yxy, or Yu′v′. Examples of the input include, but are not limited to, RED Log3G10, ARRI log C, ACEScc, SONY S-Log, CANON Log, PANASONIC V Log, PANAVISION Panalog, and/or BLACK MAGIC CinemaDNG. In one embodiment, the at least one custom encoder LUT is operable to transform the input to an output according to artistic needs. In one embodiment, the encoder does not include the color channel-to-XYZ converter or the XYZ-to-Yxy converter, as this functionality is incorporated into the at least one custom encoder LUT. In one embodiment, the at least one custom encoder LUT is a 65-cube look-up table. The at least one custom encoder LUT is preferably compatible with ACES Common LUT Format (CLF)—A Common File Format for Look-Up Tables S-2014-006, which was published Jul. 22, 2021 and which is incorporated herein by reference in its entirety. In one embodiment, the at least one custom encoder LUT is a multi-column LUT. The at least one custom encoder LUT is preferably operable to be loaded through the operations programming port. If no LUT is required, the encoder configuration CPU is operable to bypass the at least one custom encoder LUT.
In one embodiment, RGB or multi-primary (e.g., RGBCMY, RGBC) data is converted into XYZ data using the color channel-to-XYZ converter. In a preferred embodiment, a white point value for the original video data (e.g., RGB, RGBCMY) is stored in one or more of the at least one encoder memory. The encoder configuration CPU is operable to provide an adaption calculation using the white point value. The XYZ-to-Yxy converter is operable to convert XYZ data to Yxy data. Advantageously, the Yxy image data is segmented into a luminance value and a set of colorimetric values, the relationship between Y and x,y is operable to be manipulated to use lower data rates. Similarly, the XYZ-to-Yu′v′ converter is operable to convert XYZ data to Yu′v′ data, and the conversion is operable to be manipulated to use lower data rates. Any system with a luminance value and a set of colorimetric values is compatible with the present invention. The configuration CPU is operable to set the sample selector to fit one or more of the at least one encoder output. In one embodiment, the sampling selector sets a sampling structure (e.g., 4:4:4, 4:2:2, 4:2:0, 4:1:1). The sampling selector is preferably controlled by the encoder configuration CPU. In a preferred embodiment, the sampling selector also places each component in the correct serial data position as shown in Table 8.
The watermark engine is operable to modify an image from an original image to include a digital watermark. In one embodiment, the digital watermark is outside of the ITU-R BT.2020 color gamut. In one embodiment, the digital watermark is compressed, collapsed, and/or mapped to an edge of the smaller color gamut such that it is not visible and/or not detectable when displayed on a viewing device with a smaller color gamut than ITU-R BT.2020. In another embodiment, the digital watermark is not visible and/or not detectable when displayed on a viewing device with an ITU-R BT.2020 color gamut. In one embodiment, the digital watermark is a watermark image (e.g., logo), alphanumeric text (e.g., unique identification code), and/or a modification of pixels. In one embodiment, the digital watermark is invisible to the naked eye. In a preferred embodiment, the digital watermark is perceptible when decoded by an algorithm. In one embodiment, the algorithm uses an encryption key to decode the digital watermark. In another embodiment, the digital watermark is visible in a non-obtrusive manner (e.g., at the bottom right of the screen). The digital watermark is preferably detectable after size compression, scaling, cropping, and/or screenshots. In yet another embodiment, the digital watermark is an imperceptible change in sound and/or video. In one embodiment, the digital watermark is a pattern (e.g., a random pattern, a fixed pattern) using a luminance difference (e.g., 1 bit luminance difference). In one embodiment, the pattern is operable to change at each frame. The digital watermark is a dynamic digital watermark and/or a static digital watermark. In one embodiment, the dynamic digital watermark works as a full frame rate or a partial frame rate (e.g., half frame rate). The watermark engine is operable to accept commands from the encoder configuration CPU.
In an alternative embodiment, the at least one encoder input already includes a digital watermark when input to the encoder. In one embodiment, a camera includes the digital watermark on an image signal that is input to the encoder as the at least one encoder input.
The at least one encoder output includes, but is not limited to SDI, HDMI, DisplayPort, and/or ethernet. In one embodiment, at least one encoder formatter formats the image data to produce the at least one encoder output. The at least one encoder formatter includes, but is not limited to, an SDI formatter, an SMPTE ST2110, and/or an HDMI formatter. The SDI formatter formats the serial video data into an SDI package as a Yxy output. The SMPTE ST2110 formatter formats the serial video data into an ethernet package as a Yxy output. The HDMI formatter formats the serial video data into an HDMI package as a Yxy output.
In one embodiment, the decoder operations port is operable to connect to a decoder control system (e.g., via a micro universal serial bus (USB) or equivalent). In one embodiment, the decoder control system is operable to select the at least one decoder input, perform functions for the flash memory player (e.g., play, stop, forward, fast forward, rewind, fast rewind, frame selection), turn watermark detection on or off, add or modify the gamma library and/or look-up table selection, add or modify the XYZ-to-RGB library and/or look-up table selection, load data to the at least one custom decoder LUT, select bypass of one or more of the custom decoder LUT, and/or modify the Ethernet SDP. The gamma library preferably takes linear data and applies at least one non-linear function to the linear data. The at least non-linear function includes, but is not limited to, at least one standard gamma (e.g., those used in standard dynamic range (SDR) and high definition range (HDR) formats) and/or at least one custom gamma.
In one embodiment, the output of the gamma library is fed to the XYZ-to-RGB library, where tables are included to map the XYZ data to a standard RGB or YCbCr output format. In another embodiment, the output of the gamma library bypasses the XYZ-to-RGB library. This bypass leaves an output of XYZ data with a gamma applied. The selection of the XYZ-to-RGB library or bypass is determined by the configuration CPU. If the output format selected is YCbCr, then the XYZ-to-RGB library flags which sampling method is desired and provides that selection to the sampling selector. The sampling selector then formats the YCbCr data to a 4:2:2, 4:2:0, or 4:1:1 sampling structure.
In one embodiment, an input to the decoder does not include full pixel sampling (e.g., 4:2:2, 4:2:0, 4:1:1). The at least one sampling converter is operable to take subsampled images and convert the subsampled images to full 4:4:4 sampling. In one embodiment, the 4:4:4 Yxy image data is then converted to XYZ using the at least one Yxy-to-XYZ converter. In another embodiment, the 4:4:4 Yu′v′ image data is then converted to XYZ using the Yu′v′ using the at least one Yu′v′-to-XYZ converter. Image data is then converted from a parallel form to a serial stream.
In one embodiment, the at least one SDI output includes more than one SDI output. Advantageously, this allows for output over multiple links (e.g., System 3). In one embodiment, the at least one SDI output includes a first SDI output and a second SDI output. In one embodiment, the first SDI output is used to transport a first set of color channel data (e.g., RGB) and the second SDI output is used to transport a second set of color channel data (e.g., CMY).
The watermark detection engine detects the digital watermark. In one embodiment, a pattern of the digital watermark is loaded to the decoder using the operations programming port. In one embodiment, the decoder configuration CPU is operable to turn the watermark detection engine on and off. The watermark subtraction engine removes the digital watermark from image data before formatting for display on the at least one viewing device. In one embodiment, the decoder configuration CPU is operable to allow bypass of the watermark subtraction engine, which will leave the digital watermark on an output image. In a preferred embodiment, the decoder requires the digital watermark in the processed image data sent from the encoder to provide the at least one decoder output. Thus, the decoder does not send color channel data to the at least one viewing device if the digital watermark is not present in the processed image data. In an alternate embodiment, the decoder is operable to provide the at least one decoder output without the digital watermark in the processed image data sent from the encoder. If the digital watermark is not present in the processed image data, an image displayed on the at least one viewing device preferably includes a visible watermark.
In one embodiment, output from the watermark subtraction process includes luminance data including a non-linearity (e.g., ½ gamma). Non-linear luminance data (i.e., luma) is converted back to a linear image using the gamma-to-linear converter.
In one embodiment, the at least one custom decoder LUT includes a 9-column LUT. In one embodiment, the 9-column LUT includes 3 columns for a legacy RGB output (e.g., Rec. 709, Rec. 2020, P3) and 6 columns for a 6P multi-primary display (e.g., RGBCMY). Other numbers of columns (e.g., 7 columns) and alternative multi-primary displays (e.g., RGBC) are compatible with the present invention. In one embodiment, the at least one custom decoder LUT (e.g., the 9-column LUT) is operable to produce output values using tetrahedral interpolation. Advantageously, tetrahedral interpolation uses a smaller volume of color space to determine the output values, resulting in more accurate color channel data. In one embodiment, each of the tetrahedrons used in the tetrahedral interpolation includes a neutral diagonal. Advantageously, this embodiment works even with having less than 6 color channels. For example, a 4P output (e.g., RGBC) or a 5P output (e.g., RGBCY) using an FPGA is operable to be produced using tetrahedral interpolation. Further, this embodiment allows for an encoder to produce legacy RGB output in addition to multi-primary output. In an alternative embodiment, the at least one custom decoder LUT is operable to produce output value using cubic interpolation. The at least one custom decoder LUT is preferably operable to accept linear XYZ data. In one embodiment, the at least one custom decoder LUT is a multi-column LUT. The at least one custom decoder LUT is preferably operable to be loaded through the operations programming port. If no LUT is required, the decoder configuration CPU is operable to bypass the at least one custom decoder LUT.
In one embodiment, the at least one custom decoder LUT is operable to be used for streamlined HDMI transport. In one embodiment, the at least one custom decoder LUT is a 3D LUT. In one embodiment, the at least one custom decoder LUT is operable to take in a 3-column input (e.g., RGB, XYZ) and produce an output of greater than three columns (e.g., RGBC, RGBCY, RGBCMY). Advantageously, this system only requires 3 channels of data as the input to the at least one custom decoder LUT. In one embodiment, the at least one custom decoder LUT applies a gamma function and/or a curve to produce a linear output. In another embodiment, the at least one custom decoder LUT is a trimming LUT.
The at least one decoder formatter is operable to organize a serial stream as a proper format for the at least one output. In a preferred embodiment, the decoder includes a corresponding decoder formatter for each of the at least one decoder output. For example, if the decoder includes at least one HDMI output in the at least one decoder output, the decoder also includes at least one HDMI formatter in the at least one decoder formatter; if the decoder includes at least one SDI output in the at least one decoder output, the decoder also includes at least one SDI formatter in the at least one decoder formatter; if the decoder includes at least one Ethernet output in the at least one decoder output, the decoder also includes at least one Ethernet formatter in the at least one decoder formatter; and so forth.
The encoder and/or the decoder are operable to generate, insert, and/or recover metadata related to an image signal. The metadata includes, but is not limited to, a color space (e.g., 6P-B, 6P-C), an image transfer function (e.g., gamma, PQ, HLG, ½ gamma), a peak white value, and/or a signal format (e.g., RGB, Yxy, multi-primary (e.g., RGBCMY, RGBC)). In one embodiment, the metadata is inserted into SDI or ST2110 using ancillary (ANC) data packets. In another embodiment, the metadata is inserted using Vendor Specific InfoFrame (VSIF) data as part of the CTA 861 standard. In one embodiment, the metadata is compatible with SMPTE ST 2110-10:2017, SMPTE ST 2110-20:2017, SMPTE ST 2110-40:2018, SMPTE ST 352:2013, and/or SMPTE ST 352:2011, each of which is incorporated herein by reference in its entirety.
Additional details about the multi-primary system and the display are included in U.S. application Ser. Nos. 17/180,441 and 17/209,959, and U.S. Patent Publication Nos. 20210027693, 20210020094, 20210035487, and 20210043127, each of which is incorporated herein by reference in its entirety.
Display Engine
In one embodiment, the present invention provides a display engine operable to interact with a graphics processing unit (GPU) and provide Yxy, XYZ, YUV, Yu′v′, RGB, YCrCb, and/or ICTCP configured outputs. In one embodiment, the display engine and the GPU are on a video card. Alternatively, the display engine and the GPU are embedded on a motherboard or a central processing unit (CPU) die. The display engine and the GPU are preferably included in and/or connected to at least one viewing device (e.g., display, video game console, smartphone, etc.). Additional information related to GPUs are disclosed in U.S. Pat. Nos. 9,098,323; 9,235,512; 9,263,000; 9,318,073; 9,442,706; 9,477,437; 9,494,994; 9,535,815; 9,740,611; 9,779,473; 9,805,440; 9,880,851; 9,971,959; 9,978,343; 10,032,244; 10,043,232; 10,114,446; 10,185,386; 10,229,471; 10,324,693; 10,331,590; 10,460,417; 10,515,611; 10,521,874; 10,559,057; 10,593,011; 10,600,141; 10,628,909; 10,705,846; 10,713,059; 10,769,746; 10,839,476; 10,867,362; 10,922,779; 10,923,082; 10,963,299; and 10,970,805 and U.S. Patent Publication Nos. 20140270364, 20150145871, 20160180487, 20160350245, 20170178275, 20170371694, 20180121386, 20180314932, 20190034316, 20190213706, 20200098082, 20200183734, 20200279348, 20200294183, 20200301708, 20200310522, 20200379864, and 20210049030, each of which is incorporated herein by reference in its entirety.
In one embodiment, the GPU includes a render engine. In one embodiment, the render engine includes at least one render pipeline (RP), a programmable pixel shader, a programmable vector shader, a vector array processor, a curvature engine, and/or a memory cache. The render engine is operable to interact with a memory controller interface, a command CPU, a host bus (e.g., peripheral component interconnect (PCI), PCI Express (PCIe), accelerated graphics port (AGP)), and/or an adaptive full frame anti-aliasing. The memory controller interface is operable to interact with a display memory (e.g., double data rate (DDR) memory), a pixel cache, the command CPU, the host bus, and a display engine. The command CPU is operable to exchange data with the display engine.
In one embodiment, the video card includes a plurality of video cards linked together to allow scaling of graphics processing. In one embodiment, the plurality of video cards is linked with a PCIe connector. Other connectors are compatible with the plurality of video cards. In one embodiment, each of the plurality of video cards has the same technical specifications. In one embodiment, the API includes methods for scaling the graphics processing, and the command CPU is operable to distribute the graphics processing across the plurality of video cards. The command CPU is operable to scale up the graphics processing as well as scale down the graphics processing based on processing demands and/or power demands of the system.
The display engine is operable to take rendered data from the GPU and convert the rendered data to a format operable to be displayed on at least one viewing device. The display engine includes a raster scaler, at least one video display controller (e.g., XYZ video display controller, RGB video display controller, ICTCP video display controller), a color channel-to-XYZ converter, a linear converter, a scaler and/or limiter, a multi-column LUT with at least three columns (e.g., three-dimensional (3D) LUT (e.g., 1293 LUT)), an XYZ-to-Yxy converter, a non-linear function and/or tone curve applicator (e.g., ½ gamma), a sampling selector, a video bus, and/or at least one output formatter and/or encoder (e.g., ST 2082, ST 2110, DisplayPort, HDMI). In one embodiment, the color channel-to-XYZ converter includes an RGB-to-XYZ converter. Additionally or alternatively, the color channel-to-XYZ converter includes an ICTCP-to-XYZ converter and/or an ACES-to-XYZ converter. The video bus is operable to receive input from a graphics display controller and/or at least one input device (e.g., a cursor, a mouse, a joystick, a keyboard, a videogame controller, etc.).
The video card is operable to connect through any number of lanes provided by hardware on the computer. The video card is operable to communicate through a communication interface including, but not limited to, a PCIe Physical Layer (PHY) interface. In one embodiment, the communication interface is an API supported by the computer (e.g., OpenGL, Direct3D, OpenCL, Vulkan). Image data in the form of vector data or bitmap data is output from the communication interface into the command CPU. The communication interface is operable to notify the command CPU when image data is available. The command CPU opens the bus bidirectional gate and instructs the memory controller interface to transmit the image data to a double data rate (DDR) memory. The memory controller interface is operable to open a path from the DDR memory to allow the image data to pass to the GPU for rendering. After rendering, the image data is channeled back to the DDR for storage pending output processing by the display engine.
After the image data is rendered and stored in the DDR memory, the command CPU instructs the memory controller interface to allow rendered image data to load into the raster scaler. The command CPU loads the raster scaler with framing information. The framing information includes, but is not limited to, a start of file (SOF) identifier, an end of file (EOF) identifier, a pixel count, a pixel order, multi-primary data (e.g., RGBCMY data), and/or a frame rate. In one embodiment, the framing information includes HDMI and/or DisplayPort (e.g., CTA 861 format) information. In one embodiment, Extended Display Identification Data (EDID) is operable to override specifications in the API. The raster scaler provides output as image data formatted as a raster in the same format as the file which being read (e.g., RGB, XYZ, Yxy). In one embodiment, the output of the raster scaler is RGB data, XYZ data, or Yxy data. Alternatively, the output of the raster scaler is Yu′v′ data, ICTCP data, or ACES data.
In one embodiment, the output of the raster scaler is sent to a graphics display controller. In one embodiment, the graphics display controller is operable to provide display information for a graphical user interface (GUI). In one embodiment, the RGB video controller and the XYZ video controller block image data from entering the video bus. Raster data includes, but is not limited to, synchronization data, an SOF, an EOF, a frame rate, a pixel order, multi-primary data (e.g., RGBCMY data), and/or a pixel count. In one embodiment, the raster data is limited to an RGB output that is operable to be transmitted to the at least one output formatter and/or encoder.
For common video display, a separate path is included. The separate path is operable to provide outputs including, but not limited to, SMPTE SDI, Ethernet, DisplayPort, and/or HDMI to the at least one output formatter and/or encoder. The at least one video display controller (e.g., RGB video display controller) is operable to limit and/or optimize video data for streaming and/or compression. In one embodiment, the RGB video display controller and the XYZ video display controller block image data from entering the video bus.
In a preferred embodiment, image data is provided by the raster scaler in the format provided by the file being played (e.g., RGB, multi-primary (e.g., RGBCMY), XYZ, Yxy). In one embodiment, the raster scaler presets the XYZ video display controller as the format provided and contained within the raster size to be displayed. In one embodiment, non-linear information (e.g., OOTF) sent from the API through the command CPU is sent to the linear converter. The linear converter is operable to use the non-linear information. For example, if the image data was authored using an OETF, then an inverse of the OETF is operable to be used by the linear converter, or, if the image information already has an EOTF applied, the inverse of the EOTF is operable to be used by the linear converter. In one embodiment, the linear converter develops an EOTF map to linearize input data (e.g., when EOTF data is available). In one embodiment, the linear converter uses an EOTF when already available. After linear data is loaded and a summation process is developed, the XYZ video display controller passes the image data in its native format (e.g., RGB, multi-primary data (e.g., RGBCMY), XYZ, Yxy), but without a non-linearity applied to the luminance (e.g., Y) component. The color channel-to-XYZ converter is operable to accept a native format (e.g., RGB, multi-primary data (e.g., RGBCMY), XYZ, Yxy) and convert to an XYZ format. In one embodiment, the XYZ format includes at least one chromatic adaptation (e.g., D60 to D65). For RGB, the XYZ video display controller uses data supplied from the command CPU, which obtains color gamut and white point specifications from the API to convert to an XYZ output. For a multi-primary system, a corresponding matrix or a look-up table (LUT) is used to convert from the multi-primary system to XYZ. In one embodiment, the multi-primary system is RGBCMY (e.g., 6P-B, 6P-C, S6 Pa, S6Pb). For a Yxy system, the color channel-to-XYZ converter formats the Yxy data back to XYZ data. In another embodiment, the color channel-to-XYZ converter is bypassed. For example, the color channel-to-XYZ converter is bypassed if there is a requirement to stay within a multi-primary system. Additionally, the color channel-to-XYZ converter is bypassed for XYZ data.
In one embodiment, the input to the scaler and/or limiter is XYZ data or multi-primary data. In one embodiment, the multi-primary data includes, but is not limited to, RGBCMY (e.g., 6P-B, 6P-C, S6 Pa, S6Pb), RGBC, RG1G2B, RGBCW, RGBCY, RG1G2BW, RGBWRWGWB, or R1R2G1G2B1B2. Other multi-primary data formats are compatible with the present invention. The scaler and/or limiter is operable to map out of gamut values (e.g., negative values) to in gamut values (e.g., out of gamut values developed in the process to convert to XYZ). In one embodiment, the scaler and/or limiter uses a gamut mapping algorithm to map out of gamut values to in gamut values.
In one embodiment, the input to the scaler and/or limiter is multi-primary data and all channels are optimized to have values between 0 and 1. For example, if the input is RGBCMY data, all six channels are optimized to have values between 0 and 1. In one embodiment, the output of the scaler and/or limiter is operable to be placed into a three-dimensional (3-D) multi-column LUT. In one embodiment, the 3-D multi-column LUT includes one column for each channel. For example, if the output is RGBCMY data, the 3-D multi-column LUT includes six columns (i.e., one for each channel). Within the application feeding the API, each channel is operable to be selected to balance out the white point and/or shade the image toward one particular color channel. In one embodiment, the 3-D multi-column LUT is bypassed if the output of the scaler and/or limiter is XYZ data. The output of the 3-D multi-column LUT is sent to the XYZ-to-Yxy converter, where a simple summation process is used to make the conversion. In one embodiment, if the video data is RGBCMY, the XYZ-to-Yxy converter process is bypassed.
Because the image data is linear, any tone curve can be added to the luminance (e.g., Y). The advantage to the present invention using, e.g., Yxy data or Yu′v′ data, is that only the luminance needs a tone curve modification. L*a*b* has a ⅓ gamma applied to all three channels. IPT and ICTCP operate with a gamma in all three channels. The tone curve is operable to be added to the luminance (e.g., Y) only, with the colorimetric coordinates (e.g., x and y channels, u′ and v′ channels) remaining linear. The tone curve is operable to be anything (e.g., a non-linear function), including standard values currently used. In one embodiment, the tone curve is an EOTF (e.g., those described for television and/or digital cinema). Additionally or alternatively, the tone curve includes HDR modifications.
In one embodiment, the output is handled through this process as three to six individual components (e.g., three components for Yxy or XYZ, six components for RGBCMY, etc.). Alternative number of primaries and components are compatible with the present invention. However, in some serial formats, this level of payload is too large. In one embodiment, the sampling selector sets a sampling structure (e.g., 4:4:4, 4:2:2, 4:2:0, 4:1:1). In one embodiment, the sampling selector is operable to subsample processed image data. The sampling selector is preferably controlled by the command CPU. In one embodiment, the command CPU gets its information from the API and/or the display EDID. In a preferred embodiment, the sampling selector also places each component in the correct serial data position as shown in Table 8 (supra).
The output of the sampling select is fed to the main video bus, which integrates SOF and EOF information into the image data. It then distributes this to the at least one output formatter and/or encoder. In one embodiment, the output is RGBCMY. In one embodiment, the RGBCMY output is configured as 4:4:4:4:4:4 data. The format to the at least one viewing device includes, but is not limited to, SMPTE ST2082 (e.g., 3, 6, and 12G serial data output), SMPTE ST2110 (e.g., to move through ethernet), and/or CTA 861 (e.g., DisplayPort, HDMI). The video card preferably has the appropriate connectors (e.g., DisplayPort, HDMI) for distribution through any external system (e.g., computer) and connection to at least one viewing device (e.g., monitor, television, etc.). The at least one viewing device includes, but is not limited to, a smartphone, a tablet, a laptop screen, a light emitting diode (LED) display, an organic light emitting diode (OLED) display, a miniLED display, a microLED display, a liquid crystal display (LCD), a quantum dot display, a quantum nano emitting diode (QNED) device, a personal gaming device, a virtual reality (VR) device and/or an augmented reality (AR) device, an LED wall, a wearable display, and at least one projector. In one embodiment, the at least one viewing device is a single viewing device.
Six-Primary Color Encode Using a 4:4:4 Sampling Method
Subjective testing during the development and implementation of the current digital cinema system (DCI Version 1.2) showed that perceptible quantizing artifacts were not noticeable with system bit resolutions higher than 11 bits. Current serial digital transport systems support 12 bits. Remapping six color components to a 12-bit stream is accomplished by lowering the bit limit to 11 bits (values 0 to 2047) for 12-bit serial systems or 9 bits (values 0 to 512) for 10-bit serial systems. This process is accomplished by processing multi-primary (e.g., RGBCMY) video information through a standard Optical Electronic Transfer Function (OETF) (e.g., ITU-R BT.709-6), digitizing the video information as four samples per pixel, and quantizing the video information as 11-bit or 9-bit.
In another embodiment, the multi-primary (e.g., RGBCMY) video information is processed through a standard Optical Optical Transfer Function (OOTF). In yet another embodiment, the multi-primary (e.g., RGBCMY) video information is processed through a Transfer Function (TF) other than OETF or OOTF. TFs consist of two components, a Modulation Transfer Function (MTF) and a Phase Transfer Function (PTF). The MTF is a measure of the ability of an optical system to transfer various levels of detail from object to image. In one embodiment, performance is measured in terms of contrast (degrees of gray), or of modulation, produced for a perfect source of that detail level. The PTF is a measure of the relative phase in the image(s) as a function of frequency. A relative phase change of 180°, for example, indicates that black and white in the image are reversed. This phenomenon occurs when the TF becomes negative.
There are several methods for measuring MTF. In one embodiment, MTF is measured using discrete frequency generation. In one embodiment, MTF is measured using continuous frequency generation. In another embodiment, MTF is measured using image scanning. In another embodiment, MTF is measured using waveform analysis.
In one embodiment, the six-primary color system is for a 12-bit serial system. Current practices normally set black at bit value 0 and white at bit value 4095 for 12-bit video. In order to package six colors into the existing three-serial streams, the bit defining black is moved to bit value 2048. Thus, the new encode has RGB values starting at bit value 2048 for black and bit value 4095 for white and non-RGB primary (e.g., CMY) values starting at bit value 2047 for black and bit value as white. In another embodiment, the six-primary color system is for a 10-bit serial system.
In one embodiment, the OETF process is defined in ITU-R BT.709-6, published in 2015, which is incorporated herein by reference in its entirety. In one embodiment, the OETF process is defined in ITU-R BT.709-5, published in 2002, which is incorporated herein by reference in its entirety. In another embodiment, the OETF process is defined in ITU-R BT.709-4, published in 2000, which is incorporated herein by reference in its entirety. In yet another embodiment, the OETF process is defined in ITU-R BT.709-3, published in 1998, which is incorporated herein by reference in its entirety. In yet another embodiment, the OETF process is defined in ITU-R BT.709-2, published in 1995, which is incorporated herein by reference in its entirety. In yet another embodiment, the OETF process is defined in ITU-R BT.709-1, published in 1993, which is incorporated herein by reference in its entirety.
In one embodiment, the encoder is a non-constant luminance encoder. In another embodiment, the encoder is a constant luminance encoder.
Six-Primary Color Packing/Stacking Using a 4:4:4 Sampling Method
System 2 uses sequential mapping to the standard transport format, so it includes a delay for the non-RGB (e.g., CMY) data. The non-RGB (e.g., CMY) data is recovered in the decoder by delaying the RGB data. Since there is no stacking process, the full bit level video can be transported. For displays that are using optical filtering, this RGB delay could be removed and the process of mapping image data to the correct filter could be eliminated by assuming this delay with placement of the optical filter and the use of sequential filter colors.
Two methods can be used based on the type of optical filter used. Since this system is operating on a horizontal pixel sequence, some vertical compensation is required and pixels are rectangular. This can be either as a line double repeat using the same multi-primary (e.g., RGBCMY) data to fill the following line as shown in
The decode adds a pixel delay to the RGB data to realign the channels to a common pixel timing. EOTF is applied and the output is sent to the next device in the system. Metadata based on the standardized transport format is used to identify the format and image resolution so that the unpacking from the transport can be synchronized.
In one embodiment, the decoding is 4:4:4 decoding. With this method, the six-primary color decoder is in the signal path, where 11-bit values for RGB are arranged above bit value 2048, while non-RGB (e.g., CMY) levels are arranged below bit value 2047 as 11-bit. If the same data set is sent to a display and/or process that is not operable for six-primary color processing, the image data is assumed as black at bit value 0 as a full 12-bit word. Decoding begins by tapping image data prior to the unstacking process.
Six-Primary Color Encode Using a 4:2:2 Sampling Method
In one embodiment, the packing/stacking process is for a six-primary color system using a 4:2:2 sampling method. In order to fit the new six-primary color system into a lower bandwidth serial system, while maintaining backwards compatibility, the standard method of converting from six primaries (e.g., RGBCMY) to a luminance and a set of color difference signals requires the addition of at least one new image designator. In one embodiment, the encoding and/or decoding process is compatible with transport through SMPTE ST 292-0 (2011), SMPTE ST 292-1 (2011, 2012, and/or 2018), SMPTE ST 292-2 (2011), SMPTE ST 2022-1 (2007), SMPTE ST 2022-2 (2007), SMPTE ST 2022-3 (2010), SMPTE ST 2022-4 (2011), SMPTE ST 2022-5 (2012 and/or 2013), SMPTE ST 2022-6 (2012), SMPTE ST 2022-7 (2013), and/or and CTA 861-G (2106), each of which is incorporated herein by reference in its entirety.
In order for the system to package all of the image while supporting both six-primary and legacy displays, an electronic luminance component (Y) must be derived. The first component is: EY
E
Y
′=0.10634E′Red+0.23195EYellow′+0.3576EGreen′+0.19685ECyan′+0.0361EBlue′+0.0712EMagenta′
Critical to getting back to legacy display compatibility, value E−Y′ is described as:
E
−Y
′=E
Y
′−(ECyan′+EYellow′+EMagenta′)
In addition, at least two new color components are disclosed. These are designated as Cc and Cy components. The at least two new color components include a method to compensate for luminance and enable the system to function with older Y Cb Cr infrastructures. In one embodiment, adjustments are made to Cb and Cr in a Y Cb Cr infrastructure since the related level of luminance is operable for division over more components. These new components are as follows:
Within such a system, it is not possible to define magenta as a wavelength. This is because the green vector in CIE 1976 passes into, and beyond, the CIE designated purple line. Magenta is a sum of blue and red. Thus, in one embodiment, magenta is resolved as a calculation, not as optical data. In one embodiment, both the camera side and the monitor side of the system use magenta filters. In this case, if magenta were defined as a wavelength, it would not land at the point described. Instead, magenta would appear as a very deep blue which would include a narrow bandwidth primary, resulting in metameric issues from using narrow spectral components. In one embodiment, magenta as an integer value is resolved using the following equation:
The above equation assists in maintaining the fidelity of a magenta value while minimizing any metameric errors. This is advantageous over prior art, where magenta appears instead as a deep blue instead of the intended primary color value.
Six-Primary Non-Constant Luminance Encode Using a 4:2:2 Sampling Method
In one embodiment, the six-primary color system using a non-constant luminance encode for use with a 4:2:2 sampling method. In one embodiment, the encoding process and/or decoding process is compatible with transport through SMPTE ST 292-0 (2011), SMPTE ST 292-1 (2011, 2012, and/or 2018), SMPTE ST 292-2 (2011), SMPTE ST 2022-1 (2007), SMPTE ST 2022-2 (2007), SMPTE ST 2022-3 (2010), SMPTE ST 2022-4 (2011), SMPTE ST 2022-5 (2012 and/or 2013), SMPTE ST 2022-6 (2012), SMPTE ST 2022-7 (2013), and/or and CTA 861-G (2106), each of which is incorporated herein by reference in its entirety.
Current practices use a non-constant luminance path design, which is found in all the video systems currently deployed.
The output is then subtracted from ER′, EB′, EC′, and EY′ to make the following color difference components:
E
CR
′,E
CB
′,E
CC
′,E
CY′
These components are then half sampled (×2) while EY
Six-Primary Non-Constant Luminance Decode Using a 4:2:2 Sampling Method
In one embodiment, the decoding is 4:2:2 decoding. This decode follows the same principles as the 4:4:4 decoder. However, in 4:2:2 decoding, a luminance channel is used instead of discrete color channels. Here, image data is still taken prior to unstack from the ECB-INT′+ECY-INT′ and ECR-INT′+ECC-INT′ channels. With a 4:2:2 decoder, a new component, called E−Y′, is used to subtract the luminance levels that are present from the CMY channels from the ECB-INT′+ECY-INT′ and ECR-INT′+ECC-INT′ INT components. The resulting output is now the R and B image components of the EOTF process. E−Y′ is also sent to the G matrix to convert the luminance and color difference components to a green output. Thus, R′G ‘B’ is input to the EOTF process and output as GRGB, RRGB, and BRGB. In another embodiment, the decoder is a legacy RGB decoder for non-constant luminance systems.
In one embodiment, the standard is SMPTE ST292. In one embodiment, the standard is SMPTE RP431-2. In one embodiment, the standard is ITU-R BT.2020. In another embodiment, the standard is SMPTE RP431-1. In another embodiment, the standard is ITU-R BT.1886. In another embodiment, the standard is SMPTE ST274. In another embodiment, the standard is SMPTE ST296. In another embodiment, the standard is SMPTE ST2084. In yet another embodiment, the standard is ITU-R BT.2100. In yet another embodiment, the standard is SMPTE ST424. In yet another embodiment, the standard is SMPTE ST425. In yet another embodiment, the standard is SMPTE ST2110.
Six-Primary Constant Luminance Decode Using a 4:2:2 Sampling Method
System 2 operation is using a sequential method of mapping to the standard transport instead of the method in System 1 where pixel data is combined to two color primaries in one data set as an 11-bit word. The advantage of System 1 is that there is no change to the standard transport. The advantage of System 2 is that full bit level video can be transported, but at double the normal data rate.
The difference between the systems is the use of two Y channels in System 2. In one embodiment, YRGB and YCMY are used to define the luminance value for RGB as one group and CMY for the other. Alternative primaries are compatible with the present invention.
The encoder for System 2 takes the formatted color components in the same way as System 1. Two matrices are used to build two luminance channels. YRGB contains the luminance value for the RGB color primaries. YCMY contains the luminance value for the CMY color primaries. A set of delays are used to sequence the proper channel for YRGB, YCMY, and the RBCY channels. Because the RGB and non-RGB (e.g., CMY) components are mapped at different time intervals, there is no requirement for a stacking process, and data is fed directly to the transport format. The development of the separate color difference components is identical to System 1. The Encoder for System 2 takes the formatted color components in the same way as System 1. Two matrices are used to build two luminance channels: YRGB contains the luminance value for the RGB color primaries and YCMY contains the luminance value for the CMY color primaries. This sequences YRGB, CR, and CC channels into the even segments of the standardized transport and YCMY, CB, and CY into the odd numbered segments. Since there is no combining color primary channels, full bit levels can be used limited only by the design of the standardized transport method. In addition, for use in matrix driven displays, there is no change to the input processing and only the method of outputting the correct color is required if the filtering or emissive subpixel is also placed sequentially.
Timing for the sequence is calculated by the source format descriptor which then flags the start of video and sets the pixel timing.
The constant luminance system is not different from the non-constant luminance system in regard to operation. The difference is that the luminance calculation is done as a linear function instead of including the OOTF.
Six-Primary Color System Using a 4:2:0 Sampling System
In one embodiment, the six-primary color system uses a 4:2:0 sampling system. The 4:2:0 format is widely used in H.262/MPEG-2, H.264/MPEG-4 Part 10 and VC-1 compression. The process defined in SMPTE RP2050-1 provides a direct method to convert from a 4:2:2 sample structure to a 4:2:0 structure. When a 4:2:0 video decoder and encoder are connected via a 4:2:2 serial interface, the 4:2:0 data is decoded and converted to 4:2:2 by up-sampling the color difference component. In the 4:2:0 video encoder, the 4:2:2 video data is converted to 4:2:0 video data by down-sampling the color difference component.
There typically exists a color difference mismatch between the 4:2:0 video data from the 4:2:0 video data to be encoded. Several stages of codec concatenation are common through the processing chain. As a result, color difference signal mismatch between 4:2:0 video data input to 4:2:0 video encoder and 4:2:0 video output from 4:2:0 video decoder is accumulated and the degradation becomes visible.
Filtering within a Six-Primary Color System Using a 4:2:0 Sampling Method
When a 4:2:0 video decoder and encoder are connected via a serial interface, 4:2:0 data is decoded and the data is converted to 4:2:2 by up-sampling the color difference component, and then the 4:2:2 video data is mapped onto a serial interface. In the 4:2:0 video encoder, the 4:2:2 video data from the serial interface is converted to 4:2:0 video data by down-sampling the color difference component. At least one set of filter coefficients exists for 4:2:0/4:2:2 up-sampling and 4:2:2/4:2:0 down-sampling. The at least one set of filter coefficients provide minimally degraded 4:2:0 color difference signals in concatenated operations.
Filter Coefficients in a Six-Primary Color System Using a 4:2:0 Sampling Method
In one embodiment, the raster is an RGB raster. In another embodiment, the raster is a RGBCMY raster.
Six-Primary Color System Backwards Compatibility
By designing the color gamut within the saturation levels of standard formats and using inverse color primary positions, it is easy to resolve an RGB image with minimal processing. In one embodiment for six-primary encoding, image data is split across three color channels in a transport system. In one embodiment, the image data is read as six-primary data. In another embodiment, the image data is read as RGB data. By maintaining a standard white point, the axis of modulation for each channel is considered as values describing two colors (e.g., blue and yellow) for a six-primary system or as a single color (e.g., blue) for an RGB system. This is based on where black is referenced. In one embodiment of a six-primary color system, black is decoded at a mid-level value. In an RGB system, the same data stream is used, but black is referenced at bit zero, not a mid-level.
In one embodiment, the RGB values encoded in the 6P stream are based on ITU-R BT.709. In another embodiment, the RGB values encoded are based on SMPTE RP431. Advantageously, these two embodiments require almost no processing to recover values for legacy display.
Two decoding methods are proposed. The first is a preferred method that uses very limited processing, negating any issues with latency. The second is a more straightforward method using a set of matrices at the end of the signal path to conform the 6P image to RGB.
In one embodiment, the decoding is for a 4:4:4 system. In one embodiment, the assumption of black places the correct data with each channel. If the 6P decoder is in the signal path, 11-bit values for RGB are arranged above bit value 2048, while CMY level are arranged below bit value 2047 as 11-bit. However, if this same data set is sent to a display or process that is does not understand 6P processing, then that image data is assumed as black at bit value 0 as a full 12-bit word.
Alternatively, the decoding is for a 4:2:2 system. This decode uses the same principles as the 4:4:4 decoder, but because a luminance channel is used instead of discrete color channels, the processing is modified. Legacy image data is still taken prior to unstack from the ECB-INT′+ECY-INT′ and ECR-INT′+ECC-INT′ channels as shown in
For a constant luminance system, the process is very similar with the exception that green is calculated as linear as shown in
Six-Primary Color System Using a Matrix Output
In one embodiment, the six-primary color system outputs a legacy RGB image. This requires a matrix output to be built at the very end of the signal path.
In an alternative embodiment, the saturation values of the C, M, and Y primaries are not required to be substantially equal to their corollary primary saturation value among the R, G, and B primaries, but are substantially equal in saturation to a primary other than their corollary R, G, or B primary value. For example, the C primary saturation value is not required to be substantially equal in saturation to the R primary saturation value, but rather is substantially equal in saturation to the G primary saturation value and/or the B primary saturation value. In one embodiment, two different color saturations are used, wherein the two different color saturations are based on standardized gamuts already in use.
In one embodiment, substantially inverted hue angles refers to a ±10% angle range from an inverted hue angle (e.g., 180 degrees). In addition, substantially inverted hue angles cover additional percentage differences within the ±10% angle range from an inverted hue angle. For example, substantially inverted hue angles further covers a ±7.5% angle range from an inverted hue angle, a ±5% angle range from an inverted hue angle, a ±2% angle range from an inverted hue angle, a ±1% angle range from an inverted hue angle, and/or a ±0.5% angle range from an inverted hue angle. In a preferred embodiment, the C, M, and Y primaries are placed at inverted hue angles (e.g., 180 degrees) compared to the R, G, and B primaries, respectively.
In one embodiment, the gamut is the ITU-R BT.709-6 gamut. In another embodiment, the gamut is the SMPTE RP431-2 gamut.
The unstack process includes output as six, 11-bit color channels that are separated and delivered to a decoder. To convert an image from a six-primary color system to an RGB image, at least two matrices are used. One matrix is a 3×3 matrix converting a six-primary color system image to XYZ values. A second matrix is a 3×3 matrix for converting from XYZ to the proper RGB color space. In one embodiment, XYZ values represent additive color space values, where XYZ matrices represent additive color space matrices. Additive color space refers to the concept of describing a color by stating the amounts of primaries that, when combined, create light of that color.
When a six-primary display is connected to the six-primary output, each channel will drive each color. When this same output is sent to an RGB display, the non-RGB (e.g., CMY) channels are ignored and only the RGB channels are displayed. An element of operation is that both systems drive from the black area. At this point in the decoder, all are coded as bit value 0 being black and bit value 2047 being peak color luminance. This process can also be reversed in a situation where an RGB source can feed a six-primary display. The six-primary display would then have no information for the non-RGB (e.g., CMY) channels and would display the input in a standard RGB gamut.
The design of this matrix is a modification of the CIE process to convert RGB to XYZ. First, u′y′ values are converted back to CIE 1931 xyz values using the following formulas:
Next, RGBCMY values are mapped to a matrix. The mapping is dependent upon the gamut standard being used. In one embodiment, the gamut is ITU-R BT.709-6. The mapping for RGBCMY values for an ITU-R BT.709-6 (6P-B) gamut are:
In one embodiment, the gamut is SMPTE RP431-2. The mapping for RGBCMY values for a SMPTE RP431-2 (6P-C) gamut are:
Following mapping the RGBCMY values to a matrix, a white point conversion occurs:
For a six-primary color system using an ITU-R BT.709-6 (6P-B) color gamut, the white point is D65:
For a six-primary color system using a SMPTE RP431-2 (6P-C) color gamut, the white point is D60:
Following the white point conversion, a calculation is required for RGB saturation values, SR, SG, and SB. The results from the second operation are inverted and multiplied with the white point XYZ values. In one embodiment, the color gamut used is an ITU-R BT.709-6 color gamut. The values calculate as:
Where
In one embodiment, the color gamut is a SMPTE RP431-2 color gamut. The values calculate as:
Where
Next, a six-primary color-to-XYZ matrix must be calculated. For an embodiment where the color gamut is an ITU-R BT.709-6 color gamut, the calculation is as follows:
Wherein the resulting matrix is multiplied by the SRSGSB matrix:
For an embodiment where the color gamut is a SMPTE RP431-2 color gamut, the calculation is as follows:
Wherein the resulting matrix is multiplied by the SRSGSB matrix:
Finally, the XYZ matrix must converted to the correct standard color space. In an embodiment where the color gamut used is an ITU-R BT709.6 color gamut, the matrices are as follows:
In an embodiment where the color gamut used is a SMPTE RP431-2 color gamut, the matrices are as follows:
Packing a Six-Primary Color System into IcTCP
ICTCP (ITP) is a color representation format specified in the Rec. ITU-R BT.2100 standard that is used as a part of the color image pipeline in video and digital photography systems for high dynamic range (HDR) and wide color gamut (WCG) imagery. The I (intensity) component is a luma component that represents the brightness of the video. CT and CP are blue-yellow (“tritanopia”) and red-green (“protanopia”) chroma components. The format is derived from an associated RGB color space by a coordination transformation that includes two matrix transformations and an intermediate non-linear transfer function, known as a gamma pre-correction. The transformation produces three signals: I, CT, and CP. The ITP transformation can be used with RGB signals derived from either the perceptual quantizer (PQ) or hybrid log-gamma (HLG) nonlinearity functions. The PQ curve is described in ITU-R BT2100-2:2018, Table 4, which is incorporated herein by reference in its entirety.
Output from the OETF is converted to ITP format. The resulting matrix is:
RGBCMY data, based on an ITU-R BT.709-6 color gamut, is converted to an XYZ matrix. The resulting XYZ matrix is converted to an LMS matrix, which is sent to an OETF. Once processed by the OETF, the LMS matrix is converted to an ITP matrix. The resulting ITP matrix is as follows:
In another embodiment, the LMS matrix is sent to an Optical Optical Transfer Function (OOTF). In yet another embodiment, the LMS matrix is sent to a Transfer Function other than OOTF or OETF.
In another embodiment, the RGBCMY data is based on the SMPTE ST431-2 (6P-C) color gamut. The matrices for an embodiment using the SMPTE ST431-2 color gamut are as follows:
The resulting ITP matrix is:
The decode process uses the standard ITP decode process, as the SRSGSB cannot be easily inverted. This makes it difficult to recover the six RGBCMY components from the ITP encode. Therefore, the display is operable to use the standard ICtCp decode process as described in the standards and is limited to just RGB output.
Converting to a Five-Color Multi-Primary Display
In one embodiment, the system is operable to convert image data incorporating five primary colors. In one embodiment, the five primary colors include Red (R), Green (G), Blue (G), Cyan (C), and Yellow (Y), collectively referred to as RGBCY. In another embodiment, the five primary colors include Red (R), Green (G), Blue (B), Cyan (C), and Magenta (M), collectively referred to as RGBCM. In one embodiment, the five primary colors do not include Magenta (M).
In one embodiment, the five primary colors include Red (R), Green (G), Blue (B), Cyan (C), and Orange (O), collectively referred to as RGBCO. RGBCO primaries provide optimal spectral characteristics, transmittance characteristics, and makes use of a D65 white point. See, e.g., Moon-Cheol Kim et al., Wide Color Gamut Five Channel Multi-Primary for HDTV Application, Journal of Imaging Sci. & Tech. Vol. 49, No. 6, November/December 2005, at 594-604, which is hereby incorporated by reference in its entirety.
In one embodiment, a five-primary color model is expressed as F=M·C, where F is equal to a tristimulus color vector, F=(X, Y, Z)T, and C is equal to a linear display control vector, C=(C1, C2, C3, C4, C5)T. Thus, a conversion matrix for the five-primary color model is represented as
Using the above equation and matrix, a gamut volume is calculated for a set of given control vectors on the gamut boundary. The control vectors are converted into CIELAB uniform color space. However, because matrix M is non-square, the matrix inversion requires splitting the color gamut into a specified number of pyramids, with the base of each pyramid representing an outer surface and where the control vectors are calculated using linear equation for each given XYZ triplet present within each pyramid. By separating regions into pyramids, the conversion process is normalized. In one embodiment, a decision tree is created in order to determine which set of primaries are best to define a specified color. In one embodiment, a specified color is defined by multiple sets of primaries. In order to locate each pyramid, 2D chromaticity look-up tables are used, with corresponding pyramid numbers for input chromaticity values in xy or u′v′. Typical methods using pyramids require 1000×1000 address ranges in order to properly search the boundaries of adjacent pyramids with look-up table memory. The system of the present invention uses a combination of parallel processing for adjacent pyramids and at least one algorithm for verifying solutions by checking constraint conditions. In one embodiment, the system uses a parallel computing algorithm. In one embodiment, the system uses a sequential algorithm. In another embodiment, the system uses a brightening image transformation algorithm. In another embodiment, the system uses a darkening image transformation algorithm. In another embodiment, the system uses an inverse sinusoidal contrast transformation algorithm. In another embodiment, the system uses a hyperbolic tangent contrast transformation algorithm. In yet another embodiment, the system uses a sine contrast transformation execution times algorithm. In yet another embodiment, the system uses a linear feature extraction algorithm. In yet another embodiment, the system uses a JPEG2000 encoding algorithm. In yet another embodiment, the system uses a parallelized arithmetic algorithm. In yet another embodiment, the system uses an algorithm other than those previously mentioned. In yet another embodiment, the system uses any combination of the aforementioned algorithms.
Mapping a Six-Primary Color System into Standardized Transport Formats
Each encode and/or decode system fits into existing video serial data streams that have already been established and standardized. This is key to industry acceptance. Encoder and/or decoder designs require little or no modification for a six-primary color system to map to these standard serial formats.
The process for mapping a six-primary color system to a SMPTE ST425 format is the same as mapping to a SMPTE ST424 format. To fit a six-primary color system into a SMPTE ST425/424 stream involves the following substitutions: GINT′+MINT′ is placed in the Green data segments, RINT′+CINT′ is placed in the Red data segments, and BINT′+YINT′ is placed into the Blue data segments.
System 2 requires twice the data rate as System 1, so it is not compatible with SMPTE 424. However, it maps easily into SMPTE ST2082 using a similar mapping sequence. In one example, System 2 is used to have the same data speed defined for 8K imaging to show a 4K image.
In one embodiment, sub-image and data stream mapping occur as shown in SMPTE ST2082. An image is broken into four sub-images, and each sub-image is broken up into two data streams (e.g., sub-image 1 is broken up into data stream 1 and data stream 2). The data streams are put through a multiplexer and then sent to the interface as shown in
In one embodiment, the standard serial format is SMPTE ST292. SMPTE ST292 is an older standard than ST424 and is a single wire format for 1.5 GB video, whereas ST424 is designed for up to 3 GB video. However, while ST292 can identify the payload ID of SMPTE ST352, it is constrained to only accepting an image identified by a hex value, 0h. All other values are ignored. Due to the bandwidth and identifications limitations in ST292, a component video six-primary color system incorporates a full bit level luminance component. To fit a six-primary color system into a SMPTE ST292 stream involves the following substitutions: EY
SMPTE ST292 and ST424 Serial Digital Interface (SDI) formats include payload identification (ID) metadata to help the receiving device identify the proper image parameters. The tables for this need modification by adding at least one flag identifying that the image source is a six-primary color RGB image. Therefore, six-primary color system format additions need to be added. In one embodiment, the standard is the SMPTE ST352 standard.
In another embodiment, the standard serial format is SMPTE ST2082. Where a six-primary color system requires more data, it is not always compatible with SMPTE ST424. However, it maps easily into SMPTE ST2082 using the same mapping sequence. This usage has the same data speed defined for 8K imaging in order to display a 4K image.
In another embodiment, the standard serial format is SMPTE ST2022. Mapping to ST2022 is similar to mapping to ST292 and ST242, but as an ETHERNET format. The output of the stacker is mapped to the media payload based on Real-time Transport Protocol (RTP) 3550, established by the Internet Engineering Task Force (IETF). RTP provides end-to-end network transport functions suitable for applications transmitting real-time data, including, but not limited to, audio, video, and/or simulation data, over multicast or unicast network services. The data transport is augmented by a control protocol (RTCP) to allow monitoring of the data delivery in a manner scalable to large multicast networks, and to provide control and identification functionality. There are no changes needed in the formatting or mapping of the bit packing described in SMPTE ST 2022-6: 2012 (HBRMT), which is incorporated herein by reference in its entirety.
In another embodiment, the standard is SMPTE ST2110. SMPTE ST2110 is a relatively new standard and defines moving video through an Internet system. The standard is based on development from the IETF and is described under RFC3550. Image data is described through “pgroup” construction. Each pgroup consists of an integer number of octets. In one embodiment, a sample definition is RGB or YCbCr and is described in metadata. In one embodiment, the metadata format uses a Session Description Protocol (SDP) format. Thus, pgroup construction is defined for 4:4:4, 4:2:2, and 4:2:0 sampling as 8-bit, 10-bit, 12-bit, and in some cases 16-bit and 16-bit floating point wording. In one embodiment, six-primary color image data is limited to a 10-bit depth. In another embodiment, six-primary color image data is limited to a 12-bit depth. Where more than one sample is used, it is described as a set. For example, 4:4:4 sampling for blue, as a non-linear RGB set, is described as C0′B, C1′B, C2′B, C3′B, and C4′B. The lowest number index being left most within the image. In another embodiment, the method of substitution is the same method used to map six-primary color content into the ST2110 standard.
In another embodiment, the standard is SMPTE ST2110. SMPTE ST2110-20 describes the construction for each pgroup. In one embodiment, six-primary color system content arrives for mapping as non-linear data for the SMPTE ST2110 standard. In another embodiment, six-primary color system content arrives for mapping as linear data for the SMPTE ST2110 standard.
Non-linear RGBCMY image data arrives as: GINT′+MINT′, RINT′+Cint′, and BINT′+YINT′. Component substitution follows what has been described for SMPTE ST424, where GINT′+MINT′ is placed in the Green data segments, RINT′+CINT′ is placed in the Red data segments, and BINT′+YINT′ is placed in the Blue data segments. The sequence described in the standard is shown as R0′, G0′, B0′, R1′, G1′, B1′, etc.
Table 17 summarizes mapping to SMPTE ST2110 for 4:2:2:2:2 and 4:2:0:2:0 sampling for System 1 and Table 18 summaries mapping to SMPTE ST2110 for 4:4:4:4:4:4 sampling (linear and non-linear) for System 1.
Table 19 summarizes mapping to SMPTE ST2110 for 4:2:2:2:2 sampling for System 2 and Table 20 summaries mapping to SMPTE ST2110 for 4:4:4:4:4:4 sampling (linear and non-linear) for System 2.
Session Description Protocol (SDP) Modification for a Six-Primary Color System
SDP is derived from IETF RFC 4566 which sets parameters including, but not limited to, bit depth and sampling parameters. IETF RFC 4566 (2006) is incorporated herein by reference in its entirety. In one embodiment, SDP parameters are contained within the RTP payload. In another embodiment, SDP parameters are contained within the media format and transport protocol. This payload information is transmitted as text. Therefore, modifications for the additional sampling identifiers requires the addition of new parameters for the sampling statement. SDP parameters include, but are not limited to, color channel data, image data, framerate data, a sampling standard, a flag indicator, an active picture size code, a timestamp, a clock frequency, a frame count, a scrambling indicator, and/or a video format indicator. For non-constant luminance imaging, the additional parameters include, but are not limited to, RGBCMY-4:4:4, YBRCY-4:2:2, and YBRCY-4:2:0. For constant luminance signals, the additional parameters include, but are not limited to, CLYBRCY-4:2:2 and CLYBRCY-4:2:0.
Additionally, differentiation is included with the colorimetry identifier in one embodiment. For example, 6PB1 defines 6P with a color gamut limited to ITU-R BT.709 formatted as System 1, 6PB2 defines 6P with a color gamut limited to ITU-R BT.709 formatted as System 2, 6PB3 defines 6P with a color gamut limited to ITU-R BT.709 formatted as System 3, 6PC1 defines 6P with a color gamut limited to SMPTE RP 431-2 formatted as System 1, 6PC2 defines 6P with a color gamut limited to SMPTE RP 431-2 formatted as System 2, 6PC3 defines 6P with a color gamut limited to SMPTE RP 431-2 formatted as System 3, 6PS1 defines 6P with a color gamut as Super 6P formatted as System 1, 6PS2 defines 6P with a color gamut as Super 6P formatted as System 2, and 6PS3 defines 6P with a color gamut as Super 6P formatted as System 3.
Colorimetry can also be defined between a six-primary color system using the ITU-R BT.709-6 standard and the SMPTE ST431-2 standard, or colorimetry can be left defined as is standard for the desired standard. For example, the SDP parameters for a 1920×1080 six-primary color system using the ITU-R BT.709-6 standard with a 10-bit signal as System 1 are as follows: m=video 30000 RTP/AVP 112, a=rtpmap:112 raw/90000, a=fmtp:112, sampling=YBRCY-4:2:2, width=1920, height=1080, exactframerate=30000/1001, depth=10, TCS=SDR, colorimetry=6PB1, PM=2110GPM, SSN=ST2110-20:2017.
In one embodiment, the six-primary color system is integrated with a Consumer Technology Association (CTA) 861-based system. CTA-861 establishes protocols, requirements, and recommendations for the utilization of uncompressed digital interfaces by consumer electronics devices including, but not limited to, digital televisions (DTVs), digital cable, satellite or terrestrial set-top boxes (STBs), and related peripheral devices including, but not limited to, DVD players and/or recorders, and other related Sources or Sinks.
These systems are provided as parallel systems so that video content is parsed across several line pairs. This enables each video component to have its own transition-minimized differential signaling (TMDS) path. TMDS is a technology for transmitting high-speed serial data and is used by the Digital Visual Interface (DVI) and High-Definition Multimedia Interface (HDMI) video interfaces, as well as other digital communication interfaces. TMDS is similar to low-voltage differential signaling (LVDS) in that it uses differential signaling to reduce electromagnetic interference (EMI), enabling faster signal transfers with increased accuracy. In addition, TMDS uses a twisted pair for noise reduction, rather than a coaxial cable that is conventional for carrying video signals. Similar to LVDS, data is transmitted serially over the data link. When transmitting video data, and using HDMI, three TMDS twisted pairs are used to transfer video data.
In such a system, each pixel packet is limited to 8 bits only. For bit depths higher than 8 bits, fragmented packs are used. This arrangement is no different than is already described in the current CTA-861 standard.
Based on CTA extension Version 3, identification of a six-primary color transmission is performed by the sink device (e.g., the monitor). Adding recognition of the additional formats is flagged in the CTA Data Block Extended Tag Codes (byte 3). Since codes 33 and above are reserved, any two bits could be used to identify that the format is RGB, RGBCMY, Y Cb Cr, or Y Cb Cr Cc Cy and/or identify System 1 or System 2. Should byte 3 define a six-primary sampling format, and where the block 5 extension identifies byte 1 as ITU-R BT.709, then logic assigns as 6P-B. However, should byte 4 bit 7 identify colorimetry as DCI-P3, the color gamut is assigned as 6P-C.
In one embodiment, the system alters the Auxiliary Video Information (AVI) Infoframe Data to identify content. AVI Infoframe Data is shown in Table 10 of CTA 861-G. In one embodiment, Y2=1, Y1=0, and Y0=0 identifies content as 6P 4:2:0:2:0. In another embodiment, Y2=1, Y1=0, and Y0=1 identifies content as Y Cr Cb Cc Cy. In yet another embodiment, Y2=1, Y1=1, and Y0=0 identifies content as RGBCMY.
Byte 2 C1=1, C0=1 identifies extended colorimetry in Table 11 of CTA 861-G. Byte 3 EC2, EC1, EC0 identifies additional colorimetry extension valid in Table 13 of CTA 861-G. Table 14 of CTA 861-G reserves additional extensions. In one embodiment, ACE3=1, ACE2=0, ACE1=0, and ACE0=X identifies 6P-B. In one embodiment, ACE3=0, ACE2=1, ACE1=0, and ACE0=X identifies 6P-C. In one embodiment, ACE3=0, ACE2=0, ACE1=1, and ACE0=X identifies System 1. In one embodiment, ACE3=1, ACE2=1, ACE1=0, and ACE0=X identifies System 2.
HDMI sampling systems include Extended Display Identification Data (EDID) metadata. EDID metadata describes the capabilities of a display device to a video source. The data format is defined by a standard published by the Video Electronics Standards Association (VESA). The EDID data structure includes, but is not limited to, manufacturer name and serial number, product type, phosphor or filter type, timings supported by the display, display size, luminance data, and/or pixel mapping data. The EDID data structure is modifiable and modification requires no additional hardware and/or tools.
EDID information is transmitted between the source device and the display through a display data channel (DDC), which is a collection of digital communication protocols created by VESA. With EDID providing the display information and DDC providing the link between the display and the source, the two accompanying standards enable an information exchange between the display and source.
In addition, VESA has assigned extensions for EDID. Such extensions include, but are not limited to, timing extensions (00), additional time data black (CEA EDID Timing Extension (02)), video timing block extensions (VTB-EXT (10)), EDID 2.0 extension (20), display information extension (DI-EXT (40)), localized string extension (LS-EXT (50)), microdisplay interface extension (MI-EXT (60)), display ID extension (70), display transfer characteristics data block (DTCDB (A7, AF, BF)), block map (FO), display device data block (DDDB (FF)), and/or extension defined by monitor manufacturer (FF).
In one embodiment, SDP parameters include data corresponding to a payload identification (ID) and/or EDID information.
Multi-Primary Color System Display
In one embodiment, the display is comprised of a single projector. A single projector six-primary color system requires the addition of a second cross block assembly for the additional colors. One embodiment of a single projector (e.g., single LCD projector) is shown in
In another embodiment, the display is comprised of a dual stack Digital Micromirror Device (DMD) projector system.
In one embodiment, the projectors are phosphor wheel systems. A yellow phosphor wheel spins in time with a DMD imager to output sequential RG. The second projector is designed the same, but uses a cyan phosphor wheel. The output from this projector becomes sequential BG. Combined, the output of both projectors is YRGGCB. Magenta is developed by synchronizing the yellow and cyan wheels to overlap the flashing DMD.
In another embodiment, the display is a single DMD projector solution. A single DMD device is coupled with an RGB diode light source system. In one embodiment, the DMD projector uses LED diodes. In one embodiment, the DMD projector includes CMY diodes. In another embodiment, the DMD projector creates CMY primaries using a double flashing technique.
In yet another embodiment, the display is a direct emissive assembled display. The design for a direct emissive assembled display includes a matrix of color emitters grouped as a six-color system. Individual channel inputs drive each Quantum Dot (QD) element illuminator and/or micro LED element.
Video Wall Display
In one embodiment, the present invention includes a video wall system wherein the display is a video wall. A video wall is useful as a large display, e.g., for viewing image data from a distance, for displaying image data to a crowd, for displaying a large image. In one embodiment, a video wall is a display that utilizes multiple display devices, e.g., multiple screens, multiple monitors, multiple projectors, to display image data. Preferably, a video wall is operable to display a set of image data wherein the set of image data is also viewable on a single display device, e.g., a single monitor. In the embodiment wherein a video wall includes a plurality of monitor displays, each of the plurality of monitor displays is operable to display a portion of the image data, wherein the full image represented by the image data is only visible when looking at the plurality of monitor displays as a whole.
In one embodiment, the video wall is connected to at least one video wall controller wherein the at least one video wall controller is operable to control which portion of the image data is displayed by which display device. In one embodiment, each of the display devices is connected to the at least one video wall controller. Alternatively, the display devices are connected in a daisy chain, wherein a first display devices is connected to the at least one video wall controller and the remainder of the display devices are connected in series to each other. In yet another alternative, the video wall includes a single display device, e.g., a screen, a projector, wherein the at least one video wall controller is operable to scale the image data to fill and fit the dimensions of the single display device. Hardware and software implementations of the at least one video wall controller are compatible with the present invention. In one embodiment, the at least one video wall controller is integrated into a video card wherein the video card includes at least one GPU for graphics processing. Alternatively, the at least one video wall controller is connected to the video card. In one embodiment, the display engine of the present invention is connected to the video card to provide image data. In one embodiment, the video wall system includes a plurality of video cards, wherein the plurality of video cards are linked together in parallel to scale graphics processing. Parallel processing techniques including, but not limited to, time-division, image division, and/or object division are compatible with the present invention.
In one embodiment, the video card includes at least one frame buffer wherein the at least one frame buffer is operable to convert pixel data (e.g., bits stored in a bitmap) to image data for display. The video card is operable to a use graphics library to render image data and fill the at least one frame buffer with rendered image data. In one embodiment, the video card is operable to make a copy of the at least one frame buffer wherein the copy is operable to be split and processed for display on the plurality of displays as described in U.S. Pat. No. 9,911,176, which was filed Jan. 12, 2015 and issued Mar. 6, 2018, and which is incorporated herein by reference in its entirety. In one embodiment, the at least one frame buffer includes a first frame buffer and a second frame buffer. The first frame buffer includes the image data in its entirety, while the second frame buffer includes a portion of the image data for display on one of the plurality of display devices as described in U.S. Pat. No. 9,035,969, which was filed Nov. 29, 2012 and issued May 19, 2015, and which is incorporated herein by reference in its entirety. In one embodiment, the video card includes at least one LUT wherein the at least one LUT enables an expanded color gamut for the image data, e.g., 6P-B, 6P-C for the image data stored in the frame buffer. In one embodiment, the at least one LUT is operable to be modified while the image data is being processed to allow for a broader range of colors.
In one embodiment, the at least one video wall controller is operable to send an image data signal to the video wall. The image data signal includes, but is not limited to, rendered image data, metadata, and/or display data for the video wall. Preferably, the rendered image data is converted by the display engine into a three-coordinate format wherein a first coordinate and a second coordinate are both colorimetric (chroma) and wherein a third coordinate is a luminance or a luma value. As a non-limiting example, the three-coordinate format is Yxy, wherein Y is a luminance coordinate and x and y are orthogonal colorimetric coordinates. Alternatively, a transformation (e.g., a gamma compression) is applied to Y to create luma Y′. In one embodiment, x and y, the colorimetric coordinates, are scaled to increase the range of useful coding values. In a non-limiting example, x-values are divided by 0.74, while y-values are divided by 0.84 to expand the range of x and y. Alternative three-coordinate formats include, but are not limited to, L*a*b*, ICtCp, YCbCr, YUV, Yu′v′, YPbPr, and/or YIQ. Cylindrical coordinate image data (e.g., L*C*h* and other polar transformations of rectangular color coordinate systems) is also compatible with the present invention. The metadata includes, but is not limited to, an image source, an image data format, a color space, a white value, a signal format, transport format data (e.g., standardized transport format data), a test protocol, and/or Session Description Protocol (SDP) parameters. In one embodiment, the image data includes at least one transfer function, e.g., an OETF, an EOTF, an OOTF, a gamma function. Alternatively, the at least one video wall controller is operable to apply at least one transfer function to the image data upon receiving it. In one embodiment, the video wall system is operable to maintain a 12-bit bit depth for the image data. Using the three-coordinate format wherein only the third coordinate is a luminance or a luma value enables subsampling, which results in a reduction in bits. Fewer bits per pixel are needed for the chroma coordinates since the human eye is less sensitive to changes in chroma than changes in luminance. The bit reduction is not possible in other three-coordinate systems such as XYZ wherein luminance is a component in each of the three coordinates X, Y, and Z. In one embodiment, the LUT is compressed. In another embodiment, the image data and/or the LUT are encrypted. In one embodiment, encryption includes at least one key.
The display data includes data used to display the rendered image data on the video wall. In one embodiment, the display data includes cropping and/or scaling data to describe which portion of the rendered image data is displayed on each display device. The at least one video wall controller is operable to split, crop, and/or scale an image for display on the video wall. Alternatively, the display data includes mapping data to map portions of the rendered image data to the plurality of display devices.
In one embodiment, the display data further includes calibration data, e.g., a test pattern, and/or timing data to synchronize the plurality of display devices. In one embodiment, the video wall system includes at least one sensor wherein the at least one sensor is operable to monitor the display of the video wall to ensure that the plurality of display devices is properly synchronized. The at least one sensor is operable to monitor optical data, e.g., at least one color, at least one color coordinate, a brightness, a white point, a color gamut, an image, external light levels (e.g., ambient light). In one embodiment, the video wall system is operable to use computer vision to verify that the image displayed on the video wall matches the image described by the image data. In another embodiment, the at least one sensor is operable to sense the external light levels and communicate with the at least one video wall controller to modify the image data and/or the display to compensate for the external light levels. The ambient lighting in a room is likely to change for different usages of the video wall. The video wall system is operable to adjust such that the intent and content of the image data is still displayed properly regardless of viewing conditions. Alternatively, the at least one sensor is operable to monitor electrical data, e.g., a voltage, a current, a resistance, a power. In yet another embodiment, the at least one sensor is a temperature sensor. Sensor data from the at least one sensor is then compared to expected sensor data to verify the video wall display as described in U.S. Pat. No. 9,307,616, which was filed May 15, 2015 and issued Apr. 5, 2016, and which is incorporated herein by reference in its entirety. In one embodiment, the sensor data is used to monitor aging of the display devices. In one embodiment, the at least one video wall controller adjusts the image signal data based on the sensor data. For example, the at least one video wall controller modifies the brightness of an image displayed by the video wall in order to compensate for nonuniform changes in brightness of each display device in the video wall over time. Alternatively, the at least one video wall controller sends an alert regarding performance of each of the display devices. For example, if one or more display devices in a video wall is out of specification, the at least one video wall controller sends an alert to a remote device (e.g., smartphone, computer).
In one embodiment, each of the plurality of display devices receives a different image data signal. For example, each display device only receives a portion of image data that it displays rather than a full set of image data. The portion of image data is dependent on the location of the display device. In one embodiment, each of the plurality of display devices is operable to modify, recreate, and/or transmit the image data signal. For example, in a daisy chain network, a first display device is operable to modify the image data signal to indicate that the first display device received the image data signal before transmitting the image data signal to a second display device. Alternatively, a display device is operable to recreate and/or transmit a portion of the image data signal. In one embodiment, the at least one video wall controller is operable to create a virtual representation of the full set of image data for display on the video wall as described in International Patent Publication WO2021/0181412, which was filed Oct. 26, 2020 and published Apr. 29, 2021, and which is incorporated herein by reference in its entirety. The virtual representation has a virtual resolution. In one embodiment, the virtual representation is dependent on physical characteristics and/or constraints of the video wall. Alternatively, the virtual representation is agnostic of the plurality of display devices of the video wall. The at least one video wall controller is then operable to partition the image data based on the virtual representation and send a portion of the image data to each display device of the plurality of display devices. In one embodiment, the at least one video wall controller is operable to upscale and/or downscale the portion of the image data to match the resolution of each display device of the plurality of display devices.
In one embodiment, the at least one video wall controller is a single-input, multiple-output (SIMO) controller. Alternatively, the at least one video wall controller is a multiple-input, multiple-output (MIMO) controller. For example, the at least one video wall controller is operable to accept image data as multi-primary data (e.g., RGBCMY data), wherein a first input includes a first portion of the multi-primary data (e.g., RGB data) and wherein a second input includes a second portion of the multi-primary data (e.g., CMY data) as described in System 2 transport of the present invention, and output the image data to a plurality of display devices. Alternatively, the at least one video wall controller is operable to receive multiple inputs from a plurality of image data sources to display on the plurality of display devices. The at least one video wall controller is operable to consolidate the inputs and/or combine the inputs into a single set of image data. The at least one video wall controller is then operable to display the single set of image data on the plurality of display devices. In one embodiment, the at least one video wall controller stretches the image data to fit onto the plurality of display devices. The input to the at least one video wall controller includes, but is not limited to, stored image data, live image data (e.g., streaming video), and/or image data from a web source. The at least one video wall controller is operable to change image sources in real time or near real time. In one embodiment, the video wall system includes a camera wherein the camera is operable to capture image data and wherein the video wall is operable to display the captured image data in real time or near real time.
In one embodiment, the at least one video wall controller is operable to use multi-stream transport (MST). MST is a standard transport format as described in DisplayPort Standard 1.2, which was published Jan. 5, 2010, and which is incorporated herein by reference in its entirety. MST includes multiplexing a plurality of image signals and sending a single image signal to a demultiplexer, wherein the demultiplexer is operable to separate the single image signal into the plurality of image signals. The demultiplexer is then operable to send each of the plurality of image signals to the display devices of the video wall. Display interfaces including, but not limited to, SDI, HDMI, Digital Visual Interface (DVI), DisplayPort (DP), Mobile High Definition Link (MHDL), and internet protocol (IP) interfaces (e.g., as described in SMPTE ST-2110, which was published beginning Nov. 27, 2017 and which is incorporated herein by reference in its entirety), are compatible with the present invention.
In one embodiment, the at least one video wall controller is a server-based video wall controller wherein the at least one server-based video wall controller is operable for network communication with the video card and/or the video wall. In one embodiment, the server-based video wall controller is operable to communicate with the video wall via at least one adapter wherein the at least one adapter is attached to the display devices of the video wall. In one embodiment, the at least one adapter includes software that enables the server-based video wall controller to interface with the display devices. In one embodiment, the at least one adapter is operable for wireless communication, e.g., via a mobile data network, via a local area network. Advantageously, a server-based video wall controller is easier to upgrade and/or modify and eliminates the need for specialized hardware to be installed in the video wall system. Additionally, it is easier to change out server-based video wall controllers in the event that a server-based video wall controller fails. A server-based video wall controller is also operable to reduce redundancy in rendering and/or splitting image data for video wall display. In one embodiment, the server-based video wall controller includes at least one memory map for displaying image data on the video wall. The at least one video wall controller is operable to parse, optimize, and/or scale the image data for each display device in real time or near real time.
In one embodiment, the server-based video wall controller is operable to control a plurality of video walls in different locations, wherein each video wall displays the same image or a different image. In one embodiment, the at least one video wall controller is operable to adjust the image signal data in real time or near real time to accommodate changes in the video wall, e.g., addition of display devices, removal of display devices, display device failures, color space changes. In one embodiment, the at least one video wall controller includes a user interface wherein the user interface is operable to accept user input to control the video wall. In one embodiment, the server-based video wall controller is stored on a cloud-based server. Alternatively, the server-based video wall controller is stored on an edge node. Physical servers and virtual servers are also compatible with the present invention.
In one embodiment, the at least one video wall controller is connected to at least one video extender wherein the at least one video extender is operable to transport image data from the video wall controller to the plurality of display devices. Each of the plurality of display devices is preferably connected to one or more of the at least one video extender, and the at least one video wall controller is operable to determine which portion of the image data to send to which of the at least one video extender. The at least one video extender is operable to be used in an arrangement wherein the at least one video wall controller is located separately from the plurality of display devices, e.g., in a server room. In one embodiment, the at least one video wall controller is wired to the at least one video extender. Alternatively, the at least one video extender and the at least one video wall controller are operable to use wireless communication, e.g., a local area network connection, to transport the image data. In one embodiment, a capture card is operable to record the image data displayed on the video wall. In a preferred embodiment, the capture card is separate from the at least one video wall controller so that the processing power used for capture does not interfere with display on the video wall.
In one embodiment, the display devices include at least one screen, including, but not limited to, LCD screens, LED screens (e.g., perovskite LED screens, nanorod screens, miniLED screens, microLED screens, OLED screens, active matrix OLED (AMOLED) screens), cathode ray tube (CRT) screens, QD screens, and/or projector screens. In one embodiment, the at least one screen includes tiles, monitors, and/or cubes. Non-flat displays (e.g., curved OLED displays, curved Alternatively, the at least one screen is a device including, but not limited to, a computer, a wearable, a mobile device, a smartphone, and/or a tablet. The video wall system is operable to combine display devices of different sizes and/or resolutions. Alternatively, each of the plurality of display devices is identical. If the at least one screen includes bezels, the at least one video wall controller is operable to adjust the image data signal to compensate for the bezels, e.g., by scaling the image data as if the bezels did not exist. Bezel compensation eliminates pixels in an image that would otherwise covered by the bezels in order to create a seamless image. The at least one video wall controller is also operable to remove bezel compensation such that all of the image data is displayed. The at least one video wall controller is operable to compensate for gaps between screens, as well as rectangular and non-rectangular arrangements of screens.
In one embodiment, the plurality of display devices includes a laser phosphor display. In one embodiment, the at least one video wall controller is operable to reallocate bit depth in a display device to enable rearrangement of subpixels. Reallocation of bit depth enables the transport and display of color data in an expanded color gamut (e.g., 6P-B, 6P-C, RGBCMY data). For example, subpixels in an 8K display are repurposed to display an image with 4K resolution but with an expanded color gamut. Alternative display devices of the video wall are described in U.S. Pat. No. 11,030,934, which was filed Oct. 1, 2020 and issued Jun. 8, 2021, and which is incorporated herein by reference in its entirety. In one embodiment, the plurality of display devices includes projectors. The at least one video wall controller is operable to blend the output (e.g., overlap, interpolate) from the projectors to create a seamless image. In one embodiment, the at least one video wall controller includes a synchronization unit wherein the synchronization unit is operable to send a synchronization signal to each of the display devices to ensure that each of the display devices is displaying the same set of image data at a moment in time. A synchronization unit is further detailed in U.S. Pat. No. 8,911,291, which was filed Nov. 26, 2012 and issued Dec. 16, 2014, and which is incorporated herein by reference in its entirety. Alternatively, the at least one video wall controller is operable to send a broadcast command wherein the broadcast command includes time delay data to synchronize the plurality of display devices as described in U.S. Pat. No. 10,079,963, which was filed May 12, 2017 and issued Sep. 18, 2019, and which is incorporated herein by reference in its entirety. In one embodiment, the synchronization signal is an analog signal, e.g., a black burst signal, a tri-level synchronization pulse. Black burst signals for television color standards are also compatible as synchronization signals. In another embodiment, the synchronization signal is a signal for clock synchronization, e.g., as described in Request for Comments (RFC) Network Time Protocol (NTP) v4, which was published in June 2010 and which is incorporated herein by reference, or as described in the Precision Time Protocol (PTP) of IEEE 802.1AS, which was published Mar. 30, 2011 and which is incorporated herein by reference in its entirety. Other synchronization signals sent over IP are also compatible with the present invention.
In one embodiment, the video wall includes at least one electromechanical element, e.g., a microelectromechanical system (MEMS). MEMS devices typically use electronic signals to drive mechanical processes. In one embodiment, the at least one electromechanical element includes at least one integrated circuit (IC), e.g., a microprocessor, a microcontroller. In one embodiment, one or more of the at least one electromechanical element includes at least one sensor. In one embodiment, the at least one electromechanical element is a moving stage including at least one display element (e.g., a light-emitting diode) as described in U.S. Pat. No. 10,754,092, which was filed Jun. 25, 2019 and issued Aug. 25, 2020, and which is incorporated herein by reference in its entirety. Light passing from the at least one display element through at least one lens in front of the moving stage depends on a position of the moving stage relative to the at least one lens. Thus, the moving stage enables the video wall to display multiple sets of image data. In one embodiment, each set of image data is only viewable from a different angle and/or position. In one embodiment, a movement of the moving stage is based on a set path, e.g., a path of a camera filming the video wall. The movement of the moving stage follows the path of the camera such that the camera captures image data that is only visible from positions along the path of the camera. Alternatively, the movement of the moving stage is based on at least one set of image data being displayed. In one embodiment, the moving stage requires real-time or near-real-time rendering of the image data. In one embodiment, the image data is rendered with an expanded color gamut (e.g., 6P-B, 6P-C) and/or at least four primary colors. Advantageously, the expanded color gamut and/or the at least four primary colors enable more color differentiation between pixels, which is helpful when displaying multiple sets of image data. In one embodiment, the use of more than three primaries (RGB) is operable to increase a maximum luminance of the video wall, thus enabling HDR reproduction of the image data.
Video Walls for Light Field Display
In one embodiment, the video wall is a light field display. Light field displays are operable to create a three-dimensional (3D) visualization without the use of a wearable (e.g., red-blue glasses) to consolidate stereoscopic images. A light field defines rays of light passing through a plane in space. By defining the light field at each point in a 3D viewing space and displaying the image data as projected through the light field, the light field display is operable to display the 3D visualization of the image data on a two-dimensional display. In one embodiment, the light field display includes a plurality of holographic elements that appear different from different viewing angles. In one embodiment, each holographic element includes a lens (e.g., a microlens) overlayed over a plurality of pixels. Only one of the plurality of pixels is visible through the lens at a time, and the visible pixel depends on the viewing angle. In one embodiment, the holographic element further includes a blocking element, e.g., a channel, to eliminate unwanted cross-talk of light between holographic elements and/or artifacts from neighboring holographic elements. The change in appearance of each holographic element means that the displayed image as a whole appears different depending on the viewing angle, thus mimicking a three-dimensional object that appears different from different angles. Alternatively, the holographic element includes at least one electromechanical element, e.g., a moving stage. In one embodiment, the at least one electromechanical element is operable to change the appearance of the holographic element to create the 3D visualization.
Video Walls for Virtual Production
In addition to being used as large-scale displays, video walls (e.g., LED walls, LED volumes) are also used in the entertainment industry to replace or supplement real-life set design. For example, a video wall is operable to be used as a green screen. Video walls are also used for virtual production, wherein captured image data is combined with computer-generated imagery (CGI) in real time or near real time. For example, the video wall displays a virtual set that would otherwise be added in post-production. Displaying the virtual set in real time on the video wall is preferable because it means that lighting and coloring of the set as a whole, including real-life people and objects, is more accurate. For example, if the virtual set displayed on the video wall includes bright lights, reflections of the bright lights will appear on people and objects in front of the video wall, making the virtual set seem more realistic. If the virtual set were not displayed in real time (e.g., the background was a green screen), the reflections would not appear and would have to be edited in later. Displaying image data on video walls for virtual production requires real-time rendering. For example, when a camera is filming, the virtual set needs to change as the camera moves to simulate a real, three-dimensional set as viewed from different angles. Additionally, real-time or near real-time color balancing and/or color grading is needed to accommodate demands of film production including combinations of image and/or visual information from the camera, the video wall, and surrounding lighting. Advantageously, the present invention is operable for real-time or near real-time color correction, including gamut adjustments and blending. Using a three-coordinate format wherein the first coordinate is a luma or a luminance and the second and third coordinates are chroma (e.g., Yxy) is advantageous for real-time processing and adjustment of the image data by enabling subsampling without loss of visual information. With a less efficient representation, changes to image data would have to be done in post-production.
The present invention is generally directed to comparing a viewer's sense of awe when observing footage on a 3P display and observing footage on a 4P display.
In one embodiment, the present invention includes a method for evoking an emotional sensation including: viewing footage on a four primary (4P) color display; viewing footage on a three primary (3P) color display; wherein the emotional sensation is the sensation of awe wherein the viewing of the footage on the 4P color display produces a greater sensation of awe than a sensation of awe produced by viewing the footage on the 3P color display; and wherein the sensation of awe is measured using an AWE Experience Scale (AWE-S).
In another embodiment, the present invention includes a system for evoking an emotional sensation including: a four primary (4P) color display operable to display footage with four primary colors being displayed to a viewer; a three primary (3P) color display operable to display footage with three primary colors being displayed to the viewer; wherein the emotional sensation is the sensation of awe; wherein the sensation of awe is measured using an AWE Experience Scale (AWE-S); and the viewer experiencing a greater sensation of awe from viewing footage of the 4P color display than from viewing footage of the 3P color display.
In yet another embodiment, the present invention includes a system for evoking an emotional sensation including: a four primary (4P) color display operable to display footage with four primary colors being displayed to a viewer; a three primary (3P) color display operable to display footage with three primary colors being displayed to the viewer; wherein the emotional sensation is the sensation of awe; wherein the sensation of awe is measured using an AWE Experience Scale (AWE-S); and the viewer experiencing a greater sensation of awe from viewing footage of the 4P color display than from viewing footage of the 3P color display; wherein the four primary colors are Red, Blue, Green, and Cyan; and wherein the three primary colors are Red, Blue, and Green.
None of the prior art discloses an increase in the sense of awe when viewing 4P footage than when viewing 3P footage.
The present invention involves a study, which is reported herein. The study investigated the possible affective and cognitive impact of presenting viewers with video footage processed for a prototype multi-primary (4P) display vs a traditional 3P, RGB display. Specifically, differences in feelings of awe were assessed via a within- and between-subjects experimental design. Participants viewed NASA footage of Earth from the ISS on 3P and 4P displays. Feelings of awe were assessed after each video presentation. Results indicated that the 4P footage inspired greater feelings of awe than the standard 3P footage, indicating that wide-gamut video on multi-primary displays may be more affectively powerful than traditional RGB video.
The notion that colors have an emotional and cognitive impact is commonly held. Various colors are often said to symbolize certain emotions or qualities, and color is a relevant factor in all visual arts. Moreover, the notion that colors have emotional and cognitive meaning has scientific merit. Color has significant meaning for and impact on individuals when used purposively in painting, fashion, interior design, marketing, and filmmaking. Likewise, in media theory, literacy, and production courses, color is considered an effective tool of symbolism and a means of eliciting emotional and even physiological responses. Indeed, many studies have found that people easily associate colors with moods, emotions, and certain values and qualities. What has also been established is that color codes, or ways of interpreting color associations are highly subjective. Several meanings can be ascribed to the same color, although color naming and categorization may be quite similar across cultures, based on, universalities of color naming and categorization.
Many cinematographers have used color to provide contextual cues for interpretation of symbolism or to evoke an emotional response. Once such cinematographer is Vittorio Storaro, ASC, AIC who grounds his use of color in film or digital cinema in the reasoning of the Greek philosophers, who believed that four basic elements create harmony in our lives: water (green), fire (red), earth (ochre), and air (blue). He believes that when our energy is balanced, these colors combine into a form of pure energy, which is white. In an American Cinematographer interview, Storaro stated: “Color is part of the language we speak with film. We use colors to articulate different feelings and moods. It is just like using light and darkness to symbolize the conflict between life and death. I believe the meanings of different colors are universal, but people in different cultures can interpret them in different ways.”
The present invention includes a study that does not intend to investigate how viewers interpret various colors in images, although it is certainly related to that. Rather, the study of present invention investigates the possible affective and cognitive impact of presenting viewers with a wider gamut (range or envelope) of color, which represents a much greater percentage of human color perception than do the current video or digital cinema standards. The present invention proposes to accomplish this by comparing a standard 3-primary RGB (red, green, and blue) display to a 4-primary RGBC (red, green, blue, and cyan) display. For the sake of simplicity, hereinafter the 3-primary RGB display are referred to as “3P,” and the 4-primary RGBC display are referred to as “4P.”
The study associated with the present invention compares the emotional impact on a viewer after watching footage on a 4P display and watching footage on a 3P display. In order to achieve the step of displaying footage to the view, footage that includes four primary colors, the present invention is operable to utilize the systems, methods, and apparatus described in the following documents: U.S. Pat. Nos. 11,651,717, 11,043,157, 11,587,490, 11,495,160, 11,532,261, 11,699,376, 10,607,527, 11,574,580, 11,721,266, 11,482,153, 11,501,419, 11,694,592, 11,682,333, 11,373,575, 11,587,491, 11,189,214, 11,495,161, 11,289,003, 11,315,467, 11,651,718, 11,600,214, 11,189,211, 10,997,896, 11,341,890, 11,631,358, 11,315,466, 11,557,243, 11,488,510, all of which are hereby incorporated by reference in their entirety. In one embodiment, the present application describes use cases for the methods, systems, and apparatuses in the above referenced documents. In one embodiment, the present application describes improvements for the methods, systems, and apparatuses described in the above referenced documents. In one embodiment, the 4P display or the “prototype 4P display,” referenced throughout the present disclosure, is or is enabled by the methods, systems, and/or apparatuses described in the above referenced documents.
The prototype 4P display adds cyan alongside the standard red, green, and blue primaries. In one embodiment, the prototype 4P display is the method, system, and/or apparatuses described in the above referenced documents. Research indicates that certain color receptors in the human visual system are especially sensitive to hues of blue and cyan. As reported in the early 2000s, not only do human eyes sense light using rods and cones, but light is also perceived by what are known as intrinsically photosensitive retinal ganglion cells (ipRGCs), first identified as photoreceptors in mammalian eyes by Berson, Hattar, and colleagues. The operative substance responsible for this photosensitivity is known as melanopsin, a photopigment that has a peak sensitivity of 484 nm. That particular wavelength is solidly in the range identified as cyan according to several sources not specifically mentioned herein. Furthermore, melanopsin is thought to be closely linked to the circadian clock of mammals, helping to govern sleep patterns. In fact, melanopsin excitation has been linked to melatonin suppression in mammals, with the effect being a circadian phase shift. What this may mean for the present invention, then, is that humans are physiologically tuned to respond to cyan wavelengths; such wavelengths may be particularly attractive and stimulating. As such, a 4P display that incorporates cyan is likely to be more pleasing and possibly more emotion-inducing than a 3P system.
Eudaimonic Response to Video Content
We know that viewers respond emotionally to various stimuli delivered via video. Films and television programs, both fictional narratives and nonfiction, have inspired emotional responses since their inception. From the 1960s until the 2000s, the main focus of research on emotional responses to media focused on hedonic enjoyment; the visceral, spontaneous responses of pleasure, humor, suspense, and fear. But scholars then began to recognize that enjoyment encompassed more than the hedonic. Eudaimonic responses, they realized, were an important part of the process. Appreciation, they found, was another dimension of enjoyment of media content; encompassing emotions such as elevation, awe, inspiration, transcendence, and tenderness.
Of all of these, the emotional response of awe seems most relevant for the present invention. We arrive at this conclusion by considering the capabilities of the 4P display system as well as the stimulus material available. The 4P system expands the color gamut toward the added cyan primary, thus the experiment primarily required visuals containing more shades of cyan than current displays reproduce. It is known that viewing vast landscapes, including the Earth from space, often inspires strong emotions, including that of awe. For this reason, the selected stimulus material is NASA video of the planet Earth as seen from the International Space Station (ISS). Some of the cameras NASA uses on the ISS capture more color than current displays can use, and ordinarily these colors are just lost (or “clipped”) in post-production. For the purposes of this study, NASA provided footage in the camera RAW format, which includes all the color; thus, we were able to retain and process more cyan for the 4P system, resulting in views of Earth never before seen on a display.
Awe
Awe is defined as having two affective components: that of a sense of vastness, and a sense of the need for accommodation. For further explication, we turn to Keltner and Haidt, who unpacked exactly what is meant by awe in light of these components. They propose a prototypical conceptualization of awe, meaning that all experiences of awe must involve as sense of vastness and a need for accommodation, but there can be other components that color the experience.
First, awe involves a sense of vastness. This can certainly mean vastness in the sense of sheer physical size. The view from a mountain top can be awe-inspiring. Looking across a vast ocean, or the view of a landscape from an airplane or hot air balloon can also produce awe. Looking up at a grand statue, or an enormous factory floor such as the Boeing plant in Washington state might be awe inspiring. But this sense of vastness can also be felt in the power of figurative vastness, such as the presence of great importance, significance, or power, i.e., “social size.” A political leader can inspire awe because of his or her power and authority; a billionaire can likewise inspire awe because of their influence in economic and social situations. As Yaden and colleagues put it: “vastness can be either perceptual (e.g., seeing the Grand Canyon) or conceptual (e.g., contemplating eternity).” Also necessary to the feeling of awe is the need for accommodation. Accommodation refers to the inability of the mind to fully grasp the experience at hand, forcing the adjustment of mental schema in order to understand what has been experienced. “ . . . Prototypical awe involves a challenge to or negation of mental structures when they fail to make sense of an experience of something vast.”
Considering this, the study involved with the present invention assumes that awe is the proper response to expect upon exposure to the NASA test footage. The view of the Earth from space offers the experience of something both vast in that the Earth is huge relative to human beings; and it contains the entire human race. It is therefore both physically vast and conceptually vast; it has meaning and significance beyond ourselves. The view of Earth from space also requires accommodation; because it is something that few have often seen, and almost no one has ever seen for themselves. Therefore, mental structures must be adjusted to understand the scope and significance of such a vista.
Referring now to the drawings in general, the illustrations are for the purpose of describing one or more preferred embodiments of the invention and are not intended to limit the invention thereto.
Objectives
The aim of the study associated with present invention is to test the assumption that improving the visible gamut of colors, utilizing the methods, systems, and apparatus previously incorporated by reference, in electronic imaging will result in material that is more effective at eliciting emotional responses from audiences. This first test of the idea must necessarily be narrowly focused, both because of the experimental nature of this nascent field, and because of the limited stimulus material available. For that reason, the present invention focuses on the most likely emotional response to the type of footage we possess. Therefore, the present invention propose that: (1) NASA footage of the Earth as seen from the ISS will produce a sense of awe in viewers (referred to as “hypothesis one,” “hypothesis 1,” and “H1”); (2) NASA footage of the Earth as seen from the ISS processed for and shown on the 4P display will produce a greater sense of awe in viewers than the same footage processed for and displayed on the 3P display (referred to as “hypothesis two,” “hypothesis 2,” and “H2”).
Study Design
The present invention involves a study that was designed as a two condition quasi-experiment with a repeated measures crossover design. A prototype 4P (red, green, blue, and cyan) display was compared to a standard 3P (red, green, and blue) display. Specifically, the 4P display presents a color gamut volume that is 135.9 percent of the 3P color gamut volume (ITU BT.709).
Participants signed up for a research session which had four available seats. A condition was then randomly assigned to that session (and thereby the participants of that session) to determine the order of the stimulus videos. In the 3P condition, the 3P version of the video was shown first, the awe assessment was given, then the 4P video was shown, the awe assessment was given, and then the videos were shown side-by-side and the final awe assessment was given. The same procedure was followed for the 4P condition, except that the order of the first two videos was reversed (4P, 3P, side-by-side). This was done so that order effects could be taken into account, and so that some participants' reactions to the 3P video could be captured without their knowledge of the attributes of the 4P video. In the final awe assessment, participants were asked to answer about the 4P video as directly compared to the 3P video.
Participants
The initial sample of participants for this study, which the present invention pertains to, consisted of 134 university students recruited from various courses at a mid-sized research university in the south-central United States. Students were recruited via learning management system (LMS, i.e., Canvas) message and email. Participants were offered extra credit in one or more of their courses as one option among several equally valuable opportunities. They signed up for scheduled sessions using a GOOGLE form. A maximum of four participants were allowed per research session. Fourteen participant cases could not be used due to incomplete responses/missing data. Therefore, the final sample included in the analysis was 120 participants. Participants ranged in age from 18-23 years, with an average age of 19.9. The sample consisted of 61 males (approximately 51%) and 59 females (49%), and was 80% White, 9% Asian, 5% Black, and about 6% other ethnicities. All appropriate International Review Board (IRB) approved consent procedures were followed prior to initiation of any stimulus exposure.
Measuring Awe
Awe was measured using the recently developed Awe Experience Scale or the AWE-S developed by Kaufman D. B. Yaden and colleagues in their 2019 journal for The Journal of Positive Psychology titled “The development of the Awe Experience Scale (AWE-S): a multifactorial measure for a complex emotion found at http://doi.org/10.1080/17439760.2018.1484940 which is incorporated herein by reference in its entirety. The AWE-S measures awe by considering 6 factors including: altered time perception, self-diminishment, connectedness, perceived vastness, physical sensation, and need for accommodation. Each factor of the AWE-S is significantly correlated with the awe items of the modified Differential Emotions Scale (mDES) and Dispositional Positive Emotion Scale (D-PES). The awe sensation is measure by asking participants a serious of questions related to the factors, scaling their response on a scale of one to seven, and averaging the score for each participant.
The complete AWE-S is a 30-item scale that measure 6 factors that comprise the larger variable. The items are measured on a 7-point Likert-type agree-disagree scale. The six factors that make up awe are Time, Self-loss, Connectedness, Vastness, Physiological, and Accommodation. Some example questions include: “I felt that my sense of self was diminished”, “I experienced a sense of oneness with all things”, “I perceived vastness”, “I felt my jaw drop”, and “I tried to understand the magnitude of what I was experiencing.” For the sake of parsimony and time management, this study incorporated only three of the six factors; the three most salient to the type of footage being utilized in the study, namely the Vastness, Need for Accommodation, and Physiological factors. Those three are the attributes most closely associated with original definitions of awe, and seemed to be the most relevant factors to assess in the limited lab time available. For that reason, it should be acknowledged that while we are studying awe as an emotional response, we are actually measuring three of the aspects of awe found by Kaufman D. B. Yaden and colleagues in their 2019 journal.
Stimulus Material and Systems
The stimulus material of the present invention was approximately two minutes and twenty seconds of video footage provide by NASA to the 6P Color Project based at the Baylor Research and Innovation Collaborative (BRIC) in Waco, Texas. The video shows the landscape of the Earth as seen from the vantage point of the International Space Station in orbit above it. It also has several shots of the space station itself, and an astronaut performing spacewalk maneuvers. The 4P footage was processed via the methods, systems, and apparatus described in the documents incorporated by reference above (sometimes referred to as the “algorithms,” the “4P algorithm,” or the “propriety algorithm”) (i.e., a proprietary algorithm in a professional color correction software program).
The stimulus material was processed from original material shot by the astronauts during their space missions. This was raw data that was captured by a cinema-grade camera and includes colors that are outside of the standard gamut (ITU BT.709). The images were processed for lift (high-tones), gamma (mid-tones), and gain (dark-tones) with the tonal qualities being matched between the 4P and the 3P versions. No color adjustments were made. Using the raw footage allows for the additional cyan color to be extracted and processed though our 4P algorithm; making sure that for each comparison (4P and 3P) that the white point was the same. The display on the screen was actually a combination of two projectors for the 4P condition, one which was set to project red, green and blue with cyan subtracted from the image, and an additional projector outputting cyan exclusively. The 3P footage was processed without the proprietary algorithm just as any normal footage would be; and was projected via an unaltered projector of the same model as the two used to produce the 4P image. Each participant saw the video three times; they saw the 4P video once, the 3P video once (in a randomized order) and they saw both videos simultaneously side-by-side.
Results
Hypothesis 1 was a check that the footage was producing the expected sense of awe for all participants across the board. It predicted that viewers of the footage described would feel a sense of awe in response to seeing it. Since awe was measured on a 7-point scale coded from 1 for strong disagreement with the idea the item asks about and 7 for strong agreement with that item, any average value above 3.5 would indicate that viewers felt a sense of awe. To determine this, the measure of awe assessed after viewing the first video in each condition was considered. A one-sample t-test was performed with the test value set at the scale mean of 3.5. The t-test was significant, with an average awe score significantly greater than 3.5 (M=4.76, SD=1.39), t=9.95, P<0.001, Cohen's d=0.908. This result indicates support for H1; on average, participants felt a sense of awe in response to the NASA footage of Earth and the ISS from space. Only 22 of our 120 participants (18.3%) reported a sense of awe at 3.5 or lower on the awe scale in response to the first video in the sequence (either 3P or 4P with no knowledge of the other condition).
Hypothesis 2 predicted that the 4P footage would be more awe-inducing than the standard 3P footage. This hypothesis was tested using a repeated-measures analysis of covariance (RM ANCOVA) procedure with gender entered as a covariate. This test looks for both within-subjects effects as participants compare the 4P and 3P videos, and between subjects effects to consider the possible order effects that may be present due to the sequence of those videos. The first test to consider is Box's test of Equality of Covariance Matrices. This is a test of the assumption that observed covariance matrices are equal across groups. While this type of test can report different outcomes, using an ANCOVA analysis is typically robust enough to identify any violations. In this study, Box's test did indicate a violation of the assumption, Box's M=16.89, F=2.74, p=0.012. The result of this is that Pillai's Trace statistic should be considered in the ensuing multivariate tests as opposed to the more typical Wilks' Lambda.
Results of the multivariate test indicated that there was no significant overall effect for awe, meaning that awe did not vary significantly across participants or conditions, on average. However, since awe was assessed three times for each participant, and each participant saw the same stimulus materials (albeit not in the same order), this result means very little. The same multivariate test indicated that gender was not a significant covariate in the model (i.e., the interaction effect of gender and awe was not significant). But the test did indicate a significant interaction between awe and condition, with a η 2 part of 0.16, indicating that 16% of the variance in awe could be attributed to the condition the participant viewed. Statistical results are reported in Table 1.
The next important part of the analysis is Mauchly's Test of Sphericity, which tests the assumption that the error covariance matrix of the dependent variables is proportional to an identity matrix. In this study, Mauchly's test indicated a violation of the assumption of sphericity Mauchly's W=0.915, χ2=10.28, p=0.006, εGreenhouse-Geisser=0.922, εHuynh-Feldt=0.952. Given these values of Epsilon, we will consider Huynh-Feldt corrections in following tests, as is appropriate when εGreenhouse-Geisser is greater than 0.750. As such, the test of within-subjects effects indicated what the multivariate test first pointed to, that there is a significant interaction effect of awe with condition, Huynh-Feldt F(1.90, 222.74)=13.52, p<0.001, η 2 part=0.104. This result indicates that there were within-subjects differences based on the condition (i.e., the viewing order) to which participants were exposed.
According to these results, those who saw the 3P video first rated it as more awe-inspiring (MAwe=4.55) than the 4P video (MAwe=4.39). However, when they were asked to re-evaluate the 4P video when shown side-by-side with the 3P video, Awe increased significantly (MAwe=4.78). For those who saw the 4P video first, awe was quite high (MAwe=4.95) as compared to the 3P video that they saw next (MAwe=3.92). Awe remained high when the side-by-side videos were shown (MAwe=4.97). Note that the mean values shown in
The next set of analyses looks at between-subjects effects only, comparing the average awe scores across groups. Levene's test of equality of error variances indicated that the assumption was satisfied for awe responses after all videos (see Table 2).
Since the assumption of equality of error variances was satisfied, Wlks' Lambda F test was considered. That test indicated significant differences in awe responses between the three videos, Wilks' λ=0.812, F(2, 116)=13.45, p<0.001, η2part=0.188. Pairwise comparisons confirmed that awe responses to the 3P video were, on average, significantly lower than awe responses to the 4P video and the 4P video when shown alongside the 3P video. The two instances of the 4P video were not significantly different (see Table 3)
As was indicated in the results for H2, the true independent comparison of awe scores after the first video (3P vs 4P) indicated marginally significant differences in awe between the two conditions (see Table 1).
An ANOVA procedure using only the awe scores recorded after the first video was performed to isolate these results. The test shows that the difference in awe in response to the 3P video (M=4.571, SD=1.46) and the 4P video (M=4.98, SD=1.29) did not reach statistical significance, F(1, 119)=2.73, p=0.101, η2part=0.022. This indicates that, without previous knowledge of the depth of the color gamut they were seeing as compared to the other condition, our participants did not differ significantly in their awe reactions. However, it should be noted that the difference between MAwe3P=4.57 and MAwe4P=4.98 was marginally significant and may have reached significance with a larger independent sample.
Not that the means in this test are different than the means produced by the multivariate test. This is because this test is comparing only the MRGB and M4p scores participants gave after seeing the first video according to condition. In the multivariate test, MRGB and M4p are grand means produced by combining awe scores from the first and second videos. This means that in the test represented in Table 3, average awe scores were produced by combining scores from those who saw the RGB video first with those who saw the RGB video second, and so forth for the 4P video.
At the outset of the discussion of these results, at least two important acknowledgements of the study's limitations must be made. First, this study was done as a quasi-experiment with condition (the order the videos were presented) being randomly assigned to groups of participants instead of individuals. This was done to maximize the number of participants we could accommodate in the study. So then, experimental reliability was sacrificed somewhat for convenience and a larger sample. The second limitation to note is that all participants saw all videos. Therefore, some of the results are based on the comparison of mean values that came from the same participants, i.e., the same people who evaluated and reacted to the 3P video also reacted to the 4P video. The most notable effect of this is that our findings, at least when making comparisons of the mean awe scores between videos, can be seen as more conservative than results from a true independent sample. Participants likely suffered from habituation effects such that the strongest reactions would have been experienced toward the first video seen, and the second video, no matter what the condition, would have seemed less impressive or awe inspiring. However, we do not feel this limitation represents a weakness in the study; there were distinct advantages to using a dependent sample and a repeated measures design, such as our ability to test more subjects, and the fact that we were able to record 4P's impact on awe for those who also saw the 3P footage and understood the difference.
The findings show that participants who either 1) saw the 4P footage before the 3P footage or 2) saw the 4P footage beside the 3P footage were more awestruck by the 4P footage. Seeing the footage in 4P prompted a clear emotional reaction. We can say then, with some confidence, that the 4P NASA footage was more awe-inducing than the 3P version of the same footage. However, it seems that in the absence of knowledge about 4P, participants were still awestruck by the footage, even when processed with 3P only. This was not unexpected, because we purposefully chose footage that would spark an emotional response, and our tests illustrate that enhancing the color gamut positively impacts that emotional response when participants compare the two versions of the footage. On average, then, participants felt significantly more awe when viewing the 4P video than when viewing the 3P video (see Table 3).
This research has demonstrated that simply adding cyan as a primary color alongside red, green and blue makes a significant difference in affective reactions to certain footage. In particular, the footage used in this study inspired the emotional response of awe. Awe is considered a eudaimonic response because it is laden with meaning. Specifically, awe includes the sense of vastness, and a need for accommodation. In other words, awe-inspiring material is that which overwhelms us and makes us realize how very small we are in relation to nature or a feat of engineering and requires us to rethink our position in the universe in order to accommodate the greatness and majesty of what we are experiencing. Of course, awe is not the only affective response possible when presented with audiovisual material. Future research will utilize various stimulus material that will surely inspire other hedonic and eudaimonic responses. In some cases, color may have little to do with how those feelings are manipulated, but in others, it surely will, just as it seems to with awe.
It is a hope that this research is a first step in demonstrating that certain technological advances, such as those being made by 6P Color, Inc. are now able to address a concern that industry professionals have had for years. In a recent article in British Cinematographer, renowned Director of Photography, Steven Poster. ASC states: “There is a clear connection between the color of the images we use to the emotional qualities of the story. Early in preparation, along with the production designer and the director, a palette is determined. For instance, the connection between blue light and human emotion has been demonstrated in Japanese and British subways by placing blue LED panels at either end of stations. This has shown the correlation of significantly reducing suicides in these locations. There are many other examples of the color of light effecting human emotion . . . With our limited ability to reproduce specific colors and longer gradations of colors, I often find myself in post reaching for a specific tonal range for emotional effect.”
Future Research
All of this development and human testing is for the purpose of bringing modern video production closer to what the human eye actually sees. As such, this is a field ripe for further study. Astronauts themselves have noted that current display technology is unable to capture the majesty and grandeur of the Earth from space. Astronaut Reid Wiseman worried: “I had looked at pictures on social media and pictures in NASA archives of the earth so many times, I actually started to get worried, what if I get up there and it's just like the pictures?”, but then later noted: “ . . . The scale of looking at a sun refracting through the atmosphere, it blew me away, and no picture capturing it. There's no high enough dynamic range of a photo to capture what the human can see.” In describing the view, astronaut Dr. Tracy Caldwell Dyson said: “There's no words. There's no picture I could take, to do it justice. There's no watercolors that I could put on paper, to come close to the vividness, the ever-changing picture that I see, staring at this planet. Everything from the colors to what changes the atmosphere goes through, depending on where the sun angle is and whether the moon's in the view or not to just how fast we're going over the surface of it, and the way shadows are changing. You could see the same landmass over a period of two weeks, and it looks completely different, because of that. To then looking at the stars and the blackness, a black that nothing here on earth ever can replicate.”
Multi-primary display technology, even now in the current iteration that only adds cyan as a fourth primary, moves us closer to replicating on video what the human eye is capable of seeing. Anecdotal evidence seems to indicate that even this small step can make a big impact. Industry professionals who have visited the 6P Color lab at Baylor University for 3P to 4P demonstrations note: “An amazing new step in taking media to the next, and more realistic, level.”, “An immediate material difference in color clarity, quality, depth and fidelity. Terrific,” “The three-dimensional views of the water were most impressive. I didn't know how much I was really missing until I saw the side-by-side comparison”, “Loved the depths of the ocean colors”, “It made me feel like I was in space”, “Like I was part of the picture, not just watching it.”, “More natural to look at. The 3P side really did not capture my attention as the enhanced picture was far more compelling.”
We have yet, however, to show the technology to astronauts who have been to space themselves. Although it would be a more focused and qualitative study, there would be great value in such an effort to validate the reliability of the 4P (and later versions that add more primaries) to accurately represent the range and depth of color that the human eye can see. As work on the development of this technology progresses, we plan to study each iteration for its affective and cognitive impact and significance. An LED video wall capable of displaying 4P color processed footage is currently being designed and assembled. The next step in this line of research, then, is to perform independent tests of participants who view footage on this new display. Future work will also consider footage that incorporates multiple primaries, even beyond the 6P color that was first conceptualized by the founders of 6P Color, Inc.
With such possibilities on the horizon, future research will explore the relationship between emotional and psychological effects as these relate to color sensation. We hope this can help make further improvements in entertainment, advertising and medical applications.
The server 850 is constructed, configured, and coupled to enable communication over a network 810 with a plurality of computing devices 820, 830, 840. The server 850 includes a processing unit 851 with an operating system 852. The operating system 852 enables the server 850 to communicate through network 810 with the remote, distributed user devices. Database 870 may house an operating system 872, memory 874, and programs 876.
In one embodiment of the invention, the system 800 includes a network 810 for distributed communication via a wireless communication antenna 812 and processing by at least one mobile communication computing device 830. Alternatively, wireless and wired communication and connectivity between devices and components described herein include wireless network communication such as WI-FI, WORLDWIDE INTEROPERABILITY FOR MICROWAVE ACCESS (WIMAX), Radio Frequency (RF) communication including RF identification (RFD)), NEAR FIELD COMMUNICATION (NFC), BLUETOOTH including BLUETOOTH LOW ENERGY (BLE), ZIGBEE, Infrared (IR) communication, cellular communication, satellite communication, Universal Serial Bus (USB), Ethernet communications, communication via fiber-optic cables, coaxial cables, twisted pair cables, and/or any other type of wireless or wired communication. In another embodiment of the invention, the system 800 is a virtualized computing system capable of executing any or all aspects of software and/or application components presented herein on the computing devices 820, 830, 840. In certain aspects, the computer system 800 may be implemented using hardware or a combination of software and hardware, either in a dedicated computing device, or integrated into another entity, or distributed across multiple entities or computing devices.
By way of example, and not limitation, the computing devices 820, 830, 840 are intended to represent various forms of electronic devices including at least a processor and a memory, such as a server, blade server, mainframe, mobile phone, personal digital assistant (PDA), smartphone, desktop computer, notebook computer, tablet computer, workstation, laptop, and other similar computing devices. The components shown here, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the invention described and/or claimed in the present application.
In one embodiment, the computing device 820 includes components such as a processor 860, a system memory 862 having a random access memory (RAM) 864 and a read-only memory (ROM) 866, and a system bus 868 that couples the memory 862 to the processor 860. In another embodiment, the computing device 830 may additionally include components such as a storage device 890 for storing the operating system 892 and one or more application programs 894, a network interface unit 896, and/or an input/output controller 898. Each of the components may be coupled to each other through at least one bus 868. The input/output controller 898 may receive and process input from, or provide output to, a number of other devices 899, including, but not limited to, alphanumeric input devices, mice, electronic styluses, display units, touch screens, signal generation devices (e.g., speakers), or printers.
By way of example, and not limitation, the processor 860 may be a general-purpose microprocessor (e.g., a central processing unit (CPU)), a graphics processing unit (GPU), a microcontroller, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Programmable Logic Device (PLD), a controller, a state machine, gated or transistor logic, discrete hardware components, or any other suitable entity or combinations thereof that can perform calculations, process instructions for execution, and/or other manipulations of information.
In another implementation, shown as 840 in
Also, multiple computing devices may be connected, with each device providing portions of the necessary operations (e.g., a server bank, a group of blade servers, or a multi-processor system). Alternatively, some steps or methods may be performed by circuitry that is specific to a given function.
According to various embodiments, the computer system 800 may operate in a networked environment using logical connections to local and/or remote computing devices 820, 830, 840 through a network 810. A computing device 830 may connect to a network 810 through a network interface unit 896 connected to a bus 868. Computing devices may communicate communication media through wired networks, direct-wired connections or wirelessly, such as acoustic, RF, or infrared, through an antenna 897 in communication with the network antenna 812 and the network interface unit 896, which may include digital signal processing circuitry when necessary. The network interface unit 896 may provide for communications under various modes or protocols.
In one or more exemplary aspects, the instructions may be implemented in hardware, software, firmware, or any combinations thereof. A computer readable medium may provide volatile or non-volatile storage for one or more sets of instructions, such as operating systems, data structures, program modules, applications, or other data embodying any one or more of the methodologies or functions described herein. The computer readable medium may include the memory 862, the processor 860, and/or the storage media 890 and may be a single medium or multiple media (e.g., a centralized or distributed computer system) that store the one or more sets of instructions 900. Non-transitory computer readable media includes all computer readable media, with the sole exception being a transitory, propagating signal per se. The instructions 900 may further be transmitted or received over the network 810 via the network interface unit 896 as communication media, which may include a modulated data signal such as a carrier wave or other transport mechanism and includes any deliver media. The term “modulated data signal” means a signal that has one or more of its characteristics changed or set in a manner as to encode information in the signal.
Storage devices 890 and memory 862 include, but are not limited to, volatile and non-volatile media such as cache, RAM, ROM, EPROM, EEPROM, FLASH memory, or other solid state memory technology, discs (e.g., digital versatile discs (DVD), HD-DVD, BLU-RAY, compact disc (CD), or CD-ROM) or other optical storage; magnetic cassettes, magnetic tape, magnetic disk storage, floppy disks, or other magnetic storage devices; or any other medium that can be used to store the computer readable instructions and which can be accessed by the computer system 800.
In one embodiment, the computer system 800 is within a cloud-based network. In one embodiment, the server 850 is a designated physical server for distributed computing devices 820, 830, and 840. In one embodiment, the server 850 is a cloud-based server platform. In one embodiment, the cloud-based server platform hosts serverless functions for distributed computing devices 820, 830, and 840.
In another embodiment, the computer system 800 is within an edge computing network. The server 850 is an edge server, and the database 870 is an edge database. The edge server 850 and the edge database 870 are part of an edge computing platform. In one embodiment, the edge server 850 and the edge database 870 are designated to distributed computing devices 820, 830, and 840. In one embodiment, the edge server 850 and the edge database 870 are not designated for computing devices 820, 830, and 840. The distributed computing devices 820, 830, and 840 are connected to an edge server in the edge computing network based on proximity, availability, latency, bandwidth, and/or other factors.
It is also contemplated that the computer system 800 may not include all of the components shown in
The above-mentioned examples are provided to serve the purpose of clarifying the aspects of the invention, and it will be apparent to one skilled in the art that they do not serve to limit the scope of the invention. By nature, this invention is highly adjustable, customizable and adaptable. The above-mentioned examples are just some of the many configurations that the mentioned components can take on. All modifications and improvements have been deleted herein for the sake of conciseness and readability but are properly within the scope of the present invention.
This application is a continuation-in-part of U.S. application Ser. No. 18/134,884, filed Apr. 14, 2023, which is a continuation of U.S. application Ser. No. 17/965,410, filed Oct. 13, 2022, which is a continuation of U.S. application Ser. No. 17/670,112, filed Feb. 11, 2022, which is a continuation-in-part of U.S. application Ser. No. 17/516,143, filed Nov. 1, 2021, which is a continuation-in-part of U.S. application Ser. No. 17/338,357, filed Jun. 3, 2021, which is a continuation-in-part of U.S. application Ser. No. 17/225,734, filed Apr. 8, 2021, which is a continuation-in-part of U.S. application Ser. No. 17/076,383, filed Oct. 21, 2020, which is a continuation-in-part of U.S. application Ser. No. 17/009,408, filed Sep. 1, 2020, which is a continuation-in-part of U.S. application Ser. No. 16/887,807, filed May 29, 2020, which is a continuation-in-part of U.S. application Ser. No. 16/860,769, filed Apr. 28, 2020, which is a continuation-in-part of U.S. application Ser. No. 16/853,203, filed Apr. 20, 2020, which is a continuation-in-part of U.S. patent application Ser. No. 16/831,157, filed Mar. 26, 2020, which is a continuation of U.S. patent application Ser. No. 16/659,307, filed Oct. 21, 2019, now U.S. Pat. No. 10,607,527, which is related to and claims priority from U.S. Provisional Patent Application No. 62/876,878, filed Jul. 22, 2019, U.S. Provisional Patent Application No. 62/847,630, filed May 14, 2019, U.S. Provisional Patent Application No. 62/805,705, filed Feb. 14, 2019, and U.S. Provisional Patent Application No. 62/750,673, filed Oct. 25, 2018, each of which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
62876878 | Jul 2019 | US | |
62847630 | May 2019 | US | |
62805705 | Feb 2019 | US | |
62750673 | Oct 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 17965410 | Oct 2022 | US |
Child | 18134884 | US | |
Parent | 17670112 | Feb 2022 | US |
Child | 17965410 | US | |
Parent | 16659307 | Oct 2019 | US |
Child | 16831157 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 18134884 | Apr 2023 | US |
Child | 18448697 | US | |
Parent | 17516143 | Nov 2021 | US |
Child | 17670112 | US | |
Parent | 17338357 | Jun 2021 | US |
Child | 17516143 | US | |
Parent | 17225734 | Apr 2021 | US |
Child | 17338357 | US | |
Parent | 17076383 | Oct 2020 | US |
Child | 17225734 | US | |
Parent | 17009408 | Sep 2020 | US |
Child | 17076383 | US | |
Parent | 16887807 | May 2020 | US |
Child | 17009408 | US | |
Parent | 16860769 | Apr 2020 | US |
Child | 16887807 | US | |
Parent | 16853203 | Apr 2020 | US |
Child | 16860769 | US | |
Parent | 16831157 | Mar 2020 | US |
Child | 16853203 | US |