This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/US05/034891 filed Sep. 29, 2005 which was published in accordance with PCT Article 21(2) on Apr. 13, 2006 in English.
The present invention generally relates to the field of video processing, and more specifically the color correction of video segments.
In the field of post production, as used in the movie or television industries, a person operating a video editor may manipulate the colors of video content in order to allow for a specific effect desired by a director for artistic purposes. For example, video content may be modified by brightening the colors of the video content to portray a “pastel” effect or a darkening of hues to portray a “gritty” color effect.
When the colors of video content are modified, it may be difficult for such changes to be retained from the editing device to the device that will be used to display device content such as a video monitor or a film projector. Additionally, if video content is subjected to multiple devices in an editing workflow, such devices may not have the means of communicating with each other to effectively indicate what changes were made to the colors of video content as different devices have different capabilities of handing color correction.
The problems stated above, as well as other related problems of the prior art, are solved by the present invention.
According to an aspect of the present invention, a method is related for transmitting and using metadata that relates the order of color correction operations to be performed on image data representing a sequence of images and the number of such operations to be performed on such image data.
These and other aspects, features and advantages of the present invention will become apparent from the following detailed description of preferred embodiments, which is to be read in connection with the accompanying drawings.
An exemplary embodiment of the present invention is directed towards a video acquisition/editing workflow that is used to create, edit, and display video content. As shown in
Acquisition device 105 may store acquired visual information on a storage medium such as film, computer memory, flash cards, hard drives, Digital Versatile Disc (DVD), computer discs, digital storage device, archival storage tape, and the like. When such visual information is typically stored in a digital form, the acquisition device subjects the stored visual information to a transformation known as a color space that defines the color attributes of the stored video information. Different types of color spaces are known in the art such as defined by the International Commission on Illumination (CIE 1931L*a*b*), the International Telecommunication Union (ITU REC 601), Red Green Blue (RGB), and the like. These standards, among others in the art, define the color attributes of visual information when stored and such visual information is reproduced (in the form of video or film).
Visual information from acquisition device 105 is meant to be modified by editing device 110 which may take the form of a non-linear editor, computer, and the like. In a preferred embodiment of the present invention, editing device 110 is used for color management where visual information stored from acquisition device 105 is retrieved (scanned, read from a storage device, transmitted over a network connection, and the like) and is modifiable by software such as 2 k DA VINCI and DISCREET LUSTRE 4 k where color modification represent creative choices made by a person such as a colorist or a director.
Optionally, the color modifications represent an automatic operation where visual information is modified as to comport with the features of a device. For example, visual information that was stored by acquisition device 105 may have a color space that is different than used by editing device 110. Hence, the system is configured that the visual information is modified to the color space used for editing device 110 via a look up table, color transformation information, software translation and the like.
It should be noted that although the present invention acquisition/editing workflow system is discussed in regards to acquisition device 105 and editing device 110, it is to be appreciated that other devices may be used in the video editing workflow. For example, acquisition device 105 may represent a video camera used by a home user and editing device 110 may represent a home computer running Digital Video (DV) editing software, and other devices. Additionally, editing device 110 optionally represents a series of devices where each device is used for editing the visual information. For instance, one editing device 110 may be controlled by a director to make some initial color modification choices and a second editing device may be controlled by an operator to color correct such visual information before it is printed to film stock as a sequence of image frames by a rendering device 115.
Rendering device 115 is used for rendering the visual information as for display using a device such as a monitor, film projector, digital light processing (DLP), liquid crystal display, film printer, and the like, or rendering such information as film, CD-ROM, DVD, computer tape, flash card, and the like that is used to eventually render the visual information in a visual form. Preferably, the visual information will be composed as a sequence of moving images that are used to form things such as a video sequence (for videotape, DVD, streaming video, and the like) and images frames on film.
Applicants note that the principles of the invention, as to be explained below, also apply to consumer electronic equipment where a broadcaster or content producer (acting as a proxy for acquisition device 705) provides image data that represents a sequence of images. The broadcaster may specify for a specific type of device (such as for a particular type of display device) that one color correction technique be applied for one type of device (a plasma display device) and that a second color correction be applied for a second type of device (a Cathode Ray Tube display device). Other display devices may be specified, in accordance with the principles of the present invention. The principles of the present invention may also be used to calibrate editing equipment for consistent results when performing color correction operations.
As noted earlier in the application, device devices may utilize different color spaces or have had modifications performed on the colors of the visual information. The application presents a system of generating metadata that represents a framework where color modifications made may be transmitted between different devices and software used and/or provided by vendors. Preferably, this metadata is based off an XML schema associated color correction metadata with a sequence of images. The invention accommodates the premise where the color correction metadata may be temporally associated with a sequence of images where for between two time periods, two different sets color correction metadata are used.
In addition, the invention accommodates the premise where different color correctors may be used for visual data. Different types of color correctors and color correction operations are known in the art such as using a series of three 1-D lookup tables with Red Green and Blue (RGB) output (see
For example, for one time period for a sequence of images, the images are subject to first an RGB based color correction and then a color correction affected by a Lift Gain Gamma color correction operation. For a second time period, a second sequence of images is subjected to a 3-D lookup table and then an RGB based color correction operation. Other color correction processes may be used in accordance with the principles of the present invention.
TABLE 1 represents an exemplary embodiment of description tools used to perform color correction operations on a sequence of images. These operations may be reflected in the metadata that would be supplied with the sequence of images. Preferably, the described aspects of the color correction operators utilize terms defined as part of the Moving Picture Experts Group (MPEG-7) Visual Standard (ISO/IEC 15938-3:2002)
The application of color correctors can be applied anywhere within system 100, where multiple color correctors may take place for a specific device or at different times for a sequence of video images.
Time T2 represents where there is a gap of time between the application of color correctors C20 and C22. As shown, video sequences V20 and V22 are subjected to C20 and C22, respectively. V21 is not subjected to a color correction operation. Time T3 represents the circumstance where a video sequence V30 is subjected to a color corrector C30, as a video sequence V32 is subject to a color corrector C32. It is also apparent that for a video sequence V31, both color correction operations C30 and C32 are applied to the video sequence, as to overlap during their time of application.
TABLE 2 represents an exemplary embodiment of an XML schema used to generate metadata indicating color correction metadata in the form of a color decision list.
TABLE 3 represents an exemplary embodiment of an XML schema used to generate metadata indicating the color correction processed used to color correct a segment of images.
TABLE 4 represents an exemplary embodiment of an XML schema used to generate metadata indicating specifics of a particular color correction operation used to color correct a sequence of images.
A ColorLookup1D color correction operation, referred to above, preferably consists of three 1-D look up table for each color component. Typically, the quantization of the input color space equals the number of entries in the look up table. In this case, no interpolation is required. It is however possible for a 1-D color look up table to contain fewer entries than the input color space. For example, consider an RGB input color space with 6-bit color quantization and 16 entry look up table for each color component. The values of input values can be narrowed down to match the finite number of output values using interpolation techniques as known in the art.
A ColorLookup3D color corrector uses a 3-D color lookup table to perform transformations between color spaces. For example, a 3D lookup table with RGB input and output color spaces takes input RGB triples and maps it via interpolation, to an output R′G′B′ triple, with a potentially unique output value for each RGB input triple. For an exact representation, each combination of color components in an input color space would be mapped to an output color component, using each color component as a lookup table dimension. For example, for a 10-bit RGB input and output color, one would create a 3-D cube with 1024×1024×1024 (or over a billion entries). Hence, to use this type of color correction, it is preferable to use some type of interpolation such as trilinear or tetrahedral interpolation, with some restrictions (see Kasson, Nin, Plouffe, Hafner, “Performing Color Space Conversions with Three Dimensional Interpolation”, Journal of Electronic Imaging, July 1995, Vol. 4(4), for more information)
A ColorLookupPrecomputedMatrix color corrector uses a linear matrix transformation in the form of:
The ColorLookupPrecomputedMatrix operation then computes a list of lookup values for each coefficient in a 3×3 transform matrix. For each output value (R′, G′, or B′), the processing consists in three look up operations (one for each of R, G, and B) followed by two additions. Nine 1D look up tables (L′xx) are required in total:
The ColorFunctionLGG color corrector is done using controls derived from analog video processing, namely, changing the lift (known as black level, pedestal, or set up), gain, and gamma of the input-output transfer characteristic for each color component. The color corrector provides the ability to specify a color correction function rather than enumerating lookup values. Formally, lift, gain, and gamma are assigned in view of a signal color component R.
Lift raises the y-intercept of the input-output transfer characteristic, where
R′=R+L
where Lε[−1,1]
The output value R′ is limited by the normalized output interval [0,1], see
For Gain:
Gain (G′) scales the output value.
R′=GR 0≦G<∞
or equivalently
The output value R′ is limited by the normalized output interval [0.1].
See
For Gamma:
Gamma (γ) is a non-linear correction to a color component based on exponentation.
R′=Ry
where γε(0,1]
The output value R′ is limited by the normalized output interval [0,1].
See
The application of the functions listed above is in the order of Lift, Gain, and then Gamma. Or: R′=[G(R+L)]y
Preferably, for each sequence of images (video segment) should be described by a media time (starting time, instant, and duration) indicating the application (or non-application) of a color corrector. MPEG-7 MDS section 6.4 provides a number of different metadata sets to described media time. The time description is preferably based on a restricted set of lexical expressions of the International Standards Organization (ISO 8601) standard. Additionally, other representations of time for a sequence of images may be used such as Society of Motion Picture Television Engineers (SMPTE) time codes, using values in the format of a MediaTimePoint function, and using values in the format of a MediaTimeDuration function (both functions defined in MPEG-7), and the like.
It should be noted that although the principles of the present invention were described in regards to a sequence of video images, the invention supports the application of color correction over different regions of a sequence of images. Video segments decomposition tools (as described in MPEG-7 MDS ISO/IEC 15938-5 (11.4.9) describe how to decompose a series of video frames into respective space and time values. Alternatively, one may take an image from a sequence of images and decompose such an image into either rectangular or polygonal subregion (see MPEG-7 MDS ISO/IEC 15938-5 (11.4)). A color correction then may be applied to each subregion of each image.
If a specific region moves across frames, the principles of MPEG-7 MDS ISO/IEC 15938-5 (11.4) and MPEG-7 Visual 15938-3 (5.6, 9.4, and 10.3), may apply. By using the MPEG-7 tool moving region, a color correction may be applied to one or more moving regions whose outline is defined by a list of rectangles, ellipses, or an arbitrary polygon. The shape of the region can be rigid or non-rigid (such as polygon vertices that move relative to each other over with time). The motion of the region can be modeled by linear or quadratic time interpolation of the vertices of model parameters (such as two velocity parameters for rigid translation, four parameters for rotation and scaling, six parameters for affine transformations, and the like).
Optionally in step 710, there is temporal information included with the metadata that indicates when specific color correction operation should take place, as explained above. Alternatively, there may be metadata that indicates that a certain region of a sequence of images (generated from the visual information) should be color corrected using a specific technique for a specific length of time. Other sequences of images or image regions may be subject to the same or different color correction operations, in accordance with the principles of the present invention.
Step 715 has rendering device 115 performing a color correction operation in accordance with received metadata. Several different color correction operations may be performed, if the color correction metadata indicates this to be the case. In addition, the metadata may control for how long and/or what parts of the visual information should be affected by a selected color correction operation. In step 720, the color corrected data is rendered as a series of images outputted to a display device, printed on a film printer, or any other type of device capable of rendering visual information (image data) in view of the color correction operations. It is noted that any device in system 100, including acquiring device 105 and editing device 110 may perform the steps indicated above. In addition, the aspect of generating metadata that indicates the number of color correction operations to be performed, the specific color operations to be performed, and the order of such operations may be performed by any device of system 100, by applying the inverse method steps as shown in
It is also to be understood that the present invention may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. Preferably, the present invention is implemented as a combination of hardware and software. Moreover, the software is preferably implemented as an application program tangibly embodied on a program storage device. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (CPU), a random access memory (RAM), and input/output (I/O) interface(s). The computer platform also includes an operating system and microinstruction code. The various processes and functions described herein may either be part of the microinstruction code or part of the application program (or a combination thereof) that is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
Although the illustrative embodiments have been described herein with reference to the accompanying drawings, it is to be understood that the present invention is not limited to those precise embodiments, and that various other changes and modifications may be affected therein by one of ordinary skill in the related art without departing from the scope or spirit of the invention. All such changes and modifications are intended to be included within the scope of the invention as defined by the appended claims.
This application claims the benefit of U.S. Provisional Application Ser. No. 60/614,110 filed Sep. 29, 2004.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2005/034891 | 9/29/2005 | WO | 00 | 3/19/2007 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2006/039357 | 4/13/2006 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
3852808 | Sadashige | Dec 1974 | A |
4962418 | Kamaga | Oct 1990 | A |
5181098 | Guerin et al. | Jan 1993 | A |
5874955 | Rogowitz et al. | Feb 1999 | A |
6536904 | Kunzman | Mar 2003 | B2 |
6844883 | Bakhmutsky | Jan 2005 | B2 |
7545541 | Dalrymple et al. | Jun 2009 | B2 |
20030202121 | Huh et al. | Oct 2003 | A1 |
20030234796 | Bakhmutsky | Dec 2003 | A1 |
20040086265 | Tojo et al. | May 2004 | A1 |
20050071888 | Girouard et al. | Mar 2005 | A1 |
20060088105 | Shen et al. | Apr 2006 | A1 |
20060139462 | Fukasawa et al. | Jun 2006 | A1 |
20060165378 | Noda et al. | Jul 2006 | A1 |
20070092159 | Larkin et al. | Apr 2007 | A1 |
Number | Date | Country |
---|---|---|
1083567 | Mar 2001 | EP |
1321794 | May 1990 | JP |
WO2004002170 | Dec 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20070268411 A1 | Nov 2007 | US |
Number | Date | Country | |
---|---|---|---|
60614110 | Sep 2004 | US |