Reference is made to commonly assigned, co-pending U.S. patent application Ser. No. 12/510,431 filed Jul. 28, 2009, entitled: “Detection of Objects Using Range Information”, by S. Wang, and commonly assigned, co-pending U.S. patent application Ser. No. 12/511,111 filed Jul. 29, 2009, entitled: “Adjusting Perspective and Disparity in Stereoscopic Image Pairs”, by S. Wang, both of which are incorporated herein by reference.
This invention relates to digital image enhancement, and more particularly to a method for adjusting object brightnesses in a digital image using range information.
In many imaging system, it is desirable to optimize exposure and brightness. However, the captured image is often not pleasing, because of exposure and brightness.
A common problem that occurs in many digital imaging systems is that the images are too dark or too light. Often these problems result from errors introduced by the exposure control system in a digital image capture device. Many methods have been developed to improve the brightness of digital images. Such techniques are often referred to as “scene balance algorithms.” Scene balance algorithms can vary widely in their complexity, as well as in the accuracy of their results. They typically involve analyzing the distribution of overall exposure levels and relative color signal levels in an image to determine the appropriate level of brightness and color balance compensation that is needed.
Examples of prior art brightness adjustment methods are described in U.S. Pat. Nos. 4,101,217, 4,707,119, 4,984,013; 4,945,406, 5,016,043, 6,636,646, 6,845,181, 7,129,980 and 7,289,154.
Conventional scene balance algorithms apply global brightness and color balance adjustments to an entire image. Many times only portions of an image may suffer from being too dark or too light. For example, an object in the shadows, or an object that is in the background of a flash image may be too dark, or an image in the foreground of a flash image may be too light. U.S. Pat. Nos. 6,275,605 and 7,043,090 describe methods for adjusting the tone scale of an image so that different brightness adjustments can be applied to different portions of an image. The tone scale adjustments are based on an analysis of the two-dimensional digital images. Sometimes these algorithms suffer from artifacts where the amount of tone scale adjustment is not constant within an object.
Most prior art brightness adjustment methods determine brightness correction values based on an analysis of conventional two-dimensional digital images. US Patent Application Publication No. 2007/0126921 describes an exposure and tone scale adjustment method that uses range information for an image. With this method, the range information is used to determine a weighting function that places additional importance on image regions that are likely to contain important objects. The weighting function is then used to determine an overall exposure adjustment amount for the image. While this method makes it more likely that the main subject of an image is properly exposed, it does not address the problem that different objects in an image may need different exposure adjustments.
The present invention represents a method for adjusting the brightness of objects in a digital image, the method implemented at least in part by a data processing system and comprising:
receiving the digital image representing a scene;
identifying range information associated with the digital image and including distances of in the scene from a reference location;
generating a cluster map based at least upon an analysis of the range information and the digital image, the cluster map grouping pixels of the digital image by their distances from the reference location;
detecting a plurality of objects in the digital image based at least upon an analysis of the cluster map and the digital image;
determining a brightness adjustment amount for each object; and
storing the brightness adjustment amounts in a processor-accessible memory system.
It is an advantage of the present invention that by using range information a plurality of objects can be detected and segmented with improved accuracy. Furthermore, the brightness of each object can be adjusted individually.
It has the additional advantage that the object brightnesses can be adjusted to account for non-uniform lighting of the scene, for example due to fall off associated with flash lighting.
It has the additional advantage that a user interface can be provide user input to independently specify brightness adjustment amounts for each object.
In addition to the embodiments described above, further embodiments will become apparent by reference to the drawings and by study of the following detailed description.
The present invention will be more readily understood from the detailed description of exemplary embodiments presented below considered in conjunction with the attached drawings, of which:
The present invention is inclusive of combinations of the embodiments described herein. References to “a particular embodiment” and the like refer to features that are present in at least one embodiment of the invention. Separate references to “an embodiment” or “particular embodiments” or the like do not necessarily refer to the same embodiment or embodiments; however, such embodiments are not mutually exclusive, unless so indicated or as are readily apparent to one of skill in the art. The use of singular and/or plural in referring to the “method” or “methods” and the like is not limiting.
The phrase, “digital content record”, as used herein, refers to any digital content record, such as a digital still image, a digital audio file, a digital video file, etc.
It should be noted that, unless otherwise explicitly noted or required by context, the word “or” is used in this disclosure in a non-exclusive sense.
The data processing system 10 includes one or more data processing devices that implement the processes of the various embodiments of the present invention, including the example processes of
The data storage system 40 includes one or more processor-accessible memories configured to store information, including the information needed to execute the processes of the various embodiments of the present invention, including the example processes of
The phrase “processor-accessible memory” is intended to include any processor-accessible data storage device, whether volatile or nonvolatile, electronic, magnetic, optical, or otherwise, including but not limited to, registers, floppy disks, hard disks, Compact Discs, DVDs, flash memories, ROMs, and RAMs.
The phrase “communicatively connected” is intended to include any type of connection, whether wired or wireless, between devices, data processors, or programs in which data may be communicated. Further, the phrase “communicatively connected” is intended to include connections between devices or programs within a single data processor, connections between devices or programs located in different data processors, and connections between devices not located in data processors at all. In this regard, although the data storage system 40 is shown separately from the data processing system 10, one skilled in the art will appreciate that the data storage system 40 may be contained completely or partially within the data processing system 10. Further in this regard, although the peripheral system 20 and the user interface system 30 are shown separately from the data processing system 10, one skilled in the art will appreciate that one or both of such systems may be stored completely or partially within the data processing system 10.
The peripheral system 20 may include one or more devices configured to provide digital content records to the data processing system 10. For example, the peripheral system 20 may include digital still cameras, digital video cameras, cellular phones, or other data processors. The data processing system 10, upon receipt of digital content records from a device in the peripheral system 20, may store such digital content records in the data storage system 40.
The user interface system 30 may include a mouse, a keyboard, another computer, or any device or combination of devices from which data is input to the data processing system 10. In this regard, although the peripheral system 20 is shown separately from the user interface system 30, the peripheral system 20 may be included as part of the user interface system 30.
The user interface system 30 also may include a display device, a processor-accessible memory, or any device or combination of devices to which data is output by the data processing system 10. In this regard, if the user interface system 30 includes a processor-accessible memory, such memory may be part of the data storage system 40 even though the user interface system 30 and the data storage system 40 are shown separately in
Range information 105 associated with the digital image 103 is identified in identify range information step 104. The range information 105 includes distances of pixels in the scene from a known reference location. The viewpoint location needs to identified relative to the given range information. Usually, the viewpoint location is the reference location. Range information 105 is preferably presented in the form of a range map provided by a ranging camera which uses visible light, infrared light, laser light or ultrasound to determine distances to pixels in the scene. Alternately, the range map can be provided using stereoscopic image processing techniques that involve capturing images of a scene from multiple viewpoints and determining the range information by evaluating the relative positions of objects in the scene. For cases where the range map has different dimensions (i.e., number of rows and columns) than the digital image 103, the range map is preferably interpolated so that it has the same dimensions
In detect objects step 106, a plurality of objects 107 is detected in the digital image based at least upon an analysis of the range information 105. In determine brightness adjustments step 108, a brightness adjustment amount for each object is determined using an analysis which takes into account each object's distance from the viewpoint, which is the position of the camera. Each pixel in the digital image receives a brightness-parameter a based on its corresponding distance from the viewpoint, as indicated by the interpolated range map:
α(x,y)=f(dis(x,y))
where α(x, y) is the brightness parameter of the pixel in location (x, y) in the digital image, dis(x, y) is the distance of this pixel from the viewpoint, and f( ) is a linear function or a nonlinear function such as a power function, a polynomial function or an exponential function. For example, to correct for brightness differences due to non-uniform lighting in a flash scene, the f( ) function can be designed to compensate for the fact that exposure will fall off according to the well-known “inverse square law” relationship so that distant objects are brightened while closer objects are darkened.
A brightness adjustment amount γ of each object that was detected in detect object step 106 is determined by taking an average of the brightness parameter values α(x, y) for every pixel in that object, as
where γj is the brightness adjustment amount for the jth object and m is the number of pixels in the jth object.
In addition to the brightness-parameter α(x, y) that is based on the range value, additional brightness parameters can also be used. For example, an additional brightness parameter can be based on the location of the pixel within the image (e.g., the brightness of pixels near the edges of the digital image may be increased to account for lens fall-off in the image). Similarly, an additional brightness parameter can be based on local image structure (e.g., the brightness of pixels located at or near image locations having high edge gradient are may be increased).
In another variation of the present invention, the brightness adjustment amounts for the detected objects 107 can be specified by a user using a digital image editing application having a user interface. The user interface would allow the user to provide user input by selecting one of the detected objects 107 and specify a desired brightness adjustment amount, for example using a slide bar. The brightness adjustment amount could be applied to the object 107 and a preview image could be presented to the user showing the new brightness level. The user-specified brightness adjustment amounts can be supplemental to the automatically determined brightness adjustment amounts, or alternatively they can be specified relative to the unadjusted object brightnesses in the original digital image 103.
In optional modify brightness adjustments step 109, the brightness adjustments for each object are modified responsive to other factors. (The dashed lines in
The brightness of each object in the digital image 103 can be adjusted by applying each object's brightness adjustment 110. Methods for modifying the brightness of an image are well-known in the art. In a preferred embodiment of the present invention, when the pixel values of the digital image 103 represent the log of the exposure, then the brightness adjustment 110 can be applied by linearly adding an offset to the pixel values. Similarly, when the pixel values of the digital image 103 are proportional with the exposure, then the brightness adjustment 110 can be applied by nonlinearly scaling the pixel values by a constant multiplier. In either case, the brightness adjustment 110 models the physical process of scaling the amount of light in the scene (e.g. a dimming or brightening of the source illumination). The brightness adjustments 110 can also be applied in other ways. For example, the L* (lightness) or the Y (luma) of an object can be shifted or nonlinearly modified when the digital image 103 is in the CIELAB color space or a YCrCb luma-chroma color space, respectively.
objects=f(clustermap,I)
where the function f( ) is an object segmentation operation applied to the digital image I using the cluster map 203. The digital image I will be the digital image 103. The function f( ) works by identifying pixels in the cluster map 203 having the same distance, then assigning the corresponding pixels in the digital image 103 to a corresponding object 107.
Edges are detected in the digital image using an identify edges step 308. In a preferred embodiment of the present invention, the edges are identified using a gradient operation. The gradient of an image is defined as:
where I(x, y) is the intensity of pixel at location (x, y). The magnitude of the gradient vector is:
G=[Gx2+Gy2]1/2.
Edges are detected in the digital image based on the magnitude of the gradient in each pixel.
Next, filter edges step 310 is used to filter the detected edges to remove insignificant edges and keep the significant edges. Mathematically, the filtering operation can be expressed as:
where e is one of the detected edges, S(e) is the sum of gradient magnitudes of each pixel in the edge e, f is a filter mask and T is the threshold.
The pixel clusters produced by the reduce cluster noise step 306 will typically still have errors in the boundary areas because of the noise in the range map. A refine clusters step 312 is used refine the cluster groups and produce cluster map 203. The boundary of cluster groups are refined by using the significant edges computed in the filter edges step 310. If pixels are outside of the detected significant edges in each cluster group, they will be removed. This will make the boundaries of the cluster groups much more accurate. Next, an average distance, n, is computed in each of the refined cluster group as:
where m is the number of pixels in the cluster group w, dis(i) is the distance of the ith pixel to the viewpoint location. By assigning the average distance to each pixel in the cluster groups, the cluster map is generated.
It is to be understood that the exemplary embodiments are merely illustrative of the present invention and that many variations of the above-described embodiments can be devised by one skilled in the art without departing from the scope of the invention. It is therefore intended that all such variations be included within the scope of the following claims and their equivalents.
Number | Name | Date | Kind |
---|---|---|---|
4101217 | Fergg et al. | Jul 1978 | A |
4707119 | Terashita | Nov 1987 | A |
4945406 | Cok | Jul 1990 | A |
4984013 | Terashita | Jan 1991 | A |
5016043 | Kraft et al. | May 1991 | A |
6243133 | Spaulding et al. | Jun 2001 | B1 |
6252976 | Schildkraut et al. | Jun 2001 | B1 |
6275605 | Gallagher et al. | Aug 2001 | B1 |
6573932 | Adams, Jr. et al. | Jun 2003 | B1 |
6636646 | Gindele | Oct 2003 | B1 |
6845181 | Dupin et al. | Jan 2005 | B2 |
6873743 | Steinberg | Mar 2005 | B2 |
7043090 | Gindele et al. | May 2006 | B2 |
7046400 | Gindele et al. | May 2006 | B2 |
7116838 | Gindele et al. | Oct 2006 | B2 |
7129980 | Ashida | Oct 2006 | B1 |
7158174 | Gindele et al. | Jan 2007 | B2 |
7230538 | Lai et al. | Jun 2007 | B2 |
7289154 | Gindele | Oct 2007 | B2 |
7421149 | Haynes et al. | Sep 2008 | B2 |
7421418 | Nakano | Sep 2008 | B2 |
7526127 | Koide et al. | Apr 2009 | B2 |
7652716 | Qiu et al. | Jan 2010 | B2 |
20020126893 | Held et al. | Sep 2002 | A1 |
20030007687 | Nesterov et al. | Jan 2003 | A1 |
20030044063 | Meckes et al. | Mar 2003 | A1 |
20030044070 | Fuersich et al. | Mar 2003 | A1 |
20030044178 | Oberhardt et al. | Mar 2003 | A1 |
20030223622 | Simon et al. | Dec 2003 | A1 |
20040240749 | Miwa et al. | Dec 2004 | A1 |
20050157204 | Marks | Jul 2005 | A1 |
20070121094 | Gallagher et al. | May 2007 | A1 |
20070126921 | Gallagher et al. | Jun 2007 | A1 |
20070274604 | Schechner et al. | Nov 2007 | A1 |
20080112616 | Koo et al. | May 2008 | A1 |
Number | Date | Country |
---|---|---|
WO 2008102296 | Aug 2008 | WO |
WO 2009072070 | Jun 2009 | WO |
Number | Date | Country | |
---|---|---|---|
20110026051 A1 | Feb 2011 | US |