The disclosed embodiments relate generally to high dynamic range (HDR) imaging, and more particularly, but not exclusively, to a system and method for combining a set of images into a blended HDR image in a time and cost efficient manner.
High dynamic range (HDR) is a set of techniques that are used in imaging and photographing and that can be employed to record greater luminance levels than normal photographic techniques. Traditional cameras with no HDR function take photographs with limited dynamic ranges, resulting in losses of scene details. For example, underexposure usually occurs in shadows and overexposure usually occurs in highlights when taking a non-HDR picture. This is because of the limited dynamic range capability of sensors. Usually sensors, including common charge-coupled device (CCD) sensors and complementary metal oxide semiconductor (CMOS) sensors, can acquire a dynamic range of about 1:1000, or 60 dB, of intensities of brightness. That means the maximum change is about 1000 times of the darkest signal. However, many applications require working in wider dynamic range scenes, such as 1:10000, or 80 dB. HDR imaging techniques compensate the losses of details by capturing multiple photographs at different exposure levels and combining them to produce a single image with a broader tonal range. To facilitate displays of HDR images on devices with lower dynamic range, tone mapping methods are applied to produce images with preserved localized contrasts.
To get multiple photographs for HDR imaging, modern cameras offer an automatic exposure bracketing (AEB) feature with a far greater dynamic rage. Based on this feature, it is easy to achieve a set of photographs with incremental exposure levels from underexposure to overexposure.
To display the multiple photographs taken for HDR imaging, a conventional process is accomplished by using application software running on PC. Nowadays, image signal processor (ISP) inside modern cameras has become much more powerful than before, which motivates vendors to develop faster HDR imaging methods to enable a built-in HDR feature. This trend significantly enhances convenience and efficiency of the photography. Besides, HDR video recording becomes possible if the HDR imaging can be calculated and displayed in real time.
A typical HDR process usually contains three steps: (1) estimation of camera response function; (2) combination of multiple exposures of a set of images for HDR imaging and (3) tone mapping for the combined HDR image. The imaging process of the camera is modelled as a non-linear mapping g(X) from scene radiance Ei and exposure configuration kj to pixel brightness Zi,j, denoted by Equation (1).
g(Zi,j)=ln Ei+ln kj Equation (1)
wherein kj is associated with aperture A (F-number), exposure time t and ISO speed S. Equation (1) is over determined because there are more equations than unknowns and can be solved by the least squares method. Usually g(X) is implemented with a look up table (LUT) from the gray scale to the radiance map.
After solving Equation (1), the combination of multiple exposures can be represented by Equation (2).
where w(X) is a weighting function of the brightness, denoting the weight of Zi,j when recovering a scene radiance. The resulting curves for the typical weighting functions are illustrated in
There are many tone mappers in the literatures for Equation (2). For a normal display, a simple tone mapping operator is denoted in Equation (3).
where
is a scaled luminance and α=0.18.
As can be seen from Equations (1)-(3), the conventional HDR imaging methods usually require large amounts of computational resources. For instance, the conventional methods lead to the uses of a least squares method, which is usually solved by the singular value decomposition or a QR decomposition. In addition, Equation (2) and Equation (3) need pixel-wise exponential and logarithm operations. Therefore, the computational complexity becomes the main issue for built-in HDR features, which complexity makes HDR video impossible.
Moreover, the conventional methods for HDR imaging based on the “Red, Green, Blue” (RGB) color space contain several disadvantages. Firstly, because all RGB channels are correlated to the luminance, the estimation of camera response function has to take under all three channels, and is therefore computationally expensive. Secondly, sampling is difficult to cover the range of gray scales. Sampling bias may decrease the performance of the estimation. Thirdly, chromatic noises in low lights may decrease the performance of the estimation. Finally, the tone mapping may lead to color distortion, such as white unbalancing.
Therefore, there is a need for a system and method for combining a set of images into a blended HDR image in a time and cost efficient manner while preserving the quality for HDR imaging.
This disclosure presents a method and apparatus for high dynamic range imaging. Based on the derivation, this disclosure points out that the recovery of the high dynamic range radiance maps used by the previous methods can be waived. The luminance channel is fast computed by combining different exposures with respective blending curves. It saves a large amount of computational resources compared with the existing methods. Performed on the yellow-luminance (Y), blue-luminance (U) and red-luminance (V) (YUV) color space, the proposed scheme does not require extra post-processing steps to overcome the problems such as chromatic misbalancing and luminance artifacts suffered by previous methods.
In accordance with a first aspect of the subject matter disclosed herein, a method for high dynamic range (HDR) imaging is provided. The method comprises: calculating weights of Y components of a set of images in YUV color space based on a set of lookup tables (LUTs); blending Y components of the set of images with the weights to generate blended Y components; and combining the blended Y components with corresponding UV components to generate a single image in YUV color space.
In accordance with some embodiments, the method further comprises initializing the set of lookup tables (LUTs) based on exposure configurations of the set of images for HDR imaging.
In accordance with some embodiments, the method further comprises exposing the set of images in RGB color space with the different exposure configurations, so each image has a unique exposure configuration; and converting the set of images from RGB color space into YUV color space before said calculating.
In accordance with some embodiments, the method further comprises averaging values of UV components of the set of images in the YUV color space to achieve averaged UV components.
In accordance with some embodiments, the combining comprises combining the blended Y components with the averaged UV components to generate a single HDR image in YUV color space.
In accordance with some embodiments, the method further comprises converting the single HDR image from YUV color space into RGB color space to generate an HDR image in RGB color space.
In accordance with some embodiments of the method, the calculating weights of Y components comprises calculating the weight for the Y component of each image with one function selected from a plurality of functions based on the exposure configuration of the image.
In accordance with some embodiments of the method, the calculating weight of the Y component of each image with one function selected from a plurality of functions based on the exposure configuration of the image comprises: applying to the Y component of a maximum underexposed image with a first modified sigmoid function; applying to the Y component of a normal exposed image with a derivative of the first modified sigmoid function; applying to the Y component of a maximum overexposed image with a second modified sigmoid function.
In accordance with some embodiments of the method, the calculating the weight of the Y component of each image with one function selected from a plurality of functions based on the exposure configuration of the image further comprises applying to the Y component of an underexposed image with a first interpolated function between the first modified sigmoid function and the derivative of the first modified sigmoid function; and applying to the Y components of an overexposed image with a second interpolated function between the derivative of the first modified sigmoid function and the second modified sigmoid function.
In accordance with some embodiments of the method, the applying to the Y component of a maximum underexposed image comprises applying a function of S(x, a) to the Y components of the maximum underexposed image; the applying to the Y component of a normal exposed image comprises applying a function of
to the Y components of a normal exposed image; and the applying to the Y components of a maximum over exposed image comprises applying a function of S(255−x, a) to the Y component.
In accordance with some embodiments of the method, the applying to the Y component of an underexposed image function comprises applying a function of
to the Y components; and applying to the Y component of an overexposed image comprises applying a function of
to the Y component.
In accordance with some embodiments, the factor α is within a range of [0, 1] and the factor β is within a range of [0, 1].
In accordance with some embodiments, the method further comprises smoothing the calculated weights of Y component.
In accordance with some embodiments of the method, the smoothing comprises applying a Gaussian filter with the calculated weights of Y component.
In accordance with some embodiments of the method, the averaging the UV components comprises applying an average calculation
to each UV component of the set of images.
In accordance with a second aspect of the subject matter disclosed herein, an imaging camera with high dynamic range imaging capacity is provided. The camera is provided with an HDR module for performing the method of any of the methods described in any of the above embodiments.
It should be noted that the figures are not drawn to scale and that elements of similar structures or functions are generally represented by like reference numerals for illustrative purposes throughout the figures. It also should be noted that the figures are only intended to facilitate the description of the exemplary embodiments. The figures do not illustrate every aspect of the described embodiments and do not limit the scope of the present disclosure.
The present disclosure sets forth a system and method for fast adaptive blending for high dynamic range imaging. Although generally applicable to any image taking devices that are capable of taking a set of images with differently specified exposures for high dynamic range imaging, the system and method will be shown and described with reference to a camera capable of taking a set of images with incremental exposure levels from underexposure to overexposure for illustrative purpose only.
To illustrate the purpose of the system and method disclosed herein,
Similarly, to take an HDR video, a camera 100 is shown in
Although the configurations in
Same concept applies to sequences of seven or more images. In a sequence of seven images, if the normal exposure value is set to 0EV and the interval constant is selected as 1, the exposure values of the sequence of images can be (−3EV, −2EV, −1EV, 0EV, +1EV, +2EV, 3EV). Similarly, if the interval constant is selected as 2, the sequence would become (−6EV, −4EV, −2EV, 0EV, +2EV, +4EV, 6EV). For the purpose of this disclosure, a sequence of images is equivalent to a set of images.
Now, one manner by which the camera 100 processes HDR images is illustrated with
Based on the weights calculated at 220, the HDR module 114 blends the Y component for each image of the set of images to generate one blended (or composite) Y component, at 206. Then, the HDR module 114, at 208, combines the blended Y components with corresponding UV components of the set of images to produce an HDR image in YUV color space.
Another embodiment of method 200 in
In the processing branch 201A of the method 200, the HDR module 114 selects, at 209, the UV components of the same set of images and, at 210, calculates an average of the values of selected UV components among the images to generate averaged UV components. At 208, the HDR module 114 combines the blended Y component and the averaged UV components to generate a single HDR image in YUV color space.
In another embodiment of the method 200, as shown in
After converting from RGB to YUV color space, the HDR module 114 calculates weight for the Y component for each image selected from the set of images in YUV color space. Similar to what is described in
In the manner discussed above with reference to
The above embodiments of a method 200 are developed based on further deductions of Equation (3) of the existing methods for HDR imaging described in the background session. In one embodiment, the methods 200 of
In this embodiment, the method 200 eliminates the tone mappings, as described for Equation (2) and Equation (3). In traditional HDR methods, those tone mappings are performed after the recovery of the radiance maps, and hence make such a recovery computationally expensive. This disclosure assumes a f(X) meets the property to meet an approximation such that:
Generally, it is reasonable to assume f(g(Zj(x, y))−ln kj)=Zj(x, y)+δj wherein δj is small in normal exposure, positive in under exposure and negative in over exposure. Hence, Equation (4) can be rewritten as Equation (5):
where ∈ is a distortion associated with parameter δj. If we assume the set of exposures is symmetric and the scene 198 covers a large range of gray scales, E(∈)=0 and distortion e can be omitted. Therefore, the HDR imaging can be simplified to a blending of different exposures.
Different from the previous methods described in the background session, the weighting function used in this disclosure takes the exposure configurations associated with the images into account. Mathematically, w (Zj(x,y)) is replaced by w(Zj(x,y), kj). Let
be a modified sigmoid function, the proposed weighting functions are defined in Equation (6A), when there are three images selected as one underexposed, one normal exposed and one overexposed:
where S′(x, a) is the derivative of S(x, a), and a is used to control the shape of curves.
One embodiment of a calculation based upon Equation (6A) is illustrated in
at 225. For the maximum overexposed image, the HDR module 114 applies a second modified sigmoid function, S(255−x, a), at 228. The Y components for each of the images can be blended based on the calculated weights, at 206, to generate a blended Y component in the manner set forth above with reference to
Based on the descriptions set above for Equation (6A), the calculation (6A) can be abstracted into:
An embodiment of the method according to Equation (6B) is illustrated in
Another embodiment of the method 200 includes processing of five images taken for HDR imaging. In this embodiment, the five images are taken with different, and normally symmetric, exposures: two underexposed images, one normal exposed image and two overexposed images, as the five images as described in
where S′(x, a) is the derivative of S(x, a), and a is used to control the shape of curves. α,β are blending factors linearly proportional to the exposure values. Each of the factors α and β is within a range of [0, 1].
An embodiment of one method 220 for calculating the weights to be applied to Y components in accordance with Equation (7A) is shown in
which is an interpolation function of the first modified sigmoid function S(x, a) and the derivative of the first modified sigmoid function
For the overexposed image 106E, at 227, the HDR module 114 applies a second interpolated function
which is an interpolation function of the derivative of the first modified sigmoid function
and the second modified sigmoid function S(255−x, a). The Y components for each of the five images are blended based on the calculated weights at 206 to generate a blended Y component.
The calculation (7A) can be abstracted into:
An embodiment of a method 220 according to (7B) is illustrated in
Even though the embodiments in
For the purpose of this embodiment, the maximum underexposed image, the normal exposed image and the maximum overexposed image can be selected as described with reference to
A process of sets of seven or more images is also available with another embodiment of a method 220. Under an embodiment to process a set of seven images, the exposure configurations and the selections the maximum underexposed image 106A, the normal exposed image 106B and the maximum overexposed image 106C are same as described with reference to the first exemplary embodiment for processing a set of seven images. Under this embodiment, the two underexposed images (106F, 106D) between the maximum underexposed image 106A and the normal exposed image 106B are both selected as underexposed images. The two overexposed images (106E, 106G) between the normal exposed image 106B and the maximum overexposed image are selected as overexposed images.
In accordance with the same embodiment, when applying Equation (7A) for the under exposed images and the overexposed images, a simple exemplary way to determine the factor α is to calculate it with a fraction of each exposure compensation value to a sum of all under exposure compensations except the maximum exposure (or over exposure compensation values except the maximum exposure). E.g. in an exposure sequence of (3EV, −2EV, −1EV, 0EV, +1EV, +2EV, 3EV) corresponding to images 106A, 106F, 106D, 106B, 106E, 106G and 106C, the sum of all under exposure compensations except the maximum exposure 106F, 106D is −3 and the exposure compensation value for 106F is −2; therefore, the factor α is two third ⅔. When calculating weights for 106F, Equation (7A) becomes:
The exposure compensation value for 106D is −1; therefore, the factor α is one third ⅓. When calculating weights for 106F, Equation (7A) becomes:
Same approach applies to the overexposed images 106E and 106G. In this embodiment, Y components of all seven images in the set are blended at 206.
The blending schemes at 206 based on Equations (6A) to (7B) may produce artifacts to the blended Y component, particularly, when the contrast of the scene 198 is particularly high. One embodiment of the method 200 can include operations for smoothing the weights calculated in an effort to reduce the artifacts. The operations for smoothing weights (shown in the
As described above, in order to avoid losses of image qualities and to speed up HDR imaging process, the embodiments of the proposed method processes images in YUV color space. Only Y component is taken to calculate the dynamic range. For UV components, all ordinary combining methods can be used. In one embodiment of the method 200, the UV components are combined with simple average calculations as shown in Equation (8),
Referring back to
A typical example diagram showing weight curves for a maximum underexposed image, a normal exposed image and a maximum overexposed image under one embodiment of the method 200 is illustrated in
The visualized weights of
According to our test results, when evaluated in an ARM Cortex A9 processor, the proposed method is about 10 times faster than the traditional method which requires the recovery of the HDR radiance maps.
This subject matter disclosure could be widely used in the HDR imaging of image and video, including but not limited to cameras and cellphones with cameras. It is not excluded that the subject matter disclosure could be also applied to other wide dynamic range imaging systems.
The described embodiments are susceptible to various modifications and alternative forms, and specific examples thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the described embodiments are not to be limited to the particular forms or methods disclosed, but to the contrary, the present disclosure is to cover all modifications, equivalents, and alternatives.
This is a continuation application of International Application No. PCT/CN2014/091940, filed on Nov. 21, 2014, the entire contents of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5921931 | O'Donnell | Jul 1999 | A |
7433514 | Sloan | Oct 2008 | B2 |
7983502 | Cohen | Jul 2011 | B2 |
8737736 | Liu | May 2014 | B2 |
9598011 | Schachter | Mar 2017 | B2 |
9613408 | Micovic | Apr 2017 | B2 |
9858644 | Cao | Jan 2018 | B2 |
20070014470 | Sloan | Jan 2007 | A1 |
20080273119 | Yang | Nov 2008 | A1 |
20100157078 | Atanassov | Jun 2010 | A1 |
20100183071 | Segall | Jul 2010 | A1 |
20100271512 | Garten | Oct 2010 | A1 |
20110069906 | Park et al. | Mar 2011 | A1 |
20120002082 | Johnson | Jan 2012 | A1 |
20130335596 | Demandolx | Dec 2013 | A1 |
20140152686 | Narasimha et al. | Jun 2014 | A1 |
20140185931 | Aoki | Jul 2014 | A1 |
20160037044 | Motta | Feb 2016 | A1 |
20160080714 | Tsukagoshi | Mar 2016 | A1 |
20160093029 | Micovic | Mar 2016 | A1 |
20180012339 | Puetter | Jan 2018 | A1 |
Number | Date | Country |
---|---|---|
102420944 | Apr 2012 | CN |
102693538 | Sep 2012 | CN |
2004229259 | Aug 2004 | JP |
2007282020 | Oct 2007 | JP |
2012090309 | May 2012 | JP |
2012235390 | Nov 2012 | JP |
2013128212 | Jun 2013 | JP |
2013533706 | Aug 2013 | JP |
2014133340 | Sep 2014 | WO |
Entry |
---|
European Patent Office (EPO) European Search Report for 14906594.8 dated Feb. 8, 2017 8 Pages. |
The World Intellectual Property Organization (WIPO) International Search Report and Written Opinion for PCT/CN2014/091940 dated Aug. 27, 2015 6 pages. |
Reinhard et al., Photographic Tone Reproduction for Digital Images, ACM Transactions on Graphics, Jul. 2002, pp. 267-276, vol. 21, No. 3, ACM, New York. |
Debevec and Malik, Recovering High Dynamic Range Radiance Maps from Photographs, Proc. of the 24th Annual Conference on Computer Graphic and Interactive Techniques, Aug. 3, 1997, pp. 369-378, ACM, New York. |
Number | Date | Country | |
---|---|---|---|
20170154456 A1 | Jun 2017 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2014/091940 | Nov 2014 | US |
Child | 15431832 | US |