Embodiments of the invention relate to a method and system to reduce flicker artifacts in a captured image.
The instantaneous alternating current (AC) power is a continuously varying sinusoidal wave. Therefore, the instantaneous light intensity function of a lamp also varies accordingly and can be modeled as:
I′(t)=I+A|cos [ω(t+θ)]| (1)
where I′ is the observed instantaneous light intensity, I is a constant term respect to time, A|cos [ω(t+θ)]| is a varying term and is due to temporal fluctuations in ambient light , A is a constant, ω=2 πf is the power frequency at either 50 Hz or 60 Hz, t is the time, and θ is the initial phase. The varying term causes artifacts in a captured image. This phenomenon is known as light flicker. I′(t) is a periodic function. Because of the absolute term, its period is 1/(2 f), half of the cosine period.
Light flicker could cause noticeable artifacts during image and video capture. This is because the image pixel value is proportional to the integral of the instantaneous light function:
Y(α,T)=∫α−θα−θ+T[I+A|cos(ω(t+θ))|]dt=∫αα+T[I+A|cos(ωt)|]dt (2)
where α−θ is the starting time or the initial phase, T is the integration time or the camera exposure time, γ is the captured image pixel value. Y(α,T) has a period of τ=½ƒ with respect to T, where τ is the half of the power frequency period. From this equation, it can be seen that the γ value changes as a function of α, except when T is an integer multiple of τ, 1/(2 f). In that case, the γ value stays the same regardless of the initial phase α.
Depending on a camera's shutter mechanism, the light flicker causes various artifacts in the image, if T is not set to an integer number of the period τ. There are two popular types of shutters, namely snapshot and rolling shutter. In the snapshot case, all rows in the image are exposed at the same starting and ending time. If a sequence of video frames or still images is captured in the snapshot mode, each frame is exposed at a different time. Pixel values then vary from frame to frame even if the scene is static, which is easily noticeable during video playback.
In the rolling shutter case, only a number of rows are exposed at any given time. Imagine an exposure window that covers the exposed rows. The width of the window is the number of rows. The window moves one row at a time from the top to the bottom, and then rotates back to the top. At each window location, pixel values in the longest exposed row are read out. The exposure time is nβ, where n is the number of rows in the window and β is the time difference between the read-outs of two consecutive rows. β is usually constant within each image. Because each row is exposed at a different starting time, the pixel values shift from row to row. The net effect is a vertical periodic signal being superimposed on the original image.
When the magnitude of the fluctuation A is small, the bright and dark stripes in
One method to correct for flicker artifacts in a captured image is to set the exposure time T to an integer number of the period, 1/(2 f). However, with this method the allowable exposure time is constrained to a fixed set of values. In the example of 60 Hz power line, they are: 1/120 s, 2/120 s, 3/120 s. Any values between those numbers would not be allowed. Moreover the exposure time cannot be set to be smaller than 1/120 s without avoiding the flicker.
In one embodiment, a method for controlling artifacts in an image due to a presence of flicker in ambient light is provided. The method comprises determining a power frequency for the ambient light; determining a magnitude and phase of flicker fluctuation in the image due to the flicker; and adjusting the image for the flicker fluctuation based on the power frequency, magnitude, and phase.
Other aspects of the invention will be apparent from the detailed description below.
Embodiments of the invention will now be described, by way of example, with reference to the accompanying diagrammatic drawings, in which:
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art, that the invention may be practiced without these specific details. In other instances, structures and devices are shown at block diagram form only in order to avoid obscuring the invention.
Reference in this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Moreover, various features are described which may be exhibited by some embodiments and not by others. Similarly, various requirements are described which may be requirements for some embodiments but not other embodiments.
Embodiments of the present invention disclose techniques reducing or removing light flicker artifacts in captured images, regardless of whether the images are still images or video images. The techniques are designed for rolling shutter cameras. In one embodiment, there are three steps. The first step is to determine the AC power frequency, e.g. 50 Hz or 60 Hz. The second step is to determine the magnitude and the phase of the flicker fluctuation in captured images, given the power frequency. The third step reduces or removes the flicker artifacts from the images based on the determined power frequency, magnitude, and phase. Advantageously, the exposure time T need not be limited to an integer number of the period, 1/(2 f).
Embodiments of the present invention also cover an image processor that implements the techniques of the present invention. A camera system that includes an image processor that implements the techniques of the present invention is also covered.
Turning now to
The techniques to reduce flicker reduction in captured images are based on the Equation (2). Since Y(α,T) as defined in Equation (2) has period of π/ω with respect to T, in one embodiment it is assumed that 0<T≦π/ω·γ(α) may be evaluated to the following four cases, in one embodiment of the invention:
In on embodiment, Y(α,T) may be evaluated to the following:
In one embodiment 1A and 1B may be combined, and 2A and 2B may be combined as:
It is to be observed that the first term IT is a constant term with respect to time while the second term is the sinusoidal term. Moreover, when T=kπ/ω where k is an integer, both 1 and 2 may be evaluated to:
Equation (3) confirms that when T=kπ/ω, Y(α,T) is a constant function with respect to α, the initial time.
Turning now to
Determination of Power Frequency at Block 22
There are two matches for the power frequency, viz. 50 Hz or 60 Hz. Thus in order to determine the power frequency, in one embodiment differences between images taken at exposure times matched to possible choices (50 Hz or 60 Hz) for the power frequency are compared as follows. First two sets of images under the ambient or AC powered light are taken. Each set consists of two images shot of the same scene. For simplicity of discussion, assume the first images are taken at time 0, while the second images are taken at time α1 and α2, respectively. Let
ω1=2π·60 and ω2=2π·50
The exposure times are given by T1=kπ/ω1 and T2=kπ/ω2, where T1 is the exposure time for the first image and T2 is the exposure time for the second image, and where k is an integer number set to be suitable for the light condition. Further α1 and α2 are selected so that
α1≠kπ/ω1 and α2≠kπ/ω2
Based on the foregoing, either the first and second image in the first set will be identical, or the first and second image in the second set will be identical, depending on whether the power frequency is 50 Hz or 60 Hz. In practice, noise and possible motion would introduce small differences between the two images. Thus, in one embodiment the images are adjusted for motion using a motion compensation technique before comparing the images. However, if the exposure time is matched to the wrong frequency, Equation (3) no longer holds and there should be a significant difference between the two images. In one embodiment, differences calculated from the two image sets are compared. The power frequency that gives the smaller error or difference is selected as the power frequency for the ambient light. It is to be appreciated that the use of 50 Hz and 60 Hz as possible choices of the power frequency of the ambient light is for illustrative purposes only and that techniques disclosed herein are applicable for other frequencies of ambient light.
Determination of Flicker Magnitude and Phase at Block 24
The flicker magnitude and phase, A and α in Equation (2), are difficult to estimate because the flicker term is not directly compared. There is also the signal term IT that differs at each pixel location. In order to remove the signal term, in one embodiment two images are taken. A first image is taken with the exposure set to be a full period of γ, T1=π/ω. For the first image one of the images taken at block 22 may be used. Thereafter, a second image is taken with an exposure of a half period of γ, T2=π/(2ω). It is assumed that the first row of the first image is taken at time 0, and the first row of the second image is taken at time α0. In other words, the second image has an initial phase of α0.
Going back to the previous derivations for Case 1 and Case 2, when T=π/(2ω),
After further derivations, the two cases may be combined to be:
Compared to Equation (3), we obtain:
Z(α) also has a period of π/ω. It is to be noted that the new term Z is independent of the term IT. In other words, Z only depends on the flicker fluctuation and is independent of the actual signal at a pixel location. Therefore, the Z value should be identical for pixels in the same row. If the Z value of any pixel from each row, or the average Z value for each row is taken , a sequence values Z(αn) and αn=nβ+α0 may be obtained, where n is the row number, β is the time difference between the read-outs of two consecutive rows, and α0 is the initial phase for the first row.
In one embodiment a may be estimated by finding the zero crossings of Z. Since
Therefore, by determining the zero crossing and the direction where Z is going, the value of α at that location may be estimated as follows. If Z is zero and going from negative to positive, then:
where k is an integer, n is an integer number of pixel width, u is a fraction of one pixel width. Since Z is periodic, the effective value α0 is limited to 0≦α0<π/ω. That constraint eliminates k from the equation so that α0 can be estimated. Similarly α0 may be estimated if Z is zero and going from positive to negative.
Since equation (5) has a maximum and minimum value of
In one embodiment the maximum values and minimum values in the Z sequence are found, and A is calculated from Equation (6). Of course, this is only an approximation because the Z sequence is a set of discreet samples of the actual Z curve. Another technique for determining A is to use estimated α0 value and recalculate A from Equation (5). In one embodiment, the values at different pixel locations are averaged to get a more accurate estimate. In one embodiment, the results from both of the methods for determining A are combined.
Row Averages
One way to calculate Z values is to calculate Equation (5) at every pixel. To obtain a sequence of Z with respect to α, in one embodiment a Z value from each row is selected. In another embodiment an average of the Z values for each pixel is determined and used as the Z value for that row. Since the subtraction of γ values in Equation (5) is a linear operation, in one embodiment the pixel values in each row are averaged first before calculating the Z values. Using the row averages can save a large amount of memory. Instead of storing Z values for the entire image, only a column of Z values is needed. It can also speed up the processing because Z values at each pixel location need not be calculated. In general, using the row average reduces the noise and gives more accurate estimates.
Average of Z Values over Multiple Periods
Since Z is a periodic function, the entire sequence of Z values need not be analyzed. Thus, in one embodiment, just one period of the Z sequence is calculated. In another embodiment better an average of all the periods is calculated to smooth out the noise. To average over all periods, in one embodiment a conversion is performed to convert the unit of the period in time expressed in seconds to number of rows using the equation:
where β is the shutter time delay between two consecutive rows, and m is the length of the period for the Z sequence in terms of number of rows. For a rolling shutter, the exposure time is usually set in terms of number row delays. For an exposure time set to be half period of the AC power, the following relationship holds:
where n is the number of rows being exposed. It will be observed that m is same as n, the number used to set exposure time.
Smoothness and Noise Removal
In practice, image data is usually noisy. Thus in one embodiment a smoothing or noise removal step may be performed. Typical smoothing methods include Gaussian filtering, rectangular window filtering, median filtering, etc. First, the row averages are filtered along the vertical direction. Then after Step 2b where an average over all the periods is determined, the Z averages are smoothed. Usually the longer the filter length, the smoother is the data, allowing the initial phase α0 to be estimated more accurately. In one embodiment, filters of various lengths are used. Shorter ones are used to estimate A, while longer ones are used to estimate α0.
Estimation of the Zero Crossing Locations
Often, a zero crossing for the Z function may fall in between two pixels in which case, in one embodiment an interpolation operation is performed to obtain the pixel value of the Z function at the zero crossing. The interpolation operation may be a linear interpolation between the two closest pixels that span the zero crossing. For example if the zero crossing p lies in between point p1 and p2 with the corresponding Z values Z1 and Z2, p may be estimated by:
p=p1+|Z1|/(|Z1|+|Z2|)
For each period, there are two zero crossing values, one going from negative to positive and the other from positive to negative. These points are half a period apart. Suppose the two points are labeled as q1 and q2, and q1<q2. Thus, the zero crossing results may be verified by the by the following condition:
where thresh could be set to, for example 5%, for a high confidence in the result. The final value of q1 may be further adjusted by the average of the two:
In one embodiment an adjustment for q2 is performed which is similar to the adjustment for q1
Adjusting the Image for Flicker Fluctuation at Block 24
In one embodiment, having determined the flicker magnitude and the initial phase in accordance with the above techniques, the flicker reduction logic 18 at block 24 adjusts for flicker artifacts on any images taken with an exposure time that is not an integer multiple of the half period of the AC power. Theoretically, the estimated values can be applied to all subsequent frames, if the light condition is not changing and the internal clock is accurate. However, over time error accumulates, and in one embodiment the magnitude and phase are re-estimated periodically or when lighting conditions change.
In one embodiment, adjusting an image for flicker artifacts includes, calculating the flicker varying term in Case 1 and Case 2 for each row and subtracting that term from each pixel in that row. The resulting image may be dimmer than the original image. Thus, in one embodiment a constant may be added to the pixels in each row so that the two images have the same average intensity.