The present invention relates to a method and a device for digital image stabilization for removing unwanted camera movements, called jitter, from an original sequence of images generated by said camera and obtaining a stabilized sequence of images, said original sequence being applied to a stabilization algorithm, in order to remove said jitter.
Such a method and device may be used, for example, in a digital low/mid-end movie camera or in a mobile phone.
As well known, the goal of digital image stabilization methods is to remove unwanted camera movement or jitter, thus providing a more pleasant viewing experience, to produce a sequence that displays requisite camera movement only. Jitter is defined as all the undesired positional fluctuation of the image that is added to the intentional motion of the camera, in particular translation and/or rotation.
Stabilization algorithms are sometimes rigid in their processing. It is difficult for these algorithms to be fully suitable to all kinds of input sequences to be stabilized: for example sequences with a single view point, panoramic shots, sequences where the user is moving forward or backward, etc.
The need is thus felt to have a more flexible method, on the one hand, and optimal with many types of input sequences, on the other hand, and a corresponding device.
It is therefore an object of the invention to propose such a device.
To this end, the invention relates to a device such as defined in the introductory paragraph of the description and which is moreover characterized in that it comprises:
It is also an object of the invention to provide a method that can be carried out in such a device.
To this end, the invention relates to a method such as defined in the introductory paragraph of the description and which is moreover characterized in that the method comprises:
In accordance with the most important feature of the method, intrinsic properties of the sequence are used and evaluated in order to improve the stabilisation efficiency.
In practice, to implement the method, an “in loop strategy” is used. The key tool in the loop according to an important feature of the method is the evaluation of some relevant parameters inherent to the sequence which can influence the perception of the stabilization by the user.
To achieve this goal, an original jittered sequence (or a part of) is first stabilized by a default version of the stabilization algorithm. Then global motion and frequency characteristics of the stabilized sequence are taken into account to adapt the thresholds/parameter values in the algorithm to obtain a better stabilization.
The method according to the invention can be applied to any high/low/mid-end digital cameras as a dynamic video stabilization tool, implemented in a mobile phone, Key-ring, a PC program etc.
Other features of the invention are found in the dependent claims.
Additional objects, features and advantages of the invention will become apparent upon reading the following detailed description and upon reference to the accompanying drawings in which:
In the following description, well-known functions or constructions by the person skilled in the art are not described in detail since they would obscure the invention in unnecessary detail.
To fix the ideas, without limiting the scope of the invention, it will be considered hereafter that jittered images are generated by a movie camera (not shown). Both original and stabilized sequences of images can be stored in memory means, modules 10 and 12.
A module 11 implements a stabilization algorithm and handles also the implementation of stabilization algorithm filters (module 110), which will be detailed hereafter.
The stabilization algorithm module 11 creates a stabilized sequence 12 from the original, i.e. jittered sequence 10 generated by the camera. The images can be divided in large areas of pixels called “macro blocks”. Some relevant sequence parameters coming from the stabilization algorithm module 11, such as global motion vectors, hereafter called “GMVs”, or motion vectors per macro block, hereafter called “BMVs”, are sent to a sequence parameter module 13 and stored therein. Then, they are sent to an evaluation module 15 (output O13) and evaluated (the different circuits of the evaluation module 15 will be detailed in
The analysis performed in the evaluation module 15 leads to an objective measure of the video stabilization quality, by using measurements of different natures (on both original 10 and stabilized 12 image sequences): the well-known parameter called “peak signal-to-noise ratio” or “PSNR” of consecutive images, a frequency analysis of the motion vectors along the sequence and the analysis of the motion parameters of the current image. This operation is performed for each frame and is detailed hereafter. Based on this evaluation, the stabilisation parameters of the algorithm are dynamically adjusted (for each frame or for a group of N frames). The possible adjustments of the motion/jitter filtering part of the stabilization algorithm and the relationship between the evaluated parameters and the interaction on the stabilisation algorithm will be also detailed later.
The evaluation module 15 will now be described with respect to detailed
A module 14 (see
Several measures are combined to create a “Stabilization Quality Metric”, hereafter called “SQM”, using a combination of parameters based on the motion-frequency content of the original sequence 10 and the stabilized sequence of images 12, perceptual considerations and a regular mean “Inter-frame Transformation Fidelity measure”.
More particularly, in accordance with a preferred embodiment of the method of the invention “Objective video Stabilization quality measure system”, “SQM” is measured as follows.
The motion spectrum, based on consecutive “GMVs” values, is divided into two energy bands, one for high frequencies, typically above 1 Hz, and one for low frequencies, below 1 Hz (in that example).
Thus the estimation of the quality of stabilization called Estimation_of_Stabilisation_Quality, is given by the following equation:
Estimation_of_Stabilisation_Quality=(α0+α1*ABelow+α2*AAbove)*ITF+(α0+β1*ABelow+β2*AAbove)*redHF+(γ0+γ1*ABelow+γ2*AAbove)*redLF (1)
where α1 and α2 represent the respective importance of Abelow and Aabove in the way they ponder the Inter-frame Transformation Fidelity (ITF) and α0 represents the overall importance of the ITF as such in the total measure; β1 and β2 represent the respective importance of Abelow and Aabove in the way they ponder the high frequency reduction (redHF) and β0 represents the overall importance of redHF as such in the total measure; and γ1 and γ2 represent the respective importance of Abelow and Aabove in the way they ponder the low frequency reduction (redLF) and γ0 represents the overall importance of redLF as such in the total measure, respectively. These parameters are experimentally determined.
The other parameters are detailed hereafter:
Aabove and Abelow represent the proportion of the energy in both bands (above/below, for example, 1 Hz respectively) compared to the total motion energy; and HF_Energy, and LF_Energy are the absolute levels of high and low frequencies energy.
Said parameters are given by the two following equations:
RedHF and RedLF represent the reduction of motion energy between original 10 and stabilized 12 sequences over the considered two frequency bands. This should be a primary indicator for jitter reduction, particularly over high frequencies.
ITF represents an index, which is a standard “PSNR” computed on the luminance data between two consecutive frames. The final index is averaged over the whole sliding test window (or whole sequence in the case of sequence-wise measurements).
Equation (4) recalls the expression of the “PNSR” between consecutive frames:
where aij and bij are the correspondent pixels of the current and consecutive frames.
The “Inter-frame Transformation Fidelity” or “ITF” measure is calculated from “PSNR” between consecutive images of the sequence (PSNR (Ik, Ik+1)), k being an arbitrary rank.
ITF index is given by equation (5):
where nb_frame represents the number of frames of the tested part of the sequence.
The above recalled parameters are computed in the analysis block 16 (
Now, the stabilization algorithm implemented in the module 11 of
According to a feature of the invention, two first filters are used, called hereafter filters FILTERA and FILTERB, respectively. During a low-pass “GMV” filtering stage, the “GMV” curve is filtered and accumulated through time over n frames. The correction to be applied is the difference between the original accumulated curve AMV(n):
AMV(n)=SUM(k=1 to n)*(GMV(k)) (6)
and a modified accumulated curve AMVmod(n):
AMVmod(n)=SUM(k=1 to n)(filter(GMV(k))) (7).
Two moving average filtering modes are possible.
With filter FILTERA, a moving average filtering at time t is performed. The
“GMV” is replaced by an average of the M previous “GMVs”, thus filtering the high frequency component of the “GMV” as follows:
GMVfiltt=Average(GMVt . . . GMVt−M) (8)
With filter FILTERB, a double pass moving average filtering at time t is performed: The moving average-filtered “GMV”, GMV_filtt, is once again averaged over the same sampling window:
GMV_double_filtt=Average(GMV_filtt . . . GMV_filtt−M) (9).
This doubly filtered “GMV” will provide an even smoother result than a simple moving average filtering.
Then “Motion Vector Integration” provides the basis for two other motion filters, hereafter FILTERC and FILTERD. In this case, the “GMV” is integrated with a damping factor, and the integrated motion vector (“IMV”) thus generated designates the final motion vector correction to be applied to the current input image to construct the stabilized sequence:
IMVt=α*IMVt−1+GMVt (10)
where α is a damping factor <1 (it is chosen between 0 and 1, depending on the degree of stabilization desired).
According to a feature of the method of the invention, an adaptive damping coefficient α is used in such a way that the following equation is satisfied.
IMVt=α(GMVt+GMVt−1)*IMVt−1+GMVt (11)
The damping factor α depends on the sum of the last two “GMVs”. This allows tracking the beginning of intentional motion. The correspondence table between α and (GMVt+GMVt−1) is built as follows:
Two tables are used:
The loop interaction will be now explained in detail.
The default “Motion Filtering” filter used in the stabilization algorithm is FILTERC. Depending on the Evaluation results, the stabilization algorithm can use different motion filters.
First, the stabilized sequence is given a measure or mark, by the evaluation module 15, more precisely by the analysis block module 16 (
Otherwise, the sequence will be processed using another filter: branch “NO”. The signal on a second output of the comparing module 19 is sent back to a filter action analysis module 18, which also receives parameters Aabove and Abelow.
The switching process is as follows:
If Abelow>T1*Aabove then the filters FILTERA or FILTERB are used:
If Aabove>T2, filter FILTERB is used, otherwise, filter FILTERA is used
If Abelow<T1*Aabove then the FILTERC or FILTERD filters are used:
If Aabove>T3, use FILTERC, otherwise, use FILTERD
where T1, T2 and T3 are thresholds which are experimentally determined.
The result of the above computations is sent to the filter control module 110: signal on output O′15 with a predetermined format, called “Send_Filter_Action message”.
Then, the stabilization algorithm module 11 filters the global camera motion taking into account either the value of signal received from output O15, either the value of signal received from output O′15 of the evaluation module 15, and rejects undesirable jitter.
Thus, according to the most important feature of the method of the invention, said rejection is done by using the intrinsic properties of the sequence of images and evaluating them in order to improve the stabilization.
The optimal use of the invention is to realize the loop interaction using a closed loop.
The initialization frame is referenced IF (upper line) and the first stabilized frame is referenced SF1 (lower line). The first window is referenced W1. For any frame of this window, there are not yet enough processed frames to achieve a correct in-loop stabilization control. From the last frame F of the second window, referenced W2 (dotted lines in
Each window comprises the same number of time slots (each time slot corresponding to a time interval between two consecutive frames), i.e. a same number of frames, but shifts with the time, from one generated frame to the next one. For example, window W1 comprises original frames number 1 (IF) to 6, and stabilized frames 1 to 6, window W2 comprises original and stabilized frames number 2 to 7 (F) etc.
The above-described mechanism allows changing in real-time the motion filtering part of the stabilization algorithm implemented in module 11 (
However, it is to be understood that the present invention is not limited to the aforementioned embodiments and variations expressly described, and modifications may be made without departing from the spirit and scope of the invention. There are indeed numerous ways of implementing functions of the method according to the invention by means of items of hardware or software, or both, provided that a single item of hardware or software can carry out several functions. It does not exclude that an assembly of items of hardware or software or both carry out a function, thus forming a single function without modifying the DIS method in accordance with the invention. Said hardware or software items can be implemented in several manners, such as by means of wired electronic circuits or by means of an integrated circuit that is suitable programmed respectively.
Number | Date | Country | Kind |
---|---|---|---|
05300659 | Aug 2005 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2006/052749 | 8/9/2006 | WO | 00 | 7/23/2010 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2007/017840 | 2/15/2007 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5497191 | Yoo et al. | Mar 1996 | A |
5648815 | Toba | Jul 1997 | A |
5973733 | Gove | Oct 1999 | A |
7024052 | Hirase | Apr 2006 | B2 |
7269220 | Alvarez | Sep 2007 | B2 |
7705884 | Pinto et al. | Apr 2010 | B2 |
7880769 | Qi | Feb 2011 | B2 |
8155195 | Regunathan et al. | Apr 2012 | B2 |
20040012720 | Alvarez | Jan 2004 | A1 |
20040012722 | Alvarez | Jan 2004 | A1 |
20050025243 | Sohn et al. | Feb 2005 | A1 |
20050275727 | Lai et al. | Dec 2005 | A1 |
20070002146 | Tico et al. | Jan 2007 | A1 |
Number | Date | Country |
---|---|---|
2330269 | Apr 1999 | GB |
Entry |
---|
Litvin, A; et al “Probabilistic Video Stabilization Using Kalman Filtering and Mosiacking” Proceedings of the SPIE, SPIE, Bellingham VA, US, vol. 5022, Jan. 20-24, 2003, pp. 663-674. |
Yu-Chun, Peng; et al “Digital Image Stabilization and Its Integration With Video Encoder” Consumer Communications and Networking Conference, 2005. CCNC. 2005 Second IEEE Las Vegas, NV, US Jan. 3-6, 2005, pp. 544-549. |
Number | Date | Country | |
---|---|---|---|
20110037861 A1 | Feb 2011 | US |