The invention relates to the stabilization of an image sequence acquired by an observation device of an imaging system such as a video camera, subject to unintentional movements or vibrations. The observation device typically operates at wavelengths in the visible or infrared region.
A first solution relates to the video cameras on board a terrestrial, naval or aerial platform which are in general mounted on an orientable mount, for example of the theodolite type, or else in a sighting instrument having a mirror with two orientable axes. Servocontrols ensure stabilization of the line of sight by utilizing the information from a gyroscope or from rate gyros secured to the camera.
Another solution concerns portable lightweight cameras of the camcorder or binocular type (intensified or infrared) which are oriented and stabilized manually by the observer but remain subject to the residual erratic movements of the latter. Solutions for stabilizing the line of sight of the camera are implemented by associating two gyrometric sensors secured to the casing of the camera with an opto-mechanical deviator component such as a variable-angle prism or lens off-centered by actuators included in the objective of the camera.
In these two cases, one speaks of “a priori” stabilization.
Other solutions call upon image processing for mutual registration of the successive images of a video sequence. In this case, one speaks of “a posteriori” electronic stabilization. An example of a posteriori stabilization is described in patent FR 2 828 315: low-frequency images are extracted from the images acquired and are used to determine by an optical flow procedure the translations and rotations to be applied to the images so as to stabilize them. “A posteriori” electronic stabilization procedures are not usable in the case of images exhibiting a uniform background structure or a repetitive texture (sky, forest or maritime surface for example).
The aim of the invention is to be able to use a low-cost image stabilization method operating whatever the content of the image (contrasted or not, structured or not, etc.).
The principle of the image stabilization method according to the invention is based on combining a priori stabilization and a posteriori stabilization.
More precisely, it involves using the information arising from at least one gyrometric sensor secured to the camera to calculate approximate shifts between the successive images (a priori stabilization) and then calculating fine shifts between these images by an image processing (a posteriori stabilization) which uses these approximate shifts as initialization.
The first approximate stabilization is obtained independently of the background and of the contrast of the objects in the images.
This first stabilization is obtained by determining the attitude of the camera.
More precisely, the subject of the invention is a method of stabilizing the images of a scene, acquired by means of an observation device of an imaging system, comprising a step of digital processing of a stream of successive images. It is chiefly characterized in that it comprises a step of acquiring gyrometric measurements by means of at least one gyrometric sensor secured to the observation device, of using these gyrometric measurements to determine so-called approximate shifts undergone between successive images, and in that the image processing step comprises a sub-step of using the approximate shifts and the image stream acquired to determine so-called fine shifts undergone between successive images.
The advantage of the hybrid motion estimation is thus to allow stabilization of the image sequence (of the image stream) whatever the scene.
Preferably, the image processing step comprises a sub-step of evaluating the fine shifts with a view to choosing between the approximate shifts and the fine shifts, those which have to be applied to the image stream so as to stabilize it.
According to a characteristic of the invention, the image processing step comprises a sub-step of temporal filtering of the shifts.
Advantageously, the step of determining the fine shifts comprises the steps consisting in:
The shift calculations are obtained for example by correlation.
The subject of the invention is also a device for stabilizing the images of a scene comprising an observation device of an imaging system and an element for digital processing of successive images, characterized in that it furthermore comprises at least one gyrometric sensor secured to the observation device and to the image processing element.
According to a characteristic of the invention, the image processing element comprises a microprocessor able to implement calculations of shift between successive images.
According to another characteristic of the invention, it comprises a first programmable logic array positioned between the observation device and the gyrometric sensor or sensors on the one hand and the image processing element on the other hand, this programmable logic array being able to acquire gyrometric measurements from the sensors and an image stream from the observation device and preferably another programmable logic array positioned at the output of the microprocessor and linked to the observation device and able to apply the shifts provided by the microprocessor to the image stream arising from the first programmable logic array.
According to a variant of the invention, these programmable logic arrays are grouped into a single programmable logic array.
These programmable logic arrays are preferably FPGAs.
Still other objects and advantages of the present invention will become readily apparent to those skilled in the art from the following detailed description, wherein the preferred embodiments of the invention are shown and described, simply by way of illustration of the best mode contemplated of carrying out the invention. As will be realized, the invention is capable of other and different embodiments, and its several details are capable of modifications in various obvious aspects, all without departing from the invention. Accordingly, the drawings and description thereof are to be regarded as illustrative in nature, and not as restrictive.
The present invention is illustrated by way of example, and not by limitation, in the figures of the accompanying drawings, wherein elements having the same reference numeral designations represent like elements throughout and wherein:
Across all the figures, the same elements are tagged by the same references.
An exemplary stabilization device according to the invention is represented in
It comprises a video camera 1 and two gyrometric sensors 2 secured to the camera. The image of the camera comprising pixels distributed in rows (x axis) and columns (y axis); there is preferably one sensor for the pixel rows and another for the columns. These sensors are typically MEMS (acronym of the expression “Micro Electro Mechanical System”) embodied on silicon. They are linked to an FPGA 3 (acronym of the expression Field Programmable Gate Array). It is recalled that an FPGA is a user-programmable prediffused array of logic gates, used for a particular function. The FPGA delivers an image stream and a stream of gyrometric measurements; these streams are synchronized in such a way that the shift measured by the rate gyros is referred to the corresponding image.
These information streams are processed by a microprocessor 4. This processing of the streams of images and synchronized gyrometric measurements consists chiefly of a so-called hybrid motion estimation the aim of which is to provide an estimation of the spurious movements between successive images. More precisely, the microprocessor 4 calculates the shifts to be compensated so as to geometrically register the images with respect to one another. The object of the stabilization is to correct the unintentional movements of the camera in 3D space. Perfect correction is achieved by combining a rotation and translations along the three axes (x,y,z). Within the framework of correcting vibration movements of low amplitudes and high frequencies, a correction in terms of translation along the axes of the image (x and y axes) is sufficient.
At output, the microprocessor 4 provides the calculated shifts to another FPGA 5. This FPGA, the function of which is the geometric transformation of the image stream acquired, then produces a stream of stabilized images. According to a variant, a single FPGA is used to carry out the functions of the two FPGAs.
The various steps of the hybrid estimation that are carried out by the microprocessor 4 and illustrated in
This estimation of shifts can be seen as a hierarchical process producing a first approximate estimation by gyrometric measurements which is thereafter refined by image processing optionally several times, according to increasingly fine resolutions. This principle makes it possible notably to retain low calculational complexity for the process whereas a simple correlation would turn out to be very expensive.
The reliability is for example defined as a function of a minimum threshold on the correlation result and of a minimum threshold on the correlation gradient. The advantage of the hybrid motion estimation is thus to allow stabilization of the image sequence (of the image stream) whatever the scene.
The step of the second level of registration (fine registration) will now be described in greater detail in conjunction with
The current image It and the previous image It-1, are first of all dezoomed, that is to say undergo a step in which the resolution is decreased by a factor N. This step comprises for each image It and It-1 two steps:
a low-pass filtering of the image,
a sub-sampling through which only one pixel is preserved out of N pixels along the two axes.
Many implementations are possible for carrying out such a dezoom. In our implementation, a dezoom by a factor 2 is carried out by simple averaging over 2×2 blocks i.e.:
denoting by:
I the full-resolution image (It or It-1)
ID the dezoomed image (IDt will denote the dezoomed image It and IDt-1 the dezoomed image It-1)
i,j the row and column indices of the pixels in the image.
The dezoomed images IDt and IDt-1 are thereafter correlated to provide a so-called under-resolved estimation of shift.
It is possible to apply several procedures to carry out the correlation. One such will now be described.
The correlation consumes a great deal of calculation power. This is why firstly, the correlation is carried out only on a “window” of reduced size and centered respectively in the images IDt and IDt-1; these windows generally correspond to a zone of interest in the images.
Secondly, any function making it possible to quantify the similarity between a reference image IDt and a secondary image IDt-1 is designated as correlation. For our implementation, we have chosen to use a distance function (i.e. dissimilarity function) of Minkowski metric type, but some other may be chosen:
denoting by:
p the order of the distance,
IDt and IDt-1 the two images to be compared,
i and j the row and column indices of the pixels in the image,
N the number of rows of the image and M the number of columns.
In the implementation carried out, we limit ourselves to a metric of order 1 (p=1).
The principle of estimating the shift between two images is as follows.
The correlation surface or distance surface denoted Nd is constructed by displacing the correlation support for the reference image IDt (the support is a window in the window of the reference image), over a given correlation horizon, in relation to the secondary image and by calculating, for each position, the Minkowski distance:
for: −Hy≦u≦Hy and −Hx≦v≦Hx
denoting by:
SDt and SDt-1 the supports centered in the reference and secondary windows of sizes N×M,
i and j the row and column indices in the support,
u and v the row and column indices in the correlation surface,
Hx and Hy the correlation horizons along x and y.
The indices u,v corresponding to the minimum over the distance surface give the estimated shift between the reference image IDt and the secondary image IDt-1.
The correlation horizons Hx and Hy determine the maximum amplitude of the displacements that are estimatable by image processing. The smaller these horizons, the lower the calculation time and the more reliable the motion estimation since the probability of finding a local minimum of the correlation surface decreases (synonymous with poor estimation), on condition that the chosen horizons are greater than the actual displacements. Having an a priori idea (approximate estimation) of the inter-image shift makes it possible to pre-position the secondary correlation support and therefore to reduce the size of the search horizons.
The hierarchical process is as follows:
The rate gyros make it possible to obtain an approximate shift with a precision of the order of 4 to 6 pixels in a small camera field, obtained as the outcome of very low calculational complexity that is much smaller than that of an exhaustive correlation.
Thereafter, the low-resolution correlation, that is to say carried out on the dezoomed images (cf.
Finally, the high-resolution correlation carried out on the images It and It-1 in place of IDt and IDt-1, and by using as initialization the shifts obtained through the low-resolution estimation of a precision of the order of 2 pixels, makes it possible to obtain an estimation of fine shifts, that is to say a precision of the order of a pixel.
This process can be generalized in the following manner. Having obtained the approximate shifts, the following steps are carried out:
The example of
The hierarchical process therefore operates by successive refinements of the motion estimation and is beneficial in terms of calculation time (the low-resolution correlation being of lesser complexity than the high-resolution correlation) and in terms of reliability.
A last step of auto-evaluation of the correlation quality makes it possible to determine the overall reliability of this process, which reliability is used during merging to take the decision as to whether or not to use this motion estimation by image processing.
The reliability of the motion estimation by image processing is estimated through the qualification of the correlation surface. This involves validating that the correlation peak (correlation maximum or distance minimum Lp) is sufficiently marked. Accordingly, the following information is used simultaneously:
height of the peak, for example by analyzing that the difference, in absolute value, between the height of the correlation peak and the average over the correlation surface is greater than a threshold, curvature on the peak, for example by analyzing for each of the x and y directions, that the difference in absolute value between the height of the correlation peak and the average of its two immediate neighbors is greater than a threshold.
If one of the above two criteria is not satisfied, the correlation is invalidated.
It will be readily seen by one of ordinary skill in the art that the present invention fulfils all of the objects set forth above. After reading the foregoing specification, one of ordinary skill in the art will be able to affect various changes, substitutions of equivalents and various aspects of the invention as broadly disclosed herein. It is therefore intended that the protection granted hereon be limited only by definition contained in the appended claims and equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
06 05885 | Jun 2006 | FR | national |
The present Application is based on International Application No. PCT/EP2007/056294, filed on Jun. 25, 2007, which in turn corresponds to French Application No. 06 05885 filed on Jun. 29, 2006, and priority is hereby claimed under 35 USC § 119 based on these applications. Each of these applications are hereby incorporated by reference in their entirety into the present application.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP07/56294 | 6/25/2007 | WO | 00 | 12/29/2008 |