The present invention relates primarily to determining an image depth map of a scene.
An image depth map has many applications, including dimensional analysis of an object in the scene or the segmentation of elements in the scene.
The invention also relates to determining the image with extended depth of field of the scene.
The technical areas covered by the invention are numerous: human-computer interaction (HCI), augmented reality (AR), microscopy, entertainment industry, automotive industry, home automation, dimensional verification, security, and video surveillance are some examples.
There are two categories of 3D imaging devices: passive systems, which do not require projecting light on the scene to determine the depth, and active systems. We are only interested in the passive systems.
Document WO 2009/019362 relates to an optical device provided with a device to increase its depth of field.
The optical system has uncorrected longitudinal chromatic aberration. The color images obtained are degraded by the longitudinal chromatic aberration. Such an optical device focuses the blue, green and red components of these color images, for objects that are respectively “close”, “intermediate”, and “distant”. By inverting the chromatic aberration on these images, the sharp component is used to reconstruct the defocused other components. The reconstructed color images thus have a depth of field greater than would be captured with an equivalent optical system for which the longitudinal chromatic aberration is corrected.
This method only considers images composed of three channels, red, green, and blue, and therefore only handles objects at three depths, “close”, “intermediate”, and “remote”.
A method for determining the depth map and the image with extended depth of field of a scene is described in the document entitled “Coded aperture pairs for depth from defocus and defocus deblurring” by Zhou, Lin, and Nayar, published in 2011 in “International Journal on Computer Vision.”
This method is one of the techniques for estimating depth from defocus (DFD). It uses two shots of the same scene, with different coded apertures.
The depth map is obtained by analyzing the defocus variation between the two images and minimizing a cost criterion at each point of the image. An image with extended depth of field of the scene is then estimated from the depth map.
The invention aims to solve the problems of the prior art by providing a method for estimating the image depth map of a scene, characterized in that it comprises the steps of:
With the invention, it is possible to construct the image depth map of a scene from a single captured image, with a passive single-sensor system.
This embodiment allows operation real time.
In a preferred feature, the step of determining a set of spectral images comprises the steps of:
In a preferred feature, the step of analyzing a cost criterion comprises:
In a preferred feature, the method further comprises a step of estimating a set of spectral images with extended depth of field of the scene, for the estimated depth map provided in the analysis step.
In a preferred feature, estimating the set of spectral images with extended depth of field of the scene comprises selecting, by iterating on the depth, values of spectral images with extended depth of field provided in the deconvolution step for which the considered depth corresponds to the estimated depth map provided in the analysis step.
In one embodiment of the invention, the spectral estimation step and the spectral image deconvolution step are combined into a single step.
In another embodiment of the invention, the demosaicing step, the spectral estimation step, and the spectral image deconvolution step are combined into a single step, and this step uses images from a database of images to calculate a transfer matrix for converting between the space of the mosaic image and the space of the spectral images.
The invention also relates to a device for estimating the image depth map of a scene, characterized in that it comprises:
The device according to the invention has advantages similar to those mentioned above.
In a preferred feature, the spectral image sensor comprises:
In a preferred feature, the module for analyzing a cost criterion comprises:
In a preferred feature, the device further comprises a module for estimating a set of spectral images with extended depth of field of the scene, for the estimated depth map provided by the analysis module.
In a preferred feature, the module for estimating the set of spectral images with extended depth of field of the scene is adapted to select, by iterating on the depth, values of spectral images with extended depth of field provided by the deconvolution module for which the considered depth corresponds to the estimated depth map provided by the analysis module.
In a preferred feature, the spectral estimation module and the spectral image deconvolution module are combined into a single module.
In another preferred feature, the demosaicing module, the spectral estimation module, and the spectral image deconvolution module are combined into a single module which is adapted to use images from a database of images to calculate a transfer matrix for converting between the space of the mosaic image and the space of the spectral images.
In one particular embodiment, the steps of the method according to the invention are implemented by instructions of a computer program.
The invention therefore also relates to a computer program on an information medium, the program being suitable for execution in a computer, the program comprising instructions adapted for implementing the steps of a method as described above.
This program may use any programming language and be in the form of source code, object code, or an intermediate code between source code and object code, such as in a partially compiled form, or in any other desirable form.
The invention also relates to an information medium readable by computer and comprising computer program instructions adapted to implement the steps of a method as described above.
The information medium may be any entity or device capable of storing the program. For example, the medium may consist of a storage medium such as ROM, for example a CD-ROM or a microelectronic circuit ROM, or a magnetic recording medium such as a diskette or hard drive.
Or the information medium may be a transmissible medium such as an electrical or optical signal which can be conveyed via electrical or optical cable, by radio, or by other means. The program of the invention may in particular be downloaded over an Internet type of network.
Alternatively, the information medium may be an integrated circuit in which the program is embedded, the circuit being adapted to execute or to be used in the execution of the method according to the invention.
Other features and advantages will become apparent upon reading a preferred embodiment given by way of non-limiting example, described with reference to figures in which:
According to a preferred embodiment described with reference to
Such a system has the characteristic of maintaining the longitudinal chromatic aberration and minimizing the other aberrations.
The longitudinal chromatic aberration is due to the presence in the optical system of optical elements in which the materials have refractive indices dependent on the wavelength in vacuum of the ray of light passing through them. Thus the focal length of the optical system 1 is dependent on the wavelength that traverses it. The focal length increases with the wavelength. The focal length is greater for red than for blue.
In other words, considering a given object point and its image through the longitudinal chromatic optical system, the focal distance and the image defocus are dependent on the depth and on the wavelength in vacuum of the considered object point. The given object point has a spectral signature in terms of image focus that is specific to the fixed properties of the system and to the distance of the object point relative to the optical system.
It should be noted that the longitudinal chromatic optical system has no transverse or lateral chromatic aberrations, and therefore no perspective distortion related to spectral variation of the lens. The focal length only varies longitudinally, as a function of the wavelength. Thus, a given object point corresponds to the same image point regardless of the wavelength.
When the longitudinal chromatic optical system 1 is used to capture an image of a scene SC, it carries out a spectral modulation of the image according to the depth and wavelength of the object point corresponding to the image point considered.
The longitudinal chromatic optical system 1 has an output connected to an input of a spectral image sensor 2 to which it delivers the result of the spectral modulation of the image.
From this input, the spectral image sensor 2 comprises a matrix image sensor 21 on which is placed a color filter array 22. The color filters may be colored resins, but a person skilled in the art may use other known filters such as interference filters or nanostructured filters. The spatial position and the spectral response of each color filter are optimized to facilitate reconstruction of the desired spectral images. The matrix image sensor 21 and the color filter array 22 allow estimating the spectral composition of the scene in a single image capture.
The matrix image sensor 21 and the color filter array 22 output a mosaic image ICFAZ*(x,y)(x,y).
The actual spectral images of the scene, denoted EDOFS(x,y,λ), are geometric projections on the plane of the sensor 21 of rays from the scene concerned.
Considering a coordinate system of the image sensor 21, one can mathematically express the mosaic image ICFAZ*(x,y)(x,y) as a function of the actual spectral images of the observed scene, the characteristics of the longitudinal chromatic optical system, the characteristics of the color filter array, and on additive noise, according to the formula:
where:
The same notations are retained throughout the description.
The output of the color filter array 22 is connected to an input of a demosaicing module 23. The color filter array 22 delivers the mosaic image ICFA*(x,y)(x,y) to said module.
The demosaicing module 23 estimates Nf images filtered by means of a filter DM which is applied to the mosaic image ICFAZ*(x,y)(x,y), where Nf is the number of color filters in the array 22. The filtered images are at the resolution of the mosaic image ICFAZ*(x,y)(x,y) in the reference system associated with the image sensor.
The calculation module determines a linear matrix filter DM that is optimal in the least squares sense. The optimal linear matrix filter DM is determined by minimizing the squared difference between the reference filtered images and the filtered images estimated by applying the filter DM to the corresponding mosaic image.
The optimal linear matrix filter DM enables transfer from the space of the mosaic images to the space of the corresponding filtered images.
Thus, the demosaicing module 23 allows transfer from the mosaic image ICFAZ*(x,y)(x,y) to filtered images. The filtered image SOCf
SOCf
This formula assumes that these filtered images are given by the sum of the actual filtered images (left term) and a noise term (right term). In this case, the estimation errors related to demosaicing are contained in this term of additional noise. This term is not strictly identical to the noise term in the equation of the mosaic image considered (although the notation is the same). However, we considered that the demosaicing algorithm used induces in each filtered image a noise term whose statistical properties are identical to those of the noise of the mosaic image. It is the preservation of statistical properties which is exploited here.
The demosaicing thus results in a set of Nf filtered images SOCf
The output of the demosaicing module 23 is connected to an input of a spectral estimation module 24. The demosaicing module 23 delivers the Nf filtered images SOCf
The spectral estimation module 24 determines Nλ spectral images by means of a linear matrix inversion filter FS applied to the Nf filtered images. The operation of module 24 is described in detail below.
Each spectral image relates to a wavelength sample. The number Nλ of spectral images depends on the range of wavelengths of the spectral imaging system and on the size of a spectral sample.
In one particular embodiment, the number of filters in the filter array is equal to the number of spectral images desired. In this case, the spectral images are the images filtered by the demosaicing module. In other words, the demosaicing directly provides the spectral image, and spectral estimation is not necessary.
The output of the spectral estimation module 24 is connected to an input 25 of a deconvolution module. The spectral estimation module 24 delivers the Nλ spectral images to module 25.
Module 25 applies deconvolution by means of a linear matrix inversion filter CZ
The deconvolution module 25 determines estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
The output from the deconvolution module 25 is connected to an input of a module 26 for estimating the actual depth map of the scene considered. The deconvolution module 25 delivers the estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
The output of the color filter array 22 is also connected to an input of module 26. The array 22 delivers the mosaic image ICFAZ*(x,y)(x,y) to module 26.
Module 26 determines an estimation {circumflex over (Z)}(x,y) of the actual depth map Z(x,y) of the scene considered, based on a minimization of a cost criterion calculated from the mosaic image ICFAZ*(x,y)(x,y) provided by the array 22 and from the Nλ estimated spectral images EDO{circumflex over (F)}SZ
An output of module 26 is connected to an input of the deconvolution module 25, to provide it with the test depth Zt.
The operation of module 26 is detailed below.
The output of the deconvolution module 25 is connected to an input of an adaptive pixel selection module 27. The deconvolution module 25 supplies the Nλ estimated spectral images EDO{circumflex over (F)}SZ
The output of the module 26 for estimating the actual depth map of the scene is connected to an input of the adaptive pixel selection module 27. Module 26 supplies the estimation {circumflex over (Z)}(x,y) of the actual depth map of the scene to module 27.
The adaptive pixel selection module 27 determines Nλ estimated spectral images with extended depth of field EDO{circumflex over (F)}S{circumflex over (Z)}(x,y)(x,y,λ) adapted to the estimated depth map {circumflex over (Z)}(x,y). The operation of module 27 is detailed below.
In a preferred embodiment, the spectral estimation module 24 and the spectral image deconvolution module 25 are combined into a single module.
In this case, a test depth domain is considered. For a given test depth Zt within this domain, Nλ estimated spectral images EDO{circumflex over (F)}SZ
This embodiment is based on the assumption that the object scene is planar, perpendicular to the optical axis of the system used, meaning that the object depth is independent of the coordinates (x,y).
We thus obtain the following linear matrix representation:
GiZ*=HiZ*.S+N
where:
A least-squares estimate of matrix S is then performed in modules 24 and 25 considered together, according to the formula:
where:
The estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
In the case where the operations of modules 24 and 25 are carried out simultaneously, the matrix term, expressed in Fourier space,
represents a linear matrix inversion filter equivalent to filter FS.CZ
A mosaic image is reconstructed in module 26 from the estimated spectral images and for test depth Zt, according to the formula:
The pixel-to-pixel squared difference, between the mosaic image ICFAZ*(x,y)(x,y) determined by module 22 and the reconstructed mosaic image ICFAZ
WZ
The set of estimated depth values, for all positions (x,y) of the image, gives the estimated depth map {circumflex over (Z)}(x,y).
Module 27 then determines the Nλ estimated spectral images with extended depth of field EDO{circumflex over (F)}S{circumflex over (Z)}(x,y)(x,y,λ) by selecting, iterating on the depth Zt, only the values of estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
In another preferred embodiment, the demosaicing module 23, the spectral estimation module 24, and the spectral image deconvolution module 25 are combined into a single module.
This embodiment is based on the fact that it is possible to find a law of transfer from the mosaic image space to another space, here the space of spectral images with extended depth of field, in the form of a linear matrix representation.
As represented in
This embodiment is based on the assumption that the scene is a plane in the object space, perpendicular to the optical axis of the system.
The transfer matrix DEDOF
where:
The transfer matrix DEDOF
Thus a plurality of transfer matrices DEDOF
The estimated spectral image with extended depth of field for depth Zt is obtained by matrix multiplication of matrix DEDOF
EDOFSZ
As in the first embodiment, a mosaic image is reconstructed in module 26 from estimated spectral images and for test depth Zt, according to the formula:
The pixel-to-pixel squared difference between the mosaic image ICFAZ*(x,y)(x,y) determined by module 22 and the reconstructed mosaic image ICFAZ
WZ
The set of estimated depth values at each position (x,y) of the image gives the estimated depth map {circumflex over (Z)}(x,y).
Lastly, in module 27, the Nλ estimated spectral images with extended depth of field EDO{circumflex over (F)}S{circumflex over (Z)}(x,y)(x,y,λ) are determined by selecting, iterating on the depth Zt, only the values of estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
Step E1 is a step of capturing a scene using the device described above. Only one image capture is required in the context of the invention.
The longitudinal chromatic optical system 1 then delivers to the spectral image sensor 2 a spectral flux modulated as a function of the distance and wavelength for each point of the image.
In the next step E2, the image provided by the longitudinal chromatic optical system is received by the spectral image sensor 2. This provides a set of Nλ spectral images. Step E2 is detailed below.
In the next step E3, the Nλ spectral images are deconvoluted and used to estimate estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
The next step E4 determines an estimate {circumflex over (Z)}(x,y) of the actual depth map Z(x,y) of the scene considered, based on minimization of a cost criterion calculated from the mosaic image ICFAZ*(x,y)(x,y) provided by array 22 and on the Nλ estimated spectral images EDO{circumflex over (F)}SZ
The next step E5 is an adaptive selection of pixels in order to estimate a set of spectral images with extended depth of field of the scene, for the depth map {circumflex over (Z)}(x,y) estimated in step E4.
In step E21, the image provided by the longitudinal chromatic optical system 1 is converted into a mosaic image ICFAZ*(x,y)(x,y) by the image sensor 21 and the color filter array 22.
In the next step E22, the mosaic image ICFAZ*(x,y)(x,y) undergoes demosaicing in order to produce Nf filtered images SOCf
The next step E23 is a spectral estimation in order to determine Nλ spectral images from the Nf filtered images SOCf
In a preferred embodiment, spectral estimation step E23 and spectral image deconvolution step E3 are combined into a single step.
In this case, a test depth domain is considered. For a given test depth Zt, Nλ estimated spectral images EDO{circumflex over (F)}SZ
This estimation is performed based on Nf filtered images SOCf
This embodiment is based on the assumption of a planar object scene, perpendicular to the optical axis of the system used, meaning that the object depth is independent of the coordinates (x,y).
We thus obtain the following linear matrix representation:
GiZ*=HiZ*.S+N
where:
A least squares estimate of matrix S is then given by the formula:
where:
The estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
In step E4, a mosaic image ICFAZ
The pixel-to-pixel squared difference between the mosaic image ICFAZ*(x,y)(x,y) determined in step E21 and the reconstructed mosaic image ICFAZ
WZ
The set of estimated depth values for each position (x,y) of the image gives the estimated depth map {circumflex over (Z)}(x,y).
Step E5 then determines the Nλ estimated spectral images with extended depth of field EDO{circumflex over (F)}S{circumflex over (Z)}(x,y)(x,y,λ) by selecting, iterating on the depth Zt, only the values of estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
In another preferred embodiment, the demosaicing step E22, spectral estimation step E23, and spectral image deconvolution step E3 are combined into a single step.
This embodiment is based on the fact that it is possible to find a transfer law to convert from the space of the mosaic image to another space, here the space of spectral images with extended depth of field, in the form of a linear matrix representation.
This embodiment uses images from a database of images to calculate (
This embodiment is based on the assumption that the scene is a plane in the object space, perpendicular to the optical axis of the system.
The transfer matrix DEDOF
where:
The transfer matrix DEDOF
Thus a plurality of transfer matrices DEDOF
The estimated spectral image with extended depth of field for depth Zt is obtained by matrix multiplication of matrix DEDOF
EDOFSZ
As in the first embodiment, in step E4 a mosaic image is reconstructed in module 26 from estimated spectral images and for test depth Zt, according to the formula:
The pixel-to-pixel squared difference between the mosaic image ICFAZ*(x,y)(x,y) and the reconstructed mosaic image ICFAZ
Step E5 then determines the Nλ estimated spectral images with extended depth of field EDO{circumflex over (F)}S{circumflex over (Z)}(x,y)(x,y,λ) by selecting, iterating on the depth Zt, only the values of estimated spectral images with extended depth of field EDO{circumflex over (F)}SZ
[1]: Article entitled “Generalized Assorted Pixel Camera: Post-Capture Control of Resolution, Dynamic Range and Spectrum”, by Yasuma, F.; Mitsunaga, T.; Iso, D. & Nayar, S. K. (2010), in IEEE Transactions on Image Processing;
[2]: Article entitled “Extended depth-of-field using sharpness transport across color channels” by Guichard, F.; Nguyen, H.-P.; Tessieres, R.; Pyanet, M.; Tarchouna, I. & Cao, F. (2009), in ‘Proceedings of the SPIE—The International Society for Optical Engineering’;
[3]: U.S. Pat. No. 7,626,769, “Extended Depth Of Field Imaging System using Chromatic Aberration”, 2009, Datalogic Scanning;
[4]: Patent WO 2009/019362 “Optical System Furnished With a Device For Increasing its Depth Of Field”, DxO Labs;
[5]: Article entitled “Coded Aperture Pairs for Depth from Defocus and Defocus Deblurring” by Zhou, C.; Lin, S. & Nayar, S. (2011), in International Journal on Computer Vision;
[6]: Article entitled “Statistics of spatial cone-excitation ratios in natural scenes” by Nascimento, S. M. C., Ferreira, F., and Foster, D. H. (2002), in Journal of the Optical Society of America A, 19,1484-1490;
[7]: Article entitled “Color filters including infrared cut-off integrated on CMOS image sensor” by Frey, L.; Parrein, P.; Raby, J.; Pelle, C.; Herault, D.; Marty, M. & Michailos, J. (2011), Opt. Express 19(14), 13073-13080.
Number | Date | Country | Kind |
---|---|---|---|
13 61340 | Nov 2013 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2014/074804 | 11/17/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/075005 | 5/28/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7626769 | Olmstead et al. | Dec 2009 | B2 |
20110026909 | Liege et al. | Feb 2011 | A1 |
20130194375 | Michrowski | Aug 2013 | A1 |
20130278726 | Muhammad | Oct 2013 | A1 |
20140267608 | Dhome | Sep 2014 | A1 |
20160007005 | Konieczny | Jan 2016 | A1 |
20160098858 | Patkar | Apr 2016 | A1 |
Number | Date | Country |
---|---|---|
2009019362 | Feb 2009 | WO |
Entry |
---|
Jerome Dias et al: “Estimation de la profondeur par analyse de nettete”, Oct. 4, 2010, XP055127127, Extrait de l'Internet: URL:http://www.pole-ora.com/pages/projets/Journee3D/PoleORA—J3D—14102010—Poster—Dias.pdf [extrait le Jul. 7, 2014]. |
Pauline Trouve: “Conception conjointe optique/traitement pour un imageur compact a capacite 3D”, These de Doctorat, Dec. 10, 2012 (Dec. 10, 2012), pp. 1-241, XP055126368, Palaiseau, France Extrait de l'Internet: URL: http://tel.archives-ouvertes.fr/docs/00/79/73/82/PDF/Trouve—2012.pdf [extrait le Jul. 2, 2014]. |
Ferreol Soulez et al: “Joint deconvolution and demosaicing”, Image Processing (ICIP), 2009 16th IEEE International Conference on, IEEE, Piscataway, NJ, USA, Nov. 7, 2009 (Nov. 7, 2009), pp. 145-148, XP031628815, ISBN: 978-1-4244-5653-6. |
Ayan Chakrabarti et al: “Depth and Deblurring from a Spectrally-Varying Depth-of-Field”, Oct. 7, 2012 (Oct. 7, 2012), Computer Vision ECCV 2012, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 648-661, XP047019055, ISBN: 978-3-642-33714-7 le document en entier. |
Oliver Cossairt et al: “Spectral Focal Sweep: Extended depth of field from chromatic aberrations”, Computational Photography (ICCP), 2010 IEEE International Conference on, IEEE, Piscataway, NJ, USA, Mar. 29, 2010 (Mar. 29, 2010), pp. 1-8, XP031763031, ISBN: 978-1-4244-7022-8. |
Benjamin Milgrom et al: “Novel approach for extending the depth of field of Barcode decoders by using RGB channels of information”, Optics Express, Oct. 27, 2010 (Oct. 27, 2010), pp. 17027-17039, XP055126443, DOI: http://dx.doi.org/10.1364/OE.18.017027 Extrait de l'Internet: URL:http://www.opticsinfobase.org/oe/abstr act.cfm?URI=oe-18-16-17027 [extrait le Jul. 2, 2014]. |
International Search Report, dated Jan. 23, 2015, from corresponding PCT application. |
Number | Date | Country | |
---|---|---|---|
20160301911 A1 | Oct 2016 | US |