1. Field of the Invention
This invention relates generally to plenoptic imaging systems, and more particularly to reconstruction of images and image components in plenoptic imaging systems.
2. Description of the Related Art
The plenoptic imaging system has recently received increased attention. It can be used to recalculate a different focus point or point of view of an object, based on digital processing of the captured plenoptic image. The plenoptic system also finds application in multi-modal imaging, using a multi-modal filter array in the plane of the pupil aperture. Each filter is imaged at the sensor, effectively producing a multiplexed image of the object for each imaging modality at the filter plane. Other applications for plenoptic imaging systems include varying depth of field imaging and high dynamic range imaging.
However, a plenoptic imaging system is not designed to capture a high fidelity image of the object. Rather, the “image” captured by a plenoptic system is intentionally scrambled in a manner that allows the capture of additional lightfield information, but at the expense of image resolution. The loss of spatial resolution is a well-recognized and debilitating drawback of plenoptic imaging systems. A lenslet array at the conventional image plane spreads light on to the sensor. Therefore each image of a given modality (such as each refocused image, or each wavelength image in multispectral imaging, etc.) is restricted to as many pixels as there are lenslets.
Thus there is a need for approaches to combat this loss of spatial resolution.
The present invention overcomes various limitations by providing approaches to enhance the spatial resolution based on captured plenoptic images. In one aspect, the plenoptic imaging process is modeled by a pupil image function (PIF), which relates the object to the plenoptic image formed by that object. Given the PIF, a better resolution estimate of the object can then be calculated from the captured plenoptic image by effectively inverting the PIF operation (although this may be achieved without actually inverting the PIF function).
In one implementation, the plenoptic imaging process is modeled as a vector-matrix operation [Ivf]=[PIFv][Ivo], where [Ivo] is a vector representing the object (of different components in the object), [PIFv] is a matrix representing the PIF and [Ivf] is a vector representing the captured plenoptic image. Often, the dimension of the object vector Ivo is much larger than that of the plenoptic image vector Ivf, especially if the object vector Ivo represents multiple components within the object, for example from different modalities. That means the [PIFv] matrix is a rectangular matrix, for which no inverse matrix exists. However, the object vector can be estimated from the captured plenoptic image vector by solving a linear inverse problem, namely, computing an estimate [I*vo], such that the reconstruction error ∥[PIFv][I*vo]−[Ivf]∥ is minimized. This will be referred to as PIF inversion.
Such linear inverse problems can be formulated and solved in various ways. For example, different distance measures may be used for the error calculation. Constraints on the object vector may be included as part of the minimization process. For example, if the object vector is composed of different components of the object (e.g., red, green and blue components of the object), the various components may have known relations to each other which could be enforced by using constraints in the minimization process. Different optimization techniques may be used to solve the minimization problem. Design choices of the matrix [PIFv] and selection of the optimization solving technique influence the quality of the estimated object vector [I*vo]. In one approach, the individual lenslets in the lenslet array in the plenoptic imaging system are non-centrosymmetric in order to improve the behavior of the inversion problem.
The PIF can be estimated or approximated using different approaches. A wave propagation approach is more accurate and includes diffraction effects. A geometrical optics approach is faster. The PIF may also be obtained by measurement of the optical response at the detector.
In another aspect, a resolution-enhanced plenoptic imaging system includes a conventional plenoptic imaging system coupled to digital signal processing that performs the PIF inversion calculation.
The basic PIF model described above can also be used for purposes besides increasing the resolution of a captured plenoptic image. For example, the same basic principles can be used to assist in the design of these systems, to calibrate or test these systems, or to provide real-time automatic adjustments of these systems.
Other aspects of the invention include methods, devices and systems corresponding to the concepts described above.
The invention has other advantages and features which will be more readily apparent from the following detailed description of the invention and the appended claims, when taken in conjunction with the accompanying drawings, in which:
The figures depict embodiments of the present invention for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the invention described herein.
The figures and the following description relate to preferred embodiments by way of illustration only. It should be noted that from the following discussion, alternative embodiments of the structures and methods disclosed herein will be readily recognized as viable alternatives that may be employed without departing from the principles of what is claimed.
Sample Configuration.
The spatial coordinates (ξ, η) will be used at the object plane, (x, y) at the pupil plane for imaging subsystem 1, (u, v) at the plane of the lenslet array, and (t, w) at the sensor plane. The primary lens is a distance z1 from the object, the lenslet array is a distance z2 from the primary lens, and the sensor is a distance z3 from the lenslet array. In
Ignoring the filter module 125 for the moment, in imaging subsystem 1, the object 150 is imaged by the primary lens 110 to produce an image that will be referred to as the “primary image.” This primary lens 110 may be a camera imaging lens, microscope objective lens or any other such imaging system. The lenslet array 120 is placed approximately at the location of the primary image. Each lenslet then images the pupil of the primary lens to the sensor plane. This is imaging subsystem 2, which partially overlaps with imaging subsystem 1. The image created at the sensor array 130 will be referred to as the “plenoptic image” in order to avoid confusion with the “primary image.” The plenoptic image can be divided into an array of subimages, corresponding to each of the lenslets. Note, however, that the subimages are images of the pupil of imaging subsystem 1, and not of the object 150. In
Image Reconstruction.
Even though this loss of spatial resolution is inherent to plenoptic imaging systems, one of our insights is that the plenoptic image can be processed in a way to reconstruct a higher fidelity image of the object. For example, we will show that the plenoptic imaging system can be modeled by
where IfT,W is the intensity at sensor element T,W; IO,M,N is the intensity from object element M,N; and PIFM,NT,W is the plenoptic imaging system response, which we refer to as the pupil image function or PIF. T,W is the discretized version of sensor coordinates (t,w) and M,N is the discretized version of object coordinates (ξ,η).
Eq. 1A can be written more compactly as
[Ivf]=[PIFv][Ivo] (1B)
where the additional “v” indicates that these are based on vectorized images. That is, the two-dimensional plenoptic image If is represented as a one-dimensional vector Ivf. Similarly, the two-dimensional object Io is represented as a one-dimensional vector Ivo. Correspondingly, the four-dimensional PIF is represented as a two-dimensional matrix PIFv.
The PIF can be calculated and approximated in different ways. We provide two examples in the Appendix. Appendix A approximates a PIF based on wave propagation. Appendix B approximates a PIF based on geometrical optics.
Based on Eq. 1B, an estimate [I*vo] of the object can be calculated by solving the following optimization problem:
minIvo∥[PIFv][Ivo]−[Ivf]∥, subject to constraints on [Ivo] (2)
The solution to this problem may be implemented as a closed-form solution or via iterative algorithms, depending on the choice of the norm (or combination of norms) ∥.∥ and the constraints on [Ivo].
The following are some examples. For the problem minIvo∥[PIFv][Ivo]−[Ivf]∥22 with no additional constraints on [Ivo], the solution is the least-squares solution [I*vo]=[PIF*v][Ivf], where [PIF*v] is the pseudo-inverse of [PIFv], and ∥.∥2 is the Euclidean norm. For the problem minIvo∥[PIFv][Ivo]−[Ivf]∥22+∥[Γ][Ivo]∥22 with regularization matrix Γ (Tikhonov regularization problem), the Tikhonov matrix ([PIFv]T[PIFv]−[Γ]T[Γ])−1[PIFv]TIvf is the solution. For the problem minIvo∥[PIFv][Ivo]−[Ivf]∥22 with upper and lower bounds on individual component of [Ivo], a least square data fitting algorithm can be applied to calculate a solution. For the sparsity constraint regularization problem minIvo∥[PIFv][Ivo]−[Ivf]∥22+λ∥[Ivo]∥0, solutions can be obtained via iterative optimization techniques such as Basis Pursuit or Expectation Maximization. For the L1 regularization problem (LASSO) minIvo∥[PIFv][Ivo]−[Ivf]∥22+λ∥[Ivo]∥1 subject to ∥[Ivo]∥1≦ε the solution can be obtained via convex optimization techniques. Other regularization techniques may be used.
Note that the observation vector [Ivf] in Eq. (1B) is in general not the result of a convolution of the system response matrix with the object vector [Ivo] and therefore direct deconvolution techniques such as Wiener filtering, etc. are not directly applicable.
The PIFv matrix may be obtained via simulation during system design, or by good calibration of each point in the object in a physical system, or may be an approximation to this response. The PIFv matrix may also be updated with techniques such as regularization, total variation, etc., to incorporate approximations or errors in the knowledge of the PIFv matrix.
Noise in the sensor may lead to the need for regularization in the inversion process. For a scientific sensor such as used in microscopes we expect less noise. In one experiment, we obtained a reconstruction by using Eq. 2 in an iterative least squares curve fit function in Matlab to estimate [Ivo] from noisy sensor data [Ivf]. We also incorporated a smoothing procedure in every iteration that removes sharp, high frequency variations in the estimates of the object reconstruction. Other variations of this procedure could be used.
In this example, the object is a 2D object. That is, the object distance is the same for all points in the object. As a result, the whole object should be in-focus, and our corresponding reconstructions are also in-focus. However, the approach described above can also be used to obtain reconstruction for a 3D object for which the object plane is defocused. In that case, some planes intersecting the object will be in-focus and some out-of-focus. If we consider a 3D object where different planes of the object may be in-focus or out-of-focus, then we update the PIFv matrix with the response of object points in x-y as well as z depth. That is, the object vector [Ivo] would contain multiple components rather than a single object, each representing the component of the object at a depth z. The PIFv matrix would also be expanded to include the contribution to [Ivf] from each depth component. The PIFv matrix then contains x-y-z calibration and its inversion would give data for a 3D object. Depth information from other sources can be used to reduce the complexity of the inversion of this 3D response matrix.
Conditioning the PIFv Matrix.
Some techniques for calculating the solution of the PIFv inversion problem may not always create useful results, even in the 2D, noise-less case. One common problem is that an erroneous twin-image solution can dominate the reconstruction. The twin-image solution occurs when the reconstructed image is a combination of the object and its twin rotated through 180 degrees.
This could be caused by symmetry in the response for some object points. In a radially symmetric optical system, a given object point almost always has a diametrically opposite object point which gives a flipped version of almost the same optical response (PIF). This means that the coefficients in the PIFv matrix for one object point are almost identical to the coefficients of another object point that is located diametrically opposite in the optical system at the plane of the lenslet. Thus, there is lack of enough diversity or randomness in the coefficients in the PIFv matrix.
This can be addressed by incorporating asymmetry in the response for the object points across a given lenslet. In one approach, we incorporate asymmetry in the shape/extent or amplitude or phase of the lenslets in the lenslet array.
In one approach, the lenslets themselves are not altered. Rather, a filter or mask having amplitude or phase variations are placed in a conjugate object plane in order to implement such an asymmetry/variation in the response.
In a different approach, one could avoid using asymmetric lenslets and instead use advanced image processing algorithms coupled to prior assumptions, such as object statistics or basis representations of objects, etc., to restrict the solution space for the PIF inversion problem. However, this requires making assumptions about the object. These approaches can also be iterative and therefore computationally expensive.
Multiple Object Components
The description above can be used for reconstruction of a single object. In that case, Ivo represents a single version of the object. For example, Ivo might represent a grayscale image of the object, where each value in Iv, is the grayscale at a different pixel within the object. The PIF matrix then represents the contribution of each pixel object to the captured plenoptic image.
However, Ivo and PIFv can also contain different components. For example, Ivo1 might be the component of the object at wavelength band 1, Ivo2 might be the component of the object at wavelength band 2, and so on. The corresponding “component PIFs” could include PIFv1, PIFv2 and so on, where the component PIFv1 represents the contribution of the component Ivo1 to the captured plenoptic image, component PIFv2 represents the contribution of the component Ivo2 to the captured plenoptic image, etc. The basic equation [Ivf]=[PIFv][Ivo] still holds, except that now the matrix [PIFv]=[[PIFv1] . . . [PIFvK]] and the vector [Ivo]=[[Ivo1]T[Ivo2]T . . . [IvoK]T]T. Eqn. (1B) can then be written as
[Ivf]=[[PIFv1][PIFv2] . . . [PIFvK]][[Ivo1]T[Ivo2]T . . . [IvoK]T]T (3)
The components could be based on wavelength, polarization, attenuation, object illumination or depth, for example. Using the notation of Eq. 1A, if Ivo is a single object, then [PIFv] is a TW×MN matrix. If Ivo is constructed from K components, then [PIFv] becomes a TW×KMN matrix.
In systems where a filter module 125 is used, the different components might correspond to the different filter cells. In one example application, the filter cells have different spectral responses designed so that the filter module can be used to detect certain substances based on these spectral responses. The object components Ivok are chosen as the object contributions in each of the spectral regions. The PIF components PIFvk are then calculated as the system response for each of the spectral regions. Given the captured plenoptic image Ivf, the PIF inversion then yields image estimates for each of the spectral components Ivok. The use of filter cells in the plenoptic imaging system facilitates this process since certain regions of the plenoptic image will be more strongly filtered by one spectral response than another. Constraints on the object components may also be used during the PIF inversion. For example, there may be known correlation between the different spectral components. Alternately, if there is an unfiltered object component, it might be equal to the sum of the other spectral components. Note that Eq. 3 is general enough to account for spatial overlap between the different spectral components.
Other Modalities.
The model of Eqs. (1)-(3) can be used in a number of modes. The description above described a situation where the plenoptic imaging system is used in an operational mode (as opposed to a calibration, testing or other mode). A plenoptic image is captured, and the goal is to reconstruct a high quality image of the original object (or of an object component) from the captured plenoptic image. This will be referred to as reconstruction mode.
This general model can be used for different purposes. For example, it may be used as part of the design process, as shown in
In
In a last variation,
In
Two examples of metrics that may be used for object classification are the Fisher discriminant ratio and the Bhattacharyya distance. The Fisher discriminant ratio (FDR) can be used to evaluate the contrast between two object classes. It is defined as
FDR=(
where
The Bhattacharyya distance can be used to estimate classification accuracy based on a Bayes classifier. The Bhattacharyya distance is a theoretical value which is often used in an exponential function to estimate the upper bound of classification error. In one approach, a higher order equation is used to estimate the classification error Pe based on the Bhattacharyya distance as shown below:
Pe=a1−a2DB+a3DB2−a4DB3+a5DB4−a6DB5 (5)
where the coefficients are given by a1=40.219, a2=70.019, a3=63.578, a4=32.766, a5=8.7172, and a6=0.91875. The Bhattacharyya distance can be calculated using
A support vector machine (SVM) can be used to estimate the hyperplane separating two classes that simultaneously minimizes the classification error and maximizes the geometric margin.
Although the detailed description contains many specifics, these should not be construed as limiting the scope of the invention but merely as illustrating different examples and aspects of the invention. It should be appreciated that the scope of the invention includes other embodiments not discussed in detail above. Various other modifications, changes and variations which will be apparent to those skilled in the art may be made in the arrangement, operation and details of the method and apparatus of the present invention disclosed herein without departing from the spirit and scope of the invention as defined in the appended claims. Therefore, the scope of the invention should be determined by the appended claims and their legal equivalents.
In the claims, reference to an element in the singular is not intended to mean “one and only one” unless explicitly stated, but rather is meant to mean “one or more.” In addition, it is not necessary for a device or method to address every problem that is solvable by different embodiments of the invention in order to be encompassed by the claims.
In alternate embodiments, portions of the invention can be implemented in computer hardware, firmware, software, combinations thereof and other types of processing power. Apparatus of the invention can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method steps of the invention can be performed by a programmable processor executing a program of instructions to perform functions of the invention by operating on input data and generating output. The invention can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. Each computer program can be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random access memory. Generally, a computer will include one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits) and other forms of hardware.
In alternate embodiments, portions of the invention can be implemented by modules, which can take different physical forms. The term “module” is not meant to be limited to a specific physical form. Depending on the specific application, modules can be implemented as hardware, firmware, software, and/or combinations of these. For example, the signal processing in
Depending on the form of the modules, the “coupling” between modules may also take different forms. Dedicated circuitry can be coupled to each other by hardwiring or by accessing a common register or memory location, for example. Software “coupling” can occur by any number of ways to pass information between software components (or between software and hardware, if that is the case). The term “coupling” is meant to include all of these and is not meant to be limited to a hardwired permanent connection between two components. In addition, there may be intervening elements. For example, when two elements are described as being coupled to each other, this does not imply that the elements are directly coupled to each other nor does it preclude the use of other elements between the two.
This derivation uses the nomenclature shown in
where λ is the wavelength of imaging,
and the magnification from object to image plane is given by M=−z2/z1. Now consider the integral in Eq. (A1) as follows,
Substituting x′=x/λz2 and y′=y/λz2, the above term becomes
where we have defined the term h′1 for convenience as
Eq. (A1) then reduces to
The field of the primary image formed at approximately the plane of the lenslet array is
Ui(u, v)=∫∫dξdηh1(u,v;ξ,η)Uo(ξ,η) (A6)
where U0(ξ,η) is the field of the object. Substituting Eq. (A5) yields
Substituting ξ′=Mξ and η′=Mη in the above equation yields
Now consider propagation from the lenslet array to the sensor. We assume each lenslet has a diameter D2, focal length f2, pupil function given by P2 and there are M×N such lenslets in the array. The amplitude distribution of the field after the lenslet array may be written as
Using Fresnel transformation to propagate this field to the sensor, which is located at a distance z3 from the lenslet array, yields the field of the plenoptic image Uf
Inserting Eq. (A8) into Eq. (A10) yields
Therefore the intensity of this field Uf at the sensor plane is given by
Considering an object creating spatial incoherence in the object field,
Uo(ξ′,η′)U*o({tilde over (ξ)}′,{tilde over (η)}′)=Io(ξ′,η′)δ(ξ′−ξ′,η′−η′) (A13)
Substituting this into Eq. (A12) yields
This equation can be written as
The assumptions related to incoherence allow us to separate the object intensity I0 and pull it outside the magnitude squared in the equation above. The terms within the squared magnitude are all related to parameters of the optical system such as pupils, impulse responses for the main lens and lenslet, etc. If this is considered the system response, we can see that this term performs a separable operation with the object intensity. Specifically, let
This information about object separability and the lack of a conventional convolution with the system parameters is counter-intuitive. However, following on this separability, the final Eq. (A15) for an incoherent plenoptic image can be re-written in vector-matrix form to enable easy simulation. The term
inside the integral refers to a single point in the object. The second term in the integral is represented by Eq. (A16). Thus, it can be seen that this term is a function of the sensor plane coordinates (t, w) as well as the single object point (ξ′, η′). When the entire integral is performed over ξ′, η′ coordinates, we would effectively sum over all the points in an object and the result would be only a function of sensor coordinates, (t, w), thus forming the basis for Eq. (1) of the main text.
This derivation uses the nomenclature shown in
One geometrics optics approach is based on ray trace matrices. The basic ray trace matrix for the plenoptic imaging system of
where S[ ] is the matrix for free space propagation and L[ ] is the matrix for propagation through a lens. Referring to
is propagation through the primary lens 110 and S[z2] is propagation from the primary lens 110 to the lenslet array 120. {tc
is propagation through the lenslet and S[z3] is propagation from the lenslet to the sensor. For convenience, wavelength dependence is not shown in Eq. B1. Note that there is some bookkeeping and coordinate shifting since some of the components are arrays of elements. Specifically, the filter module is an arrangement of filter cells and the lenslet array is an array of lenslets. Thus, it must be determined which cell in the filter module and which lenslet in the array is traversed by a ray before the correct ray matrices can be applied.
Eq. B1 is just one approach to tracing rays. Other ray-tracing approaches can also be used. For example, exact ray tracing based on applying Snell's law at each surface can also be used.
Regardless of the specific ray tracing method, ray tracing capability can be used to estimate the PIF matrix. Recall that each value in the PIF matrix is the contribution of one object point/pixel to one point/pixel in the plenoptic image. That contribution can be estimated by tracing rays.
One approach is based on Monte Carlo methods. A large number of rays is traced from an object point. The distribution of rays in angular space is representative of the energy distribution from the object. The number of rays hitting different points in the plenoptic image is indicative of the contribution from that object point. This is repeated for many object points. In this way, the contribution from each object point to each plenoptic image point (aka, the PIF matrix) can be estimated.
Consider
This application is a continuation of U.S. patent application Ser. No. 13/398,813, “Design and optimization of plenoptic imaging systems,” filed Feb. 16, 2012. The subject matter of all of the foregoing is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
7949252 | Georgiev | May 2011 | B1 |
8228417 | Georgiev | Jul 2012 | B1 |
8315476 | Georgiev | Nov 2012 | B1 |
8345144 | Georgiev | Jan 2013 | B1 |
8811769 | Pitts | Aug 2014 | B1 |
20020008917 | Daniell | Jan 2002 | A1 |
20040189686 | Tanguay, Jr. | Sep 2004 | A1 |
20080266655 | Levoy et al. | Oct 2008 | A1 |
20090295829 | Georgiev et al. | Dec 2009 | A1 |
20100026852 | Ng | Feb 2010 | A1 |
20110122394 | Kawashima | May 2011 | A1 |
20120300091 | Shroff | Nov 2012 | A1 |
20130128068 | Georgiev | May 2013 | A1 |
20130128087 | Georgiev | May 2013 | A1 |
20130216123 | Shroff | Aug 2013 | A1 |
20130216125 | Shroff | Aug 2013 | A1 |
20160099429 | Bruder | Apr 2016 | A1 |
20160119606 | Horikawa | Apr 2016 | A1 |
Number | Date | Country |
---|---|---|
WO 2010103527 | Sep 2010 | WO |
Entry |
---|
Knoblauch, Daniel et al. “Factorization of Correspondence and Camera Error for Unconstrained Dense Correspondence Applications.” ISVC (2009), pp. 1-10. |
McMillan et al. “Plenoptic Modeling: An Image-Based Rendering System,” Annual Conference on Computer Graphics—SIGGRAPH, 1995, pp. 39-46. |
Bishop, T. et al., “Light Field Superresolution,” 2009 IEEE International Conference on Computational Photography (ICCP), 2009, pp. 1-9. |
Donatelli, M. et al., “Anti-Reflective Boundary Conditions and Re-Blurring,” Inverse Problems, 2005, pp. 169-182, vol. 21. |
Fawzy, Y. et al., “In Vivo Assessment and Evaluation of Lung Tissue Morphologic and Physiological Changes from Non-Contact Endoscopic Reflectance Spectroscopy for Improving Lung Cancer Detection,” Journal of Biomedical Optics, Jul./Aug. 2006, pp. 044003-1-044003-12, vol. 11, No. 4. |
Park, J. et al., “View Image Generation in Perspective and Orthographic Projection Geometry Based on Integral Imaging,” Optics Express, Jun. 9, 2008, pp. 8800-8813, vol. 16. |
United States Office Action, U.S. Appl. No. 13/398,813, Jun. 18, 2013, 17 pages. |
Number | Date | Country | |
---|---|---|---|
20140241615 A1 | Aug 2014 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13398813 | Feb 2012 | US |
Child | 14225397 | US |