Technical Field
The embodiments herein relate to imaging, and more particularly to reconstructing images using one or more sensors.
Description of the Related Art
Imaging refers to a class of inverse problems wherein the central objective is to form the image of an object or scene of interest that is typically being sensed by one or more sensors each furnishing complementary but correlated sources of information, and each of which is potentially contaminated by noise and distortion. What distinguishes this from the more general class of inverse and regression problems is that images originating from typical empirical sources (such as natural images) have a special structure that makes it possible to inject informed prior models into the inference process that can enhance the quality of the reconstructed image. Accordingly, a wide variety of analytical and statistical approaches have proliferated over the past several decades in diverse imaging applications such as radar, medical, and astronomical imaging.
In view of the foregoing, an embodiment herein provides a method of reconstructing an image of an object, the method comprising: determining, by a plurality of sensors, a waveform based on the object, wherein the plurality of sensors view the object from a plurality of directions; determining, by a pre-processing module, a plurality of measurements of the object using the waveform, wherein the plurality of measurements are arranged in a vector form; determining, by an option module, a sampling matrix, a dictionary, and a noise factor, wherein the sampling matrix represents a geometric arrangement of the plurality of sensors, and the dictionary is pre-selected by the option module; estimating, by an estimation module, a coefficient vector using the measurements, the sampling matrix, and the noise factor; and reconstructing, by a reconstruction module, the image, using the coefficient vector and the dictionary.
The estimating the coefficient vector may comprise computing, by a first variable module: a first variable, using a pre-selected non-linear factor; and a multi-scale Gaussian tree structure, using a quad-tree decomposition of the image, the sampling matrix, the dictionary, and the measurements. Estimating the coefficient vectors may further comprise: estimating, by a parameter module, a structural parameter based on a parent-child relationship for each node in the tree structure; repeating, by a loop module: the computing of the first variable and the multi-scale Gaussian tree structure, and the estimating of the structural parameter, across the tree structure, until the structural parameter is lower than a first pre-selected threshold. The estimating the coefficient vectors may further comprise: computing, by a second variable module, a second variable based on the first variable, the sampling matrix, a variable selection operator, and a second pre-selected threshold; and computing, by a coefficient module, the coefficient vector based on a Hadamard product of the first variable and the second variable.
Estimating the coefficient vector may comprise: initializing, by the estimation module, x0=|h−1(ΨTy)| and n=0, wherein x0 is an initial approximation of a temporary parameter, h is a nonlinear factor, Ψ is the sampling matrix, y is a vector comprising the measurements, and n is a loop counter; calculating, by the estimation module, a descent direction dn; determining, by the estimation module, a step size λ; and computing, by the estimation module, xn+1=xn+λdn, wherein xn is an nth approximation for the temporary variable and xn+1 is an (n+1)th approximation for the temporary variable.
The method may further comprise: incrementing, by the estimation module, the loop counter n; computing, by the estimation module, x*, by repeating the calculating the descent direction dn, the determining the step size λ, the computing xn+1, and the incrementing the loop counter n until a norm of a steepest descent vector is smaller than a pre-selected threshold, wherein x* is a temporary variable. The method may further comprise: computing, by the estimation module, z*=h(x*); calculating, by the estimation module,
calculating, by the estimation module, u*=L−1R where L=({tilde over (Ψ)}TΣv−1 {tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛR and R={tilde over (Ψ)}TΣv−1 where vεm is the noise factor; and calculating, by the estimation module, c*=z*⊙u* where c*εn is the coefficient vector. Reconstructing the image may further comprise: calculating, by the reconstruction module, I=Φc* where Iεn is the reconstructed image and Φεd×n is the pre-selected dictionary. The plurality of sensors may be independent from each other, wherein the dictionary may comprise at least one class of wavelet dictionaries, and wherein the sampling matrix may comprise a sampling operator determined by a transmitted waveform associated with the measurements and the geometric arrangement of the plurality of sensors.
Another embodiment herein provides an imaging device comprising: a plurality of sensors configured to generate a waveform based on an object; a pre-processor configured to determine a plurality of measurements using the waveform, wherein the plurality of measurements are arranged in a vector form; and a central imaging device comprising: an option module configured to determine a sampling matrix, a pre-selected dictionary, a pre-selected non-linear factor, a first pre-selected threshold, a second pre-selected threshold, and a noise factor, wherein option module determines the sampling matrix using a geometric arrangement of the plurality of sensors; an estimation module configured to estimate a coefficient vector using the plurality of measurements, the sampling matrix, and the pre-selected dictionary; and a reconstruction module configured to reconstruct an image of the object, using the coefficient vector and the pre-selected dictionary.
The estimation module may further comprise: a first variable module configured to: compute a first variable using a pre-selected non-linear factor; and determine a multi-scale Gaussian tree structure using a quad-tree decomposition of the electronic source image model, the sampling matrix, the pre-selected dictionary, and the measurements. The estimation module may further comprise: a parameter module configured to determine a structural parameter using a parent-child relationship for each node in the multi-scale Gaussian tree structure; and a loop module configured to control operation of the first variable module and the parameter module across the multi-scale Gaussian tree structure until the structural parameter is lower than the first pre-selected threshold. The device may further comprise: a second variable module configured to determine a second variable using the first variable, the sampling matrix, the variable selection operator and the second pre-selected threshold; and a coefficient module configured to determine the coefficient vector using a Hadamard product of the first variable and the second variable.
The estimation module may be further configured to: initialize x0=|h−1(ΨTy)| and n=0, wherein x0 is an initial approximation of a temporary parameter, h is a nonlinear factor, Ψ is the sampling matrix, y is a vector comprising the measurements, and n is a loop counter; calculate a descent direction dn; determine a step size λ; and compute xn+1=xn+λdn, wherein xn is an nth approximation for the temporary variable and xn+1 is an (n+1)th approximation for the temporary variable. The estimation module may be further configured to: increment the loop counter n; compute x* by repeating the calculating the descent direction dn, the determining the step size λ, the computing xn+1, and the incrementing the loop counter n until a norm of a steepest descent vector is smaller than a pre-selected threshold. The estimation module may be further configured to: compute z*=h(x*); calculate
calculate u*=L−1R where L=({tilde over (Ψ)}TΣu−1{tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛR and R=ΨTΣv−1 where vεm is the noise factor and calculate c*=z*⊙u* where c*εn is the coefficient vector. The reconstruction module may be further configured to reconstruct the image by calculating I=Φc* where Iεn is the reconstructed image and Φεd×n is the pre-selected dictionary.
Another embodiment provides a non-transitory program storage device readable by computer, tangibly embodying a program of instructions executable by the computer to perform a method for reconstructing an image of an object, the method comprising: determining, by a plurality of sensors, a waveform based on the object, wherein the plurality of sensors view the object from a plurality of directions; determining, by a pre-processing module, a plurality of measurements of the object using the waveform, wherein the plurality of measurements are arranged in a vector form; determining, by an option module, a sampling matrix, a dictionary, and a noise factor, wherein the sampling matrix represents a geometric arrangement of the plurality of sensors, and the dictionary is pre-selected by the option module; estimating, by an estimation module, a coefficient vector using the measurements, the sampling matrix, and the noise factor; and reconstructing, by a reconstruction module, the image, using the coefficient vector and the dictionary.
Estimating the coefficient vector may comprise computing, by a first variable module a first variable, using a pre-selected non-linear factor; and a multi-scale Gaussian tree structure, using a quad-tree decomposition of the image, the sampling matrix, the dictionary, and the measurements. Estimating the coefficient vectors may further comprise: estimating, by a parameter module, a structural parameter based on a parent-child relationship for each node in the tree structure; repeating, by a loop module: the computing of the first variable and the multi-scale Gaussian tree structure, and the estimating of the structural parameter, across the tree structure, until the structural parameter is lower than a first pre-selected threshold. Reconstructing the image may further comprise using global compound Gaussian model as a statistical prior. Reconstructing the image may further comprise using hierarchical Bayesian maximum a posteriori and using global compound Gaussian model as a statistical prior.
These and other aspects of the embodiments herein will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. It should be understood, however, that the following descriptions, while indicating preferred embodiments and numerous specific details thereof, are given by way of illustration and not of limitation. Many changes and modifications may be made within the scope of the embodiments herein without departing from the spirit thereof, and the embodiments herein include all such modifications.
The embodiments herein will be better understood from the following detailed description with reference to the drawings, in which:
The embodiments herein and the various features and advantageous details thereof are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known components and processing techniques are omitted so as to not unnecessarily obscure the embodiments herein. The examples used herein are intended merely to facilitate an understanding of ways in which the embodiments herein may be practiced and to further enable those of skill in the art to practice the embodiments herein. Accordingly, the examples should not be construed as limiting the scope of the embodiments herein.
The embodiments herein provide a Hierarchical Bayesian-MAP (Maximum a Posteriori) method for solving inverse problems. An embodiment herein reconstructs an image that is viewed from multiple directions by independent sensors, subject to a global Compound Gaussian (CG) prior placed on the signal to be estimated. An embodiment herein approaches this inverse problem in imaging based on a Hierarchical Bayesian-MAP (HB-MAP) formulation. An embodiment herein describes the basic inverse problem of multi-sensor (tomographic) imaging. Given the measurements recorded by the sensors, a problem may be reconstructing the image with a high degree of fidelity. An embodiment uses a Probabilistic Graphical Modeling extension of the CG distribution as a global image prior into a Hierarchical Bayesian inference procedure. In an embodiment, the global CG model incorporated within the HB-MAP inference procedure is used for solving inverse problems and imaging. The FIB-MAP algorithm solves a difficult non-convex optimization problem. The Monte-Carlo simulations, described herein, show that that this approach works for over a broad range of conditions—high and low sparsity cases—whereas Comprehensive Sensing (CS) and sparsity based approaches work only for high-sparsity cases. An embodiment herein provides a way to utilize the global CG model which subsumes the Laplacian—the basis of compressive sensing and many other statistical models—for solving difficult inverse problems such as imaging. An embodiment herein represents a numerical-technological advancement to realize the application of global CG models for imaging in practice. Embodiments herein can be used for radar imaging (SAR, ISAR etc.), medical imaging, and a broad class of inverse problems involving empirically derived data sources.
Referring now to the drawings, and more particularly to
An embodiment herein provides optimum image reconstruction conditioned on the aforementioned pre-processing operations. An effective forward model can be captured by the following linear system model:
y=ΨΦc+v (1)
where:
yεm is the measurements in vector form,
Ψεm×d is the effective measurement matrix,
I=Φcεn is the vectorized image of interest,
Φεd×n is the dictionary in which one may choose to represent the unknown image I.
cεn is the unknown coefficient vector which is the object of the inference methodology,
and
vεm is the effective measurement noise.
An embodiment herein solves equation (1) by estimating the coefficient vector c given the observed measurements y. The dictionary Φ, with respect to which the image is represented, is chosen a priori and can be, for example, but not limited to, any class of wavelet dictionaries. The dictionary can also be a member of a more general class of over-complete dictionaries. Embodiments herein can be applied to any type of sampling operator Ψ, for example, but not limited to, the Radon transform which underlies some imaging applications. In practice, the matrix Ψ may be determined by both the transmitted waveform and the geometric arrangement of the sensors in space and time.
Different approaches may be used to solve for c in Eq. (1). For imaging applications, the traditional approach is the Filtered Back-projection (FBP) methodology (Deans) wherein the underlying dictionary Φ includes Fourier atoms. A variety of Compressive-Sensing (CS)/Sparsity-based-reconstruction approaches including Large-Scale I1-Regularized Least Squares (I1-Is), Orthogonal Matching Pursuit (OMP). Regularized Orthogonal Matching Pursuit (ROMP), Iterative signal recovery from incomplete and inaccurate samples (CoSaMP), and Bayesian Compressive Sensing (BCS) have been applied for this purpose. An embodiment herein encompasses CS and sparse reconstruction approaches and treat them as special cases.
From Eq. (1) an embodiment herein provides a Bayesian-Maximum a posteriori (MAP) estimate of c. The prefix “Bayesian” emphasizes that probabilities in embodiments herein are assigned without a corresponding notion of sampling or frequency. Furthermore an embodiment herein produces covariance estimates which could be used to determine confidence intervals.
The Bayesian-MAP estimate of c is given as follows:
where, in Eq. (4) v˜(0, Σv=1).
The distribution P(c) encapsulates the statistical prior knowledge about the scene structure. For the specific choice of a Laplacian (i.e. P(c)=exp(−λ∥c∥1)), Eq. (2) reduces to the LASSO methodology or the Basis Pursuit methodology:
c*=argminc{∥y−ΨΦc∥22+λ∥c∥1} (5)
The Laplacian distribution is not sufficiently rich for modeling the statistics of wavelet coefficients of natural scenes when sensed by optical or radar sensors. In an embodiment herein, a probabilistic graphical modeling extension of the Compound Gaussian (CG) distribution is used as a candidate for P(c). This extension subsumes distributions such as, for example, but not limited to, the Laplacian, the generalized Gaussian, and the alpha-stable distribution.
The Bayesian-MAP estimation problem, under the CG modeling, lends itself to a Hierarchical Bayes formulation that can yield superior performance to traditional CS methodologies. In Bayesian statistics, a maximum a posteriori probability (MAP) estimate is a mode of the posterior distribution. The MAP can be used to obtain a point estimate of an unobserved quantity on the basis of empirical data. It is closely related to Fisher's method of maximum likelihood (ML), but employs an augmented optimization objective which incorporates a prior distribution over the quantity one wants to estimate.
In Eq. (1), c can be modeled as a random vector that can be decomposed into the following Hadamard product form:
c=z⊙u (6)
such that:
i) u˜(0, Pu)
z=h(x)
x follows a Multi-scale Gaussian Tree structure
ii) u and z are independent random variables
iii) [z2]=1
iv) h is a non-linearity (which ultimately controls the sparse structure of c).
The measurements 174 may be in a vector form. Imaging device 150 may also include options module 165 configured to determine a sampling matrix 173, a pre-selected dictionary 161, and a noise factor 166. Sampling matrix 173 can be based on a geometric arrangement of sensors 184 collecting data of an object 180. Imaging device 150 may further include estimation module 167 configured to estimate coefficient vector 162 based on measurements 174, sampling matrix 173, pre-selected dictionary 161, and noise factor 166.
Imaging device 150 may further include reconstruction module 169 configured to prepare, a reconstructed image 160 from the measurements 174. In an embodiment, reconstruction module 169 uses any of dictionary 161 and estimated coefficient vector 162 to generate the reconstructed image 160. In an embodiment, dictionary 161 may include at least one class of wavelet dictionaries. Sampling matrix 173 may include a sampling operator determined by the transmitted waveform 152 associated with measurements 174 and the geometric arrangement of the independent sensors 184. Reconstructed image 160 may be transmitted by an electrical communication module 179.
In an embodiment, the estimation module 167 may also include the parameter module 215 configured to determine a structural parameter 223 using a parent-child relationship for each node in the tree structure 233. The estimation module 217 may further include a loop module 217 configured to control operation of the first variable module 213 and the parameter module 215 across the tree structure 233 until the structural parameter 223 is lower than the first pre-selected threshold 201. Estimation module 167 may further include a second variable module 219 configured to determine a second variable 234 using the first variable 211, sampling matrix 173, variable selection operator 243, and the second pre-selected threshold 203. Estimation module 167 may also include a coefficient module 221 configured to determine coefficient vector 162 based on a Hadamard product of the first variable 211 and the second variable 234.
In an embodiment, estimation module 167 may be configured to (a) initialize x0=|h−1(ΨT y)| and n=0, where h is the non-linear factor 209, Ψεm×d is the sampling matrix 173, yεm are the measurements 174, and n is a loop counter; (b) calculating a descent direction dn; (c) determining a step size λ; (d) computing xn+1=xn+λdn; (e) incrementing the loop counter n; (t) computing x* by repeating steps (b) through (e) until a norm of a steepest descent vector is smaller than the first pre-selected threshold 201; (g) computing z*=h(x*); (h) calculating
(i) calculating u*=L−1R, where L=({tilde over (Ψ)}TΣv−1{tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛR and R={tilde over (Ψ)}TΣv−1 where vεm is the noise factor 166; and (j) calculating c*=z*⊙u* where c*εn is the coefficient vector 162.
In an embodiment, the reconstruction module 169 is further configured to reconstruct the image by calculating I=Φc* where Iεn is the reconstructed image, and Φεd×n is the pre-selected dictionary 161.
At step 506, method 500 repeats steps 502 and 504 across the tree structure until the parameter is lower than the first pre-selected threshold 201. In an embodiment, the loop module 217 controls operation of the first variable module 213 and the parameter module 215 across the multi-scale Gaussian tree structure 233 until the structural parameter 223 is lower than the first pre-selected threshold 201. At step 508, method 500 computes the second variable 234 using the first variable 211, the sampling matrix 173, a variable selection operator, and the second pre-selected threshold 203. In an embodiment, the second variable module 219 determines the second variable 234, the sampling matrix 173, the variable selection operator, and the second pre-selected threshold 203. At step 510, method 500 computes the coefficient vector 162 using the Hadamard product of the first variable 211 and the second variable 234. In an embodiment, the coefficient module 221 computes the coefficient vector 162.
At step 608, method 600 may also compute z*=h(x*). At step 610, method 600 calculates
At step 612, method 600 calculates u*=L−1R where
L=({tilde over (Ψ)}TΣv−1{tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛR and R={tilde over (Ψ)}TΣv−1 where vεm is the noise factor. At step 614, method 600 calculates c*=z*⊙u* where c*εn is the coefficient vector. In an embodiment, estimation module 167 performs steps 602 through 614 of method 600.
At step 616, method 600 calculates I=Φc* where Iεn is the reconstructed image and Φεd×n is the pre-selected dictionary 161. The pre-selected dictionary 161 can include at least one class of wavelet dictionaries. The sampling matrix 173 can include a sampling operator determined by a transmitted waveform associated with the measurements 174 and a geometric arrangement of the independent sensors. In an embodiment, reconstruction module 169 reconstructs the image at step 616 of method 600.
x(s)=A(s)x(par(s))+B(s)w(s) (7)
P
x(s)=A(s)Px(par(s))A(s)T+Q(s) (8a)
P
x(s,t)=Γ(s,st)Px(st)ΓT(t,st) (8b)
where, s is node 11 of tree 13 and par(s) refers to the parent 15 of node 11: w(s)˜(0,1), Q(s)=B(s)BT (s); the symbol st refers to the closest common ancestor to nodes s and t on a tree; and where Γ is a state transition matrix defined recursively as: Γ(s, s)=1, Γ(s, t)=A(s)Γ(par(s), t). These coarse-to-fine dynamics implicitly define a probabilistic graphical model in which edges between parent-child nodes represent a jointly Gaussian dependency structure.
The statistics of wavelet coefficients can be modeled effectively by the CG model and related observations in sea-clutter and radar images, as shown in Eq. (6). Given Eq. (6) and property (ii), the covariance of a coefficient and its parent is as follows:
cov[c(s),c(par(s))]=[h(x(s))hT(par(x(s)))]⊙cov[u(s)uT(par(s))]
The fact that the wavelet coefficients of images across scales tend to be decorrelated can be enforced by constraining the Gaussian field u to be corresponding white noise process across the multiscale tree. Furthermore, the variance of the wavelet coefficients across scales follows a self-similar structure. These properties can be accommodated by modeling:
u(s)=D(s)(s)(s)˜(0,1) such that: D(s)=2−γm(s)
where, m(s) is the scale corresponding to node s, and γ>0 is a constant. From Properties (i)-(iii) it follows that the variance of c is controlled by the u-field.
In spite of the above decorrelation across scales, the strong non-Gaussian local dependence of wavelet coefficients that is observed in natural images is captured by the tree dependency structure in the x (and correspondingly, via (i), the z) random field. Finally as mentioned above, the sparse structure of the wavelet coefficients is enforced by the non-linearity h.
In using the Graphical CG model for the purposes of solving Eq. (1), simplifying assumptions can be made: (1) A(s)=A·Id and B(s)=B·Id ∀s where, A, Bε and Id denotes the identity matrix of size d, and (2) the tree structure corresponding to x is homogeneous. From these assumptions, Eq. (7) can be written as:
Letting A≡μ and B≡√{square root over (1−μ2)}, Px(s)=Id ∀s. Thus, a single parameter μ controls the inter-scale dependency structure of the wavelet coefficients.
Though many different sparsity-inducing non-linearities h can be incorporated, in an embodiment herein, the following non-linearity can be useful in terms of analytical properties such as smoothness with respect to the sparsity controlling parameter a:
h(x)=√{square root over (exp(x/α))} (10)
Eq. (10) shows that the sparsity level of the signal is inversely proportional to α. Thus two parameters, α and μ, control the properties of the Graphical CG distribution.
Given the CG model in Eq. (6), the pdf (probability density function) of the wavelet field c is given by:
The structure of the CG distribution results from the summation of a continuum of different Gaussians with different scales (variances)—each of which is weighted by the profile Pz(z). Different choices of the weighting profile Pz(z) result in synthesis of different kinds pdfs (each with different kinds of heavy-tailed behavior)—including many of the well-known distributions in statistics, for example, but not limited to, the Laplacian, Generalized Gaussian, and alpha-stable distributions.
A statistical model with prior distribution π(θ) is said to be hierarchical Bayes if it can be decomposed in terms of conditional distributions
π(θ|θ1),π(θ1|θ2), . . . ,π(θt-1|θt1), and marginal π(θt): π(θ)=∫π(θ|θ1)π(θ1|θ2) . . . π(θt-1|θt1)π(θt).dθ1 . . . dθt (12)
The CG distribution inherently has a hierarchical structure in that conditioning on the latent variable z Gaussianizes the wavelet field c. A fully Bayesian non-parametric approach to statistical inference under a hierarchical Bayesian prior model involves drawing samples from the posterior distribution:
P(c,z|y)∝P(c,z)P(y|c)=P(c|z)P(z)P(y|c) (13)
via Markov Chain Monte Carlo (MCMC) and other sampling methods. The intermediate estimation of the latent variable, the z-field, is referred to herein as Type-II estimation, whereas the estimation of the primary parameter of interest c is referred to herein as Type-I estimation.
An embodiment focuses on a particular family of prior distributions; i.e., the Graphical CG model described above. The embodiment performs inference by a sequence of MAP estimates starting with the latent variable:
z*=argmaxz log P(z|y)
c*=argmaxc log P(c|y,z*)
An embodiment solves the optimization problem presented in Eq. (4) wherein a Graphical CG model is placed on the unknown coefficients c. The first step is to estimate the non-Gaussian component z (Type-Ill estimation), followed by the estimation of u (Type-I estimation). With respect to Type II estimation, given that z=h(x), it suffices to estimate the Multi-scale Gaussian Tree random process x. This can be performed by recourse to the Bayesian-MAP strategy:
x*=argmaxx log P(x|y) (14)
argmaxx log P(y|x)+log P(x) (15)
y={tilde over (Ψ)}(h(x)⊙u)+u where, {tilde over (Ψ)}≡ΨΦy=Axu+v (16)
where, Ax={tilde over (Ψ)}Hx
H
x=diag(h(x))
y|x˜(y;0,AxPuAxT+Σv)
x˜(x;0,Px)
where, (w; μ, Σ) denotes a Gaussian with mean j and covariance matrix Σ. Thus Eq. (16) is equivalent to:
x*=argmaxxƒ(x) (17)
where:
ƒ(x)=yT(AxPuAxT+Σv)−1y+log det(AxPuAxT+Σv)+xTPx−1x (18)
In an embodiment, two types of steepest descent approaches are considered: Gradient and Newton based steepest descent. The Gradient descent involves a first-order Taylor series expansion of the cost function Eq. (18) wherein only the Gradient vector of ƒ has to be computed in each iteration. The Newton descent approach, which involves the second order Taylor approximation of ƒ, in addition entails the calculation of the Hessian matrix. For Newton iterations, once the Hessian is calculated via equation, the closest psd (positive semi-definite) approximation to the Hessian is computed and used for calculating the descent direction as described above. Given a matrix X, an approximation to its closest psd can be calculated as follows:
M=QLQ
T (19)
where,
L≡diag(max(real (Λx), 0))
Q≡√{square root over (QxTQx)}
Λx≡a diagonal matrix of eigenvalues of X
Qx≡matrix of eigenvectors of X
Performing the closest psd approximation of the Hessian matrix X=∇2ƒ(xn), which effectively constrains the search direction to an elliptical ball defined by the matrix M, furnishes descent directions that can improve upon the gradient direction.
With respect to Type I estimation, let x* be the solution of Eq. (17) obtained via Type II estimation. Thus, the estimate of the non-Gaussian component of the unknown coefficient c is z*=h(x*). The goal of the Type-I estimation is to estimate the corresponding u field. The Bayesian-MAP strategy is used to generate the optimality criterion to solve:
Eq. (22) reduces to solving the following equation with respect to u:
ΛL({tilde over (Ψ)}TΣv−1{tilde over (Ψ)})ΛRu=ΛL{tilde over (Ψ)}TΣv−1y (23)
where,
ΛL=ΛR=diag(z*)
The sparse structure of z* can allow pruning the number of equations to be solved:
({tilde over (Ψ)}TΣv−1{tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛRu={tilde over (Ψ)}TΣv−1y (24)
=diag(λ[z*]) (25)
λ is a variable selection operator with respect to threshold λ. The threshold λ is determined from the histogram of z*. The resulting Type-1 estimation methodology can be summarized as follows:
1. Let z* be the solution of the Type-II estimation procedure given measurements y, sampling matrix Ψ, and dictionary Φ
2. Calculate =diag(λ[z*]) for a threshold λ (in eq. (25)) and let ΛR=diag(z*)
u*=L
−1
R
where,
L=({tilde over (Ψ)}TΣv−1{tilde over (Ψ)}+ΛL−1Σu−1ΛL−1)ΛR
R=
{tilde over (Ψ)}
TΣv−1
The matrix prunes out the rows corresponding to values of the z that are too small (as determined by threshold λ).
The HB-MAP methodology can be summarized as follows:
1. Initialize parameters
2. Perform Type-II estimation to obtain z*, the MAP estimate of the z field
3. Given z* perform the approximate EM methodology to estimate the optimum parameter μ
4. Iterate between steps (2)-(3) until convergence of μ
5. Perform Type-I estimation to obtain u*
6. Return the optimum estimate of the image: I*=Φc*, where c*=z*⊙u*
In addition to employing the Type-I and Type-II estimation procedures described above, the HB-MAP methodology also employs an approximate EM methodology that enables the refinement of parameter μ that controls the non-Gaussian inter-scale dependencies between the wavelet coefficients. Let μk be the value of μ in the kth iteration, and let xk* be the corresponding Type-II MAP estimate that solves (14). A 2nd order Taylor expansion of the cost function ƒ about xk* can be computed as follows:
ƒ(x)≈ƒ(xk*;μk)+[∇ƒ(xk*;μk)]T(x−xk*)+0.5(x−xk*)T{∇2ƒ(xk*;μk)}−1(x−xk*) (27)
≈0.5(x−xk*)T{∇2ƒ(xk*;μk)}−1(x−xk*) (28)
since ∇ƒ(xk*; μk)=0 by definition of the fact that xk* is a local optimum point, and ƒ(xk*; μk)≈0 by assumption. Thus p(x|y, μk)≈N(x; xk*, ∇2ƒ(xk*; μk)).
The Q-function of the EM methodology is:
and the gradient function of Q is as follows:
The optimum parameter p can be found as follows:
μ*=argminμQ(μ;μk) (32)
where Eq. (32) can be solved by a simple gradient descent by using the gradient expression given in Eq. (31).
The computationally most intensive part of the methodology is the calculation of the Hessian matrix and (to a lesser extent) the Gradient vector that is required in the steepest descent methodology (Type II estimate). Direct implementation of the Hessian matrix and Gradient vector equations without exploiting the sparse structure of the matrices could be computationally expensive due to the presence of massive tensor products resulting from Kronceker operations on high dimensional matrices. The computational bottleneck can be overcome as follows. The gradient vector is given by the following equation:
∇ƒ(x)=2Px−1x+Gx(ΨTΨTr)vx (33)
where,
G
x=∇xHx[{(PuHx)In}+{In(PuHx)}] (34)
v
x=vec((MxT)−1)−[(Mx−1y)(Mx−1y)] (35)
The following is a procedure for calculating the gradient in Eq. (33).
1. for i=1 to n
2. r=Gx(i,:)
3. index=find locations k where r(k)≠0
4. g=0
5. for j=1 to |index|
6. k=index(j), kmod=mod(k, n), ks=(k−kmod)/n
7. v1=Ψ(ks,:) v2=Ψ(kmod,:)
8. g=g+vxTvec(v2v1T)
9. endfor
10. endfor
11. g=g+2Px−1x
The Hessian form can be expressed as follows:
Λ2ƒ(x)=2Px−1x+Σi=14Mi[N1+N2]+Q1+Q2 (36)
where,
M
1=∇2Hx[(PuHx)In
M
2=∇2Hx[In(PuHx)In]({tilde over (Ψ)}T{tilde over (Ψ)}TIn) (38)
M
3
=∇H
x
[I
n
(PuT{tilde over (K)}nm)][In
M
4
=∇H
x(InPuT)({tilde over (K)}nmIn)[In
furthermore,
N
1=vec(Mx−1)In (41)
N
2=(Mx−1y)(Mx−1y)In (42)
=∇xHx[{(PuHx)In}+{In(PuHx)}] (45)
A property that can enable efficient calculation of the above matrix products is to preserve sparsity of the intermediate matrices products calculated from left to right. First, by construction the matrices ΛHx and Λ2Hx are sparse. Given this, the following Hessian procedure can allow the sparsity of the intermediate matrix products to be preserved while producing the desired matrix products (for example, MiNj ∀i, j).
indicates data missing or illegible when filed
where, N is either N1 or N2 (in Eq. (41-42)) and prod( ) refers to specialized matrix products that use algorithmic techniques similar to that in the gradient calculation algorithm.
In terms of flop count, the Gradient calculation takes O(n.S) and the Hessian calculation takes O(n2.S) (where integer S depends on matrix Pu; for a block-diagonal matrix Pu, S is bounded by a small constant) on a single-core CPU (Central Processing Unit). Each of the iterations in the Gradient procedure and the Hessian procedure are independent of each other and thus can be performed in any order. This key property allows for parallel processing to be efficiently exploited in multicore CPU or GPU (Graphical Processing Unit) platforms.
Newton 21, Gradient Descent 23, and Perfect Knowledge 25 are performance results from execution of the embodiments herein. Line 23 corresponds to the Gradient descent iteration and line 21 corresponds to the Newton iteration. The simulation results show that the Gradient descent iteration and the Newton iterations both have the maximum MSE at 2 trials. Line 25 illustrates the Perfect Knowledge HB-MAP 25 corresponding to the case where the z− field is known exactly, as in simulated Monte-Carlo trials. Thus, the Perfect Knowledge HB-MAP procedure only applies the Type-I estimation to estimate the image. The performance of Perfect Knowledge HB-MAP is therefore a lower bound which no other methodology can surpass.
The threshold λ that is used in the Type-I estimation procedure is determined in these experiments by the coordinate of the histogram of the estimated z field below which 0.1 of the samples is contained. Line 27 corresponding to I1-Is, line 29 corresponding to CoSaMP, line 31 corresponding to ROMP, and line 33 corresponding to BCS illustrate performance results from high sparsity-based convex optimization methodologies. Line 35 corresponding to Fourier illustrates performance results from a standard Fourier-based reconstruction. Simulated images have high-sparsity (i.e. α=0.2). The high sparsity case is more typical of natural images. The performance of the embodiments herein exceeds that of other options.
In an exemplary embodiment, the various modules described herein and illustrated in the figures, for example systems and devices illustrated in
Some components of the embodiments herein can include a computer program product configured to include a pre-configured set of instructions stored in non-volatile memory, which when performed, can result in actions as stated in conjunction with the methods described above. For example components of
The embodiments herein may also include tangible and/or non-transitory computer-readable storage media for carrying or having computer executable instructions or data structures stored thereon. Such non-transitory computer readable storage media can be any available media that can be accessed by a special purpose computer, including the functional design of any special purpose processor, module, or circuit as discussed above. By way of example, and not limitation, such non-transitory computer-readable media can include RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store desired program code means in the form of computer executable instructions, data structures, or processor chip design. When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or combination thereof) to a computer, the computer properly views the connection as a computer-readable medium. Thus, any such connection is properly termed a computer-readable medium. Combinations of the above should also be included within the scope of the computer-readable media.
Computer-executable instructions include, for example, instructions and data which cause a special purpose computer or special purpose processing device to perform a certain function or group of functions. Computer-executable instructions also include program modules that are executed by computers in stand-alone or network environments. Generally, program modules include routines, programs, components, data structures, objects, and the functions inherent in the design of special-purpose processors, etc. that perform particular tasks or implement particular abstract data types. Computer executable instructions, associated data structures, and program modules represent examples of the program code means for executing steps of the methods disclosed herein. The particular sequence of such executable instructions or associated data structures represents examples of corresponding acts for implementing the functions described in such steps.
The techniques provided by the embodiments herein may be implemented on an integrated circuit chip (not shown). The chip design is created in a graphical computer programming language, and stored in a computer storage medium (such as a disk, tape, physical hard drive, or virtual hard drive such as in a storage access network). If the designer does not fabricate chips or the photolithographic masks used to fabricate chips, the designer transmits the resulting design by physical means (e.g., by providing a copy of the storage medium storing the design) or electronically (e.g., through the Internet) to such entities, directly or indirectly. The stored design is then converted into the appropriate format (e.g., GDSII) for the fabrication of photolithographic masks, which typically include multiple copies of the chip design in question that are to be formed on a wafer. The photolithographic masks are utilized to define areas of the wafer (and/or the layers thereon) to be etched or otherwise processed.
The resulting integrated circuit chips can be distributed by the fabricator in raw wafer form (that is, as a single wafer that has multiple unpackaged chips), as a bare die, or in a packaged form. In the latter case, the chip is mounted in a single chip package (such us a plastic carrier, with leads that are affixed to a motherboard or other higher level carrier) or in a multichip package (such as a ceramic carrier that has either or both surface interconnections or buried interconnections). In any case, the chip is then integrated with other chips, discrete circuit elements, and/or other signal processing devices as part of either (a) an intermediate product, such as a motherboard, or (b) an end product. The end product can be any product that includes integrated circuit chips, ranging from toys and other low-end applications to advanced computer products having a display, a keyboard or other input device, and a central processor, and may be configured, for example, as a kiosk.
The embodiments herein can include both hardware and software elements. The embodiments that are implemented in software include but are not limited to, firmware, resident software, microcode, etc. Furthermore, the embodiments herein can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can comprise, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk-read only memory (CD-ROM), compact disk-read/write (CD-R/W) and DVD.
A data processing system suitable for storing and/or executing program code will include at least one processor coupled directly or indirectly to memory elements through a system bus. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
Input/output (I/O) devices (including but not limited to keyboards, displays, pointing devices, etc.) can be coupled to the system either directly or through intervening I/O controllers. Network adapters may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters.
A representative hardware environment for practicing the embodiments herein is depicted in
The foregoing description of the specific embodiments will so fully reveal the general nature of the embodiments herein that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. Therefore, while the embodiments herein have been described in terms of preferred embodiments, those skilled in the art will recognize that the embodiments herein can be practiced with modification within the spirit and scope of the appended claims.
The embodiments described herein may be manufactured, used, and/or licensed by or for the United States Government without the payment of royalties thereon.