A field of the invention is computed tomography. The invention is applicable to various computed tomography techniques, including, for example, X-ray Computed Tomography (CT), single photon emission computed tomography (SPECT), and positron emission tomography (PET)
Computed tomography techniques including X-ray Computed Tomography (CT), single photon emission computed tomography (SPECT), positron emission tomography (PET) are well-established imaging modalities. Conventional image reconstruction in such imaging modalities and others has been performed using a method known as filtered backprojection (MP). More recently, iterative image reconstruction methods have been introduced, with the main motivation being x-ray dose reduction. One goal of the iterative techniques is to lower the dose of radiation experience by a subject being imaged. The lower doses and under sampling (when used, e.g., to reduce dose) provide challenges to compute high contrast and clear images. Another goal of iterative techniques, is to provide high-quality reconstruction from data acquired by advanced detector systems such as photon counting detectors, for which FBP may not provide the desired quality. Recent iterative techniques utilized an image fidelity function to encourage fidelity to the measured data, and an edge-preserving regularization function such as total variation or qGGIVERF that encourages smoothness in image areas with little variation, while recovering sharp boundaries to preserver image resolution.
DeMan et al. U.S. Pat. No. 8,971,599 discloses methods for iterative tomographic reconstruction, Three embodiments are discussed that use a ramp filter as a preconditioner. in a first embodiment, the iteration applies a deconvolution filter and reevaluates to see if completion criteria have been satisfied. This first embodiment uses a change of variables (that is, replacing all instances of the image variable x with another variable z, using the equivalence x=Fz for a filter F) . The approach of the first embodiment is highly similar to the classical formulation. of preconditioning. In a second embodiment, a preconditioner is applied to the simultaneous image update steps of an iterative reconstruction, where the preconditioner approximates the inverse Hessian of the cost function being optimized. In a variation, the preconditioner is applied only to the data, fidelity term (called data fit term in the ‘599 patent) and not to the regularization term. This strategy is intended to accommodate edge preserving priors, which are commonly used in current iterative reconstruction methods. However, by applying the preconditioner in this way, it ceases to be a proper preconditioner. Instead, this approach fundamentally alters the minimization process such that the fixed point of the iteration no longer corresponds to the fixed point of the original update sequence. One drawback of the techniques in the ‘599 patent arises in the common case that statistical weighting matrices are used in the update steps. These weighting matrices degrade the approximation of the ramp filter (preconditioner) as an inverse Hessian operation, and therefore degrade the preconditioning effect. In particular, the presence of the weighting matrix W, which often has large dynamic range and therefore causes the Hesssian of the data fidelity term in the cost function to be poorly conditioned, has a detrimental effect on the rate of convergence of the iterative algorithm, ultimately requiring more iterations to achieve a desired accuracy.
Embodiments of the invention include methods and systems for iterative computed tomography. In preferred methods and systems of the invention, an auxiliary variable is added to the reconstruction process, while retaining all variables in the original formulation. Preferred embodiments apply a weighting operator or filter that causes the Hessian with respect to image of the cost function to be well-conditioned. In preferred embodiments, an auxiliary sinogram variable distinct from both a set of actual image measurements and from a set of projections computed based on the image is applied to iteratively update during the statistical iterative image reconstruction, with applied conditions that causes the Hessian with respect to image of the cost function to be well-conditioned:
Unlike the techniques in the ‘599 patent discussed in the background, preferred methods of the invention do not degrade the approximation of the ramp filter as an inverse Hessian operation. Preferred methods instead relate the image to an auxiliary variable without statistical weighting, but using a ramp filter (or other norm weighting), while the statistical weighting can remain in a different term, relating the measured projection data derived from the CT scan data to the auxiliary variable, without using the ramp filter. More generally, preferred embodiments apply a weighting operator or filter that causes the Hessian with respect to image of the cost function to be well-conditioned. The auxiliary variable is added to the reconstruction process, while retaining all variables in the original formulation. The preferred iterative reconstruction methods use both the image variable and the auxiliary variable, and specifically updates to the image variable depend on the auxiliary variable, but not on the measured projection data, and the updates to the image variable do not need to include the statistical weighting. Also unlike the variations of the ‘599 patent that alter the minimization process such that the fixed point of the iteration no longer corresponds to the fixed point of the original update sequence, methods of the invention maintain mathematical equivalence with the solution of the original update process.
Preferred methods and systems provide for iterative reconstruction tomography images including, for example, CT images, SPECT and PET images. Preferred embodiments can reduce the computation needed to produce an image of desired quality using an iterative algorithm. This can be used to obtain results more quickly using the same computing hardware, or to reduce the cost and/or the size and weight and/or the power consumption of the computing hardware needed to produce the results in a desired time. Because iterative reconstruction can be used to reduce the x-ray dose to which the subject is exposed during a CT scan, and/or improve image quality and reduce artifacts, e.g., due to metal implants, the preferred methods and systems can be used to provide the same benefits at a lower cost and/or at a higher patient throughput and smaller delay for reconstruction compared to traditional CT methods and systems.
Those knowledgeable in the art will appreciate that embodiments of the present invention lend themselves well to practice in the form of computer program products. Accordingly, it will be appreciated that embodiments of the present invention may comprise computer program products comprising computer executable instructions stored on a non-transitory computer readable medium that, when executed, cause a computer to undertake methods according to the present invention, or a computer configured to carry out such methods. The executable instructions may comprise computer program language instructions that have been compiled into a machine-readable format. The non-transitory computer-readable medium may comprise, by way of example, a magnetic, optical, signal-based, and/or circuitry medium useful for storing data. The instructions may be downloaded entirely or in part from a networked computer. Also, it will be appreciated that the term “computer” as used herein is intended to broadly refer to any machine capable of reading and executing recorded instructions, it will also be understood that results of methods of the present invention may be displayed on one or more monitors or displays (e.g., as text, graphics, charts, code, etc.), printed on suitable media, stored in appropriate memory or storage, etc.
Preferred embodiments of the invention will now be discussed with respect to the drawings. The drawings may include schematic representations, which will be understood by artisans in view of the general knowledge in the art and the description that follows. Features may be exaggerated in the drawings for emphasis, and features may not be to scale
A preferred iterative reconstruction module applies an iterative reconstruction with an auxiliary variable and a weighting operator or filter that causes the Hessian with respect to image of the cost function to be well-conditioned. In preferred embodiments, the weighting operator or filter is a ramp filter. Generally, the reconstruction method of preferred embodiments is related to iterative reconstruction techniques that use the Augmented Lagrangian. Augmented Lagrangian methods of iterative reconstruction (IR) converge more quickly than techniques that directly minimize the objective function used in the iterative reconstruction, in particular in the case of weighted or complicated discrepancy terms between the acquired sinogram and the sinogram computed based on the image.
Before discussing introductory concepts and the preferred the iterative reconstruction, symbols used in the following description and the in claims will be defined. The following table defines symbols that are used.
f
2 L(u, f, η)
Denote the reconstructed image by the image variable f. Image variable f can represent a 2D array of numbers in the case of 2D imaging, e.g., cross-sectional CT, or a 3D array of numbers in the case of 3D imaging, e.g., conebeam or helical conebeam CT. As known to artisans, such arrays may be organized in various desired forms in computer memory. Because any data array of any dimension can be arranged into a 1D array, or vector, for the purposes of this disclosure, it will be convenient to consider all data arrays, regardless of dimension, as vectors. It will also be noted, as used in the disclosure and the claims, that a vector approximately proportional to the gradient indicates that the vector is not perpendicular to the gradient (not equal to π/2, and preferably less than π/2).
A typical formulation for conventional iterative reconstruction involves minimization of a cost function to generate the reconstructed image f. This minimization problem and cost ftinction can be generally written as
where fopt is the final reconstructed image. The measured sinogram variable g usually corresponds to a 2D or 3D data array. For example, it would correspond to a 2D data array in the case of 2D imaging, or in the case of 3D helical imaging with a line detector when g is indexed by view angle and detector index. It would correspond to a 3D array, for example in the case of 3D conebeam CT imaging with a multirow or area detector. In some applications, such as dual or multi-energy (also called multispectral) 3D CT, the measured sinogram variable g may correspond to a 4 dimensional array. Again, for the purposes of this disclosure, we will consider g to be a vector.
The measured sinogram variable g is generated from CT data by conventional techniques, such as those described by J. Hsieh, “Computed Tomography: Principles, Design, Artifacts, and Recent Advances, Second Edition, SPIE, Wash. USA, 2009, which may include negative logarithm, and various corrections steps for deviations from ideal models. In the case of iterative reconstruction, fewer corrections may be used in generating the measured sinogram data g, and instead some of the non-idealities and even the negative logarithm operation may be included, if desired, in the modeling incorporated into the iterative reconstruction, as described below. The projection operator R (“Projector”), also known as the system matrix, incorporates the model of the data acquisition process. For example, it might account for the line integral measurement, for the shape of the X-ray beam and of the focal spot, for the shape of the voxel or other basis function used to represent the image, for the geometry of the CT scanner, for the shape and properties of the detector, etc. Given an image f, the quantity Rf is the set of measurements (also called “sinogram” or “projections”) computed based on the image f. The function h(Rf,g) is a measure of the discrepancy between the acquired projection data g and the computed projection data Rf generated by the current image estimate off using the projection operator R. Accordingly, the first term in the cost function in (1) is sometimes called a data fidelity term. The adjoint RT of R is a backprojector operator. This backprojection operator may however, be somewhat different from conventional approximate backprojection operators used in filtered back projection algorithms, which may not have an exact adjoint relationship to a desired projector R.
Examples of h include a negative log-likelihood term, such as a Poisson-likelihood, a weighted sum-of-squared deviations, or other nondecreasing and typically convex functional, such as a weighted sum of generalized Huber functions of elements of the difference g−Rf. With such or other choices of h, it too can be used to express physical and/or statistical modeling aspects of the CT data acquisition, which may result in g not exactly corresponding to true projections or sinogram, as described by J. Hsieh, “Computed Tomography: Principles, Design, Artifacts, and Recent Advances, Second Edition, SPIE, Wash. USA, 2009. The step of generating g from the CT data may be even reduced, if desired, to no operations (i.e., using the actual CT data in unmodified form), by appropriate choice of function h (e.g, incorporating desired transformations or corrections into h), although it is usually computationally advantageous to perform at least some of these transformations and corrections in a separate step of generating g.
The Φ(f) term in (1) is a regularization functional that represents prior information about the image f. Common choices for Φ(f) encourage smoothness in the image by penalizing local differences, that is, differences between voxels in a small neighborhood about each voxel. Typically non-quadratic penalties, with notable examples including generalized Huber functions of the local differences, the so-called qGGMRF (random field) penalty, total variation, or other penalties that promote sparsity of the local differences or of some transform of the image, are used to introduce an edge preserving property to the regularization, or capture additional properties of the images of interest.
The iterative reconstruction algorithm involves updates to the algorithm variables to progressively perform the minimization in (1). A stopping criterion is employed to terminate the iterative algorithm. The stopping criterion might include, for example, running for a fixed number of iterations, examining the discrepancy (e.g., some norm of the difference g−Rf) between the measured sinogram g and computed measurements (projections) Rf, or detecting when the maximum absolute value of an element (pixel or voxel, as the case may be) of an update of the image f is below some threshold.
As a specific example, we consider the penalized weighted least squares (PWLS) formulation for h, although the method described here is generally applicable to other functions. The PWLS cost function and associated optimization problem are
The weighted squared norm ∥ν∥Γ2 of a vector ν denotes the quantity νTΓν, for a positive-definite symmetric operator or matrix Γ, where νT denotes the transpose of vector v, or equivalently ∥ν∥Γ2=<Γν, ν>, where <‘,’>denotes the Euclidean inner product between vectors, and Γν is the application of the linear operator Γto ν. (Alternatively, the square of the weighted norm can be implemented as ∥Γ1/2ν∥22=<Γ1/2ν, Γ1/2ν>, where Γ1/2 is a square root of operator Γ, and satisfies (Γ1/2)TΓ1/2=Γ, where AT denotes the adjoint of operator A.) The notation for the norm simplifies to ∥ν∥2 when the weight operator is the identity operator. Here too, as in the rest of this disclosure, vector v may represent a physical multidimensional array. In Equation (2) W is a diagonal matrix that encapsulates the statistical or other modeling information about the data g. For example, the elements on the diagonal of W can be set to the inverse of an estimate of the variance of each measurement gi, as follows
Wii=I0e−g
where I0 denotes the number of incident photons when measured in an air scan, with or without a bow-tie filter as appropriate. Alternatively, W can be set to some other weighting that represents the confidence about particular measurements. Another typical usage is to down-weight rays passing through highly attenuating objects such as metal.
Direct minimization of the cost function in (1) can be challenging, due to the large scale of the problem along with the presence of the weighting matrix W, which often has large dynamic range and therefore causes the Hesssian of the data fidelity term in the cost function to be poorly conditioned, or other aspects of h(Rf−g), which make evaluation of it or of its gradients computationally demanding, and/or require many iterations to approach convergence. A variable splitting technique addresses these issues. The variable splitting technique converts the unconstrained minimization of (1) to the constrained minimization problem:
such that u=Rf
This is in turn converted back to an unconstrained minimization problem using the Augmented Lagrangian (AL) (with completion of the square)
Here u is an auxiliary sinogram-domain variable, representing an approximation to the measurements Rf computed based on the image. Variable η is another auxiliary sinogram variable, serving as a Lagrange multiplier (also known as dual variable). We refer to the first term in Equation (5) as the sinogram discrepancy term, and to the third term as the image discrepancy term. Significantly, the constant μ only affects the rate of convergence of the algorithm, not the final resulting image at the point of complete convergence.
The AL can be used in the Augmented Lagrangian Method (ALM), which involves the iteration
ηn+1=ηn−(un+1−Rfn+1) (7)
which is repeated for increasing values of n starting from some appropriate initial values at n=0, until some convergence criterion is met. Here symbols with superscript n denote the values of the corresponding variable in iteration number n. Typical choices for initial values for the variables are to set f0 as the FBP reconstruction of sinogram data g, set u0=Rf0, and set η0=0.
Variations on the ALM include algorithms that perform the minimization in (6) approximately by block coordinate descent, alternating between minimization with respect to f and u, with the other variable held fixed [J. Eckstein, “Augmented Lagrangian and Alternating Direction Methods for Convex Optimization; A Tutorial and Some Illustrative Computational Results,” RUTCOR Research Report RRR 32-2012, December 2012]. Typically, the most challenging minimization is the one with respect to f, because of the presence of R.
Auxiliary Variable Preferred Method
The present inventors have recognized that the image discrepancy term, i.e., the norm term in (5) can be replaced by a weighted two-norm ∥·∥Γ2 or even semi-norm with a positive non-negative definite operator Γ, which can provide advantages in implementation or in convergence rate. Depending on the choice of weighting operator Γ, existing theoretical convergence analyses for the resulting optimization algorithms may not exactly apply, but empirical results can still be favorable. In preferred embodiments, particular choices of weighting operators provide advantages. Advantageous choices for Γ are those for which the Hessian ∇f2L(u, f, η)=RTΓR+∇f2Φ(f) of the AL with respect to f (which is well-defined when Φ is twice differentiable) or the Hessian RTΓR with respect to f of the image discrepancy term (last term) of the AL, are well-conditioned (e.g., approximately equal or approximately proportional to the identity operator). This will speed up convergence of the minimization with respect to f, and therefore reduce the computation for the entire iterative algorithm.
Example advantageous choices of Γ that improve the conditioning of the composite operator RTΓR are described next. Consider data in the projection domain denoted by q(t, θ) for 2D (cross-sectional) imaging, or by q(t, θ, r) for 3D imaging, where t denotes the detector index along a detector row, θ denotes view (or x-ray source) angle, and r denotes detector row index. Then, one favorable choice for the operator Γ is the ρ operator, which performs a 1-D filtration (in the t variable) along each row in the projection domain. The frequency response of the ρ operator is the so-called ramp filter response |ω| for frequency variable ω. Owing to the length of the impulse response of this filter in the spatial domain, the ρ operator is typically applied in the frequency domain, using the FFT (Fast Fourier Transform), as is done with conventional filtered backprojection reconstruction. An alternative realization of the ρ operator is to use lower order IIR (Infinite Impulse Response) or FIR (Finite Impulse Response) filters applied in the projection domain that approximate the |ω| response at lower computational cost.
This choice Γ=ρ of weight operator in the AL produces the ρ-weighted AL Lρ, given in the following Equation (8)
This leads to the Hessian of the image discrepancy term (the third term) in the ρ-weighted AL (8) being equal to RTρR, which is the identity operation for parallel beam geometries in the continuous variable formulation, or an approximation of the identity operation in the discrete and divergent beam (e.g. fan and cone, with circular or helical) geometries. Thus, minimization of (8) with respect to f is approximately preconditioned by this construction.
Artisans will appreciate that the auxiliary sinogram variable u is distinc from both the measured sinogram g, and from the measurements computed based on the image, Rf.
The utility in this choice of weight can be demonstrated, with the specific example of the PWLS (Penalized Weighted Least Squares) choice for h, and a particular AL-based scheme, although the technique is applicable to other cost function formulations, and other AL-based schemes.
The Alternating Direction Method of Multipliers (ADMM) is a technique that can be interpreted as a particular variant of ALM, involving alternating minimization, which is used to make finding the solution more tractable.
ηn+1=ηn−(un+1−Rfn+1) (11)
Here, the second term in the cost functions of (9) and (10) is called the image discrepancy term, and the first term in the cost function in (10) is called the sinogram discrepancy term.
The preferred updating with p weighting is labelled “ADMM- ρ”. The most challenging step from a computational standpoint is the image update (9) which itself must usually be solved by an iterative method. We call these iterations “inner iterations”, and refer to the iterations (for increasing n) over the steps in (9)-(11) as “outer iterations”. Through the choice of a ρ weighting, the optimization problem has been implicitly preconditioned. For example, approximating a solution to (9) using a conjugate-gradients (CG) method involves calculating the gradient of the cost function in (9) with respect to f, yielding
∇fΦ(f)+μRTρ(Rf−un+ηn) (12)
which contains the term RTρR, which as described before, is a well-conditioned matrix (or operator), approximating an identity operation. Because the convergence rate of CG is affected by the conditioning (more specifically, the clustering of eigenvalues) of the Hessian of the cost function, the convergence of the inner iterations for minimization of (9) will be accelerated by the ρ weighting, resulting in a speedup of the entire process.
Other minimization strategies can be employed in minimizing (9). The ordered subsets (OS) method approximates the gradient calculation of (12) by using only a subset of views, the subsets indexed by in m=1, . . . , M,
∇fΦ(f)+μMRmTρ(Rmf−umn+ηmn) (13)
Each of the M subsets contains
of the original projection angles. The subscript m in (13) is understood to represent the appropriate subset of views in the corresponding variable/operator. Different subsets are used in successive iterations, cycling through them in some desired order. Owing to the dominance of the operator R versus Φ computationally, the calculations in (13) are approximately M times faster than using the full sinogram data in (12).
The introduction of ρ weighting increases somewhat the complexity of the minimization problem in (10), whose solution has the form
un+1=(W+μρ)−1(μρ(Rfn+1+ηn)−Wg) (14)
The operation (W+μρ) is difficult to invert directly, and similar to (9), an iterative process with a few “inner iterations” is also required, using the gradient with respect to u of the cost function in (10), which is given by
Wu+μρ(u−Rfn+1−ηn)−Wg (15)
However, unlike finding the solution of (9), the only operators that need to be applied in the iterative solver used to compute the update in (14) are the diagonal W and the ρ weightings, which are significantly cheaper computationally than the projector R, or backprojector RT, especially if low order projection-domain filter approximations to ρ are used. Finallly, the step in (11) is a simple update and requires negligible computation compared to the other steps.
Alternatively, an additional variable split s=u can be introduced to detangle the application of the W and ρ weightings, allowing for closed-form updates of the auxiliary variables. The ρ-weighted AL formulation in this case is
Variable s is another singoram-domain auxiliary variable, which represents an approximation to the auxiliary sinogram variable u, and which is distinct from all three quantities g, u, and Rf. The ρ-weighted AL with the indicated additional variable split in (16) is useful more generally when h(s, g) is a relatively complicated function of s, for example when h(s, g) comprises a statistical model for g such as a negative log likelihood for a Poisson measurement model in SPECT or PET.
Again, applying the ALM algorithm to the ρ-weighted augmented Lagrangian in (16), the updates for the different variables can be performed one at a time, in sequence. For the ADMM applied to this AL, the update sequence for one iteration consists of three minimization steps, with respect to the variables f, u, and s, respectively, and two updates for the Lagrangian variables ηu, ηs.
ηun+1=ηun−(un+1−Rfn+1) (20)
ηsn+1=ηsn−(sn+1−un+1) (21)
The update for u involves inverting (μ2I+μ1ρ), which is a shift invariant filter and can be efficiently inverted with a pair of FFT applications, or the inverse filter can be approximated by a low order digital filter. The update for s requires, in the PWLS case, inverting (W+μ2I), which is diagonal and has inverse consisting of the inverse of each element along the diagonal. Finally, the two update steps of the Lagrangian variables are each similar to (11) and are similarly simple and require negligible computation compared to the other steps.
A similar additional variable split z=f, using an axillary image variable z that is distinct from f, can be performed in the image domain to detangle the application of the R operator and the regularization calculations Φ, resulting in
Applying the ALM algorithm to the ρ-weighted augmented Lagrangian in (22) yields simpler updates than the original algorithm.
ηun+1=ηun−(un+1−Rfn+1) (26)
ηzn+1=ηzn−(zn+1−fn+1) (27)
The update for variable f involves a quadratic regularization term instead of the typically non-quadratic Φ function. The update for variable z does involve Φ, but does not require the R operator at all, leading to a denoising-like update step, which can be solved efficiently. These additional splits in Equations (16) and (22) can be combined, although they are described separately here for clarity.
The stopping criteria for the iterative method can be modified to use the new variables. For example, the sinogram variable u can be compared to the actual measurements g, or to calculated measurements Rf. Or in the case of using the additional auxiliary sinogram variable s as in (16), it can be compared to g, or u can be compared to Rf. The comparison between variables might involve using a desired function of the variables, such as the function h, or a norm or weighted norm. Additionally, the amount of change in any of these variables in a given iteration can be calculated and compared to a threshold. For example, the values of the sinogram discrepancy term and the image discrepancy term may be compared, or one or both terms may be monitored for sufficiently small change from one iteration to another.
The ADMM algorithm is not the only method by which the AL can be minimized. One such alternative is the alternating minimization algorithm (AMA). AMA minimizes the AL through a sequence of simpler update steps. AMA applied to the ρ-weighted AL in (8) yields the AMA-ρ procedure
ηn+1=ηn−(un+1−Rfn+1) (30)
Here the notation (a, b)ρ=aTρb denotes the ρ-weighted inner product, whereas (a, b)2=aTb denotes the regular (Euclidean) inner product. In (28), we refer to the inner product term as an image discrepancy term, whereas in (29) the image discrepancy term is the third term. This sequence of steps has a simpler form in (28) for the update in the image variable f than in (9): due to the non-quadratic regularizer Φ, the respective updates will require an iterative strategy, but here the backprojector RT need only be applied once during the minimization with respect to f (because ηn remains fixed) whereas in (9) both R and RT are applied in every iteration in minimizing f. Updates to u and η remain comparable to the ADMM algorithm.
Preferred Variations with the Auxiliary Variable Iteration Methods and the ρ-weighted AL
Linearized ADMM-ρ Embodiment
The so-called Linearized ADMM framework (also known as the split inexact Uzawa method) can also benefit from the weighting with ρ. A preferred linearized ADMM (L-ADMM) approach modifies the image update step in (9) by adding an additional penalty (or so-called inertia) term. In the typical formulation of L-ADMM (without the ρ weighting), the image update is
where Q must be positive semi-definite, or positive definite. The choice of Q can make finding the solution to (31) much easier. In particular, the main computational burden in minimizing in (31) is the application of the operator R and its adjoint RT. The operator Q is chosen as
The coefficient δ is must satisfy δ≦1/μ∥RTR∥ so that Q will be at least positive semi-definite. Here the norm applied to the operator is the induced operator norm equal to the largest singular value in the case of a matrix. The minimization problem of (31) then becomes solving
A significant improvement is that the operators R and RT act only on the previous values of each variable and thus only need to be applied once each during the entire minimization in (33) to determine fn+1. In contrast, solving (9) via gradient methods requires application of each operator during each inner iteration of the minimization algorithm for solving (9). A potential limitation of L-ADMM is that typically the entire algorithm requires many more outer iterations for convergence. The objective when using L-ADMM is that the increased number of outer ADMM iterations is offset by the improved speed of each outer iteration, in particular solving the image minimization via (33). The convergence of L-ADMM is improved by picking δ as large as possible, subject to the constraint with respect to the operator R mentioned previously.
Combining L-ADMM with the ρ weighting yields an algorithm with similar structure but significantly improved performance. The choice of operator Q becomes
The minimization problem of (31) is similarly modified by the inclusion of the ρ weighting in both the second term and in Q, yielding the image update equation as a solution to the minimization problem
This improvement concerns the choice of the parameter δ, which now must satisfy δ≦1/(μ∥RTρR∥). The inclusion of the ρ weighting allows for a larger δ, because ∥RTρR∥<<∥RTR∥. In this variation the image f is updated (step 44 in
Extended Weighting Embodiment
In this variation, the ρ weighting operator is augmented with operator F to form aggregate weighting operator Γ=ETρE. The operator E is applied on both sides of ρ to maintain symmetry of the operator. The extended weighting operator E can be constructed to make the composite operator RTETρER become closer to an identity operation, i.e. improve the implicit preconditioning nature of the weighting.
For example, E can be a rebinning operation, converting from fan-beam to parallel-beam data. The use of the ρ operator for parallel beam geometries makes RTρ a more exact analytical inverse of R than when applied to fan-beam geometries. Similarly in 3D, the E operator can rehin from true cone-beam data to cone-parallel data (parallel in-plane, but divergent beam along the axial direction). Applying the ρ operator on this geometry is equivalent to filtering along slanted lines in the original cone-beam domain, which has been shown to be advantageous in FBP reconstructions. Alternatively, E could simply rotate the projection data on the detector panel itself to implement filtering along slanted lines. The adjoint operation ET rotates the projection data back to its original orientation. The operator E can also be adopted when using the square root of the ramp filter
or approximation to a square root of the ramp filter, e.g., in the form
In general, the most effective choice of E, in terms of the approximation of the identity by the composite operator RTETρER, would be to make the composite operator RTETρE approximate the inverse of R. To the extent that an FBP reconstruction algorithm provides a good approximation to the inverse of R, a good choice for E would result in RTETρE behaving like the FBP reconstruction operator.
Alternatively, the weighting operator Γ can be chosen to correspond to filtering along slanted lines or curves in the original cone-beam projection domain (that is, jointly in the variables t, δ, r or a subset thereof, similar to the filtering used in reconstruction procedures for divergent beam CT geometries (e.g. helical coneheam).
The update equations analogous to (9) and (10) replace the ρ weighting by the extended weighting operator Γ=ETρE. A similar change applies to Equations (12)-(14). As before, the corresponding operation (W+μΓ) is hard to invert directly, and similar to the case of ρ weighting, the minimization with respect to u of the cost function corresponding to the one in (10) is performed by an iterative process with a few “inner iterations” using the gradient with respect to u, which takes the form in (14), with ρ replaced by Γ=ETρE. The same comments about applying W and Γ at low computational cost apply, except that the application of Γ=ETρE will now involve rebirming as well.
As before, the need for an iterative algorithm to compute the update of u can be overcome by introducing another variable split. While the split u=s used before can work in this situation as well, the resulting update for u involves inverting (μ2I+μ1Γ)=μ2I+μ1ETρE, which is no longer a simple shift invariant filter. Instead, using the split u=Es retains the linear shift invariant filtering form for the u update, and the update for s involves instead inversion of W+μ2ETE. For most rehinning operators E, the operator ETE will be very well conditioned, and often structured, facilitating inversion or iterative solution.
ηun+1=ηun−(un+1−Esn+1) (39)
ηsn+1=ηsn−(sn+1−Rfn+1) (40)
The extended weighting can be combined with the L-ADMM similarly to the way the ρ weighting was combined with the L-ADMM, obtaining the benefit of both features.
Regularizer-Aware Weighting Embodiment
Another variation on the previous choices of weighting operator Γ, is directed explicitly to improve the conditioning of the Hessian with respect to f ∇f2LΓ(u, f, η)=RTΓR+∇f2Φ(f) of the Γ-weighted AL. Typically, with quadratic regularization, Φ(f)=∥f∥G2, where G is a weighting operator with a high-pass filter characteristics. This is typically true to a good approximation, even with edge-preserving, non-quadratic regularization, for small values of the image gradient. It follows, that to make the Hessian ∇f2LΓ(u, f, η) well-conditioned, or approximate the identity, the weighting Γ can be chosen so that RTΓR has frequency response complementary to that of G. This can be achieved by replacing the RamLak ramp filter ρ in ETρE by an appropriately apodized (frequency-weighted) ramp filter, with similar modifications when using the square root filter.
Filtering in the Image Domain Embodiment
Artisans will appreciate that the composite operator RTΓR is an operation that produces an output image from an input image. Hence, this operation can be performed entirely in the image domain. This can lead to savings, if the operation can be performed at lower cost than a projection R or backprojection RT. Now, with the weighting Γ chosen to one of the forms described in this invention, so that RTΓR is well-conditioned or an approximate identity, RTΓR can be approximated by a low-order and inexpensive digital filtering operation, in the image domain. This will further reduce the cost of the iterative algorithm.
Introduction of ρ through Image Domain Split (left-right) Preconditioner Embodiment
In another preferred variation, instead of incorporating the ρ operator in the sinogram domain, it can instead be introduced into the formulation using an image domain representation. Consider the minimization with respect to f of the augmented Lagrangian without the ρ weighting in (5). It involves the solution of the zero gradient condition expressed by the following equation
∇fΦ(f)+μRTRf+μRT(n−u)=0 (41)
This equation for f is typically ill-conditioned because of the ill-conditioning of the Hessian RT R of the image discrepancy term with respect to f. Denote the image domain representation or approximate representation of the ρ operator as {circumflex over (ρ)}. For example, when ρ corresponds to the ramp filter with 1-D frequency response |ω| in the projection domain, the operator {circumflex over (ρ)} may be chosen as an image domain filter with frequency response √{square root over (ω12+ω22)}, where ω1 and ω2 denote the frequency variables in the transverse plane. More generally, {circumflex over (ρ)} may be chosen as an image domain filter with a high-pass response in the transverse plane. Denote further a square root of tins operator as
Then using {circumflex over (ζ)} as a split (combination of left and right) preconditioner in Eq. (41), yields the following preconditioned version of Eq. (41):
{circumflex over (ζ)}T∇fΦ(f)+μ{circumflex over (ζ)}TRTR{circumflex over (ζ)}x+μ{circumflex over (ζ)}TRT(η−u)=0 where f={circumflex over (ζ)}x (42)
where x is an image-domain auxiliary variable that is related to the image f by the relation
That is, appocanon of the image-domain filer {circumflex over (ζ)} to x produces f. We refer to x as a pre-image. Next, using the identity that holds for a linear operator A: ∇xΦ(Ax)=AT∇fΦ(f) whenever f=Ax, Eq. (42) reduces to
∇xΦ({circumflex over (ζ)}x)+μ{circumflex over (ζ)}TRTR{circumflex over (ζ)}x+μ{circumflex over (ζ)}TRT(η−u)=0 where f={circumflex over (ζ)}x (43)
To obtain this preconditioned form of the gradient with respect to x, the AL in Eq. (5) is modified to the following form
Eq. (43) is made to be well-conditioned for solving for the variable x by choosing the image domain filter {circumflex over (ζ)} so that {circumflex over (ζ)}TRTR{circumflex over (ζ)} is well-conditioned (e.g, approximates or is proportional to the identity operator), for example by using the square root of the image domain ramp filter as described above, or its approximation using an FIR or an IIR image domain filter, or using the FFT. In the case of an IIR or FIR filter, it may be advantageous to use in the definition
a square root operator that is not necessarily Hermitian symmetric (or self adjoint); it only needs to satisfy the equation
This allows the phase of the frequency response of the filter
to be freely chosen, which may enable a computationally cheaper implementation than in the case of a Hermitian symmetric operator with zero phase frequency response.
An alternative favorable choice for {circumflex over (ζ)} is one that makes the Hessian with respect to x of the AL Lρ(x, u, η), which is given by
∇x2Φ({circumflex over (ζ)}x)+μ{circumflex over (ζ)}TRTR{circumflex over (ζ)} (45)
well-conditioned (e.g., approximately equal or proportional to the identity operator).
Because the relation in Eq. (44) between the variables f and x does not appear in the expression for Lρ(x, u, η), the block-coordinate optimization of Lρ(x, u, η) with respect to the variables x, u and η can proceed without reference to the image variable f. The image f can be computed only at the end of the iterations, when the completion criterion has been satisfied, or more frequently if desired, e.g., for monitoring the iterations.
The steps corresponding to the ADMM algorithm in this case, i.e. the block-coordinate optimization of Lρ(x, u, η), are analogous to those in Eq. (9)-(11), and in the PWLS case that the sinogram discrepancy term has the form of a weighted two-norm are given by
In this embodiment the preconditioning operator
advantageously is not directly applied to the auxiliary sinogram variable u, which makes the update steps for u simpler having the closed-form solution,
which only requires multiplication of previously calculated quantities by a diagonal matrix.
The only numerically difficult step is the x image variable update, which will have a similar level of complexity to the f updates of the sinogram-domain ρ operator, e.g. Eq. (9), and can be similarly performed to desired accuracy using an iterative minimization algorithm such as nonlinear conjugate gradients.
Introduction of ρ through image Domain Variable Splitting Embodiment
This is another embodiment with ρ applied in the image domain. The same notation and meaning for {circumflex over (ρ)} and
as above is used. This embodiment introduces into Eq. (4) an additional auxiliary image domain variable x with corresponding image domain constraint relating the new image variable x and the original image variable f with the
operator. The resulting equation is
Following from the non-ρ weighted case in (5), the extended ρ-weighted Augmented Lagrangian for this is written
Focusing specifically on the corresponding ADMM update equations for f and η2 for this Augmented Lagrangian, we have
With the initialization η20=0, these recurrence relations resolve to
allowing the η2 term and update equation to be dropped all together.
The modified set of ADMM update equations for all variables are then
This embodiment shares the benefits of the above image domain embodiment defined by Equations (44) and (46)-(49), particularly the preconditioning operator is not directly applied to the auxiliary sinogram variable u, which makes the update steps for u simpler and they may have closed-form solutions. The update step for f also has a trivial closed-form in this formulation. The only numerically difficult step is the x image variable update, which will have a similar level of complexity to the f updates of the sinogram-domain ρ operator, e.g. (9), and can be similarly performed to desired accuracy using an iterative minimization algorithm such as nonlinear conjugate gradients. However, in this embodiment and Equations (51)-(57) above the quantities f and
are distinct, which can provide additional freedom and improve the speed of convergence.
Additional Variations
The particular examples of using the different variants of weighting operators Γ of this invention with the alternating minimization of the ρ-modified AL in the ADMM-ρ and L-ADMM-ρ algorithms are not meant to be exhaustive. The different variants of weightings and iterative schemes can be combined in various ways. The same weighting operators can be used to provide similar advantages in other alternating minimization schemes known in the art, for example versions studied of Gauss-Seidel and of the so-called diagonal-quadratic approximation method [J. Eckstein, “Augmented Lagrangian and Alternating Direction Methods for Convex Optimization: A Tutorial and Some Illustrative Computational Results,” RUTCOR Research Report RRR 32-2012, December 2012], or DQA, or related methods such as Gradient projection for sparse reconstruction (GPSR) [M. T. Figueiredo, R. D. Nowak, and S. J. Wright, Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems, 2007], Proximal Forward-backward splitting methods (PFBS) [P. L. Combettes and V. R. Wajs, Signal recovery by proximal forward-backward splitting, 2005], and Uzawa-type iterations, or as another set of examples, versions with Nesterov-type or Barzilai-Borwein type acceleration of the inner and/or the outer iterations in versions of the ALM or ADMM or L-ADMM, such as NESTA, FISTA, etc.
Artisans will also appreciate that the description and embodiments of different variations of this invention can be modified and manipulated to other, mathematically equivalent forms, in which not all features previously described would be apparent. For example, auxiliary sinogram variables u and η in Eqns. (8)-(12) could be represented together by a single vector, say v, by stacking the vectors u and η as ν=[uT, ηT]T. Eqns. (8)-(12) could then be rewritten in terms of the single auxiliary sinogram variable ν, with a corresponding change in any computer code or system executing the instructions to perform the operations indicated by these equations, obscuring the actual presence of more than one auxiliary variable. The distinct update steps for u and η in
Simulation Result
A brief example demonstrates the efficacy of the present approach for the PWLS formulation of the cost function. The projection data used is simulated from a thorax phantom, scanned in a 2D fan beam geometry with curved detector panel. The projection data has 983 detectors and 1024 views over a full circular rotation. Poisson noise was added to the simulated data, such that the standard deviation of the noise in the FBP reconstruction was around 20 HU. The baseline iterative algorithm solves (2) using the method of conjugate gradients (CG). The majority of the computation will involve the application of R or RT, so the computational cost of the algorithm will be measured in terms of the number of such operator applications. For example, in CG, each iteration calculates the gradient of (2), which involves application of both R and RT, yielding 2 applications per iteration.
We consider the resulting image after 200 CG iterations to be the converged image. For each approach, we calculate the progress of the algorithm as the distance or root-mean-square error (RMSE) from this converged image.
As
While specific embodiments of the present invention have been shown and described, it should be understood that other modifications, substitutions and alternatives are apparent to one of ordinary skill in the art. Such modifications, substitutions and alternatives can be made without departing from the spirit and scope of the invention, which should be determined from the appended claims.
Various features of the invention are set forth in the appended claims. In the claims, when it is stated that some quantity q is a function of another quantity x, this functional dependence is not exhaustive, that is, quantity q can be also a function of some additional quantities. Thus we say that h(s, g) is a function of s, a function of g, and function of both s and g.
The application claims priority under 35 U.S.C. §119 from prior provisional application Ser. No. 62/015,446, which was filed Jun. 22, 2014.
This invention was made with government support under R44EB014669 awarded by National Institutes of Health, National Center for Advancing Translational Sciences (NCATS) and National Institute of Biomedical Imaging and Bioengineering (NIBIB). The government has certain rights in the invention.
Number | Name | Date | Kind |
---|---|---|---|
6005916 | Johnson | Dec 1999 | A |
8971599 | DeMan | Mar 2015 | B2 |
20110142316 | Wang | Jun 2011 | A1 |
20120155728 | DeMan | Jun 2012 | A1 |
20130343672 | Yu | Dec 2013 | A1 |
20140140599 | Kim | May 2014 | A1 |
20140369580 | Yu | Dec 2014 | A1 |
20140369581 | Fu | Dec 2014 | A1 |
20150287223 | Bresler | Oct 2015 | A1 |
Entry |
---|
Combettes, P.L., et al., “Signal recovery by proximal forward-backward splitting”, Multiscale Model. Simul., vol. 4, Issue 4, (2005), pp. 1168-1200. |
Eckstein, J., “Augmented Lagrangian and Alternating Direction Methods for Convex Optimization: A Tutorial and Some Illustrative Computational Results”, RUTCOR Research Report RRR 32-2012, (Dec. 2012), 35 pages. |
Figueiredo, M.T., et al., “Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems”, IEEE Journal of Selected Topics in Signal Processing, vol. 1, No. 4, (Dec. 2007), pp. 586-597. |
Number | Date | Country | |
---|---|---|---|
62015446 | Jun 2014 | US |