This disclosure is directed to methods for parallel reconstruction of digital images.
Magnetic resonance imaging (MRI) is a medical imaging technique used in radiology to visualize internal structures of the body in detail. As a non-invasive imaging technique, MRI makes use of nuclear magnetic resonance to image nuclei of atoms inside the body. MRI has been used for imaging the brain, muscles, the heart, cancers, etc.
The raw data acquired by an MR scanner are the Fourier coefficients, or the so-called k-space data.
y=F
u
x+n, (1)
where Fu is a given undersampled Fourier transform operator, x denotes the MR image, y is the acquired k-space data, and n denotes the noise introduced in the acquisition. Unlike the full k-space scenario, one cannot directly apply the inverse Fourier transform to the undersampled data acquired in
Parallel imaging has been proven effective for reducing acquisition time. Parallel imaging exploits differences in sensitivities between individual coil elements in a receiver array to reduce the number of gradient encodings required for imaging.
Sensitivity encoding (SENSE) is a technique that can reduce scan time in magnetic resonance imaging (MRI). A SENSE approach utilizes the spatial information related to the coils of a receiver array for reducing conventional Fourier encoding. In principle, SENSE can be applied to any imaging sequence and k-space trajectories, however, it is particularly feasible for Cartesian sampling schemes. In the SENSE approach, one models the relationship between the target image and the acquired k-space data as:
yi=FuSix+ni, (2)
where yi is the undersampled k-space data acquired by the i-th coil, and Si is the coil sensitivity maps, as shown in the last two rows of
To recover x from EQ. (2), one should note that the target x has certain structures, with which one can better reconstruct x from the undersampled data y. This is where sparse learning can play a role. Typically, x can be computed by minimizing the following expression:
where loss(y, Fux) denotes the data fidelity, and φ(x) incorporates prior knowledge about the image to be reconstructed. The efficient optimization of EQ. (3) is key to parallel imaging.
The data fidelity term is typically defined as the squared distance between the acquired data and the prediction: loss(y, Fux)=1/2∥y−FuSix∥22. The term for φ(x) should incorporate the structure in the target image x.
Redundant Haar minimization, which is the minimization of the combination of the data fidelity and the L1 regularization via redundant Haar wavelets, has been proven effective for parallel MRI reconstruction. When minimizing via Nesterov's accelerated gradient algorithm, a key building block is the so-called proximal operator associated with the redundant Haar penalty. Such a proximal operator does not allow a closed form solution, which adds to the difficulty of optimization. A low precision solution of such proximal operator usually leads to slow convergence or no convergence at all. Therefore, it is useful to have an efficient solver for such proximal operator.
Exemplary embodiments of the invention as described herein generally include methods for solving a proximal operator using a restricted least squares regression algorithm, an alternating algorithm for solving the dual system, and a novel and effective warm start technique for fast convergence. An equivalent relationship is established between a restricted least squares regression algorithm according to an embodiment of the invention, and the alternating algorithm according to an embodiment of the invention.
According to an aspect of the invention, there is provided a method for parallel magnetic resonance imaging (MRI) reconstruction of digital images, including providing a set of acquired k-space MR image data v, a redundant Haar wavelet matrix W satisfying WTW=I, wherein I is an identity matrix, a regularization parameter λ≧0, and a counter limit k, initializing a variable z0=Wv, and intermediate quantities p0=q0=0, calculating yi=arg minz 1/2∥z−(pi+zi)∥22+λ∥z∥1 for 0≦i≦k, wherein z denotes values of an MR image sought to be reconstructed, updating pi+1=(pi+zi)−yi, updating zi+1=arg minz 1/2∥z−(qi+zi)∥22+g(z), wherein
and updating qi+1=(qi+yi)−zi+1, wherein x=WTz is a solution of
that specifies a reconstruction of the MR image.
According to a further aspect of the invention,
According to a further aspect of the invention, arg minz 1/2∥z−(qi+zi∥22+g(z)=WWT(qi+zi).
According to another aspect of the invention, there is provided a method for parallel MRI reconstruction of digital images, including providing a set of acquired k-space MR image data v, a redundant Haar wavelet matrix W satisfying WTW=I, wherein I is an identity matrix, a regularization parameter λ≧0, a counter limit k, an initial value of a dual variable p derived by forming a dual to a quadratic program
that represents parallel reconstruction of a magnetic resonance (MR) image, and an initial value of a Lagrange multiplier {tilde over (q)} of the constraint an initial value of z=WWTz, calculating
calculating an intermediate value yi=(Wv−(I−WWT){tilde over (q)}i)−pi+1, updating
calculating qi+1=(I−WWT){tilde over (q)}i+1, and calculating zi+1=(Wv−Pi+1)−qi+1, wherein x=WTz is a solution of
that specifies a reconstruction of the MR image.
According to a further aspect of the invention, 1/2∥(I−WWT){tilde over (q)}i−(Wv−pi)∥22 is minimized by (I−WWT){tilde over (q)}i+1=(I−WWT)(Wv−pi).
According to a further aspect of the invention, if dual variable p and Lagrange
According to a further aspect of the invention, if dual variable p and Lagrange multiplier {tilde over (q)} are both initialized to 0,
wherein S is defined as
According to a further aspect of the invention, if dual variable p and Lagrange multiplier {tilde over (q)} are both initialized to 0,
According to another aspect of the invention, there is provided a non-transitory program storage device readable by a computer, tangibly embodying a program of instructions executed by the computer to perform the method steps for parallel magnetic resonance imaging (MRI) reconstruction of digital images.
Exemplary embodiments of the invention as described herein generally include systems and methods for efficient redundant Haar minimization for parallel MRI reconstruction. Accordingly, while the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that there is no intent to limit the invention to the particular forms disclosed, but on the contrary, the invention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention.
As used herein, the term “image” refers to multi-dimensional data composed of discrete image elements (e.g., pixels for 2-dimensional images and voxels for 3-dimensional images). The image may be, for example, a medical image of a subject collected by computer tomography, magnetic resonance imaging, ultrasound, or any other medical imaging system known to one of skill in the art. The image may also be provided from non-medical contexts, such as, for example, remote sensing systems, electron microscopy, etc. Although an image can be thought of as a function from R3 to R or R7, the methods of the inventions are not limited to such images, and can be applied to images of any dimension, e.g., a 2-dimensional picture or a 3-dimensional volume. For a 2- or 3-dimensional image, the domain of the image is typically a 2- or 3-dimensional rectangular array, wherein each pixel or voxel can be addressed with reference to a set of 2 or 3 mutually orthogonal axes. The terms “digital” and “digitized” as used herein will refer to images or volumes, as appropriate, in a digital or digitized format acquired via a digital acquisition system or via conversion from an analog image.
Parallel MRI reconstruction can be modeled by the following program:
where y denotes acquired k-space MR image data, x denotes intensities of the MR image one seeks to reconstruct, A=FS, F=diag(Fu, Fu, . . . Fu) is a matrix of size cM×cN, Fu is a partial FFT matrix of size M×N, S=[S1, S2, . . . , Sc]H is the (conjugate transpose) coil sensitivity maps (CSM) matrix with S, being a matrix of size N×N, W is a redundant Haar wavelet matrix satisfying WTW=I, and λ is the regularization parameter.
One effective approach for solving EQ. (4) is Nesterov's algorithm, also known as FISTA (Fast Iterative Shrinkage-Thresholding Algorithm). Given a function to be minimized of the form F(x):=f(x)+g(x), the steps of a FISTA with constant stepsize are as follows, where Step k is repeated until convergence:
A feature of this algorithm is that it can achieve a convergence rate of O(1/k2), which is optimal for all first-order black-box algorithms. When applying FISTA, a key building block is the proximal operator associated with the regularization term:
where v is a result of the gradient descent, i.e.,
in the previous algorithm. The efficient solution of such a proximal operator is a key to the resulting algorithm. However, such a proximal operator does not have a closed form solution.
An efficient approach according to an embodiment of the invention for solving EQ. (2) incorporates the relationship WTW=I into EQ. (5):
Let z=Wx, which implicitly induces the following relationship
z=WWTz. (7)
The above equation indicates that z is in the range space of W. Thus, solving EQ. (6) is equivalent to solving
The following theorem justifies the conversion of EQ. (5) into EQ. (8):
Theorem 1 Let z* be the solution to EQ. (8). Then, x*=WTz* is the optimal solution to EQ. (5).
Proof: For all x, let z=Wx. We have x=WTz from WTW=I. It is obvious that the relationship in EQ. (7) holds. As z* is the optimal solution to EQ. (8), which satisfies z*=WWTz* and z*=Wx*, we have
1/2∥z*−Wv∥22+λ∥z*∥1≦1/2∥z−Wv∥22+λ∥z∥1 (9)
Inserting in z*=WWTz*, z=WWTz, z*=Wx* and z=Wx, we have
1/2∥WWTz*−Wv∥22+λ∥Wx*∥1≦1/2∥WWTz−Wv∥22+λ∥Wx∥1 (10)
which together with WTW=I leads to
1/2∥WTz*−v∥22+λ∥Wx*∥1≦1/2∥WTz−v∥22+λ∥Wx∥1. (11)
Incorporating x*=WTz* and x=WTz, we have
1/2∥x*−v∥22+λ∥Wx*∥1≦1/2∥x−v∥22+λ∥Wx∥1, (12)
which indicates that x* is the optimal solution to EQ. (5).
When solving EQ. (4) via FISTA, embodiments of the invention use a “warm” start technique, i.e., solving EQ. (5) with the solution to the previous program as the “warm” start.
According to an embodiment of the invention, EQ. (8) can be solved via a restricted least squares regression algorithm. For convenience of discussion, let
A restricted least squares regression algorithm according to an embodiment of the invention is presented in Algorithm 1, which recursively calls the solver to the proximal operator associated with f(.) and g(.), respectively. Algorithm 1 is as follows, with reference to the steps of the flowchart of
According to an embodiment of the invention, the proximal operator associated with f(.) can be computed by soft shrinkage as:
According to an embodiment of the invention, the proximal operator associated with g(.) can be computed by orthogonal projection as:
Algorithm 1 is guaranteed to converge to the optimal solution of EQ. (8).
According to an embodiment of the invention, an alternating algorithm can solve the dual of EQ. (8).
Making use of the dual norm of the l1 norm, EQ. (8) can be written as:
The objective function in EQ. (17) is convex in z and concave in p, and the constraints are closed convex. Therefore, the von Neuman Lemma that ensures the existence of a solution to the min-max holds, and the min-max can be exchanged to obtain
For the inner minimization of EQ. (18), i.e.,
embodiments introduce a Lagrangian variable {tilde over (q)} for the equality constraint. As the constraint is affine, and the objective function is differentiable and convex, strong duality holds. Thus, EQ. (19) can be written as the following equivalent min-max program:
Thus, embodiments can reformulate EQ. (8) as the following min-max optimization:
For the inner minimization, setting the derivative of z to zero, one has
z=Wv−p−(I−WWT){tilde over (q)}. (22)
Inserting EQ. (22) into EQ. (21), the following dual program can be obtained:
Embodiments of the invention solve the dual program (23) via an alternating algorithm in Algorithm 2. Step 3 and Step 5 are included for convenience of analyzing the relationship with Algorithm 1. Algorithm 2 is guaranteed to converge as the objective function of EQ. (23) is concave in both {tilde over (q)} and p, and the constraint sets are convex and closed. Algorithm 2 is as follows, with references to step numbers in the flowchart of
Algorithm 2 Alternating Algorithm for EQ. (8)
In Step 2, pi+1 can be computed by a simple projection onto to the l∞ ball as follows:
In Step 4, {tilde over (q)}i+1, the solution to the minimization is not unique. However, (I−WWT){tilde over (q)}i+1 is unique, and satisfies
(I−WWT){tilde over (q)}i+1=(I=WWT)(Wv−pi). (25)
To see this, for the objective function in Step 4, set the derivative with regard to {tilde over (q)} to zero, to obtain
(I−WWT)T((I−WWT){tilde over (q)}i+1−(Wv−pi)). (26)
EQ, (25) can be obtained by using the relationship WTW=I.
The equivalence between Algorithm 1 and Algorithm 2, when the inputs p0, {tilde over (q)}0, are both set to zero in Algorithm 2, may be established as follows.
In Algorithm 2, Step 3 can be computed as
where S is the soft shrinkage operator, defined as
Step 6 can be computed as
In Step 2, pi+1 satisfies
It is to be understood that embodiments of the present invention can be implemented in various forms of hardware, software, firmware, special purpose processes, or a combination thereof. In one embodiment, the present invention can be implemented in software as an application program tangible embodied on a computer readable program storage device. The application program can be uploaded to, and executed by, a machine comprising any suitable architecture.
The computer system 61 also includes an operating system and micro instruction code. The various processes and functions described herein can either be part of the micro instruction code or part of the application program (or combination thereof) which is executed via the operating system. In addition, various other peripheral devices can be connected to the computer platform such as an additional data storage device and a printing device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures can be implemented in software, the actual connections between the systems components (or the process steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings of the present invention provided herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the present invention.
While the embodiments of the present invention have been described in detail with reference to exemplary embodiments, those skilled in the art will appreciate that various modifications and substitutions can be made thereto without departing from the spirit and scope of the invention as set forth in the appended claims.
This application claims priority from “Redundant, Efficient Haar Minimization”, U.S. Provisional Application No. 61/654,227 of Liu, et al., filed Jun. 1, 2012, the contents of which are herein incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
61654227 | Jun 2012 | US |