Magnetic resonance imaging by subspace projection

Description

FIELD OF DISCLOSURE

This disclosure relates to magnetic resonance imaging (MRI), and in particular, to image reconstruction.

BACKGROUND

One way to generate an image of the interior of an opaque structure makes use of a phenomenon known as nuclear magnetic resonance (NMR). Generally, NMR is a phenomenon by which atoms absorb energy provided from an external magnetic field, and subsequently radiate the absorbed energy as photons. By controlling the magnetic field throughout a region, one can control the frequency and phase of the emitted photons to vary as a function of the emitting atom's position in the region. Therefore, by measuring the emitted photons' frequency and phase, one can tell where the atoms are present inside the region.

In FIG. 1, one way to represent the data gathered from the resonant atoms is by constructing a k-space dataset 100. Each radiated photon is characterized by its wave number (often denoted k) or equivalently its frequency, and its phase relative to a reference phase. The k-space dataset 100 is a plot of the number of photons detected with a particular wave number and a particular phase. Often, k-space dataset 100 is represented as a two-dimensional array of pixels that have varying intensities. Dark pixels indicate that a relatively small number of photons were detected at the particular wave number and phase, and bright pixels indicate that a relatively large number of photons were detected at the particular wave number and phase.

An image 102 can be reconstructed from the k-space dataset 100 by various mathematical techniques. These techniques are useful for imaging the internal structure of a human being. In this context, the hydrogen atoms that make up the human's body are caused to undergo nuclear magnetic resonance. In the context of imaging the hydrogen in a human or other animal, this technique is sometimes referred to as magnetic resonance imaging (MRI). Since hydrogen is present in nearly every organ, tissue, fluid, or other part of a human being. MRI typically provides a relatively detailed image of the human being through non-invasive means.

In some MRI contexts, it is desirable to quickly acquire the k-space data necessary to produce an image. For example, when imaging a patient's heart, the image quality is often enhanced if the patient suppresses respiratory-induced motion by holding his breath while the k-space data is acquired. Some patients experience discomfort or difficulty holding their breath for extended periods of time, particularly patients who are in need of cardiac imaging.

One way to reduce the time required to acquire the k-space data is to employ multiple detectors, with each detector configured to detect photons from different spatial regions. This approach is referred to as “parallel” MRI, or pMRI.

SUMMARY

In general, in one aspect, reconstructing an image from MRI data provided by an MRI machine includes: selecting a system transformation based on the MRI machine; selecting a subspace based on the system transformation; obtaining a solution vector that solves a minimization problem, the minimization problem being formulated on the subspace based on the MRI data; and displaying the image reconstructed from the solution vector.

Reconstructing an image also includes estimating a sensitivity of a receiving coil in the MRI machine, and selecting the system transformation includes selecting the system transformation based on the sensitivity. Selecting the system transformation also includes selecting the system transformation based on a sampling pattern used by the MRI machine. Obtaining a solution vector includes selecting a regularization parameter, and the subspace is selected independently of the regularization parameter. Reconstructing an image also includes selecting a basis of the subspace. The basis is selected using a conjugate-gradient least-square technique. The basis is selected using an least-squares QR technique. Obtaining a solution vector includes selecting a regularization parameter, and the basis is selected independently of the regularization parameter. The subspace consists of a Krylov subspace. Obtaining a solution vector includes using an LSQR algorithm. Obtaining a solution vector includes using a conjugate-gradient algorithm. Reconstructing an image also includes selecting a regularization parameter corresponding to a substantially vertical portion of an L-curve. Reconstructing an image also includes selecting a first regularization parameter and a second regularization parameter. The first regularization parameter corresponds to a real part of the solution vector, and the second regularization parameter corresponds to an imaginary part of the solution vector. The first regularization parameter and the second regularization parameter are selected as maximum parameters yielding a pre-determined error. The first and second regularization parameters are selected from an interval, the interval being based on a singular value spectrum of the system transformation. The interval consists of numbers between 10^2 and 10^{circumflex over (6)}. The MRI data is acquired by uniformly sampling a k-space. The MRI data is acquired by non-uniformly sampling a k-space. The MRI data is acquired by sampling over an entire k-space. The MRI data is acquired by sampling over half a k-space.

Other aspects include other combinations of the features recited above and other features, expressed as methods, apparatus, systems, program products, and in other ways. Other features and advantages will be apparent from the description and from the claims.

DESCRIPTION

FIG. 1 shows an exemplary k-space dataset and its associated image.

FIG. 2 is a schematic depiction of a magnetic resonance imaging machine

FIGS. 3A and 3B are exemplary k-space datasets.

FIG. 4 is a flowchart describing image reconstruction.

FIG. 5 is a schematic depiction of an L-curve.

FIG. 6 is a schematic depiction of a contour plot.

FIG. 7 is a flowchart describing selecting regularization parameters.

FIG. 2 shows a magnetic resonance imaging (MRI) machine 200 having a magnetic assembly 202 and detector coils 204. The detector coils 204 are in data communication with a computer 206. In operation, the magnetic assembly 202 generates a magnetic field, causing the hydrogen atoms in the water molecules in a patient 208 to undergo nuclear magnetic resonance (NMR). The water molecules subsequently emit photons 208 that are detected by each detector coil 204. Each detector coil 204 detects the emitted photons from various angles, and transmits the photon intensity it measures at a particular frequency and phase to the computer 206. Using this information, the computer 206 produces an image of the patient 208 or portion of the patient 202, which is recorded for display to a physician or MRI technician.

Although only two detector coils 204 are shown in FIG. 2, in principle any number of coils 204 may be used, including only one detector coil 204. Furthermore, although only one computer 206 is shown in FIG. 2, in principle any number of computers 206 cooperate to produce or display the image of the patient 202.

The computer 206 receives data provided by the MRI machine 200, and includes instruction to perform image reconstruction described below (see FIGS. 4 and 5) on the data. Additionally, the instructions may be stored on a computer-readable medium separate from the computer 206, such as a separate processor, magnetic or optical data storage device.

FIG. 3
a shows a schematic k-space dataset 300 constructed by irregular or non-uniform sampling. Each line 302 represents the amounts and relative phases of photons detected (or “sampled”) at a particular frequency by a detector coil 204. Typically, a relatively high quality image can be constructed from a central region 304 of k-space. Thus, in some approaches, the k-space dataset 300 is sampled along lines 302 that are concentrated in the central region 304. Using non-uniform sampling allows for comparatively greater flexibility in manipulating artifacts inherent with subsampling, and provides easily self-referenced coil sensitivity data. However, some mathematical techniques for image reconstruction rely on uniform sampling, therefore those techniques would be inapplicable to this k-space dataset 300.

By contrast, FIG. 3b shows a k-space dataset 306 constructed by uniform sampling, using the same number of lines as in FIG. 3a. Each line 302 is a constant distance d away from its nearest neighboring line, but the central region 304 is sampled less frequently than in FIG. 3a. Thus, some mathematical techniques that rely on uniform sampling can be used to quickly produce an image from this k-space dataset 306.

Additionally, k-space datasets may have certain symmetries. For example, in certain cases, k-space datasets tend to have conjugate symmetry. In general, if a k-space dataset possesses such symmetry, then only a portion of the dataset need be acquired in order to recover the entire dataset. In the case of conjugate symmetry, for example, the entire k-space dataset may be reconstructed from a suitable half of the k-space dataset.

The techniques described below do not rely on any particular sampling strategy, and therefore can be used with any k-space dataset. Moreover, the techniques described below can be used on an entire k-space dataset, or half of a k-space dataset.

Theoretical Background

One can represent a k-space dataset as a vector or collection of vectors. Each sampled frequency is represented by a separate vector, whose dimension is equal to the phase resolution of the detector coil 204 that gathers data at that frequency. Each component of the vector represents the complex signal strength detected at the corresponding phase and frequency encoding. As a notational convention, all vectors are assumed to be column vectors, unless otherwise specified.

Reconstructing an image from a k-space dataset is equivalent to finding a vector ρ that solves the minimization problem:

min_ρ∥s−Pρ∥₂, (1)

where s is an n-dimensional vector in the k-space dataset, P is an n×m matrix describing the phase and frequency encoding, along with estimates of the detector coils' sensitivities, ρ is an m-dimensional vector containing the reconstructed image (or a portion thereof), and ∥-∥₂is the L²norm. The maxtrix P is sometimes referred to as the system matrix. The dimension of s equals the number of distinct phases that can be detected by each detector coil 204, multiplied by the number of coils employed. The dimension of ρ represents the resolution of the reconstructed image.

Typically, (e.g. at high sub-sampling rates) the dimensions of s, ρ, and P are such that the minimization problem (1) is ill-conditioned, and therefore solutions of (1) can be extremely sensitive to error or noise. For example, a solution ρ of (1) may have the property that relatively slight changes in the acquired data results in relatively large changes in other solutions. One way to mitigate this sensitivity is to solve a “regularized” minimization problem:

min_ρ{∥s−P_ρ∥₂²+λ²∥L_ρ∥₂²}. (2)

Here, L is a linear operator that is selected to impose a desired constraint on the solution ρ, and λ is a real or complex scalar. Often, the choice of L depends on how the k-space data is acquired. If data over the full range of k-space is acquired, then typically, L=I, the identity operator.

In some embodiments, data over only half of k-space is acquired. Nevertheless, a full image can be recovered, based on symmetries that typically exist in k-space data, or based on known properties of solutions ρ. If data over only half of k-space is acquired, on can use a regularization parameter λ and regularization operator L to separately constraint the real and imaginary part of the solution. On can then rewrite the minimization problem (2) as

$\begin{matrix} \min_{ρ} {{ s - P ρ }_{2}^{2} + λ_{1}^{2} { L [\begin{matrix} real {ρ} \\ imag {ρ} \end{matrix}] }_{2}^{2}}, & (2^{'}) \end{matrix}$

where real {ρ} denotes the real portion of ρ, and image {ρ} denotes the imaginary portion of ρ. In some embodiments, L can be defined by

$\begin{matrix} L = [\begin{matrix} I & 0 \\ 0 & cI \end{matrix}], & (2^{″}) \end{matrix}$

where I is an identify operator of dimension equal to that of ρ, and c=λ/λ. This operation L acts nontrivially on the imaginary part of ρ. An operator L=(1/c)L can also be used to act on the real part of ρ. In this case, λ_rshould be replaced by λ in equation (2′). With the choice of L as in equation (2″), the regularized minimization problem (2′) has two parameters: λ_rand c.

In general, other choices of L can be accommodated. If L is invertible, then the techniques described below can be applied by a transformation of variables. For example, one transformation of variables is to introduce the variable η=Lρ, or ρ=L⁻¹η. Reformulating minimization problem (2) or (2′) in terms of η eliminates the appearance of L, and the techniques described below can be used. In some implementations the system matrix can be considered to the PL⁻¹, as opposed to P.

The regularized minimization problem (2) can be thought of as seeking a solution to the original minimization problem (1), while simultaneously attempting to constrain the solution ρ. In this context, the “regularization parameter” λ (or λ_r) represents the relative importance of finding a ρ that minimizes (1) and controlling the norm of ρ. For example, if λ is small, then a solution of (2) is close to a solution of (1). If used, the regularization parameter c represents the relative importance of adhering to the assumed symmetries in k-space.

Krylov Subspaces and Orthonormal Bases

Without any additional constraints, solving the minimization problem (2) is computationally intensive, in part due to the dimensions of s and ρ. The Krylov subspace techniques described below provide a way to project the minimization problem (2) to a smaller-dimensional subspace. This results in a more tractable minimization problem. To illustrate this process, we note that the minimization problem (2) is equivalent to solving:

(P^HP+λ²L^HL)ρ=P^Hs, (3)

wherein the exponent “H” denotes the Hermitian (or complex-conjugate) of a matrix. In what follows, we shall assume L=I, although the other case L≠1 follows by employing a change of variables; e.g., η=Lρ, or ρ=L⁻¹η, to make the following observations applicable. Equation (3) is sometimes referred to as the “normal equations” corresponding to the minimization problem (2).

In general, given an n-dimensional vector v in a vector space V, and a linear operator T on V, the kth-order Krylov subspace of V generated by v and T is defined to be the linear vector space spanned by v and k−1 successive images of v under T; that is, K_k(v,T)=span{v, Tv, T²v, . . . , T^k−1v}. Where v and T are understood, the k-th order Krylov subspace is simply denoted K_k.

A Krylov subspace can be constructed based on s and P. Although the n×m matrix P is generally not a square matrix, the matrix P^HP is always n×n. Thus, Krylov subspaces can be formed from the vector P^Hs and the operator P^HP. The kth-order Krylov subspace associated with the normal equations (3) (for λ=0) is given by:

K_k=span{P^Hs, (P^HP)(P^Hs), . . . , (P^HP)^k−1(P^Hs)}. (4)

With the k-th Krylov subspace identified, the minimization problem (1) and (2), or the normal equations (3) can be restricted to the k-th Krylov subspace. For example, minimization problem (1) restricts to:

$\begin{matrix} \min_{ρ_{k} \in K_{k}} { s - P ρ_{k} }_{2}, & (5) \end{matrix}$

where ρ_kdenotes the solution obtained in the k-th Krylov subspace. One class to strategies for solving the restricted problems involves constructing an orthonormal basis of the Krylov subspace K_k. As described more fully below, this can be accomplished iteratively; that is, the orthonormal basis for K_k−1can be used to construct the orthonormal basis of K_k.

Projecting to a Krylov Subspace with a CGLS-Constructed Basis.

One way to construct such an orthonormal basis involves conjugate-gradient least squares (“CGLS”) techniques. Using these techniques, a matrix V_kwith columns {v, . . . , v_k} can be constructed such that: the columns of V_kform an orthonormal basis of K_k; βv=P^Hs for some scalar β; and

(P^HP)V_k=V_k+3{tilde over (H)}_k (5)

for a (k+1)×k tridiagonal matrix {tilde over (H)}_k. Because {tilde over (H)}_kis tridiagonal, equation (5) yields a three-term recurrence that can be used to construct the next column v_kfrom the previous columns v_k−1, v_k−2. Thus, the columns of V_kof need not be stored for all k, as k varies from iteration to iteration (see FIG. 4, step 414). Not storing un-necessary columns of V_ksaves computational resources.

If ρ_kis a solution of equation (3) restricted to be in a Krylov subspace K_k, then ρ_kfactors as:

ρ_k=V_ky_k (6)

for some coefficient vector y_k, because the columns of V_kform an orthonormal basis of K_k. Inserting equation (6) into equation (3) and applying equation (5) yields:

((P^HP)+λ²I)V_ky_k=(V_k−1{tilde over (H)}_k+λ²V_k)y_k=βv₁. (7)

To enforce the condition that the error between the vectors ρ_kand ρ is orthogonal to K_k, one multiplies (7) by V^Hto obtain:

(H_k+λ²I)y_k=βe₁. (8)

where H_kis the k×k leading tridiagonal submatrix of {tilde over (H)}_k, and e₁=V_k+1^Hv₁is the first canonical unit vector. Equation (8) is referred to as the “projection” of equation (3) onto the kth-order Krylov subspace. Because H_kis tri-diagonal, a three-term recurrence relation can be used to obtain solutions to equation (8) using only the solutions and orthonormal basis vectors from the previous three iterations. Obtaining the three-term recurrence relation is explained in Appendix A.

Projecting to a Krylov Subspace with LSQR-Constructed Basis.

One can also use least-square QR (“LSQR”) techniques to construct a basis of the Krylov subspace K_k. As in the CGLS case described above, the vectors in this orthogonal basis are also denoted {v₁, . . . , v_k}. It can be arranged that if these vectors are placed as columns in a matrix V_k, then the matrix V_ksatisfies the recurrence relation

PV_k=U_k+1B_k, (9)

where B_kis a k+1×k bidiagonal matrix and U_k+1=[u₁, . . . , u_k+1] is a matrix with orthonormal columns such that u₁=s/∥s∥₂. Similarly to the CGLS case, since the matrix B_kis bidiagonal, the columns of V_k(and U_k+1) can be generated from a two-term recurrence. Thus, the entire matrix V_k(or U_k+1) is not needed to generated subsequent columns of the matrix as k increases—only the most recent two columns are needed. Not storing the entire matrices V_kor U_k+1can save computational resources.

Since V_kdescribes a basis of K_k, the solution to equation (3) above is given by ρ_k=V_ky_kfor some appropriate coefficient vector y_k. Instead, for example, a solution ρ_kmay be constructed by using the bidiagonal nature of the B_kmatrix. In particular, using equation (4) and the equation ρ_k=V_ky_k, equation (3) can be written as

$\begin{matrix} \min_{y_{k}} { s - {PV}_{k} y_{k} }_{2} = \min_{y_{k}} {\langle \langle u_{1} \rangle \langle s \rangle \rangle}_{2} - U_{k + 1} B_{k} y_{k} {\langle \rangle}_{2} & (10) \\ = \min_{y_{k}} { U_{k + 1} (β e_{1} - B_{k} y_{k}) }_{2} & (11) \\ = \min_{y_{k}} { β e_{1} - B_{k} y_{k} }_{2}, & (12) \end{matrix}$

where e₁is the first canonical unit vector of length k+1 in B_k, β=∥s∥₂, and the last equality follows from the fact that the L²-norm is invariant under left multiplication by a matrix with orthonormal columns. Therefore, solving the minimization problem (3) has been reduced solving the minimization problem (12), which is a problem of smaller dimension.

Using this approach, ρ_kis not formed from the product V_ky_kexplicitly: rather, the bidiagonal nature of B_kis exploited to obtain ρ_kfrom only the most recent iterates. Further, based on reasoning explained in Appendix A, one can also obtain values of the norms ∥s−Pρ_k∥₂and ∥ρ_k∥₂from short-term recurrences. As discussed more fully below, these norms can be used to select regularization parameters.

Image Reconstruction

Referring to FIG. 4, the above methods are used to reconstruct an image from a k-space dataset acquired by an MRI machine 200. Based on the characteristics of the MRI machine 200 (including, for example, the sensitivities of the detector coils 204), the system matrix P of equation (3) is determined (step 402). Then, based on any additional constraints on the image desired by a user, a matrix L in equation (3) is determined (step 402).

data is acquired from the MRI machine (step 404). To reconstruct the image, initially, a particular Krylov subspace of order k₀is selected (step 406). With k=k₀, a basis V_kaccording to equation (5) is constructed, for example by using conjugate-gradient least squares of LSQR techniques (step 408). Using the basis V_k, the “projected” image reconstruction minimization problem is formulated as in equation (8).

This minimization problem is solved, for example using the truncated singular value decomposition approach described in connection with equation (10) (step 410). The result is a solution of equation (3) that depends on λ, and possibly c. A particular value of λ (and c, if necessary) is selected (step 412), as described below (see FIGS. 5 and 6). If a higher-quality image is needed (step 412), then k can be incremented (step 414), and the process starting from step 408 can be repeated using a higher-dimensional Krylov subspace. On the other hand, if the image is of acceptable quality, then it is displayed (step 416), for example on the computer 206. In some embodiments, an image is expected to be of acceptable quality based on the difference ∥ρ_k+1−ρk∥₂, of the solutions determined using k+1 and k-dimensional Krylov subspaces. In some embodiments, if this difference is less than 10⁻⁴, then the image is displayed.

Selection of Regularization Parameter(s)

Referring to FIG. 5, in the case where L=I in equation (3), one way to select a λ for regularization involves the use of an L-curve 500 associated with the minimization problem (2). For many actual datasets, this curve has the general appearance of an “L,” hence the name

The L-curve 500 is obtained by plotting ∥ρ∥₂against ∥s−Pρ∥₂on a log-log scale, parameterized by λ. The L-curve 500 shown in FIG. 5 is schematic; it is neither drawn to scale, nor drawn to reflect an actual dataset from which an image is to be reconstructed. In FIG. 5 points on the L-curve 500 corresponding to λ values λ₀, λ⁺, λ₁, and λ_nare plotted.

Typically, the L-curve 500 for parallel MRI reconstruction problems has a region 502 in which it is nearly vertical, and an inflection point 504. As λ varies within region 502, ∥s−Pρ∥₂changes relatively slowly. Consequently, the resulting solutions to equation (3) are relatively stable. Thus, for image reconstruction, λ is selected within region 502. Specifically, λ is chosen by partitioning the λ-interval corresponding to the substantially vertical region 502 into subintervals [λ_n−1, λ_n], and selecting λ to be the largest λ_nsuch that the condition:

|∥s−Pρ_λ_n−1∥₂−∥s−Pρ_λ_n∥₂|<C (13)

is satisfied for some threshold C, where ρ_λ_ndenotes the solution of equations (3) obtained using the regularization parameter λ_n. Alternatively, a logarithmic threshold can be used in condition (13) by taking the base-10 logarithm of each error term. Qualitatively, condition (13) expresses the notion that changing λ results in a relatively small change in the error. Thus, qualitatively, λ is selected as the largest λ that produces an acceptably small change in the error.

In some examples, the logarithmic error threshold is chosen to be 0.01. The point λ=λ⁺ schematically illustrates the location of a typical value satisfying the condition (13). A value of λ that satisfies the condition (13) may be near the inflection point 504, but need not be coincident with it.

Once a value of λ is fixed, then ρ is an approximate solution of equations (3). This solution may subsequently be used to generate an image according to known techniques. When the iterations have converged to produce an image of sufficient quality, it is displayed (step 416 in FIG. 4).

Referring to FIGS. 6 and 7, in the case when half of a k-space dataset is used, and when L is defined as in equation (2′), the error ∥s−Pρ∥₂is a function of two variables: λ and λ_r. FIG. 6 shows a schematic contour plot 600 of this error. Each contour 602 represents a curve on which the error ∥s−Pρ∥₂is constant. Similarly to the full k-space setting described above, we (qualitatively) seek the largest regularization parameters that result in an acceptably small error. For example, the contour 602 corresponding to an acceptably small error is shown in a darkened line in FIG. 6. The maximum value of λ, along this contour 602 is denoted Λ, and the maximum value of λ_ialong this contour 602 is denoted Λ_iin FIG. 6.

Referring to FIG. 7, one way to find these regularization parameters is as follows. First, a regularization range is identified (step 700). The regularization range is the anticipated range in which the desired values of the regularization parameters will fall. In some implementations, the regularization range is a two-dimensional rectangular region defined by two axes. For example, the regularization range may include axes for λ_iand λ_r, or the regularization range may include axes for λ_rand c. For convenience in what follows, axes for λ_iand λ_rare used.

In some implementations, the maximum value in the regularization range is based on the greatest singular value of the matrix P. For example, the maximum value in the regularization range may be equal to the greatest singular value of P, or may be a scalar multiple of the greatest singular value of P, the scalar multiple being based on the process by which P is determined from the detector coils 204.

The singular values of P often are distributed in two clusters, with each singular value in the first cluster being larger than each singular value in the second cluster. In some implementations, the minimum value in the regularization range is selected based on the greatest singular value in the second cluster. For example, the minimum value in the regularization range may be equal to the greatest singular value in the second cluster, or may be a scalar multiple of the greatest singular value in the second cluster, the scalar multiple being based on the process by which P is determined from the detector coils 204.

An error threshold is identified (step 702). This error threshold is analogous to the value C in equation (13). In some implementations, the logarithmic error threshold is equal to 0.01. In step 704, each axis in the regularization range (e.g., the λ_iand λ_raxes) is partitioned into intervals 604 (see FIG. 6). For example, each axis can be partitioned onto n intervals. In some embodiments, n is between 10 and 40.

An endpoint of an interval is selected (step 706). For example, the largest partition endpoint on the λ_raxis. Starting from the endpoint selected in step 706, the error terms ∥s−Pρ∥₂are computed along a diagonal trajectory 606 (step 708). In FIG. 6, the arrows along each trajectory 606 illustrate the direction of the computation. In step 708, the error is computed along the trajectory 606 until the error equals or exceeds the error threshold identified in step 702. Computing the error term ∥s−Pρ∥₂along a diagonal trajectory, as opposed to a non-diagonal trajectory, is computationally inexpensive, because the Krylov subspace basis used to obtain ρ differs only by a scalar along diagonal trajectories; that is, linear trajectories with 45-degree slope. Therefore, re-computing the Krylov-subspace basis need not be repeated along a diagonal trajectory 606. In FIG. 6, the level curve 602 corresponding to this energy threshold is shown in a darkened line. The values of λ_rand λ_ithat cause the error ∥s−Pρ∥₂to equal or exceed the threshold are recorded (step 710).

It is determined whether interval endpoints exist that have not been used in the above steps (step 712). If such endpoints exist, steps 706-712 are repeated. In repeating steps 706-712, several values of λ_rand λ_iare recorded. Once all the interval endpoints have been identified, the maximum λ_rand λ_iof all those recorded in the various iterations of step 710 are used as regularization parameters.

Other embodiments are within the scope of the following claims.

APPENDIX A

In this appendix, certain recurrence relations and algorithms are described. The notation used in this appendix follows the MATLAB conventions, and follows that of Gene Golub and Charles Van Loan, “Matrix Computations,” second edition, Johns Hopkins University Press, 1989. To the extent the MATLAB notation MATLAB and Matrix Computations is inconsistent, MATLAB notation governs.

LSQR Recurrence and Algorithm

The matrix recursions

PV_k=U_k+1B_k,
P^HU_k+1=V_kB_k^H+α_k+1v_k+1e_k+1^r

translate into the following matrix-vector (coupled two-term) recursions:

βu₁=s;
β_i+1u_i+1=Av_i−α_iv_i;
α₁v₁=P^Hu₁;
α_i+1v_i+1=P^Hu_i+1−β_i+1v₁

where α_iand β_iindicate scalar normalization constants.

$\begin{matrix} { P ρ_{k} - s }_{2}^{2} + { λ^{2} ρ_{k} }_{2}^{2} = { [\begin{matrix} B_{k} \\ λ \end{matrix}] y_{k} - β e_{1} }_{2}^{2} & (1) \end{matrix}$

for any λ. To minimize the functional on the right of the equation (1), one can use a QR factorization. However, the stacked matrix on the right is sparse, and because B_kdiffers from B_k−1only in the addition of a new row and column, the factorization at step k requires only a few additional Givens (plane) rotations. For notational convenience in the QR factorization described below, the dependence on λ will be suppressed.

To solved equation (1), the QR factorization is applied to the augmented matrix as follows:

$Q_{k} [\begin{matrix} B_{k} & β e_{1} \\ λ 1 & 0 \end{matrix}] = [\begin{matrix} R_{k} & f_{k} \\ 0 & {\overline{ϕ}}_{k + 1} \\ 0 & q_{k} \end{matrix}],$

where Q_kis an orthonormal k×k matrix, R_kis an upper bidiagonal k×k matrix, and otherwise the notation follows LSQR: An Algorithm for Sparse Linear Equations and Sparse Least Squares, ACM Transactions on Mathematical Software, Vol. 8, No. 1, Mar. 1982, pp. 43-71; Vol. 8, No. 2 June 1982, pp. 195-209. By the relationship of B_kto B_k−1, the matrix Q_kis a product of Q_k−1(augmented by e_kin the last row) and plane rotations that “zero-out” the new last row in B_k, as well as the extra λ in the last row of the stacked matrix.

This yields R_ky_k=f_kas the solution to the minimization problem on the right in equation (1). However, the recursion for ρ_kcan be obtained by the fact that ρ_k=V_ky=D_kf_k, where R_ky_k=f_kis the solution to the minimization problem. The columns of D_kare obtained from the fact that R_k^TD_k^T=V_k^T, and that R_k^Tis lower bidiagonal. Thus, solving for the last row of D_k^Tdepends only on d_k−1and v_k. This allows ρ_kto be obtained from a linear combination of ρ_k−1and d_k.

By way of example, in the case k=2, the augmented matrix is transformed through four plane rotations as:

$[\begin{matrix} α_{1} & 0 & β_{1} \\ β_{2} & α_{2} & 0 \\ 0 & β_{3} & 0 \\ λ & 0 & 0 \\ 0 & λ & 0 \end{matrix}] ⟶ [\begin{matrix} τ_{1} & θ_{2} & ϕ_{3} \\ 0 & τ_{2} & ϕ_{2} \\ 0 & 0 & ϕ_{3} \\ 0 & 0 & ψ_{1} \\ 0 & 0 & ψ_{2} \end{matrix}] .$

For k=3, the augmented matrix transform as:

$[\begin{matrix} α_{1} & 0 & 0 & β_{1} \\ β_{2} & α_{2} & 0 & 0 \\ 0 & β_{3} & α_{3} & 0 \\ 0 & 0 & β_{4} & 0 \\ λ & 0 & 0 & 0 \\ 0 & λ & 0 & 0 \\ 0 & 0 & λ & 0 \end{matrix}] ⟶ [\begin{matrix} τ_{1} & θ_{2} & 0 & ϕ_{1} \\ 0 & τ_{2} & θ_{3} & ϕ_{2} \\ 0 & 0 & τ_{3} & ϕ_{3} \\ 0 & 0 & 0 & {\overline{ϕ}}_{4} \\ 0 & 0 & 0 & ψ_{1} \\ 0 & 0 & 0 & ψ_{2} \\ 0 & 0 & 0 & ψ_{3} \end{matrix}] .$

To obtain a recurrence for ∥ρk∥₂, write

ρ_k=V_kR_k⁻¹f_k=V_kQ_k^Tz_k

where R_kQ_k^T= L_kis the reduction of R_kto lower bidiagonal form through plane rotations, and z_ksolves L_k= z_kf_k. Since V_kand Q_k^Tare orthonormal, ∥x_k∥₂=∥ z_k∥₂.

For the norms of the residual, ∥Pρ_k−s∥₂, note that

${ P ρ_{k} - s }_{2}^{2} = { [\begin{matrix} B_{k} \\ λ \end{matrix}] y_{k} - β e_{1} }_{2}^{2} - λ^{2} { ρ_{k} }_{2}^{2} .$

However, an estimate of the first term on the right is given by

${ [\begin{matrix} B_{k} \\ λ \end{matrix}] y_{k} - β e_{1} }_{2}^{2} \approx {\overline{ϕ}}_{k + 1}^{2} + { q_{k} }_{2}^{2},$

where the vector q_kdiffers from the previous q_k−1only in the last component. Thus, ∥q_k∥₂²can be computed from ∥q_k−1∥₂²+ψ_k².

Algorithm LSQR for Complex λ

Initialization:

β = ∥s∥; u = s/β;

t = P^Hu; α = ∥t∥; v = t/α;

ρ = 0; w = v;

Φ = β; τ = α;

res2 = 0; ρ = 0; c₂= −1; d₂= 0;

Main loop: for i = 1 to maxits

t = P*v − α*u; β = ∥t∥; u = t/β;

t = P^H*u − β*v; α = ∥t∥; v = t/α;

custom character

= norm([ τ, λ]);

ĉ = conj( τ)/ custom character

; {circumflex over (d)} = λ/ custom character

;

= ĉ* Φ;

ψ = conj({circumflex over (d)})*;

τ = norm([ custom character

, β]);

c = conj( custom character

)/τ; d = conj(β)/τ;

θ = d*α;

τ = −conj(c)*α;

Φ = c* custom character

;

Φ = conj(d)* custom character

;

ρ = ρ + (Φ/τ)*w;

w = v − (θ/τ)*w;

δ = d₂* τ; γ = −c₂*τ; h = Φ − δ*z;

z = h/ γ;

ρ_norm= {square root over (ρ + z²)};

γ = norm([ γ, θ]);

c₂= γ/γ; d₂= θ/γ; z = h/γ;

ρ = ρ + z²;

res1 = Φ²; res2 = res2 + ψ²;

R₂= (res1 + res2)^1/2;

Resnorm = {square root over (R₂² − λ² * ρ²_norm)}.

Conjugate-Gradient Recurrence and Algorithm:

The matrix recurrence relations

P^HPV_k=V_k+1H_k=V_kH_k+g_ke_k^T,

where g_k=[0, . . . , 0, β_k] and e_kis the kth canonical unit vector, can be re-written as

(P^HP+λ²I)V_k=V_k({tilde over (H)}_k+λ²I)+g_ke_k^T.

since the matrix T_k=(H_k+λ²I) is tridiagonal and Hermitian positive definite, T_kcan be factored as:

T_k=L_kD_kL_k^H

where L_kand D_kare defined by:

$L_{k} = (\begin{matrix} 1 & 0 & \dots & 0 \\ μ_{1} & 1 & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & μ_{k - 1} & 1 \end{matrix}); D_{k} = (\begin{matrix} d_{1} & 0 & \dots & 0 \\ 0 & d_{2} & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & 0 & d_{k} \end{matrix}) .$

The entries μ₁and d_iin L_kand D_kdepend on λ, but that dependence has been suppressed for notational convenience.

Note that the entries of L_k, D_kdiffer from the entries in L_k−1, D_k−1respectively only in the last row and column of each matrix. Thus, the matrix recurrence above can be written as a matrix-vector recurrence by equating like columns, obtaining:

β_k−1v_k=−conj(β_k−2)v_k−2+P^H(Pv₂)−α₂v₂,

where the β_iterms are the normalization constants located on the subdiagonal of T_k, their conjugates are on the superdiagonal of T_k, and the α_iterms are on the main diagonal of T_k. Therefore, the recursion for the entries of L_kand D_kis given by:

μ_t−1=β_t−1/d_t−1,
d_i=α_i−|μ_i−1|²*d_t−1,

with initial condition d₁=α₁.

Define C_k=V_kL_k^−Hand let p_kbe the solution to L_kD_kp_j=βe₁. Now

ρ_k=V_ky_k=V_kT_k⁻¹βe₁=(V_kL_k^−H)D_k⁻¹L_k⁻¹βe₁=C_kp_k.

From the definition of C_k, we have

L_kC_k^T=V_k^T,

so the lower bidiagonal structure of L_kgives the recurrence

c_k=v_k−μ_k−1c_k−1.

Writing p_k=[φ₁, . . . , φ_k]^T, we have

$(\begin{matrix} L_{k - 1} D_{j - 1} & 0 \\ 0 \dots 0 μ_{k - 1} d_{k - 1} & d_{k} \end{matrix}) (\begin{matrix} ϕ_{1} \\ ⋮ \\ ϕ_{k} \end{matrix}) = β e_{1} .$

However, because L_k−1D_k−1p_j=βe₁, we have p_k=[p_k−1; φ_k], so that

φ₁=β/d₁
φ_k=−(μ_k−1d_k−1φ_k−1)/d_k−1

from which it follows that

ρ_k=C_kp_k=C_k−1p_k−1+φ_kc_k=ρ_k−1+φ_kc_k.

Using the relations above, the residual norm of the regularized equations can be written as:

∥(P^HP+λ²I)ρ_k−P^Hs∥₂=|β_k∥e_k^Ty|=|β_k∥φ_k|,

since y_kis a solution to L_k^Hy_k=p_k.

Conjugate Gradient Algorithm (Complex λ)

Initialization:

r = P^Hs;

β = ∥r∥₂

v = ρ = 0;

k = 0;

Main loop:

v_new= r/β; t = [P; λ*1]*v_new; α t^H*t; k = k + 1;

r = [P^H, λ*1]*t − α*v_new− β*v

β_new= ∥r∥²

if k = 1

d = α; c = v_new; Φ = β / α; ρ = Φ*v_new;

else

μ = β/d; d_new= α − d*μ*conj(μ);

c = v_new− μ*c;

Φ = −μ*d*Φ/d_new;

ρ = ρ + Φ*c; d = d_new

v = v_new; β = β_new.

Claims

1. A method of reconstructing an image from MRI data provided by an MRI machine, the method comprising: selecting a system transformation based on the MRI machine;selecting a subspace based on the system transformation;selecting a first regularization parameter and a second regularization parameter from an interval, wherein the interval is based on a singular value spectrum of the system transformation;obtaining a solution vector using the first and second regularization parameters, wherein the solution vector solves a minimization problem formulated on the subspace based on the MRI data; anddisplaying the image reconstructed from the solution vector.
2. The method of claim 1, further comprising estimating a sensitivity of a receiving coil in the MRI machine, and selecting the system transformation includes selecting the system transformation based on the sensitivity.
3. The method of claim 2, in which selecting the system transformation also includes selecting the system transformation based on a sampling pattern used by the MRI machine.
4. The method of claim 1, in which obtaining a solution vector includes selecting a regularization parameter, and the subspace is selected independently of the regularization parameter.
5. The method of claim 1, further comprising selecting a basis of the subspace.
6. The method of claim 5, wherein the basis is selected using a conjugate-gradient least-square technique.
7. The method of claim 5, wherein the basis is selected using an least-squares QR technique.
8. The method of claim 5, in which obtaining a solution vector includes selecting a regularization parameter, and the basis is selected independently of the regularization parameter.
9. The method of claim 1, in which the subspace consists of a Krylov subspace.
10. The method of claim 1, in which obtaining a solution vector includes using an LSQR algorithm.
11. The method of claim 1, in which obtaining a solution vector includes using a conjugate-gradient algorithm.
12. The method of claim 1, further comprising selecting a regularization parameter corresponding to a substantially vertical portion of an L-curve.
13. The method of claim 1, wherein the first regularization parameter corresponds to a real part of the solution vector, and the second regularization parameter corresponds to an imaginary part of the solution vector.
14. The method of claim 1, wherein the first regularization parameter and the second regularization parameter are selected as maximum parameters yielding a pre-determined error.
15. The method of claim 1, wherein the interval consists of numbers between 102 and 106.
16. The method of claim 1, wherein the MRI data is acquired by uniformly sampling a k-space.
17. The method of claim 1, wherein the MRI data is acquired by non-uniformly sampling a k-space.
18. The method of claim 1, wherein the MRI data is acquired by sampling over an entire k-space.
19. The method of claim 1, wherein the MRI data is acquired by sampling over half a k-space.
20. A non-transitory computer-readable medium for use in reconstructing an image from MRI data provided by an MRI machine, the medium bearing instructions causing a computer to: select a system transformation based on the MRI machine;select a subspace based on the system transformation;select a first regularization parameter and a second regularization parameter from an interval, wherein the interval is based on a singular value spectrum of the system transformation;obtain a solution vector using the first and second regularization parameters, wherein the solution vector solves a minimization problem formulated on the subspace based on the MRI data; anddisplay the image reconstructed from the solution vector.
21. The computer-readable medium of claim 20, in which the instructions further cause the computer to estimate a sensitivity of a receiving coil in the MRI machine, and to select the system transformation based on the sensitivity.
22. The computer-readable medium of claim 21, in which the instructions further cause the computer to select the system transformation based on a sampling pattern used by the MRI machine.
23. The computer-readable medium of claim 20, in which the instructions to cause the computer to obtain a solution include instructions causing the computer to select a regularization parameter, wherein the instructions further cause the computer to select the subspace independently of the regularization parameter.
24. The computer-readable medium of claim 20, in which the instructions further cause the computer to select a basis of the subspace.
25. The computer-readable medium of claim 24, in which the instructions cause the computer to select the basis using a conjugate-gradient least-square technique.
26. The computer-readable medium of claim 24, in which the instructions cause the computer to select the basis using a least-squares QR technique.
27. The computer-readable medium of claim 24, in which the instructions for obtaining a solution vector include instructions for selecting a regularization parameter, wherein the instructions cause the computer to select the basis independently from the regularization parameter.
28. The computer-readable medium of claim 20, in which the subspace consists of a Krylov subspace.
29. The computer-readable medium of claim 20, in which the instructions that cause the computer to obtain the solution vector include a least-squares QR algorithm.
30. The computer-readable medium of claim 20, in which the instructions that cause the computer to obtain the solution vector include a conjugate-gradient algorithm.
31. The computer-readable medium of claim 20, in which the instructions further include instructions for selecting a regularization parameter corresponding to a substantially vertical portion of an L-curve.
32. The computer-readable medium of claim 20, in which the first regularization parameter corresponds to a real part of the solution vector, and the second regularization parameter corresponds to an imaginary part of the solution vector.
33. The computer-readable medium of claim 20, wherein the instructions cause the computer to select the first regularization parameter and the second regularization parameter as maximum parameters yielding a pre-determined error.
34. The computer-readable medium of claim 20, wherein the interval consists of numbers between 102 and 106.
35. The computer-readable medium of claim 20, wherein the MRI data is acquired by uniformly sampling a k-space.
36. The computer-readable medium of claim 20, wherein the MRI data is acquired by non-uniformly sampling a k-space.
37. The computer-readable medium of claim 20, wherein the MRI data is acquired by sampling of an entire k-space.
38. The computer-readable medium of claim 20, wherein the MRI data is acquired by sampling over half a k-space.

REFERENCE TO GOVERNMENT SUPPORT

This invention was made with Government support under Grant No. RRO19703 awarded by the National Institutes of Health. The U.S. Government has certain rights in this invention.

US Referenced Citations (12)

Number	Name	Date	Kind
5546472	Levin	Aug 1996	A
6208139	Foo et al.	Mar 2001	B1
6680610	Kyriakos et al.	Jan 2004	B1
6748096	Chuang	Jun 2004	B2
6975900	Rudy et al.	Dec 2005	B2
20010038285	Zhu	Nov 2001	A1
20030120163	Rudy et al.	Jun 2003	A1
20040082870	Rudy et al.	Apr 2004	A1
20050270024	Lin	Dec 2005	A1
20070297656	DeMan et al.	Dec 2007	A1
20080012564	Lin	Jan 2008	A1
20080107319	Chang et al.	May 2008	A1

Related Publications (1)

	Number	Date	Country
	20080175451 A1	Jul 2008	US

Magnetic resonance imaging by subspace projection

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

REFERENCE TO GOVERNMENT SUPPORT

US Referenced Citations (12)

Related Publications (1)