This invention relates to magnetic resonance imaging, and more particularly to quantitatively mapping material intrinsic physical properties using signals collected in magnetic resonance imaging.
Quantitative susceptibility mapping (QSM) in magnetic resonance imaging (MRI) has received increasing clinical and scientific interest. QSM is useful for characterizing and quantifying magnetic chemical compositions such as iron, calcium, and contrast agents including gadolinium and superparamagnetic iron oxide nanoparticles. Tissue composition of these compounds may be altered in various neurological disorders, such as Parkinson's disease, Alzheimer's disease, stroke, multiple sclerosis, hemochromatosis, and tumor as well as other diseases throughout the body. Different from diffuse tissue iron, deoxyheme iron in the capillaries and veins reflects tissue metabolic oxygen extraction from circulation, and circulation is vital to all organs and is perturbed in many diseases. QSM is able to unravel novel information related to the magnetic susceptibility (or simply referred as susceptibility), a physical property of underlying tissue that depends on magnetic chemical compositions. Due to the prevalence of iron and calcium in living organisms and their active involvement in vital cellular functions, QSM is in general very useful to study the molecular biology of iron/calcium by tracking iron/calcium in the circulation and other body system and to study the metabolic activities by using iron and calcium as surrogate marks. QSM can also be used to quantify contrast agents passage in tissue, which can be captured in time resolved imaging and fitted to physical transport equations for generating quantitative maps of transport parameters. Therefore, accurate mapping of the magnetic susceptibility induced by iron, calcium and contrast agents will provide tremendous benefit to clinical researchers to probe the structure and functions of human body, and to clinicians to better diagnose various diseases and provide relevant treatment.
QSM reconstruction has been based on minimizing a cost function that fits the proton-experienced tissue magnetic field b(r) (in unit B0, main field of MRI scanner) to the unit dipole field
Approximating noise in b as Gaussian, the data fidelity term to fit measured data with biophysics model is w(r)|d*χ(r)−b(r)|2dr≡∥d*χ−b∥w2. This integral form b=d*χ allows numerical estimation of the susceptibility solution, typically under a regularization associated with a prior information. As known from physics, the integral form represents a solution to the governing partial differential equation (PDE). For mathematical analysis of susceptibility solution, the equivalent PDE that can be derived directly from Maxwell's equation with Lorentz sphere correction to connect the macroscopic magnetic susceptibility quantity x to the microscopic field b experienced by protons, signal generators in MRI:
Eq. 1 is a wave equation with respect to susceptibility χ (z-axis is time) and a Laplacian equation with respect to field b. In the continuous space without any error, b(r)=d*χ(r), B(k)=D(k)X(k), allows a precise solution of no streaking:
Any field data that cannot be fitted to the dipole field is referred as the incompatible field data v(r), which may come from various sources including noise, digitiztion error, and anisotropy source. This incompatible field causes artifacts according to the wave propagation solution:
is the wave propagator having large values at magic-angle cones above and below the incompatible source
The dipole incompatible part, including noise, discretization error, and anisotropy sources, generates artifacts defined by the wave propagator with the B0 field direction as the time axis. Granular noise causes streaking along wave propagator cone surface and contiguous error causes both streaking and shadowing artifacts. QSM reconstruction task is to minimize these streaking and shadowing artifacts. The streaking can be effectively minimized by penalizing edges that are not in tissue edges, which is typically referred as a structure prior. The shadowing artifacts can also be suppressed but remain challenging.
The multiecho gradient echo (mGRE) sequence is widely accessible and commonly used to acquire data for QSM. Prior QSM reconstruction has been based on the approach of estimating the magnetic field from all echo signals of the mGRE data before the field to magnetic susceptibility inversion. This estimating the field from all echoes in prior art has invariably involved approximations in noise characteristics, violating fidelity to mGRE data at all echo times and resulting in errors.
In general, magnetic susceptibility affects the complex MRI signal in a nonlinear manner. Further biophysical decomposition of susceptibility contributors, including fat with chemical shift, deoxyheme iron vs other diffuse ferritin iron, and contrast agents and of tissue transport and interaction phenomenon add complexity to the nonlinear signal models. The corresponding inverse problems of nonlinear signal models are nonconvex and their solutions challengingly depend on initialization.
Implementations of systems and methods for collecting and processing multiecho (including the single echo as a special case) complex MRI signals of a subject, and reconstructing maps of physical properties intrinsic to the subject (e.g., magnetic susceptibility) are described below. In some implementations, MR signal data corresponding to a subject can be transformed into susceptibility-based images that quantitatively depict the structure and/or composition and/or function of the subject. Using this quantitative susceptibility map, one or more susceptibility-based images of the subject can be generated and displayed to a user. The user can then use these images for diagnostic or experimental purposes, for example to investigate the structure and/or composition and/or function of the subject, and/or to diagnose and/or treat various conditions or diseases based, at least in part, on the images. As one or more of the described implementations can result in susceptibility-based images having higher quality and/or accuracy compared to other susceptibility mapping techniques, at least some of these implementations can be used to improve a user's understanding of a subject's structure and/or composition and/or function, and can be used to improve the accuracy of any resulting medical diagnoses or experimental analyses.
Some of the implementations described below for the magnetic dipole inversion using incorporation of multiecho complex data based cost function, preconditioning, deep neural network, chemical composition based fat compensation, and enforcement of regional susceptibility contrasts improve susceptibility map quality including suppression of artificial signal variation in regions of low susceptibility contrasts and to provide accurate susceptibility values with precise fidelity to the acquired multiecho complex MRI data, which are pressing concerns to ensure quantification accuracy, experimental repeatability and reproducibility.
In general, one aspect of the invention disclosed here enables an accurate quantification of tissue magnetic susceptibility by the use of receiving multiecho complex magnetic resonance imaging data obtained by a magnetic resonance scanner, wherein the multiecho data at each echo time is complex and comprises magnitude and phase information or real and imaginary information regarding the object; and by the use of estimating a magnetic susceptibility distribution of the object based on the multiecho complex data at each echo time, wherein estimating the magnetic susceptibility distribution of the subject comprises determining a cost function, the cost function relating at least a data fidelity term involving a comparison between the measured and a susceptibility modeled complex signal at each echo time wherein the susceptibility modeled complex signal has time dependence in both signal magnitude and phase, and a regularization term associated with a structure prior information, and determining a magnetic susceptibility distribution based on minimizing the cost function
In general, another aspect of the invention disclosed here enables accurately estimating the magnetic susceptibility distribution, including calculating a magnetic field experienced by the tissue in the object based on modeling the multiecho complex data with consideration of noise in both magnitude and phase information at each echo time, and replacing the temporal phase evolution with echo time in multiecho data with a phase factor determined by the calculated magnetic field.
In general, another aspect of the invention disclosed here for quantitative susceptibility mapping introduces a scaling method, wherein the magnetic field is divided first by a factor for estimating the magnetic susceptibility and then the output magnetic susceptibility is multiplied by the factor.
In general, in another aspect of the invention disclosed here for quantitative susceptibility mapping in the body comprised of fat and water, the magnetic field, the water fraction and the fat fraction in a voxel are computed using a deep neural network.
In general, another aspect of the invention disclosed here for estimating the magnetic susceptibility distribution includes identifying regions distributed over the whole image volume that have low susceptibility contrast according to a characteristic feature and adding a second regularization term associated with penalizing susceptibility variations in these regions. This regularization over low contrast regions allows for estimation and reduction of shadowing and/or streaking artifacts in these regions.
In general, another aspect of the invention disclosed here for estimating the magnetic susceptibility distribution includes using a characteristic feature characterized by R2* value. The region of interest is determined by similar R2* properties of contiguous voxels; and the penalty of the second regularization term depends on a representative R2* value in the region of interest.
In general, in another aspect of the invention disclosed here for estimating the magnetic susceptibility distribution, the cost function minimization involves preconditioning.
In general, another aspect of the invention disclosed here for estimating the magnetic susceptibility distribution is realized using a deep neural network.
Implementations of these aspects may include one or more of the following features.
In some implementations, the cerebrospinal fluid in the ventricles in the brain is segmented based on low R2* values according to similar R2* properties of contiguous voxels.
In some implementations, the prior information regarding the magnetic susceptibility distribution is determined based on an L1 norm and structural information of the object estimated from acquired images of the object.
In some implementations, the object comprises a cortex, a putamen, a globus pallidus, a red nucleus, a substantia nigra, or a subthalamic nucleus of a brain, or a liver in an abdomen, and generating one or more images of the object comprises generating one or more images depicting the cortex, the putamen, the globus pallidus, the red nucleus, the substantia nigra, or the subthalamic nucleus of the brain, or the liver in the abdomen.
In some implementations, the object comprises at least one of a multiple sclerosis lesion, a cancerous lesion, a calcified lesion, or a hemorrhage, and generating one or more images of the object comprises generating one or more images depicting at least one of the multiple sclerosis lesion, the cancerous lesion, the calcified lesion, or the hemorrhage.
In some implementations, the operations further comprise quantifying, based on the one or more images, a distribution of contrast agents introduced into the object in the course of a contrast enhanced magnetic resonance imaging procedure. The contrast agent passage through tissue is modeled according to a flow velocity and exchange coefficients. The contrast agent passage is captured on many timeframes of time resolved imaging and is used to determined velocity and exchange coefficients by solving the inverse problem of transport equation.
In some implementations, the operations further comprise quantifying, based on the one or more images, a distribution of fat and water fractions. The fat chemical shift effect on signal phase is captured on multiecho magnetic resonance imaging data and is used to generate fat and water decomposition, which is a nonconvex optimization problem that can be solved using a deep neural network without training of labeled data.
In some implementations, the operations further comprise quantifying, based on the one or more images, a distribution of oxygen extraction fraction. The deoxyheme iron has unique effect on multiecho magnetic resonance imaging data and is used to separate deoxygeme iron from other diffuse susceptibility sources, which is a nonconvex optimization problem that is solved using a deep neural network without training of labeled data.
In some examples, a method for generating one or more images of an object comprises obtaining complex magnetic resonance data collected by a magnetic resonance scanner, wherein the data is complex and comprises magnitude and phase information regarding the object; estimating a magnetic susceptibility distribution of the object based on the obtained complex magnetic resonance data, wherein estimating the magnetic susceptibility distribution of the subject comprises: determining a cost function, the cost function including a data fidelity term involving modeling a magnetic susceptibility effect on the obtained complex magnetic resonance data, a regularization term involving a first region with a low feature of susceptibility contrasts, and a second region with a feature of susceptibility contrasts different from the feature of susceptibility contrasts of the first region; minimizing the cost function; determining a magnetic susceptibility distribution based on the act of minimizing the cost function; and presenting, on a display device, one or more images of the object generated based on the determined magnetic susceptibility distribution. In some examples, the aforementioned example method may include one or more of the following particulars, singly or in any combination: the regularization term involves a third and/or more regions with features of susceptibility contrasts differing from features of susceptibility contrasts of the first two regions; the feature of susceptibility contrasts of the first region is the lowest; the regularization term involves a penalty on a variation of susceptibility values in a region; the penalty varies with the feature of susceptibility contrasts of a region; the feature of susceptibility contrasts of a region involves decay rates of magnitude signals in voxels of the region; a region is formed by binning a group of voxels sharing a characteristic; the characteristic is a distribution of decay rates; the distribution is a rectangular or Gaussian function; the characteristic is a distribution in space; minimizing the cost function is obtained using a preconditioning method; the regularization term is implicitly formed and estimating the magnetic susceptibility distribution is realized using a deep neural network; the deep neural network is trained with labeled data or unlabeled data, or is untrained; the deep neural network is trained with network weights updated with a test data; modeling a magnetic susceptibility effect on the obtained complex magnetic resonance data involves a comparison between the obtained complex magnetic resonance data and a susceptibility modeled complex signal at each echo time; the susceptibility modeled complex signal includes a phase involving echo time and the magnetic susceptibility dipole convolution; the comparison comprises an Lp norm of the difference between the obtained complex magnetic resonance data and a susceptibility modeled complex signal at each echo time; modeling a magnetic susceptibility effect on the obtained complex magnetic resonance data comprises calculating a magnetic field experienced by tissue in the object from the obtained magnetic resonance data; calculating a phase at each echo time according to the calculated magnetic field and a multiplicative scaling factor; performing a comparison involving the calculated phase with a phase of the magnetic susceptibility dipole convolution at each echo time; and dividing the determined magnetic susceptibility distribution by the scaling factor; the comparison involving the calculated phase with a phase of the magnetic susceptibility dipole convolution at each echo time comprises calculating an Lp norm of a weighted difference between an exponential factor of the calculated phase and an exponential factor of the phase of the magnetic susceptibility dipole convolution at each echo time.
In some examples, a method for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance data collected by a magnetic resonance scanner, wherein the multiecho data is complex and comprises magnitude and phase information regarding the object; determining oxygen extraction fraction distribution of the object based on the multiecho complex magnetic resonance data, comprising: determining a cost function, the cost function including a data fidelity term involving a comparison between the obtained multiecho complex magnetic resonance data and a modeled complex signal at each echo time with a consideration of deoxyheme-iron effects on magnitude and phase of the modeled complex signal, and a regularization term; minimizing the cost function; determining an oxygen extraction fraction distribution based on the act of minimizing the cost function; and presenting, on a display device, one or more images of the object generated based on the determined oxygen extraction fraction distribution. In some examples, the aforementioned example method may include one or more of the following particulars, singly or in any combination: the consideration of deoxyheme-iron effects on signal magnitude and phase includes a model of deoxyheme-iron distributed in cylinders and nonblood susceptibility sources distributed diffusely in space for determining the modeled complex signal; the comparison between the obtained multiecho complex magnetic resonance data and the modeled complex signal comprises an Lp norm of the difference of the obtained multiecho complex magnetic resonance data and the modeled complex signal. In some examples, the one or more images of the object include the nonblood susceptibility distribution and/or venous blood volume; wherein the regularization term includes a sparsity of edges in oxygenation and/or venous blood volume; wherein minimizing the cost function includes a dipole deconvolution step and the regularization term includes a sparsity of edges in tissue susceptibility and/or penalties on variations of susceptibility values in regions; wherein the regularization term includes a sparsity of temporal evolution along echo time of the magnitude of the modeled complex signal; wherein minimizing the cost function includes coordinate descent along axes defined by variables of oxygenation and venous blood volume; wherein the regularization term is implicitly formed and determining the oxygen extraction fraction distribution is realized using a deep neural network; wherein the deep neural network is trained with labeled data or unlabeled data, or is untrained; wherein the deep neural network is trained with network weights updated with a test data.
In some examples, a method for generating one or more images of an object comprises obtaining multiple timeframe imaging data about tracer transport through the object; determining tracer concentration at each timeframe from imaging data according to tracer effects on image signal; estimating tracer velocity and exchange coefficient distribution of the object, comprising: determining a cost function, the cost function including a data fidelity term involving a comparison between the determined tracer concentrations and tracer concentrations according to a velocity-exchange model; and minimizing the cost function; and determining a velocity and exchange distribution based on the act of minimizing the cost function; and presenting, on a display device, one or more images of the object generated based on the determined velocity and/or exchange distribution. In some examples, the aforementioned example method may include one or more of the following particulars, singly or in any combination: the multiple timeframe imaging data is obtained using magnetic resonance imaging; the multiple timeframe imaging data is obtained using positron emission tomography; the multiple timeframe imaging data is obtained using computed tomography, or single photon emission tomography; the velocity-exchange model includes a temporal deconvolution involving an exchange coefficient; the comparison between the obtained tracer concentrations and tracer concentrations according to a velocity-exchange model comprises an Lp norm of a difference between the obtained tracer concentrations and tracer concentrations according to a velocity-exchange model; the cost function includes a regularization term imposing a sparsity on velocity and/or exchange coefficient distribution; the regularization term is implicitly formed and determining the velocity and exchange coefficient distribution is realized using a deep neural network; the deep neural network is trained with labeled data or unlabeled data, or is untrained; the deep neural network is trained with network weights updated with a test data.
In some examples, a method for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance imaging data collected by a magnetic resonance scanner, wherein the multiecho data is complex and comprises magnitude and phase information regarding the object; estimating fat and water distribution of the object based on the multiecho complex data, comprising: determining a cost function, the cost function including a data fidelity term involving a comparison between the obtained multiecho complex magnetic resonance data and a complex signal of a fat-water model at each echo time wherein the fat-water model has echo time dependence in both signal magnitude and phase with the fat chemical shift contributing an additional phase; and determining fat and water distribution of the object using a deep neural network without labeled data; and presenting, on a display device, one or more images of the object generated based on the determined fat and water distribution. In some examples, the aforementioned example method may include one or more of the following particulars, singly or in any combination: the comparison between the obtained multiecho complex magnetic resonance data and a complex signal of a fat-water model at each echo time comprises an Lp norm of a difference between the obtained multiecho complex magnetic resonance data and a complex signal of a fat-water model at each echo time; the fat-water model has echo time dependence in signal magnitude with different decay rates for fat and water components; the fat chemical shift contributing an additional phase comprises a sum of contributions from a fat chemical shift spectrum; the deep neural network is trained with unlabeled data; network weights of the trained deep neural network are updated with a test data; the deep neural network is untrained; the deep neural network includes a U-net.
Details of one or more embodiments are set forth in the accompanying drawings and the description below. Other features and advantages will be apparent from the description and drawings, and from the claims.
Part a) of
Part a) of
Parts (a)-(b) of
Parts (a)-(d) of
Part (a) of
Like reference symbols in the various drawings indicate like elements.
Implementations of systems and methods for collecting and processing MRI signals of a subject, and reconstructing maps of physical properties intrinsic to the subject (e.g., magnetic susceptibility) are described below. Some of the disclosed implementations overcome limitations in quantitative susceptibility mapping (QSM), where the computation robustness and accuracy and the Bayesian prior information affect QSM image quality and practical usage. In some implementations, to reconstruct maps of a subject's magnetic susceptibility, the computation process of finding the susceptibility distribution that optimally fits available multiecho gradient echo MRI (mGRE) data and tissue structure prior information is made accurate by fitting data at each echo time without compromising the precision in modeling data noise. In some implementations, the computation process of finding the magnetic field from mGRE data is also made accurate by fitting data at each echo time without compromising the precision in modeling data noise. In some implementations, a scaling factor is used to ensure robust initialization of the nonconvex optimization needed to deal with precise signal noise modeling. In some implementations, additional information of regions with low susceptibility contrasts that can be automatically segmented from R2* map is used to improve QSM reconstruction. In some implementations, a deep neural network is used for magnetic field estimation and QSM reconstruction. Implementations of these improvements have made QSM technology robust and accurate.
For multiecho gradient echo (mGRE) magnetic resonance imaging data obtained by a magnetic resonance scanner, the mGRE data at each echo time is complex, its magnitude and phase information or real and imaginary information regarding the object is well characterized to have Gaussian noise in the domain of complex signal. This mGRE data should be generally interpreted as multiple echoes with both magnitude and phase evolving with echo times. Gradient echo, spin echo and hybrid echo and other type of pulse sequences can be used to acquire these multiple echoes. Therefore, these data may be referred to as multiecho complex (MC) data.
QSM prior art typically consists of (1) estimating the magnetic field from multiecho complex signals at all echo times and 2) performing the inversion from magnetic field to magnetic susceptibility. The first step here typically focuses on signal phase, ignoring noise effects on signal magnitude; the second step here invariably has difficulties with the very complex noise property of the estimated field. Consequently, both steps contribute errors to QSM reconstruction.
To avoid these two steps and their associated errors, our invention improves upon QSM prior art through estimating a magnetic susceptibility distribution of the object based on the multiecho complex data at each echo time. We construct a cost function relating a data fidelity term associated with a likelihood function of the Gaussian noise in the multiecho complex data at each echo time. The likelihood function for the multiecho complex data at an echo time is Gaussian with the variance same for all echoes but the mean depending on individual signals. The product of likelihood functions of all echoes for the resultant likelihood function. The negative log of the likelihood function is the cost function referred to as a data fidelity term, and the maximal likelihood estimation corresponds to the minimal cost function. This leads to that the cost function for the QSM reconstruction is a sum of cost functions of all individual echoes. Extending to the Bayesian inference framework, the prior probability is the exponential function of the negative regularization term, the prior probability times the likelihood function determines the posterior probability, and the maximum a posterior probability (known as MAP) estimation leads to minimum of a cost function comprised of the data fidelity term from likelihood and the regularization term from prior probability. The data fidelity term involves summing a comparison between the measured signal and modeled signal at each echo. This comparison can be a difference expressed in an Lp norm (p≥0, for example, p=2 for Gaussian noise model). The modeled signal involves temporal evolution in both magnitude and phase. As the field to susceptibility inverse problem is ill-posed, additional information is needed for QSM reconstruction, which is a regularization term associated with a structure prior information. Then the magnetic susceptibility distribution is determined based on minimizing the cost function.
We extend the approach of going back to individual echo data to addressing errors in the first and second steps of QSM prior art. Therefore, the first step is improved by calculating a magnetic field experienced by the tissue in the object based on modeling the multiecho complex data with consideration of noise in both magnitude and phase information at each echo time, which would allow precise modeling noise in both magnitude and phase data. The second step is improved by replacing the temporal phase evolution with echo time in the multiecho complex data with a phase factor determined by the calculated magnetic field, which would allow proper estimation of noise weighting in formulating the data fidelity term. This is a nonlinear nonconvex optimization problem, which depends on initialization or may be trapped in local minima.
We devised a scaling method for solving nonlinear nonconvex optimization problem without concerns in initialization trapping. We scale the magnetic field down to be very small, which alleviates initialization problems and allows linearization of phase factors. The magnetic field is divided first by a factor for estimating the magnetic susceptibility. The small susceptibility values can be reliably searched. Then, the output magnetic susceptibility is multiplied by the factor, to obtain the properly values susceptibility distribution of the object.
For tissue with fat, we solve the fat water separation problem for obtaining the magnetic field in the presence of phase and magnitude effects from chemical shifts of fat and other compounds. This fat-water separation problem is a nonlinear nonconvex optimization problem that is dependent on initialization. We devised a deep neural network approach with or without training (supervised or unsupervised training) to overcome this initialization problem. The magnetic field, the water fraction and the fat fraction in a voxel can be computed using a deep neural network.
We further improve the use of structure prior information for effective suppression of artifacts in QSM reconstruction. The concept of penalizing susceptibility variation in a region of uniform susceptibility or zero susceptibility contrast can be extended to include regions of small susceptibility contrasts at graded penalties. A region with a susceptibility contrast slightly larger than zero can be characterized by a slightly lower penalty in the cost function. In this manner, estimating the magnetic susceptibility distribution include steps of identifying a region of interest in a tissue that has a low susceptibility contrast according to a characteristic feature and adding a second regularization term associated with enforcing the known low susceptibility contrast in the region of interest in the tissue. We use R2* values as the characteristic feature for identifying regions at various susceptibility contrasts. R2* values can be readily obtained from multiecho complex data. The region of interest at a given susceptibility contrast can be determined by similar R2* properties of contiguous voxels, and the penalty there depends on a representative R2* value in the region of interest.
We have developed several solver methods to perform the cost function minimization. One solver involves preconditioning that uses small search steps in low susceptibility regions and large search steps in high susceptibility regions (absolute value of susceptibility and water is zero-reference). Another solver for estimating the magnetic susceptibility distribution is realized using a deep neural network (DNN). The DNN can be trained under supervision of labeled data, including data generated from known biophysics models. The DNN can be trained with unlabeled data according to biophysics model of signal. The DNN can also be used without training to process a test data by minimizing a cost function based on the biophysics model of signal.
We have developed methods to improve MRI signal in regions with large R2* by acquiring zero or ultrashort echo time data immediately after RF excitation. Only a portion of k-space is acquired, and the undersampled data is reconstructed using structure consistency among images of various echo times, as expressed in sparsity or as learned by a deep neural network. This structure consistency is also used to shorten QSM data acquisition by undersampling data acquisition for all echo times.
Accurate QSM improves many applications, including mapping brain QSM with reduced shadowing and streaking artifacts, particularly in brains with large susceptibility contrasts such as from hemorrhages and tumor with angiogenesis blood leaks, mapping susceptibility sources in vessel wall plaque for differentiating intraplaque hemorrhage from plaque calcification, and mapping tissue iron in the livers of patients under blood transfusion treatment. Accurate QSM will also improve mapping of tissue oxygen extraction fraction and fat water fraction map. Various aspects of our invention are exemplified in the specific embodiments in the following implementations of systems and methods for collecting and processing MRI signals of a subject, and reconstructing maps of physical properties intrinsic to the subject.
In general, one implementation obtains complex magnetic resonance (MR) data collected by a magnetic resonance scanner, wherein the data is complex and comprises magnitude and phase information regarding the object. The magnetic susceptibility distribution of the object is estimated from the obtained complex MR data by determining a cost function. As the magnetic susceptibility affects MRI data through dipole convolution generated magnetic field, the cost function is designed to include a data fidelity term that involves the magnetic susceptibility dipole convolution modeling of the data phase. An advantage in reconstructing the magnetic susceptibility distribution is to reduce artifacts such as those described in the above Eq. 2, which streak and shadow throughout the whole image volume and can be reduced in low susceptibility-contrast regions that are distributed throughout the image volume. A feature is used to characterize the susceptibility contrast, such as R2* values. A first region of the lowest feature of the susceptibility contrast is cerebrospinal fluid distributed in the brain ventricles and sulci. A second region of the second lowest feature of the susceptibility contrast is the cortical gray matter adjacent to the cerebrospinal fluid. Additional regions may be considered, such as a third region of the third lowest feature of susceptibility contrast is the white matter adjacent to the cortical gray matter. Accordingly, the cost function is designed to include a regularization term involving two or more regions of the lowest features of susceptibility contrasts. The susceptibility is determined by minimizing the cost function. The regularization may include penalties on susceptibility variation in the regions of lowest features of susceptibility contrasts. The regularization may be formulated implicitly through a deep neural network.
In general, an example embodiment provides a method for generating one or more images of an object, the method comprising obtaining complex magnetic resonance data collected by a magnetic resonance scanner, wherein the data is complex and comprises magnitude and phase information regarding the object, estimating a magnetic susceptibility distribution of the object based on the obtained complex magnetic resonance data, wherein estimating the magnetic susceptibility distribution of the subject comprises determining a cost function, the cost function including a data fidelity term involving modeling a magnetic susceptibility effect on complex magnetic resonance data, a regularization term involving a first region with a low feature of susceptibility contrasts, and a second region with a feature of susceptibility contrasts different from the feature of susceptibility contrasts of the first region; minimizing the cost function, determining a magnetic susceptibility distribution based on the act of minimizing the cost function, and presenting, on a display device, one or more images of the object generated based on the determined magnetic susceptibility distribution.
As examples of specific implementations, the regularization term may involve a third and/or more regions with features of susceptibility contrasts differing from features of susceptibility contrasts of the first two regions; the feature of susceptibility contrasts of the first region may be the lowest; the regularization term may involve a penalty on a variation of susceptibility values in a region, wherein the penalty may vary with the feature of susceptibility contrasts of a region; the feature of susceptibility contrasts of a region may involve decay rates of magnitude signals in voxels of the region, such as the R2* decay rate of the measured magnetic resonance signal, or an opposite or inverse of T2* or T2 weighted image; a region may be formed by binning a group of voxels sharing a characteristic, where the characteristic is a distribution of decay rates wherein the distribution may be a rectangular or Gaussian function, and/or the characteristic may be a distribution in space; minimizing the cost function may be obtained using a preconditioning method; the regularization term may be implicitly formed and estimating the magnetic susceptibility distribution may be realized using a deep neural network, wherein the deep neural network may be trained with labeled data or unlabeled data, or is untrained, or the deep neural network may be trained with network weights updated with a test data; modeling a magnetic susceptibility effect on the obtained complex magnetic resonance data may involve a comparison between the obtained complex magnetic resonance data and a susceptibility modeled complex signal at each echo time, wherein the susceptibility modeled complex signal may include a phase involving echo time and the magnetic susceptibility dipole convolution, and/or the comparison comprises an Lp norm (p≥0, for example, p=0 for Gaussian noise in complex data) of the difference between the obtained complex magnetic resonance data and a susceptibility modeled complex signal at each echo time; modeling a magnetic susceptibility effect on the obtained complex magnetic resonance data may comprise calculating a magnetic field experienced by tissue in the object from the obtained magnetic resonance data, calculating a phase at each echo time according to the calculated magnetic field and a multiplicative scaling factor, performing a comparison involving the calculated phase with a phase of the magnetic susceptibility dipole convolution at each echo time, and dividing the determined magnetic susceptibility distribution by the scaling factor, wherein the comparison involving the calculated phase with a phase of the magnetic susceptibility dipole convolution at each echo time may comprise calculating an Lp norm of a weighted difference between an exponential factor of the calculated phase and an exponential factor of the phase of the magnetic susceptibility dipole convolution at each echo time.
In an example implementation, the data fidelity term is associated with a comparison between the obtained complex magnetic resonance imaging data and a susceptibility modeled complex signal at each echo time wherein the susceptibility modeled complex signal has echo time dependence in both signal magnitude and phase with the signal phase echo time dependence including the magnetic susceptibility dipole convolution.
The magnetic susceptibility distribution may be determined by minimizing the cost function. The minimization procedure may involve the use of preconditioning for uniform convergence over the whole imaging volume. The determination of the susceptibility distribution may be performed using a deep neural network. The data fidelity term may involve a comparison between the measured and a susceptibility modeled complex signal at each echo time wherein the susceptibility modeled complex signal has echo time dependence in both signal magnitude and phase. An example for the complex signal model is that signal magnitude decays with echo time exponentially with a decay rate of R2*, and signal phase evolves linearly with echo time according to the dipole convolution generated magnetic field. An example for the comparison is a difference expressed in an Lp norm. The magnetic susceptibility distribution may also be determined by calculating a magnetic field experienced by the tissue in the object based on modeling the multiecho complex data with consideration of noise in both magnitude and phase information at each echo time, replacing the temporal phase evolution with echo time in the measured multiecho complex data with a phase factor determined by the calculated magnetic field; and dividing the calculated magnetic field by a factor for estimating the magnetic susceptibility and then the final output magnetic susceptibility is multiplied by the factor.
In general, another implementation for generating one or more images of an object comprises obtaining complex magnetic resonance imaging data collected by a magnetic resonance scanner wherein the data is complex and constitutes magnitude and phase information regarding the object, estimating a magnetic susceptibility distribution of the object based on the complex data by determining a cost function that includes a data fidelity term involving the magnetic susceptibility dipole convolution modeling of the data phase and a regularization term involving penalties on susceptibility variation in at least two regions of low susceptibility contrasts and by determining a magnetic susceptibility distribution based on minimizing the cost function.
In general, another implementation for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance imaging data collected by a magnetic resonance scanner, wherein the multiecho data is complex and constitutes magnitude and phase information regarding the object, and estimating fat and water distribution of the object based on the multiecho complex data by determining a cost function to include a data fidelity term involving a comparison between the measured and a complex signal of a fat-water model at each echo time wherein the fat-water model has echo time dependence in both signal magnitude and phase with the fat chemical shift contributing an additional phase. The cost function is minimized using a deep neural network without training of labeled data.
As examples of specific implementations, the comparison between the obtained multiecho complex magnetic resonance data and a complex signal of a fat-water model at each echo time comprises an Lp norm of a difference between the obtained multiecho complex magnetic resonance data and a complex signal of a fat-water model at each echo time; the fat-water model may have echo time dependence in signal magnitude with different decay rates for fat and water components; the fat chemical shift contributing an additional phase may comprise a sum of contributions from a fat chemical shift spectrum; the deep neural network may be trained with unlabeled data, wherein network weights of the trained deep neural network may be updated with a test data, and/or the deep neural network may be untrained; the deep neural network may include a U-net.
In general, another implementation for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance imaging data collected by a magnetic resonance scanner, wherein the multiecho data is complex and constitutes magnitude and phase information regarding the object, and a magnetic susceptibility distribution is calculated using quantitative susceptibility mapping processing of the obtained multiecho complex magnetic resonance imaging data, and estimating oxygen extraction fraction distribution of the object based on the multiecho complex data by determining a cost function to include a data fidelity term involving a comparison between a magnitude image of obtained multiecho complex magnetic resonance imaging data and a magnitude image of a modeled complex signal with a consideration of deoxyheme-iron effects at each echo time, and a comparison between the calculated susceptibility and a susceptibility decomposition model including a deoxyheme-iron component, and minimizing the cost function using a deep neural network.
In general, another implementation for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance imaging data collected by a magnetic resonance scanner, wherein the multiecho data is complex and comprises magnitude and phase information regarding the object, calculating a magnetic susceptibility distribution using quantitative susceptibility mapping processing of the obtained multiecho complex magnetic resonance imaging data, determining oxygen extraction fraction distribution of the object based on the multiecho complex data, comprising determining a cost function, the cost function involving a comparison between a magnitude image of obtained multiecho complex magnetic resonance imaging data and a magnitude image of a modeled complex signal with a consideration of deoxyheme-iron effects at each echo time, and a comparison between the calculated susceptibility and a susceptibility decomposition model including a deoxyheme-iron component, and a regularization term on oxygen extraction fraction and venous blood volume, and minimizing the cost function, and presenting, on a display device, one or more images of the object generated based on the determined oxygen extraction fraction distribution.
In general, another implementation for generating one or more images of an object comprises obtaining multiecho complex magnetic resonance imaging data collected by a magnetic resonance scanner, wherein the multiecho data is complex and comprises magnitude and phase information regarding the object, determining oxygen extraction fraction distribution of the object based on the multiecho complex data, comprising determining a cost function, the cost function including a data fidelity term involving a comparison between the obtained multiecho complex magnetic resonance imaging data and a modeled complex signal at each echo time with a consideration of deoxyheme-iron effects on signal magnitude and phase, and a regularization term, and minimizing the cost function, and presenting, on a display device, one or more images of the object generated based on the determined oxygen extraction fraction distribution.
As examples of specific implementations, the consideration of deoxyheme-iron effects on signal magnitude and phase may include a model of deoxyheme-iron distributed in cylinders and nonblood susceptibility sources distributed diffusely in space for determining the modeled complex signal; the comparison between the obtained multiecho complex magnetic resonance data and the modeled complex signal may comprise an Lp norm of the difference of the obtained multiecho complex magnetic resonance data and the modeled complex signal. The one or more images of the object include the nonblood susceptibility distribution and/or venous blood volume; the regularization term may include a sparsity of edges in oxygenation and/or venous blood volume; minimizing the cost function may include a dipole deconvolution step and the regularization term includes a sparsity of edges in tissue susceptibility and/or penalties on variations of susceptibility values in regions; the regularization term may include a sparsity of temporal evolution along echo time of the magnitude of the modeled complex signal; minimizing the cost function may include coordinate descent along axes defined by variables of oxygenation and venous blood volume; the regularization term may be implicitly formed and determining the oxygen extraction fraction distribution is realized using a deep neural network, wherein the deep neural network may be trained with labeled data or unlabeled data, or is untrained, and/or the deep neural network is trained with network weights updated with a test data.
The modeled complex signal with a consideration of deoxyheme-iron effects on signal magnitude and phase at each echo time include modeling deoxyheme-iron as cylinders and other susceptibility sources as diffuse and calculating corresponding effects on signal magnitude and phase at each echo time. Deep neural networks can be used to minimize the cost function.
In general, another implementation for generating one or more images of an object comprises obtaining multiple timeframe imaging data about tracer transport through the object, determining tracer concentration at each timeframe from imaging data according to tracer effects on image signal, estimating tracer velocity and exchange coefficient distribution of the object by determining a cost function that includes a data fidelity term involving a comparison between the determined tracer concentrations and tracer concentrations according to a velocity-exchange model, minimizing the cost function. determining a velocity and exchange distribution based on the act of minimizing the cost function, and presenting, on a display device, one or more images of the object generated based on the determined velocity and/or exchange distribution.
As examples of specific implementations, the multiple timeframe imaging data may be obtained using magnetic resonance imaging, positron emission tomography, or computed tomography, or single photon emission tomography; the velocity-exchange model may include a temporal deconvolution involving an exchange coefficient; the comparison between the obtained tracer concentrations and tracer concentrations according to a velocity-exchange model may comprise an Lp norm of a difference between the obtained tracer concentrations and tracer concentrations according to a velocity-exchange model; the cost function may include a regularization term imposing a sparsity on velocity and/or exchange coefficient distribution; the regularization term may be implicitly formed and determining the velocity and exchange coefficient distribution is realized using a deep neural network, wherein the deep neural network may be trained with labeled data or unlabeled data, or is untrained, or the deep neural network is trained with network weights updated with a test data.
The velocity-exchange model for tracer passage through tissue is formulated in a transport equation accounting for tracer convection with a velocity and tracer binding with tissue. The cost function includes a regularization on the velocity and exchange coefficient, with the regularization formulated explicitly such as using sparsity and/or implicitly using a deep neural network. The cost function can be minimized using gradient descent or using a deep neural network.
For each general or specific implementations described in the above, there is a system for generating one or more images of an object, the system comprising a processor, a graphical output module communicatively coupled to the processor, an input module communicatively coupled to the processor, and a non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by processor cause the processor to perform operations to realize the implementation.
1. Multiecho Complex Total Field Inversion (mcTFI) Method for Improved Signal Modeling in QSM
In this section, we present an improvement on constructing QSM by modeling the MRI signal more precisely, particularly accounting for noise in signal amplitude that has been ignored in the prior art.
Quantitative Susceptibility Mapping (QSM) is an MRI contrast mechanism that is capable of the mapping of the underlying tissue magnetic susceptibility. The magnetic susceptibility sources are magnetized when placed inside an external magnetic field such as the main field B0 of an MRI scanner, and the magnetic fields generated by the magnetized susceptibility sources, henceforth termed the susceptibility field, can then be measured by the MRI scanner for susceptibility map reconstruction. Accordingly, QSM can be used to study tissue susceptibility sources such as deoxyheme iron in blood, tissue nonheme iron, myelin, cartilage, and calcification. QSM robustness and reproducibility have been demonstrated for brain applications, and there are a wide range of QSM clinical applications, including deep brain stimulation target mapping, cerebral cavernous malformation monitoring, multiple sclerosis chronic inflammation imaging and MRI follow-up without gadolinium, neurodegeneration investigation in Parkinson's disease and Alzheimer's disease, and more.
Technical challenges remain for QSM in regions of low SNR, such as lesions with large susceptibility contrasts and regions near air-tissue interfaces, and these challenges become more problematic at higher field strengths. Typically, the multiecho gradient-echo (GRE) sequence is used for QSM data acquisition and the susceptibility field can be inferred as the change in the phase of the GRE images measured at the multiple echo times. QSM reconstruction comprises first estimating this susceptibility field from the GRE images, and then reconstructing the susceptibility map from the estimated field. While proper noise modeling of the estimated field is important, assumptions made during field estimation from GRE data may introduce errors. These errors can propagate into the reconstructed susceptibility map as artifacts. The MERIT method used for iterative tuning of the noise weighting helps to alleviate some of the resulting artifacts, but its implementation remains empirical.
Here we report a multiecho Complex Total Field Inversion (mcTFI) method to compute the susceptibility map directly from the acquired GRE images using an improved signal model and the Gaussian noise property of the complex data. We compared mcTFI with preconditioned Total Field Inversion (TFI), to demonstrate improvements in susceptibility reconstruction, especially in regions with low SNR.
In this work, the gradient echo signal Si measured at echo time tj with j=1, . . . , #TE is modeled as:
S
j
=m
0
e
−tjR*
e
iϕ
e
itj2πf [3]
Where m0 is the initial magnetization, R*2 is the T2 relaxation rate, ϕ0 is the initial phase offset, and f is the susceptibility field (in a left-handed phase convention, which is equivalent to a right-handed phase convention but with opposite sign). The susceptibility field is modeled as a dipole field: it is the convolution of a susceptibility distribution x with the magnetic field of a unit dipole. An operator D=d*is used to denote the convolution with the unit dipole field:
f=
B
0
d*x [4]
Where ω0=γB0 with B0 the main magnetic field. Therefore, Eq. 3 can also be written as an example for modeling a magnetic susceptibility effect on complex magnetic resonance data. The susceptibility modeled complex signal at the jth echo is:
S
j
=m
0
e
−t
R*
e
iϕ
e
it
ω
Dχ [5]
The goal of QSM is to reconstruct the susceptibility distribution map, χ, from the measured complex signal (GRE images) at the jth echo, Sj.
Here, we provide mcTFI to compute the susceptibility distribution directly from the complex gradient echo data. As an exemplary embodiment for the cost function relating at least a data fidelity term comparing the measured complex signal and the susceptibility modeled signal. For example, the data fidelity term can be associated with a Gaussian likelihood function involving a difference between the measured and a susceptibility modeled complex signal at each echo time wherein the susceptibility modeled complex signal has time dependence in both signal magnitude and phase, and a regularization term associated with a structure prior information, we formulate the following cost function:
Where m0, R*2, and ϕ0, were solved alongside with the susceptibility map, χ=Py, with P being a preconditioner. The second term is a weighted total variation designed to suppress streaking artifacts in y, where λ1 is the regularization parameter, MG is the binary edge mask reflecting the anatomy, and ∇ is the gradient operator. The third term, also known as the zero-reference term, enforces constant susceptibility distribution in Py within a mask Mc where λ2 is the regularization parameter, and
where λ3 is the regularization parameter.
The optimization problem in Eq. 6 was solved iteratively using the Gauss-Newton (GN) method. In each GN iteration, Eq. 6 was linearized with respect to the four unknowns via first order Taylor expansion. For computational efficiency, the linearized problem at the nth GN iteration was broken down into two subproblems using coordinate descending approach: 1) keeping m0,n, R*2,n, and ϕ0,n fixed, yn+1 was found using the Conjugate Gradient (CG) solver (see Appendix A in the next section 1.3 for solver detail):
and 2) keeping yn+1 fixed, m0,n+1, R*2,n+1, and ϕ0,n+1 were updated using a voxel-by-voxel pseudo-inversion (see Appendix B in the next section 1.3 for solver detail):
The mcTFI method described in the above addresses important shortcomings in the prior art of field estimation and QSM. For example, in prior TFI, QSM is computed by fitting the susceptibility to the susceptibility field, f, which is first estimated via a fitting by ignoring the noise in the signal amplitude in Eq. 7 and solving only for ϕ0 and f:
the susceptibility map was then reconstructed according to the following prior method that assumes Gaussian noise in the estimated field:
where w is a diagonal noise weighting matrix, which is typically calculated as the covariance matrix of the fitting in Eq. 9 with the assumption of Gaussian phase at each echo in a multiecho gradient echo sequence. Note that the noise in signal magnitude may be substantial, and the noise in f may no longer be Gaussian, which could introduce model errors to the data fidelity term in the above prior method. Using a proper noise weighting in TFI can help to mitigate this model error (as well as fitting errors from Eq. 9) by assigning small values to w in the problematic voxels, therefore, reducing the influence of these problematic voxels during the optimization. To address that w can also contain errors and may not completely account for the noise property in the susceptibility field, MERIT is introduced to further retune w. With MERIT, at the end of the ith GN iteration, the noise weighting wi is updated to wi+1, which will be used in the next GN iteration, as follows:
otherwise where ρi=wi|DPyi−f| is the voxel-by-voxel data term residual for the ith iteration, T is the threshold determining the number of voxels whose noise weighting will be reduced, and R is the attenuation factor determining the strength of the weighting reduction. Carefully selection of T and R are important: large values may cause reasonable voxels to be disregarded and small values may not probably penalize all erroneous voxels. However, MERIT is a heuristic method and the parameters, T and R, are currently empirically determined.
The proposed mcTFI method is hypothesized to be an improvement upon the TFI method because 1) mcTFI does not require the fine-tuning of w, and 2) by avoiding a separate fitting of f, the Gaussian noise assumption in mcTFI cost function still holds.
The accuracy of the proposed mcTFI algorithm was evaluated against TFI both without and with MERIT (TFI+MERIT) in a numerical brain simulation with inclusion of a hemorrhage and calcification, in patients suffering from intracerebral hemorrhage (ICH) scanned at 3T and in healthy subjects scanned at 7T.
In TFI, P is the automated adaptive preconditioner, λ1=0.001, and λ2=0.1 were chosen as accordance with the literature. The MERIT parameters T=6σ, where σ is the standard deviation of the two-norm of the data term residual, and R=2 were chosen according to the original work.
The same automated adaptive preconditioner was used in mcTFI. The follows steps were taken in order to match the regularizations parameters in mcTFI to that in TFI:
with #M the number of voxels in M. In mcTFI, the complex images, Sj were scaled by
so that
The Tikhonov regularization parameter, λ3, was automatically determined for each case by first constructing a L-curve in a large range, from 10−5 to 101 with an increment of an order of magnitude, to determine a rough value for the L-curve corner. Then, a second L-curve was constructed within a range of an order of magnitude around this rough value, and λ3 was chosen as the corner of the second L-curve. The L-curve corner was determined using an adaptive pruning algorithm. To further improve the stability of mcTFI, m0 and R*2 were constrained to be greater than or equal to zero.
Proper initializations are important for solving the nonlinear optimization problems in mcTFI. Here, R*2,init was obtained using ARLO, and then m0,init was estimated as
ϕ0,init and finit were obtained via pseudo-inversion of the phase in the first two GRE echoes:
One challenge in initializing Eq. 6 is that the angle of a complex exponential is only defined within a range of 2π, so that large susceptibility field may cause wraps in the phase, leading in turn to wraps in the resulting susceptibility map. Such wrapping artifact can be avoided if y is initialized without any wraps, and this was done with the following steps:
A quality map guided spatial unwrapping algorithm was used for phase unwrapping.
A numerical brain phantom with calcification and hemorrhage was constructed based on the Zubal phantom. First, multiecho GRE image data (ignoring T1 relaxation effect) were simulated according to Eq. 5, on 3T, with simulation parameters 1st TE (ms)/ΔTE (ms)/#TE (ms)/voxel size (mm3)=4.5/5/8/1×1×1, and with known tissue parameters m0 (a.u.)/R*2(Hz)/ϕ0 (rad)/χ (ppm) set for different structures based on literature values: white matter=0.715/20/1/−0.046, grey matter=0.715/20/1/0.053, CSF=0.715/4/1/0, caudate nucleus=0.715/30/1/0.093, putamen=0.715/30/1/0.093, thalamus=0.9/20/1/0.073, globus pallidus=0.9/45/1/0.193, calcification=0.1/500/1/−2.4, ICH=0.1/150/1/2.68, air=0/0/0/9, sagittal sinus=0/45/0/0.27, and skull=0/4/0/−2. Next, random zero-mean gaussian white noise at SNR=100 was added to the real and imaginary parts of the complex image independently. Finally, the GRE images were downsampled by a factor of 2, from which the QSM images were reconstructed using the different methods for comparison. This simulation experiment was repeated for SNR ranging from 10 to 150 to evaluate the performance of the QSM methods at different SNR levels, and at each SNR level the experiment was repeated 10 times. The accuracy of the reconstructed QSM was calculated as the mean of the
over the 10 repeats; RMSE was calculated inside of the brain mask, only inside of the lesions, and only outside of the lesions. The lesion mask was set as the actual lesion plus 3 layers of voxels around it.
The 4 datasets (1 brain with and without calcification, simulated at two different SNR) provided in the QSM Challenge 2.0 were used to compare mcTFI, TFI, and TFI+MERIT. The results were evaluated according to the metrics of QSM Challenge 2.0, which are:
Both data and evaluation codes are available on the official QSM Challenge 2.0 website.
In vivo Brain
Eighteen ICH patients were scanned on a commercial 3T scanner (750/SIGNA, General Electric Healthcare, Waukesha Wisconsin, USA) with a unipolar flow compensated multiecho gradient echo sequence. The acquisition parameters were: FA=15, FOV=25.6 cm, TE1=4.5-5.0 ms, TR=39-49 ms, #TE=6-8, ΔTE=4.6-5 ms, acquisition matrix=512×512×64-144, reconstructed voxel size=0.5×0.5 mm2, reconstructed slice thickness=1-3 mm, and BW=±62.5 kHz.
Six healthy volunteers were scanned on a prototype 7T MR scanner (MR950, Signa 7.0T, General Electric Healthcare, Waukesha, WI) using a two-channel transmit/32-channel to receive head coil with a multiecho gradient echo sequence. The acquisition parameters were: FA=15, FOV=220×176 mm2, TE1=3.81 ms, TR=45.03 ms, #TE=10, ΔTE=4.1 ms, acquisition matrix=320×320×74, reconstructed voxel size=0.5×0.5 mm2, reconstructed slice thickness=1 mm, and BW=±78.1 kHz.
The image quality of the reconstructed hemorrhage was scored semi-quantitatively based on visually assessed artifacts with a three-point scale (3=severe, 2=moderate, and 1=negligible), by an experienced radiologist. The image quality scores (presented as mean±standard deviation) were compared with each other using a two-tailed Wilcoxon paired-sample signed rank test, and a two-sided p<0.05 was deemed indicative of statistical significance.
Appendix A
Starting from Eq. 7, let Wj=m0e−t
First order Taylor expansion around y gives:
The regularizations terms can be reformatted as follows so that the regularizations terms at each j can be scaled down by tj:
Gradient of E with respect to dy:
In both λ1 and λ2 terms, since tj is the only summation variable and tj is scalar, the above expression can be further simplified:
Settings the gradients to zero:
Which can be written as
Appendix B
First order Taylor expansion:
Gradient of E with respect to dm0, dR*2, and dϕ0:
Settings the gradients to zero:
Appendix C
The threshold, T, needs to be selected such that only the unreliable voxels in the data have their noise weighting reduced. As such, if T is smaller than the optimal value, then some of the reliable voxels would be penalized, resulting in the loss of data, but if T is larger than the optimal value, then some unreliable voxels would not be penalized, allowing these voxels to continue to generate artifacts in the solution. The attenuation factor, R, determines how much the noise weightings in voxels, that were picked up by a given T, are reduced. Large R could cause voxels to be over-penalized, leading to the loss of data, and small R could result in insufficient noise weighting reduction in unreliable voxels and insufficient suppression of artifacts. The current implementation of MERIT uses the parameters T=6σ, where σ is the standard deviation of the two-norm of the data-term residual, p, therefore, assuming that ρ is a Gaussian distribution, 0.3% of the voxels are penalized by MERIT at every GN iteration. In practice, however, it is unlikely that precisely 0.3% of voxels in every GN iteration contain model errors. Furthermore, the current MERIT implementation uses R=2, such that the noise weighting in the selection voxels are reduced by a factor of
but there is no guarantee that this reduction factor will always return a more appropriate noise weighting. Implementation of MERIT can be found in the MEDI opensource toolbox
In Vivo Brains with Hemorrhages
The image quality scores of the reconstructed hemorrhages for TFI, TFI+MERIT, and mcTFI were respectively 2.44±0.62, 1.78±0.35, and 1.11±0.21. There was a significant difference between the scores of TFI and TFI+MERIT (p=0.002), a significant difference between the scores of TFI and mcTFI (p<0.001), and significant difference between the scores of TFI+MERIT and mcTFI (p<0.001).
The image quality scores for TFI, TFI+MERIT, and mcTFI were 2.86±0.24, 2.29±0.41, and 1.29±0.041, respectively. There was no significant difference between the scores of TFI and TFI+MERIT (p=0.102), a significant difference between the scores of TFI and mcTFI (p=0.002), and significant difference between the scores of TFI+MERIT and mcTFI (p=0.012).
QSM computes the susceptibility map from a susceptibility field, typically with the field first estimated by fitting a signal model to the acquired GRE images. The fitting errors and the non-zero mean Gaussian noise in the resulting fitted susceptibility field propagates through the subsequent dipole inversion step, causing artifacts in the final susceptibility map. The proposed mcTFI method computes the susceptibility map directly from the multiecho complex GRE images, while minimizing errors in field estimation and retaining the Gaussian noise property in the data term. The mcTFI method was compared with TFI (without and with MERIT), and mcTFI demonstrated both quantitative and qualitative improvements over TFI in simulations, brains with ICH, and healthy brains scanned at 7T.
The proper biophysical modeling is important for parameter extraction in quantitative MRI. For field estimations in the presence of fat chemical shift, for example, by modeling the fat chemical shift as part of the unknowns, as opposed to assuming it to be a constant value, the quality, and accuracy resulting estimated field was substantially improved. Prior to this work, the susceptibility field was first estimated via various approximations to Eq. 3, where different assumptions to the noise property of the field were made in the different approximations. For example, the TFI approach here estimated the susceptibility field by solving Eq. 9 (a simplified version of Eq. 3), in which the magnitude component of the complex images was assumed to be noiseless. The proposed mcTFI, on the other hand, solves the signal equation directly, which is a better signal model with fewer approximations and simplifications.
Another benefit of fitting the susceptibility map directly to the complex data is that the assumed noise in mcTFI is that of the acquired data. A separate field estimation commonly simplifies and alters the noise property, contributing to potential model errors in the subsequent dipole inversion step, which is typically formulated as maximum likelihood estimation by minimizing the quadratic data fidelity term ∥Dχ−f∥22, thereby causing artifacts in the final susceptibility map. A noise weighting term, w, needs to be introduced to the data fidelity term ∥w(Dχ−f)∥22 to mitigate the model errors and suppress the artifacts, but the proper estimation of w remains a challenge. This noise property related errors are substantial in tissue lesions with large susceptibilities such as hemorrhages and at ultra-high field strengths such as 7T. Accordingly, mcTFI can significantly improve QSM performance for these applications, as demonstrated in this work.
Currently, for field estimation using Eq. 9, the noise weighting is calculated as the covariance matrix of the fit in Eq. 9, but it does not properly account for the noise properties, as demonstrated by the suboptimal TFI results in this work. Further retuning of w using MERIT allows better suppression of artifacts from these model errors. However, MERIT is a heuristic method and dataset-specific MERIT parameters are required to ensure that MERIT suppresses the artifacts without introducing new artifacts (see appendix C in 1.3 in the above for details regarding the parameters). Looking at the examples in this work, when the MERIT parameters caused over-penalization, then, the streaking artifacts were effectively suppressed, but to over-penalized noise weighting in the voxels in and around the two lesions, led to the solver to fit for the field around the lesions first and the size of the lesions in the resulting susceptibility map appeared larger. In contrast, by using a more accurate model in the data term, mcTFI results demonstrated a sufficient suppression of streaking artifacts while retaining the true size of the lesions. Furthermore, the over-penalization of MERIT likely causes the loss of data in TFI, resulting in less accurate susceptibility map reconstruction in comparison to mcTFI. Similar behaviors were also demonstrated where streaking artifacts were effectively suppressed using MERIT, but lesions appeared larger. On the other hand, mcTFI was able to suppress streaking artifacts, while avoiding the introduction of new artifacts. In exemplary case demonstrated here, MERIT failed to completely suppress all streaking artifacts because it did not sufficiently penalize the noise weighting in all problematic voxels; in contrast, the QSM reconstructed using mcTFI suppressed streaking artifacts around the hemorrhage.
The current implementation of mcTFI can be readily extended in several aspects to fully embody the inventions conceptualized here. Firstly, the nonlinear and nonconvex objective function in mcTFI, by definition, has multiple local minima, and, therefore, its solutions may be sensitive to the choice of initial guess. There is no guarantee that the initial guess strategies as used in this implementation are optimal. This may be overcome by an alternative implementation that estimates the field first. As an exemplary embodiment for modeling the measured complex data with consideration of noise in both magnitude and phase information at each echo time, we have the following formulation to estimate field and other model parameters from the measured complex signal at each echo time:
Then susceptibility is determined by inverting the field using the robust initialization method of alpha a scaling, which may be set to be much smaller than 1. As an exemplary embodiment for modeling a phase at each echo time according to a phase factor determined by the calculated magnetic field and the alpha scaling, we have the following formula to estimate susceptibility from the magnetic susceptibility dipole convolution modeling of the modeled phase at each echo time:
A very small α may be used to linearize the exponential function in Eq. 11 for speeding up the computation with one deconvolution for all echoes (separation of echo summation and deconvolution or convolution) and overcoming initialization problems. Dividing the determined magnetic susceptibility distribution by the scaling factor, the final QSM map is
The modeled signal amplitude m0e−t
Thirdly, the zero-reference term from a region of known uniform susceptibility, ∥McP(y−
Σg(g)∥Mg(χ−
where g is a group average of R2* for the g-th group, a weighting such as a sigmoid function, Mg the binary mask for the g-th group with |Mg| number of voxels,
Here [σ1, σ2] control the output range and (s1, s2) control the shape. The choices of R2* binning and sigmoid curve are motivated by connections between R2* and susceptibility contrasts and by soft thresholding. (g) shall be high penalty (close to 1) in regions of low susceptibility contrasts (R2* close to 0), but little penalty (close to 0) in regions of high susceptibility contrasts (R2* above threshold). A step function approximating the sigmoid function as commonly used in deep learning may be used in Eq. 12. The penalties may be applied to the first two or three bins: first bin of the lowest R2* corresponds to most of cerebrospinal fluid in the whole brain, second bin of the second lowest R2* corresponds to most of cortical gray matter in the whole brain, and the third bin corresponds to a large portion of white matter next to cortical gray, as shown in
Preconditioning can be used to solve Eq. 13, as in Eqs. 4 & 11. An example of shadowing artifact reduction is illustrated in
This low-contrast fidelity strategy of penalizing variance in the low contrast regions or bins, such as expressed by Σg(g)∥Mg(χ−
Fourthly, the signal model in Eq. 5 can be further extended by considering the effects of deoxyheme iron on signal magnitude and phase. For example, the deoxyheme iron can be considered to have cylinder geometry and numerous with all directions in a voxel, while nonblood susceptibility sources can be considered as diffusely distributed in space. Then the magnitude signal at the jth echo at echo time tj with specific amount of deoxyheme iron as characterized by oxygenation Y is
with 1F2 the generalized hypergeometric function,
and g accounts for macroscopic contributions due to voxel sensitivity function. Here, y is the gyromagnetic ratio (267.513 MHz/T), B0 is main magnetic field (3T in our study), Hct is hematocrit (0.357), Δχ0 is the susceptibility difference between fully oxygenated and fully deoxygenated blood (4π××0.27 ppm), χba is purely oxygenated blood susceptibility (−108.3 ppb). χnb non-blood susceptibility, and v the vein blood volume fraction. This forms the deoxyheme-iron modeled signal magnitude. The corresponding phase at the jth echo time with deoxyheme iron effects is
Here a is the vein volume fraction in total blood volume and is assumed to be constant (0.77), ψHb the hemoglobin volume fraction (0.0909 for tissue and 0.1197 for vein), ΔχHb the susceptibility difference between deoxy- and oxy-hemoglobin (12522 ppb). Eqs. 14 & 15 exemplify a modeled complex signal with a consideration of deoxyheme-iron effects on signal magnitude and phase at each echo time. By extending the signal model in Eq. 6 using Eqs. 14 & 15, we can formulate an optimization problem for determining
with Ya the arterial oxygen saturation (0.98) as
All regularizations considered before can be included in R(Y, v, χnb, s0, R2). Coordinate descent method can be used to solve Eq. 16, as exemplified in Eqs. 7 & 8. Alternatively, the dipole deconvolution may be solved first as quantitative susceptibility mapping and then the inverse problem in the above Eq. 16 is solved without spatial dipole deconvolution.
Finally, an increase of model complexity in mcTFI also comes with an increase in computation time; the Alternating Direction Method of Multipliers (ADMM), which is a much faster algorithm, may be used efficiently solve the objective function in mcTFI.
In summary, this work provided the mcTFI method to directly estimate the susceptibility map from the GRE images, which is a more appropriate way to model the signal noise and bypass any errors from a separate field estimation. Compared to TFI, the provided mcTFI demonstrated improvements in susceptibility map reconstruction, with reduced streaking artifacts and improved performance in regions with low SNR.
In this section, we describe solving optimization problems in QSM and other inverse problems such as fat water separation using deep neural network
R2* corrected water/fat separation estimating fat, water and inhomogeneous field from gradient-recalled echo (GRE) is a necessary step in quantitative susceptibility mapping to remove the associated chemical shift contribution to the field. Several algorithms, including hierarchical multiresolution separation, multi-step adaptive fitting, and T2*-IDEAL, have been proposed to decompose the water/fat separation problem into linear (water and fat) and nonlinear (field and R2*) subproblems and solve these problems iteratively. Water/fat separation is a nonlinear nonconvex problem that requires a suitable initial guess to converge to a global minimum. Multiple solutions including 2D and 3D graph-cuts and in-phase echo-based acquisition have been proposed to generate an initial guess. The performances of these methods are dependent on the assumptions inherent in these methods, including single species dominant voxels, field smoothness, or fixed echo spacing to generate a suitable initial guess and avoid water/fat swapping.
Recently, deep neural networks (DNN) have been used to perform water/fat separation using conventional supervised training of DNN with reference data (STD). This STD water/fat separation method does not require an initial guess with the potential use of fewer echoes to shorten the scan time, or improve SNR with the same scan time, and lessen dependency on acquisition parameters compared to current standard approaches. However, the training of STD requires reference fat-water reconstructions (labels), which can be challenging to calculate as discussed above.
In this work, we investigate an unsupervised training of DNN (UTD) method that uses the physical forward problem in T2*-IDEAL as the cost function during conventional training without a need for reference fat-water reconstructions (no labels). We further investigate no-training DNN (NTD) method using a cost function similar to that in unsupervised to reconstruct fat-water images directly from a single dataset. We compare the results of the STD, UTD, and NTD methods with current T2*-IDEAL method.
Water/fat separation and field estimation is a nonconvex problem of modeling multiecho complex GRE signal (S) in terms of water content (W), fat content (F), field (f) and R*2 decay per voxel:
where N refers to the number of echoes, Si is the GRE signal S at echo time tj with j=1, . . . , N, and vF is the fat chemical shift in a single-peak model. T2*-IDEAL solves Eq. 17 by decomposing into linear (W, F) and nonlinear (f, R*2) subproblems. With an initial guess for f and R*2, the linear subproblem for W and F can be solved. With updated W, F, the nonlinear subproblem for f and R*2 is linearized through first order Taylor expansion and solved using Gauss-Newton optimization. These steps are repeated until convergence is achieved. In this study, initial guesses for the field f and R*2 decay were generated using in-phase echoes. The subsequent solutions to Eq. 17 resulted in the reference reconstructions ΨREF(S)={W, F, f, R*2}.
In this work, we adapted the approaches in recent works and making W, F, f, and R*2 the target output of the network. A U-net type network Ψ(Si; θ) with network weights θ is trained on M training pairs {Si, ΨPREF(Si)}, where Si and ΨPREF(Si)={W, F, f, R*2}, are the input complex gradient echo signal and corresponding reference T2*-IDEAL reconstruction (reference), respectively, for the ith subject in the training dataset. The cost function for training the DNN is given by:
In the proposed method, termed unsupervised, we seek to use deep learning framework to calculate W, F, f, and R*2 without access to reference reconstructions (labels). This is done by using the physical forward problem in Eq. 17 as the cost function during training. This cost function for training the DNN is given by:
Recently, a single test data is used to update DNN weights in deep image prior and fidelity imposed network edit. This inspires the idea that DNN weights θ* initialized randomly may be updated on a single gradient echo data set S to minimize the cost function in Eq. 17 in a single data set S.
The resulting network weights are specific to the data S, and the resulting output Ψ(S; θ*) can be taken as the water/fat separation reconstruction of S. In contrast to the above STD and UTD that involve conventional training data, no training is required here, the cost function is the same as that in the unsupervised training. Therefore, this method is referred to as no-training DNN (NTD).
The network Ψ(Si; θ) was a fully 2D convolutional neural network with encoding and decoding paths (
Data was acquired in healthy volunteers (n=12) and patients (n=19), including thalassemia major (n=11), polycystic kidney disease (n=7), and suspected iron overload (n=1). The study was approved by the Institutional Review Board and written informed consent was obtained from each participant.
Two 1.5T GE scanners (Signa HDxt, GE Healthcare, Waukesha, WI) with 8-channel cardiac coil were used to acquire data. The healthy subjects were imaged on both scanners using identical protocols. Patients were scanned on one scanner, selected at random, using the same protocol. This protocol contained a multiecho 3D GRE sequence with the following imaging parameters: number of echoes=6, unipolar readout gradients, flip angle=5°, ΔTE=2.3 msec, TR=14.6 msec, acquired voxel size=1.56×1.56×5 mm3, BW=488 Hz/pixel, reconstruction matrix=256×256×(32-36), ASSET acceleration factor=1.25, and acquisition time of 20-27 sec.
Multiple experiments were performed to assess the performance of DNN in water/fat separation and how supervised (STD), unsupervised (UTD), and no-training (NTD), compare against T2*-IDEAL reference method. The network architecture for Ψ(Si; θ) was identical between the three DNN methods. Parameters in STD and UTD include, number of epochs 2000, batch size 2, learning rate 0.001 for STD and 0.0001 for UTD. The learning rates were experimentally found to produce optimal results. In NTD, parameters include, number of epochs 10000, batch size 2, and learning rate 0.0001.
The supervised and unsupervised DNN were trained on the combined data set of healthy subjects (2 scans each) and patients. This data of n=43 scans was split into testing (256×256×61×6) and training (256×256×1522×6) with 80% of the latter used to training and the rest for validation. The weights corresponding to the lowest validation loss during training was selected as the optimal weights to be used during testing. Training time in each epoch was ˜60 seconds. The testing data comprised of two datasets, one healthy subject and one patient with iron overload. In the test data, ROIs were drawn on the proton density fat fraction map
the field and R2* in several regions including liver, adipose and visceral fat, aorta, spleen, kidney, vertebrae. The ROI measurements for each DNN method were compared with those measured on the reference T2*-IDEAL maps using correlation analysis. Reference generation with T2*-IDEAL was performed on CPU (Intel, i7-5820 k, 3.3 GHz, 64 GB) using MATLAB (MathWorks, Natick, MA) and all DNN trainings were performed on GPU (NVIDIA, Titan XP GP102, 1405 MHz, 12 GB) using Keras/TensorFlow.
Our results demonstrate the feasibility of an unsupervised deep neural network (DNN) framework with conventional training and without training to solve the water/fat separation problem. The proposed unsupervised training DNN (UTD) method does not require reference images as in supervised training DNN (STD), allowing the use of DNNs for training data that are unlabeled but for which physical model is known. The no-training DNN (NTD) method further allows using DNN reconstruction of a single data set (subject) for which a physical model is known.
For the nonlinear nonconvex water/fat separation problem, the reference T2*-IDEAL method used traditional gradient descent optimization and is dependent on the initial guess. This problem may be alleviated using deep learning, as long as the labeled training data is sufficiently large to capture test data characteristics. Labeled data may be difficult to obtain, in as water/fat separation problems. Unlabeled data are easier to obtain, but still it is difficult to ensure that training data do not lack test data characteristics. Accordingly, reconstruction directly from a test data without training is desirable, as in the reference T2*-IDEAL but without its dependence on initialization. This can be achieved using the proposed NTD.
The NTD method can overcome the initialization-dependence in traditional gradient descent based nonconvex optimization begs some intuitive explanation or interpretation, though rigorous explanation is currently not available. The cost function in DNN is minimized by adjusting network weights through backpropagation, which is achieved through iterative stochastic gradient descent (SGD). Perhaps SGD allows escaping local traps encountered in traditional gradient descent and the intensive computation in updating network weights facilitates some exhaustive search for a consistent minimum. The network weights updating on a single test data may converge as demonstrated in fidelity imposed network edit and in deep image prior. Our data suggests that NTD can converge to a consistent minimum without initialization-dependence for the nonlinear nonconvex water/fat separation problem.
While some image details in STD, UTD, and NTD methods were different from reference images, quantitative measurements in the liver include ROI measurements of several voxels and regions which is less dependent on image details. The network architecture can be further optimized for this problem. For instance, we found changing activation function (Relu to Sigmoid) significantly improved field and R2* results in unsupervised training while not much change was observed in water and fat maps. Since field and R2* have a nonlinear relationship with respect to input complex signal while water and fat are linear, there is possibility of designing task-specific blocks to learn the linear and nonlinear mapping during training for further improvement and potentially decreasing the number of learned weights.
The exemplary implementation in this study can be extended and improved readily in the following manners. A ready extension is to apply the DNN methods, including STD, UTD and NTD, to solve QSM inverse problem. For example, the UTD method Eq. 19 can be updated with the signal model in Eq. 5 for QSM reconstruction Ψ=(m0, R*2, ϕ0, χ): {tilde over (S)}j(Ψ)=m0e−t
Then Eq. 21 describes unsupervised training for deep learning reconstruction of QSM using multiecho complex data as detailed in the above section 2. Similarly, Eqs. 18 & 20 can be extended in the above manner for QSM reconstruction using deep learning.
Another extension is to apply the DNN methods, including STD, UTD an NTD, to solve transport inverse problem, quantitative transport mapping (QTM), from multiple timeframes of time-resolved tracer passage through tissue. DNN methods offer particular advantages in denoising poorly conditioned transport inversion problem of targeted tracers including targeted theranostic agents and targeted agents. The tracer concentration in a voxel r at time t, c(r, t), comprised of an arterial flow or convection component with velocity u(r) and concentration ca(r, t), and a tissue bound component with concentration c1(r, t). The transport equation of mass flux comprised of
∂tca(r,t)=−∇ca(r,t)u(r)−k1ca(r,t)+k2c1(r,t),
∂tc1(r,t)=k1ca(r,t)−k2c1(r,t), [22]
resulting in an example for the transport equation of tracer velocity-exchange model for the observed concentration as
∂tc(r,t)=−∇(1+T(k1,k2;t)*)−1c(r,t)u(r). [23]
Here, the temporal convolution operation T(k1, k2; t) accounts for transiently binding or exchange effects characterized by exchange coefficients k1, k2 (exchange coefficients and maybe equal for some tracers) is defined as in solving the second part of the transport equation:
c
1(r,t)=T(k1,k2; t)*ca(r,t)=k1e−k
which allows a straightforward sequential calculation of (1+T(k1, k2; t))−1c(r,δti) from i=1, n with t=δtn and δt is the imaging time frame. Then the observed signal ∂tci(r, δtj)=Sji in the ith training subject is fitted to the transport equation physics model {tilde over (S)}j(Ψ)=−∇(1+T(k1, k2; δtj)*)−1c(r, δtj)u(r), which forms the velocity and exchange coefficient modeled transport equation. The DNN methods can be applied to reconstruct Ψ=(k1, k2, u(r)). For example, the NTD method Eq. 20 can be updated as
Here the regularization R(k1, k2, u(n) can be sparsity formulated by L1 norm that are effective for denoising in image processing and reconstruction; other regularization may also be considered, including smooth regularization using L2 norm. The DNN approach to solving the inverse transport problem in determining transport parameters from the spacetime resolved tracer concentration data can also advantageously applied to other transport situations, including non-binding tracers with the transport equation simplified to ∂tc(r, t)=−∇c(r, t)u(r), non-binding diffusion tracers with the transport equation ∂tc(r, t)=−∇c(r, t)u(r)+∇D(r)∇c(r, t), and binding tracers with multiple compartments where the temporal convolution operator is extended with more exchange coefficients. The QTM problem Eq. 25 can be solved without using DNN as in Eq. 6. Noise propagation through the inversion can be overcome easily using DNN. Simulated data can be used to implement STD method, or unsupervised approached can be used to implement UTD method by minimizing EQTM for all training cases. DNN training may also provide regularization implicitly without specific L1 or L2 formulation. An example of QTM processed velocity of the liver is illustrated in
And another extension is to apply the DNN methods, including STD, UTD and NTD, to solve the inverse problem of mapping oxygen extraction fraction (OEF) from multiecho gradient echo complex data. The OEF problem formatted in Eq. 16 can be solved using the DNN method with the following cost function: Ψ=(Y, v, χnb, s0, R2), {tilde over (S)}j(Ψ)=FqBOLD(Y, v, χnb, s0, R2, tj)eiϕ
DNN can be used to formulate regularization in training using synthetic data and/or in vivo data. DNN framework can be used to solve the inverse problem.
Alternatively, the multiecho phase images are processed using QSM according to Eq. 6 or Eq. 21 as described in the above. The multiecho magnitude images are processed according the model of deoxyheme iron in cylinders and other susceptibility sources are diffuse, i.e., the magnitude signal for the jth echo at/echo time jΔTE is
|sj|=FqBOLD(Y,v,χnb, s0, R2, jΔTE), [27]
For determining
with Ya the arterial oxygen saturation (0.98), the reconstruction problem from multiecho data is Ψ=(Y, v, χnb, s0, R2):
{tilde over (S)}
j(Ψ)=|sj|=FqBOLD(Y,v,χnb, s0, R2,jΔTE)∀j=1, 2, . . . , N and
with w the weight accounting for different noise property from |sj|·{tilde over (S)}N+1(Ψ) forms the deoxyhem-iron modeled tissue magnetic susceptibility. DNN can be used to perform this reconstruction from multiecho magnitude signal Sj ∀j=1, 2, . . . , N and QSM=SN+1. For example, the NTD method Eq. 20 can be updated as
Here the regularization R(Y, v, χnb, s0, R2) can be sparsity formulated by L1 norm that are effective for denoising in image processing and reconstruction. The OEF problem defined by the cost function Eq. 29 can be solved without using DNN as in Eq. 6. Noise propagation through inversion can be overcome easily using DNN that is known to have denoising properties. Simulated data can be used to implement STD method, or unsupervised approached can be used to implement UTD method by minimizing EOEF for all training cases. An example of OEF map of the brain is illustrated in
In general, a ready improvement of the DNN exemplified in
And another extension is to allow different R2* decays for water and fat in the water/fat separation and field estimation problem. As an example, Eq. 17 can be updated by modeling multiecho complex GRE signal (S) in terms of water content (W), fat content (F), field (f), R*2W for water component decay and R*2F for fat component decay:
Different R2* decay rates for fat and water components are important for certain biomedical imaging situations, such as a voxel of mixture of marrow fat and bone water. Then Eqs. 18-20 can be updated accordingly for the fat-water separation problem.
The above Eq. 30 for the fat-water signal model can be further extended by including a spectrum of chemical shift. For example, e−i2πv
While the input data is complex, the network only accepts real values and two separate input channels were used instead. This potentially can change the noise properties and suboptimal network performance which can be addressed by including complex networks for this purpose. While the network in both supervised (STD) and the unsupervised (UTD) methods were trained with specific acquisition parameters, it's possible to generalize by including more cases with different acquisitions parameters for training. Generation of reference images requires solving the T2*-IDEAL problem which can be challenging to calculate depending on acquisition protocol. The advantage of the proposed unsupervised methods is they only require complex input signal for training. While the NTD method requires a large number of epochs to converge which is computationally expensive, one could use a previously trained network (trained on data with the same imaging parameters) and update the weights for the new data set. Only one scanner manufacturer/model was used for this study and including additional models, manufacturers and field strengths will help to further generalize. There were limited number of cases used for training in STD and UTD methods in this study and including more cases will leverage network performance and reliability.
In summary, we demonstrated the feasibility of using unsupervised DNNs to solve the water/fat problem with very good agreement compared to reference images.
In some cases, one or more of the above quantitative susceptibility mapping techniques can be implemented using the process 800 shown in
In some implementations, the process 800 can be used to transform MRI signal data corresponding to a subject into susceptibility-based images that quantitatively depict the structure and/or composition of the subject. As an example, in some cases, the process 800 can be used to obtain MRI data corresponding a subject, and process this MR data to generate a quantitative susceptibility map of the subject. Using this quantitative susceptibility map, one or more susceptibility-based images of the subject can be generated and displayed to a user. The user can then use these images for diagnostic or experimental purposes, for example to investigate the structure and/or composition and/or function of the subject, and/or to diagnose various conditions or diseases based, and/or to treat various conditions or diseases based, at least in part, on the images. As the process 800 can result in susceptibility-based images having higher quality and/or accuracy compared to other susceptibility mapping techniques, implementations of the process 800 can be used to improve a user's understanding of a subject's structure and/or composition and/or function, and can be used to improve the accuracy of any resulting medical diagnoses or experimental analyses.
The process 800 begins by acquiring magnetic resonance (MR) data corresponding to a subject (step 810). In some cases, the MR data can correspond to a patient (e.g., the entirety of a patient or a portion of a patient, such as a particular portion to the patient's body). In some cases, the MR data can correspond to one or more samples (e.g., an imaging phantom, a sample of one or more materials, a sample of one or more types of tissue, and/or one or more other objects).
In some implementations, the MR data can be acquired using an MRI scanner using one or more suitable pulse sequences. As an example, in some cases, MR data can be acquired using a gradient echo sequence that acquires MR data at a single echo time or at multiple echo times (e.g., two, three, four, five, and so forth). Various scan parameters can be used. As another example, MR data can be obtained using a 3D multiecho spoiled gradient echo sequence on a 3T scanner (e.g., a GE Excite HD MR scanner) using a pulse sequence that samples at different echo times (TE) (e.g., 4 TEs, such as 3.8, 4.3, 6.3 and 16.3 ms) in an interleaved manner, and with following imaging parameters: repetition time (TR)=22 ms; voxel size=0.5×0.5×0.5 mm3, matrix size=256×64×64; bandwidth (BW)=±31.25 kHz; flip angle=15°. As another example, MR data can be obtained using a 3D multi-gradient echo sequence on a 3T scanner using imaging parameters TE/ATE/#TE/TR/FA/BW/FOV/matrix size=5 ms/5 ms/8/50 ms/20°/±62.50 kHz/21.8×24×12.8 cm3/232×256×64. Although example sequences and example parameters are described above, these are merely illustrative. In practice, other sequences and parameters can be used, depending on various factors (e.g., the size of region to be examined, a known or assumed range of susceptibility values of the subject, scan time considerations, device limitations, and so forth).
After acquiring MR data corresponding to a subject, the process 800 continues by determining a magnetic field based on the MR data (step 820). As noted above, in some cases, the MRI signal phase is affected by the magnetic field corresponding to the magnetic susceptibility distribution of the subject and the chemical shift of tissue components.
In some implementations as illustrated in
In some implementations as illustrated in
Then process 800 continues by determining prior knowledge about tissue magnetic susceptibility distribution (step 850). One estimation of tissue susceptibility distribution is to use R2* values derived from the magnitude signal. High R2* values can be used to identify high susceptibility regions such as hemorrhages for preconditioning to accelerate the convergence of numerical optimization. An example of preconditioning is to separate the whole image volume into region with high susceptibility (including hemorrhages, air regions and background) and region with normal tissue susceptibility. The low (near zero) R2* regions can also be used for identifying regions of low susceptibility contrasts such as cerebrospinal fluid in the ventricles in the brain, oxygenated arterial blood in the aorta, or regions of pure fat in the abdominal wall. As water and fat have known susceptibility values, they can be used to serve zero references (to water) to generate absolute susceptibility values using a minimal variance regularization and to suppress streaking and shadowing artifacts.
After obtaining the susceptibility prior information and signal data noise property, the process 800 continues by estimating a magnetic susceptibility distribution of the subject based, at least in part, on the prior information and data noise property (step 860). As described above, estimating the susceptibility distribution of the subject can include determining a cost function corresponding to a susceptibility distribution, the magnetic field or the multiecho complex data, masks corresponding to regions of interest. The cost function in some includes a data fidelity term based on data noise property expressing the relationship between tissue susceptibility and the magnetic field in an integral or differential form or expressing the relationships between tissue susceptibility and the multiecho complex data, and a regularization term expressing prior information. The estimated susceptibility distribution of the subject can be determined by identifying a particular susceptibility distribution that minimizes one or more of these cost functions described above, in some cases, this can be determined numerically.
After estimating a magnetic susceptibility distribution of the subject, the process 800 continues by generating one or more images of the subject based on the estimated susceptibility distribution of the subject (step 870). These images can be electronically displayed on a suitable display device (e.g., an LCD display device, LED display device, CRT display device, or other display device that can be configured to show images) and/or physically displayed on a suitable medium (e.g., printed, etched, painted, imprinted, or otherwise physically presented on a sheet or paper, plastic, or other material).
In some cases, the estimated susceptibility distribution of the subject can be visualized using a color scale, where each of several colors is mapped to a particular susceptibility value or range of susceptibility values. Accordingly, a two dimensional or three dimension image can be generated, where each pixel or voxel of the image corresponds to a particular spatial location of the subject, and the color of that pixel of voxel depicts the susceptibility value of the subject at that location.
In some cases, the color scale can include a gradient of colors and/or a gradient of gray shades (e.g., a gray scale), in order to depict a range of susceptibility values. In some cases, for a color scale that includes a gradient of colors or a gradient of gray shades, one end of the gradient can correspond to the lowest susceptibility value in a particular window of susceptibility values (e.g., an arbitrarily selected window of values), and the other end of the gradient can correspond to the highest susceptibility value in the window of values. For instance, for gray scale that includes a gradient of gray shades between pure white and pure black, pure white can be used to indicate the highest susceptibility value in a particular arbitrary window of values, and pure black can be used to indicate the lowest susceptibility value in the window of values. Other relationships between a color/gray scale and susceptibility values is also possible. For example, in some cases, for gray scale that includes a gradient of gray shades between pure white and pure black, pure white can be used to indicate the lower susceptibility value in a particular arbitrary window of values, and pure black can be used to indicate the highest susceptibility value in the window of values. Although the examples are described in the context of gray scales, in practice, similar relationships between color scales and susceptibility values are also possible.
Susceptibility values and colors can be mapped linearly (e.g., such that each absolute change in susceptibility value corresponds to a proportional change in color), logarithmically (e.g., such that each exponential change in susceptibility value correspond to a linear change in color), or according to any other mapping. Although examples mapping between color scales and susceptibility values are described above, these are merely illustrative examples. In practice, other mappings are also possible, depending on the implementation.
Implementations of the above described techniques can be performed using a computer system.
The system 900 includes a processor 910, a memory 920, a storage device 930, and an input/output device 940. Each of the components 910, 920, 930, and 940 can be interconnected, for example, using a system bus 950. The processor 910 is capable of processing instructions for execution within the system 900. In some implementations, the processor 910 is a single-threaded processor. In some implementations, the processor 910 is a multi-threaded processor. In some implementations, the processor 910 involves a graphic processing unit. In some implementations, the processor 910 is a quantum computer. The processor 910 is capable of processing instructions stored in the memory 920 or on the storage device 930. The processor 910 may execute operations such as performing one or more of the techniques described above.
The memory 920 stores information within the system 900. In some implementations, the memory 920 is a non-transitory computer-readable medium. In some implementations, the memory 920 is a volatile memory unit. In some implementations, the memory 920 is a non-volatile memory unit.
The storage device 930 is capable of providing mass storage for the system 900. In some implementations, the storage device 930 is a non-transitory computer-readable medium. In various different implementations, the storage device 930 can include, for example, a hard disk device, an optical disk device, a solid-state drive, a flash drive, magnetic tape, or some other large capacity storage device. In some implementations, the storage device 930 may be a cloud storage device, e.g., a logical storage device including multiple physical storage devices distributed on a network and accessed using a network. In some examples, the storage device may store long-term data. The input/output device 940 provides input/output operations for the system 900. In some implementations, the input/output device 940 can include one or more of network interface devices, e.g., an Ethernet card, a serial communication device, e.g., an RS-232 port, and/or a wireless interface device, e.g., an 802.11 card (e.g. 802.11ax), a 3G wireless modem, a 4G wireless modem, a 5G wireless modem, etc. A network interface device allows the system 900 to communicate, for example, transmit and receive data. In some implementations, the input/output device can include driver devices configured to receive input data and send output data to other input/output devices, e.g., a keyboard, a mouse, a printer, a sensor (e.g., a sensor that measures component or system-related properties, a sensor that measures environmental-related properties, or other types of sensors), and a display device 960. In some implementations, mobile computing devices, mobile communication devices, and other devices can be used.
A computing system can be realized by instructions that upon execution cause one or more processing devices to carry out the processes and functions described above, for example, storing, maintaining, and displaying information. Such instructions can include, for example, interpreted instructions such as script instructions, or executable code, or other instructions stored in a computer readable medium. A computing system can be distributively implemented over a network, such as a server farm, or a set of widely distributed servers or can be implemented in a single virtual device that includes multiple distributed devices that operate in coordination with one another. For example, one of the devices can control the other devices, or the devices may operate under a set of coordinated rules or protocols, or the devices may be coordinated in another fashion. The coordinated operation of the multiple distributed devices presents the appearance of operating as a single device.
Although an example processing system has been described in
The term “processing module” may encompass all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers. A processing module can include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them.
A computer program (also known as a program, software, software application, script, executable logic, or code) can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a standalone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
Computer readable media suitable for storing computer program instructions and data include all forms of non-volatile or volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks or magnetic tapes; magneto optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry. Sometimes a server is a general purpose computer, and sometimes it is a custom-tailored special purpose electronic device, and sometimes it is a combination of these things. Implementations can include a back end component, e.g., a data server, or a middleware component, e.g., an application server, or a front end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described is this specification, or any combination of one or more such back end, middleware, or front end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), e.g., the Internet.
We present some experimental implementations and results, exemplifying inventions disclosed in the above, particularly related to Eqs. 22-25.
We train a deep neural network to estimate perfusion parameters: perfusion F, permeability Ktrans, capillary blood volume Vp and extravascular extracellular space volume Ve from time resolved 4D contrast enhanced images, such as DCE MRI images. In order to generate training data, a set of vasculatures containing arteries, capillaries and veins were generated based on constrained constructive optimization (CCO) method. Perfusion parameter maps were assumed to be a mixed Gaussian distribution. Tracer propagation inside vasculature and extravascular space were simulated using transport equation with parabolic flow assumption ignoring diffusion. Our test dataset included three parts: The first part is a set of concentration profile generated using the same rule as in training dataset. The second part is concentration profile in real tumor vasculature. The third part is human DCE MRI images. Details on training and testing data generation are described in the next section.
3D U-net was chosen as network architecture (
We considered the vasculature and tracer propagation inside 64×64×64 mm3 volume. perfusion F, permeability Ktrans, capillary blood volume Vp and extravascular extracellular space volume Ve were assumed to be a mixed Gaussian distribution:
Here x=[x, y, z] is the 3D coordinate, N0 is the number of Gaussian kernels and was set to 100. xi is the center of kernels and Ri is the radii of kernels, and was chosen from a uniform distribution between 2 mm to 10 mm. Ktrans, Vp and Ve were calculated using the same method. Afterwards, F, Ktrans, Vp and Ve were scaled to 0-120 mL/100 g/min, 0-0.1/min, 0.02-0.05 and 0.3-0.6, accordingly. One representative generated F, Ktrans, Vp and Ve maps are shown in
Here, criterion 1 is satisfied if there is no other terminal inside the same grid P1 belongs to. Criterion 2 is satisfied if the distance between vessel segment Si and P1 is smaller than threshold d1=5 mm. While minimizing the total vasculature volume, we assumed the cubic of vessel diameter is proportional to flow. The proportion is set to one as we only care relative vessel volume here. Venous vasculature was constructed separately using the same method.
Based on criterion 1, within each grid E with capillary vascular volume Vp(ε) there exists only one artery terminal and one vein terminal with the same flow F(ε). Capillaries were then constructed to connect each artery vein terminal pair. We assumed the capillaries stay inside the same grid as the terminal and the flow inside capillaries are plug flow, thus we only need to consider the length, radii and velocity of the capillaries instead of their morphology and bifurcation information. A random integer N1 from a uniform distribution between 100 and 300 was chosen as the number of capillaries between the artery and vein terminal. Setting the radii, length and flow of the first capillary as a reference, N1−1 random numbers from a uniform distribution between 0.8 and 1.2 were chosen as the relative radii to the first capillary. The same process was repeated to determine the relative length and flow to the first capillary. Next, we scaled the radii of the capillaries so that the average was 4.5 μm, scaled the flow velocity of the capillaries so that the total flow was F(ε), and scaled the length of the capillaries so that the total vascular volume equals to Vp(ε). In this way, a fully connected vascular network was constructed. As both arterial and venous vasculature are 1 direction tree while the flow of each terminal is known, the flow and velocity inside each vessel segment can be determined.
After the flow for each vessel segment was determined, tracer propagation in vascular network was simulated as follows: ignoring the diffusion inside vessel network, there is only axial direction velocity in vessel segment. In arterial we assumed a parabolic flow. Within one segment, the concentration c at each point can be expressed as:
Here x is the distance of the point to the start of the cylinder assuming the velocity direction is positive direction, r is the distance of the point to the axis of the cylinder, u is the velocity. At the bifurcation point, we assume the concentration is preserved:
In capillaries we assumed a plug flow. Apart from transport in axial direction, we also considered the exchange between capillaries and extravascular space. Each capillary was further divided into 10 elements. Within each element ξ inside voxel ε, the tracer concentration was simulated as follows:
Here ξ−1 corresponds to the neighboring element at the upstream direction of element ξ. We assumed a uniform concentration distribution in extravascular space in each voxel ε. The size of voxel ε is set to 1 mm3 cube, consistent to the resolution of CCO method.
In venous vasculature, we again assumed a parabolic flow. While Eq. 32 still holds inside one vessel segment, at the meeting point, we assume the concentration of father branch is the average of concentration in daughter branch weighted by its flow:
In arterial side, father branch means the branch in upstream direction, while in venous side father branch means the branch in downstream direction. As long as flow is conserved at the connecting point, the concentration is also conserved. A concentration input
is used as the arterial input for the simulation, which is interpolated from the tracer concentration profile of hepatic artery in a DCE MRI scan. Eqs. 32-36 allow us to calculate the tracer concentration at each segment independently instead of updating all the segments at the same time.
In total, 1600 vasculatures and corresponding concentration profile were generated, 1200 of them were used as training data, 200 of them were used as validation and 200 of them were used as testing data.
Apart from the simulation on artificial vasculature, we also validated our deep learning method using simulation on real tumor vasculature and real DCE MRI images. For simulation based on real tumor vasculature, arteries and veins were segmented from optical projection tomography image of a rat glioma and were digitalized into one-dimension trees comprised of nodes and connections with certain radii and length. Limited by image resolution, more than 70% of the vessel segments are larger than 20 μm, thus the network lacks feeding arterioles and draining veins with diameter between 10 μm and 20 μm, and also capillaries with diameter smaller than 10 μm. Therefore, we constructed feeding arterioles, draining veins and capillaries using the same CCO method as described in previous section. F, Ktrans, Vp and Ve maps were assumed to be a mixed Gaussian distribution as Eq. 1, and the tracer propagation was simulated based on Eqs. 2-6. Simulated concentration profile was down sampled to 1 mm voxel size and 5 s temporal resolution and used as the input for the neural network. Reconstructed parameter maps were then compared with the ground truth.
Six brain DCE MRI image of glioblastoma patients were acquired using a T1 fast-low angle shot, FLASH/vibe sequence. The scanning parameters were as follows: 40 dynamic acquisitions, 4.9 s per dynamic acquisition, TR=4.09 ms, TE=1.47 ms, flip angle=9°, phase=75%, bandwidth=400 Hz, thickness=4 mm, slice gap=0 mm, FOV=180*180 mm2, matrix=192×144, TA=245 s. The contrast agent Gadodiamide (Omniscan, GE Medical Systems, Amersham, Ireland) was administered intravenously during the third dynamic acquisition using a power injector system (Spectris Solaris, MedRad, Indianola, PA, USA) at 0.1 mmol/kg body weight and 2 ml/sec, immediately followed by a 25-ml saline flush at a rate of 3.5 ml per second. As the input size of the neural network is 64×64×64, the DCE MRI images were interpolated to 1 mm voxel size and sliding window method with step size=3 voxels was applied to go over the whole volume.
Our proposed deep learning method can accurately reconstruct F, Ktrans, Vp, and Ve from 4D tracer concentration profile. In test dataset 1 where the vasculature, parameter map and tracer concentration were constructed using the same method as training dataset, the relative root mean squared error (rRMSE) is smaller than 1% for F, Ktrans, Vp and Ve (
Our deep learning method can be applied to DCE MRI processing to estimate perfusion parameters without requirement of arterial input function.
Certain features that are described above in the context of separate implementations can also be implemented in combination in a single implementation. Conversely, features that are described in the context of a single implementation can be implemented in multiple implementations separately or in any sub-combinations.
The order in which operations are performed as described above can be altered. In certain circumstances, multitasking and parallel processing may be advantageous. The separation of system components in the implementations described above should not be understood as requiring such separation.
A number of embodiments of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the disclosure. Accordingly, other embodiments are within the scope of the following claims.
While subject matter of the present disclosure has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive. Any statement made herein characterizing the invention is also to be considered illustrative or exemplary and not restrictive as the invention is defined by the claims. It will be understood that changes and modifications may be made, by those of ordinary skill in the art, within the scope of the following claims, which may include any combination of features from different embodiments described above.
The terms used in the claims should be construed to have the broadest reasonable interpretation consistent with the foregoing description. For example, the use of the article “a” or “the” in introducing an element should not be interpreted as being exclusive of a plurality of elements. Likewise, the recitation of “or” should be interpreted as being inclusive, such that the recitation of “A or B” is not exclusive of “A and B,” unless it is clear from the context or the foregoing description that only one of A and B is intended. Further, the recitation of “at least one of A, B and C” should be interpreted as one or more of a group of elements consisting of A, B and C, and should not be interpreted as requiring at least one of each of the listed elements A, B and C, regardless of whether A, B and C are related as categories or otherwise. Moreover, the recitation of “A, B and/or C” or “at least one of A, B or C” should be interpreted as including any singular entity from the listed elements, e.g., A, any subset from the listed elements, e.g., A and B, or the entire list of elements A, B and C.
This application claims priority to U.S. Provisional Patent Application Ser. No. 63/068,228, titled “SYSTEM AND METHOD OF ACCURATE QUANTITATIVE MAPPING OF BIOPHYSICAL PARAMETERS FROM MRI DATA,” filed on Aug. 20, 2020, said application being incorporated by reference herein in its entirety.
This invention was made with government support under Grant No. CA181566, DK116126 NS090464, NS095562 and NS105114 awarded by the National Institutes of Health. The government has certain rights in this invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/046744 | 8/19/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63068228 | Aug 2020 | US |