This invention relates to methods, devices and system related to Specific Absorption Rate (SAR) prediction in Magnetic Resonance Imaging (MRI) using neural networks.
Obtaining a robust method of managing local Specific Absorption Rate (SAR) in high field Magnetic Resonance Imaging (MRI) has been challenging. High field MRI has been shown to produce superior image quality, but is limited by radio frequency wavelength effects that result in image inhomogeneities and heating risk for the patient. The pTx pulse has been shown to solve the image inhomogeneity problem reliably, but addressing the heating risk (quantified by SAR) is more challenging because an exact tissue model of the patient for predicting SAR is not available. The present invention addresses this problem.
The approach in this invention poses the SAR prediction problem in a deep learning/convolutional neural network framework. Fundamentally, the innovation and advantage of this approach is that it exploits information from an entire population to intelligently infer SAR for a particular patient with improved accuracy compared to prior approaches, most of which have been restricted to using a single tissue model.
In one embodiment, the dataset for training of the neural network pertains of multiple tissue models with a distribution of height, weight, body mass index, and sex that is representative of the target human subject population. On each of these models, electromagnetic simulations with a numerical approximation of the physical transmit coil are performed to produce electric field maps. Using these electric field maps, three-dimensional SAR distributions are computed for each model for several random transmit channel weightings; the SAR matrix formalism is used to make this computation as efficient as possible. For each model, the input to the neural network is the tissue properties for that model has relative permittivity, electrical conductivity, and mass density maps and a particular set of channel weightings. The output is the computed SAR for that model and a set of channel weightings. The entire training set has all combinations of channel weightings and tissue models.
A convolutional neural network architecture is used with multiple filters at each layer including maxpooling layers for downsampling. The channel weightings are input into the network at an intermediate layer and all subsequent layers are fully connected. For a new incoming tissue model, the initial layers that are independent of the channel weights are evaluated. Then the output of these layers as well as the learned weights of the final layers serve as an “oracle” for the IMPULSE optimization algorithm. IMPULSE queries this oracle for the value of SAR and its gradient for several different channel weightings and uses this information to produce a pulse that has minimum predicted SAR on the new tissue model while also satisfying image homogeneity requirements.
This invention can be applied to high field MRI where parallel transmission is necessary to produce a uniform transmit magnetic field. Another application of the method is to decrease local SAR for any MRI system that possesses multi-transmit capability. Embodiments of the invention make parallel transmission clinically viable by providing much greater confidence in SAR prediction than current methods. The method also speeds up SAR prediction and pTx pulse design due to the computational efficiency of convolutional neural networks.
Unlike prior methods that use a single tissue model, which maybe represent a poor approximation of the patient for predicting SAR, the approach of this invention involves the use of an entire population of tissue models. The trained neural network therefore produces more reliable estimates of SAR. Furthermore, with the approach of this invention there is no need for time-consuming electromagnetic simulations to calculate electric fields; this advantage may allow highly accurate SAR prediction to be accomplished in near real time, in other words while the patient is in the scanner.
Embodiments of the invention could be varied, for example, instead of using tissue models as inputs to the convolutional neural network, a possible variation would be to use MR images of the patient as inputs.
In summary, the use of deep learning to predict SAR is new and unique. All prior approaches for solving the SAR-aware parallel transmit problem have relied on the use of a single model for SAR estimation, and no prior method has exploited convolutional neural networks to solve any aspect of this problem. Embodiments of this invention could dramatically improve the practicality of using parallel transmission MRI, and could therefore making this form of MRI clinically feasible by effectively and efficiently managing the risk of tissue heating in the patient. Embodiments of this invention uses deep learning to robustly predict SAR in parallel transmission and is a key step to enabling routine use of ultra-high field MRI in a clinical setting.
In one embodiment, this invention is a DeepSAR method, which predicts local SAR using a three-dimensional convolutional neural network (CNN). In this embodiment, a patient-specific local specific absorption rate (SAR) prediction method is provided. A three-dimensional convolutional neural network (CNN) is trained using pairs of SAR maps and B1+ maps for different channel weights. The CNN has an input and an output, and is then provided as a computational device to compute SAR maps. As input to the trained CNN, measured B1+ maps, simulated B1+ maps or a combination thereof are used. The trained CNN then computes and output SAR maps in a form of a generative adversarial network (GAN) to predict a three-dimensional real-valued SAR map with both real and imaginary components to be used by a high field Magnetic Resonance Imaging (MRI) application for a patient.
Further to this embodiment, the CNN further could have additional computational devices such as a generator (G) and a discriminator (D) network.
In the case of a generator (G) network, the method could input to the generator network the measured B1+ maps, the simulated B1+ maps or the combination thereof, and the method via the generator network could compute and output the SAR maps.
In case of a discriminator (D) network, the method could further input to the discriminator network the SAR maps and the measured B1+ maps, the simulated B1+ maps or the combination thereof, and the method via the discriminator network could compute and output a probability for the SAR maps.
This method of this invention focusing on local SAR prediction using the CNN, a secondary benefit of this invention could be the use of the SAR maps as an improved tool for IMPULSE or other parallel transmit pulse design. As such, the method could further compute a pTx pulse design using the SAR maps computed by the CNN to be applied in high field Magnetic Resonance Imaging (MRI).
Ultra-high field (UHF) magnetic resonance imaging (MRI) can result in improved image quality due to increased polarization of nuclear spins which leads to a higher signal to noise ratio compared to conventional methods. However, the electromagnetic wavelength is inversely proportional to the field strength and at UHF, wavelength effects cause complex interactions between the load and the electric and magnetic fields. For different patients, the spatial variations in the fields will be different which requires that the radio frequency (RF) pulse used to excite the magnetic spins be tailored for each specific patient to homogenize the fields. Both the magnetic (B1+) and electric (E) fields are relevant for an MRI scan. Inhomogeneity in the B1+ field leads to image artifacts like shading and central brightening that can negatively impact the interpretability of the image. Inhomogeneity in the E fields can interact with conductive tissue to produce localized heating which constitutes a safety risk for the patient; this heating is characterized by a quantity known as the specific absorption rate (SAR).
A promising approach for addressing these concerns is to use a parallel transmit (pTx) RF coil which has multiple independent channels that can be excited with different amplitudes and phases such that the combined field is homogeneous. Assuming full knowledge of the B1+ field and the E field, there are established algorithms for finding the RF pulses for each channel of the pTx coil to produce the desired image by eliminating the B1+ inhomogeneity while also ensuring the safety of the patient by eliminating the SAR hotspots through tailoring of the E field. An intermediate step in this process involves precomputing spatially-averaged SAR matrices from the E field which are a convenient formulation that allows finding the SAR distribution for any set of channel weights through a matrix multiplication that takes <100 ms even for up to 10 million total voxels or matrices. The remaining challenge is to find a way to estimate these fields on every subject to feed into the pulse design algorithm. Reliable techniques have been developed, such as the Bloch-Siegert method, to measure the B1+ field in vivo while the patient is in the scanner. However, since MRI is not sensitive to the E component of the field there is no established method for measuring the E fields and there is no obvious way to derive the E field from the B1+ field from first principles. The current method (
Rather than relying on time-consuming numerical simulation that relies on principles from physics and approximations in the input tissue maps, the proposed method in this invention (
The deep learning model of this invention has a 3D convolutional neural network, specifically in the form of a Generative Adversarial Network (GAN), to predict a 3D real-valued SAR map from a 3D B1+ map with both the real and imaginary components. The architecture has a generator and a discriminator. The input to the generator is the B1+ map and the output is the SAR map. The input to the discriminator is a SAR map (either the output of the generator or the ground truth SAR map) and the B1+ map and the output is a probability that the SAR map originated from the generator rather than being ground truth (Table 1). The training proceeds in an adversarial manner whereby the generator tries to produce more realistic SAR maps while the discriminator tries to better distinguish the real SAR maps from the generated ones. At convergence, the generated and realistic SAR maps will be indistinguishable to the discriminator. In effect, the discriminator serves as a more sophisticated type of loss function compared to the conventional L1 or L2 loss.
In one example, the training set has pre-simulated B1+ and E fields using a library of numerical head models. These simulations were performed using an 8-channel parallel transmit coil tuned to 298 MHz corresponding to the 7T Larmor frequency. For details, please review to U.S. Provisional Patent Application 62/756,171 filed Nov. 6, 2018, which is incorporated herein by reference, in which the library of head models are shown in Table 2 and the meshing of the simulation including the coil model is shown in
The neural network predicts combined SAR maps from combined B1+ maps so it requires some channel weighting to combine the maps together. Therefore, during training at each iteration one random simulation from the library is chosen and the corresponding single channel B1+ maps and SAR matrices are loaded (
A detailed diagram of the network architecture is shown in
The training algorithm is described below.
Following the training, the discriminator will be about 50% accurate at correctly identifying the true SAR maps and generated SAR maps. It would be desirable to have a way to estimate the confidence of the SAR prediction. To do this the discriminator is retrained from scratch trying to predict the confidence of the prediction quantified by the error in the peak local SAR, penalizing underestimation more than overestimation.
Once the network is trained, it can be used on any patient, coil, or anatomy taking only the B1+ maps as the input. The pTx pulse design requires SAR matrices, but the neural network only outputs a single SAR map. For this reason, it is necessary to evaluate the network for several different channel weights and use the result to find the SAR matrices using a least-squares fit. The detailed procedure for doing this is as follows:
Find R using weighted least squares solution to R*B=S (minimize ∥W(RB−S)∥2)
R has size Nv×Nc{circumflex over ( )}2 where each row is a different voxel and each column is each cross term between the elements of the channel weighting vector
B has size Nc{circumflex over ( )}2×N corresponding to the cross terms of the N channel weighting vectors
S has size Nv×N and is the vectorized form of the 3D SAR maps for each of the
N channel weights
W has size N×N where the nth element is w_n
Embodiments of the invention could be described as:
Embodiment of the invention could be methods steps encoded as computer-implemented steps and executed by a computer processor or chip. Embodiments of the invention could be devices such as computer encoded devices, processors or chips. Such embodiments could be part of a system including MRI scanners, equipment, and devices.
This application claims priority from U.S. Provisional Patent Application 62/756,171 filed Nov. 6, 2018, which is incorporated herein by reference.
This invention was made with Government support under contract EB025131 awarded by the National Institutes of Health. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62756171 | Nov 2018 | US |