The present invention relates generally to methods for image reconstruction in medical diagnostic imaging. More specifically, it relates to methods for assessment of the uncertainty and risk of reconstructed images in AI-based image reconstruction.
Reliable MRI is crucial for accurate interpretation in therapeutic and diagnostic tasks. However, undersampling during MRI acquisition as well as the overparameterized and non-transparent nature of deep learning (DL) leaves substantial uncertainty about the accuracy of DL reconstruction.
Artificial intelligence (AI) has introduced a paradigm shift in medical image reconstruction in the last few years, offering significant improvements in speed and image quality [1], [2], [3], [4], [5], [6]. However, the prevailing methods for image reconstruction leverage historical patient data to train deep neural networks, and then use these models on new unseen patient data. This practice raises concerns about generalization, given that predictions can be biased towards the previously-seen training data, and consequently may struggle to detect novel pathological details in the test subjects. This is deeply problematic for both patients and physicians and can potentially alter diagnoses. It is thus crucial to ensure that reconstructions are accurate for new and unseen subjects. This well motivates developing effective and automated risk assessment tools for monitoring image reconstruction algorithms as a part of the imaging pipeline.
Despite the importance of assessing risk in medical image reconstruction, little work has explored the robustness of deep learning (DL) architectures in inverse problems. Not only is the introduction of image artifacts fairly common using such models, but there do not currently exist suitable and generalized empirical methods that enable the quantification of model uncertainty [7]. While other forms of uncertainty, such as those pertaining to the training data and acquisition model, are also important to consider, model uncertainty is especially critical to understand because it explains the unique challenges that DL models can face relative to more traditional techniques for MRI reconstruction.
Medical image reconstruction methods rooted in deep learning have been widely explored. Of the various model architectures that have been used, adversarial approaches based on generative adversarial networks (GANs) are notable for high reconstruction image quality and the ability to realistically model image details. The generator function for these types of architectures is usually a U-net or ResNet [7], [10].
Variational autoencoders (VAEs) have achieved high performance in generating natural images, while also providing probabilistic interpretability of the generation process [11], [12]. However, this approach has not yet been demonstrated in the realm of medical images.
Meanwhile, a small but growing body of work has examined uncertainty in general computer vision problems. Specifically, measurements of uncertainty have been computed by finding the mean and point-wise standard deviation of test images using Monte Carlo sampling with probabilistic models [13]. With such a method, comparing the mean intensities of regions containing an artifact and the surrounding area over several cases can provide statistical insights into the types of errors made by the model [14]. Approximate Bayesian inference based approaches have also been introduced as a way of quantifying model uncertainty [15]. The downside of these methods is that they are only applicable to probabilistic models and not deterministic ones.
Other studies have explored using invertible neural networks to learn the complete posterior of system parameters [16], [17]. Through bootstrapping, point estimate uncertainty can be obtained statistically and analyzed in a manner similar to posterior sampling. Models such as convolutional neural networks (CNNs) have also been applied on MRI reconstructions to obtain global uncertainty scores, though these models can introduce their own uncertainty and may themselves suffer from challenges with robustness [18].
Uncertainty has also been analyzed from the standpoint of data rather than variance introduced by generative models in the context of medical imaging [19]. Techniques such as the bootstrap and jackknife can be used on the sampled input data to produce accurate error maps that provide insight into the most risky ROIs in terms of reconstructions without having access to the ground truth.
Stein's Unbiased Risk Estimator (SURE) has also seen some use in imaging applications. Specifically, SURE has been utilized in regularization and for image denoising, where it is explicitly minimized during the optimization step [20], [21]. However, it has not been widely used in uncertainty estimation or in the context of medical imaging.
Given that most existing approaches of quantifying uncertainty do not apply broadly or are constrained by the chosen models, developing straightforward and effective techniques is very important. Such methods have the potential to enable holistic comparison and evaluation of model architectures across a range of problems, leading to increased robustness in sensitive domains.
The invention provides method for generating pixel risk maps for generic deep image reconstruction algorithms. The risk maps can be generated in real time, and are universal across various medical imaging modalities. These reconstruction risk maps can be paired along with or overlaid on the reconstructed medical images to guide physicians' analysis and interpretation. The technique calculates a quantitative generalization risk that each pixel is different from the ground-truth pixel, without having access to the ground-truth image which is usually impossible or impractical to obtain.
The techniques of the present invention may be used to quantify the uncertainty in image recovery with DL models. To this end, we first leverage variational autoencoders (VAEs) to develop a probabilistic reconstruction scheme that maps out (low-quality) short scans with aliasing artifacts to the diagnostic-quality ones. The VAE encodes the acquisition uncertainty in a latent code and naturally offers a posterior of the image from which one can generate pixel variance maps using Monte-Carlo sampling. Accurately predicting risk requires knowledge of the bias as well, for which we leverage Stein's Unbiased Risk Estimator (SURE) as a proxy for mean-squared-error (MSE). A range of empirical experiments is performed for Knee MRI reconstruction under different training losses (adversarial and pixel-wise) and unrolled recurrent network architectures. Our key observations indicate that: 1) adversarial losses introduce more uncertainty; and 2) recurrent unrolled nets reduce the prediction uncertainty and risk.
In one aspect, the invention provides a method for generating pixel risk maps for diagnostic image reconstruction comprising: feeding into a trained encoder a short-scan image acquired from a medical imaging scan to generate latent code statistics including the mean μy and variance σy; selecting random samples based on the latent code statistics, z˜N(μy,σy2); feeding the random samples into a trained decoder to generate a pool of reconstructed images; calculating, for each pixel of the pool of reconstructed images, the pixel mean and variance statistics across the pool of reconstructed images.
The method may further comprise calculating quantitative scores for risk of each pixel. This may be performed by computing an end-to-end Jacobian of a reconstruction network that is fed with a density-compensated short-scan image, and estimating the risk of each pixel based on the Stein Unbiased Risk Estimator (SURE).
Given the pernicious effects that the presence of image artifacts can have, reliable uncertainty quantification methods could have utility both as an evaluation metric and as a way of gaining interpretability regarding risk factors for a given model and dataset [8]. In principle, a radiologist could consider a combination of uncertainty heat maps and the scan being analyzed to make better informed interpretations overall. Additionally, uncertainty information could be leveraged in real time to inform the optimal measurements to acquire during scanning, or alternatively, separate models could be applied in a targeted fashion to enhance quality in specific regions of the image. For example,
To help realize these possibilities, this work introduces procedures that can provide insights into the model robustness of DL MR reconstruction schemes. In doing so, we develop a variational autoencoder (VAE) model for MR image recovery, which is notable for its low error and probabilistic nature which is well-suited to an exploration of model uncertainty. We first generate admissible reconstructions under an array of different model conditions. We then utilize Monte-Carlo sampling methods to better understand the variations and errors in the output image distribution under various model settings [9]. Finally, because accurately assessing bias and error with the Monte Carlo approach requires access to the true (fully-sampled) image, we leverage the Stein's Unbiased Risk Estimator (SURE) analysis of the DL model, which serves as a surrogate for mean squared error (MSE) even when the ground truth is unknown.
Empirical evaluations of these techniques are performed on real-world medical MR image data, considering the single coil acquisition model. Our key observations include: 1) using an adversarial loss function introduces more pixel uncertainty than a standard pixel-wise loss, while better recovering the high-frequencies; 2) a cascaded network architecture better leverages the physical constraint of the problem and results in higher confidence reconstructions 3) SURE effectively approximates MSE and serves as a valuable tool for assessing risk when the ground truth reconstruction (i.e. fully sampled image) is unavailable.
Important contributions of this work are as follows:
Preliminaries and Problem Statement
One key application of inverse problems is to Magnetic Resonance Imaging (MRI), where significant undersampling is typically employed to accelerate acquisition, leading to challenging image recovery problems. As a result, the development of accurate and rapid MRI reconstruction methods could enable powerful applications like diagnostic-guided surgery or cost-effective pediatric scanning without anesthesia [22], [23].
In MR imaging, the goal is to recover the true image x0∈Cn from undersampled k-space measurements y∈Cm with m n that admit
y=Φx
0
+v. (1)
Here, Φ in general includes the acquisition model with the sampling mask Ω, the Fourier operator F, as well as coil sensitivities (in the multi-coil setting). The noise term v also accounts for measurement noise and unmodeled dynamics.
Given the ill-posed nature of this problem, it is necessary to incorporate prior information to obtain high-quality reconstructions. This prior information spans across a manifold of realistic images 200 (S⊂Cn) as shown in
While DL models can be effectively used for learning the projection onto the intersection 204 S∩Cy, performance can be limited when seeing new data unlike the training examples. In particular, one risk is the introduction of realistic artifacts, or so-termed “hallucinations,” which can prove costly in a domain as sensitive as medical imaging by misleading radiologists and resulting in incorrect diagnoses [24], [25]. Hence, analyzing the uncertainty and robustness of DL techniques in MR imaging is essential.
To address this challenge, the techniques of the present invention are able to learn a projection onto the intersection 204 S∩Cy between the real image manifold S and the data consistent subspace Cy using a DL model, and then evaluate the uncertainty of the model in producing these reconstructions.
Vaes for Medical Image Recovery
For image recovery, we consider a VAE architecture (h), consisting of an encoder ƒ and a decoder g with latent space z. While VAEs have been used successfully in low-level computer vision tasks like super-resolution, they have not been applied to medical image recovery. Importantly, the VAE is capable of learning a probability distribution of realistic images that facilitates the exploration process of the manifold S. By randomly sampling latent code vectors corresponding to specific images and enforcing data consistency, we can traverse the space comprising S∩Cy and evaluate the results visually and statistically.
A processing pipeline is shown in
More specifically, the VAE functions in the following manner. First, the encoder 300 takes in input xzf 304, which is a poor linear estimate of the true image. Here, xzf=ΦHy, which is the zero-filled Fourier inverse for the single coil scenario, which we focus on here. For the multi-coil scenario, xzf becomes the SENSE reconstruction taking into account coil sensitivities [26]. The encoder output is q(z Ix) (an estimate of the posterior image distribution), where z 306 is an internal latent variable with low dimension. This latent variable formulation is valuable because it enables an expressive family of distributions and is also tractable from the perspective of computing maximum likelihood. Reconstruction occurs through sampling z from q(z|x) and passing the result to the decoder 302.
To facilitate sampling, the posterior is represented by a Gaussian distribution and constrained by a prior distribution p(z), which for simplicity of computation is chosen to be the unit normal distribution.
In generating reconstructions, the VAE balances error with the ability to effectively draw latent code samples from the posterior. As such, the VAE loss function used in training is comprised of a mixture of pixel-wise l2 loss and a KL-divergence term, which measures the similarity of two distributions using the expression
KL(p∥q)=∫p(x)log[p(x)/q(x)]dx. (2)
where the integral is from −∞ to +∞. In the VAE loss function, the KL term (which has weight η) is designed to force the posterior (based on μz, σz a for a given batch) to follow the unit normal distribution of our prior p(z) [27], [28]. As η increases, the integrity of the latent code (and thereby ease of sampling after training) is preserved at the expense of reconstruction quality.
With reconstruction output {circumflex over (x)}=h(xzf), the training cost used in updating the weights of the model h is formed as
minhEx,y[∥{circumflex over (x)}−x0∥22]+ηKL(N(μz,σz)∥N(0,1)). (3)
At test time, latent code vectors are sampled from a normal distribution z˜N(0,1) to generate new reconstructions. To ensure that these reconstructions do not deviate from physical measurement, data consistency (obtained by applying an affine projection based on the undersampling mask) is applied to all network outputs, which we find essential to obtaining high signal-to-noise ratio (SNR) [7].
To deepen our analysis of robustness, we also examine the effects of a revised model architecture that is cascaded (i.e. the VAE and data consistency portions of the model repeat) for a certain number of “recurrent blocks,” as this can positively impact reconstruction quality [29]. In the case of a model with two recurrent blocks, for example, the zero-filled image is passed into the first VAE, the output of the first VAE is passed as the input of a second VAE, and the output of the second VAE serves as the overall model reconstruction. Data consistency is applied to each VAE in question, and the VAEs have shared weights which ensures that new model parameters are not introduced. Note that we define a model with one recurrent block as the baseline model that has no repetition.
Uncertainty Analysis
The advantage of using a VAE model for image reconstruction is that it naturally offers the posterior of the image which can be used to draw samples and generate variance maps. While these variance maps are useful in letting radiologists understand which pixels differ the most between the model's reconstructions, they are not sufficient since the bias (difference between reconstructions and the fully-sampled ground truth image) is unknown. Thus, methods like SURE which can assess risk without explicitly using the ground-truth are a better alternative. The following subsections introduce these ideas in more detail.
A. Monte Carlo Sampling
An effective way of analyzing the uncertainty of probabilistic models in computer vision problems is to statistically analyze output images for a given input [8]. Nonetheless, this approach has not been widely employed in inverse problems or medical image recovery.
Utilizing the probabilistic nature of the VAE model, for a given zero-filled image xzf=ΦHy (i.e. the aliased input to the model), we can use our encoder function ƒ to find the mean μ and variance a of the latent code z, which we use to draw samples. We can then use our decoder function g to produce reconstructions of latent code samples z and then aggregate the results over k samples to produce pixel-wise mean and variance maps for the reconstructions. Algorithm 1 shows the full sequence of steps in more detail.
This Monte Carlo sampling approach allows one to evaluate variance as well as higher order statistics, which can be very useful in understanding the extent and impact of model uncertainty. However, despite the information the Monte Carlo approach can provide, some important statistics such as bias are dependent on knowledge of the ground truth, which motivates the use of metrics like SURE.
Algorithm 1 Monte Carlo uncertainty map
Input y,Ω,k,ƒ,g
Step 1. Form the input xzf=ΦHy
Step 2. Generate latent code statistics (μ,σ)=ƒ(xzf)
Step 3. Initialize i=0, {circumflex over (x)}=[0]n×n, and map=[0]n×n
Step 4. While i<k {
Step 5. Compute pixel-wise mean and variance: {circumflex over (x)}:=(1/k)Σi=1,k{circumflex over (x)}i,
map:=(1/k)Σi=1,k ({circumflex over (x)}i−{circumflex over (x)})2
return ({circumflex over (x)}, map)
B. Denoising SURE
A useful statistical technique for risk assessment when the ground truth is unknown is Stein's Unbiased Risk Estimator (SURE) [30]. Despite being well-established, SURE has not yet been used for uncertainty analysis in imaging or DL problems. Given the ground truth image x0, the zero-filled image can be written as
x
zf
=x
0
+r. (4)
where r is what we refer to as the residual term. With reconstruction {circumflex over (x)} that has dimension n, test MSE can be expanded as
E[∥{circumflex over (x)}−x0∥2]=E[∥x0−xzf+xzf−{circumflex over (x)}∥] (5)
=−nσ2+E[∥xzf−{circumflex over (x)}∥2]+2Cov(xzf,{circumflex over (x)}). (6)
Since x0 is not present in this equation, we see that SURE serves as a surrogate for MSE even when the ground truth is unknown. A key assumption behind SURE is that the residual process r that relates the zero-filled image to the ground truth is normal, namely r˜N(0,σ2I). With this assumption, we can apply Stein's formula which approximates the covariance to obtain an estimate for the risk as
SURE=−nσ2+∥{circumflex over (x)}−xzf∥2+σ2tr(∂{circumflex over (x)}/∂xzf). (7)
Note that SURE is unbiased, namely,
MSE=E[SURE]. (8)
With the risk expressed in the above form, we can separate the estimate into three terms, with the second term corresponding to residual sum of squares (RSS) and the third one corresponding to degrees of freedom (DOF). This form importantly does not depend on x0, and approximates the DOF with the trace of the end-to-end network Jacobian J=∂{circumflex over (x)}/∂xzf. The Jacobian represents the network sensitivity to small input perturbations and is a measure of interest when analyzing robustness in computer vision tasks [31].
In order to estimate the residual variance, it is reasonable to assume that error in the output reconstruction is not large, and as a result σ2 can be estimated (by setting the sum of the first two terms in the SURE expression to zero) as
σ2=∥{circumflex over (x)}−xsf∥2/n. (9)
Our evaluations validate this assumption when the undersampling rate is not very large. With this assumption, we can rewrite our expression for SURE as follows
SURE=σ2tr(J). (10)
DOF approximation. Due to the high dimension of MR images, computing the end-to-end network Jacobian for SURE can be computationally intensive. From a practical standpoint, making this computation efficient allows one to evaluate uncertainty in real time, in parallel with the reconstruction. To this end, we use an approximation for the Jacobian trace [32]. In particular, given n-dimensional noise vector b drawn from N(0,1), our trained model h and a small value ε, the Jacobian trace can be written equivalently as follows
tr{J}=limε→0Eb˜N(0,1)[bT(h(xzf+εb)−h(xzf)ε−1] (11)
Because MR images are high dimensional, taking even a single sample from the above results in a reasonable estimate of the expectation, which yields the following approximation (as a rule of thumb we choose c as the maximum pixel value in the zero-filled image divided by 103) [33].
tr{J}≈b
T(h(xzf+εb)−h(xzf))ε−1. (12)
In effect, this expression measures sensitivity of the model output to small perturbations in the input. To make the approximation more accurate, we average it over several (˜100) random realizations of b.
Pixel-wise SURE. It is also worth noting that SURE offers not only a global assessment of the reconstruction risk, but can also be used to obtain localized risk maps. To do so, we can easily extend the previous discussion for pixel (i,j) admitting yi,j=xi,j+ri,j. The SURE risk per pixel then follows (10) where the Jacobian is a scalar J(i,j)=∂xi,j/∂yi,j. This version of pixel-wise SURE measures the risk of a pixel in the reconstruction, considering only the pixel at the same spatial position in the input. The primary challenge is accurately estimating σ2i,j which we do by taking the RSS map and averaging each element with the values immediately surrounding it.
Remark 1 [SURE for deterministic networks]. Note that the scope of SURE risk assessment is not limited to variational networks. Indeed, SURE can also be applied to deterministic networks as long as the residuals obey the Gaussian assumption and one can find a reasonable estimate of the residual variance.
C. Gaussian Residuals with Density Compensation
As mentioned before, the key assumption behind SURE is that the residual model is Gaussian with zero mean. However, it is not safe to assume that the undersampling noise in MRI reconstruction inherits this property. For this reason, we introduce density compensation on the input image as a way of enforcing zero-mean residuals for the single coil setting. This approach has the added benefit of making artifacts independent of the underlying image and we find that it significantly increases residual normality (see
More specifically, given a 2D sampling mask Ω, we can treat each element of the mask as a Bernouli random variable with a certain probability Di,j, where E[Ω]=D (this is dependent on the sampling approach).
With this formulation, we can define a density compensated zero-filled image as follows: {tilde over (x)}zf F−1D−1ΩFx0. We can rewrite this expression using x0 as
{tilde over (x)}
zf
=x
0+(F−1D−1ΩF−I)x0. (13)
First, we observe that r has zero mean since E[D−1Ω]=I. In addition, the residual variance obeys
σ=tr(x0H(F−1D−1F−I)x0). (14)
Of course, in practice we do not have access to the ground truth image x0, and instead rely on the approximation in (9) for the residual variance. Given these main properties, the density compensation method that this work introduces represents an important step that can allow denoising SURE to be used effectively in medical imaging and other inverse problems. Algorithm 2 summarizes the steps for using density-compensated SURE in practice. Note that for the multi-coil case, density compensation can be extended to adjust for the coil effects using an initial input reconstruction such as SENSE.
Remark 2 [SURE beyond Gaussian residuals]. The primary assumption behind the SURE derivations is that the residuals obey an independent and identically distributed (i.i.d.) Gaussian distribution. Even though the density compensation in (13) enforces this property, one can also leverage the generalized SURE formulation that extends SURE derivations to colored noise distributions from the exponential family, though this is harder to compute in practice [21], [34]. Generalized SURE can be especially useful in the case of uniform sampling, with structured residuals.
Algorithm 2 SURE risk estimate (with density compensation)
Input y,Ω,D,n,h,b,ε
Step 1. Form density-compensated input {tilde over (x)}zf F−1D−1ΩFΦHy
Step 2. Reconstruct {circumflex over (x)}=h({tilde over (x)}zf)
Step 3. Compute the residual variance σ2=∥{circumflex over (x)}−{tilde over (x)}zf∥2/n
Step 4. Approximate the DOF: tr{J}≈bT(h({tilde over (x)}zf+εb)−h({tilde over (x)}zf))ε−1
Step 5. Obtain SURE risk for {circumflex over (x)}: SURE=σ2tr{J}
return SURE
Empirical Evaluations
In this section, we assess our model and methods on a dataset of Knee MR images. We first show reconstructions produced with the VAE model, before demonstrating representative results with the Monte Carlo and SURE methods for quantifying uncertainty. Here, we focus on the single coil acquisition model for simplicity, although the techniques can also be extended to the multi-coil model.
Dataset. The Knee dataset utilized in training and testing was obtained from 19 patients with a 3T GE MR750 scanner [35]. A 3D FSE CUBE sequence with proton density weighting including fat saturation was used for acquiring fully sampled images. The following parameters were used in acquisition: FOV=160 mm, TR=1550 for sagittal and 2,000 for axial, TE=25 for sagittal and 35 for axial, slice thickness=0.6 mm for sagittal and 2.5 mm for axial. Each volume consisted of 3202D axial slices of dimension 320×256, with all slices for a given patient being placed into either the training, validation, or test sets. A 5-fold variable density undersampling mask with radial view ordering (designed to preserve low-frequency structural elements) was used to produce aliased input images xin for the model to reconstruct [36]. Both inputs and outputs of the model are complex valued, with the real and imaginary components considered as two separate channels, which enables simple processing by the network.
A. Adversarial Loss
The effective modeling of high-frequency components is essential for capturing details in medical images. Thus, we train our model with adversarial loss in a GAN setup, which can better capture high-frequency details and has been shown to improve the perceptual quality of recovered images (even though reconstruction error rises) [7], [11]. In particular, we use a multi-layer CNN as the discriminator D along with the VAE generator G. The discriminator learns to distinguish between reconstructions and fully-sampled images, and sends feedback to the generator, which in turn adjusts the VAE's model weights to produce more realistic reconstructions.
The training process for the VAE is identical to the case with no adversarial loss, except an extra loss term is needed to capture the discriminator feedback (with weight λ referred to as GAN loss)
minGEx,y[∥{circumflex over (x)}−x0∥22]+ηKL(N(μz,σz)∥N(0,1))+λEy[(1−D({circumflex over (x)}))2] (15)
The discriminator weights are updated during training as
minDEx[(1−D(x))2]+Ey[(D({circumflex over (x)}))2]. (16)
The training process acts as a game, with the generator continuously improving its reconstructions and the discriminator distinguishing them from ground truth. As the GAN loss λ increases, the modeling of realistic image components is enhanced but uncertainty rises.
B. Network Architecture
As shown in
The discriminator function of the GAN (when adversarial loss is used) is an 8-layer CNN. The first seven layers are convolutional (8, 16, 32, 64, 64, 64, and 1 feature maps, respectively) with batch normalization and ReLU activations in all but the last layer. The first five layers use kernel size 3×3, while the following two use kernel size 1×1. The first four layers use a stride of 2, while the next three use a stride of 1. The eighth and final layer averages the output of the seventh layer to form the discriminator output.
The use of multiple recurrent blocks (RB) whereby the model repeats (the output feeds into the input of another VAE with the same model parameters) is also explored [29]. Using multiple RBs does not affect the discriminator network architecture, and the number of RBs can be optimized to maximize model performance as described in [29].
Training was completed over the course of 30,000 iterations, with loss converging over roughly 20,000 iterations. We utilize the Adam optimizer with a mini-batch size of 4, an initial learning rate of 5×10−5 that was halved every 5000 iterations, and a momentum parameter of 0.9. Models and experiments were developed using TensorFlow on an NVIDIA Titan X Pascal GPU with 12 GB RAM.
C. Individual Reconstructions
D. Variance, Bias, and Error Maps
Using the Monte Carlo method described earlier in Algorithm 1, 1K outputs corresponding to different reference slices were generated after feeding a test image into the trained model and sampling from the resulting posterior distribution. Note that for this process, only the VAE (generator portion) of the model is relevant to producing outputs, even with adversarial loss used for training.
We show the mean of the 1K reconstructed outputs for a representative slice, the pixel-wise variance, squared bias (difference between mean reconstruction and ground truth), and squared error in
The results indicate that variance, bias, and error increase as the GAN loss weight λ increases (
E. Residual Distribution with Density Compensation
As described earlier, denoising SURE builds on the Gaussian assumption for the residuals r=xzf−x0. To validate this assumption, we produce histograms and Q-Q plots of the residuals at various undersampling rates, by considering the differences for individual pixels across test images. From
To overcome the lack of normality in the residuals, we apply our density compensation method. For a given undersampling rate, 100 variable-density random sampling masks are generated and then averaged to obtain the sampling density D. The element-wise inverse of this density can then be multiplied with any given mask to produce a density-adjusted mask D−1Ω. This adjusted mask can be used to generate new zero-filled images as input to the network.
As
F. SURE Results
To evaluate the effectiveness of the density-compensated SURE approach, we produce correlations of SURE versus MSE (which depends on the ground truth and is a standard metric for assessing model error) using the results from our test images.
In addition, the average reconstruction SURE, RSS, and DOF values for different hyperparameters are shown in Table I and Table II. Increased GAN loss results in decreased SURE values (note the units of dB) and increased RSS and DOF. Meanwhile, more RBs result in higher SURE values and lower RSS and DOF, demonstrating a simple way of improving reconstruction quality while reducing uncertainty. Note that these results align closely with the Monte Carlo analysis from before, thereby reinforcing the effectiveness of the SURE approach in quantifying risk.
Furthermore, the uncertainty analysis of the sample reconstructions in
We have disclosed methods to analyze uncertainty in deep-learning based compressive MR image recovery, which can ultimately serve to improve the medical imaging workflow in both better diagnosis and acquisition. To thoroughly explore realistic and data-consistent images, we develop a probabilistic VAE model with low error. A Monte-Carlo approach is used to quantify the pixel variance and obtain uncertainty maps in parallel with reconstruction. Moreover, to fully assess the reconstruction risk (which requires the bias and thus fully sampled data), we leverage Stein's Unbiased Risk Estimator on density-compensated zero-filled images where the residuals obey a zero-mean Gaussian distribution. We validate the utility of these tools with evaluations on a dataset of Knee MR images.
In particular, our observations demonstrate that increased adversarial loss leads to larger uncertainty, indicating a trade-off when using adversarial loss to better retrieve high frequencies. On the other hand, using multiple recurrent blocks (cascaded VAE and data consistency layers), decreases uncertainty, which suggests an effective way of promoting robustness.
While we focus on model uncertainty in this work, there are other sources of uncertainty (pertaining to data and knowledge) that can also be taken into account. Evaluations with different MRI datasets and acquisition strategies may be used to assess the effects of data uncertainty on model reconstructions. Additionally, uncertainty analysis may be useful for pathological cases. Quantifying the likelihood of a diagnosis being altered by hallucinated artifacts, and finding regularization schemes to limit this, will improve the effectiveness of DL methods for MRI reconstruction.
Finally, extending pixel-wise SURE to include more accurate estimators for variance can further improve performance. The corresponding spatial risk maps can then be used alongside the Monte-Carlo results and the global SURE score to improve the medical imaging workflow in key areas such as diagnosis or automated image quality assessment.
This application claims priority from U.S. Provisional Patent Application 62/987,597 filed Mar. 10, 2020, which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
62987597 | Mar 2020 | US |