The invention is generally related to computer analysis of medical image data, and in particular to the analysis of cardiovascular medical images.
Tagged Cardiac Magnetic Resonance imaging (CMR) is a technique for detailed and non-invasive visualization of myocardium motion and deformation, with full 3D spatial geometric concordance. Local diseases, such as coronary atherosclerosis, as well as global diseases, such as heart failure and diabetes, result in wall dysfunction that manifests on tagged images. Magnetic Resonance (MR) tagging places a pre-specified pattern of virtual temporary markers (called tags) inside soft body tissues such that tag lines are created by patterns of magnetic spin in the examined tissue, and motion in the tagged tissue can be measured from the images. This technique complements traditional anatomical images and can capture detailed information about the heart over time. The tag lines allow for the computation of displacement, velocity, rotation, elongation, strain, and twist of the heart. While traditional MR techniques carry only information about the motion at the boundaries of an object, the tag lines facilitate examination of the strain and displacement of the tissue interior in close detail.
Methods for analyzing tagged MR images generally fall into the two broad spatial and spectral (frequency domain) categories. Spatial analysis estimates tissue motion and strain by identifying spatial locations of the tag lines in an image and using either model-based or model-free interpolation and differentiation. Because spatial methods track actual pixels throughout the image, these methods generally require substantial image preprocessing and segmentation, and are often computationally expensive.
Spectral analysis analyzes the harmonic phases of material points, where such material points generally do not change with motion. Because the material points generally do not change with motion, spectral analysis may build a tissue motion field. Spectral analysis methods generally perform faster as compared to spatial analysis methods, and in many cases, following a points' harmonic movement provides a more accurate tracking method as compared to spatial analysis methods.
One conventional spectral analysis method utilized in analyzing CMR images utilizes a Harmonic Phase (HARP) method. The HARP method typically computes phase images from sinusoidal tagged MR images by bandpass filtering in the Fourier domain. The MR images have both a spatial and a spectral representation (by Fourier transform). Furthermore, sharp lines in the spatial domain generally relate to noisy peaks in the spectral domain, and conversely smooth lines in the spatial domain relate to sharp peaks in the spectral domain. Therefore, spatial noise and corruption can notably affect spectral image analysis.
In general, spectral based tracking methods assume that points found in a tissue do not move a significant distance between two successive time frames. However, if the tissue in the area being tracked has a low temporal resolution or a high rate of movement, or if the MR tag parameters are selected incorrectly, corruption and noise may be introduced in the acquired images. The introduction of corruption may cause the points to be difficult to track, and the assumption that the points found in a tissue do not move much from one time frame to the next may appear to be violated. The corruption and noise may lead to spectral analysis methods failing to accurately track the tissue. In general, the primary causes of corruption and spectral based tracking failures may be classified into three types of causes: a high rate of motion between two successive frames, a through-plane motion, and boundary point tracking.
Errors due to a high rate of motion between two successive frames can be corrected by either improving the temporal resolution or decreasing the spatial frequency, or periodicity, of the tags. However, an increased temporal resolution generally reduces the overall image spatial resolution, and furthermore, decreasing the tag frequency may reduce the accuracy of spectral phase estimation. Through-plane motion errors typically appear because a material point is initially defined for the first time frame and is subsequently tracked through the remaining time series images. Points may disappear and reappear along the series of images, and this can make spectral tracking algorithms assume incorrect locations of points. In addition, boundary points being tracked near the edge may be shifted out of the tissue in subsequent frames due to noise or corruption. Spectral tracking failures that result from noise and data corruption, may require a user to manually identify and correct mistracked points. In some cases manual adjustments may be required for multiple data points in every image in a sequence. Such manual identification may be very time-consuming, especially in studies or clinical examinations when large numbers of data points must be tracked.
Therefore, a need continues to exist in the art for improved image processing systems and methods for use in analyzing magnetic resonance medical images.
The invention addresses these and other problems associated with the prior art by providing a computer aided processing system and automated method for processing tagged magnetic resonance (MR) images to generate improved tagged images in the spectral domain. Embodiments of the invention may process MR images to generate improved tagged images including reduced noise across one or more tag lines, augmented gradients across a tag profile, and/or amplified tag-to-background contrast. Embodiments of the invention may process Cardiac Magnetic Resonance (CMR) images modeling a distribution of CMR signals for the images corresponding to a cardiac cycle with an adaptive linear combination of discrete Gaussians (LCDG), where such modeled signals may be separated into signals corresponding to tag lines and signals corresponding to background. In addition, the images may be modeled by analyzing the images using a translation/rotation-invariant three-dimensional (3D) Markov-Gibbs random field model, where the images may be modeled by considering the CMR signals as voxel signals with multiple pairwise spatiotemporal (in-plane space and time) interactions. The gray-level signal of voxel-wise pairs may be analyzed to determine Gibbs potentials for interacting voxel pairs in the image set. A three-dimensional energy function may be determined based at least in part on the interaction of the voxel pairs and the determined Gibbs potentials. The energy function may be utilized to amplify homogeneity of and the contrast between signals corresponding to tags and signals corresponding to background. Images processed using embodiments of the invention may provide more accurate spectral images for spectral analysis utilizing one or more spectral analysis methods.
These and other advantages and features, which characterize the invention, are set forth in the claims annexed hereto and forming a further part hereof. However, for a better understanding of the invention, and of the advantages and objectives attained through its use, reference should be made to the Drawings, and to the accompanying descriptive matter, in which there is described exemplary embodiments of the invention.
Embodiments consistent with the invention provide for automated processing of tagged cardiovascular magnetic resonance (CMR) images to generate improved tagged CMR images. As compared to the unprocessed CMR images, the processed CMR images may include reduced noise across one or more tag lines, augmented gradients across a tag profile (i.e., a plurality of tag lines for a set of CMR images), and amplified tag-to background contrast for the spectral domain. The CMR images may include a plurality of virtual tags inside soft tissues of the images, and tag lines may be based on magnetic spin in the tagged tissues, such that motion in the tagged tissue may be measured from tagged images. The image noise may be reduced and the tag-to-background contrast may be improved by using a three dimensional energy analysis method based on probabilistic first- and second-order prior models of the visual appearance of tagged CMR images. The method may account for multiple pairwise spatiotemporal (in-plane space and time) signal interactions, and accurate voxel-wise classification based on the marginal probability distributions of the tag and background signals. Further details regarding the techniques described herein are provided in M. Nitzken, G. Beache, A. Elnakib, F. Khalifa, G. Gimel'farb, and A. El-Baz, “Improving Full-Cardiac Cycle Strain Estimation from Tagged CMR by Accurate Modeling of 3D Image Appearance Characteristics,” IEEE Int. Symposium on Biomedical Imaging (ISBI 2012), Barcelona, Spain, May 2-5, 2012, which is incorporated by reference in its entirety.
Some embodiments of the invention, for example, may perform post-processing of tagged CMR images in the spatial domain to reduce noise across the tag lines, which unsharpens the tag edges, and to amplify the tag-to-background contrast, leading to a representation that affords more efficient computations in the spectral domain. A 3D energy minimization framework may be used for each tagged CMR image, based on learning first- and second- order visual appearance models. The first-order appearance modeling may use adaptive Linear Combinations of Discrete Gaussians (LCDG) to optimally approximate the empirical marginal probability distribution of image signals, for both the tag and background submodels, and to classify the tag lines and background. The second-order modeling may consider the image as a translation- and rotation- invariant 3D Markov-Gibbs Random Field (MGRF) with multiple pairwise voxel interactions. A 3D energy function for this model may be derived by using the analytical estimation of the spatiotemporal geometry, and Gibbs potentials of interaction. To improve computations, via augmenting both the tag and the background homogeneity and contrast, the given image may be adjusted using comparisons to the minimization function.
In some embodiments of the invention, parameters associated with the tagged CMR images may be estimated based on appearance of the parameters, including estimating the parameters based on gray level intensities. Embodiments of the invention may estimate tag lines in a CMR image by generating a three dimensional (3D) model of the CMR images. The CMR images may be two dimensional (2D) comprising two spatial dimensions (e.g., ‘x’ and ‘y’), and the 3D model may map the 2D CMR images on a temporal dimension or a time axis (e.g., ‘t’), such that the 3D model comprises spatiotemporal three dimensional voxels with (x, y, t) coordinates. Signals included in the CMR images may correspond to tags or background, where the signals may be represented in the 2D images and the 3D model by gray level intensities.
In some embodiments of the invention, the tagged CMR images may be processed by determining a linear combination of discrete Gaussians (LCDG) that estimate the signals of the 3D model. The discrete Gaussians may be separated into two submodels: a submodel corresponding to the signals corresponding to the tags and a submodel corresponding to the signals corresponding to the background. Further details regarding the use of Expectation-Maximization (EM) techniques to separate LCDGs into submodels are provided in A. El-Baz and G. Gimel'farb, “EM based approximation of empirical distributions with linear combinations of discrete Gaussians,” in Proc. IEEE Int. Conf. on Image Processing (ICIP 2007), vol. 4, Oct. 16-19 2007, pp. 373-376.
In some embodiments of the invention, the tagged CMR images may be processed by modeling the signals of the tagged CMR images with a second-order probabilistic model of characteristic voxel-wise and pairwise voxel signal dependencies with a generic translation and rotation invariant central symmetric Markov-Gibbs random field (MGRF) model. Pairwise voxel potentials of the MGRF model may be based at least in part on signal differences. Such signal differences may be determined based on gray level intensities for voxels and the difference in intensity levels between one or more neighboring voxels. A Gibbs energy function of estimated signals of the CMR images may be generated based at least in part on the voxel potentials and pairwise voxel potential differences of the MGRF model.
Each estimated signal from the MGRF may be classified as a tag line or background signal based at least in part on the submodels of the LCDG model. Voxels and the corresponding pixels of the 2D CMR images may be adjusted by a small bias based on a threshold determined from the LCDG submodels to thereby generate processed tagged CMR images.
For example, in one illustrative embodiment, a set of images, DICOM or other format, is read into memory. The images are then stacked on top of one another to create a cube of the images, where the X- and Y-axes represent the images and the Z-axis represents time. Next, first-order modeling is used to approximate the empirical marginal probability distribution of the CMR signals within a cardiac cycle data set. To do this, an adaptive linear combination of discrete Gaussians (LCDG), with positive and negative components, may be used to separate and individually classify the tag lines and the background.
Once the classifier curves are constructed in this illustrative embodiment, a weighted sphere is constructed such that certain values within the sphere are given priority over others. Next, second-order modeling considers the images as samples of a translation/rotation-invariant 3D Markov-Gibbs random field (MGRF) of voxel signals with multiple pairwise spatiotemporal (in-plane space and time) interactions. The weighted sphere begins in the first image such that its center plane is positioned on the first image and it stretches horizontally in the current image as well as stretches into the future and past images (those below and above it, respectively). The points within the sphere are then extracted for analysis. The value of each extracted point in the sphere is adjusted by the weight of that location in the sphere, such that past information is more important than future information. Using an equation based on the MGRF model the minimum energy pixel may be found within the weighted sphere. If this pixel belongs to a tag line, by comparing it with the LCDG model, then the value is reduced by a delta. If it belongs to the non-tag class then the value is incremented by a delta.
The entire matrix of image information may then be processed image by image and in a fashion such that the algorithm progresses linearly from the image most in the past to the image most in the future. The results may be stored to a second matrix. After completion, the matrix may then be sub-divided back into the original images, which may be written to a separate “processed” folder using the format that the original images were recorded in, e.g., DICOM, with the finalized images possessing an improved spectral representation to the original images, and with the noise in the spectral domain significantly reduced.
Now turning to the Drawings, wherein like numbers denote like parts throughout the several views,
The process 10 determines Gibbs potentials of voxels of the 3D model utilizing a MGRF model (block 20), and the process determines a local minimum Gibbs energy function for the MGRF model (block 22). Based on the threshold determined in block 18, each estimated signal value of the Gibbs energy function is classified as corresponding to a tag line or background, and a bias is applied to the estimated signal value of each voxel based on the classification of the estimated signal value (block 24), and the 3D model is deconstructed into the corresponding 2D CMR images (26), The signals of each pixel of the 2D CMR images have been adjusted based on the bias applied to the corresponding voxel to thereby generate processed tagged CMR images 13, 14.
One or more steps in process 10 may be implemented in an automated fashion, utilizing a computer or other electronic device to implement such steps.
Computer 30 typically includes a central processing unit 38 including at least one microprocessor coupled to a memory 40, which may represent the random access memory (RAM) devices comprising the main storage of computer 30, as well as any supplemental levels of memory, e.g., cache memories, non-volatile or backup memories (e.g., programmable or flash memories), read-only memories, etc. In addition, memory 40 may be considered to include memory storage physically located elsewhere in computer 30, e.g., any cache memory in a processor in CPU 38, as well as any storage capacity used as a virtual memory, e.g., as stored on a mass storage device 42 or on another computer coupled to computer 30. Computer 30 also typically receives a number of inputs and outputs for communicating information externally. For interface with a user or operator, computer 30 typically includes a user interface 44 incorporating one or more user input devices (e.g., a keyboard, a mouse, a trackball, a joystick, a touchpad, and/or a microphone, among others) and a display (e.g., a CRT monitor, an LCD display panel, and/or a speaker, among others). Otherwise, user input may be received via another computer or terminal.
For additional storage, computer 30 may also include one or more mass storage devices 42, e.g., a floppy or other removable disk drive, a hard disk drive, a direct access storage device (DASD), an optical drive (e.g., a CD drive, a DVD drive, etc.), and/or a tape drive, among others. Furthermore, computer 30 may include an interface 46 with one or more networks 32 (e.g., a LAN, a WAN, a wireless network, and/or the Internet, among others) to permit the communication of information with other computers and electronic devices. It should be appreciated that computer 30 typically includes suitable analog and/or digital interfaces between CPU 36 and each of components 40, 42, 44 and 46 as is well known in the art. Other hardware environments are contemplated within the context of the invention.
Computer 30 operates under the control of an operating system 48 and executes or otherwise relies upon various computer software applications, components, programs, objects, modules, data structures, etc., as will be described in greater detail below. Moreover, various applications, components, programs, objects, modules, etc. may also execute on one or more processors in another computer coupled to computer 30 via network 32, e.g., in a distributed or client-server computing environment, whereby the processing required to implement the functions of a computer program may be allocated to multiple computers over a network.
As an example, computer 30 may include a computer aided diagnostic (CAD) system program 50 used to implement one or more of the steps described above in connection with process 10. For the purposes of implementing such steps, an image database 52, storing CMR scan images, may be implemented in computer 30. It will be appreciated, however, that some steps in process 10 may be performed manually and with or without the use of computer 30.
In general, the routines executed to implement the embodiments of the invention, whether implemented as part of an operating system or a specific application, component, program, object, module or sequence of instructions, or even a subset thereof, will be referred to herein as “computer program code,” or simply “program code.” Program code typically comprises one or more instructions that are resident at various times in various memory and storage devices in a computer, and that, when read and executed by one or more processors in a computer, cause that computer to perform the steps necessary to execute steps or elements embodying the various aspects of the invention. Moreover, while the invention has and hereinafter will be described in the context of fully functioning computers and computer systems, those skilled in the art will appreciate that the various embodiments of the invention are capable of being distributed as a program product in a variety of forms, and that the invention applies equally regardless of the particular type of computer readable media used to actually carry out the distribution. Examples of computer readable storage media include but are not limited to physical, tangible storage media such as volatile and non-volatile memory devices, floppy and other removable disks, hard disk drives, magnetic tape, optical disks (e.g., CD-ROMs, DVDs, etc.), among others.
In addition, various program code described herein may be identified based upon the application within which it is implemented in a specific embodiment of the invention. However, it should be appreciated that any particular program nomenclature that follows is used merely for convenience, and thus the invention should not be limited to use solely in any specific application identified and/or implied by such nomenclature. Furthermore, given the typically endless number of manners in which computer programs may be organized into routines, procedures, methods, modules, objects, and the like, as well as the various manners in which program functionality may be allocated among various software layers that are resident within a typical computer (e.g., operating systems, libraries, API's, applications, applets, etc.), it should be appreciated that the invention is not limited to the specific organization and allocation of program functionality described herein.
Embodiments of the invention may generate a spatiotemporal model consistent with embodiments of the invention. Based on the 2D CMR images, embodiments of the invention may generate a 3D model having a coordinate set r=(x, y, t) and a lattice R=[(x,y,t):x=0, . . . , X−1; y=0, . . . , Y−1; t=0, . . . , T−1] may denote a voxel with two spatial (x, y) and one time (t) coordinates of a spatiotemporal lattice of the size R=XYT. A finite set of signals for the 2D CMR images (i.e., gray level values) may be defined as Q={0, . . . , Q−1}. The lattice, R, supports 3D CMR images g=[g(r):r ∈ R; g(r) ∈ Q] comprising the 2D CMR images taken in successive time instants (“time slices”). The 3D model of the CMR images may be analyzed by describing the visual appearance of the images with a first-order model of marginal probability distributions of tag and background signals and a second-order probabilistic model of characteristic voxel-wise and pairwise voxel signal dependencies.
A translation- and rotation-invariant generic second-order MGRF of images g is specified by a certain number, N, of characteristic central-symmetric voxel neighborhoods nv; v=1, . . . , N on R.
Each voxel neighborhood nv may indicate a family of voxel pairs, Cv={cν=(r, r′): r′−r ∈ nν; r,r′∈ R}, such that their intervoxel distances (norms of the coordinate offsets o=r′−r) belong to an indexed semi-open interval [dv:min, dv:max):
dν:min≦{square root over ((x−x′)2+(y−y′)2+(t−t′)2)}{square root over ((x−x′)2+(y−y′)2+(t−t′)2)}{square root over ((x−x′)2+(y−y′)2+(t−t′)2)} <dν:max (1)
with fixed thresholds dv:min and dv:max. Such neighboring pairs may be considered second-order cliques of the neighborhood graph with nodes in the voxels.
Cliques from the family Cv support the same real-valued Gibbs potential Vv(g(r), g(r′)) of pairwise voxel interaction. In some embodiments, uniform brightness changes may be uniformly accounted for by determining a Gibbs potential based at least in part on the absolute intra-clique signal difference: Δ=|g(r)−g(r′|∈ D={0, 1, . . . , Q−1}. The potential values may be represented as one or more column vectors Vν=[Vν(Δ): Δ∈ D]. Characteristic cliques may be stratified into N families, {Cv: v=1, . . . , N} and the potentials Vv and embedded non-intersecting distance intervals : d1:min<d1:max≦d2:min< . . . ≦dN:min<dN:max.
As discussed in G. Gimel'farb, Image Textures and Gibbs Random Fields, Kluwer Academic, 1999, an MGRF consistent with some embodiments of the invention may include a Gibbs probability distribution:
where T indicates the transposition, Zv is the normalizing factor (the partition function) depending on the first and second order potentials V=[Vvox; Vvv=1, . . . , N] for the neighborhoods {nv: v=1, . . . , N},
and is the relative size of the clique family with respect to the lattice cardinality |R|=XYT, i.e., the relative number of cliques in the family Cv. A column vector F(g)=[Fvox(g); ρνFν(g): ν=1, . . . N] includes relative empirical frequencies ƒvox(q|g) of signals q ∈ Q in the voxels and frequencies ƒν(Δ|g) of absolute signal differences Δ∈ D in the cliques from the family Cv for the image g, respectively:
where the sublattice Rq(g) includes all the voxels r, such that g(r)=q, and the subfamily Cν:Δ(g) includes all the pairwise cliques cv=(r, r′) of the family such that |g(r)−r′)|=Δ. First approximations of the maximum likelihood estimates of the potentials may be considered:
V
vox(q)=λ(ƒ(q|g)−ƒirf:vox(q)); q ∈Q;
V
ν(Δ)=λ(ƒ
ν(Δ|g)−ƒirf:dif(Δ)); Δ∈ D; ν=1, . . . , N (3)
where the common scaling factor λ, is also computed analytically, and
may denote the probability of the voxel signal q and the intervoxel signal difference Δ, respectively, for the independent random field of equiprobable signals:
The factor λ may be omitted (i.e., set to λ=1) if only relative interaction energies Erel:v are computed for the clique families in order to select the most characteristic ones.
Turning now to
Analytical first approximation of the maximum likelihood estimates for equation (3) for MGRF potentials of a first and second order, V=(Vvox; V1, . . . , VN), in equation (2), may be based on analysis of one or more training images. The estimates may maximize in the space of potentials V the likelihood P(g°) of a given training image g°, or its specific log-likelihood:
where F(g°)=└Fvox(g°);ρνVνTFν(g°):ν=1, . . . , N┘ is the vector is the vector of empirical marginal probabilities of voxel signals and scaled empirical marginal probabilities of inter-voxel signal differences (see Second Order Model Above). The factor Zv normalizes the probability distribution over the entire set QR of the discrete spatiotemporal images sequences g:
where γv denote the first vectorial derivative, i.e., the gradient
of the log-likelihood at the point V of the potential space. As such,
as the gradient
of the factor Zv in the potential space is proportional to the mathematical expectation E{F(g)|V}=Σg∈g F(g)P(g) of the empirical marginals F(g) for the MGRF model of image sequences g. The gradient of equation (6) is considered equal to zero for the MLE of potentials, such that the marginal probabilities of signals and their differences for the estimated MGRF are equal to the relevant training marginals. Furthermore, the function L(V|g° is concave in the potential space and has a unique maximum because its second vectorial derivative (the Hessian matrix)
is equal to the negated covariance matrix for the marginals of the MGRF and thus is non-positive definite.
Both the factor Zv and its vectorial derivatives in the potential space are generally intractable, except of special cases when the corresponding marginals are computable, like e.g. an independent random field (IRF) with statistically independent voxel-wise components. In particular, the origin V=0 of the potential space corresponds to the IRF of equiprobable independent voxel signals. In such case, Zo=|g|=QR, L=(0|g°)=log Q, and the corresponding marginal probabilities of the voxel signals and inter-voxel signal differences to form the vector of the scaled expected marginals Firf=[Firf;vox;ρ84 Firf:dif: ν=1, . . . , N], and the uniformly distributed signals,firf:dif(q)=1/Q, and the triangularly distributed differences,
of the independent equiprobable signals, respectively.
The analytical approximation of the MLE in Eq. (3) is obtained by maximizing the truncated Taylor's series decomposition of the log-likelihood in the vicinity of the origin V=0 along the gradient direction V=λ|γo=λ(F(g°)−Firf):
The maximizer
follows from the condition
As discussed in the aforementioned publication Image Textures and Gibbs Random Fields, statistical dependencies between the pairwise signal differences in image sequences are weak and thus may be ignored, this maximizer is also computed analytically by approximating the Hessian matrix with the diagonal matrix of scaled variances of the marginal probabilities.
The CMR images of the 3D model may be described using a linear combination of discrete Gaussians as described in the aforementioned article “EM based approximation of empirical distributions with linear combinations of discrete Gaussians.” A discrete Gaussian (DG) Ψθ=(ψ(q|θ): q ∈ Q is defined as a discrete probability distribution with Q components obtained by integrating a continuous one dimensional (1D) Gaussian Density
with parameters θ=(μ, σ), where μ is the mean and σ2 is the variance over Q intervals related to the successive integer signal values in Q: if Φθ(q)=∫−∞qφ(z|θ)dz is the cumulative Gaussian probability function, then ψ(0|θ)=Φθ(0.5), ψ(q|θ)=Φθ(q+0.5)−Φθ(q−0.5) for q=1, . . . , Q−2, and ψ(Q−1|θ)−1−Φθ(Q−1.5)
The tag-to-background contrast may be improved by approximating the empirical marginal 1D signal distribution for the CMR images of the 3D model with a linear combination of discrete Gaussians (LCDG) Pw,Θ=[pw, Θ(q):q ∈ Q]; Σq∈ Qpw, Θ(q)=1, including two positive dominant and multiple sign-alternate subordinate DGs:
Kp; Kp≧2, and Kn; Kn≧0, may be total numbers of the positive and negative DGs, and w=[wp:k, wn:l] may be non-negative weights that meet the constraint Σk=1K
The first order LCDG model may be built and separated into the two LCDG submodels of the tag lines and their background. The building and separating the LCDG model into LCDG submodels may be performed using the Expectation- Maximization techniques provided in the aforementioned article, “EM based approximation of empirical distributions with linear combinations of discrete Gaussians.” The number of dominant DGs (K=2), the numbers Kp−K and Kn of the positive and negative subordinate DGs, as well as the weights and parameters (the means and variances) of all the DGs are initialized first with a sequential EM-based algorithm in order to produce a close initial LCDG approximation of the empirical distribution. Based on the fixed numbers of the components, Kp and Kn, all the weights and parameters are refined with a conventional EM algorithm, where the conventional EM algorithm may account for the sign-alternate components. The refined LCDG model may partitioned into the two LCDG submodels. The refined LCDG model may be separated into the two LCDG submodels Pvox:a|=[Pvox:a(q):q ∈ Q], one per class α ∈ {tag, background}, by associating the subordinate DGs with the dominant ones in such a way as to minimize the tag/background misclassification rate.
Turning to
The tagged CMR images g of the 3D model may be adjusted by utilizing a voxel-wise Iterative Conditional Mode (ICM) relaxation algorithm to search for a local minimum of the Gibbs energy function for the second order MGRF appearance model:
where the probability vectors Fvox(g) and Fv(g) are collected over the generated tagged CMR images. To improve the tag-background contrast, each estimated signal value, ĝ(r); r ∈ R, is classified as belonging to either the tag line or the background by using the first-order LCDG modeling. The voxels are nudged towards their proper grouping by incrementing or decrementing their signals by a small value δ in accord with the discriminant threshold.
Three metrics: (i) the reduction in the power scatter, defined as the ratio of the power of the spectral main lobe to the power of the spectral side lobes, (ii) the ability to restore noise corrupted strain slopes for synthetic phantoms, and (iii) the homogeneity of the strains in the image as indexed using the variance, were used to analyze and validate the effectiveness of the process 10 by testing on both synthetic phantoms and in-vivo data sets, analyzing the strain with the HARP technique, and quantifying the performance.
Relative power scatter between the main and side spectral lobes: for accurate and efficient spectral domain computations, the HARP technique requires that the information resides predominately within the central peak and the first (main) side lobe as described in N. F. Osman and J. L. Prince, “Visualizing myocardial function using HARP MRI,” Physics in Medicine and Biology, vol. 45, pp. 1665-1682, June 2000. When an image is noisy and has sharp edges in the spatial domain, a significant part of the total information resides in the outer side lobes in the spectral domain, so that HARP encounters difficulties in tracking phases of points. Therefore, minimizing the amount of information in the outer side lobes is important. Further details regarding the minimization of the amount of information in the outer side lobes are described in K. Hansen, F. Gran et al., “In-vivo validation of fast spectral velocity estimation techniques,” Ultrasonics, vol. 50, no. 1, pp. 52-59,2010 and T. Jiang and Y. Wu, “An overview: Peak-to-average power ratio reduction techniques for OFDM signals,” IEEE Transactions on Broadcasting, vol. 54, no. 2, pp. 257-268, June 2008.
Strain slope recovery: In cardiac strain analysis, the slope during the contraction phase of the cardiac cycle is a very important indicator of strain function, i.e. a cardiac systolic performance index, namely, the rate of peak contraction. Therefore, the accuracy of the slope recovery is a useful indicator to measure the effect of noise on the calculated strain for a tagged MR restoration algorithm. Additionally, the ability to derive accurate strain slope analysis indicates the HARP reliability in tracking tag intersection points through the cardiac cycle. Let Sobs denote the slope of the observed data set to be analyzed (e.g. of the noisy or processed observations) and let Sini be the ground truth slope, being known for the initial data set (phantom). The relative strain error is defined as
i.e. the absolute value of the relative slope change in the noisy data with respect to the ground truth was calculated for the synthetic phantom images by using the HARP technique for the Euler strain measurements.
Variance-indexed strain homogeneity: Restoring local strain homogeneity in tagged MR images is an additional important ability of the proposed approach. This ability follows directly from the reduced spectral noise and yields more accurate estimates of quantitative strain parameters for spectral-domain techniques. Bio- mechanical models of the heart tissue as a continuous medium suggest that neighboring voxels in the heart should have similar strains, so that strain does not randomly occur in an individual voxel as discussed in K. B. Chandran, H. S. Udaykumar et al., Image-Based Computational Modeling of the Human Circulatory and Pulmonary Systems: Methods and Applications. Springer, 2010. Thus, the variance in, or homogeneity, of strain in a realistic tagged MR image should be low. This should hold true in any small region, even in the presence of injury, where transition zones may occur. Therefore, to analyze the strain homogeneity in the images before and after processing using the disclosed method, the variance index for each pixel of color strain maps was extracted using the HARP processing software and is calculated as follows. The color strain map is scaled to obtain the corresponding numerical strain map over the entire image, and the variance for each strain location over the entire image is calculated within a moving 7×7 window. The stored matrix of these variances is referred to as the variance map that allows for determining the effect of noise on the strain parametric image, or a type of roughness index. The lower the strain variance, the more accurate the strain analysis. Conversely, the high strain variance indicates that the image is noisy, and that HARP would have difficulty in tracking points from one temporal frame to another throughout the cardiac cycle.
Phantoms were synthesized using a descriptive mathematical model that accounts for physical features of the left ventricle (LV) and physiological LV responses as the heart progresses through the cardiac cycle. The model uses a geometric transformation that covers the LV shearing, rotation, translation, torsion, and compression to describe the LV motion by mapping each location (i.e. a material point) defined in the model to a corresponding spatial point at a certain time instant during the cardiac cycle. Using this transformation, an inverse motion map is calculated analytically and is used to establish correspondences between two points at any two time instants. This allows for simulating tagged MR images as discussed in T. Arts, W. C. Hunter et al., “Description of the deformation of the left ventricle by a kinematic model,” Journal of Biomechanics, vol. 25, pp. 1119-1127, October 1992 and E. Waks, J. Prince et al., “Cardiac motion simulator for tagged MRI,” in Proc. Workshop on Mathematical Methods in Biomedical Image Analysis, June 1996, pp. 182-191. To derive the phantoms, a motion transformation is applied to a generated 3D LV model. Then, an image is generated by selecting an image plane that intersects the LV, and assigning every point on the image plane a value that depends on whether the point lies inside or outside the LV wall.
The known ground truth (i.e., the strain slope) for these phantoms facilitates determining that the proposed processing of noisy images may reduce the absolute strain error from approximately 94% to 36% in this simulated example.
The Fourier spectral profile of the corrupted and improved phantom images in
In addition, the homogeneity of strain was analyzed for the phantom data set. The resulting strain homogeneity is shown in Table II for the corrupted and processed data. The improvement is statistically significant according to the unpaired t-test.
The experiment data indicates that embodiments of the invention improve both the strain slope and strain homogeneity, as well as largely recover the strain variance profile of the original image. Pixel-wise parametric (color-coded) maps of the variances, shown in
Performance in real data for some embodiments of the invention were determined by processing 20 independent in-vivo data sets. Results before and after enhancing a representative example of the test images (
Table III below shows that embodiments of the invention may significantly reduce spectral noise in a variety of images. In these experiments the average noise reduction of 46±6% was observed in the spectral side lobes for the 20 independent in- vivo data sets. Additionally,
Furthermore, similar to the phantom data, the overall strain homogeneity of the image clearly increases in the example experiment.
The total processing time of a HARP analysis may be only slightly increased using embodiments of the invention because faster analysis of the restored images typically compensates for the extra image restoration time. The average processing time required for processing one 256×256 DICOM image was 9.4±0.2 seconds, i.e. about 145 seconds in average for a typical data set of 15 CMR images. Unlike other alternatives, embodiments of the invention may not require any user input or prior knowledge of model templates: all the parameters may be automatically learned from a given data set. This reduces the overall data analysis time and makes the analysis more robust due to elimination of possible human errors.
Spectral domain techniques, such as HARP, may be more efficient as compared to other methods for processing tagged MR images, such as to compute cardiac strain as the indicator of cardiac function. The experimental results discussed above indicate that the proposed processing of the tagged CMR images in the spatiotemporal domain may contribute to data de-noising and may improve the subsequent analysis in the spectral domain.
The first-order (LCDG) and second-order (MGRF) probabilistic modeling of the tagged MR images may exploit both the current spatial neighborhood information, and a prior and future temporal cardiac cycle information, to thereby improve spectral main-to-side-lobe ratio, which characterizes the noise reduction in the image spectra. Embodiments of the invention improve strain homogeneity, indexed with the strain variance in the spatial image representations. Additionally, the estimation of strain slopes, being an important clinical index of strain curves, may be improved by the noise reduction in the spectral domain. Overall, this reduces the high-spectral-noise-related errors in tracking material points between successive temporal frames, as the heart progresses through the cardiac cycle.
While embodiments of the invention have been described with respect to the HARP analysis of tagged heart CMR images, some embodiments of the invention may have a wide range of applications for de-noising image spectra and motion tracking in the spectral domain, particularly in other medical and non-medical image analysis applications involving spectral tracking of points where the images incorporate grids or grid-like image elements, tracking road intersections in moving or stationary images, tracking intersections of linear structures in an image using spectral analysis, etc. Unlike many of the other known techniques, subsequent spectral algorithms, such as the HARP, need no modification to underlying algorithms. Images processed by embodiments of the invention may ensure more efficient and robust estimates of functional parameters with state-of-the-art tools for spectral image analysis. For example, some embodiments of the invention may lead to more accurate clinical cardiac measurements and evaluations, while such embodiments may add only a very small amount of time to conventional HARP image analysis. For example, as indicated in some experimental data for some embodiments, the ratio between the mean spectral power in the main lobe and the side lobes of the image spectral representations, which relates inversely to the level of spectral noise, may increased by approximately 265% for an improved noisy phantom and approximately 46% for the improved real data. Thus, by reducing the spectral noise, some embodiments may improve both the strain slope estimation and the regional strain homogeneity. Thereby, the embodiments may provide improved CMR images, thereby providing clinicians with more accurate image functional data to make judgments for the individual patient.
Additional benefits may include an ability to be used in a modular manner, such that images are improved directly, and allowing for integration with existing systems as an “addon” to the system, and without requiring modification of commercial spectral tracking code. In addition, in many embodiments, no user input or prior knowledge of model templates is required, and all the parameters may be automatically learned from a given data set, which reduces the overall data analysis time and makes the analysis more robust due to elimination of possible human errors.
Other modifications will be apparent to one of ordinary skill in the art. Therefore, the invention lies in the claims hereinafter appended.
This application is a continuation of U.S. Provisional Application No. 61/641,598, filed on May 2, 2012 by Ayman El-Baz et al., the entire disclosure of which is incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
61641598 | May 2012 | US |