Intensity-based image registration using Earth Mover's Distance

Description

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is an exemplary method using Earth Mover's Distance as a metric for intensity-based image registration in conformance with the present invention.

FIG. 2A is a graph showing Ea with respect to rotation.

FIG. 2B is a graph showing ε_awith respect to translation.

FIG. 2C is a graph showing NMI with respect to rotation.

FIG. 2D is a graph showing NMI with respect to translation.

FIG. 3 is a graph of an average registration error for ε_aand NMI.

FIG. 4A is a graph of ε_bin gamma compression for distortion applied to a data set.

FIG. 4B is a graph of KL in gamma compression for distortion applied to a data set.

FIG. 4C is a graph of ε_bin gamma expansion for distortion applied to a data set.

FIG. 4D is a graph of KL in gamma expansion for distortion applied to a data set.

FIG. 4E is a graph of ε_bin gamma compression plus offset for distortion applied to a data set.

FIG. 4F is a graph of KL in gamma compression plus offset for distortion applied to a data set.

FIG. 5 is a exemplary embodiment of the arrangement for performing the method of FIG. 1.

FIG. 6A illustrates a sagittal overlay of set of brain data scans.

FIG. 6B illustrates a sagittal overlay of FIG. 6A after alignment.

FIG. 6C illustrates a coronal overlay of a set of brain data scans.

FIG. 6D illustrates a coronal overlay of FIG. 6C after alignment.

FIG. 6E is an axial overlay of a set of brain data scans.

FIG. 6F is an axial overlay of FIG. 6E after alignment

DETAILED DESCRIPTION

The present invention relates to aligning scanned images. In the exemplary embodiment provided in the present invention, two different intensity values of two images I₁and I_2,Tare modeled by random variables X₁and X₂. For the purposes of the exemplary embodiment of the method, I₁corresponds to a reference (known) image and T is a spatial transformation applied to a study image I₂to produce a transformed image, I_2,T.

For any given pixel location x, I_2,T(x)=I₂(T(x)). The value T belongs to an arbitrary class K of mappings ranging from rigid transformations to high-dimensional nonrigid deformations. Images are acquired on a finite sampling grid and values of I_2,Tare obtained from the study image I₂by interpolation. The joint density of the pair (X₁, X_2,T) is denoted, p_x₁_,x_2,Twith marginals p_x₁p_x_2,T. The intensities of I₁and I_2,Tand are a finite set of values. The joint intensity distribution and its marginals are represented by normalized histograms.

As opposed to the KL, MI and NMI evaluation techniques, the Earth Mover's Distance is used to allow for better evaluation of histogram similarity independently of quantization effects and deformations of the feature space. Earth Mover's Distance is referred to as a “cross-bin” distance as opposed to “bin to bin” distances such as used in KL. When applied to normalized histograms, Earth Mover's Distance is a discrete version of the Wasserstein metric in probability theory. This metric is the solution of an optimal mass transportation problem. The Earth Mover's Distance corresponds to the cost of optimally transporting one distribution into the other given a ground distance. The ground distance is typically derived from an L_pnorm, for instance d_p(x,y)=∥x−y∥_pwhere (p=2 yields the Euclidean distance). More formally, given two normalized histograms h₁and h₂, whose ith bins are centered at locations x_iand y_i, respectively, Earth Mover's Distance is defined by

$EMD (h_{1}, h_{2}) = \min_{f_{i, j}} \sum_{i, j} f_{i, j} d_{p} (x_{i}, y_{j})$

Subject to the following constraints:

$f_{i, j} \geq 0, \forall i, j, \sum_{j} f_{i, j} = h_{1} (i), \forall i, \sum_{i} f_{i, j} = h_{2} (j), \forall j .$

These values f_i,jcan be seen as elementary flows transporting elements of h₁and h₂(or h₂to h₁). This approach generalizes to multidimensional histograms such as joint intensity distributions. When the ground distance is arbitrary, the optimal solution to the linear programming problem is given by a transportation simplex algorithm.

With this definition, two Earth Mover's distance based similarity measures are provided. The first measure is:

ε_a(I₁,I_2,T)=EMD_L₁(p_x₁_,x_2,T,p_x₁p_x_2,T).

This value is maximized to achieve registration. The second is used when a learned joint intensity distribution p₀is available.

ε_b(I₁,I_2,T)=EMD_L₁(p_x₁_,x_2,T,p₀).

ε_bis minimized in order to align a pair of images. The implementation of these new measures raises several practical challenges. Computing Earth Mover's Distance for a pair of joint distribution and histograms involves solving a linear programming problem. The registration process requires multiple evaluations of Earth Mover's Distance on two dimensional histograms. To accomplish this, an algorithm providing the following 6 steps of pseudo-code allows for a compact solution.

Using this approach, the linear programming problem can be drastically simplified in order to get a much more efficient simplex algorithm. Computation times make the use of EMD_Lpossible for registration purposes.

Referring to FIG. 1 the method 10 of aligning at least two medical scan images is presented. A first medical scan image is obtained 20, the first medical scan image having a corresponding first data set. Next, a second medical scan image is obtained 30, the second medical scan image having a corresponding second data set. Next, a joint intensity distribution is learned 40 between two medical scan images. In the exemplary embodiment, two images from a patient are used, different than the first and second medical scan images. Following this step, a joint density is calculated, with marginals in the first medical scan image and second medical scan image 60. Following this step, the first medical scan image is compared with the second medical scan image using an Earth Mover's Distance 80. Finally, the first medical scan image is aligned with the second medical scan image with results obtained from the comparing of the first medical scan image with the second medical scan image using the Earth Mover's Distance 90. The above order of steps may also be augmented if a a learned joint intensity distribution is known p_o. For this instance, the images may be pre-aligned and the factor

ε_b(I₁,I_2,T)=EMD_L₁(p_x₁_,x_2,T,p₀).

minimized to align the images as completely as possible.

Referring to FIG. 5, an exemplary embodiment of an arrangement 108 of conducting analysis for alignment of medical scan images is presented. A computer 100 is used for the analysis of the data sets. The computer 100 is provided with an input capability to allow MRI scan data, for example, to be accepted into the computer 100. A Graphical User Interface (GUI) may be used by the physician/technician for easing input of the information into the computer 100. The computer 100 is connected, in the exemplary embodiment, to two output devices 102, 104. In the illustrated embodiment of the present invention, a first output device is a printer 102 that can superimpose the data sets for viewing. Additionally, a monitor 104 is provided such that the visual images of the superimposed data sets can be used by physicians. The computer 100 is connected to a data source input, in this case, an MRI machine 106, however other data acquisition systems, such as CT scanning devices may be used.

EXEMPLARY EMBODIMENT

Experiments with two pairs of Magnetic Resonance brain images were performed. The first pair was acquired from a volunteer with a Siemens Magnetom Avanto 1.5T machine. One of the datasets was obtained with a T2-weighted HASTE sequence (matrix size 512×384, 22 slices, voxel size of 0.45×0.45×5 mm³) and the other with a T1 weighted Spin-Echo sequence (matrix size 192×192, 36 slices, voxel size of 1.2×1.2×3 mm³) A volunteer was asked to move his head after the first acquisition to simulate a misalignment of large amplitude. A pair of simulated T1 and T2 weighted MR images from a database are used to generate ground truth data and measure registration errors. The database had a volume size of 181×217×181 and isotropic voxels (1 mm³). Their intensity non-uniformity was set to 20%. The noise level of the T1 and T2 weighted images was set to 3% and 9% respectively.

In all registration experiments, the number of quantization levels was set to 16. This value was chosen empirically and provides a good compromise between registration accuracy and computational efficiency. Joint histograms were computed using partial volume interpolation. In the rigid registration experiments we applied a multi-resolution hill-climbing optimization strategy. With a 3 GHz microprocessor, the evaluation EMD_Ltakes on average 1 ms for histograms of size 162. This computation time increases dramatically to 15 ms for a size of 32²and up to 313 ms for 64².

In the first experiment, the value of ε_aand NMI evolves when artificial rotations and translations are applied to one of the volumes. T1 and T2 weighted datasets which were perfectly aligned were used for analysis. Profiles of function ε_awith respect to rotation (FIG. 2A) and with respect to translation (FIG. 2B) are provided. Corresponding profiles for NMI with respect to rotation (FIG. 2C) and translation (FIG. 2D) were also computed for comparison.

Like NMI, ε_areaches a peak value at 0 and has a large capture range.

A first qualitative evaluation ε_ais conducted by rigidly aligning the real magnetic resonance data (T2 and T1 weighted scans with a large displacement). The configuration of the images before and after registration are shown in FIGS. 6A to 6F. FIG. 6A illustrates a sagittal overlay, while FIG. 6B illustrates a sagittal overlay after alignment. FIG. 6C illustrates a coronal overlay, while FIG. 6D is a coronal overlay after alignment. FIG. 6E is an axial overlay, while FIG. 6F is an axial overlay after alignment.

In the next experiment, the average registration errors of ε_a− and NMI based rigid registration algorithms for 400 points evenly placed inside the T1− and T2− weighted datasets are computed. The results of the evaluation are presented in FIG. 3. These measurements were obtained by applying 18 known synthetic transformations to the T2 weighted volume. The various levels of rotation and translation ranging approximately between ±10 degrees and ±10 millimeters. The optimization of both measures produces the same solution and success in aligning the image with a sub-voxel accuracy. ε_aoutperforms NMI on three datasets, but fails to converge to the correct solution on dataset 16 suggesting a smaller effective capture range.

In the final experiment, a prior distribution from aligned brain images is estimated and compared to the profiles of ε_band KL with respect to artificial transformations when the intensity values of one of the volumes have been distorted. A gamma compression/expansion operator and an intensity offset are applied to after the intensity profile of the images (with y equal to 0.3 and 1.7, and an offset value equal to 400). These operations are performed before quantization of the data.

In FIGS. 4A to 4F, ε_bis less sensitive than KL to intensity variations. When undergoing gamma compression, the result is particularly significant when combined to the application in an offset, which narrows considerably the capture range of KL.

The present invention provides for an advantage compared to conventional technologies that the method efficiently aligns two image data sets. The alignment of the image data sets allows physicians the capability to not only evaluate a procedures progress with respect to two dimensions, but also in a third dimension, if needed as the alignment is highly accurate. This additional capability, allows for safer and less intrusive medical procedures for patients.

The method and apparatus of the present invention may be used with multiple types of data acquisition systems. An example of this are MRI systems. Other types of scanning may be used and are applicable to use with the method of the present invention. These types of scanning include, for example, x-ray, ultrasound or computed tomography (CT) scanning.

In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereunto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are accordingly to be regarded in an illustrative rather than in a restrictive sense.

Claims

1. A method of aligning at least two medical scan images, comprising: (a) obtaining a first medical scan image, the first medical scan image having a corresponding first data set;(b) obtaining a second medical scan image, the second medical scan image having a corresponding second data set;(c) calculating a joint density, with marginals from the first medical scan image and the second medical scan image;(d) comparing the first medical scan image with the second medical scan image using an Earth Mover's Distance; and(e) aligning the first medical scan image with the second medical scan image with results obtained from the comparing of the first medical scan image with the second medical scan image using the Earth Mover's Distance.
2. The method according to claim 1, wherein image intensity values of the first data set and image intensity values of the second data set are used in estimating the joint density of the first medical scan image and the second medical scan image.
3. The method according to claim 1, wherein the first medical scan image and the second medical scan image are magnetic resonance images.
4. The method according to claim 1, wherein the first medical scan image and the second medical scan image are computed tomography scan images.
5. A method of aligning at least two images, comprising: (a) obtaining a first image, the first image having a corresponding first data set;(b) obtaining a second image, the second having a corresponding second data set;(c) learning a joint intensity distribution from a pair of prealigned images; and(d) aligning the first image and the second image by computing Earth Mover's Distance between their observed joint intensity distribution and the learned joint intensity distribution.
6. The method according to claim 5, wherein the step of aligning the first image and the second image by computing Earth Mover's Distance between their observed joint intensity distribution and the learned joint intensity distribution is accomplished using an Earth Mover's Distance induced through a formula of εb(I1,I2,T)=EMDL1(px1,x2,T,p0).
7. The method according to claim 5, wherein image intensity values of the first data set and image intensity values of the second data set are used in calculating the observed joint intensity distribution of the first medical scan image and the second medical scan image.
8. The method according to claim 5, wherein the first image and the second image are magnetic resonance images.
9. The method according to claim 5, wherein the first image and the second image are computed tomography scans.
10. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for aligning at least two medical scan images, comprising: (a) obtaining a first medical scan image, the first medical scan image having a corresponding first data set;(b) obtaining a second medical scan image, the second medical scan image having a corresponding second data set;(c) calculating a joint density, with marginals from the first medical scan image and the second medical scan image;(d) comparing the first medical scan image with the second medical scan image using an Earth Mover's Distance; and(e) aligning the first medical scan image with the second medical scan image with results obtained from the comparing of the first medical scan image with the second medical scan image using the Earth Mover's Distance.
11. The device according to claim 10, wherein the first medical scan image and the second medical scan image are magnetic resonance images.
12. The device according to claim 10, wherein the first medical scan image and the second medical scan image are computed tomography scans.
13. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for aligning at least two medical scan images, comprising: (a) obtaining a first image, the first image having a corresponding first data set;(b) obtaining a second image, the second having a corresponding second data set;(c) learning a joint intensity distribution from a pair of prealigned images; and(d) aligning the first image and the second image by computing Earth Mover's Distance between their observed joint intensity distribution and the learned joint intensity distribution.
14. The device according to claim 13, wherein the first image and the second image are magnetic resonance images.
15. The device according to claim 13, wherein the first image and the second image are computed tomography scans.
16. A method of aligning at least two images, comprising: (a) obtaining a first image, the first image having a corresponding first data set;(b) obtaining a second image, the second having a corresponding second data set; and(c) aligning the first image and the second image by computing Earth Mover's Distance between their observed joint intensity distribution and the product of its marginals.

CROSS-REFERENCE TO RELATED APPLICATIONS

This is a United States non-provisional application of U.S. provisional patent application Ser. No. 60/836,596 filed Aug. 9, 2006 by Christophe Chefd'hotel and Guillaume Bousquet, the entirety of which application is incorporated by reference herein.

Provisional Applications (1)

	Number	Date	Country
	60836596	Aug 2006	US

Intensity-based image registration using Earth Mover's Distance

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

Provisional Applications (1)