The subject matter described herein relates to methods and systems for registration of 2D projection images to 3D volume images for use during real-time image-guided radiation therapy (IGRT) or any other real-time image-guided remote therapy. More particularly, the subject matter described herein relates to methods, systems, and computer readable media for real-time 2D/3D deformable registration using metric learning.
Tumor localization in 3D is the main goal of IGRT. Another important goal is the localization of non-cancerous objects at risk to radiation. They are usually accomplished by computing the patient's treatment-time 3D deformations based on an on-board imaging system, which is usually an x-ray based system. The treatment-time 3D deformations can be computed by doing image registration between the treatment-time reconstructed 3D image and the treatment-planning 3D image (3D/3D registration) or between the treatment-time on-board 2D projection images and the treatment-planning 3D image (2D/3D registration).
Recent advances of the IGRT registration methods emphasize real-time computation and low-dose image acquisition. Russakoff et al. [1,2], Khamene et al. [3], Munbodh et al. [4], Li et al. [5,6] rejected the time-consuming 3D/3D registration and performed 2D/3D registration by optimizing similarity functions defined in the projection domain. Other than the optimization-based methods, Chou et al. [7,8] recently introduced a faster and low-dose 2D/3D image registration by using a linear operator that approximates the deformation parameters. However, all of the above registration methods involve computationally demanding production of Digitally-Reconstructed Radiographs (DRRs) in each registration iteration (e.g., 15 ms on a modern GPU to produce a 256×256 DRR from a 256×256×256 volume [9]), which makes them difficult to be extended to support real-time (>30 fps) image registration.
Accordingly, there exists a need for real-time 2D/3D deformable registration that is fast, accurate, and robust. More specifically, there exists a need for metric-learning enabling real-time 2D/3D deformable registration.
According to one aspect, the subject matter described herein includes a method for real-time 2D/3D deformable registration using metric learning. The method includes creating a catalogue of simulated 2D projection images based on a reference 3D image and a shape space of 3D deformations, where each entry in the catalogue is created by applying to the reference 3D image a set of deformation parameters from the shape space of deformations, simulating a 2D projection of the result; associating the simulated 2D projection image with the deformation parameters used to create the simulated 2D projection image, and storing the simulated 2D projection image and associated deformation parameters in the catalogue. The method also includes receiving an acquired 2D image, and, in response to receiving the 2D image, calculating a value of distance between the acquired 2D image and a simulated 2D projection image for each of the simulated 2D projection images in the catalogue, using the calculated distances to calculate weighting factors to be applied to the deformation parameters of each of the simulated 2D projection images in the catalogue, and calculating 3D deformation parameters inferred from the acquired 2D image based on the weighted 3D deformation parameters in the catalogue. The calculated deformation parameters are then used to deform a 3D volume of interest to produce a 3D volume that represents the 3D layout of the tissue at the time that the received 2D image was acquired.
The method also includes two approaches that learn the distance metrics used in computing the distances between 2D images. The first approach uses linear regressions to learn the relationship between Euclidean inter-projection-image distance metrics and the respective 3D deformation parameter values. The second approach uses a leave-one-out strategy that optimizes, over Riemannian metrics, the accuracy of the prediction of the 3D deformation parameter values of the left-out projection images.
According to another aspect, the subject matter described herein includes a system for real-time 2D/3D deformable registration using metric learning. The system includes a data store for storing a catalogue of simulated 2D projection images that were created based on a reference 3D image and a shape space of 3D deformations, wherein each entry in the catalogue was computed by applying to the reference 3D image a set of deformation parameters from the shape space of deformations, simulating a 2D projection of the result; associating the simulated 2D projection image with the deformation parameters used to create the simulated 2D projection image, and storing the simulated 2D projection image and associated deformation parameters in the catalogue. The system also includes a hardware module for receiving an acquired 2D image, and, a software module that, in response to receiving the 2D image, calculates a value of distance between the acquired 2D image and a simulated 2D projection image for each of the simulated 2D projection images in the catalogue, uses the calculated distances to calculate weighting factors to be applied to the deformation parameters of each of the simulated 2D projection images in the catalogue, calculates 3D deformation parameters inferred from the acquired 2D image based on the weighted deformation parameters in the catalogue, and uses the calculated deformation parameters to deform a 3D volume of interest to produce a 3D volume that represents the 3D layout of the tissue at the time that the 2D image was acquired. The system also includes two software modules that learn the distance metrics used in computing the distances between 2D images. The first module uses linear regressions to learn the relation between Euclidean inter-projection-image distance metrics and respective 3D deformation parameter values. The second module uses a leave-one-out strategy that optimizes, over Riemannian metrics, the accuracy of the prediction of the 3D deformation parameter values of the left-out projection images.
The subject matter described herein can be implemented in software in combination with hardware and/or firmware. For example, the subject matter described herein can be implemented in software executed by a processor. In one exemplary implementation, the subject matter described herein can be implemented using a non-transitory computer readable medium having stored thereon computer executable instructions that when executed by the processor of a computer control the computer to perform steps. Exemplary computer readable media suitable for implementing the subject matter described herein include non-transitory computer-readable media, such as disk memory devices, chip memory devices, programmable logic devices, and application specific integrated circuits. In addition, a computer readable medium that implements the subject matter described herein may be located on a single device or computing platform or may be distributed across multiple devices or computing platforms.
Preferred embodiments of the subject matter described herein will now be explained with reference to the accompanying drawings, wherein like reference numerals represent like parts, of which:
In accordance with the subject matter disclosed herein, systems, methods, and computer readable media for real-time 2D/3D deformable registration using metric learning are provided.
We present a novel real-time 2D/3D registration method, called Registration Efficiency and Accuracy through Learning Metric on Shape (REALMS), that does not require DRR production in the registration. It calculates the patient's treatment-time 3D deformations by kernel regression. Specifically, each of the patient's deformation parameters is interpolated using a weighting Gaussian kernel on that parameter's training case values. In each training case, its parameter value is associated with a corresponding training projection image. The Gaussian kernel is formed from distances between training projection images. This distance for the parameter in question involves a Riemannian metric on projection image differences. At planning time, REALMS learns the parameter-specific metrics from the set of training projection images using a Leave-One-Out (LOO) training.
To the best of our knowledge, REALMS is the first 2D/3D deformable registration method that achieves real-time (>30 fps) performance. REALMS uses the metric learning idea firstly introduced in Weinberger and Tesauro [10] to tackle the 2D/3D image registration problem. Particularly, in order to make the metric learning work for the high dimensional (D>>103) projection space, REALMS uses a specially-designed initialization approximated by linear regression. The results have led to substantial error reduction when the special initialization is applied.
Given a 3D reference image, such as a computed tomography (CT) image, and a scale space of deformations, the REALMS (Registration Efficiency and Accuracy through Learning Metric on Shape) method described herein interpolatively and very quickly (a few milliseconds on a standard desktop computer) calculates a deformation in the scale space from a 2D projection image, e.g., a radiograph, consistent with the reference image. It is designed to improve radiation treatment of cancer via image-guided radiotherapy.
The subject matter described herein uses a scale space consisting of a mean deformation and a limited number of modes of variation, such that a deformation in the scale space is formed as the sum of the mean and linear combinations of the modes with mode weights given by the n-tuple c (formed by weights c1, c2, . . . cn).
The subject matter described herein involves computing n 2D “interest images” Ai n capture distance values σi, and a catalogue of simulated 2D projection images Ik, each computed by applying the deformation determined by an associated vector ck to the reference image and calculating a projection of the result. Then, given a new 2D image I, for each coefficient ci the method calculates the weight wik associated with the ith component of ck as follows, and interpolates each coefficient ci as Σk=1nwikcik/Σk=1nWik. The weight wik is computed as exp[−0.5 distance2(I,Ik)/σi2] e.g., as a decreasing function of the distance of I from Ik, with that distance computed by the Euclidean magnitude of the dot product AiT(I-Ik).
Each interest image Ai is computed as the regression matrix linearly predicting ci from the images J−Ik, where J ranges over all of the images Ij and k ranges over all images in the catalogue but the jth. The capture distance σi is selected to yield a minimum average error magnitude in the cik over all images in the catalogue. Ai can also be optimized using an objective function made from some norm of the REALMS prediction accuracies of the catalogue cjk values.
A multiscale version with successive locality in either or both spatial region of interest and coefficient ci region of interest can increase the accuracy of the method, at the expense of some of its speed.
Reference will now be made in detail to exemplary embodiments of the present invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts.
Step 100 includes creating a catalogue of simulated 2D projection images based on a reference 3D image and a shape space of deformations. In one embodiment, the reference 3D image is a 3D image of a tissue to be treated, usually taken before treatment time, during a planning stage. The process of creating the catalogue will be described in more detail in
Step 102 includes learning the inter-projection-image distance metrics that measure weighting factors on the 3D deformation parameters using the catalogue. In one embodiment, the learned projection metrics may be learned by performing linear regressions to learn the relation between Euclidean inter-projection-image distance metrics and respective 3D deformation parameter values. In another embodiment, the learned projection metrics may be learned by performing a leave-one-out analysis, over Riemannian metrics, the accuracy of the prediction of the 3D deformation parameter values of the left-out projection images.
At step 104, a 2D image is received, which may be a 2D image of a tissue being treated, acquired at treatment time.
At step 106, a distance value is calculated between the received 2D image and each of the simulated 2D projection images in the catalogue. In one embodiment, the distance value is calculated as a difference between the received 2D image and a 2D image from the catalogue, calculated on a pixel-by-pixel basis. In one embodiment, the distance value equals the sum of the differences between the intensity value of a pixel on one image and the intensity value of the corresponding pixel on the other image. In alternative embodiments, other distance functions may be used, including comparing groups of pixels, where the comparison may be weighted or un-weighted, and so on. In one embodiment, a distance value is calculated for each image in the catalogue. Alternatively, a distance value may be calculated for a select subset of images in the catalogue.
At step 108, the calculated distances are used to calculate weighting factors to be applied to the deformation parameters for each of the simulated 2D projection images in the catalogue, and at step 110, the weighted deformation parameters for each of the 2D images in the catalogue are used to calculate 3D deformation parameters inferred from the received 2D image. In this manner, the deformation parameters for catalogue images that are more similar to the received image are given more weight than is given to the deformation parameters for catalogue images that are less similar to the received image. In one embodiment, a weighting function is used to map distance to a weighting coefficient to be applied to the deformation parameters of the catalogue image. In one embodiment, a separate weighting function may be used for each separate parameter. In one embodiment, the weighting function is determined based on analysis of the 2D projection images in the catalogue, such as by a leave-one-out analysis or regression analysis.
At step 112, the equivalent deformation parameters that were calculated for the received 2D image are applied to deform a 3D volume of interest, such as a 3D volume of a tissue being treated (or a 3D model of such tissue), to produce a 3D volume that represents the 3D layout (e.g., shape and location) of the tissue at the time that the received 2D image was acquired. If this method is used to provide real-time 2D/3D deformable registration during treatment time, then at step 114, the 3D image volume produced by step 112 is used to inform or adjust treatment during treatment time. For example, the real-time 2D/3D registration allows accurate calculation of the target or actual location within the tissue that is being treated, as well as the target or actual dose, dose accumulation, or dose exposure for the tissue being treated.
Because the methods of 2D/3D registration described herein do not require using a 3D volume to generate, at treatment time, a set of 2D simulated projection images that are then compared to the 2D images received at treatment time, as is done by conventional methods, the registration methods described herein are very fast and can perform registration and 3D volume generation at more than 30 frames per second on currently available hardware.
Step 200 includes applying to the reference 3D image a set of deformation parameters from the shape space of deformations. For example, if it has been determined, via principal component analysis (PCA) or some other method, that the shape space of deformations of the tissue in question can be characterized using only four deformation parameters P1, P2, P3, and P4, and that each parameter has its own range of values, then the shape space of deformations comprises a four-dimensional space, in which case N different 4-tuples of parameter values may be generated. In step 200, the set of deformation parameters would comprise one of the N different 4-tuples. Application of the particular 4-tuple deforms the 3D volume according to the particular values of the deformation parameters in the 4-tuple.
At step 202, a 2D projection of the deformed 3D image is simulated. When the catalogue of images is being generated for the purpose of providing real-time 2D/3D registration during treatment time, the 2D projections are likely to be generated at a projection angle that is essentially the same as the expected angle from which the treatment time 2D images will be taken. However, 2D projection images of the deformed 3D image may be generated using other projection angles, or from multiple projection angles.
At step 204, the simulated 2D projection image is associated with the deformation parameters used to create the simulated 2D projection image.
At step 206, the simulated 2D projection image and its associated deformation parameters are stored in the catalogue.
The operation of REALMS will now be illustrated graphically in
In
In
In
This novel approach of using a distance metric in projection space to map to a location in parameter space and then using those parameters to deform a model of the structure in question allows registration to occur very quickly and accurately, with a mean target registration error of only 2.56±1.11 mm at 10.89±0.26 ms, or about 92 frames per second. As will be described in more detail below, metric tensor M and kernel width σ learning may be optimized via leave one out training.
The REALMS method will now be described in more detail. In this section, we describe REALMS's 2D/3D registration framework. REALMS uses kernel regression (eq. 1) to interpolate the patient's n 3D deformation parameters c=(c1, c2, . . . , cn) separately from the on-board projection image Ψ(θ) where θ is the projection angle. It uses a Gaussian kernel KM
In the treatment-time registration, each deformation parameter cii in c can be estimated with the following kernel regression:
where KM
REALMS limits the deformation to a shape space. It models deformations as a linear combination of a set of basis deformations calculated through PCA analysis. In our target problem—lung IGRT, a set of respiratory-correlated CTs (RCCTs, dimension: 512×512×120) {Jτ|τ=1, 2, 10} are available at planning, time. From these a mean image I=
Deformation Shape Space and Mean Image Generation. REALMS computes a respiratory Fréchet mean image
where Jτ(x) is the intensity of the pixel at position x in the image Jτ, ντ,γ is the fluid-flow velocity field for the image Jτ in flow time γ, s is the weighting variable on the image dissimilarity, and φτ(x) describes the deformation at the pixel location x: φτ(x)=x+∫01ντ,γ(x)dγ.
Statistical Analysis.
With the diffeomorphic deformation set {φτ|τ=1,2, . . . , 10} calculated, our method finds a set of linear deformation basis vectors φpci by PCA analysis. The scores λτi on each φpci yield φτ in terms of these basis vectors.
φτ=
We choose a subset of n eigenmodes that captures more than 95% of the total variation. Then we let the n scores form the n-dimensional parametrization c.
c=(c1,c2, . . . cn)=λ1,λ2, . . . λn) (6)
For most of our target problems, n=3 satisfies the requirement. We now describe how REALMS learns the metric tensor Mi and decides the kernel width σi.
Metric Learning and Kernel Width Selection. REALMS learns a metric tensor Mi with a corresponding kernel width σi for the patient's ith deformation parameter ci using a LOO training strategy. At planning time, it samples a set of N deformation parameter tuples {ck=(ck1, Ck2, . . . , ckn)|k=1, 2, . . . , N} to generate training projection images {P(I∘T(ck); θ)|k=1, 2, . . . , N} where their associated deformation parameters are sampled uniformly within three standard deviations of the scores λ observed in the RCCTs. For each deformation parameter ci in c, REALMS finds the best pair of the metric tensor Mi† and the kernel width σi† that minimizes the sum of squared LOO regression residuals among the set of N training projection images:
where ĉki(Mi, σi) is the estimated value for parameter cki interpolated by the metric tensor Mi and the kernel width σi from the training projection images X other than ; Mi needs to be a positive semi-definite (p.s.d.) matrix to fulfill the pseudo-metric constraint; and the kernel width σi needs to be a positive real number.
To avoid high-dimensional optimization over the constrained matrix Mi, we structure the metric tensor Mi as a rank-1 matrix formed by a basis vector αi:Mi=αiαiT. Therefore, we can transform eq. 7 into a optimization over the unit vector αi where ∥αi∥2=1:
Then we can rewrite the squared distance dM
dα
=P(I∘T(ck);θ)−P(I∘T(cx);θ), (12)
where is a vector of intensity differences between projection images generated by parameters and ; and αi is a metric basis vector where the magnitude of the inner product of αi and the intensity difference vector , αiT· gives the Riemannian distance for the parameter ci (eq. 11).
The learned metric basis vector αi† and the selected kernel width σi† form a weighting kernel Kα
Linear-Regression Implied Initial Metric.
Since the residual functional ζ (see eq. 7) that we want to minimize is non-convex, a good initial guess of the metric basis vector α is essential. Therefore, REALMS uses a vector wi as an initial guess of the metric basis vector αi for the parameter ci. Let w=(w1w2 . . . wn) list these initial guesses. The matrix W is approximated by a multivariate linear regression (eq. 13 and eq. 14) between the projection difference matrix R=(r1r2 . . . rN)T and the parameter differences matrix ΔC. In particular, the projection difference vector rK=P(I∘T(cK); θ)−P(I; θ) is the intensity differences between the DRRs calculated from the deformed image I∘T(cK) and the DRRs calculated from the mean image I (where c=0).
The inner product of the matrix W, calculated by the pseudo-inverse in eq. 14, and the projection intensity difference matrix R, WT R, gives the best linear approximation of the parameter differences ΔC. Therefore, we use wi as the initial guess of the metric basis vector αi for the parameter ci.
Optimization Scheme.
REALMS uses a two-step scheme to optimize the metric basis vector αi and the kernel width σi in eq. 10.
First, for each candidate kernel width σi, it optimizes the metric basis vector αi using the quasi-Newton method (specifically, the BFGS method) with the vector wi as the initialization. The gradient of the function Lc
Second, REALMS selects a kernel width σi† among the candidate kernel widths where its learned metric basis vector αi† yields minimum LOO regression residuals ζc
Projection Normalization.
To account for variations caused by x-ray scatter that produces inconsistent projection intensities, REALMS normalizes both the training projection images P(I∘T(); θ) and the on-board projection image Ψ(θ). In particular, it uses a localized Gaussian normalization that has shown promise in removing the undesired scattering artifacts.
Synthetic Tests.
We used coronal DRRs (dimension: 64×48) of the target CTs as synthetic on-board cone-beam projection images. The target CTs were deformed from the patient's Fréchet mean CT by normally distributed random samples of the first three deformation parameters. In our lung datasets, the first three deformation parameters captured more than 95% of lung variation observed in their RCCTs. We generated 600 synthetic test cases from 6 lung datasets and measured the registration quality by the average mTRE (mean Target Registration Error) over all cases and all voxels at tumor sites.
With REALMS's registrations, the average mTRE and its standard deviation are down from 6.89±3.53 mm to 0.34±0.24 mm using N=125 training projection images. The computation time for each registration is 11.39±0.73 ms (87.79 fps) on Intel Corel Quad CPU Q6700. As shown in
Tradeoff of Time Versus Accuracy.
We tested REALMS on 6 lung datasets with an on-board CBCT system where a single coronal on-board CB projection (dimension downsampled to 64×48 for efficient computation) at both EE (End-Expiration) and EI (End-Inspiration) phases were used for the testing. For each dataset, we generated N=125 training DRRs to learn the metrics and select optimal interpolation kernel widths. The learned metrics and the selected kernel widths were used to estimate deformation parameters for the testing EE and EI on-board projections. The estimated CTs were deformed from Fréchet mean CT with the estimated deformation parameters. The results were validated with reconstructed CBCTs at target phases.
Table 1 shows the 3D tumor centroid differences (TCDs) between REALMS-estimated CTs and the reconstructed CBCTs at the same respiratory phases. Tumor centroids were computed via Snake active segmentations. As shown in table 1, REALMS reduces the TCD from 5.58±3.14 mm to 2.56±1.11 mm in 10.89±0.26 ms (91.82 fps). Numbers inside the parentheses are the initial TCDs.
The Learned Metric Basis Vector.
As shown in
The subject matter described herein presents an accurate and real-time 2D/3D registration method, REALMS, that estimates 3D deformation parameters from a single projection image using kernel regressions with learned rank-1 projection distance metrics. The learned distance metrics that are optimized with an initialization approximated by linear regression results in success for this high dimensional metric learning, and avoids convergence to local minima and the wrong distance metrics that would result. With this initialization, the regression estimation on both synthetic and real test cases are well suited for real-time and low-dose IGRT by using a single projection image.
The disclosure of each of the following references is incorporated herein by reference in its entirety:
It will be understood that various details of the subject matter described herein may be changed without departing from the scope of the subject matter described herein. Furthermore, the foregoing description is for the purpose of illustration only, and not for the purpose of limitation.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 61/789,000, filed Mar. 15, 2013; the disclosure of which is incorporated herein by reference in its entirety.
This invention was made with government support under Grant No. CA128510 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/028251 | 3/14/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/144019 | 9/18/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5926568 | Chaney et al. | Jul 1999 | A |
6690816 | Aylward et al. | Feb 2004 | B2 |
7200251 | Joshi et al. | Apr 2007 | B2 |
8666128 | Chaney et al. | Mar 2014 | B2 |
20050180544 | Sauer et al. | Aug 2005 | A1 |
20090180678 | Kuduvalli et al. | Jul 2009 | A1 |
20120008734 | Thomson et al. | Jan 2012 | A1 |
20120027278 | Chaney et al. | Feb 2012 | A1 |
20120314927 | Chen | Dec 2012 | A1 |
Number | Date | Country |
---|---|---|
WO 2011156526 | Dec 2011 | WO |
Entry |
---|
Notification of Transmittal of the International Search Report and the Written Opinion of the International Searching Authority, or the Declaration for International Application No. PCT/US2014/028251 (Aug. 22, 2014). |
Russakoff et al., “Fast Intensity-Based 2D-3D Image Registration of Clinical Data Using Light Fields,” Proceedings of the Ninth IEEE International Conference on Computer Vision, vol. 1, pp. 1-7 (2003). |
Russakoff et al., “Fast Generation of Digitally Reconstructed Radiographs Using Attenuation Fields with Application to 2D-3D Image Registration,” IEEE Transactions on Medical Imaging, vol. 24, No. 11, pp. 1441-1454 (Nov. 2005). |
Khamene et al., “Automatic registration of portal images and volumetric CT for patient positioning in radiation therapy,” Medical Image Analysis 10, pp. 96-112 (2006). |
Munbodh et al., “Automated 2D-3D registration of a radiograph and a cone beam CT using line-segment enhancement,” Medical Physics 33, pp. 1-27 (2006). |
Li et al., “Real-time volumetric image reconstruction and 3D tumor localization based on a single x-ray projection image for lung cancer radiotherapy,” Medical Physics 37, pp. 1-8 (2010). |
Li et al., “3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy,” Medical Physics 38, pp. 1-21 (2011). |
Chou et al., “A Learning-Based Patient Repositioning Method from Limited-Angle Projections,” In: Brain, Body and Machine, vol. 83 of Advances in Intelligent and Soft Computing, Springer, pp. 83-94 (2010). |
Chou et al., “CLARET: A Fast Deformable Registration Method Applied to Lung Radiation Therapy,” In: Fourth International (MICCAI) Workshop on Pulmonary Image Analysis, pp. 1-12 (2011). |
Miao et al., “A Hybrid Method for 2-D/3-D Registration Between 3-D Volumes and 2-D Angiography for Trans-Catheter Aortic Valve Implantation (TAVI),” In: IEEE International Symposium on Biomedical Imaging (ISBI) pp. 1215-1218 (2011). |
Weinberger et al., “Metric Learning for Kernel Regression,” In: Eleventh International Conference on Artificial Intelligence and Statistics, pp. 1-8 (2007). |
Lorenzen et al., “Multi-Modal Image Set Registration and Atlas Formation,” Medical Image Analysis Journal 10(3), pp. 1-24 (2006). |
Number | Date | Country | |
---|---|---|---|
20160012592 A1 | Jan 2016 | US |
Number | Date | Country | |
---|---|---|---|
61789000 | Mar 2013 | US |