The present invention relates to medical imaging of the heart, and more particularly, to using medical images of the heart to determine an angulation of a C-arm image acquisition system for an intervention to treat structural heart disease.
Many devices are available or under development for minimally invasive treatment of structural heart disease suing catheters instead of more invasive surgery using a heart-lung machine. This provides the opportunity to treat sicker patients and also opens the field to interventional cardiologists. Aortic valve disease is the most common valvular disease in developed countries, and has the second highest incidence among congenital valvular defects. Implantation of an artificial valve (i.e., valve prosthesis) is often necessary to replace a damaged natural valve. Transcatheter Aortic Valve Implantation (TAVI) is a minimally invasive intervention to replace the aortic valve. In TAVI, an aortic valve prosthesis is inserted via a catheter and X-ray imaging is used to support a physician in positioning and deployment of the valve prosthesis. In particular, during a valve implantation surgery, 2D fluoroscopic images (X-ray images) are often captured in real time using a C-arm image acquisition system to provide guidance to the physician.
In order to achieve good intervention results, it is desirable to have a dedicated C-arm angulation that provides an optimal view of the area of interest. For example, for TAVI, an x-ray view that is perpendicular to the aortic root is desirable. In conventional valve implantation procedures, physicians typically select an angulation for a C-arm X-ray device by iteratively acquiring angiograms using a contrast agent. From each angiogram, a physician manually predicts a good angulation until an appropriate angulation for the valve implantation procedure is selected. This selection process typically requires at least 2-3 iterations. Accordingly, this selection process typically requires a large amount of contrast agent and is time consuming.
The present invention provides a method and system for image fusion based planning of C-arm angulation for structural heart disease interventions.
In one embodiment of the present invention, a 3D ultrasound image including a cardiac region is received. The 3D ultrasound image is registered to a 3D coordinate system of a C-arm image acquisition system. A cardiac structure of interest is detected in the registered 3D ultrasound image. An angulation of the C-arm image acquisition system is determined based on the detected structure of interest in the registered 3D ultrasound image.
These and other advantages of the invention will be apparent to those of ordinary skill in the art by reference to the following detailed description and the accompanying drawings.
The present invention is directed to a method and system for image fusion based C-arm angulation planning for interventions to treat structural heart disease. Embodiments of the present invention are described herein to give a visual understanding of the method for determining an optimal angulation. A digital image is often composed of digital representations of one or more objects (or shapes). The digital representation of an object is often described herein in terms of identifying and manipulating the objects. Such manipulations are virtual manipulations accomplished in the memory or other circuitry/hardware of a computer system. Accordingly, it is to be understood that embodiments of the present invention may be performed within a computer system using data stored within the computer system.
Embodiments of the present invention utilize ultrasound and x-ray fusion to determine an optimal C-arm angulation for catheter based procedures to treat structural heart disease. Embodiments of the present invention utilize intra-operative images and therefore reflect the current state of the patient. Embodiments of the present do not require any extra operating room setting like in intra-operative 3D X-ray imaging. Embodiments of the present invention reduce the amount of radiation and contrast agent exposure required for a patient as compared to conventional techniques for determining a C-arm angulation.
In an advantageous embodiment, the image fusion based method is used to determine an optimum C-arm angulation for Transcatheter Aortic Valve Implantation (TAVI), but the present invention is not limited thereto. The method described here can similarly be used to determine optimum C-arm angulations for other procedures as well, including but not limited to mitral valve repair by MitraClip, transcatheter mitral valve implantation, left atrial appendage closure, and paravalvular leak closure.
At step 102, an ultrasound image of the region of interest is received. According to an advantageous implementation, the ultrasound image is a 3D ultrasound image acquired using transesophageal echo (TEE) or intracardiac echo (ICE). The 3D ultrasound image can be an intraopertative image acquired using an ultrasound probe at the beginning of the cardiac procedure. The 3D ultrasound image can be received directly from the ultrasound probe in real time during the cardiac procedure. In the application to TAVI, the ultrasound image is a 3D ultrasound image of the aortic root acquired before inserting the valve prosthesis via a catheter.
At step 104, the ultrasound image is registered to a coordinate system of the C-arm image acquisition device. In particular, the ultrasound image is registered to the “table” coordinate system of the C-arm image acquisition device. The table coordinate system is a mechanical coordinate system that is oriented with respect to the table of the C-arm device and thus remains constant even as the C-arm portion of the C-arm image acquisition device is rotated with respect to the table.
In an exemplary embodiment, a 2D x-ray image including the ultrasound probe (which is located within the patient's body) can be acquired using the C-arm image acquisition device at approximately the same time as the ultrasound image is acquired and the 3D position of the ultrasound probe relative to the 2D x-ray image can be determined, and from this 3D position the position of the ultrasound image relative to the table coordinate system of the C-arm image acquisition device can be derived.
At step 202, the ultrasound probe is detected in the 2D X-ray image. According to an advantageous implementation, a learning based method can be used for probe detection. Learning based methods are robust to noise and capable of handling large variations in appearance. Unlike matching or similarity measures, learning based methods are trained on a set of manually annotated or synthetically generated training data. In particular, a probe detector is trained using a learning based method offline prior to the cardiac intervention procedure, and the trained probe detector is used to detect an image patch in the 2D X-ray image that contains the ultrasound probe head. In order to train a probe detector, synthetic data can be generated by using a computed tomography (CT) volume of an ultrasound probe. DRR images are generated from the CT volume of the probe in a variety of known poses. Manually annotated training data is also chosen to contain a wide variety of pose orientations and locations in various fluoroscopic images. Additionally, the training data set can include images without a probe to enable to trained probe detector to correctly classify non-object regions. The training method is generic and independent of the probe type. The training data is probe specific and is performed offline prior to online detection.
In a possible implementation, a probabilistic boosting tree (PBT) can be used to train the probe detector from the training data. The PBT can be trained using Haar features extracted image patches in the training data annotated as positive (belonging to the probe) or negative (belonging to tissue other than the probe). At runtime, in order to detect the probe in the received 2D X-ray image, Haar features are extracted from image patches in the 2D X-ray image and the trained PBT classifier determines a probability score for each image patch. The image patch having the highest probability score is determined to be the position of the probe in the fluoroscopic image.
Returning to
Learning based techniques are used for each detection stage. This approach treats pose estimation as a classification problem. A training dataset of the probe in different poses is generated offline. The training set can include manually annotated and synthetically generated training data. In a possible implementation, separate PBT classifiers are trained for each detection stage (i.e., position and position-orientation) of the pose estimation. At run time, features (e.g., Haar features) are extracted from the fluoroscopic image and used by the sequence of trained classifiers to estimate the pose of the probe. This approach is fast and provides an initial estimate of the probe's position and orientation.
At step 404, the estimated initial pose of the probe is refined. In particular, 2D/3D registration can be used to iteratively refine the pose estimation.
At step 502, a DRR image is generated based on the estimated pose of the probe. A 3D model of the probe is generated offline prior to the cardiac intervention procedure using DynaCT/CT. This model is aligned to the initialized position of the probe in 3D and used to generate a DRR. The DRR produces a representation of the probe which is visually similar to the image captured by the 2D X-ray image. This enables a comparison between the DRR and the 2D X-ray image. At step 504, similarity between the 2D X-ray image and DRR is measured. The similarity may be measured using a difference value that represents an amount of difference between the 2D X-ray image and the DRR. That is, a small difference value indicates that the 2D X-ray image and the DRR are similar. At step 506, it is determined if the difference value between the fluoroscopic image and the DRR is below of given threshold. If the difference value is not below the threshold at step 506, the method proceeds to step 508. At step 508, the pose is refined based on the measured similarity. The pose can be refined by using a local search to determine a new pose that reduces the difference value measured between the 2D X-ray image and the DRR. After the pose is refined, the method returns to step 502 and a new DRR is generated based on the refined pose. The similarity is then measured between the new DRR and the 2D X-ray image at step 504 and the above described steps are repeated until the difference value is below the threshold. If the difference value is below the threshold at step 506, the method proceeds to step 510. At step 510, the pose of the probe is output and the method ends.
Returning to
In another exemplary embodiment, the registration of the 3D ultrasound image to the coordinate system of the C-arm image acquisition device can be implemented by equipping the ultrasound probe with a position sensor (e.g., an electro-magnetic tracking sensor). The position sensor tracks the position of the ultrasound probe relative to the C-arm image acquisition device, and the tracked position of the ultrasound probe can be used to derive the position of the ultrasound image in the coordinate system of the C-arm image acquisition device.
Returning to
In an exemplary implementation for TAVI, three “hinge points” of the aortic valve can be automatically detected in the 3D ultrasound image. The three hinge points are the lowest points on the three aortic cusps in the 3D ultrasound image. The three hinge points of the aortic valve define the aortic annulus plane, which is used to determine the C-arm angulation. In addition to the three hinge points, three aortic commissure points and the left and right coronary ostia can also be automatically detected in the 3D ultrasound image to improve the robustness of the detection of the three hinge points. Although it is possible to detect each of the aortic anatomic landmarks separately, the hinge points, commissure points, and coronary ostia can be detected in the 3D image using a hierarchical approach which first detects global object (e.g., bounding box) representing all eight anatomical landmarks (3 hinge points, 3 commissures, and 2 coronary ostia) and then refines each individual anatomic landmark using specific trained landmark detectors. The position, orientation, and scale of the global object is detected by classifiers trained based on annotated training data using marginal space learning (MSL). In order to efficiently localize an object using MSL, parameter estimation is performed in a series of marginal spaces with increasing dimensionality. Accordingly, the idea of MSL is not to learn a classifier directly in the full similarity transformation space, but to incrementally learn classifiers in the series of marginal spaces. As the dimensionality increases, the valid space region becomes more restricted by previous marginal space classifiers. In particular, detection of the global object in the 3D image is split into three stages: position estimation, position-orientation estimation, and position-orientation-scale estimation. A separate classifier is trained based on annotated training data for each of these steps. This object localization results in an estimated transformation (position, orientation, and scale) of the object, and a mean shape of the object is aligned with the 3D volume using the estimated transformation. Boundary delineation of the estimated object shape can then be performed by non-rigid deformation estimation (e.g., using an active shape model (ASM)). The specific landmark detectors for the hinge points, commissure points, and coronary ostia can be trained position detectors that search for the specific landmarks in a region constrained by the detected global object.
In another exemplary implementation for TAVI, a centerline of the aortic root can be detected. For example, the centerline of the aortic root can be detected by detecting 2D circles representing the intersection of the aortic root with horizontal slices or cross sections of the 3D image using a trained circle detector, and tracking the centerpoints of the detected 2D circles. The aortic annulus plane can then be defined as a plane that is perpendicular to the centerline at the aortic annulus point.
In another exemplary implementation for TAVI, the aortic root can be segmented in the 3D ultrasound image using MSL. As described above, in MSL-based segmentation, after estimating the pose (position, orientation, and scale) of an object, the mean shape of the object (e.g., a mean aortic root model generated from training data) is aligned with the estimated pose as an initial estimate of the object shape. After the initial estimate for the pose of the aortic root is detected a learning based boundary model and active shape model can be used to for final boundary delineation of the aortic root. Since an entire aortic root model is segmented, the aortic annulus plane is defined by the segmented aortic root. The segmentation of the aortic root using MSL is described in greater detail in U.S. patent application Ser. No. 12/725,679, filed Mar. 17, 2010, entitled “Method and System for Automatic Aorta Segmentation”, which is incorporated herein by reference. For interventions involving the mitral valve, the mitral valve can be similarly segmented using MSL to align a mean mitral valve model to the 3D ultrasound image. The segmented mitral valve model defines the annulus plane of the mitral valve in the 3D ultrasound image.
Returning to
Returning to
The above-described methods for determining an angulation of a C-arm image acquisition system may be implemented on one or more computers using well-known computer processors, memory units, storage devices, computer software, and other components. A high level block diagram of such a computer is illustrated in
The foregoing Detailed Description is to be understood as being in every respect illustrative and exemplary, but not restrictive, and the scope of the invention disclosed herein is not to be determined from the Detailed Description, but rather from the claims as interpreted according to the full breadth permitted by the patent laws. It is to be understood that the embodiments shown and described herein are only illustrative of the principles of the present invention and that various modifications may be implemented by those skilled in the art without departing from the scope and spirit of the invention. Those skilled in the art could implement various other feature combinations without departing from the scope and spirit of the invention.
This application claims the benefit of U.S. Provisional Application No. 61/764,141, filed Feb. 13, 2013, the disclosure of which is herein incorporated by reference.