This disclosure relates to imaging tools, and more particularly to systems and methods for registering and calibrating an endoscope during endoscopic procedures.
Endoscopy is a minimally invasive real-time imaging modality in which a camera is inserted into the body for visual inspection of internal structures such as the lung airways or the gastrointestinal system. Typically, the endoscope is a long flexible fiber-optic system connected to a light source at a proximal end outside of a patient's body and a lens at a distal end inside the patient's body. In addition, some endoscopes include a working channel through which the operator can perform suction or pass instruments such as brushes, biopsy needles or forceps. Video feedback gives a physician or technician cues to maneuver the scope to a targeted region.
Image guided endoscopy, as compared to traditional endoscopy, enjoys an advantage of its real-time connection to a three dimensional (3D) roadmap of a lung while the interventional procedure is performed. It thus has been recognized as a valuable tool for many lung applications. This form of endoscopy requires tracking of the tip of the endoscope in a global coordinate system, in order to associate the location of the endoscope with pre-operative computer tomography (CT) images and display fused images.
In the research of bronchoscope localization, there are three ways to track the tip of the endoscope. Type (a) tracks based on a position sensor mounted to the tip of the endoscope; Type (b) tracks based on live image registration, and Type (c) is a combination of types (a) and (b). Electro-magnetic (EM) guided endoscopy (Type (a) system) has been recognized as a valuable tool for many lung applications, but it requires employing a supplemental guidance device. Although Type (b) is more desirable than Type (a), since it does not employ a supplemental guidance device, constant frame-by-frame registration can be time consuming, and prone to errors, e.g., when fluids inside the airway obscure the video images.
The introduction of an electromagnetic (EM) position sensor to the endoscope (e.g., in Type (a) systems) may overcome this obstacle. In order to provide accurate data fusion between optical images (captured by an endoscope camera) and CT images for an endoscopic procedure, the endoscopic system needs to be calibrated and registered. Calibration refers to the process for determining coordinate offsets between a camera coordinate system and an EM tracker that is attached to the tip of the scope (given the camera intrinsic parameters have already been obtained). Registration refers to determining a coordinate transformation matrix between the EM tracker and the CT image space.
Calibration: In order to integrate data between EM space and camera space, calibration is employed to determine the position and orientation of an EM tracker mounted to the endoscope with respect to the camera coordinates (where the optical axis and center of projection are located). The results of this calibration take the form of six offset constants: three for rotation, three for translation. The goal of calibration in an interventional endoscopic procedure lies in that one can dynamically determine the camera pose based on the EM readings of the attached endoscope tracker.
Generally speaking, calibration is an offline procedure: the calibration parameters can be obtained by imaging an EM-tracked phantom (with a calibration pattern such as a checkerboard) that has known geometric properties, using an EM-tracked endoscope. This involves a cumbersome engineering procedure. Although the desired transformation in this context is between camera coordinates and the endoscope tracker, an array of calibration procedures is needed in each unit of the calibration phantom. For example, a calibration of a pointer tracker, a calibration between a test grid and reference tracker on the grid, a calibration between a camera coordinate and test grid (camera calibration) are all needed to arrive at the destination calibration between the camera coordinate and EM tracker coordinate.
Registration: Another procedure for EM guided endoscopy intervention is to align EM space with pre-operative CT space. Historically, three types of registration methods may be implemented: (1) external fiducial based, (2) internal fiducial based and (3) fiducial-free methods. The advantages and disadvantages of existing registration methods can be found in the following table (Table 1).
In the fiducial-free registration method cited above, a transformation matrix can be found by minimizing the spatial distance between EM readings from the endoscope tracker, and a midline pathway extracted from the CT images. This means the operator, in order to perform the registration task, has to move steadily along a line to make the data usable for registration. Also, it is unavoidable that when the operator tries to twist the scope toward a sub-branch, or turns the camera aside to examine a wall, the trajectory of the endoscope becomes “off-track” (no longer in the medial line). These data are no longer usable for registration, and have to be discarded until the scope goes back on track (i.e., onto the center line). This data constraint (selectiveness of usable frames) makes real-time registration difficult.
In accordance with the present principles, a simplified calibration method is provided for circumventing the cumbersome off-line calibration by only computing the offset transformation matrix between camera coordinate and endoscope tracker (given the camera intrinsic parameters have already been obtained). In one embodiment, a fly-through endoluminal view of a passageway (e.g., an airway) is rendered from 3D CT images, or virtual images (e.g., virtual bronchoscopic (VB) images). A software program is configured with an optimization scheme that is capable of identifying a most similar real image (e.g., real bronchoscopic (RB) image) from among a series of candidate real poses to a pre-operative image. A position of an EM position sensor (placed on tip of the endoscope) is determined which is associated with the real image. The position is correlated to the pre-operative image to determine a transformation matrix that indicates how to associate real-time images with the virtual or pre-operative image.
A system that can achieve on-line calibration and marker-free registration is presented. Note that the two procedures are performed independently using the same principal: e.g., the two dimensional image captured by virtual camera and the video image captured by the real camera can be employed and registered to obtain the desired transformation matrices. For the on-line calibration procedure to be successfully conducted, the registration transformation matrix has to be obtained in advance; likewise, for marker-free registration procedure presented in this context, one has to assume that the calibration matrix is already ready for use. The system is designed to achieve the desired transformation matrix between the EM and the scope camera and between the EM space and CT space intra-operatively. This approach streamlines the data integration procedure for EM-tracked endoscope applications.
The present embodiments may employ image based registration between two-dimensional (2D) video images from an endoscope camera and virtual fly-through endoluminal views derived from CT images with a simple on-line calibration method and a marker-free registration method.
A marker-free registration method is provided for aligning EM space and CT space into coincidence without the operator touching any surface fiducial markers or internal anatomic landmarks. The present principles are operator independent, and do not require a scope touching any external markers or anatomic landmarks to perform the registration. In addition, the scope does not need to be progressed along the middle line or track of the airway.
A system and method for utilizing two-dimensional real-to-virtual image alignment to obtain an EM-to-CT registration matrix and a CT-to-Camera calibration matrix are presented. This includes locating a feature in a pre-operative image and comparing real-time images with the pre-operative image taken of the feature to find a real-time image that closely matches the pre-operative image. A closest match real-time image is registered to the pre-operative image to determine a transformation matrix between a virtual camera pose of the pre-operative image and a real camera pose of the real-time image. This transformation matrix becomes the registration matrix between EM space and CT space (where the calibration matrix is known), becomes the calibration matrix (when the registration matrix is known). The presented system permits marker-free registration and on-line calibration and thus streamlines the data integration procedure for image guided endoscopy applications.
A system and method for image-based registration between images includes locating a feature in a pre-operative image and comparing real-time images taken with a scope with the pre-operative image taken of the feature to find a real-time image that closely matches the pre-operative image. A closest match real-time image is registered to the pre-operative image to determine a transformation matrix between a position of the pre-operative image and a position of the real-time image such that the transformation matrix permits tracking real-time image coordinates in pre-operative image space.
These and other objects, features and advantages of the present disclosure will become apparent from the following detailed description of illustrative embodiments thereof, which is to be read in connection with the accompanying drawings.
This disclosure will present in detail the following description of preferred embodiments with reference to the following figures wherein:
The present disclosure describes systems and methods for scope calibration and registration. A simple method for calibrating an electro-magnetic (EM) guided endoscopy system computes a transformation matrix for an offset between a camera coordinate and an endoscope tracker. The offset distance between a camera frame and an endoscope tracker frame is reflected in a disparity in 2D projection images between a real video image and a virtual fly-through image. Human eyes or a computer are used to differentiate this spatial difference and rebuild the spatial correspondence. The spatial offset becomes the calibration result.
An endoscopy system and method use marker-free, image-based registration, matching a single 2D video image from a camera on the endoscope with a CT image or other virtual image, to find a transformation matrix between CT space and EM (electromagnetic tracking) space. The present embodiments (in the form of a bronchoscope, for example) may include: (1) an EM position sensor placed on a tip of the bronchoscope, (2) reconstructed virtual bronchoscopic (VB) images from CT scans (or other technology, e.g., MRI, sonogram, etc.) and (3) software with an optimization scheme to identify the most similar-to-VB real bronchoscopic (RB) image among on a series of candidate RB poses. Progression of the bronchoscope only along a middle line of an airway is not required. Markers on or in the patient are not required. The system and method are operator independent, and do not require a scope's touching any external markers or anatomic landmarks, to perform the registration.
In particularly useful embodiments, the scope may include a bronchoscope or any scope for pulmonary, digestive system, or other minimally invasive surgical viewing. In other embodiments, an endoscope or the like is employed for other medical procedures as well. These procedures may include minimally invasive endoscopic pituitary surgery, endoscopic skull base tumor surgery, intraventricular neurosurgery, arthroscopic surgery, laparoscopic surgery, etc. Other scoping applications are also contemplated.
It should be understood that the present invention will be described in terms of a bronchoscope; however, the teachings of the present invention are much broader and are applicable to any optical scope that can be employed in internal viewing of branching, curved, coiled or other shaped systems (e.g., digestive systems, circulatory systems, piping systems, passages, mines, caverns, etc.). Embodiments described herein are preferably displayed for viewing on a display monitor. Such monitors may include any suitable display device including but not limited to handheld displays (e.g., on personal digital assistants, telephone devices, etc.), computer displays, televisions, designated monitors, etc. Depending of the scope, the display may be provided as part of the system or may be a separate unit or device. Further, virtual images may be generated using CT scanning technology although other imaging technology may also be employed such as for example, sonograms, magnetic resonance images, computer generated images, etc.
It should also be understood that the optical scopes may include a plurality of different devices connected to or associated with the scope. Such devices may include a light, a cutting device, a brush, a vacuum line, a camera, etc. These components may be formed integrally with a head on a distal end portion of the scope. The optical scopes may include a camera disposed at a tip of the scope or a camera may be disposed at the end of an optical cable opposite the tip. Embodiments may include hardware elements, software elements or both hardware and software elements. In a preferred embodiment, the present invention is implemented with software, which includes but is not limited to firmware, resident software, microcode, etc.
Furthermore, the present principles can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. A computer-usable or computer readable medium can be any apparatus that may include, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device). Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk—read only memory (CD-ROM), compact disk—read/write (CD-R/W) and DVD.
A data processing system suitable for storing and/or executing program code may include at least one processor coupled directly or indirectly to memory elements through a system bus. The processor or processing system may be provided with the scope system or provided independently of the scope system. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code to reduce the number of times code is retrieved from bulk storage during execution. Input/output or I/O devices (including but not limited to keyboards, displays, pointing devices, etc.) may be coupled to the system either directly or through intervening I/O controllers.
Network adapters may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters.
In accordance with the present principles, three local coordinate systems need to be inter-connected to permit a mapping of events therebetween. These include a camera coordinate system (where the center of projection and optical axis are located), EM sensor coordinate system, and CT coordinate system.
pCT=TCamCTpCam=TEMCTTCamEMpCam (1)
where pct is a position (pose) in CT space, and pcam is a position (pose) in camera space. Ultimately, one needs to identify the relationship TCamCT (transformation between CT space and camera space) to use a pre-operative CT roadmap to guide an endoscopic procedure. Matrix TEMCT is the calibration matrix between the EM sensor on the tip of the endoscope and the camera coordinate system, matrix TCamEM is the registration matrix between EM and CT spaces. TEMCT and TCamEM are employed to obtain the desired matrix TCamCT.
Referring now to the drawings in which like numerals represent the same or similar elements and initially to
A transformation matrix is estimated in block 16 by seeking out the pose of a camera which renders a real image most similar to the pre-operative image. In block 18, a mutual-information based registration method can be employed to find the most similar image whose pose is denoted as PR. The transformation matrix between Pv and PR becomes the desired registration result and can be used to track real image space to pre-operative image space.
Referring to
Referring to
TEMCT=TCamCTTEMCam (2)
Transformation, TCamCT, is estimated by finding the pose of a given captured VB, and seeking out the pose of a camera which renders a real image most similar to the virtual image.
A human operator only needs to bring the scope close enough to the VB pose by examining and comparing the similarities between VB and RB images. Then, a number of RB frames will be collected in a neighborhood centered on an initialization point Pi (e.g., from pose Pi−N to Pi+N in
Referring to
Referring to
In block 306, a series of video images are collected around the feature to attempt to replicate the pose of the virtual or pre-operative image. Then, in block 307, the video images are compared with the CT image to find a closest match between the video image and the CT image. This may include optimizing the matching procedure to find a maximum similarity between images to determine the closest matched real image to the CT image. In block 308, the video image is registered to the CT match image using pose positions associated with the real image matched with the CT image to create a transformation matrix based upon the rotations and translations needed to align the poses of the tracker with the pre-operative image pose. The transformation matrix between the CT space and image tracking space is determined and is based solely on image registration. The method is operator independent and free of any external markers or anatomic landmarks which need to be contacted by a tracker for registration. The transformation matrix is employed to register coordinates of the CT images to electromagnetic tracking coordinates during an endoscopic procedure. The endoscope progression may be other than along a middle line of a passage being observed.
Referring to
A computer implemented program 412 is stored in memory 404 of a computer device 414. The program 412 includes a module 416 configured to compare a real-time video image 452 taken by the camera 408 with CT images 450 to find a closest match between the real-time images and the CT image. The program 412 includes an optimization module 422 configured to find a maximum similarity to determine the closest match CT image. The program 412 is configured to register a closest matched real-time image to a pre-operative image in CT space to find a transformation matrix 420 between the CT space and image tracking space such that the transformation matrix 420 is based solely on image registration, is operator independent, and free of any external markers or anatomic landmarks to perform the registration. The transformation matrix 420 is employed to register coordinates of the CT images to electromagnetic tracking coordinates during an endoscopic procedure. A display 456 may be employed to view the real-time and/or virtual/pre-operative images during the procedure. The display 456 is configured to show endoscope progression in pre-operative image space. The marker-free registration process assumes a calibration process is employed beforehand to determine the relationship between the camera coordinate system and the tracking (EM) coordinate system.
In accordance with another embodiment, an approach is provided which uses registration to provide a calibration. The registration in this embodiment includes any type of registration including fiducial marker registration. Calibration includes a calibration matrix and may also include camera or other parameter calibrations (e.g., focal length, etc.). An offset distance between a camera frame and an endoscope tracker frame is reflected in the disparity in 2D projection images between a real video image and a virtual fly-through image (from CT scans). Human eyes and computers have the capability to differentiate these spatial differences and rebuild the spatial correspondence.
The present principles include making use of (1) an EM tracking system, (2) a phantom with EM trackable fiducials on a surface, (3) a mechanical arm (optional) that holds and stabilizes an endoscope, (4) a computer with software that collects before and after poses of an endoscope tracker, (5) input from the stereo sense of a human operator to match a real endoscopic (for example, a bronchoscopic) image (RB) with a virtual endoscopic (bronchoscopic) image (VB), and (6) software that runs an optimization scheme to find the maximum similarity between the VB and RB images.
The data integration procedure is streamlined because a same phantom, designed for fiducial-based registration, is used for both calibration and registration tasks. A calibration procedure which is independent of camera calibration (the estimation of internal and external parameters of the camera) is achieved.
Using an image-based method, an on-line calibration method is presented given that an EM-CT registration matrix has already been acquired. In this case, the fiducial-based registration method is first employed to register images between CT space and EM tracked endoscopy. The registration brings the CT coordinates and tracker coordinates into coincidence.
Referring again to
In
Relationships between an EM sensor coordinate system and a camera coordinate system provide calibration while registration couples a CT coordinate system to the EM sensor coordinate system and the camera coordinate system. The three local coordinate systems use inter-registrations to track positions between them. One needs to identify the relationship TCamCT (Eq. (2)) to associate a pre-operative CT roadmap with intra-operative endoscopic videos. Fiducial based registration is a process that may be employed to align EM space with CT space and arrive at the transformation matrix TEMCT.
Usually after fiducial based registration, CT and EM frames are largely aligned. These frames however may present a small spatial displacement owing to the unknown TCamEM. (E.g., EM sensor on the tip of the endoscope is un-calibrated with the camera coordinate system).
Referring to
A computer tomography (CT) scanner (not shown) may be configured to collect pre-operative CT images (or other technology for generating, collecting and storing a virtual map or images) of a subject having fiducial markers 122. The pre-operative images may be stored in memory 111 and transferred to the system 100 using storage media or network connections. The memory 111 and/or scanner are employed to store/collect CT images of a subject, such as a patient for surgery. The endoscope 108 includes a camera 130 for collecting real-time images during a procedure. The endoscope 108 includes an electromagnetic tracker 104 for locating the tip of the endoscope 108.
A phantom reference 120 is employed for assisting in registering pre-operative scan images to EM tracked positions. By touching each of the markers 122 using the tracker device 104, the CT image and is registered to EM tracked positions obtained by the tracker 104. A calibrated pointer-tracker (EM tracker 104) is used to touch each of the surface fiducials 122, so that a point-based registration aligns the CT space with the EM position (TEMCT) such that when the tracker on the endoscope is advanced in the airway, the pre-operative or CT (VB) images will update together with the real (RB) images. The lung phantom 120 is employed to perform dual roles to assist in calibration and registration.
For calibration, the endoscope 108 is inserted into the bronchus 123 and using the lung phantom 120, which has a few surface fiducials 122, a position is determined to perform the calibration. At the position, a real image (RB) and a closest corresponding CT image (VB) are provided (a VB image at pose 1 will be determined or captured). Due to a slight displacement between the VB and the RB images, the operator will adjust the scope 108 until the RB matches with the VB more closely. This yields a calibrated RB (at pose 2). Pose 1 refers to the RB pose after fiducial-based registration and pose 2 refers to the calibrated RB pose with the VB image. The RB video image from pose 2 matches most closely with the VB image. The rotation and translation matrix from pose 2 to pose 1 becomes the targeted calibration result. From the example in
Computer device 110 and its memory 111 store the rotation and translation information in a matrix 113 for calibrating the tracker 104 to a camera image by adjusting the endoscope 108 until the image obtained by a camera 130 associated with the tracker 104 matches with the registered CT image as described. The rotation and translation matrix 113 is employed to calibrate coordinates of the camera 130 to the tracker 104. A display 115 may be employed to view the real-time and/or virtual images during the procedure.
Referring to
In block 354, a real image is captured with an endoscope at a first position. In block 356, the endoscope is adjusted until the image obtained by a camera matches with a CT image of the same region at a second position. Adjusting the scope may include adjustment by an operator.
In block 358, a rotation and translation matrix is determined to calibrate the tracker based on the motion made during the adjustment stage (block 356). The rotation and translation matrix is employed to calibrate coordinates of a camera to the tracker such that the CT images will update together with the real-time images.
In interpreting the appended claims, it should be understood that:
Having described preferred embodiments for systems and methods (which are intended to be illustrative and not limiting), it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to be understood that changes may be made in the particular embodiments of the disclosure disclosed which are within the scope of the embodiments disclosed herein as outlined by the appended claims. Having thus described the details and particularity required by the patent laws, what is claimed and desired protected by Letters Patent is set forth in the appended claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2010/051454 | 4/2/2010 | WO | 00 | 11/17/2011 |
Number | Date | Country | |
---|---|---|---|
61179031 | May 2009 | US |