The present embodiments relate to tracking anatomical structures in images of a patient. During surgery, surgeons often use fluoroscopy as a visual aid to perform the procedure. However, the position of anatomical structures may not be clearly visible in fluoroscopic images and may be difficult to track visually due to cardiac, as well as respiratory, movement. Existing methods require an operator to carefully mark a predefined set of landmarks for the tracking of an anatomical structure. Often these markers must be updated during the procedure. Although detection based trackers that employ offline trained detectors perform well for tracking known structures, their performance to track certain anatomical structures without a known shape in advance is still limited. One method currently in use to track the position and orientation of, for example, an aortic root is by injecting an iodinated contrast agent (angiography) into the patient. However, such contrast agents are harmful for the patient and their use should be minimized. Due to this, surgeons often must carefully weigh the benefits of the use of contrast agents for a successful procedure compared to the patient's safety due to the contrast agent's side effects to the patients. Alternatively, semi-supervised trackers that learn an objects appearance perform well but are very sensitive to initialization.
By way of introduction, the preferred embodiments described below include methods, systems, instructions, and non-transitory computer readable media for tracking objects in medical imagery, such as fluoroscopic imagery. A catheter is inserted into a patient. The structure of the catheter is known in advance. This allows for the detection and tracking of the catheter. By knowing how the catheter looks inside the patient, the movement of the catheter is tracked. Using a catheter also negates the need to do manual initialization for the tracking algorithm. The system knows what to look for because the catheter imaged characteristics are known in advance. For example, during a Transcatheter Aortic Valve Implantation (TAVI) procedure, the catheter is located near the aortic root. If the surgeon needs to track anatomical structures, such as calcium deposits, the system tracks the movement of the catheter. The catheter moves in synchronization with the aortic root. The system detects any other anatomical structures which are moving in sync with the catheter. These structures are, for example, calcium deposits. As the procedure proceeds, the system tracks the detected structure using a sampling strategy. The sampling strategy, such as using a discrete structure forest, is applied to track the detected structure.
In a first aspect, a method is provided for tracking and detecting a structure in a patient. A patient with an inserted catheter is scanned over time. A processor detects the catheter in the patient from the scanned image data. The processor tracks the movement of the catheter. The processor also detects a structure represented in the data from the scanning that is detected as a function of movement with a catheter. The processor tracks the movement of the structure using sampling based on a previous location of the structure in the patient. The processor may output an image of the structure.
In a second aspect, a system is provided for object detection over time in a patient with an inserted catheter. The system includes a scanner and a processor. The scanner is configured to image the patient with the inserted catheter over time. The processor is configured to detect and track the catheter as represented in the scanned data from the imaging. The processor is further configured to detect and track an object represented in the data from the imaging where the object is detected as a function of movement with the catheter. The object is tracked using sampling based on a previous location of the object. The processor is configured to output an image of the object.
In a third aspect, a non-transitory computer readable storage medium having stored therein data representing instructions executable by a programmed processor for facilitating detection of an object in a patient. The instructions include scanning a patient over time having an inserted pig tail catheter. The processor, according to the instructions, detects the pig tail catheter. The movement of the pig tail catheter is tracked. The processor, according to the instructions, detects a structure represented in the data from the scanning. The structure is detected as a function of movement with the catheter. The processor tracks the structure represented in the data from the scanning. The tracking uses sampling based on a previous location of the structure. The processor may output an image of the structure.
The present invention is defined by the following claims, and nothing in this section should be taken as a limitation on those claims. Further aspects and advantages of the invention are discussed below in conjunction with the preferred embodiments and may be later claimed independently or in combination.
The components and the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention. Moreover, in the figures, like reference numerals designate corresponding parts throughout the different views.
Manual detection of objects in a patient may be tedious and difficult due to the high variability in object shape and the movement occurring from the patient's organs. For automatic detection, the initialization process may be complicated by determining where to place markers for detection algorithms to begin their detection and tracking of the objects. These problems may be solved by detecting and tracking a catheter that has been inserted into a patient. The catheter has a known shape and thus may act as the initialization marker for the detection and tracking algorithm to locate anatomy.
The method includes detecting and tracking a structure in a patient. The structure may be an anatomical structure, or the structure may be a foreign object inserted inside the patient, such as a stent. The detecting and tracking includes analyzing images from medical scanners, such as fluoroscopic scanners. For example, during a procedure, such as a TAVI procedure, a patient with an inserted catheter may be imaged by a fluoroscopic scanner. From the imaging data, additional objects that are moving in unison with the inserted catheter are detected and tracked. Once detected the objects may be tracked throughout the procedure. Other types of imaging, such as computed tomographic, ultrasound, or magnetic resonance may be used. Other inserted devices than a catheter may be used, such as a fiducial or stent.
The acts are performed in the order shown (e.g., top to bottom) or other orders. Additional, different, or fewer acts may be provided. For example, the method is performed without outputting the image in act 32. As another example, acts 24 and 26 are alternatives or not provided at all (e.g., detect in act 24 for each time or track over time in act 26 without further detection).
In act 22, a patient having an inserted catheter is scanned. In alternative embodiments the patient has an inserted stent. The stent may be used in the same manner as the catheter for detection and/or tracking purposes. The inserted catheter may be a pigtail catheter. In some embodiments a stent and catheter may both be inserted and work in conjunction to aid in the detecting and tracking process.
The scanning may be done by a fluoroscopic scanner, or another type of scanner used for medical imaging. The scanning may also aid in identifying the location and movement of the patients organs, such as, for example, the heart. The scanner may be affixed to the system of
In some embodiments the catheter is inserted at the aortic root of the patient. The catheter may have a hook which allows the catheter to stay affixed to the aortic root of the patient. In some embodiments, the catheter is kept inserted in the patient until an artificial valve is about to be inserted. The inserted catheter may be removed during the procedure. In other embodiments the catheter remains inserted during the entire procedure.
In act 24, the catheter is detected in data produced by the scanning of the patient with the scanner. The detecting may be done by the processor 14 of the system depicted in
The detection is performed for a given image, such as a first image in a sequence of images. The detection relies on spatial indication of the catheter. For other times or images in the sequence, the detection is repeated. Alternatively, tracking is used. The identified catheter is tracked rather than re-applying the detection. In other embodiments, the detection is repeated to update the tracking (e.g., repeat detection for each heart cycle) or detection is provided instead of tracking.
In act 26, a movement of the catheter may be tracked. The processor tracks the catheter. The tracking may include using stored images of the catheter in the memory to supplement the tracking. The catheter is detected through a sequence of images by redetecting the catheter in each image. As the patient is imaged, the movement of the inserted catheter is tracked based on an expected shape, such as detecting using a template matching or a machine-trained classifier. In other embodiments, the catheter is detected in one image or frame of data. The catheter is then tracked by determining a translation, rotation, and/or scale associated with a greatest similarity. For example, the intensities representing the catheter from one image are translated, rotated, and/or scaled and compared using sum of absolute differences or other correlation to a subsequent image. The translation, rotation, and/or scale with the greatest similarity indicates the catheter position in the subsequent image.
The disclosed embodiments do away with the need to manually indicate in the data markers for potential object detecting and tracking by tracking the movement of the catheter. This method is particularly useful given the rapid rate of movement caused by respiration and cardiac movement in the patient compared to the frame rate at which the patient is imaged.
In act 28, the processor may detect an object, or structure, represented in the data from the scanning. The object may be a support region. The object may be an anatomical structure. In some cases, the anatomical structure has no known features or structure in advance of the scanning. In other cases, the anatomical structure is known in advance and a template of what the structure may look like in the data may be used to help with detecting the structure. In some embodiments the object detected may be an inserted object separate from the catheter, such as a small tube, or stent.
The object may be detected as a function of movement with the catheter. A support region is defined as having a linear motion correlation with a known anatomical structure, such as an aortic root. Image regions may be easier to detect and track because the image regions allow use of appropriate non-local appearance models that are more robust to image noise, optical-flow failures. The catheter, having been inserted near a known anatomical structure, will move in sync with the movement of the known anatomical structure, such as for example a heart, aortic root, or lung. As such, any other structures moving in sync with the inserted catheter are necessarily moving in sync with the aortic root, heart, or lung. The structures may be anatomical structures, such as calcium deposits or tumors, as an example, with no known shape prior to a procedure on the patient. Tracked tumors may be tumor regions in lung fluoroscopic images.
In some embodiments, support regions are created by the use of local support points. Standard feature sampling techniques, such as image segmentation, image registration, image-based physiological modeling, or statistical analysis, are used to discover local support points in the initial frames or images that make up the data from the scanning. The local support points are assumed to lie in the same local motion space. The processor may use an autoregressive model to model point-to-point linear motion correlation between the inserted catheter and a potential object in a given time period. The correlation may be represented as a correlation score. A clustering algorithm may be used to group the point trajectories based on spatial and motion cues in the potential support regions. A metric is used that captures the similarity in spatial location and motion. In general, if two trajectories are close to each other and move similarly then the trajectories likely belong to the same region. A trajectory Ta={pai=(xai, yai): iεA} is represented as a sequence of points pai that spans frames in the set A, for example the data scanned from the patient. At any moment, in a current frame, the distance between trajectory Ta and Tb is the spatial distance dSi (Ta,Tb) and motion distances dMi (Ta, Tb) where d (Ta, Tb):
δMui and δMvi are standard deviations of motion in each axis that are used to normalize the weights of motion in both distances. δS is a normalizing parameter of spatial distance that can be tuned to balance motion and spatial distance. Δ is a motion smoothness parameter. Distance over nearby t frames in the fluoroscopic data is computed by:
Where X={x: x<t, xεA∩B} is the set of overlapping frames to time t. The median of frame-wise distance is taken, since the median is more robust to trajectory noise than maximum and average distance. Affinity matrix of all trajectories in current frame is constructed from distance by:
W
t=Exp(−(DMt+)DSt) (Eq. 5)
The number of clusters is automatically determining by analyzing low dimensional embedding of point trajectory space. This automatic motion compensation scheme has a strong motion correlation with an aortic root, for example, as well as improves the performance of motion estimation by calcium tracking. The technique recited above may obviate the need for contrast agents to detect human anatomy, such as a patient's aortic root.
By using the trajectories of the local support regions, the system is able to detect the structure by the structure's associated movement with the inserted catheter. The trajectories are used to detect the structure as the structure moves in synchronization with the catheter. This detection is of the structure in a given frame or image, but is based on the trajectories over a sequence. In some embodiments, only some structures are selected for tracking based on the correlation score. The structures with the highest correlation scores may be selected for the tracking.
In act 30, the structure, as represented in the data, may be tracked. The processor 14 of the system depicted in
One example method for tracking structures in the image of a patient is to use discrete structure forests. Discrete structure forests are a random forest classifier. A random forest includes a decision tree where each node in the tree corresponds to a decision that the system makes. In some embodiments, the features used as decision points in the tree follow geometric patterns that encode the underlying structure of the target being tracked. The use of discrete structure forests is a type of data mining or machine learning that allows a system to learn the structure of a target as the target is being tracked in the images of the patient. For example, features of the potential object moving in synchronization with the inserted catheter are sampled and feature point comparisons are performed to capture the underlying structure of the object in the images. The sampling techniques may be random sampling, data driven sampling, manually designed pattern sampling, or other sampling.
The discrete structure forest is constructed by first building directed random feature chains. The processor 14 may build the discrete structure forest, as well as the directed random feature chains, alone or in conjunction with the scanner 12 or memory 16. In one embodiment, sampling starts with a point X0 inside the region of the image where the object (e.g., anatomy of interest) has been detected. In one embodiment, if a bounding box is being used, the point X0 may be the center of the bounding box. An additional feature point X1 is sampled within the neighborhood of X0. The two points are used as a direction for continued sampling of points within the detected object. When sampling Xn, points Xn−1 and Xn−2 are used to avoid any cross over or overlaps in the direction for continued sampling. The direction may be a chain in the detected object represented as: Xi−1→Xi→Xi+1→Xi+2→ . . . Xn. The next point in the chain may be sampled by satisfying the following constraints:
max(Xi−1,Xi−1+α(Xi−Xi−1))<Xi+1 (Eq. 6)
X
i+1<min(Xi−1+β(Xi−Xi−1),Bdim) (Eq. 7)
where 0<α<1 and β>1. The processor 12 may perform the calculations for these equations alone or in conjunction with the memory 16.
Another approach may use data driven pairwise features. Data driven pairwise features try to encode gradient changes within the detected object, such as edges. The image is combined with a Gaussian kernel of a standard deviation of three, or other number of pixels, to suppress noise in the image. Other standard deviations of pixels may be used. Edge detection is then performed on a bounding box around the object. The number of points within the bounding box is approximated from 0 to K. The approximation limits the points to points within the neighborhood of the detected object. Points are sampled along the edge and the for each edge point Xmax, a pair of points around Xmax are sampled such that the pair of points are collinear, equidistant and on the opposite sides of Xmax. If Xj and Xk are two points sampled around Xmax then the following formula may be used:
This formula allows for the identification of edges of the object in the bounding box.
In another embodiment, data driven feature chains may be used. A data driven feature chain identifies and tracks the object by sampling maximum or minimum points within the object. An extremum point within the object is sampled and the next nearest extremum point is sampled. The sampled points form a chain. Additional points are sampled until a certain predefined length is reached for the chain of sampled points. The chain may be used as a known location at the time the image was taken and used as an input for tracking the location of the structure in future images of the patient. Data driven feature chains show significant robustness to intensity fluctuations in the image data because the fluctuations do not affect the order relationship between extrema.
Manually drawn feature trackers may be employed. Specific domain knowledge, for example typical density or shape of calcium deposits is known in advance, may be used to construct the manually drawn feature tracker, such as a geometric shape. The manually drawn feature tracker is then used in a similar fashion to a bounding box. The tracker is applied to the image, and then the location is used to project where the detected structure will be in subsequent images. In some embodiments, a triangular pyramid shape may be used to sample points inside the object corresponding to the position of the triangular pyramid. This approach may be used for dense calcium deposits that are not necessarily elongated tube like calcium deposits.
The above described sampling approaches, random sampling, data driven sampling, and manually designed pattern sampling may be implemented by the processor 12 alone or in conjunction with the memory 16. The sampling approaches may be used in conjunction with the object detection process of using an inserted catheter to detect objects via their movement in sync with the inserted catheter.
In act 32, a processor may output an image of the structure. The image may be output to a display that is a part of the system depicted in
Using the detection and tracking, the anatomical structure of interest may be highlighted. In one embodiment, the anatomical structure is segmented. The segmented structure being shown alone highlights the anatomy of interest. In other embodiments, color, graphics, or intensity change are used to highlight the anatomical structure relative to other anatomy or background in the image.
The processor 14, memory 16, and scanner 12 are part of the system. Alternatively, the processor 14, memory 16, and scanner 12 are part of a server, computer, and/or image processing system separate from the system depicted in
The scanner 12 may be a fluoroscopic image scanner. The fluoroscopic image scanner transmits x-rays through a patient. The x-rays are detected as intensities that vary based on the density of tissue or structure in the patient through which the x-rays pass. The scanner may also be used in conjunction with a contrast agent to help illuminate and identify human organs. The resulting fluoroscopic image portrays the anatomical structure that is detected and tracked.
The memory 16 may be a graphics processing memory, a video random access memory, a random access memory, system memory, cache memory, hard drive, optical media, magnetic media, flash drive, buffer, database, combinations thereof, or other now known or later developed memory device for storing data. The memory 16 is part of the system depicted in
The memory 16 stores image data, such as fluoroscopic image data of a patient. The fluoroscopic image data represents a projection through a volume of the patient. The memory 16 may alternatively or additionally store data during processing, such as storing filtered data, possible object locations, extracted feature values, actual object locations, cluster labels, imaging data, and/or other information discussed herein.
The memory 16 or other memory is alternatively or additionally a non-transitory computer readable storage medium storing data representing instructions executable by the programmed processor 14 for tracking anatomical structures in images of a patient. The instructions for implementing the processes, methods and/or techniques discussed herein are provided on non-transitory computer-readable storage media or memories, such as a cache, buffer, RAM, removable media, hard drive or other computer readable storage media. Non-transitory computer readable storage media include various types of volatile and nonvolatile storage media. The functions, acts or tasks illustrated in the figures or described herein are executed in response to one or more sets of instructions stored in or on computer readable storage media. The functions, acts or tasks are independent of the particular type of instructions set, storage media, processor or processing strategy and may be performed by software, hardware, integrated circuits, firmware, micro code and the like, operating alone, or in combination. Likewise, processing strategies may include multiprocessing, multitasking, parallel processing, and the like.
In one embodiment, the instructions are stored on a removable media device for reading by local or remote systems. In other embodiments, the instructions are stored in a remote location for transfer through a computer network or over telephone lines. In yet other embodiments, the instructions are stored within a given computer, CPU, GPU, or system.
The processor 14 is a general processor, central processing unit, control processor, graphics processor, digital signal processor, three-dimensional rendering processor, image processor, application specific integrated circuit, field programmable gate array, digital circuit, analog circuit, combinations thereof, or other now known or later developed device for detecting and tracking a catheter and anatomic or other structure. The processor 14 is a single device or multiple devices operating in serial, parallel, or separately. The processor 14 may be a main processor of a computer, such as a laptop or desktop computer, or may be a processor for handling some tasks in a larger system, such as in an imaging system. The processor 14 is configured by instructions, design, hardware, and/or software to perform the acts discussed herein. The processor 14 is configured to perform the acts discussed above. The processor 14 may be configured to generate a user interface for receiving corrections or verification of the detecting and tracking of objects.
The system depicted in
While the invention has been described above by reference to various embodiments, it should be understood that many changes and modifications can be made without departing from the scope of the invention. It is therefore intended that the foregoing detailed description be regarded as illustrative rather than limiting, and that it be understood that it is the following claims, including all equivalents, that are intended to define the spirit and scope of this invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP16/52988 | 2/12/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62121538 | Feb 2015 | US | |
62121543 | Feb 2015 | US |