The present disclosure relates to determining positions of portions of an interventional device. A computer-implemented method, a processing arrangement, a system, and a computer program product, are disclosed.
Many interventional medical procedures are carried out under live X-ray imaging. The two-dimensional images generated during live X-ray imaging assist physicians by providing a visualization of both the anatomy, and interventional devices such as guidewires and catheters that are used in the procedure.
By way of an example, endovascular procedures require interventional devices to be navigated to specific locations in the cardiovascular system. Navigation often begins at a femoral, brachial, radial, jugular, or pedal access point, from which the interventional device passes through the vasculature to a location where imaging, or a therapeutic procedure, is performed. The vasculature typically has high inter-patient variability, moreso when diseased, and can hamper navigation of the interventional device. For example, navigation from an abdominal aortic aneurysm through the ostium of a renal vessel may be challenging because the aneurysm reduces the ability to use the vessel wall to assist in the device positioning and cannulation.
During such procedures, portions of interventional devices such as such as guidewires and catheters may become obscured or even invisible under X-ray imaging, further hampering navigation of the interventional device. An interventional device may for example be hidden behind dense anatomy. X-ray-transparent sections of the interventional device, and image artifacts may also confound a determination of the path of the interventional device within the anatomy.
Various techniques have been developed to address these drawbacks, including the use of radiopaque fiducial markers on the interventional device, and the interpolation of segmented images. However, there remains room for improvements in determining the position of interventional devices under X-ray imaging.
According to a first aspect of the present disclosure, a computer-implemented method of providing a neural network for predicting a position of each of a plurality of portions of an interventional device is provided. The method includes:
According to a second aspect of the present disclosure, a computer-implemented method of predicting a position of each of a plurality of portions of an interventional device is provided. The method includes:
Further aspects, features and advantages of the present disclosure will become apparent from the following description of examples, which is made with reference to the accompanying drawings.
Examples of the present disclosure are provided with reference to the following description and the figures. In this description, for the purposes of explanation, numerous specific details of certain examples are set forth. Reference in the specification to “an example”, “an implementation” or similar language means that a feature, structure, or characteristic described in connection with the example is included in at least that one example. It is also to be appreciated that features described in relation to one example may also be used in another example, and that all features are not necessarily duplicated in each example for the sake of brevity. For instance, features described in relation to a computer-implemented method may be implemented in a processing arrangement, and in a system, and in a computer program product, in a corresponding manner.
In the following description, reference is made to computer implemented methods that involve predicting a position of an interventional device within the vasculature. Reference is made to a live X-ray imaging procedure wherein an interventional device in the form of a guidewire is navigated within the vasculature. However, it is to be appreciated that examples of the computer implemented methods disclosed herein may be used with other types of interventional devices than a guidewire, such as, and without limitation: a catheter, an intravascular ultrasound imaging device, an optical coherence tomography device, an introducer sheath, a laser atherectomy device, a mechanical atherectomy device, a blood pressure device and/or flow sensor device, a TEE probe, a needle, a biopsy needle, an ablation device, a balloon, or an endograft, and so forth. It is also to be appreciated that examples of the computer implemented methods disclosed herein may be used with other types of imaging procedures, such as, and without limitation: computed tomographic imaging, ultrasound imaging, and magnetic resonance imaging. It is also to be appreciated that examples of the computer implemented methods disclosed herein may be used with interventional devices that, as appropriate, are disposed in other anatomical regions than the vasculature, including and without limitation, the digestive tract, respiratory pathways, the urinary tract, and so forth.
It is noted that the computer-implemented methods disclosed herein may be provided as a non-transitory computer-readable storage medium including computer-readable instructions stored thereon which, when executed by at least one processor, cause the at least one processor to perform the method. In other words, the computer-implemented methods may be implemented in a computer program product. The computer program product can be provided by dedicated hardware or hardware capable of running the software in association with appropriate software. When provided by a processor or “processing arrangement”, the functions of the method features can be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which can be shared. The explicit use of the terms “processor” or “controller” should not be interpreted as exclusively referring to hardware capable of running software, and can implicitly include, but is not limited to, digital signal processor “DSP” hardware, read only memory “ROM” for storing software, random access memory “RAM”, a non-volatile storage device, and the like. Furthermore, examples of the present disclosure can take the form of a computer program product accessible from a computer usable storage medium or a computer-readable storage medium, the computer program product providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable storage medium or computer-readable storage medium can be any apparatus that can comprise, store, communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system or device or device or propagation medium. Examples of computer-readable media include semiconductor or solid-state memories, magnetic tape, removable computer disks, random access memory “RAM”, read only memory “ROM”, rigid magnetic disks, and optical disks. Current examples of optical disks include compact disk-read only memory “CD-ROM”, optical disk-read/write “CD-R/W”, Blu-Ray™, and DVD.
The inventors have found an improved method of determining positions of portions of an interventional device.
With reference to
In general, the temporal shape data 110 may include: a temporal sequence of X-ray images including the interventional device 100; or a temporal sequence of computed tomography images including the interventional device 100; or a temporal sequence of ultrasound images including the interventional device 100; or a temporal sequence of magnetic resonance images including the interventional device (100); or a temporal sequence of positions provided by a plurality of electromagnetic tracking sensors or emitters mechanically coupled to the interventional device 100; or a temporal sequence of positions provided by a plurality of fiber optic shape sensors mechanically coupled to the interventional device 100; or a temporal sequence of positions provided by a plurality of dielectric sensors mechanically coupled to the interventional device 100; or a temporal sequence of positions provided by a plurality of ultrasound tracking sensors or emitters mechanically coupled to the interventional device 100. Thus, it is also contemplated to provide the temporal shape data 110 as three-dimensional shape data.
Simultaneously with the generation of the X-ray images at time steps t1 . . . tn-1, corresponding interventional device ground truth position data 120 representing a position of each of a plurality of portions of the interventional device 100 at each time step ti..tn in the sequence, may also be generated. The interventional device ground truth position data 120 serves as training data. In the illustrated example in
It is also contemplated to provide the ground truth position data 120 from other sources. In some implementations, the ground truth position data 120 may originate from a different source that of the temporal shape data 110. The ground truth position data 120 may for example be provided by a temporal sequence of computed tomography images including the interventional device 100. Thus, it is also contemplated to provide the ground truth position data as three-dimensional position data. The computed tomography images may for example be cone beam computed tomography, CBCT, or spectral computed tomography images. The ground truth position data 120 may alternatively be provided by a temporal sequence of ultrasound images including the interventional device 100, or indeed a temporal sequence of images from another imaging modality such as magnetic resonance imaging.
In other implementations, the ground truth position data 120 may be provided by tracked sensors or emitters mechanically coupled to the interventional device. In this respect, electromagnetic tracking sensors or emitters such as those disclosed in document WO 2015/165736 A1, or fiber optic shape sensors such as those disclosed in document W02007/109778 A1, dielectric sensors such as those disclosed in document US 2019/254564 A1, or ultrasound tracking sensors or emitters such as disclosed in document WO 2020/030557 A1, may be mechanically coupled to the interventional device 100 and used to provide a temporal sequence of positions that correspond to the position of each sensor or emitter at each time step t1 . . . tn in in the sequence.
When the ground truth position data 120 is provided by a different source to that of the temporal shape data 110, the coordinate system of the ground truth position data 120 may be registered to the coordinate system of the temporal shape data 110 in order to facilitate computation of the loss function.
The temporal shape data 110, and the ground truth position data 120 may be received from various sources, including a database, an imaging system, a computer readable storage medium, the cloud, and so forth. The data may be received using any form of data communication, such as wired or wireless data communication, and may be via the internet, an ethernet, or by transferring the data by means of a portable computer-readable storage medium such as a USB memory device, an optical or magnetic disk, and so forth.
Returning to
In some implementations, the neural network 130 includes multiple outputs, and each output predicts a position 140 of a different portion of the interventional device 100 at the current time step tn in the sequence. In the neural network 130 illustrated in
In more detail, the neural network 130 illustrated in
In some implementations, the neural network illustrated in
The training operation 5130 involves adjusting 5150 parameters of the neural network 130 based on a loss function representing a difference between the predicted position 140 of each portion of the interventional device 100 at the current time step tn, and the position of each corresponding portion of the interventional device 100 at the current time step tn from the received interventional device ground truth position data 120.
The training operation 5130 is described in more detail with reference to
As in other neural networks, the training of the LSTM cell illustrated in
The operation of the LSTM cell illustrated in
f
t=σ((whf×ht-1)+(wxf×xt)+bf) Equation 1
u
t=σ((whu×ht-1)+(wxu×xt)+bu) Equation 2
c{tilde over ( )}
t=tan h ((whc×ht-1)+(wxc×xt)+bc) Equation 3
o
t=σ((who×ht-1)+(wxo×xt)+bo) Equation 4
c
t=[c{tilde over ( )}t+ut]+[ct-1+ft] Equation 5
y
t
=[o
t×tan h ct] Equation 6
Training neural networks that include the LSTM cell illustrated in
Training a neural network typically involves inputting a large training dataset into the neural network, and iteratively adjusting the neural network parameters until the trained neural network provides an accurate output. Training is usually performed using a Graphics Processing Unit “GPU” or a dedicated neural processor such as a Neural Processing Unit “NPU” or a Tensor Processing Unit “TPU”. Training therefore typically employs a centralized approach wherein cloud-based or mainframe-based neural processors are used to train a neural network. Following its training with the training dataset, the trained neural network may be deployed to a device for analyzing new input data; a process termed “inference”. The processing requirements during inference are significantly less than those required during training, allowing the neural network to be deployed to a variety of systems such as laptop computers, tablets, mobile phones and so forth. Inference may for example be performed by a Central Processing Unit “CPU”, a GPU, an NPU, a TPU, on a server, or in the cloud.
As outlined above, the process of training a neural network includes adjusting the above-described weights and biases of activation functions. In supervised learning, the training process automatically adjusts the weights and the biases, such that when presented with the input data, the neural network accurately provides the corresponding expected output data. The value of a loss function, or error, is computed based on a difference between the predicted output data and the expected output data. The value of the loss function may be computed using functions such as the negative log-likelihood loss, the mean squared error, or the Huber loss, or the cross entropy. During training, the value of the loss function is typically minimized, and training is terminated when the value of the loss function satisfies a stopping criterion. Sometimes, training is terminated when the value of the loss function satisfies one or more of multiple criteria.
Various methods are known for solving the loss minimization problem such as gradient descent, Quasi-Newton methods, and so forth. Various algorithms have been developed to implement these methods and their variants including but not limited to Stochastic Gradient Descent “SGD”, batch gradient descent, mini-batch gradient descent, Gauss-Newton, Levenberg Marquardt, Momentum, Adam, Nadam, Adagrad, Adadelta, RMSProp, and Adamax “optimizers” These algorithms compute the derivative of the loss function with respect to the model parameters using the chain rule. This process is called backpropagation since derivatives are computed starting at the last layer or output layer, moving toward the first layer or input layer. These derivatives inform the algorithm how the model parameters must be adjusted in order to minimize the error function. That is, adjustments to model parameters are made starting from the output layer and working backwards in the network until the input layer is reached. In a first training iteration, the initial weights and biases are often randomized. The neural network then predicts the output data, which is likewise, random. Backpropagation is then used to adjust the weights and the biases. The training process is performed iteratively by making adjustments to the weights and biases in each iteration. Training is terminated when the error, or difference between the predicted output data and the expected output data, is within an acceptable range for the training data, or for some validation data. Subsequently the neural network may be deployed, and the trained neural network makes predictions on new input data using the trained values of its parameters. If the training process was successful, the trained neural network accurately predicts the expected output data from the new input data.
It is to be appreciated that the example LSTM neural network described above with reference to
In some implementations, the training of the neural network in operation S130 is further constrained. In one example implementation, the temporal shape data 110, or the interventional device ground truth position data 120, comprises a temporal sequence of X-ray images including the interventional device 100; and the interventional device 100 is disposed in a vascular region. In this example, the above-described method further includes:
In so doing, the position of the portions of the interventional device may be predicted with higher accuracy. The constraint may be applied by computing a second loss function based on the constraint, and incorporating this second loss function, together with the aforementioned loss function, into an objective function, the value of which is then minimized during the training operation 5130.
The vascular image data representing a shape of the vascular region may for example be determined from X-ray images by providing the temporal sequence of X-ray images 110 as one or more digital subtraction angiography, DSA, images.
Aspects of the training method described above may be provided by a processing arrangement comprising one or more processors configured to perform the method. The processing arrangement may for example be a cloud-based processing system or a server-based processing system or a mainframe-based processing system, and in some examples its one or more processors may include one or more neural processors or neural processing units “NPU”, one or more CPUs or one or more GPUs. It is also contemplated that the processing arrangement may be provided by a distributed computing system. The processing arrangement may be in communication with one or more non-transitory computer-readable storage media, which collectively store instructions for performing the method, and data associated therewith.
The above-described examples of the trained neural network 130 may be used to make predictions on new data in a process termed “inference”. The trained neural network may for example be deployed to a system such as a laptop computer, a tablet, a mobile phone and so forth. Inference may for example be performed by a Central Processing Unit “CPU”, a GPU, an NPU, on a server, or in the cloud.
The predicted position 140 of each of the plurality of portions of the interventional device 100 at a current time step tn in the sequence may be outputted by displaying the predicted position 140 on a display device, or storing it to a memory device, and so forth.
As mentioned above, the temporal shape data 210 may for example include:
The predicted position 140 of each of the plurality of portions of the interventional device 100 at a current time step tn in the sequence that that is predicted by the neural network 130 may be used to provide a predicted position of one or more portions of the interventional device at the current time step tn when the temporal shape data 210 does not clearly identify the interventional device. Thus, in one example, the temporal shape data 20210 includes a temporal sequence of X-ray images including the interventional device 100, and the inference method includes:
In so doing, the inference method alleviates drawbacks associated with the poor visibility of portions of the interventional device.
Other sources of temporal shape data 210 such as those described above during the training operation 5130 may likewise be received during inference and displayed in a corresponding manner.
By way of an example,
In some examples, a confidence score may also be computed and displayed on the display device for the displayed position of the interventional device. The confidence score may be provided as an overlay on the predicted position(s) of portion(s) of the interventional device 100 in the current X-ray image. The confidence score may for example be provided as a heat map of the probability of the device position being correct. Other forms of presenting the confidence score may alternatively be used, including displaying its numerical value, displaying a bargraph, and so forth. The confidence score may be computed using the output of the neural network, which may for example be provided by a Softmax layer at the output of each LSTM cell in
A system 200 is also provided for predicting a position of each of a plurality of portions of an interventional device 100. Thereto,
The above examples are to be understood as illustrative of the present disclosure and not restrictive. Further examples are also contemplated. For instance, the examples described in relation to the computer-implemented method, may also be provided by a computer program product, or by a computer-readable storage medium, or by a processing arrangement, or by the system 200, in a corresponding manner. It is to be understood that a feature described in relation to any one example may be used alone, or in combination with other described features, and may also be used in combination with one or more features of another of the examples, or a combination of other examples. Furthermore, equivalents and modifications not described above may also be employed without departing from the scope of the invention, which is defined in the accompanying claims. In the claims, the word “comprising” does not exclude other elements or operations, and the indefinite article “a” or “an” does not exclude a plurality. The mere fact that certain features are recited in mutually different dependent claims does not indicate that a combination of these features cannot be used to advantage. Any reference signs in the claims should not be construed as limiting their scope.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/082056 | 11/18/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63117543 | Nov 2020 | US |