The present disclosure relates to the technical field of autonomous operation robots, and in particular to dynamic tracking methods for an in-vivo three-dimensional key point and an in-vivo three-dimensional curve.
An autonomous operation robot capable of tele-operations solves the problem of uneven distribution of medical resources. With a three-dimensional key point and further an operation path tracked, the tele-operation can be guided more precisely and the operation robot can complete autonomous manipulation more remarkably.
Currently, it is common practice to label on a tissue or track, plan, and manipulate a three-dimensional key point and an operation path with the aid of a preoperative image in the prior art. For example, in “Supervised Autonomous Electrosurgery via Biocompatible Near-Infrared Tissue Tracking Techniques” (published on Transactions on Medical Robotics and Bionics, Institute of Electrical and Electronics Engineers (IEEE), November 2019), H. Saeidi, et al. have proposed that a supervised autonomous three-dimensional path planning, filtering, and control strategy is developed for a smart tissue autonomous robot (STAR) through a biocompatible near infrared (NIR) labeling method, red-green-blue+depth (RGBD), and a near infrared photographing system, so as to cut a complex soft tissue. For another example, in “Toward Autonomous Robotic Micro-Suturing using Optical Coherence Tomography Calibration and Path Planning” published in 2020, Tian, Y., et al. have proposed to use a robotic suturing system realizing imaging feedback by means of an optical coherence tomography (OCT) system. Accordingly, the imaging feedback is performed on the basis of OCT. A three-dimensional point cloud for suturing needle semantic segmentation is constructed. A suturing needle tip is precisely aligned with a point cloud target through an Iterative Closest Point (ICP).
However, the near infrared (NIR) labeling method is susceptible to various external factors such as light rays, bubbles, and bile, resulting in an invalid path. In the case of preoperative image based labeling, because of a flexible and dynamic in-vivo environment, a three-dimensional key point and an in-vivo three-dimensional curve are susceptible to an operation environment. Accordingly, a labeled path is affected and thus fails to be highly adaptive to changes of the in-vivo environment during an operation. In addition, in view of indistinct region features in the in-vivo environment, the doctor's intention cannot be accurately conveyed merely through registration, leading to mismatching during the operation. In view of that, dynamic tracking solutions for precisely locating an in-vivo three-dimensional key point and an in-vivo three-dimensional curve are to be provided immediately.
Aiming at the defects in the prior art, the present disclosure provides dynamic tracking methods for an in-vivo three-dimensional key point and an in-vivo three-dimensional curve, and an electronic apparatus. Therefore, the technical problem that a three-dimensional labeled path cannot be precisely located is solved.
In order to realize the above objective, the present disclosure employs the technical solutions as follows:
A minimally invasive key site navigation oriented dynamic tracking method for an in-vivo three-dimensional key point includes:
An electronic apparatus includes: one or more processors;
A minimally invasive key trajectory navigation oriented dynamic tracking method for an in-vivo three-dimensional curve includes:
The present disclosure provides the minimally invasive key site navigation oriented dynamic tracking method for an in-vivo three-dimensional key point and the minimally invasive key trajectory navigation oriented dynamic tracking method for an in-vivo three-dimensional curve. Compared with the prior art, the present disclosure has the beneficial effects as follows:
In the present disclosure, the doctor determines the key point on the curve on an intraoperative image upon his/her own knowledge and experience, and a transformation matrix is acquired through a three-dimensional affine transformation between the two point clouds; coordinates of a key point on a source point cloud are transformed through the transformation matrix; the first local region is mapped to the first local point cloud and the second local region is mapped to the second local point cloud according to the mapping relation between the endoscopic image and the point clouds; the first three-dimensional key point of the first two-dimensional key point on the first local point cloud is determined, and the second three-dimensional key point on the second local point cloud is acquired through the coordinate transformation; the second three-dimensional key point is mapped back to the second local region, so as to acquire the second two-dimensional key point on the next image; and the two-dimensional coordinates of the tracked key point are acquired by minimizing a preset optimization function in combination with the initial two-dimensional key point, and the corresponding three-dimensional coordinates are finally acquired. A selected key point is initially tracked through the three-dimensional affine transformation. The three-dimensional key point in the in-vivo environment is precisely and dynamically located and tracked in combination with texture information and optical flow information.
In the present disclosure, the first local region is mapped to the first local point cloud and the second local region is mapped to the second local point cloud according to the mapping relation between the endoscopic image and the point clouds; the first three-dimensional key point of the first two-dimensional key point on the first local point cloud is determined, and the second three-dimensional key point on the second local point cloud is acquired through the coordinate transformation; the dimension of the first local point cloud is reduced to obtain the first two-dimensional point cloud, and the second two-dimensional key point of the first three-dimensional key point on the first two-dimensional point cloud is acquired; the dimension of the second local point cloud is reduced to obtain the second two-dimensional point cloud, and the third two-dimensional key point of the second three-dimensional key point on the second two-dimensional point cloud is acquired; the two-dimensional coordinates of the tracked key point on the two-dimensional point cloud are acquired by minimizing the preset optimization function according to the second two-dimensional key point and the third two-dimensional key point; and the three-dimensional coordinates of each tracked key point are acquired according to the mapping relation between the point clouds before and after dimension reduction, curve fitting is performed, and the three-dimensional curve is finally obtained by means of tracking.
The local region is determined through a position of the key point and tracked to reduce mistracking of the three-dimensional key point. In combination with the texture information of the endoscopic image and shape information, the effect of an indistinct in-vivo environment feature is avoided to a certain extent. The three-dimensional key point is precisely located by constructing the optimization function on the point cloud after dimension reduction. Therefore, the inconsistency of a curve shape under different viewing angles is avoided.
In conclusion, according to the dynamic tracking methods for an in-vivo three-dimensional key point and an in-vivo three-dimensional curve of the present disclosure, the three-dimensional key point and the three-dimensional curve in the in-vivo environment can be precisely and dynamically located and tracked. Accordingly, the technical problem that the three-dimensional labeled path cannot be precisely located is solved while an operation path curve can be tracked in real time.
In order to describe the technical solutions in the embodiments of the present disclosure or in the prior art more clearly, the accompanying drawings required for describing the embodiments or the prior art are briefly described below. Apparently, the accompanying drawings in the following description show merely some embodiments of the present disclosure. Those of ordinary skill in the art can still derive other accompanying drawings from these accompanying drawings without creative efforts.
In order to make the objectives, technical solutions, and advantages in the embodiments of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure are described clearly and completely. Apparently, the described embodiments are some embodiments rather than all embodiments of the present disclosure. All other embodiments derived by those of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts fall within the scope of protection of the present disclosure.
Embodiments of the present disclosure provide dynamic tracking methods for an in-vivo three-dimensional key point and an in-vivo three-dimensional curve. Accordingly, the technical problems that a three-dimensional key point cannot be precisely located, and an operation path curve cannot be tracked in real time are solved.
The technical solutions in the embodiments of the present disclosure are intended to solve the above technical problems. A general idea is as follows:
A minimally invasive key site navigation oriented dynamic tracking method for an in-vivo three-dimensional key point according to an embodiment of the present disclosure is configured to dynamically track a manual site on the basis of a three-dimensional point cloud in a robot based tele-operation and mainly applied to, but not limited to, minimally invasive endoscopic operation scenes. The technical solution can be specifically summarized as follows: a doctor selects a key point from an intraoperative image and maps the key point to the three-dimensional point cloud. The key point is determined preliminary through a three-dimensional affine transformation between two point clouds. Then, an optimization function is constructed in combination with a feature descriptor and texture information such as an optical flow. A position of the key point is precisely determined in a neighborhood. Therefore, an endoscopic image oriented three-dimensional key point tracking is realized.
Aiming at a flexible and dynamic in-vivo environment, in the embodiment of the present disclosure, the key point is selected manually through the intraoperative image and updated in real time on the three-dimensional point cloud. Therefore, an operation path is updated accurately under a complex and changeable environment. Aiming at an indistinct in-vivo environment feature, the selected key point is tracked initially through the three-dimensional affine transformation, and the three-dimensional key point in the in-vivo environment is precisely and dynamically located and tracked in combination with the texture information and optical flow information.
A minimally invasive key trajectory navigation oriented dynamic tracking method for an in-vivo three-dimensional curve according to an embodiment the present disclosure is mainly applied to, but not limited to, minimally invasive endoscopic operation scenes. Accordingly, a tele-operation can be guided more precisely, and an operation robot can complete autonomous manipulation more remarkably.
In a scene application, the doctor plans a curved operation path on an intraoperative image upon his/her own knowledge and experience and determines a key point on a curve. A transformation matrix is acquired through a three-dimensional affine transformation between two point clouds. Coordinates of a key point on a source point cloud are transformed through the transformation matrix to obtain an initial position of a three-dimensional key point on a target point cloud. An optimization function is constructed in combination with the texture information of the endoscopic image and shape information of the curve. The three-dimensional key point is precisely located near an initial key point by minimizing the optimization function, and curve fitting is performed to realize dynamic curve fitting on the three-dimensional point cloud.
Aiming at a flexible and dynamic in-vivo environment, the operation path is planned manually through the intraoperative images and updated in real time on the three-dimensional point cloud. Therefore, the operation path is updated accurately under a complex and changeable environment. Aiming at an indistinct in-vivo environment feature, the key point on the curve is initially tracked through three-dimensional point cloud registration, and the curve in the in-vivo environment is precisely and dynamically located and tracked in combination with the texture information and the shape information.
For a better understanding of the above technical solutions, the above technical solutions are described in detail below with reference to the accompanying drawings and particular embodiments of the description.
As shown in
In the embodiment of the present disclosure, a selected key point is initially tracked through a three-dimensional affine transformation. The three-dimensional key point in the in-vivo environment is precisely and dynamically located and tracked in combination with texture information and optical flow information.
Each step of the above technical solution will be described in detail below with reference to specific contents:
In step S11, the endoscopic image is read, and the first two-dimensional key point is acquired from the current image according to selection of the doctor.
In the present step, the doctor labels the two-dimensional key point on an intraoperative image for subsequent display and update of the key point on a three-dimensional point cloud. Accordingly, the information is transmitted intuitively and accurately, and an operation efficiency is improved.
Step S12 that a first local region encompassing the first two-dimensional key point on the current image is tracked, a second local region is acquired from a next image, and an initial two-dimensional key point of the first two-dimensional key point on the next image is determined specifically includes:
p
f(k)={p1f(k),p2f(k), . . . ,pmf(k)}
denotes a feature point on the image I(k), and m denotes the number of the feature point on the image I(k);
Apparently, the above preset region shape and side length may be selected as actually required and will not be strictly limited herein. Taking a rectangular region (k−1) with a size of L×L as an example, pc(k−1) denotes a central point of the rectangular region (k−1) on the endoscopic image I(k−1), where
In the embodiment of the present disclosure, when the three-dimensional key point is tracked, the local region is first determined according to a position of the key point and tracked to reduce mistracking of the three-dimensional key point.
In step S13, the first local region is mapped to the first local point cloud and the second local region is mapped to the second local point cloud according to the mapping relation between the endoscopic image and the point clouds, the first three-dimensional key point of the first two-dimensional key point on the first local point cloud is determined, and the second three-dimensional key point on the second local point cloud is acquired through the coordinate transformation.
In the present step, the three-dimensional key point is initially located. The three-dimensional key point may be initially located in the following two steps.
Firstly, a corresponding three-dimensional key point of a two-dimensional key point on the point cloud is determined through a position of the two-dimensional key point according to the mapping relation between the endoscopic image and the point cloud. Secondly, a tissue in the local region may be approximately deemed as a rigid body; and the transformation matrix between the point clouds is solved through the three-dimensional affine transformation, and a three-dimensional key point on a target point cloud is acquired through a coordinate transformation.
Correspondingly, S13 specifically includes:
P(k−1)=ψ(p(k−1))
ω=([X1]T[X1])−1[X1]TY
P(k)=TAP(k−1)
In step S14, the second three-dimensional key point is mapped back to the second local region, so as to acquire the second two-dimensional key point from the next image. The two-dimensional coordinates of the tracked key point are acquired by minimizing the preset optimization function in combination with the initial two-dimensional key point, and the corresponding three-dimensional coordinates are finally acquired.
In the present step, the three-dimensional key point is precisely located. In the in-vivo environment, the tissue is dynamic, flexible, and highly similar. Therefore, in the embodiment of the present disclosure, the optimization function is constructed through texture information of a key point neighborhood, and the three-dimensional key point is precisely located by minimizing the optimization function.
Specifically, firstly, the second three-dimensional key point P(k) is mapped back to the second local region (k) according to the mapping relation between the endoscopic image and the point cloud, and a second two-dimensional key point p(k) is acquired from the next image.
Then, the two-dimensional coordinates of the tracked key point are acquired by minimizing the preset optimization function in combination with the initial two-dimensional key point po(k), and the corresponding three-dimensional coordinates are finally acquired.
The above optimization function is as follows:
=2−sift−optical
a=(pu+Δu,v+Δv(k)−p(k))T,b=(po(k)−p(k))T.
Δû and Δ{circumflex over (v)} are acquired by traversing and searching for Δu and Δv, so as to satisfy the following expression:
Two-dimensional coordinates of a tracked key point pu+Δu,v+Δv(k) are acquired after ideal offset (Δû(k), Δ{circumflex over (v)}(k)) is obtained, and then corresponding three-dimensional coordinates are finally acquired according to the mapping relation between the endoscopic image and the point cloud.
In the embodiment of the present disclosure, after the transformation matrix is acquired through the three-dimensional affine transformation, the three-dimensional key point is precisely located by constructing the optimization function in combination with the texture information and the optical flow information of the endoscopic image. Therefore, the effect of the indistinct in-vivo environment feature on a tracking result is avoided to a certain extent.
In conclusion, compared with the prior art, the present embodiment has the beneficial effects as follows:
As shown in
In the embodiment of the present disclosure, the local region is determined through a position of the key point and tracked to reduce the mistracking of the three-dimensional key point. In combination with texture information of the endoscopic image and shape information, the effect of the indistinct in-vivo environment feature is avoided to a certain extent. The three-dimensional key point is precisely located by constructing the optimization function on the point cloud after dimension reduction. Therefore, the inconsistency of a curve shape under different viewing angles is avoided.
Each step of the above technical solution will be described in detail below with reference to specific contents:
In step S21, the endoscopic image is read, the operation path curve is acquired from the current image according to selection of the doctor, and the plurality of first two-dimensional key points through which the operation path curve passes are acquired.
A first two-dimensional key point acquisition process in S21 includes:
p
={p
0
p
, . . . ,p
j
p
. . . ,p
l−1
p}
In the present step, the doctor labels a three-dimensional curve on an intraoperative image for subsequent display and update of the curve on a three-dimensional point cloud. Accordingly, information is transmitted intuitively and accurately, and an operation efficiency is improved. Moreover, the key point is determined according to the curvature of the curve, and a computation speed is ensured for key point tracking.
Step S22 that a first local region encompassing the first two-dimensional key point on the current image is tracked, and a second local region is acquired from a next image specifically includes:
In the embodiment of the present disclosure, the local region is determined through a position of the key point and tracked to reduce mistracking of the three-dimensional key point.
In step S23, the first local region is mapped to the first local point cloud and the second local region is mapped to the second local point cloud according to the mapping relation between the endoscopic image and the point clouds, the first three-dimensional key point of the first two-dimensional key point on the first local point cloud is determined, and the second three-dimensional key point on the second local point cloud is acquired through the coordinate transformation.
In the present step, the three-dimensional key point is initially located. The three-dimensional key point may be initially located in the following two steps.
Firstly, a corresponding three-dimensional key point of a two-dimensional key point on the point cloud is determined through a position of the two-dimensional key point according to the mapping relation between the endoscopic image and the point cloud. Secondly, a tissue in the local region may be approximately deemed as a rigid body; and a transformation matrix between the point clouds is solved through a three-dimensional affine transformation, and a three-dimensional key point on a target point cloud is acquired through a coordinate transformation.
Correspondingly, S23 specifically includes:
P
i(k−1)=ψ(pi(k−1))
where A∈3×3, t∈3×1, ω=[A t]T∈4×3 denote parameters of a fitting function, ω is acquirable from the following formula through least squares:
ω=([X1]T[X1])−1[X1]TY
(k)=TA(k−1).
In step S24, the dimension of the first local point cloud is reduced to obtain the first two-dimensional point cloud, and the second two-dimensional key point of the first three-dimensional key point on the first two-dimensional point cloud is acquired;
In the present step, the three-dimensional key point is precisely located. In order to ensure the accuracy when the curve dynamically changes, it is required to optimize the initial position of the key point on the curve.
In the present step, firstly, in combination with the texture information of the endoscopic image and the shape information, the effect of the indistinct in-vivo environment feature is avoided to a certain extent. Secondly, the three-dimensional key point is precisely located by constructing the optimization function on the point cloud after dimension reduction. Therefore, the inconsistency of a curve shape under different viewing angles is avoided.
Specifically, firstly, a dimension of the first local point cloud (k−1) is reduced to obtain a first two-dimensional point cloud Q(k−1), and an ith second two-dimensional key point Ti(k−1) of the first three-dimensional key point (k−1) on the first two-dimensional point cloud Q(k−1) is acquired; and
Then two-dimensional coordinates of the tracked key point on the two-dimensional point cloud are acquired by minimizing the preset optimization function according to the second two-dimensional key point and the third two-dimensional key point.
The optimization function is as follows:
=1−sift+shape
is minimized by traversing and searching for neighborhood points of all third two-dimensional key points Ti(k), so as to satisfy:
{circumflex over (T)}
i=argmin(1−Jsift(Ti(k))+shape(Ti(k)))
An ideal key point Ti set can be acquired by minimizing the optimization function.
In the embodiment of the present disclosure, after the transformation matrix is acquired through the three-dimensional affine transformation, the three-dimensional key point is precisely located by constructing the optimization function in combination with the texture information of the endoscopic image and the shape information. Therefore, the effect of the indistinct in-vivo environment feature on a tracking result is avoided to a certain extent.
In step S25, the three-dimensional coordinates of each tracked key point are acquired according to the mapping relation between the point clouds before and after dimension reduction, curve fitting is performed, and the three-dimensional curve is finally obtained by means of tracking.
Specifically, in the present step, interpolation fitting is performed on a line through an equation of a B-spline curve, where a general equation of the B-spline curve is:
P(t)=Σi=0mPiFi,k(t)
In conclusion, compared with the prior art, the present embodiment has the beneficial effects as follows:
An embodiment of the present disclosure provides a storage medium. The storage medium stores an autonomous operation robot oriented computer program for three-dimensional key point tracking, where the computer program causes a computer to execute the above tracking method for an in-vivo three-dimensional key point.
An embodiment of the present disclosure provides an electronic apparatus. The electronic apparatus includes:
It can be understood that the storage medium and the electronic apparatus according to Embodiment 3 and Embodiment 4 of the present disclosure respectively correspond to the minimally invasive key site navigation oriented dynamic tracking method for an in-vivo three-dimensional key point according to Embodiment 1 of the present disclosure. Reference may be made to the corresponding parts of the tracking method for a three-dimensional key point for the explanations, instances, beneficial effects, etc. of the relevant contents of the storage medium and the electronic apparatus, which will not be repeated herein.
In conclusion, in all the above embodiments, the three-dimensional key point in the in-vivo environment can be precisely and dynamically located and tracked. In addition, for dynamic tracking of the in-vivo three-dimensional curve, since the three-dimensional key point can be precisely and dynamically located and tracked, the in-vivo three-dimensional curve can also be precisely and dynamically located and tracked while the operation path curve can be tracked in real time.
It is to be noted that relational terms herein such as first and second are merely used to distinguish one entity or operation from another entity or operation without necessarily requiring or implying any such an actual relation or order between these entities or operations. In addition, terms “comprise”, “include”, “encompass”, or any other their variations are intended to cover a non-exclusive inclusion. Therefore, a process, method, article, or apparatus including a series of elements not only includes those elements, but also includes other elements that are not explicitly listed, or further includes inherent elements of such a process, method, article, or apparatus. Without more restrictions, the elements defined by the sentence “comprise a . . . ” and “include a . . . ” do not exclude the existence of other identical elements in the process, method, article, or apparatus including the elements.
The above embodiments are only used to explain the technical solutions of the present disclosure, and are not intended to limit same. Although the present disclosure is described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still made modifications to the technical solutions described in all the foregoing embodiments, or make equivalent substitutions to some technical features in the embodiments. These modifications or substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions in all the embodiments of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202210779481.5 | Jul 2022 | CN | national |
202210779486.8 | Jul 2022 | CN | national |