Surgical robotic systems are currently being used in a variety of surgical procedures, including minimally invasive medical procedures. Some surgical robotic systems include a surgeon console controlling a surgical robotic arm and a surgical instrument having an end effector (e.g., forceps or grasping instrument) coupled to and actuated by the robotic arm. In operation, the robotic arm is moved to a position over a patient and then guides the surgical instrument into a small incision via a surgical port or a natural orifice of a patient to position the end effector at a work site within the patient's body. However, a single endoscopic camera provides a limited field of view of the surgical site. Thus, there is a need for a system that receives video from multiple cameras and provides an augmented view of the surgical site.
According to one aspect of the above embodiment, an imaging system is disclosed which includes a first camera configured to capture a first video stream of a first tissue surface and a second camera configured to capture a second video stream of a second tissue surface, the first tissue surface opposing the second tissue surface of the tissue. The system also includes a video processing device configured to receive the first video stream from the first camera and the second video stream from the second camera and modify at least one the first video stream or the second video stream based on one the first video stream or the second video stream to generate a modified video stream. The system further includes a first screen coupled to the video processing device and configured to display the modified video stream.
Implementations of the above embodiment may include one or more of the following features. According to one aspect of the above embodiment, the imaging system may include a second screen coupled to the video processing device. The video processing device may be configured to output at least one of the first video stream, the second video stream, or the modified video stream on at least one of the first screen or the second screen. The video processing device may be further configured to generate a virtual marker, such as a shape, plane, boundary, line, etc. across in the modified video stream to demarcate a region of tissue. The virtual marker may be placed at a representation of a location of the second camera. In modifying the first video stream, the video processing device may be further configured to remove a portion of the first video stream. In modifying the first video stream, the video processing device may be also configured to fill in the portion of the first video stream with a reconstructed portion of the second video stream. The video processing device may be further configured to generate a depth map of the first and second tissue surfaces and generate the reconstructed portion based on the depth map.
According to another aspect of the above embodiment, a surgical robotic system is disclosed which includes first robotic arm including a first camera located at a first position and configured to capture a first video stream of a first tissue surface. The system also includes a second robotic arm including a second camera located at a second position and configured to capture a second video stream of a second tissue surface, the first tissue surface opposing the second tissue surface of the tissue and a third robotic arm including a surgical instrument. The system further includes a video processing device configured to receive the first video stream from the first camera and the second video stream from the second camera and combine the first video stream based on the second video stream to generate a combined video stream based on the first position and the second position. The video processing device is further configured to generate a 3D volume including a trajectory of at least one of the first camera, the second camera, or the instrument based on the combined video stream. The system further includes a surgeon console including a first screen coupled to the video processing device and configured to display the combined video stream.
Implementations of the above embodiment may include one or more of the following features. According to one aspect of the above embodiment, the surgeon console may further include a second screen coupled to the video processing device. The video processing device may be configured to output at least one of the first video stream, the second video stream, or the modified video stream on at least one of the first screen or the second screen. The video processing device may be further configured to generate a virtual marker in the modified video stream. The virtual marker may be placed at a location within a field of view of the second camera. In modifying the first video stream, the video processing device may be further configured to remove a portion of the first video stream. In modifying the first video stream, the video processing device may be also configured to fill in the portion of the first video stream with a reconstructed portion of the second video stream. The video processing device may be further configured to generate a depth map of the first and second tissue surfaces and generate the reconstructed portion based on the depth map.
According to a further aspect of the above embodiment, a method of imaging tissue is disclosed which includes capturing a first video stream of a first tissue surface at a first camera, and capturing a second video stream of a second tissue surface at a second camera, the first tissue surface opposing the second tissue surface of the tissue. The method also includes receiving the first video stream from the first camera at a video processing device and receiving the second video stream from the second camera at the video processing device. The method further includes modifying the first video stream based on the second video stream to generate a modified video stream. The method additionally includes displaying the modified video stream on a screen coupled to the video processing device.
Implementations of the above embodiment may include one or more of the following features. According to one aspect of the above embodiment, modifying the first video stream at the video processing device may further include removing a portion of the first video stream. Modifying the first video stream at the video processing device may also include filling in the portion of the first video stream with a reconstructed portion of the second video stream. Modifying the first video stream at the video processing device may additionally include generating a depth map of the first and second tissue surfaces and generate the reconstructed portion based on the depth map.
Various embodiments of the present disclosure are described herein with reference to the drawings wherein:
Embodiments of the presently disclosed surgical robotic system are described in detail with reference to the drawings, in which like reference numerals designate identical or corresponding elements in each of the several views.
As will be described in detail below, the present disclosure is directed to a surgical robotic system, which includes a surgeon console, a control tower, and one or more movable carts having a surgical robotic arm coupled to a setup arm. The surgeon console receives user input through one or more interface devices. The user input is processed by the control tower as movement commands for moving the surgical robotic arm and an instrument and/or camera coupled thereto. Thus, the surgeon console enables teleoperation of the surgical arms and attached instruments/camera. The surgical robotic arm includes a controller, which is configured to process the movement commands and to generate a torque commands for activating one or more actuators of the robotic arm, which would, in turn, move the robotic arm in response to the movement commands.
With reference to
The surgical instrument 50 is configured for use during minimally invasive surgical procedures. In embodiments, the surgical instrument 50 may be configured for open surgical procedures. In further embodiments, the surgical instrument 50 may be an electrosurgical forceps configured to seal tissue by compressing tissue between jaw members and applying electrosurgical current thereto. In yet further embodiments, the surgical instrument 50 may be a surgical stapler including a pair of jaws configured to grasp and clamp tissue while deploying a plurality of tissue fasteners, e.g., staples, and cutting stapled tissue. In yet further embodiments, the surgical instrument 50 may be a surgical clip applier including a pair of jaws configured apply a surgical clip onto tissue.
The surgeon console 30 includes a first screen 32, which displays a video feed of the surgical site provided by camera 51a disposed on the robotic arm 40, and a second screen 34, which displays a user interface for controlling the surgical robotic system 10. The first screen 32 and second screen 34 may be touchscreens allowing for displaying various graphical user inputs.
The surgeon console 30 also includes a plurality of user interface devices, such as foot pedals 36 and a pair of hand controllers 38a and 38b which are used by a user to remotely control robotic arms 40. The surgeon console further includes an armrest 33 used to support clinician's arms while operating the hand controllers 38a and 38b.
The control tower 20 includes a screen 23, which may be a touchscreen, and outputs on the graphical user interfaces (GUIs). The control tower 20 also acts as an interface between the surgeon console 30 and one or more robotic arms 40. In particular, the control tower 20 is configured to control the robotic arms 40, such as to move the robotic arms 40 and the corresponding surgical instrument 50, based on a set of programmable instructions and/or input commands from the surgeon console 30, in such a way that robotic arms 40 and the surgical instrument 50 execute a desired movement sequence in response to input from the foot pedals 36 and the hand controllers 38a and 38b. The foot pedals 36 may be used to enable and lock the hand controllers 38a and 38b, repositioning camera movement and electrosurgical activation/deactivation. In particular, the foot pedals 36 may be used to perform a clutching action on the hand controllers 38a and 38b. Clutching is initiated by pressing one of the foot pedals 36, which disconnects (i.e., prevents movement inputs) the hand controllers 38a and/or 38b from the robotic arm 40 and corresponding instrument 50 or camera 51a attached thereto. This allows the user to reposition the hand controllers 38a and 38b without moving the robotic arm(s) 40 and the instrument 50 and/or camera 51a. This is useful when reaching control boundaries of the surgical space.
Each of the control tower 20, the surgeon console 30, and the robotic arm 40 includes a respective computer 21, 31, 41. The computers 21, 31, 41 are interconnected to each other using any suitable communication network based on wired or wireless communication protocols. The term “network,” whether plural or singular, as used herein, denotes a data network, including, but not limited to, the Internet, Intranet, a wide area network, or a local area network, and without limitation as to the full scope of the definition of communication networks as encompassed by the present disclosure. Suitable protocols include, but are not limited to, transmission control protocol/internet protocol (TCP/IP), datagram protocol/internet protocol (UDP/IP), and/or datagram congestion control protocol (DCCP). Wireless communication may be achieved via one or more wireless configurations, e.g., radio frequency, optical, Wi-Fi, Bluetooth (an open wireless protocol for exchanging data over short distances, using short length radio waves, from fixed and mobile devices, creating personal area networks (PANs), ZigBee® (a specification for a suite of high level communication protocols using small, low-power digital radios based on the IEEE 122.15.4-1203 standard for wireless personal area networks (WPANs)).
The computers 21, 31, 41 may include any suitable processor (not shown) operably connected to a memory (not shown), which may include one or more of volatile, non-volatile, magnetic, optical, or electrical media, such as read-only memory (ROM), random access memory (RAM), electrically-erasable programmable ROM (EEPROM), non-volatile RAM (NVRAM), or flash memory. The processor may be any suitable processor (e.g., control circuit) adapted to perform the operations, calculations, and/or set of instructions described in the present disclosure including, but not limited to, a hardware processor, a field programmable gate array (FPGA), a digital signal processor (DSP), a central processing unit (CPU), a microprocessor, and combinations thereof. Those skilled in the art will appreciate that the processor may be substituted for by using any logic processor (e.g., control circuit) adapted to execute algorithms, calculations, and/or set of instructions described herein.
With reference to
The setup arm 61 includes a first link 62a, a second link 62b, and a third link 62c, which provide for lateral maneuverability of the robotic arm 40. The links 62a, 62b, 62c are interconnected at joints 63a and 63b, each of which may include an actuator (not shown) for rotating the links 62b and 62b relative to each other and the link 62c. In particular, the links 62a, 62b, 62c are movable in their corresponding lateral planes that are parallel to each other, thereby allowing for extension of the robotic arm 40 relative to the patient (e.g., surgical table). In embodiments, the robotic arm 40 may be coupled to the surgical table (not shown). The setup arm 61 includes controls 65 for adjusting movement of the links 62a, 62b, 62c as well as the lift 67. In embodiments, the setup arm 61 may include any type and/or number of joints.
The third link 62c may include a rotatable base 64 having two degrees of freedom. In particular, the rotatable base 64 includes a first actuator 64a and a second actuator 64b. The first actuator 64a is rotatable about a first stationary arm axis which is perpendicular to a plane defined by the third link 62c and the second actuator 64b is rotatable about a second stationary arm axis which is transverse to the first stationary arm axis. The first and second actuators 64a and 64b allow for full three-dimensional orientation of the robotic arm 40.
The actuator 48b of the joint 44b is coupled to the joint 44c via the belt 45a, and the joint 44c is in turn coupled to the joint 46b via the belt 45b. Joint 44c may include a transfer case coupling the belts 45a and 45b, such that the actuator 48b is configured to rotate each of the links 42b, 42c and a holder 46 relative to each other. More specifically, links 42b, 42c, and the holder 46 are passively coupled to the actuator 48b which enforces rotation about a pivot point “P” which lies at an intersection of the first axis defined by the link 42a and the second axis defined by the holder 46. In other words, the pivot point “P” is a remote center of motion (RCM) for the robotic arm 40. Thus, the actuator 48b controls the angle θ between the first and second axes allowing for orientation of the surgical instrument 50. Due to the interlinking of the links 42a, 42b, 42c, and the holder 46 via the belts 45a and 45b, the angles between the links 42a, 42b, 42c, and the holder 46 are also adjusted in order to achieve the desired angle θ. In embodiments, some or all of the joints 44a, 44b, 44c may include an actuator to obviate the need for mechanical linkages.
The joints 44a and 44b include an actuator 48a and 48b configured to drive the joints 44a, 44b, 44c relative to each other through a series of belts 45a and 45b or other mechanical linkages such as a drive rod, a cable, or a lever and the like. In particular, the actuator 48a is configured to rotate the robotic arm 40 about a longitudinal axis defined by the link 42a.
With reference to
The robotic arm 40 also includes a plurality of manual override buttons 53 (
With reference to
The computer 41 includes a plurality of controllers, namely, a main cart controller 41a, a setup arm controller 41b, a robotic arm controller 41c, and an instrument drive unit (IDU) controller 41d. The main cart controller 41a receives and processes joint commands from the controller 21a of the computer 21 and communicates them to the setup arm controller 41b, the robotic arm controller 41c, and the IDU controller 41d. The main cart controller 41a also manages instrument exchanges and the overall state of the movable cart 60, the robotic arm 40, and the IDU 52. The main cart controller 41a also communicates actual joint angles back to the controller 21a.
Each of joints 63a and 63b and the rotatable base 64 of the setup arm 61 are passive joints (i.e., no actuators are present therein) allowing for manual adjustment thereof by a user. The joints 63a and 63b and the rotatable base 64 include brakes that are disengaged by the user to configure the setup arm 61. The setup arm controller 41b monitors slippage of each of joints 63a and 63b and the rotatable base 64 of the setup arm 61, when brakes are engaged or can be freely moved by the operator when brakes are disengaged, but do not impact controls of other joints. The robotic arm controller 41c controls each joint 44a and 44b of the robotic arm 40 and calculates desired motor torques required for gravity compensation, friction compensation, and closed loop position control of the robotic arm 40. The robotic arm controller 41c calculates a movement command based on the calculated torque. The calculated motor commands are then communicated to one or more of the actuators 48a and 48b in the robotic arm 40. The actual joint positions are then transmitted by the actuators 48a and 48b back to the robotic arm controller 41c.
The IDU controller 41d receives desired joint angles for the surgical instrument 50, such as wrist and jaw angles, and computes desired currents for the motors in the IDU 52. The IDU controller 41d calculates actual angles based on the motor positions and transmits the actual angles back to the main cart controller 41a.
The robotic arm 40 is controlled in response to a pose of the hand controller controlling the robotic arm 40, e.g., the hand controller 38a, which is transformed into a desired pose of the robotic arm 40 through a hand eye transform function executed by the controller 21a. The hand eye function, as well as other functions described herein, is/are embodied in software executable by the controller 21a or any other suitable controller described herein. The pose of one of the hand controllers 38a may be embodied as a coordinate position and roll-pitch-yaw (RPY) orientation relative to a coordinate reference frame, which is fixed to the surgeon console 30. The desired pose of the instrument 50 is relative to a fixed frame on the robotic arm 40. The pose of the hand controller 38a is then scaled by a scaling function executed by the controller 21a. In embodiments, the coordinate position may be scaled down and the orientation may be scaled up by the scaling function. In addition, the controller 21a may also execute a clutching function, which disengages the hand controller 38a from the robotic arm 40. In particular, the controller 21a stops transmitting movement commands from the hand controller 38a to the robotic arm 40 if certain movement limits or other thresholds are exceeded and in essence acts like a virtual clutch mechanism, e.g., limits mechanical input from effecting mechanical output.
The desired pose of the robotic arm 40 is based on the pose of the hand controller 38a and is then passed by an inverse kinematics function executed by the controller 21a. The inverse kinematics function calculates angles for the joints 44a, 44b, 44c of the robotic arm 40 that achieve the scaled and adjusted pose input by the hand controller 38a. The calculated angles are then passed to the robotic arm controller 41c, which includes a joint axis controller having a proportional-derivative (PD) controller, the friction estimator module, the gravity compensator module, and a two-sided saturation block, which is configured to limit the commanded torque of the motors of the joints 44a, 44b, 44c.
With reference to
The system 10 may include a plurality of cameras, e.g., cameras 51a and 51b. The cameras 51a and 51b may be coupled to and controlled by their respective robotic arms 40. The cameras 51a and 51b may be configured for open and/or endoscopic imaging, i.e., coupled to an endoscope, which may be flexible or rigid. The cameras 51a and 51b may be mono or stereoscopic, i.e., configured to capture two side-by-side (left and right) images of the surgical site to produce a video stream of the surgical scene. In further embodiments, the cameras 51a and 51b may be mounted to any suitable arm, holder, or held by a user.
The cameras 51a and 51b are coupled to a video processing device 56 (
The video processing device also includes various other computer components, such as RAM, a storage drive, peripheral ports, input device (e.g., touch screen). Additionally, the video processing device is also coupled to one or more displays via output ports. The video processing device is configured to output the processed images through any suitable video output port, such as a DISPLAYPORT™, HDMI®, etc., that is capable of transmitting processed images at any desired resolution, display rates, and/or bandwidth.
With reference to
As shown in
The imaging system 100 may also be used in colorectal surgical procedures, such as anastomosis, with the second camera 51b being inserted transanally to provide the view of the anastomosis site while the first camera 51a is inside the abdominal cavity providing the exterior view of the colon or the anastomosis site. The imaging system 100 may also be used in localizing tumors in the colon or rectum by positioning cameras 51a and 51b inside and outside the colon and/or large intestine, respectively.
In urological procedures, the second camera 51b may be inserted through the urethra while the first camera 51a is inserted into the abdomen. In gynecological procedures (e.g., hysterectomy), the second camera 51b may be inserted transvaginally and the first camera 51a may be inserted into the abdominal cavity through the access port 55b.
The imaging system 100 may also be used in thoracic cavity procedures, such as lung surgeries. The second camera 51b may be coupled to a bronchoscope and may be inserted transorally or transnasally, whereas the first camera 51a may be inserted through the abdominal wall as described above.
The imaging system 100 may also be used during abdominal surgeries to establish pneumoperitoneum and to insert cameras to establish visualization of the abdominal cavity. During these phases, the second camera 51b may be disposed outside of the patient while the first camera 51a is being inserted into the abdominal cavity. This configuration allows the second camera 51b to monitor the position and insertion of the first camera 51a.
The image processing device 56 is configured to receive multiple video streams from the cameras 51a and 51b and to combine and/or augment the video streams to generate one or more 3D volumes as shown in
In embodiments, the user may also switch between which of the first and second cameras 51a and 51b is being used in hand-eye coordination in controlling the surgeon console 30. In particular, the surgeon console 30 is configured to track eye gaze of the user and to coordinate surgeon's gaze with user engagement with the hand controllers 38a and 38b. Thus, the hand-eye coordination depends on which camera is being controlled. The GUI of the screens 32 and 34 may be used to select which of the first and second cameras 51a and 51b is the main camera that is used for hand-eye coordination.
The first and second video streams may be combined in a variety of ways. The first video stream from the second camera 51b may be the main view shown on the first screen 32 of the surgeon console 30 with the second video stream being shown in a picture-in-picture (PIP) on the first screen 32 or on the second screen 34. In embodiments, the PIP view may be located and/or sized automatically by the surgeon console 30 based on the position of the first camera 51a or based on surgical action, phase, or step of the surgical procedure.
A surgical procedure can include multiple phases, and each phase can include one or more surgical actions. A “surgical action” can include an incision, a compression, a stapling, a clipping, a suturing, a cauterization, a sealing, or any other such actions performed to complete a phase in the surgical procedure. A “phase” represents a surgical event that is composed of a series of steps (e.g., closure). A “step” refers to the completion of a named surgical objective (e.g., hemostasis). During each step, certain surgical instruments 50 (e.g., forceps) are used to achieve a specific objective by performing one or more surgical actions.
In some examples, the system 10 may include a machine learning processing system having a phase detector that uses the machine learning models to identify the surgical action, phase, or step within the surgical procedure (“procedure”). Phase detector uses a particular procedural tracking data structure from a list of procedural tracking data structures. Phase detector selects the procedural tracking data structure based on the type of surgical procedure that is being performed. In one or more examples, the type of surgical procedure is predetermined or input by the user. The procedural tracking data structure identifies a set of potential phases that can correspond to a part of the specific type of procedure. In further embodiments, the phase detector may detect the surgical action, phase, or step by image processing the first and/or second video streams.
The first video stream, which includes the internal view may also be augmented by the second video stream, which includes the external view. The first video stream may be augmented to provide a transparent or a partially transparent view of the organ “O”. In particular, the first and second video streams may be used to generate a 3D volume having a first augmented video stream, in which a portion of the exterior image of the organ “O” (i.e., of the first video stream) is removed and replaced with a reconstructed portion based on the interior image of the organ “O” (i.e., of the second video stream). Reconstruction of the image may be performed using machine learning image processing algorithms that generate textures and depth maps of the imaged objects. In particular, the video processing device 56 is configured to generate depth maps of tissue surfaces in the first and second video streams. A second augmented stream may be also generated by replacing a portion of the interior image of the organ “O” (i.e., of the second video stream) with a reconstructed portion based on the exterior image of the organ “O” (i.e., of the first video stream). The 3D volume with the first augmented video stream may be shown on the first screen 32 and the second augmented video stream may be shown on the second screen 34, providing the user with multiple augmented views by removing obstructions between the first and second FOVs of the first and second cameras 51a and 51b.
With reference to
As shown in
With continued reference to
As shown in
Additional modifications and enhancements of the video streams may include providing an overhead (e.g., 2D) or 3D schematic map with cameras 51a and 51b and their corresponding FOVs as well showing orientation indicators in each video stream. A 2D or 3D schematic map may be displayed on any of the screens 32 or 34 and allows the user to see positioning of the instrument 50, the first and second cameras 51a and 51b relative to each other since these objects are not viewable by an outside camera.
The cameras 51a and 51b may also be used to image the same continuous object, e.g., abdominal cavity. With reference to
Multiple video streams may also be used to detect critical structures, such as cystic duct, arteries, etc. using machine learning image processing techniques. In particular, different FOVs of the same object may be used to compare two images of the same object to provide for more accurate identification of the critical structures than a single camera view. Once identified, the critical structures may be highlighted on the first and/or second video streams to allow the surgeon to avoid accidentally injuring the structures during operation on neighboring tissue. Highlighting may be done by coloring the structures and/or drawings a boundary. In further embodiments, the critical structures may be labeled.
In further embodiments, at least one of the second camera 51b or the first camera 51a may be an infrared camera configured to capture near infrared (NIR) or infrared light that is used to illuminate tissue. The NIR images may be used to highlight specific tissue regions in the combined video stream or in the stream of the camera providing NIR images.
With reference to
While the present disclosure was described with using two cameras, the system and method according to the present disclosure may be implemented using three or more cameras. In particular, a plurality of video streams may be used to generate one, two, or more combined video streams and/or some or all of the video streams may be augmented with video streams of counterpart cameras.
It will be understood that various modifications may be made to the embodiments disclosed herein. Therefore, the above description should not be construed as limiting, but merely as exemplifications of various embodiments. Those skilled in the art will envision other modifications within the scope and spirit of the claims appended thereto.
This application claims the benefit of the filing date of provisional U.S. Patent Application No. 63/418,854 filed on Oct. 24, 2022. The entire disclosure of the foregoing application is incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
63418854 | Oct 2022 | US |