This document pertains generally to evaluations, and more particularly, but not by way of limitation, to skill evaluation.
Human performance of a task, such as surgery, is evaluated for various reasons, including for example, developing skills and identifying expertise. Objective and subjective evaluation criteria can be established for evaluating or judging the performance of a subject. Some examples of tasks in which a subject uses physical controls to manipulate a mechanism include surgery, driving a vehicle and operating machinery.
Typical methods of evaluating performance entail human oversight and are, thus, financially burdensome and often imprecise.
In the drawings, which are not necessarily drawn to scale, like numerals describe substantially similar components throughout the several views. Like numerals having different letter suffixes represent different instances of substantially similar components. The drawings illustrate generally, by way of example, but not by way of limitation, various embodiments discussed in the present document.
The following detailed description includes references to the accompanying drawings, which form a part of the detailed description. The drawings show, by way of illustration, specific embodiments in which the invention may be practiced. These embodiments, which are also referred to herein as “examples,” are described in enough detail to enable those skilled in the art to practice the invention. The embodiments may be combined, other embodiments may be utilized, or structural, logical and electrical changes may be made without departing from the scope of the present invention. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined by the appended claims and their equivalents.
In this document, the terms “a” or “an” are used, as is common in patent documents, to include one or more than one. In this document, the term “or” is used to refer to a nonexclusive or, unless otherwise indicated. Furthermore, all publications, patents, and patent documents referred to in this document are incorporated by reference herein in their entirety, as though individually incorporated by reference. In the event of inconsistent usages between this document and those documents so incorporated by reference, the usage in the incorporated reference(s) should be considered supplementary to that of this document; for irreconcilable inconsistencies, the usage in this document controls.
Overview
The present subject matter includes methods and systems for evaluating skills. Exemplary methods utilize a Markov model or Hidden Markov model for analyzing the departure of a specific signal from what is expected by that model.
The present subject matter is described in this document largely based on Markov and hidden Markov models. Nevertheless, other types of models are also contemplated, including algorithmic or rule-based models, dynamical system models and statistical models (of which Markov and hidden Markov models are but two examples).
In one example, the performances of surgical skills on a pig by several participants were recorded and a model based on data generated from experts performing the skills has been created. The present subject matter distinguishes between signals generated by experts and non-experts and can be applied to non-surgical manipulative tasks including, human or non-human operation of a machine. For example, the present subject matter can facilitate analysis of manipulations of physical controls used to operate a mechanism, such as driving a vehicle (steering wheel and pedals), flying an aircraft (yoke and pedals), operating machinery (such as a crane) and minimally invasive surgery.
Markov and hidden Markov models are exemplary statistical models which can be used for voice recognition of speech. Models of speech sounds are created in a controlled manner and a sample sound is recognized based on a comparison of the sample sound with those models. Statistical models, such as Markov and hidden Markov models, can tolerate variations in utterance of a particular word.
In the present subject matter, electrical signals derived from surgical instruments are used as a source input. The electrical signals are generated by sensors coupled to a surgical instrument when manipulated by operators performing at various skill levels. Surgical skill models are developed based on the recorded information. Once trained, data recorded by other surgeons (including experts and novices) are examined using the model. The model can be used to identify expert surgeons in a group. In one example, the present subject matter includes a skill measurement tool.
The analysis of the data recorded during surgery can be done off-line. That is, data analysis (and expert identification) is conducted after completion of the surgical procedure.
In one example, the data analysis is conducted in real time. That is, data processing and quantification of the skill level of subjects is performed concurrent with data acquisition.
In one example, large amounts of recorded data is compressed and simplified using vector quantization. Vector quantization was initially developed for image compression and it is adapted for use in the present subject matter.
The method includes receiving electric signals associated with a subject performing a particular task. Greater number of signals provides improved performance. In one example, the method includes receiving data recorded by experts to train a model.
In one example, a surgical robot is used to train subjects and subject performance evaluation is generated in real time. Feedback provided by the present system can augment skill development and reduce the burden of supervision.
In one example, a robotically controlled interface is coupled to one or more simulators for training purposes.
In one example, subjects are scored on their performance based on a simulated or actual manipulative task. In one example, performance is evaluated using a simulation prior to performing an actual complex procedure. Feedback derived from the evaluated simulation can be used to tailor actual performance. For example, surgeon performance using a surgical simulator can be evaluated prior to conducting actual surgery on a patient. The evaluation may reveal that the subject's performance is inferior to that of an expert because of fatigue or other correctable factor.
In one example, an interface includes a layer operating in the background of the surgical environment (actual, virtual or robotically controlled) which can interject upon detection of a departure from an expert performance. For example, if the conduct of a lower skilled surgeon is detected, then at a critical procedure, the layer will interrupt and prevent harmful movement or interrupt and suggest an improved course or provide tactile feedback (haptic) sensations to cause the surgeon to alter their performance. The layer can be implemented in hardware or in instructions executed by a computer of the present subject matter. In one example, the background layer fulfills a supervisory role as to a manipulative task.
The Markov decision process makes decisions by prioritizing possible choice as measured by evolving values criteria.
Assessing Skill with Medical Simulators
In the surgical context, procedurally-oriented skills can be performed utilizing three different modalities, (a) during actual open or minimally invasive clinical procedures; (b) in physical or virtual reality simulators with or without haptic feedback; and (c) during interaction with surgical robotic systems, as shown in
In each modality, the surgeon is separated from the treated tissue or medium by an instrument or a mechanical interface. In some examples, the interface includes a virtual component. The intermediate modality in all these examples can be considered interchangeable. A common element of these modalities is the human-machine interface in which visual, kinematics, dynamic, and haptic information is shared between the surgeon and the various modalities. This interface can provide multi-dimensional data to objectively assess technical surgical skill within the general framework of surgical ability.
The algorithm used for objective assessment of skill is independent of the modality actually used and therefore, the same algorithms can be incorporated into any of these technologies. Objective methodologies for assessing task or skill competence and performance can be used to enhance training, reduce cost and improve competency.
In one example, the surgical task is deconstructed or decomposed to expose and analyze the internal hierarchy of tasks. Task decomposition is associated with defining selected elements of the manipulative process. For example, in surgery, the procedure is divided into steps, stages, or phases with defined intermediate goals. Additional hierarchical decomposition is based on identifying tasks or subtasks and actions or states. Low-level elements of the task decomposition are associated with quantify measurable parameters. Definition of these states along with measurable, quantitative data allows for modeling of surgical tasks or medical examination.
The present subject matter can be applied to the various modalities and includes decomposing the medical procedure (such as an examination or surgical task) into fundamental states associated with discrete observations. The task is represented by a statistical model such as a multi-state Markov model, a hidden Markov model or other such model. A performance of a test subject is evaluated based on the statistical distance calculated between the test subject and at least one stored model. In one example, the stored models correspond to performance of the task at various skill levels, including that of a novice and an expert. The analysis can be conducted in real-time and provide feedback during the performance. Feedback, in various examples, can be in the form of audio, visual or tactile. The present subject matter can be used with various modalities and systems (including robotic systems and simulators) for evaluating performance of a manipulative task.
In the present subject matter, a prime element is modeled by a finite state. In the context of Markov modeling and speech recognition, the prime element is the spoken word. The prime element in the surgical context relates to tool-tissue interaction or hand-tissue interaction. Within a particular tool-tissue interaction or hand-tissue interaction, variations in forces and torque magnitudes can be noted for different skill levels and, in the context of speech recognition, this relates to variations in word pronunciation. The various force and torque magnitudes are simulated by discrete observations in the model. A sequence of tool-tissue or hand-tissue interactions comprise the steps of a medical procedure having intermediate and specific outcomes, and by analogy in the speech recognition context, a sequence of words represent a sentence or chapter.
A variety of sensors are used to generate signals corresponding to, for example, completion time, work space, force, position and tool path.
In one example, a physical simulator in the form of an instrumented teaching-mannequin representing the female pelvis and the breast exam, male prostate exam, and endotracheal intubation was used. Data was acquired from approximately 1800 students and clinicians, including quantitative measures of hands-on clinical exam techniques used while performing procedures. Background information for the students and clinicians, and a database of outcome measures including the user's clinical assessment scores and independent skilled observer ratings of the users' techniques while performing these examinations or procedures in physical simulators, was also collected.
Sensors coupled to surgical robotic systems were used to collect data on surgical tool positions and the torque commands between the master unit and the robotic instrument actuators.
Markov modeling, according to the present subject matter, provides an objective assessment of medical/surgical skills in a manner transparent to modality.
In one example, data mining is performed on a database corresponding to a manipulative task. A surgical robot provides data generated by sensors while performing surgical tasks on animal and human subjects.
In one example, two-handed, instrumented endoscopic tools and Markov models are used to perform task decomposition and objective skill assessment with the Markov modeling approach. Sensor arrays coupled to the tools and robotic systems provide quantitative data to allow data mining and clustering and multi-state Markov modeling and analysis of the particular tasks.
Objective assessment of surgical competence during minimally invasive surgery procedures is a multi-dimensional problem. Minimally invasive surgery (MIS) refers to a surgical procedure involving a minimally invasive surgical setup. Physiological constraints (stress, fatigue), equipment constraints (camera rotation and port location), team constraints (nurses), and physician ability are representative parameters that affect the outcome of a MIS procedure. Ability, with respect to surgery, is defined as the natural state or condition of being capable; innate aptitude (prior to training), which an individual brings for performing a surgical task. Minimally invasive surgery ability includes cognitive factors (knowledge and judgment) and technical factors (psychomotor ability, visio-spatial ability and perceptual ability). By definition, fundamental psychometric abilities are fixed at birth or early childhood and show little or no learning effect. However training enables the subject to perform as close as possible to his or her inherent psychometric abilities.
The methodology for objectively assessing surgical skill (as a subset of surgical ability), according to the present subject matter, includes objective and quantitative analysis. Such methodology is enabled by using instrumented tools, measurements of the surgeon's arm kinematics, gaze patterns, physical simulators, a variety of virtual reality simulators (those with and without haptics) and robotic systems. An instrumented tool can be used to generate data corresponding to kinematics (position, velocity, acceleration, and jerk), dynamics (force, and torque), contact information between the tool and the medium (a.go. real tissue or simulated tissue) and recorded display of the scene in the proximity of the tool.
Regardless of the modality being used or the clinical procedure being studied, task deconstruction or decomposition is one component of an objective skills-assessment methodology. Exposing and analyzing the internal hierarchy of tasks provides an objective means for quantifying training and skills acquisition.
Task decomposition is associated with defining the prime elements of the manipulative task. In surgery, a particular procedure is divided into steps, stages, or phases with well-defined intermediate goals. Additional hierarchical decomposition is based upon identifying tasks or subtasks including a sequence of actions or states. In addition, other measurable parameters such as workspace completion time, tool position, and forces and torques can be analyzed. Selecting low-level elements of the task decomposition allows one to associate these elements with quantifiable and measurable parameters. The definition of these states, along with measurable, quantitative data, are used for modeling and examining surgical tasks as a process.
In the proposed study, an analogy between minimally invasive surgery (MIS) and the human language inspires the decomposition of a surgical task into its prime elements. Modeling the sequential element expressions using a multi-finite states model (for example, a Markov model) reveals the internal structure of the surgical task which is utilized in assessing surgical performance. Markov modeling (MM) and hidden Markov modeling (HMM), a subset of MM, are used to characterize manipulative tasks.
Within the context of the three modalities (direct surgery/clinical examination, simulated procedures—either physical or virtual, and surgical robot), the procedure can be summarized as follows:
In one example, the present subject matter includes procedures for analyzing a database acquired from two modalities (simulator and instrumented surgical tools) using vector quantization algorithms.
According to one example, a method includes decomposing the task using expert knowledge and developing the Markov model architectures, training the Markov models based on the processed data, developing the learning curves based on measuring the statistical similarity between the models representing subjects at different levels of surgical training to enable an objective assessment of surgical skills and generalizing the methodology for assessing skill in the three modalities.
In the context of battlefield conditions, for example, military medical personnel may be called upon to perform tasks that may exceed the complexity or skill of a civilian medical personnel. Even extended experience in a civilian trauma center may be inadequate to prepare military personnel to perform under realistic conditions. As such, simulators are valuable tools in training military personnel. In addition, a mechanism for assessing skill can be helpful in a simulator and in particular, a simulator used to train military medical care providers.
Among other applications, a statistical model, such as a Markov model, can provide a tool in developing a methodology for studying models of the human operator in complex interactive tasks with machines.
Databases and Data Collection
A particular surgical robot, known popularly as the BlueDRAGON, is a system developed at the University of Washington for acquiring the kinematics and the dynamics of two endoscopic tools along with the visual view of the surgical scene while performing a MIS procedure. The system includes two four-bar passive mechanisms attached to two endoscopic tools. During a minimally invasive surgical procedure, the endoscopic tool is inserted into the body through a port located, for example, in the abdominal wall. The tool is rotated around a pivot point within the port that is generally inaccessible for sensors aimed to measure rotation of the tool. The position and orientation of the tool, with respect to the port, is tracked by sensors that are incorporated into the joints of the mechanism. The two mechanisms are equipped with three classes of sensors.
A first class of sensor include position sensors (such as potentiometers) incorporated into four of the joints of the mechanisms for measuring the position, orientation and translation of the two instrumented endoscopic tools attached thereto. In addition, two linear potentiometers are attached to the handles of the tools and used for measuring the endoscopic handle and tool tip angles.
A second class of sensors include three-axis force/torque (F/T) sensors (with holes drilled at their center) that are inserted and clamped to the proximal end of the shafts of the endoscopic tools. In addition, double beam force sensors are inserted into the handles of the tools for measuring the grasping forces at the hand-tool interface.
A third class of sensors include contact sensors, based on a resistance-capacitance (RC) circuit, which provides a binary indication of tool-tip/tissue contact.
Data measured by the sensors are acquired using two 12-bit USB A/D cards sampling the 26 channels (4 rotations, 1 translation, 1 tissue contact, and 7 channels of forces and torques from each instrumented grasper) at a frequency of 30 Hz. In addition to data acquisition, the synchronized view of the surgical scene is incorporated into a graphical user interface displaying data in real-time.
Preliminary tests acquiring data at a sampling rate of 1 KHz indicated that 95% of the signals accumulated energy is in a bandwidth 0-5 Hz. In addition, a graphical user interface (GUI) is provided to display information measured by the surgical robot in real-time while incorporating endoscopic view of the surgical scene acquired by the endoscopes video camera. On the top right side of the GUI, a virtual representation of the two endoscopic tools are shown along with vectors representing the instantaneous velocities. On the bottom left a three dimensional representation of the forces and torque vectors are presented. Surrounding the endoscopic image are bars representing the grasping/spreading forces applied on the handle and transmitted to the tool tip via the tool's internal mechanism, along with virtual binary LED indicating contact between the tool tips and the tissues.
A representative physical simulator is popularly known as the E-Pelvis. The E-pelvis is a physical simulator developed at Stanford University that consists of a partial mannequin (umbilicus to mid-thigh) constructed in the likeness of an adult human female. The mannequin is instrumented internally with force sensors that are connected to a computer having a graphical user interface for providing a real-time visual feedback. Test subjects perform simulated clinical female pelvic examinations on the mannequin and the data is collected at a sampling frequency of 30 Hz and stored in memory for off-line analysis.
A representative surgical robot system, popularly known as DaVinci, is commercially available from Intuitive Surgical (Sunnyvale, Calif.) and is FDA approved for selected surgical procedures. The system is equipped with an interface card that allows passive acquisition of internal variables of the robot during operation. Examples of data generated include position of the surgical tools and motor commands. The data is sampled at 30 Hz, displayed in real time by using a user interface and stored for off-line analysis.
Protocol for the Surgical Robot
The protocol using the surgical robot included collecting data from task performances conducted by surgeons having different levels of expertise. In one example, the performances of 30 surgeons were monitored. Levels of expertise ranged from surgeons in training to surgical attending physicians. Five subjects in each group represented the five years of surgical training, (5×R1, R2, R3, R4, R5-where the numeral denotes year of training) and five expert surgeons. For the purpose of this example, an expert surgeon (E) was defined as a board certified laparoscopic surgeon who performed at least 800 surgeries and practices medicine as an attending physician. Each subject was given instruction through a multimedia presentation on how to perform three basic surgical tasks involving (1) tying an intracorporeal knot; (2) manipulating tissue; and (3) tissue dissection. The multimedia presentation included a written description of the task and a video clip of the surgical scene with audio explanation of the task. Subjects were then given 15 minutes in which to complete this task in a swine model.
In addition to the surgical task, each subject performed 15 predefined tool/tissue and tool/needle-suture interactions as shown in
The kinematics (that is, the position/orientation (P/O) of the tools in space with respect to the port), and the dynamics (that is, forces and torque—F/T—applied by the surgeons on the tools) of the left and right endoscopic tools along with the visual view of the surgical scene were acquired by a passive mechanism coupled to the surgical robot. This data provided the F/T and velocity signatures associated with each interaction that were then used as the model observations associated with each state of the model.
Protocol for the Physical Simulator
The experimental protocol for the simulator included 400 students and 375 clinicians performing pelvic examinations using the simulator. The data include forces as a function of time recorded from sensors distributed in the simulator. In addition, background information on all of the users was also recorded. These records include a database of outcome measures, the user's clinical assessment scores, and independent skilled observer ratings of the users' techniques while performing examinations or procedures on the simulators.
Data Analysis
The methodology for analyzing the data includes a multi-step processes of data reduction starting from multi-dimensional raw data and ending with a single objective performance score. The methodology is linked directly to the physics of the medium being treated. Data processing provides insights into the process being analyzed as opposed to a black box approach where only the inputs and outputs are well defined and the modal internal architecture is arbitrarily selected and unlinked to the physical world.
Multi-Dimensional Raw Data
Multi-dimensional data was collected as a function of time for each modality under study. Time charts of the typical plots are depicted in
The vector representation of the data allows spatial graphical representation rather than time charts. Vector representation of exemplary data is shown in
The complexity of the surgical task and the multi-dimensional data can be noted in the raw data. This complexity can be resolved, in part, by decomposing the surgical task into primary elements, thus enabling insights into the clinical procedure as a process.
Vector Quantization
Data quantization is used to reduce the dimensions of the data. The data can be envisioned as a non-homogeneous discrete cloud encompassing the acquired data points, as illustrated in
Each cluster center can be defined by a discrete symbol (e.g.
In one example, the number of states of a Markov model is selected based on user-selected criteria. For example, a 30-state Markov model can be used to represent two tools working collaboratively or a 3-state or 15-state hidden Markov model can be used to represent a single tool.
Each one of the 15 states was associated with a unique set of forces, torques, angular and linear velocities, as indicated in the table of
Data reduction can be performed in three phases. During the first phase a subset of the database is created by appending the 13-dimensional vectors associated with each state measured by the left and the right tools and performed by all subjects. The 13-dimensional subset of the database (ωx, ωy, ωz, ωg, VZ, Fx, Fy, Fz, Tx, Ty, Tz, Fg, U) was transformed into a 9-dimensional vector
The subscripts x, y and z are used to associate the angular and linear velocities (ω, V) the forces (F), and torques (T) with the stationary coordinate system and an origin located at the surgical port. The combined axes x-y, x-z and y-z define planes parallel to the coronal, sagittal, transverse planes respectively. The Z-axis is pointing toward the anterior side of the abdominal wall. The subscript g is used to associate the angular velocities (ω) and the forces (F) with the tool's grasping handle. The binary variable U indicates whether the tool is in contact with the tissue or any other element in the surgical scene.
In the second phase, a K-means vector quantization algorithm is used to identify 10 cluster centers associated with each state.
Mathematically the process is defined as follows: Given M patterns
The K-means algorithm is based on minimization of the sum of squared distances from all points in a cluster domain to the cluster center,
where Si(k) was the cluster domain for cluster center
The cluster regions
In a third phase, the 10 cluster centers
Ten signatures of forces, torques, linear and angular velocities are associated with the 15 types of states (tool/tissue or tool/object interaction) defined by the table illustrated in
In the graph of
Both static, quasi-static and dynamic tool/tissue or tool/object interactions are represented by the various cluster centers. Even in static conditions, the forces and torques provide a unique and un-ambivalent signature that can be associated with each one of the 15 states.
Markov Model
In one example, data analysis included developing a model that represents the process of performing MIS and methodology for objectively evaluating surgical skill. A Markov model provides a statistical method to summarize a relatively complex task such as a step or a task of a MIS procedure. In one example, skill level was incorporated into the Markov model by developing different models based on data acquired for different levels of expertise ranging from a first year resident to an expert surgeon.
A model is generated to represent the clinical procedure for analyzing the data. The model includes multiple interconnected states where each state represents an interaction between the physician using a tool or between the physician's hands and the tissues. After the physician is engaged in a specific interaction with the tissue, different forces, torques (along with the tool kinematics) are generated through the interaction. The action/reaction information transmitted between the tool or the hand and the tissue is referred to as an observation and can be measured by an array of sensors incorporated into the various modalities previously noted.
The medical procedure can be described as a dynamic process in which the physician is moving between states while interacting with the tissue. During the physician's interaction with the tissue in each state, different types of information is exchanged between the tools or the hand and the tissue by utilizing the various observations typical to a specific state. After the physician is engaged with the tissue, the physician may remain in this state for a period of time and then perform a transition and engage with the tissue (again utilizing a different state), while using its associated observations.
This process can be modeled by a finite state machine or in a generalized form as a Markov model. The statistical nature of the model arises from the fact that each transition between two states or utilization of an observation in a state is associated with a probability. There is a particular probability that the physician will use certain transitions between the states that facilitates a specific observation while interacting in the tissue in a certain state. The model, as a whole, along with its states and observations, represents the clinical procedure. Moreover a specific navigation pattern between the model states and utilizing specific observations is associated with a particular skill. Physicians with a similar skill level are more likely to navigate through similar states of the model and leave the same trace. However, differences between the various skills level are related to different traces in the model. Each trace can be quantified by accumulating the probabilities associated with each transition. These accumulating probabilities define an objective score which can be used to differentiate between various skill levels.
The Markov model has a generic architecture (including the prime elements) such as states and observation. A specific model architecture defined for a particular medical procedure is based on an expert knowledge. Using expert knowledge, the various states and their interconnection are defined, and form a step in the model development. Each procedure has a unique model architecture and the generic methodology for assessing skill is independent of a specific procedure. The following sections will use MIS as an example of the methodology, thus demonstrating how the Markov model is translated into practice.
Analyzing the degrees of freedom (DOF) of a tool in MIS reveals that, due to the introduction of the port through which the surgeon inserts tools into the body cavity, two DOF of the tool are restricted. The six DOF of a typical open surgical tool is reduced to four DOF in a minimally invasive setup. These four DOF include rotation along the three orthogonal axes (x, y and z) and translation along the long axis of the tool's shaft (z). A fifth DOF is defined as the tool-tip jaws angle, which is mechanically linked to the tool's handle such as, when a grasper or a scissor is used. Additional one or two degrees of freedom can be obtained by adding a wrist joint to the MIS tool. The wrist joint enhances the dexterity of the tool within the body cavity.
Surgeons, while performing MIS procedures, utilize various combinations of the DOF while manipulating the tool during the interaction with the tissues or other items in the surgical scene (such as a needle, a suture or a staple) in order to achieve the desired outcome. In one example, quantitative analysis of the position and orientation of the tool during surgical procedures revealed 15 different combinations of the five DOF for a tool while interacting with the tissues and other objects. These 15 DOF combinations will be further referred to, and modeled as states (see
The modeling approach underling the methodology for decomposing and statistically representing a surgical task is based on a fully connected, symmetric finite-states (30 states) Markov model where the left and the right tools are represented by 15 states each as illustrated in
The Markov model is defined by the notation in Equation 4. Each Markov sub-model representing the left and the right tool is defined by λL and λR (Equation 4). The sub-model is defined by:
(i) The number of states—N whereas individual states are denoted as S={s1, s1, . . . sN}, and the state at time t as qt.;
(ii) The number of distinct (discrete) observation symbol—M whereas individual symbols are denoted as V={v1, v1, . . . vM};
(iii) The state transition probability distribution matrix indicating the probability of the transition from state qt=si at time T to state qt+1=sj at time t+1−A={aij}, where aij=P[qt+1=sj|qt=si] 1≦i, j≦N;
Note that A={aij} is a non-symmetric matrix (aij≠aji) since the probability of performing a transition from state i to state j using each one of the tools is different from the probability of performing a transition from state j to state i.
(iv) The observation symbol probability distribution matrix indicating the probability of using the symbol Vk while staying at state sj at time t−B={bj(k)}, where for state j bj(k)=P[vk at t|qi=sj ]1≦j≦N, 1≦k≦M;
(v) The initial state distribution vector indicating the probability of starting the process with state si at time t=1−π where πi=P[q1=si] 1≦i≦N.
The two sub-models are linked to each other by the left-right interstate transition probability matrix or the cooperation matrix indicating the probability for staying in states si with the left tool sr with the right tool at time t−C={cir}, where cir=P[qtL=s1∪qtR=sr] 1≦l, r≦N
Note that C={clr} is a non-symmetric matrix clr≠crl since it representing the combination of using two states simultaneously by the left and the right tools.
The probability of observing the state transition Q={q1, q2, . . . qT} and the associated observation sequence O={o1, o2, . . . oT}, given the two Markov sub-models (Equation 4) and interstate transition probability matrix, is defined by Equation 5
Since probabilities, by definition, have numerical value in the range of 0 to 1, the probability calculated by Equation 5 converges exponentially to zero and therefore exceeds the precision range of a machine. Hence, by using logarithmic transformation, the resulting values of Equation 5 in the range of [0 1] are mapped by Equation 6 into [−∞0].
Due to the nature of the process associated with surgery in which the procedure, by definition, always starts in the idle state (state 1), the initial state distribution vector is defined as follows in Equation 7.
π1L=π1R=1
λiL=πiR=0 2≦i≦N. (Equation 7)
Given the encoded data, 30 Markov models, (one for each subject) are calculated defining the probabilities for performing certain tool transitions ([A] matrix), the probability of combining two states ([C] matrix), and the probability of using the various signatures in each state ([B] matrix).
An element in the [A] matrix is calculated as the ratio between the number of times a specific transition between state i to state j took place n(qt=sj|qt-1=si) and the total number of state transitions n which is also equal to one minus the number of data points. There are N numbers of potential transition between two state and therefore the order of [A] is N×N. The sum of each line in the [A] matrix is equal to one. An element in the [B] matrix is calculated as the ratio between the number of times a specific observation vk was used while staying in state sj, m(vk|qt=sj) and the total number of visits of state j, m(qt=sj) which is also equal to the number of times any observation was used while visiting that state. There are N number states and M number of potential transition between two states and therefore the order of [A] is N×N. The sum of each line in the [B] matrix is equal to one. An element in the [C] matrix is calculated as the ratio between the number of times the left hand side model is in state sl as well as the right hand side of the model is in state sr, c(qLt=sl∩qRt=Sr) and the total number of state combinations observed n which is also equal to the number of data points. The sum of all lines and columns of the [C] matrix is equal to one.
In models extracted as described above from the sample surgical data, the highest probability values in the [A] matrix appear along the diagonal. Accordingly, a transition associated with remaining at the same state is more likely to occur rather than a transition to any one of the other 15 potential states. In minimally invasive surgical suturing, for example, the default transition from any state is to the grasping state (state number 2) as indicated by the high probability values along the second column of the [A] matrix. The probability of using one out of the 150 cluster centers (illustrated in
Each tool (left and right) can be only in one out of the 15 states. However, there are potentially 225 (15×15) different combinations in which the left tool is in state i and the right tool is in state j. For that reason the dimensions of the [C] matrix is 15×15.
The idle state (state 1) in which no tool/tissue interaction is performed was mainly used, in most of the surgical tasks (by both expert and novice surgeons), to move from one operative state to another. The expert surgeons used the idle state as a transition state while the novices spent a significant amount of time in this state planning the next tool/tissue or tool/object interaction. In the case of surgical suturing and knot tying, the grasping state (state 2) dominated the transition phases since the grasping state, in this case, maintains the scene in an operative state in which both the suture and the needle were held by the two surgical tools.
Objective Skill Assessment
Once the Markov modes are defined for specific subjects with specific skill levels, it becomes possible to calculate the statistical distance factors between them. The statistical distance factors are considered to be an objective criterion for evaluating skill level if, for example, the statistical distance factor between a trainee (indicated by index R) and an expert (indicated by index E) is being calculated.
Given two Markov models λEi=(λLEi, λREi, CEi)(expert) and λTi=(λLTj, λRTj, CTj) (trainee) the asymmetric statistical distances between them are defined as D1(λTj, λEi) and D2(λEi, λTj). The natural expression of the symmetric statistical distance version DEiTj is defined by Equation 8.
Setting an expert level as the reference level of performance, the symmetric statistical distance of a model representing a given subject from a given expert (DEiTj) is normalized with respect to the average distance between the models representing all the experts associated with the expert group (
For the purpose of calculating the normalized learning curve, the distances between all the subjects associated with the group of experts was first calculated DE
Once the reference level of expertise was determined, the statistical distances between each one of the 25 subjects, grouped into five levels of training (R1, R2, R3, R4, R5), and each one of the experts was calculated (5 distances for each individual, 25 distances for each group of skill level and 125 distances for the entire data base) using Equation 8. The average statistical distance and its variance defines the learning curve of a particular task.
Complimentary Objective Indexes
In addition to the Markov models and the statistical similarity analysis, two other objective indexes of performance can be measured and calculated, including the task completion time and the overall length (L) of the path generated by the left and the right tool tips. Where DL,DR are the distances between two consecutive tool tip positions PL(t−1), PR(t−1) and PL(t), PR(t) as a function of time of the left and the right tools respectively.
These complimentary performance indexes are available for the particular surgical robot database in which motion of the tool was acquired. Acquisition of tool motion in the other modalities is also contemplated.
FIGS. 10A-C illustrate normalized Markov model-based statistical distance as a function of the training level, normalized completion time and normalized path length of the two tool tips respectively. The complementary subjective normalized scoring is depicted in
In particular,
The data illustrates that substantial suturing skills are acquired during the first year of the residency training. The learning curves do not indicate significant improvement during the second and the third years of training. The rapid improvement of the first year is followed by lower gradient of the learning curve as the trainees progress toward the expert level. The Markov model-based statistical distance along with the completion time criteria indicate another gradient in the learning curve that occurs during the fourth year of the residency training followed by slow conversion to expert performance. Similar trends in the learning curve are also demonstrated by the subjective assessment. One particular subject in the R2 group outperformed his peers in his own group and some subjects in a more advanced groups (R3, R4) which slightly altered the overall trend of the learning curves as defined by the different criteria.
Exemplary Method
A clinical procedure, regardless of the performance modality, entails synthesis between visual and kinesthetic information. Analyzing the procedure in terms of these two sources of information facilitates development of objective criteria for training physicians and evaluating the performance in different modalities including real procedures, master/slave robotic systems or virtual reality or physical simulators.
The Markov model and the vector quantization described herein is suitable for multi-modal sources of information, including low level data (such as tool kinematics and dynamics defining the model observations) and high level methodological processes (such as tool/tissue interactions formulating the model's state). The Markov model provides a mathematical representation of the process associated with manipulative tasks including complex medical procedures such as surgery. In one example, the present subject matter provides a quantitative and objective measure of surgical performance.
Exemplary outcomes of analysis of minimally invasive surgical procedures using the present subject matter revealed differences between surgeons at different skill levels including, (i) the types of tool/tissue/object interactions being used, (ii) the transitions between tool/tissue/object interactions being applied by each hand, (iii) time spent while performing each tool/tissue/object interaction, (iv) the overall completion time, (v) the various F/T/velocity magnitudes being applied by the subjects through the endoscopic tools, and (vi) two-handed collaboration. In addition, the F/T associated with each state revealed that the F/T magnitudes are relatively task-dependent with relatively high F/T magnitudes applied by novices compared to experts during tissue manipulation, and vice versa during tissue dissection. High efficiency of surgical performance was demonstrated by the expert surgeons and expressed by shorter tool tip displacements, shorter periods of time spent in the ‘idle’ state and sufficient application of F/T on the tissue to safely accomplish the task.
In various examples, the present subject matter facilitates development of objective criteria for decomposing a medical procedure and analysis using models. In one example, objective measures of skill and competency enables training and evaluating performance. In real-time, the present subject matter provides feedback to the trainee or as an artificial intelligent background layer which may increase performance efficiency in medicine and improve patient safety and outcome.
Indexes of Performance
Following two steps of data reduction, data that were collected by the surgical robot and were used to develop models representing MIS as a process. In data reduction, there is a compromise between decreasing the input dimensionality while retaining sufficient information to characterize and model the process under study. Utilizing the VQ algorithm the 13 dimensional stream of acquired data were quantized into 150 symbols with nine dimensions each.
The data quantization included identification of the cluster centers and encoding the database based on the identified cluster centers. Every data point meeting two criteria is then associated with one of the 150 identified cluster centers. The first criterion is to have the minimal geometrical distance to one of the cluster centers. Once the data point was associated with a specific cluster center it is, by definition, associated with a specific state out the 15 defined. Based on expert knowledge of surgery, the table in
MIS is recognized both qualitatively and quantitatively as a multidimensional process. As such, studying one parameter (e.g. completion time, tool-tip paths, or force/torque magnitudes) reveals only one aspect of the process. A model that describes MIS as a process can facilitate study of the internal process and provide information. At the high level, a tremendous amount of information is encapsulated into a single objective indicator of surgical skill level and expressed as the statistical distance between the surgical performance of a particular subject under study from a surgical performance of an expert. As part of an alternative approach a combined score could be calculated by studying each parameter individually (e.g. force, torque, velocity, tool path, completion time etc.), assigning a weight to each one of these parameters, which is a subjective process by itself, and combining them into a single score. The assumption underlying this approach is that a collection of aspects associated with surgery may be used to assess the overall process. However this alternative approach ignores the internal process that is more likely to be revealed by a model such as the Markov model. In addition, as opposed to analyzing individual parameters, studying the low levels of the model provides profound insight into the process of MIS in a way that allows one to offer constructive feedback for a trainee regarding performance aspects like the appropriate application of F/T, economy of motion, and two handed manipulation.
The application of F/T on the tissue has an impact on the surgical performance efficiency and outcome of surgery. Some results indicate that the F/T magnitudes are task dependent. Experts applied high F/T magnitudes on the tissues during tissue dissection as apposed to low F/T magnitudes applied on the tissues by trainees that were trying to avoid irreversible damage. An inverse relationship regarding the F/T magnitudes was observed during tissue manipulation in which high F/T magnitudes applied on the tissue by trainees exposed them to acute damage. These differences were observed in particular states (e.g. those states including grasping for tissue manipulation and states involving spreading for tissue dissection). Due to the inherent variance in the data, even multidimensional ANOVA failed to identify this phenomena once the F/T magnitudes are removed from the context of the multi state model. Given the nature of surgical task, the Markov model [B] Matrix, encompassing information regarding the frequency in which the F/T magnitudes were applied, may be used to assess whether the appropriate magnitudes F/T were applied for each particular state. Tissue damage is correlated with surgical outcome and linked to the magnitudes and the directions in which F/T were applied on the tissues. As such, tissue damage boundaries may be incorporated into the [B] matrix for each particular state. Given the surgical task, this additional information may refine the constructive feedback to the trainee and the objective assessment of the performance.
The economy of motion and the two hand collaboration may be further assessed by retrieving the information encapsulated into the [A], and [C] matrices. The amount of information incorporated into these two data structures exceeds the information provided by a single indicator (such as tool-tip path length or completion time) for the purpose of formulating constructive feedback to the trainee. Given a surgical task, utilizing the appropriate sets of states and state transitions are skill dependent. This information is encompassed in the [A] matrix indicating the states that were in use and the state transitions that were performed. Moreover, the ability to refine the time domain analysis using the multi-state Markov model indicated, as was observed in previous studies, that the ‘idle’ state is utilized as a transition state by expert surgeons whereas a significant amount of time is spent in that state by trainees.
Coordinated movements of the two tools is yet another indication of high skill leveling MIS. At a lower skill level the dominant hand is more active than the non-dominant hand as opposed to a high skill level in which the two tools are utilized equally. The collaboration [C] matrix encapsulates this information and quantifies the level of collaboration between the two tools.
The Markov model provides insight into the process of performing MIS. This information can be translated into a constructive feedback to the trainee as indicated by the three model matrices [A], [B] and [C]. Moreover, the capability of running the model in real-time and its inherent memory allows a senior surgeon supervising the surgery or an artificially intelligent expert system incorporated into a surgical robot or a simulator to provide immediate constructive feedback during the process as previously described.
Although the notations and the model architecture of the Markov model and the hidden Markov model approach are similar, there are several differences between them. The Markov model can be perceived as a white box model in which each state has a physical meaning describing a particular interaction between the tools and tissue or other objects in the surgical scene (such as sutures and needles). The hidden Markov model can be perceived as a black box model in which the states are abstract and are not related to a specific physical interaction. In the white box model, each state has a unique set of observations that characterize only the specific state. By definition, once the discrete observation is matched with a vector quantization code-word the state is also defined. States in the hidden Markov model share the same observations, however different observation distributions differentiate between them.
Other sensors can be used to generate data for the present subject matter including, for example, sensors configured to measure position, orientation, force, torque, pressure, physiological variables and contact. In addition, other sensors, including a velocity sensor, an acceleration sensor, a pressure sensor, a visual display of a scene being analyzed, a clock, and a temperature sensor can also be used to generate data for the present subject matter.
In one example, a hybrid model is generated which represents the topology between a Markov model and a hidden Markov model. The hybrid model adds another layer of complexity to the Markov model by introducing the observation elements for each state. The hybrid model provides insight into the process by linking the states to physical and meaningful interactions. The hybrid model includes the collaboration matrix [C] in addition to the Markov model notation. The collaboration matrix [C] is not normally present in either the Markov model or the hidden Markov model. The collaboration matrix [C] links the models representing the left and right hand tools since surgery is a two-handed task.
In one example, the Markov model provides physical meaning to the process being modeled. In one example, the hidden Markov model provides a compact model topology and does not rely on expert knowledge incorporated into the model.
In one example, a method of the present subject matter includes defining the scope of the model and the fundamental elements, the state and the observation. For example, in the case of minimally invasive surgery, the surgical task is modeled by a fully connected model topology were each tool/tissue/object interaction is modeled as a state. In one example, each phenomenon is represented by a model with abstract states wherein each tool/object interaction is modeled by an entire model using more generalized definitions for these interactions e.g. place position, insert remove. In one example, additional models are used with a predetermined overall structure that represents the overall process.
In one example, the scope of the model is limited to objectively assess technical factors of surgical ability. Cognitive factors can be assessed by the model where a specific action is taken as a result of a decision making process.
Decomposing MIS and analyzing it using a Markov model is one approach for developing objective criteria for surgical performance.
In one example, the present subject matter, when used in real-time during the course of learning as feedback to the trainee surgeons or as an artificial intelligent background layer, may increase performance efficiency in MIS and improve patient safety and outcome.
One example of the present subject matter utilizes a plurality of models and a performance of a specimen is correlated to a particular model based on a generated distance that describes the probability that the specimen matches a particular one of the plurality of models.
The present subject matter can be applied to other types of human machine interfaces, including, for example, flight simulators and vehicle simulators and other multi-state non-medical devices and simulators.
In one example, an intelligent layer or expert system is configured to interject a message or interrupt a process performed by a robotic device. For example, an imprudent manipulation by a low skilled surgeon will trigger delivery of a message, either visually, audibly or tactile. In one example, the robotic device will prevent an imprudent manipulation or provide cues to suggest adoption of an alternate manipulation.
In one example, the models are adapted or trained against a data set. For example, a first year resident performing a minimally invasive surgical procedure will generate a particular set of performance data. In one example, a Baum-Welch algorithm is executed by a set of computer implemented instructions. A Baum-Welch algorithm is used to train the models for each skill level based on data from the training groups of known skill levels. In other words, the Baum-Welch algorithm facilitates the determination that the hidden Markov model can generate data matching the particular specimen performance. The Baum-Welch algorithm is but one example of a class of algorithms known as forward-backward algorithms, machine learning algorithms or pattern recognition algorithms and other alorgithms are also contempalted for use with the present subject matter. In one example, a forward-backward algorithm is used to determine the probability that the specimen performance correlates to a particular Markov model.
In one example, the surgical robot is equipped with 26 sensors and at a sampling rate of 100 readings per second, 2,600 data points are generated per second.
Execution of the Baum-Welch algorithm facilitates adaptation or modification of the data to represent a particular subject performance. In one example, the Baum-Welch algorithm is executed for each particular skill level in order to train the model. In one example, specimen data is used in the forward-backward algorithm and applied to the data corresponding to each of the six models generated and the present subject matter selects the one model with the highest probability. In one example, a correlation function is executed to determine a performance grade for a particular specimen.
In one example, a “distance” is calculated between each mode and the specimen data set. The shortest distance correlates to the highest probability for a match.
In one example, a recurrent neural networks (ARMA, autoregressive moving average) is calculated to correlate specimen performance to a particular model data set.
In various examples, measurements of the tool path length (a measure of the movement of a tool tip), time, force applied or other parameter is used to judge performance. Other parameters include torque, position, displacement, electrical contact measurement (resistance) and temperature. Such parameters can be used in the analysis of surgical tasks such as suturing, cutting, cauterizing and ablating.
In one example, a hidden Markov model is applied to physical signals generated by a performance of a manipulative task conducted by a specimen. The internal parameters are adjusted to improve stability of the signal generated. For example, a window is established around a particular signal to a limit the amount of variable changes. By establishing a window or boundaries, the asymptotic change of a value is bracketed and convergence is accelerated. In one example, a trial and error approach is performed in establishing the boundaries for a particular signal value.
The present subject matter can be operated in real-time and provide feedback (any of visual, aural, tactile) regarding performance during the manipulative task.
The methodology is independent of the modality used and can be incorporated into an example of the present subject matter including any of an instrumented surgical tool, a simulator, and a robotic system. In addition, the present subject matter can include an instrumented tool configured to provide performance data where the tool is a non-surgical device.
In one example, the present subject matter executes an algorithm that can be described as a black box model of skill. The black box model generates generalized findings such as probabilities, fuzzy logic membership functions, or similar abstract numbers. In one example, the algorithm generates generalized findings of skill using a model based on fuzzy logic.
It is to be understood that the above description is intended to be illustrative, and not restrictive. For example, the above-described embodiments (and/or aspects thereof) may be used in combination with each other. Many other embodiments will be apparent to those of skill in the art upon reviewing the above description. The scope of the invention should, therefore, be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled. In the appended claims, the terms “including” and “in which” are used as the plain-English equivalents of the respective terms “comprising” and “wherein.” Also, in the following claims, the terms “including” and “comprising” are open-ended, that is, a system, device, article, or process that includes elements in addition to those listed after such a term in a claim are still deemed to fall within the scope of that claim. Moreover, in the following claims, the terms “first,” “second,” and “third,” etc. are used merely as labels, and are not intended to impose numerical requirements on their objects.
The Abstract of the Disclosure is provided to comply with 37 C.F.R. §1.72(b), requiring an abstract that will allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, various features may be grouped together to streamline the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter may lie in less than all features of a single disclosed embodiment. Thus the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.
This patent document claims the benefit of priority, under 35 U.S.C. Section 119(e), to Blake Hannaford, U.S. Provisional Patent Application Ser. No. 60/711,514, entitled “SKILL EVALUATION,” filed on Aug. 26, 2005 (Attorney Docket No. 2082.006PRV).
The invention was made with Government support under contract or grant number DAMD17-97-1-7256 entitled “Force/Torque Signatures in Minimally Invasive Surgery: Quantification of Skill and Improvement of Outcomes,” project period 6/1997-5/1999, awarded by the Defense Advanced Research Projects Agency (DARPA) and under contract or grant number W81XWH-04-1-0464 entitled “Markov Models,” project period March 2004-May 2006, awarded by the Department of Defense (DOD). An Information Technology Research (ITR) award from the National Science Foundation (NSF) via the John Hopkins University supported this work. The Government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
60711514 | Aug 2005 | US |