Artifact detection and removal can be performed for physiological data collected in uncontrolled or ‘messy’ situations like in the hospital or at home to ensure data quality. As sensors become more ubiquitous and optimized for comfort and convenience over signal quality, ensuring data quality is increasingly the responsibility of analysis processes that can quickly detect and correct artifacts. Specifically, robust artifact removal helps physiological modalities to become clinical standards, since artifact removal should be integrated into hardware systems to ensure high quality data for clinicians.
Many artifacts are clearly identifiable by eye and attributable to sources such as patient movement, accidental removal or repositioning of sensors, or interference from other equipment. However, automating removal of artifacts that can be detected by eye can be challenging. Common methods for artifact removal in simpler situations, such as thresholding, may not be sufficient for complex clinical environments. Supervised learning tools can be impractical for detecting and removing artifacts from clinical data because they are trained on sets of labeled data, which can be difficult and time-consuming to prepare, especially on the sub-second timescale of most artifacts. In addition, artifact rejection strategies should not remove or distort true data, especially in cases where temporal dependencies exist. Temporal dependencies may also warrant special considerations in methods development, for example, favoring removal of multiple smaller chunks of data rather than a single continuous chunk.
Electrodermal activity (EDA) is a physiological measure that is inexpensive and convenient to collect but is not yet a clinical standard because there are no rigorous tools to process and analyze it. EDA represents the changing electrical conductance of the skin due to the activity of sweat glands, which provide part of the body's sympathetic ‘fight or flight’ reflex. EDA has immense potential as a physiological marker to track sympathetic activation in situations involving pain or stress. Developing frameworks and methodologies to process EDA, including artifact detection and removal specific to clinical situations, would bring EDA closer to being a clinical standard. In particular, improved frameworks and methodologies to process EDA, including artifact detection and removal, can be useful in estimating a patient's nociceptive state.
For example, during surgery, while a patient is sedated, an anesthesiologist typically relies on changes in heart and blood pressure, the epidemiologic profile of the patient, the pharmacology of the anesthetics being used, and previous experience with similar patients and similar surgeries to decide how and when anti-nociceptive agents should be administered to the patient. However, these conventional methods of predicting a sedated patient's nociceptive state can be unreliable. Several confounding factors, including drug-induced hemodynamic instability, respiratory variability, and hypnotic effect make it difficult to track nociception. These confounding factors restrict an anesthesiologist's ability to predict the patient's underlying nociceptive state. As a consequence, an anesthesiologist may underdose or overdose the patient with anti-nociceptive agents, both of which have consequences. Patients that are grossly underdosed suffer in pain and patients that are overdosed remain sedated or delirious for many hours following surgery.
EDA has the potential to be used for real-time tracking of a patient's nociceptive state during surgery so that an anesthesiologist can supply appropriate amounts of anti-nociceptive agents. However, previous attempts to use EDA to estimate a patient's nociceptive state have suffered from inaccuracy, as well as a lack of statistical rigor and lack of validation. EDA measurements can be corrupted by artifacts caused by a number of factors including patient movement, the surgical techniques, and the surgical instruments. For example, when surgical instruments that use an electric current (e.g., electrocautery) are applied to the patient, these instruments cause electrical interference in the EDA data collected from the patient, usually in the form of large spikes in the data. These spikes make it difficult to accurately estimate the patient's nociceptive state from raw EDA data.
Previously, these artifacts have been removed by high-pass filtering with an arbitrary threshold, which risked filtering out true EDA data if the threshold was too low or interpreting artifacts as true EDA data if the threshold was too high, both of which contribute to inaccurate estimates of nociceptive state. Other artifact removal methods, including wavelet decomposition and other decomposition-based methods, also filter out true EDA data. The anesthesiologist's need for real-time feedback on the patient's nociceptive state prevented the use of more rigorous frameworks and methodologies to process EDA to detect and remove artifacts.
Unfortunately, simple filters do not remove artifacts precisely enough from EDA data for that EDA data to be used for clinical decision making. Postprocessing can remove artifacts from EDA data more precisely but takes too long to provide clinically relevant information—by the time the postprocessed EDA data is available, the surgery may be over. Simple filters and postprocessing also cannot remove every type of artifact from EDA data. For example, sharp artifacts tend to be partially mitigated rather than fully removed. Because conventional techniques cannot produce accurate, artifact-free EDA data in real time or near-real time in a clinical or surgical setting, EDA data is not used today to inform anesthesiology care or other clinical decisions.
The present techniques for identifying and removing artifacts from EDA use unsupervised (machine) learning methods, including isolation forest, K-nearest neighbor distance, and 1-class support vector machine (SVM). Unlike supervised learning methods, unsupervised learning methods are not trained with labeled sets of training data. Instead, they assign data to groups based on patterns detected in the data. The unsupervised methods are able to differentiate between artifact and true data given the appropriate features. Our unsupervised learning methods distinguish artifacts from true data by tracking complex patterns in half-second windows among twelve physiological features based on our own experimentation and the scientific literature.
In one example, our techniques work on EDA collected during surgery in the operating room, where artifacts are caused by interference from surgical cautery equipment. This is one of the most intense clinical situations, and our techniques robustly remove artifacts from this data, demonstrating that they are adaptable for any clinical situation. Our artifact identification and removal techniques move EDA closer to being a clinical standard by removing artifacts in real-time so that clinicians can use them to determine patient's nociceptive state and adjust the patient's anesthetic accordingly.
To demonstrate our techniques, we collected EDA data continuously during lower abdominal surgery in 69 human subjects. The source of most artifacts was surgical cautery, which caused large visible deflections in the EDA data every time the cautery instrument was turned on and off. These cautery-induced deflections can occur over 150 times in an average surgery at short, irregular intervals. The magnitude, sharpness, and direction of artifactual deflections varied across subjects. Each time the cautery was turned on, it typically only remained on for a few seconds. While the cautery-induced deflections are clearly visible in the EDA data, to complicate matters, there are segments of intact, shifted (down typically) EDA data between the deflections.
Our techniques can be implemented as a method of identifying and removing artifacts from raw EDA data. This method can be carried out in real time (e.g., during surgery as the EDA data is collected from the patient) or off-line (e.g., after surgery is complete). Our techniques include dividing the raw EDA data into segments, each of which has a duration corresponding to a timescale of the artifacts (e.g., about 0.5 seconds); extracting respective feature vectors from the raw EDA data segments; determining statistical properties (e.g., the inter-artifact interval distribution) of the raw EDA data segments based on the respective feature vectors using unsupervised learning (e.g., isolation forest, K-nearest neighbor (KNN) distance, or 1-class support vector machine (SVM)); determining an appropriate cut-off for each unsupervised learning method based on the statistical properties of the distribution of artifact; and removing the artifacts from the EDA data using unsupervised learning with the appropriate cut-off.
The features in the respective feature vectors may include a standard deviation of signal, difference between maximum amplitude and minimum amplitude, mean of a first derivative, median of the first derivative, standard deviation of the first derivative, minimum amplitude of the first derivative, maximum amplitude of the first derivative, mean of level 4 Haar wavelet coefficients, median of level 4 Haar wavelet coefficients, standard deviation of level 4 Haar wavelet coefficients, minimum of level 4 Haar wavelet coefficients, and maximum of level 4 Haar wavelet coefficients from each of the segments of the EDA data.
Identifying the artifacts in the segments of the EDA data may include generating scores, using unsupervised learning, for the segments based on the respective feature vectors and determining if the segments have artifacts based on the scores. Determining if the segments have artifacts can be accomplished by comparing the scores to a threshold based on skewness and/or kurtosis of an inter-artifact interval distribution of the raw EDA data.
Removing the artifacts identified using unsupervised learning from the raw EDA data can include interpolating at least one of the segments of the raw EDA data and adjusting a bias of the at least one of the segments of the raw EDA data. The raw EDA data can be collected from a patient undergoing surgery, in which case the artifacts may be caused by positioning the patient and/or performing electrocautery on the patient. The processed EDA data can be used to estimate the patient's nociceptive state, which can in turn be used to adjust a dosage of an anesthetic agent administered to the patient.
Other embodiments include an EDA sensor with first and second electrodes, an analog front end (AFE) coupled to the first and second electrodes, a processor coupled to the AFE, and a display operably coupled to the processor. In operation, the first and second electrodes electrically couple with first and second portions, respectively, of skin of a person and with each other. The AFE receives and conditions EDA data collected by the first and second electrodes. The processor identifies and removes artifacts from the EDA data by dividing the EDA data into segments whose durations correspond to a timescale of the artifacts; extracting respective feature vectors from the EDA data segments; identifying, using unsupervised machine learning, the artifacts in the segments based on the respective feature vector; and removing the artifacts from the EDA data to yield corrected EDA data. The display can display the corrected EDA data in real time, e.g., for use in clinical or surgical decision making.
The AFE can supply a voltage of about 0.2 V to about 2.5 V to the first electrode. The first and second electrodes can include or have first and second fasteners, respectively, so that they can be secured against the first and second portions of human skin (e.g., the first proximal phalange of a first finger on a hand and a second proximal phalange of a second finger on the hand).
The EDA sensor may be part of a system for tracking a nociceptive state of the person, in which case the processor is configured to determine the nociceptive state of the person in real time based at least in part on the corrected EDA data and the display is configured to display a real-time indication of the nociceptive state. Such a system can also include a sensor to measure a heart rate and a heart rate variability of the person, in which case the processor determines the nociceptive state of the person based at least in part on the heart rate and the heart rate variability.
Yet another embodiment includes a method of administering anesthetic agents to a person. This method includes collecting EDA data of the person and identifying and removing artifacts from the EDA data to yield corrected EDA data. The person's nociceptive state is determined based on the corrected EDA data and used to adjust a dosage of an anesthetic agent administered to the person. Identifying and removing artifacts from the EDA data can include dividing the EDA data into segments, each of which has a duration corresponding to a timescale of the artifacts; extracting respective feature vectors from the segments; identifying, using unsupervised machine learning, the artifacts in the segments based on the respective feature vectors; and removing the artifacts from the EDA data. Collecting the EDA data can occur in real time, in which case identifying the artifacts, removing the artifacts, and determining the nociceptive state occur in less than five minutes.
All combinations of the foregoing concepts and additional concepts discussed in greater detail below (provided such concepts are not mutually inconsistent) are contemplated as being part of the inventive subject matter disclosed herein. In particular, all combinations of claimed subject matter appearing at the end of this disclosure are contemplated as being part of the inventive subject matter disclosed herein. Terminology explicitly employed herein that also may appear in any disclosure incorporated by reference should be accorded a meaning most consistent with the particular concepts disclosed herein.
The skilled artisan will understand that the drawings primarily are for illustrative purposes and are not intended to limit the scope of the inventive subject matter described herein. The drawings are not necessarily to scale; in some instances, various aspects of the inventive subject matter disclosed herein may be shown exaggerated or enlarged in the drawings to facilitate an understanding of different features. In the drawings, like reference characters generally refer to like features (e.g., functionally similar and/or structurally similar elements).
Electrodermal activity (EDA) is typically measured with electrodes placed on the fingers or the palm. The electrodes sense changes in the skin's resistance or potential differences between different parts of the skin. These changes or potential differences represent neurally mediated effects on sweat gland permeability, e.g., in response to painful stimuli, significant stress, fear, anxiety, etc. EDA measurements can be used to track a nociception, or the perception of a painful or injurious stimuli, for a patient under anesthesia as disclosed in International Application No. WO 2021/011588 A1, which is from the same inventors, is entitled “Tracking Nociception under Anesthesia Using a Multimodal Metric,” and is incorporated herein by reference in its entirety for all purposes.
Unfortunately, EDA measurements can be corrupted by artifacts caused by the surgical techniques and instruments themselves. In laparoscopy, for instance, electrocautery tools are often used to remove unwanted tissue or to burn and seal blood vessels. Electric current running through the tip of a resistive metal electrocautery probe generates heat, which can be used to cauterize the tissue. The electric current running through the probe tip can also be picked up by the EDA electrodes, leading to artifacts in the EDA data that cannot be removed quickly or easily using conventional techniques.
System for Removing Artifacts from EDA Data
The EDA sensor 120 may use one or two of two different EDA sensing methods—endosomatic and exosomatic. An endosomatic sensor measures only potential differences originating in the skin itself without using an external current. An exosomatic sensor applies an external alternating current (AC) or direct current (DC) to the patient's skin for measuring the skin conductance. In some versions, the EDA device 120 may make exosomatic DC measurements. The EDA electrodes 130 can also be either wet or dry. In both cases, strong contact should be maintained between the electrodes 130 and the patient's skin to ensure good signal quality.
The EDA sensor 120 may be coupled to or attached to the patient's fingers via two electrodes 130. The two electrodes 130 may be Ag/AgCl electrodes connected to the EDA sensor 120 with electrical leads. In some versions, the electrodes 130 are attached to proximal phalanges on different fingers of the patient's hand via fasteners. The fasteners secure the electrodes 130 to the patient's skin so that the electrodes do not move during operation. The fasteners may be hook-and-loop fasteners or buckles. The EDA electrodes 130 can alternatively be placed on the patient's wrist(s) or on the sole(s) of the patient's foot or feet.
The AFE 122 provides a low-frequency excitation of less than about 300 Hz (e.g., 256 Hz) that limits penetration of the excitation signal to deeper layers of human tissue. The electrodes 130 are electrically connected to the AFE 122 via electrical leads to create a two-wire circuit. The AFE 122 supplies a source voltage (e.g., about 0.2 V to about 2.5 V DC) to the electrodes 130 to induce a small current across a patch of skin between the electrodes 130, and measures small current fluctuations across the patch of skin, where the fluctuations indicate changes in epidermal conductivity. The AFE 122 receives the analog EDA data of the fluctuations in current across the patch of skin and conditions the EDA data for further processing, for example, by converting the EDA data from the analog domain to the digital domain with an analog-to-digital converter.
The processor 124 receives the digital EDA data from the AFE 122. The processor 124 processes the digital EDA data by removing artifacts from the EDA data using one, two, three, or more different unsupervised learning techniques, including: isolation forest, K-nearest neighbor (KNN) distance, and 1-class support vector machine (SVM). These techniques are discussed in more detail below. The processor 124 may be coupled to the memory 128, which stores data and computer-executable instructions for processing the EDA data as described below. The processor 124 may also be integrated with a display or monitor 126 or the display or monitor 126 may be a separate device (e.g., a smartphone, laptop, or tablet) wirelessly coupled to the processor 124.
Some versions of the processor 124 can determine or estimate the nociceptive state of the patient 110 based on the artifact-free (or substantially artifact-free) EDA data. The processor 124 uses point process data and regression, state-state models, neural networks, and/or other statistical frameworks to compute a probability of the patient's perception of nociception. The EDA sensor 120 sends EDA data to the processor 124 (via the AFE 122) in real-time, and the processor 124 processes the EDA data in real-time or with a lag of at most a few seconds (e.g., 1 second, 2 seconds, 3 seconds, 4 seconds, 5 seconds, 10 seconds, or 30 seconds) or a few minutes (e.g., 1 minute, 2 minutes, or 5 minutes).
In particular, the processor 124 uses a point process model to capture the autonomic dynamics in EDA data. The processor 124 can display an indication of the resulting probability of the patient's perception of nociception based on the autonomic dynamics in the EDA data on the monitor 126 to an anesthesiologist 140. This indication may take the form of a graph or trace of the probability of the patient 110 perceiving nociception along with some of the EDA indices, including mean+/−standard deviation for EDA.
The anesthesiologist 140 can respond to the indication of the patient's nociceptive state by adjusting the type and dosage(s) of the drug(s) 142 (or other therapeutic agents) administered to the patient 110 (e.g., by an intravenous (IV) line 144). Because the indication of the patient's perception of nociception is provided in real time or with little lag, the anesthesiologist 140 can tailor the drug regimen more effectively, thereby reducing or eliminating side effects caused by overdosage or underdosage of the drugs 142. Too much anesthetic can cause confusion, nausea, and vomiting upon reawakening and difficultly urinating and defecating. Too little anesthetic can cause persistent or intermittent pain for months to years after surgery, which in turn can lead to dependence on or addiction to opioids. And too much or too little anesthetic can lengthen the patient's post-operative stay in the hospital, driving up healthcare costs.
In some versions, the system 100 may include one or more additional physiological sensors 150. The additional sensors 150 may be used in conjunction with the EDA sensor 120 so that the processor 124 may determine or estimate the patient's perception of nociceptive state using a multimodal metric, which may provide a more precise indication of probability of perception of nociception. The sensors 150 may include a heart rate (HR) sensor, such as an electrocardiogram (ECG) or pulse pressure wave sensor, a respiration sensor, a blood pressure (BP) sensor, and/or a skin temperature sensor. The sensors can be coupled to the processor 124 via the AFE 122 and all of the sensors may be synchronized to each other or to the same clock (e.g., a separate reference or the processor's internal clock).
Conventional filtering techniques and existing unsupervised methods for artifact removal are generally unsuitable for removing electrocautery artifacts from EDA data. Existing unsupervised methods are generally specific to the datasets for which they were created, which are typically in mostly controlled experimental settings with occasional but minimal artifact. None have the degree of artifact that surgical cautery interference produces in clinical EDA data. Neither variational mode decomposition nor wavelet decomposition can successfully identify and remove cautery-related artifacts from clinical EDA data like the data in
The EDA sensor 120 identifies and removes artifacts caused by surgical cautery, movement, etc. from the clinical EDA data using unsupervised machine learning methods, also called unsupervised learning, conducted by the processor 124. The EDA sensor 120 can successfully remove even heavy cautery artifacts from clinical EDA data. In addition, the EDA sensor 120 can do so while preserving true EDA data, including small snippets of true EDA data in between sections of EDA data corrupted by cautery artifacts.
Features and Unsupervised Learning for Removing Artifacts from EDA Data
Step 314 involves screening across thresholds (also called biases) on the scores for artifact for all three unsupervised machine learning methods. The labeled artifacts at each threshold are described by an associated inter-artifact interval distribution. The skewness and/or kurtosis of that distribution can be computed and plotted by threshold. Selecting a set of local maxima from skewness vs. threshold curve yields a set of candidate thresholds on the scores to evaluate each unsupervised method. In step 316, the candidate thresholds are themselves ‘scored’ based on the characteristics of the resulting ‘cleaned’ EDA data in terms of local standard deviation. Three specific characteristics are compared to quantitative limits set from examining artifact-free data and the optimal threshold is chosen as the one that most closely meets all limits. Generally, the optimal threshold is the threshold that removes the most artifact without sacrificing too much actual EDA data.
More specifically, in step 316, each of the candidate thresholds was assessed by detecting and removing artifact using that threshold and then computing three metrics on the corrected signal: the maximum standard deviation in any half-second window; the ratio of the maximum standard deviation in any half-second window to the 90th percentile standard deviation in any half-second window; and the ratio of the maximum standard deviation in any half-second window to the median standard deviation in any half-second window. Then, the three metrics were converted into score components by computing their differences from the limits of 0.48, 6, and 10 respectively, and penalizing the distance above each limit by twice as much as the distance below,
In step 318, the best combination of unsupervised machine learning method and threshold on scores is chosen as the one which has the lowest proportion of labeled artifact while still meeting all other conditions (being the optimal threshold). In step 320, the chosen combination of unsupervised machine learning method and threshold is used to remove artifacts from the EDA data and interpolate gaps in the data resulting from artifact removal.
In step 312, each of the three unsupervised methods was applied to each 0.5-second window of data and yielded a score for each window, where a higher score indicates a higher likelihood of an artifact in that window. In other examples, fewer or more than three unsupervised methods (e.g., one, two, four, or five unsupervised methods) can be applied to the data windows. For instance, multiple unsupervised methods can be applied to sample data or initial data and evaluated, then the best-performing unsupervised method (e.g., isolation forest) can be applied to the remaining data.
TABLE 1 shows twelve features that are extracted from each 0.5-second window of raw EDA data. These features are a combination of those used by other existing methods (e.g., wavelet-based methods) as well as additional ones that we discovered were useful based on experimentation. They summarize characteristics of the data that differ between artifact and true signal, such as sudden large changes in amplitude. A subset of these features may achieve the same or similar performance. Other features, including those that capture sudden high amplitude changes, could be useful in addition to or instead of the features in TABLE 1.
We computed these features for each 0.5-second window (128 samples) for each dataset to match with the timescale of most artifacts. These feature vectors were then fed as inputs into the three unsupervised learning methods. Each unsupervised learning technique operated on the feature vectors in a different way to produce a score for that feature vector.
Isolation forest is like random forest; however, each vector of features is scored based on the average length of the path to isolate it down to a leaf in an ensemble of decision trees. Data with artifacts are thought to have a shorter path length than true data, since they are fundamentally different in nature from true data. In this case, each isolation forest included 100 decisions trees, and the isolation scores were computed as the median of 10 such forests.
KNN distance computes the mean distance between each vector of features and the K nearest analogous vectors in the dataset. In this case, KNN distance was computed using Euclidean distance and K=50. Artifactual data are thought to be further in distance from true data.
1-class SVM is like regular SVM, except that it is trained on only true data (only one class) and then tested on its ability to detect data that are not sufficiently similar to true data, in this case, artifact. These artifacts are assumed to be rare in occurrence compared to true data. In this case, 1-class SVM was trained on 90% of the data, based on the 90% with the lowest KNN distance as a conservative estimate of true data and excluding the 10% of data points with the greatest KNN distance.
Each unsupervised learning method yielded a score for each window of data quantifying the degree of abnormality. The formula for determining the score may be different for each unsupervised learning method. For example, for isolation forest, the score is based on the average length of the path in a tree to isolate that specific node from all others. For KNN distance, the score may an aggregate measure of the distance from the K nearest neighbors, such as the sum, mean, median. For 1-class SVM, the score may be the distance from the margin used to envelop the trained data. The higher the score, the more likely that segment of data included artifact(s). The isolation forest scores were made negative to match the directionality of the other two methods.
In step 314, a set of candidate thresholds were identified for evaluation to determine the appropriate threshold for identifying artifacts for each unsupervised method for each subject. The labeled artifact at each threshold can be described by an associated inter-artifact interval distribution. The skewness and/or kurtosis of that distribution can be computed and plotted by threshold.
The process used to select these thresholds relies on specific insight about how the unsupervised learning methods label artifacts. For each dataset, as the threshold on any of the unsupervised method scores is decreased, the portions of data that are labeled artifact increase in discrete jumps with more subtle changes in between. The most ‘correct’ labeling of artifact is likely to occur at one of these discrete jumps since each jump represents the additional labeling of one similar cluster of data as artifact whereas gradual changes represent a continuous spectrum of subtle differences within similar clusters. True artifacts are very similar to each other and distinctly different from true data; therefore, there should be no need to rely on subtle differences.
To identify the discrete jump that represents the most ‘correct’ labeling of artifact, we can take advantage of the fact that each discrete jump dramatically changes the inter-artifact interval distribution by introducing long gaps between subsequent artifact labels. Therefore, the skewness and kurtosis (3rd and 4th moments) of the inter-artifact interval distribution are computed across thresholds for each unsupervised method. Since discrete jumps in labeled artifact skew the inter-artifact interval distribution, the jumps can be identified by local maxima in skewness and kurtosis.
As an example, the candidate thresholds for each unsupervised method were identified using the findpeaks function in Matlab to identify local maxima in the skewness vs threshold curve, using minimum peak prominence of 0.1. Peak prominence is defined as the height of a local maximum above the higher of the two neighboring troughs on either side of the peak.
As another example, the thresholds at which local maxima in skewness and kurtosis occur for each unsupervised method can be tested by visually inspecting the labeled artifact. By using a binary search method to streamline which local maxima are tested, about 5-6 thresholds are visually inspected for each dataset. The best of these thresholds for each unsupervised method is selected by visual inspection. For each unsupervised learning method, once the ideal threshold is chosen, the total proportion of data labeled artifact can be computed, as can the longest single continuous artifact.
In step 316, optimal thresholds on the scores to determine artifact returned by the unsupervised machine learnings methods were selected. Each of the candidate thresholds was assessed by detecting and removing artifact using that threshold and then computing three metrics on the corrected signal: the maximum standard deviation in any half-second window (localSTDmax); the ratio of the maximum standard deviation in any half-second window to the 90th percentile standard deviation in any half-second window (localSTDmax/localSTD90); and the ratio of the maximum standard deviation in any half-second window to the median standard deviation in any half-second window (localSTDmax/localSTDmed). Then, the three metrics were converted into score components by computing their differences from limits of 0.48, 6, and 10 respectively, and penalizing the distance above each limit by twice as much as the distance below, as described in the formula below. The limits were chosen by examining characteristics of artifact-free EDA data.
The final score for each candidate threshold was computed as the sum of the three score components and the value of the threshold itself (penalizing higher thresholds).
In step 318, the best combination of unsupervised method and optimal threshold were chosen. The best unsupervised method for each dataset was chosen as the one that labeled the smallest proportion of artifact at its optimal threshold (implying that it also satisfied all previous conditions). The proportion of data labeled artifact at this threshold was recorded. The goal is to select the combination of unsupervised method and optimal threshold that is the most precise in its selection of artifact without compromising true data.
In step 320, the EDA data were corrected after artifact detection. After identifying and removing the artifact in the EDA data while preserving as much of the true data as possible, any ‘islands’ of true data that are shifted upward or downward due to artifactual deflection are translated back (downward or upward) based on the linearly interpolated mean of the data at that time. The islands are typically clear in visual inspection and quantitatively identifiable based on their duration and distance shifted up or down from the neighboring EDA data. The duration and average distance are hyperparameters that can be adjusted by subject to ensure no islands were excluded. Islands were defined as being shorter than 20 seconds in duration and more than 0.12 μS from the linearly interpolated mean of the neighboring EDA data.
After translating the ‘islands’ back, the gaps created by removing artifacts can be filled using linear interpolation once more to create continuous data. This is where the duration of the longest continuous artifact can be useful. Using linear interpolation to fill in a few seconds of data at a time will likely not affect downstream analysis; however, interpolating a few minutes at a time could.
Artifact Removal from Clinical EDA Data
We tested our artifact identification and removal process on EDA data recorded from 70 subjects (38 female), collected under protocol approved by the Massachusetts General Hospital (MGH) Human Research Committee. All subjects were undergoing laparoscopic urologic or gynecologic surgery at MGH. The EDA data were recorded from the most proximal phalanges of two digits of each subject's left hand at 256 Hz using the Thought Technology Neurofeedback System. The electrodes were placed as soon as the patient entered the operating room, before induction of anesthesia, and were removed only after extubation at the end of surgery. The data were fed in real-time to a laptop located at the head of the operating table, near the anesthesiologist, and monitored the whole time by a member of the study team to ensure signal quality. Due to logistical concerns, EDA data collection from one subject (Subject 31) was ended before the onset of cautery, and therefore we excluded data from that subject from this analysis. EDA data from the remaining 69 subjects were analyzed using Matlab 2020b.
TABLE 2 (below) summarizes the results from all three unsupervised learning methods for all 69 subjects, including the final threshold chosen for each subject, proportion of data labeled artifact, and longest single continuous artifact for each unsupervised learning method. The best method, by smallest proportion of artifact removed (removing the least excess signal) and shortest maximum continuous artifact, is in bold for each subject. According to the proportion labeled artifact, isolation forest was the best method for 50 of the 69 subjects, KNN distance for 14 subjects, and 1-class SVM for 4 subjects, and both 1-class SVM and isolation forest were identical for one subject. Across all of the subjects, using isolation forest, the proportions of artifact ranged from 0.7% to just under 18% and the longest contiguous artifact from 6 seconds to 194 seconds. For 52 of the 69 subjects (˜75%), the proportion labeled artifact was 10% or less and the longest continuous artifact was 30 seconds or shorter.
0.0418
20.0039
20.0039
0.0655
29.5039
0.0436
17.5039
0.1009
30.5039
0.0583
21.5039
21.5039
21.5039
0.1565
34.5039
0.1155
18.5039
18.5039
18.5039
0.0601
17.5039
17.5039
17.5039
0.1154
16.5039
16.5039
16.5039
0.0553
42.0039
0.0629
0.0629
21.5039
21.5039
21.5039
0.0508
29.0039
0.0507
9.0039
0.1014
18.5039
0.0805
19.5039
19.5039
19.5039
0.0496
20.0039
20.0039
20.0039
0.1768
8.5039
0.0677
14.0039
14.0039
14.0039
0.069
28.5039
28.5039
28.5039
0.1057
138.0039
0.0697
21.5039
0.0319
13.0039
13.0039
0.0457
6.0039
6.0039
0.0365
28.5039
28.5039
28.5039
0.06
28.5039
28.5039
28.5039
0.0828
13.5039
0.1349
30.5039
30.5039
0.0549
23.5039
23.5039
23.5089
0.0812
40.5039
40.5039
0.2180
29.5039
0.1215
193.5039
193.5039
193.5039
0.0563
16.0039
0.0977
16.0039
0.0935
24.5039
0.1484
22.5039
22.5039
22.5039
0.0933
13.5039
13.5039
13.5039
0.1172
28.5039
28.5039
28.5039
0.0879
17.5039
17.5039
17.5039
0.0762
21.0039
0.0425
15.0039
15.0039
15.0039
0.0696
11.5039
11.5039
0.1032
12.0039
0.0915
31.5039
0.1126
32.5039
32.5039
32.5039
0.0934
27.0039
27.0039
0.0234
17.0039
17.0039
0.0512
11.0039
11.0039
0.0946
18.0039
18.0039
0.1002
16.5039
16.5039
16.5039
0.0911
19.0039
19.0039
0.0888
21.0039
21.0039
0.1382
32.5039
32.5039
32.5039
0.0825
26.0039
0.029
15.5039
15.5039
15.5039
0.1435
24.5039
24.5039
0.0352
23.5039
0.0044
11.5039
0.0129
10.0039
10.0039
0.0974
14.5039
14.5039
14.5039
0.0659
17.0659
0.0527
13.5039
13.5039
0.1179
23.6294
0.0066
6.0039
6.0039
0.0741
23.5039
23.5039
23.5039
0.167
14.5039
0.08
8.0039
8.0039
0.1069
26.0039
0.0853
14.0039
0.1002
20.0039
TABLE 3 (below) contains the hyperparameter values for identification of islands for each subject for all three unsupervised methods.
Without being bound by any particular theory, isolation forest may perform best because it is nonparametric. Random forest, its cousin for supervised learning, is also a really powerful process for similar reasons and random forest is often used as the benchmark to beat for deep learning processes.
Simulated EDA data containing artifacts was created using artifact-free EDA data processed using the method in
Discussion of Experimental Artifact Removal from Raw EDA Data
Unsupervised (machine) learning methods as implemented above successfully removed artifacts due to surgical cautery and movement from the EDA data collected from 69 patients. This overcomes a major barrier for EDA to be used clinically. We specifically focused on unsupervised learning methods to emphasize practicality at the implementation stage, since unsupervised learning does not require a manually labeled training set. We tested three unsupervised learning methods: isolation forest, K-nearest neighbor (KNN) distance, and 1-class support vector machine. We also compared existing methods such as variational mode decomposition and wavelet decomposition, as well as intuitive heuristic-based rules such as thresholding at zero or thresholding the derivative of the signal, for a representative subsample of subjects. Across all 69 subjects, the three unsupervised methods were able to remove all artifacts. No other methods was able to fully remove artifacts in the tested subsample of subjects. Of the three unsupervised learning methods tested, isolation forest was the most discerning and parsimonious in not mislabeling true EDA as artifact for the majority of subjects (51 out of 69). Therefore, we used isolation forest for all subjects to arrive at the final artifact-free EDA data.
Our methodology did not require any manual labeling of training data, which would be extremely time-intensive and impractical in clinical settings. Despite the absence of training data, our methodology successfully removed cautery artifact from the data even when true EDA data were interspersed between sections of heavy artifact. This is indicated by the fact that even when the cautery seems to be continuous to the eye, the total proportion of labeled artifact in most cases (52 out of 69 subjects) was 10% or lower. In the subset of subjects in which comparison methods are shown in
In addition, most of the labeled artifact was in short segments of under 30 seconds. The longest continuous artifact only exceeded 60 seconds for 9 of the 69 subjects. This is useful to consider in terms of downstream analysis since relevant information about sympathetic activity is contained in the dynamic pulse-like phenomena in EDA. Any method that modifies long, continuous chunks of data could affect the readout of dynamic activity in that timeframe. In contrast, short regions of missing data can be interpolated since they are likely to contain only a few pulses, and the missing data can be account for in estimation of uncertainty.
While our methodology used some of the same features as existing methods, we allowed the unsupervised methods to ‘learn’ the difference between artifact and true signal for each dataset on their own rather than hardcoding rules. The selected features, including those that overlap with existing methods, simply highlighted relevant characteristics of the data, based on the physiology of EDA and observations about the nature of cautery-related artifact. A straightforward expansion of this approach for other types of “clearly visible” artifact in modalities such as ECG and EEG could be implemented using custom feature definition, again informed by the physiology and nature of artifact in those signals.
EDA data have great potential as a clinical marker of sympathetic activation; however, it is limited by the lack of hardware systems and software tools that have been built specifically for the clinical setting. This includes the crucial steps of artifact removal, specific to the degree and types of artifacts that occur in clinical settings. Cautery interference during surgery is among the most intense cause of artifacts since there is an abundance of high-powered electrical equipment in use during surgery. Since existing methods were built for purely experimental settings, none are effective in this scenario. However, we have demonstrated that our method is successfully able to recover viable EDA signal even in this situation, which allows EDA to be of clinical use as a marker of sympathetic activation even in the operating room. An inventive clinical EDA system can identify and remove artifacts as they occur using these techniques. This enables the eventual of EDA into clinical workflows as a biomarker of sympathetic activation, for example, to track unconscious pain during surgery.
The EDA sensor 820 identifies and removes artifacts from its EDA data using unsupervised machine learning methods, also called unsupervised machine learning or unsupervised learning, conducted by the processor 824, according to the methods described above. If it has access to a large enough memory, the processor 824 can perform all of the steps of the artifact-removal methods discussed above with respect to
If desired, the processor 824 can be trained on large EDA datasets before being deployed to eliminate the need for performing the threshold selection in real time. Alternatively, if the processor 824 is calibrated on a certain person's data, for example, every 30 days or so, the artifact-removal parameters (e.g., selected unsupervised machine learning method and optimal thresholds) can be fit specifically to that person, trained on their EDA data, and then frozen so the artifacts can be removed with less processing.
While various inventive embodiments have been described and illustrated herein, those of ordinary skill in the art will readily envision a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein, and each of such variations and/or modifications is deemed to be within the scope of the inventive embodiments described herein. More generally, those skilled in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the inventive teachings is/are used. Those skilled in the art will recognize or be able to ascertain, using no more than routine experimentation, many equivalents to the specific inventive embodiments described herein.
The foregoing embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, inventive embodiments may be practiced otherwise than as specifically described and claimed. Inventive embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the inventive scope of the present disclosure.
Also, various inventive concepts may be embodied as one or more methods, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts simultaneously, even though shown as sequential acts in illustrative embodiments.
All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
The indefinite articles “a” and “an,” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean “at least one.”
The phrase “and/or,” as used herein in the specification and in the claims, should be understood to mean “either or both” of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with “and/or” should be construed in the same fashion, i.e., “one or more” of the elements so conjoined. Other elements may optionally be present other than the elements specifically identified by the “and/or” clause, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
As used herein in the specification and in the claims, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when separating items in a list, “or” or “and/or” shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as “only one of” or “exactly one of,” or, when used in the claims, “consisting of,” will refer to the inclusion of exactly one element of a number or list of elements. In general, the term “or” as used herein shall only be interpreted as indicating exclusive alternatives (i.e., “one or the other but not both”) when preceded by terms of exclusivity, such as “either,” “one of” “only one of” or “exactly one of.” “Consisting essentially of,” when used in the claims, shall have its ordinary meaning as used in the field of patent law.
As used herein in the specification and in the claims, the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements may optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, “at least one of A and B” (or, equivalently, “at least one of A or B,” or, equivalently “at least one of A and/or B”) can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
In the claims, as well as in the specification above, all transitional phrases such as “comprising,” “including,” “carrying,” “having,” “containing,” “involving,” “holding,” “composed of,” and the like are to be understood to be open-ended, i.e., to mean including but not limited to. Only the transitional phrases “consisting of” and “consisting essentially of” shall be closed or semi-closed transitional phrases, respectively, as set forth in the United States Patent Office Manual of Patent Examining Procedures, Section 2111.03.
This application claims the priority benefit, under 35 U.S.C. 119(e), of U.S. Application No. 63/251,107, filed on Oct. 1, 2021, and entitled “Surgical Cautery Artifact Removal from Electrodermal Activity Data,” which is incorporated herein by reference in its entirety for all purposes.
Number | Date | Country | |
---|---|---|---|
63251107 | Oct 2021 | US |