This application relates generally to methods and systems of active learning. More specifically, this application relates to long-term active learning from large continually changing data sets, including the autonomous development of predictive models. This application also relates to methods and systems that apply active learning models to predict specific out comes. These outcomes can be in the medical, military, and/or robotics arenas, to name a few.
There are numerous applications in which active-learning techniques are needed, ranging among medical applications, engineering applications, manufacturing applications to and others. Examples of such active-learning techniques include expert-system techniques, iterative techniques, neural-network techniques and genetic algorithms, among others.
An expert system essentially uses a machine to reproduce the performance of human experts. It typically relies on the creation of a knowledgebase, that uses a knowledge-representation formalism to capture the knowledge of subject-matter experts. The knowledgebase is populated by gathering the relevant knowledge from the subject-matter experts and codifying it according to the representation formalism. Commonly, a learning component is included so that the content of the knowledgebase may be modified as the expert system is used in the same real-world problem-solving circumstances as are considered by the subject-matter experts, thereby improving its performance.
Iterative techniques begin with a seed solution to a defined problem that is processed by a formalism to produce a result that is compared with an observed result. If the formal result differs by more than a defined amount from the observed result, the solution is modified and reprocessed by the formalism. Various techniques are applied so that the modifications of the solution are driven towards converging the formal result with the observed result. When the convergence is achieved at a satisfactory level, the solution is taken as well approximating the real-world conditions that produced the observed result.
Neural networks typically include a plurality of nodes, with each node having a weight value associated with it. One layer of nodes is an input layer that has a plurality of input nodes and another layer of nodes is an output layer that has a plurality of output nodes, with at least one intermediate layer of nodes there between. Input data are provided to the layer of input nodes and the weight values applied by the network to generate results at the layer of output nodes. To train the neural network, the resulting output values are compared against correct interpretations of known samples. If the output value in such a comparison is incorrect, the network modifies itself to arrive at the correct value. This is achieved by connecting or disconnecting certain nodes and/or adjusting the weight values of the nodes during the training. Once the training is completed, the resulting layer/node configuration and corresponding weights represent a trained neural network, which is then ready to receive unknown data and make interpretations based on the data. Self learning and/or predictive models that can handle large amounts of possibly complex, continually changing data have not been described or successfully implemented for medical care.
Appropriate resuscitation of an injured patient demands an accurate assessment of physical exam findings, correct interpretation of physiological changes and an understanding of treatment priorities. Resuscitative trauma care is provided by a broad range of individuals with varying levels of interest and experience. It can require a large amount of information be quickly gathered, accurately interpreted and meaningfully conveyed to a coordinated group of local and downstream healthcare providers.
Traumatic brain injury (TBI) and exsanguination are the two most common causes of death during the resuscitative phase of trauma care. The management of head injury, hemorrhage and fluid resuscitation are therefore integral parts of early trauma care.
Traumatic brain injury (TBI) is a common and devastating condition. It is the number one cause of death and disability in the pediatric population, affecting over half a million children annually in the U.S. TBI accounts for approximately 60,000 adult and pediatric deaths in the U.S. each year. TBI outcome depends on the severity of primary brain injury (direct injury to the brain due to mechanical insult) and the effectiveness of preventing or limiting secondary brain injury (defined as damage to the brain due to the body's physiological response to the initial mechanical insult). The cranium is a bony compartment with a fixed volume. Following head trauma, blood vessels within and around the brain may rupture and bleed into the brain (causing intracerebral hemorrhage) and/or around the brain (causing development of an epidural and/or subdural hematoma to form). Bleeding in this fashion compresses the brain. The brain also swells as a result of injury. These types of secondary injury increase the intracranial pressure and decrease cerebral perfusion, leading to brain ischemia. Brain ischemia causes further brain swelling, more ischemia and if not treated and managed appropriately, brain herniation through the base of the skull (where the spinal cord exits) and death.
Evidence based guidelines for the management of severe traumatic brain injury have been developed, yet a wide spectrum of methods still characterizes most monitoring and treatment strategies. The most widely used, current method for intracranial pressure monitoring involves placement of an intracranial pressure monitoring device. This is an invasive procedure that involves cutting the scalp and drilling a hole through the patient's cranium, so that a pressure transducer can be inserted in or on top of the brain. Newer, non-invasive methods for intracranial pressure and cerebral perfusion monitoring have been described; however, these methods are still considered experimental and none are in clinical practice. These non-invasive, intracranial pressure monitoring methods include: to transcranial Doppler ultrasonography; transcranial optical radiation, such as near-infrared spectroscopy; ophthalmodynamometry; arterial pulse phase lag; and ocular coherence tomography.
Posttraumatic seizure (PTS) is associated with severe primary brain injury and, importantly, could itself also act as a type of secondary brain injury. Electrographic only posttraumatic seizures, which can be seen in up to 45% of pediatric moderate-severe TBI patients, have been shown to cause elevated ICP and metabolic stress. Moreover, posttraumatic seizures (occurring ≦7 days post-injury) have been shown to negatively impact outcome and increase morbidity. Thus, posttraumatic seizure is a potential therapeutic target and one of the few potentially preventable causes of secondary brain injury following TBI.
It is difficult to identify at-risk patients who will benefit from early anti-seizure prophylaxis and prevention of acute secondary brain injury. Clinical markers, such as mental status and seizure-like movements, can be monitored; however, these markers of PTS are often masked by altered mental status/coma, sedatives and paralytics, and even anticonvulsants. Continuous electroencephalographic (cEEG) monitoring in moderate-severe TBI has been shown in the adult literature to increase PTS detection rates by 22-33%. This is a labor intensive method requiring the collection of visual and continuous 21 channel EEG data. This large volume of data must then be reviewed by a trained epileptologist. Further, it is unclear which of the available anticonvulsants are most useful in adults and children, based on antiepileptic effect, antiepileptogenic effects, duration of treatment, and effect on outcome.
Prior research has been done on the automated identification of seizures in cEEG data, achieving detection rates of 70-80% and 1-3 false positives per hour, but the work has not yet yielded a product or prototype. These systems have typically been rule-based, where a set of feature detectors are combined using thresholds and qualitative or quantitative constraints.
Fluid resuscitation strategies are poorly understood, difficult to study and variably practiced. Inadequate resuscitation poses the risk of hypotension and end organ damage. Conversely, aggressive fluid resuscitation may dislodge clots from vascular injuries, resulting in further blood loss, hemodilution and death. How to best proceed when one is dealing with a multiply-injured patient who has a traumatic brain injury and exsanguinating hemorrhage can be especially difficult. Under resuscitation can harm the already injured brain, whereas overresuscitation can reinitiate intracranial bleeding and exacerbate brain swelling, leading to brain herniation, permanent neurological injury and oftentimes death.
Embodiments of the invention can be implemented to use high dimensional, complex domains, where large amounts of variable, possibly complex data exist on a continuous, and/or possibly dynamically changing timeline. Various embodiments can be implemented in disparate fields of endeavor. For example, embodiments of the invention can be implemented in the fields of robotics and medicine. In the field of robotics, embodiments of the invention can use real-time image (and information derived from other sensors modalities) analysis, high speed data processing and highly accurate decision-making to enable robot navigation in outdoor, unknown unstructured environments. Embodiments of the invention can also be applied to physiological (vital sign) and clinical data analysis in the field of medicine. In such embodiments, an algorithm can discover and model the natural, complex, physiological and clinical relationships that exist between normal, injured and/or diseased organ systems, to accurately predict the current and future states of a patient.
In some embodiments of the invention methods are provided for autonomously building a predictive model of outcomes. A most-predictive set of signals Sk can be autonomously identified out of a set of signals s1, s2, . . . , sD for each of one or more outcomes ok. A set of probabilistic predictive models ôk=Mk(Sk) can be autonomously learned, where ôk is a prediction of outcome ok derived from the model Mk that uses as inputs values obtained from the set of signals Sk. The step of autonomously learning can be repeated incrementally from data that contains examples of values of signals s1, s2, . . . , sD (possibly dynamically changing) and corresponding outcomes o1, o2, . . . , oK.
In some embodiments autonomously learning can include using a linear model framework to identify predictive variables for each increment of data. The linear model framework may be constructed with the form
where fk is any mapping from one input to one output and a0, a1, . . . , ad are linear model coefficients. In some embodiments, autonomously learning can include determining or estimating which signals are not predictive from the set of signals and outcomes. The corresponding coefficients for these signals can then be set to 0. An autonomous learning method can then build a predictive density model using these predictive coefficients, signals, and/or outcomes. In some embodiments, the method can repeat each time a new signal outcome pair is received or encountered that is predictive.
Embodiments of the invention also provide methods for predicting volume of acute blood loss from a patient. Data values are collected from one or more physiological sensors attached to the patient. A hemodynamic compensation model is applied to the collected data values to predict the volume of acute blood loss from the patient. The hemodynamic compensation model can be previously generated from a plurality of data values collected from physiological sensors attached to a plurality of subjects.
Embodiments of the invention can also provide methods for predicting volume of acute blood loss from a patient that will cause hemodynamic decompensation, also termed cardiovascular (CV) collapse. Data values are collected from one or more physiological sensors attached to the patient. A hemodynamic compensation model is applied to the collected data values to predict the volume of acute blood loss from the patient that will cause CV collapse. The hemodynamic compensation model can be previously generated from a plurality of data values collected from physiological sensors attached to a plurality of subjects.
In some embodiments, the one or more physiological sensors may comprise an electrocardiograph, a pulse oximeter, a transcranial Doppler sensor, or a capnography sensor, among others. The collected data values may include a photoplethysmograph, a perfusion index, a pleth variability index, cardiac output, heart stroke volume, arterial blood pressure, systolic pressure, diastolic blood pressure, mean arterial pressure, systolic pressure variability, pulse pressure, pulse pressure variability, stroke volume, cardiac index, or near-infrared spectroscopy data, among others.
Embodiments of the invention also provide methods for determining brain pressures within a subject. A plurality of parameters are measured from the subject. The parameters are applied to a model that relates the parameters to various brain pressures, with the model having been derived from application of a machine-learning algorithm. This allows the brain pressures to be determined from the model.
The brain pressure may comprise an intracranial pressure or a cerebral perfusion pressure in different embodiments. The plurality of parameters may comprise heart rate, systolic blood pressure, diastolic blood pressure, mean arterial pressure, cardiac output, pulse oximetry data, carotid blood flow, among others.
Embodiments of the invention also provide methods detecting seizures based on continuous EEG waveform data from a subject. A plurality of parameters can be derived from cEEG data measured from the subject. The parameters are applied to a model that relates the parameters to seizure waveform activity, with the model having been derived from application of a machine-learning algorithm. This allows seizure activity to be determined from the model.
Autonomous learning methods, robot navigation methods, acute blood loss determination methods, prediction of CV collapse, brain pressure determination methods and detection, as well as prediction, of seizure activity can be embodied on a system having an input device and a processor provided in electrical communication with the input device. The processor can include a computer-readable storage medium that includes instructions for implementing the method as described.
A further understanding of the nature and advantages of the present invention may be realized by reference to the remaining portions of the specification and the drawings.
Embodiments of the invention provide methods and systems for autonomously building predictive models of current and future outcomes using large amounts of possibly complex, continually changing, incrementally available data. A general predictive model is disclosed followed by specific augmentation to the predictive model in specific applications. Prior to describing the predictive model, an example of a computational device is disclosed that can be used to implement various embodiments of the invention. Following the description of the predictive model, specific embodiments are disclosed implementing the predictive model in various aspects.
Embodiments of the invention provide methods and systems for autonomously building predictive models of current and future outcomes, using large amounts of possibly complex, continually changing, incrementally available data. Such embodiments find application in a diverse range of applications. Merely by way of illustration, some exemplary applications include autonomous robot navigation in unknown, outdoor unstructured, environments; a human hemorrhaging model for the continuous, noninvasive detection of acute blood loss; and/or a human hemorrhaging model for fluid resuscitation and the prediction of cardiovascular collapse and intracranial pressure. Such examples are not intended to limit the scope of the invention, which is more generally suitable for any to application in which current and future outcomes are desired to be known on the basis of large, continually changing datasets.
Computation Device
The predicative and/or self learning models may be embodied on computation devices, a typical structure for which is shown schematically in
Computational device 100 can also include software elements, shown as being currently located within working memory 120, including an operating system 124 and other code 122, such as a program designed to implement methods of the invention such as predictive and/or self learning algorithms disclosed throughout the specification. It will be apparent to those skilled in the art that substantial variations may be made in accordance with specific requirements. For example, customized hardware might also be used and/or particular elements might be implemented in hardware, software (including portable software, such as applets), or both. Further, connection to other computing devices such as network input/output devices may be employed.
A Self-Learning Predictive Model
A self-learning predictive model (or machine learning) method is provided with the flow diagram 200 of
As indicated at block 216, this is achieved by identifying the most predictive set of signals Sk, where Sk contains at least some (and perhaps all) of the derived signals s1, . . . , sD for each outcome ok, where kε{1, . . . , K}. A probabilistic predictive model ok=Mk (Sk) is learned at block 220, where ôk is the prediction of outcome ok derived from the model Mk that uses as inputs values obtained from the set of signals Sk, for all kε{1, . . . , K}. Method 200 can learn the predictive models ôk=Mk(Sk) incrementally from data that contains example values of signals s1, . . . , sD and the corresponding outcomes ok, . . . , oK. As the data become available, the method loops so that the data are added incrementally to the model for the same or different sets of signals Sk (for all kε{1, . . . , K}).
While the above outlines the general characteristics of the methods, additional features are noted. A linear model framework may be used to identify predictive variables for each new increment of data. In a specific embodiment, given a finite set of data of signals and outcomes {({right arrow over (s)}1,{right arrow over (o)}1), ({right arrow over (s)}2,{right arrow over (o)}2), . . . }, a linear model may be constructed that has the form, for all kε{1, . . . , K}:
where fk is any mapping from one input to one output, and a0, a1, . . . ad are the linear model coefficients. The framework used to derive the linear model coefficients may estimate which signals s1, s2, . . . , sd are not predictive and accordingly sets the corresponding coefficients a1, a2, . . . , ad to zero. Using only the predictive variables, the model builds a predictive density model of the data, {({right arrow over (s)}1,{right arrow over (o)}1), ({right arrow over (s)}2,{right arrow over (o)}2), . . . }. For each new increment of data, a new predictive density models can be constructed.
In some embodiments, a prediction system can be implemented that can predict future results from previously analyzed data using a predictive model and/or modify the predictive model when data does not fit the predictive model. In some embodiments, the prediction system can make predictions and/or adapt the predictive model in real-time. Moreover, in some embodiments, a prediction system can use large data sets to not only create the predictive model, but also predict future results as well as adapt the predictive model.
In some embodiments, a self-learning, prediction device can include a data input, a processor and an output. Memory can include application software that when executed can direct the processor to make a prediction from input data based on a predictive model. Any type of predictive model can be used that operates on any type of data. In some embodiments, the predictive model can be implemented for a specific type of data. In some embodiments, when data is received the predictive model can determine whether it understands the data according to the predictive model. If the data is understood, a prediction is made and the appropriate output provided based on the predictive model. If the data is not understood when received, then the data can be added to the predictive model to modify the model. In some embodiments, the device can wait to determine the result of the specified data and can then modify the predictive model accordingly. In some embodiments, if the data is understood by the predictive model and the output generated using the predictive model is not accurate, then the data and the outcome can be used to modify the predictive model. In some embodiments, modification of the predictive model can occur in real-time.
In some embodiments, a predictive model can be used for medical data, robotics data, weather data, financial market data, traffic pattern data, etc.
General Physiological Predictions
Embodiments of the present invention provide for real time prediction of physiological conditions using various physiological data. Physiological data can be received (e.g., input) from a physiological sensor that is measuring a physiological state of a patient. Physiological feature data can then be derived from the physiological data. For example, a Finometer (physiological sensor) can be used to measure the blood pressure of a patient and provide blood pressure data (physiological data). From the blood pressure data blood volume data (physiological feature data) can be derived. Various other physiological feature data can be derived from the physiological data. From the physiological feature data a prediction can be made about a physiological threshold where patient state is reached (e.g., trauma or shock). The prediction can be based on a large data set of physiological feature data. Moreover, the prediction can use any type of predictive algorithm and/or can be self learning. In some embodiments, a user interface can provide the physiological feature data along with the predicted threshold. Such a user interface can allow a user to determine whether the physiological feature data is converging and/or diverging with the threshold data.
Patient Blood Volume
Hemorrhage is a problem that surgeons commonly face. It accounts for 40% of all trauma deaths and it is the most frequent cause of preventable death after severe injury. Tissue trauma can cause hemorrhage, which initiates coagulation and fibrinolysis. Shock is a primary driver of early coagulopathy. In fact, several groups have noted a linear correlation between the severity of tissue hypoperfusion and the degree of admission coagulopathy as measured by the prothrombin time (PT) and partial thromboplastin time (PTT). Recent evidence suggests that the early identification of hemorrhage, together with treatment directed at the prevention of hypotension, correction of post-injury coagulopathy and stopping the bleeding can lead to dramatic reductions in the morbidity and mortality of severely injured patients.
The problem is that humans cannot detect early signs of hemorrhage by looking at a patient's vital signs. Standard vital signs, such as heart rate, blood pressure, and arterial oxygen saturation appear to a human to change very little until a patient has lost about 30% of their total blood volume. Late detection of acute blood loss is associated with inadequate fluid resuscitation. Inadequate resuscitation poses the risk of hypotension, end organ damage and worsening coagulopathy. Conversely, aggressive fluid resuscitation may dislodge clots from vascular injuries, resulting in further blood loss, hemodilution and possibly death.
In some embodiments, the predictive model can be used to predict blood loss volume. Such embodiments can be used to detect the early signs of hemorrhage. In order to make bleeding related treatment decisions, embodiments described herein can provide information about how much blood a patient has lost. In some embodiments, the self-learning predictive model described above can be implemented to measure blood loss volume. Such predictions can be useful, for example, to aide in determining whether a wounded soldier is safe to remove from the battlefield without an IV, or whether the wounded soldier should receive intravenous fluid(s) (such as blood or saline) and/or medication prior to and during extraction.
Embodiments of the invention can also predict when an individual patient will experience CV collapse. This can be important, because individual patients experience hemodynamic decompensation at differing volumes of blood loss. On the battlefield, medics must also establish a triage order and evacuate potential survivors at greatest risk for CV collapse first. In civilian settings, paramedics and emergency medicine technicians (EMTs) must respond similarly to quickly determine who should be transported first and where. Some embodiments of the invention can provide objective, real time guidance during this critical decision-making process.
A general overview of a structure used in embodiments of the invention is provided with
There are numerous sensors 304 that may be used in different embodiments, some of which are described herein. For example, an electrocardiograph may be used to measure the heart's electrical activity using electrodes placed specifically on the subject's 308 body. A pulse oximeter or a photoplethysmograph can be used, for example, to measure ratios of deoxygenated and oxygenated blood. As another example, a Finometer, impedance cardiography, and Finopres systems can be used to measure systolic blood pressure, diastolic blood pressure, mean arterial blood pressure, pulse pressure variability, stroke volume, cardiac output, cardiac index, and/or systolic blood pressure variability (mmHg). In another example, an infrared spectrometer can be used to measure tissue oxygenation. As another example, a transcranial Doppler system can be used to measure blood flow velocities in intracranial blood vessels. As another example, a capnogram can be used to monitor the inhaled and exhaled concentration or partial pressure of carbon dioxide (CO2). As yet another example, an impedance cardiograph can be used to measure stroke volume.
While the following describes the use of a few specific sensors, this disclosure can be extended to data collected using other measurement devices, such as those described above. The output of an electrocardiograph describes cardiac muscle activity through voltages along different directions between electrode pairs. The typical electrocardiograph waveform is described as a P wave, a QRS complex, and a T wave. Heart rate can be extracted from the waveform and considerable attention has been given to heart rate variability for evaluating autonomic dysfunction and its correlation to events such as increased intracranial pressure and death due to traumatic injury. The performance of heart rate variability for predicting traumatic head injury is improved by considering factors such as heart rate, blood pressure, sedation, age, and gender. There are various algorithmic definitions for computing heart rate variability from R-R intervals, which appear to perform equivalently as long as they are calculated over extended intervals, such as over five minutes or more.
Pulse oximeters and photoplethysmographs may also be used. In their basic form, pulse oximeters use the differing properties of deoxygenated and oxygenated hemoglobin for absorbing red and infrared light. Red and infrared LEDs shine through a relatively translucent site such as the earlobe or finger and a photodetector on the other side receives the light that passes through. The observed values are used to compute the ratio of red to infrared intensity, which can be used to look up the subject's saturation of peripheral oxygen level from precomputed tables. As the heart beats, blood pulses through the arteries in the measurement location, causing more light to be absorbed, thus yielding a waveform of light signals over time. This photoplethysmograph (“PPG”) can be used to determine heart rate, but also analyzed in its own right. Subtracting the trough DC values, which represent constant light absorbers, what remains are the absorption properties for the varying AC component, which is arterial blood. Advances in technology have seen more light wavelengths used to distinguish oxygen (O2) and carbon dioxide (CO2), thus making these systems more reliable.
Use of the raw PPG signal has been shown to be correlated to systolic pressure variation (“SPV”), which in turn is correlated with hypovolemia. A comparison of the correlation of ear and finger pulse oximeter waveforms to systolic blood pressure (“SBP”) has evaluated pulse amplitude, width, and area under the curve as extracted features. Metrics on the envelope of the PPG waveform have been used to reliably detect blood sequestration of more than one liter induced by LBNP. A linear predictor for cardiac output (“CO”) has been constructed based on heart rate and features extracted from the ear PPG waveform.
The perfusion index (“PI”) expresses the varying versus stationary components of infrared light in the PPG as a percentage:
The correlation of PI and core-to-toe temperature difference has been shown for critically ill patients.
The Pleth Variability Index (“PVI”) describes changes in PI over at least one respiratory cycle:
It has been demonstrated that PVI can predict fluid responsiveness in anaesthetized and ventilated subjects. It has also been demonstrated that PPG variation, pulse pressure variation (“PPV”), and systolic pressure variation (“SPV”) are well correlated to gradual autodonation to a reduction of 20% in systolic blood pressure.
Blood pressure and volume measurements may use the Finopres system, which may in turn use a volume clamp mechanism to measure the finger arterial pressure waveform as well as estimating parameters such as cardiac output (“CO”) and stroke volume (“SV”). The mechanism combines an infrared plethysmograph to determine baseline unloaded artery diameter and monitor blood volume, and an inflatable finger cuff that is controlled to maintain baseline diameter. Variation in cuff pressure provides an indirect way of measuring intra-arterial pressure.
Similar parameters can be obtained using impedance cardiography (“ICG”), which measures volumetric changes due to the cardiac cycle by observing changes in thoracic impedance. Current is passed through the chest between sensors, traveling through the aorta as the path of least resistance. As blood velocity and volume change in the aorta, corresponding changes in impedance are recorded as a continuous waveform, from which hemodynamic parameters such as CO and SV can be computed.
Many standard hemodynamic parameters intended to capture the behavior of the cardiac cycle are derived from blood pressure and heart-rate measurements. For example, arterial blood pressure (“ABP”) is the pressure in the arteries, which varies through the systolic and diastolic phases of the cardiac cycle. Systolic blood pressure (“SBP”) is the maximum ABP as the left ventricle contracts. It can be extracted as the peak values of the raw Finopres ABP waveform. Diastolic blood pressure (“DBP”) is the ABP when the heart is at rest. It can be measured from the troughs of the ABP waveform.
Mean arterial pressure (“MAP”) describes the mean arterial blood pressure over a cardiac cycle,
MAP=(CO×SVR)+CVP,
where CO is the cardiac output, SVR is the systemic vascular resistance, and CVP is the central venous pressure. The MAP can be approximated using more accessible parameters as
MAP≅DBP+⅛(SBP−DBP).
Systolic pressure variability SPV attempts to measure the change or variability in SBP over a respiration cycle. In general, it is the difference (or % change) between minimum and maximum SBP,
SPV=SBPmaxR−SBPminR.
Distinctions are also frequently made between delta up (dUp) and delta down (dDown) components. Correlations between SPV and dDown have been examined for hemorrhage and volume replacement, finding that they follow intravascular volume for mechanically ventilated patients. One conclusion has been drawn that dDown is an effective indicator of CO response to volume replacement for mechanically ventilated septic shock patients. In some embodiments, SPV and dDown are calculated as percentages of SBP in the case of hypotension.
Pulse pressure (“PP”) is the beat-to-beat change in blood pressure:
PP=SBP−DBP.
Pulse pressure variability (“PPV”) is also computed using minimum and maximum PP over the respiratory cycle:
PPV=PPmaxR−PPminR.
It has been shown that higher PPV percentages indicate which subjects in septic shock respond to fluids and also demonstrated a correlation between PPV and cardiac index. PPV can be an effective measure for fluid management.
Stroke volume (“SV”), or volume of blood pumped by the left ventricle in a single contraction, is the difference between the amount of blood in the ventricle at the end of the diastolic phase minus the blood remaining after the heart beat:
SV=(end diastolic volume)−(end systolic volume).
Since these constituent parameters are difficult to measure, SV is generally estimated from the ABP waveform. It has been shown that SV and PP derived from finometer BP estimates are correlated with blood loss.
Cardiac output (“CO”) is the volume of blood pumped per unit time:
CO=SV×HR.
Cardiac index (“CI”) relates the performance of the heart to the size of the subject using body surface area (“BSA”):
BSA can be estimated using height and mass of the individual, and it has been found that CI and mixed venous oxygen saturation show a linear relationship to blood loss.
In other embodiments, near-infrared spectroscopy is used for measuring tissue oxygenation. In such embodiments, near-infrared light is shone on the body and deeply penetrates skin, fat, and other layers where it is either scattered or absorbed. As with pulse oximeters, the differing absorption characteristics of oxyhemoglobin (O2Hb) and deoxyhemoglobin (HHb) are used to calculate concentrations based on light received by a detector. Other parameters such as pH and hematocrit can also be extracted from the spectra. This process has been modified to compensate for the interference of skin and fat layers to better measure muscle oxygen saturation (SmO2). Near-infrared spectroscopy measurements of SmO2 and pH have been tested as indicators of hemodynamic instability with subjects undergoing LBNP, with the conclusion that SmO2 is an early indicator of vasoconstriction and impending measurements of SmO2 and muscle oxygen tension (PmO2) to StO2 measured at the thenar eminence with a commercial device. Spectroscopic observations of PmO2 and SmO2 are thus early indicators of hemodynamic decompensation due to LBNP, while thenar StO2 did not change through the test.
Other noninvasive sensors, although less well investigated for monitoring hemorrhage, offer different system measurements that may contribute to the prediction system. Transcranial Doppler uses sound waves in the form of a pulsed Doppler probe to measure blood flow velocities in cerebral blood vessels (cerebral blood flow CBF). It poses challenges in determining recording locations with a clear path to the vessels of interest. CBF velocities have been used as an indicator for dynamic cerebral autoregulation under hypervolemia with hemodilution.
The respiration cycle is intimately related to the cardiac cycle and may offer relevant measurements. Capnography measures the concentration of carbon dioxide (CO2) in respiratory gases and is an indirect measure of the CO2 in arterial blood. Infrared light is passed through the gas sample, where CO2 absorbs it and a detector on the other side observes this decrease in light. End tidal CO2 (EtCO2), or the CO2 concentration at the end of exhalation, has been determined to have a logarithmic relationship to cardiac output. It has also been found that EtCO2 tracks SV in an LBNP model at progressive levels of central hypovolemia, but that the decreases are small relative to baseline measurements for subjects.
Thus, in some embodiments, a computational method for predicting the blood loss volume at which a patient will experience hemodynamic decompensation can be characterized by generating a predictive model that includes data signals {right arrow over (s)}=(s1, . . . , sD) that result in outcomes {right arrow over (o)}=(o1,o2) that ends or does not end in hemodynamic compensation.
At block 412 patient specific predictions based on the hemodynamic compensation models can be made from new data signals. At block 416 the predictions can be provided to a medical practitioner, who may provide semantic (machine readable) text to the predictive model, thus augmenting the result. At block 420, the results can be saved for future model building and or predictions.
In some embodiments, a computational device (e.g., computational device 100) can simultaneously predict: 1) blood loss volume and 2) individual specific blood loss volume for CV collapse. In some embodiments, the computational device can simultaneously graph predicted blood loss volume 1105 with predicted, individual specific blood loss volume for CV collapse to occur 1110, as shown in
In some embodiments, a computational device (e.g., computational device 130 in
In some embodiments, a device can implement embodiments of the present invention for monitoring fluid levels in a patient during the delivery of intravenous fluids. As a patient is being treated with IV fluids, the device can provide medical personal with real-time information on the effectiveness of IV fluid therapy as shown in
Noninvasive Prediction of Intracranial Pressure and Cerebral Perfusion Pressure
Embodiments of the invention provide a number of methods and systems related to monitoring and treating various cerebral parameters. According to some embodiments of the invention, hemodynamic and/or cerebral parameters can be diligently recorded and time-synchronized. Machine learning techniques and/or predictive models can be used with this data to determine whether there are undiscovered correlations between central and cerebral physiological variables, and such correlations may be used to diagnose, trend, and predict nearly instantaneous changes in intracranial (ICP) and cerebral perfusion pressures. These hemodynamic and/or cerebral parameters can include electrocardiograph measurements, arterial blood pressures, venous pressures, carotid blood flow, intrathoracic pressures, heart rate, cardiac output, intracranial pressures, end tidal carbon dioxide values, and blood gases.
A general overview of how embodiments of the invention may be implemented is illustrated with the flow diagram of
In addition to these parameters, the intracranial pressure and/or the cerebral perfusion pressure may be measured and calculated so that a model of intracranial pressure may be applied at block 512 to relate such values with the various parameters obtained at blocks 504 and 508. A machine-learning paradigm (e.g., the predictive model described above) can be applied at block 516 to enable the extraction of those parameters that are most relevant to determining the intracranial pressure and/or the cerebral perfusion pressure; the model may then be tailored for prediction of those quantities at block 520.
The resultant model may then be used diagnostically as indicated in the drawing. For instance, the relevant parameters determined at block 520 may be collected at block 524 for a patient presented for diagnosis and the intracranial pressure and/or the cerebral perfusion pressure determined at block 528 by application of the model. If the determined pressure is outside of an acceptable range, medical action may be taken at block 532. In some embodiments, it can be possible for revisions to the model to be made at block 536, particularly after treatment of the patient, in order to improve the value and application of the model.
Evaluation of the model may be made in any of several different ways. For example, a mean square difference of the intracranial pressure predicted by the model and the true estimated intracranial pressure may be calculated. Similarly, mean square difference between the predicted cerebral perfusion pressure and the true estimated cerebral perfusion pressure may be calculated. When a change in intracranial pressure is detected, the time taken for the model to respond to this change in the predicted intracranial pressure or to the predicted cerebral perfusion pressure may be relevant in evaluating the model. In addition, detection of a change in intracranial pressure may be used to calculate the time taken for carotid artery blood flow to diminish and to compare this with the time taken for the model to respond to such a change.
Various studies testing embodiments of the method have enabled the prediction of ICP using hemodynamic measures such as heart rate variability and central hemodynamic pressure. The ability to predict ICP directly from these central hemodynamic parameters stems from the experimentally proven ability to predict blood volume loss and CV collapse onset, using only cranial measures of blood flow derived from intracranial Doppler signals.
Management of traumatic brain injury may include therapies and diagnostic techniques that optimize and monitor cerebral metabolism and function by minimizing global cerebral ischemia. Such therapies may be included in algorithm modifications to allow noninvasive tracking of cerebral pressures.
The machine-learning paradigm accordingly permits the establishment of models that relate such parameters as described above to the intracranial and cerebral perfusion pressures. In particular, it enables the otherwise invasive intracranial and cerebral perfusion pressures to be determined through measurement of noninvasive parameters.
Noninvasive Prediction of Central Blood Volume Loss
In further embodiments, lower-body negative pressure (“LBNP”) can be used to simulate loss of central blood volume in humans. Such a model provides a method for investigating physiological signals under conditions of controlled, experimentally induced hypovolemic hypotension in otherwise healthy humans. In one set of studies, each subject was placed in an LBNP chamber and connected to a variety of noninvasive monitoring devices. Baseline measurements were made. Subjects were exposed to progressively greater amounts of LBNP to the point of cardiovascular collapse. At that point, the LBNP was released and central volume returned to normal. The experiments lasted between 25 and 50 minutes and were dependent on the level of LBNP at which the subject exhibited to cardiovascular collapse. Each LBNP level equates to about 250 cc's of blood loss.
The inventors used the method described above to derive a machine-learning paradigm that is capable of the following in real time: (1) detecting early, primary signs of LBNP, which equate to acute blood loss; (2) estimating the rate and volume of blood loss in a bleeding patient to guide resuscitation therapy; and (3) predicting a timeframe for when a bleeding patient will progress to cardiovascular collapse. The method uses hemodynamic features as inputs derived from commercially available physiological sensors, i.e. heart rate, blood pressure and RR interval from the electrocardiograph. The sample size was 64 heart beats. For this particular embodiment, the method is about 96.5% accurate in predicting the presence of active bleeding; is about 96% accurate in identifying the level of bleeding to within 250 cc's; is about 85% accurate for predicting individual specific LBNP level that a subject will experience cardiovascular collapse. Further training of the algorithm with data from 104 LBNP subjects shows greater than 95% prediction accuracy for both LNPB level and individual specific CV collapse levels.
Other Healthcare Applications
Foreseeing the clinical course of a patient whose physiology is possibly complex and constantly changing due to injury, patient disease and/or our efforts to stabilize and correct the underlying disease process depends on a practitioner's ability to identify, understand and continuously monitor a range of clinical features. Practitioners cannot, of course, physically reside at a patient's bedside at all times. Nor can they rapidly abstract, discern and respond to the many unique and subtle features that are characteristic of normal and abnormal physiological signals. Embodiments of the invention can apply a new polynomial Mahalanobis distance metric for use in classifying continuous physiological data (e.g., any waveform data), to enable active, long term learning from extremely large continually changing physiological datasets. The application of such embodiments to human vital sign data has led to the discovery of several previously hidden hemodynamic relationships that are predictive of acute blood loss and individual specific risk for cardiovascular collapse. Implementation of embodiments of the invention have broad applicability in many areas of medicine and surgery. It is especially applicable to the care of severely injured patients, whose physiology is acute, complex, constantly changing and human interpretation is required on an ongoing basis.
Embodiments of the invention incorporate dynamic, multi-objective optimization schemes. Such schemes can become increasingly more complex as greater amounts of high fidelity clinical data is captured and becomes available for analysis. Dynamic multi-objective optimization schemes can enable the development of predictive models using real-time physiological data, while autonomously controlling the management of competing therapies. An example is IV fluid management for an injured soldier with a traumatic brain injury and an exsanguinating solid organ injury. IV fluid therapy in this type of setting must be provided at a rate that will optimize systemic and cerebral perfusion, avoid re-bleeding and maintain the patient until bleeding can be controlled. Competing injuries add complexity to any fluid resuscitation strategy and the invention described herein solves this problem.
In some embodiments, the inputs to a predictive device can include non-invasively measured physiological signals, derived from existing products used in medical facilities. In some instances, the device comprises a laptop computer (e.g., as schematically shown in
Some methods and devices of the invention provide an intuitive user interface to allow medical professionals to interact with the device. In some embodiments, the user interface can allow the user to specify which sensors are available, which can then define which model to use. The user interface can also allow the device to intuitively interact with the medical professional to ensure correct sensor functioning and/or allow the medical professional to enter patient specific clinical information such as gender, weight, age, historical information, physical exam findings, various forms of treatment and information on the clinical response to treatment. In some embodiments, this clinical information can be retrieved from various data sources include computer hard drivers, network drives, etc. In some embodiments this information can be retrieved from central servers that have historical health and patient information stored thereon.
These results indicate that methods of the invention for analyzing noninvasive hemodynamic parameters is not only fast and accurate, but a viable platform for a device that could provide medical personnel with early, reliable and critically important information on blood loss, injury severity and the time to act.
Devices and methods of the invention can be seamlessly integrated into existing hospital and pre-hospital care settings because: they can be applied in parallel with existing physiological monitors, medical personnel need not change standard procedures, the method alone could be licensed to device manufactures, to enable existing, in-hospital monitors to become “smart” monitors.
Some devices of the invention utilize advanced hemodynamic measures, derived from traditional monitoring devices (blood pressure, EKG, etc). Some devices of the invention can be used to collect non-invasive data. A large amount of data can be collected from individual patients, requiring relatively few subjects for verification. Verification can be done in a short period of time, as no lengthy experimental procedures and no blood work are required. Some devices of the invention have low computational requirements (i.e. they can effectively run on inexpensive processors and laptop computers).
Methods and devices of the invention can save lives by providing early, critical information on acute blood loss, injury severity, and resuscitation effectiveness. This invention will be of great commercial interest to all branches of the U.S. armed services, trauma and non-trauma surgeons, anesthesiologists and critical care physicians worldwide. It is equally useful during the management of trauma and non-trauma patients, who are experiencing or are at risk volume loss, whether it be due to the acute loss of blood, dehydration and/or myocardial dysfunction.
Robot Navigation
The problem of planning smooth trajectories for mobile robots traveling at relatively high speed in natural environments, depends on being able to identify navigable terrain a significant distance ahead. Labeling safe or path regions in an image sequence is a common way to achieve this far field classification. Many pixel-wise classification techniques fail at this task because their similarity metric is not powerful enough to tightly separate path from nonpath, resulting in outliers distributed across the image. Some embodiments of the invention provide for a new and more powerful polynomial Mahalanobis distance metric for use in classifying path regions in images of natural outdoor environments. Some embodiments use only an initial positive sample of a path region to capture the relationships in the data, which are most discriminative for path/nonpath classification. Performance of some embodiments have been compared with Euclidean and standard Mahalanobis distance for illustrative synthetic data as well as for challenging outdoor scenes. For both normalized color and texture features embodiments provided herein produces significantly better results.
Robot navigation can implement predictive models as described throughout this disclosure for navigation and other processes. In some embodiments, a predictive model can learn and distinguish between traversable regions from non-traversable regions using image labeling techniques. For example,
In some embodiments, for an input x, a model has the following Bayesian form for estimating the class ŷ:
where cε{1, . . . . C} designates the class, {circumflex over (p)}c is an estimate of the prior probability Pr(c) of class c, and Ĥc(x)Ĥc(x) is the estimate of density of class c at input x (this is analogous to Pr(c|x)). We can estimate {circumflex over (p)}c (unbiased) by dividing the number of times class c appeared in the training sets {S1, . . . , SK}, by the total number of examples seen.
Note that one difference between the standard Bayesian use of equation (1) and the one adopted here is the following: If Ĥ1(x)=Ĥ2(x)= . . . =Ĥc(x)=0 (or some other small probability threshold deemed applicable), we can predict that the current model cannot make a class prediction for the input x because x falls outside of the type of data seen so far by the long term learning algorithm. This essentially means the learning algorithm must see labeled examples representative of x before a prediction is made.
A key focus of our research and development efforts under the LAGR program has been a development of a novel framework for learning class density models Ĥc(x) that are suitable for long term learning. Each class density model has the following form:
where chi(x) is a local density model, cαi≧0 are scaling factors, for all i=1, . . . , τc, and τc is the number of density models associated with class c.
Therefore, the learning paradigm involves learning local density models chi(x) that represent traversable and non-traversable terrain. These local density models are combined as defined above to label pixels in the image as being traversable or non-traversable. Therefore, long term ongoing learning is defined by learning as many local density models, and using a weighted subset of the most relevant ones given the robot's current environment.
At block 808 pixels within the image data can be clustered into regions that contradict the robots current set of models (e.g., models produce wrong labels), or which cannot be labeled with its current set of models. The resulting clusters constitute knowledge about the environment that the robot currently does not have. In some embodiments, the clustering algorithm can include the property that it identifies as clusters on nonlinear manifolds, and determines which examples in each cluster are most outside the manifold and therefore likely to be noise. These noisy examples can be discarded, and learning takes place only on the clean clusters. Thus new models are only constructed of previously unexplained (by the model), clean, sensor data.
For example, clusters can be constructed separately from each class that does not match data found in any model. For the far field navigation embodiments, traversable image pixel examples and the non-traversable examples can be separately clustered into groups. Clustering can use any number of algorithms. In some embodiments, the clustering algorithm can be computationally efficient at clustering thousands of training examples. For example, in the far field navigation application domain, we typically see several thousand training examples from each class. In some embodiments, the clustering algorithm can find clusters that lie on nonlinear manifolds. This property can be motivated by the observation that pixels associated with paths typically lie on locally nonlinear structures. In some embodiments, the clustering algorithm can identify examples that are outliers. These examples are often associated with sensor noise, and should not be used when learning new density models.
In some embodiments, a rank based clustering algorithm can be used. This algorithm clusters by ranking the ordering of points along nonlinear manifold structures. It therefore can allow direct identification of points that lie most in a cluster manifold (i.e. the center points), as well as points that lie most outside the manifold (i.e. the outlier points).
At block 812 the appropriate image features which separate each clustered group from all clusters in a different class are selected. For example, if a cluster is associated with traversable terrain, then the features chosen will be those that best separate it from non-traversable terrain- and similarly for clusters of non-traversable terrain. Thus each cluster involves using a unique set of features as a foundation for separating it from other clusters.
For each cluster identified, in some embodiments, the goal of feature selection is to efficiently identify the features that separate it from other clusters representing examples of a different class. For the far field navigation learning example, this amounts to finding features that best separate traversable from non-traversable terrain in the robot's current environment. This can be difficult because in some cases regions in the image that are associated with traversibility (e.g., grass on the ground) can look very similar to regions associated with obstacles (e.g., green shrubs).
In some embodiments, the framework used to discover the most discriminative image features can use a Sparse Linear Classifiers. In some embodiments, the Sparse Huber Loss algorithm can be used because of its computational efficiency and its effectiveness in building sparse linear classifiers. This algorithm is used to find the best sparse linear classifier between each cluster and all clusters corresponding to examples in a different class. The boundary of this classifier has the following form:
where {a0, . . . , ad} are the model coefficients, and xj represents dimension j of the inputs. The model is sparse because most of {a1, . . . , ad} are zero. For each cluster, the image features that are associated with non-zero coefficients {a1, . . . , ad}, are the most discriminative features for that cluster. These features can then be used to construct a local density model for the cluster.
At block 816 a nonlinear distance metric model can be built for each cluster, which measures how far points from one clusters are from another cluster. For each cluster identified in block 808 the relevant features found with the feature selection in block 812 are used to construct a distance model for the cluster. This distance model can be denoted by cdi(x), where c is the class the cluster falls in, and i refers to the cluster. The distance cdi(x) measures the distance from any point x to the cluster. It can be constructed, for example, using the Polynomial Mahalanobis Distance framework. This framework can efficiently allow locally nonlinear manifold data structures to be identified, allowing clusters to be modeled. The Polynomial Mahalanobis distance metric is illustrated in
At block 820 this distance metric can be used to build a density model for each cluster. This density model can be used to measure how close a new pixel (in either the current or new image) is to the cluster for which a model has been constructed. This process can generate many thousands of image models, and only a few of these are appropriate for any environment. For example, density models appropriate for the desert may not be useful in wooded areas.
Given the distance model cdi(x) of a cluster as constructed in block 816, a locally nonlinear density model chi(x), in some embodiments, can be constructed using a one dimensional histogram density. Therefore, the specific form of our density models can be denoted:
c
h
k(x)=DenHist(cdk(x))
where DenHist(cdk(x)) can be a one dimensional histogram density model constructed from the distance values of points within the cluster i associated with class c when put through the model cdi(x). This process is depicted in
At block 824, the current alphabet of terrain density models can be combined to make predictions of traversibility in the far field (e.g., beyond vision). Models that are relevant to the current environment can be chosen for making predictions. Relevance can be measured by how well these models predict the near field vision based classification of traversable and non-traversable terrain, as well as how relevant they are to the far field image data.
Using a classification model defined as
This model uses the density functions chi(x) (computed as described the learning of which is described above) as defined in Equation (2). Therefore, to make a prediction for an input x, the values of the scaling cai≧0 for all i=1, . . . , τc, associated with each chi(x) must be defined. These scaling factors are environment specific, and can be chosen in real time as the robot executes a task.
In some embodiments, the magnitude of the scaling factor cai can be proportional to the relevance of the density model chi(x) in the robot's current environment. If chi(x) is irrelevant to the current situation the robot is in, then it should be the case that cai=0. Note that the density models chi(x) respond (i.e. output values greater than zero), when the current examples (i.e. image features) have similar properties to examples used to construct it. Therefore one can set cai=0 whenever chi(x) has low response in the image. Furthermore, one can set cai=0 whenever chi(x) disagrees with the current image, because the stereo labeled examples in the current image where chi(x)>0, belong to a class other than c.
In some embodiments, cai=0 if either of the following conditions are met: 1)
where Ψ is the set of all examples in the current image, taken from both near and far field parts of the image. The threshold Tα2, defines a minimum on how much support the density function has in the image (for all experiments and tests under the LAGR program, this threshold is set to 10−6, but any small enough positive value can work equally well). When this threshold is violated, the density function chi(x)>0 has very little to do with the current image (e.g. perhaps it was learned when the robot was in a desert environment, whereas the robot currently is navigating in the woods). 2)
where ⊖ is the set of all examples that stereo has NOT labeled as to class c. The threshold Tα2, defines how wrong a density model can be with respect stereo labeling, and still be used. Once again, in the experiments presented here, Tα2, is set this to a small positive value of 10e−6. When this threshold is violated, then chi(x)>0 is not appropriate to the current environment, leading to incorrect classifications. For all remaining chi(x) for which Ca, is not set to zero by the above conditions, the following formula for cai can be used:
where Ψ is the set of all examples in the current image. Therefore, the value of cai is defined by how relevant the density model chi(x) is to the current image.
Embodiments of the invention can be adapted to any condition for which there exists subject data. In the medical arena this type of data will increase exponentially in the coming years, as physiological data from individual illness events becomes incorporated into each patient's electronic medical record. The matching of physiological patient data with semantically driven medical records containing various diagnoses, the timing of therapy and response to treatment, will allow methods and devices of the invention to gain insight into the practice of medicine and expected outcomes. For example, self-learning predictive systems may provide predictions based not only on real-time physiological measurements, but also on a patient's medical history such as age, diet, previous diagnoses, exercise routine, smoking habits, caffeine intake, alcohol consumption, travel history, various medical risk factors, familial history, allergies, pharmaceutical intake, weight, physical exam findings, practitioner impressions and treatment effects, etc. Moreover, multiple physiological measurements can be used to make predictions and/of for self learning.
Examples of medical and surgical conditions that could be analyzed and potentially linked and evaluated in real-time using aspects of the various embodiments include: 1) closed head injury monitoring and management, including cEEG; 2) differentiation of shock states; 3) resuscitation monitoring and management; 4) asthma, pneumonia and other respiratory diseases; 5) diabetes monitoring and prevention of diabetic ketoacidosis; 6) myocardial ischemia and infarction; 7) stroke; 8) congestive heart failure; 9) intra-operative monitoring, including depth of anesthesia; 10) pain control monitoring and management; 12) post-operative monitoring; 13) sleep apnea monitoring; 14) rehabilitation monitoring, including gait, stability and range of motion; cognitive function; activities of daily living; 15) progressive neurological disorders, e.g. Alzheimer's disease, multiple sclerosis, epilepsy, etc.; and 16) therapeutic oncology, to name a few.
This application is a non-provisional, and claims the benefit, of U.S. Provisional Patent Application No. 61/109,490, entitled “Method For Determining Physiological State Or Condition,” filed Oct. 29, 2008, the entire disclosure of which is incorporated herein by reference for all purposes. This application is a non-provisional, and claims the benefit, of U.S. Provisional Patent Application No. 61/166,472, entitled “Long Term Active Learning From Large to Continually Changing Data Sets,” filed Apr. 3, 2009, the entire disclosure of which is incorporated herein by reference for all purposes. This application is a non-provisional, and claims the benefit, of U.S. Provisional Patent Application No. 61/166,486, entitled “Statistical Methods For Predicting Patient Specific Blood Loss Volume Causing Hemodynamic Decompensation,” filed Apr. 3, 2009, the entire disclosure of which is incorporated herein by reference for all purposes. This application is a non-provisional, and claims the benefit, of U.S. Provisional Patent Application No. 61/166,499, entitled “Advances In Pre-Hospital Care,” filed Apr. 3, 2009, the entire disclosure of which is incorporated herein by reference for all purposes. This application is a non-provisional, and claims the benefit, of U.S. Provisional Patent Application No. 61/252,978, entitled “Long Term Active Learning From Large Continually Changing Data Sets,” filed Oct. 19, 2009, the entire disclosure of which is incorporated herein by reference for all purposes.
The United States Federal Government may have rights to this invention pursuant to DOD AFRL Award No. FA8650-07-C-7702 and/or pursuant to NSF Grant No. 0535269.
Number | Date | Country | |
---|---|---|---|
61252978 | Oct 2009 | US | |
61166472 | Apr 2009 | US | |
61166486 | Apr 2009 | US | |
61166499 | Apr 2009 | US | |
61109490 | Oct 2008 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13126727 | Aug 2011 | US |
Child | 15007489 | US |