The invention relates to a system and a method for extubation prediction, and more particularly to a system and a method thereof for establishing an extubation prediction using a machine learning model.
According to statistics, more than 80% of patients in the intensive care unit need to rely on mechanical ventilation assistance to maintain their lives, and research shows that the longer the patient relies on mechanical ventilation assistance, the more difficulty for the patient to be removed from mechanical ventilation assistance, and can even evolve into a long-term dependence on mechanical ventilation assistance to survive, or in a short period of time after being removed from a ventilator, the patient has to be re-intubated again to rely on mechanical ventilation assistance to maintain survival.
In order to increase the success rate of patients to be removed from mechanical ventilation assistance, many studies are currently using various research methods to predict the time score and success rate of patients to be removed from mechanical ventilation assistance. For example, some studies conduct statistical analysis by collating the data of patients in intensive care units and chronic ventilatory care units, their clinical features, reasons for using ventilators, chronic complications, and reasons for difficulty in removing ventilators in order to summarize the features of patients who have difficulty in removing from mechanical ventilation assistance. However, the conditions of each of the patients is not the same and the conditions change rapidly. The results obtained through the retrospective statistical analysis method will not be able to accurately predict the time score suitable for extubation and removal from a ventilator.
A main object of the invention is to provide a system and a method thereof for establishing an extubation prediction using a machine learning model capable of automatically predicting a possibility of extubation of hospitalized patients in real time to be used as a tool for clinicians to evaluate a success rate of extubation of patients.
Another object of the invention is to provide a system and a method thereof for establishing an extubation prediction using a machine learning model capable of providing reasons or related instructions for predicting a possibility of extubation in order to increase reliability and interpretability of a possibility of extubation.
In order to achieve the above objects, the invention provides a system for establishing an extubation prediction using a machine learning model comprising a processing device for analyzing key feature data of a patient to be predicted through an extubation prediction model to generate a possibility of extubation of the patient to be predicted, wherein the key feature data comprise physiological parameter data, and consciousness data, input/output liquid data and ventilatory function data obtained 1 day or/and 2 days before a day of performing the extubation prediction.
Wherein the extubation prediction model is selected from a group consisting of XGBoost (Extreme Gradient Boosting), CatBoost (categorical boosting), LightGBM (light gradient boosting machine), random forest algorithm (RF) and logistic regression (LR).
Wherein the possibility of extubation is a possibility of the patient to be predicted to be removed from a ventilatory assistance device within 24 hours after the day of performing the extubation prediction.
Wherein the processing device has a data processing module and an extubation prediction module, and the data processing module is used to extract the key feature data from feature data of the patient to be predicted; the extubation prediction module analyzes the key feature data with the extubation prediction model.
In one embodiment of the invention, the system for establishing the extubation prediction using the machine learning model further comprises a database for collecting feature data of patients, wherein the feature data comprise consciousness data, input/output liquid data, ventilatory function data and physiological parameter data; the processing device comprises a data processing module; and each of the patients has a record of using a ventilatory assistance device during hospitalization.
In another embodiment of the invention, the data processing module can be used for performing a data preprocessing procedure on the feature data of the patients in the database in order to delete data in the feature data exceeding a preset reasonable range, and/or complement data missing in the feature data.
In another embodiment of the invention, the extubation prediction module analyzes the key feature data with the extubation prediction model, and then generates a correlated information between the key feature data and a probability of extubation.
In another embodiment of the invention, the processing device further comprises a visualization module for converting the correlated information between the key feature data and the probability of extubation into a visualization interface, such as bar graph, broken line graph, SHAP force plot, partial dependence plot (PDP).
In yet another embodiment of the invention, the processing device further comprises a model training module for training a machine learning model with at least a part of the feature data of the patients, and verifying to generate the extubation prediction model and a key feature.
Wherein the key feature comprises age, a number of days of using a ventilator, and GCS score, urine volume, injection volume, nutrition amount, RASS score, PIP, MAP, ventilatory rate and heart rate of 1 day and 2 days before the day of performing the extubation prediction.
Furthermore, another embodiment of the invention discloses a method for establishing an extubation prediction using a machine learning model for predicting a possibility of extubation in real time through the machine learning model. Specifically, the method for establishing the extubation prediction using the machine learning model comprises following steps of: (a) obtaining feature data used for training; (b) performing training with the machine learning model and obtaining an extubation training model; (c) obtaining key feature data of a patient to be predicted; and (d) analyzing the key feature data with the extubation training model to obtain a possibility of extubation of the patient to be predicted.
Wherein obtaining, training, performing, analyzing are performed by a processor using computer readable instructions stored in a database.
Wherein the step a further comprises a data preprocessing step for removing the feature data used for training that do not meet a standard, and/or complementing an insufficient part of the feature data used for training by means of interpolation.
Wherein the step d further comprises obtaining a correlation between each of the key features and the probability of extubation, and converting the correlation into a visualization interface.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The invention discloses a system and a method thereof for establishing an extubation prediction using a machine learning model capable of obtaining an extubation prediction model and key feature used by the extubation prediction model through training and/or verification of a machine learning model, and analyzing key feature data of a patient in real time through the extubation prediction model in order to obtain a possibility of extubation of the patient and its related explanation. Accordingly, the system and the method thereof for establishing the extubation prediction using the machine learning model disclosed in the invention are used as a tool for clinical caregivers to evaluate extubation in order to reduce a possibility of being unable to breathe spontaneously or requiring reintubation after extubation.
It must be emphasized that the system and method thereof for establishing extubation prediction using machine learning model disclosed in the present invention are to incorporate patients' consciousness data and input/output fluid data, so the extubation prediction results can be closer to the results of the doctor's clinical judgment. It means that the system and method thereof for establishing extubation prediction using machine learning model disclosed in the present invention can achieve a more accurate extubation prediction rate.
The terms used in the invention will be described below, for ones that are not listed in the following description, they will be explained based on reference materials recognized by a person having ordinary skill in the art to which the invention pertains, such as thesaurus, dictionary, literature or common knowledge.
The term “dynamic parameter” refers to data or results obtained from regular or irregular testing of patients during hospitalization.
The term “consciousness/awareness domain” refers to data obtained based on a patient's state of consciousness assessed by a system, a scale or a method, such as the Glasgow Coma Scale (GCS), the Richmond Agitation-Sedation Scale (RASS).
The term “fluid balance domain” refers to data related to fluids entering a patient's body or fluids eliminated by a patient, such as urine volume, injection volume, infusion volume, and total force-fed volume.
The term “ventilatory function domain,” refers to data related to a patient's ventilatory function or cardiopulmonary function, such as peak airway pressure (PIP), mean airway pressure (MAP), a number of days of using a ventilator, and ventilatory rate.
The term “physiological parameter domain” refers to physiological data of a patient, such as weight, body mass index, height.
The term “ventilatory assistance device” or “ventilator” refers to an external device installed on a patient to assist the patient in ventilation/inhalation.
The term “module” refers to a component with specific functions composed of several basic functional elements, which can be used to form a system, a device or a program with complete functions, for example, a module can be an electronic circuit.
The term “ICU” is an abbreviation of Intensive Care Unit, which refers to an intensive care ward for severe illness in a hospital, and different wards can belong to different departments according to division.
The term “machine learning” refers to the use of a machine learning model to learn and improve through data, find patterns and correlations, and make decisions and predictions based on learning and analysis results. Taking an example listed in the invention as an illustration, the machine learning model comprises XGBoost (Extreme Gradient Boosting), CatBoost (categorical boosting), LightGBM (light gradient boosting machine), random forest algorithm (RF) and logistic regression (LR).
“Tube” in the term “extubation” refers to a tube used to ventilate or provide a gas, thus, “extubation” means removal from a ventilator or other ventilatory assistance devices.
The term “APACHE II” is an abbreviation of acute physiology and chronic health evaluation II, which is a disease severity scoring system ranging from 0 to 71 scores. The scoring system is based on 12 physiological data, patient age and health status as input scores, and each of the data is a worst value of a patient 24 hours after admission.
The term “SOFA” is an abbreviation of Sequential Organ Failure Assessment, which is a system for assessing disease severity. It divides organs into lungs, blood coagulation, liver, heart, nerves and kidneys, and evaluates them separately. A highest score for each of the organs is 4 scores, a lowest score is 0 score, 24 hours after entering an ICU, a total score will be calculated every 48 hours.
The term “GCS (Glasgow Coma Scale)” is a coma index and is one of the most widely used coma indices. Its assessment comprises three aspects: eye opening response, speech response, movement response, a highest score for each of the aspects is 5 scores, a lowest score is 1 score. The lower a total score, the more severe a coma procedure.
The term “RASS (Richmond Agitation-Sedation Scale)” is a scale used to measure a degree of agitation or sedation of patients with scores ranging from −4 to +4, representing different behavioral states.
The term “SHAP value” or “SHAP force plot” refers to a numerical value assigned to each feature in a datum or a set of data when a predetermined machine learning model generates a predicted value for the datum or the set of data.
The term “LIME (Local Interpretable Model-Agnostic Explanations)” refers to locally interpretable models with an object to understand complex and unexplainable models, and a principle used is to provide a locally interpretable or understandable model according to an individual to be interpreted.
The term “visualization interface” refers to the graphical display of data or information through graphics, so that users can better understand the data and information, and the so-called graphics include diagrams, tables, colors, symbols, marks, icon or other visual element.
Specifically, referring to
Wherein the database 20 is a structured information or collection of data to be stored in a predetermined way, such as electronically stored in a computer system, a hard disk or a cloud space. The database 20 collects and stores feature data of patients, wherein each of the patients has a record of using a ventilatory assistance device in an intensive care unit during hospitalization, and the feature data comprise consciousness data, input/output liquid data, ventilatory function data and physiological parameter data. A source of data of the database 20 can be directly input or obtained after being connected with other external databases.
In this embodiment, the consciousness data are results of assessment of a patient's degree of consciousness or awareness, such as scores obtained from the Glasgow Coma Scale (GCS) and scores obtained from the Richmond Agitation-Sedation Scale (RASS); the input/output liquid data is an amount of liquid injected into a patient's body within a predetermined period of time, or an amount of liquid excreted by a patient, such as infusion volume, injection volume, amount of liquid intake in a diet, urine volume; the ventilatory function data are data related to a patient's ventilatory or/and cardiopulmonary function status, such as a number of days wearing a ventilatory assistance device, peak airway pressure (PIP), mean airway pressure (MAP), ventilatory rate; the physiological parameter data are a patient's physiological data, such as age, weight, body mass index.
The processing device 30 is hardware, such as a computer, a computer processor capable of handling data by a series of executing programs, instructions. The processing device 30 has a data processing module 31, a model training module 32, an extubation prediction module 33 and a visualization module 34.
Wherein the data processing module 31 is used to process feature data from patients, including performing a data preprocessing procedure and a feature data extraction procedure. Specifically, the data preprocessing procedure refers to a step of deleting or/and complementing the feature data after the data processing module 31 reads the feature data of the patients in the database 20, that is, the data processing module 31 receives a preset reasonable range of each feature and uses the preset reasonable range as a judgment standard, when one of the feature data does not fall within the preset reasonable range, the feature datum is deleted; and checks whether a quantity of the feature data meets a predetermined value, if the quantity does not meet the predetermined value, it is determined that the feature data is missing a part, and the missing part of the feature data should be complemented by taking an average value of the feature data. The feature data extraction procedure refers to the data processing module 31 reading the feature data of a patient to be predicted, and then extracting data related to a key feature therefrom for using as key feature data.
The model training module 32 receives the feature data processed by the data processing module 31, and uses at least a part of the feature data to train a machine learning model, and uses at least a part of the feature data to verify the trained machine learning model and results obtained to generate an extubation prediction model and the key feature data used in the extubation prediction model.
In this embodiment, the machine learning model is an algorithmic logic or an algorithm well-known in the technical field to which the invention pertains, such as XGBoost (Extreme Gradient Boosting), CatBoost (categorical boosting), LightGBM (light gradient boosting machine), random forest algorithm (RF) and logistic regression (LR).
In this embodiment, there are 20 key features, including age, a number of days of using a ventilatory assistance device, GCS score, urine volume, injection volume, nutrient amount, RASS score, PIP, MAP, ventilatory rate, heart rate measured on one day before and two days before a day of performing the extubation prediction.
The extubation prediction module 33 obtains the extubation prediction model from the model training module 32, and uses the extubation prediction model to analyze the key feature data from the patient to be predicted in order to generate a probability of extubation of the patient to be predicted within 24 hours after the day of performing the extubation prediction, and a correlated information between each key feature and the probability of extubation.
The visualization module 34 converts the correlated information between each key feature and the probability of extubation into a visualization interface for presenting on a display or a device with a display, such as computer, screen, tablet computer, mobile phone.
With a composition of the above components, the system 10 for establishing the extubation prediction using the machine learning model disclosed by the invention is capable of automatically analyzing in real time whether a patient hospitalized in an intensive care unit is in a state suitable for removing a ventilatory assistance device. A quantitative method provides clinicians with a probability of extubation as a reference for assessing extubation, and can further provide clinicians with a correlated information between each key feature and the probability of extubation to achieve an efficacy of increasing a reliability of the probability of extubation by increasing an interpretability of the probability of extubation.
Another embodiment of the invention provides a method for establishing an extubation prediction using a machine learning model comprising following steps:
In order to illustrate the efficacies, the technical features of the invention can achieve, several examples are provided as detailed description as follows.
All tests in the following examples are reviewed and approved by the Research Ethics Review Committee of Taichung Veterans General Hospital, and follow the guidelines of informed consent and anonymization.
Data of patients admitted to intensive care units of Taichung Veterans General Hospital between July 2015 and July 2019 are collected, the data of patients who did not use ventilators and the data of patients who used ventilators for less than 72 hours are excluded, and finally 5940 subjects who used ventilators for more than 48 hours are screened out, and statistical analysis results of feature data of the subjects are shown in Table 1, wherein data in Table 1 are expressed as mean value±standard deviation or amount (percentage); CCI stands for Charlson Comorbidity Index. From the contents of Table 1, it can be known that there are 3657 subjects who removed a ventilator during an ICU period, and 2283 subjects who did not remove a ventilator during an ICU period; a total of 65 feature data are used; an average age of the subjects is 66.2±16.2 years old, and 64.0% of the subjects are male; a disease severity of the subjects is significantly high, APACHE II score and SOFA score are 25.7±6.6 and 8.5±3.6 respectively, and 61.5% of the subjects are extubated during ICU hospitalization, however, distributions of age, gender, and CCI index between the extubated subjects and the non-extubated subjects are similar, but APACHE II score and SOFA score of the non-extubated subjects are higher than those of the extubated subjects.
Then, according to four main clinical domains in a clinical workflow, dynamic parameters of the subjects during ICU hospitalization are classified and processed and statistically analyzed, results are shown in Table 2. Wherein the four main clinical domains are consciousness/awareness domain, fluid balance domain, ventilatory function domain, and physiological parameter domain respectively, wherein the consciousness/awareness domain comprises the Glasgow Coma Scale (hereinafter referred to as GCS) and the Richmond Agitation-Sedation Scale (hereinafter referred to as RASS); the fluid balance domain comprises liquid, urine volume, total amount of force-feeding given to a patient; the ventilatory function domain comprises peak airway pressure (PIP), mean airway pressure (MAP), number of days of using a ventilator, ventilatory rate; the physiological parameter domain comprises heart rate. From the results in Table 2, it can be known that during ICU hospitalization the extubated patients' consciousness continues to improve, sedation state decreases, heart rate and infusion volume gradually decrease, and urine volume and force-fed volume show a steady increase.
Please refer to
Please refer to
Before analysis, the data preprocessing procedure can be performed on the feature data of the subjects collected in Example 1. The so-called data preprocessing comprises removing abnormal data and inputting missing data, wherein the abnormal data refer to numerical values exceeding a reasonable range of variables. In this example, the reasonable range of variables is set by a doctor. For example, a reasonable range of each of the variables is as follows: age is 1-100 years old, a number of days wearing a ventilator is 1-60 days, GCS is 3-15, urine volume is 0-5000 ml, injection volume is 0-10000 ml, nutrition amount is 0-3000 ml, RASS is −5-+4, PIP is 0-50, MAP is 10-40, ventilatory rate is 0-40, heart rate is 0-300; and in this example, an average value of each of the variables is input to complement the missing data.
In addition, before the machine learning models are used for data analysis, all data are standardized from +1 to −1; and, in order to avoid sampling bias, two sets of data are used in the extubated subjects, one of the sets is data of 1 day before extubation, the other set is random data, and five sets of data are randomly selected from the non-extubated subjects; a ratio of the feature data of the extubated subjects to the feature data of the non-extubated subjects is 1:3.4.
Analysis results of each of the machine learning models are shown in
From the results in
In this example, the machine learning model used is XGBoost.
According to the four main clinical domains disclosed in Example 1, the 20 key features obtained in Example 2 are respectively classified into the corresponding main clinical domains, an importance degree of each of the key features is analyzed, and by accumulating importance values of the key features in each of the main clinical domains, importance degrees between each of the main clinical domains and extubation prediction can be obtained, as shown in the results in
Then, how the 20 key features affect the possibility of extubation can be obtained by using SHAP values, and the results are shown in
Further, as shown in
With reference to the content disclosed in Example 2, the different machine learning models are used to analyze the feature data of the subjects, but the difference is that the feature data of the subjects used for analyzing do not comprise features of consciousness domain: GCS and RASS, and then preciseness, specificity, sensitivity, accuracy and AUROC of the results obtained by each of the machine learning models are checked, and the results are shown in Table 4 below.
Comparing the results of Table 3 with those in Table 4, it can be known that when the key features disclosed in the invention are missing, a reliability of the obtained possibility of the extubation prediction will be reduced.
Although the invention has been disclosed as above with the embodiments, it is not intended to limit the invention. A person having ordinary skill in the art to which the invention pertains can make various changes and modifications without departing from the spirit and scope of the invention. Therefore, scope of protection of the invention shall be subject to what is defined in the pending claims.
Number | Date | Country | Kind |
---|---|---|---|
112119171 | May 2023 | TW | national |