The present invention relates to a method and a system for predicting types of pathogens in patents with septicemia before the culture result of the pathogenic bacteria.
Sepsis is the major death cause of the hospitalized patents. Administration of effective antibiotics can decrease the mortality rate of the patents with septicemia. However, the method of accurately identifying the type of pathogens before the culture results available is still lacking. Physicians usually give empiric antibiotics based on individual judgement without solid supporting evidences. It is concerned by people in the field about how to determine whether a patient is infected or infected by what type of pathogen before the culture result of the pathogen is available.
Embodiments of the present disclosure provide a system for predicting types of pathogens in patients with septicemia. The system includes at least one sensor and a processor. The sensor is configured to sense current physiological data, wherein a type of the current physiological data includes at least one of body temperature, blood pressure, and pulse. The processor is configured to calculate at least one feature value according to the current physiological data, and input the feature value into a machine learning model to determine one of categories including at least two of uninfected, fungal infected, contaminated bacteria infected, Gram-negative infected, and Gram-positive infected.
In some embodiments, the processor is further configured to perform steps of: obtaining healthy physiological data that changes over time; calculating a mean of the healthy physiological data as a healthy mean; calculating a variance of the healthy physiological data as a healthy variance; calculating a variance of the current physiological data as a current variance; calculating a variance of the current physiological data respect to the healthy mean as a reference variance; dividing the reference variance by the healthy variance as a first feature value; and dividing the current variance by the healthy variance as a second feature value.
In some embodiments, the reference variance is calculated according to the following equation (1) where Xcurrent is a value of samples of the current physiological data, μhealth is the healthy mean, and # current is a number of the samples of the current physiological data.
In some embodiments, the at least one sensor includes a gravity sensor, and the processor is configured to determine if a user is stationary according to a signal sensed by the gravity sensor, and obtain the current physiological data only when the user is stationary.
In some embodiments, the machine learning model is a random forest algorithm.
In some embodiments, the processor is further configured to generate an image for each type of the current physiological data according to a following equation (2).
p
i,j=(Xcurrent,i−μcurrent)×(Xhealth,j−μhealth) (2)
pi,j is a pixel at ith column and jth row of the image. Xcurrent,i is a ith value of the current physiological data. and Xhealth,j is jth value of the healthy physiological data. i and j are positive integers. The processor is further configured to input the image into a convolutional neural network to determine one of the categories.
From another aspect, a method for predicting types of pathogens in patients with septicemia that is performed on a processor is provided. The method includes: sensing, by at least one sensor, current physiological data in which a type of the current physiological data includes at least one of body temperature, blood pressure, and pulse; and calculating, by a processor, at least one feature value according to the current physiological data, and inputting the feature value into a machine learning model to determine one of categories including at least two of uninfected, fungal infected, contaminated bacteria infected, Gram-negative infected, and Gram-positive infected.
The invention can be more fully understood by reading the following detailed description of the embodiment, with reference made to the accompanying drawings as follows.
Specific embodiments of the present invention are further described in detail below with reference to the accompanying drawings, however, the embodiments described are not intended to limit the present invention and it is not intended for the description of operation to limit the order of implementation. Moreover, any device with equivalent functions that is produced from a structure formed by a recombination of elements shall fall within the scope of the present invention. Additionally, the drawings are only illustrative and are not drawn to actual size.
The using of “first”, “second”, “third”, etc. in the specification should be understood for identifying units or data described by the same terminology, but are not referred to particular order or sequence.
How the types of infection are determined is described herein. First, the physiological data of body temperature, blood pressure, pulse, and heat rate are signals that change overtime. The processor 120 obtains the physiological data in a period (e.g. few seconds, but the length of the period is not limited) through the sensors 110. For example, if the sampling frequency is 60 Hz, then five seconds of the physiological data include 60×5=300 values. The sampling frequency is not limited in the invention. The physiological data obtained by the sensors 110 are referred to current physiological data.
In addition, the processor 120 obtains physiological data of body temperature, blood pressure, pulse, and heat rate corresponding to a healthy state from a database (not shown). The obtained data is also referred to healthy physiological data. The healthy physiological data is measured from the people who are healthy (e.g. uninfected). The healthy physiological data also changes overtime, but the length and the sampling frequency of the healthy physiological data are not limited in the invention. In other words, the length of the healthy physiological data may be different from that of the current physiological data.
The processor 120 calculates two features values for each type of the physiological data (i.e. body temperature, blood pressure, pulse, or heat rate). Herein, a value of a sample included in the healthy physiological data is written as Xhealth. # health is the number of the values Xhealth. # health is also referred to the length of the healthy physiological data. A value of a sample included in the current physiological data is written as Xcurrent. # current is the number of the values Xcurrent. # current is also referred to the length of the current physiological data or the number of the samples of the current physiological data. The processor 120 calculates the man of the healthy physiological data as a healthy mean which is written as μhealth in the following equations. The mean of the current physiological data is written as μcurrent. In addition, the variance of the healthy physiological data is calculated based on the following equation (1) as a healthy variance σhealth. The variance of the current physiological data is calculated based on the following equation (2) as a current variance σsick-sick. The variance of the current physiological data with respect to the healthy mean is calculated based on the following equation (3) as a reference variance σcurrent-health.
A first feature value f1 is obtained by dividing the reference variance by the healthy variance that is written in the following equation (4). A second feature value f2 is obtained by dividing the current variance by the healthy variance that is written in the equation (5).
There are four types of physiological data (i.e. body temperature, blood pressure, pulse, and heat rate) in the embodiment, and therefore total of eight features values (including four of first feature values f1 and four of second feature values f2) are calculated. Alternatively, the blood pressure includes diastolic blood pressure that corresponds to two feature values f1 and f2 and systolic blood pressure that corresponds to two feature values f1 and f2, and thus total of ten feature values are calculated. All the calculated first feature values f1 and second feature values f2 constitute a feature vector which is inputted into a machine learning model such as a random forest algorithm, a support vector machine, a neural network, and so on that is not limited in the invention. The machine learning model is trained to determine if the patients are infected and determine the types of the pathogens. In some embodiments, the categories outputted by the machine learning model include at least two of virus infected, uninfected, fungal infected, contaminated bacteria infected, Gram-negative infected, and Gram-positive infected. Herein, “contaminated bacteria infected” means that the pathogen in the patient is caused by some sources of pollution instead of sepsis.
Referring to
Among the aforementioned physiological data, body temperature is important for determining if the patient is infected. However, the patient may get up and move, and thus affecting the value of the body temperature. In some embodiments, the sensors 110 of
Note that the feature values f1 and f2 are merely a portion of the feature vector which may include other information such as user's age, gender, medical history that would be digitized as part of the feature vector. Alternatively, the signals sensed by the sensors 110 may be used to calculate other feature values to constitute the feature vector, which is not limited in the invention.
In some embodiments, the system 100 is a wearable device carried by the patient who can go anywhere. The system 100 can determine whether the patient is infected from time to time or periodically, and can also transmit the collected physiological data or the classification result to a server or the doctor's mobile phone through the communication module 130. This allows the hospital or doctor to notify the patient to seek immediate medical attention for effective medical treatment.
In some embodiments, the physiological data is transformed into images which are inputted into a convolutional neural network for classification. For example, for each type of physiological data, an image is generated according to co-variance between the current physiological data and the healthy physiological data. To be specific, the pixel at ith column and jth row of the image is written as pi,j calculated as the following equation (6). Xcurrent,i is the ith value of the current physiological data, and Xhealth,j is the jth value of the healthy physiological data where i and j are positive integers.
p
i,j=(Xcurrent,i−μcurrent)×(Xhealth,j−μhealth) (6)
Each type of the physiological data can be used to generate one image, and therefore total of four images are generated. The four images are combined as a two-dimensional image with four channels. This two-dimensional image is inputted to a convolutional neural network to perform the classification. From another aspect, the pixel pi,j may be referred to a feature value.
In some embodiments, an image is generated based on the following equation (7).
p
i,j=(xi−xj)2 (7)
xi is the ith value of the current physiological data or the healthy physiological data. Note that the equation (7) can be applied to both of the current physiological data and the healthy physiological data, and hence two images can be generated for each type of the physiological data. Accordingly, total of eight images are generated and combined as a two-dimensional image with eight channels. The two-dimensional image is inputted into a convolutional neural network to perform the classification.
In the system and method described above, whether the patient is infected and the types of the pathogen can be predicted before the blood culture result is released. In addition, the clinician can refer to the predicted result to open a suitable antibiotic to treat the sepsis patient, resulting in higher survival rate of sepsis patients. Moreover, the prediction method is non-invasive, and no additional blood test is needed.
Although the present invention has been described in considerable detail with reference to certain embodiments thereof, other embodiments are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the embodiments contained herein. It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
108129894 | Aug 2019 | TW | national |
This application claims priority to U.S. Provisional Application Ser. No. 62/785,699 filed Dec. 28, 2018, and Taiwan Application Serial Number 108129894, filed Aug. 21, 2019, the disclosures of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
62785699 | Dec 2018 | US |