The present invention belongs to the field of medical treatment and machine learning technologies, and in particular relates to a system for the prognostics of the chronic diseases after the medical examination based on the multi-label learning.
Various chronic diseases, including a diabetes, a heart disease, a coronary heart disease, and a chronic kidney disease, have become the most important diseases that have caused a significant decline in a quality of life of people and a substantial increase in a medical economic burden on a global scale. Chronic diseases have features of a high concealment in an early stage, a low awareness rate, a high damage in a later stage, and an extremely low cure rate. According to statistics of the World Health Organization (WHO), the number of deaths caused by cardiovascular diseases and a diabetes in the world was 17 million in 2012, accounting for 50.2% of deaths from non-communicable diseases. In 2015 report on prevention and control of chronic diseases, WHO pointed out that effects of prevention and treatment of chronic diseases on a global scale were significantly lower than expected.
Early awareness and early warning are of great significance to the prevention and treatment of chronic diseases. Chronic diseases can generally be prevented by controlling work and rest, diets, exercise and other methods after early warning. However, once a patient has experienced organic lesions, the diseases can hardly be truly cured. Another difficulty in the prevention and treatment of chronic diseases is that various diseases often occur concurrently. According to a review report of 35 studies compiled by Lehnert et al. in 2011, the number of chronic diseases (MCCs) of an individual was positively or exponentially correlated to costs of diagnosis and treatment. Therefore, under a premise that a small number of chronic diseases or related abnormal physiological indicators are known, predicting an occurrence of other chronic diseases and carrying out effective interventions are of great significance to improve a control effect of chronic diseases and greatly reduce a medical burden.
Medical examination is a medical method that can be performed by both healthy and unhealthy people, and has a good prospective ability to predict the occurrence of chronic diseases. However, a current medical examination system mainly determines whether a patient has a specific disease at a current time based on current examination indicators and has a lack of systematic technical means to perform prognosis of the occurrence of chronic diseases in the next several years via current examination data and an existing chronic disease state of a medical examinee.
A medical examination clinical decision support system in the prior art that predicts various diseases with a traditional single-label machine learning method cannot extract a concurrent correlation between different chronic diseases, resulting in a decrease in an accuracy of prediction, and an outstanding medical logic contradiction in prediction results of a plurality of diseases. Currently, very few clinical decision support systems are provided for medical examination using multi-label machine learning, and relevant research can only assist in a diagnosis of diseases at a current time node, but cannot perform prognostics on an occurrence of future chronic diseases of a medical examinee.
In view of deficiencies of the prior art, an objective of the present invention is to provide a system for the prognostics of the chronic diseases after the medical examination based on the multi-label learning. Modules of the system aim at a multi-label learning model for a physical examination scenario, and fully extract time sequence information between a medical examination time point in existing medical data and an occurrence time of subsequent chronic diseases to complete a prediction of an occurrence of the chronic diseases in the next 1 to 5 years. The system can better tap a medical value which a single medical examination can bring, so that the public can have a better understanding of current and future personal medical conditions via the medical examination, and improve an actual effect of the medical examination on early awareness of the chronic diseases.
The objective of the present invention is realized through the following technical solutions: a system for the prognostics of the chronic diseases after the medical examination based on the multi-label learning includes a data acquisition module, a data preprocessing module, a basic predicting model constructing module, and a local predicting module.
The data acquisition module is configured to acquire physical examination data of a physical examination user. The physical examination data includes basic physiological indicators and routine assay indicators. The basic physiological indicators include a height, a weight, a blood pressure and other indicators that can be directly measured. The routine assay indicators include a blood routine, a urine routine and other indicators obtained through a sample assay. All the forgoing indicators of the physical examination user are expressed as a feature vector X=[x1, x2, . . . xp]T. p is the total number of indicators. An occurrence time t0 of a current medical examination is recorded. The earliest occurrence time data T=[t1, t2, . . . tq]T of a diagnosis of various chronic diseases (such as a diabetes, a hypertension, a coronary heart disease, a chronic kidney disease, or the like) from diagnosis data of the chronic diseases in the user's electronic medical record before and after the medical examination is extracted. q is the number of categories of the chronic diseases preset by the system. In the forgoing data, X, t0 is a necessary data. Each of components in T is set as null when the diagnosis does not exist.
A processing process of the data preprocessing module is specifically as follows:
performing standardization processing ϕi on each of the components xi in X based on the component, so that a standard deviation of all data on this component is 1, and a mean value of all the data on this component is 0. A standardized feature vector is denoted as X′=[x′1, x′2, . . . x′p]T;
where, xi′ is a standardized data. λi is the mean value of all the data on the component xi. σi is the standard deviation of all the data on the component xi;
performing an exponential operation on T based on a base number α(0<α<1) to generate a prognostic index vector Y=[y1, y2, . . . yq]T:
The basic predicting model constructing module is configured to construct a multi-label learning model for a physical examination scenario, and a constructing process is as follows:
(1) constructing a multilayer neural network, and completing data processing from input to output via the multilayer neural network, where, specific hyperparameters of the network include: the number K of layers of the network, the number n1, n2, . . . nK of nodes in each of the layers of the network, and an activation function between two adjacent layers, where n1=p, nK=q; transfer weight matrixes between two adjacent layers are denoted as W1, W2, . . . WK−1; and an output value of the nodes of the last layer is denoted as a predicted prognostic index C=[c1, c2, . . . cq]T;
(2) designing a loss function E, where, the loss function is a key definition that enables the model to adaptively learn an occurrence state and an occurrence sequence of various chronic diseases to perform prognostics of future chronic diseases of a medical examinee;
The loss function can be regarded as a weighted mean value of three loss functions set for different objectives, where, N represents a data sample amount used in a single batch of gradient descent, and a superscript k of Eik indicates that a component of the loss function is obtained by calculation of a k-th sample in the batch of gradient descent. λiw represents respective weight values of the three loss functions;
a) E0 is a single-label loss function, which characterizes a difference between a predicted prognostic index ci and an actual prognostic index yi of disease prediction:
b) E1 is an interval loss function, which characterizes a difference between a predicted prognostic index difference Δci,j and an actual prognostic index difference Δyi,j of two different chronic diseases:
c) E2 is a ranking loss function, which characterizes a difference between a predicted occurrence order and an actual occurrence order of the two different chronic diseases:
With this loss function, an actual occurrence time of the diseases, an occurrence logical relation between the diseases, and a time interval between the diseases can be considered during model training, so that an occurrence of a plurality of diseases in the future can be better predicted based on a single physical examination data.
(3) Learning parameters: according to the physical examination data of a sample medical institution, several models M1, M2, . . . ML are constructed by matrix hyperparameter scanning (i.e., hyperparameters for scanning: the number K of the layers of the network, the number n2, . . . nK−1 of nodes in middle layers of the network, and activation functions between two adjacent layers). Parameters of each of the models are learned based on a mini-batch gradient descent (MBGD). Optimal parameters are determined via k-fold cross validation. An optimal model is used as a basic predicting model Mbest for migration to other medical institutions. Mbest is solidified into the basic predicting model constructing module.
The local predicting module is arranged in a specific local medical institution and includes a local model training unit and a predicting unit.
The local model training unit obtains an optimal basic predicting model Mbest via the basic predicting model constructing module that is adjusted via real data of a sample medical institution. However, due to different testing instruments and methods used by different medical institutions, the parameters need to be adaptively adjusted based on specific physical examination data of the specific local medical institution to determine an local predicting model MbestH for the specific local medical institution;
The local model training unit receives the optimal basic predicting model Mbest provided by the basic predicting model constructing module, acquires the physical examination data X, t0, T of the specific local medical institution via the data acquisition module, generates X′, Y via the data preprocessing module, performs model training the same as a Mbest training method with model parameters of Mbest as initial parameters based on X′, Y, and solidifies a trained local predicting model MbestH into the local predicting module after the parameters are converged.
The predicting unit performs the prognostics of the chronic diseases based on physical examination data of a new medical examinee according to the local predicting model MbestH, outputs a predicted prognostic index C=[c1, c2, . . . cq]T of the occurrence of the plurality of chronic diseases, and then obtains a future expected occurrence time t′i of the corresponding chronic disease via an inverse function t′i=y−1(ci) of y(ti). For a disease whose occurrence time is greater than a cut-off time tcutoff, a risk of occurrence of the diseases in a short term is considered to be low. A predicted occurrence time of the chronic diseases is fed back to the medical examinee via a natural language generation method, so that the medical examinee can understand high-risk diseases in the future, thereby providing a reference for a targeted adjustment of living habits and better prevention and treatment of high-incidence diseases.
Further, the medical institution stores the physical examination data as a .csv file locally. A selected sample medical institution generates an encapsulated basic predicting model Mbest from the physical examination data of the sample medical institution. The physical examination data of a local medical institution is sent to its local predicting module via an interface service, and the future expected occurrence time of the corresponding chronic diseases is returned via an interface response.
The beneficial effects of the present invention are as follows: the system of the present invention can extract an internal relation in a case of a concurrence of the chronic diseases with a multi-label learning method, which is more in line with a feature of high concurrency of the chronic diseases, and can better accurately predict an occurrence of the future chronic diseases. In addition, a complete set of solutions including data acquiring, data preprocessing, basic predicting model construction and local prediction output is completed.
The present invention is further described in detail below in conjunction with the drawings and specific embodiments.
As shown in
The system includes a data acquisition module, a data preprocessing module, a basic predicting model constructing module, and a local predicting module.
The data acquisition module is configured to acquire physical examination data of a physical examination user. The physical examination data includes basic physiological indicators and routine assay indicators. The basic physiological indicators include a height, a weight, BMI, a systolic blood pressure, and a diastolic blood pressure. The routine assay indicators include a blood routine (total protein, albumin, globulin, an albumin/globulin ratio, alanine aminotransferase, aspartate aminotransferase, alkaline phosphatase, cholinesterase, total bile acid, total bilirubin, direct bilirubin, indirect bilirubin, adenylate deaminase, glutamyl transpeptidase, glomerular filtration rate, creatinine, urea, uric acid, bronostatin C, triglycerides, total cholesterol, high-density lipoprotein-C, low-density lipoprotein-C, very low-density lipoprotein-C, fasting blood glucose, potassium, sodium, chloride, total calcium, inorganic phosphorus, glyphosate dipeptide aminopeptidase, a-fucosidase), and a urine routine (urine protein, urine ketone bodies, urine glucose, urine sediment red blood cells, urine bilirubin, urine sediment white blood cells, urine sediment red blood cells, urobilinogen, uric acidity). All the forgoing indicators of the physical examination user are expressed as feature vectors X=[x1, x2, . . . Xp]T. p=46 is the total number of indicators. An occurrence time t0 of a current medical examination is recorded. The earliest occurrence time data T=[t1, t2, . . . tq]T of diagnosis of the chronic diseases (a diabetes, a hypertension, a hyperlipidemia, a hyperuricemia, a fatty liver, a coronary heart disease, and a chronic kidney disease) from diagnosis data of the chronic diseases in the user's electronic medical record before and after the medical examination is extracted. q=7 is the number of categories of the chronic diseases preset by the system. In the forgoing data, X, t0 is necessary data, and each of components in T is set to null when the diagnosis does not exist;
The data preprocessing module performs specifically the following:
performing standardization processing ϕi on each of the components xi in X based on the component, so that a standard deviation of all data on this component is 1, and a mean value of all the data on this component is 0. A standardized feature vector is denoted as X′=[x′1, x′2, . . . x′p]T;
where, xi′ is a standardized data. λi is the mean value of all the data on the component xi. σi is the standard deviation of all the data on the component xi;
performing an exponential operation on T based on a base number α(0<α<1) to generate a prognostic index vector Y=[y1, y2, . . . yq]T:
The basic predicting model constructing module is configured to construct a multi-label learning model for a physical examination scenario. As shown in
(1) constructing a multilayer neural network, and completing data processing from input to output via the multilayer neural network as shown in
(2) designing a loss function E, where, the loss function is a key definition that enables the model to adaptively learn an occurrence state and an occurrence sequence of various chronic diseases to perform prognostics of future chronic diseases of a medical examinee.
The loss function can be regarded as a weighted mean value of three loss functions set for different objectives, where, N=20 represents a data sample amount used in a single batch of gradient descent, and a superscript k of Eik represents that a component of the loss function is obtained by calculation of a k-th sample in the batch of gradient descent. λiw represents respective weight values of the three loss functions; λ0w=λ1w=λ2w=1 is used in this embodiment.
a) E0 is a single-label loss function, which characterizes a difference between a predicted prognostic index ci and an actual prognostic index yi of disease prediction:
b) E1 is an interval loss function, which characterizes a difference between a predicted prognostic index difference λci,j and an actual prognostic index difference λyi,j of two different chronic diseases:
c) E2 is a ranking loss function, which characterizes a difference between a predicted occurrence order and an actual occurrence order of the two different chronic diseases:
With this loss function, an actual occurrence time of the diseases, an occurrence logical relation between the diseases, and a time interval between the diseases can be considered during model training, so that an occurrence of a plurality of diseases in the future can be better predicted based on a single physical examination data.
(3) Learning parameters: according to the physical examination data of a sample medical institution, several models M1, M2, . . . ML are constructed by matrix hyperparameter scanning (i.e., hyperparameters for scanning: the number K∈{3, 4, 5,6} of the layers of the network, the number n2, . . . nK−1∈{50, 30, 20, 10} of nodes in middle layers of the network, and activation functions ∈{ReLU, sigmoid, Tanh} between two adjacent layers). Parameters of each of the models are learned based on a mini-batch gradient descent (MBGD). Optimal parameters are determined via 10-fold cross validation. An optimal model is used as a basic predicting model Mbest for migration to other medical institutions. Mbest is solidified into the basic predicting model constructing module.
The local predicting module is arranged in a specific local medical institution and includes a local model training unit and a predicting unit as shown in
The local model training unit obtains an optimal basic predicting model Mbest via the basic predicting model constructing module that is adjusted via real data of a sample medical institution. However, due to different testing instruments and methods used by different medical institutions, the parameters need to be adaptively adjusted based on specific physical examination data of the specific local medical institution to determine an local predicting model MbestH for the specific local medical institution.
The local model training unit receives the optimal basic predicting model Mbest provided by the basic predicting model constructing module, acquires the physical examination data X, t0, T of the specific local medical institution via the data acquisition module, generates X′, Y via the data preprocessing module, performs model training the same as a Mbest training method with model parameters of Mbest as initial parameters based on X′, Y, and solidifies a trained local predicting model MbestH into the local predicting module after the parameters are converged.
The predicting unit performs the prognostics of the chronic diseases based on physical examination data of a new medical examinee according to the local predicting model MbestH, outputs a predicted prognostic index C=[c1, c2, . . . cq]T of the occurrence of the plurality of chronic diseases, and then obtains a future expected occurrence time t′i of the corresponding chronic disease via an inverse function t′i=y−1(ci) of y(ti). For a disease whose occurrence time is greater than a cut-off time tcutoff=5 year, a risk of occurrence of the diseases in a short term is considered to be low. A predicted occurrence time of the chronic diseases is fed back to the medical examinee via a natural language generation method, so that the medical examinee can understand high-risk diseases in the future, thereby providing a reference for a targeted adjustment of living habits and better prevention and treatment of high-incidence diseases.
Further, the medical institution stores the physical examination data as a .csv file locally. A selected sample medical institution generates an encapsulated basic predicting model Mbest from the physical examination data of the sample medical institution. The physical examination data of a local medical institution is sent to its local predicting module via an interface service, and the future expected occurrence time of the corresponding chronic diseases is returned via an interface response.
The forgoing is only examples of implementation of the present invention, and is not used to limit the protection scope of the present invention. Any modification, equivalent replacement, improvement, or the like made without creative labor within the spirit and principle of the present invention are all included in the protection scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
202010038223.2 | Jan 2020 | CN | national |
The present application is a continuation of International Application No. PCT/CN2021/071826, filed on Jan. 14, 2021, which claims priority to Chinese Application No. 202010038223.2, filed on Jan. 14, 2020, the contents of both of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/071826 | Jan 2021 | US |
Child | 17543736 | US |