This application is based upon and claims priority to Chinese Patent Application No. 2023116173052, filed on Nov. 29, 2023, the entire contents of which are incorporated herein by reference.
The present invention belongs to the technical field of prediction of adverse reactions between drugs, and particularly relates to a method for predicting adverse reactions between drugs based on multi-attribute and multi-kernel representation learning.
Adverse reactions between drugs mean that the drug efficacy or the pharmacological action of one drug is destroyed by the other drug after two drugs are combined in use, which causes changes in original in-vivo metabolic and absorption processes of the drugs, generating adverse reactions or toxic and side effects harmful to human bodies. For example, the combined use of a triazole antifungal agent voriconazole, and a synthesized adrenal corticosteroid will induce adverse reactions such as osteoporosis, ichorrhemia, and attenuation of vision. The common use of a polyamine sevelamer and a sulfomethylaminobenzoic acid derivative furosemide will cause adverse reactions such as bradycardia, asystole, and adynamic ileus. Identification and prediction of the adverse reactions between drugs have received attention gradually by scientific researchers in various disciplines, which are also a research hotspot in the current field of clinical pharmacy and medical treatment.
To address the problem of the adverse reactions between drugs, pharmaceutical enterprises all over the world have invested much capital in a new drug research and development stage for clinical tests before the drug is listed, so that the risk of the adverse reactions of the new listed drug is reduced, and the safety of the drug is improved. Despite this, there are still adverse reaction events happening frequently every year. The costs of the pharmaceutical enterprises in a safety monitoring stage for the new listed drug are increased, and the risk for the drug being discarded and withdrawn is also improved. On the one hand, the factors affecting the adverse reactions between drugs are complicated and diverse. For example, there are differences between biochemical attributes (molecular structure, target, enzyme, pathway, and side effect) of different drugs and the absorptive and metabolic capabilities of patients to drugs are also different, resulting in that extensive analysis and researches cannot be carried out in clinical tests for the various using conditions of the drugs and different human body states. On the other hand, there are a large number of drugs, leading to their huge combination space when they are used together, leading to high clinical test cost and long time consumption in clinical tests. Therefore, efficient identification and prediction of the adverse reactions between drugs cannot be conducted on a large scale by the clinical test methods, and it is not enough to find potential adverse reactions between drugs merely dependent on the clinical tests before new drug is listing. Therefore, predicting the adverse reactions between drugs makes up for the limitation of the clinical test methods and hysteresis of safety monitoring after the new drug is listed, thereby lowering the research and development cost of the new drug and the safety monitoring cost of the new listed drug.
Research work for predicting the adverse reactions between drugs at present is mainly divided into two categories: methods for predicting adverse reactions between drugs based on a knowledge base and methods for predicting adverse reactions between drugs based on drug attributes.
Public medical and sanitary institutions collect drug safety monitoring data submitted by drug users and medical workers through a spontaneous reporting system to construct a knowledge base for the adverse reactions between drugs. Researchers proposed the methods for predicting adverse reactions between drugs based on a knowledge base to detect the adverse reactions between drugs in biochemical texts, electronic medical records, and a biological heterogeneous database.
With deep research of pharmaceutical experts on drug attributes (molecular structure, side effect, target, enzyme, pathway, phenotype, gene, diseases, and the like), the researchers extract drug attribute information from the heterogeneous database of drugs, design the methods for predicting adverse reactions between drugs based on drug attributes, and reveal attribute rules of the adverse reactions of the drugs. Common research methods include multi-task learning, feature selection, graph convolutional neural networks, et al.
Although the above methods lay a good foundation for predicting the adverse reactions between drugs, they have the following defects: by calculating the similarities of multi-attribute representations between drugs by similarity measure functions, the current methods model the relationship of adverse reactions between drugs. Specifically, the similarity measure function can be regarded as a kernel function that first maps the attribute representations of the drugs to a high-dimensional space by using a projection strategy and then calculates inner products of the attributes representations of the two drugs in high-dimensional space representations, which is seen as adverse reaction scores between the drugs. Due to that different attributes are usually distinct in revealing the underlying characteristics of adverse reactions between drugs, the kernel function most compatible with its potential characteristics is preferably selected to model the adverse reactions between drugs. The current methods usually make attempts at various kernel functions (such as linear kernel, Gaussian kernel, and Sigmoid kernel), and select the kernel function with the best result as the optimal kernel function to measure the attribute similarity measure between drugs. However, because the potential characteristics of different drug attributes are diverse and complex, the selected optimal kernel function is limited by its representation capability, and the optimal kernel function only can approximate the potential characteristics of the attributes and cannot fully reflect the characteristics of the attributes. On the other hand, from the perspective of the kernel function, different kernel functions have their own preference and tendencies when calculating the attribute representation similarities between drugs.
Due to that different attributes are usually distinct in revealing the underlying characteristics of adverse reactions between drugs, and different kernels typically have their respective preferences and tendencies in similarity estimation, based on multiple attributes of the drugs for predicting the adverse reactions between drugs, it is proposed to construct the optimal kernel function combination to better reveal the potential characteristics of different attributes in modeling the adverse reactions between drugs, that is, to find an integrated similarity measure function which is capable of being compatible to diverse and complex characteristics of the different attributes. Specifically, this optimal kernel function combination integrates preference and tendencies of various kernel functions in calculating the multi-attribute similarities, which can better reveal the relationship between the multi-attribute similarities of the drugs and the adverse reactions between drugs, thereby improving the accuracy of predicting the adverse reactions between drugs. For the differences of multi-attributes in revealing the potential characteristics of the adverse reactions between drugs, the problem to be solved in the present invention is to construct the optimal kernel function combination for accurate prediction of adverse reactions between drugs.
To address the above problems, the present invention provides a method for predicting adverse reactions between drugs based on multi-attribute and multi-kernel representation learning. Based on multi-attribute information (molecular structure, target, pathway, side effect, phenotype, and disease) of the drugs, in view of highly dimensional and sparse multi-attribute feature space of the drugs and divergence among feature dimensions of different attributes, the method in the present invention includes: first, projecting a multi-attribute feature space of the drugs to a same low-dimension and dense feature space to learn their shared and private representations; then developing a multi-kernel representation learning model, and designing a distance learning strategy and a reconstruction strategy of the kernel functions by calculating an incidence relationship among the kernel functions so as to select representative kernel functions; and finally, constructing an optimal kernel function combination based on the incidence relationship between representative kernel functions and the original kernel functions for revealing the relationship between the multi-attribute similarities of the drugs and the adverse reactions between drugs, so as to predict the adverse reactions between drugs.
A technical solution for the present invention is as follows:
A method for predicting adverse reactions between drugs based on multi-attribute and multi-kernel representation learning, including the following steps:
Further, the objective function constructed in S2 is as follows:
the objective function can be formulated by augmented Lagrangian function and further solved by an alternating direction multipliers method and a non-negative matrix factorization optimization method so as to obtain iterative update solutions of the shared representation P and private representation Qm of the multi-attribute feature space of the drugs and the reconstructed coefficient matrix Um of the multi-attribute feature space, m=1, . . . , M, and the maximum number of iterations or a minimum change threshold of the objective function are set and variables are iteratively updated to obtain an optimal solution of the objective function.
Further, the distance learning strategy of the kernel functions constructed in S3 is as follows:
and
Further, the objective function of the multi-kernel representation learning model constructed in S4 is as follows:
Further, the mapping relationship between the multi-attribute and multi-kernel representations between drugs and the adverse reactions of the R-layer neural network in S5 is as follows:
The present invention has the following beneficial effects: compared with a conventional prediction method, in light of the differences of multiple attributes of the drugs in revealing the potential characteristics of the adverse reactions between drugs, the consistent optimal kernel function combination is constructed. The optimal kernel function embodies preference and tendencies of various kernel functions when calculating the multi-attribute similarities, and the potential relationship between the multi-attribute similarities of the drugs and the adverse reactions between drugs is established, thereby improving the accuracy of predicting the adverse reactions between drugs. The present invention is capable of providing data support to experimental research on the adverse reactions between drugs, improves the clinical experimental research of the adverse reactions between drugs, and is of great significance in promoting clinical medication safety.
The present invention will be described in detail below in combination with drawings and specific embodiments.
As shown in
As shown in
In the step S1, collecting the data of the adverse reactions between drugs and the multi-attribute information of the drugs. In the embodiment, data of the adverse reactions between drugs is collected from a TWOSIDES database [N. P. Tatonetti, P. P. Ye, D. Roxana, R. B. Altman, Data-driven prediction of drug effects and interactions, Science Translational Medicine 4 (125) (2012) 1-26]. The TWOSIDES data base records adverse reactions of the combined use of two drugs. In the embodiment, a relationship for predicting the adverse reactions between drugs is established by means of attribute data such as molecular structures, target, pathways, side effects, phenotypes, and diseases of the drugs. Molecular structure and target information of the drugs is originated from a DrugBank database [D. S. Wishart, Y. D. Feunang, A. C. Guo, et al. DrugBank 5.0: A major update to the DrugBank database for 2018[J]. Nucleic Acids Research, 2018, 46(D1): D1074-D1082.]. Pathway and disease information of the drugs is originated from a KEGG database [M. Kanehisa, M. Furumichi, Y. Sato, et al. KEGG: Integrating viruses and cellular organisms[J]. Nucleic Acids Research, 2021, 49(D1): D545-D551.]. Side effect information of the drugs is originated from an SIDER database [M. Kuhn, I. Letunic, L. J. Jensen, et al. The SIDER database of drugs and side effects[J]. Nucleic Acids Research, 2016, 44(D1): D1075-D1079.]. Phenotype information of the drugs is originated from a CTD database [A. P. Davis, C. J. Grondin, R. J. Johnson, et al. Comparative toxicogenomics database (CTD): Update 2021[J]. Nucleic Acids Research, 2021, 49(D1): D1138-D1143.]. Based on data of the adverse reactions between drugs and the multi-attribute data of the drugs, totally 1188258 kinds of adverse reactions between drugs are acquired in the embodiment, involving N=567 drugs and K=258 adverse reactions. Basically, common drugs and adverse reactions are covered. The data collected by using the collection method is of higher reliability.
A drug set D={d1, d2, . . . , dN} is given, for adverse reactions between drugs di and dj, constructing a vector rij∈{0, 1}K to denote an adverse reaction relationship between the ith aj, where K denotes the number of of the types of adverse reactions, drug di and the jth drug dj, where K denotes the number of of the types of adverse reactions, and if the kth adverse reaction is induced by the interaction between di and dj, then, rkij=1; otherwise, rkij=0. For the attribute data such as molecular structures, target, pathways, side effects, phenotypes, and diseases of the drugs, binary vectors are used to denote the attribute representation of the drugs. Vector elements 1 and 0 respectively denote whether the drugs contain representation information of corresponding attributes. The source databases and representation dimensions of the multi-attribute information of the drugs are shown in Table 1. A matrix Xm∈RN×L
In the step S2 of the embodiment, the shared and private representation of multi-attribute representations of drugs are learned. As shown in Table 1, due to highly dimensional and sparse representations of different attributes of the drugs and differences in representations dimensions of different attributes, the multi-attribute feature spaces of the drugs are projected to a same low-dimensional dense space to learn the shared and private representations of the multi-attribute informative information of the drugs.
In the step, the shared representations denote that different attributes have consistent contribution information for predicting the adverse reactions between drugs, and the private representations denote that different attributes contain specific information of each attribute, which plays a supplementing role in predicting the adverse reactions between drugs. In the step, by establishing the potential relationship between the shared and private representations and the original feature space of the drug attributes, that is, the feature space of each attribute consists of the shared representations and respective private representations in the projected low-dimension sense space, consistency information and supplementary information of the multi-attribute feature space are revealed. Therefore, a objective function of the shared representations and private representations denoted by the multi-attribute representations of the drugs can be written as follows:
The equation (1) can be formulated by the augmented Lagrangian function and further optimized by the alternating direction multipliers method (ADMM), and the non-negative matrix factorization optimization method so as to acquire iterative update solutions of the shared representations P and private representations Qm of the multi-attribute feature space of the drug and the reconstructed coefficient matrix Um of the multi-attribute feature space, m=1, . . . , M. The maximum number of iterations or a minimum change threshold of the objective function are set and variables are iteratively updated to obtain an optimal solution of the objective function.
In the step S3 of the embodiment, the distance learning strategy of kernel functions and the reconstruction strategy of the kernel functions are designed. Based on the shared representations P and the private representations Qm of M attributes acquired in step 2, the shared representations of the drug di are denoted as Pi⋅, and the private representations of the mth attribute of the drug di are denoted as Qi⋅m, m=1, . . . , M. The private representations of the M attribute spaces of the drug di are concatenated to acquire the private representation Qi⋅=[Qi⋅1, Qi⋅2, . . . , Qi⋅M] of the drug di.
K={κ1, κ2, . . . , κL} is given to denote L kernel function sets, where κl denotes the lth kernel function κl. First, the share and private representations of drugs are projected to the high-dimensional space with a projection function ϕl(⋅), and the inner products of two drug attribute representations are calculated in the high-dimensional space. For the adverse reactions between drugs di and dj, the shared representations Pi⋅, Pj⋅ are taken as inputs of the kernel function κl, to be regarded as similarity measures κl(Pi⋅, Pj⋅)=ϕl(Pi⋅)Tϕl(Pj⋅), the private representations Qi⋅, Qj⋅ of di and dj are also taken as inputs of the kernel function κl, to be regarded as similarity measures κl(Qi⋅, Qj⋅)=ϕl(Qi⋅)Tϕl(Qj⋅), l=1, 2, . . . , L. w=[w1, w2, . . . , wL] is given to denote the weight vectors of L kernel functions. Therefore, multi-kernel representations of the shared representations and private representations of the drugs di and dj can be written as follows:
For the kernel function κl, κlP, κlQ∈RN×N respectively denote similarity matrixes of the shared representations and the private representations of the adverse reactions between drugs based on the kernel function κl, with matrix elements [κlP]ij=κl(Pi⋅, Pj⋅), [κlQ]ij=κl(Qi⋅, Qj⋅).
To select the appropriate representative kernel function to construct the optimal kernel function combination, the distance between the kernel functions κl and κs can be regarded as the similarity between the kernel functions κl and κs. The less the distance between the kernel functions κl and κs is, the larger the similarity between the kernel functions κl and κs is, so the probability that the kernel function κl is capable of being used to represent the kernel function κs is higher. A kernel function incidence matrix Y is designed, and a matrix element Yls is a probability that the kernel function κl can be used to represent the kernel function κs. Therefore, for the similarity matrixes κlP of the shared representations and κlQ of the private representations in terms of the kernel function κl, the distance between the kernel functions κl and κs of the shared representations and the private representations can be denoted as follows:
The matrixes DP, DQ∈RL×L respectively denote the similarity matrixes of the L kernel functions on the shared representations and the private representations. Therefore, in combination with the kernel function incidence matrix Y, the distance learning strategy of the kernel functions can be written in the following:
1 and 0 respectively denote an all-1 vector and an all-0 vector, Y≥0 denotes non-negativity of elements in the kernel function incidence matrix Y, diag(Y)=0 denotes that the diagonal element of the matrix Y is 0,
guarantees that a probability sum of L kernel functions as the representative kernel functions to represent the kernel function κs is 1, equivalent to a bound term YT=1 in the equation (4). Yls is the probability that the kernel function κl represent the kernel function κs, the weight wl of the kernel function κl can be regarded as a mean value of the probability that κl serves as the representative kernel function to represent the L kernel functions, that is,
To estimate the kernel function incidence matrix Y, the shared representations matrix κlP of the adverse reactions between drugs based on the kernel function κl can be represented by the shared representations matrix κsP of the adverse reactions between drugs of other kernel functions κs, and the private representations matrix κlQ of the adverse reactions between drugs based on the kernel function κl can also be represented by the private representations matrix κsQ of the adverse reactions between drugs of other kernel functions κs, s=1, 2, . . . , L, s≠l. Therefore, the reconstruction strategy of the kernel functions can be written as the following rule items:
In the step S4 of the embodiment, the multi-kernel representation learning model is designed, the representative kernel functions are selected, and the optimal kernel function combination is constructed. Based on the distance learning strategy of the kernel functions given in the equation (4) and the reconstruction strategy of the kernel functions given in the equation (5), the multi-kernel representation learning model is constructed, and the objective function of the model can be written as follows:
The regularization parameter λ controls the wight of the distance learning of the kernel functions. The objective function can be formulated by the augmented Lagrangian function and further optimized by the ADMM method, to acquire the iterative update solution of the kernel function incidence matrix Y. By setting a maximum number of iterations or a minimum change threshold of the objective function, matrix Y is iteratively updated to finally acquire the optimal solution of the objective function.
Based on the optimized kernel function incidence matrix Y, the weight wl of the kernel function κl can be denoted as
The weights of the L kernel functions are sequenced, rL kernel functions with the maximum weights of the kernel functions are selected as the representative kernel functions, and the representative kernel function set can be denoted as
and the private representations of the adverse reactions between drugs of kernel function κl can be also reconstructed by the similarity matrix of the private representations of the adverse reactions between drugs of the representative kernel functions, i.e.,
where Y
denotes the reconstruction of the similarity matrix κlP of the shared representations of the adverse reactions between drugs of the kernel function κl by the similarity matrix κ
The representative kernel function set
In the step S5 of the embodiment, the model for predicting adverse reactions between drugs is constructed by using the optimal kernel function combination. A vector rij is given to denote the adverse reaction relationship between the drugs di and dj, and an R-layer neural network is designed to mine a potential relationship between the multi-attribute and multi-kernel representations of the drugs and the adverse reactions. The equation (7) gives the multi-kernel representations κw(Pi⋅, Pj⋅), κw(Qi⋅, Qj⋅) of the shared representations and the private representations of the drugs di and dj based on the representative kernel functions, so that the mapping relationship between the multi-attribute and multi-kernel representations between drugs and the adverse reactions can be written as follows:
h(r), E(r), and b(r) respectively denote the output vector, the coefficient matrix, and the offset vector of the rth layer of the neural network. κwij denotes the concatenation of the multi-kernel representations of the shared representations and the private representations of the drugs di, and dj, that is, κwij=[κw(Pi⋅, Pj⋅), κw(Qi⋅, Qj⋅)]. Particularly, the output vector h(R) of the Rth layer of the neural network can be regarded as the predicted value
In the step S6 of the embodiment, multi-attribute information Xa⋅m and Xb⋅m, m=1, 2, . . . , M, of the drugs da and db is given to predict the adverse reactions between the drugs da and ab. Through the step S2 of the embodiment, the shared representations Pa⋅ and Pb⋅ and the private representations Qa⋅m, Qb⋅m, of the multi-attribute information of the drugs da and db are obtained, m=1, 2, . . . , M, and the private representations of M attributes of da and db are concatenated, i.e. Qa⋅=[Qa⋅1, Qa⋅2, . . . , Qa⋅M], Qb⋅=[Qb⋅1, Qb⋅2, . . . , Qb⋅M]. Based on the kernel function set K={κ1, κ2, . . . , κL}, the similarity measures κl(Pa⋅, Pb⋅), κl(Qa⋅, Qb⋅), of the shared representations and the private representations of the drugs da and ab are calculated, l=1, 2, . . . , L. Then, based on the representative kernel function set
The method for predicting adverse reactions between drugs based on multi-attribute and multi-kernel representation learning provided in the embodiment, by constructing the optimal kernel function combination, explores the potential characteristic rules of different attributes in modeling the adverse reactions between drugs, and reveals the relationship between the multi-attribute similarities of the drugs and the adverse reactions between drugs, thereby realizing prediction of the adverse reactions between drugs. The predicted result can provide data support for research on the adverse reactions between drugs based on the bio-experimental method and research on the safety of the new drug.
Number | Date | Country | Kind |
---|---|---|---|
2023116173052 | Nov 2023 | CN | national |