The present disclosure relates to a system for obtaining a therapeutic solution for treatment of a disease or a disorder and also relates to a method of drug discovery and obtaining therapeutic solutions for various medical conditions, through the system.
The process of discovering a drug and bringing it to the market is highly complicated and takes an average of 10-15 years and costs around $800 million to $1 billion. This cost takes into account all the failures. For every 5000-10000 compounds that enter the research and development pipeline, ultimately only one compound receives approval. Every successful drug that reaches the market has to undergo a long process of pre-clinical and clinical testing. The entire process starting from understanding of a disease to the pre-clinical testing consumes about three to six years whereas the clinical testing phase consumes an average time span of six to seven years. A candidate drug must go through extensive studies in humans, and must prove to be safe and effective before the Food and Drug Administration (FDA) and other regulatory agencies approves the drug.
The success rates for different therapeutics vary among the different stages of clinical trials. For instance, there's only a 66% chance that a compound will get through Phase I, but only a 33% chance of success in Phase II, and a 55% chance in Phase III. Once the drug has been submitted for regulatory approval, the success rate shoots to 80%. In the year 2008 to 2010, the reported Phase II failures for new drugs and major new indications of existing drugs included 51% due to insufficient efficacy, 29% due to strategic reasons like inadequate differentiation from more advanced drugs in the same class or from drugs with similar indications in another mechanistic class and 19% due to clinical or preclinical safety reasons. Although, the success rate in the Phase II trials is a significant determining factor, the benefit-risk ratio and the increasing complexity of diseases are the major reasons for determining the probability of a compound getting approved.
Disorders like Rheumatoid Arthritis (RA), Cancer, Atherosclerosis, Leukocyte defects, autoimmune disorders, inflammatory disorders etc. are considered to be complex diseases severely affecting large populations on a global scale. Developing therapeutic drugs to these disorders with complex pathophysiology poses a major challenge owing to the very low success rate of such developed drug molecules in clinical trials.
For instance, Phase I success rate for RA therapeutics is known to be 88%, much higher than the industry standard of 64%. The Phase II success rate is 22%, which is significantly lower than the industry standard of 39%. Phase III yields a success rate of 86% for this indication, higher than the industry standard of 66%. Cumulatively, the success rate for RA is 16%, which is at par with the industry standard of 16%. This means that out of 6 drug candidates clinically investigated in this disease area, only one will make it to market successfully. [Jayasundara K S et al. J Rheumatol. 2012; Sep. 1]
Further, oncology drug development has a poor track record in clinical development and the lowest success rate overall. In the therapeutics sector, the highest overall success rate from Phase I through likelihood of approval is infectious diseases, followed by endocrine system drugs at 10.4%, autoimmune diseases at 9.4% and cancer drug success rate of 4.7% is the lowest. The reasons for a very low success rate in cancer is due to the complexity of the multi-phenotype disease that in itself is not one disease, but highly heterogeneous. At the molecular level, two cancers are hardly identical and this results in extremely varied drug responses between individual patients even for the same kind of cancer. The cancer preclinical models used to predict clinical efficacy and toxicity are insufficient and unreliable in predicting clinical outcomes in different patient populations. They lack the complexity of the differences between individual patients and also the heterogeneity of the tumor microenvironment within a particular tumor tissue. Additionally, issues related to the design of clinical trials in oncology also contribute to the low success rate in this therapeutic area.
Rheumatoid Arthritis, a complex auto-immune disorder affecting the joints and cancer, another complex disorder caused by uncontrolled growth of cells affecting specific and different tissues, affect a significant population for which the current treatment regimens are inadequate in treating or curing the disease. There is a need for developing newer therapies for these complex diseases. One of the key reasons for the failure of newly developed drug treatments, whether small molecule or biologics, is not being able to get a thorough insight and understanding of how the drug works on the disease endpoints and not being able to predict clinical outcomes early on in the process. The preclinical experimental systems used, whether in vitro cells or animal models, are all black boxes in terms of being able to see how the particular drug works. All that can be monitored is the endpoint being assayed. The biological system is a very complex network with a maze of interactions intra- and inter-cellularly and due to this cross-talk, there are multiple effects of manipulating a single target and pathway besides what is being observed. Some of the effects, if acute, are directly observed through a phenotypic manifestation while some other effects could be more indirect and could manifest to a phenotype more subtly and over a longer time interval. A thorough understanding of the biology of the manipulations in the disease and control systems is needed to avoid pitfalls in lack of efficacy or toxicity effects later in the process of drug development.
Lack of a transparent and predictive experimental system has been a key missing link in the success rate of drug discovery and development. This when compared to other orthogonal and equally complex areas of semi-conductor engineering, space, automotive, climate, traffic etc. where we have technology to predict outcomes based on different ‘what-if’ scenarios, is what the key regulatory agencies like the FDA realized is missing in the Pharma industry. Early identification of failures is equally important to increase the success rate of drug development and this insight can be obtained through creating a virtual physiology model of the disease that makes the complex network maze transparent and dynamic and allows prediction of endpoints and clinical outcomes.
The current treatment regimens to these diseases have issues with toxicity and also developing resistance to the treatment over time. This is the case with the disease modifying (DMARD) drugs for RA that are known to stop being effective after some time. For cancer too, the cytotoxic drugs that work by killing the cancer cells also affect all the rapidly dividing cells in the body thereby having severe adverse consequences on the entire system as a whole. These drugs have a very narrow therapeutic window. The targeted drugs for cancer as in RA are known to become resistant after sometime. The common theme across these multi-phenotype, heterogeneous diseases is that we need treatments that are targeting multiple aspects of the disease and those which do not become resistant over time. One way to tackle this is to have a combination drug approach with combining drugs affecting different targets and pathways. To be able to test these combinations in a high throughput manner at different concentrations and monitor the impact on multiple disease endpoints instead of one phenotype assay, a simulation based predictive approach is required at the cellular and molecular abstraction level that can in turn be transformed to disease phenotype functions based on biomarkers. This can tremendously decrease the time taken for the development of a novel therapy for complex diseases and also bring down the failure rate of therapies and hence the cost involved in the initial phases of drug development including target identification etc. This emphasizes the need for novel comprehensive virtual experimental systems using in-silico approach that aids in rationalizing target selection and prioritization.
The disclosure includes a novel methodology of designing therapeutic solutions for complex, multi-phenotype human diseases such as the autoimmune disorders like Rheumatoid Arthritis and the varied kinds of cancers, using a virtual, predictive, physiology aligned disease model. This system comprises of a comprehensive network of signaling and metabolic regulation at the intra-cellular and inter-cellular level underlying the various phenotypes of the disease. The disease is induced through various triggers and/or cellular changes caused by mutations. A digital drug library is created through representing the functional modulation of the drug targets. This digital library is run in an automated high-throughput engine and the impact on various biomarkers and phenotypes is assessed. Millions of studies of the combinations at different concentrations are tested and through designed cost functions of efficacy, low toxicity and PKPD compatibility of the drugs and compounds, the combinations for specific disease profiles and indications are identified. This process which would normally take 6-8 years conventionally can be achieved in 1-2 months. The designs are then tested and validated in appropriate and standard preclinical models of the disease. The technology and methodology allows the generation of a pipeline of effective combination therapies that can be personalized to specific profiles of the disease.
One example aspect disclosed in the present disclosure includes a system for obtaining a therapeutic solution for treatment of a disease. In one example embodiment, the system includes a database. One example embodiment of the database comprises one or more unit models, wherein the one or more unit models are configured by collating information on parameters of biological system from plurality of information sources. Further there may be one or more cell system models. The one or more cell system models are obtained by integrating the one or more unit models, the one or more cell system models facilitates simulation of at least one of the biological system or homeostatic state of the biological system. In addition, one or more disease models are obtained by integrating the one or more cell system models. The disease models facilitate simulation of the perturbed state of the biological system.
A digital drug library may be formed by combining information on plurality of categories of digital drug capsules.
A processor is communicatively connected to the database and the digital drug library and the processor is configured to transmit one of the one or more disease models along with a set of drugs from the digital drug library to a scheduler. It is noted that the set of drugs are selected from the digital drug library based on a predefined cost function, wherein the scheduler distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. In addition, the scheduler is connected to the one or more computing devices through a network.
The processor also is configured to receive an output comprising the effect of the set of drugs on the disease model from the scheduler, wherein the scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processor is also configured to analyze the output to obtain the therapeutic solution in context of said disease.
The present disclosure further relates to a method of obtaining a therapeutic solution for treatment of a disease. In one example embodiment, the method comprises transmitting one of one or more disease model along with a set of drugs from a digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. Then, the scheduler distributes the one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. Thescheduler is connected to the one or more computing devices through a network. The method also comprises receiving an output comprising the effect of the set of drugs on the diseased model from the scheduler, wherein the scheduler transmits the output to the processor by combining matching results from the one or more computing devices. Further the method comprises analyzing the output to obtain the therapeutic solution in context of said disease.
In an embodiment, the present disclosure further relates to a non-transitory computer readable medium wherein operations are stored which are processed by at least one processing unit. This processing causes the system to perform the acts of transmitting one of one or more disease model along with a set of drugs from a digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler then distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network.
The processing unit thereafter receives an output comprising the effect of the set of drugs on the diseased model from the scheduler. The scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processing unit further analyzes the output to obtain the therapeutic solution in context of said disease.
In another embodiment, the present disclosure relates to a computer program for obtaining a therapeutic solution for treatment of a disease. The computer program comprises code segment for transmitting one of one or more disease model along with a set of drugs from a digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler then distributes the one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network, code segment for receiving an output comprising the effect of the set of drugs on the diseased model from the scheduler. The scheduler transmits the output to the processor by combining matching results from the one or more computing devices, and code segment for analyzing the output to obtain the therapeutic solutions in context of said disease.
The features of the present disclosure are set forth with particularity in the appended claims. The embodiments of the disclosure itself, together with further features and attended advantages, will become apparent from consideration of the following detailed description, taken in conjunction with the accompanying drawings. One or more embodiments of the present disclosure are now described, by way of example only, with reference to the accompanied drawings wherein like reference numerals represent like elements and in which:
The figures depict embodiments of the disclosure for purposes of illustration only. One skilled in the art will readily recognize from the following description that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the disclosure described herein.
The present disclosure relates to a system for obtaining a therapeutic solution for treatment of a disease. The system comprises a database having one or more unit models. The one or more unit models are configured by collating information on parameters of biological system from plurality of information sources. The database further comprises one or more cell system models, wherein the one or more cell system models are obtained by integrating the one or more unit models. The one or more cell system models facilitate simulation of at least one of the biological system or homeostatic state of the biological system. The database further comprises one or more disease models, wherein the one or more disease models is obtained by integrating the one or more cell system models. The disease models facilitate simulation of the perturbed state of the biological system.
The system further comprises a digital drug library formed by combining information on plurality of categories of digital drug capsules.
The system also comprises a processor, communicatively connected to the database and the digital drug library. The processor is configured to transmit one of the one or more disease model along with a set of drugs from the digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network. The processing unit receives an output comprising the effect of the set of drugs on the disease model from the scheduler, wherein the scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processing unit also analyzes the output to obtain the therapeutic solution in context of said disease.
In an embodiment of the present disclosure, the network is a cloud based network.
In another embodiment of the present disclosure, the one or more computing devices test effect of at least one of the set of drugs from the digital drug library to analyze the efficacy of the set of drugs on disease model based on the predefined cost function.
In another embodiment of the present disclosure, the one or more unit models comprise assimilation of information. The information is at least one selected from pathways or networks or biomarkers. The pathways are selected from a group comprising signaling pathway, metabolic pathway, apoptotic signaling pathway and signal transduction pathway. The networks are selected from a group comprising genomic networks, and protein networks; and the biomarkers are specific for the unit model.
In yet another embodiment of the present disclosure, the at least one information is selected from a group comprising biological information, therapeutic information, pathological information, computational information, information on mutation, information pertaining to kinetic rate laws and information pertaining to kinetic rate parameters or any combination thereof.
In still another embodiment of the present disclosure, the parameters are selected from at least one of biological pathway, biological network and biomarker.
In still another embodiment of the present disclosure, the biomarkers are selected from a group comprising proteins, metabolites, nucleic acids, ions, nutrients, hormones, lipids, transporters, receptors and enzymes or any combination thereof.
In still another embodiment of the present disclosure, the cell system model comprises cell selected from a group comprising but not limiting to white blood cells, dendritic cell, B Lymphocyte, Helper T Lymphocytes, Cytotoxic T Lymphocytes, Mast cell, Beta-Pancreatic cell, Cardiomyocyte, E. Coli, Endothelial Cell, Fibroblast, Adipocyte, Hepatocyte, Keratinocyte, Macrophage, Melanocyte, Mycobacterium Tuberculosis, Neutrophil, Osteoblast, Osteoclast, Skeletal Muscle, Tumor Cell, Epithelial cells, Plasma cells, Natural killer cells, other inflammatory related cells and any other human cell system or cell lines or any combination thereof.
In still another embodiment of the present disclosure, the digital drug capsule comprises at least one of a specific drug, small molecule, biomolecule, small inhibitory molecule or ligand or a combination thereof; and containing information selected from a group comprising mechanism of action (MOA) of the drug, pharmacological properties of the drug, including IC50, Cmax, bioavailability, AUC, Tmax and half-life, physical properties of the drug, including compound structure, molecular formula and molecular weight, pharmaceutical formulation information of the compounds within the drug, information pertaining to approved or safe dosing range of the drug, therapeutic category to which the drug belongs, conditions for which the drug has been indicated, information pertaining to off-target effects, drug interactions and adverse events associated with the drug, manufacturer details specific for the drug, patent information specific to the drug and indication specific alignment information including trends observed for biomarkers, phenotypes and disease scores in experiments performed on patients, animal models and cell-line cultures, or any combination thereof.
In still another embodiment of the present disclosure, the digital drug library comprises at least one of a single target drug, a multi-target drug, pseudo-drug and hypothetical drug comprising action selected from a group comprising single target or multi-target or any combination thereof.
In still another embodiment of the present disclosure, the biological system comprises processes selected from a group comprising gene transcription, RNA translation, signaling pathway, metabolic pathway, antigen presentation, signal transduction pathway, gene over-expression, gene knock-down, gene knock out, gene inhibition, genomic network, protein network, cell cycle, whole cell simulation and cell growth or any combination thereof.
In still another embodiment of the present disclosure, the therapeutic solution is selected from a group comprising drug, combination of drug or targets, combination of drug and target, combination of pseudo-drug and a target.
The present disclosure relates to a method for obtaining a therapeutic solution for treatment of a disease. The method comprises transmitting one of the one or more disease model along with a set of drugs from the digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network. The processing unit receives an output comprising the effect of the set of drugs on the disease model from the scheduler, wherein the scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processing unit also analyzes the output to obtain the therapeutic solution in context of said disease.
In an embodiment, the disease models facilitates simulation of the perturbed state of the biological system. The one or more disease model is obtained by configuring one or more unit models by collating information on parameters of biological system from plurality of information sources. The disease model further comprises one or more cell system models, wherein the one or more cell system models are obtained by integrating the one or more unit models. The one or more cell system models facilitate simulation of at least one of the biological system or homeostatic state of the biological system.
In another embodiment of the present disclosure, the digital drug library is formed by combining information on plurality of categories of digital drug capsules.
In yet another embodiment of the present disclosure, the biological system comprises processes selected from a group comprising gene transcription, RNA translation, signaling pathway, metabolic pathway, antigen presentation, signal transduction pathway, gene over-expression, gene knock-down, gene knock out, gene inhibition, genomic network, protein network, cell cycle, whole cell simulation and cell growth or any combination thereof.
In another embodiment of the present disclosure, the one or more unit models comprises assimilation of information. The information is at least one selected from pathways or networks or biomarkers. The pathways are selected from a group comprising signaling pathway, metabolic pathway, apoptotic signaling pathway and signal transduction pathway. The networks are selected from a group comprising genomic networks, and protein networks; and the biomarkers are specific for the unit model.
In still another embodiment of the present disclosure, the at least one information is selected from a group comprising biological information, therapeutic information, pathological information, computational information, information on mutation, information pertaining to kinetic rate laws and information pertaining to kinetic rate parameters or any combination thereof.
In still another embodiment of the present disclosure, the unit model comprises species and bio-molecular interactions across different parts of a cell selected from at least one of cytoplasm, nucleus, mitochondria, Endosome, Endoplasmic Reticulum, Golgi Apparatus, Inner mitochondrial membrane, Inner membrane space, Lysosome, Membrane, Melanosome, Rough Endoplasmic Reticulum, Mitochondrial Matrix, Accessory Compartment, and extracellular space.
In still another embodiment of the present disclosure, obtaining one or more cell system models comprises assimilation of information on plurality of cell lines and mutation profiles.
In still another embodiment of the present disclosure, the cell system model comprises cell selected from a group comprising white blood cells, dendritic cell, B Lymphocyte, Helper T Lymphocytes, Cytotoxic T Lymphocytes, Mast cell, Beta-Pancreatic cell, Cardiomyocyte, E. Coli, Endothelial Cell, Fibroblast, Adipocyte, Hepatocyte, Keratinocyte, Macrophage, Melanocyte, Mycobacterium Tuberculosis, Neutrophil, Osteoblast, Osteoclast, Skeletal Muscle, Tumor Cell, Epithelial cells, Plasma cells, Natural killer cells, other inflammatory related cells and any other human cell system or cell lines or any combination thereof.
In still another embodiment of the present disclosure, the method further comprises validation for optimization of model parameters and alignment of datasets.
In still another embodiment of the present disclosure, the model parameters are selected from at least one of biological pathway, biological network and biomarkers.
In still another embodiment of the present disclosure, changes in levels of the parameter are defined by specific trigger which represent perturbation in the homeostatic state, thereby inducing and representing the disease, wherein said perturbation leads to change in level of the biomarkers.
In still another embodiment of the present disclosure, the biomarkers are selected from a group comprising proteins, metabolites, nucleic acids, ions, nutrients, hormones, lipids, transporters, receptors and enzymes or any combination thereof.
In still another embodiment of the present disclosure, the perturbation lead to assertive statement which indicate the positive or negative adherence of the alignment and the validation dataset in the model, wherein the assertive statement is based on at least one of quality and quantity of the expected trend of a biomarker specific for the perturbed parameters within said model.
In still another embodiment of the present disclosure, the digital drug capsule comprises at least one of a specific drug, small molecule, biomolecule, small inhibitory molecule or ligand or a combination thereof; and containing information selected from a group comprising mechanism of action (MOA) of the drug, pharmacological properties of the drug, including IC50, Cmax, bioavailability, AUC, Tmax and half-life, physical properties of the drug, including compound structure, molecular formula and molecular weight, pharmaceutical formulation information of the compounds within the drug, information pertaining to approved or safe dosing range of the drug, therapeutic category to which the drug belongs, conditions for which the drug has been indicated, information pertaining to off-target effects, drug interactions and adverse events associated with the drug, manufacturer details specific for the drug, patent information specific to the drug and indication specific alignment information including trends observed for biomarkers, phenotypes and disease scores in experiments performed on patients, animal models and cell-line cultures, or any combination thereof.
In still another embodiment of the present disclosure, the digital drug library comprises at least one of a single target drug or a multi-target drug, pseudo-drug and hypothetical drug comprising novel mechanism of action selected from a group comprising single target or multi-target or any combination thereof.
In still another embodiment of the present disclosure, the matching comprises: simulating the perturbed state of the disease model with the information from the set of drugs; and
optimizing the dosage recursively by the drug efficacy characterization to achieve perfect alignment to trends observed in disease specific biomarker or phenotype or a combination thereof.
In still another embodiment of the present disclosure, the therapeutic solution is selected from a group comprising drug, combination of drug or targets, combination of drug and target, combination of pseudo-drug and a target.
In still another embodiment of the present disclosure, analyzing the output comprises analyzing effect of the therapeutic solution by clinical assessment selected from at least one of disease scores, synergy scores, phenotype scores and biomarker scores. In another embodiment, the analysis is done either manually or in an automated manner, preferably in an automated manner, to arrive at a therapeutic solution based on predefined cost functions.
In still another embodiment of the present disclosure, the perturbed state representing the disease is selected from at least one of autoimmune diseases, cancers, dermatological diseases, infectious diseases, cardiac conditions, pulmonary diseases, renal diseases, nerve diseases or neurological disorders, inflammatory disorders or any other human diseases.
In an embodiment, the present disclosure further relates to a non-transitory computer readable medium wherein operations are stored which are processed by at least one processing unit. This processing causes the system to perform the acts of transmitting one of one or more disease model along with a set of drugs from a digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler then distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network.
The processing unit thereafter receives an output comprising the effect of the set of drugs on the diseased model from the scheduler. The scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processing unit further analyzes the output to obtain the therapeutic solution in context of said disease.
In another embodiment, the present disclosure relates to a computer program for obtaining a therapeutic solution for treatment of a disease. The computer program comprises code segment for transmitting one of one or more disease model along with a set of drugs from a digital drug library to a scheduler. The set of drugs are selected from the digital drug library based on a predefined cost function. The scheduler then distributes the one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler is connected to the one or more computing devices through a network, code segment for receiving an output comprising the effect of the set of drugs on the diseased model from the scheduler. The scheduler transmits the output to the processor by combining matching results from the one or more computing devices, and code segment for analyzing the output to obtain the therapeutic solutions in context of said disease.
The present disclosure thus aims at arriving at a system which provides innovative therapies by overcoming the difficulties in drug discovery and deficiencies of the prior art to treat a host of varied disease conditions, such as rheumatoid arthritis, cancer, dermatological disorder and inflammatory disorders. Such a system involves a process of employing a translational research methodology towards designing therapeutic solutions to treat said disease condition(s). The said methodology involves the following steps:
(1) translation of data obtained from data mining in to an in-silico model, (2) creation of a digital drug library, (3) high-throughput simulation of a digital drug capsule in an in-silico disease model and (4) sorting and filtering of viable drugs to arrive at a therapy to treat said disease condition(s).
In the present disclosure, various aspects are described by taking certain conditions such as RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof, as examples, and the scope of the instant disclosure should not be limiting to only these diseases or disorders or conditions. The scope of the present disclosure encompasses the efficient concept of drug delivery including single drugs or combination of drugs for any disease or disorder or condition known to a person skilled in the art, and for which efficient way of drug discovery is required.
The present disclosure therefore relates to an in-silico disease model, which is a highly comprehensive and complex virtual network of multiple interacting in-silico cell systems. Such disease models are made to represent a host of varied disease conditions including, but not limiting to, rheumatoid arthritis, cancer, autoimmune diseases, dermatological disorder and inflammatory disorders.
In an embodiment of the present disclosure, an in-silico model development process for any medical condition such as that for RA or cancer or for that matter, any other disease condition is based on assimilation of data from various sources wherein the sources shall vary from literature to in-house experiments, data from clinical trials etc.
In an embodiment, the models illustrate a biological system by defining the cells responsible for aggravating the disease condition(s) and how these cells communicate differently in a disease environment in comparison to a homeostatic [non-diseased] environment. Once the identity of the relevant cells is established, the developer needs to identify as to what stimuli trigger these cells in to a disease state.
In an embodiment, the basis of any disease is mainly attributed to an increase or decrease of a single or multiple disease stimuli resulting in an increase or decrease of endpoint biomarkers secreted from the different interacting cells. In order that a cell responds to these disease stimuli, several stimuli receptors connected to a complex network of signal transduction pathways are expressed exclusively on a disease cell.
In an embodiment, assimilation of data from literature and identification of disease specific cell receptors and their associated signaling/metabolic pathways is important. For the disease triggers or stimuli to actually have an impact, it is necessary that they affect the homeostatic phenotypes associated with the non-disease or control systems. Identification of disease affected phenotypes and the associated change in different end-point cell biomarkers using an efficient data mining technique illustrates a disease. These disease conditions would have interacting cells and every cell would have responsible signal transduction pathways involving proteins, metabolites, ions and nucleic acids. Hence assimilation of data from the literature and identifying these relevant signal transduction networks with associated entities is of prime importance to develop a system.
In an embodiment, every signal transduction pathway would have a stimulus, also referred to as a ligand and a receptor protein which are stimuli specific. The signal received by a receptor is transmitted with the help of adaptor proteins. The adaptor proteins can communicate to different enzymes or proteins known as transcription factors to enhance or suppress the expression of certain biomarker genes. For development of an in-silico model all the above collected data would only help in identifying the relative positions of interacting entities and designing the complete network. However to simulate the network for dynamic behavior, data pertaining to kinetic rate parameters such as dissociation constants for ligand-receptor binding, rate constants for mass transfer, enzyme turn-over and efficiency values etc. is collated.
In an embodiment, if the model does not show the intended behavior. Hence, to test the model for alignment to an intended behavior, a large comprehensive set of alignment data is developed. This alignment dataset contains information from various literatures about different in-vivo and in-vitro experiments conducted on a disease animal model, a patient, a cell line co-culture, a single entity or a multiple entity perturbation on a cell line culture. The alignment dataset contains information pertaining to the experiment protocols and the observed results. An in-silico model trained and validated against such a large alignment dataset illustrates all the known behavior pertaining to the actual biological cell or disease system.
In an embodiment of the present disclosure, one of the aims of designing an in-silico disease model is to identify therapies to suppress or control the disease; hence it becomes important that the model also exhibits the known effects of a therapy. To achieve alignment for a therapy, data/information from the literature is assimilated. This information is categorized and comprehensive alignment dataset is developed to cover the experiment protocol, the triggers used and the published results of a drug's effect. These affect the level of target efficacy, end-point biomarkers and the phenotypes. For simulating a drug effectively on a model, data pertaining to the drug's pharmacological properties (such as and not limited to IC50, Cmax, bioavailability, AUC, Tmax and half-life), mechanism of action (MOA) covering all the known targets, information pertaining to dosing of the drug, the therapeutic category to which the drug belongs (such as and not limited to NSAIDS, DMARDs, Biologics, natural compounds etc.) and information pertaining to off-target effects and adverse events associated with the drug have to be searched and categorized. All this information regarding a drug is categorized and curated into a digital drug capsule and placed in to a digital drug library. The model that is perfectly aligned with all the information present in the digital drug library is used to identify and design new and better therapies to cure diseases including, but not limited to, rheumatoid arthritis, cancer, autoimmune diseases, dermatological disorder, inflammatory diseases and various diseases/disorders.
In one non limiting exemplary embodiment, the present disclosure relates to an in-silico rheumatoid arthritis (RA) model, cancer model and dermatological disorder model. The model has been designed to emulate the characteristics and phenotypic properties associated with the RA, cancer and dermatological disorder by responding to self-antibodies specific to a multiplicity of immunogens such as bacterial lipopolysaccharide, external antigenic stimuli, citrullinated peptides etc. The main aim of developing the system of the instant disclosure is to bring together, the information from all the research journals on immunology, molecular biology, cell biology, etc. that have been published over the years into a single unified virtual research platform. This virtual research platform is advantageous both with respect to time, research costs and at the same time to avoid failures pertaining to the benefit-risk ratio.
In an embodiment, scientific and technical terms as used herein connection with the present disclosure shall have the meanings as further described below:
Data mining: Data mining also referred to as scoping, is the process of extraction and assimilation of concept specific information from literature resources inclusive of online peer-reviewed journals, published texts and any other reviewed literature source.
Cloud computing system: Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network.
High level language: A high level language is a biological programming language that enables digital execution of virtual biological experiments in a user environment designed with a strong abstraction, from intricate machine details.
In-silico model: An in-silico model is a virtual biological experimental platform designed to predict stimuli based biological reactions and functions.
Disease model: An in-silico disease model is a virtual biological experimental platform designed to predict disease trigger based biological reactions and functions.
Alignment datasets: Alignment dataset refers to the group of assimilated biological data used to train the response function of an in-silico model.
Autocrine: An autocrine relates to a substance secreted by a cell and acting on surface receptors of the same cell.
Paracrine: A paracrine relates to a substance secreted by a cell and acting on surface receptors of other cells.
In an embodiment, the in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof is a highly comprehensive, complex virtual network. This network is designed using data from various publications over the past many years, say, for illustrating purpose, in the range of 5 to 10 years. This is built on a backbone of a large number of mathematical equations representing interactions between a large number of biochemical entities. The in-silico model also boasts of an alignment of results from various different experiments published in high impact journals.
In another embodiment, the process of obtaining information from data mining and providing to an in-silico model involves (1) creation and validation of unit pathway models, (2) creation and validation of cell system models and (3) creation and validation of disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof.
In another embodiment, the disclosure relates to a method of assigning dynamism to an in-silico model, said method involving (1) establishing viable representations for interactions between proteins, ions, metabolites and nucleic acids, (2) defining kinetic rate parameters for interactions between proteins, ions, metabolites and nucleic acids and (3) defining kinetic rate equations for interactions between proteins, ions, metabolites and nucleic acids.
In another embodiment, the disclosure relates to a method of automated integration of in-silico models, which involves (1) integration of in-silico unit pathway models to create a cell system model and (2) integration of in-silico cell system models to create a homeostatic model which on perturbations simulate a disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof.
In another embodiment, the disclosure relates to a method of translating experimental alignment data to a digital alignment data set, which involves (1) creating digital alignment data sets for an in-silico unit pathway model, (2) creating digital alignment data sets for an in-silico cell system model and (3) creating digital alignment data sets for an in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof.
In another embodiment, the disclosure relates to a method of obtaining digital alignment and validation of an in-silico model of RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof, which involves (1) optimizing the kinetic rate parameters to obtain digital alignment to the datasets and (2) high-throughput validation of the in-silico unit pathway model, in-silico cell system model and the in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof for alignment to the digital alignment data sets.
In an embodiment, the validation is a process to determine whether the given model is an accurate representation of the real system intended to study with the predictive model. In the validation process, the model is compared to actual system to calibrate the model and to determine the discrepancies between the model using validation datasets. This process is repeated until the acceptable model accuracy/predictability is reached.
In an embodiment of the present disclosure, the unit model, the cell system model and the disease models are validated with respective validation datasets. The unit models are validated with experimental data on biomarkers from perturbation on particular pathway and signaling. The cell system model are validated with data from cell line or cell culture based experiments where single or multi pathway/signaling are perturbed, and along with unit model validation datasets. The disease models, represents multi-cellular co-culture systems developed to represent a disease physiology/phenotype. Therefore, usually dataset from in-vivo animal models and human clinical trials are used to validate the model along with prospective validations from each of the cell system model.
In another embodiment, the disclosure relates to a method of employing a system for simultaneous validation of a large number of digital alignment data sets to in-silico model comprising an in-silico unit pathway model, an in-silico cell system model and an in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof over a distributed network.
In another embodiment, the disclosure relates to methods of computing and reporting the validation of an in-silico model comprising in-silico unit pathway model, an in-silico cell system model and an in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof against a large number of digital alignment datasets.
In another embodiment, the disclosure relates to a method of creating a digital drug capsule and a digital drug library.
In another embodiment, the disclosure relates to a method of automated high-throughput simulation of:
(1) “c/n” different dosages of individual digital drug capsule information in the in-silico disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof, where “c” represents the approved therapeutic dose for the drug and “n” is any number greater than 0; and
(2) x-degree combination (where x is any number greater than 0) of “c/n” different dosages of different individual drug capsule information in the disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof involving:
(2.1) Exhaustive “x-degree” combination of information from all “y” digital drug capsules present in the digital drug library; and
(2.2) Exhaustive “x-degree” combination of information from “y-m” digital drug capsules against information from “m” anchor digital drug capsules present in the digital drug library.
In another embodiment, the disclosure relates to the process of digital curation, which involves (1) curation of an in-silico model, (2) curation of digital alignment data set and (3) curation of digital drug capsules.
In another embodiment, the disclosure relates to a method of computing and reporting of simulated drug data towards sorting and filtering based on selected sort or filter parameters.
In another embodiment, the disclosure relates to a method of automated sorting and filtering of simulated drug data involving sorting and filtering on the basis of (1) N-ACR scores, (2) synergy scores, (3) phenotype scores, (4) biological target compatibility, (5) PK/PD data and/or (6) compound structural properties.
In another embodiment, data mining forms the core process that shapes the definition and functionality of the in-silico model representing biological concepts, in this case, a disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof.
In another embodiment, the development of unit model, cell system model, disease model representing RA or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof or co-culture model representing said condition, digital drug library and designing therapeutic solution is provided in the sections below. Although the development of these models and designing therapeutic solution in the below sections represent a specific disease condition, i.e., rheumatoid arthritis or cancer or autoimmune disease or dermatological disorder or inflammatory disorder or any combination thereof, the same should not be considered limiting to the said disease condition. In other words, similar procedure to arrive at the said models and therapeutic solution may be employed for treatment of other varied disease conditions. All such development and designing for all such disease conditions fall within the purview of the instant disclosure.
Referring now to
In an embodiment, the processor 102 transmits one of the one or more disease models along with a set of drugs from the digital drug library 106 to the scheduler 110. The said set of drugs are selected from the digital drug library 106 based on a predefined cost function. Then, the scheduler 110 distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices 112 through the network 114. In a non-limiting embodiment, the network 114 is a distributed network which provides dynamic load balancing and transparent access to the resources available over the network 114. The network monitors the process for all possible errors in each client machine.
The scheduler 110 of the system is configured to split the job of processing the plurality of digital drug capsules with the one or more disease models into multiple jobs based on a set of commands representing one or more perturbed state of the model. The scheduler 110 distributes the logins on the basis including, but not limited to, job priority, number of pending jobs with the one or more computing devices 112, the client machine configuration and the number of available client machines. The scheduler 110 is additionally configured to check the progress of the job at each of the one or more computing devices 112.
The one or more computing devices 112 processes the one or more of digital drug capsules with the one or more disease models and provides an output accordingly to the scheduler 110. The output comprises the effect of the set of drugs on the diseased model. The scheduler 110 transmits the output to the processor 102 by combining matching results from the one or more computing devices 112. Finally the processor 102 analyzes the output to shortlist the therapeutic solutions in context of said disease or disorder or the combination thereof.
The described steps may be implemented as a method or a system using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof. The described steps may be implemented as code maintained in a “non-transitory computer readable medium”, where a processing unit may read and execute the code from the computer readable medium. The processing unit is at least one of a microprocessor and a processor capable of processing and executing the queries. A non-transitory computer readable medium may comprise media such as magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), optical storage (CD-ROMs, DVDs, optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, Flash Memory, firmware, programmable logic, etc.), etc. The non-transitory computer-readable media comprise all computer-readable media except for a transitory. The code implementing the described operations may further be implemented in hardware logic (e.g., an integrated circuit chip, Programmable Gate Array (PGA), Application Specific Integrated Circuit (ASIC), etc.).
The unit model is developed by assimilating data from plurality of information sources on a specific signaling/metabolic pathway, which is diligently used to create meaningful information. The alignment data gathered in parts is integrated logically and correlated and the flow of the events is charted out and the various bio-molecules, metabolites, ions in the signaling or metabolic network are positioned accordingly to develop a comprehensive pathway.
Every event in a pathway is represented as a static map, as illustrated in
For instance, from a literature, the details on different cascades getting activated through a particular pathway is obtained, but, the detailed regulation of each of these cascades might not be covered in the same literature and these regulations will have to be studied separately from data present in various other literatures. The data so gathered in parts is integrated logically to develop a comprehensive pathway. Further, different signaling pathways can, in common, have one or more cascades getting activated. Hence, while developing a static model for a pathway which has common cascades activated as that of various other existing comprehensive models in the library, the common parts present in the existing models are re-used.
In an embodiment, the unit model developed are further used to develop multiple cell systems, with addition and deletion of bio-molecular interactions, events and gene expression specific to the new cell-type based on assimilated data/information.
In another embodiment, bio-molecules, metabolites, ions are represented as a separate entity known as species in the static unit model. Further, the molecules in each state are again defined as a separate species (e.g.: inactive and active forms of a protein are shown as two separate species).
In another embodiment, the various events involved in a pathway include bio-molecular interactions like binding of bio-molecules (e.g.: Ligand receptor binding), sequestration, phosphorylation, activation, inhibition, transport, translocation (unidirectional/bidirectional) of biomolecules, metabolites and ions from one part of the cell to another, transcription, translation, degradation etc. Every single event involved in a pathway is represented as a specific reaction while creating a static unit model.
In another embodiment, it is to be noted that a pathway has species and bio-molecular interactions across different parts of a cell including cytoplasm, nucleus, mitochondria, extracellular space etc. These different parts of the cell are represented in a model as a separate category of entities named compartments. Species can also be transported across different compartments within a cell. All the entities in a model including species, reactions and compartments are named based on pre-defined standards as defined in Tables 3 and 4 below. The data assimilated from all the literatures are captured in the static unit model. The static unit model thus developed logically correlates with the datasets created. The data obtained from literature as shown in
In another embodiment, different species present in the static unit model are assigned with specific concentration values based on the data extracted from the literatures.
In another embodiment, every event in a pathway is represented as a reaction and each reaction is defined by specific kinetic rate law equations. Different equations defined for reactions include those based on Mass-Action Kinetics (reversible/irreversible), Simple Michaelis Menten equations, First-Order Transport or a modified form of these (Refer Table 1). Table 2 further also describes the different kinetic rate law equations defined for specific reactions of Table 1.
In another embodiment, the equation selected and defined in the reactions, is based on the type of biological events and thereby aligns with physiological outcomes (for example, as described with
In another embodiment, the alignment is created either manually or through automated processes which define equations based on the biological interactions defined according to the pre-defined standard as defined in Tables 3 and 4 nomenclatures for reactions.
Table 1: Describes the various biological events and their respective equation types.
Table 2: Describes the different kinetic rate law equations defined for specific reactions.
E. Coli
Mycobacterium
Tuberculosis
<cell type>_<prefix>_<species_name>_<suffix>
<cell type> represents the name of the virtual cell system (refer tables above)
<prefix> any pre-defined prefix (for details, refer table 5 below)
<species_name> can be NCBI Gene name or common name.
<suffix> any pre-defined suffix (for details, refer table 6 below)
Reaction nomenclature Standards:
<Cell Type>_<Substrate name(s)>_<reaction type>
<cell type> represents the name of the virtual cell system (refer tables above).
<Substrate name(s)> names of the substrate involved in the reaction.
<reaction type>, few scenarios mentioned in table 7 below:
In another embodiment, the values for different kinetic parameters are assigned based on the data extracted from literatures. In case of inconsistent or unknown enzyme kinetic parameters, optimization strategy is employed to attain physiological outcome.
In another embodiment, the pathway dynamics are simulated by interconnecting ordinary differential equations describing kinetic behavior of each species in the pathway.
In another embodiment, training the static unit model with alignment dataset is carried out for developing a model which validates or predicts the responses to perturbations given to the model similar to that of the natural physiological outcomes.
In another embodiment, values for the kinetic rate parameters for a reaction are defined and it is carried out for all the reactions in a model in such a way that: flux balances are maintained with the system reaching a steady state, and the model is sensitive enough for a wide range of input values.
In another embodiment, the unit model developed is robust as it does not reach saturation within a very small range of input values, which would make the model less sensitive to respond to different user-defined perturbations.
In another embodiment, a unit model can have multiple pathways or activators activated within the pathways or activators converging at different common points (reactions). Maintaining the contribution of each pathway or activator at a convergent point is crucial and this is carried out logically so as to attain proper alignment with the datasets created from assimilated data. Different tools are developed that aid in defining the parameter values for setting the desired percentage contributions from each activator or pathway towards a common convergent point.
In an embodiment, datasets for alignment and validations are represented in a high level language so as to facilitate its execution on the model. This includes assertive statements which indicate the positive or negative adherence of the alignment and validation datasets on the model. These assertive statements are based on both quality and quantity of the expected trend of the biomarker in focus which is usually based on assimilated data.
In another embodiment, validation of unit model is carried out by running iterative regression of the alignment and validation datasets represented in high level language either manually or through an automated high throughput system, preferably by cloud computing system. While testing the model by running iterative regression, informative reports are generated based on the assertive statements which aids in analysis of the results. The above described processes are depicted in
In another embodiment, analysis of different perturbations like over-expressions, knock-down, knock out, inhibition etc. of different species or reactions before confirming the final parameter values is carried out manually or through different automated tools, preferably by the automated tools. The automated tools are as follows—(a) One that shows the effect of a user-defined perturbation on the model based on color-codes with the intensity of color changing based on the extent to which the species is affected due to the perturbation, and/or (b) One that highlights all the possible paths, such that a signal can flow from one point to the other with details on the shortest and longest path highlighted, when the two points (can be species, reactions etc.) in a model are given as inputs.
In an embodiment, the dynamic unit model developed elicits response to perturbations in a fashion similar to the way the natural biological system would respond.
In another embodiment, development of unit model undergoes a series of checks for ensuring its correctness at every stage. Testing for inherent errors in the model is carried out manually or by automated checks on the model or a combination thereof. Biological networks including flow of the pathway, various links and interactions present in the model, etc. are checked manually. Correctness of the protein function representations in the model including equations defined for depicting each reaction are also checked manually or by using automated scripts or a combination thereof.
In another embodiment, the unit model is checked to ensure the correctness of the protocols used for alignment and validation data-sets mentioned in high level language.
In another embodiment, the reports generated post running iterative regression of alignment and validation datasets represented in the high level language are based on the assertive statements mentioned in the high level language. Thus, a check is always carried out on the assertive statements for biomarkers and the trends defined in the unit model.
In another embodiment, the model is always checked for computational rigidness so as to ensure smooth simulation of the model in the simulator. Further, the model is also checked for adherence to pre-defined standards to ensure compatibility between two models during integration.
In another embodiment, the unit model is perturbed to different condition through a set of commands. The perturbations can be disease setting, overexpression, knock-down, addition of a drug or addition of a function. Apart from these, there are other set of commands to assert the anticipated output from the set of commands.
In another embodiment, set of executable commands are coded based on the assimilated data, which is executed on the model using the computational solver and the results are validated, which helps in developing predictive computational simulation
The computational solver is a simulation engine designed to solve biochemical equations built into the physiological models herein that are transformed mathematically to Ordinary Differential Equations (ODE), in which there is a single independent variable (t) and one or more dependent variables x, (t). Multiple techniques to solve single ODEs as well as system of ODEs are provided by way of the solver.
The cell system model is developed by integrating the one or more developed unit models. Development of a cell type model starts with thorough data mining by understanding cell physiology and functions, key phenotype markers, different stimuli that triggers the cell system, key unit pathways involved in the cell physiology, crosstalk between different unit pathways and details of the outputs of the cell that acts in autocrine or paracrine fashion. To emulate the responses of the cell-type to a specific disease, deciding the right set of triggers specific to the disease based on data assimilated from literatures and defining those in the model becomes crucial. This is carried out by either increasing or decreasing single or multiple stimuli leading to a variation (increase or decrease) in the levels of the bio-markers. The in-silico cell system developed is capable of responding to a multiplicity of disease stimuli.
In an embodiment, the in-silico cell system is developed for white blood cells, bone cells, synovial fibroblast, endothelilia cells and dendritic cells, osteoblast, osteoclast, Keratinocytes, Melanocytes and Langerhans cells and also to other cells involved in inflammation, cells involved in autoimmune disease, and the cells that turn cancerous.
In an embodiment, a cell system model integrates variety of cell signaling pathways which are represented by different unit models to represent intended physiologic phenotypes or functions of the cell. These different pathways converge at different levels of signaling and each of those convergence points will have varied signaling contribution from different pathways.
In an embodiment, thoroughly validated cell-specific unit models which have passed through all the inherent error checks are integrated together to develop a comprehensive cell-system.
In another embodiment, integration of unit models or group of unit models is carried out either manually or by using an automated integration management system, preferably by automated system which automatically identifies the similarity and differences between the two models (intermediate cell type model and unit model to be integrated to it) and flags the anomalies for required action.
In another embodiment, integration of unit model to develop a cell type is carried out by the following methods: (a) Flat method, which involves integrating all pathways present in the same compartment. The model developed using this method appears to be more complex in figuring out the common components between pathways including bio-molecules, metabolites, ions and their interactions, gene expressions etc.; (b) Hierarchy level integration method, which involves integration of each pathway in a sub-compartment which is present in the common compartment. Unlike flat method, the components specific to the unit pathways are present in the sub-compartments, whereas common components between different pathways are present in the common compartment which makes it easy for the user to understand the complex model interactions.
In another embodiment, hierarchy level integration method is carried out to integrate unit models that are developed for specific cell type, which is carried out by the following ways: (a) sequentially integrating the unit models one by one to create the intermediate virtual cell-type and continuing integration of other unit models one by one; and (b) sequentially integrating a group of unit models with the total number of models divided into different groups based on similar functionality, and integrating each group to already existing intermediate virtual cell-type model.
In another embodiment, based on literature, post integrating, the autocrine modules for specific biomarkers expressed in the model is well defined in order to complete feedback regulations.
In another embodiment, according to the data gathered from literatures, various factors are defined in the model based on changes in specific bio-markers or phenotypes or a combination to show the effect on cell proliferation, survival, apoptosis, cell-counts etc.
In another embodiment, integrated models are trained with alignment data gathered specifically for the cell type prior to using the model for validating or predicting cellular response to perturbations made to the system. Optimal parameter values are defined to get the system to steady state and ensure proper sensitivity of the model.
In another embodiment, during integration of the unit models, multiple models will converge at multiple common points. Maintaining the right contributions from each model/pathway at a converging point, based on alignment data, by defining the optimal parameter values is very critical. This is carried out manually or by using different tools similar to unit model development process.
In another embodiment, the alignment & validation datasets used in validating the unit models are integrated in a high level language so as to facilitate its execution in the model. This is further analyzed to check the unit model integrity after it is integrated into the cell type system.
In another embodiment, new cell-type specific alignment and validation data-sets, is checked only in integrated systems and those based on protocols involving simultaneous perturbations of species from different unit models (multi-pathway related studies), are also integrated in the high level language.
In another embodiment, information from Digital drug library 106 and the alignment and validation data-sets for the same, created from assimilated data, are also added in the high level language so as to test the efficacy of a drug on the cell-type model. In another embodiment, the model is tested by running iterative regression of the alignment and validation datasets represented in high level language either manually or through an automated high throughput technology, preferably by cloud computing technology and the informative reports are generated for analysis based on defined assertive statements (based on the assimilated data).
In another embodiment, analysis of different perturbations like over-expressions, knock-down, knock out, inhibition etc. are carried out manually or through different automated tools, preferably by the automated tools. The automated tools are—(a) One that shows the effect of a user-defined perturbation on the cell model based on color-codes with the intensity of color changing based on the extent to which the species is affected due to the perturbation; and/or (b) One that highlights all the possible paths, such that a signal can flow from one point to the other with details on the shortest and longest path highlighted, when the two points (can be species, reactions etc.) in a cell model are given as inputs.
In another embodiment, the cell-type model developed elicits response to perturbations in a fashion similar to the way the natural biological system would respond.
In another embodiment, to ensure correctness and re-usability of the cell-type model developed, the model is passed through a series of checks for inherent errors, including checks on disease trigger definitions, biological networks including inter-pathway interactions, protein function representations, protocols used for alignment and validation data-sets, assertive statements for the biomarkers and the trends, computational rigidness and adherence to pre-defined standards as defined in Tables 3 and 4.
In another embodiment, series of check on the cell type model is carried out to ensure correctness and re-usability of the model and also to ensure compatibility between two cell-type models when they are integrated to develop a comprehensive disease or co-culture model/platform.
The disease model is developed by integrating the one or more cell system models. Development of an in-silico model for any complex diseases selected from a group comprising Rheumatoid Arthritis (RA), cancer, dermatological disorder, inflammatory disorder, autoimmune disease and any combination thereof requires intensive research on the disease patho-physiology. This involves thorough data mining and aggregation of details including the different cell systems involved in the patho-physiology of these diseases and their dominance in various stages of the disease, the set of stimuli that can trigger the onset of the disease, the in-depth details of the different inter-cellular interactions and details on disease specific phenotype markers.
In an embodiment, the in-silico model representing RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof is developed by integrating a group of different comprehensive in-silico cell-type models involved in the disease patho-physiology comprising, white blood cells, bone cells, synovial fibroblast, macrophages, endothelial cells, osteoblast, osteoclast, Keratinocytes, Melanocytes and Langerhans cells and dendritic cells or a combination thereof.
In another embodiment, based on the data assimilated from literatures, a set of disease triggers (increase or decrease of specific stimuli) are decided and defined in the model which results in an increase or decrease of endpoint biomarkers secreted from the thoroughly validated disease specific cell-type models, based on the datasets gathered for the disease. Thus, the integration of cell system models lead to a homeostatic model, in which perturbations by way of disease triggers are simulated to obtain a disease model. The outputs from one cell-type may act as stimuli to the same cell-type (autocrine) or a different cell-type (paracrine) which consists of stimuli receptors connected to a complex network of signal transduction. The details of these inter-cellular interactions involved in a disease of interest are represented in a common compartment for extra-cellular space in the in-silico model, where the different cell-type models with their respective compartments are placed.
In another embodiment, the key disease phenotypic markers inferring different outcomes of RA comprises synovitis, cartilage degradation, bone erosion, pain, angiogenesis etc. are defined to relate the responses elicited by the system to the intensity of the disease.
In another embodiment, the key disease phenotypic markers inferring different outcomes of cancer comprises cell proliferation, cell viability, apoptosis, pro-angiogenesis, anti-angiogenesis and metastasis etc. are defined to relate the responses elicited by the system to the intensity of the disease.
In another embodiment, the key disease phenotypic markers inferring different outcomes of dermatological disorder comprises epidermal barrier function, skin hydration, inflammation etc. are defined to relate the responses elicited by the system to the intensity of the disease.
In another embodiment, in-silico disease model developed in the present disclosure is designed using data from various publications, for example, in the range of 5 to 10 years and built on a backbone of a large number of mathematical equations representing interactions between a large number of biochemical entities.
In another embodiment, integrated model with alignment datasets designed specifically for the disease [RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disorder or a combination thereof] is trained prior to using the integrated disease model for predictions and validations.
In another embodiment, the key converging points between multiple cell types integrated in a disease/co-culture model are majorly at the biomarker expression levels, where the contribution of each cell system towards a particular bio-marker is set on the basis of disease specific alignment datasets. Defining optimal parameter values at these converging points to set the right contributions from each cell type is crucial.
In another embodiment, the kinetic rate parameters are optimized at the inter-cellular interaction points so that the in-silico system elicits responses to the disease triggers similar to that of the natural biological systems.
In another embodiment, a comprehensive dataset specific for a disease of interest such as RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or a combination thereof comprising data from animal studies, humans, clinical trials etc. is extracted and integrated to high level language along with alignment and validation datasets designed for different cell-type models so as to facilitate its execution in the model.
In another embodiment, information from Digital drug library 106 and the alignment and validation datasets specific to the disease of interest such as RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or a combination thereof, developed from assimilated data, are integrated to high level language so as to test the efficacy of drugs on the in-silico model representing the disease.
In another embodiment, validating the model's alignment comprising a huge comprehensive dataset is carried out by using a comprehensive cloud computing technology over a distributed network to perform repeated large-scale high-throughput virtual experiments in an extremely short period of time.
In another embodiment, informative reports are generated based on the assertive statements defined in the high level language based assimilated data.
In another embodiment, analysis of different perturbations like over-expressions, knock-down, knock out, inhibition etc. are carried out manually or through different automated tools, preferably by the automated tools on the disease in-silico model. The automated tools are—(a) One that shows the effect of a user-defined perturbation on the model based on color-codes with the intensity of color changing based on the extent to which the species is affected due to the perturbation; and (b) One that highlights all the possible paths, such that a signal can flow from one point to the other with details on the shortest and longest path highlighted, when the two points (can be species, reactions etc.) in a model are given as inputs.
In another embodiment, in-silico disease model is aligned and validated for results from over 4000 different experimental information obtained from the assimilated data
In another embodiment, the disease in-silico model is subjected to a series of checks for inherent errors in the model including checks on the disease trigger definitions, biological networks including inter-cellular interactions, autocrine and paracrine definitions, protein function representations, protocols used for alignment and validation datasets, assertive statements for the biomarkers and the trends, computational rigidness and adherence to pre-defined standards as defined in Tables 3 and 4.
In another embodiment, the validated in-silico model showing good correlation with validation data collected for Rheumatoid Arthritis or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof serves as the validation dataset over the in-silico model's credibility for predicting drug effects. The in-silico disease/co-culture model is hence used for finding novel therapies for the treatment of the disease of interest, such as those exemplified above.
The digital drug library 106 is a comprehensive electronic repository containing multiple categories of digital drug capsules. A digital drug capsule is an electronic file representing a specific drug and containing intensive information such as—
The mechanism of action (MOA) of the drug, covering information regarding all the known and speculated targets of the drugs.
The pharmacological properties of the drug, such as IC50, Cmax, bioavailability, AUC, Tmax and half-life.
The physical properties of the drug, such as compound structure, molecular formula, molecular weight etc.
The pharmaceutical formulation information of the compounds within the drug.
Information pertaining to approved or safe dosing range of the drug.
The therapeutic category to which the drug belongs, such as and not limited to NSAIDS, DMARDs, Biologics, natural compounds etc.
Conditions for which the drug has been indicated.
Information pertaining to off-target effects, drug interactions and adverse events associated with the drug.
Manufacturer details specific for the drug.
Patent information specific to the drug.
Indication specific alignment information not limited to trends observed for biomarkers, phenotypes, disease scores etc. in experiments performed on patients, animal models, cell-line cultures etc.
In another embodiment, digital drug library 106 for in-silico disease model representing RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof is developed using data from various publications over the past many years, say, for illustrating purpose, in the range of 5 to 10 years and built on a backbone of a large number of mathematical equations representing drug pharmacodynamics, drug pharmacokinetics, drug identifier, drug patent information, drug physicochemical properties and drug development information.
In another embodiment, the digital drug library 106 comprises intensive information about all under-trials and FDA approved drugs specific for RA and also other inflammatory conditions.
In another embodiment, the disease in-silico model ensures that the model will replicate intensively, the known or published effects of different drugs on RA patients or animal models of RA such as Collagen-induced arthritis (CIA), Adjuvant-Induced Arthritis (AIA), Collagen Antibody-Induced Arthritis (CAIA) etc.
In another embodiment, the disease in-silico model ensures that the model replicates intensively, the known or published effects of different drugs on cancer patients or animal models of cancer such as carcinoma, sarcoma, lymphoma, leukemia, adenoma etc. The model also represents the cancers mentioned above with KRAS and non-KRAS mutation.
In another embodiment, the disease in-silico model ensures that the model replicates intensively, the known or published effects of different drugs on dermatological conditions such as atopic dermatitis, Contact dermatitis, Irritant contact dermatitis, psoriasis, Eczema, and Xerosis etc.
In another embodiment, the disease in-silico model ensures that the model replicates intensively, the known or published effects of different drugs on inflammatory conditions such as acne vulgaris, asthma, autoimmune disease, celiac disease, chronic prostatis, glomerulonephritis, hypersensitivities, inflammatory bowel disease, pelvic inflammatory disease, reperfusion injury, rheumatoid arthritis, sarcoisosis, transplant rejection, vasculitis etc.
In another embodiment, the disease model is simulated with the information from the digital drug capsule and the dosing optimized recursively in a process known as drug efficacy characterization to achieve perfect alignment to trends observed in disease specific biomarkers and phenotypes of RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof. Tables 9 and 10 provide a comprehensive list of all associated biomarker for all the phenotypes associated with RA pathogenesis and cancer.
Table 9: Shows various RA phenotypes and their associated biomarkers:
Table 10: Shows various cancer phenotypes and their associated biomarkers:
In an embodiment, the insilico model represents major phenotypes associated with RA pathogenesis such as swollen joints, tender joints, bone destruction, cartilage degradation, pannus formation, pain and synovitis.
In another embodiment, swelling of joints is a major clinical assessment phenotype associated with RA disease. The biomarkers directly and indirectly influencing joint swelling are selected and not limited to a group comprising Tumor Necrosis Factor-alpha (TNF), Interleukin-1 beta (IL1B), Interleukin-6 (IL6), Interleukin-17 (IL17), Interleukin-18 (IL18), chemokine [C-C motif] ligand 2 (CCL2), Vascular Endothelial Growth Factor alpha (VEGFA) etc.
In another embodiment, joint tenderness is a phenotype that is critically observed during the clinical assessment of RA. The biomarkers that influence Joint tenderness are selected from and not limited to a group comprising RANK Ligand (TNFSF11), Matrix Metallopeptidase 9 (MMP9), Cathepsin K (CTSK), Matrix Metallopeptidase 3 (MMP3), C-telopeptide of type 1 collagen (CTX1) etc.
In another embodiment, the biomarkers that influence bone destruction are selected from and not limited to a group comprising RANK Ligand (TNFSF11), Cathepsin K (CTSK), Matrix Metallopeptidase 3 (MMP3), Matrix Metallopeptidase 9 (MMP9), C-telopeptide of type 1 collagen (CTX-1), cross-linked telopeptide of type I collagen (ICTP), N-telopeptide of type 1 collagen (NTX-1), Bone Sialoprotein (IBSP) etc.
In another embodiment, the biomarkers that influence cartilage degradation are selected from and not limited to a group comprising Matrix Metallopeptidase 3 (MMP3), Matrix Metallopeptidase 13 (MMP13), C-telopeptide of type 2 collagen (CTX-2), Cartilage glycoprotein 39 (CHI3L1) etc.
In another embodiment, the biomarkers that influence pannus formation are selected from and not limited to a group comprising Vascular Endothelial Growth Factor-alpha (VEGFA), Fibroblast Growth Factor 2 (FGF2), Angiopoietin 1 (ANGPT1) etc.
In another embodiment, pain associated with the RA disease is directly characterized by biomarkers selected from and not limited to a group comprising Prostaglandin E2 (PGE2), Nerve Growth Factor (NGF), Substance P (TAC1) etc.
In another embodiment, the biomarkers that influence inflammation or synovitis are selected from and not limited to a group comprising C-Reactive Protein (CRP), Erythrocyte Sedimentation Rate (ESR), Tumor Necrosis Factor-alpha (TNF), Interleukin-1 beta (IL1B), Interleukin-6 (IL6), Interleukin-17 (IL17), Interleukin-8 (IL8), chemokine [C-C motif] ligand 2 (CCL2), chemokine [C-C motif] ligand 5 (CCL5), chemokine [C-X-C motif] ligand 13 (CXCL13) etc.
In another embodiment, the disease in-silico model is evaluated on the effect of a therapy based on the phenotypic markers, comprising mathematical scoring functions representing each phenotype. This Phenotype Scoring function involves multiplicity of mathematical operations pertaining to the levels of the associated biomarkers derived from the in-silico RA disease model.
In another embodiment, the scores generated from the mathematical scoring functions are correlated to the published clinically assessed scores stored in the digital drug capsule to ensure perfect alignment of the in-silico model to a particular therapy.
In another embodiment, American College of Rheumatoid (ACR) scoring system and DAS28 scoring system are used to determine the extent of improvement in the RA disease symptoms on treatment with a drug.
In another embodiment, the ACR scoring system involves three main scores: ACR20 score pertaining to 20% improvement in the disease, ACR50 score pertaining to 50% improvement in the disease and ACR70 score pertaining to 70% improvement in the disease. Clinically, the calculation of the ACR score is dependent on inputs from a patient Health Assessment Questionnaire (HAQ) and analysis of the percentage improvement in the RA disease phenotypes.
In another embodiment, the DAS28 scoring system employs a complex formula taking into account tender joints, swollen joints, synovitis, and global health of the biological system as parameters to assess improvement in the disease condition upon administration of a drug or combination of drugs. The formula is as follows: [DAS28=0.56*sqrt(tender joints)+0.28*sqrt(swollen joints)+0.70*Ln(synovitis)+0.014*Global Health]. A DAS28 score of above 5.1 represents a high disease activity and highlights the ineffectiveness of a therapy. A DAS28 score in a range of 5.1 and 3.2 represents medium disease activity on administration of a therapy. A DAS28 score in a range of 3.2 and 2.6 represents a low disease activity on administration of a therapy and a DAS28 score below 2.6 signifies disease remission.
In another embodiment, to evaluate the ACR or DAS28 score for a drug in an in-silico RA disease model, an ACR scoring function referred to as N-ACR and a DAS28 scoring function are designed and formulated to include, a multiplicity of mathematical operations on the phenotype scores generated from the Phenotype Scoring function.
In an embodiment, in silico model represents the major phenotypes associated with cancer such as proliferation, viability, apoptosis, pro-angiogenesis, anti-angiogenesis and metastasis.
In another embodiment, proliferation is a major clinical assessment phenotype associated with cancer. The biomarkers directly and indirectly influencing proliferation are selected and not limited to a group comprising CDK4-CCND1 active complex, CDK2-CCNE active complex, CDK2-CCNA active complex AND CDC2-CCNB1 active complex etc., wherein proliferation is calculated by a formula: Proliferation=average (0.4*CDK4−CCND1+0.2*CDK2−CCNE+0.2*CDK2−CCNA+0.2*CDC2−CCNB1) to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
In another embodiment, apoptosis is a phenotype that is critically observed during the clinical assessment of cancer. The biomarkers that influence apoptosis are selected from and not limited to a group comprising Bcl-2-associated X protein, caspase-3, caspase-8, NOXA/PMA-induced protein 1 and BCL2 interacting protein BIM etc., wherein apoptosis is calculated by a formula:
Apoptosis=average (0.2*BAX+0.2*CASP3+0.2*CASP8+0.2*NOXA+0.2*BIM)
to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
In another embodiment, the biomarkers that influence survival of cells in cancer disease phenotype are selected from and not limited to a group comprising ATK1 kinase, BCL2 protein, MCL1, X-linked inhibitor of apoptosis, Baculoviral IAP repeat containing 2 and Baculoviral IAP repeat containing 5 etc., wherein apoptosis is calculated by a formula:
Apoptosis=average (0.2*BAX+0.2*CASP3+0.2*CASP8+0.2*NOXA+0.2*BIM)
to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
In another embodiment, the biomarkers that influence viability of cancer cells are selected and not limited to a group comprising Bcl-2-associated X protein, caspase-3, caspase-8, NOXA/PMA-induced protein 1, BCL2 interacting protein BIM, ATK1 kinase, BCL2 protein, MCL1, X-linked inhibitor of apoptosis, Baculoviral IAP repeat containing 2 and Baculoviral IAP repeat containing 5 etc.
In another embodiment, the biomarkers that influence Pro-angiogenesis are selected from and not limited to a group comprising interleukin-8 and vascular endothelial growth factor A etc., wherein pro-angiogenesis is calculated by a formula:
Pro-angiogenesis=average (0.5*IL8+0.5*VEGFA)
to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
In another embodiment, the biomarkers that influence anti-angiogenesis are selected from and not limited to a group comprising serpin peptidase inhibitor, Glade F and thrombospondin etc., wherein anti-angiogenesis is calculated by a formula:
Anti-angiogenesis=average (0.5*SERPINF1+0.5*THBS2)
to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
In another embodiment, metastasis of a insilico cancer model is defined by matrix degradation, motility, chemotaxis, and EMT, wherein;
Matrix degradation is calculated by the formula:
Matrix degradation=0.3*MMP2+0.2*MMP9+0.2*MMP1+0.3*PLAU−PLAUR;
Motility is calculated by a formula:
Motility=0.33*RAC1−GTP+0.33*RHOA−GTP+0.33*CDC42−GTP;
Chemotaxis is calculated by the formula
Chemotaxis=100*CXCR4;
EMT is calculated by the formula:
EMT=100*CTNNA1−CTNNB1−CTNND1.
Once all the individual parameter of metastasis is calculated, the overall metastasis rate of a cancer is determined by a formula:
Metastasis=average (0.35*Matrix Degradation+0.35*Motility+0.1*Chemotaxis+0.2*EMT)
to determine the improvement in disease on administering a drug or a combination of drug from digital drug library 106.
The metastasis is defined by considering matrix degradation, motility, chemotaxis, and EMT as parameters, each of which further comprise specific calculations of biomarkers representing the said parameter. For example, matrix degradation takes into account the levels of MMP2, MMP9, MMP1m PLAU-PLAUR biomarkers, whereas chemotaxis takes into account the levels of CXCR4 biomarker.
In another embodiment, angiogenesis of in silico cancer model is defined by factors such as BM degree, endothelial cell motility, endothelial cell proliferation, endothelial cell survival, endothelial cell apoptosis, endothelial cell viability, wherein these factors are calculated in the model by the below mentioned formula in the Table 11.
Further, improvement of the disease condition, i.e. cancer upon administering a drug or a combination of drug from the digital drug library 106 is analyzed by calculating the overall angiogenesis score by a formula:
Angiogenesis=average (0.3*Factor_BM_Deg)+(0.3*Factor_ENC_Motility)+(0.2*Factor_ENC_Proliferation)+(0.2*Factor_ENC_Viability).
The angiogenesis score is obtained by considering Factor_BM_Deg, Factor_ENC_Motility, Factor_ENC_Proliferation and Factor_ENC_Viability as parameters, each of which further comprise specific calculations of biomarkers representing the said parameter. For example, Factor_ENC_Motility takes into account the levels of MMP2 and MMP9 biomarkers, whereas Factor_ENC_Proliferation takes into account the levels of CCND1 and CDKN1A biomarkers.
In another embodiment, overall response rate of tumor co-culture system in cancer model is calculated by a formula:
{((0.25*PROLIFERATION)+(0.25*VIABILITY)+(0.25*ANGIOGENESIS-CO-CULTURE)+(0.25*METASTASIS))}
to determine the improvement in the disease after administering a drug or combination of drug from digital drug library 106.
The overall response rate takes into consideration the angiogenesis score obtained above, along with proliferation, viability and metastasis as parameters, each of which is given equal weightage to arrive at an average overall response rate.
In another embodiment, tumor volume of tumor co-culture system in cancer model is calculated by a formula:
{((0.4*VIABILITY)+(0.4*PROLIFERATION)+(0.2*ANGIOGENESIS-CO-CULTURE))}
to determine the improvement in the disease after administering a drug or combination of drug from digital drug library 106.
The tumor volume takes into consideration the angiogenesis score obtained above, along with proliferation and viability as parameters, wherein each of proliferation and viability are given equal weightage, whereas the angiogenesis score is given half weightage when compared to the other two parameters to arrive at an average tumor volume.
In another embodiment, tumor shrinkage of tumor co-culture system in cancer model is calculated by a formula:
{((0.5*VIABILITY)±(0.5*PROLIFERATION))}
to determine the improvement in the disease after administering a drug or combination of drug from digital drug library 106.
The tumor shrinkage takes into consideration the proliferation and viability as parameters, wherein each of proliferation and viability are given equal weightage, to arrive at average tumor shrinkage.
In another embodiment, tumor burden of tumor co-culture system in cancer model is calculated by a formula:
{((0.2*TUMOR SHRINKAGE)±(0.4*ANGIOGENESIS-CO-CULTURE)+(0.4*METASTASIS))}
to determine the improvement in the disease after administering a drug or combination of drug from digital drug library 106.
The tumor burden takes into consideration the angiogenesis score and tumor shrinkage score obtained above, along with metastasis as parameters, wherein each of angiogenesis score and metastasis are given equal weightage, whereas the tumor shrinkage score is given half weightage when compared to the other two parameters to arrive at average tumor burden.
In another embodiment, the phenotypes representing cancer model mentioned in the previous embodiments are all supported in the single cell system and gives a good insight into the therapeutic effect of a drug, as a measure of these indexes, in terms of reduction in tumor cell proliferation and viability. However, for aspects of measuring tumor burden, angiogenic factors need to be accounted which is covered better in a co-culture system wherein, the overall angiogenic potential of the tumor, mainly a property of the endothelial cells and other neighboring cells like macrophages and fibroblast which support tumor cell survival, gets factored in the overall tumor burden index
In another embodiment, the cancer in-silico models are evaluated on the effect of a therapy based on the phenotypic markers, comprising mathematical scoring functions representing each phenotype. This Phenotype Scoring function involves multiplicity of mathematical operations pertaining to the levels of the associated biomarkers derived from the in-silico model.
The basic idea behind designing a novel therapeutic solution would find its roots in the fact that most single drug therapies are either in-efficacious or efficacious but have associated toxicities.
In an embodiment, designing an effective combination therapy involves bringing together two or more drugs at varying concentrations (concentrations lower than the approved concentration in the pharmacopeia) for administration either as a single formulation or as separate therapies that can be administered concomitantly or sequentially, such that there is a synergistic effect in the overall efficacy of the participating drugs. The design of a combination therapy involves the following-Combining multiple in-efficacious drugs to increase the overall efficacy. Combining multiple efficacious but toxic drugs in lower doses to increase efficacy but negate acute toxicities.
Combining in-efficacious drugs with lower doses of efficacious drugs to obtain synergistic efficacy and negate acute toxicities.
In another embodiment, designing and testing a combination therapy in the in-silico disease/co-culture model representing RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof, involves selecting digital drug capsules from the library using either an exclusive or a non-exclusive selection process and the information contained in the capsules are combined exhaustively using a mixer tool in to a digital combination drug capsule. This combination capsules are simulated in the model to calculate the therapeutic benefits.
In another embodiment, the digital capsules from a digital drug library 106 are combined in a plurality of ways such as—
‘x’-degree exhaustive combination of ‘v’ capsules in full dose (c dose) with c dose of ‘y−1’ other capsules present in the library. Here y′ represents the number of capsules present in the library and ‘x’ is any non-zero whole number such that ‘x-degree’ represents the number of drugs present in the combination, for example, a 2-degree combination would represent a combination of 2 drugs.
In another embodiment, the combination of digital capsules in a system of the instant invention are exemplified by the following ways but not limited to the same, wherein A, B and C represent individual drug molecules used to treat one or more disease/disorder which is being described in the instant invention, for the ease of representation the drug molecules have been designated as A, B and C—
The library with 3 capsules say A, B and C, then a 2-degree exhaustive combination of full dose (c) is designed to yield the following combinations:
‘x’-Degree Combination of ‘y’ Capsules in c/n Dose with c/n Dose of ‘y−1’ Other Capsules Present in the Library. Here n is any Non-Zero Whole Number and c/n Represents a Fractional Dose.
The library with 3 capsules say A, B and C, then a 3-degree exhaustive combination of full dose (c) and half dose (c/2) is designed to yield the following combinations:
‘x’-Degree Combination of ‘y-m’ Capsules in c Dose with c Dose of ‘m’ Selected Anchor Capsules Present in the Library.
The library with 3 capsules say A, B and C, then a 2-degree exhaustive combination of full dose (c) of A as anchor with full dose (c) of B and C is designed to yield the following combinations:
‘x’-Degree Combination of ‘y-m’ Capsules in c/n Dose with c Dose of ‘m’ Selected Anchor Capsules Present in the Library.
The library with 3 capsules say A, B and C, then a 2-degree exhaustive combination of full dose (c) of A as anchor with half dose (c/2) of B and C is designed to yield the following combinations
‘x’-Degree Combination of Capsules in c/n Dose with c/n Dose of ‘m’ Selected Anchor Capsules Present in the Library.
The library with 3 capsules say A, B and C, then a 2-degree exhaustive combination of half dose (c/2) of A as anchor with full dose (c) and half dose (c/2) of B and C is designed to yield the following combinations
In another embodiment, the combinations of the digital capsules are stored automatically in a digital combinatorial drug library and using a high-throughput cloud computing technology simulated in the in-silico disease model representing RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof.
In another embodiment, filtering of efficacious combination of digital capsule is carried out by a data extraction and reporting tool (DERT), which automatically retrieves preset biomarker level values pertaining to control, disease and drug simulation time points and calculates the percentage change in the defined biomarker.
In another embodiment, the percentage change for the biomarkers is calculated by the following—
percentage change in the levels of the biomarker with drug administration compared to its levels in the diseased state; and/or
percentage change in the levels of the biomarkers with drug administration compared to its levels in both the diseased and control states.
In another embodiment, the percentage change values of all the biomarkers for all the capsule combinations simulated in the model are recorded in data extraction and reporting tool (DERT) to calculate the phenotype scores using phenotype scoring function.
In another embodiment, drug efficacy scores, mainly ACR or DAS28 scores, are calculated by data extraction and reporting tool (DERT) using the biomarker scores and the phenotype scores as inputs. These scores are also used to calculate synergy values for the combinations. Synergy value in this context refers to the percentage improvement, observed in the efficacy scores with combination drug therapy with respect to the sum of efficacy scores observed with the individual, constituent drug therapies. The calculated values are provided to an electronic file which is referred to here as a drug efficacy report (DER).
In another embodiment, Drug Efficacy Report (DER) is subjected to multi-level filtering using automated or manual techniques to identify viable drug combinations, that show good efficacy, less associated toxicities, good pharmacological affinities and formulation compatibility.
In another embodiment, the filtering process of combination of drugs or individual drugs at lower concentration than the approved value, for their efficacy towards the improvement of RA condition mainly involves the following strategies—
Level 1-filtering of the combinations on the basis of drug efficacy, which involves:
Filtering on the basis of ACR scores;
Filtering on the basis of DAS28 scores;
Filtering on the basis of synergy scores;
Filtering on the basis of phenotype scores;
Filtering on the basis of biomarker scores; or
Any degree of combination of the above strategies.
Level 2-filtering of the combinations on the basis of drug properties, which involves:
Filtering on the basis of logical drug target preference;
Filtering on the basis of logical affinities in drug pharmacological properties;
Filtering on the basis of multi-drug formulation compatibility;
Filtering on the basis of associated side effects;
Filtering on the basis of novelty of the combination, involving and not limited to, the following:
Filtering on the basis of non-obviousness of the target set; or
Filtering on the basis of existing prior art pertaining to the combination;
Filtering on the basis of ease of procurement and patent restrictions on the compounds; or
Any degree of combination of the above strategies.
In another embodiment, the filtering process of combination of drugs or individual drugs at lower concentration than the approved value, for their efficacy towards the improvement of cancer condition mainly involves the following strategies—
Level 1-filtering of the combinations on the basis of drug efficacy, which involves:
Filtering on the basis of synergy scores;
Filtering on the basis of phenotype scores such as proliferation, viability, survival, pro and anti-angiogenesis;
Filtering on the basis of biomarker scores; or
Any degree of combination of the above strategies.
Level 2-filtering of the combinations on the basis of drug properties, which involves:
Filtering on the basis of logical drug target preference;
Filtering on the basis of logical affinities in drug pharmacological properties;
Filtering on the basis of multi-drug formulation compatibility;
Filtering on the basis of associated side effects;
Filtering on the basis of novelty of the combination, involving and not limited to, the following:
Filtering on the basis of non-obviousness of the target set; or
Filtering on the basis of existing prior art pertaining to the combination;
Filtering on the basis of ease of procurement and patent restrictions on the compounds; or
Any degree of combination of the above strategies.
In another embodiment, the filtering process of combination of drugs or individual drugs at lower concentration than the approved value, for their efficacy towards the improvement of dermatological disorder condition mainly involves the following strategies—
Level 1-filtering of the combinations on the basis of drug efficacy, which involves:
Filtering on the basis of synergy scores;
Filtering on the basis of phenotype scores such as epidermal barrier/skin barrier, skin
hydration and inflammation etc;
Filtering on the basis of biomarker scores; or
Any degree of combination of the above strategies.
Level 2-filtering of the combinations on the basis of drug properties, which involves:
Filtering on the basis of logical drug target preference;
Filtering on the basis of logical affinities in drug pharmacological properties;
Filtering on the basis of multi-drug formulation compatibility;
Filtering on the basis of associated side effects;
Filtering on the basis of novelty of the combination, involving and not limited to:
Filtering on the basis of non-obviousness of the target set; or
Filtering on the basis of existing prior art pertaining to the combination;
Filtering on the basis of ease of procurement and patent restrictions on the compounds; or
Any degree of combination of the above strategies.
In another embodiment, upon arriving at the desired set of combinations, the individual compounds are procured, formulated and tested pre-clinically in animal models to validate the predictive data.
In another embodiment, the validated in-silico disease model/co-culture model representing RA or cancer or dermatological disorder or inflammatory disorder or autoimmune disease or any combination thereof is used as a powerful platform upon which new and effective combination therapies is designed to achieve remission from various diseases.
Thus in brief, the present disclosure relates to a system for obtaining a therapeutic solution for treatment of a disease, wherein said system comprises database, digital drug library and processor. The database within the system comprises unit models configured by collating information on parameters of biological system from plurality of information sources. These unit models further integrate to form cell system models which facilitates simulation of at least one of the biological system or homeostatic state of the biological system. The unit and cell system models then come together to arrive at a homeostatic model, which when perturbed provides a user with a model which depict the conditions of a disturbed or diseased biological system, known as disease model.
The digital drug library of the instant system is formed by combining information on plurality of categories of digital drug capsules, whereas the processor is communicatively connected to the database and the digital drug library. The processor, based on a predefined cost function, therefore transmits one of the one or more disease model along with a set of drugs from the digital drug library to a scheduler. The scheduler distributes one or more of the plurality of digital drug capsules with the one or more disease models to one or more computing devices. The scheduler also is connected to the one or more computing devices through a network. The processor then receives an output comprising the effect of the set of drugs on the disease model from the scheduler, wherein the scheduler transmits the output to the processor by combining matching results from the one or more computing devices. The processor thereafter analyzes the output to obtain the therapeutic solution in context of said disease.
The present disclosure is not to be limited in terms of the particular embodiments described in this application, which are intended as illustrations of various aspects. Many modifications and variations can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. Functionally equivalent methods and devices within the scope of the disclosure, in addition to those enumerated herein, will be apparent to those skilled in the art from the foregoing descriptions. Such modifications and variations are intended to fall within the scope of the appended claims. The present disclosure is to be limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting.
With respect to the use of substantially any plural and/or singular terms herein, those having skill in the art can translate from the plural to the singular and/or from the singular to the plural as is appropriate to the context and/or application. The various singular/plural permutations may be expressly set forth herein for sake of clarity.
Any discussion of documents, acts, materials, devices, articles and the like that has been included in this specification is solely for the purpose of providing a context for the disclosure. It is not to be taken as an admission that any or all of these matters form a part of the prior art base or were common general knowledge in the field relevant to the disclosure as it existed anywhere before the priority date of this application.
In addition, where features or aspects of the disclosure are described in terms of Markush groups, those skilled in the art will recognize that the disclosure is also thereby described in terms of any individual member or subgroup of members of the Markush group.
While various aspects and embodiments have been disclosed herein, other aspects and embodiments will be apparent to those skilled in the art. The various aspects and embodiments disclosed herein are for purposes of illustration and are not intended to be limiting, with the true scope and spirit being indicated by the following claims.
The disclosed configurations are further illustrated by the following examples. The following examples are provided for illustrative purposes only and are not intended to limit the scope of the disclosure.
Rheumatoid arthritis (RA) is complex autoimmune disease affecting mainly joints; wherein the disease involves multiple cell interactions through various inter-cellular mediators like cytokines, chemokines, etc. The main clinical manifestations of the disease are inflammation, synovial hyperplasia, endothelial dysfunction, cartilage degradation and bone degradation. These clinical features can be linked to abnormal activity of various cells. The local or systemic inflammation can be linked to abnormal activation of immune cells like Dendritic cells, T lymphocytes, B lymphocytes and Macrophages, Neutrophils, whereas the synovial hyperplasia is due to abnormal proliferation of Fibroblasts like Synoviocytes.
These abnormal activities of the cells in turn cause alterations in the expression levels of the inter-cellular mediators causing the disease cascade to increase severity from early phase to late phase of the disease.
Some key cytokines produced from inflammatory cells & Fibroblast like Synoviocytes are IL1B (Interleukin 1-beta), TNFA (Tumor Necrosis Factor-alpha), IL6 (Interleukin 6), IL18 (Interleukin 18), IL17 (Interleukin 17), GMCSF (Granulocyte Colony Stimulating Factor), VEGF (Vascular Endothelial Growth Factor), EGF (Epidermal Growth Factor) and CC & CXC chemokines.
Similarly Chondrocyte, Osteoclasts produce matrix degradation enzymes like MMPs (Metalloproteinase), CTSK (Cathepsin K) & RANKL causing cartilage and bone damage.
These inter-cellular mediators exert their specific functions in each target cell through cell signaling or signal transduction pathways through specific receptors which involve activation of respective adapter proteins, accessory proteins, kinases, transcription factors and leading to specific gene expression pattern. Multiple such cell signaling pathways converge synergistically or antagonistically to represent different phenotypes of the cell.
Therefore for the development of in-silico disease model for rheumatoid arthritis (RA), considering the above mentioned understandings on RA disease pathophysiology; a bottom-up block development method have been adopted, which involves development of multiple unit models depicting each important intra-cellular signal transduction pathways for specific cell systems,
these unit models developed for specific cell systems are strategically integrated to form a cell system model which will be able to predict cell phenotypes & biomarkers expression based on different trigger sets or disease triggers,
Such different cell system models based on their relevance in the disease focus are integrated considering their cell to cell interactions and disease phenotypes to form a robust disease model for rheumatoid arthritis (RA).
For the development of unit model representing Rheumatoid arthritis (RA), data is assimilated from literature on a specific signaling/metabolic pathway. The instant example illustrates the development of a unit model representing Interleukin 6 (IL6) signaling in the Cell system model “T Lymphocyte” (TCELL) with a disease focus of “Rheumatoid Arthritis” (RA). Here, the example describes the development of unit model representing interleukin 6 (IL6), as IL6 is one of the key markers in the disease condition of RA. The below Tables 12 and 13 exemplify the literatures that are referred to assimilate data regarding the role of IL6 in RA and the inference obtained. Further, Tables 14, 15, 16, 17 and 18 illustrate on how the data assimilated from the literatures on IL6 shows a biological relationship with STAT3 signaling and how the relationship is represented by a specific kinetic rate law equations.
Based on the information/data made available in the Tables 14, 15, 16, 17 and 18 static representation of IL6-STAT3 signaling is dynamically simulated (refer FIG. 11)—by evaluating the specific kinetic rate law equations with a computational solver to predict the dynamicity of the different cellular interactions for a given simulation time.
The
As shown above, these kinetics rate parameter values are optimized recursively to get the preferred steady state conditions which are defined based on model validations i.e. alignment datasets.
Similarly each biological relationship can be modeled among different ligands, receptors, adapter proteins, kinases, transcription factors, gene transcriptional regulations, translation, post-translation modifications, and gene expressions with respect to the phenotype or biomarker of interest in a specific cell system.
The focus above in developing unit model is on the role of IL6 in T-cell, which is exploited later to study their implications in RA model upon administering individual drug or a combination of drugs. On similar lines various biomarkers (as illustrated in Table 19) are studied individually from various literature to analyze their biological relationship with the corresponding biomarker in a respective signaling/metabolic pathway in T-lymphocyte and thereupon the specific kinetic rate law equations based on the relationship is developed. The below table-19 illustrates the key steps in cell system model development for T-lymphocyte with a disease focus of rheumatoid arthritis.
In-silico disease model for RA is developed by aggregating details from different cell systems involved in the pathophysiology of RA and their dominance in various stages of the disease, the set of stimuli that will trigger the onset of the disease, the in-depth details of the different inter-cellular interactions and details on disease specific phenotype markers.
The different cells within the cell system represented herein provides for the following types of cells:
Dendritic Cells (DC) functions mainly as APC (Antigen Presenting Cell) and activation of T lymphocytes;
B-Lymphocytes (BCELL) which are also responsible for antigen recognition and production of specific antibodies;
T-Lymphocytes (TCELL) (CD4+ Cells) are important decision making cells which directly controls the activation of macrophage and B lymphocytes;
Macrophages (MCP) which are responsible for responding to antigens or a bacterial insult and thereby invoking a profound inflammatory response;
Fibroblast (FB) which synthesizes the extracellular matrix and collagen;
Endothelial cell (ENC) which is responsible for angiogenesis in RA synovium.
Bone remodeling system involving the cells Osteoclast (OSC) and Osteoblast (OSB) are selected to highlight their effect on the bone in response to the inflammatory cytokines.
Each of the above mentioned cell systems are converged by careful analysis of literatures describing the pathophysiology of the disease and relating the function or phenotypes of these cell-systems with the phenotypes of the disease.
Once the individual cell system which is involved in the development of RA is analyzed and the biological relationship of the biomarkers with all the cells involved is established, the specific expression and signaling pattern across these different cell systems is analyzed in order to arrive at specific kinetic rate law equations. The below Table 20 represents the specific expression and signaling patterns:
Further, the instant example describes the perturbation which is applied to the cell system model/co-culture model to develop a perturbed model replicating the condition of RA in a cell or in a co-culture model. For illustrative purposes, the example describes the perturbation of IL6 to describe the overexpression of STAT3 in T-cell to replicate a condition similar to biological system with RA condition.
These set of commands represent the highlighted experimental protocol (as provided in Table 21) obtained from the literature which provides a study on overexpression of Interleukin 6 (IL6) in a T lymphocyte (TCELL) cell system and asserting its effect on STAT3 phosphorylation.
The validations are analyzed based on the assertion report generated at the end of the validation which defines the dynamism of the generated perturbed RA co-culture model/disease model. For illustrative purposes, find below the assertion report of IL6 perturbation.
There is no false assertion.
True assertion is:
The developed perturbed co-culture model upon validation as provided above, is simulated with various combinations of drugs. The drugs which are selected to arrive at the combination is based on the predictive disease activity score which shows high correlation with various RA drug's published clinical trial data like ACR disease remission score. Table 23 illustrates the predictive activity score of various approved drugs. Further, Table 24 illustrates the effect of individual drug on the specific phenotype in relation to the specific biomarker.
Based on the predictive score, various combinations of drugs are tested from the digital drug library 106 to analyze the efficacy of the combination on RA disease model. The effect of the drug combinations in terms of predictive disease activity score and key RA disease phenotypes sorted based on level of synergy is also shown in below Table 25. In an embodiment, the effect of drug combinations is displayed on a user interface 108 as illustrated in
The drugs within the digital drug library 106 are represented by proprietary designations, such as CWxxx, wherein xxx represents a combination of numbers, depending on the user defined criteria.
The combinations are tested through designed cost functions of concentrations, efficacy, low toxicity and PKPD compatibility of the drugs and compounds, in order to optimize the selectivity and to arrive at the most efficient therapeutic solution, based on which the combinations for specific disease profiles and indications are identified.
Cost functions such as the following are designed for the optimization of the process:
Inclusion of small molecule drugs;
Exclusion of drugs that are approved for Rheumatoid Arthritis;
Exclusion of drugs that have Specific Adverse reactions, Toxicity;
Inclusion of drugs which will be off-patent in near future;
Also other criteria like 2-drug or 3-drug combination is given.
Based on these cost function, the matching drugs are selected from the drug library and combined exclusively and the set of commands for the execution of predictive computational simulation studies are automatically generated and execute through a high through-put systems using cloud computing architecture. These cost functions are programmable based on the fields present in digital drug library.
The results from the above process are used for the analysis of the likely outcome.
The following cost function is designed for the analytical purposes:
The novel therapeutic solution should be efficacious on the predictive RA disease score i.e more than 70%
The novel therapeutic solution should be efficacious on the predictive RA disease phenotypes i.e more than 70%
Also if the novel therapeutic solution is a combination, then the composition should be synergistic at least by >5%
Based on the above mentioned cost functions, the output result is represented as shown below. The effect of the drug combinations in terms of Predictive disease activity score & key RA disease phenotypes sorted based on level of synergy is also shown in below table.
The best combination drug from Table 25 based on the cost function is taken forward for further analysis. Below Table 26, illustrates the percentage change in predictive disease activity score and synergy of the chosen drug combination on RA disease model.
The effect of the drug combination in terms of key RA disease phenotypes associated biomarker is illustrated in below Table 27.
From table 27, it is inferred that the combination comprising CW299, CW305 and CW302 showed highest synergy of 25.62% on improving the RA condition on the developed RA model. Further, this combination of drug is further analyzed to study the effect of the drug combination in terms of key RA disease phenotypes, associated biomarkers, and predictive disease activity score with Marker level synergy, which is illustrate in Table 28 below:
Thus by using these cost functions, a 3-drug combination of CW299, CW305 and CW302 is arrived at for treatment of RA. The efficacy of this combination has been validated using Murine Collagen Induced Arthritis Animal Model, which proved the predicted outcome.
For the development of unit model, the data assimilated from literature on a specific signaling/metabolic pathway, which is diligently used for the creation of a static unit model following the same methodology and strategy as explained for the IL6 unit model in the RA example above.
The development of a cell type model mainly involves understanding cell systems, key phenotype and related markers, different stimuli that trigger the cell system to turn cancerous, key unit pathways involved in the cell physiology, crosstalk between different unit pathways and details of the outputs of the cell that acts in autocrine or paracrine fashion.
The below Table 29 illustrates the cell system model development for tumor cell in an in-silico cancer model:
The markers defined in the above table is converged diligently at different levels to yield a cell system model depicting various signaling/metabolic pathway and each of these convergence points will have varied signaling contribution from different pathways.
Development of an in-silico model for Cancer requires aggregation of details including the different cell systems involved in the patho-physiology of Cancer and their dominance in various stages of the disease, the set of stimuli that triggers the onset of the disease, the in-depth details of the different inter-cellular interactions and details on disease specific phenotype markers. For a predictive computational cancer co-culture disease model designed to represent tumor microenvironment the following cell systems are integrated:
Macrophages which are responsible for synthesizing cytokines and chemokines that help support metastasis;
Fibroblast which synthesizes the extracellular matrix and collagen; and
Endothelial cell which is responsible for angiogenesis
The various inter-cellular interactions between the cell systems are charted out based on the specific input and output of all these cell system models and their biological relationships are represented as specific kinetic rate law equations.
The specific kinetic rate law equations are evaluated with a computational solver to predict the dynamicity of the different cellular interactions for a given simulation time.
The predictive cancer disease model is set for various stages of disease by selective perturbation of trigger or stimulus. Also different disease profiles or mutation profiles is created to evaluate the efficacy of therapy in different disease patient populations.
Various predictive computational simulation studies experiments are coded using set of executable commands. All these different set of predictive computational simulation studies are executed in a parallel mode, through a high through-put system using cloud computing architecture.
The model can be perturbed to different conditions through the set of these commands. The perturbations can be disease setting, an overexpression, knock-down, addition of a drug, addition of a function. Therefore for any predictive computational simulation studies of the model, a set of executable commands can be coded, which can be executed on the model using the computational solver and the results can be validated.
An automated screening of all the output data from the predictive computational simulation studies executed through high through-put systems is carried out and relevant reports are generated for further analysis.
Correlation of the predictive disease activity score
The predictive disease activity score showed high correlation with various Cancer drug's published data like impact on proliferation and viability. The below table 30 shows the correlation details:
Based on the predictive data, various combinations of drugs are tested from the digital drug library 106 to analyze the efficacy of the combination on cancer disease model.
The drugs within the digital drug library 106 is represented by proprietary designations, such as CWxxx, wherein xxx represents a combination of numbers, depending on the user defined criteria.
The combinations are tested through designed cost functions of concentrations, efficacy, low toxicity and PKPD compatibility of the drugs and compounds, in order to optimize the selectivity and to arrive at the most efficient therapeutic solution, based on which the combinations for specific disease profiles and indications are identified.
Cost functions such as the following are designed for the optimization of the process:
Inclusion of small molecule drugs
Inclusion of drugs that show good efficacy as anti-cancer agents
Exclusion of drugs with existing prior art
Exclusion of drugs with saturating effect of the single agent
Exclusion of drugs that have Specific Adverse reactions, Toxicity
Inclusion of drugs which will be off-patent in near future
Also other criteria like 2-drug or 3-drug combination is given
Based on these cost function, the matching drugs are selected from the drug library and combined exclusively and the set of commands for the execution of predictive computational simulation studies are automatically generated and execute through a high through-put systems using cloud computing architecture. These cost functions are programmable based on the fields present in digital drug library.
The results from the above process are used by the analytical engine. The following cost function is designed for the analytical purposes:
The novel therapeutic solution should be efficacious on the predictive Cancer disease phenotypes i.e., more than 50%
Also if the novel therapeutic solution is a combination, then the composition should be synergistic or show an enhancement over the individual effects
Based on the above mentioned cost functions, the output result is represented as shown below. The effect of the drug combinations in terms of key cancer disease phenotypes sorted based on level of synergy along with the inference is shown in below Tables 31 and 32:
Thus by using these cost functions, a 2-drug combination of drug comprising CW229 and CW178 is arrived at for treatment of cancer. Based on the cost function, the combination of drug comprising CW229 and CW178 showed enhanced efficacy with percentage decrease in the proliferation by 89.91% and percentage decrease in the viability by 93.95%.
Number | Date | Country | Kind |
---|---|---|---|
4718/CHE/2012 | Nov 2012 | IN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2013/060011 | 11/8/2013 | WO | 00 |