This disclosure relates generally to systems and methods for classifying the status of a transplant.
Transplantation of cells, tissues, partial or whole organs are life-saving medical procedures in cases where an individual experiences acute organ failure or suffers from some malignancy. Many organs including, but not limited to, heart, kidney, liver, lung, and pancreas can be successfully transplanted, and one of the most common types of organ transplantations performed nowadays is kidney transplantation.
Upon transplantation of non-self (allogeneic) cells, tissues, or organs (allograft) into a recipient, the transplant recipient's immune system recognizes the allograft to be foreign to the body and activates various mechanisms to reject the allograft. Thus, it is necessary to medically suppress such an immune response to minimize the risk of transplant rejection. After transplantation, the status of the transplant may be monitored by a variety of clinical laboratory diagnostics tests including histopathologic assessment of transplant biopsy tissue. The status may be monitored to guide clinical care and immunosuppressive treatment options. Although a histopathological evaluation (e.g., a biopsy) is the current standard for diagnosis of rejection, improving its diagnostic accuracy for determining and monitoring the status of a transplant, such as an organ transplant, is critical due to the invasive nature of the procedure and the associated risk to the transplant, potential sampling error, and subjective nature of histopathological interpretation.
What is needed are systems and methods for classifying and monitoring the status of a transplant, such as an organ transplant, with improved diagnostic accuracy, as provided by the present disclosure.
A method for classifying a status of a transplant is disclosed. The method comprises: receiving expression levels of a plurality of genes from a biological sample of a transplant recipient; receiving a plurality of sets of weights for the plurality of genes; generating one or more probability rejection scores of one or more rejection labels based on the plurality of sets of weights and the expression levels; and assigning a predictive rejection classification of the biological sample of the transplant recipient based on the one or more probability rejection scores, wherein the predictive rejection classification classifies the status of the transplant. In some embodiments, at least one of the plurality of genes is associated with one or more of: immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, or transcription regulation. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing antibody-mediated rejection (ABMR), T-cell mediated rejection (TCMR), mixed ABMR+TCMR, or no rejection. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing antibody-mediated rejection (ABMR). In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing T-cell mediated rejection (TCMR). In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing mixed ABMR+TCMR rejection. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing no rejection. In some embodiments, generating one or more probability rejection scores of one or more rejection labels comprises: for each rejection label of a plurality of rejection labels, generating a probability rejection score based on the plurality of sets of weights and the expression levels. In some embodiments, each set of weights comprises a weight for a corresponding rejection label. In some embodiments, assigning a predictive rejection classification of the biological sample of the transplant recipient comprises assigning the rejection label having the highest probability rejection score amongst the plurality of rejection labels as the predictive rejection classification. In some embodiments, the plurality of sets of weights for the plurality of genes is from a machine-learning model trained to: receive a discovery dataset from biological samples of a discovery cohort of transplant recipients, wherein the discovery dataset comprises gene expression levels of a plurality of genes and rejection classifications; analyze the gene expression levels of the discovery dataset for associations with the rejection classifications in the discovery dataset; identify a subset of genes from the plurality of genes of the discovery dataset; and generate the plurality of sets of weights for the subset of genes based on the associations between the gene expression levels of the discovery dataset and the rejection classifications of the discovery dataset, wherein each set of weights is associated with one gene of the subset of genes. In some embodiments, the gene expression levels are analyzed by analyzing nucleic acids from the biological samples of the discovery cohort. In some embodiments, the gene expression levels are analyzed by analyzing RNA from the biological samples of the discovery cohort. In some embodiments, at least some of the rejection classifications of the discovery dataset comprise antibody-mediated rejection (ABMR). In some embodiments, at least some of the rejection classifications of the discovery dataset comprise T-cell mediated rejection (TCMR). In some embodiments, at least some of the rejection classifications of the discovery dataset comprise mixed ABMR+TCMR rejection. In some embodiments, at least some of the rejection classifications of the discovery dataset comprise no rejection. In some embodiments, the expression levels of one or more genes from the plurality of genes of the discovery dataset are normalized relative to gene expression levels of one or more reference genes. In some embodiments, the machine-learning model was validated by: acquiring a validation dataset from biological samples of a validation cohort of transplant recipients, wherein the validation dataset comprises gene expression levels for a plurality of genes and rejection classifications; determining one or more computer-determined predictive rejection classifications from the validation dataset; comparing one or more of the rejection classifications in the validation dataset and the one or more computer-determined predictive rejection classifications; and determining a diagnosis accuracy based on the comparison, wherein the diagnosis accuracy is greater than a predetermined value. In some embodiments, the predetermined value is 60 percent, 70 percent, 80 percent, or 90 percent. In some embodiments, at least one gene of the plurality of genes comprises a gene identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMP3, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1. In some embodiments, the transplant recipient received a transplant comprising one or more of: a kidney transplant, a heart transplant, a lung transplant, a pancreas transplant, a liver transplant, an intestinal transplant, or a vascularized composite allograft transplant. In some embodiments, the transplant recipient received a transplant that is an allograft or a xenograft. In some embodiments, the biological sample is an organ tissue sample. In some embodiments, a step of administering an immunosuppressive treatment.
A kit for classifying the status of a transplant is disclosed. The kit may comprise: one or more probesets specific for one or more genes identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMPS, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1, reagents, controls, and instructions for use. In some embodiments, the kit further comprises instructions for: receiving expression levels of a plurality of genes from a biological sample of a transplant recipient; receiving a plurality of sets of weights for the plurality of genes; generating one or more probability rejection scores of one or more rejection labels based on the plurality of sets of weights and the expression levels; and assigning a predictive rejection classification of the biological sample of the transplant recipient based on the one or more probability rejection scores, wherein the predictive rejection classification classifies the status of the transplant. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing antibody-mediated rejection (ABMR), T-cell mediated rejection (TCMR), mixed ABMR+TCMR, or no rejection. In some embodiments, generating one or more probability rejection scores of one or more rejection labels comprises: for each rejection label of a plurality of rejection labels, generating a probability rejection score based on the plurality of sets of weights and the expression levels. In some embodiments, each set of weights comprises a weight for a corresponding rejection label. In some embodiments, the transplant recipient received a transplant comprising one or more of: a kidney transplant, a heart transplant, a lung transplant, a pancreas transplant, a liver transplant, an intestinal transplant, or a vascularized composite allograft transplant. In some embodiments, the transplant recipient received a transplant that is an allograft or a xenograft. In some embodiments, the biological sample is an organ tissue sample. In some embodiments, assigning a predictive rejection classification of the biological sample of the transplant recipient comprises assigning the rejection label having the highest probability rejection score amongst the plurality of rejection labels as the predictive rejection classification.
A system for classifying a status of a transplant is disclosed. The system may comprise: a scoring unit that: receives expression levels of a plurality of genes from a biological sample of a transplant recipient; receives a plurality of sets of weights for the plurality of genes; generates one or more probability rejection scores of one or more rejection labels based on the plurality of sets of weights and the expression levels; and assigns a predictive rejection classification of the biological sample of the transplant recipient based on the one or more probability rejection scores, wherein the predictive rejection classification classifies the status of the transplant. In some embodiments, at least one of the plurality of genes is associated with one or more of: immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, or transcription regulation. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing antibody-mediated rejection (ABMR), T-cell mediated rejection (TCMR), mixed ABMR+TCMR, or no rejection. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing antibody-mediated rejection (ABMR). In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing T-cell mediated rejection (TCMR). In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing mixed ABMR+TCMR rejection. In some embodiments, the predictive rejection classification classifies the status of the transplant as experiencing no rejection. In some embodiments, generate one or more probability rejection scores of one or more rejection labels comprises: for each rejection label of a plurality of rejection labels, generate a probability rejection score based on the plurality of sets of weights and the expression levels. In some embodiments, each set of weights comprises a weight for a corresponding rejection label. In some embodiments, assign a predictive rejection classification of the biological sample of the transplant recipient comprises assign the rejection label having the highest probability rejection score amongst the plurality of rejection labels as the predictive rejection classification. In some embodiments, the plurality of sets of weights for the plurality of genes is from a machine-learning model trained to: receive a discovery dataset from biological samples of a discovery cohort of transplant recipients, wherein the discovery dataset comprises gene expression levels of a plurality of genes and rejection classifications; analyze the gene expression levels of the discovery dataset for associations with the rejection classifications in the discovery dataset; identify a subset of genes from the plurality of genes of the discovery dataset; and generate the plurality of sets of weights for the subset of genes based on the associations between the gene expression levels of the discovery dataset and the rejection classifications of the discovery dataset, wherein each set of weights is associated with one gene of the subset of genes. In some embodiments, the gene expression levels are analyzed by analyzing nucleic acids from the biological samples of the discovery cohort. In some embodiments, the gene expression levels are analyzed by analyzing RNA from the biological samples of the discovery cohort. In some embodiments, at least some of the rejection classifications of the discovery dataset comprise antibody-mediated rejection (ABMR). In some embodiments, at least some of the rejection classifications of the discovery dataset comprise T-cell mediated rejection (TCMR). In some embodiments, at least some of the rejection classifications of the discovery dataset comprise mixed ABMR+TCMR rejection. In some embodiments, at least some of the rejection classifications of the discovery dataset comprise no rejection. In some embodiments, the expression levels of one or more genes from the plurality of genes of the discovery dataset are normalized relative to gene expression levels of one or more reference genes. In some embodiments, the machine-learning model was validated by: acquiring a validation dataset from biological samples of a validation cohort of transplant recipients, wherein the validation dataset comprises gene expression levels for a plurality of genes and rejection classifications; determining one or more computer-determined predictive rejection classifications from the validation dataset; comparing one or more of the rejection classifications in the validation dataset and the one or more computer-determined predictive rejection classifications; and determining a diagnosis accuracy based on the comparison, wherein the diagnosis accuracy is greater than a predetermined value. In some embodiments, the predetermined value is 60 percent, 70 percent, 80 percent, or 90 percent. In some embodiments, at least one gene of the plurality of genes comprises a gene identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMPS, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1. In some embodiments, the transplant recipient received a transplant comprising one or more of: a kidney transplant, a heart transplant, a lung transplant, a pancreas transplant, a liver transplant, an intestinal transplant, or a vascularized composite allograft transplant. In some embodiments, the transplant recipient received a transplant that is an allograft or a xenograft. In some embodiments, the biological sample is an organ tissue sample.
Disclosed herein are systems, kits, and methods for classifying the status of a transplant. The status of a transplant may be classified based on expression levels of a plurality of genes from a biological sample of a transplant recipient. The status of a transplant may be classified based on a predictive rejection classification. Example predictive rejection classifications may include, but not limited to, antibody-mediated rejection (ABMR), T-cell mediated rejection (TCMR), mixed ABMR+TCMR, and no rejection. The predictive rejection classification may be assigned based on probability rejection scores. In some embodiments, a probability rejection score may be assigned to each rejection label. In some embodiments, the rejection label having the highest probability rejection score amongst the plurality of rejection labels may be assigned as the predictive rejection classification. Non-limiting rejection labels may include ABMR, TCMR, mixed ABMR+TCMR, and no rejection. The probability rejection score of each rejection label may be generated based on a plurality of sets of weights and expression levels of genes.
The computer-determined status of a transplant may be provided by way of a medical analysis tool that is readily accessible to a physician or medical expert. The medical analysis tool may display the status of a transplant on, e.g., a user interface, a report printout, etc. The physician or medical expert may use the computer-determined status in addition to, or instead of, the physician's or medical expert's assessment of the status of the transplant. The computer-determined status may be provided to the physician or medical expert as the predictive rejection classification and/or probability rejection score(s) for one or more rejection labels. For example, the medical analysis tool may output ABMR, TCMR, mixed ABMR+TCMR, or no rejection as the predictive rejection classification for a given biological sample of a transplant recipient. As another non-limiting example, the medical analysis tool may output 30% ABMR, 50% TCMR, 15% mixed ABMR+TCMR, and 5% no rejection as the probability rejection scores for the rejection labels for a given biological sample of an transplant recipient. The computer-determined status may be used by the physician or medical expert as a guide for treatment options, monitoring protocols, and/or clinical diagnosis.
By quantifying the status of a transplant or classifying the status based on quantified values (e.g., probability rejection scores), the status of a transplant may be objective, consistent, and reliable. The disclosed computer-implemented method can be used to compare the status of a transplant at one point in time to another point in time. Additionally or alternatively, the status may be used as a guide for deciding treatment options and related timing. A systematic assessment may help to better characterize a transplant recipient's response to therapy and help inform subsequent management and care. The results of the computer-implemented methods may be more reproducible such that variations in results between transplant recipients, or from different measurement times for a given transplant recipient, may be reduced.
The plurality of sets of weights may correspond to the plurality of genes and may be received by a machine-learning model. The machine-learning model may generate the plurality of sets of weights based on a discovery dataset (from biological samples from a discovery cohort of transplant recipients) and rejection classifications. The machine-learning model may analyze the gene expression levels of the discovery dataset for associations with rejection classifications in the discovery dataset. A subset of genes from the plurality of genes of the discovery dataset may be identified. A machine-learning model may generate the plurality of sets of weights for the subset of genes. In some embodiments, the plurality of sets of weights may be based on associations between the gene expression levels of the discovery dataset and the rejection classifications of the discovery dataset. In some embodiments, each set of weights may be associated with one gene of the subset of genes. For example, a first set of weights of 100.0, 0.0, 0.0, and 0.0 for no rejection, ABMR, TCMR, and mixed ABMR+TCMR, respectively, may be associated with the gene KIR_Inhibiting_Subgroup_1.
The following description is presented to enable a person of ordinary skill in the art to make and use the various embodiments. Descriptions of specific devices, techniques, and applications are provided only as examples. These examples are being provided solely to add context and aid in the understanding of the described examples. It will thus be apparent to a person of ordinary skill in the art that the described examples may be practiced without some or all of the specific details. Other applications are possible, such that the following examples should not be taken as limiting. Various modifications in the examples described herein will be readily apparent to those of ordinary skill in the art, and the general principles defined herein may be applied to other examples and applications without departing from the spirit and scope of the various embodiments. The various embodiments are not intended to be limited to the examples described herein and shown, but are to be accorded the scope consistent with the claims.
Various techniques and process flow steps will be described in detail with reference to examples as illustrated in the accompanying drawings. In the following description, numerous specific details are set forth in order to provide a thorough understanding of one or more aspects and/or features described or referenced herein. It will be apparent, however, to a person of ordinary skill in the art, that one or more aspects and/or features described or referenced herein may be practiced without some or all of these specific details. In other instances, well-known process steps and/or structures have not been described in detail in order to not obscure some of the aspects and/or features described or referenced herein.
In the following description of examples, reference is made to the accompanying drawings which form a part hereof, and in which, by way of illustration, specific examples are shown that can be practiced. It is to be understood that other examples can be used, and structural changes can be made without departing from the scope of the disclosed examples.
The terminology used in the description of the various described embodiments herein is for the purpose of describing particular embodiments only and is not intended to be limiting. As used in the description of the various described embodiments and the appended claims, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will also be understood that the term “and/or” as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items. It will be further understood that the terms “includes,” “including,” “comprises,” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The term “sample” or “biological sample,” as used herein, refers to any sample obtained from a transplant recipient including, but not limited to, tissue and/or cells from a biopsy, whole blood, plasma, serum, lymph, peripheral blood mononuclear cells, buccal swabs, saliva, or urine.
The term “transplant” includes solid organ transplants as well as hollow organ transplants, e.g., gastrointestinal transplants, from an allogeneic, i.e., non-self, origin within the same species, or across species from a xenogeneic origin, such as a xenotransplant or xenograft. The term “transplant” also includes cellular transplants such as hematopoietic stem cells, pancreatic islet cells, pluripotent cells, skin tissue, skin cells, immune cells including, but not limited to, NK cells and T cells, from allogeneic or xenogeneic origin. The term “transplant” also includes cellular transplants of autologous, i.e., self, origin, e.g., a transplant comprising autologous cells that originate from the recipient, including autologous cells that were genetically engineered before re-administration into the recipient. The terms “transplant” and “allograft” are used interchangeably herein and, in meaning, include a xenograft. A “transplant” refers to any transplant that is transplanted on its own or in combination with one or more transplants.
The term “solid organ transplant,” as used herein, refers to any transplant of a solid organ including, but not limited to, a kidney transplant, a heart transplant, a lung transplant, a liver transplant, a pancreas transplant, a vascularized composite allograft transplant, or combinations of the above transplants.
The term “gene cluster” or “cluster,” as used herein, refers to a group of two or more genes with a related gene expression pattern, e.g., gene expression levels that have a level or degree of correlation or association.
The term “TCMR,” as used herein, refers to cellular or T-cell mediated (allograft or xenograft) rejection including, but not limited to, acute active cellular or T-cell-mediated rejection, chronic active cellular or T-cell-mediated rejection, and chronic stable cellular or T-cell-mediated rejection.
The term “ABMR,” as used herein, refers to antibody-mediated (allograft or xenograft) rejection including, but not limited to, acute active antibody-mediated rejection, chronic active antibody-mediated rejection, and chronic stable antibody-mediated rejection.
The terms “mixed rejection,” and “mixed ABMR+TCMR” refer to rejection that shows characteristics of both ABMR and TCMR.
The terms “no rejection” and “non-rejection,” as used herein, refer to a state that is characterized by an absence of biopsy-confirmed ABMR, TCMR, and/or mixed ABMR+TCMR, or absence of significant rejection-associated clinical symptoms, e.g., as indicated by elevated serum creatinine levels, decreased estimated glomerular filtration rate, abnormal echocardiogram results or some other clinical concern that would indicate a clinical need for a biopsy. The terms “no rejection” and “non-rejection,” as used herein, may also refer to a state that is characterized by low levels of immune activity indicating a resting, quiescent state of the immune system.
The term “nucleic acid,” as used herein, refers to RNA or DNA that is linear or branched, single or double stranded, or a hybrid thereof. The term also encompasses RNA/DNA hybrids.
The term “gene,” as used herein, refers to a nucleic acid, e.g., DNA or RNA, sequence that comprises coding sequences necessary for the production of RNA or a polypeptide. A polypeptide can be encoded by a full-length coding sequence or by any part thereof.
The term “gene expression,” as used herein, refers to the production of a transcriptional or translational product of a gene, e.g., total RNA, mRNA, a splice variant mRNA, or polypeptide. Unless otherwise apparent from the context, gene expression levels can be measured at the RNA and/or polypeptide level. The measurement of gene expression may provide an indication of the presence of transplant rejection or presence of a likelihood or probability of transplant rejection, characterized by elevated activity of cells of the immune system, or an indication of the absence of transplant rejection or absence of a likelihood or probability of transplant rejection, characterized by a resting state of cells of the immune system demonstrating absence of immune activity or low levels of immune activity. The gene expression measurements, optionally normalized relative to gene expression levels of one or more reference genes, may be used to compute probability rejection scores in accordance with an indication of the presence or absence of a probability of transplant rejection. Such probability rejection scores may be used to predict the likelihood of a clinical outcome, e.g., the likelihood of transplant rejection or the likelihood of “no rejection”, in a transplant recipient. For example, such probability rejection scores would enable a treating physician or medical expert to identify transplant recipients who have a high likelihood of “no rejection” and therefore do not require adjustment, e.g., increase, decrease, change, or initiation, of their immunosuppressive treatment, or have a high likelihood of transplant rejection and therefore would require adjustment of their immunosuppressive treatment. The probability rejection scores may be the basis for assigning a predictive rejection classification to classify the status of a transplant.
The term “machine-readable medium,” as used herein, refers to both a single medium and multiple media (e.g., a centralized or distributed database and/or associated caches and servers) that store one or more sets of instructions, and includes any medium that is capable of storing, encoding, or carrying a set of instructions for execution by a device and that causes a device to perform any method disclosed herein and more. The term “machine-readable medium,” as used herein, includes but is not be limited to solid-state memories, optical and magnetic media, and carrier wave signals.
System 100 may include an interface 160 and a scoring unit 170. Examples of the disclosure may include some or all of the components shown in the figure, or other components not shown in the figure. The system 100 may be, for example, a medical analysis tool. A treating physician or medical expert may use the medical analysis tool to help monitor the status of a transplant in a transplant recipient, as well as monitor and/or suggest an adjustment to an immunosuppressive therapy administered, or to be administered, to a transplant recipient. Monitoring the status of a transplant involves analyzing various aspects that provide useful information about the physiological state of the transplant. The methods of the present disclosure may be used to classify the status of a transplant by way of a predictive rejection classification 180. The predictive rejection classification 180 may indicate the diagnosis that has the highest probability rejection score amongst the plurality of other diagnoses.
The interface 160 may receive the expression levels of a plurality of genes 140 of a biological sample of a transplant recipient. In some embodiments, the expression levels may be provided as user input (e.g., input from a physician or medical expert). The scoring unit 170 may be a tool for assessing the status of a transplant. The scoring unit 170 may receive a plurality of sets of weights 150 (e.g., from a machine-learning model) and the expression levels of a plurality of genes 140. The scoring unit 170 may assign a predictive rejection classification 180 to the biological sample of the transplant recipient.
In some embodiments, system 100 may be a kit used by a treating physician or medical expert for post-transplant monitoring. The kit may classify the status of a transplant, e.g., an organ transplant, in a biological sample from a recipient of a transplant, according to one or more methods disclosed herein. The kit may comprise a set of probesets specific for one or more genes from the plurality of genes.
In step 204, the system may receive a plurality of sets of weights for the plurality of genes. The plurality of sets of weights may be received from a machine-learning model, for example. As discussed in more detail below, the machine-learning model may be trained to generate the plurality of sets of weights based on a discovery dataset and corresponding rejection classifications. As one non-limiting example, each gene, or subset of genes, of the plurality of genes may have a corresponding set of weights. The generation of the plurality of sets of weights is discussed in more detail below.
In step 206, the system may use the scoring unit 170 of
In step 208, the system may assign a predictive rejection classification of the biological sample of a transplant recipient, for example, an organ transplant recipient. The predictive rejection classification may classify the status of the transplant, e.g., the organ transplant. The system may assign each biological sample (e.g., organ tissue sample) one of multiple classifications or diagnoses, such as four diagnoses comprising three different types of rejection and no rejection. A predictive rejection classification may include but is not limited to, ABMR, TCMR, mixed ABMR+TCMR, or no rejection.
In some embodiments, the predictive rejection classification may be assigned based on one or more probability rejection scores. The sum of the probability rejection scores may be equal to 1 or 100%, for example. The rejection label having the highest probability rejection score amongst the plurality of rejection labels may be assigned as the predictive rejection classification, in some embodiments. Returning to the previous example of 30% ABMR, 50% TCMR, 15% mixed ABMR+TCMR, and 5% no rejection for probability rejection scores, the system may assign a predictive rejection classification of TCMR due to TCMR having the highest probability rejection score of 50% amongst the plurality of rejection labels. The assignment of the predictive rejection classification is discussed in more detail below.
Embodiments of the disclosure may include repeating one or more steps of method 200 and/or method 400 (discussed below). Although the descriptions and figures show particular steps of the method occurring in a particular order, the steps of the method may occur in other orders not described or shown. Additionally or alternatively, embodiments of the disclosure may include performing all, some, or none of the steps of method 200 and/or method 400, where appropriate. Furthermore, although certain components, devices, or systems are described as carrying out the steps of method 200 and/or method 400, any suitable combination of components, devices, or systems (including ones not explicitly disclosed) may be used to carry out the steps.
As discussed above, the system 100 (e.g., a medical analysis tool) may receive expression levels of a plurality of genes from a biological sample of a transplant recipient, e.g., an organ transplant recipient. In some embodiments, the expression levels may be used to generate one or more probability rejection scores (e.g., step 206 of method 200 in
The plurality of sets of weights may be generated by a machine-learning model 330, for example, as shown in the example system of
In some embodiments, before determining expression levels of a plurality of genes from a biological sample of a transplant recipient, the biological sample may be processed using, e.g., light, immunofluorescence, and electron microscopy. For example, transplant biopsies may undergo immunohistochemical staining for polyomavirus by SV40 on formalin-fixed paraffin embedded (FFPE) tissue or immunofluorescence staining for C4d on unfixed tissue. One or more tissue sections from each FFPE block of renal core biopsy tissue may be dissected using a cutting tool such as a microtome. Dissected tissue sections may be used directly or stored at conditions that maintain the integrity of the nucleic acids, e.g., RNA, and prevent degradation and/or contamination of the tissue sections, until further processed, e.g., for RNA extraction.
The biomarker unit 310 can be configured to determine one or more characteristics of a biological sample of a transplant recipient, e.g., an organ transplant recipient. For example, the biomarker unit 310 may analyze expression levels of a plurality of genes from the biological sample. The transplant recipient may have received a transplant comprising one or more of: a kidney transplant, a heart transplant, a lung transplant, a pancreas transplant, a liver transplant, an intestinal transplant, or a vascularized composite allograft transplant. In some embodiments, the transplant recipient may have received a transplant that is an allograft or a xenograft. In some embodiments, the analysis of the gene expression levels may comprise analyzing for associations with rejection classifications (e.g., of the discovery dataset). In certain embodiments, the analysis of the gene expression levels may comprise analyzing for associations with ABMR. In other embodiments, the analysis of the gene expression levels may comprise analyzing for associations with TCMR. In certain embodiments, the analysis of the gene expression levels may comprise analyzing for associations on the basis of association strength, e.g., low, moderate, or high association strength, as generally interpreted by a person skilled in the art based on the statistical significance of the determined association strength.
Example genes that may be informative with respect to analyzing associations with transplant rejection classifications on the basis of their gene expression levels, and, thus, informative with respect to the status of a transplant in a transplant recipient, in accordance with embodiments of the disclosure, may include, but are not limited to, one or more genes that are associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation. For example, in some embodiments such genes may be informative, on the basis of their gene expression levels, with respect to whether the transplant recipient experiences “no rejection” or active rejection, e.g., TCMR, ABMR, or mixed ABMR+TCMR. In some embodiments, such genes may be informative, on the basis of their gene expression levels, with respect to the strength of association with one or more rejection classification. The same one or more informative genes may be used for each transplant recipient; there may not be a need to customize the one or more informative genes to different recipients of transplants.
Table 1 lists non-limiting example informative genes that are associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation; some of these genes belong to correlating gene clusters, as shown in Table 2, exhibiting a correlation of at least 0.6 or 60%. Genes that exhibit a level or degree of correlation of at least 0.6 or 60% with the example informative genes described herein are also considered to be informative, at least on the basis of their gene expression levels, with respect to whether a transplant recipient experiences “no rejection” or active rejection, e.g., TCMR, ABMR, or mixed ABMR+TCMR, and are considered to be within the scope of the present disclosure.
In some embodiments, two or more genes are determined to be correlated if they exhibit similar expression patterns across a set of samples from transplant recipients, some of whom have experienced transplant rejection and some of whom have not experienced transplant rejection. In some embodiments, two or more genes are determined to be correlated when their expression levels increased or decreased to a similar extent in the same samples. Exemplary methods for clustering based on gene expression patterns are described, for example, in Oyelade, J. et al., Bioinform Biol Insights. 2016; 10: 237-253 which is hereby incorporated by reference in its entirety. In some embodiments, clustering is based on genes, samples, and/or other variables, and is performed using various clustering methods such as hierarchical clustering (HC), self-organizing maps (SOM), and/or K-means clustering.
In some embodiments, the plurality of genes associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation may comprise 2-10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 101-120, 121-150, 151-200, 201-250, 251-300, 301-400, 401-500, 501-600, 601-700, 701-800, 801-1000, or more, genes. In some embodiments, at least one gene of the plurality of genes may comprise a gene identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMPS, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1.
In some embodiments, at least one gene of the plurality of genes may comprise a gene that is determined to be correlated with a gene that is associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, or transcription regulation.
In some embodiments, the biomarker unit 310 may provide gene expression levels by testing a gene panel comprising one or more informative genes from a plurality of genes associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation, utilizing a biological sample, such as FFPE renal allograft biopsy tissue, comprising nucleic acids. In some embodiments, the nucleic acids from the biological sample comprise mRNA. In some embodiments, the nucleic acids from the biological sample comprise total RNA. In some embodiments, the nucleic acids, e.g., total RNA, may be extracted from the biological sample, e.g., from tissue curls of an organ tissue sample. Various methods of extracting nucleic acids, such as mRNA or total RNA, are known in the art, e.g., methods as described in Sambrook et al. Molecular Cloning: A Laboratory Manual 4th edition (2014) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; Ausubel, et al., Current Protocols in Molecular Biology (2010). Nucleic acid extraction may also be performed using commercial purification kits, buffer sets, and proteases in accordance to the manufacturers' instructions or any suitable method. Once the nucleic acids are extracted, they can be frozen or otherwise stored in a condition that maintains the integrity and prevents degradation and/or contamination of the nucleic acids, or used directly for downstream applications and analysis, such as analysis of gene expression levels of one or more informative genes. In some embodiments, gene expression levels may be determined by analyzing total RNA from the sample, e.g., using RNA-sequencing. In some embodiments, gene expression levels may be determined by analyzing mRNA from the sample. In some embodiments, the RNA may be fragmented and used as a template to synthesize cDNA. The cDNA may be then subjected to 3′-adenylation and 5′-end repair. Sequencing adaptors may be ligated onto the 3′-adenylation and 5-end repaired cDNA, and the adaptor-ligated cDNA may then be amplified prior to sequencing. In some embodiments, gene expression levels are determined by quantifying RNA levels, e.g., mRNA transcript levels, without amplification and/or reverse transcription to cDNA, e.g., using a gene expression platform such as the NanoString Technologies nCounter® system. In some embodiments, a gene expression platform may quantify mRNA transcript levels for one or more informative genes from the gene panel. As discussed in more detail below, the gene panel may be a subset of genes identified from a plurality of genes of the biological sample, associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation. In some embodiments, a gene expression platform may be utilized that does not require instant preservation in RNA stabilization and storage reagents after sample collection, e.g., by using the same biopsy core from the routine histopathologic assessment. In some embodiments, a FFPE organ tissue sample, e.g., a renal allograft biopsy tissue sample, is obtained from routine clinical pathology practice and used for determining gene expression levels. In some embodiments, the FFPE organ tissue sample used for the determination of gene expression levels may be archived clinical samples, including older samples (e.g., 5, 6, 10, 13 years old, etc.). In some embodiments, a gene expression platform, such as the NanoString Technologies nCounter® system, may be used to develop gene expression signatures for transplant rejection diagnosis in recipients of transplants.
In some embodiments, the database 320 may store various characteristics, such as gene expression levels of a plurality of genes in a biological sample, e.g., an organ tissue sample, from a transplant recipient that may be informative with regards to determining the status of the transplant, or transplant lesion scores, e.g., organ transplant lesion scores, that may have been assigned by one or more pathologists, e.g., renal pathologists, upon histopathological evaluation of a biological sample, e.g., an organ tissue sample, from a kidney transplant recipient. In some embodiments, one or more lesion scores may be stored in the database 320. In some embodiments, one or more rejection classifications may be stored in the database 320 that may have been assigned by one or more pathologists, e.g., renal pathologists, based on the one or more lesion scores that were assigned upon histopathological evaluation of a biological sample, e.g., an organ tissue sample. In some embodiments, one or more rejection classifications may be stored in the database 320 that may have been assigned by one or more pathologists, e.g., renal pathologists, based on the one or more lesion scores alone or in combination with additional lab test results. In some embodiments, one or more rejection classifications may be stored in the database 320 that may have been assigned based on the one or more lesion scores alone or in combination with additional lab test results, in accordance with guidelines for the classification of human transplants, e.g., Banff 2019 classification guidelines for human organ transplants (Mengel et al. (2019) Am J Transplant. 2020 20: 2305-2317.)
In some embodiments, the data stored in the database 320 may comprise a discovery dataset from biological samples of a discovery cohort of transplant recipients, e.g., organ transplant recipients, and a validation dataset from biological samples of a validation cohort of transplant recipients, e.g., organ transplant recipients. In some embodiments, one or more of the transplant recipients (of the discovery dataset, validation dataset, or both) may have received an organ transplant comprising one or more of: a kidney transplant, a heart transplant, a lung transplant, a pancreas transplant, a liver transplant, an intestinal transplant, or a vascularized composite allograft transplant. In some embodiments, one or more of the transplant recipients may have received a transplant that is an allograft or a xenograft. In some embodiments, the discovery dataset may comprise gene expression levels of a plurality of genes and rejection classifications for the discovery cohort biological samples. In some embodiments, the discovery dataset may comprise gene expression levels of a plurality of genes associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation. In some embodiments, the validation dataset may comprise gene expression levels of a plurality of genes and rejection classifications for the validation cohort biological samples. The discovery dataset may also comprise rejection classifications for the discovery cohort biological samples. In some embodiments, the validation dataset may comprise gene expression levels of a plurality of genes associated with immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, and/or transcription regulation.
In some embodiments, the discovery dataset may comprise data from biological samples, obtained from transplant recipients, exhibiting diverse histologic findings (e.g., different types of rejection, non-diagnostic for rejection from both renal allograft and native kidney, etc.). For example, in some embodiments, at least some of the rejection classifications of the discovery dataset may comprise ABMR. In some embodiments, at least some of the rejection classifications of the discovery dataset may comprise TCMR. In some embodiments, at least some of the rejection classifications of the discovery dataset may comprise mixed ABMR+TCMR. In some embodiments, at least some of the rejection classifications of the discovery dataset may comprise no rejection.
Embodiments of the disclosure may comprise systems and methods capable of differentiating conditions of inflammation associated with renal allograft rejection from conditions caused by other pathologic conditions unrelated to rejection (including various types of viral or bacterial infection, or various types of glomerulopathy). In some embodiments, the discovery dataset may include data from biological samples, e.g., organ tissue samples such as biopsy samples, that originate from both a native organ (e.g., native kidney(s)), an organ transplant (e.g., kidney transplant), and exhibit various types of inflammation, such as cytomegalovirus (CMV) or BK virus (BKV) nephropathy, acute pyelonephritis, diabetic nephropathy, etc.
The machine-learning model 330 may be trained to generate a plurality of sets of weights to be used by system 100 in generating one or more probability rejection scores and assigning a predictive rejection classification. In some embodiments, the machine-learning model 330 may be trained to analyze gene expression levels of a discovery dataset for associations with rejection classifications in the discovery dataset. In some embodiments, the machine-learning model 330 may narrow down the set of genes (to which sets of weights are generated) by identifying a subset of genes (from the plurality of informative genes of the discovery dataset) based on the fitting process disclosed herein. In some embodiments, the machine-learning model 330 may generate a plurality of sets of weights for the subset of genes.
For example, as shown in
In some embodiments, at least one of the plurality of informative genes of the discovery dataset may be associated with one or more of: immune cell activation, organ-specific defense against pathogens, regulation of tissue and cellular processes, or transcription regulation.
In some embodiments, at least one gene of the plurality of informative genes of the discovery dataset may comprise a gene identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMPS, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1.
In some embodiments, at least one gene of the plurality of informative genes of the discovery dataset may comprise a gene that exhibits a correlation of at least 0.6 or 60% with a gene identified from a group consisting of KIR_Inhibiting_Subgroup_1, IL7R, KLRK1, BK large T Ag, PLA1A, LGALS3, HLA-F, SMAD3, HLA-C, SH2D1B, CXCL11, GBP4, SFTPC, SOST, AGT, HSPA12B, NCAM1, NCR1, ITGA4, LCN2, HLA-DPB1, XCL1/2, BK VP1, COL4A1, ARG2, MCM6, CD59, CD69, SMARCA4, IL18, CMV UL83, SIGIRR, KIT, CD160, SERPINE1, TFRC, CCR7, HLA-B, CXCL8, AQP2, SOD2, SFTPB, HLA-DQA1, IFI6, HFE, MAPK12, GDF15, IFIT1, KLRF1, SERINC5, FOXP3, BCL2L1, FABP1, CCL21, LOX, ROBO4, MYBL1, AGR3, CXCR6, CXCL13, FCER1A, BTG2, CTLA4, CASP3, SPRY4, RAF1, MAPK13, IGF2R, RHOU, LYVE1, CD80, KAAG1, CCL18, EHD3, IL1RL1, CRIP2, TNFSF9, CDH5, CD8B, PRDM1, SIRPG, ABCA1, ADORA2A, RASSF9, JUN, COL4A4, TRAF4, PIN1, SOX7, CFB, CFH, SFTPD, THBS1, AIRE, RAMPS, IL1R2, GNG11, RAPGEF5, DEFB1, GNLY, PHEX, ENG, BMP7, RELA, COL1A1, PLAAT4, CD81, ICAM2, PLAT, CD40LG, NPHS2, IL33, CD58, TIPARP, TNC, PECAM1, C5, EGFR, CD2, BMP2, CTNNB1, MYB, CRHBP, MT2A, EEF1A1, BCL2, SLC19A3, VMP1, PSEN1, MAPK3, TFF3, TNFSF4, CD55, PDPN, IL17RB, IGHG2, CXCL12, CD207, MICA, MMP9, EOMES, EPO, NOS3, KLF2, KLF4, SLC4A1, P2RX4, CCL3/L1, and HPRT1.
Returning to
In some embodiments, the expression levels of one or more genes from the plurality of genes of the discovery dataset may be normalized relative to gene expression levels of one or more reference genes. In some embodiments, normalization may be performed using housekeeping genes. In some embodiments, normalization may be performed using normalization quality control metrics.
In step 406, a subset of genes from the plurality of genes of the discovery dataset may be identified. In one example embodiment of the disclosure, the plurality of genes of the discovery dataset represented more than 700 genes, and a subset of less than 200 genes was identified for subsequent predictive rejection classification.
In step 408, the machine-learning model 330 may generate a plurality of sets of weights for the subset of genes (from step 406). The plurality of sets of weights may be generated based on the associations between the gene expression levels of the discovery dataset and the rejection classifications of the discovery dataset. In some embodiments, each set of weights may be associated with one gene of the subset of genes. The plurality of sets of weights may be calculated using a prediction model such as lasso regularized regression, elastic net random forests, gradient boosted machine, k nearest neighbors, or support vector machine. The process performed by the prediction model may involve fitting by cross-validation (e.g., a 10-fold cross-validation) to determine one or more hyper-parameter.
Embodiments of the disclosure may include training the machine-learning model. The machine-learning model may be trained by using a discovery dataset from biological samples of a discovery cohort of transplant recipients, e.g., organ transplant recipients. The machine-learning model may be trained to receive the discovery dataset, analyze gene expression levels of the discovery dataset, identify a subset of genes, and generate a plurality of sets of weights for the subset of genes.
The machine-learning model may be validated using a validation cohort to determine whether it was trained according to certain criteria, e.g., diagnosis accuracy. In some embodiments, a diagnosis accuracy being greater than a predetermined value may be one criterion. In some embodiments, the diagnosis accuracy may be determined based on a comparison of one or more rejection classifications in a dataset and one or more computer-determined predictive rejection classifications.
In some embodiments, a dataset used for validating the machine-learning model may be a validation dataset or cohort, as shown in
The computer-determined predictive rejection classifications may be determined for a validation dataset using probability rejection scores generated based on the expression levels of a plurality of genes with a plurality of sets of weights generated by the machine-learning model from biological samples of the validation cohort. In some embodiments, the computer-determined predictive classifications will be acceptable if diagnosis accuracy exceeds a predetermined value. In some embodiments, the predetermined value may be 60%, 70%, 80%, or 90%. The diagnosis accuracy may represent the percentage of the predictive rejection classifications in the validation dataset that match the computer-determined predictive rejection classifications (determined from the validation dataset).
In some embodiments, the diagnosis accuracy may be different for different predictive rejection classifications and/or different datasets.
In some embodiments, the performance characteristics, for example, the diagnosis accuracy, sensitivity, specificity, of the disclosed systems and methods may be different for different predictive rejection classifications and/or different datasets.
If the machine-learning model is not trained adequately (e.g., the diagnosis accuracy is not greater than the predetermined value), the training data (e.g., discovery dataset) may be revised to provide feedback to the model. In some embodiments, the output of the machine-learning model between training iterations may be evaluated by a medical expert or treating physician to determine which data in the training data should be revised. The treating physician or medical expert can revise certain data in areas of potential improvement, such as the weights of the expression levels of the plurality of genes.
Immunosuppressive therapy generally refers to the administration of an immunosuppressant or other therapeutic agent that suppresses immune responses to a transplant recipient. Example immunosuppressant agents may include, for example, calcineurin inhibitors, mTor inhibitors, anticoagulants, antimalarials, cardiovascular agents including but not limited to ACE inhibitors and β-blockers, non-steroidal anti-inflammatory drugs (NSAIDs), aspirin, azathioprine, B7RP-1-fc, brequinar sodium, campath-1H, celecoxib, chloroquine, corticosteroids, coumadin, cyclophosphamide, cyclosporin A, DHEA, deoxyspergualin, dexamethasone, diclofenac, dolobid, etodolac, everolimus, FK778, feldene, fenoprofen, flurbiprofen, heparin, hydralazine, hydroxychloroquine, CTLA-4 or LFA3 immunoglobulin, ibuprofen, indomethacin, ISAtx-247, ketoprofen, ketorolac, leflunomide, meclophenamate, mefenamic acid, mepacrine, 6-mercaptopurine, meloxicam, methotrexate, mizoribine, mycophenolate mofetil, naproxen, oxaprozin, Plaquenil, NOX-100, prednisone, methylprednisolone, rapamycin (sirolimus), sulindac, tacrolimus (FK506), thymoglobulin, tolmetin, tresperimus, UO126, and antibodies including, for example, alpha lymphocyte antibodies, adalimumab, anti-CD3, anti-CD25, anti-CD52, anti-IL2R, anti-TAC antibodies, basiliximab, daclizumab, etanercept, hu5C8, infliximab, OKT4, natalizumab, and any combination thereof. Immunosuppressive therapy may be adjusted in response to the classification of the status of a transplant as experiencing “no rejection,” ABMR, TCMR, or mixed ABMR+TCMR rejection. For example, in response to the classification of the status of a transplant as experiencing TCMR, bolus steroid treatment may be initiated, or maintenance immunosuppressive therapy may be increased with respect to dosage and/or frequency. In response to the classification of the status of a transplant as experiencing ABMR, for example, plasmapheresis or intravenous immunoglobulin (IVIg) may be initiated.
In some embodiments, no change in the status of a transplant (e.g., as indicated by no change in the predictive rejection classification) may indicate no need to adjust immunosuppressive therapy being administered to the transplant recipient, or that the immunosuppressive therapy being administered may be maintained. The decision to maintain immunosuppressive therapy being administered to a transplant recipient may be based on additional clinical factors, such as, for example, the health, age, comorbidities of the transplant recipient.
In some embodiments, adjustment of immunosuppressive therapy includes changing the type, form, or frequency of immunosuppressive therapy or other transplant-related therapy being administered to the transplant recipient. In some embodiments, where the transplant recipient is not receiving immunosuppressive therapy, the methods of the present disclosure may indicate a need to begin administering immunosuppressive therapy to the transplant recipient.
Other transplant-related therapies include treatments or therapies besides transplantation or immunosuppressive therapy that are administered to a transplant recipient to promote survival of the transplant or to treat transplant-related symptoms (e.g., cytokine release syndrome, neurotoxicity). Examples of other transplant-related therapies include, but are not limited to, administration of antibodies, antigen-targeting ligands, non-immunosuppressive drugs, and other agents that stabilize or destabilize components of transplants that are critical to transplant activity or that directly activate or inhibit one or more transplant activity. These activities may include the ability to induce an immune response, recognize particular antigens, replicate, and/or induce repair of damaged tissues. Adjusting immunosuppressive therapy may be combined with adjusting, initiating, or discontinuing other transplant-related therapies.
The methods of the disclosure may classify the status of a transplant, e.g., an organ transplant. The status of the transplant can be used to inform the need to adjust monitoring of the transplant recipient. In general, changes in the predictive rejection classification over time may be informative with regard to determining a need to adjust monitoring of a transplant recipient. In some embodiments, classifying the status of a transplant, as described above, is informative with regard to determining a need to adjust monitoring of a transplant recipient.
Depending on the status of the transplant, monitoring of the transplant recipient may be adjusted accordingly. For example, monitoring may be adjusted by increasing or decreasing the frequency of monitoring, as appropriate. Monitoring may be adjusted by altering the means of monitoring, for example, by altering the metric that is used to monitor the transplant recipient.
The system and methods discussed herein may be implemented by a device.
The device 1002 may also include a display 1012, an input/output device 1014 (e.g., a touch screen), a transceiver 1016, and storage 1018. Storage 1018 includes a machine-readable medium 1020 on which is stored one or more sets of instructions 1024 (e.g., software) embodying any of the methods or functions described herein. The software may also reside, completely or at least partially, within the main memory 1006 and/or within the processor 1004 during execution thereof by the device 1002. The one or more sets of instructions 1024 (e.g., software) may further be transmitted or received over a network via a network interface device 1022.
While the machine-readable medium 1020 is shown in an embodiment to be a single medium, the term “machine-readable medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database and/or associated caches and servers) that store the one or more sets of instructions. The term “machine-readable medium” shall also be taken to include any medium that is capable of storing, encoding, or carrying a set of instructions for execution by the device and that causes the device to perform any one or more of the methods of the present invention. The term “machine-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, optical and magnetic media, and carrier wave signals.
The system and methods described herein and the corresponding data can be stored in storage 1018, main memory 1006, static memory 1008, or a combination thereof. The display 1012 may be used to present a user interface to a physician who is treating transplant recipients or a medical expert, and the input/output device 1014 may be used to receive input (e.g., clicking on a graphic representative of a microblog) from the treating physician or medical expert. The transceiver 1016 may be configured to communicate with a network, for example.
Although examples of this disclosure have been fully described with reference to the accompanying drawings, it is to be noted that various changes and modifications will become apparent to those skilled in the art. Such changes and modifications are to be understood as being included within the scope of examples of this disclosure as defined by the appended claims.
This application claims the benefit of U.S. Provisional Application No. 63/336,870, filed Apr. 29, 2022, which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63336870 | Apr 2022 | US |