CANCER NEOANTIGENS AND THEIR UTILITIES IN CANCER VACCINES AND TCR-BASED CANCER IMMUNOTHERAPY

Information

  • Patent Application
  • 20220307087
  • Publication Number
    20220307087
  • Date Filed
    April 25, 2019
    5 years ago
  • Date Published
    September 29, 2022
    a year ago
Abstract
Disclosed are compositions and methods for identifying neoantigens and using neoantigens in the use of treating cancer, as well as autoimmune diseases, where antigens causing tissue destruction.
Description
I. BACKGROUND

Cancer immunotherapy through checkpoint blockade and T cell-based approaches has produced impressive and durable clinical benefits for cancer patients. Increasing evidence indicates that tumor regression induced by checkpoint blockade therapy relies on the recognition of mutation-derived neoantigens expressed by cancer cells. Mutational landscape of cancer patients is the strong prediction biomarker for checkpoint blockade therapy. Despite the larger numbers (several hundreds) of somatic mutations can be identified from each cancer tissue to serve as neoantigens, only a few (1-3 immunogenic epitopes per cancer patient) have been identified for T cell recognition. What are needed are new methods for identifying neoantigens and methods and compositions for using neoantigens identified by said methods.


II. SUMMARY

Disclosed are methods and compositions related to identifying novel neoantigens, and methods of using said neoantigens in the treatment of cancer.


In one aspect, disclosed herein are methods of identifying neoantigens from a cancer in a human subject comprising a) performing whole exome sequencing on a nucleic acid sample from a cancer cell; b) mapping the sequence to a reference genome sequence (such as, for example a human genome); c) filtering sequence variations to remove common variations in tumors and normal cells; d) creating one or more single mutation peptide constructs comprising at least one uncommon amino acid variation and one or more flanking amino acids; e) synthesizing one or more minigenes encoding one or more single mutation peptide constructs of step d; f) transfecting one or more minigenes into one or more cells or cell lines expressing MHC class I or MHC class II molecules; g) co-culturing one or more T cells or T cells lines with the transfected cells of step f); h) measuring T cell activity of the co-cultured T cells or T cell lines; i) and assaying single mutation peptides from each minigene in transfected cell lines that induced T cell activity for the ability to induce T cell activity alone.


In one aspect, also disclosed herein are methods of identifying neoantigens of any preceding aspect, further comprising mapping the mutation to a specific peptide epitope and/or further comprising assaying the HLA restriction of the peptide epitope.


Also disclosed herein are methods of identifying neoantigens of any preceding aspect, wherein the single mutation constructs comprise at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acids on either side of the mutation.


Also disclosed herein are methods of identifying neoantigens of any preceding aspect, wherein the minigene comprises at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 single mutation constructs.


In one aspect, also disclosed herein are methods of identifying neoantigens of any preceding aspect, wherein the one or more cell or cell lines (such as, for example HEK293 cells, HEK293T cells, Cos-7, MA104 cells, CHO cells, Fibroblasts, B cells, VERO cells, Madin-Darby Canine Kidney (MDCK) cells, HEp-2 cells, HeLa cells, MRC-5 cells, WI-38 cells, EB66, or PER C6 cells) expressing MHC class I or class II molecules, are engineered to express an MHC molecule.


Also disclosed herein are methods of identifying neoantigens of any preceding aspect, wherein the T cell activity (such as, for example, release of cytokines including, but not limited to IFN-γ, TGF-β, lymphotoxin-α, IL-2, IL-4, IL-10, IL-17, or IL-25) is measured by ELISA, ELISpot, Intracellular cytokine staining, or Chromium Release.


In one aspect, also disclosed herein are methods of identifying neoantigens of any preceding aspect, wherein the cancer is selected from the group consisting of B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas.


In one aspect, also disclosed herein are neoantigens (including, but not limited to peptides, polypeptides, and proteins of the neoantigens) identified by the methods of identifying neoantigens of any preceding aspect. For example, disclosed herein are a peptides, polypeptides, or proteins comprising the amino acid sequence AEPKRKSSLFWHAFNRLTPFRK (SEQ ID NO: 2) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the peptide, polypeptide, or proteinat least comprises the sequence LFWHAFNRL (SEQ ID NO: 7) (for example, SSLFWHAFNRLTP (SEQ ID NO: 4), RKSSLFWHAFNRL (SEQ ID NO: 3), RKSSLFWHAFNRLTPFR (SEQ ID NO: 6), and LFWHAFNRLTPFR (SEQ ID NO: 5); a polypeptide comprising the amino acid sequence FQLLLEKPFQIFCAELWVRDINDHA (SEQ ID NO: 9) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the polypeptide at least comprises the sequence LEKPFQIFCAELW (SEQ ID NO: 12); a polypeptide comprising the amino acid sequence ENSPLGTEFPLNYALDLDVGSNNVQ (SEQ ID NO: 14) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the polypeptide at least comprises the sequence LGTEFPLNYALDL (SEQ ID NO: 17); a polypeptide comprising the amino acid sequence MTDDKDVLRNVWFGRIPTCFT (SEQ ID NO: 19) or a fragment thereof comprising at least 11 amino acids, wherein any fragment of the polypeptide at least comprises the sequence KDVLRNVWFGR (SEQ ID NO: 23) (for example, DDKDVLRNVWFGR (SEQ ID NO: 22), KDVLRNVWFGRIP (SEQ ID NO: 21), and DDKDVLRNVWFGRIP (SEQ ID NO: 24)); a polypeptide comprising the amino acid sequence RLKASLDRPFTNSESAFYSIVGLSS (SEQ ID NO: 26) or a fragment thereof comprising at least 11 amino acids, wherein any fragment of the polypeptide at least comprises the sequence RPFTNSESAFY (SEQ ID NO: 30) (for example, DRPFTNSESAFYS (SEQ ID NO: 28)); a polypeptide comprising the amino acid sequence GSGEKVAGRVIVKVCEVTRVKAVRI (SEQ ID NO: 32) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the polypeptide at least comprises the sequence RVIVKVCEV (SEQ ID NO: 36) (for example, KVAGRVIVKVCEV (SEQ ID NO: 33), AGRVIVKVCEVTR (SEQ ID NO: 34), KVAGRVIVKVCEVTRVK (SEQ ID NO: 32), and RVIVKVCEVTRVK (SEQ ID NO: 35)); a polypeptide comprising the amino acid sequence YGMYFCMNISSQEDGACVLLRALEP (SEQ ID NO: 39) or a fragment thereof comprising at least 10 amino acids, wherein any fragment of the polypeptide at least comprises the sequence ISSQEDGACV (SEQ ID NO: 43)(for example, MNISSQEDGACVL (SEQ ID NO: 41), ISSQEDGACVLLR (SEQ ID NO: 42), MNISSQEDGACVLLR (SEQ ID NO: 44), ISSQEDGACVL (SEQ ID NO: 132), ISSQEDGACVLL (SEQ ID NO: 133), NISSQEDGACV (SEQ ID NO: 134), NISSQEDGACVL (SEQ ID NO: 135), NISSQEDGACVLL, (SEQ ID NO: 136), NISSQEDGACVLLR (SEQ ID NO: 137), and MNISSQEDGACVLL (SEQ ID NO: 138)); a polypeptide comprising the amino acid sequence MKLTSLMCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 46) or a fragment thereof comprising at least 26 amino acids, wherein any fragment of the polypeptide at least comprises the sequence VSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 60) (for example, MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 49), KSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 50), GCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 51), GHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 52), SMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 53), VSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 54), MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 57), VSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 139), CVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 140), MCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 141), MCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 142), SMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 143), and SMCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 144)); and/or a polypeptide comprising the amino acid sequence MLMAQEALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRR MEGAPAGPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 67) or a fragment thereof comprising at least 61 amino acids, wherein any fragment of the polypeptide at least comprises the sequence VPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSRR (SEQ ID NO: 73) (for example, QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKR (SEQ ID NO: 74), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR R (SEQ ID NO: 145), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RR (SEQ ID NO: 146), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRR (SEQ ID NO: 147), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRR (SEQ ID NO: 148), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RP (SEQ ID NO: 149), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRP (SEQ ID NO: 150), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRP (SEQ ID NO: 151), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRP (SEQ ID NO: 152), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPW (SEQ ID NO: 153), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPW (SEQ ID NO: 154), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPW (SEQ ID NO: 155), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPW (SEQ ID NO: 156), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWK (SEQ ID NO: 157), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWK (SEQ ID NO: 158), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWK (SEQ ID NO: 159), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWK (SEQ ID NO: 160), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWKR (SEQ ID NO: 161), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWKR (SEQ ID NO: 162), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWKR (SEQ ID NO: 163), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSP (SEQ ID NO: 61), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGM (SEQ ID NO: 62), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAG (SEQ ID NO: 63), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKR (SEQ ID NO: 64), LMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGG RTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 68), AMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACL SCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 69), and QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 70)). Also disclosed herein are variants of any polypeptide or polypeptide fragment disclosed herein wherein the variant comprises a conservative amino acid substitution.


In one aspect, also disclosed herein are nucleic acids encoding a polypeptide neoantigen of any preceding aspect.


Also disclosed herein are compositions comprising a therapeutically effective amount of one or more of the neoantigens of any preceding aspect (including, but not limited to peptides, polypeptides, and proteins of the neoantigens) or one or more minigene mRNAs encoding one or more neoantigens of any preceding aspect.


Also disclosed herein are compositions comprising a therapeutically effective amount of one or more chimeric antigen receptor (CAR) T cells, T cell receptor (TCR) T-cells, and/or tumor infiltrating lymphocytes (TILs); wherein the CAR T cell, TCR T cell, and/or TIL has been engineered to express a receptor (such as, for example, a T cell receptor or chimeric antigen receptor) that recognizes one or more of the neoantigens of any preceding aspect (including, but not limited to peptides, polypeptides, and proteins of the neoantigens). In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens of any preceding aspect is further engineered to knockout or knockdown Zinc finger and BTB domain-containing protein 7B (ThPOK), Lysine-specific histone demethylase 1 (LSD1), programmed cell death protein (PD1), Proteine phosphatase 2 (PP2A) to enhance their function such as cytotoxic activity and persistence or survival in vivo after adoptive transfer to a cancer patient. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens of any preceding aspect is treated with a small molecular inhibitor such 2-PAPC to enhance T cell cytotoxic activity or the ability to proliferate and survive in vivo.


In one aspect, also disclosed herein are methods of stimulating an immunological response against a cancer or treating, inhibiting, and/or preventing a cancer comprising administering to a subject a composition comprising a therapeutically effective amount of a neoantigen of any preceding aspect (including, but not limited to peptides, polypeptides, and proteins of the neoantigens) and/or identified by the method of identifying a neoantigen of any preceding aspect.





III. BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments and together with the description illustrate the disclosed compositions and methods.



FIGS. 1A, 1B, and 1C show specificity of tumor-reactive TILs and T-cell clones. FIG. 1A shows tumor-infiltrating lymphocytes (TILs) expanded from cancer tissues of patient #135 and #136 and their composition analyzed by flow cytometry analysis with CD4 or CD8 antibody. Both CD4+ and CD8+ T-cell populations were isolated from the 135TIL or 136TIL lines, and activation of these T-cell populations by 135 mel or 136 mel was determined based on the expression of interferon-γ (upper, patient #135; lower, patient #136). FIG. 1B shows the specificity of 135TIL and 136TIL was determined after incubation with multiple cell lines. FIG. 1C shows the specificity and activation of 14 of 78 CD4+ and 14 of 40 CD8+ T-cell clones of patient #135 were determined by the release of IFN-γ after incubation with 135 mel or EBV-B and other melanoma cell lines. (upper, CD4+; lower, CD8+). Data are presented as representatives of three independent experiments.



FIG. 2 shows the specificity of other CD4+ tumor-reactive T-cell clones. The activation of 64 (out of 78) CD4+ T-cell clones was determined by the release of IFN-γ after incubation with 135 mel as well as EBV-B and melanoma cell lines of other patients. Data are presented as representatives of three independent experiments.



FIGS. 3A, 3B, 3C, 3D, 3E, 3F, 3G, and 3H show the identification of multiple neoantigens of patient #135 using whole-exome sequencing and tandem minigene screening with T cells. FIG. 3A shows somatic mutations of 135 mel cells were identified by whole-exome sequencing and analyzed by a series of filtering criteria, resulting in 232 somatic mutations and 24 TMG constructs. FIG. 3B shows screening of TMGs presented by MHC-II molecules in artificial APCs with use of 135TILs (CD4+ population). IFN-γ ELISA showed that TMG6, TMG17, TMG18 were strongly recognized by CD4+ TILs respectively. FIG. 3C shows the identification of MHC-II neoantigens in positive TMGs, based on IFN-γ ELISA of four HLA class II neoantigens: SPATA13 (T>A) [TMG6-1] (SEQ ID NO: 1(WT) and SEQ ID NO: 2 (MT)), PCDHB7 (R>C) [TMG17-6] (SEQ ID NO: 8 (WT) and SEQ ID NO: 9 (MT)), PCDHB16 (H>Y) [TMG17-7] (SEQ ID NO: 13 (WT) and SEQ ID NO: 14 (MT)), and ATG5 (D>N) [TMG18-10] (SEQ ID NO: 18 (WT) and SEQ ID NO: 19 (MT)). The wild type of each neoantigen was not recognized. FIG. 3D shows the screening of TMGs presented by MHC-I molecules in artificial APCs with use of 135TILs (CD8+ population). IFN-γ ELISA showed that TMG1 and TMG14 were strongly recognized by CD8+ TILs respectively. FIG. 3E shows the identification of MHC-I neoantigens in positive TMGs, based on IFN-γ ELISA of two MHC-I neoantigens: RPN2 (L>S) [TMG14-4] (SEQ ID NO: 25 (WT) and SEQ ID NO: 26 (MT)) and TXNIP (E>K) [TMG1-10] (SEQ ID NO: 31 (WT) and SEQ ID NO: 32 (MT)). The wild type of each neoantigen was not recognized. FIG. 3F shows the screening of TMGs presented by 135EBV-B cells with use of 135TILs. Besides positive TMGs identified in (D), TMG7 was also recognized by 135TILs. FIG. 3G shows the identification of one MHC-I neoantigens based on IFN-γ ELISA after 135TILs cocultured with 24 TMGs presented by autologous EBV-B cells: MPG (G>E) [TMG7-5] (SEQ ID NO: 38 (WT) and SEQ ID NO: 39 (MT)). The wild type was not recognized. FIG. 3H shows IFN-γ ELISA of serially diluted peptides of HLA class II and class I neoantigens (mutant and wild type) incubated with 135TIL. Data in B-H are plotted as means±SD of three independent experiments. * P<0.05, ** P<0.01 by comparison with mock controls or WT peptide controls using Student's t-test.



FIGS. 4A and 4B show the identification of subcloned MHC-II neoantigens of 135 mel from TMGs and their epitopes. FIG. 4A shows that each positive MHC-II TMG recognized by CD4+ 135TILs was subcloned into 10 separated constructs encoding one mutated gene each, and then assayed with CD4+ 135TILs. IFN-γ ELISA showed one or two subcloned genes in each TMG, which were recognized by CD4+ 135TILs when presented by 293IMDR3/DP4 and 293IMDR4/DP4 respectively. Each positive subclone was sequenced and identified as a patient-derived neoantigen. FIG. 4B shows that SPATA13 (SPATA13 WT (SEQ ID NO: 1), SPATA13 MT (SEQ ID NO: 2), SPATA13 13 mer-1 (SEQ ID NO: 3), SPATA13 13 mer-2 (SEQ ID NO: 4), SPATA13 13 mer-3 (SEQ ID NO: 5)), PCDHB16 (PCDHB16 WT (SEQ ID NO: 13), PCDHB16 MT (SEQ ID NO: 14), PCDHB16 13 mer-1 (SEQ ID NO: 15), PCDHB16 13 mer-2 (SEQ ID NO: 16), PCDHB16 13 mer-3 (SEQ ID NO: 17)), PCDHB7 (PCDHB7 WT (SEQ ID NO: 8), PCDHB7 MT (SEQ ID NO: 9), PCDHB7 13 mer-1 (SEQ ID NO: 10), PCDHB7 13 mer-2 (SEQ ID NO: 11), PCDHB7 13 mer-3 (SEQ ID NO: 12)), and ATG5 (ATG5 WT (SEQ ID NO: 18), ATG5 MT (SEQ ID NO: 19), ATG5 13 mer-1 (SEQ ID NO: 20), ATG5 13 mer-2 (SEQ ID NO: 21), ATG5 13 mer-3 (SEQ ID NO: 22)) neoantigens were truncated to three constructs each encoding 13 aa of the mutation. The truncated peptides were assayed with 135TIL, and recognition was determined by the release of IFN-γ. The epitope of each neoantigen was refined to 9-13 aa. Data are plotted as means±SD of three independent experiments. * P<0.05, ** P<0.01 by comparison with mock controls using Student's t-test. NS, not significant.



FIGS. 5A, 5B, and 5C show the identification of subcloned HLA class I neoantigens of 135 mel from TMG and their epitopes. FIG. 5A shows positive MHC-I TMGs were subcloned and assayed with CD8+ 135TIL. Each positive subclone presented by artificial APCs was sequenced and identified as a neoantigen. FIG. 5B shows that TMG7 were subcloned and assayed with CD8+ 135TIL presented by 135EBV-B cells. The positive subclone was sequenced and identified as a neoantigen. FIG. 5C shows that RPN2 (RPN2 WT (SEQ ID NO: 25), RPN2 MT (SEQ ID NO: 26), RPN2 13 mer-1 (SEQ ID NO: 27), RPN2 13 mer-2 (SEQ ID NO: 28), RPN2 13 mer-3 (SEQ ID NO: 29)), TXNIP (TXNIP WT (SEQ ID NO: 31), TXNIP MT (SEQ ID NO: 32), TXNIP 13 mer-1 (SEQ ID NO: 33), TXNIP 13 mer-2 (SEQ ID NO: 34), TXNIP 13 mer-3 (SEQ ID NO: 35)), and MPG (MPG WT (SEQ ID NO: 38), MPG MT (SEQ ID NO: 39), MPG 13 mer-1 (SEQ ID NO: 40), MPG 13 mer-2 (SEQ ID NO: 41), MPG 13 mer-3 (SEQ ID NO: 42)) neoantigens were truncated to three constructs encoding 13 aa containing the mutation and assayed with 135TIL as well. The epitope of RPN2 was truncated to 13 aa, the epitope of MPG was truncated to 11 aa and the epitope of TXNIP was truncated to 9 aa. Data are plotted as means±SD of three independent experiments. ** P<0.01 by comparison with mock controls using Student's t-test. NS, not significant.



FIGS. 6A, 6B, 6C, and 6D show the identification of two non-mutated antigens of patient #135 by cDNA library screening with T-cell clones. FIG. 6A shows representative data after all CD4+ T-cell clones assayed with identified MHC-II neoantigens in 135 mel presented by EBV-B cells. Two CD4+ tumor-reactive T-cell clones (1C1 and 1E1) did not recognize any MHC-II neoantigens. FIG. 6B shows two cDNA pools (pools #4 and #26) were recognized by CD4+ T-cell clones 1C1 and 1E1, respectively. cDNA pools were transfected into 293IMDR3/DP4 or 293IMDR4/DP4 cells and then cocultured with different T-cell clones. FIG. 6C shows IFN-γ ELISA of cloned antigens in pool #4 for 1C1 and pool #26 for 1E1, showing that both were recognized by 1C1 and 1E1, respectively, when presented by 293IMDR4 or 293IMDR3 cells. Identification of positive antigens by sequencing analysis (shown are 3′ and 5′ ends of SEQ ID NOs 45 and 46 of ADIPOR2 3′ UTR (ellipsis representing nucleotides or amino acids not shown in figure) and 3′ and 5′ ends of SEQ ID NOs: 47 and 48 for LAGE1b ORF2 (ellipsis representing nucleotides or amino acids not shown in figure)). FIG. 6D shows IFN-γ ELISA of serially diluted peptides of ADIPOR2 and LAGE1b in alternative ORFs incubated with 135TILs and antigen-specific T-cell clones, respectively. Data in C and D are plotted as means±SD of three independent experiments. ** P<0.01 compared with controls using Student's t-test.



FIGS. 7A, 7B, 7C, 7D, and 7E show the identification of two non-mutated antigens of 135 mel. FIG. 7A shows the sequence of non-mutated antigens identified by CD4+ T-cell clones. The antigen recognized by 1C1 was a 56 aa reading frame located in the 3′UTR sequence of gene ADIPOR2 (SEQ ID NO: 45 shows the nucleic acid sequence of ADIPOR2 and SEQ ID NO: 46 shows the amino acid sequence of the 3′UTR and epitope of T cell recognition). The antigen recognized by 1E1 was a 109 aa peptide located in the coding sequence of gene CTAG2 (LAGE1b) (SEQ ID NO: 47 shows the nucleic acid sequence of LAGE1b and SEQ ID NO: 48 shows the amino acid sequence of the ORF2 and epitope of T cell recognition), but in a frame-shifted reading frame. FIG. 7B shows IFN-γ ELISA of ADIPOR2 coding sequence and the antigen in 3′UTR assayed with 1C1. 1C1 only recognized the identified antigen in the 3′UTR, but not ADIPOR2 coding sequence. FIG. 7C shows IFN-γ ELISA of LAGE1b in normal reading frame (ORF1) and alternative reading frame (ORF2) assayed with 1E1. 1E1 only recognized ORF2 but not ORF1. Data are presented as means±SD of three independent experiments. * P<0.05, ** P<0.01 by comparison with mock or specified controls using Student's t-test. FIGS. 7D and 7E show that serial truncations were carried out on ADIPOR2 and LAGE1b antigens. Each truncate was assayed with the corresponding T-cell clone and tested by IFN-γ ELISA. The minimum linear truncate of ADIPOR2 recognized by 1C1 was 26 aa, while the minimum linear truncate of LAGE1b recognized by 1E1 was 69 aa. Data are plotted means±SD of three independent experiments. * P<0.05, ** P<0.01 by comparison with T cell alone using Student's t-test. NS, not significant.



FIGS. 8A, 8B, 8C, 8D, 8E, 8F, 8G, 8H, and 8I show the identification of multiple neoantigens of patient #136. FIG. 8A shows the identification of somatic mutations of 136 mel cells, resulting in 348 somatic mutations and 35 TMG constructs. FIG. 8B shows the screening of TMGs presented by MHC-II molecules in artificial APCs with use of 136TILs (CD4+ population). IFN-γ ELISA showed that TMG19, TMG31, TMG32 were strongly recognized by CD4+ TILs respectively. FIG. 8C shows the identification of MHC-II neoantigens in positive TMGs, based on IFN-γ ELISA of four HLA class II neoantigens: ZFYVE1 (H>Y) [TMG19-1] (SEQ ID NO: 75 (WT) and SEQ ID NO: 76 (MT)), LMAN2 (L>R) [TMG31-2] (SEQ ID NO: 77 (WT) and SEQ ID NO: 78 (MT)), MAPK9 (W>C) [TMG31-3] (SEQ ID NO: 79 (WT) and SEQ ID NO: 80 (MT)), and ANKIB1 (P>L) [TMG32-8] (SEQ ID NO: 81 (WT) and SEQ ID NO: 82 (MT)). The wild type of each neoantigen was not recognized. FIG. 8D shows the screening of TMGs presented by MHC-I molecules in artificial APCs with use of 136TILs (CD8+ population). IFN-γ ELISA showed that only TMG2 was strongly recognized by CD8+ TILs respectively. FIG. 8E shows the identification of MHC-I neoantigens in positive TMGs, based on IFN-γ ELISA of the only one MHC-I neoantigen: HHAT (G>E) [TMG2-3] (SEQ ID NO: 165 (WT) and SEQ ID NO: 166 (MT)). The wild type of this neoantigen was not recognized. FIG. 8F shows the screening of TMGs presented by fibroblasts of patient #136 pretreated by IFN-γ for expression of HLA molecules. No additional positive TMGs were identified. FIG. 8G shows the screening of TMGs presented by MHC-II molecules in artificial APCs with use of CD4+ T-cell clones of 136TIL (C13 and C22). IFN-γ ELISA showed that TMG13 and TMG8 were strongly recognized by clone C13 and C22 respectively. FIG. 8H shows the identification of MHC-II neoantigens only recognized by T-cell clones, based on IFN-γ ELISA of two MHC-II neoantigens: C1GALT1C1 (L>F) [TMG13-1] (SEQ ID NO: 83 (WT) and SEQ ID NO: 84 (MT)) and HSPA13 (P>L) [TMG8-1] (SEQ ID NO: 85 (WT) and SEQ ID NO: 86 (MT)). The wild type of each neoantigen was not recognized. FIG. 8I shows IFN-γ ELISA of serially diluted peptides MHC-II and MHC-I neoantigens (mutant and wild type) incubated with 136TIL or T-cell clones. Data in B-I are plotted as means±SD of three independent experiments. * P<0.05, ** P<0.01 by comparison with mock controls or WT peptide controls using Student's t-test.



FIGS. 9A, 9B, 9C, and 9D show the identification of subcloned HLA class I and class II neoantigens of 136 mel from TMGs. FIG. 9A shows positive MHC-II TMGs were subcloned and assayed with CD4+ 136TIL. Each positive subclone presented by 293IMDR1/DP4 or 293IMDR11/DP4 cells was sequenced and identified as a neoantigen. FIG. 9B shows the only positive MHC-I TMG2 recognized by CD8+ 136TILs was subcloned into 10 separated constructs encoding one mutated gene each, and then assayed with CD8+ 136TILs. IFN-γ ELISA showed one subcloned gene in each TMG, which were recognized by CD8+ 136TILs when presented by Cos7-A1 cells and then sequenced and identified as a patient-derived neoantigen. FIG. 9C shows that 136 Fibroblasts were treated with IFN-γ for three days and the expression of MHC-II molecules before and after treatment were compared. FIG. 9D shows two positive MHC-II TMGs only recognized by CD4+ T-cell clones of 136TILs were subcloned and assayed with CD4+ T-cell clones respectively. Each positive subclone presented by 293IMDR1/DP4 cells was sequenced and identified as a neoantigen.



FIGS. 10A, 10B, 10C, 10D, 10E, and 10F show the recognition of given neoantigens of patient #135 and #136 by dominant population of tumor-reactive T cells. FIG. 10A shows IFN-γ ELISA of all 78 CD4+ tumor-reactive T-cell clones of 135TIL assayed with MHC-II neoantigens: SPATA13, PCDHB7, PCDHB16, ATG5, ADIPOR2 26 mer and LAGE1b ORF2 respectively. FIG. 10B shows IFN-γ ELISA of all 40 CD8+ tumor-reactive T-cell clones of 135TIL assayed with MHC-I neoantigens: RPN2, MPG and TXNIP respectively. FIG. 10C shows the tumor and neoantigen peptides-reactive populations of CD4+ (upper) and CD8+ (lower) 135TILs stained intracellularly by IFN-γ antibody and sorted by FACS. FIG. 10D shows the statistic result of the ratio of each identified neoantigen in 135 mel in the summary of all peptide-reactive CD4+ or CD8+ T cell populations respectively. FIG. 10E shows PCDHB16 and MPG peptides-bound T cells in 135TIL were sorted. TCRβ CDR3 profiling were performed with sorted T cells and 135TIL. Top sequence frequencies of neoantigen-specific TCRs in sorted T cell groups are shown. FIG. 10F shows the percentage in 135TIL of PCDHB16 or MPG-bound TCRs after TCRβ CDR3 profiling. FIGS. 11A, 11B, 11C, and 11D show TCRβ CDR3 repertoire analysis of dominant neoantigen-reactive T cells in 135TIL and 136TIL. FIG. 11A shows the tumor and neoantigen peptides-reactive populations of CD4+ (upper) and CD8+ (lower) 136TILs stained intracellularly by IFN-γ antibody and sorted by FACS. FIG. 11B shows the statistic result of the ratio of each identified neoantigen in 136 mel in the summary of all peptide-reactive CD4+ or CD8+ T cell populations respectively. FIG. 11C shows the usage of combined V-J sequences in TCR β CDR3 of PCDHB16-reactive T cells (left) and MPG-reactive T cells (right) in 135TILs. FIG. 11D shows the amino acid usage of V-J joint in TCR β CDR3 of PCDHB16-reactive T cells (left, SEQ ID NO: 87) and MPG-specific T cells (right, SEQ ID NO: 164) in 135TILs.



FIGS. 12A, 12B, 12C, 12D, 12E, and 12F show the anti-tumor effect in vitro and in vivo when targeting dominant neoantigen of 135 mel. FIG. 12A shows single cell tumor clones were generated from 135 mel tumor cell lines by limiting dilution or single cell cloning method. Dominant PCDHB16 neoantigen-specific T-cell clone 1H1, as well as subdominant/cryptic neoantigen 2C10 (PCDHB7-specific) and JF6 (SPATA13-specific) were used to test their ability to recognize single tumor clones and bulk 135 mel tumor cell line. Data are plotted as means±SD of three independent experiments. FIG. 12B shows the dominant neoantigen PCDHB16-specific T-cell clones 1H1 showed higher percentage of tumor cell killing than subdominant/cryptic neoantigen-specific T-cell clones on 135 mel against bulk and selected single tumor cell clones. FIG. 12C shows sanger sequencing of 135 mel tumor clones and 135TIL at the mutation sites of PCDHB16, PCDHB7 and SPATA13, revealing that the somatic mutation of each neoantigen existed in the genome of each clone. FIG. 12D shows the scheme of NSG mice injected with 135 mel and treated by 135 T-cell clones. FIG. 12E shows murine serum level of IFN-γ in vivo on day 4 after T-cell injection. FIG. 12F shows the change of 135 mel tumor size in vivo after injection of T-cell clones targeting different neoantigens. Data are plotted as means±SD of three independent experiments. ** P<0.01 compared with control using Student's t-test. NS, not significant.



FIGS. 13A, 13B, 13C, and 13D show the recognition of multiple neoantigens by a single TCR. FIG. 13A shows that T-cell clone 4B8 simultaneously recognized the product translated from the ADIPOR2 3′UTR mRNA and PCDHB16 neoantigen by IFN-γ ELISA of serially diluted peptides. The same assay of naïve T cells transduced with 4B8 TCR confirmed this dual recognition. FIG. 13B shows IFN-γ ELISA of separated CD4+ and CD8+ T cells transduced with 4B8 TCR. Only CD4+ T cells recognized both antigens after the transduction. FIG. 13C shows flow cytometric analysis of T-cell clone C76, showing its CD4+/CD8+ double positivity. Naïve T cells transduced by C76 TCR recognized HLA class I-restricted RPN2 neoantigen and HLA class II-restricted PCDHB16 neoantigen by IFN-γ release assay. FIG. 13D shows that CD8+ T cells transduced with C76 TCR recognized HLA-A1-presented RPN2 neoantigen, while transduced CD4+ T cells recognized only HLA-DR4-restricted PCDHB16 neoantigen. Data are presented as means±SD of three independent experiments. ** P<0.01 compared with controls using Student's t-test. NS, not significant.



FIGS. 14A, 14B, and 14C show that one T-cell receptor can recognize dual neoantigens. FIG. 14A shows FACS analysis of CD4+ T-cell clone 4B8 of 135TILs reacted with two neoantigens, PCDHB16 and ADIPOR2 26 mer. FIG. 14B shows a map of TCRα and β chains cloned into the retroviral vector pMSGV. FIG. 14C shows IFN-γ ELISA of all 78 CD4+ T-cell clones assayed with HLA class I neoantigens. One PCDHB16-recognizing CD4+ T-cell clone C76 was discovered to recognize RPN2 presented by HLA-A1 spontaneously, leading to the further identification of C76 TCR dual recognition. Data are presented as representatives of three independent experiments.



FIGS. 15A, 15B, 15C show that ThPOK expression correlates with 135 TIL CD4+ clones cytotoxicity. FIG. 15A shows the cytotoxicity of 135 TIL CD4+ clones was conducted by LDH assay. Effectors: Targets=20:1. FIG. 15B shows quantitative real-time PCR of mRNA level. FIG. 15C shows western blot analysis of ThPOK expression in 135 CD4+ TIL clones.



FIGS. 16A, 16B, 16C, 16D, and 16E show the knockdown of ThPOK in human 135 CD4+ 135TILs increases their cytotoxicity. FIG. 16A shows western blot analysis of ThPOK Knockdown efficiency. FIG. 16B shows that an LDH assay was conducted after ThPOK shRNA first transduction 72 hours at different E:T ratio. FIG. 16C shows that 135 mel tumor cells co-cultured with or without ThPOK shRNA knockdown in CD4+ 135TIL clone 1D1 for three days. FIG. 16D shows quantitative real-time PCR analysis of typical CD8+ CTL cytolytic molecules after ThPOK knockdown 72 hours. FIG. 16E shows ThPOK knockdown in CD4+ 135TIL clone 1D1 maintains the CD4 property in vitro, ELISA test of cytokine secretion of IFN-γ, GM-CSF, IL-4, IL-10.



FIGS. 17A, 17B, 17C, and 17D show knockdown of hThPOK enhances the anti-tumor efficacy of tumor specific T cells in vivo. FIG. 17A shows that 2×106 135 mel tumor cells were inoculated into the flank of SCID (beige) mice (n=5). 5×106 CD4+ 135TILs modified with hThPOK shRNA were adoptively transferred on Day 5. Tumor growth was monitored every 2-3 days with an electronic caliper. FIG. 17B shows mice were sacrificed when control group length reach 10 mm. Tumor was excised and weighed on Day 26 after inoculation. FIG. 17C shows human CD4+ or CD8+ T cells transduced with A2ESO TCR or A2ESO TCR-ThPOK shRNA were injected into mouse at day 4 after tumor cell injection. Tumor volume was measured every day. FIG. 17D shows mice were sacrificed at day 9 and tumor morphology were measured.



FIGS. 18A, 18B, 18C, and 18D show that ThPOK inhibits CD4+ tumor specific T cell cytotoxicity by interacting with LSD1. FIG. 18A shows epigenetic inhibitor screening identified LSD1 inhibition increases cytotoxicity of antigen specific CD4+ T cells. FIG. 18B shows ThPOK interacting epigenetic gene screening identified LSD1 could directly interact with ThPOK. FIG. 18C shows endogenous immunoprecipitation in CD4+ T cells validated ThPOK could interact with LSD1 in CD4+ T cells. FIG. 18D shows the relative occupancy of ThPOK and LSD1 in GZMB and PRF1 gene promoters. Q-PCR analysis of recovered DNA from the anti-ThPOK and LSD1 Chromatin immunoprecipitation assay.



FIGS. 19A and 19B shows the identification of 2-PCPA as a positive regulator of the generation of stem cell-like memory T cells through in vitro screening. FIG. 19A shows that T cells were activated by plate-bound OKT3 for 2 days and cultured for additional 12 days before flow cytometry analysis of stem cell-like memory T cells (Tscm) percentage. Metabolites or inhibitors were added at the time of activation. Results were shown as the fold change of Tscm percentage between treated and untreated T cells. Tscm subset was identified as CCR7CD62L+ CD45RO−. FIG. 19B shows the determining other Tscm markers CD95 and CD45RA expression on control of 2-PCPA treated T cells. Isotype controls were shown as blue peaks.



FIGS. 20A, 20B, 20C, 20D, and 20E show 2-PCPA treatment promoted the in vitro and in vitro function of A2ESO TCR-T cells. FIG. 20A shows that T cells from healthy donors were activated by plate-bound OKT3 for 2 days and transduced with PG13-A2ESO TCR virus. Transduced T cells were cultured for additional 8 days. 2-PCPA or DMSO (control) treatment were added before activation. CD4 and CD8 memory phenotype was detected by flow cytometry at day 10. FIG. 20B shows DMSO or 2-PCPA treated A2ESO TCR-T cells were stimulated by PMA and ionomycin for 5 hours, fixed, permeabilized and IFN-γ, IL-2 and TNFα expression in CD4+ and CD8+ T cells were checked. FIG. 20C shows DMSO or 2-PCPA treated A2ESO TCR-T cells were labeled with CFSE and cultured under IL-2 or stimulated by 231-ESO cells for 3 days. Division Index was calculated by FlowJo software. FIG. 20D shows 2-PCPA treatment render A2ESO TCR-T cells with increased OXPHOS and spare respiratory capacity (SRC). OCR was detected by Seahorse XFe96 Analyzer. FIG. 20E shows NSG mice were injected with 231-ESO breast cancer cell line at fat pad. 3 days later, DMSO or 2-PCPA treated A2ESO TCR-T cells were injected intravenously. Tumors were isolated and weight after 2 weeks.



FIGS. 21A, 21B, 21C, 21D, and 21E show that 2-PCPA treatment promoted the in vitro and in vitro function of CAR-T cells. FIG. 21A shows that T cells from healthy donors were activated by plate-bound OKT3 for 2 days and transduced with PG13-CD19 CAR virus. Transduced T cells were cultured for additional 10 days. 2-PCPA or DMSO (control) treatment were added before activation. CD4+ and CD8+ memory phenotype was detected by flow cytometry at day 12. FIG. 21B shows 2-PCPA treatment render CAR-T cells with increased OXPHOS and spare respiratory capacity (SRC). OCR was detected by Seahorse XFe96 Analyzer. FIG. 21C shows DMSO or 2-PCPA treated CAR-T cells were washed extensively and cultured in IL-2-free medium for 4 days and survived cells were detected by PI staining. FIG. 21D shows that NSG mice were intravenously injected with Raji lymphoma cells. 10 days later, DMSO or 2-PCPA treated CD19 CAR and luciferase-transduced T cells were injected intravenously. Proliferation of CAR-T cells were tracked through in vivo imaging. FIG. 21E shows Human CD3+ cells percentage within spleen at day 8 detected by flow cytometry.





IV. DETAILED DESCRIPTION

Before the present compounds, compositions, articles, devices, and/or methods are disclosed and described, it is to be understood that they are not limited to specific synthetic methods or specific recombinant biotechnology methods unless otherwise specified, or to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.


A. Definitions

As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a pharmaceutical carrier” includes mixtures of two or more such carriers, and the like.


Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that when a value is disclosed that “less than or equal to” the value, “greater than or equal to the value” and possible ranges between values are also disclosed, as appropriately understood by the skilled artisan. For example, if the value “10” is disclosed the “less than or equal to 10” as well as “greater than or equal to 10” is also disclosed. It is also understood that the throughout the application, data is provided in a number of different formats, and that this data, represents endpoints and starting points, and ranges for any combination of the data points. For example, if a particular data point “10” and a particular data point 15 are disclosed, it is understood that greater than, greater than or equal to, less than, less than or equal to, and equal to 10 and 15 are considered disclosed as well as between 10 and 15. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.


In this specification and in the claims which follow, reference will be made to a number of terms which shall be defined to have the following meanings:


“Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.


The term “subject” refers to any individual who is the target of administration or treatment. The subject can be a vertebrate, for example, a mammal. In one aspect, the subject can be human, non-human primate, bovine, equine, porcine, canine, or feline. The subject can also be a guinea pig, rat, hamster, rabbit, mouse, or mole. Thus, the subject can be a human or veterinary patient. The term “patient” refers to a subject under the treatment of a clinician, e.g., physician.


Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this pertains. The references disclosed are also individually and specifically incorporated by reference herein for the material contained in them that is discussed in the sentence in which the reference is relied upon.


B. Methods of Identifying Neoantigens

Although in vitro stimulation of T cells from normal healthy donors with neoantigen peptides could increase the number of neoantigens capable of activating T cells, prior to the present disclosure it remained unknown about the number and recognition pattern of neoantigens by T cells in cancer patients. Furthermore, it was largely unknown whether a single T cell receptor (TCR) can recognize two or multiple neoantigens in cancer patients. Despite recent identification of many HLA class I-restricted neoantigens using exome sequencing and computer-assisted prediction programs, increasing evidence demonstrates that neoantigen-specific CD4+ T cells are more dominant than neoantigen-specific CD8+ T cells, and play a critical role in tumor regression. However, the current prediction programs in use in the art for MHC II-restricted neoantigens are not accurate and much poorer than the prediction programs of MHC I-restricted neoantigens due to the open pockets of MHC II molecules, thus limiting the ability to identify neoantigens and develop neoantigen-based immunotherapy.


Tumor regression induced by checkpoint blockade therapy and T cell-based immunotherapy relies on T-cell recognition of tumor antigens, particularly mutation-derived neoantigens, expressed by cancer cells. However, despite the larger numbers of somatic mutations identified in cancer tissues that could generate neoantigens, only a few immunogenic peptide epitopes (per cancer patient) for T-cell recognition have been reported. The underlying mechanisms of these observations remain poorly understood. Herein identification of four MHC class II-restricted neoantigens and two MHC class I-restricted neoantigens is reported. Further analysis of neoantigens for recognition by CD4+ T-cell clones revealed the dominance of one neoantigen, which was recognized by 87% of the T-cell clones, while several low-frequency T-cell clones recognized 3 additional neoantigens. Overall, 98% of the CD4+ T cells recognized neoantigens, compared with only 34% of the CD8+ T cell clones. Additionally, 2 T-cell clones did not recognize mutation-derived neoantigen, but instead recognized neoepitopes derived from gene products translated from the 3′ untranslated region of the ADIPOR2 gene or from an alternative open reading frame of LAGE1b. Finally, it was demonstrated herein that a single TCR can recognize two neoantigens, but not other antigens in normal cells. Also demonstrated was the utility and power of neoantigen-specific T cells in inhibiting tumor growth. The findings have identified immune regulatory mechanisms that restrain T-cell response to a limited number of neoantigens with one as dominant, and should have important implications in personalized immunotherapy that targets dominant neoantigens and TCRs with multiple neoantigen specificity.


In one aspect, disclosed herein are methods of identifying neoantigens from a cancer in a human subject comprising a) exome sequencing on a nucleic acid sample from a cancer cell (in some instances the method can further comprise isolating DNA from cancer cells, capturing DNA fragments, and constructing an exome library); b) mapping the sequencing result to a reference genome sequence data (for example human genome GRCh38/hg38); c) filtering sequence variations to remove common variations in tumors and normal cells; d) creating one or more single mutation peptide constructs comprising at least one uncommon amino acid variation and one or more flanking amino acids; e) synthesizing one or more minigenes encoding one or more single mutation peptide constructs of step d; f) transfecting one or more minigenes into one or more cells or cell lines expressing MHC class I or MHC class II molecules; g) co-culturing one or more T cells or T cells lines with the transfected cells of step f); h) measuring T cell activity of the co-cultured T cells or T cell lines; i) and assaying single mutation peptides from each minigene in transfected cell lines that induced T cell activity for the ability to induce T cell activity alone.


In one aspect, also disclosed herein are methods of identifying any neoantigendisclosed herein, further comprising mapping the mutation to a specific peptide epitope by serial truncation and/or further comprising assaying the HLA restriction of the peptide epitope by HLA antibody blocking.


As noted herein, the single mutation constructs created in the disclosed methods can have one or more amino acids on either or both sides of the mutations. Thus, for example, disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the single mutation constructs comprise at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acids on either side of the mutation.


It is understood and herein contemplated that the minigenes created in the disclosed methods can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or more single mutation constructs. Thus, in one aspect, disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the minigene comprises at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 single mutation constructs.


In one aspect, it is understood and herein contemplated that the disclosed also disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the one or more cell or cell lines (such as, for example HEK293 cells, HEK293T cells, Cos-7, MA104 cells, CHO cells, Fibroblasts, B cells, VERO cells, Madin-Darby Canine Kidney (MDCK) cells, HEp-2 cells, HeLa cells, MRC-5 cells, WI-38 cells, EB66, or PER C6 cells) expressing MHC class I or class II molecules, are engineered to express an MHC molecule.


It is understood and herein contemplated that there are numerous methods known in the art for measuring T cell activity that could be used in the disclosed methods of identifying neoantigens, including, but not limited to chromium release assays, ELISA, ELISpot, intracellular cytokine staining, flow cytometry, MHC class I tetramer staining, MHC class II tetramer staining, radioimmunoassay, or other immunoassay. The immunoassays can measure the release of any number of effector cytokines (such as, for example, release of cytokines including, but not limited to IFN-γ, TGF-β, lymphotoxin-α, IL-2, IL-4, IL-10, IL-17, or IL-25) or expression of cell surface markers associated with T cell activation (CD69, CD62L, CD44, CD45 RA, CD45RO, and/or CCR7).


1. Immunoassays and Fluorochromes 52. The steps of various useful immunodetection methods have been described in the scientific literature, such as, e.g., Maggio et al., Enzyme-Immunoassay, (1987) and Nakamura, et al., Enzyme Immunoassays: Heterogeneous and Homogeneous Systems, Handbook of Experimental Immunology, Vol. 1: Immunochemistry, 27.1-27.20 (1986), each of which is incorporated herein by reference in its entirety and specifically for its teaching regarding immunodetection methods. Immunoassays, in their most simple and direct sense, are binding assays involving binding between antibodies and antigen. Many types and formats of immunoassays are known and all are suitable for detecting the disclosed biomarkers. Examples of immunoassays are enzyme linked immunosorbent assays (ELISAs), radioimmunoassays (RIA), radioimmune precipitation assays (RIPA), immunobead capture assays, Western blotting, dot blotting, gel-shift assays, Flow cytometry, protein arrays, multiplexed bead arrays, magnetic capture, in vivo imaging, fluorescence resonance energy transfer (FRET), and fluorescence recovery/localization after photobleaching (FRAP/FLAP).


In general, immunoassays involve contacting a sample suspected of containing a molecule of interest (such as the disclosed biomarkers) with an antibody to the molecule of interest or contacting an antibody to a molecule of interest (such as antibodies to the disclosed biomarkers) with a molecule that can be bound by the antibody, as the case may be, under conditions effective to allow the formation of immunocomplexes. Contacting a sample with the antibody to the molecule of interest or with the molecule that can be bound by an antibody to the molecule of interest under conditions effective and for a period of time sufficient to allow the formation of immune complexes (primary immune complexes) is generally a matter of simply bringing into contact the molecule or antibody and the sample and incubating the mixture for a period of time long enough for the antibodies to form immune complexes with, i.e., to bind to, any molecules (e.g., antigens) present to which the antibodies can bind. In many forms of immunoassay, the sample-antibody composition, such as a tissue section, ELISA plate, dot blot or Western blot, can then be washed to remove any non-specifically bound antibody species, allowing only those antibodies specifically bound within the primary immune complexes to be detected.


Immunoassays can include methods for detecting or quantifying the amount of a molecule of interest (such as the disclosed biomarkers or their antibodies) in a sample, which methods generally involve the detection or quantitation of any immune complexes formed during the binding process. In general, the detection of immunocomplex formation is well known in the art and can be achieved through the application of numerous approaches. These methods are generally based upon the detection of a label or marker, such as any radioactive, fluorescent, biological or enzymatic tags or any other known label.


As used herein, a label can include a fluorescent dye, a member of a binding pair, such as biotin/streptavidin, a metal (e.g., gold), or an epitope tag that can specifically interact with a molecule that can be detected, such as by producing a colored substrate or fluorescence. Substances suitable for detectably labeling proteins include fluorescent dyes (also known herein as fluorochromes and fluorophores) and enzymes that react with colorometric substrates (e.g., horseradish peroxidase). The use of fluorescent dyes is generally preferred in the practice of the invention as they can be detected at very low amounts. Furthermore, in the case where multiple antigens are reacted with a single array, each antigen can be labeled with a distinct fluorescent compound for simultaneous detection. Labeled spots on the array are detected using a fluorimeter, the presence of a signal indicating an antigen bound to a specific antibody.


Fluorophores are compounds or molecules that luminesce. Typically fluorophores absorb electromagnetic energy at one wavelength and emit electromagnetic energy at a second wavelength. Representative fluorophores include, but are not limited to, 1,5 IAEDANS; 1,8-ANS; 4-Methylumbelliferone; 5-carboxy-2,7-dichlorofluorescein; 5-Carboxyfluorescein (5-FAM); 5-Carboxynapthofluorescein; 5-Carboxytetramethylrhodamine (5-TAMRA); 5-Hydroxy Tryptamine (5-HAT); 5-ROX (carboxy-X-rhodamine); 6-Carboxyrhodamine 6G; 6-CR 6G; 6-JOE; 7-Amino-4-methylcoumarin; 7-Aminoactinomycin D (7-AAD); 7-Hydroxy-4-I methylcoumarin; 9-Amino-6-chloro-2-methoxyacridine (ACMA); ABQ; Acid Fuchsin; Acridine Orange; Acridine Red; Acridine Yellow; Acriflavin; Acriflavin Feulgen SITSA; Aequorin (Photoprotein); AFPs—AutoFluorescent Protein—(Quantum Biotechnologies) see sgGFP, sgBFP; Alexa Fluor 350™; Alexa Fluor 430™; Alexa Fluor 488™; Alexa Fluor 532™; Alexa Fluor 546™; Alexa Fluor 568™; Alexa Fluor 594™; Alexa Fluor 633™; Alexa Fluor 647™; Alexa Fluor 660™; Alexa Fluor 680™; Alizarin Complexon; Alizarin Red; Allophycocyanin (APC); AMC, AMCA-S; Aminomethylcoumarin (AMCA); AMCA-X; Aminoactinomycin D; Aminocoumarin; Anilin Blue; Anthrocyl stearate; APC-Cy7; APTRA-BTC; APTS; Astrazon Brilliant Red 4G; Astrazon Orange R; Astrazon Red 6B; Astrazon Yellow 7 GLL; Atabrine; ATTO-TAG™ CBQCA; ATTO-TAG™ FQ; Auramine; Aurophosphine G; Aurophosphine; BAO 9 (Bisaminophenyloxadiazole); BCECF (high pH); BCECF (low pH); Berberine Sulphate; Beta Lactamase; BFP blue shifted GFP (Y66H); Blue Fluorescent Protein; BFP/GFP FRET; Bimane; Bisbenzemide; Bisbenzimide (Hoechst); bis-BTC; Blancophor FFG; Blancophor SV; BOBO™-1; BOBO™-3; Bodipy 492/515; Bodipy 493/503; Bodipy 500/510; Bodipy; 505/515; Bodipy 530/550; Bodipy 542/563; Bodipy 558/568; Bodipy 564/570; Bodipy 576/589; Bodipy 581/591; Bodipy 630/650-X; Bodipy 650/665-X; Bodipy 665/676; Bodipy Fl; Bodipy FL ATP; Bodipy Fl-Ceramide; Bodipy R6G SE; Bodipy TMR; Bodipy TMR-X conjugate; Bodipy TMR-X, SE; Bodipy TR; Bodipy TR ATP; Bodipy TR-X SE; BO-PRO™-1; BO-PRO™-3; Brilliant Sulphoflavin FF; BTC; BTC-5N; Calcein; Calcein Blue; Calcium Crimson—; Calcium Green; Calcium Green-1 Ca2+ Dye; Calcium Green-2 Ca2+; Calcium Green-5N Ca2+; Calcium Green-C18 Ca2+; Calcium Orange; Calcofluor White; Carboxy-X-rhodamine (5-ROX); Cascade Blue™; Cascade Yellow; Catecholamine; CCF2 (GeneBlazer); CFDA; CFP (Cyan Fluorescent Protein); CFP/YFP FRET; Chlorophyll; Chromomycin A; Chromomycin A; CL-NERF; CMFDA; Coelenterazine; Coelenterazine cp; Coelenterazine f; Coelenterazine fcp; Coelenterazine h; Coelenterazine hcp; Coelenterazine ip; Coelenterazine n; Coelenterazine O; Coumarin Phalloidin; C-phycocyanine; CPM I Methylcoumarin; CTC; CTC Formazan; Cy2™; Cy3.1 8; Cy3.5™; Cy3™; Cy5.1 8; Cy5.5™; Cy5™; Cy7™; Cyan GFP; cyclic AMP Fluorosensor (FiCRhR); Dabcyl; Dansyl; Dansyl Amine; Dansyl Cadaverine; Dansyl Chloride; Dansyl DHPE; Dansyl fluoride; DAPI; Dapoxyl; Dapoxyl 2; Dapoxyl 3′DCFDA; DCFH (Dichlorodihydrofluorescein Diacetate); DDAO; DHR (Dihydorhodamine 123); Di-4-ANEPPS; Di-8-ANEPPS (non-ratio); DiA (4-Di 16-ASP); Dichlorodihydrofluorescein Diacetate (DCFH); DiD-Lipophilic Tracer; DiD (Di1C18(5)); DIDS; Dihydorhodamine 123 (DHR); Dil (Di1C18(3)); I Dinitrophenol; DiO (DiOC18(3)); DiR; DiR (Di1C18(7)); DM-NERF (high pH); DNP; Dopamine; DsRed; DTAF; DY-630-NHS; DY-635-NHS; EBFP; ECFP; EGFP; ELF 97; Eosin; Erythrosin; Erythrosin ITC; Ethidium Bromide; Ethidium homodimer-1 (EthD-1); Euchrysin; EukoLight; Europium (111) chloride; EYFP; Fast Blue; FDA; Feulgen (Pararosaniline); FIF (Formaldehyd Induced Fluorescence); FITC; Flazo Orange; Fluo-3; Fluo-4; Fluorescein (FITC); Fluorescein Diacetate; Fluoro-Emerald; Fluoro-Gold (Hydroxystilbamidine); Fluor-Ruby; FluorX; FM 1-43™; FM 4-46; Fura Red™ (high pH); Fura Red™/Fluo-3; Fura-2; Fura-2/BCECF; Genacryl Brilliant Red B; Genacryl Brilliant Yellow 10GF; Genacryl Pink 3G; Genacryl Yellow SGF; GeneBlazer; (CCF2); GFP (S65T); GFP red shifted (rsGFP); GFP wild type′ non-UV excitation (wtGFP); GFP wild type, UV excitation (wtGFP); GFPuv; Gloxalic Acid; Granular blue; Haematoporphyrin; Hoechst 33258; Hoechst 33342; Hoechst 34580; HPTS; Hydroxycoumarin; Hydroxystilbamidine (FluoroGold); Hydroxytryptamine; Indo-1, high calcium; Indo-1 low calcium; Indodicarbocyanine (DiD); Indotricarbocyanine (DiR); Intrawhite Cf; JC-1; JO JO-1; JO-PRO-1; LaserPro; Laurodan; LDS 751 (DNA); LDS 751 (RNA); Leucophor PAF; Leucophor SF; Leucophor WS; Lissamine Rhodamine; Lissamine Rhodamine B; Calcein/Ethidium homodimer; LOLO-1; LO-PRO-1; Lucifer Yellow; Lyso Tracker Blue; Lyso Tracker Blue-White; Lyso Tracker Green; Lyso Tracker Red; Lyso Tracker Yellow; LysoSensor Blue; LysoSensor Green; LysoSensor Yellow/Blue; Mag Green; Magdala Red (Phloxin B); Mag-Fura Red; Mag-Fura-2; Mag-Fura-5; Mag-lndo-1; Magnesium Green; Magnesium Orange; Malachite Green; Marina Blue; I Maxilon Brilliant Flavin 10 GFF; Maxilon Brilliant Flavin 8 GFF; Merocyanin; Methoxycoumarin; Mitotracker Green FM; Mitotracker Orange; Mitotracker Red; Mitramycin; Monobromobimane; Monobromobimane (mBBr-GSH); Monochlorobimane; MPS (Methyl Green Pyronine Stilbene); NBD; NBD Amine; Nile Red; Nitrobenzoxedidole; Noradrenaline; Nuclear Fast Red; i Nuclear Yellow; Nylosan Brilliant lavin EBG; Oregon Green™; Oregon Green™ 488; Oregon Green™ 500; Oregon Green™ 514; Pacific Blue; Pararosaniline (Feulgen); PBFI; PE-Cy5; PE-Cy7; PerCP; PerCP-Cy5.5; PE-TexasRed (Red 613); Phloxin B (Magdala Red); Phorwite AR; Phorwite BKL; Phorwite Rev; Phorwite RPA; Phosphine 3R; PhotoResist; Phycoerythrin B [PE]; Phycoerythrin R [PE]; PKH26 (Sigma); PKH67; PMIA; Pontochrome Blue Black; POPO-1; POPO-3; PO-PRO-1; PO-I PRO-3; Primuline; Procion Yellow; Propidium lodid (P1); PyMPO; Pyrene; Pyronine; Pyronine B; Pyrozal Brilliant Flavin 7GF; QSY 7; Quinacrine Mustard; Resorufin; RH 414; Rhod-2; Rhodamine; Rhodamine 110; Rhodamine 123; Rhodamine 5 GLD; Rhodamine 6G; Rhodamine B; Rhodamine B 200; Rhodamine B extra; Rhodamine BB; Rhodamine BG; Rhodamine Green; Rhodamine Phallicidine; Rhodamine: Phalloidine; Rhodamine Red; Rhodamine WT; Rose Bengal; R-phycocyanine; R-phycoerythrin (PE); rsGFP; S65A; S65C; S65L; S65T; Sapphire GFP; SBFI; Serotonin; Sevron Brilliant Red 2B; Sevron Brilliant Red 4G; Sevron I Brilliant Red B; Sevron Orange; Sevron Yellow L; sgBFP™ (super glow BFP); sgGFP™ (super glow GFP); SITS (Primuline; Stilbene Isothiosulphonic Acid); SNAFL calcein; SNAFL-1; SNAFL-2; SNARF calcein; SNARF1; Sodium Green; SpectrumAqua; SpectrumGreen; SpectrumOrange; Spectrum Red; SPQ (6-methoxy-N-(3 sulfopropyl) quinolinium); Stilbene; Sulphorhodamine B and C; Sulphorhodamine Extra; SYTO 11; SYTO 12; SYTO 13; SYTO 14; SYTO 15; SYTO 16; SYTO 17; SYTO 18; SYTO 20; SYTO 21; SYTO 22; SYTO 23; SYTO 24; SYTO 25; SYTO 40; SYTO 41; SYTO 42; SYTO 43; SYTO 44; SYTO 45; SYTO 59; SYTO 60; SYTO 61; SYTO 62; SYTO 63; SYTO 64; SYTO 80; SYTO 81; SYTO 82; SYTO 83; SYTO 84; SYTO 85; SYTOX Blue; SYTOX Green; SYTOX Orange; Tetracycline; Tetramethylrhodamine (TRITC); Texas Red™; Texas Red-X™ conjugate; Thiadicarbocyanine (DiSC3); Thiazine Red R; Thiazole Orange; Thioflavin 5; Thioflavin S; Thioflavin TON; Thiolyte; Thiozole Orange; Tinopol CBS (Calcofluor White); TIER; TO-PRO-1; TO-PRO-3; TO-PRO-5; TOTO-1; TOTO-3; TriColor (PE-Cy5); TRITC TetramethylRodaminelsoThioCyanate; True Blue; Tru Red; Ultralite; Uranine B; Uvitex SFC; wt GFP; WW 781; X-Rhodamine; XRITC; Xylene Orange; Y66F; Y66H; Y66W; Yellow GFP; YFP; YO-PRO-1; YO-PRO 3; YOYO-1; YOYO-3; Sybr Green; Thiazole orange (interchelating dyes); semiconductor nanoparticles such as quantum dots; or caged fluorophore (which can be activated with light or other electromagnetic energy source), or a combination thereof.


A modifier unit such as a radionuclide can be incorporated into or attached directly to any of the compounds described herein by halogenation. Examples of radionuclides useful in this embodiment include, but are not limited to, tritium, iodine-125, iodine-131, iodine-123, iodine-124, astatine-210, carbon-11, carbon-14, nitrogen-13, fluorine-18. In another aspect, the radionuclide can be attached to a linking group or bound by a chelating group, which is then attached to the compound directly or by means of a linker. Examples of radionuclides useful in the apset include, but are not limited to, Tc-99m, Re-186, Ga-68, Re-188, Y-90, Sm-153, Bi-212, Cu-67, Cu-64, and Cu-62. Radiolabeling techniques such as these are routinely used in the radiopharmaceutical industry.


The radiolabeled compounds are useful as imaging agents to diagnose neurological disease (e.g., a neurodegenerative disease) or a mental condition or to follow the progression or treatment of such a disease or condition in a mammal (e.g., a human). The radiolabeled compounds described herein can be conveniently used in conjunction with imaging techniques such as positron emission tomography (PET) or single photon emission computerized tomography (SPECT).


Labeling can be either direct or indirect. In direct labeling, the detecting antibody (the antibody for the molecule of interest) or detecting molecule (the molecule that can be bound by an antibody to the molecule of interest) include a label. Detection of the label indicates the presence of the detecting antibody or detecting molecule, which in turn indicates the presence of the molecule of interest or of an antibody to the molecule of interest, respectively. In indirect labeling, an additional molecule or moiety is brought into contact with, or generated at the site of, the immunocomplex. For example, a signal-generating molecule or moiety such as an enzyme can be attached to or associated with the detecting antibody or detecting molecule. The signal-generating molecule can then generate a detectable signal at the site of the immunocomplex. For example, an enzyme, when supplied with suitable substrate, can produce a visible or detectable product at the site of the immunocomplex. ELISAs use this type of indirect labeling.


As another example of indirect labeling, an additional molecule (which can be referred to as a binding agent) that can bind to either the molecule of interest or to the antibody (primary antibody) to the molecule of interest, such as a second antibody to the primary antibody, can be contacted with the immunocomplex. The additional molecule can have a label or signal-generating molecule or moiety. The additional molecule can be an antibody, which can thus be termed a secondary antibody. Binding of a secondary antibody to the primary antibody can form a so-called sandwich with the first (or primary) antibody and the molecule of interest. The immune complexes can be contacted with the labeled, secondary antibody under conditions effective and for a period of time sufficient to allow the formation of secondary immune complexes. The secondary immune complexes can then be generally washed to remove any non-specifically bound labeled secondary antibodies, and the remaining label in the secondary immune complexes can then be detected. The additional molecule can also be or include one of a pair of molecules or moieties that can bind to each other, such as the biotin/avidin pair. In this mode, the detecting antibody or detecting molecule should include the other member of the pair.


Other modes of indirect labeling include the detection of primary immune complexes by a two-step approach. For example, a molecule (which can be referred to as a first binding agent), such as an antibody, that has binding affinity for the molecule of interest or corresponding antibody can be used to form secondary immune complexes, as described above. After washing, the secondary immune complexes can be contacted with another molecule (which can be referred to as a second binding agent) that has binding affinity for the first binding agent, again under conditions effective and for a period of time sufficient to allow the formation of immune complexes (thus forming tertiary immune complexes). The second binding agent can be linked to a detectable label or signal-genarating molecule or moiety, allowing detection of the tertiary immune complexes thus formed. This system can provide for signal amplification.


Immunoassays that involve the detection of as substance, such as a protein or an antibody to a specific protein, include label-free assays, protein separation methods (i.e., electrophoresis), solid support capture assays, or in vivo detection. Label-free assays are generally diagnostic means of determining the presence or absence of a specific protein, or an antibody to a specific protein, in a sample. Protein separation methods are additionally useful for evaluating physical properties of the protein, such as size or net charge. Capture assays are generally more useful for quantitatively evaluating the concentration of a specific protein, or antibody to a specific protein, in a sample. Finally, in vivo detection is useful for evaluating the spatial expression patterns of the substance, i.e., where the substance can be found in a subject, tissue or cell.


Provided that the concentrations are sufficient, the molecular complexes ([Ab-Ag]n) generated by antibody-antigen interaction are visible to the naked eye, but smaller amounts may also be detected and measured due to their ability to scatter a beam of light. The formation of complexes indicates that both reactants are present, and in immunoprecipitation assays a constant concentration of a reagent antibody is used to measure specific antigen ([Ab-Ag]n), and reagent antigens are used to detect specific antibody ([Ab-Ag]n). If the reagent species is previously coated onto cells (as in hemagglutination assay) or very small particles (as in latex agglutination assay), “clumping” of the coated particles is visible at much lower concentrations. A variety of assays based on these elementary principles are in common use, including Ouchterlony immunodiffusion assay, rocket immunoelectrophoresis, and immunoturbidometric and nephelometric assays. The main limitations of such assays are restricted sensitivity (lower detection limits) in comparison to assays employing labels and, in some cases, the fact that very high concentrations of analyte can actually inhibit complex formation, necessitating safeguards that make the procedures more complex. Some of these Group 1 assays date right back to the discovery of antibodies and none of them have an actual “label” (e.g. Ag-enz). Other kinds of immunoassays that are label free depend on immunosensors, and a variety of instruments that can directly detect antibody-antigen interactions are now commercially available. Most depend on generating an evanescent wave on a sensor surface with immobilized ligand, which allows continuous monitoring of binding to the ligand. Immunosensors allow the easy investigation of kinetic interactions and, with the advent of lower-cost specialized instruments, may in the future find wide application in immunoanalysis.


The use of immunoassays to detect a specific protein can involve the separation of the proteins by electrophoresis. Electrophoresis is the migration of charged molecules in solution in response to an electric field. Their rate of migration depends on the strength of the field; on the net charge, size and shape of the molecules and also on the ionic strength, viscosity and temperature of the medium in which the molecules are moving. As an analytical tool, electrophoresis is simple, rapid and highly sensitive. It is used analytically to study the properties of a single charged species, and as a separation technique.


Generally the sample is run in a support matrix such as paper, cellulose acetate, starch gel, agarose or polyacrylamide gel. The matrix inhibits convective mixing caused by heating and provides a record of the electrophoretic run: at the end of the run, the matrix can be stained and used for scanning, autoradiography or storage. In addition, the most commonly used support matrices—agarose and polyacrylamide—provide a means of separating molecules by size, in that they are porous gels. A porous gel may act as a sieve by retarding, or in some cases completely obstructing, the movement of large macromolecules while allowing smaller molecules to migrate freely. Because dilute agarose gels are generally more rigid and easy to handle than polyacrylamide of the same concentration, agarose is used to separate larger macromolecules such as nucleic acids, large proteins and protein complexes. Polyacrylamide, which is easy to handle and to make at higher concentrations, is used to separate most proteins and small oligonucleotides that require a small gel pore size for retardation.


Proteins are amphoteric compounds; their net charge therefore is determined by the pH of the medium in which they are suspended. In a solution with a pH above its isoelectric point, a protein has a net negative charge and migrates towards the anode in an electrical field. Below its isoelectric point, the protein is positively charged and migrates towards the cathode. The net charge carried by a protein is in addition independent of its size—i.e., the charge carried per unit mass (or length, given proteins and nucleic acids are linear macromolecules) of molecule differs from protein to protein. At a given pH therefore, and under non-denaturing conditions, the electrophoretic separation of proteins is determined by both size and charge of the molecules.


Sodium dodecyl sulphate (SDS) is an anionic detergent which denatures proteins by “wrapping around” the polypeptide backbone—and SDS binds to proteins fairly specifically in a mass ratio of 1.4:1. In so doing, SDS confers a negative charge to the polypeptide in proportion to its length. Further, it is usually necessary to reduce disulfide bridges in proteins (denature) before they adopt the random-coil configuration necessary for separation by size; this is done with 2-mercaptoethanol or dithiothreitol (DTT). In denaturing SDS-PAGE separations therefore, migration is determined not by intrinsic electrical charge of the polypeptide, but by molecular weight.


Determination of molecular weight is done by SDS-PAGE of proteins of known molecular weight along with the protein to be characterized. A linear relationship exists between the logarithm of the molecular weight of an SDS-denatured polypeptide, or native nucleic acid, and its Rf. The Rf is calculated as the ratio of the distance migrated by the molecule to that migrated by a marker dye-front. A simple way of determining relative molecular weight by electrophoresis (Mr) is to plot a standard curve of distance migrated vs. log 10 MW for known samples, and read off the log Mr of the sample after measuring distance migrated on the same gel.


In two-dimensional electrophoresis, proteins are fractionated first on the basis of one physical property, and, in a second step, on the basis of another. For example, isoelectric focusing can be used for the first dimension, conveniently carried out in a tube gel, and SDS electrophoresis in a slab gel can be used for the second dimension. One example of a procedure is that of O'Farrell, P. H., High Resolution Two-dimensional Electrophoresis of Proteins, J. Biol. Chem. 250:4007-4021 (1975), herein incorporated by reference in its entirety for its teaching regarding two-dimensional electrophoresis methods. Other examples include but are not limited to, those found in Anderson, L and Anderson, NG, High resolution two-dimensional electrophoresis of human plasma proteins, Proc. Natl. Acad. Sci. 74:5421-5425 (1977), Ornstein, L., Disc electrophoresis, L. Ann. N.Y. Acad. Sci. 121:321349 (1964), each of which is herein incorporated by reference in its entirety for teachings regarding electrophoresis methods. Laemmli, U.K., Cleavage of structural proteins during the assembly of the head of bacteriophage T4, Nature 227:680 (1970), which is herein incorporated by reference in its entirety for teachings regarding electrophoresis methods, discloses a discontinuous system for resolving proteins denatured with SDS. The leading ion in the Laemmli buffer system is chloride, and the trailing ion is glycine. Accordingly, the resolving gel and the stacking gel are made up in Tris-HCl buffers (of different concentration and pH), while the tank buffer is Tris-glycine. All buffers contain 0.1% SDS.


One example of an immunoassay that uses electrophoresis that is contemplated in the current methods is Western blot analysis. Western blotting or immunoblotting allows the determination of the molecular mass of a protein and the measurement of relative amounts of the protein present in different samples. Detection methods include chemiluminescence and chromogenic detection. Standard methods for Western blot analysis can be found in, for example, D. M. Bollag et al., Protein Methods (2d edition 1996) and E. Harlow & D. Lane, Antibodies, a Laboratory Manual (1988), U.S. Pat. No. 4,452,901, each of which is herein incorporated by reference in their entirety for teachings regarding Western blot methods. Generally, proteins are separated by gel electrophoresis, usually SDS-PAGE. The proteins are transferred to a sheet of special blotting paper, e.g., nitrocellulose, though other types of paper, or membranes, can be used. The proteins retain the same pattern of separation they had on the gel. The blot is incubated with a generic protein (such as milk proteins) to bind to any remaining sticky places on the nitrocellulose. An antibody is then added to the solution which is able to bind to its specific protein.


The attachment of specific antibodies to specific immobilized antigens can be readily visualized by indirect enzyme immunoassay techniques, usually using a chromogenic substrate (e.g. alkaline phosphatase or horseradish peroxidase) or chemiluminescent substrates. Other possibilities for probing include the use of fluorescent or radioisotope labels (e.g., fluorescein, 125I). Probes for the detection of antibody binding can be conjugated anti-immunoglobulins, conjugated staphylococcal Protein A (binds IgG), or probes to biotinylated primary antibodies (e.g., conjugated avidin/streptavidin).


The power of the technique lies in the simultaneous detection of a specific protein by means of its antigenicity, and its molecular mass. Proteins are first separated by mass in the SDS-PAGE, then specifically detected in the immunoassay step. Thus, protein standards (ladders) can be run simultaneously in order to approximate molecular mass of the protein of interest in a heterogeneous sample.


The gel shift assay or electrophoretic mobility shift assay (EMSA) can be used to detect the interactions between DNA binding proteins and their cognate DNA recognition sequences, in both a qualitative and quantitative manner. Exemplary techniques are described in Ornstein L., Disc electrophoresis—I: Background and theory, Ann. NY Acad. Sci. 121:321-349 (1964), and Matsudiara, PT and DR Burgess, SDS microslab linear gradient polyacrylamide gel electrophoresis, Anal. Biochem. 87:386-396 (1987), each of which is herein incorporated by reference in its entirety for teachings regarding gel-shift assays.


In a general gel-shift assay, purified proteins or crude cell extracts can be incubated with a labeled (e.g., 32P-radiolabeled) DNA or RNA probe, followed by separation of the complexes from the free probe through a nondenaturing polyacrylamide gel. The complexes migrate more slowly through the gel than unbound probe. Depending on the activity of the binding protein, a labeled probe can be either double-stranded or single-stranded. For the detection of DNA binding proteins such as transcription factors, either purified or partially purified proteins, or nuclear cell extracts can be used. For detection of RNA binding proteins, either purified or partially purified proteins, or nuclear or cytoplasmic cell extracts can be used. The specificity of the DNA or RNA binding protein for the putative binding site is established by competition experiments using DNA or RNA fragments or oligonucleotides containing a binding site for the protein of interest, or other unrelated sequence. The differences in the nature and intensity of the complex formed in the presence of specific and nonspecific competitor allows identification of specific interactions.


Gel shift methods can include using, for example, colloidal forms of COOMASSIE (Imperial Chemicals Industries, Ltd) blue stain to detect proteins in gels such as polyacrylamide electrophoresis gels. Such methods are described, for example, in Neuhoff et al., Electrophoresis 6:427-448 (1985), and Neuhoff et al., Electrophoresis 9:255-262 (1988), each of which is herein incorporated by reference in its entirety for teachings regarding gel shift methods. In addition to the conventional protein assay methods referenced above, a combination cleaning and protein staining composition is described in U.S. Pat. No. 5,424,000, herein incorporated by reference in its entirety for its teaching regarding gel shift methods. The solutions can include phosphoric, sulfuric, and nitric acids, and Acid Violet dye.


Radioimmune Precipitation Assay (RIPA) is a sensitive assay using radiolabeled antigens to detect specific antibodies in serum. The antigens are allowed to react with the serum and then precipitated using a special reagent such as, for example, protein A sepharose beads. The bound radiolabeled immunoprecipitate is then commonly analyzed by gel electrophoresis. Radioimmunoprecipitation assay (RIPA) is often used as a confirmatory test for diagnosing the presence of HIV antibodies. RIPA is also referred to in the art as Farr Assay, Precipitin Assay, Radioimmune Precipitin Assay; Radioimmunoprecipitation Analysis; Radioimmunoprecipitation Analysis, and Radioimmunoprecipitation Analysis.


While the above immunoassays that utilize electrophoresis to separate and detect the specific proteins of interest allow for evaluation of protein size, they are not very sensitive for evaluating protein concentration. However, also contemplated are immunoassays wherein the protein or antibody specific for the protein is bound to a solid support (e.g., tube, well, bead, or cell) to capture the antibody or protein of interest, respectively, from a sample, combined with a method of detecting the protein or antibody specific for the protein on the support. Examples of such immunoassays include Radioimmunoassay (RIA), Enzyme-Linked Immunosorbent Assay (ELISA), Flow cytometry, protein array, multiplexed bead assay, and magnetic capture.


Radioimmunoassay (RIA) is a classic quantitative assay for detection of antigen-antibody reactions using a radioactively labeled substance (radioligand), either directly or indirectly, to measure the binding of the unlabeled substance to a specific antibody or other receptor system. Radioimmunoassay is used, for example, to test hormone levels in the blood without the need to use a bioassay. Non-immunogenic substances (e.g., haptens) can also be measured if coupled to larger carrier proteins (e.g., bovine gamma-globulin or human serum albumin) capable of inducing antibody formation. RIA involves mixing a radioactive antigen (because of the ease with which iodine atoms can be introduced into tyrosine residues in a protein, the radioactive isotopes 125I or 131I are often used) with antibody to that antigen. The antibody is generally linked to a solid support, such as a tube or beads. Unlabeled or “cold” antigen is then adding in known quantities and measuring the amount of labeled antigen displaced. Initially, the radioactive antigen is bound to the antibodies. When cold antigen is added, the two compete for antibody binding sites—and at higher concentrations of cold antigen, more binds to the antibody, displacing the radioactive variant. The bound antigens are separated from the unbound ones in solution and the radioactivity of each used to plot a binding curve. The technique is both extremely sensitive, and specific.


Enzyme-Linked Immunosorbent Assay (ELISA), or more generically termed EIA (Enzyme ImmunoAssay), is an immunoassay that can detect an antibody specific for a protein. In such an assay, a detectable label bound to either an antibody-binding or antigen-binding reagent is an enzyme. When exposed to its substrate, this enzyme reacts in such a manner as to produce a chemical moiety which can be detected, for example, by spectrophotometric, fluorometric or visual means. Enzymes which can be used to detectably label reagents useful for detection include, but are not limited to, horseradish peroxidase, alkaline phosphatase, glucose oxidase, β-galactosidase, ribonuclease, urease, catalase, malate dehydrogenase, staphylococcal nuclease, asparaginase, yeast alcohol dehydrogenase, alpha.-glycerophosphate dehydrogenase, triose phosphate isomerase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase.


Variations of ELISA techniques are known to those of skill in the art. In one variation, antibodies that can bind to proteins can be immobilized onto a selected surface exhibiting protein affinity, such as a well in a polystyrene microtiter plate. Then, a test composition suspected of containing a marker antigen can be added to the wells. After binding and washing to remove non-specifically bound immunocomplexes, the bound antigen can be detected. Detection can be achieved by the addition of a second antibody specific for the target protein, which is linked to a detectable label. This type of ELISA is a simple “sandwich ELISA.” Detection also can be achieved by the addition of a second antibody, followed by the addition of a third antibody that has binding affinity for the second antibody, with the third antibody being linked to a detectable label.


Another variation is a competition ELISA. In competition ELISA's, test samples compete for binding with known amounts of labeled antigens or antibodies. The amount of reactive species in the sample can be determined by mixing the sample with the known labeled species before or during incubation with coated wells. The presence of reactive species in the sample acts to reduce the amount of labeled species available for binding to the well and thus reduces the ultimate signal.


Regardless of the format employed, ELISAs have certain features in common, such as coating, incubating or binding, washing to remove non-specifically bound species, and detecting the bound immunecomplexes. Antigen or antibodies can be linked to a solid support, such as in the form of plate, beads, dipstick, membrane or column matrix, and the sample to be analyzed applied to the immobilized antigen or antibody. In coating a plate with either antigen or antibody, one will generally incubate the wells of the plate with a solution of the antigen or antibody, either overnight or for a specified period of hours. The wells of the plate can then be washed to remove incompletely adsorbed material. Any remaining available surfaces of the wells can then be “coated” with a nonspecific protein that is antigenically neutral with regard to the test antisera. These include bovine serum albumin (BSA), casein and solutions of milk powder. The coating allows for blocking of nonspecific adsorption sites on the immobilizing surface and thus reduces the background caused by nonspecific binding of antisera onto the surface.


In ELISAs, a secondary or tertiary detection means rather than a direct procedure can also be used. Thus, after binding of a protein or antibody to the well, coating with a non-reactive material to reduce background, and washing to remove unbound material, the immobilizing surface is contacted with the control clinical or biological sample to be tested under conditions effective to allow immunecomplex (antigen/antibody) formation. Detection of the immunecomplex then requires a labeled secondary binding agent or a secondary binding agent in conjunction with a labeled third binding agent.


Enzyme-Linked Immunospot Assay (ELISpot) is an immunoassay that can detect an antibody specific for a protein or antigen. In such an assay, a detectable label bound to either an antibody-binding or antigen-binding reagent is an enzyme. When exposed to its substrate, this enzyme reacts in such a manner as to produce a chemical moiety which can be detected, for example, by spectrophotometric, fluorometric or visual means. Enzymes which can be used to detectably label reagents useful for detection include, but are not limited to, horseradish peroxidase, alkaline phosphatase, glucose oxidase, β-galactosidase, ribonuclease, urease, catalase, malate dehydrogenase, staphylococcal nuclease, asparaginase, yeast alcohol dehydrogenase, alpha.-glycerophosphate dehydrogenase, triose phosphate isomerase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. In this assay a nitrocellulose microtiter plate is coated with antigen. The test sample is exposed to the antigen and then reacted similarly to an ELISA assay. Detection differs from a traditional ELISA in that detection is determined by the enumeration of spots on the nitrocellulose plate. The presence of a spot indicates that the sample reacted to the antigen. The spots can be counted and the number of cells in the sample specific for the antigen determined.


“Under conditions effective to allow immunecomplex (antigen/antibody) formation” means that the conditions include diluting the antigens and antibodies with solutions such as BSA, bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween so as to reduce non-specific binding and to promote a reasonable signal to noise ratio.


The suitable conditions also mean that the incubation is at a temperature and for a period of time sufficient to allow effective binding. Incubation steps can typically be from about 1 minute to twelve hours, at temperatures of about 20° to 30° C., or can be incubated overnight at about 0° C. to about 10° C.


Following all incubation steps in an ELISA, the contacted surface can be washed so as to remove non-complexed material. A washing procedure can include washing with a solution such as PBS/Tween or borate buffer. Following the formation of specific immunecomplexes between the test sample and the originally bound material, and subsequent washing, the occurrence of even minute amounts of immunecomplexes can be determined.


To provide a detecting means, the second or third antibody can have an associated label to allow detection, as described above. This can be an enzyme that can generate color development upon incubating with an appropriate chromogenic substrate. Thus, for example, one can contact and incubate the first or second immunecomplex with a labeled antibody for a period of time and under conditions that favor the development of further immunecomplex formation (e.g., incubation for 2 hours at room temperature in a PBS-containing solution such as PBS-Tween).


After incubation with the labeled antibody, and subsequent to washing to remove unbound material, the amount of label can be quantified, e.g., by incubation with a chromogenic substrate such as urea and bromocresol purple or 2,2′-azido-di-(3-ethyl-benzthiazoline-6-sulfonic acid [ABTS] and H2O2, in the case of peroxidase as the enzyme label. Quantitation can then be achieved by measuring the degree of color generation, e.g., using a visible spectra spectrophotometer.


Protein arrays are solid-phase ligand binding assay systems using immobilized proteins on surfaces which include glass, membranes, microtiter wells, mass spectrometer plates, and beads or other particles. The assays are highly parallel (multiplexed) and often miniaturized (microarrays, protein chips). Their advantages include being rapid and automatable, capable of high sensitivity, economical on reagents, and giving an abundance of data for a single experiment. Bioinformatics support is important; the data handling demands sophisticated software and data comparison analysis. However, the software can be adapted from that used for DNA arrays, as can much of the hardware and detection systems.


One of the chief formats is the capture array, in which ligand-binding reagents, which are usually antibodies but can also be alternative protein scaffolds, peptides or nucleic acid aptamers, are used to detect target molecules in mixtures such as plasma or tissue extracts. In diagnostics, capture arrays can be used to carry out multiple immunoassays in parallel, both testing for several analytes in individual sera for example and testing many serum samples simultaneously. In proteomics, capture arrays are used to quantitate and compare the levels of proteins in different samples in health and disease, i.e. protein expression profiling. Proteins other than specific ligand binders are used in the array format for in vitro functional interaction screens such as protein-protein, protein-DNA, protein-drug, receptor-ligand, enzyme-substrate, etc. The capture reagents themselves are selected and screened against many proteins, which can also be done in a multiplex array format against multiple protein targets.


For construction of arrays, sources of proteins include cell-based expression systems for recombinant proteins, purification from natural sources, production in vitro by cell-free translation systems, and synthetic methods for peptides. Many of these methods can be automated for high throughput production. For capture arrays and protein function analysis, it is important that proteins should be correctly folded and functional; this is not always the case, e.g. where recombinant proteins are extracted from bacteria under denaturing conditions. Nevertheless, arrays of denatured proteins are useful in screening antibodies for cross-reactivity, identifying autoantibodies and selecting ligand binding proteins.


Protein arrays have been designed as a miniaturization of familiar immunoassay methods such as ELISA and dot blotting, often utilizing fluorescent readout, and facilitated by robotics and high throughput detection systems to enable multiple assays to be carried out in parallel. Commonly used physical supports include glass slides, silicon, microwells, nitrocellulose or PVDF membranes, and magnetic and other microbeads. While microdrops of protein delivered onto planar surfaces are the most familiar format, alternative architectures include CD centrifugation devices based on developments in microfluidics (Gyros, Monmouth Junction, N.J.) and specialised chip designs, such as engineered microchannels in a plate (e.g., The Living Chip™, Biotrove, Woburn, Mass.) and tiny 3D posts on a silicon surface (Zyomyx, Hayward Calif.). Particles in suspension can also be used as the basis of arrays, providing they are coded for identification; systems include colour coding for microbeads (Luminex, Austin, Tex.; Bio-Rad Laboratories) and semiconductor nanocrystals (e.g., QDots™, Quantum Dot, Hayward, Calif.), and barcoding for beads (UltraPlex™, SmartBead Technologies Ltd, Babraham, Cambridge, UK) and multimetal microrods (e.g., Nanobarcodes™ particles, Nanoplex Technologies, Mountain View, Calif.). Beads can also be assembled into planar arrays on semiconductor chips (LEAPS technology, BioArray Solutions, Warren, N.J.).


Immobilization of proteins involves both the coupling reagent and the nature of the surface being coupled to. A good protein array support surface is chemically stable before and after the coupling procedures, allows good spot morphology, displays minimal nonspecific binding, does not contribute a background in detection systems, and is compatible with different detection systems. The immobilization method used are reproducible, applicable to proteins of different properties (size, hydrophilic, hydrophobic), amenable to high throughput and automation, and compatible with retention of fully functional protein activity. Orientation of the surface-bound protein is recognized as an important factor in presenting it to ligand or substrate in an active state; for capture arrays the most efficient binding results are obtained with orientated capture reagents, which generally require site-specific labeling of the protein.


Both covalent and noncovalent methods of protein immobilization are used and have various pros and cons. Passive adsorption to surfaces is methodologically simple, but allows little quantitative or orientational control; it may or may not alter the functional properties of the protein, and reproducibility and efficiency are variable. Covalent coupling methods provide a stable linkage, can be applied to a range of proteins and have good reproducibility; however, orientation may be variable, chemical derivatization may alter the function of the protein and requires a stable interactive surface. Biological capture methods utilizing a tag on the protein provide a stable linkage and bind the protein specifically and in reproducible orientation, but the biological reagent must first be immobilized adequately and the array may require special handling and have variable stability.


Several immobilization chemistries and tags have been described for fabrication of protein arrays. Substrates for covalent attachment include glass slides coated with amino- or aldehyde-containing silane reagents. In the Versalinx™ system (Prolinx, Bothell, Wash.) reversible covalent coupling is achieved by interaction between the protein derivatised with phenyldiboronic acid, and salicylhydroxamic acid immobilized on the support surface. This also has low background binding and low intrinsic fluorescence and allows the immobilized proteins to retain function. Noncovalent binding of unmodified protein occurs within porous structures such as HydroGel™ (PerkinElmer, Wellesley, Mass.), based on a 3-dimensional polyacrylamide gel; this substrate is reported to give a particularly low background on glass microarrays, with a high capacity and retention of protein function. Widely used biological coupling methods are through biotin/streptavidin or hexahistidine/Ni interactions, having modified the protein appropriately. Biotin may be conjugated to a poly-lysine backbone immobilized on a surface such as titanium dioxide (Zyomyx) or tantalum pentoxide (Zeptosens, Witterswil, Switzerland).


Array fabrication methods include robotic contact printing, ink-jetting, piezoelectric spotting and photolithography. A number of commercial arrayers are available [e.g. Packard Biosciences] as well as manual equipment [V & P Scientific]. Bacterial colonies can be robotically gridded onto PVDF membranes for induction of protein expression in situ.


At the limit of spot size and density are nanoarrays, with spots on the nanometer spatial scale, enabling thousands of reactions to be performed on a single chip less than 1 mm square. BioForce Laboratories have developed nanoarrays with 1521 protein spots in 85 sq microns, equivalent to 25 million spots per sq cm, at the limit for optical detection; their readout methods are fluorescence and atomic force microscopy (AFM).


Fluorescence labeling and detection methods are widely used. The same instrumentation as used for reading DNA microarrays is applicable to protein arrays. For differential display, capture (e.g., antibody) arrays can be probed with fluorescently labeled proteins from two different cell states, in which cell lysates are directly conjugated with different fluorophores (e.g. Cy-3, Cy-5) and mixed, such that the color acts as a readout for changes in target abundance. Fluorescent readout sensitivity can be amplified 10-100 fold by tyramide signal amplification (TSA) (PerkinElmer Lifesciences). Planar waveguide technology (Zeptosens) enables ultrasensitive fluorescence detection, with the additional advantage of no intervening washing procedures. High sensitivity can also be achieved with suspension beads and particles, using phycoerythrin as label (Luminex) or the properties of semiconductor nanocrystals (Quantum Dot). A number of novel alternative readouts have been developed, especially in the commercial biotech arena. These include adaptations of surface plasmon resonance (HTS Biosystems, Intrinsic Bioprobes, Tempe, Ariz.), rolling circle DNA amplification (Molecular Staging, New Haven Conn.), mass spectrometry (Intrinsic Bioprobes; Ciphergen, Fremont, Calif.), resonance light scattering (Genicon Sciences, San Diego, Calif.) and atomic force microscopy [BioForce Laboratories].


Capture arrays form the basis of diagnostic chips and arrays for expression profiling. They employ high affinity capture reagents, such as conventional antibodies, single domains, engineered scaffolds, peptides or nucleic acid aptamers, to bind and detect specific target ligands in high throughput manner.


Antibody arrays have the required properties of specificity and acceptable background, and some are available commercially (BD Biosciences, San Jose, Calif.; Clontech, Mountain View, Calif.; BioRad; Sigma, St. Louis, Mo.). Antibodies for capture arrays are made either by conventional immunization (polyclonal sera and hybridomas), or as recombinant fragments, usually expressed in E. coli, after selection from phage or ribosome display libraries (Cambridge Antibody Technology, Cambridge, UK; Biolnvent, Lund, Sweden; Affitech, Walnut Creek, Calif.; Biosite, San Diego, Calif.). In addition to the conventional antibodies, Fab and scFv fragments, single V-domains from camelids or engineered human equivalents (Domantis, Waltham, Mass.) may also be useful in arrays.


The term “scaffold” refers to ligand-binding domains of proteins, which are engineered into multiple variants capable of binding diverse target molecules with antibody-like properties of specificity and affinity. The variants can be produced in a genetic library format and selected against individual targets by phage, bacterial or ribosome display. Such ligand-binding scaffolds or frameworks include ‘Affibodies’ based on Staph. aureus protein A (Affibody, Bromma, Sweden), ‘Trinectins’ based on fibronectins (Phylos, Lexington, Mass.) and ‘Anticalins’ based on the lipocalin structure (Pieris Proteolab, Freising-Weihenstephan, Germany). These can be used on capture arrays in a similar fashion to antibodies and may have advantages of robustness and ease of production.


Nonprotein capture molecules, notably the single-stranded nucleic acid aptamers which bind protein ligands with high specificity and affinity, are also used in arrays (SomaLogic, Boulder, Colo.). Aptamers are selected from libraries of oligonucleotides by the Selex™ procedure and their interaction with protein can be enhanced by covalent attachment, through incorporation of brominated deoxyuridine and UV-activated crosslinking (photoaptamers). Photocrosslinking to ligand reduces the crossreactivity of aptamers due to the specific steric requirements. Aptamers have the advantages of ease of production by automated oligonucleotide synthesis and the stability and robustness of DNA; on photoaptamer arrays, universal fluorescent protein stains can be used to detect binding.


Protein analytes binding to antibody arrays may be detected directly or via a secondary antibody in a sandwich assay. Direct labelling is used for comparison of different samples with different colors. Where pairs of antibodies directed at the same protein ligand are available, sandwich immunoassays provide high specificity and sensitivity and are therefore the method of choice for low abundance proteins such as cytokines; they also give the possibility of detection of protein modifications. Label-free detection methods, including mass spectrometry, surface plasmon resonance and atomic force microscopy, avoid alteration of ligand. What is required from any method is optimal sensitivity and specificity, with low background to give high signal to noise. Since analyte concentrations cover a wide range, sensitivity has to be tailored appropriately; serial dilution of the sample or use of antibodies of different affinities are solutions to this problem. Proteins of interest are frequently those in low concentration in body fluids and extracts, requiring detection in the pg range or lower, such as cytokines or the low expression products in cells.


An alternative to an array of capture molecules is one made through ‘molecular imprinting’ technology, in which peptides (e.g., from the C-terminal regions of proteins) are used as templates to generate structurally complementary, sequence-specific cavities in a polymerizable matrix; the cavities can then specifically capture (denatured) proteins that have the appropriate primary amino acid sequence (ProteinPrint™, Aspira Biosystems, Burlingame, Calif.).


Another methodology which can be used diagnostically and in expression profiling is the ProteinChip® array (Ciphergen, Fremont, Calif.), in which solid phase chromatographic surfaces bind proteins with similar characteristics of charge or hydrophobicity from mixtures such as plasma or tumour extracts, and SELDI-TOF mass spectrometry is used to detection the retained proteins.


Large-scale functional chips have been constructed by immobilizing large numbers of purified proteins and used to assay a wide range of biochemical functions, such as protein interactions with other proteins, drug-target interactions, enzyme-substrates, etc. Generally they require an expression library, cloned into E. coli, yeast or similar from which the expressed proteins are then purified, e.g. via a His tag, and immobilized. Cell free protein transcription/translation is a viable alternative for synthesis of proteins which do not express well in bacterial or other in vivo systems.


For detecting protein-protein interactions, protein arrays can be in vitro alternatives to the cell-based yeast two-hybrid system and may be useful where the latter is deficient, such as interactions involving secreted proteins or proteins with disulphide bridges. High-throughput analysis of biochemical activities on arrays has been described for yeast protein kinases and for various functions (protein-protein and protein-lipid interactions) of the yeast proteome, where a large proportion of all yeast open-reading frames was expressed and immobilised on a microarray. Large-scale ‘proteome chips’ promise to be very useful in identification of functional interactions, drug screening, etc. (Proteometrix, Branford, Conn.).


As a two-dimensional display of individual elements, a protein array can be used to screen phage or ribosome display libraries, in order to select specific binding partners, including antibodies, synthetic scaffolds, peptides and aptamers. In this way, ‘library against library’ screening can be carried out. Screening of drug candidates in combinatorial chemical libraries against an array of protein targets identified from genome projects is another application of the approach.


A multiplexed bead assay, such as, for example, the BD™ Cytometric Bead Array, is a series of spectrally discrete particles that can be used to capture and quantitate soluble analytes. The analyte is then measured by detection of a fluorescence-based emission and flow cytometric analysis. Multiplexed bead assay generates data that is comparable to ELISA based assays, but in a “multiplexed” or simultaneous fashion. Concentration of unknowns is calculated for the cytometric bead array as with any sandwich format assay, i.e. through the use of known standards and plotting unknowns against a standard curve. Further, multiplexed bead assay allows quantification of soluble analytes in samples never previously considered due to sample volume limitations. In addition to the quantitative data, powerful visual images can be generated revealing unique profiles or signatures that provide the user with additional information at a glance.


Accordingly, in one aspect, disclosed herein are methods of identifying neoantigens any neoantigen disclosed herein, wherein the T cell activity (such as, for example, release of cytokines including, but not limited to IFN-γ, TGF-β, lymphotoxin-α, IL-2, IL-4, IL-10, IL-17, or IL-25) is measured by ELISA, ELISpot, Intracellular cytokine staining, or Chromium Release.


It is understood and herein contemplated that the disclosed methods of identifying neoantigens can be used to identify neoantigens in any cancer, including, but not limited to B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas. Therefore, in one aspect, also disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the cancer is selected from the group consisting of B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas.


C. Compositions

Disclosed are the components to be used to prepare the disclosed compositions as well as the compositions themselves to be used within the methods disclosed herein. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutation of these compounds may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular neoantigen is disclosed and discussed and a number of modifications that can be made to a number of molecules including the neoantigen are discussed, specifically contemplated is each and every combination and permutation of neoantigen and the modifications that are possible unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is not individually recited each is individually and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods.


The methods disclosed herein identify neoantigens that can be used in compositions for the treatment of cancer (either as a therapeutic treatment or prophylactic treatment) as well as for the creating of CAR T cells, TCR T cells, and/or TILs that can be used to treat cancer. Thus, in one aspect, also disclosed herein are neoantigens identified by the methods of identifying any neoantigen disclosed herein. For example, disclosed herein are polypeptides comprising the amino acid sequence AEPKRKSSLFWHAFNRLTPFRK (SEQ ID NO: 2) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the sequence at least comprises LFWHAFNRL (SEQ ID NO: 7), for example, SSLFWHAFNRLTP (SEQ ID NO: 4), RKSSLFWHAFNRL (SEQ ID NO: 3), RKSSLFWHAFNRLTPFR (SEQ ID NO: 6), LFWHAFNRLTPFR (SEQ ID NO: 5), SLFWHAFNRL (SEQ ID NO: 88), SSLFWHAFNRL (SEQ ID NO: 89), SSLFWHAFNRLT (SEQ ID NO: 90), LFWHAFNRLT (SEQ ID NO: 91), LFWHAFNRLTP (SEQ ID NO: 92), SLFWHAFNRLT (SEQ ID NO: 93), LFWHAFNRLTPF (SEQ ID NO: 94); a polypeptide comprising the amino acid sequence FQLLLEKPFQIFCAELWVRDINDHA (SEQ ID NO: 9) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the sequence at least comprises LEKPFQIFCAELW (SEQ ID NO: 12), for example, LEKPFQIFCAELWV (SEQ ID NO: 95) and LEKPFQIFCAELWVR (SEQ ID NO: 96); a polypeptide comprising the amino acid sequence ENSPLGTEFPLNYALDLDVGSNNVQ (SEQ ID NO: 14) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the sequence at least comprises LGTEFPLNYALDL (SEQ ID NO: 17), for example LGTEFPLNYALDLD (SEQ ID NO: 97), LGTEFPLNYALDLDV (SEQ ID NO: 98), LGTEFPLNYALDLDVG (SEQ ID NO: 99), LGTEFPLNYALDLDVGS (SEQ ID NO: 100), LGTEFPLNYALDLDVGSN (SEQ ID NO: 101), LGTEFPLNYALDLDVGSNN (SEQ ID NO: 102), LGTEFPLNYALDLDVGSNNV (SEQ ID NO: 103), LGTEFPLNYALDLDVGSNNVQ (SEQ ID NO: 104), ENSPLGTEFPLNYALDL (SEQ ID NO: 105), NSPLGTEFPLNYALDL (SEQ ID NO: 106), SPLGTEFPLNYALDL (SEQ ID NO: 107), PLGTEFPLNYALDL (SEQ ID NO: 108), PLGTEFPLNYALDLD (SEQ ID NO: 109), PLGTEFPLNYALDLDV (SEQ ID NO: 110), SPLGTEFPLNYALDLD (SEQ ID NO: 111), SPLGTEFPLNYALDLDV (SEQ ID NO: 112), NSPLGTEFPLNYALDLDV (SEQ ID NO: 113), and ENSPLGTEFPLNYALDLDV (SEQ ID NO: 114); a polypeptide comprising the amino acid sequence MTDDKDVLRNVWFGRIPTCFT (SEQ ID NO: 19) or a fragment thereof comprising at least 11 amino acids, wherein any fragment of the sequence at least comprises KDVLRNVWFGR (SEQ ID NO: 23), for example, DDKDVLRNVWFGR (SEQ ID NO: 22), KDVLRNVWFGRIP (SEQ ID NO: 21), DDKDVLRNVWFGRIP (SEQ ID NO: 24), DKDVLRNVWFGR (SEQ ID NO: 115), KDVLRNVWFGRI (SEQ ID NO: 116), DKDVLRNVWFGRI (SEQ ID NO: 117), DKDVLRNVWFGRIP (SEQ ID NO: 118), DDKDVLRNVWFGRI (SEQ ID NO: 119), DDKDVLRNVWFGRIPT (SEQ ID NO: 120), DDKDVLRNVWFGRIPTC (SEQ ID NO: 121), and TDDKDVLRNVWFGRIPT (SEQ ID NO: 122); a polypeptide comprising the amino acid sequence RLKASLDRPFTNSESAFYSIVGLSS (SEQ ID NO: 26) or a fragment thereof comprising at least 10 amino acids, wherein any fragment of the sequence at least comprises PFTNSESAFY (SEQ ID NO: 30), for example, DRPFTNSESAFYS (SEQ ID NO: 28), PFTNSESAFYS (SEQ ID NO: 123), PFTNSESAFYSI (SEQ ID NO: 124), PFTNSESAFYSIV (SEQ ID NO: 29), RPFTNSESAFY (SEQ ID NO: 125), RPFTNSESAFYS (SEQ ID NO: 126), RPFTNSESAFYSI (SEQ ID NO: 127), RPFTNSESAFYSIV (SEQ ID NO: 128), DRPFTNSESAFY (SEQ ID NO: 129), DRPFTNSESAFYSI (SEQ ID NO: 130), and DRPFTNSESAFYSIV (SEQ ID NO: 131); a polypeptide comprising the amino acid sequence GSGEKVAGRVIVKVCEVTRVKAVRI (SEQ ID NO: 32) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the sequence at least comprises RVIVKVCEV (SEQ ID NO: 36), for example, KVAGRVIVKVCEV (SEQ ID NO: 33), AGRVIVKVCEVTR (SEQ ID NO: 34), KVAGRVIVKVCEVTRVK (SEQ ID NO: 32), RVIVKVCEVTRVK (SEQ ID NO: 35), and KVAGRVIVKVCEVTRVK (SEQ ID NO: 37); a polypeptide comprising the amino acid sequence YGMYFCMNISSQEDGACVLLRALEP (SEQ ID NO: 39) or a fragment thereof comprising at least 10 amino acids, wherein any fragment of the sequence at least comprises ISSQEDGACV (SEQ ID NO: 43), for example, MNISSQEDGACVL (SEQ ID NO: 41), ISSQEDGACVLLR (SEQ ID NO: 42), MNISSQEDGACVLLR (SEQ ID NO: 44), ISSQEDGACVL (SEQ ID NO: 132), ISSQEDGACVLL (SEQ ID NO: 133), NISSQEDGACV (SEQ ID NO: 134), NISSQEDGACVL (SEQ ID NO: 135), NISSQEDGACVLL, (SEQ ID NO: 136), NISSQEDGACVLLR (SEQ ID NO: 137), and MNISSQEDGACVLL (SEQ ID NO: 138); a polypeptide comprising the amino acid sequence MKLTSLMCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 46) or a fragment thereof comprising at least 26 amino acids, wherein any fragment of the sequence at least comprises VSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 60), for example, MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 49), KSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 50), GCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 51), GHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 52), SMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 53), VSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 54), MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 57), VSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 139), CVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 140), MCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 141), MCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 142), SMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 143), SMCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 144); a polypeptide comprising the amino acid sequence MLMAQEALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRR MEGAPAGPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 67) or a fragment thereof comprising at least 61 amino acids, wherein any fragment of the sequence at least comprises VPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSRR (SEQ ID NO: 73), for example, QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKR (SEQ ID NO: 74), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR R (SEQ ID NO: 145), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RR (SEQ ID NO: 146), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRR (SEQ ID NO: 147), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRR (SEQ ID NO: 148), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RP (SEQ ID NO: 149), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRP (SEQ ID NO: 150), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRP (SEQ ID NO: 151), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRP (SEQ ID NO: 152), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPW (SEQ ID NO: 153), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPW (SEQ ID NO: 154), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPW (SEQ ID NO: 155), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPW (SEQ ID NO: 156), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWK (SEQ ID NO: 157), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWK (SEQ ID NO: 158), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWK (SEQ ID NO: 159), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWK (SEQ ID NO: 160), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWKR (SEQ ID NO: 161), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWKR (SEQ ID NO: 162), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWKR (SEQ ID NO: 163), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSP (SEQ ID NO: 61), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGM (SEQ ID NO: 62), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAG (SEQ ID NO: 63), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKR (SEQ ID NO: 64), LMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGG RTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 68), AMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACL SCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 69), and QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 70); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 76 (KDAARPAYWVPDYEILHCHNCRKEF); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 78 (SMPLWDFQGSTMRTSQYVRLTPDER); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 80 (YPGIKFEELFPDCIFPSESERDKIK); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 82 (LLKTRRILKCSYLYGFFLEPKSTKK); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 84 (PFYLGHTIKSGDFEYVGMEGGIVLS); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 86 (LLLAGYLAQQYLLLPTPKVIGIDLG); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 166 (TSMLILSNLVFLEGNEVGKTYWNRI); and/or variant of any of the preceding polypeptides or polypeptide fragments comprising a conservative amino acid substitution. Also disclosed are CAR T cells engineered to express a receptor (such as, for example, a T cell receptor) that can recognize one or more of the neoantigens disclosed herein. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein can be further engineered to knockout or knockdown Zinc finger and BTB domain-containing protein 7B (ThPOK), Lysine-specific histone demethylase 1 (LSD1), programmed cell death protein (PD1), and/or protein phosphatase (PP2A) to enhance their function such as cytotoxic activity and persistence or survival in vivo after adoptive transfer to a cancer patient. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein is treated with a small molecular inhibitors such 2-PAPC to enhance T cell cytotoxic activity or the ability to proliferate and survive in vivo.


It is understood and herein contemplated that the disclosed that once the sequence of the neoantigens is identified, the skilled artisan would have full knowledge of the nucleic acids that would encode said amino acid neoantigens and it would be well within the skill set of the skilled artisan to make said nucleic acid constructs. Thus, in one aspect, also disclosed herein are nucleic acids encoding a polypeptide for any neoantigen disclosed herein.


1. Homology


It is understood that one way to define any known variants and derivatives or those that might arise, of the disclosed genes and proteins herein is through defining the variants and derivatives in terms of homology and/or identify to specific known sequences. For example SEQ ID NO: 1 sets forth a particular sequence of an MHC Class II epitope peptide. Specifically disclosed are variants of these and other genes and proteins herein disclosed which have at least, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 percent homology to the stated sequence. Those of skill in the art readily understand how to determine the homology of two proteins or nucleic acids, such as genes. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.


Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.


The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment.


Nucleic Acids


There are a variety of molecules disclosed herein that are nucleic acid based, including for example the nucleic acids that encode, for example SEQ ID NO: 1, or any of the nucleic acids disclosed herein or fragments thereof, as well as various functional nucleic acids. The disclosed nucleic acids are made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if, for example, an antisense molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the antisense molecule be made up of nucleotide analogs that reduce the degradation of the antisense molecule in the cellular environment.


a) Nucleotides and Related Molecules


A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), cytosin-1-yl (C), guanin-9-yl (G), uracil-1-yl (U), and thymin-1-yl (T). The sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide is pentavalent phosphate. An non-limiting example of a nucleotide would be 3′-AMP (3′-adenosine monophosphate) or 5′-GMP (5′-guanosine monophosphate). There are many varieties of these types of molecules available in the art and available herein.


A nucleotide analog is a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties. Modifications to nucleotides are well known in the art and would include for example, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, and 2-aminoadenine as well as modifications at the sugar or phosphate moieties. There are many varieties of these types of molecules available in the art and available herein.


Nucleotide substitutes are molecules having similar functional properties to nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson-Crick or Hoogsteen manner, but which are linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to conform to a double helix type structure when interacting with the appropriate target nucleic acid. There are many varieties of these types of molecules available in the art and available herein.


It is also possible to link other types of molecules (conjugates) to nucleotides or nucleotide analogs to enhance for example, cellular uptake. Conjugates can be chemically linked to the nucleotide or nucleotide analogs. Such conjugates include but are not limited to lipid moieties such as a cholesterol moiety. (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556). There are many varieties of these types of molecules available in the art and available herein.


A Watson-Crick interaction is at least one interaction with the Watson-Crick face of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a nucleotide, nucleotide analog, or nucleotide substitute includes the C2, N1, and C6 positions of a purine based nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 positions of a pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute.


A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The Hoogsteen face includes the N7 position and reactive groups (NH2 or 0) at the C6 position of purine nucleotides.


b) Primers and Probes


Disclosed are compositions including primers and probes, which are capable of interacting with the disclosed nucleic acids, such as the neoantigens disclosed herein. In certain embodiments the primers are used to support DNA amplification reactions. Typically the primers will be capable of being extended in a sequence specific manner. Extension of a primer in a sequence specific manner includes any methods wherein the sequence and/or composition of the nucleic acid molecule to which the primer is hybridized or otherwise associated directs or influences the composition or sequence of the product produced by the extension of the primer. Extension of the primer in a sequence specific manner therefore includes, but is not limited to, PCR, DNA sequencing, DNA extension, DNA polymerization, RNA transcription, or reverse transcription. Techniques and conditions that amplify the primer in a sequence specific manner are preferred. In certain embodiments the primers are used for the DNA amplification reactions, such as PCR or direct sequencing. It is understood that in certain embodiments the primers can also be extended using non-enzymatic techniques, where for example, the nucleotides or oligonucleotides used to extend the primer are modified such that they will chemically react to extend the primer in a sequence specific manner. Typically the disclosed primers hybridize with the disclosed nucleic acids or region of the nucleic acids or they hybridize with the complement of the nucleic acids or complement of a region of the nucleic acids.


The size of the primers or probes for interaction with the nucleic acids in certain embodiments can be any size that supports the desired enzymatic manipulation of the primer, such as DNA amplification or the simple hybridization of the probe or primer. A typical primer or probe would be at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In other embodiments a primer or probe can be less than or equal to 6, 7, 8, 9, 10, 11, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In certain embodiments this product is at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In other embodiments the product is less than or equal to 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


Peptides


a) Protein Variants


As discussed herein there are numerous variants of the neoantigens that are known and herein contemplated. Protein variants and derivatives are well understood to those of skill in the art and in can involve amino acid sequence modifications. For example, amino acid sequence modifications typically fall into one or more of three classes: substitutional, insertional or deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of one to four residues. Immunogenic fusion protein derivatives, such as those described in the examples, are made by fusing a polypeptide sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by recombinant cell culture transformed with DNA encoding the fusion. Deletions are characterized by the removal of one or more amino acid residues from the protein sequence. Typically, no more than about from 2 to 6 residues are deleted at any one site within the protein molecule. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture. Techniques for making substitution mutations at predetermined sites in DNA having a known sequence are well known, for example M13 primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single residues, but can occur at a number of different locations at once; insertions usually will be on the order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final construct. The mutations must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure. Substitutional variants are those in which at least one residue has been removed and a different residue inserted in its place. Such substitutions generally are made in accordance with the following Tables 1 and 2 and are referred to as conservative substitutions.









TABLE 1







Amino Acid Abbreviations











Amino Acid
Abbreviations















Alanine
Ala
A



allosoleucine
AIle



Arginine
Arg
R



asparagine
Asn
N



aspartic acid
Asp
D



Cysteine
Cys
C



glutamic acid
Glu
E



Glutamine
Gln
Q



Glycine
Gly
G



Histidine
His
H



Isolelucine
Ile
I



Leucine
Leu
L



Lysine
Lys
K



phenylalanine
Phe
F



proline
Pro
P



pyroglutamic acid
pGlu



Serine
Ser
S



Threonine
Thr
T



Tyrosine
Tyr
Y



Tryptophan
Trp
W



Valine
Val
V

















TABLE 2





Amino Acid Substitutions Original Residue Exemplary Conservative


Substitutions, others are known in the art.


















Ala
Ser



Arg
Lys; Gln



Asn
Gln; His



Asp
Glu



Cys
Ser



Gln
Asn, Lys



Glu
Asp



Gly
Pro



His
Asn; Gln



Ile
Leu; Val



Leu
Ile; Val



Lys
Arg; Gln



Met
Leu; Ile



Phe
Met; Leu; Tyr



Ser
Thr



Thr
Ser



Trp
Tyr



Tyr
Trp; Phe



Val
Ile; Leu










Substantial changes in function or immunological identity are made by selecting substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions which in general are expected to produce the greatest changes in the protein properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for sulfation and/or glycosylation.


For example, the replacement of one amino acid residue with another that is biologically and/or chemically similar is known to those skilled in the art as a conservative substitution. For example, a conservative substitution would be replacing one hydrophobic residue for another, or one polar residue for another. The substitutions include combinations such as, for example, Gly, Ala; Val, Ile, Leu; Asp, Glu; Asn, Gln; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively substituted variations of each explicitly disclosed sequence are included within the mosaic polypeptides provided herein.


Substitutional or deletional mutagenesis can be employed to insert sites for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one by glutaminyl or histidyl residues.


Certain post-translational derivatizations are the result of the action of recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side chains (T. E. Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of the C-terminal carboxyl.


It is understood that one way to define the variants and derivatives of the disclosed proteins herein is through defining the variants and derivatives in terms of homology/identity to specific known sequences. Specifically disclosed are variants of these and other proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated sequence. Those of skill in the art readily understand how to determine the homology of two proteins. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.


Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.


The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989.


It is understood that the description of conservative mutations and homology can be combined together in any combination, such as embodiments that have at least 70% homology to a particular sequence wherein the variants are conservative mutations.


As this specification discusses various proteins and protein sequences it is understood that the nucleic acids that can encode those protein sequences are also disclosed. This would include all degenerate sequences related to a specific protein sequence, i.e. all nucleic acids having a sequence that encodes one particular protein sequence as well as all nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and derivatives of the protein sequences. Thus, while each particular nucleic acid sequence may not be written out herein, it is understood that each and every sequence is in fact disclosed and described herein through the disclosed protein sequence.


It is understood that there are numerous amino acid and peptide analogs which can be incorporated into the disclosed compositions. For example, there are numerous D amino acids or amino acids which have a different functional substituent then the amino acids shown in Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are disclosed, as well as the stereo isomers of peptide analogs. These amino acids can readily be incorporated into polypeptide chains by charging tRNA molecules with the amino acid of choice and engineering genetic constructs that utilize, for example, amber codons, to insert the analog amino acid into a peptide chain in a site specific way.


Molecules can be produced that resemble peptides, but which are not connected via a natural peptide linkage. For example, linkages for amino acids or amino acid analogs can include CH2NH—, —CH2S—, —CH2—CH2—, —CH═CH—(cis and trans), —COCH2—, —CH(OH)CH2—, and —CHH2SO—(These and others can be found in Spatola, A. F. in Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., Marcel Dekker, New York, p. 267 (1983); Spatola, A. F., Vega Data (March 1983), Vol. 1, Issue 3, Peptide Backbone Modifications (general review); Morley, Trends Pharm Sci (1980) pp. 463-468; Hudson, D. et al., Int J Pept Prot Res 14:177-185 (1979) (—CH2NH—, CH2CH2—); Spatola et al. Life Sci 38:1243-1249 (1986) (—CH H2—S); Hann J. Chem. Soc Perkin Trans. I 307-314 (1982) (—CH—CH—, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 (1980) (—COCH2—); Jennings-White et al. Tetrahedron Lett 23:2533 (1982) (—COCH2—); Szelke et al. European Appln, EP 45665 CA (1982): 97:39405 (1982) (—CH(OH)CH2—); Holladay et al. Tetrahedron. Lett 24:4401-4404 (1983) (—C(OH)CH2—); and Hruby Life Sci 31:189-199 (1982) (—CH2—S—); each of which is incorporated herein by reference. A particularly preferred non-peptide linkage is —CH2NH—. It is understood that peptide analogs can have more than one atom between the bond atoms, such as b-alanine, g-aminobutyric acid, and the like.


Amino acid analogs and analogs and peptide analogs often have enhanced or desirable properties, such as, more economical production, greater chemical stability, enhanced pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity (e.g., a broad-spectrum of biological activities), reduced antigenicity, and others.


D-amino acids can be used to generate more stable peptides, because D amino acids are not recognized by peptidases and such. Systematic substitution of one or more amino acids of a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of L-lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize or attach two or more peptides together. This can be beneficial to constrain peptides into particular conformations.


4. Pharmaceutical carriers/Delivery of pharmaceutical products


In one aspect, it is understood and herein contemplated that the disclosed neoantigens can be administered as compositions to a subject with a cancer or likely to develop a cancer. Accordingly, disclosed herein are compositions comprising a therapeutically effective amount of one or more of the neoantigens disclosed herein (including, but not limited to peptides, polypeptides, and proteins of the neoantigens). Additionally, disclosed herein are compositions comprising a therapeutically effective amount of one or more CAR T cells, TCR T-cells and/or TILs; wherein the CAR T cell, TCR T cell, and/or TIL has been engineered to express a receptor for one or more of the neoantigens disclosed herein. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein can be further engineered to knockout or knockdown Zinc finger and BTB domain-containing protein 7B (ThPOK), Lysine-specific histone demethylase 1 (LSD1), programmed cell death protein (PD1), and/or Protein phosphatase 2 (PP2A) to enhance their function such as cytotoxic activity and persistence or survival in vivo after adoptive transfer to a cancer patient. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein is treated with a small molecular inhibitors such 2-PAPC to enhance T cell cytotoxic activity or the ability to proliferate and survive in vivo.


As described above, the compositions can also be administered in vivo in a pharmaceutically acceptable carrier. By “pharmaceutically acceptable” is meant a material that is not biologically or otherwise undesirable, i.e., the material may be administered to a subject, along with the nucleic acid or vector, without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition in which it is contained. The carrier would naturally be selected to minimize any degradation of the active ingredient and to minimize any adverse side effects in the subject, as would be well known to one of skill in the art.


The compositions may be administered orally, parenterally (e.g., intravenously), by intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, topically or the like, including topical intranasal administration or administration by inhalant. As used herein, “topical intranasal administration” means delivery of the compositions into the nose and nasal passages through one or both of the nares and can comprise delivery by a spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or vector. Administration of the compositions by inhalant can be through the nose or mouth via delivery by a spraying or droplet mechanism. Delivery can also be directly to any area of the respiratory system (e.g., lungs) via intubation. The exact amount of the compositions required will vary from subject to subject, depending on the species, age, weight and general condition of the subject, the severity of the allergic disorder being treated, the particular nucleic acid or vector used, its mode of administration and the like. Thus, it is not possible to specify an exact amount for every composition. However, an appropriate amount can be determined by one of ordinary skill in the art using only routine experimentation given the teachings herein.


Parenteral administration of the composition, if used, is generally characterized by injection. Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as emulsions. A more recently revised approach for parenteral administration involves use of a slow release or sustained release system such that a constant dosage is maintained. See, e.g., U.S. Pat. No. 3,610,795, which is incorporated by reference herein.


The materials may be in solution, suspension (for example, incorporated into microparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, receptors, or receptor ligands. The following references are examples of the use of this technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate Chem., 2:447-451, (1991); Bagshawe, K. D., Br. J. Cancer, 60:275-281, (1989); Bagshawe, et al., Br. J. Cancer, 58:700-703, (1988); Senter, et al., Bioconjugate Chem., 4:3-9, (1993); Battelli, et al., Cancer Immunol. Immunother., 35:421-425, (1992); Pietersz and McKenzie, Immunolog. Reviews, 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol, 42:2062-2065, (1991)). Vehicles such as “stealth” and other antibody conjugated liposomes (including lipid mediated drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma cells in vivo. The following references are examples of the use of this technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophysica Acta, 1104:179-187, (1992)). In general, receptors are involved in pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which the receptors are sorted, and then either recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The internalization pathways serve a variety of functions, such as nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow more than one intracellular pathway, depending on the cell type, receptor concentration, type of ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 10:6, 399-409 (1991)).


a) Pharmaceutically Acceptable Carriers


The compositions, including antibodies, can be used therapeutically in combination with a pharmaceutically acceptable carrier.


Suitable carriers and their formulations are described in Remington: The Science and Practice of Pharmacy (19th ed.) ed. A. R. Gennaro, Mack Publishing Company, Easton, Pa. 1995. Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the formulation to render the formulation isotonic. Examples of the pharmaceutically-acceptable carrier include, but are not limited to, saline, Ringer's solution and dextrose solution. The pH of the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 7.5. Further carriers include sustained release preparations such as semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in the art that certain carriers may be more preferable depending upon, for instance, the route of administration and concentration of composition being administered.


Pharmaceutical carriers are known to those skilled in the art. These most typically would be standard carriers for administration of drugs to humans, including solutions such as sterile water, saline, and buffered solutions at physiological pH. The compositions can be administered intramuscularly or subcutaneously. Other compounds will be administered according to standard procedures used by those skilled in the art.


Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, preservatives, surface active agents and the like in addition to the molecule of choice. Pharmaceutical compositions may also include one or more active ingredients such as antimicrobial agents, antiinflammatory agents, anesthetics, and the like.


The pharmaceutical composition may be administered in a number of ways depending on whether local or systemic treatment is desired, and on the area to be treated. Administration may be topically (including ophthalmically, vaginally, rectally, intranasally), orally, by inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or intramuscular injection. The disclosed antibodies can be administered intravenously, intraperitoneally, intramuscularly, subcutaneously, intracavity, or transdermally.


Preparations for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like.


Formulations for topical administration may include ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable.


Compositions for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, diluents, emulsifiers, dispersing aids or binders may be desirable.


Some of the compositions may potentially be administered as a pharmaceutically acceptable acid- or base-addition salt, formed by reaction with inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, di-, trialkyl and aryl amines and substituted ethanolamines.


b) Therapeutic Uses


Effective dosages and schedules for administering the compositions may be determined empirically, and making such determinations is within the skill in the art. The dosage ranges for the administration of the compositions are those large enough to produce the desired effect in which the symptoms of the disorder are effected. The dosage should not be so large as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease in the patient, route of administration, or whether other drugs are included in the regimen, and can be determined by one of skill in the art. The dosage can be adjusted by the individual physician in the event of any counterindications. Dosage can vary, and can be administered in one or more dose administrations daily, for one or several days. Guidance can be found in the literature for appropriate dosages for given classes of pharmaceutical products. For example, guidance in selecting appropriate doses for antibodies can be found in the literature on therapeutic uses of antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et al., eds., Noges Publications, Park Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis and Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A typical daily dosage of the antibody used alone might range from about 1 μg/kg to up to 100 mg/kg of body weight or more per day, depending on the factors mentioned above.


D. Methods of Using the Compositions

1. Method of Treating Cancer


The disclosed compositions can be used to treat any disease where uncontrolled cellular proliferation occurs such as cancers. Accordingly, in one aspect, disclosed herein are methods of stimulating an immunological response against a cancer or treating, inhibiting, and/or preventing a cancer comprising administering to a subject a composition comprising a therapeutically effective amount of any of the neoantigens disclosed herein (for example SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163) and/or any neoantigens identified by the method of identifying a neoantigen disclosed herein. In one aspect, it is understood and herein contemplated that the disclosed neoantigens include peptide, polypeptides, and/or proteins of any disclosed neoantigen.


The term “therapeutically effective” refers to the amount of the composition used is of sufficient quantity to ameliorate one or more causes or symptoms of a disease or disorder. Such amelioration only requires a reduction or alteration, not necessarily elimination.


The term “treatment” refers to the medical management of a patient with the intent to cure, ameliorate, stabilize, or prevent a disease, pathological condition, or disorder. This term includes active treatment, that is, treatment directed specifically toward the improvement of a disease, pathological condition, or disorder, and also includes causal treatment, that is, treatment directed toward removal of the cause of the associated disease, pathological condition, or disorder. In addition, this term includes palliative treatment, that is, treatment designed for the relief of symptoms rather than the curing of the disease, pathological condition, or disorder; preventative treatment, that is, treatment directed to minimizing or partially or completely inhibiting the development of the associated disease, pathological condition, or disorder; and supportive treatment, that is, treatment employed to supplement another specific therapy directed toward the improvement of the associated disease, pathological condition, or disorder.


A non-limiting list of different types of cancers that can be treated by the disclosed methods is as follows: lymphomas (Hodgkins and non-Hodgkins), leukemias, carcinomas, carcinomas of solid tissues, squamous cell carcinomas, adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, neuroblastomas, plasmacytomas, histiocytomas, melanomas, adenomas, hypoxic tumors, myelomas, AIDS-related lymphomas or sarcomas, metastatic cancers, or cancers in general.


A representative but non-limiting list of cancers that the disclosed compositions can be used to treat is the following: lymphoma, B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers such as small cell lung cancer and non-small cell lung cancer, neuroblastoma/glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancer, liver cancer, melanoma, squamous cell carcinomas of the mouth, throat, larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, breast cancer, and epithelial cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon cancer, and/or rectal cancers.


2. Methods of Treating Autoimmune Diseases


In one aspect, it is understood and herein contemplated that the disclosed neoantigens can be used in the treatment of autoimmune diseases. As used herein, “autoimmune disease” refers to a set of diseases, disorders, or conditions resulting from an adaptive immune response (T cell and/or B cell response) against the host organism. In such conditions, either by way of mutation or other underlying cause, the host T cells and/or B cells and/or antibodies are no longer able to distinguish host cells from non-self-antigens and attack host cells baring an antigen for which they are specific. Examples of autoimmune diseases that can cause an inflammatory skin disorder include, but are not limited to Achalasia, Acute disseminated encephalomyelitis, Acute motor axonal neuropathy, Addison's disease, Adiposis dolorosa, Adult Still's disease, Agammaglobulinemia, Alopecia areata, Alzheimer's disease, Amyloidosis, Ankylosing spondylitis, Anti-GBM/Anti-TBM nephritis, Antiphospholipid syndrome, Aplastic anemia, Autoimmune angioedema, Autoimmune dysautonomia, Autoimmune encephalomyelitis, Autoimmune enteropathy, Autoimmune hemolytic anemia, Autoimmune hepatitis, Autoimmune inner ear disease (AIED), Autoimmune myocarditis, Autoimmune oophoritis, Autoimmune orchitis, Autoimmune pancreatitis, Autoimmune polyendocrine syndrome, Autoimmune retinopathy, Autoimmune urticaria, Axonal & neuronal neuropathy (AMAN), Baló disease, Behcet's disease, Benign mucosal emphigoid, Bickerstaffs encephalitis, Bullous pemphigoid, Castleman disease (CD), Celiac disease, Chagas disease, Chronic fatigue syndrome, Chronic inflammatory demyelinating polyneuropathy (CIDP), Chronic recurrent multifocal osteomyelitis (CRMO), Churg-Strauss Syndrome (CSS), Eosinophilic Granulomatosis (EGPA), Cicatricial pemphigoid, Cogan's syndrome, Cold agglutinin disease, Congenital heart block, Coxsackie myocarditis, CREST syndrome, Crohn's disease, Dermatitis herpetiformis, Dermatomyositis, Devic's disease (neuromyelitis optica), Diabetes mellitus type 1, Discoid lupus, Dressler's syndrome, Endometriosis, Enthesitis, Eosinophilic esophagitis (EoE), Eosinophilic fasciitis, Erythema nodosum, Essential mixed cryoglobulinemia, Evans syndrome, Felty syndrome, Fibromyalgia, Fibrosing alveolitis, Giant cell arteritis (temporal arteritis), Giant cell myocarditis, Glomerulonephritis, Goodpasture's syndrome, Granulomatosis with Polyangiitis, Graves' disease, Guillain-Barre syndrome, Hashimoto's encephalopathy, Hashimoto's thyroiditis, Hemolytic anemia, Henoch-Schonlein purpura (HSP), Herpes gestationis or pemphigoid gestationis (PG), Hidradenitis Suppurativa (HS) (Acne Inversa), Hypogammalglobulinemia, IgA Nephropathy, IgG4-related sclerosing disease, Immune thrombocytopenic purpura (ITP), Inclusion body myositis (IBM), Interstitial cystitis (IC), Inflamatory Bowel Disease (IBD), Juvenile arthritis, Juvenile diabetes (Type 1 diabetes), Juvenile myositis (JM), Kawasaki disease, Lambert-Eaton syndrome, Leukocytoclastic vasculitis, Lichen planus, Lichen sclerosus, Ligneous conjunctivitis, Linear IgA disease (LAD), Lupus nephritis, Lupus vasculitis, Lyme disease chronic, Meniere's disease, Microscopic polyangiitis (MPA), Mixed connective tissue disease (MCTD), Mooren's ulcer, Mucha-Habermann disease, Multifocal Motor Neuropathy (MMN) or MMNCB, Multiple sclerosis, Myasthenia gravis, Myositis, Narcolepsy, Neonatal Lupus, Neuromyelitis optica, Neutropenia, Ocular cicatricial pemphigoid, Optic neuritis, Ord's thyroiditis, Palindromic rheumatism (PR), PANDAS, Paraneoplastic cerebellar degeneration (PCD), Paroxysmal nocturnal hemoglobinuria (PNH), Parry Romberg syndrome, Pars planitis (peripheral uveitis), Parsonnage-Turner syndrome, Pemphigus, Peripheral neuropathy, Perivenous encephalomyelitis, Pernicious anemia (PA), POEMS syndrome, Polyarteritis nodosa, Polyglandular syndromes type I, II, III, Polymyalgia rheumatica, Polymyositis, Postmyocardial infarction syndrome, Postpericardiotomy syndrome, Primary biliary cirrhosis, Primary sclerosing cholangitis, Progesterone dermatitis, Psoriasis, Psoriatic arthritis, Pure red cell aplasia (PRCA), Pyoderma gangrenosum, Raynaud's phenomenon, Reactive Arthritis, Reflex sympathetic dystrophy, Relapsing polychondritis, Restless legs syndrome (RLS), Retroperitoneal fibrosis, Rheumatic fever, Rheumatoid arthritis, Rheumatoid vasculitis, Sarcoidosis, Schmidt syndrome, Schnitzler syndrome, Scleritis, Scleroderma, Sjögren's syndrome, Sperm & testicular autoimmunity, Stiff person syndrome (SPS), Subacute bacterial endocarditis (SBE), Susac's syndrome, Sydenham chorea, Sympathetic ophthalmia (SO), Systemic Lupus Erythematosus, Systemic scleroderma, Takayasu's arteritis, Temporal arteritis/Giant cell arteritis, Thrombocytopenic purpura (TTP), Tolosa-Hunt syndrome (THS), Transverse myelitis, Type 1 diabetes, Ulcerative colitis (UC), Undifferentiated connective tissue disease (UCTD), Urticaria, Urticarial vasculitis, Uveitis, Vasculitis, Vitiligo, Vogt-Koyanagi-Harada Disease, and Wegener's granulomatosis (or Granulomatosis with Polyangiitis (GPA)). Accordingly, in one aspect, disclosed herein are methods of stimulating an immunological response against an autoimmune disease or treating, inhibiting, and/or preventing an autoimmune disease comprising administering to a subject a composition comprising a therapeutically effective amount of any of the neoantigens disclosed herein (for example SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163) and/or any neoantigens identified by the method of identifying a neoantigen disclosed herein. In one aspect, it is understood and herein contemplated that the disclosed neoantigens include peptide, polypeptides, and/or proteins of any disclosed neoantigen.


E. EXAMPLES

The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary and are not intended to limit the disclosure. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations have been accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric.


Example 1: Dominant Neoantigen-Specific T Cell Response and Multiple-Antigen Specificity of Single T Cell Receptors

Tumor regression induced by checkpoint blockade therapy and T-cell-based immunotherapy depends on T-cell recognition of tumor antigens, particularly mutation-derived neoantigens. Increasing evidence indicates that tumor mutation loads are positively correlated with clinical response to checkpoint blockade therapy. However, despite the relatively larger numbers of somatic mutations in cancer tissues, the number of neoantigens identified from human cancer patients using tumor-reactive T cells has been unexpectedly low. The molecular mechanisms underlying the low number of neoantigens for T cell recognition remain poorly understood.


The mutated proteins must be naturally processed to short peptides [generally 9 amino acids for MHC class I molecules and variable length (13-15 amino acids) for MHC-II molecules] with specific MHC-I or -II binding motifs, and then presented on the cell surface for T cell stimulation. Furthermore, the binding affinity of neoantigens to MHC-I or -II molecules is a critical determinant of its immunogenicity and dependent on MHC-I (HLA-A, -B and -C) or MHC-II (HLA-DR, -DQ and DP) molecules. More importantly, there is accumulating evidence supporting that the mutated amino acid of neoantigenic peptides either increases the binding affinity of peptide-MHC complexes or triggers T cell receptor (TCR) to elicit T cell response through MHC/peptide-TCR contact. Therefore, antigen processing, HLA genotypes and TCRs can influence the number and quality of neoantigens that elicit T cell response against cancer cells. Recent studies show that tumor cells may develop multiple strategies to invade immune destruction by regulating antigen processing and specific loss of HLA alleles, thus rendering them susceptible or resistant to immunotherapy. Despite recent identification of many HLA class I-restricted neoantigens and their importance in cancer immunotherapy, growing evidence indicates that neoantigen-specific CD4+ T cells and their cognate neoantigens play a critical role in tumor regression. However, the current prediction programs for neoantigens are not accurate, particularly for MHC II-restricted neoantigens owing to the open pockets of MHC II molecules, thus limiting our ability to identify the full-spectrum of immunogenic neoantigens.


Because of the importance of mutation-derived neoantigens in tumor regression induced by checkpoint blockade therapy, a systemic approach may be required for identifying mutation-derived neoantigens for better understanding of T-cell recognition repertoire and potential mechanisms that control the number and quality of neoantigens for T cell recognition. Furthermore, whether a single T-cell receptor (TCR) can recognize two or more neoantigens in cancer patients remains largely unknown. In this study, is described a systemic approach to identify all possible mutation-derived neoantigens recognized by CD4+ and CD8+ T cells, and discover the immunodominance of neoantigen recognition by T cell lines and clones derived from cancer patients. The immunodominant neoantigen recognition by T cells is further supported by T cell clone analysis, intracellular staining and TCR profiling of primary T cells. Interestingly, one single TCR has the ability to recognize two neoantigen epitopes in an MHC II restricted fashion, while another TCR recognizes a MHC-I restricted neoantigen epitope when it is expressed in CD8+ T cells, but recognizes a MHC II-restricted neoantigen epitope when it is expressed in CD4+ T cells.


a) Results


(1) Systemic Identification of Somatic Mutation-Derived Neoantigens in 135 Mel

To systemically identify the maximal number of neoantigens recognized by T cells, primary tumor-infiltrating lymphocyte (TIL) lines were generated from fresh cancer tissues that had been surgically removed from two melanoma patients (#135 and #136). The resultant two TIL cell lines consisted of 54.5% CD4+, 41.9% CD8+ T cells (#135) and 82.2% CD4+, 17.2% CD8+ T cells (#136) and elicited tumor-specific responses against 135 mel and 136 mel tumor cells, respectively (FIG. 1A). Meanwhile two melanoma cell lines derived from primary patients' tumor tissues were passaged (135 mel and 136 mel). The ability of 135TIL and 136TIL to recognize autologous tumor cell lines (135 mel and 136 mel, respectively) was tested along with other control cells, including allogeneic tumor cell lines, Epstein-Barr virus-transformed B (EBV-B) cells and 293 cells (FIG. 1B), demonstrating that both 135TIL and 136TIL were autologous tumor-specific. After purification of CD4+ and CD8+ T cell populations from early TILs, tumor-reactive T-cell clones were established by limited dilution (0.3 cells/well) to eliminate the competition and loss of tumor-reactive T cells due to their differential growth property in bulk T cells. The distribution of T-cell clones on 96-well plates was completely unbiased, from which each tumor-reactive clone was picked and expanded. Tumor-reactivity of CD4+ and CD8+ T-cell clones was tested against autologous EBV-B cells, autologous tumor cells, and other allogeneic tumor cell lines, and found that these T cell clones were specific for autologous tumor cells, but did not respond to autologous EBV-B cells and other targets. Some representative T-cell clones are shown in FIG. 1C and FIG. 2, indicating that like their parental T cell lines, these T cell clones maintain specific recognition of autologous tumor cells.


Whole-exome sequencing was performed on libraries generated from genomic DNAs of 135 mel and 135TIL cells. Based on the next-generation sequencing results, around 40,000 single nucleotide variations (SNVs) were distinguished in 135 mel and 135TIL respectively after alignment with the Human Genome Version 19 (hg19). Then a multiple-step filtering pipeline was applied to determine the lower bound of SNV frequency, remove common variations in tumor and normal T cells and limit the SNV type. Finally, 232 somatic missense mutations unique to 135 mel tumor cells (FIG. 3A) were identified. Because of poor prediction of neoantigens by currently available computer-assistant algorithm, a library of 24 tandem minigenes (TMGs) containing all 232 mutations was designed and constructed. Each mutation was included in a 25 amino acid (aa) peptide sequence with 12 aa flanking each side of the mutated amino acid. The synthesized TMGs with 10 mutations per TMG were cloned into the Ii80-fusion targeting expression vector pTSX for efficient processing and presentation of T-cell epitopes to T cells for recognition.


To identify true neoantigens among the TMGs recognized by tumor-reactive T cells, HEK293 cells engineered to express MHC-I (HLA-A1, HLA-B8 or HLA-C7) and/or MHC-II molecules (HLA-DR1, HLA-DR3, HLA-DR4, HLA-DR11 and/or HLA-DP4) were used as artificial antigen-presenting cells (APCs). Based on HLA typing of patient #135, 293T-A1 and 293T-B8 (which naturally express HLA-C7) were generated for MHC-I antigen presentation, and 293IMDR3/DP4 and 293IMDR4/DP4 (expressing various HLA-DR molecules and HLA-DP4) for MHC-II antigen presentation. Neoantigens were screened for T cell recognition by transfection of 24 TMG constructs into 2931IMDR3/DP4 and 293IMDR4/DP4 cells, respectively, followed by adding CD4+135TILs for co-culture overnight. Significant IFN-γ release was found from T cells that were co-cultured with 293IMDR3/DP4 or 293IMDR4/DP4 cells transfected with TMG6, TMG17 or TMG18 (FIG. 3B). To identify individual neoantigens, all single mutations (25 aa) from TMG6, 17 and 18 were subcloned to the same vector. Each single mutation-containing construct was tested with CD4+ tumor-reactive T cells in the same MHC-II APCs. These experiments led to identification of four mutation-derived neoantigens [SPATA13 (T>A) in TMG6, PCDHB7 (R>C) and PCDHB16 (H>Y) in TMG17, and ATG5 (D>N) in TMG18] that were strongly recognized by CD4+135TILs (FIG. 4A). Next, the wild-type (WT) counterparts of each mutation-derived neoantigen were generated and tested them together with the mutated constructs, and found that 135TIL strongly recognized four neoantigens, but not WT counterparts (FIG. 3C). To further define the T-cell epitopes of these neoantigens, peptides encoding different 13-mers locating around the mutation sites were synthesized and tested for 135TIL cell recognition. Based on their ability to activate T cell response, these neoepitopes were narrowed down to 9-13 mer long through a series of deletions and truncation of T cell epitopes (FIG. 4B).


To identify MHC-I restricted neoantigens, 293T-A1 or 293T-B8 cells were co-transfected with 24 TMGs, followed by co-culture with CD8+135TIL cells, and found that CD8+ 135TIL cells recognized TMG1 presented by 293T-B8 cells, as well as TMG14 presented by 293T-A1 cells (FIG. 3D). Further experiments through subcloning each mutation from the positive TMG (containing 10 mutations), the two neoantigens [TXNIP (E>K) in TMG1 and RPN2 (L>S) in TMG14] were identified (FIG. 3E and FIG. 5A). However, These CD8+135TIL cells did not respond to WT counterparts (FIG. 3E).


Because the 293-engineered APCs do not express all HLA types, immortalized autologous APCs (135EBV-B) were used to serve as APCs for identification of all possible neoantigens presented different MHC-I and II molecules. Due to low transfection efficiency of EBV-B cells, in vitro transcribed (IVT) RNA of 24 TMGs were generated and then electroporated them into 135EBV-B cells, followed by adding 135TIL cells for antigen recognition based on cytokine release assay. All neoantigen-containing TMGs identified above positively responded for T cell recognition, but one new construct (TMG7) presented by EBV-B cells was identified that was strongly recognized by 135TIL, indicating more neoantigens presented by HLA molecules that the artificial APCs did not cover (FIG. 3F). Further investigation by TMG subcloning and re-testing revealed that MPG (G>E) in TMG7 was another MHC-I presented neoantigen (FIG. 3G and FIG. 5B), while other positive TMGs were the same as those identified above. Using the same approach to MHC-II neoantigens, all three MHC-I neoantigens were further defined to 9-11 mer peptides for T cell recognition (FIG. 5C). Titration experiments of all MHC-I and MHC-II neoantigens were performed and showed that tumor-reactive T cells strongly recognized these neoantigen peptides in a dose-dependent manner, but with little or no recognition of WT peptide sequences (FIG. 3H).


(2) Identification of Two Non-Mutated Antigens Derived from Alternative Open Reading Frames and 3′ UTR Region in 135Mel Tumor Cells

It is shown herein that tumor-reactive T cells are capable of recognizing T cell epitopes derived alternative open reading frames (ORF) and long-noncoding genes. Thus, immunogenic antigens can be generated from various sources or mechanisms [such as alternative ORFs, noncoding regions (3′ and 5′ UTRs, introns, noncoding RNA)]. To explore the possibility that some tumor-reactive T cells can recognize the cryptic T cell epitope, rather than just in the coding region based on exome sequence, the established CD4+ T-cell clones were tested for their ability recognize four newly identified MHC-II neoantigens, and CD8+ T-cell clones for their ability to recognize three MHC-1 neoantigens in 135 mel. Among all T cell clones, two T cell clones were identified that recognized 135 mel tumor cells, but did not respond to any neoantigen identified. A representative data are shown in FIG. 6A. All other tumor-reactive CD4+ T-cell clones and CD8+ T-cell clones recognized the mutation-derived neoantigens. These two T-cell clones did not respond to EBV-B cells transfected with 24 TMGs RNAs, indicating that these T cell clones recognize antigenic peptides presented by 135 mel cells through other sources, but not from somatic mutations in the coding regions. To identify these two antigens recognized by these two CD4+ T-cell clones, an Ii-fusion cDNA library derived from 135 mel tumor cells was constructed using experimental procedures. The Ii-fusion cDNA library pools (approximately 100 cDNA clones per pool) were transfected into 293IMDR3/DP4 or 293IMDR4/DP4 cells, followed by co-culturing with CD4+ T-cell clones (135-1C1 or 135-1E1). After 2×105 cDNA clones, positive cDNA library pools that stimulate two T-cell clones, respectively, were identified based on IFN-γ release by ELISA (FIG. 6B). The positive pools were then transformed into Stbl3™ Escherichia coli, and single bacteria colonies were picked for plasmid DNA isolation. The screening assays were repeated in the same APCs transfected with the single cDNA-containing plasmids, and identified three clones that were able to stimulate 135-1C1 T cells, and two clones that reacted with 135-1E1 T cells (FIG. 6C). Sequencing of each positive cDNA clone and a database search revealed that 135-1C1 T cells recognized the same antigen, which had a 56 aa open reading frame (ORF) translated from the 3′UTR of the ADIPOR2 gene (FIG. 6C and FIG. 7A). The 135-1E1 T cell-reactive cDNA clones encoded the product translated from an alternative reading frame of CTAG2 (LAGE1b) (FIG. 6C and FIG. 7A). To further demonstrate the reactivity and specificity of these two T-cell responses, ADIPOR2 and LAGE1b were constructed with their normal reading frames in the Ii-fusion vector, and found that 135-1C1 T-cell clones only responded to peptides translated from the 3′ UTR region of ADIPOR2 gene (FIG. 7B). By contrast, 135-1E1 T cells recognized the gene products translated from the alternative ORF2 of the CTAG2 gene, but did not respond to the gene products of their normal ORFs (FIG. 7C). LAGE1 is a cancer-testis antigen, with over 80% amino acid sequence identical to NY-ESO-1 protein, which is the most immunogenic tumor antigen identified to date. To define the T cell epitopes, wee generated a series of deletions and truncations of 3′ UTR region of ADIPOR2 gene, and narrowed down T cell epitope to a 26 aa peptide-coding sequence that stimulate 135-1C1 T cell response (FIG. 7D). Using similar approach, T cell epitopes were narrowed down to a 69 aa peptide-coding sequence of the LAGE1b alternative ORF2 for 135-1E1 T-cell recognition (FIG. 7E). Peptide titration experiments showed that 135-1C1 and 135-1E1 T-cell clones strongly recognized antigen peptides in a dose-dependent manner, but the bulk 135TIL cell lines showed little or no activity (FIG. 6D), indicating that 135-1C1 and 135-1E1 T-cell clones are relatively rare in the original T cell population. Taken together, these results indicate that these two tumor-reactive CD4+ T cell clones recognize nonmutated but aberrant neoantigen peptides translated from either the 3′ UTR sequence of ADIPOR2 gene or alternative ORF of CTAG2 gene.


(3) Systemic Identification of Neoantigens from the Second Patient Tumor by Tumor-Reactive T Cells

Using several different approaches (293-engineered APCs and B cells, as well as different libraries) and tumor-reactive T cells (TIL lines and clones), 6 MHC-II antigens (4 mutation-derived neoantigens and 2 derived from alternative ORF and 3′ UTR region) were identified and 3 MHC-I neoantigens in 135 mel tumor cells. It appears that the number of neoantigens recognized by T cells is rather limited, even though there are 232 somatic mutations (Table 3). To extend the findings, exome sequencing of genomic DNAs isolated from the second patient tumor (136 mel) and 136TIL cells (as a control) was performed and identified 348 somatic mutations that were unique to 136 tumor cells (FIG. 8A) (Table 4). A library was designed and constructed containing 35 TMGs (10 mutations in each TMG) in pTSX targeting expression vector based on somatic mutations. To identify MHC-II neoantigens presented by HLA-DR1, DR11 and DP4 molecules based on the HLA typing of patient #136, 293IMDR1/DP4 and 293IMDR11/DP4 cells were transfected with 35 TMGs, respectively, followed by adding tumor-reactive CD4+ T cells isolated from 136TIL cells by antibody-coated magnetic beads. As shown in FIG. 8B, three TMGs (TMG19, TMG31 and TMG 32) were identified that were positive for T cell recognition in 293IMDR1/DP4 cells, and found one TMG (TMG32) that was able stimulate CD4+ T cells to release IFN-γ in 293IMDR11/DP4 cells. After subcloning of 10 minigenes of each positive TMG into pTSX expression vector, they were tested for their ability to stimulate T cell activation in 293IMDR1/DP4 cells, and found that four neoantigens, including ZFYVE1 (H>Y) in TMG19, LMAN2 (L>R) and MAPK9 (W>C) in TMG31, and ANKIB1 (P>L) in TMG32, were capable of stimulating CD4+ T cells for IFN-γ release after overnight co-culture with T cells (FIG. 8C and FIG. 9A). By contrast, no appreciable T cell activity was observed against the corresponding WT counterparts (FIG. 8C). Because ANKIB1 can be presented by both 293IMDR1/DP4 and 293IMDR11/DP4 cells (FIG. 8B), it is likely that ANKIB1 is presented by HLA-DP4 molecules for T cell recognition. These results indicate that four MHC-II restricted neoantigens are processed and presented for T cell recognition.









TABLE 3







Amino acid sequences of somatic mutations (patient #135) for


screening in each TMG









TMG




No.
Gene name and locus
Mutant AA sequence





TMG1
AGRN location:chr1:987171 the
VDTLAFDGRTFVGYLNAVTESEKAL



33 -th exon(s) of
(SEQ ID NO: 167)



ENST00000379370




MORN1 location:chr1:2267954
KKAGGRSRGGLHSRGTPPTAQEPPG



the 12 -th exon(s) of
(SEQ ID NO: 168)



ENST00000378531




CLCN6 location:chr1:11894611
GIYDIHVGLRGVLLLEWETEVEMDK



the 16 -th exon(s) of
(SEQ ID NO: 169)



ENST00000312413




ATP13A2 location:chr1:17320224
PLVPEPRRLPVGLLLRALATCHALS



the 16 -th exon(s) of
(SEQ ID NO: 170)



ENST00000452699




KIF17 location:chr1:20998671 the
EKMQRKLRAAEVKIKDLQSEFQLEK



12 -th exon(s) of
(SEQ ID NO: 171)



EN5T00000247986




CELA3B location:chr1:22310198
WNRSCVACGNDIVLIKLSRSAQLGD



the 5 -th exon(s) of
(SEQ ID NO: 172)



ENST00000337107




EPHA10 location:chr1:38197088
QSFNPSIEVQTLEEAASGSRDQSPA



the 7 -th exon(s) of
(SEQ ID NO: 173)



ENST00000373048




TIE1 location:chr1:43773506 the
CAPGHFGADCRLRCQCQNGGTCDRF



7 -th exon(s) of EN5T00000372476
(SEQ ID NO: 174)



AGBL4 location:chrl:48999906
KHPLLRGPASNYLNSKGDKKSSVNH



the 14 -th exon(s) of
(SEQ ID NO: 175)



ENST00000371839




TXNIP location:chr1:145438902
GSGEKVAGRVIVKVCEVTRVKAVRI



the 1 -th exon(s) of
(SEQ ID NO: 176)



ENST00000582401






TMG2
POGZ location:chr1:151380993
PRTVPVSSNDTPLSALQEAAPLTSS



the 14 -th exon(s) of
(SEQ ID NO: 177)



ENST00000271715




RORC location:chr1:151789697
KICGDKSSGIHYRVITCEGCKGFFR



the 2 -th exon(s) of
(SEQ ID NO: 178)



ENST00000356728




FLG2 location:chr1:152326075 the
VHERHETTYGQTEEATGHGHSGHGQ



3 -th exon(s) of
(SEQ ID NO: 179)



ENST00000388718




IVL location:chr1:152883669 the
EQQVGQPKNLEQKEKQLELPEQQEG



2 -th exon(s) of ENST00000368764
(SEQ ID NO: 180)



FCRL1 location:chr1:157767607
TPGQLQPIYENELREQSVAVHGRQQ



the 8 -th exon(s) of
(SEQ ID NO: 181)



ENST00000358292




LY9 location:chr1:160783603 the
SESNGGSILTVSQTPCDPDLPYICT



3 -th exon(s) of
(SEQ ID NO: 182)



ENST00000263285




TBX19 location:chr1:168278072
WTSLSSTPHASIMSVPHTNGPINPG



the 7 -th exon(s) of
(SEQ ID NO: 183)



ENST00000367821




RGS16 location:chr1:182572416
LGIFLHKSELGCNTGSTGKFEWGSK



the 2 -th exon(s) of
(SEQ ID NO: 184)



ENST00000367558




FAM129A
LEEAYTLVQHQVPEGLSALKEECRA



location:chr1:184792424 the 8 -th
(SEQ ID NO: 185)



exon(s) of ENST00000367511




BRINP3 location:chr1:190129856
QLFLKAQKIVHKFFSLSKRCHKQPL



the 7 -th exon(s) of
(SEQ ID NO: 186)



ENST00000367462






TMG3
CRB1 location:chr1:197403848
QPVLQGFECIANVVFNGQSGQILFR



the 7 -th exon(s) of
(SEQ ID NO: 187)



ENST00000367399




NENF location:chr1:212619328
NLDFKPEDQPHFYIKDEF



the 4 -th exon(s) of
(SEQ ID NO: 188)



ENST00000366988




USH2A location:chr1:216074197
LPPRLSSATPTSIQVVWSTPARNNA



the 39 -th exon(s) of
(SEQ ID NO: 189)



ENST00000307340




CCDC185
HAVEGQKKVQDTSLSSLINYQARKV



location:chr1:223568085 the 1 -th
(SEQ ID NO: 190)



exon(s) of ENST00000366875




ABCB10 location:chrl :229654072
LMDGRTVLVIAHCLSTIKNANMVAV



the 13 -th exon(s) of
(SEQ ID NO: 191)



ENST00000344517




RYR2 location:chr1:237777461
ALGNHRVAHALCNHVDEPQLLYAIE



the 37 -th exon(s) of
(SEQ ID NO: 192)



ENST00000366574




OR2L3 location:chr1:248224771
YAPFVYTYLRPRFLRSPTEDKVLAV



the 1 -th exon(s) of
(SEQ ID NO: 193)



ENST00000359959




CUBN location:chr10:16893368
NNTFASPDSDSNRMYDKNLNCVWII



the 60 -th exon(s) of
(SEQ ID NO: 194)



ENST00000377833




SH2D4B location:chr10:82363415
QRARDEYRHHSLCAIQKGTVAGLSS



the 5 -th exon(s) of
(SEQ ID NO: 195)



ENST00000339284




ATRNL1
ANICHLHTGKCFRTTKGIKGDQCQL



location:chr10:117154225 the 20-
(SEQ ID NO: 196)



th exon(s) of ENST00000355044






TMG4
MUC6 location:chr11:1031883 the
VQLRRGPDGSISWIIVELGASVVTV



3 -th exon(s) of
(SEQ ID NO: 197)



ENST00000421673




OR51A4 location:chr11:4967678
LCLMVDFILIAVFYTLILKTVLGIA



the 1 -th exon(s) of
(SEQ ID NO: 198)



ENST00000380373




HBD location:chr11:5255429 the
VGGEALGRLLVVFPWTQRFFESFGD



2 -th exon(s) of ENST00000380299
(SEQ ID NO: 199)



OR52E4 location:chr11:5906376
IHILLANLYVVVLPALNPVIYGVRT



the 1 -th exon(s) of
(SEQ ID NO: 200)



ENST00000316987




NCR3LG1
IGVGLVLLIVLISWKKICNKSSSAY



location:chr11:17390500 the 4 -th
(SEQ ID NO: 201)



exon(s) of ENST00000338965




E2F8 location:chr11:19247118 the
PVTSSELTAVNFSSFHVTPLKLMVS 202



12 -th exon(s) of




ENST00000250024




OR5D16 location:chr11:55607024
HGTILFLYCVPNFKNSRHTVKVASV 203



the 1 -th exon(s) of




ENST00000378396




TRIM49 location:chr11:89537525
VTIDCGHSFCRPSFYLNWQDIPFLV 204



the 3 -th exon(s) of




ENST00000329758




TMPRSS5
LRCSECGARPLAFRIVGGQSVAPGR 205



location:chr11:113565338 the 8 -th




exon(s) of ENST00000299882




CEP164 location:chr11:117258122
EEILRLHQQKEQFLSSLRERLQKAI 206



the 15 -th exon(s) of




ENST00000278935






TMG5
PLEKHG6
PLLLHAVLKRSPKARAQEALNAMIE 207



location:chr12:6426830 the 9 -th




exon(s) of ENST00000396988




C1S location:chr12:7169847 the
VYAEPTMYGEILFPNYPQAYPSEVE 208



3 -th exon(s) of ENST00000328916




GDF3 location:chr12:7843214 the
KLLYFNLSAIKEGEQLTLAQLGLDL 209



2 -th exon(s) of




ENST00000329913




KRT83 location:chr12:52709846
AAVAQSEQQGEATLSDARCKLAELE 210



the 7 -th exon(s) of




ENST00000293670




LRP1 location:chr12:57589739 the
SRQWECDGENDCLDQSDEAPKNPHC 211



54 -th exon(s) of




ENST00000243077




KIF5A location:chr12:57965854
RLQEVSGHQRKRIAEVLNGLMKDLS 212



the 14 -th exon(s) of




ENST00000286452




NR2C1 location:chr12:95442978
SRAFDTLAKALNAGESTACQSSVAG 213



the 9 -th exon(s) of




ENST00000333003




CUX2 location:chr12:111757993
EVAPRGRSVPPSLPERPSLATASQN 214



the 17 -th exon(s) of




ENST00000261726




PTPN11
LKYDVGGGERFDALTDLVEHYKKNP 215



location:chr12:112892407 the 5 -th




exon(s) of ENST00000392597




TBX3 location:chr12:115117732
FPSDHATWQGNYGFGTQTILNSMHK 216



the 3 -th exon(s) of




ENST00000257566






TMG6
NCOR2 location:chr12:124810031
PPPPGLPAGSGPFAGPHHAWDEEPK 217



the 48 -th exon(s) of




ENST00000429285




SPATA13
AEPKRKSSLFWHAFNRLTPFRK 218



location:chr13:24876879 the 13 -th




exon(s) of ENST00000382108




CCDC168
PSQPKLPISSGAEKSRLANSNEGIS 219



location:chr13:103388489 the 4 -th




exon(s) of ENST00000322527




COL4A1
TPGPTGPAGQKGKPGSDGIPGSAGE 220



location:chr13:110826298 the




40 -th exon(s) of ENST00000375820




ARHGAP5
RHQREIVEKAKEKFQEMLFEHSELF 221



location:chr14:32561340 the 2 -th




exon(s) of ENST00000345122




MIPOL1 location:chr14:37838743
AALSKCKRLEQEFHHVKEQNQTSAN 222



the 11 -th exon(s) of




ENST00000327441




SPTB location:chr14:65263382 the
EEAEYRRELALRHELIRQEKLEQLA 223



10 -th exon(s) of




ENST00000389722




NPAP1 location:chr15:24924213
GQAAWDPTGHSMTAAPQGASNIPVF 224



the 1 -th exon(s) of




ENST00000329468




RYR3 location:chr15:34065794
YHILCSLYSLGTRKNIYVERQRPAL 225



the 64 -th exon(s) of




ENST00000389232




RASGRP1
LLFDHLEPEELSKHLTYLEFKSFRR 226



location:chr15:38808442 the 6 -th




exon(s) of ENST00000310803






TMG7
RASGRP1
MGTLGKEREAPRKPSHGCR 227



location:chr15:38856810 the 1 -th




exon(s) of ENST00000310803




TRIM69 location:chr15:45047438
KKLPLLKGHPQCQEHGENLKLFSKP 228



the 2 -th exon(s) of




ENST00000329464




CYP11A1
FGERQGMLEEVVSPEAQRFIDAIYQ 229



location:chr15:74636279 the 4 -th




exon(s) of ENST00000268053




CHSY1 location:chr15:101775370
RRMVPHIGKCLRKMYTTHEDVEVGR 230



the 2 -th exon(s) of




ENST00000254190




MPG location:chr16:135400 the
YGMYFCMNISSQEDGACVLLRALEP 231



4 -th exon(s) of ENST00000397817




CLEC16A
VARSAAVETASLFPSLVPARQPTIS 232



location:chr16:11272324 the 23 -th




exon(s) of EN5T00000409790




TXNDC11
VIIPAKPPVSFFFLRSPVLDLFQGQ 233



location:chr16:11830039 the 2 -th




exon(s) of ENST00000283033




ABCC1 location:chr16:16139715
PQILKLLIKFVNNTKAPDWQGYFYT 234



the 9 -th exon(s) of




ENST00000399410




Cl6orf58 location:chr16:31504800
PAPSLSLGVPLHLLVSSVFELQQLV 235



the 9 -th exon(s) of




EN5T00000327237




ADCY7 location:chr16:50324484
YVECLLRRWLRAFALLTWACLVALG 236



the 2 -th exon(s) of




ENST00000254235






TMG8
ADCY7 location:chr16:50324485
VECLLRRWLRALTLLTWACLVALGY 237



the 2 -th exon(s) of




ENST00000254235




DYNC1LI2
KLQGAEHGKKGRDLEYLYLSVHDED 238



location:chr16:66783144 the 3 -th




exon(s) of ENST00000258198




FBXL8 location:chr16:67196929
RAPGLRGLRLECCGEKPLFDAGRDV 239



the 3 -th exon(s) of




ENST00000258200




SLC12A4
IRPKVSSLLGKLISYTNLTQGAKEH 240



location:chr16:67995561 the 3 -th




exon(s) of ENST00000316341




PMFBP1 location:chr16:72174425
DHSKVRIYTSPCIIQEHQETQKRLS 241



the 6 -th exon(s) of




ENST00000237353




MYH4 location:chr17:10356983
DDLELTLAKVEKKKHATENKVKNLT 242



the 23 -th exon(s) of




ENST00000255381




DNAH9 location:chr17:11833330
SPEGHIIPQGILQNSIKITNEPPTG 243



the 9 -th exon(s) of




ENST00000608377




FLCN location:chr17:17117084
EDTQKLLSILGAFEEDNVKLLKFWM 244



the 14 -th exon(s) of




ENST00000285071




FLII location:chr17:18160249 the
LKLNRTGLCYLPKELAALQKLEHLS 245



2 -th exon(s) of




ENST00000545457




SHMT1 location:chr17:18232625
TPALTSRGLLEKNFQKVAHFIHRGI 246



the 10 -th exon(s) of




ENST00000352886






TMG9
FBXW10 location:chr17:18682399
RIYTALDPFRVNAEFVLLTVKEEKE 247



the 13 -th exon(s) of




ENST00000301938




SLFN14 location:chr17:33884597
LREKGFRAQRGRLRVKKLHPQQVLN 248



the 1 -th exon(s) of




ENST00000415846




KRT24 location:chr17:38859509
GLFSGGEKQTMQSLNDRLANYLDKV 249



the 1 -th exon(s) of




ENST00000264651




KRT31 location:chr17:39551303
LQAQHNLRDSLEKTLTESEARYSSQ 250



the 6 -th exon(s) of




ENST00000251645




CDC27 location:chr17:45234359
PDTVPLGTGTSIFSKQVQNKPKTGR 251



the 7 -th exon(s) of




ENST00000066544




CDC27 location:chr17:45234397
SSVSYIDSAVISSDTVPLGTGTSIL 252



the 7 -th exon(s) of




ENST00000066544




BZRAP1 location:chr17:56395792
GSGPKDLDLPPGFPGRCTPKSSEPA 253



the 13 -th exon(s) of




ENST00000268893




ABCA8 location:chr17:66871534
VAIMVSGRLRCICSIQHLKSKFGKD 254



the 36 -th exon(s) of




ENST00000430352




Cl7orf62 location:chr17:80402458
TLFRAGHDQVVVQLHDVRDVSVEEE 255



the 5 -th exon(s) of




ENST00000434650




FOXK2 location:chr17:80543800
AQSAPGSPLSSQSVLITVQRQLPQA 256



the 7 -th exon(s) of




ENST00000335255






TMG10
L3MBTL4 location:chr18:6244527
NGFQIGMRLEGIHPRHPSVFCVLSV 257



the 6 -th exon(s) of




ENST00000400105




DSC1 location:chr18:28720131
MCTTTVTVKIIDNDEGPECHPPVKV 258



the 10 -th exon(s) of




ENST00000257197




CCDC178
GTLFHLTKHKTDKMEDKIAEVRRKF 259



location:chr18:30825250 the 14 -th




exon(s) of ENST00000403303




TCEB3B location:chr18:44560939
VSHSKGHKSSRQKKRPLCAQGDWHS 260



the 1 -th exon(s) of




ENST00000332567




CPLX4 location:chr18:56964118
EMDENQIQMAGDNVDLPEDLRKMVD 261



the 3 -th exon(s) of




ENST00000299721




MUC16 location:chr19:9046760
VDTRSGVPTTTILPSIPGVVTSQVT 262



the 5 -th exon(s) of




ENST00000397910




ZNF560 location:chr19:9577898
LRTHAGEKPYECIKCGKAFTERSYL 263



the 10 -th exon(s) of




ENST00000301480




ATG4D location:chr19:10662790
CQPTVDVSQADFLLESFHCTSPRKM 264



the 8 -th exon(s) of




ENST00000309469




ZNF91 location:chr19:23544724
KRIHTGEKPYKCEECGKAFSNSSTL 265



the 4 -th exon(s) of




ENST00000300619




KIRREL2
LVPPEAPQVLGGLSVSLVAGVPANL 266



location:chr19:36349639 the 4 -th




exon(s) of ENST00000360202






TMG11
ZNF607 location:chr19:38189938
TAPHTFESVEKPFKCEECGKAFSVH 267



the 5 -th exon(s) of




ENST00000355202




CYP2A13
ATFDWLFKGYGVVFSNGERAKQLRR 268



location:chr19:41595958 the 3 -th




exon(s) of ENST00000330436




PRKD2 location:chr19:47177874
RYITHESDDARWEQFAAEHPLPGSG 269



the 19 -th exon(s) of




ENST00000595515




PRKD2 location:chr19:47177874
RWEQFAAEHPLPESGLPTDRDLGGA 270



the 17 -th exon(s) of




ENST00000600194




LRRC4B location:chr19:51021783
KCRTGTSMTSVNLLTPNGTLMTHGS 271



the 3 -th exon(s) of




ENST00000389201




SIGLEC10
STQAPESQESQEKLHYATLNFPGVR 272



location:chr19:51914457 the 10 -th




exon(s) of ENST00000353836




KIR2DL1
FPLGPATHGGTYKCFGSFHDSPYEW 273



location:chr19:55286836 the 4 -th




exon(s) of ENST00000336077




NLRP2 location:chr19:55494015
CGDWEKKKPVPVILGSLLNRVMLPK 274



the 5 -th exon(s) of




ENST00000537859




NLRP4 location:chr19:56379171
NNKKLTYLNVSCKQLDTGVPLLCEA 275



the 6 -th exon(s) of




ENST00000301295




OTOF location:chr2:26684954 the
FFTGEKSSDIFVKGWLKGQQEDKQD 276



25 -th exon(s) of




ENST00000338581






TMG12
SOS1 location:chr2:39213258 the
VFSSSPLHLQPPSLGKKSDHGNAFF 277



23 -th exon(s) of




ENST00000402219




EGR4 location:chr2:73519196 the
APGDLGEGAEGLSGLLTPPSGEGGS 278



2 -th exon(s) of




ENST00000545030




DNAH6 location:chr2:84949894
MYFVIASLSEIDLMYQYSLKYFKQL 279



the 60 -th exon(s) of




ENST00000389394




TBC1D8 location:chr2:101654034
KSPLMHPDALVTVFQQSGSQSPDSR 280



the 8 -th exon(s) of




ENST00000376840




RNF149 location:chr2:101898473
GDVQEMPAPESPLGRDPAANLSLAL 281



the 6 -th exon(s) of




ENST00000295317




ZC3H6 location:chr2:113089764
GGLKSSDKTEPSLGEAILPQKPSPN 282



the 12 -th exon(s) of




ENST00000409871




PTPN4 location:chr2:120734607
RFVCEAILKVYEDGFVKPLTTSTNK 283



the 27 -th exon(s) of




ENST00000263708




PTPN4 location:chr2:120734627
LKVYEEGFVKPLKTSTNK 284



the 27 -th exon(s) of




ENST00000263708




BIN1 location:chr2:127821176 the
FYVNTFQSIAGLQENFHKEMSKLNQ 285



9 -th exon(s) of




ENST00000351659




NEB location:chr2:152380916 the
QAAKQASEVEYRVKHRKEGSHGLSM 286



153 -th exon(s) of




ENST00000427231






TMG13
NEB location:chr2:152380916 the
TVADRPDIKKATLAAKQASEVEYRA 287



125 -th exon(s) of




ENST00000409198




NEB location:chr2:152527587 the
QHPDTVKFTSVPNSMGMVLAQHNTK 288



38 -th exon(s) of




ENST00000427231




PMS1 location:chr2:190656594
TVRLLSSSQIITLVVSVVKELIENS 289



the 2 -th exon(s) of




ENST00000441310




ABCA12 location:chr2:215917230
GTYTFNGSQVLAQILGLEKLLKQNS 290



the 5 -th exon(s) of




ENST00000272895




ZNF142 location:chr2:219521044
LLIPPPLSNRGIMGPVQSPCPSRDP 291



the 4 -th exon(s) of




ENST00000449707




SLC23A3
PLPEDPGDEEGGFSEPEEMADLLPG 292



location:chr2:220026722 the 11 -th




exon(s) of ENST00000295738




TRPM8 location:chr2:234888922
LGLFYFIAGIVFWLHSSNKSSLYSG 293



the 18 -th exon(s) of




ENST00000324695




COL6A3 location:chr2:238274484
VSVVANTPSGPVKAFDFDEYQPEML 294



the 11 -th exon(s) of




ENST00000353578




UBE2F location:chr2:238925240
PDEGYYQGGKFQLETEVPDAYNMVP 295



the 5 -th exon(s) of




ENST00000272930




EBF4 location:chr20:2690234 the
LTHEIMCSRCCDQKSCGNRNETPSD 296



7 -th exon(s) of




ENST00000380648






TMG14
PLCB1 location:chr20:8862399
ISEDSNHGSAPLFLSSDPGKVNHKT 297



the 32 -th exon(s) of




ENST00000338037




PAK7 location:chr20:9561130 the
LSKPSEYSDLKWKYQRASSSSPLDY 298



4 -th exon(s) of




ENST00000353224




FLRT3 location:chr20:14306307
ISKEEFVIHTIFSPNGMNLYKNNHS 299



the 3 -th exon(s) of




ENST00000341420




RPN2 location:chr20:35812712
RLKASLDRPFTNSESAFYSIVGLSS 300



the 2 -th exon(s) of




ENST00000373622




ZNFX1 location:chr20:47887948
DNFQQWRTPHQKLTEQPQQAKKLGY 301



the 3 -th exon(s) of




ENST00000396105




RNF114 location:chr20:48561937
HGCRKNFFLSKIWSHVATCSKYQNY 302



the 3 -th exon(s) of




ENST00000244061




ATP9A location:chr20:50290746
LVMVALQHFAGRSYLQIIRFLLLFS 303



the 11 -th exon(s) of




ENST00000338821




CTCFL location:chr20:56093816
PFKCSMCKYASVKVKPFLDLKLHGI 304



the 4 -th exon(s) of




ENST00000608158




CTCFL location:chr20:56093816
PFKCSMCKYASVKERHMTAHIRTHT 305



the 4 -th exon(s) of




ENST00000432255




CLIC6 location:chr21:36079611
KAGYDGESIGNCSFSQRLFMILWLK 306



the 2 -th exon(s) of




ENST00000349499






TMG15
CECR1 location:chr22:17662383
WKKRWDKFIADVDTK 307



the 10 -th exon(s) of




ENST00000399837




MYO18B
QVAQMRIEYLEQFTVDRAIVSRQEA 308



location:chr22:26348292 the 38 -th




exon(s) of EN5T00000335473




MYO18B
LAAVRQTLQTDLKTSIRRIADLQAA 309



location:chr22:26388383 the 40 -th




exon(s) of ENST00000335473




SEZ6L location:chr22:26709798
GVVLSPNWPEPYLEGEDCIWKIHVG 310



the 9 -th exon(s) of




ENST00000360929




ELFN2 location:chr22:37770923
VTKNYDRLQCESSREFAGYPLLVPR 311



the 3 -th exon(s) of




ENST00000402918




GTPBP1 location:chr22:39122341
IKLNDTLLLGPDSLGNFLSIAVKSI 312



the 8 -th exon(s) of




ENST00000216044




EFCAB6 location:chr22:44064813
GPPTVSPVLVPKNQLLSEHLQKDEQ 313



the 16 -th exon(s) of




ENST00000262726




PNPLA3 location:chr22:44323024
VSDGENVLVSDFWSKDEVVDALVCS 314



the 2 -th exon(s) of




ENST00000216180




GRAMD4
DRLNEIKGHLEIGLLEKHFLQEELR 315



location:chr22:47033834 the 2 -th




exon(s) of ENST00000361034




NUP210 location:chr3:13383354
TLVALDEALDNYIITFLIRGVAIGQ 316



the 23 -th exon(s) of




ENST00000254508






TMG16
SEMA3G location:chr3:52469912
IVASQLDNLFPPKPKPEEPPARGGL 317



the 16 -th exon(s) of




ENST00000231721




FOXP1 location:chr3:71026116
EIYNWFTRMFAYLRRNAATWKGAIR 318



the 17 -th exon(s) of




ENST00000615603




ABI3BP location:chr3:100513845
KPYPEVSQSEPASLETRGIPFIPMI 319



the 22 -th exon(s) of




ENST00000284322




RBPJ location:chr4:26426018 the
TKVALFNRLRSQRVSTRYLHVEGGN 320



7 -th exon(s) of




ENST00000355476




MMRN1 location:chr4:90856108
SSLSEDLESTRQKIQKVNESVVSIA 321



the 6 -th exon(s) of




ENST00000264790




BANK1 location:chr4:102965052
RRQSDDDKFCGLSKKQDRARIESPA 322



the 11 -th exon(s) of




ENST00000444316




NPY1R location:chr4:164247357
HWVFGEAMCKLNLFVQCVSITVSIF 323



the 2 -th exon(s) of




ENST00000296533




TLL1 location:chr4:166915559 the
IGFGLEQNNTVKRKVPLQFSGQNEK 324



4 -th exon(s) of




ENST00000061240




TLL1 location:chr4:166924631 the
GKNCDKFGIVVHKLGHVIGFWHEHT 325



6 -th exon(s) of




ENST00000061240




FRG1 location:chr4:190878626
IRCNEAGDIEAKNKTAGEEEMIKIR 326



the 6 -th exon(s) of




ENST00000226798






TMG17
DNAH5 location:chr5:13901576
DMISKLYTKQKYNPPLARNQPPIAG 327



the 14 -th exon(s) of




ENST00000265104




IL7R location:chr5:35873708 the
DHYFKGFWSEWSSSYYFRTPEINNS 328



5 -th exon(s) of ENST00000303115




C7 location:chr5:40979942 the
HCQGRNYTLTGRNSCTLPASAEKAC 329



17 -th exon(s) of ENST00000313164




FBN2 location:chr5:127640712
NVCSHGLCVDLQRSYQCICHNGFKA 330



the 45 -th exon(s) of




ENST00000262464




PCDHA9 location:chr5:140230541
IIFFLERYYRLLSGAVQIVLFIFLE 331



the 1 -th exon(s) of




ENST00000378122




PCDHB7 location:chr5:140552768
FQLLLEKPFQIFCAELWVRDINDHA 332



the 1 -th exon(s) of




ENST00000231137




PCDHB16
ENSPLGTEFPLNYALDLDVGSNNVQ 333



location:chr5:140562600 the 1 -th




exon(s) of ENST00000609684




GRIA1 location:chr5:153085554
EGRDQTTSDQSNKFGIFNSLWFSLG 334



the 11 -th exon(s) of




ENST00000518783




PWWP2A
RKDNGLLVRQEACISWFGSPTTSFL 335



location:chr5:159519583 the 2 -th




exon(s) of ENST00000307063




GABRP location:chr5:170235679
RNVLYFILETYVLSTFLVVLSWVSF 336



the 8 -th exon(s) of




ENST00000265294






TMG18
F13A1 location:chr6:6318782 the
LPTVELQGVVPRDVNLQEFLNVTSV 337



2 -th exon(s) of




ENST00000264870




ZSCAN31 location:chr6:28294281
NEHRRSHTGEKPHQCKECGKAFSAS 338



the 3 -th exon(s) of




ENST00000446474




GLTSCR1L
SQIILKGSGQQASSNVSGGLLVHRQ 339



location:chr6:42796807 the 5 -th




exon(s) of ENST00000394168




ADGRF5 location:chr6:46834827
CIFRYKNSYSIAAKDVIVHPLPLKL 340



the 13 -th exon(s) of




ENST00000265417




PKHD1 location:chr6:51893144
TLSRNISNIAGGKTLVIGVARLMNY 341



the 30 -th exon(s) of




ENST00000340994




GFRAL location:chr6:55264062
KHANKITLTGFHFPFNGEVIYAAMC 342



the 7 -th exon(s) of




ENST00000340465




EYS location:chr6:66044988 the
CSCLSEEDSQEYWYLCFLRWAGNMY 343



11 -th exon(s) of




ENST00000503581




TBX18 location:chr6:85447046
VPATHPHLLSGSFCSSPAFHLGPNT 344



the 8 -th exon(s) of




ENST00000369663




MAP3K7 location:chr6:91296538
SSSSSAGEMIEALSQVLNFEEIDYK 345



the 1 -th exon(s) of




ENST00000369325




ATG5 location:chr6:106764056
MTDDKDVLRNVWFGRIPTCFT 346



the 2 -th exon(s) of




ENST00000343245






TMG19
REV3L location:chr6:111678229
AADEKALFHEIASIIKRYDPDILLG 347



the 19 -th exon(s) of




ENST00000358835




NHSL1 location:chr6:138753933
APANRENGSQAMSYNCRNNLAFPAH 348



the 6 -th exon(s) of




ENST00000343505




DGKB location:chr7:14733729 the
LEEWIQGGMTTISLLVLLGLENNVK 349



8 -th exon(s) of




ENST00000399322




SP4 location:chr7:21468406 the
PENNNKKPKTSGFQDSQPSPLALLA 350



2 -th exon(s) of ENST00000222584




GHRHR location:chr7:31015410
IVLSVGVNFGLFFNIIRILVRKLEP 351



the 10 -th exon(s) of




ENST00000326139




SEPT14 location:chr7:55910709
LFEYHDSRVHVCFYFISPTGHSLKS 352



the 5 -th exon(s) of




ENST00000388975




PCLO location:chr7:82785328 the
KEQGKPEGIIKPLLQQQPPKPIPKQ 353



2 -th exon(s) of




ENST00000333891




LRRD1 location:chr7:91793994
IKYVKYLYLDKNKIKTFQGADSGDL 354



the 1 -th exon(s) of




ENST00000430130




TSPAN12
SVELLKPSLSRILEHTSMANSFNTH 355



location:chr7:120428705 the 8 -th




exon(s) of EN5T00000222747




ZNF783 location:chr7:148963799
RNRNFWILRLPPDSKGEAPKVPVTF 356



the 2 -th exon(s) of




ENST00000434415






TMG20
TMEM176A
VAVLAGAAAFIYKKRGGTYWALLRT 357



location:chr7:150500521 the 4 -th




exon(s) of ENST00000004103




MSRA location:chr8:10102709 the
TQVGFAGGYTSNSTYKEVCSEKTGH 358



3 -th exon(s) of




ENST00000317173




TEX15 location:chr8:30701132
QVNECEAIMEHCFDCFDFSLSVPFT 359



the 1 -th exon(s) of




ENST00000256246




FGFR1 location:chr8:38287286
TRITGEEVEVQDFVPADSGLYACVT 360



the 3 -th exon(s) of




ENST00000397091




PCMTD1 location:chr8:52732961
PQNLLREKIMKLSLPESLKAYLTYF 361



the 4 -th exon(s) of




ENST00000544451




TOX location:chr8:59728090 the
MHPSLPRNIAPKLNNQMPVTVSIAN 362



7 -th exon(s) of EN5T00000361421




COPS5 location:chr8:67958099
AQLGRGSFMLGLQTHDRKSEDKLAK 363



the 7 -th exon(s) of




ENST00000357849




TG location:chr8:133925498 the
FYQVLTSEASQDRLGCVKCPEGSYS 364



20 -th exon(s) of




ENST00000220616




GSDMD location:chr8:144642058
APGQAKIAGGAAESDSSSTSMNVYS 365



the 3 -th exon(s) of




ENST00000262580




VPS13A location:chr9:79986036
GAVARPTGGIIDIASSTFQGIKRAT 366



the 67 -th exon(s) of




ENST00000357409






TMG21
DIRAS2 location:chr9:93375725
MLVGNKCDESPSCEVQSSEAEALAR 367



the 2 -th exon(s) of




ENST00000375765




CDK5RAP2
EERIQALEEDLRKKEREIATEKKNS 368



location:chr9:123290182 the 10 -th




exon(s) of ENST00000360190




FBXW2 location:chr9:123550377
RHLSNNLETLLKWDFLKLLPLELSF 369



the 3 -th exon(s) of




ENST00000608872




ST6GALNAC6
LWDLRRVRGEAAFAQPLGQGPSSGQ 370



location:chr9:130648830 the 7 -th




exon(s) of ENST00000373142




TLR7 location:chrX:12904948 the
SVNKISPSGDSSKVGFCSNARTSVE 371



3 -th exon(s) of




ENST00000380659




MAGEB1 location:chrX:30269305
IWKFMNVLGAYDEEEHLIYGEPRKF 372



the 4 -th exon(s) of




ENST00000378981




FAM47C location:chrX:37027628
PPEAGVSHLCPELPKTRVPPLRPET 373



the 1 -th exon(s) of




ENST00000358047




GPR34 location:chrX:41555747
FTICFVPYHAFRLIYISSQLNVSSC 374



the 3 -th exon(s) of




ENST00000378142




ZNF81 location:chrX:47775337
CGKAFTQKSTLRTHQRIHTGERSYI 375



the 5 -th exon(s) of




ENST00000338637




HDAC6 location:chrX:48673271
LEFQPQLVLVAARFDALQGDPKGEM 376



the 13 -th exon(s) of




ENST00000334136






TMG22
XAGE3 location:chrX:52896139
MIWRGRSTCRPRPRRSV 377



the 2 -th exon(s) of




ENST00000346279




PHF8 location:chrX:54040917 the
SWVENLWPEECVVERPNVQKYCLMS 378



7 -th exon(s) of




ENST00000322659




TEX11 location:chrX:69828949
SLESRANEAQWFQKTAWNLAVQCDK 379



the 23 -th exon(s) of




ENST00000395889




ACRC location:chrX:70823968
SDDSEAPDDSSDNSEASDDSSDDSE 380



the 8 -th exon(s) of




ENST00000373696




PCDH19 location:chrX:99551483
PLHLKSSLPTKPFVSYTIALAPPAR 381



the 5 -th exon(s) of




ENST00000420881




NRK location:chrX:105167164 the
VGKISPPVYLTNKWVGYNALSEIFR 382



18 -th exon(s) of




ENST00000243300




COL4A6
TGPQGFTGSTGLLGLKGERGFPGLL 383



location:chrX:107464534 the 4 -th




exon(s) of ENST00000334504




OR13H1 location:chrX:130678638
KLTCSDTSLNEFIILITSIFTLLLP 384



the 1 -th exon(s) of




ENST00000338616




MAGEC1
EDSLSPLHFPQFSPQGEDFQSSLQS 385



location:chrX:140995378 the 4 -th




exon(s) of ENST00000285879




MAMLD1
KSMLPHFAMVGNCQEPRKLQESGTV 386



location:chrX:149613849 the 1 -th




exon(s) of ENST00000426613






TMG23
PDZD4 location:chrX:153073863
GTQTDITFEHIMVLGKLRPPTPPMV 387



the 2 -th exon(s) of




ENST00000164640




FLNA location:chrX:153593209
VGKSADFVVEAIEDDVGTLGFSVEG 388



the 12 -th exon(s) of




ENST00000369850




F8 location:chrX:154157374 the
IKWNEANRPGKVLFLRVATESSAKT 389



14 -th exon(s) of




ENST00000360256




ATP13A2 location:chr1:17320225
PLVPEPRRLPVGSLLRALATCHALS 390



the 16 -th exon(s) of




ENST00000452699




UBE2F location:chr2:238925240
KDEGYYQGGKFQLETEVPDAYNMVP 391



the 3 -th exon(s) of




ENST00000414443




ZNFX1 location:chr20:47887949
DNFQQWRTPHQKSTEQPQQAKKLGY 392



the 3 -th exon(s) of




ENST00000396105






TMG24
CTCFL location:chr20:56093816
PFKCSMCKYASVKASKLKRHVRSHT 393



the 5 -th exon(s) of




ENST00000371196




FOXP1 location:chr3:71026116
EIYNWFTRMFAYLRRNAATWKNAVR 394



the 11 -th exon(s) of




ENST00000614176




F13A1 location:chr6:6318783 the
LPTVELQGVVPRSVNLQEFLNVTSV 395



2 -th exon(s) of




ENST00000264870




TG location:chr8:133925499 the
FYQVLTSEASQDELGCVKCPEGSYS 396



20 -th exon(s) of




ENST00000220616




HDAC6 location:chrX:48673272
LEFQPQLVLVAAEFDALQGDPKGEM 397



the 13 -th exon(s) of




ENST00000334136




MAMLD1
KSMLPHFAMVGNCQEPRKLQESGKK 398



location:chrX:149613849 the 1 -th




exon(s) of ENST00000262858
















TABLE 4







Amino acid sequences of somatic mutations (patient #136) for


screening in each TMG









TMG No.
Gene name and locus
Mutant AA sequence





TMG1
PRDM16:NM_022114:exon14:
APVSQHPGVLTNYLGTSASSPTSES



c.C3139T:p.H1047Y,
399



C1QC:NM_172369:exon3:c.G446A:
LIRFNAVLTNPQEDYDTSTGKFTCK



p.G149E,
400



PIGV:NM_017837:exon3:c.G1111A:
QRSKNNKTLEKPNLGFLSPQVFVYV



p.D371N,
401



HPDL:NM_032756:exon1:c.G377C:
GAATYAVVSSPAAILSLTLLERAGY



p.G126A,
402



MROH7:NM 001039464:exon12:
FLGPQQIKDLLLGALEGLKGSSEAP



c.C2099G:p.A700G,
403



LRRIQ3:NM_001105659:exon7:
NAVLREKKQHFFSAYPQPIYTTHPK



c.C1135T:p.P379S,
404



FAM19A3:NM_182759:exon4:
WCQMEPCLPGEEYKVLPDLSGWSCS



c.G326A:p.C109Y,
405



FCRL5:NM_001195388:exon12:
GPFATGVAGGLLTIAGLAAGALLLY



c.G2573C:p.S858T,
406



FCRL1:NM_001159398:exon2:
MLPRLLLLICALLCEPAELFLIAS



c.C35T:p.P12L,
407



OR6K2:NM_001005279:exon1:
DHYLAICSPLHYSSIMTPKLCTQLT



c.C394T:p.P132S,
408





TMG2
VSIG8:NM_001013661:exon3:
SINLMNLQVSDTVTYECRVKKTTMA



c.C365T:p.A122V,
409



RASAL2:NM_170692:exon14:
QNGSRSRQQSSSFRESPVPKVRAIQ



c.C3197T:p.S1066F,
410



HHAT:NM_001170588:exon10:
TSMLILSNLVFLEGNEVGKTYWNRI



c.G1148A:p.G383E,
411



KCNK1:NM_002245:exon2:c.G727A:
YVPGEGYNQKFRKLYKIGITCYLLL



p.E243K,
412



OR2G3:NM_001001914:exon1:
TECILLADMALDWYIAVCKPLHYVV



c.C364T:p.R122W,
413



OSBPL5:NM_145638:exon12:
TDSRTFYIAEQVFHHPPVSAFHVSN



c.C1226T:p.S409F,
414



MRGPRX1:NM_147199:exon1:
TISKILYPVMMFFYFAGLSFLSAVS



c.C314T:p.S105F,
415



AGBL2:NM_024783:exon10:c.C955T:
FRVQNTRKDATYCFTIVNLLKPKSL



p.R319C,
416



OR4A16:NM_001005274:exon1:
DAIYSTAMSPKLIIDLLCDKIAISL



c.G240T:p.M80I,
417



OR4C15:NM_001001920:exon1:
RLYMIPVGAFIFFLGNMQNQSFVTE



c.C152T:p.S51F,
418





TMG3
OR4C15:NM_001001920:exon1:
FLGFLSFLDACFASVITPKMIVDSL



c.T379G:p.S127A,
419



OR5F1:NM_003697:exon1:c.G886A:
MLNPLIYSLRSKKVKKALANVISRK



p.E296K,
420



OR8K3:NM_001005202:exon1:
MIYVISVMGNLGIIVLTKLDSRLQT



c.G135A:p.M45I,
421



SLC22A10:NM_001039752:exon5:
FFVFFLLSRWLVKSARWLIITNKLD



c.G841A:p.E281K,
422



PPME1:NM_016147:exon6:c.G476C:
PPPIMLIGHSMGAAIAVHTASSNLV



p.G159A,
423



MMP10:NM_002425:exon8:c.G1135A:
VQAGYPRGIHTLSFPPTIRKIDAAV



p.G379S,
424



BUD13:NM_032725:exon3:c.G271T:
VVAEFVDERPEELKQMEAFRSSAKW



p.V91L,
425



ADAMTS15:NM_139055:exon8:
TVEVLSVGKMTPLRVRYSFYLPKEP



c.C2375T:p.P792L,
426



SLC6A12:NM_001122848:exon11:
QGVPISEVAESGLGLAFIAFPKAVT



c.C1079T:p.P360L,
427



CD163L1:NM_001297650:exon7:
AGVICSDKADLDIRLVGAHSPCYGR



c.C1456A:p.L486I,
428





TMG4
OR6C75:NM_001005497:exon1:
VTEFYLLAAMSYNRCMAICKPLHYT



c.G355A:p.D119N,
429



SLC5A8:NM_145913:exon10:
LSWISQGMSVVYEALCIGMAALASL



c.G1178A:p.G393E,
430



BTBD11:NM_001018072:exon2:
MFSQSELRTIEQFLLATRVGSIAEL



c.C1247T:p.S416F,
431



RASAL1:NM_001193520:exon17:
KKRYVWLSGETLFFSKSPEWQMCHS



c.C1808T:p.S603F,
432



OR4K5:NM_001005483:exon1:
MSRRTCTVLVMIFWAVSLVHTLSQL



c.C443T:p.S148F,
433



STON2:NM_033104:exon6:c.C2686T:
PFVHPTTLPLLFFLAMLTMFAW



p.L896F,
434



CHD2:NM_001271:exon17:c.G2096A:
QSLHKVLEPFLLQRVKKDVEKSLPA



p.R699Q,
435



CHD2:NM_001271:exon20:c.C2519T:
TIKHYPFQRLDGFIKGEIRKQALDH



p.S840F,
436



TSC2:NM_000548:exon31:c.C3680T:
PLSPFSSDINNMLLQELSNALMAAE



p.P1227L,
437



CACNG3:NM_006539:exon3:c.G436A:
VILSAGIFFVSARLSNIIGIIVYIS



p.G146R,
438





TMG5
LPCAT2:NM_017839:exon1:c.C127T:
RQASFFPPPVPNSFVQQTQIGSARR



p.P43S,
439



LPCAT2:NM_017839:exon1:c.C128T:
RQASFFPPPVPNLFVQQTQIGSARR



p.P43L,
440



OR1G1:NM_003555:exon1:c.A125G:
SFLFMYLVTVAGSLLIILVIITDTQ



p.N42S,
441



DNAH9:NM_001372:exon45:
TLRKGYQIQDFKIDLASLCLKAGVK



c.G8586A:p.M2862I,
442



WSB1:NM_134265:exon5:c.C463T:
DILMEFGHLFPPSTPIFAGGANDRW



p.P155S,
443



GPATCH8:NM_001304943:exon5:
AQVHHIPQPHLTLISLSHLTHSIIP



c.C4010T:p.P1337L,
444



ANKRD29:NM_173505:exon5:c.C415T:
VVETLLKHGANIYDQLYDGATALFL



p.H139Y,
445



LRG1:NM_052972:exon2:c.A808T:
GLRQLDMLDLSNYSLASVPEGLWAS



p.N270Y,
446



ZNF559-ZNF177:NM_001172650:
PYECSDCGKAFISQSSLKKHMRSHT



exon12:c.T404C:p.F135S,
447



ANKLE1:NM_001278445:exon8:
CFQHVVAVEAYTLEACIVEALGIQT



c.G1637T:p.R546L,
448





TMG6
ZNF585B:NM_152279:exon5:c.C764T:
ECTDCGKAFTQKFTLKIHQKIHTGE



p.S255F,
449



RYR1:NM_001042723:exon33:
CQEPLTMMALHISEENRCMDILELS



c.C4921T:p.P1641S,
450



RYR1:NM_001042723:exon33:
CQEPLTMMALHILEENRCMDILELS



c.C4922T:p.P1641L,
451



FCGBP:NM_003890:exon28:
CMGGGDRDILCKDLASYVAACQAAG



c.C12971A:p.A4324D,
452



C19orf47:NM_001256440:exon5:
VPCSPSPLAGEICRGTSAASRMITN



c.C397T:p.R133C,
453



FOXA3:NM_004497:exon2:c.C575T:
RSPDKPGKGSYWVLHPSSGNMFENG



p.A192V,
454



TULP2:NM_003323:exon8:c.C908T:
TRDKHGVDKGLFLLYYLYLETSDSL



p.P303L,
455



FPR2:NM_001005738:exon2:c.C866T:
YKIIDILVNPTSFLAFFNSCLNPML



p.S289F,
456



USP29:NM_020903:exon4:c.A875G:
SQQLQQGFPNLGSTCYMNAVLQSLF



p.N292S,
457



ZNF418:NM_001317029:exon5:
GERPYECSECGKLFHRSSSLLRHQR



c.C1964T:p.S655L,
458





TMG7
APOB:NM_000384:exon9:c.G1081A:
NKLVTELRGLSDKAVTSLLPQLIEV



p.E361K,
459



MAP4K3:NM_003618:exon9:c.A651T:
IELAELQPPMFDFHPMRALFLMTKS



p.L217F,
460



ANTXR1:NM_053034:exon3:c.A271G:
RMSFIVFSTRGTALMKLTEDREQIR



p.T91A,
461



STEAP3:NM_182915:exon5:c.G1121A:
EEVWRMEIYLSLEVLALGTLSLLAV



p.G374E,
462



GALNT13:NM_052917:exon5:c.C370T:
DELPNTSVVIVFYNEAWSTLLRTVY



p.H124Y,
463



UBR3:NM_172070:exon8:c.T1238C:
FLLNMLPDQEYKAAFTKTFVQHYAF



p.V413A,
464



TTN:NM_001267550:exon326:
FEVTGLVEDHRYKFRVIARNAAGVF



c.G75232A:p.E25078K,
465



DNAH7:NM_018897:exon49:c.C9271T:
FYITTKLRNPHYFPETSVKVTLLNF



p.L3091F,
466



HECW2:NM_020760:exon4:c.C464T:
YHGISGALRATTLCITVKNPAVMMG



p.P155L,
467



NPBWR2:NM_005286:exon1:c.C423A:
FSSIYFLAVMSVERYLVVLATVRSR



p.D141E,
468





TMG8
HSPA13:NM_006948:exon2:c.C80T:
LLLAGYLAQQYLLLPTPKVIGIDLG



p.P27L,
469



SLC5A3:NM_006933:exon2:c.T953G:
KLLPMFIIVVPGRISRILFTDDIAC



p.M318R,
470



DSCAM:NM_001271534:exon15:
DSWDSAQRTKDVFPQLNSATIIDIH



c.C2816T:p.S939F,
471



APOL3:NM_145642:exon4:c.C322T:
RAIRQARARARLSVTTWRISAGSGG



p.P108S,
472



SLC4A7:NM_001258379:exon16:
CVILFFTTFFLSLFLKQFKTKRYFP



c.C2162T:p.S721L,
473



GADL1:NM_207359:exon14:c.A1381G:
LREMEEGPEFWAELNLVAPAIKERM



p.K461E,
474



SCN5A:NM_001099405:exon23:
TKVKVNFDNVGAEYLALLQVYEEQP



c.G4223A:p.G1408E,
475



COL7A1:NM_000094:exon5:c.G638A:
FSILRTLLPLVSQRVCTTAGGVPVT



p.R213Q,
476



MON1A:NM_032355:exon4:c.C1388T:
SSSSFREGEAWTLVCLPKFNAAGFF



p.P463L,
477



SLC38A3:NM_006841:exon8:c.T553C:
QTFLNLEEKTSDRYMNGNYLVILVS



p.W185R,
478





TMG9
POLQ:NM_199420:exon29:c.C7615T:
CPIRGGFFILQLYDELLYEVAEEDV



p.H2539Y,
479



SAMD7:NM_182610:exon6:c.G484A:
LHFHRSTLRNLQRNPMLAATAPHFE



p.G162R,
480



SPATA16:NM_031955:exon2:c.C425T:
IDEMGVRYEFVEFFMSTGSQPTCQA



p.S142F,
481



ATP13A5:NM_198505:exon26:
AFLYVKQQPWYCKVYQYSECFLANQ



c.G3007A:p.E1003K,
482



MUC4:NM_138297:exon13:c.C1751T:
GSPEEMLFHFGMIWQINGTGLLGKR



p.T584I,
483



MUC4:NM_018406:exon2:c.G8482A:
SLPVTDASSVFTSHATSLPVTIPSS



p.G2828S,
484



GABRA4:NM_001204266:exon7:
IQTYIPCIMTVIPSQVSFWINKESV



c.T761C:p.L254P,
485



PDGFC:NM_016205:exon6:c.C968T:
LRPKTGVRGLHKLLTDVALEHHEEC



p.S323L,
486



SLC6A18:NM_182632:exon9:
LDVGVLPRWVPKKALTGLVCLVCFL



c.G1324A:p.E442K,
487



NIPBL:NM_133433:exon23:c.C4753T:
VNKPEWPAAELLFSLLGRLLVHQFS



p.L1585F,
488





TMG10
C6:NM_000065:exon7:c.G769A:
TAEDDLKTDFYKNLTSLGHNENQQG



p.D257N,
489



FAM170A:NM_182761:exon3:c.G805A:
CHVFHLTMAQLTSNMESESTQDEQE



p.G269S,
490



PCDHB8:NM_019120:exon1:c.C1709T:
NSPFVLYPLQNGFAPCTELVPRAAE



p.S570F,
491



NDST1:NM_001543:exon7:c.C1444T:
RRGFIHNGIMVLSRQTCGLFTHTIF



p.P482S,
492



SLIT3:NM_001271946:exon1:c.G44C:
PGWAGVGAAVRAPLALALALASVLS



p.R15P,
493



BMP5:NM_021073:exon1:c.G160A:
RLRNHERREIQRKILSILGLPHRPR



p.E54K,
494



SOGA3:NM_001012279:exon6:
VRNIRCLTPTRSLYPAPGPWPKSFS



c.T2392C:p.F798L,
495



SLC22A2:NM_003058:exon7:
KHTMILMYNWFTNSVLYQGLIMHMG



c.G1073A:p.S358N,
496



MAD1L1:NM_003550:exon19:
DCLIFKATSPSGFKMQLLETEFSHT



c.C2018T:p.S673F,
497



AMZ1:NM_001284355:exon2:c.C259T:
EDFQTFHASLQHWKPRLARKHIYLQ



p.R87W,
498





TMG11
COL1A2:NM_000089:exon44:
HKGERGYPGNIGLVGAAGAPGPHGP



c.C2867T:p.P956L,
499



PLXNA4:NM_020911:exon2:c.G721A:
IPSDTFTIIPDFNIYYVYGFSSGNF



p.D241N,
500



PTPRN2:NM_001308268:exon6:
QYLLRAPSRMPRSLLAPAAPQKWPS



c.C901T:p.P301S,
501



CLVS1:NM_173519:exon2:c.C83T:
WNGDLAKMTHLQVGLSPETIEKARL



p.A28V,
502



DNAJC5B:NM_033105:exon4:c.G250A:
TDISKRSIYDKYRSLGLYVAEQFGD



p.G84R,
503



ANGPT1:NM_001146:exon2:c.G336A:
LENYIVENMKSEIAQIQQNAVQNHT



p.M112I,
504



CYHR1:NM_001129888:exon2:c.G19A:
MAPKPGTEWSTALSHLVLG



p.A7T,
505



CNTNAP3:NM_033655:exon13:
APSGHPRSAVSFVYAAGAGQLRSAV



c.C1973T:p.A658V,
506



NTRK2:NM_001018065:exon14:
SSSEGGPDAVIIRMTKIPVIENPQY



c.G1558A:p.G520R,
507



OR13C3:NM_001001961:exon1:
ILAVLKLACADIFLNIITMVISNMA



c.C668T:p.S223F,
508





TMG12
OR13C8:NM_001004483:exon1:
PKLQTVFFVLILLMYLMILLGNGVL



c.G98T:p.W33L,
509



OR13C8:NM_001004483:exon1:
VLILWMYLMILLENGVLISVIIFDS



c.G122A:p.G41E,
510



SLC2A6:NM_001145099:exon6:
IQDNVRRQSSRVLWAEARAPHVCRP



c.C788T:p.S263L,
511



TLR8:NM_138636:exon2:c.A1422C:
DPHSNFYHFTRPFIKPQCAAYGKAL



p.L474F,
512



SMS:NM_004595:exon5:c.G476A:
YQNIKILHSKQFENILILSGDVNLA



p.G159E,
513



CASK:NM_001126054:exon18:
NQTVEQLQKMLRKMRGSITFKIVPS



c.G1672A:p.E558K,
514



CCNB3:NM_033031:exon5:c.G2092A:
SGSLFQEALVLQKKTDAEEDSLKNL



p.E698K,
515



FAM120C:NM_001300788:exon3:
AAFHWSLLGPEHSLASLKVRAHQLV



c.C1012T:p.P338S,
516



SRPX2:NM_014467:exon5:c.C401T:
CHALPFITSGTYICTNGVLLDSRCD



p.T134I,
517



XKRX:NM_212559:exon2:c.C593T:
YQLYVSLISAEVLLGRVVLMVFSLV



p.P198L,
518





TMG13
C1GALT1C1:NM_001011551:exon2:
PFYLGHTIKSGDFEYVGMEGGIVLS



c.C532T:p.L178F,
519



USP26:NM_031907:exon1:c.C208T:
NVVLKSYRGNQNYLHLTLQNNNGLF



p.H70Y,
520



MAGEC1:NM_005462:exon4:c.C3196T:
LEYREVPNSSPPCYEFLWGPRAHSE



p.R1066C,
521



JAKM1P2:NM_014790:exon21:
IKDLEEKSNRKHE



c.G2429A:p.G810E,
522



IRF6:NM_006147:exon9:c.C1397T:
PTPSMQLPPALPLQ



p.P466L,IRF6:NM_001206696:
523



exon7:c.C1112T:p.P371L,




AFF2:NM_001169123:exon1:c.G4A:
MNLFDFFRDWDLEQ



p.D2N,
524





TMG14
IFNLR1:NM_173064:exon7:c.C1039T:
GVDSGRPRAPLVSSEGSSAWDSSDR



p.P347S,
525



HMGB4:NM_145205:exon2:c.G220A:
ALAKLDKARYQEKMMNYVGKRKKRR



p.E74K,
526



RIMS3:NM_014747:exon3:c.G101A:
SISGEICGSQQAEGGAGTTTAKKRR



p.G34E,
527



KANK4:NM_181712:exon3:c.C59T:
DQSSQGDEEKDPLKSHPYSVETPYG



p.P20L,
528



SLC44A5:NM_152697:exon4:c.G70A:
SEEEDFGDPRTYNPDFKGPVANRSC



p.D24N,
529



FCGR1A:NM_000566:exon6:c.G899A:
PVWFHVLFYLAVEIMFLVNTVLWVT



p.G300E,
530



SV2A:NM_014849:exon2:c.C305T:
DDEIYEGEYQGILRAESGGKGERMA



p.P102L,
531



TCHHL1:NM_001008536:exon3:
FGEQEGNLATQSLPPKEATQRPCED



c.C788T:p.S263L,
532



HRNR:NM_001009931:exon3:
SSSSYGQHGSGSHQSLGHGQHGSGS



c.G5792A:p.R1931H,
533



FLG:NM_002016:exon3:c.G10285A:
QARDSSRHSASQKGQDTIRGHPGSS



p.E3429K,
534





TMG15
UBAP2L:NM_001287816:exon7:
GSGRRGGRFSAQEMGTFNPADYAEP



c.G617A:p.G206E,
535



CD1E:NM_030893:exon3:c.C520T:
GIRAQNICKVLNCYLDIKEILQSLL



p.R174C,
536



OR10J5:NM_001004469:exon1:
IIVTIICIDHHLYTPMYFFLSMLAS



c.C166T:p.H56Y,
537



KCNT2:NM_001287819:exon9:
IFTCICGIQHLEQIGKKLNLFDSLY



c.G665A:p.R222Q,
538



CFHR2:NM_001312672:exon3:
GDIVEFVCKSGYQPTKSHSFRAMCQ



c.T366G:p.H122Q,
539



NFASC:NM_015090:exon6:c.C487T:
PLTLQCNPPPGLSSPVIFWMSSSME



p.P163S,
540



TUBB8:NM_177987:exon4:c.A1241G:
GMDEMEFTEAESSMNDLVSEYQQYQ



p.N414S,
541



FAM208B:NM_017782:exon12:
GAEVLTAQFVQKIKLDRKNQEAPIS



c.C1241T:p.T414I,
542



ARID5B:NM_001244638:exon7:
EIQEGKDKLLEKKALPHSHMPSFLA



c.G1721A:p.R574K,
543



SFTPA2:NM_001098668:exon3:
GRDGVKGDPGPPSPMGPPGETPCPP



c.G172A:p.G58S,
544





TMG16
SFTPA1:NM_001164644:exon3:
GRDGLKGDPGPPSPMGPPGEMPCPP



c.G172A:p.G58S,
545



DNMBP:NM_015221:exon5:c.C2401T:
IEELLQTERDYIWDLEMCIERIMVP



p.R801W,
546



TECTB:NM_058222:exon7:c.G682A:
LQWQLINKGCPTNETVLVHENGRDH



p.D228N,
547



MUC5B:NM_002458:exon31:c.G1229
TPGHTTATSRTTTTATPSKTRTSTL



2A:p.A4098T,
548



TRPM5:NM_014555:exon12:c.C1754T:
EAKYERLALDLFFECYSNSEARAFA



p.S585F,
549



OR51G1:NM_001005237:exon1:
QGLEGLHGWISISFCFIYLTVILGN



c.C94T:p.P32S,
550



TPP1:NM_000391:exon11:c.C1424T:
SNRVPIPWVSGTLASTPVFGGILSL



p.S475L,
551



OR10A4:NM_207186:exon1:c.T452G:
HISCAQLAAASWCSGFSVATVQTTW



p.F151C,
552



AMPD3:NM_001172431:exon10:
VDDESKHSDHMFFDKSPNPDVWTSE



c.C1136T:p.S379F,
553



LUZP2:NM_001009909:exon11:
QDEGRPCSMKHKKSPPSNATAETEP



c.G889A:p.E297K,
554





TMG17
BBOX1:NM_003986:exon6:c.G589T:
DKIDANNVAYTTWKLSFHTDYPALH



p.G197W,
555



KCNA4:NM_002233:exon2:c.C1148T:
FFIVETVCIVWFFFEFVVRCFACPS



p.S383F,
556



PAMR1:NM_001282675:exon8:
CLAGYTGQRCENFLEERNCSDPGGP



c.C700T:p.L234F,
557



NARS2:NM_024678:exon2:c.C164T:
RIKIQGWIRSVRFQKEVLFLHVNDG



p.S55F,
558



CRACR2A:NM_001144958:exon16:
VDNSQVALQLWDMAGQERYRCITQQ



c.C1802T:p.T601M,
559



CCND2:NM_001759:exon5:c.G803A:
NSLQQYRQDQRDESKSEDELDQAST



p.G268E,
560



TAS2R46:NM_176887:exon1:c.G198C:
SRVGLLWVLVLNCYATELNPAFNSI



p.W66C,
561



SLC38A4:NM_018018:exon17:
GSMALIIIDWIYNPPNSKHH



c.G1618A:p.D540N,
562



KMT2D:NM_003482:exon10:c.C2584T:
HLSPELEKPPLSSRPEKPPEEPGQC



p.P862S,
563



LUM:NM_002345:exon2:c.G743A:
LSHNELADSGIPENSFNVSSLVELD



p.G248E,
564





TMG18
CFAP54:NM_001306084:exon60:
VSEAVLAINLLIEKKNTRMHKVNQV



c.G8177A:p.G2726E,
565



HECTD4:NM_001109662:exon63:
TPAKPIRVSDIYFSKEQINSQTPGN



c.C10705T:p.L3569F,
566



PARP4:NM_006437:exon13:
FSDSLSTSIKYSQPGETDGTRLLLI



c.C1470A:p.H490Q,
567



FREM2:NM_207361:exon1:c.C3661T:
DTPILNAADADVSLDDLTFTITQFP



p.P1221S,
568



PROSER1:NM_025138:exon11:
PVMIKTEPTSPTSSAFKGPSHSGNP



c.C1846T:p.P616S,
569



SLITRK6:NM_032229:exon2:
PATTTNTADTILQSLTDAVPLSVLI



c.G1796A:p.R599Q,
570



METTL21C:NM_001010977:exon2:
YASYTQEHYRFAEKEIVIQESIESY



c.G224A:p.G75E,
571



CCDC168:NM_001146197:exon4:
PLSPKRKDQKGRITIRDLKRELSTK



c.T18244A:p.L6082I,
572



TRIM9:NM_015163:exon8:c.G1871A:
DDKAWAMYVDNNQSWFMHNNSHTNR



p.R624Q,
573



SMOC1:NM_001034852:exon8:
EAQQNPREGIVISECAPGGLYKPVQ



c.C745T:p.P249S,
574





TMG19
ZFYVE1:NM_021260:exon12:
KDAARPAYWVPDYEILHCHNCRKEF



c.C2146T:p.H716Y,
575



SLC12A1:NM_001184832:exon11:
VAICVGACVVRDTTGNMNDTIISGM



c.G1321A:p.A441T,
576



UNC13C:NM_001080534:exon1:
STHNLSTEEDEAGKEFSLSPTFSYR



c.A247G:p.S83G,
577



CGNLI:NM_001252335:exon3:
PVLDGARSRRSSLSSTTPTSANSLY



c.C893T:p.S298L,
578



ITGA11:NM_001004439:exon14:
LNQDSYNDVVVGTPLEDNHAGAIYI



c.G1690A:p.A564T,
579



GOLGA6C:NM_001164404:exon17:
AQNPADEPTPGASAPQELGAAGEQD



c.C1915T:p.P639S,
580



NTRK3:NM_001007156:exon10:
LEEPELRLEHCIKFVVRGNPPPTLH



c.G964A:p.E322K,
581



RGS11:NM_003834:exon16:
LAEAGIPLEMKRHVFPFTWRPRHSS



c.G1226A:p.R409H,
582



SMIM122:NM_001253793:exon4:
PRRESPRKERPKEVDNLALEP



c.G224A:p.G75E,
583



DCUN1D3:NM_173475:exon3:
LSNYSEDEAWPSFFDTFVEWEMERR



c.C805T:p.L269F,
584





TMG20
MVP:NM_005115:exon3:c.C244T:
QGLVLFDVTGQVWLRHADLEIRLAQ



p.R82W,
585



GPT2:NM_001142466:exon10:
AKLTEDLFNQVPEIHCNPLQGAMYA



c.G986A:p.G329E,
586



AMFR:NM_001144:exon11:c.C1507T:
DNILEGRIQVPFSTQRSDSIRPALN



p.P503S,
587



NFATC3:NM_004555:exon9:c.C2797T:
GSATTASPAASHSLASSPLSGPPSP



p.P933S,
588



ZNF469:NM_001127464:exon2:
QLPASPSCRDPPCPQQLLACSPAWA



c.G6397T:p.G2133C,
589



CDH15:NM_004933:exon5:c.G553A:
RAEATDADDPETNNAALRFSILQQG



p.D185N,
590



RPH3AL:NM_001190413:exon7:
GETGTGSADPPGRPRPGLTRRAPVK



c.G757A:p.G253R,
591



ZNF594:NM_032530:exon2:c.G451A:
SSNLIIHQRIHTRNKPYVCNECGKD



p.G151R,
592



NEURL4:NM_001005408:exon1:
GGPGPGPGGGGGLSGSGSGPGSNGG



c.C74T:p.P25L,
593



NEURL4:NM_001005408:exon1:
GGPGPGPGGGGGSSGSGSGPGSNGG



c.C73T:p.P25S,
594





TMG21
MYH2:NM_017534:exon12:c.G1039A:
TDSAIDILGFTNKEKVSIYKLTGAV



p.E347K,
595



MYOCD:NM_153604:exon10:c.G1661A:
VEELRMQLQKQKKNNCSEKKPLPFL



p.R554K,
596



PLD6:NM_178836:exon2:c.G548A:
SLNWTTQAIQNNKENVLITEDDEYV



p.R183K,
597



MRC2:NM_006039:exon6:c.C1114T:
KKKPNATAEPTPSDRWANVKVECEP



p.P372S,
598



ACE:NM_000789:exon16:c.G2383A:
LLWAWEGWRDKARRAILQFYPKYVE



p.G795R,
599



CD300LF:NM_001289083:exon3:
VQVTIDPAPVTQAETSSSPTLTGHH



c.A398C:p.E133A,
600



RNF213:NM_001256071:exon29:
HPLLEDGCIEDDLAPHKKVGFVGIS



c.C8618T:p.P2873L,
601



DSG4:NM_001134453:exon4:c.G322A:
PYGVFTINPRTGKINITSVVDREIT



p.E108K,
602



DSG4:NM_001134453:exon8:c.G901A:
RLQAIDLDEEGTNNWLAQYLILSGN



p.D301N,
603



RNF165:NM_152470:exon2:c.C286T:
PLPTLQFQDVTGSSFLPQALHQQYL



p.P96S,
604





TMG22
PLIN4:NM_001080400:exon3:
ASVVDVAKGVVQRGLDTTRSALTGT



c.G325A:p.G109R,
605



RFX2:NM_000635:exon8:c.C811T:
TRGNSKYHYYGICLKPDSPLNRLQE



p.R271C,
606



DENND1C:NM_001290331:exon20:
LSMGAKSAGSLRSSQSLDCCHRGDL



c.C1603T:p.P535S,
607



MUC16:NM_024690:exon5:c.T33443A:
EASSAVLTVSPEEPGMVTSLVTSSR



p.V11148E,
608



C19orf43:NM_024038:exon3:
YKAHQCGDDDKTWPLVK



c.C514T:p.R172W,
609



KMT2B:NM_014727:exon3:c.C1919T:
PPAPSPPPAPATFSRRPLLLRAPQF



p.S640F,
610



ZNF585A:NM_199126:exon6:c.C625T:
TQKSTLKMHQKIYTGERSYICIECG



p.H209Y,
611



SIPA1L3:NM_015073:exon3:
GECERNVSFSRAFVGSPSSGEGHLA



c.C1343T:p.5448F,
612



HNRNPL:NM_001533:exon8:c.C1004T:
GYHSHYHDEGYGLPPPHYEGRRMGP



p.P335L,
613



ZNF574:NM_022752:exon2:c.A74G:
EHRYVCSECNQLCGSLEEVLMHQNS



p.Y25C,
614





TMG23
LILRA4:NM_012276:exon6:c.G1172A:
EFPMSPVTSAHAETYRCYGSRSSNP



p.G391E,
615



NLRP11:NM_145007:exon8:c.C2336T:
CDALLHPNCTLILLVLVFCCLTENC



p.5779L,
616



NLRP5:NM_153447:exon7:c.G866A:
FRPRTVVLHGKSEIGKSALARRIVL



p.G289E,
617



ZNF256:NM_005773:exon3:c.G1383T:
DLIVHERVHTGESPYECSECGKSFT



p.R461S,
618



GCKR:NM_001486:exon12:c.C1055T:
IIAIMDGVECIHIFGADFRDVRGFL



p.T352I,
619



BIRC6:NM_016252:exon68:
ATTSLRQANQEKILGEYSKKAAMKP



c.A13508T:p.K4503I,
620



DYSF:NM_001130979:exon28:
DAGHLSFVEEVFKNQTRLPGGQWIY



c.G2956A:p.E986K,
621



DNAH6:NM_001370:exon51:c.C8413T:
ADISEIRVFTKPSDLVMTVMEAISI



p.P2805S,
622



MARCO:NM_006770:exon5:c.G533A:
GAPGPPGPPAEKEAKGAMGRDGATG



p.G178E,
623



BAZ2B:NM_001289975:exon28:
LTSNVASSKSESSVPQNEKATSAQP



c.C4933T:p.P1645S,
624





TMG24
SCN2A:NM_001040142:exon17:
ESDFENLNTEEFNSESDMEESKEKL



c.G3368A:p.S1123N,
625



SCN1A:NM_001165963:exon24:
KLGSKKPQKPIPQPGNKFQGMVFDF



c.G4574A:p.R1525Q,
626



TTN:NM_001267550:exon28:
ELKSRKKDESYEKLLRKTKDELLHW



c.G6085A:p.E2029K,
627



CERKL:NM_001030311:exon8:
FSAMFGFGGRTLVLAEKYRWMSPNQ



c.C1073T:p.A358V,
628



DNAH7:NM_018897:exon5:c.A277G:
QSHAEYMERFGKEGKLPHQVDDSYV



p.K93E,
629



SPAG16:NM_024532:exon7:c.C734T:
KHHTLLKEKMLTFLERDKVVGQISG



p.S245F,
630



PRKAG3:NM_017431:exon4:c.C329T:
PLAQADPAGVGTLPTGWDCLPSDCT



p.P110L,
631



DES:NM_001927:exon6:c.G1232A:
DVEIATYRKLLEEEESRINLPIQTY



p.G411E,
632



COL4A3:NM_000091:exon28:
IPGSLGKCGDPGFPGPDGEPGIPGI



c.C2059T:p.L687F,
633



SPHKAP:NM_030623:exon7:c.C2486T:
SHRVPDSSTATTFSKEIYLKGIAGE



p.S829F,
634





TMG25
FAM209B:NM_001013646:exon2:
SLYKDCVFNTLNKLEVELLKFVSEV



c.G352A:p.E118K,
635



KCNJ15:NM_001276437:exon4:
QLSGKLLQTHVTQEGERILLNQATV



c.A667C:p.K223Q,
636



UMODL1:NM_001199527:exon11:
VNPSQGSPSQGSFRQESTSQASPSQ



c.C1771T:p.L591F,
637



SEZ6L:NM_001184773:exon4:
YFRTFQDDGLGTVQLHYQAFMLSCN



c.T1144G:p.F382V,
638



MKL1:NM_001282662:exon15:
GEAIPEDSLWVPTGSTAITFC



c.C2368A:p.P790T,
639



DLEC1:NM_007337:exon3:c.C658T:
SPEDYYTDTVPFYSAPKGISLPGCS



p.H220Y,
640



SEMA3F:NM_004186:exon8:c.G646A:
DPKLDTASALINKELYAGVYIDFMG



p.E216K,
641



STAB1:NM_015136:exon47:c.G4819A:
ASFFSLRLLEYKKLKGDGPFTIFVP



p.E1607K,
642



LRTM1:NM020678:exon3:c.G1016A:
QTNDPGKVEEKEQFDSSPA



p.R339Q,
643



LRTM1:NM_020678:exon3:c.G1006A:
PLAQTNDPGKVEKKERFDSSPA



p.E336K,
644





TMG26
LRTM1:NM_020678:exon2:c.G158A:
LAEIPSHLPPQTQTLHLQDNQIHHL



p.R53Q,
645



ERC2:NM_015576:exon2:c.G89A:
SPRLPRSPRLGHQRTSSGGGGGTGK



p.R30Q,
646



COL8A1:NM_020351:exon4:c.G440A:
PGLPGHGIPGIKEKPGPQGYPGVGK



p.G147E,
647



USF3:NM_001009899:exon7:
NSQGSIEATMERSLEKPSCSLGIKT



c.C3625T:p.P1209S,
648



TIMMDC1:NM_016589:exon2:c.A338T:
KQQYIEQSQAEIFHNRFDAVQSAHR



p.Y113F,
649



CCDC14:NM_001308317:exon12:
DWSISSFSTFTSHDEQDFRNGLAAL



c.G2168A:p.R723H,
650



ACAD9:NM_014049:exon11:c.C1117T:
YLTAGMLDQPGFSDCSIEAAMVKVF



p.P373S,
651



FNDC3B:NM_001135095:exon5:
YISQVIEDSTGVCRVVVTPQSPECY



c.C289T:p.R97C,
652



BCL6:NM_001706:exon5:c.A496C:
IMAYRGREVVENHLPLRSAPGCESR



p.N166H,
653



UBE2K:NM_001312646:exon3:
LEIKIPETYPFNSPKYKQNPEMFKQ



c.C208T:p.P70S,
654


TMG27
BEND4:NM_001159547:exon3:
GHPSSSTLPEEEKEEDEEGYCPRCQ



c.G943A:p.E315K,
655



GABRB1:NM_000812:exon9:c.G1129A:
GNILLSTLEIRNKTSGSEVLTSVSD



p.E377K,
656



NOA1:NM_032313:exon6:c.C1807T:
MGGKERMAGFPPFVAEDIMLKEGLG



p.L603F,
657



ADGRL3:NM_015236:exon7:c.C1369T:
RTTTLSPGRSTTSSVSGRRNRSTST



p.P457S,
658



ANKRD17:NM_198889:exon27:
KKVSVPSTVISRGIGRGGCNINAIR



c.T4466G:p.V1489G,
659



FRAS1:NM_025074:exon63:c.C9566T:
HVEEVTKEGVKKFPSPGYPLVCVTP



p.S3189F,
660



ADH1A:NM_000667:exon7:c.C890T:
EACGTSVIVGVPLDSQNLSMNPMLL



p.P297L,
661



BANK1:NM_001127507:exon8:
ADGAEANEMEGEEKQNGSGMETKHS



c.G977A:p.G326E,
662



ENPEP:NM_001977:exon16:c.C2237T:
NDAGDHVTKLLRFSVLGFACKMGDR



p.S746F,
663



KIAA1109:NM_015312:exon71:
SGLGSPLGRSRHTSSQSDLTSSSSS



c.G12395C:p.S4132T,
664





TMG28
INTU:NM_015693:exon12:c.A2110G:
CRRTLFGDYSLKARKPSPSCSSGGS



p.T704A,
665



LARP1B:NM_018078:exon10:
MSRGLSTSLPDLGSEPWIEVKKRHQ



c.A1103G:p.D368G,
666



NPY5R:NM_006174:exon4:c.T998A:
SQLSSSSKFIPGDPTCFEIKPEENS



p.V333D,
667



GLRA3:NM_001042543:exon6:
LPQFLLKEEKDLQYCTKHYNTGKFT



c.G686A:p.R229Q,
668



TENM3:NM_001080477:exon27:
IHDVKDYITDVNNWLVTFGFHLHNA



c.G7223A:p.S2408N,
669



STOX2:NM_020225:exon3:c.G1091A:
HHSGRSKKSRTHQKSHGKSRSHSKT



p.R364Q,
670



PLEKHG4B:NM_052909:exon3:
QSGVVTLPGTRDGHGRAVVQVRTRS



c.C631G:p.R211G,
671



DNAH5:NM_001369:exon73:
FGALGWNIPYEFSQADFNATVQFIQ



c.A12638G:p.N4213S,
672



DNAH5:NM_001369:exon14:c.G1837A:
DMISKLYTKQKYNPPLARNQPPIAG



p.D613N,
673



DNAH5:NM_001369:exon4:c.G364T:
KPKVFVTEGNDVSLTGVCVFFIRTD



p.A122S,
674





TMG29
PRLR:NM_000949:exon10:c.A1297T:
PSQHNPRSSYHNFTDVCELAVGPAG



p.I433F,
675



RICTOR:NM_001285439:exon1:
AAIGRGRSLKNLQVRGRNDSGEENV



c.G41A:p.R14Q,
676



MROH2B:NM_173489:exon30:
FIPNEEILMFLEKMLDGLESLNPTC



c.G3031A:p.E1011K,
677



CMYA5:NM_153610:exon2:c.C9670T:
SEGDSVNSEASFSSRNSDTDDGTGI



p.P3224S,
678



VCAN:NM_001164098:exon7:
PFPSQHRTEIELLPYSGDKILVEGI



c.T1966C:p.F656L,
679



VCAN:NM_001164098:exon7:
LSTPQPPASTKFYPDINVYIIEVRE



c.C3955T:p.H1319Y,
680



VCAN:NM_001164097:exon7:
SPHVETTFSTEPIGLVLSTVMDRVV



c.C2687T:p.T896I,
681



ADGRV1:NM_032119:exon20:
FTGLEGAFGTVNSKYHPSRNNTIAN



c.C4024T:p.P1342S,
682



NR2F1:NM_005654:exon2:c.G476A:
LKVGMRREAVQRERMPPTQPNPGQY



p.G159E,
683



LIX1:NM_153234:exon4:c.C412T:
ASTSGTLDDADDSSTSVGAYHYMLE



Pp.138S,
684





TMG30
EPB41L4A:NM_022140:exon6:
TAAQLGAYAIQSKLGDYDPYKHTAG



c.G436A:p.E146K,
685



KDM3B:NM_016604:exon14:
DIRSEEPLKTDSWASNSNSELKAIR



c.C3590G:p.S1197W,
686



GRXCR2:NM_001080516:exon1:
PEKKLNQKSDGKLRKVRFKISSSYS



c.C47T:p.P16L,
687



LARS:NM_020117:exon29:c.A3033C:
NLEKMGPRILDLHLEFDEKAVLMEN



p.Q1011H,
688



GRIA1:NM_000827:exon11:c.G1496T:
DVAVAPLTITLVLEEVIDFSKPFMS



p.R499L,
689



FAXDC2:NM_032385:exon9:c.C985T:
GFTPLSESIPDSSKRME



p.P329S,
690



THVID4:NM_001146726:exon2:
SSWSHNSNSMCWEKDQCPYSGCKEA



c.G164A:p.G55E,
691



NIPAL4:NM_001099287:exon1:
SRPPAPELGSPGTVRPRVGSCAPGP



c.G148A:p.A50T,
692



GABRG2:NM_198904:exon6:c.C701T:
WKRSSVEVGDTRFWRLYQFSFVGLR



p.S234F,
693



STC2:NM_003714:exon4:c.G871A:
LGAQGPSGSSEWKDEQSEYSDIRR



p.E291K,
694





TMG31
FGFR4:NM_213647:exon13:
LRARRPPGPDLSSDGPRSSEGPLSF



c.C1720T.p.P574S,
695



LMAN2:NM_006816:exon2:c.T248G:
SMPLWDFQGSTMRTSQYVRLTPDER



p.L83R,
696



MAPK9:NM_002752:exon8:c.G834C:
YPGIKFEELFPDCIFPSESERDKIK



p.W278C,
697



DEK:NM_001134709:exon2:c.G123C:
REESEEEEDEDDDEEEEEEKGKGQK



p.E41D,
698



LRRC16A:NM_017640:exon33:
LIKSRSKSERPPKILMTEEPSSPKG



c.C3254A:p.T1085K,
699



RNF8:NM_183078:exon3:c.C664T:
LEPSKTTGAPIYSGFPKVTEVHHEQ



p.P222S,
700



RNF8:NM_183078:exon3:c.C665T:
LEPSKTTGAPIYLGFPKVTEVHHEQ



p.P222L,
701



KCNK16:NM_001135106:exon5:
SDPSGLPRPQKISISA



c.C871T:p.P291S,
702



KLHDC3:NM_057161:exon2:c.C37T:
MLRWTVHLEGGPCRVNHAAVAVGHR



p.R13C,
703



HSP90AB1:NM_001271969:exon10:
LCKLMKEILDKKAEKVTISNRLVSS



c.T1724C:p.V575A,
704





TMG32
OOEP:NM_001080507:exon1:c.C50T:
AGAAESQRGKQTLAHSLEQLRRLPL



p.P17L,
705



IMPG1:NM_001282368:exon2:
CQQETFCLFDIGNNFSNSQEHLDLL



c.A192T:p.K64N,
706



MANEA:NM_024641:exon5:c.G1253A:
HEGTQIEKAVPKKTSNTVYLDYRPH



p.R418K,
707



AK9:NM_001145128:exon23:
TVVLPEFPEDSYSDVPEMEPFKEKI



c.C2467T:p.P823S,
708



BCLAF1:NM_014739:exon5:c.G1111A:
EASKEKGSEKGRTEGEWEDQEALDY



p.A371T,
709



ADCYAP1R1:NM_001118:exon7:
FGDSNSLDLSDMRVVSRNCTEDGWS



c.G334A:p.G112R,
710



PDE1C:NM_001191058:exon16:
EKARLAAEEQQKKMEAKSQAEEGAS



c.G1825A:p.E609K,
711



ANK1B1:NM_019004:exon15:
LLKTRRILKCSYLYGFFLEPKSTKK



c.C1994T:p.P665L,
712



BRAF:NM_004333:exon15:c.T1799A:
LTVKIGDFGLATEKSRWSGSHQFEQ



p.V600E,
713



ZNF398:NM_170686:exon6:c.C1277T:
TYHLRVHNSTERLFPCPDCPKRFAD



p.P426L,
714





TMG33
ABCB8:NM_001282293:exon15:
KGGLYAELIRRQTLDAPRTAAPPPK



c.G1816A:p.A606T,
715



CSMD1:NM_033225:exon66:
REVNETVTKTPVSSDVFFVNSLWKG



c.C10042T:p.P3348S,
716



FGF20:NM_019851:exon3:c.C601T:
HFLPRPVDPERVSELYKDLLMYT



p.P201S,
717



C8orf34:NM_052958:exon9:
TLNICSRCARLQEDNLEERTEESLP



c.G1250A:p.G417E,
718



ZFHX4:NM_024721:exon10:c.A7093G:
MDATDQVVYKHCAVSGQTDAAKNAA



p.T2365A,
719



ZC2HC1A:NM_016010:exon8:c.T741G:
GANVKPRNSTPPRLARNPAPGVLTN



p.S247R,
720



ATP6V0D2:NM_152565:exon2:
ANHTNPLTVSKIYTEMRKRLCGEFE



c.G211T:p.D71Y,
721



MMP16:NM_005941:exon10:c.C1564T:
FNNQILKVEPGYSRSILKDFMGCDG



p.P522S,
722



PKHD1L1:NM_177531:exon16:
VQVITLENWETTDAINEVQKIKVTS



c.A1564G:p.N522D,
723



GML:NM_002066:exon3:c.C181T:
PYHIRRCMTISICINSRELLVYKNC



p.R61C,
724





TMG34
CEP78:NM_032171:exon11:c.T1307C:
VTVTVESPSSSEAEEVDDSSESVHE



p.V436A,
725



NAA35:NM_024635:exon17:c.C1409T:
ATLQDEAEKVDAVLHTMLLKQEPQR



p.A470V,
726



GRIN3A:NM_133445:exon1:c.C329T:
LGSTLHGRGPPGFRKPGEGARAEAL



p.S110F,
727



FKTN:NM_006731:exon10:c.G1339A:
DWKRSPPNVQPNRIWPISEWDEVIQ



p.G447R,
728



OR1J4:NM_001004452:exon1:
IFFTDLDNFLLTTMAYDRYVAICHP



c.T349A:p.S117T,
729



IER5L:NM_203434:exon1:c.G949A:
YPGQEEEEDDEENAGGLGAEPPGGA



p.D317N,
730



VCX3B:NM_001001888:exon3:
SQESQVEEPLSQKSEMEEPLSQESE



c.G679A:p.E227K,
731



ARHGAP6:NM_006125:exon4:c.C995T:
NKRQNKELSSSNLSLSSTSETPNES



p.S332L,
732



FRMPD4:NM_014728:exon2:c.C68T:
KLSSHRTKSSGWLPPSGTWGLSQVP



p.P23L,
733



BMX:NM_001721:exon15:c.G1567A:
SQLLEMCYDVCESMAFLESHQFIHR



p.G523S,
734





TMG35
PHEX:NM_000444:exon4:c.C386T:
ELLEKSISRRRDIEAIQKAKILYSS



p.T129I,
735



DMD:NM_004009:exon10:c.G1000A:
EDKSFGSSLMESKVNLDRYQTALEE



p.E334K,
736



FAM47C:NM_001013736:exon1:
CPEPPETRVSPLCQLPPEAGVSHLC



c.C1099T:p.R367C,
737



SHROOM4:NM_020717:exon4:
PKECSRMGGRRSRGTRGRSIQNRRK



c.G1714A:p.G572R,
738



IQSEC2:NM_001111125:exon7:
LDDALRKFQSHIQVQGEAQKVERLI



c.G2534A:p.R845Q,
739



PCDH11X:NM_032968:exon1:c.C277T:
TGEIFTTGARIDCEKLCAGIPRDEH



p.R93C,
740



PCDH19:NM_001105243:exon4:
TSVTSMGSQMPDYDQNEGFHCREEC



c.C2704T:p.H902Y,
741



TENM1:NM_014253:exon10:c.C1774T:
CRHGWKGPECDVSEEQCIDPTCFGH



p.P592S,
742



ZNF75D:NM_001185063:exon4:
MKIAQKTMGRENSGDTHSVQKWHRA



c.C673T:p.P225S,
743



MAGEC1:NM_005462:exon4:c.G367A:
PLEISQSPPEGENVQSPLQNPASSF



p.D123N,
744



IDS:NM_006123:exon6:c.G820A:
VAYNPWMD1RQRKDVQALNISVPYG



p.E274K,
745



ABCD1:NM_000033:exon1:c.C22G:
MPVLSRPGPWRGNTLKRTAV



p.R8G,
746









To identify MHC-I neoantigens, MHC-I expressing Cos7 cells were transfected (Cos7-A1, Cos7-B8 and Cos7-C7) cells with 35 TMGs, respectively, followed by adding CD8+ T cells isolated from 136 TILs. ELISA analysis revealed that one TMG (i.e. TMG2) was positive for activating T cell response in Cos7-A1, but not in Cos7-B8 or Cos7-C7 cells (FIG. 8D). Further experiments using individual minigene-expressing vectors identified HHAT (G>E) in TMG2 as HLA-A1 restricted neoantigen in 136 mel (FIG. 8E and FIG. 9B). To exclude the possibility that other MHC-I and II molecules could also present neoantigens for T cell recognition, fibroblasts were generated from patient #136 tumor samples and used as APCs to express all MHC I and II molecules after IFN-γ pretreatment (FIG. 9C). The IFN-γ—pretreated fibroblasts were electroporated with in IVT mRNA of 136 mel TMGs, and then co-cultured with 136TILs. TMG2, TMG19, TMG31 and TMG32 were identified as positive again for T cell recognition in the IFN-γ-pretreated fibroblasts, but did not identify new TMG using 136TIL (FIG. 8F).


As shown in 135 mel, ADIPOR2 and LAGE1b were identified as tumor antigens recognized by T cell clones that were in relatively low abundance in the bulk TIL population. Thus, T cell clones generated from 136TILs were tested for their ability to recognize the newly identified as well as unknown neoantigens. 293IMDR1/DP4 and 293IMDR11/DP4 cells were transfected with 35 TMGs, respectively, followed by co-culturing with T cells clones for overnight. It was found that CD4+136-C13 and 136-C22 T cell clones were capable of recognizing TMG13 and TMG8 presented by 293IMDR1/DP4 cells (FIG. 8G). Further experiments using individual minigenes identified C1GALT1C1 (L>F) in TMG13 and HSPA13 (P>L) in TMG8 as two MHC-II neoantigens (FIG. 8H and FIG. 9D). Titration experiments using synthesized neoantigens and their corresponding WT peptides demonstrated that CD4+136TIL and CD8+136TIL cells can recognize 4 MHC II neoantigens and one HLA-A1-restricted neoantigen, respectively, while T cell clones (136-C13 and 136-C22) recognized two MHC-II restricted neoantigens (FIG. 8I). By contrast, these T cell lines or clones showed no or little reactivity against their corresponding WT peptides (FIG. 8I), indicating that these are true neoantigens recognized by tumor-reactive T cells.


(4) Discovery of Immunodominance for Neoantigen Recognition Patterns by Tumor-Reactive T Cells

Using systemic and multiple approaches, 8 cancer neoantigens were identified in 135 mel and 7 neoantigens in 136 mel. To understand the potential mechanisms that restrain the number of neoantigens that could induce T cell response, it was asked whether these neoantigens were equally recognized by T cells. To test this possibility, 78 CD4+ tumor-reactive T-cell clones established from 135TILs were tested for their ability to recognize neoantigens, and unexpectedly found that 69 of 78 T cell clones (88%) were capable of recognizing the PCDHB16 neoantigen in 293IMDR4/DP4 cells, while 3 (3.8%) recognized SPATA13, 3 (3.8%) recognized PCDHB7, and 1 (1.3%) recognized ATG5 (FIG. 10A), indicating that PCDHB16 neoantigen is an immunodominant neoantigen that stimulates CD4+ T cell response. Similarly, 40 CD8+ tumor-reactive T-cell clones were tested for their ability to recognize MHC-I restricted neoantigens (RPN2, MPG and TXNIP), and found that 27 of 40 CD8+ T-cell clones (67.5%) recognized MPG, while 5 of 40 (12.5%) recognized RPN2 and 8 of 40 (20%) recognized TXNIP, respectively (FIG. 10B), indicating that MPG is an immunodominant neoantigen responsible for CD8+ T-cell recognition.


To exclude the possibility that T cell growth selection during T cell cloning may contribute to the observed immunodominance, primary neoantigen-specific T cell population (without long-time culture and expansion) were freshly generated and performed intracellular cytokine staining after specific neoantigen peptide stimulation, and found that tumor-reactive CD4+ T cell populations accounted for 32.7% in the total T cell population when stimulated with 135 mel tumor cells. Importantly, PCDHB16 neoantigen-specific CD4+ T cell population accounted for 30.2% in the total T cell population (FIG. 10C), which indicated that 92% of tumor-reactive CD4+ T cells are PCDHB16 neoantigen-specific T cells (FIG. 10D). By contrast, other neoantigens (SPATA13, PCDHB7 and ATG5) specific T cells accounted for very small percentage of tumor-reactive T cells (FIGS. 10C and 10D). Using a similar approach, it was shown that 18.8% of tumor-reactive CD8+ T cells were presented in the total T cell population when stimulated with 135 mel tumor cells, but 12.3% of them accounted for recognition of MPG neoantigen, while 2.95 and 1.54% accounted for recognition of TXNIP and RPN2, respectively (FIG. 10C), indicating that MPG neoantigen functions as an immunodominant antigen for CD8+ T cell response, which accounts for 65% of tumor-reactive CD8+ T cells (FIG. 10D). These results indicate that PCDHB16 and MPG neoantigens are immunodominant neoantigens recognized by a large majority of CD4+ or CD8+ tumor-reactive T cells, respectively.


Interestingly a similar recognition pattern was observed in the second patient-derived T cells (136TIL) for their neoantigen recognition. Intracellular staining of the freshly generated 136TIL population after stimulation with fibroblasts loaded with neoantigen peptide pools or individual one showed that tumor-reactive of CD4+ or CD8+ T cells accounted for 24.8% and 14.6% in the total T cell population, respectively (FIG. 11A). Among them, MAPK9 and ANKIB1 neoantigen-specific CD4+ T cells accounted for 14.3% and 5.2% in the total T cell population, which accounted for 58% and 21% in the total tumor-reactive CD4+ T cell population, respectively (FIG. 11B). By contrast, other neoantigen-specific CD4+ T cells accounted for only a small percentage of T cells in the total T cell population (FIG. 11A). Since HHAT is only one MHC-I neoantigen, it was shown that HHAT neoantigen-specific CD8+ T cells accounted for 10.6% in the total T cell population when stimulation with HHAT peptide (FIG. 11A), which accounted for 73% in the total tumor-reactive CD8+ T cell population (FIG. 11B). Taken together, these results indicate that the mutated MAPK9 and ANKIB1 antigens are MHC-II restricted immunodominant neoantigens, while HHAT is the only MHC-I restricted neoantigen.


Although T cell clones and intracellular staining analysis demonstrate immunodominant neoantigen recognition patterns by T cells, it is not clear whether immunodominant neoantigens are recognized by T cells with the same or different TCR α and β chains. To address this issue, PCDHB16 neoantigen-specific CD4+ T cells were isolated by FACS sorting of PCDHB16-neoantigen T cells from freshly generated bulk T cells after stimulated PCDHB16 neoepitope peptide-pulsed 135EBV-B cells. Using the similar approach, MPG-specific CD8+ T cells were also purified from freshly generated bulk T cells after stimulated MPG neoepitope peptide-pulsed 135EBV-B cells. Genomic DNAs were isolated from the two purified neoantigen-specific T cell populations, as well as the parental bulk T cells, and used as templates for TCR β-chain CDR3 repertoire amplification. Since there is only one copy of genomic DNA in a single T cell, specific TCR β-chain sequence abundance can represent the frequency of neoantigen-specific T cells in the total T cell population. TCRβ CDR3 were then amplified with a set of primers targeting mature V-D-J rearranged T cells. After the amplicons (from PCDHB16-, MPG-specific T cells and bulk 135TIL) were subjected to deep-sequenced via next-generation sequencing (reads>100,000 per sample), followed by bioinformatic analysis and TCR repertoire alignment. Dysfunctional TCR sequences and redundant sequences in all three groups were removed. PCDHB16 and MPG neoantigen-specific TCR sequences (CDR3β VDJ) with top frequencies (>0.001%) were identified in the sequencing result and used to determine the relative frequencies of neoantigen-specific TCRs in the total T cell population (FIG. 10E). The relative TCR sequence frequencies that were identical to those in PCDHB16-specific T cells and MPG-specific T cells were determined in total 135TIL cell population (FIG. 10F). PCDHB16-specific TCRs accounted for 36% in the total 135TIL TCRs, which was close to 32% identified by intracellular staining and FACS analysis (FIG. 10B and FIG. 10F). MPG-specific TCRs accounted for 25.5% in the total TCR sequences in 135TIL, which is higher than 12.3% identified by intracellular staining (FIG. 10B and FIG. 10F). It should be noted that intracellular staining approach identifies the functional T cells, while TCR sequence analysis might contain both functional and non-functional TCRs. These results indicate that PCDHB16- and MPG-specific TCRs are dominant in the total TCR sequences, even though some TCRs with similar sequence may not function in T cell recognition assay. Importantly, it was shown that the same neoantigen-specific TCRs exhibited the strong diversity in the TCR VP CDR3 region (V-J segment) (FIG. 10F, FIGS. 11C and 11D). These results indicate that PCDHB16- and MPG-bound T cells maintain their TCR diversities, with the same neoantigen specificity. Taken together, different approaches (T cell clone analysis, intracellular staining as well as TCR profiling analysis) indicate immunodominant neoantigens in T cell response and recognition in cancer patients.


(5) Differential Recognition of Tumor Cells In Vitro and In Vivo by Neoantigen-Specific T Cells

To further understand the cause and impact of immunodominant neoantigens on immunotherapy, it was hypothesized that immunodominant neoantigens may be highly expressed and presented by all tumor cells, while cryptic (subdominant) neoantigens are expressed and presented by some, but not all tumor cells. To test this possibility, single tumor-cell clones of 135 mel were generated by limited dilution (0.3 cells/well) in 96-well plates. Each clone was expanded from one single 135 mel cell and then tested for their recognition by different neoantigen-specific T-cell clones. As expected, the PCDHB16-specific T-cell clone 1H1 recognized every single tumor cell clone that was generated, but other subdominant neoantigen-specific T-cell clones (such as 2C10 specific for PCDHB7 and JF6 specific for SPATA13) recognized only a few of single tumor cell clones (FIG. 12A). Similarly, PCDHB16-specific T-cell clone showed significant CD4+ T cell killing activity against 135 mel bulk and clones (FIG. 12B). By contrast, subdominant PCDHB7-specific T-cell clone showed its killing activity against the tumor clone that it recognized, but much weaker activity against bulk tumor cells or no activity against tumor cell clones it failed to recognize (FIG. 12B). Surprisingly, after genomic DNA were isolated from each tumor clone and sent to Sanger sequencing, all tumor clones showed genomic mutations in all three neoantigens same as the result of whole-exome sequencing [PCDHB16 (C>T), PCDHB7 (C>T), SPATA13 (A>G)], while the same sites in 135TIL showed wildtype sequences (FIG. 12C). This result indicated that the loss of T-cell recognition in different tumor clones was not derived from the existence of wildtype genes, but the dysfunctional antigen processing and presentation of mutated antigens.


To further demonstrate in vivo recognition of single tumor cell clones by different neoantigen-specific T cells, 135 mel cells were injected in NSG mice at day 0 and followed by treatment with adoptive transfer of different neoantigen-specific T cell clones (1H1 specific for PCDHB16 and 2C10 specific for PCDHB7) on day 15. Exogenous interleukin-2 was given on day 15 for three consecutive days to maintain T cell proliferation and survival in vivo (FIG. 12D). On day 19 (4 days after T-cell injection), the serum level of IFN-γ was examined and it was found that the tumor-bearing mice treated with dominant neoantigen PCDHB16-specific T-cells produced higher amounts of IFN-γ compared to those treated with PCDHB7-specific T cells (FIG. 12E), indicating strong in vivo recognition of 135 mel tumor cells by PCDHB16-specific T-cells, but weak recognition by PCDHB7-specific T cells. Consistent with these observations, it was found that PCDHB16-specific T cells markedly inhibited 135 mel tumor growth, but PCDHB7-specific T cells showed weak antitumor activity (FIG. 12F). These results indicate that the dominant neoantigen-specific T cells exhibit much stronger in vitro and in vivo tumor recognition and antitumor immunity than those subdominant neoantigen-specific T cells, thus providing a rational design for neoantigen-specific immunotherapy.


(6) Dual Recognition of Two Neoantigens by a Single TCR

Because of large TCR sequence diversity for the same neoantigen, it was noticed that a few tumor-reactive T-cell clones for the dominant neoantigen PCGHB16 had cross-reactivity against other neoantigens (ADIPOR2) (FIG. 10A). Further experiments showed that 135-4B8 T-cell clone recognized both PCDHB16 and ADIPOR2 neoantigen peptides in a dose-dependent manner (FIG. 13A and FIG. 14A). To exclude the possibility that the 135-4B8 T-cell clone contained more than one T-cell receptor (TCR) or mixed with two antigen-specific T cells, a TCR (α and β) gene was cloned from 135-4B8 T cells. Sequencing analysis revealed that these T cells expressed TRAV8-6 and TRBV5-1, indicating that 135-4B8 is a pure T-cell clone expressing a single TCR. Its full-length α and β chain of this TCR was then cloned into the retroviral vector pMSGV (FIG. 14B) and transduced bulk T cells isolated from a healthy donor. After co-culturing of TCR-engineered T cells with PCDHB16 and ADIPOR2 neoantigen peptides, it was shown that 4B8-TCR-transduced T cells were capable of recognizing both PCDHB16 and ADIPOR2 neoantigens (FIG. 13B). To determine whether 4B8-TCR mediates antigen recognition through CD4+ or CD8+ T cells, CD4+ and CD8+ naïve T cells were transduced with 4B8-TCR, and then tested their ability to recognize the target neoantigens. It was found that 4B8 TCR-transduced CD4+ T cells recognized both the PCDHB16 and ADIPOR2 antigens, whereas the 4B8-TCR transduced CD8+ T cells failed to do so (FIG. 13C).


The CD4+ T-cell clone 135-C76 was analyzed for dual recognition of the PCDHB16 neoantigen presented by HLA-DP4 molecules and RPN2 neoantigens presented by HLA-A1 molecules (FIG. 14C). Flow cytometric analysis revealed that 135-C76 T cells were CD4 and CD8 double positive (FIG. 13D). After cloning 135-C76 TCR (TRAV29, TRBV15), it was inserted into pMSGV retroviral vector to test its activity. It was found that 135-C76 TCR-transduced T cells containing CD8+ and CD4+ T-cell populations were capable of recognizing both mutated PCDHB16 and RPN2 neoantigens (FIG. 13D). However, further experiments revealed that 135-C76 TCR-transduced CD8+ T cells recognized the HLA-A1-restricted RPN2 neoantigen, but not the PCDHB16 neoantigen (FIG. 13E). By contrast, 135-C76 TCR-transduced CD4+ T cells recognized PCDHB16 neoantigen, but not the RPN2 neoantigen (FIG. 13E). These results indicate that 135-C76 TCR can recognize an MHC-I-restricted antigen by CD8+ T cells, and MHC-II-restricted neoantigen by CD4+ T cells.


(7) Knockdown of ThPOK in CD4+ 135TIL and Other CD4+ T Cells Increases their Cytolytic Activity In Vitro and In Vivo

CD8+ and CD4+ T cells are the major components of T cell-based antitumor immunity. CD8+ T cells, also known as cytotoxic T lymphocytes (CTL), are capable of specific antigen recognition presented by MHC-I molecules and have high cytotoxicity to kill target cells. CD4+ T cells also show specific antigen recognition presented by MHC-II molecules but much lower killing ability than CD8+ T cells. Although the data of T-cell clones of 135TIL show rather high cytolytic activity when targeting PCDHB16 neoantigen, in most cases CD4+ antigen-specific T cells have rare killing ability when they recognize target cells. Thus it will be a new challenge to reprogram CD4+ T cells and improve their cytolytic activity. The transcription factor T-helper-inducing POZ/Krueppel-like factor (ThPOK) and is a master regulator of CD4+ T cell development and negatively regulates the cytolytic ability of CD4+ T cells. After immigration from the thymus, naïve CD4+ T cells maintain the expression of ThPOK, indicating that ThPOK can still play important roles for the maintenance of CD4+ T cells phenotype and function.


To address this problem, single T cell clones were generated from 135 mel cancer cell-reactive CD4+ TIL via limited dilution. Cytotoxicity analysis of those T-cell clones were performed with expanded T-cell clones, showing different cytotoxic ability in each T cell clone (FIG. 15A). Next, the expression of ThPOK was checked in both mRNA and protein level in those CD4+ T-cell clones by quantitative PCR and western blotting (FIG. 15B and FIG. 15C). The result showed a negative association of ThPOK expression level of both mRNA and protein with CD4+ T cell's killing ability. ThPOK is highly expressed in weak cytotoxic CD4+ T cell clones (2F8) and vice versa (1H1), indicating CD4+ T cell's cytolytic activity was negatively regulated by ThPOK.


To abolish the expression of ThPOK in antigen-specific CD4+ human T cells, lentiviral shRNA plasmid from the human shRNA library was applied to degrade ThPOK mRNA generation. Western blotting analysis confirmed ThPOK was effectively knockdown 72 h after lentiviral transduction in CD4+135TILs (FIG. 16A). Then CD4+135TIL with or without ThPOK knockdown were compared in the LDH assay with E/T ratio from 5:1 to 20:1. The ThPOK-knockdown CD4+ TIL significantly had higher cytotoxicity against 135 mel at every E/T point (up to 55% which exceeds most normal CD4+ T cells, FIG. 16B). In another experiment, the number of live 135 mel cells incubating with ThPOK-knockdown CD4+ TIL decreased about 20% more than normal CD4+135TIL after three-day culture (FIG. 16C). The two results demonstrated a dramatic improvement of cytolytic activity in ThPOK-knockdown CD4+135TIL in vitro. The mRNA level of gene GZMB, PRF1, EOMES, RUNX3 and IFN-γ in ThPOK-knockdown CD4+135TIL was also examined, which are positive regulators for T cell cytotoxicity. The significant up-regulation of GzmB, Prf1 stands for a remarkable enhancement of T cell cytotoxicity (FIG. 16D). Although the mRNA level of IFN-γ was also up-regulated (FIG. 16D), a significant change in protein level was not found via ELISA (FIG. 16E). Furthermore, ThPOK knockdown did not affect other CD4+ T cell cytokines, such as IL-4, IL-10 and GM-CSF (FIG. 16E). Those results indicated that knockdown of ThPOK can increase CD4+ T-cell cytotoxicity but maintains their CD4+ property in vitro.


Based on the results in vitro, the 135 mel mouse model was applied to test knockdown of ThPOK in vivo. 135 mel tumor cells were inoculated into the NSG mice. CD4+135TIL modified by ThPOK shRNA lentiviruses were adoptively transferred on the 5th day after the tumor inoculation, together with unmodified CD4+135TIL and non-specific CD4+ T cells. From day 5 to day25, 135 mel growth in each group was observed. Tumor growth treated by ThPOK-knockdown CD4+135TIL was inhibited significantly, leaving the smallest tumor size, compared to unmodified CD4+135TIL and non-specific CD4+ T cells (FIG. 17A and FIG. 17B). To further validate in vivo function of ThPOK-knockdown CD4+ T cells, a CD4+ TCR-T model besides 135TIL was applied. A TCR targeting MHC-A2-presented NY-ESO-1 epitope was retrovirally transduced in normal human CD4+ and CD8+ T cells respectively. A human prostate cancer cell line PC3 expressing NY-ESO-1 and MHC-A2 (A2ESO) was subcutaneously inoculated in NSG mice and followed by injection of TCR-transduced or untransduced CD4+ and CD8+ T cells with or without ThPOK knockdown in a similar time manner as above. Interestingly it was found that the tumor suppression effect of ThPOK-knockdown TCR-transduced CD4+ T cells was very similar as that of TCR-transduced CD8+ T cells, which inhibited the tumor growth profoundly, while none-specific CD4+ T cells and ThPOK-expressing CD4+ TCR-T cells showed minor tumor suppression effect (FIG. 17C and FIG. 17D). All in vivo experiment confirmed knockdown of ThPOK in tumor-reactive CD4+ T cells improved T-cell antitumor activity and decelerated the tumor growth in NSG mice. Taken together, ThPOK is selectively expressed in CD4+ T cells while low level of ThPOK corresponds to an improved cytolytic activity of CD4+ T cells, which has been demonstrated by in vitro and in vivo experiments.


(8) ThPOK Inhibits CD4+ Tumor Specific T Cell Cytolytic Activity by Interacting with LSD1

It was hypothesized that the improvement killing ability of CD4+ T cells due to ThPOK knockdown is procedure of T-cell reprogramming via epigenetic regulation. To distinguish how ThPOK reprogrammed CD4+ T cells to cytolytic CD4+ T cells, a screening of A2ESO TCR-transduced CD4+ T cells treated by various epigenetic inhibitors was performed. After measuring the change of cytotoxicity of CD4+ TCR-T cells against tumor, it was found that 2-PCPA, the inhibitor of Lysine-specific histone demethylase 1A (LSD1) was the only one inhibitor which dramatically enhanced the cytolytic activity of CD4+ TCR-T cells, that was similar as depletion of ThPOK (FIG. 18A). To further investigate the correlation between ThPOK and LSD1, immunoprecipitation screening was performed by overexpression tagged proteins in HEK293T host cells with various epigenetic regulators and ThPOK. Suprisingly, LSD1 showed the strongest binding with ThPOK among all epigenetic regulators (FIG. 18B). The direct interaction between LSD1 and ThPOK was further confirmed by endogenous immunoprecipitation in primary human CD4+ T cells (FIG. 18C). Since LSD1 functions in regulating target gene expression by modifying their promoter histone methylation status, CHIP-PCR was performed by pulling down bound DNAs with ThPOK and LSD1 antibodies respectively in CD4+ and CD8+ T cells. Detection of high level of promoter regions of genes GZMB and PRF1 in CD4+ T cells revealed that ThPOK and LSD1 both had strong binding with the promoters of these two genes only in CD4+ T cells (FIG. 18D). The co-occupancy of ThPOK and LSD1 at the promoters of GZMB and PRF1 can block the transcription of these two the key cytotoxic genes and deprived the cytolytic activity of CD4+ cells.


(9) Inhibition of LSD1 by 2-PCPA Positively Regulates of the Generation of Stem Cell-Like Memory T Cells

To identify key regulators of the generation and maintenance of stem cell-like memory T cells (Tscm), an in vitro small molecules screening platform was established that works directly on ex vivo primary human T cells. Briefly, T cells were activated with plate-bound OKT3 and treated them with small molecules including different metabolites and inhibitors. Half volume of culture medium was changed, and metabolites or inhibitors were added every 2 or 3 days. The memory phenotypes of both CD4+ and CD8+ T cell populations were detected at day 12 through staining with key memory markers CD45RO, CCR7 and CD62L. Tscm subset in this platform was identified as CD45RO, CCR7+ and CD62L+ population by flow cytometry. In activated T cells that don't contain naïve T cells, these three markers were able to identify three memory T cells subsets, Tscm, Tcm and Tem (Lugli et al., 2013). Small molecules that were extremely toxic and dramatically suppressed T cells growth were excluded from this platform. With this method of T cells culture, T cells differentiated significantly and very limited percentage of Tscm cells in both CD4+ and CD8+ T cells can be obtained. 2-PCPA, an inhibitor of LSD1, was identified as the only one positive regulator of Tscm generation that increased the population of Tscm in both CD4+ and CD8+ T cells (FIG. 19A). To further confirm that the CD45RO CCR7+ CD62L+ population were truly Tscm cells, the other 2 Tscm markers, CD45RA and CD95 were analyzed within CD45RO−CCR7+ CD62L+ population and found that the cells were almost 100% positive in these 2 markers (FIG. 19B), validating the purity of Tscm population. In conclusion, it was shown that 2-PCPA was effective in promoting Tscm generation in both CD4+ and CD8+ T cells.


(10) Inhibition of LSD1 by 2-PCPA in TCR-T and CAR-T Cells Increases T-Cell Antitumor Activity In Vitro and In Vivo

To test how the inhibition of LSD1 by 2-PCPA treatment affects TCR-T cells, 2-PCPA (8 uM) was directly administered at the beginning of TCR-T cell generation. Activated human T cells were transduced on day 2 and day 3 with A2ESO TCR viruses that were packaged by PG13-A2ESO TCR clone. Results showed that 2-PCPA treatment increased both Tscm and Tcm percentages in A2ESO TCR-T cells, indicating a less-differentiated phenotype (FIG. 20A). To investigate the functional change of A2ESO TCR-T cells after 2-PCPA treatment, intracellular staining (ICS) was performed to check the expression of effector cytokines, targeting IL-2, IFN-γ and TNF-α. Interestingly, although 2-PCPA treatment maintained A2ESO TCR-T cells in a less-differentiated phenotype, it didn't reduce but slightly increased the frequency of A2ESO TCR-T cells with polyfunctional cytokine secretion (FIG. 20B). As high proliferation potential is a key characteristic of Tscm cells, the in vitro proliferation assay by using CFSE staining and by stimulating A2ESO TCR-T cells with IL-2 or 231-ESO cells confirmed this feature of 2-PCPA treated A2ESO TCR-T cells (FIG. 20C). Another key feature of less-differentiated T cell is increased oxidative metabolism and mitochondrial respiratory capacity. It was also reported that LSD1-inhibition upregulated oxidative metabolism genes in myogenic cells (Anan et al., 2018). The results also showed that 2-PCPA treatment rendered A2ESO TCR-T cells with increased oxidative metabolism and elevated spare respiratory capacity (SRC) (FIG. 20D).


The results of 2-PCPA-treated A2ESO TCR-T cells in vitro showed less-differentiated memory phenotype and increased function and proliferation, an attempt was made to test to in vivo effect of 2-PCPA-treated A2ESO TCR-T cells. A breast cancer model was established using MDA-MB-231 cells expressing HLA-A2 and NY-ESO-1 antigen in 8 weeks old NOD-scid IL2Rγnull (NSG) mice. The in vivo anti-tumor function of A2ESO TCR-T cells was significantly enhanced by 2-PCPA treatment (FIG. 20E). Through determining the memory phenotype of T cells after adoptive transfer, it was found that Tscm population was barely detectable in control A2ESO TCR-T cells, while 2-PCPA treated A2ESO TCR-T cells maintained higher frequency of both Tscm and Tcm cells (FIG. 14F). These results indicated that 2-PCPA treatment promotes Tscm generation in A2ESO TCR-T cells with higher proliferation potential and self-renewal ability.


To determine whether 2-PCPA treatment was also effective on CAR-T cells as well as TCR-T cells, 2-PCPA treatment was applied when generating CAR-T cells. Activated T cells were transduced with CD19-specific chimeric antigen receptor (CAR) possessing CD28 co-stimulatory domain virus packaged by PG13 clone. Memory phenotype analysis showed that 2-PCPA treatment similarly increased the frequencies of less-differentiated memory T cells subsets (FIG. 21A). The metabolic reprogramming effect is consistent with its effect on A2ESO TCR-T cells (FIG. 21B). In vitro survival assay showed that 2-PCPA treated CAR-T cells survived better under IL-2 free condition than control CAR-T cells (FIG. 21C). However, this effect was not seen on 2-PCPA-treated A2ESO TCR-T cells. Next, in vivo proliferation of CAR-T cells was observed with or without 2-PCPA treatment. Both CAR and luciferase were transduced into activated T cells and transfer into NSG mice that have been injected with Raji lymphoma cells. In vivo imaging showed that 2-PCPA-treated CAR-T cells grew faster than control CAR-T cells (FIG. 21D). To further confirmed the results from in vivo imaging, the mice were sacrificed and checked the existence of human T cells within the spleen (FIG. 21E). These results demonstrated that 2-PCPA treatment promoted both in vitro and in vivo antitumor function of CAR-T cells.


b) DISCUSSION


The findings presented here identify a distinct pattern of T-cell reactivity against mutation-derived neoantigen; that is, a dominant neoantigen was recognized by the majority of tumor-reactive CD4+ or CD8+ T cells, while several other neoantigens were infrequently recognized by T cells. Interestingly, in the case of patient #135 it was shown that large majority of the tumor-reactive CD4+ or CD8+ TIL population/T-cell clones recognized neoantigens derived from somatic missense mutations respectively which was demonstrated via FACS and TCR sequencing analysis. Importantly, dominant neoantigen-specific T cells show better recognition and killing ability of single tumor cell clones, compared with subdominant/cryptic neoantigen-specific T-cell clones, implying that dominant neoantigen-specific T cells can play a critical role in tumor regression in checkpoint immunotherapy. Although two tumor-reactive CD4+ T-cell clones did not recognize mutation-derived neoantigens, they did respond to neoepitopes derived from the gene product of the 3′UTR region of ADIPOR2 mRNA or translated from an alternative ORF of the LAGE1b gene. Thus, gene products derived from aberrant translation of an mRNA can serve as neoantigens in cancer cells. By contrast, it was found that all tumor-reactive CD8+ T-cell clones recognized neoantigens derived from somatic missense mutations. Further investigation of neoantigens in another melanoma patient (patient #136) revealed the similar pattern of neoantigen recognition as seen with melanoma patient #135, indicating that although neoantigens capable of eliciting a specific T-cell response can be different among cancer patients, the regulation and control of neoantigen-specific response and recognition pattern appear consistent among patients within the same type of cancer.


Immunodominant and cryptic antigens have been reported in autoimmunity and infectious diseases, the study indicates that similar mechanisms can operate in response to cancer neoantigens. In this study, tumor clones growing from single cells were generated and checked the specific neoantigen presentation on tumor clones by recognition of T-cell clones. The data confirms that the presentation level of neoantigens varies in one tumor tissue and there are normally a few neoantigens presented by most of tumor cells, inducing the expansion of their specific T cells to the largest population in primary tumor-reactive T cells. These dominant neoantigens are ideal targets for T-cell based immunotherapy and contribute more in tumor suppression and elimination rather than those subdominant neoantigens, which has been demonstrated by the in vitro and in vivo study.


A single TCR has been reported to recognize multiple antigens in infectious and autoimmune diseases, but it has not demonstrated that a single TCR can recognize dual cancer neoantigens. By taking advantage of tumor-reactive single T-cell clones, provided herein is clear evidence that a single TCR can recognize two neoantigens presented by the same MHC II molecules, without amino acid sequence similarity. Surprisingly, a single TCR from 135-C76 T cells recognized the PCDHB16 neoantigen presented by HLA-DR4 molecules when transduced into CD4+ T cells, while the same TCR responded only to the RPN2 neoantigens presented by HLA-A1 molecules when expressed on CD8+ T cells. Hence, the use of TCRs with dual neoantigen specificity can enhance the potency of T cells used in TCR-based cancer immunotherapy.

    • c) Methods
    • (1) Human Samples


Fresh tumor tissues were obtained from melanoma patients (#135 and #136) under an approved Institutional Review Board (IRB) protocol of Houston Methodist. Both patients were HLA typed (#135: HLA-A1, B8, C7, DR3, DR4, DP4; #136: HLA-A1, B8, C7, DR1, DR11, DP4).


(2) Primary and Cell Lines


All cells are grown at 37° C. with 5% CO2 unless otherwise stated. Patients' melanoma cell lines (135 mel and 136 mel) were generated and expanded in RPMI 1640 medium (Fisher scientific) with 10% (v/v) fetal bovine serum (Valley Biomedical Inc.) supplemented with 1% (v/v) penicillin-streptomycin (Thermo Fisher Scientific).


All types of human T-cell containing cells, including peripheral blood mononuclear cell (PBMCs), tumor-infiltrated lymphocytes (135TIL and 136TIL), T-cell clones etc. were established and maintained in T-cell culture medium (TCM) containing RPMI 1640 medium with 10% (v/v) human AB serum (Corning) supplemented with 1% (v/v) HEPES (Thermo Fisher Scientific), 1% (v/v) GlutaMAX (Thermo Fisher Scientific) and 0.1% (v/v) 2-mercaptoethanol (Thermo Fisher Scientific)) supplemented with recombinant human interleukin-2 (IL-2, 300 IU/ml) (Peprotech).


Epstein-Barr virus-transformed B (EBV-B) cells derived from patient #135 were cultured in RPMI 1640 medium with 10% (v/v) fetal bovine serum supplemented with 1% (v/v) penicillin-streptomycin.


Fibroblasts derived from patient #136 were cultured in Dulbecco's Modified Eagle Medium (DMEM) (Fisher Scientific) with 10% (v/v) fetal bovine serum (FBS) containing 1% penicillin-streptomycin.


HEK293-based antigen presenting cells (APCs), including 293IMDR3/DP4, 293IMDR4/DP4, 293IMDR1/DP4, 293IMDR11/DP4 were generated, and maintained in Dulbecco's DMEM with 10% (v/v) FBS containing 1% (v/v) penicillin-streptomycin. HEK293T cells, 239T-HLA-A1 and 293T-HLA-B8 cells were maintained in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.


Cos-7 cells were cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.


Phoenix-AMPHO cells were cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.


(3) T-Cell Cloning and T-Cell Expansion


T-cell cloning and expansion were performed. Briefly, T-cell clones from 135TILs were generated by a limited dilution (0.3 cell/well) in 96-well plates. Allogeneic PBMCs from normal donors were irradiated (60 Gy) and then seeded as feeder cells (7×104/well) in TCM containing 35 ng/ml OKT3 (anti-hCD3 antibody) (R&D Systems) and 35 ng/ml anti-hCD28 antibody (R&D Systems). On day 1, recombinant IL-2 was added to the final concentration of 300 IU/ml. On day 5, half of the culture medium was replaced with fresh TCM containing recombinant IL-2, and then the medium was changed every other day or when the medium color turned yellow. On day 14, T-cell clones were harvested, counted, tested for tumor reactivity. Tumor reactive T-cell clones were further expanded.


For T-cell expansion, 2.5×105 T cells were transferred into 25 ml flasks with 20 ml of TCM. 2×107 irradiated PBMCs (60 Gy) and 5×106 irradiated EBV-B cells (210 Gy) were added as feeder cells. OKT3 and anti-hCD28 antibody were added to the final concentration of 35 ng/ml. On day 1, recombinant interleukin-2 was added to the final concentration of 300 IU/ml. Culture medium was changed, as described above, starting on day 5. On day 14, expanded T cells were harvested, counted, and restored in liquid nitrogen until use.


(4) T-Cell Reactivity Assay 96-well tissue culture plates were coated with phosphate-buffered saline (PBS, GenDEPOT) containing 50 μg/ml poly-L-lysine (Millipore Sigma) at 37° C. 10 min later, the plates were rinsed twice with PBS. Tumor cells or 293-APCs were seeded at 5×104/well in the 96-well plate and cultured at 37° C., 5% CO2 for at least 4 h. For plasmid transfection, 250 ng of plasmid DNA were transfected into 293-APCs per well with 0.5 μl of Lipofectamine 2000 (Thermo Fisher Scientific) according to the manufacturer's instructions and incubated at 37° C., in 5% CO2 overnight. For peptide presentation, synthesized peptides (GeneScript) were added to each well at different final concentrations from 102 to 10−4 μM and incubated at 37° C., 5% CO2. The cells transfected with plasmids or pulsed with peptides were washed with TCM twice, and then cocultured with T cells (1×105/well). After overnight culture, 50 μl of supernatant was collected for measuring cytokine release with an ELISA assay.


96-well microplate (Greiner Bio-one) was coated with 50 μl of PBS containing anti-human IFN-γ (Thermo Fisher Scientific, primary antibody, 1 μg/ml) per well and incubated at 4° C. overnight. After three washes with PBS, each well was blocked by 200 μl of 1% (w/v) bovine serum albumin (BSA) in PBS and incubated at room temperature for at least 1 h. After three washes with PBS, 50 μl of culture supernatant or human IFN-γ standard (Peprotech) was added to each well at different concentrations in PBS and incubated at room temperature for 1 h. After three washes, 50 μl of biotin-labeled anti-human IFN-γ antibody (Thermo Fisher Scientific, secondary antibody, 1:1000 dilution) in 2% (v/v) FBS-containing PBS buffer was added to each well, and then incubated at room temperature for 1 h. The 96-well plate was washed six times with PBS containing 1% (v/v) Tween-20. 50 μl of 1:5000 diluted poly-HRP Streptavidin (Thermo Fisher Scientific) in PBS containing 1% (w/v) BSA was then added per well, and incubated in the dark at room temperature for 30 min. After six washes with PBS containing 1% (v/v) Tween-20, 100 μl of 3,3′,5,5′-tetramethylbenzidine substrate solution (TMB, Millipore Sigma) was added per well and incubated at room temperature for 10 min. The color reaction was terminated by adding 50 μl of 2.5 M H2504 The plate was read at OD450 nm. The concentration of IFN-γ was calculated by standard reading.


(5) Whole-Exome Sequencing of Tumor and Control Cells


Whole-exome sequencing was performed by a standard protocol. Briefly, genomic DNA from 135 mel and 135TILs were extracted with the Quick-DNA™ Miniprep Kit (Zymo Research). DNA library construction, exome capture, and sequencing were performed by the standard procedure of Illumina HiSeq 3000. The average reads were around 150× of the coverage on each base. Over 8 billion bases of sequence data were obtained from each sequenced sample. Output from Illumina was processed to yield BAM files, which contained aligned reads to the NCBI Human Reference Genome Build hg19 with well-calibrated quality scores. Somatic mutations were detected referred to previous reports and determined after filtering with the following parameters: filtering out variants in TIL sequencing; read depth >10; 1000 human genome frequency <0.01; missense mutation. Finally, somatic mutations were identified for the tandem minigene construction.


(6) Construction of Tandem Minigenes (TMGs) and Screening with T Cells

The tandem minigene construct was designed to encode 10 mutations-containing peptides with 12 amino acids flanked on both sides. The codons for double-strained nucleotide synthesis were optimized to avoid EcoRI and XhoI sites for cloning. The synthesized tandem minigenes (Integrated DNA Technologies) were then cloned into pTSX expression vector with in-frame fusion with the Ii80 open reading frame.


The screening of antigen-specific T cells against TMGs were performed as described in T-cell reactivity assay above. Briefly, TMGs were transfected with Lipofectamine 2000 in the artificial APCs according to the patient's HLA typing respectively. The transfected cells were cocultured with tumor-reactive T cells overnight and the IFN-γ release in the supernatant in each well was determined by ELISA on the second day.


(7) In Vitro Transcription of TMGs

In vitro transcription (IVT) of TMGs was performed with a standard protocol of HiScribe™ T7 ARCA mRNA Kit (New England Biolabs). Briefly, each TMG construct was linearized by cutting of XhoI (New England Biolabs) and gel-purified with DNA Recovery Kit (Zymo Research). The linearized DNA (1 μg) were used as template for RNA synthesis respectively and mixed with T7 RNA polymerase and reaction buffer in a 20 μl mixture as the kit instruction. The reaction was incubated at 37° C. for at least 30 min. Then the DNA templates were removed by adding 2 μl of DNase I and incubating at 37° C. for 15 min. Next the poly (A) tailing reaction (50 μl) was completed by mixing the IVT reaction with Poly(A) polymerase and reaction buffer as the kit instruction and incubating at 37° C. for 30 min.


Each 5′-capped and 3′-tailed WT-RNAs were purified by adding 1/2 volume of LiCl to the reaction as the kit instruction. After the incubation at −20° C. for 30 min, the RNAs were spun down by at 4° C. for 15 min at top speed and then the pellets were washed with 500 μl of cold 70% ethanol and air-dried. The pellets were finally resuspended and dissolved in suitable RNA storage solution.


(8) Electroporation of EBV-B Cells and Screening with T Cells

The electroporation was performed as with a modified protocol of Amaxa® Cell Line Nucleofector® Kit V (Lonza). Briefly 1×105 EBV-B cells were spun down and resuspended in 50 μl of electroporation solution (Nucleofector® Solution:Supplement=9:2) and transferred into a kit-attached 100 μl cuvette immediately. 500 ng of IVT-RNA of TMGs were added to the cuvette respectively. The electroporation was completed by placing the cuvette in the Nucleofector™ 2b device and running a pre-set program (T-020).


The electroporated EBV-B cells were cultured in grow medium at 37° C. overnight and then cocultured with tumor-reactive T cells as described above. The IFN-γ release in the supernatant in each well was determined by ELISA on the second day.


(9) Ii80-cDNA Library Construction and Screening with T Cells

For cDNA library construction and screening, total RNAs of 135 mel tumor cells were extracted with TRIzol reagent (Thermo Fisher Scientific) according to the manufacturer's instructions. mRNA was further purified from total RNAs, and converted into cDNA using a cDNA construction kit (Thermo Fisher Scientific) with an oligo-dT primer. After ligation with adaptors, cDNAs were cloned into a pTSX vector containing an Ii80 fragment upstream of the cloning sites, and then transformed into Stbl3™ Escherichia coli bacteria (Thermo Fisher Scientific). The cDNA bacterial library was divided into pools with each consisted of approximately 100 cDNA clones. Each plasmid pool was isolated from bacteria and transfected into 293-based APCs per well with 0.5 μl of Lipofectamine 2000 and incubated at 37° C., 5% CO2 overnight. After gentle washes, transfected cells were cocultured with T-cell clone (1×105/well) overnight. The positive pools were identified based on cytokine release from T cells by ELISA. Bacterial colonies transformed with the positive pool plasmid DNA were picked and prepared for DNA isolation. Several hundreds of individual colonies were repeated for screening with T cells. The positive clones were sequenced to identify target antigens recognized by T cells.


(10) Intracellular Staining of IFN-γ

T cells were stimulated by various in vitro methods, such as tumor cells, peptides or DNA encoding neoantigens, etc. and induced to release cytokines. Stimulated T cells were resuspended in 100 μl of Cytofix/Cytoperm solution (BD Biosciences) per well for microwell plates (or 250 μl for tubes) and incubated for 20 min at 4° C. for cell fixation. Fixed cells were permeabilized by washing twice in 1× Perm/Wash buffer (BD Biosciences). Then the fixed/permeabilized cells were thoroughly resuspended in 50 μl of 1× Perm/Wash buffer containing 1 μl of phycoerythrin (PE)-conjugated IFN-γ antibody (Thermo Fisher Scientific) and incubate at 4° C. for 30 min in the dark. After that the cells were washed twice with 1× Perm/Wash buffer and ready for flow cytometric analysis.


(11) Live-Cell IFN-γ Staining

Cytokine Secretion Assay-Detection Kit (MACS Miltenyi Biotec) was used in cell-surface IFN-γ staining based on a commercial protocol. Briefly 1×106 T cells were washed with growth medium and spun down at 300×g for 10 min and then resuspended in 100 μl of grow medium containing 5% human serum. Washed cells were transferred in one well of a 96-well plate and added with antigenic peptides to the concentration of 1-10m/ml and incubated for 3-6 h at 37° C. in 5% CO2. After in vitro stimulation of T cells, they were transferred to 15 ml closable tube and washed with 1 ml of cold buffer (cold PBS containing 2% FBS (v/v) and 5 mM EDTA) and spun down. The cell pellets were resuspended in 90 μl of cold growth medium and added with 10 μl of IFN-γ Catch Reagent and incubated for 5 min on ice. After that, 1 ml of warm growth medium (37° C.) was added in the tube and the cells were incubated in closed tube for 45 min at 37° C. under slow continuous rotation. The tube was transferred on ice after incubation and the cells were washed with cold buffer and spun down. The cell pellets were resuspended in 90 μl of cold buffer and then added with 10 μl of IFN-γ Detection Antibody and incubated for 10 min on ice. Finally cells were washed with cold buffer and ready for flow cytometric analysis.


(12) Tumor Cell Cloning

135 mel tumor cells were suspended in 1× Keratinocyte-SFM and supplements (Thermo Fisher Scientific) with additional 10% (v/v) fetal bovine serum and 1% (v/v) penicillin-streptomycin at the density of 1.5 cell/ml after trypsinization, then 200 μl of mixture was added per well to 96-well plates and incubated at 37° C., 5% CO2. On day 5, wells containing a single colony were marked after observation under the microscope. On day 10, the medium was changed with fresh culture medium. The plates were incubated until day 20. The tumor clones in marked wells were trypsinized, and then expanded in larger flasks separately.


(13) Cytotoxicity Assay

CytoTox 96® Non-Radioactive Cytotoxicity Assay Kit (Promega) was applied to measure T cell —tumor cytotoxicity. Briefly, T cells (effectors) and tumor cells (targets) were mixed in TCM at the E/T ratio from 2:1 to 20:1, then cocultured at 100 μl/well in a 96-well plate at 37° C., 5% CO2 for at least 4 h. After the incubation, lactate dehydrogenase (LDH) concentration in the supernatant of each well was measured with the Kit following the manufacturer's instructions and the cytotoxicity percentage was calculated.


(14) In Vivo Adoptive T Cell-Based Antitumor Assay

NSG mice (female) were 6-8 weeks of age, which were obtained from Jackson Laboratories. All NSG mice-related experiments were performed in animal housing facilities under specific pathogen-free conditions at Houston Methodist. All animal studies were performed as instructed by the NIH guidelines and approved by the Animal Care and Use Committee of Houston Methodist.


Subcutaneous injection of 135 mel tumor lines in NSG mice was performed to construct an animal model for adoptive T cell-based therapy (4 mice in each group). Briefly, 3×106 135 mel cells were injected on day 0 per mouse. On day 15, different neoantigen-specific T cell clones were intravenously injected at 107 T cells per mouse, followed by the intraperitoneal injection of 5×104 unit of interleukin-2 per mouse on day 15, 16 and 17. The serum level of IFN-γ in each tumor-bearing mouse was collected on day 19 and measured via ELISA. The growth of 135 mel in each mouse was tracked until day 40.


(15) TCR β CDR3 Repertoire Analysis

The TCR β CDR3 analysis were referred to a previous description (Robins et al., 2009). Neoantigen-specific T cells were isolated after the flow cytometric sorting of peptide-stimulated TIL bulk by live-cell IFN-γ staining. Genomic DNAs were isolated from sorted T cells with Quick-DNA™ Miniprep Kit (Zymo Research) according to a commercial protocol. The isolated genomic DNAs were used as templates for PCR amplification of rearranged TCR β CDR3 in mature T cells. A 50 μl PCR mixture was set up with 16 ng/μl genomic DNA templates, 1 μM of forward BV primer sets and 1 μM of reverse BJ primer sets (Table 5), 10× reaction buffer and 0.2 μl of AccuPrime Taq DNA Polymerase HF (Thermo Fisher Scientific). The PCR was performed for 35 cycles (95° C. for 30 s, 59° C. for 30 s and 68° C. for 1 min). The PCR amplicons were gel-purified by DNA Recovery Kit (Zymo Research) and sent for amplicon next-generation sequencing (Genewiz). The sequencing result was analyzed via a laboratory pipeline for data filtering and TCR repertoire alignment.









TABLE 5







Primers for TCR β CDR3 amplification









PRIMER
PRIMER



SETS
NAME
SEQUENCE (5′→3′)





BV-F
TRBV2
TCAAATTTCACTCTGAAGATCCGGTCCACAA




(SEQ ID NO: 747)



TRBV3-1
GCTCACTTAAATCTTCACATCAATTCCCTGG 748



TRBV4-1
CTTAAACCTTCACCTACACGCCCTGC 749



TRBV4-2, 4-3
CTTATTCCTTCACCTACACACCCTGC 750



TRBV5-1
GCTCTGAGATGAATGTGAGCACCTTG 751



TRBV5-3
GCTCTGAGATGAATGTGAGTGCCTTG 752



TRBV5-4, 5-5, 5-6,
GCTCTGAGCTGAATGTGAACGCCTTG 753



5-7, 5-8




TRBV6-1
TCGCTCAGGCTGGAGTCGGCTG 754



TRBV6-2, 6-3
GCTGGGGTTGGAGTCGGCTG 755



TRBV6-4
CCCTCACGTTGGCGTCTGCTG 756



TRBV6-5
GCTCAGGCTGCTGTCGGCTG 757



TRBV6-6
CGCTCAGGCTGGAGTTGGCTG 758



TRBV6-7
CCCCTCAAGCTGGAGTCAGCTG 759



TRBV6-8
CACTCAGGCTGGTGTCGGCTG 760



TRBV6-9
CGCTCAGGCTGGAGTCAGCTG 761



TRBV7-1
CCACTCTGAAGTTCCAGCGCACAC 762



TRBV7-2
CACTCTGACGATCCAGCGCACAC 763



TRBV7-3
CTCTACTCTGAAGATCCAGCGCACAG 764



TRBV7-4
CCACTCTGAAGATCCAGCGCACAG 765



TRBV7-6
CACTCTGACGATCCAGCGCACAG 766



TRBV7-7
CCACTCTGACGATTCAGCGCACAG 767



TRBV7-8
CCACTCTGAAGATCCAGCGCACAC 768



TRBV7-9
CACCTTGGAGATCCAGCGCACAG 769



TRBV9
GCACTCTGAACTAAACCTGAGCTCTCTG 770



TRBV10-1
CCCCTCACTCTGGAGTCTGCTG 771



TRBV10-2
CCCCCTCACTCTGGAGTCAGCTA 772



TRBV10-3
CCTCCTCACTCTGGAGTCCGCTA 773



TRBV11-1, 11-3
CCACTCTCAAGATCCAGCCTGCAG 774



TRBV11-2
CTCCACTCTCAAGATCCAGCCTGCAA 775



TRBV12-3, 12-4,
CCACTCTGAAGATCCAGCCCTCAG 776



12-5




TRBV13
CATTCTGAACTGAACATGAGCTCCTTGG 777



TRBV14
CTACTCTGAAGGTGCAGCCTGCAG 778



TRBV15
GATAACTTCCAATCCAGGAGGCCGAACA 779



TRBV16
CTGTAGCCTTGAGATCCAGGCTACGA 780



TRBV17
CTTCCACGCTGAAGATCCATCCCG 781



TRBV18
GCATCCTGAGGATCCAGCAGGTAG 782



TRBV19
CCTCTCACTGTGACATCGGCCC 783



TRBV20-1
CTTGTCCACTCTGACAGTGACCAGTG 784



TRBV23-1
CAGCCTGGCAATCCTGTCCTCAG 785



TRBV24-1
CTCCCTGTCCCTAGAGTCTGCCAT 786



TRBV25-1
CCCTGACCCTGGAGTCTGCCA 787



TRBV27
CCCTGATCCTGGAGTCGCCCA 788



TRBV28
CTCCCTGATTCTGGAGTCCGCCA 789



TRBV29-1
CTAACATTCTCAACTCTGACTGTGAGCAACA 790



TRBV30
CGGCAGTTCATCCTGAGTTCTAAGAAGC 791





BJ-R
TRBJ1-1
TCACGAAGGTCCCCAGTATCCAACTTTACCTACAACT




GTGAGTCTGGTGCCTTGTCCAAA 792



TRBJ1-2
TCACGAAGGTCCCCAGTATCCAACTACCTACAACGGT




TAACCTGGTCCCCGAACCGAA 793



TRBJ1-3
TCACGAAGGTCCCCAGTATCCAACTACCTACAACAGT




GAGCCAACTTCCCTCTCCAAA 794



TRBJ1-4
TCACGAAGGTCCCCAGTATCCAACTCCAAGACAGAG




AGCTGGGTTCCACTGCCAAA 795



TRBJ1-6
TCACGAAGGTCCCCAGTATCCAACTCTGTCACAGTGA




GCCTGGTCCCGTTCCCAAA 796



TRBJ2-1
TCACGAAGGTCCCCAGTATCCAACTCGGTGAGCCGTG




TCCCTGGCCCGAA 797



TRBJ2-2
TCACGAAGGTCCCCAGTATCCAACTCCAGTACGGTCA




GCCTAGAGCCTTCTCCAAA 798



TRBJ2-3
TCACGAAGGTCCCCAGTATCCAACTACTGTCAGCCGG




GTGCCTGGGCCAAA 799



TRBJ2-4
TCACGAAGGTCCCCAGTATCCAACTAGAGCCGGGTCC




CGGCGCCGAA 800



TRBJ2-5
TCACGAAGGTCCCCAGTATCCAACTGGAGCCGCGTGC




CTGGCCCGAA 801



TRBJ2-6
TCACGAAGGTCCCCAGTATCCAACTGTCAGCCTGCTG




CCGGCCCCGAA 802



TRBJ2-7
TCACGAAGGTCCCCAGTATCCAACTGTGAGCCTGGTG




CCCGGCCCGAA 803
















TABLE 6







Amino acid sequences of V-J joint in TCR β CDR3 of dominant


neoantigen-specific T cells in 135TILs









TRBV
Paired V-D-J sequence
TRBJ











REPERTOIRE
V segment
Joint (-NDN-)
J segment
repertoire










Amino acid sequences (CDR3 β) of potentially functional


PCDHB16-reactive TCRs











TRBV6-2
TSVYFCAS
SYDTGSGDYT
DTQYFGPGTR
TRBJ2-3



SAVYLCAS
SLGAW
GYTFGSGTR
TRBJ1-2





TRBV7-2
SAVYLCAS
IGVIGG
NEQFFGPGTR
TRBJ2-1



SAVYLCAS
SLGP
NEQFFGPGTR
TRBJ2-1





TRBV7-6
SAMYRCAS
SLHRAD
GANVLTFGPGTR
TRBJ2-6





TRBV7-9
SAMYLCAS
SQRGTE
NEKLFFWQWNP
TRBJ1-4





TRBV10-2
TSVYFCAS
SGTLGYN
SPLHFGNGTR
TRBJ1-6





TRBV11-2
SAVYLCAS
SPTSGP
DTQYFGPGTR
TRBJ2-3





TRBV12-3
SAVYFCAS
SFPLA
GELFFGEGSR
TRBJ2-2





TRBV19
TAFYLCAS
TTTSGTY
NEQFFGPGTR
TRBJ2-1



TAFYLCAS
SSWVY
GYTFGSGTR
TRBJ1-2





TRBV25-1
TSQYLCAS
PKARTGD
YEQYFGPGTR
TRBJ2-7



TSLYFCAS
TPVGQGIW
SPLHFGNGTR
TRBJ1-6





TRBV27
TSLYFCAS
SSISTGYWT
EAFFGQGTR
TRBJ1-1



TSLYFCAS
SFRTGGGGSY
GYTFGSGTR
TRBJ1-2










Amino acid sequences (CDR3 β) of potentially functional


MPG-reactive TCRs











TRBV5-4
SALYLCAS
STRLAGYT
DTQYFGPGTR
TRBJ2-3





TRBV5-8
SALYLCAS
SPREDGT
DTQYFGPGTR
TRBJ2-3





TRBV6-1
TSVYFCAS
SDRGSG
NTIYFGEGSW
TRBJ1-3





TRBV6-3
TSVYFCAS
SYDTGSGDYT
DTQYFGPGTR
TRBJ2-3





TRBV6-4
TSVYFCAS
RNRDRDT
EAFFGQGTR
TRBJ1-1



TSVYFCAS
RNRDWDT
EAFFGQGTR
TRBJ1-1





TRBV7-9
SAMYLCAS
SPRQGDQM
NTQYFGPGTR
TRBJ2-3





TRBV19
TAFYLCAS
RDRQGDT
EAFFGQGTR
TRBJ1-1





TRBV25-1
TSQYLCAS
SENPNSG
NTIYFGEGSW
TRBJ1-3





TRBV28
TSMYLCAS
RSAPL
EAFFGQGTR
TRBJ1-1





V segment, Joint (-NDN-) and J segment are sections of a single contiguous sequence. Each sequence is depicted here to emphasize the different segments.






(16) TCRα/β Chain Variable Regions and Full-Length TCRα/β Chain Cloning

TCRα/β chain variable regions (CDR3) were amplified from a single T cell by three rounds of nested PCR. The first one-step RT-PCR was performed with the OneStep RT-PCR kit (Qiagen) in a 25 μl reaction and the following primers (Table 7): mixα1 (38 primers mixed, 10 μM each) and pCal (TCRα), mixβ1 (36 primers mixed, 10 μM each) and pCβ1 (TCRβ). 2 μl of PBS containing at least one T cell was added as the DNA template. The one-step RT-PCR was performed by reverse transcription (50° C., 30 min) followed by PCR for 25 cycles (94° C. for 30 s, 62° C. for 1 min, and 72° C. for 1 min), according to the manufacturer's instructions. The second and third PCR reactions were performed with KOD Hot Start DNA polymerase (Millipore Sigma) following the manufacturer's instructions. One μl of the first-round PCR product was added as a template for the second-round PCR in a 25 μl reaction. The second-round PCR was performed with the following primers (Table 7): mixα2 (36 primers mixed, 10 μM each) and pCα2 (TCRα), mixβ2 (36 primers mixed, 10 μM each) and pCβ2 (TCRβ) for 25 cycles (94° C. for 30 s, 64° C. for 1 min, and 70° C. for 1 min). One μl of the second-round PCR product was added as a template for the third-round PCR in a 25 μl reaction. The third-round PCR was performed with the following primers (Table 7): pF3 and pCα3 (TCRα), pF3 and pCβ3 (TCRβ), for 35 cycles (94° C. for 30 s, 64° C. for 1 min, and 70° C. for 1 min). The final PCR amplicons were gel-purified (Zymo Research) and sequenced (Genewiz). The genotypes of TCRs were determined by search with the online tools V-QUEST (IMGT) and Igblast (NCBI).









TABLE 7







Primers for TCRα/β chain variable regions (CDR3) amplification










Primer Name
Sequence (5′→3′)













1st round of
mixα1
TRAV1
CTG CAC GTA CCA GAC ATC TGG GTT


nested PCR


804




TRAV2
GGC TCA AAG CCT TCT CAG CAG G





805




TRAV3
GGA TAA CCT GGT TAA AGG CAG CTA





806




TRAV4
GGA TAC AAG ACA AAA GTT ACA AAC





GA 807




TRAV5
GCT GAC GTA TAT TTT TTC AAA TAT





GGA 808




TRAV6
GGA AGA GGC CCT GTT TTC TTG CT





809




TRAV7
GCT GGA TAT GAG AAG CAG AAA GGA





810




TRAV8
AGG ACT CCA GCT TCT CCT GAA GTA





811




TRAV9
GTA TGT CCA ATA TCC TGG AGA AGG T





812




TRAV10
CAG TCA GAA CAC AAA GTC GAA CGG





813




TRAV12.1
CCT AAG TTG CTG ATG TCC GTA TAC





814




TRAV12.2
GGG AAA AGC CCT GAG TTG ATA ATG T





815




TRAV12.3
GCT GAT GTA CAC ATA CTC CAG TGG





816




TRAV13.1
CCC TTG GTA TAA GCA AGA ACT TGG





817




TRAV13.2
CCT CAA TTC ATT ATA GAC ATT CGT TC





818




TRAV14
GCA AAA TGC AAC AGA AGG TCG CTA





819




TRAV16
TAG AGA GAG CAT CAA AGG CTT CAC





820




TRAV17
CGT TCA AAT GAA AGA GAG AAA CAC AG





821




TRAV18
CCT GAA AAG TTC AGA AAA CCA GGA G





822




TRAV19
GGT CGG TAT TCT TGG AAC TTC CAG





823




TRAV20
GCT GGG GAA GAA AAG GAG AAA GAA A





824




TRAV21
GTC AGA GAG AGC AAA CAA GTG GAA





825




TRAV22
GGA CAA AAC AGA ATG GAA GAT TAA GC





826




TRAV23
CCA GAT GTG AGT GAA AAG AAA GAA G





827




TRAV24
GAC TTT AAA TGG GGA TGA AAA GAA GA





828




TRAV25
GGA GAA GTG AAG AAG CAG AAA AGA C





829




TRAV26.1
CCA ATG AAA TGG CCT CTC TGA TCA





830




TRAV26.2
GCA ATG TGA ACA ACA GAA TGG CCT





831




TRAV27
GGT GGA GAA GTG AAG AAG CTC AAG





832




TRAV29
GGA TAA AAA TGA AGA TGG AAG ATT CAC





833




TRAV30
CCT GAT GAT ATT ACT GAA GGG TGG A





834




TRAV34
GGT GGG GAA GAG AAA AGT CAT GAA





835




TRAV35
GGT GAA TTG ACC TCA AAT GGA AGA C





836




TRAV36
GCT AAC TTC AAG TGG AAT TGA AAA GA





837




TRAV38
GAA GCT TAT AAG CAA CAG AAT GCA AC





838




TRAV39
GGA GCA GTG AAG CAG GAG GGA C





839




TRAV40
GAG AGA CAA TGG AAA ACA GCA AAA AC





840




TRAV41
GCT GAG CTC AGG GAA GAA GAA GC





841






mixβ1
TRBV2
CTG AAA TAT TCG ATG ATC AAT TCT CAG





842




TRBV3-1
TCA TTA TAA ATG AAA CAG TTC CAA ATC G





843




TRBV4
AGT GTG CCA AGT CGC TTC TCA C





844




TRBV5-4,8
CAG AGG AAA CTT CCC TCC TAG ATT





845




TRBV5-1
GAG ACA CAG AGA AAC AAA GGA AAC TTC





846




TRBV6-1
GGT ACC ACT GAC AAA GGA GAA GTC C





847




TRBV6-2,3
GAG GGT ACA ACT GCC AAA GGA





848




TRBV6-4
GGC AAA GGA GAA GTC CCT GAT GGT T





849




TRBV6-5,6
AAG GAG AAG TCC CCA ATG GCT ACA A





850




TRBV6-8
CTG ACA AAG AAG TCC CCA ATG GCT AC





851




TRBV6-9
CAC TGA CAA AGG AGA AGT CCC CGA T





852




TRBV7-2
AGA CAA ATC AGG GCT GCC CAG TGA





853




TRBV7-3
GAC TCA GGG CTG CCC AAC GAT





854




TRBV7-8
CCA GAA TGA AGC TCA ACT AGA CAA





855




TRBV7-4,6
GGT TCT CTG CAG AGA GGC CTG AG





856




TRBV7-7
GGC TGC CCA GTG ATC GGT TCT C





857




TRBV7-9
GAC TTA CTT CCA GAA TGA AGC TCA ACT





858




TRBV9
GAG CAA AAG GAA ACA TTC TTG AAC GAT T





859




TRBV10-1,3
GGC TGA TCC ATT ACT CAT ATG GTG TT





860




TRBV10-2
GAT AAA GGA GAA GTC CCC GAT GGC T





861




TRBV11
GAT TCA CAG TTG CCT AAG GAT CGA T





862




TRBV12-3,4
GAT TCA GGG ATG CCC GAG GAT CG





863




TRBV12-5
GAT TCG GGG ATG CCG AAG GAT CG





864




TRBV13
GCA GAG CGA TAA AGG AAG CAT CCC T





865




TRBV14
TCC GGT ATG CCC AAC AAT CGA TTC T





866




TRBV15
GAT TTT AAC AAT GAA GCA GAC ACC CCT





867




TRBV16
GAT GAA ACA GGT ATG CCC AAG GAA AG





868




TRBV18
TAT CAT AGA TGA GTC AGG AAT GCC AAA G





869




TRBV19
GAC TTT CAG AAA GGA GAT ATA GCT GAA





870




TRBV20-1
CAA GGC CAC ATA CGA GCA AGG CGT C





871




TRBV24-1
CAA AGA TAT AAA CAA AGG AGA GAT CTC T





872




TRBV25-1
AGA GAA GGG AGA TCT TTC CTC TGA GT





873




TRBV27-1
GAC TGA TAA GGG AGA TGT TCC TGA AG





874




TRBV28
GGC TGA TCT ATT TCT CAT ATG ATG TTA A





875




TRBV29
GCC ACA TAT GAG AGT GGA TTT GTC ATT





876




TRBV30
GGT GCC CCA GAA TCT CTC AGC CT





877




pCα1
CGG TGA ATA GGC AGA CAG ACT TGT





878




pCβ1
ACC AGT GTG GCC TTT TGG GTG TG





879





2nd round of
mixα2
2TRAV1
CCA GGG TTT TCC CAG TCA CGA CAG


nested PCR


GTC GTT TTT CTT CAT TCC TTA GTC





880




2TRAV2
CCA GGG TTT TCC CAG TCA CGA CAC





GAT ACA ACA TGA CCT ATG AAC GG





881




2TRAV3.1
CCA GGG TTT TCC CAG TCA CGA CCT





TTG AAG CTG AAT TTA ACA AGA GCC





882




2TRAV4.1
CCA GGG TTT TCC CAG TCA CGA CCT





CCC TGT TTA TCC CTG CCG AC





883




2TRAV5.1
CCA GGG TTT TCC CAG TCA CGA CAA





ACA AGA CCA AAG ACT CAC TGT TC





884




2TRAV6
CCA GGG TTT TCC CAG TCA CGA CAA





GAC TGA AGG TCA CCT TTG ATA CC





885




2TRAV7
CCA GGG TTT TCC CAG TCACGA CAC





TAA ATG CTA CAT TAC TGA AGA ATG G





886




2TRAV8
CCA GGG TTT TCC CAG TCACGA CGC





ATC AAC GGT TTT GAG GCT GAA TTT





AA 887




2TRAV9
CCA GGG TTT TCC CAG TCA CGA CGA





AAC CAC TTC TTT CCA CTT GGA GAA





888




2TRAV10
CCA GGG TTT TCC CAG TCA CGA CTA





CAG CAA CTC TGG ATG CAG ACA C 889




2TRAV12
CCA GGG TTT TCC CAG TCA CGA CGA





AGA TGG AAG GTT TAC AGC ACA 890




2TRAV13.1
CCA GGG TTT TCC CAG TCA CGA CGA





CAT TCG TTC AAA TGT GGG CGA A





891




2TRAV13.2
CCA GGG TTT TCC CAG TCA CGA CGG





CAA GGC CAA AGA GTC ACC GT





892




2TRAV14
CCA GGG TTT TCC CAG TCA CGA CTC





CAG AAG GCA AGA AAA TCC GCC A





893




2TRAV16
CCA GGG TTT TCC CAG TCA CGA CGC





TGA CCT TAA CAA AGG CGA GAC A





894




2TRAV17
CCA GGG TTT TCC CAG TCA CGA CTT





AAG AGT CAC GCT TGA CAC TTC CA





895




2TRAV18
CCA GGG TTT TCC CAG TCA CGA CGC





AGA GGT TTT CAG GCC AGT CCT





896




2TRAV19
CCA GGG TTT TCC CAG TCA CGA CTC





CAC CAG TTC CTT CAA CTT CAC C





897




2TRAV20
CCA GGG TTT TCC CAG TCA CGA CGC





CAC ATT AAC AAA GAA GGA AAG CT





898




2TRAV21
CCA GGG TTT TCC CAG TCA CGA CGC





CTC GCT GGA TAA ATC ATC AGG A





899




2TRAV22
CCA GGG TTT TCC CAG TCA CGA CAC





GAC TGT CGC TAC GGA ACG CTA





900




2TRAV23
CCA GGG TTT TCC CAG TCA CGA CCA





CAA TCT CCT TCA ATA AAA GTG CCA





901




2TRAV24
CCA GGG TTT TCC CAG TCA CGA CAC





GAA TAA GTG CCA CTC TTA ATA CCA





902




2TRAV25
CCA GGG TTT TCC CAG TCA CGA CGT





TTG GAG AAG CAA AAA AGA ACA GCT





903




2TRAV26.1
CCA GGG TTT TCC CAG TCA CGA CCA





GAA GAC AGA AAG TCC AGC ACC T





904




2TRAV26.2
CCA GGG TTT TCC CAG TCA CGA CAT





CGC TGA AGA CAG AAA GTC CAG T





905




2TRAV27
CCA GGG TTT TCC CAG TCA CGA CAC





TAA CCT TTC AGT TTG GTG ATG CAA





906




2TRAV29
CCA GGG TTT TCC CAG TCA CGA CCT





TAA ACA AAA GTG CCA AGC ACC TC





907




2TRAV30
CCA GGG TTT TCC CAG TCA CGA CAA





TAT CTG CTT CAT TTA ATG AAA AAA





AGC





908




2TRAV34
CCA GGG TTT TCC CAG TCA CGA CCC





AAG TTG GAT GAG AAA AAG CAG CA





909




2TRAV35
CCA GGG TTT TCC CAG TCA CGA CCT





CAG TTT GGT ATA ACC AGA AAG GA





910




2TRAV36
CCA GGG TTT TCC CAG TCA CGA CGG





AAG ACT AAG TAG CAT ATT AGA TAA G





911




2TRAV38
CCA GGG TTT TCC CAG TCA CGA CCT





GTG AAC TTC CAG AAA GCA GCC A





912




2TRAV39
CCA GGG TTT TCC CAG TCA CGA CCC





TCA CTT GAT ACC AAA GCC CGT





913




2TRAV40
CCA GGG TTT TCC CAG TCA CGA CAG





GCG GAA ATA TTA AAG ACA AAA ACT C





914




2TRAV41
CCA GGG TTT TCC CAG TCA CGA CGA





TTA ATT GCC ACA ATA AAC ATA CAG G





915






mixβ2
2TRBV2
CCA GGG TTT TCC CAG TCA CGA CGC





CTG ATG GAT CAA ATT TCA CTC TG





916




2TRBV3-1
CCA GGG TTT TCC CAG TCA CGA CTC





TCA CCT AAA TCT CCA GAC AAA GCT





917




2TRBV4
CCA GGG TTT TCC CAG TCA CGA CCC





TGA ATG CCC CAA CAG CTC TC





918




2TRBV5-4,8
CCA GGG TTT TCC CAG TCA CGA CCT





CTG AGC TGA ATG TGA ACG CCT





919




2TRBV5-1
CCA GGG TTT TCC CAG TCA CGA CCG





ATT CTC AGG GCG CCA GTT CTC T





920




2TRBV6-1
CCA GGG TTT TCC CAG TCA CGA CTG





GCT ACA ATG TCT CCA GAT TAA ACA A





921




2TRBV6-2,3
CCA GGG TTT TCC CAG TCA CGA CCC





CTG ATG GCT ACA ATG TCT CCA GA





922




2TRBV6-4
CCA GGG TTT TCC CAG TCA CGA CGT





GTC TCC AGA GCA AAC ACA GAT GAT T





923




2TRBV6-5,6
CCA GGG TTT TCC CAG TCA CGA CGT





CTC CAG ATC AAC CAC AGA GGA T





924




2TRBV6-8
CCA GGG TTT TCC CAG TCA CGA CGT





CTC TAG ATT AAA CAC AGA GGA TTT C





925




2TRBV6-9
CCA GGG TTT TCC CAG TCA CGA CGG





CTA CAA TGT ATC CAG ATC AAA CA





926




2TRBV7-2
CCA GGG TTT TCC CAG TCA CGA CTC





GCT TCT CTG CAG AGA GGA CTG G





927




2TRBV7-3
CCA GGG TTT TCC CAG TCA CGA CCG





GTT CTT TGC AGT CAG GCC TGA





928




2TRBV7-8
CCA GGG TTT TCC CAG TCA CGA CCC





AGT GAT CGC TTC TTT GCA GAA A





929




2TRBV7-4,6
CCA GGG TTT TCC CAG TCA CGA CTC





TCC ACT CTG AAG ATC CAG CGC A





930




2TRBV7-7
CCA GGG TTT TCC CAG TCA CGA CGC





AGA GAG GCC TGA GGG ATC CAT





931




2TRBV7-9
CCA GGG TTT TCC CAG TCA CGA CCT





GCA GAG AGG CCT AAG GGA TCT





932




2TRBV9
CCA GGG TTT TCC CAG TCA CGA CCT





CCG CAC AAC AGT TCC CTG ACT T





933




2TRBV10-1.3
CCA GGG TTT TCC CAG TCA CGA CCA





GAT GGC TAT AGT GTC TCT AGA TCA





AA





934




2TRBV10-2
CCA GGG TTT TCC CAG TCA CGA CGT





TGT CTC CAG ATC CAA GAC AGA GAA





935




2TRBV11
CCA GGG TTT TCC CAG TCA CGA CGC





AGA GAG GCT CAA AGG AGT AGA CT





936




2TRBV12-3.4
CCA GGG TTT TCC CAG TCA CGA CGC





TAA GAT GCC TAA TGC ATC ATT CTC





937




2TRBV12-5
CCA GGG TTT TCC CAG TCA CGA CCT





CAG CAG AGA TGC CTG ATG CAA CT





938




2TRBV13
CCA GGG TTT TCC CAG TCA CGA CTC





TCA GCT CAA CAG TTC AGT GAC TA





939




2TRBV14
CCA GGG TTT TCC CAG TCA CGA CGC





TGA AAG GAC TGG AGG GAC GTA T





940




2TRBV15
CCA GGG TTT TCC CAG TCA CGA CGA





TAA CTT CCA ATC CAG GAG GCC G





941




2TRBV16
CCA GGG TTT TCC CAG TCA CGA CGC





TAA GTG CCT CCC AAA TTC ACC C





942




2TRBV18
CCA GGG TTT TCC CAG TCA CGA CGG





AAC GAT TTT CTG CTG AAT TTC CCA





943




2TRBV19
CCA GGG TTT TCC CAG TCA CGA CGG





TAC AGC GTC TCT CGG GAG AAG A





944




2TRBV20-1
CCA GGG TTT TCC CAG TCA CGA CGG





ACA AGT TTC TCA TCA ACC ATG CAA





945




2TRBV24-1
CCA GGG TTT TCC CAG TCA CGA CTG





GAT ACA GTG TCT CTC GAC AGG C





946




2TRBV25-1
CCA GGG TTT TCC CAG TCA CGA CCA





ACA GTC TCC AGA ATA AGG ACG GA





947




2TRBV27-1
CCA GGG TTT TCC CAG TCA CGA CTA





CAA AGT CTC TCG AAA AGA GAA GAG





GA





948




2TRBV28
CCA GGG TTT TCC CAG TCA CGA CGG





GGT ACA GTG TCT CTA GAG AGA





949




2TRBV29
CCA GGG TTT TCC CAG TCA CGA CGT





TTC CCA TCA GCC GCC CAA ACC TA





950




2TRBV30
CCA GGG TTT TCC CAG TCA CGA CCA





GAC CCC AGG ACC GGC AGT TCA T





951




pCα2
CAG ACA GAC TTG TCA CTG GAT TTA G





952




pCβ2
CTT TTG GGT GTG GGA GAT CTC TG





953





3rd round of

pF3
CCT ACA CGA CGC TCT TCC GAT CTN


nested PCR


NGC AGA GAT AAG CCC AGG GTT TTC





CCA GTC ACG AC





(SEQ ID NO: 954)




pCα3
CTG CTG AAC CGC TCT TCC GAT CTN





NGT TCA GTC ACT GGA TTT AGA GTC





TCT CAG (SEQ ID NO: 955)




pCβ3
CTG CTG AAC CGC TCT TCC GAT CTN





NGT TCA GAG ATC TCT GCT TCT GAT





GGC TC (SEQ ID NO: 956)









Once TCRα/β chain variable regions were determined, primers specific for leader sequence and constant region were used to amplify full-length TCRα/β chains. The PCR to amplify the intact TCRα- or β-chain and the following overlap PCR were performed with KOD Hot Start DNA polymerase (Millipore Sigma) according to the manufacturer's instructions. One μl of cDNA was added as a template in a 25 μl reaction together with the primers: pLα-P2A (based on TCR genotype), pRCα for amplifying full-length a chain or pLβ (based on TCR genotype), and pRCβ-P2A for amplifying full-length β chain. The PCR was performed for 20 cycles (94° C. for 30s, 64° C. for 1 min, 70° C. for 1 min). 2 μl of TCRα and 2 μl of TCRβ chain PCR products were mixed in 25 μl of overlap PCR reaction. The primers pLβ and pRCα were also added. The overlap PCR was performed for 30 cycles (94° C. for 30 s, 60° C. for 1 min, and 70° C. for 90s). The overlap PCR products (˜1.5 kb) were gel-purified with DNA Recovery Kit (Zymo Research) and cloned into pMSGV vector with NEBuilder® HiFi DNA Assembly Master Mix (New England Biolabs).


(17) Retroviral Transduction of TCR in Naïve T Cells

The pMSGV DNA encoding full-length TCR was transfected into Phoenix-Ampho cells (ATCC) to produce retroviruses. Briefly, 6-well tissue culture plate was coated with Poly-L-lysine (50 μg/ml in PBS) for 10 min at 37° C. and rinsed with PBS once. Phoenix-Ampho cells were seeded onto the coated plate at 1.5×106 per well and cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% penicillin-streptomycin at 37° C., 5% CO2. Four h later, 4 μg of pMSGV plasmids was transfected into each well with 10 μl of Lipofectamine 2000. The culture medium was changed 6 h after the transfection and the retroviral supernatant was collected at 48 or 60 h after the transfection.


For preparation of activated naïve T cells, 24-well tissue culture plates were coated with 1 ml of PBS containing 0.5 μg/ml OKT3 per well and incubated at 4° C. overnight. After the plate was rinsed with PBS once, PBMCs from normal donors or bead-purified T cells were added in (1.0×106/well) and cultured in TCM containing 300 IU/ml IL-2 at 37° C., 5% CO2 for 48 h. The OKT3-stimulated T cells were ready for transduction after washed twice with TCM. 230. For retroviral transduction of activated T cells, 24-well non-tissue culture plates (Corning) were coated with 0.5 ml of human recombinant RetroNectin per well (Takara, 10m/ml in PBS) 1 day prior to the transduction and incubated at 4° C. overnight. After RetroNectin was removed, the plate was blocked with 2% (w/v) BSA in PBS at room temperature for 30 min. The plate was then washed twice with PBS containing 2.5% (v/v) HEPES. Retroviral supernatant of Phoenix-Ampho cells was added to the RetroNectin-coated plates (2.5 ml per well). The plate was centrifuged at 2000×g, 32° C. for 2 h. After the viral supernatant was discarded, OKT3-stimulated T cells were added to the plate (5×105/well) and cultured in TCM containing 150 IU/ml IL-2 at 37° C., 5% CO2 overnight. A second round of transduction was performed with fresh retroviral supernatant and RetroNectin-coated plates, as described above. The transduced T cells were ready to assay with tumor cells or 293APCs 72 h after transduction.


(18) Statistics

Comparisons between groups were calculated by Student's t-test using GraphPad Prism (GraphPad Software). P<0.05 was considered as statistically significant.


F. REFERENCES



  • Balachandran, V. P., Luksza, M., Zhao, J. N., Makarov, V., Moral, J. A., Remark, R., Herbst, B., Askan, G., Bhanot, U., Senbabaoglu, Y., et al. (2017). Identification of unique neoantigen qualities in long-term survivors of pancreatic cancer. Nature 551, 512-516.

  • Birnbaum, M. E., Mendoza, J. L., Sethi, D. K., Dong, S., Glanville, J., Dobbins, J., Ozkan, E., Davis, M. M., Wucherpfennig, K. W., and Garcia, K. C. (2014). Deconstructing the peptide-MHC specificity of T cell recognition. Cell 157, 1073-1087.

  • Bolotin, D. A., Poslaysky, S., Davydov, A. N., Frenkel, F. E., Fanchi, L., Zolotareva, O. I., Hemmers, S., Putintseva, E. V., Obraztsova, A. S., Shugay, M., et al. (2017). Antigen receptor repertoire profiling from RNA-seq data. Nature biotechnology 35, 908-911.

  • Braumuller, H. et al. T-helper-1-cell cytokines drive cancer into senescence. Nature 494, 361-365, doi:10.1038/nature11824 (2013).

  • Carpenter, A. C., Wohlfert, E., Chopp, L. B., Vacchio, M. S., Nie, J., Zhao, Y., Shetty, J., Xiao, Q., Deng, C., Tran, B., et al. (2017). Control of Regulatory T Cell Differentiation by the Transcription Factors Thpok and LRF. Journal of immunology 199, 1716-1728.

  • Carreno, B. M., Magrini, V., Becker-Hapak, M., Kaabinejadian, S., Hundal, J., Petti, A. A., Ly, A., Lie, W. R., Hildebrand, W. H., Mardis, E. R., et al. (2015). Cancer immunotherapy. A dendritic cell vaccine increases the breadth and diversity of melanoma neoantigen-specific T cells. Science 348, 803-808.

  • Chen, L., and Han, X. (2015). Anti-PD-1/PD-L1 therapy of human cancer: past, present, and future. The Journal of clinical investigation 125, 3384-3391.

  • Chen, Y. T., Scanlan, M. J., Sahin, U., Tureci, O., Gure, A. O., Tsang, S., Williamson, B., Stockert, E., Pfreundschuh, M., and Old, L. J. (1997). A testicular antigen aberrantly expressed in human cancers detected by autologous antibody screening. ProcNatlAcadSciUSA 94, 1914-1918.

  • Chowell, D., Morris, L. G. T., Grigg, C. M., Weber, J. K., Samstein, R. M., Makarov, V., Kuo, F., Kendall, S. M., Requena, D., Riaz, N., et al. (2018). Patient HLA class I genotype influences cancer response to checkpoint blockade immunotherapy. Science 359, 582-587.

  • Cibulskis, K., Lawrence, M. S., Carter, S. L., Sivachenko, A., Jaffe, D., Sougnez, C., Gabriel, S., Meyerson, M., Lander, E. S., and Getz, G. (2013). Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nature biotechnology 31, 213-219.

  • Cole, D. K. et al. Hotspot autoimmune T cell receptor binding underlies pathogen and insulin peptide cross-reactivity. J. Clin. Invest. 126, 2191-2204, doi:10.1172/JCI85679 (2016).

  • Cole, D. K., Bulek, A. M., Dolton, G., Schauenberg, A. J., Szomolay, B., Rittase, W., Trimby, A., Jothikumar, P., Fuller, A., Skowera, A., et al. (2016). Hotspot autoimmune T cell receptor binding underlies pathogen and insulin peptide cross-reactivity. The Journal of clinical investigation 126, 3626.

  • De la Herran-Arita, A. K. et al. CD4+ T cell autoimmunity to hypocretin/orexin and cross-reactivity to a 2009 H1N1 influenza A epitope in narcolepsy. Sci Transl Med 5, 216ra176, doi:10.1126/scitranslmed.3007762 (2013).

  • Gubin, M. M., Zhang, X., Schuster, H., Caron, E., Ward, J. P., Noguchi, T., Ivanova, Y., Hundal, J., Arthur, C. D., Krebber, W. J., et al. (2014). Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature 515, 577-581.

  • Han, A., Glanville, J., Hansmann, L., and Davis, M. M. (2014). Linking T-cell receptor sequence to functional phenotype at the single-cell level. Nature biotechnology 32, 684-692.

  • He, X., Park, K., and Kappes, D. J. (2010). The role of ThPOK in control of CD4/CD8 lineage commitment. Annual review of immunology 28, 295-320.

  • Hellmann, M. D., Callahan, M. K., Awad, M. M., Calvo, E., Ascierto, P. A., Atmaca, A., Rizvi, N. A., Hirsch, F. R., Selvaggi, G., Szustakowski, J. D., et al. (2018). Tumor Mutational Burden and Efficacy of Nivolumab Monotherapy and in Combination with Ipilimumab in Small-Cell Lung Cancer. Cancer cell 33, 853-861 e854.

  • Hung, K. et al. The Central Role of CD4(+) T Cells in the Antitumor Immune Response. J. Exp. Med. 188, 2357-2368 (1998).

  • Kang, T. W. et al. Senescence surveillance of pre-malignant hepatocytes limits liver cancer development. Nature 479, 547-551, doi:10.1038/nature10599 (2011).

  • Kastenmuller, W., Gasteiger, G., Gronau, J. H., Baier, R., Ljapoci, R., Busch, D. H., and Drexler, I. (2007). Cross-competition of CD8+ T cells shapes the immunodominance hierarchy during boost vaccination. The Journal of experimental medicine 204, 2187-2198.

  • Kreiter, S., Vormehr, M., van de Roemer, N., Diken, M., Lower, M., Diekmann, J., Boegel, S., Schrors, B., Vascotto, F., Castle, J. C., et al. (2015). Mutant MHC class II epitopes drive therapeutic immune responses to cancer. Nature 520, 692-696.

  • La Gruta, N. L., Gras, S., Daley, S. R., Thomas, P. G., and Rossjohn, J. (2018). Understanding the drivers of MHC restriction of T cell receptors. Nature reviews Immunology 18, 467-478.

  • Laumont, C. M., Vincent, K., Hesnard, L., Audemard, E., Bonneil, E., Laverdure, J. P., Gendron, P., Courcelles, M., Hardy, M. P., Cote, C., et al. (2018). Noncoding regions are the main source of targetable tumor-specific antigens. Science translational medicine 10.

  • Le, D. T., Durham, J. N., Smith, K. N., Wang, H., Bartlett, B. R., Aulakh, L. K., Lu, S., Kemberling, H., Wilt, C., Luber, B. S., et al. (2017). Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science 357, 409-413.

  • Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754-1760.

  • Linnemann, C., van Buuren, M. M., Bies, L., Verdegaal, E. M., Schotte, R., Calis, J. J., Behjati, S., Velds, A., Hilkmann, H., Atmioui, D. E., et al. (2015). High-throughput epitope discovery reveals frequent recognition of neo-antigens by CD4+ T cells in human melanoma. Nat Med 21, 81-85.

  • Lu, Y. C., Yao, X., Crystal, J. S., Li, Y. F., El-Gamil, M., Gross, C., Davis, L., Dudley, M. E., Yang, J. C., Samuels, Y., et al. (2014). Efficient identification of mutated cancer antigens recognized by T cells associated with durable tumor regressions. Clinical cancer research: an official journal of the American Association for Cancer Research 20, 3401-3410.

  • Luckey, M. A. et al. The transcription factor ThPOK suppresses Runx3 and imposes CD4(+) lineage fate by inducing the SOCS suppressors of cytokine signaling. Nature immunology 15, 638-645, doi:10.1038/ni.2917 (2014).

  • Luksza, M., Riaz, N., Makarov, V., Balachandran, V. P., Hellmann, M. D., Solovyov, A., Rizvi, N. A., Merghoub, T., Levine, A. J., Chan, T. A., et al. (2017). A neoantigen fitness model predicts tumour response to checkpoint blockade immunotherapy. Nature 551, 517-520.

  • Malandro, N., Budhu, S., Kuhn, N. F., Liu, C., Murphy, J. T., Cortez, C., Zhong, H., Yang, X., Rizzuto, G., Altan-Bonnet, G., et al. (2016). Clonal Abundance of Tumor-Specific CD4(+) T Cells Potentiates Efficacy and Alters Susceptibility to Exhaustion. Immunity 44, 179-193.

  • Manguso, R. T., Pope, H. W., Zimmer, M. D., Brown, F. D., Yates, K. B., Miller, B. C., Collins, N. B., Bi, K., LaFleur, M. W., Juneja, V. R., et al. (2017). In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target. Nature 547, 413-418.

  • Marty Pyke, R., Thompson, W. K., Salem, R. M., Font-Burgada, J., Zanetti, M., and Carter, H. (2018). Evolutionary Pressure against MHC Class II Binding Cancer Mutations. Cell 175, 1991.

  • Marty, R., Kaabinejadian, S., Rossell, D., Slifker, M. J., van de Haar, J., Engin, H. B., de Prisco, N., Ideker, T., Hildebrand, W. H., Font-Burgada, J., et al. (2017). MHC-I Genotype Restricts the Oncogenic Mutational Landscape. Cell 171, 1272-1283 e1215.

  • McGranahan, N., Rosenthal, R., Hiley, C. T., Rowan, A. J., Watkins, T. B. K., Wilson, G. A., Birkbak, N. J., Veeriah, S., Van Loo, P., Herrero, J., et al. (2017). Allele-Specific HLA Loss and Immune Escape in Lung Cancer Evolution. Cell 171, 1259-1271 e1211.

  • McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., et al. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome research 20, 1297-1303.

  • Mucida, D. et al. Transcriptional reprogramming of mature CD4(+) helper T cells generates distinct MHC class II-restricted cytotoxic T lymphocytes. Nature immunology 14, 281-289, doi:10.1038/ni.2523 (2013).

  • Nelson, R. W., Beisang, D., Tubo, N.J., Dileepan, T., Wiesner, D. L., Nielsen, K., Wuthrich, M., Klein, B. S., Kotov, D. I., Spanier, J. A., et al. (2015). T cell receptor cross-reactivity between similar foreign and self peptides influences naïve cell population size and autoimmunity. Immunity 42, 95-107.

  • Pan, D., Kobayashi, A., Jiang, P., Ferrari de Andrade, L., Tay, R. E., Luoma, A. M., Tsoucas, D., Qiu, X., Lim, K., Rao, P., et al. (2018). A major chromatin regulator determines resistance of tumor cells to T cell-mediated killing. Science 359, 770-775.

  • Patel, S. J., Sanjana, N. E., Kishton, R. J., Eidizadeh, A., Vodnala, S. K., Cam, M., Gartner, J. J., Jia, L., Steinberg, S. M., Yamamoto, T. N., et al. (2017). Identification of essential genes for cancer immunotherapy. Nature 548, 537-542.

  • Porter, D. L., Levine, B. L., Kalos, M., Bagg, A., and June, C. H. (2011). Chimeric antigen receptor-modified T cells in chronic lymphoid leukemia. The New England journal of medicine 365, 725-733.

  • Rizvi, N. A., Hellmann, M. D., Snyder, A., Kvistborg, P., Makarov, V., Havel, J. J., Lee, W., Yuan, J., Wong, P., Ho, T. S., et al. (2015a). Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124-128.

  • Rizvi, N. A., Hellmann, M. D., Snyder, A., Kvistborg, P., Makarov, V., Havel, J. J., Lee, W., Yuan, J., Wong, P., Ho, T. S., et al. (2015b). Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124-128.

  • Robbins, P. F., Lu, Y. C., El-Gamil, M., Li, Y. F., Gross, C., Gartner, J., Lin, J. C., Teer, J. K., Cliften, P., Tycksen, E., et al. (2013). Mining exomic sequencing data to identify mutated antigens recognized by adoptively transferred tumor-reactive T cells. Nat Med 19, 747-752.

  • Robins, H. S., Campregher, P. V., Srivastava, S. K., Wacher, A., Turtle, C. J., Kahsai, O., Riddell, S. R., Warren, E. H., and Carlson, C. S. (2009). Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells. Blood 114, 4099-4107.

  • Rosenberg, S. A., and Restifo, N. P. (2015). Adoptive cell transfer as personalized immunotherapy for human cancer. Science 348, 62-68.

  • Rosenberg, S. A., Tran, E., and Robbins, P. F. (2017). T-Cell Transfer Therapy Targeting Mutant KRAS. The New England journal of medicine 376, ell.

  • Schumacher, T. N., and Schreiber, R. D. (2015). Neoantigens in cancer immunotherapy. Science 348, 69-74.

  • Schumacher, T. N., Scheper, W., and Kvistborg, P. (2018). Cancer Neoantigens. Annu Rev Immunol. Sercarz, E. E., Lehmann, P. V., Ametani, A., Benichou, G., Miller, A., and Moudgil, K. (1993). Dominance and crypticity of T cell antigenic determinants. Annual review of immunology 11, 729-766.



Sharma, P., and Allison, J. P. (2015). Immune checkpoint targeting in cancer therapy: toward combination strategies with curative potential. Cell 161, 205-214.

  • Shastri, N., Schwab, S., and Serwold, T. (2002). Producing nature's gene-chips: the generation of peptides for display by MHC class I molecules. Annu Rev Immunol 20, 463-493.
  • Shedlock, D. J. & Shen, H. Requirement for CD4 T cell help in generating functional CD8 T cell memory. Science 300, 337-339 (2003).
  • Starck, S. R. et al. Translation from the 5′ untranslated region shapes the integrated stress response. Science 351, aad3867, doi:10.1126/science.aad3867 (2016).
  • Stronen, E., Toebes, M., Kelderman, S., van Buuren, M. M., Yang, W., van Rooij, N., Donia, M., Boschen, M. L., Lund-Johansen, F., Olweus, J., et al. (2016). Targeting of cancer neoantigens with donor-derived T cell receptor repertoires. Science 352, 1337-1341.
  • Sun, J. C. & Bevan, M. J. Defective CD8 T cell memory following acute infection without CD4 T cell help. Science 300, 339-342 (2003).
  • Topalian, S. L., Drake, C. G., and Pardoll, D. M. (2015). Immune checkpoint blockade: a common denominator approach to cancer therapy. Cancer cell 27, 450-461.
  • Tran, E., Ahmadzadeh, M., Lu, Y. C., Gros, A., Turcotte, S., Robbins, P. F., Gartner, J. J., Zheng, Z., Li, Y. F., Ray, S., et al. (2015). Immunogenicity of somatic mutations in human gastrointestinal cancers. Science 350, 1387-1390.
  • Tran, E., Turcotte, S., Gros, A., Robbins, P. F., Lu, Y. C., Dudley, M. E., Wunderlich, J. R., Somerville, R. P., Hogan, K., Hinrichs, C. S., et al. (2014). Cancer immunotherapy based on mutation-specific CD4+ T cells in a patient with epithelial cancer. Science 344, 641-645.
  • Vacchio, M. S. et al. A ThPOK-LRF transcriptional node maintains the integrity and effector potential of post-thymic CD4+ T cells. Nature immunology 15, 947-956, doi:10.1038/ni.2960 (2014).
  • Wang, H. Y., Lee, D. A., Peng, G., Guo, Z., Li, Y., Kiniwa, Y., Shevach, E. M., and Wang, R. F. (2004). Tumor-specific human CD4+ regulatory T cells and their ligands: implications for immunotherapy. Immunity 20, 107-118.
  • Wang, H. Y., Peng, G., Guo, Z., Shevach, E. M., and Wang, R.-F. (2005). Recognition of a new ARTC1 peptide ligand uniquely expressed in tumor cells by antigen-specific CD4+ gegulatory T cells. J Immunol 174, 2661-2670.
  • Wang, H. Y., Zhou, J., Zhu, K., Riker, A. I., Marincola, F. M., and Wang, R. F. (2002). Identification of a mutated fibronectin as a tumor antigen recognized by CD4+ T cells: its role in extracellular matrix formation and tumor metastasis. The Journal of experimental medicine 195, 1397-1406.
  • Wang, R.-F. The role of MHC class II-restricted tumor antigens and CD4+ T cells in antitumor immunity. Trends in Immunology 22, 269-276 (2001).
  • Wang, R. F., and Wang, H. Y. (2017). Immune targets and neoantigens for cancer immunotherapy and precision medicine. Cell research 27, 11-37.
  • Wang, R.-F., Johnston, S. L., Zeng, G., Schwartzentruber, D. J., and Rosenberg, S. A. (1998). A breast and melanoma-shared tumor antigen: T cell responses to antigenic peptides translated from different open reading frames. J Immunol 161, 3596-3606.
  • Wang, R. F., Parkhurst, M. R., Kawakami, Y., Robbins, P. F., and Rosenberg, S. A. (1996). Utilization of an alternative open reading frame of a normal gene in generating a novel human cancer antigen. J Exp Med 183, 1131-1140.
  • Wang, R. F., Wang, X., and Rosenberg, S. A. (1999b). Identification of a novel major histocompatibility complex class II-restricted tumor antigen resulting from a chromosomal rearrangement recognized by CD4(+) T cells. J Exp Med 189, 1659-1668.
  • Wang, R. F., Wang, X., Atwood, A. C., Topalian, S. L., and Rosenberg, S. A. (1999). Cloning genes encoding MHC class II-restricted antigens: mutated CDCl27 as a tumor antigen. Science 284, 1351-1354.
  • Zacharakis, N., Chinnasamy, H., Black, M., Xu, H., Lu, Y. C., Zheng, Z., Pasetto, A., Langhan, M., Shelton, T., Prickett, T., et al. (2018). Immune recognition of somatic mutations leading to complete durable regression in metastatic breast cancer. Nature medicine 24, 724-730.

Claims
  • 1. A method of identifying neoantigens from a cancer in a human subject comprising: a. performing whole exome sequencing on a nucleic acid sample from a cancer cell;b. mapping the sequence to a reference genome sequence;c. filtering sequence variations to remove common variations in tumors and normal cells;d. creating one or more single mutation peptide constructs comprising at least one uncommon amino acid variation and one or more flanking amino acids;e. synthesizing one or more minigenes encoding one or more single mutation peptide constructs of step d;f. transfecting one or more minigenes into one or more cells or cell lines expressing MHC class I or MHC class II molecules;g. co-culturing one or more T cells or T cells lines with the transfected cells of step f);h. measuring T cell activity of the co-cultured T cells or T cell lines and identifying mutation peptides encoded by minigenes in transfected cells that induced T cell activity; andi. assaying the identified mutation peptide from each minigene in transfected cells that induced T cell activity, for the ability to induce T cell activity alone.
  • 2-12. (canceled)
  • 13. A method of stimulating an immunological response against a cancer or treating, inhibiting, and/or preventing a cancer, the method comprising administering to a subject a composition comprising a pharmaceutically acceptable carrier and a therapeutically effective amount of one or more neoantigen peptides identified by the method of claim 1.
  • 14. The method of claim 13, wherein the neoantigen is an immunodominant neoantigen.
  • 15. A composition for treating a cancer, wherein the composition comprises a pharmaceutically acceptable carrier and a therapeutically effective amount of one or more neoantigen peptides
  • 16. (canceled)
  • 17. A composition for treating a cancer comprising a therapeutically effective amount of one or more chimeric antigen receptor (CAR) T cells T cell receptor (TCR) T-cells, and/or tumor infiltrating lymphocytes (TILs); wherein the CAR T cell, TCR T cell, and/or TIL has been engineered to express a receptor for a cancer neoantigen.
  • 18. The composition of any one of claim 15 or 17, wherein the neoantigen is selected from the group of neoantigens consisting of the amino acids as set forth in SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163.
  • 19. A method of treating a cancer in a subject comprising administering to the subject the composition of any one of claim 15, 17 or 18.
  • 20-44. (canceled)
  • 45. An isolated nucleic acid encoding a neoantigen of claim 18.
  • 46. The method of claim 13 wherein the cancer is selected from the group consisting of B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas.
  • 47. The method of claim 13, wherein the neoantigen is selected from the group of neoantigens consisting of the amino acids as set forth in SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163.
  • 48. A composition comprising a therapeutically effective amount of one or more TCR-T cells or CAR-T cells, wherein the cells have been engineered to express one or more nucleic acid sequences encoding a shRNA targeting a gene encoding a negative signaling molecule or an epigenetic protein, whereby knock down or knockout of the negative signaling molecule or epigenetic protein enhances T cell persistence and antitumor activity of the TCR-T or CAR-T cells in vivo.
  • 49. The composition of claim 48, wherein the TCR-T and CAR-T cells are further engineered to express a T cell receptor or chimeric antigen receptor specific for cancer neoantigens.
  • 50. The composition of claim 49, wherein the cancer neoantigen is a cancer testis neoantigen.
  • 51. The composition of claim 48, wherein the negative signaling molecule is selected from a master regulator of T cell development, an immune checkpoint protein and an immune suppressor protein.
  • 52. The composition of claim 48, wherein the epigenetic protein is a lysine demethylase.
  • 53. A composition comprising a therapeutically effective amount of one or more TCR-T cells or CAR-T cells and a LSD1 inhibitor that enhances the anti-tumor activity of the T cells or CAR-T cells in vivo.
  • 54. The composition of claim 52, wherein the LSD1 inhibitor reprograms stem cell-like T cells to prolong survival of the T cells.
Parent Case Info

This application claims the benefit of U.S. Provisional Application No. 62/662,495, filed on Apr. 25, 2018, which is incorporated herein by reference in its entirety.

PCT Information
Filing Document Filing Date Country Kind
PCT/US19/29107 4/25/2019 WO
Provisional Applications (1)
Number Date Country
62662495 Apr 2018 US