The present invention relates to methods for stimulating the immune response to a tumor by implementing a two-phase vaccination strategy in which a first vaccination comprises epitopes from tumor associated antigens or epitopes which embody amino acid mutations commonly associated with the cancer that is administered before tissue diagnosis, and a second vaccination which comprises personal neoepitopes, the design of which is unique to that subject and is identified based on comparative sequencing of normal tissue and tumor tissue obtained by biopsy.
Cancer arises as the combination of an array of genomic, proteomic and immunologic events which make every cancer a unique challenge. Mutational events at the genomic level are manifest in changed protein functions that give rise to the classic features of cancer: dysregulation comprising cell proliferation, evasion of growth suppressors, invasiveness and metastasis, ongoing cell replication and resistance to apoptosis, increased vascularity and dysregulation of cell energetics [1]. However, tumors also exhibit a further array of consequences of mutations in their evasion of the immune response which could otherwise limit progression. The present invention addresses how to marshal the immune response to directly limit invasiveness by limiting the actions of some tumor associated proteins, to limit progression from initial diagnosis until more definite treatments weeks-months later in current standard paradigms, and to reduce the immune evasion but stimulating tumor specific cytotoxic T cells that can bring about the elimination of cancer cells. A goal of the present invention is to combine early stimulation of the immune response to cancer with the stimulation of a tumor specific response. In the first instance early stimulation is achieved by a combination of epitopes in tumor associated antigens and/or in proteins with mutations which are very commonly found in association with a particular clinical presentation. In the second instance a more tumor specific response is achieved by recognizing that the mutational landscape in each tumor is unique and personal to that patient. This is the result of the overlay of the mutations which have occurred and continue to occur as the tumor progresses, and the immunosurveillance which is determined by the MHC binding characteristics of the peptides comprising the mutations, the MHC alleles carried by the patient, and the particular repertoire of T cells that the individual patient carries. This results in a unique array of neoepitopes which characterize the immunomic profile of a given tumor and which serve as tumor specific targets.
As tumorigenesis progresses within a given patient the array of mutations diversifies to adopt epitope patterns unique to that individual. Targeting the immune response to such unique neoepitopes requires identification of the mutations that differentiate tumor cells form normal tissue and selection or design of peptides, or their encoding nucleic acids, which embody the tumor specific mutation and can present it to stimulate the T cells of that individual.
The present invention provides a two-phase immunotherapeutic intervention strategy for treatment of a subject with a clinical diagnosis of cancer. In one embodiment the strategy comprises a first intervention, comprising administration of a first round of vaccination in which the antigens administered to the subject are peptides, or the nucleic acids which encode them, derived from one or both of two groups of proteins. The first group of proteins are tumor associated antigens which are not mutated, but which are known to be commonly upregulated in the type of cancer with which the subject is diagnosed, or to exert an influence favoring the progression of that type of tumor. The second group of proteins from which peptides administered in the first round of vaccination may be drawn are the most commonly mutated proteins associated with the type of cancer with which the subject is diagnosed.
Peptides selected from the second group encompass the common mutations found in the proteins. Following the first round of vaccination, the second intervention comprised in the invention is a second round of vaccination. The second round of vaccination is a personalized vaccination comprising a set of peptides, or the nucleic acids which encode them, that embody the amino acids shown to be mutated in that tumor by means of comparative sequencing of the proteins in the tumor and in normal tissue. The amino acids shown to be mutated are placed within the T cell exposed positions. The amino acids in the MHC groove exposed position, which are those positions not exposed to the T cells, are then substituted with alternative amino acids to achieve a desired binding affinity to the subject's MHC alleles. Thus, an array of alternative peptides is selected which is administered as a personalized vaccination, either as peptides or the nucleic acids that encode them.
Peptides selected from the proteins known to be commonly mutated are selected for the first vaccination so as to place the commonly mutated amino acids in one of the T cell exposed positions when the protein is bound in a MHC molecular groove. The amino acids in the MHC groove exposed position of these peptides, which are those positions not exposed to the T cells, may then substituted with alternative amino acids to achieve a desired binding affinity to the subject's MHC alleles.
In a further embodiment, the alternative peptides included in the personalized vaccination are selected from those mutated proteins which are shown to be expressed in the tumor, as evidenced by the ratio of copy numbers of the gene product detected in RNA relative to the copies of that gene detected in DNA. In preferred embodiments the gene product is detected in at least 10% of the RNA transcribed from that gene locus. It is also preferred that the alternative peptides are selected from mutants that are detected in at least 10% of the tumor DNA.
In a preferred embodiment, the first round of vaccination is administered to the subject prior to any surgical intervention. In a further preferred embodiment, the HLA alleles of the subject are determined as an aid to the selection of peptides included in the first round of vaccination. In the second round of vaccination a preferred embodiment includes five or more different peptides based on the demonstrated mutations found in the tumor proteins.
The present invention includes the design of immunogens which target either CD8+ T cells responsive to peptides bound in MHC I alleles or CD4+ T cells responsive to peptides bound in MHC II alleles. In the first of these instances, for targeting CD8+ T cells, the preferred embodiment is that the peptide is 8-10 amino acids in length, whereas in the second instance peptides of 13-20 amino acids are the preferred embodiment to target CD4+ T cells.
In yet other embodiments peptides of 8-35 amino acids may be deployed to target both CD8+ and CD4+ T cells following endopeptidase processing by antigen presenting cells.
The peptides, or their encoding nucleic acids, are designed and selected to be competitive with other peptides from the same protein and to exceed the binding affinity of 85% of the peptides in the originating proteins. In yet other embodiments the peptides are selected to exceed the binding affinity of 95% of peptides in the source protein. In preferred embodiments the peptides selected have a binding affinity to the MHC alleles of the subject of less than 200 nanomolar, in yet further embodiments a higher binding affinity of less than 50 nanomolar is provided.
While the present invention provides a method to design the second round of vaccination as a personalized vaccine by inclusion of alternative amino acids to optimize the binding affinity to MHC, in some embodiments it may also comprise natural peptides that occur in the tumor proteins where the binding affinity of the peptides is appropriate.
In some instances, each round of vaccination may comprise only one application of each group of peptides, In preferred embodiments each round of vaccination comprises multiple applications of the peptides or their encoding nucleic acids.
The present invention pertains to a strategy for immunotherapeutic intervention in a tumor drawn from the group of solid tumors including, but not limited to, those affecting tissues of the nervous system, lung, breast, pancreas, liver, skin, prostate, genito-urinary tract, hematopoietic system, gastro-intestinal, endocrine system, musculoskeletal or any other tissue affected by a tumor. In some preferred embodiments the methods provided by the invention are applied to the treatment of brain tumors, including but not limited to glioma, glioblastoma, neuroblastoma, meningioma, schwannoma, and metastases to the brain from other sites in the body. The first round of vaccination is selected according to the clinical diagnosis. In the case of a brain tumor, the peptides included the first round of vaccination may be drawn from the group comprising vascular endothelial growth factor receptors, vimentin, Wilms tumor protein or any other tumor associated protein common to that type of brain tumor. These examples are illustrative and not considered limiting. The invention provides sequences of peptides from vascular endothelial growth factor receptors, vimentin and Wilms tumor protein which are examples of suitable vaccinal peptides from these proteins.
The invention further provides that the first round of vaccination may comprise peptides, or their encoding nucleic acids, that comprise the most common mutations characteristic of that clinically diagnosed tumor. To that end the present invention provides sequences for peptides that embody the T cell exposed motifs of the most common mutations of epidermal growth factor receptor (EGFR), a common deleterious splice variant thereof, EGFRvIII, isocitrate dehydrogenase (IDH) mutant R132H, and histone variants H3.3 associated with gene instability mutant K27M. In yet other embodiments, peptide sequences are provided that encompass the most common mutation of the serine/threonine-protein kinase B-raf (BRAF) V600E. For each the invention identifies the T cell exposed motifs of interest and provides examples of peptides which are designed to bind optimally to exemplar MHC alleles. As many different alternative peptides can be designed to achieve the desired MHC binding affinity while maintaining the T cell exposed motif constant, the peptides provided are considered as non-limiting illustrations of the methods.
In some embodiments, the tumor associated protein is a viral protein, where such virus is an oncogenic virus such as a papillomavirus, retrovirus, polyomavirus or herpesvirus. In other instances the viral protein maybe drawn from the group of viruses which may be upregulated in the tumor cells and thus provide a target to direct a T cell response to that tumor cell. In particular embodiments such viruses include, but are not limited to, human papillomaviruses and human cytomegalovirus.
In addition to providing a method that is a strategy for immunotherapeutic intervention, the present invention also provides particular synthetic peptides with diverse HLA binding that serve as immunogens to direct a T cell response to VEGFR1 and VEGFR2, vimentin and WT1. Of particular note here is that these peptides are selected based on their high binding affinity to a multiplicity of MHC alleles, facilitating their use in subjects which have not been HLA typed. Such peptides are provided to bind to MHC I and MHC II alleles and with a binding affinity of exceeding 50 nanomolar and exceeding 200 nanomolar. In particular embodiments the peptides derived from VEGFR1 and VEGFR2, vimentin and WT1 are shorter, being of 8-10 amino acids, whereas in yet other embodiments they are of 13-20 amino acids and in yet other instances are up to 30 amino acids in length. As is well known to those skilled in the art, the peptides may also be delivered by means of the nucleic acids which encode them. Many routes of administration are suitable. In some embodiments the peptides are administered parenterally, in some instances by intradermal or subcutaneous delivery. In yet other embodiments administration of one or more doses of vaccine is orally. The vaccine composition may comprise an adjuvant. Various delivery vehicles may be utilized such that in one embodiment the peptides or nucleic acids encoding them may be delivered by a lipid drug delivery system drawn from the group comprising, but not limited to, lipid nanoparticles, emulsions, self-emulsifying drug delivery systems, nanocapsules and liposomes or by a viral vector, which may comprise, but is not limited to, elements of a retroviral vector, a poxvirus, an adenovirus, or other virus vector delivery system. In yet other embodiments in lieu of administering the vaccine directly to the subject, antigen presenting cells are harvested for the subject and these are contacted with the vaccinal peptides of interest in vitro, prior to returning these to the subject, or exposing the antigen presenting cells to T cells also drawn from the subject, before autologous reinfusion.
It is anticipated that in some preferred embodiments the immunotherapeutic strategy described in the present invention will be accompanied by other immunotherapeutic interventions such as but not limited to the treatment with check point inhibitors.
A further embodiment of the present invention addresses the monitoring of the immune response the subject mounts to any one of the epitopes embodied in the first or second round of vaccination. Thus, a further application of the present invention is in companion diagnostics in which the peptides identified herein are embodied into a diagnostic test.
In some further preferred embodiments, the present invention provides methods of treatment of a subject clinically presenting with a tumor comprising: administering a first round of vaccination, wherein the vaccination comprises an array of one or more peptides, or nucleic acids encoding the peptides, selected from the group consisting of non-mutated peptides derived from tumor associated antigens appropriate to tumors of the type diagnosed and peptides commonly found to be mutated in tumors of the type diagnosed; and administering a second round of personalized vaccination, wherein the second round of vaccination comprises the following steps: obtaining a biopsy of the tumor and of normal tissue from the subject, and obtaining sequences for DNA, RNA and proteins in the biopsy; identifying proteins from the biopsy containing mutated amino acids and the peptide comprising each of the mutated amino acids; determining T cell exposed motifs which comprise mutated amino acids in each of the proteins; determining the predicted binding affinity to the subject's MHC alleles of peptides which comprise each of the T cell exposed motifs, or a subset thereof; generating an array of alternative peptides not present in the tumor, wherein each peptide in the array comprises the amino acids of one of the T cell exposed motifs, and in which the amino acids not within the T cell exposed motif are substituted to change the predicted MHC binding affinity; selecting a group of one or more selected peptides from the array of alternative peptides which have a desired predicted binding affinity for one or more of the subject's MHC alleles; and synthesizing the group of one or more selected peptides, or nucleic acids encoding the selected peptides and administering the selected peptides, or their encoding nucleic acids, as the second round of vaccination.
In some preferred embodiments, the methods further comprise selecting the peptides commonly found to be mutated in tumors so as to position the mutated amino acid in a T cell exposed position and substituting one or more of the amino acids not in a T cell exposed position to provide a desired binding affinity to one or more of the MHC alleles of the subject.
In some preferred embodiments, the methods further comprise: a) determining the fraction of the DNA in the tumor biopsy which encodes each of the mutated amino acids and the fraction of RNA transcribed from that gene locus and expressing the mutated amino acids; and b) selecting a sub array of the alternative peptides from the proteins in the biopsy which are present in at least 10% of the DNA in the biopsy and expressed in at least 10% of the RNA transcribed from that gene locus in the biopsy.
In some preferred embodiments, the method further comprise determining the MHC alleles of the subject prior to the first round of vaccination.
In some preferred embodiments, the first round of vaccination is administered prior to surgical intervention.
In some preferred embodiments, the MHC alleles are MHC type I alleles and the T cell response is a CD8+ response. In some preferred embodiments, the MHC alleles are MHC type II alleles and the T cell response is a CD4+ response. In some preferred embodiments, the MHC alleles are a combination of MHC type I alleles and MHC type II alleles. In some preferred embodiments, the MHC I allele in the first round of vaccination is not A0201 or A2402.
In some preferred embodiments, the peptides, or nucleic acids encoding them, administered in the first round or the second round are 8 to 10 amino acids long. In some preferred embodiments, the peptides, or nucleic acids encoding them, administered in the first round or the second round are 13-20 amino acids long. In some preferred embodiments, the peptides, or nucleic acids encoding them, administered in the first round or the second round are 8-35 amino acids long.
In some preferred embodiments, the group of one or more selected peptides, or the nucleic acids encoding them, administered in the second round of vaccination comprises at least 5 unique peptides not present in the proteins sequenced in the tumor.
In some preferred embodiments, the desired predicted binding affinity to the one or more of the subject's MHC alleles of selected peptides in the second round of vaccination exceeds 85% of the binding affinity of all peptides in the tumor protein that comprises the mutated amino acid. In some preferred embodiments, the desired predicted binding affinity to the one or more of the subject's MHC alleles of selected peptides in the second round exceeds 95% of the binding affinity of all peptides in the tumor protein that comprises the mutated amino acid. In some preferred embodiments, the desired predicted binding affinity to the one or more of the subject's MHC alleles of selected peptides in the second round is less than 50 nanomolar. In some preferred embodiments, the desired predicted binding affinity to the one or more of the subject's MHC alleles of selected peptides in the second round is less than 200 nanomolar.
In some preferred embodiments, the first and/or second rounds of vaccination further comprise administering peptides, or the nucleic acids encoding them, which occur naturally in a tumor protein.
In some preferred embodiments, the first and second rounds of vaccination each comprise more than a single application of the array of peptides, or the nucleic acids encoding them. In some preferred embodiments, each round of vaccination comprises 3 or more applications of the array of peptides, or the nucleic acids encoding them.
In some preferred embodiments, the tumor is a tumor of a tissue selected from the group consisting of nervous system, lung, breast, pancreas, liver, skin, prostate, genito-urinary tract, hematopoietic system, gastro-intestinal, endocrine system, and musculoskeletal tissues.
In some preferred embodiments, the tumor of the nervous system tissue is selected from the group consisting of glioma, glioblastoma, neuroblastoma, meningioma, schwannoma, and metastases to the brain from other sites.
In some preferred embodiments, the non-mutated peptides administered in the first round of vaccination are selected from the group consisting of VEGFR, WT-1, and vimentin. In some preferred embodiments, the first round of vaccination utilizes peptides that are selected from the group consisting of SEQ ID NOs: 1-118 and combinations thereof.
In some preferred embodiments, the first round of vaccination further includes mutated peptides that are selected from the group consisting of EGFR, H3.3, and IDH. In some preferred embodiments, the mutated peptides comprise T cell exposed motifs selected from the group consisting of SEQ ID NOs: 154-183, 284-288, 294-298, 344-348, 392-396, and 441-446 and combinations thereof. In some preferred embodiments, the EGFR is the variant EGFRvIII. In some preferred embodiments, the mutated proteins from EGFR comprise the T cell exposed motifs selected from the group consisting of SEQ ID NOs: 284-288 and combinations thereof. In some preferred embodiments, the T cell exposed motifs are flanked by groove exposed motifs that do not occur naturally and in which amino acids have been substituted to enhance MHC binding.
In some preferred embodiments, the first round of vaccination further includes mutated peptides that are drawn from the protein group comprising KIAA1549-BRAF fusion and BRAF. In some preferred embodiments, the mutated peptides comprise the T cell exposed motifs selected from the group consisting of SEQ ID NOs: 468-472 and 512-514 and combinations thereof.
In some preferred embodiments, the tumor is a lung tumor. In some preferred embodiments, the non-mutated peptides administered in the first round of vaccination are selected from the group consisting of CEA, MAGE-A1, MAGE-A3, MAGE-A4, MSLN, PRAME, TERT, HER2, MUC1, BIRC5, STEAP1, SOX2, NY-ESO-1 and combinations thereof. In some preferred embodiments, the first round of vaccination further includes mutated peptides that are selected from the group consisting of KRAS, EGFR, PIK3CA, TP53, BRAF, EML4 ALK and combinations thereof.
In some preferred embodiments, the tumor of the skin is a melanoma. In some preferred embodiments, the non-mutated peptides administered in the first round of vaccination are selected from the group consisting of PMEL, MAGE family, MART, MAGEA3, NY-ESO-1 and combinations thereof. In some preferred embodiments, the first round of vaccination further includes mutated peptides that are selected from the group consisting of BRAF, NRAS, IDH, MAK2K1 and combinations thereof.
In some preferred embodiments, the tumor is a prostate tumor. In some preferred embodiments, the non-mutated peptides administered in the first round of vaccination are selected from the group consisting of PSMA, SSEA-4, MAGE-C2, PSA, NANOG and combinations thereof. In some preferred embodiments, the first round of vaccination further includes mutated peptides that are selected from the group consisting of SPOP, TP53, PIK3CA, IDH and combinations thereof.
In some preferred embodiments, the tumor is a breast tumor. In some preferred embodiments, the non-mutated peptides administered in the first round of vaccination are selected from the group consisting of MUC 1, CEA, ERBB2, MAGE, NY-ESO1, TACA and combinations thereof. In some preferred embodiments, the first round of vaccination further includes mutated peptides that are selected from the group consisting of ST6GALNAC6, MAGEA12, KDM5B, TEX15, ERBB2, ERBB3, CEA, MYC, TP53, MKI67, NME1, PRDX6, EIF5A, PARK7, HSPB1, PIK3CA, HSPA14 proteins and combinations thereof.
In some preferred embodiments, the tumor associated antigen is a viral protein. In some preferred embodiments, the viral protein is selected from the group consisting of herpesviruses and papillomaviruses.
In some preferred embodiments, the array of one or more peptides, or nucleic acids encoding the peptides administered in the first round comprises one or more synthetic peptides or nucleic acids encoding the peptides derived from VEGFR1 or VEGFR2 wherein the synthetic peptides bind one or more MHC alleles and the synthetic peptides are selected from the group consisting of SEQ ID NOs: 1-28 and combinations thereof. In some preferred embodiments, the array of one or more peptides, or nucleic acids encoding the peptides administered in the first round comprises one or more synthetic peptides or nucleic acids encoding the peptides derived from vimentin wherein the synthetic peptides bind one or more MHC alleles and the peptides are selected from the group consisting of SEQ ID NOs: 29-103 and combinations thereof. In some preferred embodiments, the array of one or more peptides, or nucleic acids encoding the peptides administered in the first round comprises one or more synthetic peptides or nucleic acids encoding the peptides derived from WT-1 wherein the synthetic peptides bind one or more MHC alleles and the peptides are selected from the group consisting of SEQ ID NOs: 104-118 and combinations thereof. In some preferred embodiments, the peptides bind one or more MHC I alleles. In some preferred embodiments, the peptides are 8-10 amino acids. In some preferred embodiments, the MHC I allele is not A0201 or A2402. In some preferred embodiments, the peptides bind one or more MHC II alleles. In some preferred embodiments, the peptides are 15-18 amino acids. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC is less than 20 nanomolar. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC is less than 50 nanomolar. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC is less than 200 nanomolar.
In some preferred embodiments, at least 2 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 5 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 10 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 15 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 20 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the allele is not A0201 or A2402. In some preferred embodiments, at least 2 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 5 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 10 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 15 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, at least 20 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the peptides (ore nucleic acids encoding the peptides) selected for synthesis and/or coadministration to the subject comprise a combination of peptides that bind to MHC I alleles and MHC II alleles according to the foregoing ranges (e.g., at least 5 peptides that bind MHC I alleles and at least 5 peptides that bind MHC II alleles, and so on).
In some preferred embodiments, from 2 to 100 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 5 to 100 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 10 to 100 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 15 to 100 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 20 to 100 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the allele is not A0201 or A2402. In some preferred embodiments, from 2 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 5 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 10 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 15 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 20 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the peptides (ore nucleic acids encoding the peptides) selected for synthesis and/or coadministration to the subject comprise a combination of peptides that bind to MHC I alleles and MHC II alleles according to the foregoing ranges (e.g., from 5 to 100 peptides that bind MHC I alleles and from 5 to 100 peptides that bind MHC II alleles, and so on).
In some preferred embodiments, from 2 to 50 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 5 to 50 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 10 to 50 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 15 to 50 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 20 to 50 peptides that bind to MHC I alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the allele is not A0201 or A2402. In some preferred embodiments, from 2 to 50 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 5 to 100 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 10 to 50 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC I alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 15 to 50 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, from 20 to 50 peptides that bind to MHC II alleles or nucleic acids encoding the peptides that bind to MHC II alleles are selected for synthesis and/or coadministration to a subject in the second round of vaccination. In some preferred embodiments, the peptides (ore nucleic acids encoding the peptides) selected for synthesis and/or coadministration to the subject comprise a combination of peptides that bind to MHC I alleles and MHC II alleles according to the foregoing ranges (e.g., from 5 to 50 peptides that bind MHC I alleles and from 5 to 50 peptides that bind MHC II alleles, and so on).
In some preferred embodiments, the peptides or nucleic acids encoding the peptides in the second round of vaccination have a combination of 2 or more or mutations selected from the group consisting of a missense mutation, an insertion mutation, a deletion mutation, an in-frame nucleotide mutation or out-of-frame nucleotide mutation.
In some preferred embodiments, the peptides or nucleic acids encoding the peptides are administered parenterally. In some preferred embodiments, the parenteral delivery is intradermal or subcutaneous. In some preferred embodiments, the peptides or nucleic acids encoding the peptides are administered orally.
In some preferred embodiments, the peptides or nucleic acids encoding the peptides are administered in a vaccine comprising an adjuvant.
In some preferred embodiments, the peptides or nucleic acids encoding the peptides are administered in vaccine via a viral vector.
In some preferred embodiments, the peptides or nucleic acids encoding the peptides are administered in a vaccine comprising a lipid drug delivery system.
In some preferred embodiments, the first or second vaccination is followed by a checkpoint inhibitor or other immunotherapeutic intervention to enhance T cell activity.
In some preferred embodiments, the methods further comprise a step of analyzing for adverse reactions based on T cell expose motif match in the human proteome.
In some preferred embodiments, the methods of the present invention further comprise harvesting antigen presenting cells and contacting the harvested cells in vitro with the one or more of the peptides, or nucleic acids encoding the same peptides identified in the methods described above. In some preferred embodiments, the methods further comprise reinfusing the antigen presenting cells into the subject from whom they were drawn. In some preferred embodiments, the methods further comprise contacting the antigen presenting cells which have been contacted by the peptides with T cells drawn from the subject and reinfusing the T cells into the subject.
In some preferred embodiments, the present invention provides a diagnostic test for monitoring the immune response to vaccination with any of the methods of claims 1 to 79 comprising a capture reagent selected from the group consisting of peptides encoded by SEQ IDs 1-537 and combinations thereof.
In some preferred embodiments, the present invention provides a vaccine comprising one or more synthetic peptides or nucleic acids encoding the synthetic peptides derived from VEGFR1 or VEGFR2 wherein the synthetic peptides bind one or more MHC alleles and the synthetic peptides are selected from the group consisting of SEQ ID NOs: 1-28 and combinations thereof. In some preferred embodiments, the present invention provides a vaccine comprising one or more synthetic peptides or nucleic acids encoding the synthetic peptides derived from vimentin wherein the synthetic peptides bind one or more MHC alleles and the peptides are selected from the group consisting of SEQ ID NOs: 29-103 and combinations thereof. In some preferred embodiments, the present invention provides a vaccine comprising one or more synthetic peptides or nucleic acids encoding the synthetic peptides derived from WT-1 wherein the synthetic peptides bind one or more MHC alleles and the peptides are selected from the group consisting of SEQ ID NOs: 104-118 and combinations thereof.
In some preferred embodiments, the peptides in the vaccine bind one or more MHC I alleles. In some preferred embodiments, the peptides do not bind MHC I alleles A0201 or A2402 at less than 200 nM. In some preferred embodiments, the peptides are 8-10 amino acids. In some preferred embodiments, the peptides bind one or more MHC II alleles. In some preferred embodiments, the peptides are 15-18 amino acids. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC alleles is less than 20 nanomolar. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC alleles is less than 50 nanomolar. In some preferred embodiments, the predicted binding affinity of the peptides to the one or more MHC alleles is less than 200 nanomolar. In some preferred embodiments, the vaccines further comprise a pharmaceutically acceptable carrier. In some preferred embodiments, the vaccines further comprise an adjuvant.
In some preferred embodiments, the present invention provides a vaccination regimen comprising administering a vaccine as described above to a subject with cancer.
In some preferred embodiments, the present invention provides a vaccine as described above for use in treating cancer or tumor in a subject. In some preferred embodiments, the tumor is a tumor of a tissue selected from the group consisting of nervous system, lung, breast, pancreas, liver, skin, prostate, genito-urinary tract, hematopoietic system, gastro-intestinal, endocrine system, and musculoskeletal tissues. In some preferred embodiments, the tumor of the nervous system tissue is selected from the group consisting of glioma, glioblastoma, neuroblastoma, meningioma, schwannoma, and metastases to the brain from other sites.
As used herein, the term “genome” refers to the genetic material (e.g., chromosomes) of an organism or a host cell.
As used herein, the term “proteome” refers to the entire set of proteins expressed by a genome, cell, tissue or organism. A “partial proteome” refers to a subset the entire set of proteins expressed by a genome, cell, tissue or organism. Examples of “partial proteomes” include, but are not limited to, transmembrane proteins, secreted proteins, and proteins with a membrane motif. Human proteome refers to all the proteins comprised in a human being. Multiple such sets of proteins have been sequenced and are accessible at the InterPro international repository (on the world wide web at ebi.ac.uk/interpro). Human proteome is also understood to include those proteins and antigens thereof which may be over-expressed in certain pathologies or expressed in a different isoforms in certain pathologies. Hence, as used herein, tumor associated antigens are considered part of the human proteome. “Proteome” may also be used to describe a large compilation or collection of proteins, such as all the proteins in an immunoglobulin collection or a T cell receptor repertoire, or the proteins which comprise a collection such as the allergome, such that the collection is a proteome which may be subject to analysis. All the proteins in a bacteria or other microorganism are considered its proteome.
As used herein, the terms “protein,” “polypeptide,” and “peptide” refer to a molecule comprising amino acids joined via peptide bonds. In general, “peptide” is used to refer to a sequence of 40 or less amino acids and “polypeptide” is used to refer to a sequence of greater than 40 amino acids.
As used herein, the term, “synthetic polypeptide,” “synthetic peptide” and “synthetic protein” refer to peptides, polypeptides, and proteins that are produced by a recombinant process (i.e., expression of exogenous nucleic acid encoding the peptide, polypeptide or protein in an organism, host cell, or cell-free system) or by chemical synthesis.
As used herein, the term “protein of interest” refers to a protein encoded by a nucleic acid of interest. It may be applied to any protein to which further analysis is applied or the properties of which are tested or examined. Similarly, as used herein, “target protein” may be used to describe a protein of interest that is subject to further analysis.
As used herein “peptidase” refers to an enzyme which cleaves a protein or peptide. The term peptidase may be used interchangeably with protease, proteinases, oligopeptidases, and proteolytic enzymes. Peptidases may be endopeptidases (endoproteases), or exopeptidases (exoproteases). The term peptidase would also include the proteasome which is a complex organelle containing different subunits each having a different type of characteristic scissile bond cleavage specificity. Similarly the term peptidase inhibitor may be used interchangeably with protease inhibitor or inhibitor of any of the other alternate terms for peptidase.
As used herein, the term “immunogen” refers to a molecule which stimulates a response from the adaptive immune system, which may include responses drawn from the group comprising an antibody response, a cytotoxic T cell response, a T helper response, and a T cell memory. An immunogen may stimulate an upregulation of the immune response with a resultant inflammatory response or may result in down regulation or immunosuppression. Thus, the T-cell response may be a T regulatory response. An immunogen also may stimulate a B-cell response and lead to an increase in antibody titer. Another term used herein to describe a molecule or combination of molecules which stimulate an immune response is “antigen”.
As used herein, the term “native” (or wild type) when used in reference to a protein refers to proteins encoded by the genome of a cell, tissue, or organism, other than one manipulated to produce synthetic proteins.
As used herein the term “epitope” refers to a peptide sequence which elicits an immune response, from either T cells or B cells or antibody
As used herein, the term “B-cell epitope” refers to a polypeptide sequence that is recognized and bound by a B-cell receptor. A B-cell epitope may be a linear peptide or may comprise several discontinuous sequences which together are folded to form a structural epitope. Such component sequences which together make up a B-cell epitope are referred to herein as B-cell epitope sequences. Hence, a B-cell epitope may comprise one or more B-cell epitope sequences. Hence, a B cell epitope may comprise one or more B-cell epitope sequences. A linear B-cell epitope may comprise as few as 2-4 amino acids or more amino acids.
“B cell core peptides” or “core pentamer” when used herein refers to the central 5 amino acid peptide in a predicted B cell epitope sequence. The B cell epitope may be evaluated by predicting the binding of across a series of 9-mer windows, the core pentamer then is the central pentamer of the 9-mer window
As used herein, the term “T-cell epitope” refers to a polypeptide sequence which when bound to a major histocompatibility protein molecule provides a configuration recognized by a T-cell receptor. Typically, T-cell epitopes are presented bound to a MHC molecule on the surface of an antigen-presenting cell.
As used herein, the term “predicted T-cell epitope” refers to a polypeptide sequence that is predicted to bind to a major histocompatibility protein molecule by the neural network algorithms described herein, by other computerized methods, or as determined experimentally.
As used herein, the term “major histocompatibility complex (MHC)” refers to the MHC Class I and MHC Class II genes and the proteins encoded thereby. Molecules of the MHC bind small peptides and present them on the surface of cells for recognition by T-cell receptor-bearing T-cells. The MHC is both polygenic (there are several MHC class I and MHC class II genes) and polyallelic or polymorphic (there are multiple alleles of each gene). The terms MHC-I, MHC-II, MHC-1 and MHC-2 are variously used herein to indicate these classes of molecules. Included are both classical and nonclassical MHC molecules. An MHC molecule is made up of multiple chains (alpha and beta chains) which associate to form a molecule. The MHC molecule contains a cleft or groove which forms a binding site for peptides. Peptides bound in the cleft or groove may then be presented to T-cell receptors. The term “MHC binding region” refers to the groove region of the MHC molecule where peptide binding occurs.
As used herein, an “MHC II binding groove” refers to the structure of an MHC molecule that binds to a peptide. The peptide that binds to the MHC II binding groove may be from about 11 amino acids to about 23 amino acids in length, but typically comprises a 15-mer. The amino acid positions in the peptide that binds to the groove are numbered based on a central core of 9 amino acids numbered 1-9, and positions outside the 9 amino acid core numbered as negative (N terminal) or positive (C terminal). Hence, in a 15mer the amino acid binding positions are numbered from −3 to +3 or as follows: −3, −2, −1, 1, 2, 3, 4, 5, 6, 7, 8, 9, +1, +2, +3.
As used herein, the term “haplotype” refers to the HLA alleles found on one chromosome and the proteins encoded thereby. Haplotype may also refer to the allele present at any one locus within the MHC. Each class of MHC-Is represented by several loci: e.g., HLA-A (Human Leukocyte Antigen-A), HLA-B, HLA-C, HLA-E, HLA-F, HLA-G, HLA-H, HLA-J, HLA-K, HLA-L, HLA-P and HLA-V for class I and HLA-DRA, HLA-DRB1-9, HLA-, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1, HLA-DMA, HLA-DMB, HLA-DOA, and HLA-DOB for class II. The terms “HLA allele” and “MHC allele” are used interchangeably herein. HLA alleles are listed at hla.alleles.org/nomenclature/naming.html, which is incorporated herein by reference.
The MHCs exhibit extreme polymorphism: within the human population there are, at each genetic locus, a great number of haplotypes comprising distinct alleles—the IMGT/HLA database release (February 2010) lists 948 class I and 633 class II molecules, many of which are represented at high frequency (>1%). MHC alleles may differ by as many as 30-aa substitutions. Different polymorphic MHC alleles, of both class I and class II, have different peptide specificities: each allele encodes proteins that bind peptides exhibiting particular sequence patterns.
The naming of new HLA genes and allele sequences and their quality control is the responsibility of the WHO Nomenclature Committee for Factors of the HLA System, which first met in 1968, and laid down the criteria for successive meetings. This committee meets regularly to discuss issues of nomenclature and has published 19 major reports documenting firstly the HLA antigens and more recently the genes and alleles. The standardization of HLA antigenic specifications has been controlled by the exchange of typing reagents and cells in the International Histocompatibility Workshops. The IMGT/HLA Database collects both new and confirmatory sequences, which are then expertly analyzed and curated before been named by the Nomenclature Committee. The resulting sequences are then included in the tools and files made available from both the IMGT/HLA Database and at hla.alleles.org.
Each HLA allele name has a unique number corresponding to up to four sets of digits separated by colons. See e.g., hla.alleles.org/nomenclature/naming.html which provides a description of standard HLA nomenclature and Marsh et al., Nomenclature for Factors of the HLA System, 2010 Tissue Antigens 2010 75:291-455. HLA-DRB1*13:01 and HLA-DRB1*13:01:01:02 are examples of standard HLA nomenclature. The length of the allele designation is dependent on the sequence of the allele and that of its nearest relative. All alleles receive at least a four digit name, which corresponds to the first two sets of digits, longer names are only assigned when necessary.
The digits before the first colon describe the type, which often corresponds to the serological antigen carried by an allele, The next set of digits are used to list the subtypes, numbers being assigned in the order in which DNA sequences have been determined. Alleles whose numbers differ in the two sets of digits must differ in one or more nucleotide substitutions that change the amino acid sequence of the encoded protein. Alleles that differ only by synonymous nucleotide substitutions (also called silent or non-coding substitutions) within the coding sequence are distinguished by the use of the third set of digits. Alleles that only differ by sequence polymorphisms in the introns or in the 5′ or 3′ untranslated regions that flank the exons and introns are distinguished by the use of the fourth set of digits. In addition to the unique allele number there are additional optional suffixes that may be added to an allele to indicate its expression status. Alleles that have been shown not to be expressed, ‘Null’ alleles have been given the suffix ‘N’. Those alleles which have been shown to be alternatively expressed may have the suffix ‘L’, ‘S’, ‘C’, ‘A’ or‘Q’. The suffix‘L’ is used to indicate an allele which has been shown to have ‘Low’ cell surface expression when compared to normal levels. The ‘S’ suffix is used to denote an allele specifying a protein which is expressed as a soluble ‘Secreted’ molecule but is not present on the cell surface. A ‘C’ suffix to indicate an allele product which is present in the ‘Cytoplasm’ but not on the cell surface. An ‘A’ suffix to indicate ‘Aberrant’ expression where there is some doubt as to whether a protein is expressed. A ‘Q’ suffix when the expression of an allele is ‘Questionable’ given that the mutation seen in the allele has previously been shown to affect normal expression levels.
In some instances, the HLA designations used herein may differ from the standard HLA nomenclature just described due to limitations in entering characters in the databases described herein. As an example, DRB1_0104, DRB1*0104, and DRB1-0104 are equivalent to the standard nomenclature of DRB1*01:04. In most instances, the asterisk is replaced with an underscore or dash and the semicolon between the two digit sets is omitted.
As used herein, the term “polypeptide sequence that binds to at least one major histocompatibility complex (MHC) binding region” refers to a polypeptide sequence that is recognized and bound by one or more particular MHC binding regions as predicted by the neural network algorithms described herein or as determined experimentally.
As used herein the terms “canonical” and “non-canonical” are used to refer to the orientation of an amino acid sequence. Canonical refers to an amino acid sequence presented or read in the N terminal to C terminal order; non-canonical is used to describe an amino acid sequence presented in the inverted or C terminal to N terminal order.
As used herein, the term “transmembrane protein” refers to proteins that span a biological membrane. There are two basic types of transmembrane proteins. Alpha-helical proteins are present in the inner membranes of bacterial cells or the plasma membrane of eukaryotes, and sometimes in the outer membranes. Beta-barrel proteins are found only in outer membranes of Gram-negative bacteria, cell wall of Gram-positive bacteria, and outer membranes of mitochondria and chloroplasts.
As used herein, the term “affinity” refers to a measure of the strength of binding between two members of a binding pair, for example, an antibody and an epitope and an epitope and a MHC-I or II haplotype. Kd is the dissociation constant and has units of molarity. The affinity constant is the inverse of the dissociation constant. An affinity constant is sometimes used as a generic term to describe this chemical entity. It is a direct measure of the energy of binding. The natural logarithm of K is linearly related to the Gibbs free energy of binding through the equation ΔG0=−RT LN(K) where R=gas constant and temperature is in degrees Kelvin. Affinity may be determined experimentally, for example by surface plasmon resonance (SPR) using commercially available Biacore SPR units (GE Healthcare) or in silico by methods such as those described herein in detail. Affinity may also be expressed as the ic50 or inhibitory concentration 50, that concentration at which 50% of the peptide is displaced. Likewise ln(ic50) refers to the natural log of the ic50.
The term “Koff”, as used herein, is intended to refer to the off-rate constant, for example, for dissociation of an antibody from the antibody/antigen complex, or for dissociation of an epitope from an MHC haplotype.
The term “Kd”, as used herein, is intended to refer to the dissociation constant (the reciprocal of the affinity constant “Ka”), for example, for a particular antibody-antigen interaction or interaction between an epitope and an MHC haplotype.
As used herein, the terms “strong binder” and “strong binding” and “High binder” and “high binding” or “high affinity” refer to a binding pair or describe a binding pair that have an affinity of greater than 2×107M−1 (equivalent to a dissociation constant of 50 nM Kd)
As used herein, the term “moderate binder” and “moderate binding” and “moderate affinity” refer to a binding pair or describe a binding pair that have an affinity of from 2×107M−1 to 2×106M−1.
As used herein, the terms “weak binder” and “weak binding” and “low affinity” refer to a binding pair or describe a binding pair that have an affinity of less than 2×106M−1 (equivalent to a dissociation constant of 500 nM Kd)
Binding affinity may also be expressed by the standard deviation from the mean binding found in the peptides making up a protein. Hence a binding affinity may be expressed as “−1σ” or <−1σ, where this refers to a binding affinity of 1 or more standard deviations below the mean. A common mathematical transformation used in statistical analysis is a process called standardization wherein the distribution is transformed from its standard units to standard deviation units where the distribution has a mean of zero and a variance (and standard deviation) of 1. Because each protein comprises unique distributions for the different MHC alleles standardization of the affinity data to zero mean and unit variance provides a numerical scale where different alleles and different proteins can be compared. Analysis of a wide range of experimental results suggest that a criterion of standard deviation units can be used to discriminate between potential immunological responses and non-responses. An affinity of 1 standard deviation below the mean was found to be a useful threshold in this regard and thus approximately 15% (16.2% to be exact) of the peptides found in any protein will fall into this category.
The terms “specific binding” or “specifically binding” when used in reference to the interaction of an antibody and a protein or peptide or an epitope and an MHC haplotype means that the interaction is dependent upon the presence of a particular structure (i.e., the antigenic determinant or epitope) on the protein; in other words the antibody is recognizing and binding to a specific protein structure rather than to proteins in general. For example, if an antibody is specific for epitope “A,” the presence of a protein containing epitope A (or free, unlabeled A) in a reaction containing labeled “A” and the antibody will reduce the amount of labeled A bound to the antibody.
As used herein, the term “antigen binding protein” refers to proteins that bind to a specific antigen. “Antigen binding proteins” include, but are not limited to, immunoglobulins, including polyclonal, monoclonal, chimeric, single chain, and humanized antibodies, Fab fragments, F(ab′)2 fragments, and Fab expression libraries. Various procedures known in the art are used for the production of polyclonal antibodies. For the production of antibody, various host animals can be immunized by injection with the peptide corresponding to the desired epitope including but not limited to rabbits, mice, rats, sheep, goats, etc.
“Adjuvant” as used herein encompasses various adjuvants that are used to enhance the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, Lipid A analogues (e.g. poly I:C), pluronic polyols, polyanions, peptides, oil emulsions, CpG, C type lectin ligands, CD1d ligands (e.g. α-galactosylceramide), squalene, squalene emulsions, liposomes, imidazoquinolines (e.g. imiquimod), keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (Bacille Calmette-Guerin) and Corynebacterium parvum. In other embodiments a cytokine may be co-administered, including but not limited to interferon gamma or stimulators thereof, interleukin 12, or granulocyte stimulating factor. In other embodiments the peptides or their encoding nucleic acids may be co-administered with a local inflammatory agent, either chemical or physical. Examples include, but are not limited to, heat, infrared light, proinflammatory drugs, including but not limited to imiquimod.
As used herein “immunoglobulin” means the distinct antibody molecule secreted by a clonal line of B cells; hence when the term “100 immunoglobulins” is used it conveys the distinct products of 100 different B-cell clones and their lineages.
As used herein, the term “vector,” when used in relation to recombinant DNA technology, refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, retrovirus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences between cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors. “Viral vector” as used herein includes but is not limited to adenoviral vectors, adeno-associated viral vectors, lentiviral vectors, retroviral vectors, poliovirus vectors, measles virus vectors, flavivirus vectors, poxvirus vectors, and other viral vectors which may be used to deliver a peptide or nucleic acid sequence to a host cell.
As used herein, the term “host cell” refers to any eukaryotic cell (e.g., mammalian cells, avian cells, amphibian cells, plant cells, fish cells, insect cells, yeast cells), and bacteria cells, and the like, whether located in vitro or in vivo (e.g., in a transgenic organism).
The term “isolated” when used in relation to a nucleic acid, as in “an isolated oligonucleotide” refers to a nucleic acid sequence that is identified and separated from at least one contaminant nucleic acid with which it is ordinarily associated in its natural source. Isolated nucleic acids are nucleic acids present in a form or setting that is different from that in which they are found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA that are found in the state in which they exist in nature.
A “subject” is an animal such as vertebrate, preferably a mammal such as a human, a bird, or a fish. Mammals are understood to include, but are not limited to, murines, simians, humans, bovines, ovines, cervids, equines, porcines, canines, felines etc.). A “subject affected by cancer” is a cancer patient.
An “effective amount” is an amount sufficient to effect beneficial or desired results. An effective amount can be administered in one or more administrations,
As used herein, the term “purified” or “to purify” refers to the removal of undesired components from a sample. As used herein, the term “substantially purified” refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated, and are at least 60% free, preferably 75% free, and most preferably 90% free from other components with which they are naturally associated. An “isolated polynucleotide” is therefore a substantially purified polynucleotide.
As used herein, the term “motif” refers to a characteristic sequence of amino acids forming a distinctive pattern.
The term “Groove Exposed Motif” (GEM) as used herein refers to a subset of amino acids within a peptide that binds to an MHC molecule; the GEM comprises those amino acids which are turned inward towards the groove formed by the MHC molecule and which play a significant role in determining the binding affinity. In the case of human MHC-I the GEM amino acids are typically (1, 2, 3, 9). In the case of MHC-II molecules two formats of GEM are most common comprising amino acids (−3, 2, −1, 1, 4, 6, 9, +1, +2, +3) and (−3, 2, 1, 2, 4, 6, 9, +1, +2, +3) based on a 15-mer peptide with a central core of 9 amino acids numbered 1-9 and positions outside the core numbered as negative (N terminal) or positive (C terminal).
“Affinity maturation” is the molecular evolution that occurs during somatic hypermutation during which unique variable region sequences generated that are the best at targeting and neutralizing and antigen become clonally expanded and dominate the responding cell populations.
“Germline motif” as used herein describes the amino acid subsets that are found in germline immunoglobulins. Germline motifs comprise both GEM and TCEM motifs found in the variable regions of immunoglobulins which have not yet undergone somatic hypermutation.
“Immunopathology” when used herein describes an abnormality of the immune system. An immunopathology may affect B-cells and their lineage causing qualitative or quantitative changes in the production of immunoglobulins. Immunopathologies may alternatively affect T-cells and result in abnormal T-cell responses. Immunopathologies may also affect the antigen presenting cells. Immunopathologies may be the result of neoplasias of the cells of the immune system. Immunopathology is also used to describe diseases mediated by the immune system such as autoimmune diseases. Illustrative examples of immunopathologies include, but are not limited to, B-cell lymphoma, T-cell lymphomas, Systemic Lupus Erythematosus (SLE), allergies, hypersensitivities, immunodeficiency syndromes, radiation exposure or chronic fatigue syndrome.
“pMHC” Is used to describe a complex of a peptide bound to an MHC molecule. In many instances a peptide bound to an MHC-I will be a 9-mer or 10-mer however other sizes of 7-11 amino acids may be thus bound. Similarly MHC-II molecules may form pMHC complexes with peptides of 15 amino acids or with peptides of other sizes from 11-23 amino acids. The term pMHC is thus understood to include any short peptide bound to a corresponding MHC.
“T-cell exposed motif” (also where abbreviated TCEM), as used herein, refers to the sub set of amino acids in a peptide bound in a MHC molecule which are directed outwards and exposed to a T-cell binding to the pMHC complex. A T-cell binds to a complex molecular space-shape made up of the outer surface MHC of the particular HLA allele and the exposed amino acids of the peptide bound within the MHC. Hence any T-cell recognizes a space shape or receptor which is specific to the combination of HLA and peptide. The amino acids which comprise the TCEM in an MHC-I binding peptide typically comprise positions 4, 5, 6, 7, 8 of a 9-mer. The amino acids which comprise the TCEM in an MHC-II binding peptide typically comprise 2, 3, 5, 7, 8 or −1, 3, 5, 7, 8 based on a 15-mer peptide with a central core of 9 amino acids numbered 1-9 and positions outside the core numbered as negative (N terminal) or positive (C terminal). As indicated under pMHC, the peptide bound to a MHC may be of other lengths and thus the numbering system here is considered a non-exclusive example of the instances of 9-mer and 15 mer peptides.
As used herein “histotope” refers to the outward facing surface of the MHC molecules which surrounds the T cell exposed motif and in combination with the T cell exposed motif serves as the binding surface for the T cell receptor.
As used herein the T cell receptor refers to the molecules exposed on the surface of a T cell which engage the histotope of the MHC and the T cell exposed motif of a peptide bound in the MHC. The T cell receptor comprises two protein chains, known as the alpha and beta chain in 95% of human T cells and as the delta and gamma chains in the remaining 5% of human T cells. Each chain comprises a variable region and a constant region. Each variable region comprises three complementarity determining regions or CDRs
“Regulatory T-cell” or “Treg” as used herein, refers to a T-cell which has an immunosuppressive or down-regulatory function. Regulatory T-cells were formerly known as suppressor T-cells. Regulatory T-cells come in many forms but typically are characterized by expression CD4+, CD25, and Foxp3. Tregs are involved in shutting down immune responses after they have successfully eliminated invading organisms, and also in preventing immune responses to self-antigens or autoimmunity.
“Isoform” as used herein refers to different forms of a protein which differ in a small number of amino acids. The isoform may be a full-length protein (i.e., by reference to a reference wild-type protein or isoform) or a modified form of a partial protein, i.e., be shorter in length than a reference wild-type protein or isoform.
“Immunostimulation” as used herein refers to the signaling that leads to activation of an immune response, whether the immune response is characterized by a recruitment of cells or the release of cytokines which lead to suppression of the immune response. Thus, immunostimulation refers to both upregulation or down regulation.
“Up-regulation” as used herein refers to an immunostimulation which leads to cytokine release and cell recruitment tending to eliminate a non self or exogenous epitope. Such responses include recruitment of T cells, including effectors such as cytotoxic T cells, and inflammation. In an adverse reaction upregulation may be directed to a self-epitope.
“Down regulation” as used herein refers to an immunostimulation which leads to cytokine release that tends to dampen or eliminate a cell response. In some instances, such elimination may include apoptosis of the responding T cells.
“Frequency class” or “frequency classification” as used herein is used to describe logarithmic based bins or subsets of amino acid motifs or cells. When applied to the counts of TCEM motifs found in a given dataset of peptides a logarithmic (log base 2) frequency categorization scheme was developed to describe the distribution of motifs in a dataset. As the cellular interactions between T-cells and antigen presenting cells displaying the motifs in MHC molecules on their surfaces are the ultimate result of the molecular interactions, using a log base 2 system implies that each adjacent frequency class would double or halve the cellular interactions with that motif. Thus, using such a frequency categorization scheme makes it possible to characterize subtle differences in motif usage as well as providing a comprehensible way of visualizing the cellular interaction dynamics with the different motifs.
Hence a Frequency Class 2, or FC 2 means 1 in 4, a Frequency class 10 or FC 10 means 1 in 210 or 1 in 1024. In other embodiments the frequency classification of the TCEM motif in the reference dataset is described by the quantile score of the TCEM in the reference dataset. Quantile scores are used, but is not limited to, applications where the reference dataset is the human proteome or a microbial proteome. “Frequency class” or “frequency classification” may also be applied to cellular clonotypic frequency where it refers to subgroups or bins defined by logarithmic based groupings, whether log base 2 or another selected log base.
A “rare TCEM” as used herein is one which is completely missing in the human proteome or present in up to only five instances in the human proteome.
“Clonotype” as used herein refers to the cell lineage arising from one unique cell. In the particular case of a B cell clonotype it refers to a clonal population of B cells that produces a unique sequence of IGV. The number of B cells that express that sequence varies from singletons to thousands in the repertoire of an individual. In the case of a T cell it refers to a cell lineage which expresses a particular TCR. A clonotype of cancer cells all arise from one cell and carry a particular mutation or mutations or the derivates thereof. The above are examples of clonotypes of cells and should not be considered limiting.
As used herein “epitope mimic” or “TCEM mimic” is used to describe a peptide which has an identical or overlapping TCEM, but may have a different GEM. Such a mimic occurring in one protein may induce an immune response directed towards another protein which carries the same TCEM motif. This may give rise to autoimmunity or inappropriate responses to the second protein.
“Cytokine” as used herein refers to a protein which is active in cell signaling and may include, among other examples, chemokines, interferons, interleukins, lymphokines, granulocyte colony-stimulating factor tumor necrosis factor and programmed death proteins.
“MHC subunit chain” as used herein refers to the alpha and beta subunits of MHC molecules. A MHC II molecule is made up of an alpha chain which is constant among each of the DR, DP, and DQ variants and a beta chain which varies by allele. The MHC I molecule is made up of a constant beta macroglobulin and a variable MHC A, B or C chain.
“Immunoglobulinome” as used herein refers to the total complement of immunoglobulins produced and carried by any one subject.
As used herein the term “repertoire” is used to describe a collection of molecules or cells making up a functional unit or whole. Thus, as one non limiting example, the entirely of the B cells or T cells in a subject comprise its repertoire of B cells or T cells. The entirety of all immunoglobulins expressed by the B cells are its immunoglobulinome or the repertoire of immunoglobulins. A collection of proteins or cell clonotypes which make up a tissue sample, an individual subject or a microorganism may be referred to as a repertoire.
As used herein “mutated amino acid” refers to the appearance of an amino acid in a protein that is the result of a nucleotide change, a missense mutation, or an insertion or deletion or fusion. Mutated amino acid as used herein thus refers to either a substitution of one amino acid for another or to the novel juxtaposition of two amino acids as the result of an insertion, deletion, splice variant or fusion. Hence, the EGFR splice variant vIII is considered a “mutated amino acid” herein.
“Splice variant” as used herein refers to different proteins that are expressed from one gene as the result of inclusion or exclusion of particular exons of a gene in the final, processed messenger RNA produced from that gene or that is the result of cutting and re-annealing of RNA or DNA.
As used here in a “receptor bearing cell” is any cell which carries a ligand binding recognition motif on its surface. In some particular instances a receptor bearing cell is a B cell and its surface receptor comprises an immunoglobulin variable region, the immunoglobulin variable region comprising both heavy and light chains which make up the receptor. In other particular instances a receptor bearing cell may be a T cell which bears a receptor made up of both alpha and beta chains or both delta and gamma chains. Other examples of a receptor bearing cell include cells which carry other ligands such as, in one particular non-limiting example, a programmed death protein of which there are multiple isoforms.
As used herein “immunotherapy intervention” is used to describe any deliberate modification of the immune system, including but not limited to, through the administration of therapeutic drugs or biopharmaceuticals, radiation, T cell therapy, application of engineered T cells, which may include T cells linked to cytotoxic, chemotherapeutic or radiosensitive moieties, checkpoint inhibitor administration, cytokine or recombinant cytokine or cytokine enhancer, including but not limited to a IL-15 agonist, microbiome manipulation, vaccination, B or T cell depletion or ablation, or surgical intervention to remove any immune related tissues.
As used herein “immunomodulatory intervention” refers to any medical or nutritional treatment or prophylaxis administered with the intent of changing the immune response or the balance of immune responsive cells. Such an intervention may be delivered parenterally or orally or via inhalation. Such intervention may include, but is not limited to, a vaccine including both prophylactic and therapeutic vaccines, a biopharmaceutical, which may be from the group comprising an immunoglobulin or part thereof, a T cell stimulator, checkpoint inhibitor, or suppressor, an adjuvant, a cytokine, a cytotoxin, receptor binder, an enhancer of NK (natural killer) cells, an interleukin including but not limited to variants of IL15, superagonists, and a nutritional or dietary supplement. The intervention may also include radiation or chemotherapy to ablate a target group of cells. The impact on the immune response may be to stimulate or to down regulate.
“Checkpoint inhibitor” or “checkpoint blockade” as used herein refers to a type of drug that blocks certain proteins made by some types of immune system cells, such as T cells, and some cancer cells. These proteins help keep immune responses in check and can keep T cells from killing cancer cells. When these proteins are blocked, the “brakes” on the immune system are released and T cells are able to kill cancer cells better. Examples of checkpoint proteins found on T cells or cancer cells include, but are not limited to, PD-1/PD-L1 and CTLA-4/B7-1/B7-2.
As used herein the “cluster of differentiation” proteins refers to cell surface molecules providing targets for immunophenotyping of cells. The cluster of differentiation is also known as cluster of designation or classification determinant and may be abbreviated as CD. Examples of CD proteins include those listed on the world wide web at uniprot.org/docs/cdlist.
As used herein “tumor associated antigens” “TAA” and “tumor associated antigen proteins” refers to proteins which are upregulated in a tumor, but not necessarily mutated relative to that protein in normal tissue. Such tumor associated antigens comprise those that are common to many cases of the same type of cancer and those that are unique to the individual subject. Some tumor associated antigens are derived from viruses such as, but not limited to, herpesviruses, papillomaviruses, retroviruses and polyomaviruses.
As used herein “presentome” refers to the peptides bound in MHC and presented on the surface of antigen presented cells. Mass spectroscopy detects some but not all peptides which are part of the presentome.
“Neoantigen” as used herein refers to a novel epitope motif or antigen created as the result of introduction of a mutation into an amino acid sequence. Thus, a neoantigen differentiates a wildtype protein from its mutant-bearing tumor protein homolog, when such mutant is presented to T cells or B cells.
“Tumor specific antigen” or “tumor specific epitope” is used herein to designate an epitope or antigen that differentiates a mutated tumor protein from its unmutated wildtype homologue. Thus, a neoantigen is one type of tumor specific antigen.
As used herein “driver mutations” are those which arise very early in tumorigenesis and are causally associated with the early steps of cell dysregulation. Driver mutations are shared by all clonal offspring arising from the initial tumor cells and offer some additional fitness benefit to the clonal line within its microenvironment. In contrast “passenger mutations” are those somatic mutations which arise during the differentiation of the tumor and which offer no particular benefit of fitness to the cell. Passengers may serve as biomarkers on tumor cells and may enable some immune evasion. Passenger mutations may differ at different time points in its development and among different parts of a tumor or among metastases. “Driver” and “passenger” are terms largely interchangeable with “trunk” and “branch” mutations.
“Bespoke peptides” or “bespoke vaccine” as used herein refers to a peptide or neoantigen or a combination of peptides, or nucleic acid encoding peptides, that are tailored or personalized specifically for an individual patient, taking into account that patient's HLA alleles and mutations. A bespoke peptide or bespoke vaccine is also referred to herein as a “personalized peptide”, “personalized peptide vaccine”, “personalized neoepitope vaccine” or “personalized vaccine”.
As used herein “TCGA” refers to The Cancer Genome Atlas (one the world wide web at cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga).
“Immunopathology” when used herein describes an abnormality of the immune system. An immunopathology may affect B-cells and their lineage causing qualitative or quantitative changes in the production of immunoglobulins. Immunopathologies may alternatively affect T-cells and result in abnormal T-cell responses. Immunopathologies may also affect the antigen presenting cells. Immunopathologies may be the result of neoplasias of the cells of the immune system. Immunopathology is also used to describe diseases mediated by the immune system such as autoimmune diseases. Representative autoimmune diseases include, but are not limited to rheumatoid arthritis, diabetes type I and type II, Ankylosing Spondylitis, Atopic allergy, Atopic Dermatitis, Autoimmune cardiomyopathy, Autoimmune enteropathy, Autoimmune hemolytic anemia, Autoimmune hepatitis, Autoimmune inner ear disease, Autoimmune lymphoproliferative syndrome, Autoimmune peripheral neuropathy, Autoimmune pancreatitis, Autoimmune polyendocrine syndrome, Autoimmune progesterone dermatitis, Autoimmune thrombocytopenic purpura, Autoimmune uveitis, Bullous Pemphigoid, Castleman's disease, Celiac disease, Cogan syndrome, Cold agglutinin disease, Crohn's Disease, Dermatomyositis, Eosinophilic fasciitis, Gastrointestinal pemphigoid, Goodpasture's syndrome, Graves' disease, Guillain-Barré syndrome, Anti-ganglioside Hashimoto's encephalitis, Hashimoto's thyroiditis, Systemic Lupus erythematosus, Miller-Fisher syndrome, Mixed Connective Tissue Disease, Myasthenia gravis, Narcolepsy, Pemphigus vulgaris, Polymyositis, Primary biliary cirrhosis, Psoriasis, Psoriatic Arthritis, Relapsing polychondritis, Sjögren's syndrome, Temporal arteritis, Ulcerative Colitis, Vasculitis, and Wegener's granulomatosis.
“Antigen presenting cell” as used herein refers to cells which are capable of presentation of peptides to T cells bound to MHC molecules. This includes but is not limited to the so called “professional” antigen presenting cells comprising, but not limited to, dendritic cells, B cells, and macrophages, but also the so called non-professional antigen presenting cells which carry MHC molecules.
“Cognate” as used herein in the context of a T cell refers to the T cell capable of binding a particular epitope.
“Vaccination round” or “round of vaccination” as used herein refers to a vaccination regimen which may comprise one or more administrations of the same one or more immunogens. Hence, a primary immunization followed by one or more boosters with the same antigen is considered a “round” of vaccination.
“Surgical intervention” as used herein in the context of a tumor refers to any intervention by a surgeon which may result in the resection of tissue. It does not refer to simple needle biopsies conducted in an outpatient clinic.
“Genome Data Commons” as used herein refers the National Cancer Institute database maintained at the University of Chicago and on the world wide web at portal.gdc.cancer.gov/ and gdc.cancer.gov/.
“Mutational landscape” is a term used herein to describe the total array of mutations found in any one individual cancer subject or characteristic of a particular type of cancer.
“Lipid drug delivery system” or LDDS as used herein is a generic term which encompasses lipid nanoparticles, emulsions, self-emulsifying drug delivery systems, nanocapsules and liposomes, wherein molecules of a drug active product is encased or partially encased in lipid.
“HUGO” as used herein refers to the Human Genome Organisation Gene Nomenclature Committee at the European Bioinformatics Institute (one the world wide web at genenames.org) which assigns a name and an approved gene symbol to each gene. Examples of HUGO gene names included herein are EGFR (Epidermal growth factor receptor), H3.3 or H33 (Histone H3.3), IDH (isocitrate dehydrogenase), BRAF (Serine/threonine-protein kinase B-raf), TP53 (Cellular tumor antigen p53), PTEN (Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase and dual-specificity protein phosphatase), ERBB2 (Receptor tyrosine-protein kinase erbB-2), PIK3CA (Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform), and KRAS (GTPase KRas). Other examples which are found in fusion proteins mentioned herein are KIAA1549-BRAF (UPF0606 protein KIAA1549 fused to Serine/threonine-protein kinase B-raf) and EML4-ALK (Echinoderm microtubule-associated protein-like 4 fused to ALK tyrosine kinase receptor).
This invention provides a method for stimulating the immune response to a tumor by implementing a two-phase vaccination strategy as shown in
The stimulation of the immune system is achieved either by means of stimulation of dendritic cells or T cells in vitro followed by administration of these cells to a patient, or by means of administration directly to the subject of a vaccine in which peptides, or their encoding nucleic acids, have been selected or designed to ensure an appropriate level of binding affinity to the particular cancer patient's MHC alleles. The first and second rounds of vaccinations can be administered with a variety of adjuvants and other immunostimulatory agents, not limited to checkpoint inhibitors, radiation, chemotherapy and targeted drugs, and may be administered by any of a number of routes.
In preferred embodiments the HLA alleles of the subject are determined prior to the first round of vaccination. Once the HLA alleles of the subject are known, in preferred embodiments the design of the peptides, or their encoding nucleic acids, from the common mutations may be modified to achieve a desired binding affinity to the alleles. This is achieved by maintaining the mutated amino acids withing the T cell exposed motif, while substituting amino acids in the groove exposed motif to achieve the desired binding affinity. The same process of optimization of binding affinity may be conducted for the personalized peptides of the second vaccination.
While cancer arises as the result of gene mutations which produce dysregulation of growth [1], it is also the consequence of immune evasion. Immune evasion may be the result of exhaustion or suppression in the tumor microenvironment [2]. Immune evasion may also arise because of the absence of cognate tumor-specific T cells in the T cell repertoire. Indeed, our prior characterization of tumor mutations indicates that mutations are often concealed from T cells by poor, or at less non-competitive, binding to MHC molecules and preferential location in pocket or groove exposed positions where they are not able to engage a T cell receptor (See, e.g., PCT US2020/037206, incorporated by reference herein in its entirety). In addition, tumor mutations often give rise to rare amino acid sequence motifs, not otherwise present in the human proteome, which are more likely to lie outside the subject's repertoire of cognate T cells. Immunotherapy interventions, such as checkpoint inhibitors and other immunotherapeutic agonists [3] which enhance and activate T cells responses have brought a significant change to the prognosis of cancer patients. However, checkpoint inhibitors are still limited in their efficacy to about 25-30% of cases [4]. Efficacy is higher in subjects and tumor types with a higher number of mutations [5, 6] where the probability of unleashing an effective tumor specific cognate T cell is higher. Checkpoint inhibitors activate the T cells that a subject already has, but if a cognate T cell clone is absent and thus cannot be upregulated, a checkpoint inhibitor may be limited in efficacy. Thus, it is a desirable goal to increase the number of T cell clones which are directed to epitopes on the tumor cells, and ideally to do this as early as possible in case management. The present invention is directed to this goal of maximizing tumor specific T cell clones, and in particular active CD8+ cytotoxic T cells, as early and rapidly as possible in the course of disease. Maturation and establishment of memory of CD8+ T cells is only completed with the help of CD4+ T helper cells [7]. Therefore, each antigen directed to stimulate CD8+ cells is ideally accompanied by an antigen that will stimulate a close companion CD4+ cell clonal line.
The approach described herein therefore addresses the phased combination of two therapeutic strategies: firstly, early intervention based on clinical diagnosis, and secondly, personalization of neoepitope vaccines following tumor exome sequencing and comparison of tumor vs normal gene mutations.
As the sophistication of sequencing of tumors has advanced, the gene mutations most commonly associated with certain cancers have become better understood. Although common driver mutational patterns are now better recognized, each cancer is different and adopts a pattern of additional mutations that is “personal” to the particular subject. This is a combination not only of the stochastic nature of mutations, particularly passenger mutations occurring as the cancer develops, but also the immune pressure created according to the subject's particular HLA alleles and the unique composition of that subject's T cell repertoire and hence the gaps in that repertoire which may facilitate the survival of a cancer cell carrying a particular mutation or array of mutations. However, full characterization of the tumor mutations, their protein coding consequences and the level of expression of such proteins, cannot be evaluated without access to a biopsy. Such a biopsy is obtained through a surgical procedure at a time point determined by clinical progression. Completing exome sequencing of DNA and RNA from a biopsy is relatively rapid with current techniques but may still lead to a lapse of a few weeks before a personalized vaccine plan can be designed following collection of a biopsy, during which time the tumor continues to grow unchecked. Thus, it is desirable to jumpstart the immune response earlier in anticipation of a complete personalized vaccine at a later date. Furthermore, the surgical intervention itself may result in greater exposure to T cells of epitopes derived from damaged tissue. Thus, priming T cells to the degree possible prior to surgery is desirable to enhance the number of T cell clones cognate to the tumor epitopes as early as possible.
Two groups of tumor epitopes are useful for early stimulation: common mutations found in the cancers of the particular clinical presentation and certain non-mutated tumor associated antigens.
While cancer mutations arise stochastically, certain mutations have emerged as characteristic of certain types of cancer. These are typically in driver genes which determine the pathway of dysregulation of the cancer. As such mutations are indeed tumor specific and are not found in normal tissue, it is possible to target them with relatively low risk of directing an autoimmune response to the corresponding normal protein homolog. The utilization of short peptide immunogens further reduces the possibility of an off-target reaction by minimizing the number of T cell exposed motifs to those essential to generate the tumor-specific response sought. This is the case both to avoid adverse targeting of the homolog normal protein, but also to avoid targeting unrelated proteins which may carry, and present, the same T cell exposed motif. As discussed in Example 11 a method has been developed (See, e.g., PCT US2020/037206, incorporated herein by reference in its entirety) for review of potential off-target autoimmune responses, allowing risk benefit analysis, and mitigating the risk of autoimmunity.
A number of mutations that are highly characteristic of certain cancer types can thus be considered for targeting upon initial clinical presentation. Selected peptides embodying such common mutations, or their encoding nucleic acids, can be applied as a vaccine with a high probability that they will be present but with minimal risk of adverse consequences if they are not present. In order to render a tumor specific response and minimize off target reactions it is necessary to select the peptide antigens so as to have the mutant amino acids exposed to the T cell receptor, i.e. with the mutant amino acids in the T cell exposed motifs. Peptide vaccines, or vaccines comprising peptides encoded in nucleic acid or vectors, directed to the most common mutated proteins, and designed for MHC binding with each HLA allele in mind, can be prepared and held “ready to go” for early intervention as cases present for initial clinical evaluation.
Examples of common mutations which are suitable for such early intervention use are as follows. These are illustrative examples and thus considered non limiting.
In glioblastoma, over half of the cases carry aberrant epidermal growth factor receptor (EGFR). Approximately 30% show upregulated EGFRvIII, a splice variant lacking exons 2-7 [3, 8]. In addition, about 25% cases have mutations in the extracellular domain of EGFR including A289V/D/T, R108K, and G598D. Conversely in lung cancer the most common mutation of EGFR is L858R. While prior efforts have been made to target the EGFRvIII by vaccination [9], these adopted a strategy designed to stimulate antibody dependent cell mediated cytotoxicity, and by linking the EGFR target peptide to keyhole limpet hemocyanin, a very strong and dominant T helper epitope which is likely to out-compete the formation of tumor specific T cells. In one embodiment therefore, described in Example 4 and 5, we provide examples of epitope peptides for exemplar HLA alleles which can target the tumor specific motifs of EGFRvIII and the most common mutations in EGFR.
Midline gliomas are frequently characterized by mutation of histone H3 variant proteins and in particular a K27M mutation of H3.3. While peptide approaches have been proposed, these have been restricted to subjects carrying HLA A0201 and one particular peptide RMSAPSTGGV [10, 11] (see also U.S. Pat. No. 10,441,644). While this peptide has a moderately high predicted binding affinity to A0201, the location of the mutant methionine at position 2 means that this amino acid is preferentially hidden in a pocket position of the MHC groove (groove exposed position) and thus unlikely to stimulate a T cell response that will effectively differentiate tumor and normal tissue. In the present invention, by using modifications of the groove exposed motif of peptides in which the mutant methionine is exposed in the T cell exposed motif, we identify other peptides which are capable of directing a CD8+ response to the tumor and for other HLA alleles.
A very common missense mutation in isocitrate dehydrogenase R132H is often found in gliomas and glioblastomas as well as several other cancers, and is associated with hypermethylation leading to genetic instability which may trigger other mutations [12]. Mutations in isocitrate dehydrogenase enzyme isoform 1 (IDH1) and, to a lesser extent in isoform 2 (IDH2) genes have been identified in a large proportion of diffuse astrocytomas (70-90%), oligodendrogliomas (69-94%), oligoastrocytomas (78-100%), and secondary glioblastomas (82-88%). Peptides have been identified in the vicinity of the mutation in IDH which can stimulate CD4+ cells and produce a tumor suppressive effect [13, 14] (see also U.S. Ser. No. 10/161,940). In the present invention we identify tumor specific epitopes in IDH R132H that can target an array of CD8+ cells in subjects of differing HLA alleles and also provide examples of personalized peptides with altered groove exposed motifs to optimize binding for particular HLAs.
Two mutations are highly characteristic of pediatric low-grade gliomas: KIAA1549-BRAF and BRAFV600E. In one study these together accounted for 68% of pediatric low-grade gliomas [15]. At present only two forms (long/short) of fusion of KIAA1549 to BRAF are described, providing unique neoepitopes at the fusion junction. BRAF mutations at position 600 are common not only in this cancer but in many others, including but not limited to thyroid, skin (melanoma) many adenocarcinomas and others. BRAF600 mutations are among the most common recorded at the Genome Data Commons, especially V600E, but also at lower frequency V600G, and V600M. The present invention provides peptides which can be utilized to target tumor specific T cell exposed motifs harboring such common BRAF mutations.
The above non-limiting examples are provided to indicate that based on clinical presentation, a probable occurrence of certain common mutations can be determined. This can be further validated by imaging studies with radiolabeled tracers (e.g. EGFR) and molecular- and immuno-diagnostics conducted on blood/CSF/urine/saliva, etc. (liquid biopsies), needle aspirates, or bone marrow samples prior to access to a tumor biopsy (for instance for IDH mutations, EGFR).
Immune responses are recognized to a number of proteins that are upregulated in association with tumors or which may control the progression of tumors. A large number of tumor associated antigens have been identified and ranked according to their utility as tumor target antigens for immunotherapy [16]. Among the most prominent tumor associated antigens with immunotherapeutic potential are WT1, MUC1, LMP2, HPV E6 E7, EGFRvIII, HER-2/neu, MAGE A3, p53, NY-ESO-1, PSMA, GD2, CEA, MelanA/MART1, Ras-mutant, gp100, Proteinase3 (PR1), Bcr-abl, Tyrosinase, Survivin, PSA, hTERT, Sarcoma translocation breakpoints, EphA2, PAP, ML-IAP, AFP, EpCAM, ERG (TMPRSS2 ETS fusion gene), NA17, PAX3, ALK, Androgen receptor, Cyclin B1, Polysialic Acid, MYCN, RhoC, TRP-2, GD3, Fucosyl GM1, Mesothelin, PSCA, MAGE A1, sLe(a), CYP1B1, PLAC1, GM3, BORIS, TBXT (brachyury), BCAN, Tn, GloboH, ETV6-AML, NY-BR-1, RGS5, SART3, STn, Carbonic anhydrase IX, PAX5, OY-TES1, Sperm protein 17, LCK, HMWMAA, AKAP-4, SSX2, XAGE 1, B7H3, Legumain, Tie 2, VEGFR2, MAD-CT-1, FAP, PDGFR-beta, MAD-CT-2, and Fos-related antigen 1. These tumor associated antigens (TAA) were among the first cancer immunogens to be explored, albeit with limited success [17-19]. In some instances, there was insufficient understanding of the precise nature of MHC allele-specific presentation. In other cases, the epitopes were not well characterized. Many trials of TAA vaccination preceded the era of checkpoint inhibitors. There are, however, examples of beneficial effects of vaccination with tumor associated antigens [18, 20-23]. In the present invention we provide a method for combining the vaccination with certain epitope peptides from tumor associated antigens with subsequent vaccination with a personalized cancer vaccine.
While any of the above listed tumor associated antigens offer opportunities to target cancer cells and in the present invention are considered targets for the first round of vaccination, some particular examples of actionable tumor associated antigens which are suitable for such early intervention use are as follows. These are illustrative examples and thus considered non limiting.
Vascular endothelial growth factor receptor 1 (VEGFR1) and Vascular endothelial growth factor receptor 2 (VEGFR2), acting as receptors for vascular endothelial growth factors A and B, play a key role in the regulation of angiogenesis, cell migration, and cancer cell invasion. The benefit of reducing vascularization of tumors has been shown by bevacizumab, a monoclonal antibody to VEGFA. Similarly inducing an immune response to VEGFR 1 and 2 has been successful in reducing tumorigenesis in multiple cancers and in mitigating choroidal vascularization [20, 21, 24] (see also U.S. Pat. No. 8,975,229). However, these studies have focused exclusively on two HLA alleles A0201 and A2402 and do not teach how to apply immunization to VEGFR to patients of other alleles. A Salmonella vectored VEGFR protein has also been successfully delivered orally to reduce progression of pancreatic cancer [25]. The present invention provides VEGFR peptides which can be applied more broadly to other subjects with other HLA alleles. In some preferred embodiments such administration is early in the clinical course of the cancer and is followed by a personalized neoepitope vaccination.
Vimentin is a filamentous cytoskeleton protein expressed in most mesenchymal cells. Autoantibodies to post translationally citrullinated vimentin have been linked to rheumatoid arthritis and sarcoidosis. Autoantibodies to vimentin are also documented in systemic lupus erythematosus [26]. In adenocarcinoma cancer cells, including gliomas, glioblastomas and melanoma, vimentin adopts a different structural configuration which exposes an ectodomain, which has served as the target of the therapeutic antibody pritumumab [27-29]. The B cell conformational epitope is not found on normal cells, although the sequence of vimentin remains unchanged. As T cell epitopes are not subject to the same structural constraints as antibody epitopes, it follows that the T cell epitopes of vimentin offer a therapeutic vaccine target. In the present invention, therefore, peptides comprising T cell epitopes in the ectodomain of vimentin are among those provided as an early intervention. In the examples cited below we provide predicted T cell epitopes in vimentin for multiple HLA alleles including both MHC I and MHC II.
Wilms tumor antigen (WT1) is expressed in many cancers, having originally being identified in a childhood renal cancer, Wilms tumor. In addition to renal cancer, WT1 is recognized in many different hematologic cancers, lung cancer, pancreatic cancer, melanoma and a variety of adenomas and adenocarcinomas. It encodes a zinc finger transcription factor that is critical to cell differentiation and it is considered an oncogene [30, 31].
WT-1 has long been recognized as a tumor associated antigen with therapeutic potential [16, 30], however while peptides from this protein have been used in vaccine studies of pancreatic and hematologic cancers, they have been restricted only to those peptides which bind to A0201, A2402 and A0206. Peptides with wider applicability across a greater diversity of the population have not been evaluated [30, 32-34]. In addition a peptide with MHC II binding capable of providing T cell help was identified [35]. In the present invention we provide examples of peptides with broader MHC binding and with suitability for vaccination of individuals with a broad diversity of MHC I and MHC II alleles. While there are multiple isoforms of WT1, the examples below show, it is possible to identify epitopes in conserved regions with broad allele binding.
Tumor associated antigens also include viral antigens. In some particular embodiments the first round of vaccination may include epitope peptides, or their encoding nucleic acids, derived from oncogenic viruses including, but not limited to, human papillomaviruses and herpesviruses. The latter may include epitopes from, but not limited to human herpesviruses 1 and 2 (herpes simplex virus), human herpesvirus 4 (Epstein Barr virus) and human herpesvirus 5 (human cytomegalovirus) [36]. Certain viruses are known to be frequently upregulated in certain cancers, for example human cytomegalovirus in glioblastoma. Whether this is a driver or a consequence of the tumorigenesis is not clear, but the presence of the CMV antigens provide a way to target cancer cells [37, 38]. While such viruses may play a role in tumorigenesis and share common antigens which can be targeted early in the clinical course as provided here in a first round of vaccination, the resultant tumors each have diverse mutations which are then targeted in a second personalized vaccination strategy in round two as described below.
There is increasing evidence that a variety of T cell immunotherapies can be successful in halting the progression of cancer [39].
Tumor-specific antigens comprise both those common to many cancers, and those which are unique to any single patient and which may change over the life of a tumor. Generally, the higher the mutational load, the more infiltrating T cells and the more inflamed a tumor, the greater probability there is of a check-point inhibitor leading to a successful T cell driven elimination of the tumor cells. Mutational load tends to differ between cancer types; some such as melanoma and colorectal cancers have a high mutational frequency. Others such as glioblastoma are notoriously low in mutational numbers.
Several recent publications have reported promising, but mixed, results in the development of personalized vaccines for melanoma [40, 41], lung cancer [42, 43] and glioblastoma [44, 45]. These have employed from 1 to 20 different neoantigens. Increasing the number of neoepitopes incorporated in a vaccine allows for a multipronged attack on the tumor using multiple alleles and multiple antigens derived from different proteins. Mutations continue to arise in tumors as they develop, with antigens gained or lost in the process. There may also be heterogeneity of mutations within a tumor and the mutational landscape may not be fully reflected in the sequencing of a biopsy. Hence a high number of cytotoxic “hits” is desirable rather than depending on only one or two antigen targets [46]. A goal of the present invention is to maximize the number of tumor specific epitopes which can be targeted by T cells responding to peptides presented by a particular patient's alleles.
The goal of T cell immunotherapy has been primarily to activate CD8+ cytotoxic T cells which will target tumor cells, but also to stimulate CD4+ T helper cells to enhance CD8+ responses. Stimulation of CD4+ T helper cells may also enhance B cell responses. Selection of peptides for use as neoepitopes has followed several paths. As a starting point, given the diversity of the human genome, it is desirable to compare sequences of proteins in tumor biopsies with a normal tissue from the same patient [47]. However, reference human genomes are frequently used as comparators to determine mutation sites. Practitioners have then used several approaches to select peptides for use, or for encoding in RNA or DNA for administration. In some instances peptides have been selected based on mass spectroscopy [48, 49]; in yet others predictive algorithms, most often NetMHC Pan [50], was used to select peptides [40, 41, 45]. In one instance, both approaches were reported, but in this particular case none of the mutated peptides were detected by mass spectroscopy [44].
Mutations in cancers include those which are unique to a specific patient. Some are patient specific driver mutations, arising as the root cause of cell dysregulation. Others arise as branch or passenger mutations, which are sequelae to an earlier trunk or driver mutation. Such mutations may continue to evolve throughout the tumor progression. There are also a number of mutations which are found commonly at the same positions in the same proteins, some of which occur repeatedly across many cancer types [51-55]. The Cancer Genome Atlas documents many proteins which are found to share mutations across multiple cancer types (on the world wide web at cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga). Some of these are simple amino acid substitutions arising from single nucleotide mutations; others involve amino acid duplications. In other cases, the mutations give rise to insertions and deletions (indels) and missense sequences. Gene fusions are another source of novel tumor specific motifs. Where these mutations are shared across many cancers, a set of peptides can be designed for each patient HLA allele which will allow stimulation of T cells to specifically target tumor cells with cytotoxic T cells and/or T helper cells.
In the case of indels and missense mutations, when these result in an in-frame downstream sequence they may provide a target-rich sequence, but every patient is unique and so selection of vaccine peptides for these must be handled as a personalized design effort. In some embodiments consistent indels are found repeatedly in many cancers.
In the present invention, a second round of vaccination is based on information gathered following collection of a biopsy of the tumor and a comparative sample of normal tissue. In some preferred embodiments, mutated proteins in the biopsy samples are identified by sequencing the genome, proteome, or transcriptome of cells from the biopsy, in other words obtaining DNA and RNA sequences and deducing protein sequences. The present invention is not limited to any particular method of obtaining sequences of proteins mutated in a biopsy. A variety of sequencing methods are readily available to those of ordinary skill in the art.
In some preferred embodiments, the present invention utilizes nucleic acid sequencing techniques. The nucleic acid sequences are preferably converted in silico to protein sequences from the identification of mutated amino acids and peptides comprising the mutated amino acids.
In some embodiments, the sequencing is Second Generation (a.k.a. Next Generation or Next-Gen), Third Generation (a.k.a. Next-Next-Gen), or Fourth Generation (a.k.a. N3-Gen) sequencing technology including, but not limited to, pyrosequencing, sequencing-by-ligation, single molecule sequencing, sequence-by-synthesis (SBS), semiconductor sequencing, massive parallel clonal, massive parallel single molecule SBS, massive parallel single molecule real-time, massive parallel single molecule real-time nanopore technology, etc. Morozova and Marra provide a review of some such technologies in Genomics, 92: 255 (2008), herein incorporated by reference in its entirety. Those of ordinary skill in the art will recognize that because RNA is less stable in the cell and more prone to nuclease attack experimentally RNA is usually reverse transcribed to DNA before sequencing.
DNA sequencing techniques include fluorescence-based sequencing methodologies (See, e.g., Birren et al., Genome Analysis: Analyzing DNA, 1, Cold Spring Harbor, N.Y.; herein incorporated by reference in its entirety). In some embodiments, the sequencing is automated sequencing. In some embodiments, the sequencing is parallel sequencing of partitioned amplicons (PCT Publication No: WO2006084132 to Kevin McKeman et al., herein incorporated by reference in its entirety). In some embodiments, the sequencing is DNA sequencing by parallel oligonucleotide extension (See, e.g., U.S. Pat. No. 5,750,341 to Macevicz et al., and U.S. Pat. No. 6,306,597 to Macevicz et al., both of which are herein incorporated by reference in their entireties). Additional examples of sequencing techniques include the Church polony technology (Mitra et al., 2003, Analytical Biochemistry 320, 55-65; Shendure et al., 2005 Science 309, 1728-1732; U.S. Pat. Nos. 6,432,360, 6,485,944, 6,511,803; herein incorporated by reference in their entireties), the 454 picotiter pyrosequencing technology (Margulies et al., 2005 Nature 437, 376-380; US 20050130173; herein incorporated by reference in their entireties), the Solexa single base addition technology (Bennett et al., 2005, Pharmacogenomics, 6, 373-382; U.S. Pat. Nos. 6,787,308; 6,833,246; herein incorporated by reference in their entireties), the Lynx massively parallel signature sequencing technology (Brenner et al. (2000). Nat. Biotechnol. 18:630-634; U.S. Pat. Nos. 5,695,934; 5,714,330; herein incorporated by reference in their entireties), and the Adessi PCR colony technology (Adessi et al. (2000). Nucleic Acid Res. 28, E87; WO 00018957; herein incorporated by reference in its entirety).
Next-generation sequencing (NGS) methods share the common feature of massively parallel, high-throughput strategies, with the goal of lower costs in comparison to older sequencing methods (see, e.g., Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; each herein incorporated by reference in their entirety). NGS methods can be broadly divided into those that typically use template amplification and those that do not. Amplification-requiring methods include pyrosequencing commercialized by Roche as the 454 technology platforms (e.g., GS 20 and GS FLX), Life Technologies/Ion Torrent, the Solexa platform commercialized by Illumina, GnuBio, and the Supported Oligonucleotide Ligation and Detection (SOLiD) platform commercialized by Applied Biosystems. Non-amplification approaches, also known as single-molecule sequencing, are exemplified by the HeliScope platform commercialized by Helicos BioSciences, and emerging platforms commercialized by VisiGen, Oxford Nanopore Technologies Ltd., and Pacific Biosciences, respectively.
In pyrosequencing (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 6,210,891; 6,258,568; each herein incorporated by reference in its entirety), template DNA is fragmented, end-repaired, ligated to adaptors, and clonally amplified in-situ by capturing single template molecules with beads bearing oligonucleotides complementary to the adaptors. Each bead bearing a single template type is compartmentalized into a water-in-oil microvesicle, and the template is clonally amplified using a technique referred to as emulsion PCR. The emulsion is disrupted after amplification and beads are deposited into individual wells of a picotiter plate functioning as a flow cell during the sequencing reactions. Ordered, iterative introduction of each of the four dNTP reagents occurs in the flow cell in the presence of sequencing enzymes and luminescent reporter such as luciferase. In the event that an appropriate dNTP is added to the 3′ end of the sequencing primer, the resulting production of ATP causes a burst of luminescence within the well, which is recorded using a CCD camera. It is possible to achieve read lengths greater than or equal to 400 bases, and 106 sequence reads can be achieved, resulting in up to 500 million base pairs (Mb) of sequence.
In the Solexa/Illumina platform (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 6,833,246; 7,115,400; 6,969,488; each herein incorporated by reference in its entirety), sequencing data are produced in the form of shorter-length reads. In this method, single-stranded fragmented DNA is end-repaired to generate 5′-phosphorylated blunt ends, followed by Klenow-mediated addition of a single A base to the 3′ end of the fragments. A-addition facilitates addition of T-overhang adaptor oligonucleotides, which are subsequently used to capture the template-adaptor molecules on the surface of a flow cell that is studded with oligonucleotide anchors. The anchor is used as a PCR primer, but because of the length of the template and its proximity to other nearby anchor oligonucleotides, extension by PCR results in the “arching over” of the molecule to hybridize with an adjacent anchor oligonucleotide to form a bridge structure on the surface of the flow cell. These loops of DNA are denatured and cleaved. Forward strands are then sequenced with reversible dye terminators. The sequence of incorporated nucleotides is determined by detection of post-incorporation fluorescence, with each fluor and block removed prior to the next cycle of dNTP addition. Sequence read length ranges from 36 nucleotides to over 250 nucleotides, with overall output exceeding 1 billion nucleotide pairs per analytical run.
Sequencing nucleic acid molecules using SOLiD technology (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 5,912,148; 6,130,073; each herein incorporated by reference in their entirety) also involves fragmentation of the template, ligation to oligonucleotide adaptors, attachment to beads, and clonal amplification by emulsion PCR. Following this, beads bearing template are immobilized on a derivatized surface of a glass flow-cell, and a primer complementary to the adaptor oligonucleotide is annealed. However, rather than utilizing this primer for 3′ extension, it is instead used to provide a 5′ phosphate group for ligation to interrogation probes containing two probe-specific bases followed by 6 degenerate bases and one of four fluorescent labels. In the SOLiD system, interrogation probes have 16 possible combinations of the two bases at the 3′ end of each probe, and one of four fluors at the 5′ end. Fluor color, and thus identity of each probe, corresponds to specified color-space coding schemes. Multiple rounds (usually 7) of probe annealing, ligation, and fluor detection are followed by denaturation, and then a second round of sequencing using a primer that is offset by one base relative to the initial primer. In this manner, the template sequence can be computationally re-constructed, and template bases are interrogated twice, resulting in increased accuracy. Sequence read length averages 35 nucleotides, and overall output exceeds 4 billion bases per sequencing run.
In certain embodiments, sequencing is nanopore sequencing (see, e.g., Astier et al., J. Am. Chem. Soc. 2006 Feb. 8; 128(5):1705-10, herein incorporated by reference). The theory behind nanopore sequencing has to do with what occurs when a nanopore is immersed in a conducting fluid and a potential (voltage) is applied across it. Under these conditions a slight electric current due to conduction of ions through the nanopore can be observed, and the amount of current is exceedingly sensitive to the size of the nanopore. As each base of a nucleic acid passes through the nanopore, this causes a change in the magnitude of the current through the nanopore that is distinct for each of the four bases, thereby allowing the sequence of the DNA molecule to be determined.
In certain embodiments, sequencing is HeliScope by Helicos BioSciences (Voelkerding et al., Clinical Chem., 55: 641-658, 2009; MacLean et al., Nature Rev. Microbiol., 7: 287-296; U.S. Pat. Nos. 7,169,560; 7,282,337; 7,482,120; 7,501,245; 6,818,395; 6,911,345; 7,501,245; each herein incorporated by reference in their entirety). Template DNA is fragmented and polyadenylated at the 3′ end, with the final adenosine bearing a fluorescent label. Denatured polyadenylated template fragments are ligated to poly(dT) oligonucleotides on the surface of a flow cell. Initial physical locations of captured template molecules are recorded by a CCD camera, and then label is cleaved and washed away. Sequencing is achieved by addition of polymerase and serial addition of fluorescently labeled dNTP reagents. Incorporation events result in fluor signal corresponding to the dNTP, and signal is captured by a CCD camera before each round of dNTP addition. Sequence read length ranges from 25-50 nucleotides, with overall output exceeding 1 billion nucleotide pairs per analytical run.
The Ion Torrent technology is a method of DNA sequencing based on the detection of hydrogen ions that are released during the polymerization of DNA (see, e.g., Science 327(5970): 1190 (2010); U.S. Pat. Appl. Pub. Nos. 20090026082, 20090127589, 20100301398, 20100197507, 20100188073, and 20100137143, incorporated by reference in their entireties for all purposes). A microwell contains a template DNA strand to be sequenced. Beneath the layer of microwells is a hypersensitive ISFET ion sensor. All layers are contained within a CMOS semiconductor chip, similar to that used in the electronics industry. When a dNTP is incorporated into the growing complementary strand a hydrogen ion is released, which triggers a hypersensitive ion sensor. If homopolymer repeats are present in the template sequence, multiple dNTP molecules will be incorporated in a single cycle. This leads to a corresponding number of released hydrogens and a proportionally higher electronic signal. This technology differs from other sequencing technologies in that no modified nucleotides or optics are used. The per-base accuracy of the Ion Torrent sequencer is ˜99.6% for 50 base reads, with ˜100 Mb to 100 Gb generated per run. The read-length is 100-300 base pairs. The accuracy for homopolymer repeats of 5 repeats in length is ˜98%. The benefits of ion semiconductor sequencing are rapid sequencing speed and low upfront and operating costs.
In some embodiments, sequencing is the technique developed by Stratos Genomics, Inc. and involves the use of Xpandomers. This sequencing process typically includes providing a daughter strand produced by a template-directed synthesis. The daughter strand generally includes a plurality of subunits coupled in a sequence corresponding to a contiguous nucleotide sequence of all or a portion of a target nucleic acid in which the individual subunits comprise a tether, at least one probe or nucleobase residue, and at least one selectively cleavable bond. The selectively cleavable bond(s) is/are cleaved to yield an Xpandomer of a length longer than the plurality of the subunits of the daughter strand. The Xpandomer typically includes the tethers and reporter elements for parsing genetic information in a sequence corresponding to the contiguous nucleotide sequence of all or a portion of the target nucleic acid. Reporter elements of the Xpandomer are then detected. Additional details relating to Xpandomer-based approaches are described in, for example, U.S. Pat. Pub No. 20090035777, entitled “High Throughput Nucleic Acid Sequencing by Expansion,” filed Jun. 19, 2008, which is incorporated herein in its entirety. Other emerging single molecule sequencing methods include real-time sequencing by synthesis using a VisiGen platform (Voelkerding et al., Clinical Chem., 55: 641-58, 2009; U.S. Pat. No. 7,329,492; U.S. patent application Ser. No. 11/671,956; U.S. patent application Ser. No. 11/781,166; each herein incorporated by reference in their entirety) in which immobilized, primed DNA template is subjected to strand extension using a fluorescently-modified polymerase and florescent acceptor molecules, resulting in detectible fluorescence resonance energy transfer (FRET) upon nucleotide addition.
In other preferred embodiments, the present invention utilizes protein sequencing techniques. In some embodiments, proteins may be sequenced by Edman degradation. See. e.g., Edman and Begg (1967). “A protein sequenator”. Eur. J. Biochem. 1 (1): 80-91; Alterman and Hunziker (2011) Amino Acid Analysis: Methods and Protocols. Humana Press. ISBN 978-1-61779-444-5. In other embodiments, mass spectrometry techniques are utilized to sequence proteins. See, e.g., Shevchenko et al., (2006) “In-gel digestion for mass spectrometric characterization of proteins and proteomes”. Nature Protocols. 1 (6): 2856-60; Gundry et al., (2009) “Preparation of proteins and peptides for mass spectrometry analysis in a bottom-up proteomics workflow” Current Protocols in Molecular Biology. Chapter 10: Unit 10.25.
Best practices in identifying tumor specific mutations have been described [56] although variations of the approach described are valid. In preferred embodiments described herein MHC binding, cathepsin cleavage patterns, and patterns of T cell exposed frequency are determined by mathematical algorithms previously described, see. e.g., PCT US2011/029192, PCT US2012/055038, US2014/01452, PCT US2020/037206, U.S. Pat. Nos. 10,706,955 and 10,755,801, each of which is incorporated by reference herein in its entirety.
Tumor Fraction and RNA Expression.
In order to effectively target an immune response to any individual tumor-specific mutation, a minimum of two initial conditions must be fulfilled: The mutation must be present in the DNA of a sufficient fraction of tumor cells and the DNA encoding the mutation must be transcribed into RNA and expressed as a protein. The tissue fraction comprising the mutant DNA can vary with the precision of resection of the biopsy and the relative composition of tumor to stromal tissue. It may also vary between metastases. In some instances, the fraction of the tumor biopsy comprising the mutation may be very high, represented in over 35% or over 50% of all cell DNA. In other instances, it may be lower, from 1-2% to 10%. In preferred embodiments the targeted mutations are selected from those mutant genes represented in at least 10% of the tumor DNA.
The RNA expression is an indicator of whether the gene is transcribed and hence actually targetable as a protein. Bulk RNA content of the tumor is enumerated and for each protein is normalized for the number of total reads of RNA detected in the biopsy sample and the length of the RNA transcript as a metric for gene expression. The number of RNA transcripts varies widely between proteins and overall the bulk RNA frequencies can be described as a log-normal distribution. If the gene is being expressed by both parental chromosomes, the relative expression of the normal and mutant allele for a mutated proteins should be correlated to the DNA mutation frequency. Allele specific expression has been shown to occur in tumors. Such a situation would be manifest with the parental chromosomes being expressed at differential rates and would lead to RNA mutant frequencies that differ from the frequency in the DNA. The mutation:normal ratio of expressed RNA compared to the DNA in the tumor fraction is an indicator of this. In some embodiments the DNA mutation occurs in only one chromosome and if expression of the protein is occurring solely or predominantly from the other chromosome, the mutant protein is not expressed or is under represented, rendering it an ineffective target.
To effectively target a tumor specific mutation it is necessary to establish that the protein is indeed being expressed from the parental chromosome containing the mutant gene. Methods for determination of the RNA fraction are described in Example 12 below. Thus in preferred embodiments peptides comprising mutant amino acids are selected from those proteins that are expressed and for which the RNA mutant: normal fraction is at least 10% of the corresponding DNA tumor:normal fraction.
Based on comparative sequencing of tumor biopsy and normal tissue a personalized set of neoepitopes can be defined that can form a personalized vaccine applied in the second round of vaccination. Some consideration in developing such a personalized vaccine are as follows.
The goal of stimulating a cytotoxic T cell response to a tumor is to specifically and differentially destroy the tumor cells while leaving normal cells intact. It follows that to drive a T cell response specific to the cancer, the T cell receptor must recognize an epitope unique to the tumor. Thus, the mutated amino acid must be located in the exposed pentameric motif exposed to the T cell receptor. When a mutated amino acid is located in a pocket or groove exposed motif, it may or may not affect binding affinity, but it is hidden from the T cell receptor and cannot elicit tumor-specific T cell responses. In some instances, the natural binding affinity of the mutated peptide and its neighboring peptides in the affected protein may give rise to better binding in positions which do not expose the mutated amino acid. In some cases, so-called neoepitope peptides have been selected which do not, in fact, differentiate tumor and normal T cell exposed motifs [42, 57]. In the present invention we seek to maximize use of the T cell exposed motifs containing mutant amino acids, and hence focus the T cell response on these differentiating epitopes, and likewise subsequent expansion of this response as the result of administration of checkpoint inhibitors.
Many investigators have considered how to identify peptides in mutated tumor proteins which bind to a patient's MHC alleles. Some have employed mass spectrometry to identify the “presentome” of peptides bound and presented to T cells [48]. However, this has the bias of identifying very high affinity peptides. In some cases, the peptides containing mutant amino acids were never detected by mass spectroscopy [44].
It is not clear that the highest binding peptides are those which will actually generate the best cytotoxic T cell response. Indeed, evidence in other settings suggests that this is not the case and that an intermediate binding affinity may be most effective in stimulating a T cell response and good memory T cells [58]. Low affinity peptides may initiate a CD8+ response but this is not sustained [59]. Furthermore, also drawing on experience in an anti-microbial setting, an active interferon gamma response is also needed to trigger the development of T memory cells [60]. Strength of T cell receptor—pMHC binding may be a factor in determining whether the T cell response to a tumor leads to T cell exhaustion and tolerance [2].
Analysis of the predicted MHC binding of peptides comprising mutations among proteins documented in the TCGA shows no statistical difference in overall predicted binding affinity between mutant and wildtype homolog. However, for TCEM I there is a significant impact when the mutant amino acid lies in positions 2 or 9 of a 9 mer. Overall, based on analysis of proteins with mutations recorded in TCGA, the MHC I binding affinity of the peptides containing the T cell exposed motif which become mutated is very low; about 22 μM, which is more than 40× lower than the 500 nM that is the consensus T cell stimulatory level. This indicates that such peptides are overall not highly likely to naturally elicit an effective and sustained cytotoxic T cell response and memory.
In one embodiment, the present invention enables the design of peptides presenting the T cell exposed motif of interest with a range of MHC binding affinities, allowing for selection of very high affinity binders or intermediate binding affinity to the alleles of a particular patient with the goal of stimulating and effective cytotoxic response. This is achieved by modification of amino acids in the groove exposed motifs, without changing the T cell exposed motifs to achieve a desired binding for one or more of a subject's MHC alleles.
Comparison of the frequency distribution of the T cell exposed motifs in peptides comprising mutations (for TCEM I cognate for MHC I molecules), among those documented in the TCGA, reveals that those comprising mutated amino acids are motifs that occur less commonly in the human proteome than their wildtype homologues. Overall the mutant peptides are biased towards those that are rare or even completely absent in the human proteome; the comparator here being all T cell exposed motif in all peptides of all isoforms of human proteins, approximately 88,000 proteins. The mutational event that inserts a new amino acid in the T cell exposed motif consistently produces T cell exposed motif that are much rarer as compared to the wildtype T cell exposed motif. (See, e.g., PCT US2020/037206, incorporated herein by reference in its entirety). It was also noted that when the frequency category of the T cell exposed motif comprising mutated acids in tumors are compared to the frequency of occurrence in the human immunoglobulinome, they correspond on average to the immunoglobulin frequency category FC20; indicating that on average the T cell exposed motif amino acid motifs would be found in 1 in 220 immunoglobulin variable regions (less than 1 in a million B cell clonal lines). This is 1000-fold below the mean frequency in immunoglobulin variable regions; another indicator that tumor T cell exposed motif are uncommon and that there may be a low frequency of cognate T cells.
While the primary focus is on stimulating a cytotoxic T cell response, driven by CD8+ T cells, such a response is enhanced and helped by the simultaneous stimulation of a CD4+ T helper response. This may be particularly important to the development of a population of memory T cells which can ensure ongoing surveillance and elimination of cancer cells. In some instances, a naturally occurring T helper response may be driven from the native mutated protein. In the present invention we also describe how a tumor specific T helper response can be stimulated by peptides designed to have a high binding affinity to the patient's MHC II alleles and to target T cell exposed motifs which comprise the mutated amino acid. Therefore, in one embodiment the invention provides for designing 15mer peptides by maintaining the TCEM II and varying the flanking sequences.
In some embodiments, to optimize the second phase personalized vaccination and similarly to maximize targeting of the selected common mutations of the first phase of vaccination, the present invention provides methods to maximize the utilization of available tumor specific antigens to generate effective cytotoxic T cell response that can bring about elimination of the tumor cells. This is achieved by identifying the T cell exposed motif containing the mutant amino acids and generating an array of peptides which combine these T cell exposed motifs with an array of different flanking amino acids of varying predicted binding affinity to enable selection of appropriate high binding peptides (See. e.g., PCT US2020/037206, which is incorporated by reference herein in its entirety).
In the case of TCEM I located in a 9-mer comprising 5 exposed amino acids flanked by 4 groove exposed amino acids, for each T cell exposed motif there is a maximum of 204 or 160,000 possible variant amino acid combinations in the groove exposed position. In some embodiments, an array of 1000 peptides is created by random amino acid substitution in the groove exposed positions, in other embodiments an array of 10,000 peptides is likewise created, and in further embodiments a 50,000 peptide array is created. In the case of TCEM II to create peptides binding differentially to MHC II, we consider a 15 mer in which exposed positions 2, 3, 5, 7, 8 or −1, 3, 5, 7, 8 are kept constant, as all other amino acids in the peptide that are presumed to be involved in the binding affinity are changed by random substitution to create arrays of 1000, 5,000 or 10,000 peptides. In both cases the array sizes cited here are examples that are considered non limiting.
In each case, both MHC I and MHC II, the TCEM is maintained identical to that in the mutated peptides in the native mutated protein and all TCEM which comprise each mutated amino acid are selected as the basis for generation of binding variants.
In further steps embodied in this invention, the initial array of peptides generated by amino acid substitution is then filtered to remove any duplicate peptides, and in some preferred embodiments peptides predicted to be of low solubility are removed by assigning a score to the polarity of their constituent amino acids. The peptides are then selected to be suitable for the specific patient and his/her combination of MHC I and MHC II alleles. In preferred embodiments all alleles are typed, including MHC I A, MHC I B, MHC I C, and MHC II DRB, DP and DQ loci. In one embodiment, the predicted affinity of the peptides in the native mutant protein is reviewed to determine the probability that a particular peptide would be bound by one or more of the patient's MHC alleles, albeit with a low affinity, and hence presented for T cell recognition. As the goal is to stimulate or “train” T cells to target the specific mutated T cell exposed motifs (TCEM) in the tumor, these must be exposed to T cell recognition to enable targeting of tumor cells. In one embodiment we identify each of the TCEM-allele combinations in each native mutant protein which binds with an affinity greater than the mean for the comprising protein. Such TCEM are targetable by T cells which are also specific to that MHC allele histotope. TCEM-allele combinations which have a predicted binding affinity above the mean are set aside as unlikely to ever be presented. For this subset of “presentable” TCEM-allele combinations, we then assess the array of randomly generated peptides, filtered for binding and solubility, and identify a peptide for each TCEM-allele combination with a desired predicted binding affinity. In some embodiments, the peptide with maximum predicted binding affinity for each allele may be chosen. This may be a peptide that binds at 2.5 or 3 or more standard deviation units below the mean for peptides in the protein (i.e., higher affinity). Such a high binding peptide would be comparable to those detected as part of the presentome by mass spectroscopy and equivalent to approximately <20 nM to 100 nM, depending on the protein context. In preferred embodiments however, peptides are chosen with high, but not excessive predicted binding affinity, keeping in mind the probability that this may be more likely to stimulate an effective cytotoxic response and memory and mitigate against T cell exhaustion. Such a binding affinity may be from 1-2 standard deviation units below the mean for peptides in the protein, typically equivalent to 100-500 nM. Overall, the invention embodies the ability to select for a desired binding affinity and can be considered “tunable” to that selected binding affinity for each patient allele.
In some embodiments the novel peptides are used in vitro to stimulate dendritic cells or T cells. In some embodiments such cells are of autologous source, in yet other embodiments they are obtained from allele-matched donors. Stimulated cells are then administered to the cancer patient to passively provide an active T cell population or to provide dendritic cells presenting the TCEM of interest which can stimulate T cells in the patient. In yet other embodiments the peptides are used as components of a peptide vaccine. In yet other embodiments the peptides are applied as a fusion with antibody sequences. In further embodiments the peptides may be encoded in RNA or DNA for administration.
In some embodiments, the frequency classification of the TCEM in the human proteome is noted. In further embodiments the frequency classification of the TCEM in the human immunoglobulinome is noted. In both cases this is achieved by reference to a precomputed reference database comprising over 88,000 human proteins including multiple isoforms and over 35 million unique human immunoglobulin variable regions. Based on this, in some embodiments peptides comprising rare TCEM are identified for priority use.
In desired embodiments, therefore, the process described above yields a unique array of peptides for a particular patient, enabling stimulation of T cells targeting the maximum possible TCEM specific to that patient's tumor-specific mutations and mutated proteins, by presentation of peptides of selected binding affinity in each of the known alleles the patient carries, and the peptides further selected to be soluble. This is a panel of peptides which can then be deployed to stimulate T cells in vivo and in vitro by application in a number of different formats.
TCEMs comprise 5 amino acids, or 205=3.2 million possible configurations. T cell receptor polyspecificity is well recognized [61]. Any neoantigen carries with it the risk of generating an off-target T cell targeting of a self-protein with potential adverse consequences, which may be magnified by immunomodulatory interventions such as checkpoint inhibitors. Prior developers of neoantigen vaccines have not addressed this aspect. In a further embodiment of the present invention therefore, TCEM are identified which comprise mutated amino acids and which are bound and presented in the patient's alleles, and are therefore identified as candidates for targeting with T cells stimulated by highly bound peptides. The stimulation of T cells targeting these peptides, when enhanced by high binding affinity neoantigens and potentially further boosted by a checkpoint inhibitor blockade could potentially give rise to self-protein targeting. In one embodiment, therefore, a “call list” of such TCEM is cross-correlated with the reference data set of the human proteome to identify all human proteins carrying the TCEM. These proteins are reviewed to determine the predicted binding affinity of the peptide in which the TCEM occurs for each of the patient's known alleles. If the human proteome carries that TCEM and the patient alleles would bind the contextual peptide at a moderate or high affinity (which may be considered to be an affinity at less than 1 standard deviation below the mean for the comprising protein, although this range is not considered limiting) then the protein carrying the TCEM is added to an advisory list. In preferred embodiments the protein is identified by its Uniprot identifier or identifiers linking it to other reference databases. In preferred embodiments the advisory list is reviewed to further identify proteins where deficiencies or blockades are associated with known pathologies, and to identify proteins which are of critical function and non-redundant. Such proteins may not be suitable for inclusion in a neoantigen vaccine and may be added to a caution list. However, the advisory and caution lists only identify potential sources of adverse reactions and must be weighed against the progression and severity of the cancer. Given the degree of inherent polyspecificity, the advisory list is typically quite extensive. Many proteins are shielded by anatomic or cellular location, some may be considered redundant, or may be considered an acceptable tradeoff to overcoming cancer. However, this embodiment allows an informed decision to be made regarding possible adverse effects in neoantigen selection.
In a preferred embodiment the present invention allows the design of a personalized immunotherapeutic intervention designed for each cancer patient based on their HLA alleles and particular set of mutations for application in the second vaccination. In some applications of this embodiment the mutations are unique to one patient. This intervention becomes feasible as soon as sequencing of a tumor biopsy and HLA typing is available and can be applied as a second round of vaccination as described herein. In some embodiments the process of sequencing a biopsy may be repeated several times in the course of treatment and the selection of peptides updated based on detection of new mutations. In some preferred embodiments the invention provides an immunotherapy solution for patients who have few proteins with known mutations, for example, but not limited to, glioblastoma patients, who would otherwise be limited to only one neoantigen per protein and possibly no neoantigens with appropriate HLA binding. The preferred embodiment of the present invention is to provide the maximum number of T cell stimulating peptides which will result in targeting of every possible TCEM in which the mutant amino acid occurs and by utilizing every possible HLA. In a further embodiment of the invention the peptides are down-selected to those which will target TCEM presented in vivo and those which are less likely to cause adverse targeting of other human proteins. In an extension of this preferred embodiment, the selected stimulatory peptides may be grouped to provide a series of vaccinations or treatments which allow the utilization of all available alleles the patient carries, while not causing competition for peptide presentation in any one group of peptides.
In some embodiments the selected peptides are applied to dendritic cells in vitro which are then administered to the patient to stimulate T cells. In yet other embodiments the selected peptides are applied in vitro to stimulate a population of T cells which are administered to the patient. In yet other embodiments the peptides, or nucleic acids encoding them are administered directly to the patient in one or more groups spaced over time.
In some embodiments the present invention will yield an array of many peptides suitable for enhancing the CD8+ response of a particular patient to his/her mutated tumor proteins and a list of many peptides suitable for enhancing a CD4+ helper response to these proteins. In some particular embodiments the number of peptides designed to bind MHC and stimulate T cells in a particular patient may be up to 5, in others it is about 20, in yet others it is over 100 and in yet others over 200 peptides. In some embodiments the peptide array will include those which bind to 1 allele, 2 alleles or up to 6 MHC I alleles and others which bind 1, 2 or up to 6 MHC II alleles. In order to optimize the application of the peptides and maximize the use of binding alleles while minimizing competition for binding at any single administration, a further embodiment of the present invention is to prioritize and group the peptides for sequential administration. In a preferred embodiment the peptides may be grouped into subgroups of about 5, in other embodiments subgroups of about 10 are preferred, and in yet other embodiments subgroups of about 20 are preferred and in further embodiments larger groups are preferred. The subgroups may combine both MHC I and MHC II binding peptides. Some peptides may be repeated in several subgroups. In some embodiments where vaccination regimens comprise sequential administration of a subset of selected peptides, each peptide administration may be followed by check point inhibitor treatment. In some embodiments, consideration is given to whether particular TCEM encompassed in the peptides in each group are rare or common TCEM in the human proteome or immunoglobulinome. In some preferred embodiments priority is given to inclusion of peptides that comprise rare TCEM. In each instance where a peptide is mentioned above, this may also refer to the application of a nucleic acid encoding the peptide. In preferred embodiments peptides that have TCEM matches in certain human proteins are excluded from consideration, where stimulating a T cell response which may target the human proteins may result in an adverse effect. In yet another embodiment, where transcription levels of the mutated proteins in a tumor are known, peptides may be prioritized based on their transcription level to increase the chance of successful targeting of tumor cells.
Many delivery formulations are suitable for administration of both the first and second rounds of vaccinations described herein. In some embodiments the delivery formulation adopted may be the same for both first and second rounds of vaccinations. In other embodiments different delivery formulations may be chosen for tumor associated antigens, for epitopes targeting common mutations of the clinical presentation, and yet different formulations and delivery routes chosen for the personalized neoepitopes of the second round of vaccination
The suitable delivery formulations for neoepitope and tumor associated antigen vaccines, include but are not limited to peptide vaccines, antibody-antigen fusion proteins, DNA or RNA encoding antigens, and particulate vaccines. Neoantigens have been administered directly to subjects or have served to prime dendritic cells or stimulate T cells in vitro for administration of such cells to the subject. The dendritic cells or T cells have included those of autologous or of donor origin. Any of these delivery formulations may be used for delivery of peptides designed by the present invention.
In some embodiments of the present invention the epitope peptides, or their encoding nucleic acids, may be administered parenterally. In yet other embodiments the peptides or their encoding nucleic acids may be delivered intradermally or subcutaneously. In some embodiments intradermal administration may be achieved by needle injection. In preferred embodiments intradermal administration may be provided by micro needle patch or array. In yet further embodiments the microneedle patch or array may deliver multiple different peptides or encoding nucleic acids thereof. In yet other embodiments some doses of each round of vaccination may be delivered orally.
In some embodiments of the present invention the peptides comprising the selected epitopes may be delivered as a nucleic acid, in some embodiments the nucleic acid may be RNA, in others DNA. In further embodiments the peptides may be delivered by a virus vector, which may comprise elements of a retroviral vector, a poxvirus, an adenovirus, or other virus vector delivery system. In further embodiments the delivery of nucleic acid or peptide may be accomplished by a lipid drug delivery system comprising, but not limited to, one of the following: nanoparticles, emulsions, self-emulsifying drug delivery systems, nanocapsules and liposomes, wherein molecules of a drug active product is encased or partially encases in lipid.
In some embodiments the selected peptides may be administered as a fusion to a moiety which favors formation of nanoparticles. Examples of such moieties include but are not limited to leucine multimers (polyleucine), unnatural hydrophobic amino acids, or liposomes. The peptide of interest may be attached to its fusion partner by a linker. In some instances, the linker is cleavable. The cleavable linker may be one or more lysine or arginine residues, or a cathepsin cleavable linker.
In some embodiments the selected peptides or their encoding nucleic acids may be delivered with an adjuvant. Various adjuvants are used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, squalene, squalene emulsions, liposomes, imiquimod, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (Bacille Calmette-Guerin) and Corynebacterium parvum. In other embodiments a cytokine may be co-administered, including but not limited to interferon gamma or stimulators thereof, interleukin 12, or granulocyte stimulating factor. In other embodiments the peptides or their encoding nucleic acids may be co-administered with a local inflammatory agent, either chemical or physical. Examples include, but are not limited to, heat, infrared light, proinflammatory drugs, including but not limited to imiquimod.
In some embodiments epitope peptides stimulating a response to tumor associated antigens have been selected to allow binding to an array of different HLA alleles and such vaccines may be administered without determination of the HLA alleles of the subject. However, in a preferred embodiment the HLAs of the subject are determined prior to administration of the first round of vaccination. For the second round of vaccination, having knowledge of the patient's HLA alleles is a prerequisite to designing a bespoke peptide vaccine. Several approaches to HLA typing may be employed, including PCR, and such testing is widely available. The HLA can be derived from the whole exome sequence at the same time as analysis of the tumor mutations in the biopsy.
The present invention provides an immunotherapeutic strategy to maximize the immune response to a tumor by combining vaccinations comprising epitopes derived from tumor associated antigens, epitopes that are arise from mutations commonly found in cancers of a particular clinical presentation, and personalized combinations of neoepitopes designed to address the particular mutational landscape of one individual subject, based sequencing the exome of a tumor biopsy from that person. In preferred embodiments, this is a staged approach with the first round of vaccination occurring prior to surgical intervention and a second round following surgical intervention and biopsy. However, in situations where the tumor is not respectable due to its location, and in instances where the personal tumor mutational landscape can be determined by sequencing tissues accessible without surgery the two phases may proceed without a surgical intervention.
Vascular endothelial growth factor receptor 1 (VEGFR1), also known as FLT1, is characterized by the sequence P17948 (uniprot.org) and also occurs in at least 8 other isoforms produced by alternative splicing. In selecting an epitope peptide therefore, it is important to identify sequences which are conserved. VEGFR1 is a tyrosine-protein kinase that acts as a cell-surface receptor for VEGFA, VEGFB and PGF, and plays an essential role in the development of embryonic vasculature, the regulation of angiogenesis, cell survival, cell migration, macrophage function, chemotaxis, and cancer cell invasion. Vascular endothelial growth factor receptor 2 (VEGFR2), also known as KDR, is similarly a tyrosine-protein kinase, but which acts as a cell-surface receptor for VEGFA, VEGFC and VEGFD.
Vimentin is found in one dominant isoform represented by the sequence P08670 in Uniprot. Possible additional isoforms have been identified.
Wilms tumor protein WT1 is a transcription factor that plays an important role in cellular development and cell survival and is considered an oncogene. It is exemplified by sequence P19544. At least 8 additional isoforms are recognized and so epitopes are selected from the conserved regions.
Tables 5 and 6 provide epitope peptides of WT1 which are predicted to bind multiple MHC I and MHC II alleles. A combination of 2 MHC I epitopes may be applied to provide broad coverage or a peptide may be selected based on the known HLA alleles of the subject. In preferred embodiments an MHC I epitope peptide is administered contemporaneously with an MHC II peptide that overlaps it and also binds to the alleles of the individual.
EGFR is a transmembrane protein with a transmembrane domain located at positions 646-667 relative to its N terminal and a signal peptide. The EGFRvIII variant found commonly in glioblastoma has a shortened extramembrane domain (
As shown in
Table 7 shows peptides which have T cell exposed motifs spanning the common EGFR mutations and in which natural binding affinity is appropriate; it will be noted that relatively few alleles have high affinity natural binding to the mutant T cell exposed motifs, reflecting one reason tumor motifs may achieve immune evasion.
Table 8 shows bespoke peptides as examples of peptides designed for exemplar alleles to provide binding that will allow them to be presented to these alleles where natural binding is insufficient to competitively stimulate a new T cell clone. As these examples were selected from a longer list for illustrative purposes, and other peptides with differing binding affinity could have been selected from that list, these examples are considered non limiting.
Unique T cell exposed motifs span the deletion junction in EGFRvIII, however relatively few MHC I alleles bind at least one of the five possible unique T cell exposed motifs. Overall 31 of 70 MHC I alleles bound at less than −500 nM (1 SD), comprising 17 binding sites among the 31 B alleles, 9 of 31 A allele and 5 of 8 C alleles evaluated had binding less than 500 nM at any of the possible T cell exposed motifs. In particular, no binding of A0201 was predicted. In addition, A0101, B4001 and B 1542 had predicted binding in excess of 2.75 SD below the mean equivalent of approximately 20 nM which may be an affinity so high it could induce suppression or exhaustion. Therefore, EGFRvIII is a candidate for designing peptides specifically to optimize binding to a patient's alleles. Among the 70 alleles for which predicted binding was evaluated in the natural mutated EGFRvIII, 65 alleles have some probability of presentation of the native epitope based on at least a low level of binding of the natural peptide. These are candidates for using a synthetic bespoke peptide to stimulate T cells which are cognate for and can therefore target these T cell exposed motifs. Following the process laid out in the prior examples we generated a set of 10,000 peptides for each of the possible T cell exposed positions EEKKG (SEQ ID NO: 284), EKKGN (SEQ ID NO: 285), KKGNY (SEQ ID NO: 286), KGNYV (SEQ ID NO: 287), GNYVV (SEQ ID NO: 288).
Examples of peptides with selected predicted binding affinity are shown in Table 9 for a set of example alleles. These are assigned SEQ ID NOs.: 244-283. As many different peptides can be designed with flanking regions that produce the desired binding affinity and solubility, these examples shown are provided as illustrative but non-limiting examples.
Isocitrate dehydrogenase IDH1, encoded by sequence 075874, is commonly mutated in gliomas. The most frequent mutation is R132H is associated with loss of conversion of isocitrate to alpha-ketoglutarate and increased risk of hypermethylation.
For other alleles and to increase the array of alleles the generation of designed flanking regions to the above cited T cell exposed motifs is desirable. Tables 10 and 11 provide examples of such bespoke peptides for an illustrative set of alleles. These examples are considered non limiting as the same approach can be applied for other alleles and multiple peptide options exist for each allele.
Histone H3.3 (also written as H33), is encoded by sequence P84243, and is a variant of histone 3 found in non-dividing cells. Mutations of this variant are associated with hypermethylation, instability and pathogenesis of glioma and glioblastoma. The most common mutation is referred to as K27M, although in fact it is a replacement of a lysine at position 28 in the canonical isoform 1 of the protein. Position 28 is located in a region which peptides very poor or no predicted binding to either MHC I or MHC II alleles (
BRAF (Serine/threonine-protein kinase B-raf) exemplified by sequence P15056 is one of the most commonly mutated proteins in cancer. BRAF is thought to function in regulating the MAP kinase/ERKs signaling pathway, which affects cell division, differentiation, and secretion. BRAF mutations have been associated with various cancers, including non-Hodgkin lymphoma, colorectal cancer, malignant melanoma, thyroid carcinoma, non-small cell lung carcinoma, and adenocarcinoma of the lung. There are several particularly common mutations in BRAF but the most common is V600E. Mutation V600M, G and K are less common.
In order for a T cell to differentially target a tumor cell expressing a mutated protein, the mutated amino acid has to be located in a position “visible” or exposed to the T cell receptor and not hidden in the pocket or groove exposed positions that determine binding. A first step in designing a multi peptide vaccine or stimulant panel is therefore to identify those peptide positions which expose the mutated amino acid. For MHC I this means the mutant amino acid must be at positions 4, 5, 6, 7 or 8 of a 9-mer peptide and for MHC II at positions 2, 3, 5, 7, 8 of the 9-mer core of a 15 mer. This identifies TCEM IIA; TCEM IIB positions are at −1, 3, 5, 7, 8. We first calculated the predicted binding affinity of all sequential peptide positions in the mutant protein and then selected those peptides with relevant TCEM comprising mutated amino acids.
A T cell is only able to target a TCEM if that motif is presented in the host from the naturally occurring mutant peptide. Mutant TCEM that lie in peptides that are extremely unlikely to ever be presented are thus poor targets. The TCEM are therefore filtered to identify those which have some likelihood of exposure in the host, limiting to those whose predicted binding affinity is greater than the mean for the native protein. This is not an absolute requirement but maximizes the potential for a successful targeting.
For each of the selected peptides comprising a mutant TCEM, a bank of peptides is generated by varying the flanking amino acids and recalculating the new binding affinity for each allele of interest. For a 9-mer with a pentamer exposed TCEM, this implies up to 160,000 (204) different peptides could be generated, each with a different binding affinity. For practical purposes a bank of 1000 or up to 15,000 peptides is usually sufficient to provide peptides within the range of binding affinity desired. For MHC II in preferred embodiments only those amino acids outside the core 9 mer peptide comprising the TCEM are varied, as the intercalated amino acids which are in pocket (groove exposed) positions affect binding but may also influence the positioning of the exposed amino acids.
A further practical consideration is solubility of the peptide. A score was generated based on the polarity of the constituent amino acids and only peptides likely to be soluble were put forward as candidates. Sufficient peptides can be generated to prevent this from becoming a limitation. Furthermore, peptides in which cysteines are placed in the groove exposed motifs are avoided wherever possible to minimize cross linking during formulation.
For a group of 5 proteins each with one mutation and a patient with 4 known alleles therefore a maximum number of allele TCEM combinations is 5 TCEM×5 proteins×4 alleles or 100 possible ways to stimulate T cells which will uniquely target those proteins. This is then reduced by the TCEM of low probability of natural presentation.
The development of vaccines and stimulants for dendritic cells and T cells in vitro to comprise multiple peptides with a selected desired affinity for the patient's alleles builds on methods previously described to precisely predict MHC binding, identify and analyze T cell exposed motifs and generate peptides with altered binding affinity (See PCT US2011/029192, PCT US2012/055038, US2014/014523, PCT US2015/039969, PCT US2017/021781, US Publ. No. 20130330335, US Publ. No. 20160132631, US Publ. No. 20170039314, US Publ. No 20170161430, US Publ. No. 20190070255, PCT US2020/037206, U.S. Pat. Nos. 10,706,955 and 10,755,801, all of which are incorporated herein by reference in their entirety).
The process described in Example 9 generates a selection of peptides of different binding affinity for each combination of mutant-containing-TCEM and patient allele. Peptides are then selected which have a desired predicted binding affinity. As peptides of many different binding affinities are provided, the desired affinity may be selected. In the subsequent examples we have elected to focus on peptides with predicted binding affinity at about 2 standard deviations below the mean of the protein, placing them at about the 95th percentile; i.e. the top 5% binders, but not higher, because very high affinity peptide may lead to immunosuppression or exhaustion.
Peptides may be selected to use in groups that target the maximum number of combinations of allele and TCEM in any one application. A desired aspect is to ensure not all peptides administered at any one time as a multi-neoepitope vaccine target the same allele, thus competing with each other for space in MHC and presentation. When dendritic cells and T cells are targeted in vitro it may be desirable to provide as many combinations as possible.
To identify potential off-target effects of the T cells stimulated by the peptides designed to generate targeting of cancer mutations, the TCEM are compared with those in the human proteome to identify relevant matches. The entire human proteome, comprising over 88,000 proteins (including all known isoforms of each protein), was pre-analyzed to determine the binding affinity of each peptide in each protein for all MHC alleles. The TCEM comprised in the peptides selected for each cancer patient, selected as described in Example 10 are assembled into a “call list”. The human proteome reference database is searched for all TCEM on the patient call list; a subset of proteins with matching TCEM is assembled. The peptides in this subset which contain the TCEM on the call list are then examined to determine if the TCEM would be likely to be presented in the MHC corresponding to that subject's alleles. If the proteome peptide comprising the TCEM of interest is predicted to bind to any one of the patient's known alleles with an affinity <1 SD below the mean for the protein, the protein is included in an advisory list. The list is curated to remove duplicates and references to any protein fragments catalogued in UniProt (on the world wide web at uniprot.org). Individual proteins may be reviewed in UniProt and elsewhere to determine if there is evidence of pathologies arising from deficiencies or mutations in the protein. Instances in which a protein of immediate concern is targeted are flagged with a “caution” and excluded from the proposed peptides encoded in a vaccine or in vitro cell stimulation. Examples include, but are not limited to, coagulation factors, neurotransmitters, complement, other proteins with known essential and non-redundant functions. Decision on off-targeting of proteins in the advisory list may be based on a risk-benefit analysis of the subject's condition but access to such a list allows the oncologist to make an informed decision. The most complete typing of a subject's alleles enables a more complete assessment of potential off-targets. Notably, as the relevance of each target will depend on its presentation as a result of the MHC binding of the peptide in which the TCEM occurs, identifying the potential off-target impacts is as personalized as the design of the peptide array for that cancer patient.
Bulk RNA transcript enumeration is carried out using a bioinformatic process that has been designed to tally transcription of different genes. The resulting data is expressed as the FPKM (fragments per kilobase per million total reads) that normalizes the metric for both the length of the transcribed coding region and the number of total reads in the bulk sample detected by the sequencing machine. The bioinformatic software used for transcript enumeration (Magic-BLAST from NCBI) has been designed to assess gene expression and as such is not directly capable of measuring the frequency of potentially mutated codons within the transcripts. In order to compute the mutant frequency in the mRNA transcripts it is necessary to separately enumerate the normal and mutant transcripts. This is achieved by creating a version of the SAM (sequence alignment map) file of the RNA sequences with a bioinformatic software that modifies the cigar (compact idiosyncratic gapped alignment report) strings that map the alignments of the (missing) intronic sequences in the mRNA. Once this modified SAM file is created it can be processed with the standard mutation detection tool, such as mutect 2 [62] that provides the differential mutant and normal read tallies. The ratios of these read tallies are thus the mutant and normal frequency of the allele in the mRNA transcripts. If both parental chromosomes are being expressed equally then the frequency of the mutant and normal allele in the RNA will correlate with the frequency in the DNA. Allele specific differences in expression will give rise to poor correlations. In the extreme, where there is highly differential expression of the parental chromosomes, the mutant may be the only one expressed or may not be expressed at all compared to the normal.
Therefore, in preferred embodiments, the RNA fraction comprising the mutant amino acid is compared to the tumor DNA tumor fraction encoding the gene mutation. In some embodiments tumor specific mutations which can be targeted by T cells are selected from those in which the RNA/DNA ratio exceeds 10%.
All publications and patents mentioned in the above specification are herein incorporated by reference as if expressly set forth herein. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in relevant fields are intended to be within the scope of the following claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/062147 | 12/7/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63122192 | Dec 2020 | US |