Compositions for optimized RAS peptide vaccines

Information

  • Patent Grant
  • 11058751
  • Patent Number
    11,058,751
  • Date Filed
    Friday, November 20, 2020
    4 years ago
  • Date Issued
    Tuesday, July 13, 2021
    3 years ago
  • Inventors
  • Original Assignees
    • Think Therapeutics, Inc. (Newton, MA, US)
  • Examiners
    • Stoica; Elly-Gerald
    Agents
    • Wilmer Cutler Pickering Hale and Dorr LLP
Abstract
The present disclosure provides for methods, systems, and compositions of nucleic acid and peptide sequences. The present disclosure provides for a nucleic acid sequence encoding two or more amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. The present disclosure also provides for an immunogenic peptide composition comprising two or more peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. The present disclosure further provides for a nucleic acid sequence encoding one or more amino acid sequences selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. The present disclosure additionally provides for an immunogenic peptide composition comprising one or more peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
Description

This patent disclosure contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction of the patent document or the patent disclosure as it appears in the U.S. Patent and Trademark Office patent file or records, but otherwise reserves any and all copyright rights.


INCORPORATION BY REFERENCE

All documents cited herein are incorporated herein by reference in their entirety.


SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. The ASCII copy, created on Nov. 20, 2020, is named 2215269_00123US1_SL.txt and is 419,529 bytes in size.


TECHNICAL FIELD

The present invention relates generally to compositions, systems, and methods of peptide vaccines. More particularly, the present invention relates to compositions, systems, and methods of designing peptide vaccines to treat or prevent disease optimized based on predicted population immunogenicity.


BACKGROUND

The goal of a peptide vaccine is to train the immune system to recognize and expand its capacity to engage cells that display foreign peptides to improve the immune response to cancerous cells or pathogens. A peptide vaccine can also be administered to someone who is already diseased to increase their immune response to a causal cancer, other diseases, or pathogen. There exists a need for compositions, systems, and methods of peptide vaccines based on prediction of the foreign peptides that will be displayed at a later time to protect a host from cancer, other disease, or pathogen infection.


SUMMARY OF THE INVENTION

In one aspect, the invention provides for a nucleic acid sequence encoding two or more amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class I molecule. In some embodiments, the one or more peptides is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.


In another aspect, the invention provides for an immunogenic peptide composition comprising two or more peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In some embodiments, a peptide in the immunogenic peptide composition is displayed by an HLA class I molecule. In some embodiments, a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In another aspect, the invention provides for a nucleic acid sequence encoding one or more amino acid sequences selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.


In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class II molecule. In some embodiments, the one or more peptides is a modified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.


In another aspect, the invention provides for an immunogenic peptide composition comprising one or more peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.


In some embodiments, a peptide in the immunogenic peptide composition is displayed by an HLA class II molecule. In some embodiments, a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.





BRIEF DESCRIPTION OF THE DRAWINGS

The following figures depict illustrative embodiments of the invention.



FIG. 1 is a flow chart of a vaccine optimization method.



FIG. 2 is a flow chart of vaccine optimization method with seed set compression.



FIG. 3 shows predicted population coverage for single target MHC class I vaccines by vaccine size for KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D targets.



FIG. 4 shows predicted population coverage for single target MHC class II vaccines by vaccine size for KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D targets.



FIG. 5 shows probabilities of disease presentations for pancreas, colon/rectum, and bronchus/lung and respective probabilities of target presentations for KRAS G12D, KRAS G12V, and KRAS G12R targets.



FIG. 6 is a flow chart for multiple target (combined) vaccine optimization methods.



FIG. 7 shows predicted population coverage for pancreatic cancer multiple target (combined) MHC class I vaccines by vaccine size for KRAS G12D, KRAS G12V, and KRAS G12R targets.



FIG. 8 shows predicted population coverage for pancreatic cancer multiple target (combined) MHC class II vaccines by vaccine size for KRAS G12D, KRAS G12V, and KRAS G12R targets.



FIG. 9 shows an example Python implementation of the MERGEMULTI function for combined vaccine design procedures.



FIG. 10 shows predicated peptide-HLA hits by vaccine size for a KRAS G12V vaccine for the HLA diplotype HLA-A02:03, HLA-A11:01, HLA-B55:02, HLA-B58:01, HLA C03:02, HLA-C03:03.





DETAILED DESCRIPTION

In some embodiments, the disclosure provides for peptide vaccines that incorporate peptide sequences that will be displayed by Major Histocompatibility Complex (MHC) molecules on cells and train the immune system to recognize cancer or pathogen diseased cells. In some embodiments, a peptide vaccine is a composition that consists of one or more peptides. In some embodiments, a peptide vaccine is an mRNA or DNA construct administered for expression in vivo that encodes for one or more peptides.


Peptide display by an MHC molecule is necessary, but not sufficient, for a peptide to be immunogenic and cause the recognition of the resulting peptide-MHC complex by an individual's T cells to trigger T cell activation, expansion, and immune memory. In some embodiments, experimental data from assays such as the ELISPOT (Slota et al., 2011) or the Multiplex Identification of Antigen-Specific T Cell Receptors Using a Combination of Immune Assays and Immune Receptor Sequencing (MIRA) assay (Klinger et al., 2015) can be combined with machine learning based predictions of peptide display by an MHC molecule. In some embodiments, the MHCflurry or NetMHCpan computational methods (known in the art) are used to predict MHC class I display of a peptide by an HLA allele (see Table 1). In some embodiments, the NetMHCIIpan computational method (known in the art) is used to predict MHC class II display of a peptide by an HLA allele (see Table 2).


A peptide is displayed by an MHC molecule when it binds within the groove of the MHC molecule and is transported to the cell surface where it can be recognized by a T cell receptor. In some embodiments, a peptide that is part of the normal proteome in a healthy individual is a self-peptide, and a peptide that is not part of the normal proteome is a foreign peptide. Foreign peptides can be generated by mutations in normal self-proteins in tumor cells that create epitopes called neoantigens, or by pathogenic infections. In some embodiments, a neoantigen is any subsequence of a human protein, where the subsequence contains one or more altered amino acids or protein modifications that do not appear in a healthy individual.


For example, KRAS gene mutations are the most frequently mutated oncogenes in cancer, but they have been very difficult to treat with small molecule therapeutics. The KRAS protein is part of a signaling pathway that controls cellular growth, and point mutations in the protein can cause constitutive pathway activation and uncontrolled cell growth. Single amino acid KRAS mutations result in minor changes in protein structure, making it difficult to engineer small molecule drugs that recognize a mutant specific binding pocket and inactivate KRAS signaling. KRAS oncogenic mutations include the mutation of position 12 from glycine to aspartic acid (G12D), glycine to valine (G12V), glycine to arginine (G12R), or glycine to cystine (G12C); or the mutation of position 13 from glycine to aspartic acid (G13D). The corresponding foreign peptides contain these mutations.


A challenge for the design of peptide vaccines is the diversity of human MHC alleles that each have specific preferences for the peptide sequences they will display. The Human Leukocyte Antigen (HLA) loci, located within the MHC, encode the HLA class I and class II molecules. There are three classical class I loci (HLA-A, HLA-B, and HLA-C) and three loci that encode class II molecules (HLA-DR, HLA-DQ, and HLA-DP). An individual's HLA type describes the alleles they carry at each of these loci. Peptides of length of between about 8 and about 11 residues can bind to HLA class I (or MHC class I) molecules whereas those of length of between about 13 and about 25 bind to HLA class II (or MHC class II) molecules (Rist et al., 2013; Chicz et al., 1992). Human populations that originate from different geographies have differing frequencies of HLA alleles, and these populations exhibit linkage disequilibrium between HLA loci that result in population specific haplotype frequencies. In some embodiments, methods are disclosed for creating effective vaccines that includes consideration of the HLA allelic frequency in the target population, as well as linkage disequilibrium between HLA genes to achieve a set of peptides that is likely to be robustly displayed.


The present disclosure provides for compositions, systems, and methods of vaccine designs that produce immunity to multiple targets. In some embodiments, a target is a neoantigen protein sequence, a pathogen proteome, or any other undesired protein sequence that is non-self and is expected to be bound and displayed by an MHC molecule. When a target is present in an individual it may result in multiple peptide sequences that are displayed by a variety of HLA alleles. Therefore, in this disclosure, “foreign peptide” refers to an amino acid sequence encoding a fragment of a target protein/peptide (or a full protein/peptide), the target protein/peptide consisting of: a neoantigen protein, a pathogen proteome, or any other undesired protein that is non-self and is expected to be bound and displayed by an MHC molecule.


In some embodiments, peptide-MHC immunogenicity data or computational predictions of peptide-MHC immunogenicity can be included and combined with scores for peptide display in the methods disclosed herein. One way of combining the scores is using immunogenicity data for peptides assayed for immunogenicity in diseased or vaccinated individuals, and assigning peptides to the HLA allele that displayed them in the individual by choosing the HLA allele that computational methods predict has the highest likelihood of display. For peptides that are not experimentally assayed, computational predictions of display can be used.


Since immunogenicity may vary from individual to individual, one method to increase the probability of vaccine efficacy is to use a diverse set of foreign peptides (e.g., at least two peptides) to increase the chances that some subset of them will be immunogenic in a given individual. Prior research using mouse models has shown that most MHC displayed peptides are immunogenic, but immunogenicity varies from individual to individual as described in Croft et al. (2019). In some embodiments, experimental peptide-HLA immunogenicity data are used to determine which foreign peptides and their modifications will be effective immunogens in a vaccine.


Considerations for the design of peptide vaccines are outlined in Liu et al., Cell Systems 11, Issue 2, p. 131-146 (Liu et al., 2020) and (Liu et al., 2020b) which are incorporated by reference herein.


Certain foreign peptides may not bind with high affinity to a wide range of HLA molecules. To increase the binding of foreign peptides to HLA molecules, their amino acid composition can be altered to change one or more anchor residues or other residues. Anchor residues are amino acids that interact with an HLA molecule and have the largest influence on the affinity of a peptide for an HLA molecule. Peptides with altered anchor residues are called heteroclitic peptides. In some embodiments, heteroclitic peptides include foreign peptides with residue modifications at non-anchor positions. In some embodiments, heteroclitic peptides include foreign peptides with residue modifications that include unnatural amino acids and amino acid derivatives. Modifications to create heteroclitic peptides can improve the binding of peptides to both MHC class I and MHC class II molecules, and the modifications required can be both peptide and MHC class specific. Since peptide anchor residues face the MHC molecule groove, they are less visible than other peptide residues to T cell receptors. Thus, heteroclitic peptides have been observed to induce a T cell response where the stimulated T cells also respond to unmodified peptides. It has been observed that the use of heteroclitic peptides in a vaccine can improve a vaccine's effectiveness (Zirlik et al., 2006). In some embodiments, the immunogenicity of heteroclitic peptides are experimentally determined and their ability to activate T cells that also recognize the corresponding base (also called seed) peptide of the heteroclitic peptide is performed as is known in the art. In some embodiments, these assays of the immunogenicity and cross-reactivity of heteroclitic peptides are performed when the heteroclitic peptides are displayed by specific HLA alleles.


Peptide Vaccines to Induce Immunity to One or More Targets


In some embodiments, a method is provided for formulating peptide vaccines using a single vaccine design for a one or more targets. In some embodiments, a single target is a foreign protein with a specific mutation (e.g., KRAS G12D). In some embodiments, multiple targets can be used (e.g. both KRAS G12D and KRAS G13D).


In some embodiments, the method includes extracting peptides to construct a candidate set from all target proteome sequences (e.g., entire KRAS G12D protein) as described in Liu et al. (2020). FIGS. 1 and 2 depict flow charts for example vaccine design methods that can be used for MHC class I or MHC class II vaccine design. In some embodiments, extracted foreign peptides are of amino acid length of between about 8 and about 10 (e.g., for MHC class I binding (Rist et al., 2013)). In some embodiments, the extracted foreign peptides presented by MHC class I molecules are longer than 10 amino acid residues, such as 11 residues (Trolle et al., 2016). In some embodiments, extracted foreign peptides are of length between about 13 and about 25 (e.g., for class II binding (Chicz et al., 1992)). In some embodiments, sliding windows of various size ranges described herein are used over the entire proteome. In some embodiments, other foreign peptide lengths for MHC class I and class II sliding windows can be utilized. In some embodiments, computational predictions of proteasomal cleavage are used to filter or select peptides in the candidate set. One computational method for predicting proteasomal cleavage is described by Nielsen et al. (2005). In some embodiments, peptide mutation rates, glycosylation, cleavage sites, or other criteria can be used to filter peptides as described in Liu et al. (2020).


As shown in FIGS. 1-2, in some embodiments, the next step of the method includes scoring the foreign peptides in the candidate set for binding to all considered HLA alleles as described in Liu et al. (2020) and Liu et al. (2020b). Scoring can be accomplished for human HLA molecules, mouse H-2 molecules, swine SLA molecules, or MHC molecules of any species for which prediction algorithms are available or can be developed. Thus, vaccines targeted at non-human species can be designed with the method. Scoring metrics can include the affinity for a foreign peptide to an HLA allele in nanomolar, eluted ligand, presentation, and other scores that can be expressed as percentile rank or any other metric. The candidate set may be further filtered to exclude peptides whose predicted binding cores do not contain a particular pathogenic or neoantigen target residue of interest or whose predicted binding cores contain the target residue in an anchor position. The candidate set may also be filtered for foreign peptides of specific lengths, such as length 9 for MHC class I, for example. In some embodiments, scoring of foreign peptides is accomplished with experimental data or a combination of experimental data and computational prediction methods.


The criteria used for scoring peptide-HLA binding during the scoring procedure can accommodate different goals during the candidate identification and vaccine design phases. For example, a foreign peptide with peptide-HLA binding affinities of 500 nM may be displayed by an individual that is diseased, but at a lower frequency than a foreign peptide with a 50 nM peptide-HLA binding affinity. Thus, during the scoring of a candidate set to qualify potential immune system targets, 500 nM or other less constrained affinity criteria than 50 nM may be utilized. During the combinatorial design phase of a vaccine, a more constrained affinity criteria may be used, such a 50 nM, to increase the probability that a vaccine peptide will be found and displayed by HLA molecules. In some embodiments, peptides are selected that have peptide-HLA binding affinities of between about 50 nM and about 500 nM. Alternatively, combined models that incorporate peptide immunogenicity can be used to qualify foreign peptides for improvement and score their modified versions for vaccine inclusion. In some embodiments, experimental observations of the immunogenicity of peptides in the context of their display by HLA alleles can be used to score peptides for vaccine inclusion. In some embodiments, computational predictions of the immunogenicity of a peptide in the context of display by HLA alleles can used for scoring such as the methods of Ogishi et al. (2019).


In some embodiments, the method further includes running the OptiVax-Robust algorithm as described in Liu et al. (2020) using the HLA haplotype frequencies of a population on the scored candidate set to construct a seed set (also referred to as base set herein) of foreign peptides (FIG. 2). In some embodiments, HLA diplotype frequencies can be provided to OptiVax. OptiVax-Robust includes algorithms to eliminate peptide redundancy that arises from the sliding window approach with varying window sizes, but other redundancy elimination measures can be used to enforce minimum edit distance constraints between foreign peptides in the candidate set. The size of the seed set is determined by a point of diminishing returns of population coverage as a function of the number of foreign peptides in the seed set. Other criteria can also be used, including a minimum number of vaccine foreign peptides, maximum number of vaccine foreign peptides, and desired predicted population coverage. One alternative criterion is a minimum number of expected peptide-HLA hits in each individual, where a peptide-HLA hit is the potential immunogenic display of a peptide by a single HLA allele as described as in Liu et al. (2020b). In alternate embodiments, the method further includes running the OptiVax-Unlinked algorithm as described in Liu et al. (2020) instead of OptiVax-Robust.


The OptiVax-Robust method uses binary predictions of foreign peptide binding to HLA alleles, and these binary predictions can be generated as described in Liu et al. (2020). The OptiVax-Unlinked method uses the probability of foreign peptide binding to HLA alleles and can be generated as described in Liu et al. (2020). Either method can be used for the purposes described herein, and thus we will the term “OptiVax” refers to either the Robust or Unlinked method. In some embodiments, the observed probability of peptide-HLA immunogenicity in experimental assays can be used as the probability of peptide-HLA binding in EvalVax-Unlinked and OptiVax-Unlinked. In some embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design describe the world's population. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to a geographic region. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to an ancestry. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to a race. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to individuals with risk factors such as genetic indicators of risk, age, exposure to chemicals, alcohol use, chronic inflammation, diet, hormones, immunosuppression, infectious agents, obesity, radiation, sunlight, or tobacco use. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to individuals that carry certain HLA alleles. In alternative embodiments, the HLA diplotypes provided to OptiVax for vaccine design describe a single individual, and are used to design an individualized vaccine.


In some embodiments, the seed set of foreign peptides that results from OptiVax application to the candidate set of target peptides describes a set of unmodified foreign peptides that represent a possible compact vaccine design (Seed Set in FIG. 2). In some embodiments, the seed set is based upon filtering candidate peptides by predicted or observed affinity or immunogenicity with respect to HLA molecules (Seed Set in FIG. 1). However, to improve the display of the foreign peptides in a wide range of HLA haplotypes as possible, some embodiments include modifications of the seed (or base) set. In some embodiments, experimental assays can be used to ensure that a modified seed (or base) peptide activates T cells that also recognize the seed peptide.


For a given foreign peptide, the optimal anchor residue selection may depend upon the HLA allele that is binding to and displaying the foreign peptide and the class of the HLA allele (MHC class I or class II). A seed peptide set can become an expanded set by including anchor residue modified peptides of either MHC class I or II peptides (FIGS. 1-2). Thus, one aspect of vaccine design is considering how to select a limited set of heteroclitic peptides that derive from the same foreign peptide for vaccine inclusion given that different heteroclitic peptides will have different and potentially overlapping population coverages.


In some embodiments, all possible anchor modifications for each base set foreign peptide are considered. There are two anchor residues in peptides bound by MHC class I molecules, typically at positions 2 and 9 for 9-mer peptides. At each anchor position, 20 possible amino acids are attempted in order to select the best heteroclitic peptides. Thus, for MHC class I binding, 400 (i.e., 20 amino acids by 2 positions=202) minus 1 heteroclitic peptides are generated for each base foreign peptide. There are four anchor residues in peptides bound by MHC class II molecules, typically at positions 1, 4, 6, and 9 of the 9-mer binding core. Thus, for MHC class II binding there are 160,000 (i.e., 20 amino acids by 4 positions=204) minus 1 heteroclitic peptides generated for each base foreign peptide. Other methods, including Bayesian optimization, can be used to select optimal anchor residues to create heteroclitic peptides from each seed (or base) set peptide. Other methods are presented in “Machine learning optimization of peptides for presentation by class II MHCs” by Dai et al. (2020), incorporated in its entirety herein. In some embodiments, the anchor positions are determined by the HLA allele that presents a peptide, and thus the set of heteroclitic peptides includes for each set of HLA specific anchor positions, all possible anchor modifications.


In some embodiments, for all of the foreign peptides in the seed set, new peptide sequences with all possible anchor residue modifications (e.g., MHC class I or class II) are created resulting in a new heteroclitic base set (Expanded set in FIGS. 1-2) that includes all of the modifications. In some embodiments, the heteroclitic base set (Expanded set in FIGS. 1-2) also includes the original seed (or base) set (Seed Peptide Set in FIGS. 1-2). In some embodiments, the heteroclitic base set includes amino acid substitutions or non-natural amino acid analogs at non-anchor residues. The heteroclitic base set is scored for HLA affinity or other metrics as described herein (another round of Peptide Filtering and Scoring as shown in FIGS. 1-2). The scoring predictions may be further updated for pairs of heteroclitic peptide and HLA allele, eliminating pairs where a heteroclitic peptide is predicted to be displayed by an allele but the seed (or base) peptide from which it was derived is not predicted to be displayed by the allele. The scoring predictions may also be filtered to ensure that predicted binding cores of the heteroclitic peptide displayed by a particular HLA allele align exactly in position with the binding cores of the respective seed (or base) set foreign peptide for that HLA allele. In some embodiments the scoring predictions are filtered for an HLA allele to ensure that the heteroclitic peptides considered for that HLA allele are only modified at anchor positions determined by that HLA allele. In some embodiments, heteroclitic peptides are included in experimental assays such as MIRA (Klinger et al., 2015) to determine their immunogenicity with respect to specific HLA alleles. In some embodiments, the methods of Liu et al. (2020b), can be used to incorporate MIRA data for heteroclitic peptides into a model of peptide-HLA immunogenicity. In some embodiments, the immunogenicity of heteroclitic peptides are experimentally determined and their ability to activate T cells that also recognize the corresponding seed (or base) peptide of the heteroclitic peptide is performed as is known in the art. In some embodiments, these assays of the immunogenicity and cross-reactivity of heteroclitic peptides are performed when the heteroclitic peptides are displayed by specific HLA alleles. In some embodiments, computational predictions of the immunogenicity of a heteroclitic peptide in the context of display by HLA alleles can used for scoring such as the methods of Ogishi et al. (2019).


In some embodiments, the next step involves inputting heteroclitic base set (also referred to as Expanded set as shown in FIGS. 1-2) to OptiVax to select a compact set of vaccine peptides that maximizes vaccine performance (Vaccine Performance Optimization; FIGS. 1-2). Vaccine performance is the population coverage of a vaccine, or the expected number peptide-HLA hits produced by a vaccine, or a function of population coverage and expected number of peptide-HLA hits desired. In some embodiments, the vaccine immunogenicity metric is a metric that describes the overall immunogenic properties of a vaccine with two or more peptides In some embodiments, the methods described herein are included for running OptiVax. In some embodiments, population coverage means the proportion of a subject population that presents one or more immunogenic peptides that activate T cells responsive to a seed foreign peptide. The metric of population coverage is computed using the HLA haplotype frequency in a given population such as a representative human population. In some embodiments, the metric of population coverage is computed using marginal HLA frequencies in a population. Maximizing population coverage means selecting a foreign peptide set that collectively results in the greatest fraction of the population that has at least a minimum number (e.g., 1, 2, 3, 4, 5, 6, 7, or 8) of immunogenic peptide-HLA bindings based on proportions of HLA haplotypes in a given population (e.g., representative healthy or diseased human population). In some embodiments, this process includes the OptiVax selection of heteroclitic peptides (as described in this disclosure) that activate T cells that respond to their corresponding seed (or base) peptide and the heteroclitic base peptides to improve population coverage. In some embodiments, the seed (or base) foreign peptides are always included in the final vaccine design to guard against the possibility that heteroclitic peptides will not produce immunity that reacts with the native seed (or base) foreign peptides. In some embodiments, peptides are only considered as candidates for a vaccine design if they have been observed to be immunogenic in clinical data or animal models.


In some embodiments, a candidate vaccine peptide is eliminated from vaccine inclusion if it activates T cells that recognize self-peptides (e.g., this can be achieved at the first and/or second round of Peptide Filtering and Sorting as shown in FIGS. 1-2). Testing a vaccine peptide for its ability to activate T cells that recognize self-peptides can be experimentally accomplished by the vaccination of animal models followed by ELISPOT or other immunogenicity assay or with human tissue protocols. In both cases, models with HLA alleles that present the vaccine peptide are used. In some embodiments, human primary blood mononuclear cells (PBMCs) are stimulated with a vaccine peptide, the T cells are allowed to grow, and then T cell activation with a self-peptide is assayed as described in Tapia-Calle et al. (2019) or other methods as known in the art. In some embodiments, computational predictions of the ability of a peptide to activate T cells that also recognize self-peptides can be utilized. These predictions can be based upon the modeling of the outward facing residues from the peptide-HLA complex and their interactions with other peptide residues. In some embodiments, a candidate vaccine peptide is eliminated from vaccine inclusion or experimentally tested for cross-reactivity if it is predicted to activate T cells that also recognize self-peptides based upon the structural similarity of the peptide-MHC complex of the candidate peptide and the peptide-MHC complex of a self-peptide. One method for the prediction of peptide-MHC structure is described by Park et al. (2013).


In some embodiments, a candidate heteroclitic vaccine peptide is eliminated from vaccine inclusion if it does not activate T cells that recognize its corresponding seed foreign peptide (second round of Peptide Filtering and Scoring, FIGS. 1-2). Testing a candidate heteroclitic peptide for its ability to activate T cells that recognize its corresponding seed (or base) foreign peptide can be experimentally accomplished by the vaccination of animal models followed by ELISPOT or other immunogenicity assay or with human tissue protocols. In both cases, models with HLA alleles that present the heteroclitic peptide are used. In some embodiments, human PBMCs are stimulated with the heteroclitic peptide, the T cells are allowed to grow, and then T cell activation with the seed (or base) foreign peptide is assayed as described in Tapia-Calle et al. (2019) or using other methods known in the art. In some embodiments, computational predictions of the ability of a heteroclitic peptide to activate T cells that also recognize the corresponding seed (or base) foreign peptide can be utilized. These predictions can be based upon the modeling of the outward facing residues from the peptide-HLA complex and their interactions with other peptide residues. In some embodiments, the structural similarity of the peptide-MHC complex of a heteroclitic peptide and the peptide-MHC complex of the corresponding seed (or base) foreign is used to qualify heteroclitic peptides for vaccine inclusion or to require experimental immunogenicity testing before vaccine inclusion.



FIG. 3 (MHC class I) and FIG. 4 (MHC class II) show the predicted population coverage of OptiVax-Robust selected single target-specific vaccines with differing number of peptides designed for the KRAS mutations G12D, G12V, G12R, G12C, and G13D. FIGS. 4-5 show that as the number of peptides increases for a vaccine, its predicted population coverage increases. The population coverage shown in FIGS. 4-5 are of those individuals that have the specific mutation that the vaccine is designed to cover. An increase in peptide count will also typically cause the average number of peptide-HLA hits in each individual to increase in the population.


OptiVax can be used to design a vaccine to maximize the fraction/proportion of the population whose HLA molecules are predicted to bind to and display at least p peptides from the vaccine. In some embodiments, this prediction includes experimental immunogenicity data to directly predict at least p peptides will be immunogenic. The number p is input to OptiVax, and OptiVax can be run multiple times with varying values for p to obtain a predicted optimal foreign peptide set for different peptide counts p. Larger values of p will increase the redundancy of a vaccine at the cost of more peptides to achieve a desired population coverage. In some embodiments, it may not be possible to achieve a given population coverage given a specific heteroclitic base set. In some embodiments, the number p is a function of the desired size of a vaccine.


The methods described herein can be used to design separate vaccine formulations for MHC class I and class II based immunity.


In some embodiments, this procedure is used to create a vaccine for an individual. In some embodiments, the foreign peptides present in the individual are determined by sequencing the individual's tumor RNA or DNA, and identifying mutations that produce foreign peptides. One embodiment of this is described in U.S. Ser. No. 10/738,355B2. In some embodiments, peptide sequencing methods are used to identify foreign peptides in the individual. One embodiment of this is described in US20110257890A1. In some embodiments, the foreign peptides used for the individual's vaccine are selected when a foreign peptide or foreign peptide encoding RNA observed in a specimen from the individual is present at a predetermined level. The foreign peptides in the individual are used to construct a vaccine as described in the disclosure herein. For vaccine design OptiVax is provided a diplotype comprising the HLA type of the individual. In an alternative embodiment, the HLA type of an individual is separated into multiple diplotypes with frequencies that sum to one, where each diplotype comprises one or more HLA alleles from the individual and a notation that the other allele positions should not be evaluated. The use of multiple diplotypes will cause OptiVax's objective function to increase the chance that immunogenic peptides will be displayed by all of the constructed diplotypes.



FIG. 10 shows the vaccine performance (predicted number of peptide-HLA hits) of ten example G12V MHC class I vaccines for a single individual with the MHC class I HLA diplotype HLA-A02:03, HLA-A11:01, HLA-B55:02, HLA-B58:01, HLA-C03:02, HLA-C03:03. OptiVax was used to design ten G12V MHC class I vaccines for this HLA diplotype with peptide counts ranging from 1 to 10. For the results in FIG. 10, OptiVax was run with six synthetic diplotypes, each equally weighted, each with one HLA allele from the individual's HLA diplotype, and the other allele positions marked to not be evaluated. The 10 peptide vaccine in FIG. 10 comprises SEQ ID NO: 3 (GAVGVGKSL), SEQ ID NO: 4 (LMVVGAVGV), SEQ ID NO: 7 (VVGAVGVGK), SEQ ID NO: 14 (GPVGVGKSV), SEQ ID NO: 69 (LMVVGAVGI), SEQ ID NO: 72 (LMVVGAVGL), SEQ ID NO: 131 (GAVGVGKSM), SEQ ID NO: 138 (GPVGVGKSA), SEQ ID NO: 142 (VTGAVGVGK), and SEQ ID NO: 198 (VAGAVGVGM). Two peptides, SEQ ID NO: 3 (GAVGVGKSL) and SEQ ID NO: 131 (GAVGVGKSM), are predicted to bind two of the HLA alleles with an affinity of 50 nM or less.


MHC Class I Vaccine Design Procedure


In some embodiments, MHC class I vaccine design procedures consist of the following computations steps.


In some embodiments, the inputs for the computation are:

    • P1 . . . n: Peptide sequence (length n) containing the neoantigen or pathogenic target(s) of interest (e.g., KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, KRAS G13D). Pi denotes the amino acid at position i.
    • t: Position of target mutation in P, t∈[1, . . . , n] (e.g., t=12 for KRAS G12D).
    • τ1: Threshold for potential presentation of peptides by MHC for peptide-MHC scoring (e.g., 500 nM binding affinity)
    • τ2: Threshold for predicted display of peptides by MHC for peptide-MHC scoring (e.g., 50 nM binding affinity)
    • custom character: Set of HLA alleles (for HLA-A, HLA-B, HLA-C loci)
    • F: custom character3custom character: Population haplotype frequencies (for OptiVax optimization and coverage evaluation).
    • N: Parameter for EvalVax and OptiVax objective function. Specifies minimum number of predicted per-individual hits for population coverage objective to consider the individual covered. Default=1 (computes P(n≥1) population coverage).


In some embodiments, Peptide-HLA Scoring Functions used are:

    • ScorePotential: P×custom charactercustom character: Scoring function mapping a (peptide, HLA allele) pair to a prediction of peptide-HLA display. If predicted affinity ≤τ1, then returns 1, else returns 0. Options include MHCflurry, NetMHCpan, PUFFIN, ensembles, or alternative metrics or software may be used, including models calibrated against immunogenicity data.
    • SCOREDISPLAY: P×custom charactercustom character: Scoring function mapping a (peptide, HLA allele) pair to a prediction of peptide-HLA display. If predicted affinity ≤τ2, then returns 1, else returns 0. Options include MHCflurry, NetMHCpan, PUFFIN, ensembles, or alternative metrics or software may be used, including models calibrated against immunogenicity data.


Next, from the seed protein sequence (P), a set custom character of windowed native peptides spanning the protein sequence(s) is constructed. In some embodiments, 9-mers are produced, but this process can be performed with any desired window lengths and the resulting peptide sets combined.

custom character={Pj . . . j+8|j∈[t−8, . . . ,t],j≠{t−7,t}}


The second condition j≠{t−md 7, t} excludes peptides where the mutation at t is in positions P2 or P9 of the windowed 9-mer peptide (i.e., the anchor positions).


Next, each peptide sequence in custom character is scored against all HLA alleles in custom character for potential presentation using SCOREPOTENTIAL (with threshold τ1=500 nM) and store results in a |custom character|×|custom character| matrix S:

S[p,h]=SCOREPOTENTIAL(p,h)∀p∈custom character,h∈custom character

    • Note that S is a binary matrix where 1 indicates the HLA is predicted to potentially present the peptide, and 0 indicates no potential presentation.


      Define base set of peptides B⊆custom character:

      B={p∈custom character|∃h s.t.S[p,h]=1}


Thus, B contains the native peptides that are predicted to be potentially presented by at least 1 HLA.


Create a Set of all Heteroclitic Peptides B′ Stemming from Peptides in B:







B


=





b

B



ANCHOR

-

MODIFIED


(
b
)









    • where ANCHOR-MODIFIED(b) returns a set of all 399 anchor-modified peptides stemming from b (with all possible modifications to the amino acids at P2 and P9).





Next, all heteroclitic candidate peptides in B′ are scored against all HLA alleles in custom character for predicted display using SCOREDISPLAY (with threshold T2=50 nM), and store results in binary |B′|×|custom character| matrix S1′:

S1′[b′,h]=SCOREDISPLAY(b′,h)∀b′∈B′,h∈custom character


Next, an updated scoring matrix S2′ is computed for heteroclitic peptides conditioned on the potential presentation of the corresponding base peptides by each HLA:








S
2




[


b


,
h

]


=

{








S
1




[


b


,
h

]


,





if






S


[

b
,
h

]



=
1






0
,



otherwise








b




B





,

h











    • where each heteroclitic peptide b′∈B′ is a mutation of base peptide b∈B. This condition enforces that if h was not predicted to potentially present b, then all heteroclitic peptides b′ derived from b will not be displayed by h (even if h would otherwise be predicted to display b′).





In some embodiments, OptiVax-Robust is used to design a final peptide set from the union of base peptides and heteroclitic peptides B∪B′ (with corresponding scoring matrices S and S2′ for B and B′, respectively). Let custom characterk denote the compact set of vaccine peptides output by OptiVax containing k peptides. Note that custom characterk+1 is not necessarily a superset of custom characterk. (In alternate embodiments, OptiVax can be used to augment the base set B with peptides from B′ using scoring matrix S2′ to return set custom characterk, and the final vaccine set custom characterk+|B| consists of peptides B∪custom characterk.)


In some embodiments, this procedure is repeated independently for each target of interest, and the resulting independent vaccine sets can be merged into a combined vaccine as described below.


MHC Class II Vaccine Design Procedure


In some embodiments, MHC class II vaccine design procedures consist of the following computations steps.


In some embodiments, the inputs for the computation are:

    • P1 . . . n: Peptide sequence(s) (length n) containing the neoantigen(s) of interest (e.g., KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, KRAS G13D). Pi denotes the amino acid at position i.
    • t: Position of target mutation in P, t∈[1, . . . , n] (e.g., t=12 for KRAS G12D).
    • τ1: Threshold for potential presentation of peptides by MHC for peptide-MHC scoring (e.g., 500 nM binding affinity)
    • τ2: Threshold for predicted display of peptides by MHC for peptide-MHC scoring (e.g., 50 nM binding affinity)
    • custom character: Set of HLA alleles (for HLA-DR, HLA-DQ, HLA-DP loci)
    • F: custom character3custom character: Population haplotype frequencies (for OptiVax optimization and coverage evaluation).
    • N: Parameter for EvalVax and OptiVax objective function. Specifies minimum number of predicted per-individual hits for population coverage objective to consider the individual covered. Default=1 (computes P(n≥1) population coverage).


In some embodiments, Peptide-HLA Scoring Functions used are:

    • SCOREPOTENTIAL: P×custom charactercustom character: Scoring function mapping a (peptide, HLA allele) pair to a prediction of peptide-HLA display. If predicted affinity ≤τ1, then returns 1, else returns 0.
    • Options include NetMHCIIpan, PUFFIN, ensembles, or alternative metrics or software may be used, including models calibrated against immunogenicity data.
    • SCOREDISPLAY: P×custom charactercustom character: Scoring function mapping a (peptide, HLA allele) pair to a prediction of peptide-HLA display. If predicted affinity ≤τ2, then returns 1, else returns 0. Options include NetMHCIIpan, PUFFIN, ensembles, or alternative metrics or software may be used, including models calibrated against immunogenicity data.
    • FindCore: P×custom character→[1, . . . , n]: Function mapping a (peptide, HLA allele) pair to a prediction of the 9-mer binding core. The core may be specified as the offset position (index) into the peptide where the core begins.


Next, from the seed protein sequence (P), a set custom character of peptides spanning the protein sequence are constructed. Here, we extract all windowed peptides of length 13-25 spanning the target mutation, but this process can be performed using any desired window lengths (e.g., only 15-mers).






𝒫
=




k


[

13
,









,
25

]





𝒫
k









𝒫
k

=

{


𝒫


j











j

+

(

k
-
1

)



|

j


[


t
-

(

k
-
1

)






,





,
t

]



}







    • where custom characterk contains all sliding windows of length k, which are combined to form custom character. Note that here (unlike MHC class I), no peptides are excluded based on binding core or anchor residue positions (for MHC class II, filtering is performed in Paragraph 0063).





Next, each peptide sequence in P is scored against all HLA alleles in custom character for potential presentation using SCOREPOTENTIAL (with threshold τ1=500 nM) and store results in a |custom character|×|custom character| matrix S1:

S1[p,h]=SCOREPOTENTIAL(p,h)∀p∈custom character,h∈custom character

    • Note that S1 is a binary matrix where 1 indicates the HLA is predicted to potentially present the peptide, and 0 indicates no potential presentation.


For each (peptide, HLA allele) pair (p, h), identify/predict the 9-mer binding core using FINDCORE. The predicted binding core is recorded in a matrix C:

C[p,h]=FINDCORE(p,h)∀p∈custom character,h∈custom character


Next, an updated scoring matrix S2 is computed for native peptides in custom character:








S
2



[

p
,
h

]


=

{








S
1



[

p
,
h

]






,





if






C


[

p
,
h

]







specifies






P

t







at





a





non

-

anchor





position





inside





core







0
,



otherwise







p

𝒫



,

h











    • where Pt is the target residue of interest (e.g., the mutation site of KRAS G12D). This condition enforces the target residue to fall within the binding core at a non-anchor position for all (peptide, HLA allele) pairs with non-zero scores in S2, and allows the binding core to vary by allele per peptide (as the binding cores of a particular peptide may differ based on the HLA allele presenting the peptide). Thus, for each pair (p, h), if the predicted binding core C[p, h] specifies the target residue Pt at an anchor position (P1, P4, P6, or P9 of the 9-mer core), or if Pt is not contained within the binding core, then S2 [p, h]=0. In an alternate embodiment, Pt can be located outside of the core or inside the core in a non-anchor position.





Next, OptiVax-Robust is run with peptides custom character and scoring matrix S2 to identify a non-redundant base set of peptides B⊆custom character. (In alternate embodiments, B can be chosen as the entire set custom character rather than identifying a non-redundant base set.)


Next, a set of all heteroclitic peptides B′ is created stemming from peptides in B:







B


=




b



B





{



ANCHOR
-


MODIFIED


(

b
,
c

)





c



|



h






s
.
t
.






S
2



[

b
,
h

]






=
1

}








    • where ANCHOR-MODIFIED(b,c) returns a set of all 204−1 anchor-modified peptides stemming from b with all possible modifications to the amino acids at P1, P4, P6, and P9 of the 9-mer binding core c. Thus, for each base peptide b, the heteroclitic set B′ contains all anchor-modified peptides b′ with modifications to all unique cores of b identified for any HLA alleles that potentially present b with a valid core position as indicated by scoring matrix S2.





Next, all heteroclitic candidate peptides in B′ are scored against all HLA alleles in custom character for predicted display using SCOREDISPLAY (with threshold τ2=50 nM), and store results in binary |B′|×|custom character| matrix S1′:

S1′[b′,h]=ScoreDisplay(b′,h)∀b′∈B′,h∈custom character


For each (heteroclitic peptide, HLA allele) pair (b′,h), identify/predict the 9-mer binding core using FINDCORE. The predicted binding core is recorded in a matrix C′:

C′[b′,h]=FINDCORE(b′,h)∀b′∈B′,h∈custom character


An updated scoring matrix S2′ is computed for heteroclitic peptides conditioned on the identified binding cores of a heteroclitic and base peptides occurring at the same offset by a particular HLA:








S
2




[


b


,
h

]


=

{








S
1




[


b


,
h

]


,





if







C




[


b


,
h

]



=

C


[

b
,
h

]








0
,



otherwise








b




B





,

h











    • where each heteroclitic peptide b′∈B′ is a mutation of base peptide b∈B. This condition enforces the binding core of the heteroclitic peptide b′ to be at the same relative position as the base peptide b, and, implicitly, enforces that the target residue Pt still falls in a non-anchor position within the 9-mer binding core (Step 3).





An updated scoring matrix S3′ is computed for heteroclitic peptides conditioned on the potential presentation of the corresponding base peptides by each HLA:








S
3




[


b


,
h

]


=

{








S
2




[


b


,
h

]


,





if






S


[

b
,
h

]



=
1







0
,








otherwise








b




B





,

h











    • where each heteroclitic peptide b′∈B′ is a mutation of base peptide b∈B. This condition enforces that if h was not predicted to display b, then all heteroclitic peptides b′ derived from b will not be displayed by h (even if h would otherwise be predicted to display b′).





OptiVax-Robust is used to design a final peptide set from the union of base peptides and heteroclitic peptides B∪B′ (with corresponding scoring matrices S2 and S3′ for B and B′, respectively). Let custom characterk denote the compact set of vaccine peptides output by OptiVax containing k peptides. Note that custom characterk+1 is not necessarily a superset of custom characterk. (In alternate embodiments, OptiVax can be used to augment the base set B with peptides from B′ using scoring matrix S2′ to return set custom characterk, and the final vaccine set custom characterk+|B| consists of peptides B∪custom characterk.)


In some embodiments, this procedure is repeated independently for each single target of interest, and the resulting independent vaccine sets can be merged into a combined vaccine as described below.


Methods for Combining Multiple Vaccines


The above described methods will produce an optimized foreign peptide set for one or more targets. In some embodiments, a method is provided for designing separate vaccines for MHC class I and class II based immunity for multiple targets (e.g., two or more targets such as KARS G12D and KRAS G12V).


In some embodiments, a method is disclosed for producing a combined peptide vaccine for multiple targets by using a table of presentations for a disease that is based upon empirical data from sources such as the Cancer Genome Atlas (TCGA). FIG. 5 shows one embodiment for factoring disease presentation type probabilities (pancreatic cancer, colon/rectum cancer, and bronchus/lung cancer) by probability, for each disease presentation, of target presented for various KRAS mutation targets (KRAS G12D, KRAS G12V, and KRAS G12R). A presentation is a unique set of targets that are presented by one form of a disease (e.g., distinct type of cancer as shown in FIG. 5). For each presentation, FIG. 5 shows an example of the probability of that presentation, and the probability that a given target is observed. For a given presentation, there can be one or more targets, each having a probability. In some embodiments, the method for multi-target vaccine design will allocate peptide resources for inducing disease immunity based on the presentation and respective target probabilities as shown in FIG. 5, for example. In some embodiments, presentations correspond to the prevalence of targets in different human populations or different risk groups. The probability of a target in a population is computed by summing for each possible presentation the probability of that presentation times the probability of the target in that presentation.


Referring to FIG. 6, in some embodiments, the method first includes designing an individual peptide vaccine for each target to create a combined vaccine design for multiple targets. This initially results in sets of target-specific vaccine designs. In some embodiments, the marginal vaccine performance of each target-specific vaccine at size k is defined by vaccine performance at size k minus the vaccine performance of the vaccine at size k minus one (see FIGS. 3-4). The composition of a vaccine may change as the number of peptides used in the vaccine increases, and thus for computing contributions to a combined vaccine the marginal vaccine performance of each target-specific vaccine is used instead of a specific set of peptides.


In some embodiments, the weighted marginal vaccine performance of a target-specific vaccine design for each target specific vaccine size is computed as shown in FIG. 6. For a given target specific vaccine size, its weighted vaccine performance is computed by multiplying its vaccine performance times the probability of the target in the population (e.g., by using values as shown in FIG. 5). The marginal weighted vaccine performance for a target specific vaccine is its weighted coverage at size k minus its coverage a size k minus one (e.g., see FIGS. 3-4). The marginal weighted vaccine performance of a target specific vaccine of size one is its weighted vaccine performance. The marginal weighted vaccine performances for all vaccines are combined into a single list, and the combined list is sorted from largest to least by the weighted marginal vaccine performances of the target specific vaccines as shown in FIG. 6. The combined vaccine of size n is then determined by the first n elements of this list. The peptides for the combined vaccine are determined by the individual peptide target vaccines whose sizes add to n and whose weighted vaccine performances sums to the same sum as the first n elements of the sorted list. This maximizes the vaccine performance of the combined vaccine of size n.


In some embodiments, the combined multiple target vaccine can be designed on its overall predicted coverage for the disease described depending on the presentation table used (e.g., see FIG. 5), by its predicted coverage for a specific indication, and/or by its predicted coverage for a specific target by adjusting the weighting used for vaccine performance accordingly. Once a desired level of coverage is selected, the peptides of the combined vaccine are determined by the contributions of target-specific designs. For example, if the combined vaccine includes a target-specific vaccine of size k, then the vaccine peptides for this target at size k are used in the combined vaccine.


As an example of one embodiment, FIG. 5 shows three mutations (KRAS G12D, G12V, and G12R) and their respective probabilities of occurring in an individual with pancreatic cancer. FIG. 3 (MHC class I) and FIG. 4 (MHC class II) show the population coverage of target-specific vaccines for the KRAS G12D, G12V, G12R, G12C, and G13D targets using the methods for vaccines described herein. The marginal population coverage of each target-specific vaccine at a given vaccine size is the improvement in coverage at that size and the size less one. The coverage with no peptides is zero. The marginal coverage of each target-specific vaccine is multiplied by the probability of the target in the population as determined by the proportions as shown in FIG. 5 for the pancreas (pancreatic cancer). These weighted marginal coverages of all target-specific vaccines are sorted to determine the best target-specific compositions, and the resulting list describes the composition of a combined vaccine at each size k by taking the first k elements of the list. As an example of one embodiment, FIG. 7 (MHC Class I) and FIG. 8 (MHC Class II) show the target specific contributions at each vaccine size for a combined KRAS vaccine for the three mutations KRAS G12D, G12V, and G12R. The methods for combined vaccine protocol described herein was used to compute the examples in FIGS. 7 and 8. At each combined vaccine size, different components of the target-specific vaccines are utilized. Table 1 (below) contains the peptides present in independent (single target) and combined (multiple target) MHC class I vaccine designs for the KRAS G12D, G12V, G12R, G12C, and G13D targets. Table 2 (below) contains the contains the peptides present in independent (single target) MHC class II vaccine designs for the KRAS G12D, G12V, G12R, G12C, and G13D targets, and any subset of the individual/single target vaccines can be combined to create an MHC class II vaccine for two or more multiple targets. For alternate embodiments, Sequence Listing provides heteroclitic peptides useful in MHC class I vaccines for the KRAS G12D, G12V, G12R, G12C, and G13D targets.


Combined Vaccine Design Procedure


In some embodiments, the procedure described herein is used to combine individual compact vaccines optimized for different targets into a single optimized combined vaccine.


In some embodiments, the computational inputs for the procedure are:

    • custom character: Set of neoantigen or pathogenic targets of interest (e.g., KRAS G12D, KRAS G12V, KRAS G12R)
    • custom character: Vaccine sets optimized individually for each target. Let custom charactert,k denote the optimal vaccine set of exactly k peptides for target t∈custom character (e.g., as computed by the procedures describe above). Note that custom charactert,k+1 may not necessarily be a superset of custom charactert,k.
    • W: custom character→[0,1]: Target weighting function mapping each target t∈custom character to a probability or weight of tin a particular presentation of interest (e.g., pancreatic cancer; see Exhibit A, Table 1 for example).
    • POPULATIONCOVERAGE: custom character→[0,1]: Function mapping a peptide set into population coverage (e.g., EvalVax). This function may also take as input additional parameters, including HLA haplotype frequencies and a minimum per-individual number of peptide-HLA hits N (here, we compute coverage as P(n≥1) using EvalVax-Robust).


For each target t (individually) and vaccine size (peptide count) k, the unweighted population coverage ct,k is computed:

ct,k=PopulationCoverage(custom charactert,k)∀t∈custom characterk

    • Note that for each target t, ct,k is generally monotonically increasing and concave down for increasing values of k (each additional peptide increases coverage but with decreasing returns).


For each target t (individually), the marginal coverage mt,k is computed of the k-th peptide added to the vaccine set:







m

t
,
k


=

{






c

t
,
k






if





k

=
1








c

t
,
k


-

c

t
,

k
-
1




,



otherwise







t

𝒯



,
k








    • Note that for each target t, mt,k should be a monotonically decreasing function in k (by Step 1 above).





The weighted marginal population coverage {tilde over (m)}t,k is computed using weights of each target in W:

{tilde over (m)}t,k=W(tmt,k∀t∈custom character,k

    • The weighted marginal population coverage gives the effective marginal coverage of the k-th peptide in the vaccine weighted by the prevalence of the target in the presentation (by multiplication with the probability/weight of the target in the presentation).


The individual vaccines are combined into a combined vaccine via the MERGEMULTI procedure called on the weighted marginal population coverage lists {tilde over (m)}t=[{tilde over (m)}t,k, k∈1, 2, . . . ]. FIG. 9 shows an example Python implementation of the MERGEMULTI function. This procedure takes as input multiple sorted (descending) lists and merges them into a single sorted (descending) list. Let M indicate the output of MERGEMULTI where each element Mk contains both the marginal weighted coverage and source (target) of the k-th peptide in the combined vaccine. The combined vaccine contains peptides from different targets. In particular, the combined vaccine with k peptides contains Ct,kj≤k]]{Mk from t} peptides from target t. Note that Ct,k∈[0, . . . , k] and Σt Ct,k=k (Ct,k gives the distribution of the k peptides in the combined vaccine across the targets).


The optimal combined vaccine set custom characterk is defined as:








v
^

k

=




t

𝒯




v

t
,

C

t
,
k









Thus, the combined vaccine with k peptides is the combination of the optimal individual (Ct,k)-peptide vaccines. The marginal weighted coverage values of the combine vaccine Mk can be cumulatively summed over k to give the overall effective (target-weighted) population coverage of the combined vaccine containing k peptides as Σj≤KMk (taking into account both the probabilities/weights of the targets in the presentation and the expected population coverage of peptides based on HLA display). The final vaccine size k can vary based upon the specific population coverage goals of the vaccine.


MHC Class I Peptide Sequences


In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about five, ten, or twenty MHC class I peptides with each peptide consisting of 8 or more amino acids. In some embodiments, an MHC class I peptide vaccine is intended for one or more of the KRAS G12D, G12V, and G12R targets. In some embodiments, the amino acid sequence of a first peptide in a five-peptide combined vaccine comprises SEQ ID NO: 1.











(SEQ ID NO: 1)



GADGVGKSM.







In some embodiments, the amino acid sequence of a second peptide in a five-peptide combined vaccine comprises SEQ ID NO: 2.











(SEQ ID NO: 2)



LMVVGADGV.







In some embodiments, the amino acid sequence of a third peptide in a five-peptide combined vaccine comprises SEQ ID NO: 3.











(SEQ ID NO: 3)



GAVGVGKSL.







In some embodiments, the amino acid sequence of a fourth peptide in a five-peptide combined vaccine comprises SEQ ID NO: 4.











(SEQ ID NO: 4)



LMVVGAVGV.







In some embodiments, the amino acid sequence of a fifth peptide in a five-peptide combined vaccine comprises SEQ ID NO: 5.











(SEQ ID NO: 5)



VTGARGVGK.







An example combined vaccine for the KRAS G12D, G12V, and G12R targets with five peptides (SEQ ID NO: 1 to SEQ ID NO: 5) is predicted to have a weighted population coverage of 0.3620.


In some embodiments, any one of the peptides (peptides 1-5) in the five-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 5.


In some embodiments, the amino acid sequence of peptides 1 to 5 in a ten-peptide combined vaccine comprise SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NO: 5. In some embodiments, the amino acid sequence of a sixth peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 6.











(SEQ ID NO: 6)



VMGAVGVGK.







In some embodiments, the amino acid sequence of a seventh peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 7.











(SEQ ID NO: 7)



VVGAVGVGK.







In some embodiments, the amino acid sequence of an eight peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 8.











(SEQ ID NO: 8)



GARGVGKSY.







In some embodiments, the amino acid sequence of a ninth peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 9.











(SEQ ID NO: 9)



GPRGVGKSA.







In some embodiments, the amino acid sequence of a tenth peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 10.











(SEQ ID NO: 10)



LMVVGARGV.







An example combined vaccine for the KRAS G12D, G12V, and G12R targets with ten peptides (SEQ ID NO: 1 to SEQ ID NO: 10) is predicted to have a weighted population coverage of 0.4374.


In some embodiments, any one of the peptides (peptides 1-10) in the ten-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, or SEQ ID NO: 10.


In some embodiments, the amino acid sequence of peptides 1 to 10 in a twenty-peptide combined vaccine comprise SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10. In some embodiments, the amino acid sequence of an 11th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 11.











(SEQ ID NO: 11)



GADGVGKSL.







In some embodiments, the amino acid sequence of a 12th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 12.











(SEQ ID NO: 12)



GADGVGKSY.







In some embodiments, the amino acid sequence of a 13th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 13.











(SEQ ID NO: 13)



GYDGVGKSM.







In some embodiments, the amino acid sequence of a 14th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 14.











(SEQ ID NO: 14)



GPVGVGKSV.







In some embodiments, the amino acid sequence of a 15th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 15.











(SEQ ID NO: 15)



LTVVGAVGV.







In some embodiments, the amino acid sequence of a 16th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 16.











(SEQ ID NO: 16)



VVGAVGVGR.







In some embodiments, the amino acid sequence of a 17th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 17.











(SEQ ID NO: 17)



GARGVGKSM.







In some embodiments, the amino acid sequence of an 18th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 18.











(SEQ ID NO: 18)



GPRGVGKSV.







In some embodiments, the amino acid sequence of a 19th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 19.











(SEQ ID NO: 19)



LLVVGARGV.







In some embodiments, the amino acid sequence of a 20th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 20.











(SEQ ID NO: 20)



VAGARGVGM.







An example combined vaccine for the KRAS G12D, G12V, and G12R targets with twenty peptides (SEQ ID NO: 1 to SEQ ID NO: 20) is predicted to have a weighted population coverage of 0.4604.


In some embodiments, any one of the peptides (peptides 1-20) in the twenty-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID NO: 20


Table 1 shows MHC class I peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, KRAS protein target (with specific mutation), the seed amino acid sequence (i.e., the amino acid sequence of the wild type KRAS fragment), the amino acid substitution (if any) for heteroclitic peptides at positions 2 and 9, and notes detailing embodiments in which the peptide may be included in a 5, 10, or 20 combined peptide vaccine as described herein. Table 1 also includes additional peptide sequences comprising SEQ ID NOs: 21-41. In some embodiments, any combination of peptides listed in Table 1 (SEQ ID NOs: 1-41) may be used to create a combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1-41; SEQ ID NOs: 1-41) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to any of SEQ ID NOs: 1-41.









TABLE 1







Example KRAS Vaccine Peptides (MHC class I)













SEQ ID
Sequence corresponding


Heteroclitic
Heteroclitic



NO
to SEQ ID
Target
Seed
Modification P2
Modification P9
Note





SEQ ID
GADGVGKSM
KRAS
GADGVGKSA

A9M
Individual KRAS


NO: 1

G12D



G12D (MHCfluny);








Combined (5 peptide)








(MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
LMVVGADGV
KRAS
LVVVGADGV
V2M

Individual KRAS


NO: 2

G12D



G12D (MHCfluny);








Individual KRAS








G12D (NetMHCpan);








Combined (5 peptide)








(MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GAVGVGKSL
KRAS
GAVGVGKSA

A9L
Individual KRAS


NO: 3

G12V



G12V (MHCfluny);








Combined (5 peptide)








(MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
LMVVGAVGV
KRAS
LVVVGAVGV
V2M

Individual KRAS


NO: 4

G12V



G12V (MHCfluny);








Individual KRAS








G12V (NetMHCpan);








Combined (5 peptide)








(MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
VTGARGVGK
KRAS
VVGARGVGK
V2T

Individual KRAS


NO: 5

G12R



G12R (MHCfluny);








Combined (5 peptide)








(MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
VMGAVGVGK
KRAS
VVGAVGVGK
V2M

Individual KRAS


NO: 6

G12V



G12V (MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
VVGAVGVGK
KRAS
VVGAVGVGK


Individual KRAS


NO: 7

G12V



G12V (NetMHCpan);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GARGVGKSY
KRAS
GARGVGKSA

A9Y
Individual KRAS


NO: 8

G12R



G12R (MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GPRGVGKSA
KRAS
GARGVGKSA
A2P

Individual KRAS


NO: 9

G12R



G12R (MHCfluny);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
LMVVGARGV
KRAS
LVVVGARGV
V2M

Individual KRAS


NO: 10

G12R



G12R (MHCfluny);








Individual KRAS








G12R (NetMHCpan);








Combined (10








peptide) (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GADGVGKSL
KRAS
GADGVGKSA

A9L
Individual KRAS


NO: 11

G12D



G12D (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GADGVGKSY
KRAS
GADGVGKSA

A9Y
Individual KRAS


NO: 12

G12D



G12D (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GYDGVGKSM
KRAS
GADGVGKSA
A2Y
A9M
Individual KRAS


NO: 13

G12D



G12D (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
GPVGVGKSV
KRAS
GAVGVGKSA
A2P
A9V
Combined (20


NO: 14

G12V



peptide) (MHCfluny)


SEQ ID
LTVVGAVGV
KRAS
LVVVGAVGV
V2T

Individual KRAS


NO: 15

G12V



G12V (NetMHCpan);








Combined (20








peptide) (MHCfluny)


SEQ ID
VVGAVGVGR
KRAS
VVGAVGVGK

K9R
Individual KRAS


NO: 16

G12V



G12V (MHCfluny);








Individual KRAS








G12V (NetMHCpan);








Combined (20








peptide) (MHCfluny)


SEQ ID
GARGVGKSM
KRAS
GARGVGKSA

A9M
Combined (20


NO: 17

G12R



peptide) (MHCfluny)


SEQ ID
GPRGVGKSV
KRAS
GARGVGKSA
A2P
A9V
Combined (20


NO: 18

G12R



peptide) (MHCfluny)


SEQ ID
LLVVGARGV
KRAS
LVVVGARGV
V2L

Individual KRAS


NO: 19

G12R



G12R (NetMHCpan);








Combined (20








peptide) (MHCfluny)


SEQ ID
VAGARGVGM
KRAS
VVGARGVGK
V2A
K9M
Individual KRAS


NO: 20

G12R



G12R (MHCfluny);








Combined (20








peptide) (MHCfluny)


SEQ ID
LTVVGADGV
KRAS
LVVVGADGV
V2T

Individual KRAS


NO: 21

G12D



G12D (NetMHCpan)


SEQ ID
LLVVGADGV
KRAS
LVVVGADGV
V2L

Individual KRAS


NO: 22

G12D



G12D (NetMHCpan)


SEQ ID
LMVVGADGL
KRAS
LVVVGADGV
V2M
V9L
Individual KRAS


NO: 23

G12D



G12D (NetMHCpan)


SEQ ID
VMGAVGVGR
KRAS
VVGAVGVGK
V2M
K9R
Individual KRAS


NO: 24

G12V



G12V (NetMHCpan)


SEQ ID
VMGARGVGK
KRAS
VVGARGVGK
V2M

Individual KRAS


NO: 25

G12R



G12R (NetMHCpan)


SEQ ID
GACGVGKSL
KRAS
GACGVGKSA

A9L
Individual KRAS


NO: 26

G12C



G12C (MHCfluny)


SEQ ID
LMVVGACGV
KRAS
LVVVGACGV
V2M

Individual KRAS


NO: 27

G12C



G12C (MHCfluny);








Individual KRAS








G12C (NetMHCpan)


SEQ ID
LTVVGACGV
KRAS
LVVVGACGV
V2T

Individual KRAS


NO: 28

G12C



G12C (MHCfluny);








Individual KRAS








G12C (NetMHCpan)


SEQ ID
VTGACGVGK
KRAS
VVGACGVGK
V2T

Individual KRAS


NO: 29

G12C



G12C (MHCfluny)


SEQ ID
VVGACGVGR
KRAS
VVGACGVGK

K9R
Individual KRAS


NO: 30

G12C



G12C (MHCfluny)


SEQ ID
AADVGKSAM
KRAS
AGDVGKSAL
G2A
L9M
Individual KRAS


NO: 31

G13D



G13D (MHCfluny);








Individual KRAS








G13D (NetMHCpan)


SEQ ID
AEDVGKSAM
KRAS
AGDVGKSAL
G2E
L9M
Individual KRAS


NO: 32

G13D



G13D (MHCfluriy)


SEQ ID
AYDVGKSAM
KRAS
AGDVGKSAL
G2Y
L9M
Individual KRAS


NO: 33

G13D



G13D (MHCfluriy)


SEQ ID
DAGKSALTV
KRAS
DVGKSALTI
V2A
I9V
Individual KRAS


NO: 34

G13D



G13D (MHCfluny)


SEQ ID
GAGDVGKSM
KRAS
GAGDVGKSA

A9M
Individual KRAS


NO: 35

G13D



G13D (MHCfluriy)


SEQ ID
LQVVGACGV
KRAS
LVVVGACGV
V2Q

Individual KRAS


NO: 36

G12C



G12C (NetMHCpan)


SEQ ID
VMGACGVGK
KRAS
VVGACGVGK
V2M

Individual KRAS


NO: 37

G12C



G12C (NetMHCpan)


SEQ ID
VMGACGVGR
KRAS
VVGACGVGK
V2M
K9R
Individual KRAS


NO: 38

G12C



G12C (NetMHCpan)


SEQ ID
AADVGKSAL
KRAS
AGDVGKSAL
G2A

Individual KRAS


NO: 39

G13D



G13D (NetMHCpan)


SEQ ID
ASDVGKSAL
KRAS
AGDVGKSAL
G25

Individual KRAS


NO: 40

G13D



G13D (NetMHCpan)


SEQ ID
ASDVGKSAM
KRAS
AGDVGKSAL
G25
L9M
Individual KRAS


NO: 41

G13D



G13D (NetMHCpan)









Additional amino acid sequences of MHC class I heteroclitic peptides are provided in Sequence Listings (SEQ ID NOs: 67-1522). In some embodiments, any combination of MHC class I peptides disclosed herein (SEQ ID NOs: 1-41 and 67-1522) may be used to create a combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (SEQ ID NOs: 1-41 and 67-1522) in the combined vaccine comprises or contains an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to any of SEQ ID NOs: 1-41 or 67-1522.


MHC Class II Peptide Sequences


In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about 2 to 40 MHC class II peptides with each peptide consisting of about 20 amino acids. In some embodiments, an MHC class II peptide vaccine is intended for one or more of the KRAS G12D, G12V, G12R, G12C, and G13D targets.


Table 2 summarizes MHC class II peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, the amino acid sequence corresponding to the peptide's binding core, the KRAS protein target (with specific mutation), the seed amino acid sequence (i.e., the amino acid sequence of the wild type KRAS fragment), the seed amino acid sequence of the binding core, and the amino acid substitution (if any) for heteroclitic peptides at positions 1, 4, 6, and 9. Table 2 includes peptide sequences comprising SEQ ID NOs: 42-66. SEQ ID NOs: 42-65 (Table 2) encode for recombinant peptides. In some embodiments, any combination of peptides listed in Table 2 (SEQ ID NOs: 42-66) may be used to create a single target (individual) or combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 42-66; SEQ ID NOs: 42-66) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to any of SEQ ID NOs: 42-66.









TABLE 2







Example KRAS Vaccine Peptides (MHC class II)























Hetero-
Hetero-
Hetero-
Hetero-









clitic
clitic
clitic
clitic




Sequence




Modifi-
Modifi-
Modifi-
Modifi-



SEQ
corresponding to




cation
cation
cation
cation



ID NO
SEQ ID
Core
Target
Seed
Seed Core
P1
P4
P6
P9
Note





SEQ
EYKFVVFGSDGAG
FVVFG
KRAS
EYKLVVVGADGVG
LVVVGADGV
L1F
V4F
A65
V9A
Individual


ID
KS
SDGA
G12D
KS





KRAS


NO:









G12D


42









(NetMHCIIpan)


SEQ
EYKFVVIGNDGAG
FVVIG
KRAS
EYKLVVVGADGVG
LVVVGADGV
L1F
V4I
A6N
V9A
Individual


ID
KSALTIQLIQN
NDGA
G12D
KSALTIQLIQN





KRAS


NO:









G12D


43









(NetMHCIIpan)


SEQ
EYKFVVLGADGAG
FVVLG
KRAS
EYKLVVVGADGVG
LVVVGADGV
L1F
V4L

V9A
Individual


ID
KS
ADGA
G12D
KS





KRAS


NO:









G12D


44









(NetMHCIIpan)


SEQ
MTEYKFVVSGADG
FVVSG
KRAS
MTEYKLVVVGADG
LVVVGADGV
L1F
V4S

V9I
Individual


ID
IGKSALT
ADGI
G12D
VGKSALT





KRAS


NO:









G12D


45









(NetMHCIIpan)


SEQ
MTEYKFVVYGSDG
FVVYG
KRAS
MTEYKLVVVGADG
LVVVGADGV
L1F
V4Y
A6S
V9I
Individual


ID
IGKSALT
SDGI
G12D
VGKSALT





KRAS


NO:









G12D


46









(NetMHCIIpan)


SEQ
EYKFVVIGRVGHG
FVVIG
KRAS
EYKLVVVGAVGVG
LVVVGAVGV
L1F
V4I
A6R
V9H
Individual


ID
KS
RVGH
G12V
KS





KRAS


NO:









G12V


47









(NetMHCIIpan)


SEQ
EYKFVVLGTVGHG
FVVLG
KRAS
EYKLVVVGAVGVG
LVVVGAVGV
L1F
V4L
A6T
V9H
Individual


ID
KS
TVGH
G12V
KS





KRAS


NO:









G12V


48









(NetMHCIIpan)


SEQ
EYKFVVYGNVGM
FVVYG
KRAS
EYKLVVVGAVGVG
LVVVGAVGV
L1F
V4Y
A6N
V9M
Individual


ID
GKS
NVGM
G12V
KS





KRAS


NO:









G12V


49









(NetMHCIIpan)


SEQ
EYKIVVAGNVGIGK
IVVAG
KRAS
EYKLVVVGAVGVG
LVVVGAVGV
L1I
V4A
A6N
V9I
Individual


ID
S
NVGI
G12V
KS





KRAS


NO:









G12V


50









(NetMHCIIpan)


SEQ
TEYKIVVMGNVGY
IVVMG
KRAS
TEYKLVVVGAVGV
LVVVGAVGV
L1I
V4M
A6N
V9Y
Individual


ID
GK
NVGY
G12V
GK





KRAS


NO:









G12V


51









(NetMHCIIpan)


SEQ
MTEYKFVVFGSRG
FVVFG
KRAS
MTEYKLVVVGARG
LVVVGARGV
L1F
V4F
A65

Individual


ID
VGKSALT
SRGV
G12R
VGKSALT





KRAS


NO:









G12R


52









(NetMHCIIpan)


SEQ
MTEYKFVVIGNRG
FVVIG
KRAS
MTEYKLVVVGARG
LVVVGARGV
L1F
V4I
A6N

Individual


ID
VGKSALT
NRGV
G12R
VGKSALT





KRAS


NO:









G12R


53









(NetMHCIIpan)


SEQ
MTEYKFVVIGVRG
FVVIG
KRAS
MTEYKLVVVGARG
LVVVGARGV
L1F
V4I
A6V
V9D
Individual


ID
DGKSALT
VRGD
G12R
VGKSALT





KRAS


NO:









G12R


54









(NetMHCIIpan)


SEQ
MTEYKFVVMGSRG
FVVM
KRAS
MTEYKLVVVGARG
LVVVGARGV
L1F
V4M
A6S
V9A
Individual


ID
AGKSALT
GSRGA
G12R
VGKSALT





KRAS


NO:









G12R


55









(NetMHCIIpan)


SEQ
VVVIARGVPKSLLT
IARGV
KRAS
VVVGARGVGKSAL
GARGVGKSA
G1I

G6P
A9L
Individual


ID
I
PKSL
G12R
TI





KRAS


NO:









G12R


56









(NetMHCIIpan)


SEQ
EYKFVVFGNCGAG
FVVFG
KRAS
EYKLVVVGACGVG
LVVVGACGV
L1F
V4F
A6N
V9A
Individual


ID
KS
NCGA
G12C
KS





KRAS


NO:









G12C


57









(NetMHCIIpan)


SEQ
EYKFVVSGACGVG
FVVSG
KRAS
EYKLVVVGACGVG
LVVVGACGV
L1F
V4S


Individual


ID
KS
ACGV
G12C
KS





KRAS


NO:









G12C


58









(NetMHCIIpan)


SEQ
EYKFVVSGNCGLG
FVVSG
KRAS
EYKLVVVGACGVG
LVVVGACGV
L1F
V4S
A6N
V9L
Individual


ID
KS
NCGL
G12C
KS





KRAS


NO:









G12C


59









(NetMHCIIpan)


SEQ
EYKLVVMGPCGAG
LVVM
KRAS
EYKLVVVGACGVG
LVVVGACGV

V4M
A6P
V9A
Individual


ID
KS
GPCGA
G12C
KS





KRAS


NO:









G12C


60









(NetMHCIIpan)


SEQ
KLVIVGICKVGHSA
IVGICK
KRAS
KLVVVGACGVGKS
VVGACGVGK
V1I
A4I
G6K
K9H
Individual


ID
L
VGH
G12C
AL





KRAS


NO:









G12C


61









(NetMHCIIpan)


SEQ
EYKFVVFGNGDLG
FVVFG
KRAS
EYKLVVVGAGDVG
LVVVGAGDV
L1F
V4F
A6N
V9L
Individual


ID
KS
NGDL
G13D
KS





KRAS


NO:









G13D


62









(NetMHCIIpan)


SEQ
EYKFVVMGNGDSG
FVVM
KRAS
EYKLVVVGAGDVG
LVVVGAGDV
L1F
V4M
A6N
V9S
Individual


ID
KS
GNGDS
G13D
KS





KRAS


NO:









G13D


63









(NetMHCIIpan)


SEQ
EYKFVVSGSGDVG
FVVSG
KRAS
EYKLVVVGAGDVG
LVVVGAGDV
L1F
V4S
A6S

Individual


ID
KS
SGDV
G13D
KS





KRAS


NO:









G13D


64









(NetMHCIIpan)


SEQ
EYKIVVMGRGDMG
IVVMG
KRAS
EYKLVVVGAGDVG
LVVVGAGDV
L1I
V4M
A6R
V9M
Individual


ID
KS
RGDM
G13D
KS





KRAS


NO:









G13D


65









(NetMHCIIpan)


SEQ
YKLVVVGAGDVG

KRAS






Individual


ID
KSA

G13D






KRAS


NO:
KRAS








G13D


66









(NetMHCIIpan)









In some embodiments, any combination of MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NOs: 1-1522) may be used to create a single target (individual) or combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1-1522; SEQ ID NOs: 1-1522) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to any of SEQ ID NOs: 1-1522.


mRNA and DNA Vaccines


In some embodiments, vaccine peptides are encoded as mRNA or DNA molecules and are administered for expression in vivo as is known in the art. One example of the delivery of vaccines by mRNA is found in Kranz et al. (2016), incorporated herein by reference. In one embodiment, a construct comprises 10 peptides, including a five-peptide MHC class I combined pancreatic cancer vaccine (targets: KRAS G12D, G12V, G12R) and a five-peptide MHC class II combined pancreatic cancer vaccine (targets: KRAS G12D, G12V, G12R), as optimized by the procedure described herein. Peptides are prepended with a secretion signal sequence at the N-terminus and followed by an MHC class I trafficking signal (MITD) (Kreiter et al., 2008; Sahin et al., 2017). The MITD has been shown to route antigens to pathways for HLA class I and class II presentation (Kreiter et al., 2008). Here we combine all peptides of each MHC class into a single construct using non-immunogenic glycine/serine linkers from Sahin et al. (2017), though it is also plausible to construct individual constructs containing single peptides with the same secretion and MITD signals as demonstrated by Kreiter et al. (2008).


In some embodiments, the amino acid sequence encoded by the mRNA vaccine comprises SEQ ID NO: 1523. Underlined amino acids correspond to the signal peptide (or leader) sequence. Bolded amino acids correspond to MHC class I (9 amino acids in length; 5 peptides) and MHC class II (13-25 amino acids in length; 5 peptides) peptide sequences. Italicized amino acids correspond to the trafficking signal.









(SEQ ID NO: 1523)



MRVTAPRTLILLLSGALALTETWAGSGGSGGGGSGGGADGVGKSMGGSGGG






GSGGLMVVGADGVGGSGGGGSGGGAVGVGKSLGGSGGGGSGGLMVVGAVGV





GGSGGGGSGGVTGARGVGKGGSGGGGSGGEYKFVVLGTVGHGKSGGSGGGG





SGGEYKIVVAGNVGIGKSGGSGGGGSGGEYKFVVFGSDGAGKSGGSGGGGS





GGMTEYKFVVSGADGIGKSALTGGSGGGGSGGMTEYKFVVIGNRGVGKSAL






TGGSLGGGGSGIVGIVAGLAVLAVVVIGAVVATVMCRRKSSGGKGGSYSQA







ASSDSAQGSDVSLTA.







In some embodiments, the vaccine is an mRNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 1523. In some embodiments, the nucleic acid sequence of the mRNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1523.


In some embodiments, the vaccine is a DNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 1523. In some embodiments, the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1523.


In some embodiments, one or more MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NO: 1-1522) can be encoded in one or more mRNA or DNA molecules and administered for expression in vivo. In some embodiments between about 2 and about 40 peptide sequences are encoded in one or more mRNA constructs. In some embodiments, between about 2 and about 40 peptide sequences are encoded in one or more DNA constructs (i.e., nucleic acids encoding the amino acids sequences comprising on or more of SEQ ID NOs: 1-1522). In some embodiments, the amino acid sequence of the mRNA vaccine or the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to any of SEQ ID NOs: 1-1522.


Non-Limiting Embodiments of the Subject Matter


In one aspect, the invention provides for a nucleic acid sequence encoding two or more amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class I molecule. In some embodiments, the one or more peptides is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.


In another aspect, the invention provides for an immunogenic peptide composition comprising two or more peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In some embodiments, a peptide in the immunogenic peptide composition is displayed by an HLA class I molecule. In some embodiments, a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.


In another aspect, the invention provides for a nucleic acid sequence encoding one or more amino acid sequences selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.


In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class II molecule. In some embodiments, the one or more peptides is a modified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.


In another aspect, the invention provides for an immunogenic peptide composition comprising one or more peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.


In some embodiments, a peptide in the immunogenic peptide composition is displayed by an HLA class II molecule. In some embodiments, a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.


Compositions


In some embodiments, the foreign peptides (e.g., peptide vaccine) are administered in a pharmaceutical composition comprising the peptides and a pharmaceutically acceptable carrier. In some embodiments, the pharmaceutical composition is in the form of a spray, aerosol, gel, solution, emulsion, or suspension.


The composition is preferably administered to a subject with a pharmaceutically acceptable carrier. Typically, in some embodiments, an appropriate amount of a pharmaceutically acceptable salt is used in the formulation, which in some embodiments can render the formulation isotonic.


In certain embodiments, the foreign peptides are provided as an immunogenic composition comprising any one of the foreign peptides described herein and a pharmaceutically acceptable carrier. In certain embodiments, the immunogenic composition further comprises an adjuvant. In certain embodiments, the foreign peptides are conjugated with other molecules to increase their effectiveness as is known by those practiced in the art. For example, peptides can be coupled to antibodies that recognize cell surface proteins on antigen presenting cells to enhance vaccine effectiveness. One such method for increasing the effectiveness of peptide delivery is described in Woodham, et al. (2018).


In some embodiments, the pharmaceutically acceptable carrier is selected from the group consisting of saline, Ringer's solution, dextrose solution, and a combination thereof. Other suitable pharmaceutically acceptable carriers known in the art are contemplated. Suitable carriers and their formulations are described in Remington's Pharmaceutical Sciences, 2005, Mack Publishing Co. The pH of the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 7.5. The formulation may also comprise a lyophilized powder. Further carriers include sustained release preparations such as semipermeable matrices of solid hydrophobic polymers, which matrices are in the form of shaped articles, e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in the art that certain carriers may be more preferable depending upon, for instance, the route of administration and concentration of peptides being administered.


The phrase pharmaceutically acceptable carrier as used herein means a pharmaceutically acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, solvent or encapsulating material, involved in carrying or transporting the subject pharmaceutical agent from one organ, or portion of the body, to another organ, or portion of the body. Each carrier is acceptable in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. Some examples of materials which can serve as pharmaceutically acceptable carriers include: sugars, such as lactose, glucose and sucrose; starches, such as corn starch and potato starch; cellulose, and its derivatives, such as sodium carboxymethyl cellulose, ethyl cellulose and cellulose acetate; powdered tragacanth; malt; gelatin; talc; excipients, such as cocoa butter and suppository waxes; oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; glycols, such as butylene glycol; polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol; esters, such as ethyl oleate and ethyl laurate; agar; buffering agents, such as magnesium hydroxide and aluminum hydroxide; alginic acid; pyrogen-free water; isotonic saline; Ringer's solution; ethyl alcohol; phosphate buffer solutions; and other non-toxic compatible substances employed in pharmaceutical formulations. The term carrier denotes an organic or inorganic ingredient, natural or synthetic, with which the active ingredient is combined to facilitate the application. The components of the pharmaceutical compositions also are capable of being comingled with the compounds of the present invention, and with each other, in a manner such that there is no interaction which would substantially impair the desired pharmaceutical efficiency. The composition may also include additional agents such as an isotonicity agent, a preservative, a surfactant, and, a divalent cation, preferably, zinc.


The composition can also include an excipient, or an agent for stabilization of a foreign peptide composition, such as a buffer, a reducing agent, a bulk protein, amino acids (such as e.g., glycine or praline) or a carbohydrate. Bulk proteins useful in formulating foreign peptide compositions include albumin. Typical carbohydrates useful in formulating foreign peptides include but are not limited to sucrose, mannitol, lactose, trehalose, or glucose.


Surfactants may also be used to prevent soluble and insoluble aggregation and/or precipitation of foreign peptides or proteins included in the composition. Suitable surfactants include but are not limited to sorbitan trioleate, soya lecithin, and oleic acid. In certain cases, solution aerosols are preferred using solvents such as ethanol. Thus, formulations including foreign peptides can also include a surfactant that can reduce or prevent surface-induced aggregation of foreign peptides by atomization of the solution in forming an aerosol. Various conventional surfactants can be employed, such as polyoxyethylene fatty acid esters and alcohols, and polyoxyethylene sorbitol fatty acid esters. Amounts will generally range between 0.001% and 4% by weight of the formulation. In some embodiments, surfactants used with the present disclosure are polyoxyethylene sorbitan mono-oleate, polysorbate 80, polysorbate 20. Additional agents known in the art can also be included in the composition.


In some embodiments, the pharmaceutical compositions and dosage forms further comprise one or more compounds that reduce the rate by which an active ingredient will decay, or the composition will change in character. So called stabilizers or preservatives may include, but are not limited to, amino acids, antioxidants, pH buffers, or salt buffers. Nonlimiting examples of antioxidants include butylated hydroxy anisole (BHA), ascorbic acid and derivatives thereof, tocopherol and derivatives thereof, butylated hydroxy anisole and cysteine. Nonlimiting examples of preservatives include parabens, such as methyl or propyl p-hydroxybenzoate and benzalkonium chloride. Additional nonlimiting examples of amino acids include glycine or proline.


The present invention also teaches the stabilization (preventing or minimizing thermally or mechanically induced soluble or insoluble aggregation and/or precipitation of an inhibitor protein) of liquid solutions containing foreign peptides at neutral pH or less than neutral pH by the use of amino acids including proline or glycine, with or without divalent cations resulting in clear or nearly clear solutions that are stable at room temperature or preferred for pharmaceutical administration.


In one embodiment, the composition is a pharmaceutical composition of single unit or multiple unit dosage forms. Pharmaceutical compositions of single unit or multiple unit dosage forms of the invention comprise a prophylactically or therapeutically effective amount of one or more compositions (e.g., a compound of the invention, or other prophylactic or therapeutic agent), typically, one or more vehicles, carriers, or excipients, stabilizing agents, and/or preservatives. Preferably, the vehicles, carriers, excipients, stabilizing agents and preservatives are pharmaceutically acceptable.


In some embodiments, the pharmaceutical compositions and dosage forms comprise anhydrous pharmaceutical compositions and dosage forms. Anhydrous pharmaceutical compositions and dosage forms of the invention can be prepared using anhydrous or low moisture containing ingredients and low moisture or low humidity conditions. Pharmaceutical compositions and dosage forms that comprise lactose and at least one active ingredient that comprise a primary or secondary amine are preferably anhydrous if substantial contact with moisture and/or humidity during manufacturing, packaging, and/or storage is expected. An anhydrous pharmaceutical composition should be prepared and stored such that its anhydrous nature is maintained. Accordingly, anhydrous compositions are preferably packaged using materials known to prevent exposure to water such that they can be included in suitable formulary kits. Examples of suitable packaging include, but are not limited to, hermetically sealed foils, plastics, unit dose containers (e.g., vials), blister packs, and strip packs.


Suitable vehicles are well known to those skilled in the art of pharmacy, and non-limiting examples of suitable vehicles include glucose, sucrose, starch, lactose, gelatin, rice, silica gel, glycerol, talc, sodium chloride, dried skim milk, propylene glycol, water, sodium stearate, ethanol, and similar substances well known in the art. Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid vehicles. Whether a particular vehicle is suitable for incorporation into a pharmaceutical composition or dosage form depends on a variety of factors well known in the art including, but not limited to, the way in which the dosage form will be administered to a patient and the specific active ingredients in the dosage form. Pharmaceutical vehicles can be sterile liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like.


The invention also provides that a pharmaceutical composition can be packaged in a hermetically sealed container such as an ampoule or sachette indicating the quantity. In one embodiment, the pharmaceutical composition can be supplied as a dry sterilized lyophilized powder in a delivery device suitable for administration to the lower airways of a patient. The pharmaceutical compositions can, if desired, be presented in a pack or dispenser device that can contain one or more unit dosage forms containing the active ingredient. The pack can for example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device can be accompanied by instructions for administration.


Methods of preparing these formulations or compositions include the step of bringing into association a compound of the present invention with the carrier and, optionally, one or more accessory ingredients. In general, the formulations are prepared by uniformly and intimately bringing into association a compound of the present invention with liquid carriers, or finely divided solid carriers, or both, and then, if necessary, shaping the product.


Formulations of the invention suitable for administration may be in the form of powders, granules, or as a solution or a suspension in an aqueous or non-aqueous liquid, or as an oil-in-water or water-in-oil liquid emulsion, or as an elixir or syrup, or as pastilles (using an inert base, such as gelatin and glycerin, or sucrose and acacia) and/or as mouthwashes and the like, each containing a predetermined amount of a compound of the present invention (e.g., peptides) as an active ingredient.


A liquid composition herein can be used as such with a delivery device, or they can be used for the preparation of pharmaceutically acceptable formulations comprising foreign peptides that are prepared for example by the method of spray drying. The methods of spray freeze-drying foreign peptides/proteins for pharmaceutical administration disclosed in Maa et al., Curr. Pharm. Biotechnol., 2001, 1, 283-302, are incorporated herein. In another embodiment, the liquid solutions herein are freeze spray dried and the spray-dried product is collected as a dispersible foreign peptide-containing powder that is therapeutically effective when administered to an individual.


The compounds and pharmaceutical compositions of the present invention can be employed in combination therapies, that is, the compounds and pharmaceutical compositions can be administered concurrently with, prior to, or subsequent to, one or more other desired therapeutics or medical procedures (e.g., foreign peptide vaccine can be used in combination therapy with another treatment such as chemotherapy, radiation, pharmaceutical agents, and/or another treatment). The particular combination of therapies (therapeutics or procedures) to employ in a combination regimen will take into account compatibility of the desired therapeutics and/or procedures and the desired therapeutic effect to be achieved. It will also be appreciated that the therapies employed may achieve a desired effect for the same disorder (for example, the compound of the present invention may be administered concurrently with another therapeutic or prophylactic).


The invention also provides a pharmaceutical pack or kit comprising one or more containers filled with one or more of the ingredients of the pharmaceutical compositions of the invention. Optionally associated with such container(s) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use or sale for human administration.


The current invention provides for dosage forms comprising foreign peptides suitable for treating cancer or other diseases. The dosage forms can be formulated, e.g., as sprays, aerosols, nanoparticles, liposomes, or other forms known to one of skill in the art. See, e.g., Remington's Pharmaceutical Sciences; Remington: The Science and Practice of Pharmacy supra; Pharmaceutical Dosage Forms and Drug Delivery Systems by Howard C., Ansel et al., Lippincott Williams & Wilkins; 7th edition (Oct. 1, 1999).


Generally, a dosage form used in the acute treatment of a disease may contain larger amounts of one or more of the active ingredients it comprises than a dosage form used in the chronic treatment of the same disease. In addition, the prophylactically and therapeutically effective dosage form may vary among different conditions. For example, a therapeutically effective dosage form may contain peptides that has an appropriate immunogenic action when intending to treat cancer or other disease. On the other hand, a different effective dosage may contain foreign peptides that has an appropriate immunogenic action when intending to use the peptides of the invention as a prophylactic (e.g., vaccine) against cancer or another disease/condition. These and other ways in which specific dosage forms encompassed by this invention will vary from one another and will be readily apparent to those skilled in the art. See, e.g., Remington's Pharmaceutical Sciences, 2005, Mack Publishing Co.; Remington: The Science and Practice of Pharmacy by Gennaro, Lippincott Williams & Wilkins; 20th edition (2003); Pharmaceutical Dosage Forms and Drug Delivery Systems by Howard C. Ansel et al., Lippincott Williams & Wilkins; 7th edition (Oct. 1, 1999); and Encyclopedia of Pharmaceutical Technology, edited by Swarbrick, J. & J. C. Boylan, Marcel Dekker, Inc., New York, 1988, which are incorporated herein by reference in their entirety.


The pH of a pharmaceutical composition or dosage form may also be adjusted to improve delivery and/or stability of one or more active ingredients. Similarly, the polarity of a solvent carrier, its ionic strength, or tonicity can be adjusted to improve delivery. Compounds such as stearates can also be added to pharmaceutical compositions or dosage forms to alter advantageously the hydrophilicity or lipophilicity of one or more active ingredients to improve delivery. In this regard, stearates can also serve as a lipid vehicle for the formulation, as an emulsifying agent or surfactant, and as a delivery enhancing or penetration-enhancing agent. Different salts, hydrates, or solvates of the active ingredients can be used to adjust further the properties of the resulting composition.


Compositions can be formulated with appropriate carriers and adjuvants using techniques to yield compositions suitable for immunization. The compositions can include an adjuvant, such as, for example but not limited to, alum, poly IC, MF-59, squalene-based adjuvants, or liposomal based adjuvants suitable for immunization.


In some embodiments, the compositions and methods comprise any suitable agent or immune modulation which could modulate mechanisms of host immune tolerance and release of the induced antibodies. In certain embodiments, an immunomodulatory agent is administered in at time and in an amount sufficient for transient modulation of the subject's immune response so as to induce an immune response which comprises antibodies against for example tumor neoantigens (i.e., tumor-specific antigens (TSA)).


Expression Systems


In certain aspects, the invention provides culturing a cell line that expresses any one of the foreign peptides of the invention in a culture medium comprising any of the foreign peptides described herein.


Various expression systems for producing recombinant proteins/peptides are known in the art, and include, prokaryotic (e.g., bacteria), plant, insect, yeast, and mammalian expression systems. Suitable cell lines, can be transformed, transduced, or transfected with nucleic acids containing coding sequences for the foreign peptides of the invention in order to produce the molecule of interest. Expression vectors containing such a nucleic acid sequence, which can be linked to at least one regulatory sequence in a manner that allows expression of the nucleotide sequence in a host cell, can be introduced via methods known in the art. Practitioners in the art understand that designing an expression vector can depend on factors, such as the choice of host cell to be transfected and/or the type and/or amount of desired protein to be expressed. Enhancer regions, which are those sequences found upstream or downstream of the promoter region in non-coding DNA regions, are also known in the art to be important in optimizing expression. If needed, origins of replication from viral sources can be employed, such as if a prokaryotic host is utilized for introduction of plasmid DNA. However, in eukaryotic organisms, chromosome integration is a common mechanism for DNA replication. For stable transfection of mammalian cells, a small fraction of cells can integrate introduced DNA into their genomes. The expression vector and transfection method utilized can be factors that contribute to a successful integration event. For stable amplification and expression of a desired protein, a vector containing DNA encoding a protein of interest is stably integrated into the genome of eukaryotic cells (for example mammalian cells), resulting in the stable expression of transfected genes. A gene that encodes a selectable marker (for example, resistance to antibiotics or drugs) can be introduced into host cells along with the gene of interest in order to identify and select clones that stably express a gene encoding a protein of interest. Cells containing the gene of interest can be identified by drug selection wherein cells that have incorporated the selectable marker gene will survive in the presence of the drug. Cells that have not incorporated the gene for the selectable marker die. Surviving cells can then be screened for the production of the desired protein molecule.


A host cell strain, which modulates the expression of the inserted sequences, or modifies and processes the nucleic acid in a specific fashion desired also may be chosen. Such modifications (for example, glycosylation and other post-translational modifications) and processing (for example, cleavage) of peptide/protein products may be important for the function of the peptide/protein. Different host cell strains have characteristic and specific mechanisms for the post-translational processing and modification of proteins and gene products. As such, appropriate host systems or cell lines can be chosen to ensure the correct modification and processing of the foreign protein expressed. Thus, eukaryotic host cells possessing the cellular machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be used.


Various culturing parameters can be used with respect to the host cell being cultured. Appropriate culture conditions for mammalian cells are well known in the art (Cleveland W L, et al., J Immunol Methods, 1983, 56(2): 221-234) or can be determined by the skilled artisan (see, for example, Animal Cell Culture: A Practical Approach 2nd Ed., Rickwood, D. and Hames, B. D., eds. (Oxford University Press: New York, 1992)). Cell culturing conditions can vary according to the type of host cell selected. Commercially available medium can be utilized.


Foreign peptides of the invention can be purified from any human or non-human cell which expresses the polypeptide, including those which have been transfected with expression constructs that express foreign peptides of the invention. For protein recovery, isolation and/or purification, the cell culture medium or cell lysate is centrifuged to remove particulate cells and cell debris. The desired polypeptide molecule is isolated or purified away from contaminating soluble proteins and polypeptides by suitable purification techniques. Non-limiting purification methods for proteins include: size exclusion chromatography; affinity chromatography; ion exchange chromatography; ethanol precipitation; reverse phase HPLC; chromatography on a resin, such as silica, or cation exchange resin, e.g., DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, e.g., Sephadex G-75, Sepharose; protein A sepharose chromatography for removal of immunoglobulin contaminants; and the like. Other additives, such as protease inhibitors (e.g., PMSF or proteinase K) can be used to inhibit proteolytic degradation during purification. Purification procedures that can select for carbohydrates can also be used, e.g., ion-exchange soft gel chromatography, or HPLC using cation- or anionexchange resins, in which the more acidic fraction(s) is/are collected.


Methods of Treatment


In one embodiment, the subject matter disclosed herein relates to a preventive medical treatment started after following diagnosis of cancer in order to prevent the disease from worsening or curing the disease. In one embodiment, the subject matter disclosed herein relates to prophylaxis of subjects who are believed to be at risk for cancer or have previously been diagnosed with cancer (or another disease). In one embodiment, said subjects can be administered the peptide vaccine described herein or pharmaceutical compositions thereof. The invention contemplates using any of the foreign peptides produced by the systems and methods described herein. In one embodiment, the foreign peptide vaccines described herein can be administered subcutaneously via syringe or any other suitable method know in the art.


The compound(s) or combination of compounds disclosed herein, or pharmaceutical compositions may be administered to a cell, mammal, or human by any suitable means. Non-limiting examples of methods of administration include, among others, (a) administration though oral pathways, which includes administration in capsule, tablet, granule, spray, syrup, or other such forms; (b) administration through non-oral pathways such as intraocular, intranasal, intraauricular, rectal, vaginal, intraurethral, transmucosal, buccal, or transdermal, which includes administration as an aqueous suspension, an oily preparation or the like or as a drip, spray, suppository, salve, ointment or the like; (c) administration via injection, including subcutaneously, intraperitoneally, intravenously, intramuscularly, intradermally, intraorbitally, intracapsularly, intraspinally, intrasternally, or the like, including infusion pump delivery; (d) administration locally such as by injection directly in the renal or cardiac area, e.g., by depot implantation; (e) administration topically; as deemed appropriate by those of skill in the art for bringing the compound or combination of compounds disclosed herein into contact with living tissue; (f) administration via inhalation, including through aerosolized, nebulized, and powdered formulations; and (g) administration through implantation.


As will be readily apparent to one skilled in the art, the effective in vivo dose to be administered and the particular mode of administration will vary depending upon the age, weight and species treated, and the specific use for which the compound or combination of compounds disclosed herein are employed. The determination of effective dose levels, that is the dose levels necessary to achieve the desired result, can be accomplished by one skilled in the art using routine pharmacological methods. Typically, human clinical applications of products are commenced at lower dose levels, with dose level being increased until the desired effect is achieved. Alternatively, acceptable in vitro studies can be used to establish useful doses and routes of administration of the compositions identified by the present methods using established pharmacological methods. Effective animal doses from in vivo studies can be converted to appropriate human doses using conversion methods known in the art (e.g., see Nair A B, Jacob S. A simple practice guide for dose conversion between animals and human. Journal of basic and clinical pharmacy. 2016 March; 7(2):27.)


Methods of Prevention


In some embodiments, the foreign peptides prepared using methods of the invention can be used as a vaccine to promote an immune response against cancer (e.g., against tumor neoantigens). In some embodiments, the invention provides compositions and methods for induction of immune response, for example induction of antibodies to tumor neoantigens. In some embodiments, the antibodies are broadly neutralizing antibodies. In some embodiments, the foreign peptides prepared using methods of the invention can be used as a vaccine to promote an immune response against a pathogen.


The compositions, systems, and methods disclosed herein are not to be limited in scope to the specific embodiments described herein. Indeed, various modifications of the compositions, systems, and methods in addition to those described will become apparent to those of skill in the art from the foregoing description.

Claims
  • 1. A composition comprising nucleic acid sequences encoding two or more amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
  • 2. The composition of claim 1, wherein the composition is immunogenic.
  • 3. The composition of claim 1, wherein the nucleic acid sequence is administered in a construct for expression in vivo.
  • 4. The composition of claim 3, wherein the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class I molecule.
  • 5. The composition of claim 4, wherein the one or more peptides is a modified or unmodified fragment of a mutated KRAS protein.
  • 6. The composition of claim 5, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R.
  • 7. The composition of claim 3, wherein the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer.
  • 8. The composition of claim 3, wherein the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.
  • 9. An immunogenic peptide composition comprising two or more peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
  • 10. The immunogenic peptide composition of claim 9, wherein a peptide in the immunogenic peptide composition is displayed by an HLA class I molecule.
  • 11. The immunogenic peptide composition of claim 9, wherein a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein.
  • 12. The immunogenic peptide composition of claim 11, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, and KRAS G12R.
  • 13. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer.
  • 14. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
  • 15. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
  • 16. A nucleic acid sequence encoding one or more amino acid sequences selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
  • 17. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is an immunogenic composition.
  • 18. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in a construct for expression in vivo.
  • 19. The nucleic acid sequence of claim 18, wherein the in vivo administration of the nucleic acid sequence is configured to produce one or more peptides that are displayed by an HLA class II molecule.
  • 20. The nucleic acid sequence of claim 16, wherein the one or more peptides is a modified fragment of a mutated KRAS protein.
  • 21. The nucleic acid sequence of claim 19, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
  • 22. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer.
  • 23. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.
  • 24. An immunogenic peptide composition comprising one or more peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
  • 25. The immunogenic peptide composition of claim 24, wherein a peptide in the immunogenic peptide composition is displayed by an HLA class II molecule.
  • 26. The immunogenic peptide composition of claim 24, wherein a peptide in the immunogenic peptide composition is a modified or unmodified fragment of a mutated KRAS protein.
  • 27. The immunogenic peptide composition of claim 26, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
  • 28. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer.
  • 29. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
  • 30. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
US Referenced Citations (9)
Number Name Date Kind
8741576 Tangri et al. Jun 2014 B2
9340577 Grey et al. May 2016 B2
10738355 Sahin et al. Aug 2020 B2
20110257890 Weinschenk et al. Oct 2011 A1
20160125129 Sahin et al. May 2016 A1
20180066017 Hunt et al. Mar 2018 A1
20190322714 Petit et al. Oct 2019 A1
20200061166 Sahin et al. Feb 2020 A1
20200069782 Biskup et al. Mar 2020 A1
Foreign Referenced Citations (3)
Number Date Country
WO-9733602 Sep 1997 WO
WO-2005042698 May 2005 WO
WO-2018102585 Jun 2018 WO
Non-Patent Literature Citations (56)
Entry
Alhadj Ali et al., “Metabolic and immune effects of immunotherapy with proinsulin peptide in human new-onset type 1 diabetes,” Science Translation Medicine, Aug. 9, 2017, vol. 9;9(402):eaaf7779. 9 pages.
Candia et al., “On Peptides and Altered Peptide Ligands: From Origin, Mode of Action and Design to Clinical Application (Immunotherapy),” International Archives of Allergy and Immunology, published online Sep. 20, 2016; vol. 170(4), pp. 211-233.
Gibson et al., “Proinsulin multi-peptide immunotherapy induces antigen-specific regulatory T cells and limits autoimmunity in a humanized model,” Clinical Experimental Immunology, Dec. 2015, vol. 182(3), pp. 251-260.
Guevara-Patino, et al., “Optimization of a self antigen for presentation of multiple epitopes in cancer immunity,” The Journal of Clinical Investigation, May 2006, vol. 116, No. 5, pp. 1382-1390.
Hong et al., “Epitope-optimized alpha-fetoprotein genetic vaccines prevent carcinogen-induced murine autochthonous hepatocellular carcinoma,” Hepatology, Apr. 2014, vol. 59(4), pp. 1448-1458.
Houghton et al., “Immunological validation of the EpitOptimizer program for streamlined design of heteroclitic epitopes,” Vaccine, available online Jun. 4, 2007, vol. 25(29), pp. 5330-5342.
Reynisson et al., “NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data,” Nucleic Acids Research, published online May 14, 2020; vol. 48(W1), pp. W449-W454.
Chicz, et al., “Predominant naturally processed peptides bound to HLA-DR1 are derived from MHC-related molecules and are heterogeneous in size,” Nature, Aug. 27, 1992, vol. 358, pp. 764-768.
Cleveland et al., “Routine large-scale production of monoclonal antibodies in a protein-free culture medium,” Journal of Immunological Methods, Jan. 28, 1983, vol. 56, Issue 2, pp. 221-234.
Croft et al., “Most viral peptides displayed by class I MHC on infected cells are immunogenic,” Proceedings of the National Academy of Sciences, Feb. 19, 2019, vol. 116, No. 8, pp. 3112-3117.
Dai et al., “Machine learning optimization of peptides for presentation by class II MHCs,” bioRxiv, posted Aug. 18, 2020. 35 pages. (https://doi.org/10.1101/2020.08.18.256081).
Jaravine et al., “Assessment of cancer and virus antigens for cross-reactivity in human tissues,” Bioinformatics, Jan. 1, 2017, vol. 33, No. 1, pp. 104-111.
Jaravine et al., “Expitope 2.0: a tool to assess immunotherapeutic antigens for their potential cross-reactivity against naturally expressed proteins in human tissues,” BMC Cancer, Dec. 28, 2017, vol. 17:892. 9 pages.
Klinger et al., “Multiplex identification of antigen-specific T cell receptors using a combination of immune assays and immune receptor sequencing,” PLOS One, Oct. 28, 2015, vol. 10(10), e0141561. 21 pages.
Kranz et al., “Systemic RNA delivery to dendritic cells exploits antiviral defence for cancer immunotherapy,” Nature, Jun. 16, 2016, vol. 534(7607), pp. 396-401, and Methods. 16 pages.
Kreiter, et al., “Increased antigen presentation efficiency by coupling antigens to MHC class I trafficking signals,” The Journal of Immunology, Jan. 2008, vol. 180(1), pp. 309-318, and Corrections. 12 pages.
Liu et al,. “Predicted Cellular Immunity Population Coverage Gaps for SARS-CoV-2 Subunit Vaccines and their Augmentation by Compact Peptide Sets,”bioRxiv, posted Oct. 21, 2020, 29 pages. (https://www.biorxiv.org/content/10.1101/2020.08.04.200691v2).
Liu et al., “Computationally Optimized SARS-CoV-2 MHC Class I and II Vaccine Formulations Predicted to Target Human Haplotype Distributions,” Cell Systems, Aug. 26, 2020, vol. 11(2), pp. 131-144, and Methods. 23 pages.
Maa et al., “Biopharmaceutical Powders: Particle Formation and Formulation Considerations,” Current Pharmaceutical Biotechnology, Nov. 2000, vol. 1, No. 3, pp. 283-302.
Mösch et al., “Machine Learning for Cancer Immunotherapies Based on Epitope Recognition by T Cell Receptors,” Frontiers in Genetics, Nov. 19, 2019, vol. 10, Article 1141. 17 pages.
Nair et al., “A simple practice guide for dose conversion between animals and human,” Journal of Basic and Clinical Pharmacy, Article Review, Mar.-May 2016, vol. 7, Issue 2, pp. 27-31.
Ng et al., “In silico-guided sequence modifications of K-ras epitopes improve immunological outcome against G12V and G13D mutant KRAS antigens,” PeerJ., published Jul. 20, 2018, 6:e5056. doi: 10.7717/peerj.5056. 21 pages.
Nielsen et al., “The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage,” Immunogenetics, published online Mar. 3, 2005, vol. 57, pp. 33-41.
Ogishi et al., “Quantitative Prediction of the Landscape of T Cell Epitope Immunogenicity in Sequence Space,” Frontiers in Immunology, Apr. 16, 2019, vol. 10, Article 827. 20 pages.
Park et al., “Accurate structure prediction of peptide-MHC complexes for identifying highly immunogenic antigens,” Mol. Immunol., Nov. 2013, vol. 56(0):81-90. NIH Author Manuscript. 25 pages.
Rist et al., “HLA peptide length preferences control CD8+ T cell responses,” The Journal of Immunology, published online Jun. 7, 2013, vol. 191(2), pp. 561-571. 12 pages.
Sahin et al., “Personalized RNA mutanome vaccines mobilize poly-specific therapeutic immunity against cancer,” Nature, Jul. 13, 2017, vol. 547(7662), pp. 222-226, and Methods. 19 pages.
Slota et al., “ELISpot for measuring human immune responses to vaccines,” Expert Review of Vaccines, Mar. 2011, vol. 10(3), pp. 299-306. NIH Author Manuscript. 14 pages.
Soria-Guerra et al,. “An overview of bioinformatics tools for epitope prediction: Implications on vaccine development,” Journal of Biomedical Informatics, Feb. 2015, vol. 53, pp. 405-414.
Tapia-Calle et al., “A PBMC-Based System to Assess Human T Cell Responses to Influenza Vaccine Candidates In Vitro,” Vaccines, Nov. 13, 2019, vol. 7(4):181. 26 pages.
Trolle et al., “The length distribution of class I—restricted T cell epitopes is determined by both peptide supply and MHC allele—specific binding preference,” The Journal of Immunology, Feb. 15, 2016, vol. 196(4), 1480-1487. HSS Author Manuscript. 21 pages.
Woodham et al., “Nanobody-Antigen Conjugates Elicit HPV-Specific Antitumor Immune Responses,” Cancer Immunology Research, Jul. 2018, vol. 6(7); pp. 870-880.
Zirlik et al., “Cytotoxic T cells generated against heteroclitic peptides kill primary tumor cells independent of the binding affinity of the native tumor antigen peptide,” Blood, Dec. 1, 2006, Vo. 108, No. 12, pp. 3865-3870.
Dastagir et al., “Efficient Presentation of Multiple Endogenous Epitopes to Both CD4+ and CD8+ Diabetogenic T Cells for Tolerance,” Molecular Therapy: Methods & Clinical Development, Mar. 2017, vol. 4, pp. 27-38.
Alvarez, B. et al., “NNAlign_MA; MHC Peptidome Deconvolution for Accurate MHC Binding Motif Characterization and Improved T-cell Epitope Predictions”, Molecular & Cellular Proteomics, Dec. 2019, vol. 18(12), pp. cover, 2459-2477 (20 pages).
Asahara et al., “Phase 1/11 clinical trial using HLA-A24-restricted peptide vaccine derived from KIF20A for patients with advanced pancreatic cancer Journal of Translational Medicine,” vol. 11, article 291, 13 total pages (Year: 2013).
Bae et al., “Myeloma-Specific Multiple Peptides Able to Generate Cytotoxic T Lymphocytes: A Potential Therapeutic Application in Multiple Myeloma and Other Plasma Cell Disorders,” Clinical Cancer Research, vol. 18, pp. 4850-4860 (Year: 2012).
Bhasin, M. and Raghava, G.P.S., “Prediction of Promiscuous and High-Affinity Mutated MHC Binders”, Hybridoma and Hybridomics, Nov. 4, 2003, vol. 22, 229-234, (8 pages).
Dey et al., “A Bioinformatics approach to designing a Zika virus vaccine,” Computational Biology and Chemistry, vol. 68, pp. 143-152 (Year: 2017).
Hollingsworth et al., “Turning the corner on therapeutic cancer vaccines,” npj Vaccines, published online Feb. 8, 2019, vol. 4(7), pp. 1-10.
Hoppes et al., “Altered Peptide Ligands Revisited: Vaccine Design through Chemically Modified HLA-A2-Restricted T Cell Epitopes,” Journal of Immunology, vol. 193, pp. 4803-4813 (Year: 2014).
Jain et al., “Synthetic Tumor-Specific Breakpoint Peptide Vaccine in Patients with Chronic Myeloid Leukemia and Minimal Residual Disease,” Cancer, vol. 115, pp. 3924-3934 (Year: 2009).
Jurtz, V. et al., “NetMHCpan-4.0: Improved Peptide-MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data”, The Journal of Immunology, prepublished online Oct. 4, 2017, vol. 199, pp. 3360-3368 (9 pp.).
Krienke, C. et al., “A noninflammatory mRNA vaccine for treatment of experimental autoimmune encephalomyelitis”, Science, Jan. 8, 2021, vol. 371, pp. 145-153 (10 pages).
Longmate, et al., “Population coverage by HLA class-I restricted cytotoxic T-lymphocyte epitopes,” Immunogenetics vol. 52, pp. 165 -173 (Year: 2001).
Mahanty, et al., “Immunogenicity of infectious pathogens and vaccine antigens,” BMC Immunology, published online May 29, 2015, vol. 16(1), pp. 1-6.
Mashiba, et al., “Identification of CTL epitopes in hepatitis C virus by a genome-wide computational scanning and a rational design of peptide vaccine,” Immunogenetics, vol. 59, pp. 197-209 (Year: 2007).
Nielsen, M. and Andreatta, M., “NNAlign: a platform to construct and evaluate artificial neural network models of receptor-ligand interactions”, Nucleic Acids Research, published online Apr. 12, 2017, vol. 45, pp. W344-W349 (6 pages).
Nielsen, M. and Lund, O., “NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction”, BMC Bioinformatics, Sep. 18, 2009, vol. 10:296, pp. 1-10 (10 pages).
Nielsen, M. et al., “Quantitative Predictions of Peptide Binding to Any HLA-DR Molecule of Known Sequence: NetMHCIIpan”, PLoS Computational Biology, Jul. 4, 2008, vol. 4(7):e1000107, pp. 1-10 (10 pages).
O'Donnell, T.J. et al., “MHCflurry 2.0: Improved Pan-Allele Prediction of MHC Class I-Presented Peptides by Incorporating Antigen Processing”, Cell Systems, Jul. 22, 2020, vol. 11, pp. cover, 42-48 (15 pages).
O'Donnell, T.J. et al., “MHCflurry: Open-Source Class I MHC Binding Affinity Prediction”, Cell Systems, Jul. 25, 2018, vol. 7, pp. cover, 129-132 (9 pages).
Schipper, et al., “Minimal Phenotype Panels a Method for Achieving Maximum Population Coverage with a Minimum of HLA Antigens,” Human Immunology, vol. 51, pp. 95-98 (Year: 1996).
Sette et al., “Peptides and Methods for Creating Synthetic Peptides With Modulated Binding Affinity for HLA Molecules,” Application for utility U.S. Appl. No. 09/226,775, filed Jan. 6, 1999-not published, abandoned (133 total pages).
Shimokawa, C. et al., “CD8+ regulatory T cells are critical in prevention of autoimmune-mediated diabetes”, Nature Communications, Apr. 22, 2020, vol. 11:1922, pp. 1-9 (9 pages).
Toussaint, N. C. et al., “A Mathematical Framework for the Selection of an Optimal Set of Peptides for Epitope-Based Vaccines”, PLoS Computational Biology, Dec. 26, 2008, vol. 4(12):e1000246, pp. 1-10 (10 pages).