T CELL EPITOPES AND RELATED COMPOSITIONS USEFUL IN THE PREVENTION, DIAGNOSIS, AND TREATMENT OF COVID-19

REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created Feb. 10, 2021, is named “EPV0035WO Sequence Listing 1_ST25.txt” and is 1905 KB bytes in size.

FIELD

The present disclosure generally relates to novel T-cell epitope-based compounds and compositions, including vaccines, effective against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection (or a closely related virus such as Severe Acute

Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof,. Such T-cell epitope compounds and compositions include immunogenic T-cell epitope polypeptides (including concatemeric polypeptides and chimeric or fusion polypeptides), as well as nucleic acids, plasmids, vectors (including expression vectors), and cells which express the polypeptides, pharmaceutical compositions, and vaccines. The present disclosure also generally relates to methods, assays, and kits for detecting a cell-mediated immune response, including a T cell response (e.g., CD8+ and/or CD4+ T cell response), against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) or a related coronavirus, as well as methods, assays, and kits for the diagnosis of a SARS-CoV-2 infection or related coronaviruses infection.

BACKGROUND

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a positive-sense single-stranded ribonucleic (RNA) virus belonging to the Coronaviridae family. SARS-CoV-2 (which may also be referred to herein as “COVID-19 virus”) was first identified in Wuhan, China in late 2019 and is the cause of the highly contagious coronavirus disease 2019 (which has been termed and may be referred to herein as “COVID-19”, “2019 novel coronavirus”, or “2019-nCoV”). SARS-CoV-2 infection causes a broad range of disease, known as coronavirus disease 2019 (COVID-19), from mild or no symptoms to serious complications that may be rapidly fatal, often in adults over 65 years old and individuals with underlying medical conditions including cardiovascular disease, type 2 diabetes, and obesity. The global spread of COVID-19 was declared a pandemic by the World Health Organization (WHO) on Mar. 11, 2020. As of Dec. 25, 2020, the global spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in over 79 million cases of COVID-19, 1.7 million deaths and global economic disruption in less than 10 months since the first case appeared in Wuhan, China. Recovery from natural infection in non-severe disease, and resistance to severe disease in younger individuals suggests that the immune system can be harnessed to help bring an end to the COVID-19 pandemic by vaccination strategies that recapitulate protective immune responses.

While immune correlates of COVID-19 protection are not yet defined, several studies show cellular adaptive immune mechanisms contribute to SARS-CoV-2 control. Humoral immune responses also contribute to protection and have been the focus of current vaccine development efforts. Virus-specific IgM and IgG antibodies are found in nearly all infections. Seroconversion is observed 7 to 14 days after onset of symptoms and persists for weeks after virus clearance. Antibody levels wane four to five months after infection, but durable memory B cell immunity has been described in mild and severe disease. Antibodies are found against the surface spike glycoprotein and the internal nucleocapsid protein. Neutralizing antibodies target the receptor binding domain of spike, preventing cell entry via the angiotensin-converting enzyme 2 (ACE2) host receptor. Neutralizing antibodies are found in more than 90% of persons who seroconvert. In a prospective study of exposed healthcare workers, anti-COVID-19 IgG titers were correlated with protection from subsequent PCR test positivity, suggesting that either antibodies, or T cell response (responsible for driving higher Ab titers) or both were correlates of protection from subsequent infection. In other studies, spike-specific follicular helper CD4 T cells (Tfh) frequencies correlate with neutralizing antibody responses. Although much of the current COVID-19 vaccine focus has been on generating antibody responses, this latter finding identifies a critical role for T cells in generating immunity.

More recently, correlations between a wide range of T cell responses and protection from infection have begun to emerge. A large prospective study showed numbers of SARS-CoV-2-specific T cells indirectly correlate with disease risk. Individuals with low T cell responses to spike, membrane and nucleocapsid proteins develop COVID-19 while high responders do not, even if seronegative. T cell breadth is another key feature of the protected response, as patients with mild disease have higher TCR clonality in blood and bronchoalveolar lavage in comparison with severe disease.

T cell phenotype and function may also help to predict mild versus severe cases. Poor outcomes are associated with multiple signs of T cell impairment including enhanced expression of PD-1 and TIM-3 exhaustion markers, higher inhibitory molecule levels including CTLA-4 and TIGIT, and low frequencies of polyfunctional CD4 and CD8 T cells, as well as low GzmB-producing CD8 T cells. In contrast, non-severe patients present with lower levels of inhibitory molecules and higher GzmA, GzmB, and perforin effectors. Moreover, in recovered patients,

Tfh are found in the periphery at the time of viral clearance and persist into convalescence in contrast with an absence of lymph node Tfh found in patients who died of COVID-19. These findings underscore the importance of defining T cell epitope specificities to better understand COVID-19 immunity and to develop antibody- and T cell-directed vaccines that exploit T cell immunity.

There is an urgent need for the identification of CD4+ and CD8+ effector T cell epitopes contained in SARS-CoV-2 and for their use in the development of effective pharmaceuticals and vaccines. There is also an urgent need for methods, assays, and kits for detecting an immune response, including a cell-mediated immune response, such a T cell response (e.g., CD8+ and/or CD4+ T cell response), against the SARS-CoV-2 or related coronaviruses, as well as methods, assays, and kits for the diagnosis of a SARS-CoV-2 infection or related coronaviruses infection.

SUMMARY

Accordingly, the present disclosure provides novel, therapeutic T cell epitope compounds and compositions (including one or more of e.g., peptides or polypeptides as disclosed herein, including polypeptides having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein; concatemeric peptides as disclosed herein, including concatemeric polypeptides comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and variants and fragments thereof; chimeric or fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, expression cassettes, plasmids, expression vectors, recombinant viruses, or cells as disclosed herein; vaccine compositions or formulations as disclosed herein, and/or pharmaceutical compositions as disclosed herein), and use of the same, e.g., in methods of stimulating, inducing, and/or expanding an immune response , e.g., against coronavirus infection, including SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19, and methods of treating and/or preventing against SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19 in a subject. In aspects, a T-cell epitope compound or composition of the present disclosure includes one or more peptides or polypeptides a disclosed herein. In aspects, the present disclosure is directed to a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, or fragments or variants thereof. The phrase “consisting essentially of” is intended to mean that a peptide or polypeptide according to the present disclosure, in addition to the sequence according to any of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 or a fragment or variant thereof, contains additional amino acids or residues that may be present at either terminus of the peptide and/or on a side chain that are not necessarily forming part of the peptide or polypeptide that functions as an MHC ligand and provided they do not substantially impair the activity of the peptide to function as a T-cell epitope. The polypeptides of the present disclosure may be isolated, synthetic, and/or recombinant, and may comprise post-transcriptional modifications such as glycosylation, added chemical groups, etc. In aspects, the peptides or polypeptides can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In aspects, the peptides or polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group. In aspects, peptides or polypeptides of the instant disclosure having SEQ ID NOS: 1055, 1058-1060, 1062, 1065-1066, 1069-1072, 1075, 1078, 1081-1087, 1089, 1091-1094, 1100, 1106, 1110-1113, 1115 and 1366, 1369-1371, 1373, 1376-1377, 1379-1383, 1385-1386, 1389, 1391-1400, 1402-1404, 1407, 1411-1419, 1421-1424, 1426-1427, 1118-1365, and 1429-1676 are capped with an n-terminal acetyl and a c-terminal amino group. In aspects, peptides or polypeptides of the instant disclosure having SEQ ID NOS: 1068, 1074, 1080, 1088, 1096, 1101-1105, 1107-1108, and 1116 are capped with an n-terminal acetyl and are not capped at the c-terminus.

In aspects, the instant disclosure is directed to a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692. In aspects, the instant disclosure is directed to a peptide or polypeptide have a core amino acid sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments and variants thereof), and optionally having extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal of the core amino acid sequence, wherein the overall number of these flanking amino acids is 1 to 12, 1 to 3, 2 to 4, 3 to 6, 1 to 10, 1 to 8, 1 to 6, 2 to 12, 2 to 10, 2 to 8, 2 to 6, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 4 to 12, 4 to 10, 4 to 8, 4 to 6, 5 to 12, 5 to 10, 5 to 8, 5 to 6, 6 to 12, 6 to 10, 6 to 8, 7 to 12, 7 to 10, 7 to 8, 8 to 12, 8 to 10, 9 to 12, 9 to 10, or 10 to 12, wherein the flanking amino acids can be distributed in any ratio to the C-terminus and the N-terminus (for example, all flanking amino acids can be added to one terminus, or the amino acids can be added equally to both termini or in any other ratio). In aspects, the instant disclosure is directed to a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, wherein the overall number of these flanking amino acids is 1 to 12, 1 to 3, 2 to 4, 3 to 6, 1 to 10, 1 to 8, 1 to 6, 2 to 12, 2 to 10, 2 to 8, 2 to 6, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 4 to 12, 4 to 10, 4 to 8, 4 to 6, 5 to 12, 5 to 10, 5 to 8, 5 to 6, 6 to 12, 6 to 10, 6 to 8, 7 to 12, 7 to 10, 7 to 8, 8 to 12, 8 to 10, 9 to 12, 9 to 10, or 10 to 12, wherein the flanking amino acids can be distributed in any ratio to the C-terminus and the N-terminus (for example all flanking amino acids can be added to one terminus, or the amino acids can be added equally to both termini or in any other ratio), provided that the polypeptide with the flanking amino acids is still able to bind to a same HLA molecule (i.e., retain MHC binding propensity) as said polypeptide core sequence without said flanking amino acids. In aspects, said polypeptide with the flanking amino acids is still able to bind to a same HLA molecule (i.e., retain MHC binding propensity) and/or retain the same TCR specificity as said polypeptide core sequence without said flanking amino acids. In aspects, said polypeptide with the flanking amino acids is still able to bind to a same HLA molecule (i.e., retain MHC binding propensity) and/or retain the same TCR specificity, and/or retain anti-viral activity, including anti-SARS-CoV-2 activity, as said polypeptide core sequence without said flanking amino acids. In aspects, said flanking amino acid sequences are those that also flank the peptides or polypeptides included therein in the naturally occurring protein from which the peptide or polypeptide is found. For example, for a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 4-68, 1003-1005, 708-739, 1055-1059, or 1366-1370 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 4-68, 1003-1005, 708-739, 1055-1059, or 1366-1370 in the amino acid sequence of the envelope (SEQ ID NO: 1) of SARS-CoV-2. For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 69-213, 1006-1015, 740-851, 1060-1072, 1118-1119, 1371-1383, or 1429-1430 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 69-213, 1006-1015, 740-851, 1060-1072, 1118-1119, 1371-1383, or 1429-1430 in the amino acid sequence of the membrane (SEQ ID NO: 2) of SARS-CoV-2. For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 210-707, 1016-1054, 852-1002, 1073-1117, 1120-1123, 1384-1428, 1431-1434, 1782-1800, 1975-1984, 2568, 2569, 2590, 2647-2676, or 2701-2704 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 210-707, 1016-1054, 852-1002, 1073-1117, 1120-1123, 1384-1428, 1431-1434, 1782-1800, 1975-1984, 2568, 2569, 2590, 2647-2676, or 2701-2704 in the amino acid sequence of the spike (SEQ ID NO: 3) of SARS-CoV-2. Additional examples are found in the specification, below. In aspects, said flanking amino acid sequences as described herein may serve as a WIC stabilizing region. In aspects, the use of a longer peptide may allow endogenous processing by patient cells and may lead to more effective antigen presentation and induction of T cell responses. In aspects, the extension(s) may serve to improve the biochemical properties of the peptides or polypeptides (e.g., but not limited to, solubility or stability) or to improve the likelihood for efficient proteasomal processing of the peptide. In aspects, the polypeptides of the present disclosure may be islated, synthetic, and/or recombinant, and may comprise post-transcriptional modifications such as glycosylation, added chemical groups, etc. In aspects, the peptides or polypeptides can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In certain aspects, the peptides or polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group.

In aspects, the present disclosure is directed to a concatemeric polypeptide or peptide that comprises at one or more of the instantly-disclosed polypeptides or peptides (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) linked, fused, or joined together (e.g., fused in-frame, chemically-linked, or otherwise bound) to an additional peptide or polypeptide. Such additional peptide or polypeptide may be one or more of the instantly instantly-disclosed polypeptides or peptides, or may be an additional peptide or polypeptide of interest. In aspects a concatemeric peptide is composed of 3 or more, 4 or more, 5 or more 6 or more 7 or more, 8 or more, 9 or more of the instantly-disclosed peptides or polypeptides. In other aspects, the concatemeric peptides or polypeptides include 1000 or more, 1000 or less, 900 or less, 500 or less, 100 or less, 75 or less, 50 or less, 40 or less, 30 or less, 20 or less or 100 or less peptide epitopes. In yet other embodiments, a concatemeric peptide has 3-100, 5-100, 10-100, 15-100, 20-100, 25-100, 30-100, 35-100, 40-100, 45-100, 50-100, 55-100, 60-100, 65-100, 70-100, 75-100, 80-100, 90-100, 5-50, 10-50, 15-50, 20-50, 25-50, 30-50, 35-50, 40-50, 45-50, 100-150, 100-200, 100-300, 100-400, 100-500, 50-500, 50-800, 50-1,000, or 100-1,000 of the instantly-disclosed peptides or polypeptides linked, fused, or joined together. Each peptide or polypeptide of the concatemeric polypeptide may optionally have one or more linkers, which may optionally be cleavage sensitive sites, adjacent to their N and/or C terminal end. In such a concatemeric peptide, two or more of the peptide epitopes may have a cleavage sensitive site between them. Alternatively two or more of the peptide epitopes may be connected directly to one another or through a linker that is not a cleavage sensitive site. In aspects, a concatemeric polypeptide of the present disclosure comprises, consists of, or consists essentially of one or more of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and/or fragments and variants thereof. In aspects, the instantly-disclosed concatermeric polypepide or peptide sequences do not correspond to a naturally occurring sequence, i.e., each of the one or more of the instantly-disclosed polypeptides or peptides (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) are linked, fused, or joined together (e.g., fused in-frame, chemically-linked, or otherwise bound) to an additional peptide or polypeptide (which may be one or more of the instantly-disclosed peptides) in such a fashion such that the overall concatermic polypeptide does not correspond to a naturally occurring coronavirus sequence. In aspects, the concatemeric polypeptides of the present disclosure may be isolated, synthetic, and/or recombinant, and may comprise post-transcriptional modifications such as glycosylation, added chemical groups, etc. In aspects, the concatemeric polypeptides can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In certain aspects, the concatemeric polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group.

In aspects, one or more peptides or polypeptides or concatemeric polypeptides of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, or a concatemeric polypeptide comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 (and fragments or variants thereof)), is joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide. In aspects, the one or more peptides or polypeptides or concatemeric polypeptides of the instant disclosure may be joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide as a whole, although it may be made up from a joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted amino acid sequence, together with flanking amino acids of the heterologous polypeptide.

In aspects, the present disclosure is directed to a chimeric or fusion polypeptide composition (which in aspects may be isolated, synthetic, or recombinant) comprising one or more peptides, polypeptides, or concatemeric peptides of the present disclosure. In aspects, a chimeric or fusion polypeptide composition of the present disclosure comprises one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure (e.g., a peptide or polypeptide comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide In aspects, the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure may be inserted into the heterologous polypeptide, may be added to the C-terminus (with or without the use of linkers, as is known in the art), and/or added to the N-terminus (with or without the use of linkers, as is known in the art) of the heterologous polypeptide. In aspects of the above chimeric or fusion polypeptide compositions, the one or more peptide, polypeptides, or concatemeric peptides may be joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide as a whole, although it may be made up from a joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted amino acid sequence, together with flanking amino acids of the heterologous polypeptide. In aspects, a chimeric or fusion polypeptide composition of the present disclosure comprises a peptide, polypeptide, and/or concatemeric peptide of the instant disclosure, said peptide, polypeptide, and/or concatemeric peptide having a sequence that is not naturally included in the heterologous polypeptide and/or is not located at its natural position in the heterologous polypeptide. For example, in aspects, the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure may be inserted into a SARS-CoV-2 sequence in which the SARS-CoV-2 sequence does not include the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure (e.g., the SARS-CoV-2 sequence is mutated to not include the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure) or the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure is inserted into a SARS-CoV-2 sequence but not at its natural position. In aspects, the one or more of peptide, polypeptide, and/or concatemeric peptide of the present disclosure can be joined, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into the heterologous polypeptide. In aspects of above-described chimeric or fusion polypeptide compositions, the chimeric or fusion polypeptides may be isolated, synthetic, or recombinant.

In aspects, the instant disclosure is directed to a nucleic acid (e.g., DNA or RNA, including mRNA) encoding one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as described herein. For example, in aspects, the instant disclosure is directed to a nucleic acid encoding a peptide or polypeptide comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID

NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692. Additionally, the instant disclosure is directed to a nucleic acid encoding a polypeptide comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence off SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734. In aspects, the present disclosure is directed to a vector, such as an expression vector, comprising such a nucleic acid as described. In aspects, the present disclosure is directed to expression cassettes, plasmids, expression vectors, recombinant viruses, or cells comprising a nucleic acid as described herein. In aspects, the present disclosure is directed to a cell or vaccine comprising such a vector as described. In aspects, the present disclosure is directed to a cell comprising a vector of the present disclosure.

In aspects, the instant disclosure is directed to a pharmaceutical composition, the pharmaceutical composition comprising a T-cell epitope compound or composition of the instant disclosure (e.g., one or more of: polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric or fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, including nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric of fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, cells as disclosed herein) and a pharmaceutically acceptable carrier, excipient, and/or adjuvant. In aspects, the one or more nucleic acids encoding said peptides or polypeptides are DNA, RNA, or mRNA. In aspects of the above-described pharmaceutical compositions, the composition comprises at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 25, at least 30, at least 40, at least 50, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or at least 1000, peptides, polypeptides, and/or concatemeric peptides, as disclosed herein, including every value or range therebetween.

In aspects, the instant disclosure is directed to a vaccine comprising a T-cell epitope compound or composition of the instant disclosure (e.g., one or more of: polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric or fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, including nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric of fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, cells as disclosed herein; pharmaceutical compositions as disclosed herein; or vaccines as described herein) and, optionally, a carrier, excipient, and/or an adjuvant.

The present disclosure also relates to methods of immunizing or inducing an immune response in a subject, said method comprising administering to said subject one more peptides, polypeptides, concatemeric peptides, chimeric or fusion polypeptides, nucleic acids, expression cassettes, plasmids, expression vectors, recombinant viruses, cells pharmaceutical compositions, or vaccines as described herein. In aspects, the subject is a human. In aspects, the present disclosure is directed to to methods of immunizing or inducing an immune response in a subject, comprising administering to said subject a T-cell epitope compound or composition of the instant disclosure (e.g., one or more of: polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric or fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, including nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, cells as disclosed herein; pharmaceutical compositions as disclosed herein; or vaccines as described herein). In aspects, the subject is a human. In aspects, the present disclosure is directed to a method of stimulating, inducing, and/or expanding an immune response to a SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject, comprising administering to said subject a T-cell epitope compound or composition of the instant disclosure.

The present disclosure also relates to methods of treating and/or preventing a SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject, such as a human, comprising administering to said subject a T-cell epitope compound or composition of the instant disclosure (e.g., one or more of: polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric or fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, including nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, cells as disclosed herein; pharmaceutical compositions as disclosed herein; or vaccines as described herein).

As should be understood, the T-cell epitope compounds or compositions of the instant disclosure as described herein may be used to induce an immune response and/or to vaccinate a subject. It is particularly useful to vaccinate against SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19.

In aspects, the instant disclosure provide novel methods, assays, and kits for detecting an immune response, and in aspects a cell-mediated immune (“CMI”) response, including a T cell response (e.g., CD8+ and/or CD4+ T cell response) against SARS-CoV-2 or a related coronavirus, as well as methods, assays, and kits for the diagnosis of a SARS-CoV-2 infection or related coronaviruses infection, as well as diseases caused by SARS-CoV-2, including the highly contagious coronavirus disease 2019. The instantly-disclosed assays, methods, and kits use or include one or more T-cell epitope compounds and composiitons (including peptides or polypeptides) a disclosed herein.

In aspects, the present disclosure is directed to methods of measuring a CMI response against COVID-19 or a related coronavirus infection in a subject by incubating a sample from the subject which comprises T-cells or other cells of the immune system with one or more peptides or polypeptides of the instant disclosure. In aspects, production of IFN-γ or other cytokine or immune effector molecule(s) is then detected. The presence or level of immune effector is then indicative of the level of cell mediated responsiveness of the subject. In aspects, preferably, the sample is whole blood which is collected in a suitable container comprising the antigen. Optionally, a simple sugar such as dextrose is added to the incubation mixture. Accordingly, one aspect of the present disclosure relates to a method for measuring a CMI response in a subject, said method comprising collecting a sample from said subject wherein said sample comprises cells of the immune system which are capable of producing immune effector molecules following stimulation by an antigen, incubating said sample with one or more peptides or polypeptides of the instant disclosure and then measuring the presence of or elevation in the level of an immune effector molecule wherein the presence or level of said immune effector molecule is indicative of the capacity of said subject to mount a cell-mediated immune response against SARS-CoV-2 or a related coronavirus infection. In aspects, the presence of or elevation in the level of an immune effector molecule wherein the presence or level of said immune effector molecule is indicative of the capacity of said subject to mount a cell-mediated immune response against SARS-CoV-2 or a related coronavirus infection.

In aspects, the present disclosure is directed to methods of assaying for SARS-CoV-2 or related coronavirus peptide-specific T-cells, the method comprising providing a fluid containing T-cells, adding one or more peptides or polypeptides of the instant disclosure to the fluid, incubating the fluid to cause cytokine release, and detecting the released cytokine. Preferably the method comprises providing the fluid containing T-cells in contact with a surface carrying an immobilized first antibody to the cytokine, adding the peptide or polypeptide to the fluid, incubating the resulting fluid mixture under conditions to cause any peptide or poleypeptide-specific T-cells that have been pre-sensitized in vivo to the peptide or polypeptide to secrete the cytokine, and detecting any secreted cytokine bound to the immobilized first antibody. In aspects, the cells are preferably peripheral blood mononuclear cells (PMBC). They may suitably be taken from a patient known to be suffering, or to have suffered, from COVID-19 infection or a related coronavirus infection. In aspects, the cells used are fresh. In aspects, the assay is used to identify or quantitate peptide or polypeptide-specific T-cells e.g. CD8+ or CD4+ cells that have been activated or pre-sensitized in vivo to a particular peptide or polypeptide. In aspects, these are unrestimulated T-cells, i.e. cells capable of immediate effector function without the need to effect division/differentiation by in vitro culture. When a peptide or polypeptide in question is presented to such cells, the cells secrete various cytokines, of which any one may be selected for the purposes of this assay. In aspects, the cytokine selected is interferon-γ (IFN γ).

In aspects, the present disclosure provides a method of detecting an anti-SARS-CoV-2 (or related coronavirus) T cell response (which in aspects can included CD4+ and/or CD8+ T cell response) comprising contacting a population of T cells of an individual with a peptide or polypeptide of the instant disclosure, wherein one or more of said peptides or polypeptides may be substituted by an analogue which binds a T cell receptor that recognizes the peptide, and determining whether T cells of the T cell population recognize the peptide(s).

In aspects, the present disclosure provides a method of diagnosing a SARS-CoV-2 or related coronavirus infection in a host, or exposure of a host, to SARS-CoV-2 or related corornavirus comprising (i) contacting a population of T cells from the host with one or more peptides or analogues as disclosed here, and analogues thereof which can bind a T cell receptor which recognizes any of the said peptides; and (ii) determining whether the T cells of said T cell population recognize the peptide(s) and/or analogue(s).

BRIEF DESCRIPTION OF THE FIGURES

The present disclosure may be better understood with reference to the following figures.

FIG. 1 is an overview of MHC class II cluster selection from the envelope (SEQ ID NO: 1) of SARS-CoV-2. The cluster address given the location of the peptide within the sequences that were provided for analysis. The core peptide (middle amino acids in bold, SEQ ID NO: in parentheses) defines the actual cluster that was identified during the analysis. The stabilizing flanks (N-terminal and C-terminal, not bold) are included for use with the core sequence, and are labeled by the SEQ ID NO: not listed in parentheses. The number of hits is the number of EpiMatrix Z-scores above 1.64 or top 5% found within the sequence. The EpiMatrix Cluster Score is derived from the number of hits normalized for the length of the cluster. Cluster Score is thus the excess or shortfall in predicted aggregate immunogenicity relative to a random peptide standard. Hydrophobicity scores of 2 and above are predictive of difficulty synthesizing peptides.

FIGS. 2-4 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figures. All scores in the Top 5% (Z-Score >1.64) are considered “Hits”. *Scores in the top 10% are considered elevated, other scores are grayed out for simplicity. Frames containing 4 or more alleles scoring above 1.64 are referred to as EpiBars and are highlighted in yellow. These frames have an increased likelihood of binding to HLA. Flanking amino acids, added to stabilize the cluster during in-vitro testing, are presented in blue type face and underlined.

FIG. 5 is an overview of MHC class II cluster selection from the membrane (SEQ ID NO: 2) of SARS-CoV-2. The cluster address given the location of the peptide within the sequences that were provided for analysis. The core peptide (middle amino acids in bold, SEQ ID NO: in parentheses) defines the actual cluster that was identified during the analysis. The stabilizing flanks (N-terminal and C-terminal, not bold) are included for use with the core sequence, and are labeled by the SEQ ID NO: not listed in parentheses. The number of hits is the number of EpiMatrix Z-scores above 1.64 or top 5% found within the sequence. The EpiMatrix Cluster Score is derived from the number of hits normalized for the length of the cluster. Cluster Score is thus the excess or shortfall in predicted aggregate immunogenicity relative to a random peptide standard. Hydrophobicity scores of 2 and above are predictive of difficulty synthesizing peptides.

FIGS. 6-15 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figures. All scores in the Top 5% (Z-Score >1.64) are considered “Hits”. *Scores in the top 10% are considered elevated, other scores are grayed out for simplicity. Frames containing 4 or more alleles scoring above 1.64 are referred to as EpiBars and are highlighted in yellow. These frames have an increased likelihood of binding to HLA. Flanking amino acids, added to stabilize the cluster during in-vitro testing, are presented in blue type face and underlined.

FIG. 16 is an overview of MHC class II cluster selection from the spike (SEQ ID NO: 3) of SARS-CoV-2. The cluster address given the location of the peptide within the sequences that were provided for analysis. The core peptide (middle amino acids in bold, SEQ ID NO: in parentheses) defines the actual cluster that was identified during the analysis. The stabilizing flanks (N-terminal and C-terminal, not bold) are included for use with the core sequence, and are labeled by the SEQ ID NO: not listed in parentheses. The number of hits is the number of EpiMatrix Z-scores above 1.64 or top 5% found within the sequence. The EpiMatrix Cluster Score is derived from the number of hits normalized for the length of the cluster. Cluster Score is thus the excess or shortfall in predicted aggregate immunogenicity relative to a random peptide standard. Hydrophobicity scores of 2 and above are predictive of difficulty synthesizing peptides.

FIGS. 17-55 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figures. All scores in the Top 5% (Z-Score >1.64) are considered “Hits”. *Scores in the top 10% are considered elevated, other scores are grayed out for simplicity. Frames containing 4 or more alleles scoring above 1.64 are referred to as EpiBars and are highlighted in yellow. These frames have an increased likelihood of binding to HLA. Flanking amino acids, added to stabilize the cluster during in-vitro testing, are presented in blue type face and underlined.

FIG. 56 is the JanusMatrix reports for identified MHC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. *Count of HUMAN JanusMatrix matches found in the search database. With respect to a given EpiMatrix Hit (a 9-mer contained within the input sequence which is predicted to bind to a specific allele), a Janus Matrix match is a 9-mer derived from the search database (e.g., the human genome) which is predicted to bind to the same allele as the EpiMatrix Hit and shares TCR facing contacts with the EpiMatrix Hit. ** Janus Homology Score represents the average depth of coverage in the search database for each EpiMatrix hit in the input sequence. For example, an input peptide with eight EpiMatrix hits, all of which have one match in the search database, has a Janus Homology Score of 1. An input peptide with four EpiMatrix Hits, all of which have two matches in the search database, has a Janus Homology Score of 2. The JanusMatrix Homology Score considers all constituent 9-mers in any given peptide, including flanks.

FIG. 57 is the JanusMatrix reports for identified MHC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. *Count of HUMAN JanusMatrix matches found in the search database. With respect to a given EpiMatrix Hit (a 9-mer contained within the input sequence which is predicted to bind to a specific allele), a Janus Matrix match is a 9-mer derived from the search database (e.g., the human genome) which is predicted to bind to the same allele as the EpiMatrix Hit and shares TCR facing contacts with the EpiMatrix Hit. ** Janus Homology Score represents the average depth of coverage in the search database for each EpiMatrix hit in the input sequence. For example, an input peptide with eight EpiMatrix hits, all of which have one match in the search database, has a Janus Homology Score of 1. An input peptide with four EpiMatrix Hits, all of which have two matches in the search database, has a Janus Homology Score of 2. The JanusMatrix Homology Score considers all constituent 9-mers in any given peptide, including flanks.

FIG. 58 is the JanusMatrix reports for identified WIC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2. *Count of HUMAN JanusMatrix matches found in the search database. With respect to a given EpiMatrix Hit (a 9-mer contained within the input sequence which is predicted to bind to a specific allele), a Janus Matrix match is a 9-mer derived from the search database (e.g., the human genome) which is predicted to bind to the same allele as the

EpiMatrix Hit and shares TCR facing contacts with the EpiMatrix Hit. ** Janus Homology Score represents the average depth of coverage in the search database for each EpiMatrix hit in the input sequence. For example, an input peptide with eight EpiMatrix hits, all of which have one match in the search database, has a Janus Homology Score of 1. An input peptide with four EpiMatrix Hits, all of which have two matches in the search database, has a Janus Homology Score of 2. The JanusMatrix Homology Score considers all constituent 9-mers in any given peptide, including flanks.

FIG. 59 is an EpiMatrix staircase report for identified WIC class I clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. Z-score indicates the potential of a 9-mer or 10-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIG. 60 is an EpiMatrix staircase report for identified MHC class I clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. Z-score indicates the potential of a 9-mer or 10-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIG. 61 is an EpiMatrix staircase report for identified WIC class I clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2. Z-score indicates the potential of a 9-mer or 10-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIG. 62 shows the sequences of the envelope (SEQ ID NO: 1) of SARS-CoV-2, the membrane (SEQ ID NO: 2) of SARS-CoV-2, and the spike (SEQ ID NO: 3) of SARS-CoV-2.

FIG. 63A and FIG. 63B, ex vivo immune recall responses differentiate SARS-CoV-2 naïve and experienced individuals and exhibit different COVID-19 immunotypes.

FIG. 64A and FIG. 64B shows that strong ex vivo immune recall responses are found or may be found in SARS-CoV-2 experienced individuals using polypeptides of the instant disclosure.

FIG. 65 shows polypeptides of the instant disclosure stimulate ex vivo immune recall response in natural SARS-CoV-2 infection.

FIG. 66A and FIG. 66B show polypeptides of the instant disclosure stimulate or may stimulate higher IFN-γresponses in naïve and COVID-19 convalescent donors following expansion in culture.

FIG. 67A and FIG. 67B show polypeptides of the instant disclosure stimulate or may stimulate low frequency epitope-specific T cells following expansion in culture in naïve and COVID-19 convalescent donors.

FIG. 68 shows polypeptides of the instant disclosure stimulate low frequency epitope-specific T cells following expansion in culture in naïve and COVID-19 convalescent donors.

FIGS. 69A-69D shows the sequences of the nucleocapsid (SEQ ID NO: 1693), ORF3a (SEQ ID NO: 1694), ORF6 (SEQ ID NO: 1695), ORF7a (SEQ ID NO: 1696), ORF8 (SEQ ID NO: 1697), ORF10 (SEQ ID NO: 1698), ORF1ab non-structural protein 2 (NSP2) (SEQ ID NO: 1699), ORF1ab non-structural protein 3 (NSP3) (SEQ ID NO: 1700), ORF1ab non-structural protein 4 (NSP4) (SEQ ID NO: 1701), ORF1ab 3C-like proteinase (SEQ ID NO: 1702), ORF1ab non-structural protein 6 (NSP6) (SEQ ID NO: 1703), ORF1ab non-structural protein 7 (NSP7) (SEQ ID NO: 1704), ORF1ab non-structural protein 8 (NSP8) (SEQ ID NO: 1705), ORF1ab non-structural protein 9 (NSP9) (SEQ ID NO: 1706), ORF1ab non-structural protein 10 (NSP10) (SEQ ID NO: 1707), ORF1ab RNA-dependent RNA polymerase (SEQ ID NO: 1708), ORF1ab helicase (SEQ ID NO: 1709), ORF1ab 3′-5′ exonuclease (SEQ ID NO: 1710), ORF1ab endoRNase (SEQ ID NO: 1711), and ORF1ab 2′O-ribose methyltransferase proteins (SEQ ID NO: 1712) of SARS-CoV-2.

FIG. 70 is an overview of MHC class II cluster selection from the various proteins of SARS-CoV-2 with the corresponding SEQ ID NO: in parentheses. The cluster address given the location of the peptide within the sequences that were provided for analysis. The core peptide (middle amino acids in bold, SEQ ID NO: in parentheses) defines the actual cluster that was identified during the analysis. The stabilizing flanks (N-terminal and C-terminal, not bold) are included for use with the core sequence, and are labeled by the SEQ ID NO: not listed in parentheses. The number of hits is the number of EpiMatrix Z-scores above 1.64 or top 5% found within the sequence. The EpiMatrix Cluster Score is derived from the number of hits normalized for the length of the cluster. Cluster Score is thus the excess or shortfall in predicted aggregate immunogenicity relative to a random peptide standard. Hydrophobicity scores of 2 and above are predictive of difficulty synthesizing peptides.

FIGS. 71-92 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the SARS-CoV-2 peptides of FIG. 70. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figures. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”. *Scores in the top 10% are considered elevated, other scores are grayed out for simplicity. Frames containing 4 or more alleles scoring above 1.64 are referred to as EpiBars and are highlighted in yellow. These frames have an increased likelihood of binding to HLA. Flanking amino acids, added to stabilize the cluster during in-vitro testing, are presented in blue type face and underlined.

FIGS. 93-101 are EpiMatrix staircase reports for identified MHC class I clusters of the SARS-CoV-2 peptides of FIG. 70. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIGS. 102-110 are EpiMatrix staircase reports for identified MHC class II clusters of the SARS-CoV-2 peptides of FIG. 70. Z-score indicates the potential of a 9-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIGS. 111-119 are EpiMatrix staircase reports for identified MHC class I clusters of the SARS-CoV-2 peptides of FIG. 70. Z-score indicates the potential of a 10-mer frame to bind to a given HLA allele; the strength of the score is indicated by the blue shading as noted in the respective Figure. All scores in the Top 5% (Z-Score≥1.64) are considered “Hits”.

FIG. 120 shows the sequences of some of the concatemers disclosed herein along with their corresponding sequence identifiers.

FIGS. 121A-D show predicted SARS-CoV-2 T cell epitopes are antigenic ex vivo in COVID-19 convalescent donors but not healthy donors. (FIG. 121A) Convalescent and pre-SARS-CoV-2 donors (naive) were stimulated with the total peptide pool consisting of our 32 predicted epitopes and IFNγ producing cells were measured by Fluorospot assay. Open circles identify responses to low-dose restimulation. Horizontal line indicates positivity criteria at SFC/10⁶spenocytes=25. (FIG. 121B) IFNγresponses to individual peptides were also assessed, identifying the breadth of response in individual donors, (FIG. 121C) the frequency of responses to unique peptides within the cohort (vertical lines denote 20% of each cohort), and (FIG. 121D) the depth of response indicated by the frequency of IFNγ-producing, epitope-specific cells, vertical lines indicate positivity criteria at SFC/10⁶spenocytes=25). *=p<0.05.

FIGS. 122A-B show SARS-CoV-2 experienced individuals exhibit variable immune recall responses ex vivo. (FIG. 122A) Significant responses to individual peptides (combined per source antigen) identify three distinct immunotype cohorts within covalescent donors (Donor notations; *=pneumonia, **=hospitalized, non-ICU). (FIG. 122B) Correlation of cumulative T cell responses and age according to gender.

FIGS. 123A-D show antigen-specific T cell expansion increases responses in COVID-19 convalescents and uncovers pre-existing SARS-CoV-2 immunity in healthy donors. (FIG. 123A) PBMCs of convalescent and naive donors were restimulated with the total peptide pool following 8-day expansion culture and IFNγ producing cells were measured by Fluorospot assay. Horizontal line indicates positivity criteria at SFC/10⁶spenocytes=25. (FIG. 123B) IFNγ responses to individual peptides were also assessed, identifying the breadth of response in individual donors, (FIG. 123C) the frequency of responses to unique peptides within the cohort (vertical lines denote 20% of each cohort), (FIG. 123D) and the depth of response indicated by the frequency of IFNγ-producing, epitope-specific cells (vertical lines indicate positivity criteria at SFC/10⁶spenocytes=25).

FIGS. 124A-D show EPV-CoV-19 immunization stimulates strong type 1-skewed T cell responses in HLA-DR3 transgenic mice. Eight days post-boost, murine splenocytes were isolated and assayed for epitope-specific recall responses. Cells were plated in dual IFNγ/IL-4 fluorospot plates, and restimulated with peptide pools for 48 h. (FIG. 124A) Representative images and spot counts are shown for both. (FIG. 124B) IFNγ SFC counts were normalized to 1×10⁶cells and adjusted by background subtraction, and (FIG. 124C) IFNγ SI index was determined by calculating the fold change of individual restimulation replicates over background. (FIG. 124C) IL-4 SFC and (FIG. 124D) SI were similarly calculated. Horizontal lines denote positivity criteria of SFC>25 and SI>5, respectively. (FIG. 124D) From the reported IFNγ and IL-4 stimulation indexes, we calculated the IFNγ:IL-4 ratio of each restimulation replicate to model the overall skewing of the immune response, identifying a sharply type 1 skewed phenotype in all vaccinated animals. Horizontal lines identify 40, 100, and 1000-fold skewing of type 1/type 2 response.

(n=17, *=p<0.05, **=p<0.01, ***=p<0.001).

FIGS. 125A-E show EPV-CoV-19 immunization stimulates type 1-skewed Memory CD4 and CD8 T cells in HLA-DR3 transgenic mice. Splenocytes were restimulated with a vaccine-matched peptide pool 6 hours in the presence of brefeldin A and monensin. Following incubation, cells were stained for surface markers, fixed and permeabilized, stained for intracellular markers, and expression of markers was recorded by flow cytometry. Memory CD4+ T cells and CD8+ T cells were assessed for IFNγ, IL-4, or IL-5-production (both frequency in parent T cell population and mean fluorescence intensity (MFI) of cytokines). (FIG. 125A) Representative images of type 1 and type 2 skewed, epitope-specific memory T cell populations are shown. (FIG. 125B) The fold increase of epitope-specific responses (over CD28 stimulated controls) identify vaccine-specific induction of IFNγ, but not (FIG. 125C) IL-4, or (FIG. 125D) IL-5. (FIG. 125E) From ICS generated data, we calculated the fold increase of IFNγ or IL-4 and/or IL-5 producing cells with peptide restimulation and used the ratio of type 1-to-type 2 responses to model Th-skewing and Tc-skewing in vaccinated animals. (n=17, *=p<0.05, *=p<.01, ***=p<.001)

DETAILED DESCRIPTION OF THE INVENTION

The present disclosure generally relates to T-cell epitope-based compounds and compositions, including vaccines, for use against SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19. The disclosure relates to immunogenic peptides, polypeptides, concatemeric peptides, and chimeric or fusion polypeptides and the uses thereof, particularly in pharmaceutical and vaccine compositions. The present disclosure also relates to nucleic acids, vectors (including expression vectors), and cells which express the peptides, polypeptides, concatemeric peptides, and chimeric or fusion polypeptides and the uses thereof. The peptides, polypeptides, concatemeric peptides, and chimeric or fusion polypeptides of the present disclosure more specifically comprise an agretope predicted to be a ligand of HLA class I and/or HLA class II MHC molecules, as well as an epitope that is predicted to be recognized by T-cells (including CD8+ and/or CD4+ T-cells) in the context of MHC class I and/or class II molecules. The instant disclosure is particularly suited to produce vaccines for humans, particularly for vaccinating against coronavirus infection, including SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19.

It is possible to exploit epitope-specific T-cells to induce an immune response against specific antigens. This discovery has implications for the design of therapeutic regimens and antigen-specific therapies against particular pathogens and infections. The instant disclosure and data relates to identified SARS-CoV-2 T cell epitopes that are recognized in natural infection and stimulate pre-existing immunity to SARS-CoV-2. These epitopes are excellent candidates for T cell-directed vaccine development. A T cell targeting vaccine composed of conserved epitopes may provide rapid, effective and long-term immunity at sites of infection with production of tissue resident memory CD8+ T cells, as well as memory CD4+ T cells that support antibody responses. Influenza vaccination during the 2009 H1N1 pandemic demonstrated that memory CD4+ T cells are able to support naive B cell responses to a novel hemagglutinin. A T cell-directed SARS-CoV-2 vaccine could generate robust CD4+ T cell memory that would provide early control of acute infection with a novel SARS-CoV-2 virus in the absence of pre-existing cross-protective antibodies. Thus, administration of T-cell epitopes, including a T-cell epitope compound or composition of the present disclosure (including one or more of peptides or polypeptides having a sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein; concatemeric peptides as disclosed herein (including a concatemeric polypeptide of the present disclosure that comprises, consists of, or consists essentially of one or more of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and/or fragments and variants thereof); chimeric of fusion polypeptide compositions as disclosed herein; nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric of fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, or cells which express such peptides, polypeptides, concatemeric peptides or chimeric of fusion polypeptide compositions as disclosed herein; vaccine compositions or formulations as disclosed herein; and/or pharmaceutical compositions or formulations as disclosed herein), optionally in conjunction with a drug (such as an antiviral drug), a protein, or a inactivated or live attenuated virus, can induce an immune response, e.g., against a pathogen, including SARS-CoV-2 and related diseases caused by SARS-CoV-2, including COVID-19. T-cell epitopes, including T-cell epitope compounds and compositions of the present disclosure, can be used to deliberately manipulate the immune system toward immunity.

For example, the T-cell epitope compounds and compositions of the present disclosure are useful in the selective engagement and activation of immunogenic T-cells. It is demonstrated herein that certain naturally occurring T-cells (in aspects, including CD4+ and CD8+ T-cells), can be engaged, activated, and/or applied to induce immunity or induce an immune response against pathogens such as SARS-CoV-2 and related diseases caused by SARS-CoV-2, including COVID-19. By using the T-cell epitope compounds and compositions of the present disclosure to selectively activate naturally occurring T-cells, it is herein shown that such T-cell epitope compounds and compositions can be used to stimulate, induce, and/or expand an immune response to a coronavirus, including SARS-CoV-2 and related diseases caused by SARS-CoV-2, including COVID-19 in a subject, and thus can be used in methods of treating and/or preventing SARS-CoV-2 and related diseases caused by SARS-CoV-2, including COVID-19 in a subject.

Definitions

To further facilitate an understanding of the present disclosure, a number of terms and phrases are defined below. Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. It will be further understood that terms such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure, and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 25 is understood to include any number, combination of numbers, or sub-range from the group consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25, as well as all intervening decimal values between the aforementioned integers such as, for example, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, and 1.9. With respect to sub-ranges, “nested sub-ranges” that extend from either end point of the range are specifically contemplated. For example, a nested sub-range of an exemplary range of 1 to 25 may comprise 1 to 5, 1 to 10, 1 to 15, and 1 to 20 in one direction, or 25 to 20, 25 to 15, 25 to 10, and 25 to 5 in the other direction.

As used herein, the term “biological sample” as refers to any sample of tissue, cells, or secretions from an organism.

As used herein, the term “medical condition” includes, but is not limited to, any condition or disease manifested as one or more physical and/or psychological symptoms for which treatment and/or prevention is desirable, and includes previously and newly identified diseases and other disorders.

As used herein, the term “immune response” refers to the concerted action of lymphocytes, antigen presenting cells, phagocytic cells, granulocytes, and soluble macromolecules produced by the above cells or the liver (including antibodies, cytokines, and complement) that results in selective damage to, destruction of, or elimination from the human body of cancerous cells, metastatic tumor cells, malignant melanoma, invading pathogens (including a virus), cells or tissues infected with pathogens, or, in cases of autoimmunity or pathological inflammation, normal human cells or tissues. In aspects, an immune response includes a measurable cytotoxic T lymphocyte (CTL) response (e.g., against a virus expressing an immunogenic polypeptide) or a measurable B cell response, such as the production of antibodies, (e.g., against an immunogenic polypeptide). One of ordinary skill would know various assays to determine whether an immune response against a peptide, polypeptide, or related composition was generated, including use of the experiments and assays as disclosed in the Examples herein. Various B lymphocyte and T lymphocyte assays are well known, such as ELISAs, Eli Spot assays, cytotoxic T lymphocyte CTL assays, such as chromium release assays, proliferation assays using peripheral blood lymphocytes (PBL), tetramer assays, and other cytokine production assays. See Benjamini et al. (1991), hereby incorporated by reference.

As used herein, the term “effective amount”, “therapeutically effective amount”, or the like of a composition, including a T-cell epitope compound or composition of the present disclosure (including one or more of peptides or polypeptides having a sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein; concatemeric peptides as disclosed herein, including concatemeric polypeptides comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and/or fragments and variants thereof; chimeric of fusion polypeptide compositions as disclosed herein; nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric of fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, or cells which express such peptides, polypeptides, concatemeric peptides or chimeric of fusion polypeptide compositions as disclosed herein; vaccine compositions or formulations, and/or pharmaceutical compositions or formulations as disclosed herein) is a quantity sufficient to achieve a desired therapeutic and/or prophylactic effect, e.g., an amount that results in the prevention of, or a decrease in, the symptoms and/or underlying causes associated with a disease that is being treated, such as SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19, or an amount to measurably to inhibit inhibit virus (for example, SARS-CoV-2) replication or infectivity. The amount of a composition of the present disclosure administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors. The T-cell epitope compounds and compositions of the present invention can also be administered in combination with each other or with one or more additional therapeutic compounds.

As used herein, “anti-SARS-CoV-2 activity”, “anti-SARS-CoV-2 polypeptides”, “anti-SARS-CoV-2 compounds and compositions”, and the like are intended to mean that the T-cell epitope compounds and compositions of the of the present diclsoure (including polypeptides, concatemeric polypeptides, chimeric or fusion proteins, nucleic acids, plasmids, vectors, pharmaceutical compositions, vaccines, and other compositions of the instant disclosure) have anti-SARS-CoV-2 activity and thus are capable of suppressing, controlling, and/or killing an invading SARS-CoV-2 virus. For example, anti-SARS-CoV-2 activity means that the instantly-disclosed therapeutic T-cell epitope comopounds and compositions are, in aspects: capable of stimulating, inducing, and/or expanding an immune response to SARS-CoV-2 (e.g., a cellular (CD4+ and/or CD8+ T-cell response) or humoral immune response to SARS-CoV-2) and/or associated diseases in a subject; capable of stimulating, inducing, and/or expanding a SARS-CoV-2-specific IFNγ response (e.g., by lymphocytes such as PMBC, or effector CD4+ and/or CD8+ T-cells), capable of inhibiting SARS-CoV-2 viral replication or infectivity, and/or capable of inducing immunity against SARS-CoV-2. In aspects, a T-cell epitope compound or composition of the present disclosure having anti-SARS-CoV-2 activity will reduce the disease symptoms resulting from SARS-CoV-2 challenge by at least about 5% to about 50%, at least about 10% to about 60%, at least about 30% to about 70%, at least about 40% to about 80%, or at least about 50% to about 90% or greater, including any value or range therebetween. Anti-SARS-CoV-2 activity can be determined by various experiments and assays as known to those of skill in the art, including methods such as by antibody titrations of sera, e.g., by ELISA and/or seroneutralization assay analysis and/or by vaccination challenge evaluation, including use of experiments and assays as disclosed in the Examples herein.

As used herein, the term “T-cell epitope” means an MHC ligand or protein determinant, 7 to 30 amino acids in length, and capable of specific binding to human leukocyte antigen (HLA) molecules and interacting with specific T cell receptors (TCRs). As used herein, in the context of a T cell epitope that is known or determined (e.g. predicted) to engage a T cell, the terms “engage”, “engagement” or the like means that when bound to a MHC molecule (e.g. human leukocyte antigen (HLA) molecules), the T cell epitope is capable of interacting with the TCR of the T cell and activating the T cell. Generally, T-cell epitopes are linear and do not express specific three-dimensional characteristics. T-cell epitopes are not affected by the presence of denaturing solvents. The ability to interact with T-cell epitopes can be predicted by in silico methods (De Groot A S et al., (1997), AIDS Res Hum Retroviruses, 13(7):539-41; Schafer J R et al., (1998), Vaccine, 16(19):1880-4; De Groot A S et al., (2001), Vaccine, 19(31):4385-95; De Groot A R et al. ,(2003), Vaccine, 21(27-30):4486-504, all of which are herein incorporated by reference in their entirety.

As used herein, the term “T-cell epitope cluster” refers to polypeptide that contains between about 4 to about 40 MHC binding motifs. In particular embodiments, the T-cell epitope cluster contains between about 5 to about 35 MHC binding motifs, between about 8 and about 30 MHC binding motifs; and between about 10 and 20 MHC binding motifs.

As used herein, the term “immune-stimulating T-cell epitope polypeptide” refers to a molecule capable of inducing an immune response, e.g., a humoral, T cell-based, or innate immune response.

As used herein, the term “regulatory T cell”, “Treg” or the like, means a subpopulation of T cells that suppress immune effector function, including the suppression or down regulation of CD4+ and/or CD8+ effector T cell (Teff) induction, proliferation, and/or cytokine production, through a variety of different mechanisms including cell-cell contact and suppressive cytokine production. In aspects, CD4+ Tregs are characterized by the presence of certain cell surface markers including but not limited to CD4, CD25, and FoxP3. In aspects, upon activation, CD4+ regulatory T cells secrete immune suppressive cytokines and chemokines including but not limited to IL-10 and/or TGFβ. CD4+ Tregs may also exert immune suppressive effects through direct killing of target cells, characterized by the expression upon activation of effector molecules including but not limited to granzyme B and perforin. In aspects, CD8+ Tregs are characterized by the presence of certain cell surface markers including but not limited to CD8, CD25, and, upon activation, FoxP3. In aspects, upon activation, regulatory CD8+ T cells secrete immune suppressive cytokines and chemokines including but not limited to IFNγ, IL-10, and/or TGFβ. In aspects, CD8+ Tregs may also exert immune suppressive effects through direct killing of target cells, characterized by the expression upon activation of effector molecules including but not limited to granzyme B and/or perforin.

As used herein, the term “regulatory T cell epitope” (“Tregitope”) refers to a “T cell epitope” that causes a tolerogenic response (Weber CA et al., (2009), Adv Drug Deliv, 61(11):965-76) and is capable of binding to MHC molecules and engaging (i.e.interacting with and activating) circulating naturally occurring Tregs (in aspects, including natural Tregs and/or adaptive Tregs). In aspects, upon activation, CD4+ regulatory T cells secrete immune suppressive cytokines and chemokines including but not limited to IL-10 and/or TGFβ. CD4+ Tregs may also exert immune suppressive effects through direct killing of target cells, characetized by the expression upon activation of effector molecules including but not limited to granzyme B and perforin.leads to the expression of the immune suppressive cytokines including, but not limited to, IL-10 and TGF-f3 and TNF-α. In aspects, upon activation, regulatory CD8+ T cells secrete immune suppressive cytokines and chemokines including but not limited to IFNγ, IL-10, and/or TGFβ. In aspects, CD8+ Tregs may also exert immune suppressive effects through direct killing of target cells, characetized by the expression upon activation of effector molecules including but not limited to granzyme B and/or perforin.

As used herein, the term “B-cell epitope” means a protein determinant capable of specific binding to an antibody. B-cell epitopes usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and usually have specific three-dimensional structural characteristics, as well as specific charge characteristics. Conformational and non-conformational epitopes are distinguished in that the binding to the former but not the latter is lost in the presence of denaturing solvents.

The term “subject” as used herein refers to any living organism in which an immune response is elicited. The term subject includes, but is not limited to, humans, nonhuman primates such as chimpanzees and other apes and monkey species; farm animals such as cattle, sheep, pigs, goats and horses; domestic mammals such as dogs and cats; laboratory animals including rodents such as mice, rats and guinea pigs, and the like. The term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered.

As used herein, the terms “the major histocompatibility complex (MHC)”, “MHC molecules”, “MHC proteins” or “HLA proteins” are to be understood as meaning, in particular, proteins capable of binding peptides resulting from the proteolytic cleavage of protein antigens and representing potential T-cell epitopes, transporting them to the cell surface and presenting them there to specific cells, in particular cytotoxic T-lymphocytes or T-helper cells. The major histocompatibility complex in the genome comprises the genetic region whose gene products expressed on the cell surface are important for binding and presenting endogenous and/or foreign antigens and thus for regulating immunological processes. The major histocompatibility complex is classified into two gene groups coding for different proteins, namely molecules of MEW class I and molecules of MHC class II. The molecules of the two MHC classes are specialized for different antigen sources. The molecules of MHC class I present endogenously synthesized antigens, for example viral proteins and tumor antigens. The molecules of MHC class II present protein antigens originating from exogenous sources, for example bacterial products. The cellular biology and the expression patterns of the two MHC classes are adapted to these different roles. MHC molecules of class I consist of a heavy chain and a light chain and are capable of binding a peptide of about 8 to 11 amino acids, but usually 9 or 10 amino acids, if this peptide has suitable binding motifs, and presenting it to cytotoxic T-lymphocytes. The peptide bound by the MHC molecules of class I originates from an endogenous protein antigen. The heavy chain of the MHC molecules of class I is preferably an HLA-A, HLA-B or HLA-C monomer, and the light chain is β-2-microglobulin. MHC molecules of class II consist of an ≢-chain and a β-chain and are capable of binding a peptide of about 12 to 25 amino acids if this peptide has suitable binding motifs, and presenting it to T-helper cells. The peptide bound by the

MHC molecules of class II usually originates from an extracellular of exogenous protein antigen. The ≢-chain and the (3-chain are in particular HLA-DR, HLA-DQ and HLA-DP monomers.

As used herein, the term “MHC complex” refers to a protein complex capable of binding with a specific repertoire of polypeptides known as HLA ligands and transporting said ligands to the cell surface.

As used herein, the term “MHC Ligand” means a polypeptide capable of binding to one or more specific MHC alleles. The term “HLA ligand” is interchangeable with the term “MHC Ligand”. Cells expressing MHC/Ligand complexes on their surface are referred to as “Antigen Presenting Cells” (APCs). Similarly, as used herein, the term “MHC binding peptide” relates to a peptide which binds to an MHC class I and/or an MHC class II molecule. In the case of MHC class I/peptide complexes, the binding peptides are typically 8-10 amino acids long although longer or shorter peptides may be effective. In the case of MHC class II/peptide complexes, the binding peptides are typically 10-25 amino acids long and are in particular 13-18 amino acids long, whereas longer and shorter peptides may also be effective.

As used herein, the term “T Cell Receptor” or “TCR” refers to a protein complex expressed by T cells that is capable of engaging a specific repertoire of MHC/Ligand complexes as presented on the surface of cells, such as antigen presenting cells (APCs).

As used herein, the term “MHC Binding Motif” refers to a pattern of amino acids in a protein sequence that predicts binding to a particular MHC allele.

As used herein, the term “AAY cleavage motif” refers to the short amino acid motif consisting of the sequence “alanine-alanine-tyrosine” capable of promoting proteasome-mediated cleavage of a peptide or protein, promoting the binding of the transporter associated with antigen processing to a peptide or protein, and/or increasing proteasome degradation at specific sites within a peptide or protein.

As used herein, the term “immune synapse” means the protein complex formed by the simultaneous engagement of a given T cell epitope to both a cell surface MHC complex and TCR.

The term “polypeptide” refers to a polymer of amino acids, and not to a specific length; thus, peptides, oligopeptides and proteins are included within the definition of a polypeptide. As used herein, a polypeptide is said to be “isolated” or “purified” when it is substantially free of cellular material when it is isolated from recombinant and non-recombinant cells, or free of chemical precursors or other chemicals when it is chemically synthesized. A peptide or polypeptide (e.g., a polypeptide comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 or variants and fragments thereof, which in aspects may be isolated, synthetic, or recombinant) of the present disclosure, however, can be joined to, linked to, or inserted into another polypeptide (e.g., a heterologous polypeptide) with which it is not normally associated in a cell and still be “isolated” or “purified.” Additionally, one or more T-cell epitopes of the present disclosure can be joined to, linked to, or inserted into another polypeptide wherein said one or more T-cell epitopes of the present disclosure is not naturally included in the polypeptide and/or said one or more T-cell epitopes of the present disclosure is not located at its natural position in the polypeptide. When a polypeptide is recombinantly produced, it can also be substantially free of culture medium, for example, culture medium represents less than about 20%, less than about 10%, or less than about 5% of the volume of the polypeptide preparation.

As used herein, a “concatemeric” peptide or polypeptide refers to a series of at least two peptides or polypeptides linked together. Such linkages may form of string-of-beads design. In aspects, concatemeric polypeptides of the instant disclosure include concatemeric polypeptides comprising, consisting of, or consisting essentially of one or more of 1677-1692, 2593-2604, 2639-2646, and 2719-2734, and/or fragments and variants thereof (which in aspects may be isolated, synthetic, and/or recombinant). In aspects, earch of the peptides or polypeptides of concatermeric polypeptide may optionally be spaced by one or more linkers, and in further aspects neutral linkers. The term “linker” refers to a peptide added between two peptide domains such as epitopes or vaccine sequences to connect said peptide domains. In aspects, a linker sequence is used to reduce steric hindrance between each one or more identified peptides of the instant disclosure, is well translated, and supports or allows processing of the each one or more identified polypeptides of the instant disclosure. In aspects, the linker should have little or no immunogenic sequence elements. In aspects, each peptide or polypeptide of the concatemeric polypeptide may optionally have one or more linkers, which may optionally be cleavage sensitive sites, adjacent to their N and/or C terminal end. In such a concatemeric peptide, two or more of the peptides may have a cleavage sensitive site between them. Alternatively two or more of the peptides may be connected directly to one another or through a linker that is not a cleavage sensitive site.

As used herein, the term “pharmaceutically acceptable” refers to approved or approvable by a regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in animals, including humans.

As used herein, the term “pharmaceutically acceptable excipient, carrier, or diluent” or the like refer to an excipient, carrier, or diluent that can be administered to a subject, together with an agent, and which does not destroy the pharmacological activity thereof and is nontoxic when administered in doses sufficient to deliver a therapeutic amount of the agent.

As used herein, the term “purpose built computer program” refers to a computer program designed to fulfill a specific purpose; typically to analyze a specific set of raw data and answer a specific scientific question.

As used herein, the term “z-score” indicates how many standard deviations an element is from the mean. A z-score can be calculated from the following formula: z=(X−μ)/σ; where z is the z-score, X is the value of the element, μ is the population mean, and σ is the standard deviation.

As used herein, the singular forms “a,” “an,” and “the” are intended to include the plural forms, including “at least one,” unless the content clearly indicates otherwise. “Or” means “and/or.” As used herein, the term “and/or” and “one or more” includes any and all combinations of the associated listed items. For example, the term “one or more” with respect to the “one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 of the present disclosure” includes any and all combinations of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692. The term “or a combination thereof” means a combination including at least one of the foregoing elements.

A “variant” peptide or polypeptide (including a variant T-cell epitope) can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these. In aspects, a variant peptide or polypeptide (including a variant T-cell epitope) can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these provided said variants retain MHC binding propensity and/or TCR specificity, and/or SARS-CoV-2 activity.

The present disclosure also includes fragments of the peptide or polypeptides of the invention. The disclosure also encompasses fragments of the variants of the T-cell epitopes described herein, provided said fragments and/or variants at least in part retain MHC binding propensity and/or TCR specificity, and/or retain anti-SARS-CoV-2 activity.

The present disclosure also provides chimeric or fusion polypeptides (which in aspects may be isolated, synthetic, or recombinant) wherein one or more of the instantly-disclosed peptides, polypeptides, or concatemeric peptides is a part thereof. In aspects, a chimeric or fusion polypeptide composition comprises one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure linked to a heterologous polypeptide. As previously stated, the term “heterologous polypeptide” is intended to mean that the one or more T-cell epitopes (e.g., one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) are heterologous to, or not included naturally, in the heterologous polypeptide. In aspects, the one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure may be inserted into the heterologous polypeptide (e.g., through mutagenesis or other known means in the art), may be added to the C-terminus (with or without the use of linkers, as is known in the art), and/or added to the N-terminus (with or without the use of linkers, as is known in the art) of the heterologous polypeptide. For example, protein engineering by mutagenesis can be performed using site-directed mutagenesis techniques, or other mutagenesis techniques known in the art (see e.g., James A. Brannigan and Anthony J. Wilkinson., 2002, Protein engineering 20 years on. Nature Reviews Molecular Cell Biology 3, 964-970; Turanli-Yildiz B. et al., 2012, Protein Engineering Methods and Applications, intechopen.com, which are herein incorporated by reference in their entirety).

In aspects, chimeric or fusion polypeptides comprise one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure operatively linked to a heterologous polypeptide. “Operatively linked” indicates that the polypeptide (e.g., the one or more T-cell epitope polypeptides of the present disclosure) and the heterologous protein are fused in-frame or chemically-linked or otherwise bound. In aspects, the instantly-disclosed chimeric or fusion polypeptides may be isolated, synthetic, or recombinant

An “isolated” peptide, polypeptide, concatemeric peptide (e.g., an isolated T-cell activating T-cell epitope or T-cell epitope polypeptide), or chimeric or fusion polypeptide can be purified from cells that naturally express it, purified from cells that have been altered to express it (recombinant), or synthesized using known protein synthesis methods. In one embodiment, a peptide, polypeptide, or concatemeric peptide is produced by recombinant DNA or RNA techniques. For example, a nucleic acid molecule encoding the peptide, polypeptide, concatemeric peptide, or chimeric or fusion polypeptide is cloned into an expression vector, the expression vector introduced into a host cell and the peptide, polypeptide, concatemeric peptide, or chimeric or fusion polypeptide is expressed in the host cell. The peptide, polypeptide, concatemeric peptide, or chimeric or fusion polypeptide can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques.

For the purposes of the present disclosure, peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptides of the instant disclosure can include, for example, modified forms of naturally occurring amino acids such as D-stereoisomers, non-naturally occurring amino acids; amino acid analogs; and mimetics. Further, in aspects, peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptides of the instant disclosure can include retro-inverso peptides of the instantly disclosed peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptides of the instant disclosure, provided said peptides, polypeptides, concatemeric peptides, or chimeric or fusion polypeptides of the instant disclosure at least in part retain WIC binding propensity and/or TCR specificity, and/or retain anti-SARS-CoV-2 activity.

Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, the preferred methods and materials are described. Other features, objects, and advantages of the present disclosure will be apparent from the description and the claims. In the specification and the appended claims, the singular forms include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. All references cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference in its entirety for all purposes.

Polypeptides, Concatemeric Polypeptides, and Chimeric or Fusion Polypeptides

In aspects, the present disclosure provides a novel class of T-cell epitopes (which may be isolated, synthetic, or recombinant), which comprise a peptide or polypeptide chain derived from SARS-CoV-2 proteins (e.g., encoded proteins from a SARS-CoV-2 genome), including the envelope, membrane, spike, nucleocapsid, ORF3a, ORF6, ORF7a, ORFS, ORF10, ORF1ab non-structural protein 2 (NSP2), ORF1ab non-structural protein 3 (NSP3), ORF1ab non-structural protein 4 (NSP4), ORF1ab 3C-like proteinase, ORF1ab non-structural protein 6 (NSP6), ORF1ab non-structural protein 7 (NSP7), ORF1ab non-structural protein 8 (NSP8), ORF1ab non-structural protein 9 (NSP9), ORF1 ab non-structural protein 10 (NSP10), ORF1ab RNA-dependent RNA polymerase, ORF1ab helicase, ORF1ab 3′-5′ exonuclease, ORF1ab endoRNase, and ORF1ab 2′O-ribose methyltransferase proteins of SARS-CoV-2. As explained in more detail in the Examples, T-cell epitopes of the present disclosure are highly conserved among known variants of their source proteins, and SARS-CoV-2 (taxid: 2697049), SARS-CoV-1 (taxid: 694009), MERS-CoV (taxid: 1335626), and human CoV (taxids: 11137, 443239, 277944 and 31631) antigen sequences isolated from human hosts were obtained from GenBank at the National Center for Biotechnology Information. SARS-CoV-2 epitopes were compared across sequences obtained from isolates with fully sequenced genomes isolated from December 2019 to December 2020 for T cell epitope mapping. SARS-CoV-2 Wuhan-Hu-1 (GenBank id: MN908947) was selected as the reference strain.

As further described in the Examples, T-cell epitopes of the present disclosure comprise at least one putative T cell epitope as identified by EpiMatrix™ analysis. EpiMatrix™ is a proprietary computer algorithm developed by EpiVax (Providence, R.I.), which is used to screen protein sequences for the presence of putative T cell epitopes. The algorithm uses matrices for prediction of 9- and 10-mer peptides binding to MHC molecules. Each matrix is based on position-specific coefficients related to amino acid binding affinities that are elucidated by a method similar to, but not identical to, the pocket profile method (Sturniolo, T. et al., Nat. Biotechnol., 17:555-561, 1999). Input sequences are, for example, parsed into overlapping 9-mer frames or 10-mer where each frame overlaps the last by 8 or 9 amino acids, respectively. Each of the resulting frames form the mutated peptide and the non-mutated peptide are then scored for predicted binding affinity with respect to MHC class I alleles (e.g., but not limited to, HLA-A and HLA-B alleles) and MHC class II alleles (e.g., but not limited to HLA-DRB1 alleles). Raw scores are normalized against the scores of a large sample of randomly generated peptides. The resulting “Z” scores are normally distributed and directly comparable across alleles. The resulting “Z” score is reported. In aspects, any 9-mer or 10-mer peptide with an allele-specific EpiMatrix™ Z-score in excess of 1.64, theoretically the top 5% of any given sample, is considered a putative T cell epitope.

As also further described in the Examples, peptides containing clusters of putative T cell epitopes are more likely to test positive in validating in vitro and in vivo assays. In aspects, the results of the initial EpiMatrix™ analysis are further screened for the presence of putative T cell epitope “clusters” using a second proprietary algorithm known as Clustimer™ algorithm. The Clustimer™ algorithm identifies sub-regions contained within any given amino acid sequence that contains a statistically unusually high number of putative T cell epitopes. Typical T-cell epitope “clusters” range from about 9 to roughly 30 amino acids in length and, considering their affinity to multiple alleles and across multiple 9-mer frames, can contain anywhere from about 4 to about 40 putative T cell epitopes. FIG. 1 is an overview of MHC class II cluster selection from the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIG. 5 is an overview of MHC class II cluster selection from the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIG. 16 is an overview of MHC class II cluster selection from the spike (SEQ ID NO: 3) of SARS-CoV-2. Each epitope cluster identified an aggregate EpiMatrix™ score is calculated by summing the scores of the putative T cell epitopes and subtracting a correcting factor based on the length of the candidate epitope cluster and the expected score of a randomly generated cluster of the same length. EpiMatrix™ cluster scores in excess of +10 are considered significant. In aspects, the T-cell epitopes of the instant disclosure contain several putative T-cell epitopes forming a pattern known as a T-cell epitope cluster.

FIGS. 2-4 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2and relate to SEQ ID NOS: 4-68 and 1003-1005. FIGS. 6-15 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2 and relate to SEQ ID NOS: 69-209, 1006-1015, 2255, 2561, and 8691, and 8692. FIGS. 17-55 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2 and relate to SEQ ID NOS: 210-707 and 1016-1054. FIG. 59 is an EpiMatrix staircase report for identified MHC class I clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2 and relates to SEQ ID NOS: 708-739. FIG. 60 is an EpiMatrix staircase report for identified MHC class I clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2 and relates to SEQ ID NOS: 740-851. FIG. 61 is an EpiMatrix staircase report for identified MHC class I clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2 and relates to SEQ ID NOS: 852-1002. FIGS. 71-92 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the SARS-CoV-2 peptides listed in FIG. 70 and relate to SEQ ID NOS: 1713-2010. FIGS. 93-101 are EpiMatrix staircase reports for identified MHC class I clusters of the SARS-CoV-2 peptides identified in FIG. 70 and relate to SEQ ID NOS: 1713-2158. FIGS. 102-110 are EpiMatrix staircase reports for identified MHC class II clusters of the SARS-CoV-2 peptides identified in FIG. 70 and relate to SEQ ID NOS: 1713-2158. FIGS. 111-119 are EpiMatrix staircase reports for identified MHC class I 10mer clusters of the SARS-CoV-2 peptides identified in FIG. 70 and relate to SEQ ID NOS: 2159-2569.

Putative T-cell epitopes were also screened for cross-conservation with the human proteome using JanusMatrix, as further described in more detail in the Examples. The JanusMatrix system (EpiVax, Providence, R.I.) useful for screening peptide sequences for cross-conservation with a host proteome. JanusMatrix is an algorithm that predicts the potential for cross-reactivity between peptide clusters and the host genome or proteome, based on conservation of TCR-facing residues in their putative MHC ligands. The JanusMatrix algorithm first considers all the predicted epitopes contained within a given protein sequence and divides each predicted epitope into its constituent agretope and epitope. Each sequence is then screened against a database of host proteins. Peptides with a compatible MHC-facing agretope (i.e., the agretopes of both the input peptide and its host counterparty are predicted to bind the same MHC allele) and exactly the same TCR-facing epitope are returned. The JanusMatrix Homology Score suggests a bias towards immune tolerance. In the case of a therapeutic protein, cross-conservation between autologous human epitopes and epitopes in the therapeutic may increase the likelihood that such a candidate will be tolerated by the human immune system. In the case of a vaccine, cross-conservation between human epitopes and the antigenic epitopes may indicate that such a candidate utilizes immune camouflage, thereby evading the immune response and making for an ineffective vaccine. When the host is, for example, a human, the peptide clusters are screened against human genomes and proteomes, based on conservation of TCR-facing residues in their putative HLA ligands. The peptides are then scored using the JanusMatrix Homology Score. In aspects, peptides with a JanusMatrix Homology Score below 2.5 or below 3.0 (and even below 2.0 in certain apsects) indicate low tolerogenicity potential and may be useful for pharmaceutical formulations and vaccines for the treatment/prevention of SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19, and in aspects may be included from the T cell epitope compositions and methods of the present disclosure. In aspects, peptides with a JanusMatrix Homology Score above 3.0 indicate high tolerogenicity potential and may not be useful for pharmaceutical formulations and vaccines for the treatment/prevention of SARS-CoV-2 infection and related diseases caused by SARS-CoV-2, including COVID-19, and in aspects may be excluded from the T cell epitope compositions and methods of the present disclosure. In specific aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, and/or 1675 may be excluded from the T cell epitope compounds and compositions, methods, and assays/kitsof the present disclosure. FIG. 56 is the JanusMatrix reports for identified WIC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIG. 57 is the JanusMatrix reports for identified WIC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIG. 58 is the JanusMatrix reports for identified WIC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2.

In aspects, and as also described in further detail in the Examples, T-cell epitopes of the present disclosure are highly conserved among related coronaviruses that infect humans including highly pathogenic SARS-CoV and MERS-CoV and low pathogenicity common cold coronaviruses (CCCs) OC43, HKU1, NL63, and 229E. Prior exposure to these viruses may have established T cell memory that can be recalled upon SARS-CoV-2 infection and vaccination. As well, SARS-CoV-2 infection and vaccination establishes T cell memory that can influence responses in future infections to these viruses or yet-to-emerge coronaviruses. To identify potentially cross-reactive sequences, we screened the TCR-face of select SARS-CoV-2 epitopes for homology with coronaviruses that infect humans, using the JanusMatrix algorithm (e.g., Table 58A and Table 58B). All of the selected membrane and envelope sequences shared identical TCR-face patterns with SARS-CoV. Half the selected spike clusters were unique to SARS-CoV-2 and the other half are conserved with SARS-CoV. Only three selected clusters were cross-conserved outside SARS viruses. Given reports of pre-existing T cell immunity in people with no SARS-CoV-2 experience, we relaxed the requirement for 100% identity at every TCR-face position. Fixing two positions shown to be extensively involved in TCR interactions (positions 5 and 8), JanusMatrix predicted an expanded cross-conservation landscape for selected SARS-CoV-2 spike and membrane clusters. Most selected spike clusters were conserved, by these criteria, in the subset of coronaviruses that infect humans. The remainder of the selected sequences with cross-reactivity potential are cross-conserved among highly pathogenic beta-coronaviruses or among high and low pathogenicity beta-coronaviruses. Only three clusters were unique to SARS-CoV-2 by the cristeria described above, and none are solely conserved with SARS-CoV. The single membrane selected clusters was cross-conserved in the highly pathogenic beta-coronaviruses and coronaviruses that infect humans subsets. As the vast majority of people were not exposed to SARS-CoV and MERS-CoV, we also explored cross-conservation between SARS-CoV-2 and CCCs only. Of the 32 selected peptides, only 2 are SARS-CoV-2-specific. Eighteen clusters (56.6%) are cross-conserved across OC43, HKU1, NL63, and 229E and 12 (37.5%) are cross-conserved in at least one of these four CCCs.

In aspects, T-cell epitopes of the present disclosure bind to at least one and preferably two or more common HLA class I and/or class II alleles with at least a moderate affinity (e.g., in aspects, <1000 μM IC₅₀, <500 μM IC₅₀, <400 μM IC₅₀, <300 μM IC₅₀, or <200 μM IC₅₀in HLA binding assays based on soluble HLA molecules). In aspects, T-cell epitopes of the present disclosure are capable of being presented at the cell surface by cells in the context of at least one and, in other aspects, two or more alleles of the HLA. In this context, the epitope-HLA complex can be recognized by CD4+ and/or CD8+ T-cells having TCRs that are specific for the epitope-HLA complex and circulating in subjects. In aspects, the recognition of the epitope-HLA complex can cause the matching T-cell to be activated and to secrete activating cytokines (e.g., effector cytokines such as IFNγ) and chemokines.

In aspects, a T-cell epitope compounds or compositions of the present disclosure includes one or more peptides or polypeptides a disclosed herein. In aspects, the present disclosure is directed to a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, or fragments or variants thereof. The phrase “consisting essentially of” is intended to mean that a peptide or polypeptide according to the present disclosure, in addition to the sequence according to any of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 or a fragment or variant thereof, contains additional amino acids or residues that may be present at either terminus of the peptide and/or on a side chain that are not necessarily forming part of the peptide or polypeptide that functions as an MHC ligand and provided they do not substantially impair the activity of the peptide to function as a T-cell epitope. In aspects, the peptides or polypeptides of the instant disclosure can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In aspects, the peptides or polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group. In aspects, peptides or polypeptides of the instant disclosure having SEQ ID NOS: 1055, 1058-1060, 1062, 1065-1066, 1069-1072, 1075, 1078, 1081-1087, 1089, 1091-1094, 1100, 1106, 1110-1113, 1115 and 1366, 1369-1371, 1373, 1376-1377, 1379-1383, 1385-1386, 1389, 1391-1400, 1402-1404, 1407, 1411-1419, 1421-1424, 1426-1427, 1118-1365, and 1429-1676 are capped with an n-terminal acetyl and a c-terminal amino group. In aspects, peptides or polypeptides of the instant disclosure having SEQ ID NOS: 1068, 1074, 1080, 1088, 1096, 1101-1105, 1107-1108, and 1116 are capped with an n-terminal acetyl and are not capped at the c-terminus. Table 1 describes the cluster address and gives the location of the peptide within exemplary selected sequences that were provided for analysis. Table 1 describes the core peptide (middle amino acids in bold, SEQ ID NO: in parentheses) which defines the actual cluster that was identified during the analysis. The stabilizing flanks (N-terminal and C-terminal, not bold) are included for use with the core sequence in cetain aspects, with the full sequence of the cluster and flanks being labeled in Table 1 by the SEQ ID NO: not listed in parentheses.

TABLE 1

Sequence

(cluster sequence

is bolded,

Se-
while flanks are

EpiMatrix
EpiMatrix
Number

quence
not bolded,

Hits
Cluster
of

SEQ

Address
mutations
Class
(Class II
Score
Human
Janus

ID
Input
(w/
from WT are
I
w/o
(w/o
Se-
Homology

NO:
Sequence
Flanks)
italicized)
Hits
Flanks)
Flanks)
quences
Score

1055
ENVELOPE
0001-0015
MYS:FVSEETGTL:IVN
13
5
9.73
2
1.50

(1366)
(SEQ ID NO: 1)

1056
ENVELOPE
0007-0024
EET:GTLIVNSVLLFL:AFV
11
15
26.56
22
2.72

(1367)
(SEQ ID NO: 1)

1057
ENVELOPE
0014-0036
VNS:VLLFLAFVVFLLVTL:A
26
22
40.15
46
3.48

(1368)
(SEQ ID NO: 1)

1058
ENVELOPE
0054-0069
PSF:YVYSRVKNLNS:SR
7
16
32.67
13
0.90

(1369)
(SEQ ID NO: 1)

1059
ENVELOPE
0060-0072
S:RVKNLNSSRV:PD
1
11
23.21
15
3.00

(1370)
(SEQ ID NO: 1)

1060
MEMBRANE
0013-0027
LKK:LLEQWNLVI:GFL
9
4
7.03
6
1.40

(1371)
(SEQ ID NO: 2)

1061
MEMBRANE
0017-0031
LEQ:WNLVIGFLF:LTW
17
4
8.76
3
1.00

(1372)
(SEQ ID NO: 2)

1062
MEMBRANE
0032-0048
ISL:LQFAYANRNRF:LYI
9
14
26.89
2
0.73

(1373)
(SEQ ID NO: 2)

1063
MEMBRANE
0042-0060
RNR:FLYIIKLIFLWLL:WPV
29
15
27.53
8
1.56

(1374)
(SEQ ID NO: 2)

1064
MEMBRANE
0062-0076
LAC:FVLAAVYRI:NWI
4
9
19.44
3
0.80

(1375)
(SEQ ID NO: 2)

1065
MEMBRANE
0093-0107
LSY:FIASFRLFA:RTR
10
8
18.15
5
0.56

(1376)
(SEQ ID NO: 2)

1066
MEMBRANE
0097-0113
IAS:FRLFARTRSMWS:FN
11
13
23.84
5
0.80

(1377)
(SEQ ID NO: 2)

1067
MEMBRANE
0109-0122
MWS:FNPETNILL:NV
7
4
8.21
2
1.25

(1378)
(SEQ ID NO: 2)

1068
MEMBRANE
0115-0129
ETN:ILLNVPLHG:TIL
3
8
16.48
9
2.45

(1379)
(SEQ ID NO: 2)

1069
MEMBRANE
0146-0160
RGH:LRIAGHHLG:RSD
2
6
10.96
1
0.14

(1380)
(SEQ ID NO: 2)

1070
MEMBRANE
0165-0179
PKE:ITVATSRTL:SYY
11
5
9.81
1
0.14

(1381)
(SEQ ID NO: 2)

1071
MEMBRANE
0175-0190
TLS:YYKLGASQRV:AGD
4
9
17.64
8
1.83

(1382)
(SEQ ID NO: 2)

1072
MEMBRANE
0201-0217
IGN:YKLNTDHSSSS:DNI
5
8
14.38
4
0.78

(1383)
(SEQ ID NO: 2)

1073
SPIKE
0001-0016
M:FVFLVLLPLVSS:QCV
10
26
53.39
63
4.03

(1384)
(SEQ ID NO:

3)

1074
SPIKE
0025-0038
PPA:YTNSFTRGV:YY
10
6
12.80
1
0.17

(1385)
(SEQ ID NO:

3)

1075
SPIKE
0040-0054
DKV:FRSSVLHST:QDL
3
5
9.51
11
1.92

(1386)
(SEQ ID NO:

3)

1076
SPIKE
0052-0066
QDL:FLPFFSNVT:WFH
14
7
14.42
2
0.22

(1387)
(SEQ ID NO:

3)

1077
SPIKE
0061-0076
NVT:WFHAIHVSGT:NGT
2
9
17.97
2
0.54

(1388)
(SEQ ID NO:

3)

1078
SPIKE
0087-0102
NEG:VYFASTEKSNI:IR
4
7
10.83
2
0.29

(1389)
(SEQ ID NO:

3)

1079
SPIKE
0114-0130
TQS:LLIVNNATNVVI:KV
5
20
36.68
9
0.52

(1390)
(SEQ ID NO:

3)

1080
SPIKE
0140-0156
FLG:VYYHKNNKSWM:ES
6
11
20.72
0
0.00

(1391)
(SEQ ID NO:

E

3)

1081
SPIKE
0154-0171
ESE:FRVYSSANNCTF:EYV
17
11
18.25
1
0.09

(1392)
(SEQ ID NO:

3)

1082
SPIKE
0198-0212
DGY:FKIYSKHTP:INL
8
3
5.48
1
0.08

(1393)
(SEQ ID NO:

3)

1083
SPIKE
0207-0221
HTP:INLVRDLPQ:GFS
3
5
9.53
17
4.25

(1394)
(SEQ ID NO:

3)

1084
SPIKE
0232-0248
GIN:ITRFQTLLALHR:SY
10
16
31.53
11
1.47

(1395)
(SEQ ID NO:

3)

1085
SPIKE
0239-0255
QT:LLALHRSYLTPG:DSS
10
14
25.33
7
0.53

(1396)
(SEQ ID NO:

3)

1086
SPIKE
0272-0288
PRT:FLLKYNENGTI:TDA
2
11
18.68
3
0.31

(1397)
(SEQ ID NO:

3)

1087
SPIKE
0315-0329
TSN:FRVQPTESI:VRF
1
6
10.62
0
0.00

(1398)
(SEQ ID NO:

3)

1088
SPIKE
0338-0352
FGE:VFNATRFAS:VYA
6
4
6.48
3
0.38

(1399)
(SEQ ID NO:

3)

1089
SPIKE
0347-0364
FAS:VYAWNRKRISNC:VA
5
7
11.13
3
0.63

(1400)
(SEQ ID NO:

D

3)

1090
SPIKE
0362-0378
VAD:YSVLYNSASFST:FK
14
18
33.15
11
0.95

(1401)
(SEQ ID NO:

3)

1091
SPIKE
0446-0468
GGN:YNYLYRLFRKSNLKP
14
31
58.68
14
1.10

(1402)
(SEQ ID NO:

FE:RDI

3)

1092
SPIKE
0483-0500
VEG:FNCYFPLQSYGF:QP
10
11
17.96
0
0.00

(1403)
(SEQ ID NO:

T

3)

1093
SPIKE
0492-0508
LQS:YGFQPTNGVGY:QPY
3
7
10.08
1
0.14

(1404)
(SEQ ID NO:

3)

1094
SPIKE
0508-0522
YRV:VVLSFELLH:APA
5
3
4.88
30
7.57

(1405)
(SEQ ID NO:

3)

1095
SPIKE
0512-0526
VLS:FELLHAPAT:VCG
4
7
14.88
4
0.75

(1406)
(SEQ ID NO:

3)

1096
SPIKE
0536-0549
NKS:VNFNFNGLT:GT
2
7
12.54
0
0.00

(1407)
(SEQ ID NO:

3)

1097
SPIKE
0540-0554
NFN:FNGLTGTGV:LTE
0
5
9.07
0
0.00

(1408)
(SEQ ID NO:

3)

1098
SPIKE
0715-0729
PTN:FTISVTTEI:LPV
8
6
12.63
0
0.00

(1409)
(SEQ ID NO:

3)

1099
SPIKE
0753-0774
LLQ:YGSFCTQLNRALTGI
10
18
27.90
8
0.50

(1410)
(SEQ ID NO:

A:VEQ

3)

1100
SPIKE
0797-0816
FGG:FNFSQILPDPSKPS:K
4
8
10.25
1
0.13

(1411)
(SEQ ID NO:

RS

3)

1101
SPIKE
0818-0831
IED:LLFNKVTLA:DA
3
4
7.44
13
2.22

(1412)
(SEQ ID NO:

3)

1102
SPIKE
0852-0869
AQK:FNGLTVLPPLLT:DE
4
10
15.82
8
1.64

(1413)
(SEQ ID NO:

M

3)

1103
SPIKE
0856-0869
NG:LTVLPPLLT:DEM
3
6
10.67
5
1.83

(1414)
(SEQ ID NO:

3)

1104
SPIKE
0866-0879
TDE:MIAQYTSALL:A
8
10
17.71
3
0.30

(1415)
(SEQ ID NO:

3)

1105
SPIKE
0895-0911
LQIP:FAMQMAYRFNG:IG
15
7
10.20
0
0.00

(1416)
(SEQ ID NO:

V

3)

1106
SPIKE
0920-0934
QKL:IANQFNSAI:GKI
7
5
8.58
2
0.29

(1417)
(SEQ ID NO:

3)

1107
SPIKE
0924-0938
ANQ:FNSAIGKIQ:DSL
2
7
14.98
5
2.00

(1418)
(SEQ ID NO:

3)

1108
SPIKE
0934-0949
IQD:SLSSTASALG:KLQ
6
7
11.21
17
3.11

(1419)
(SEQ ID NO:

3)

1109
SPIKE
0960-0974
NTL:VKQLSSNFG:AIS
5
8
17.90
9
1.40

(1420)
(SEQ ID NO:

3)

1110
SPIKE
0998-1012
TGR:LQSLQTYVT:QQL
3
6
12.24
1
0.14

(1421)
(SEQ ID NO:

3)

1111
SPIKE
1010-1024
QQL:IRAAEIRAS:ANL
2
4
6.75
6
1.40

(1422)
(SEQ ID NO:

3)

1112
SPIKE
1015-1029
AAE:IRASANLAA:TKM
6
8
18.74
14
3.40

(1423)
(SEQ ID NO:

3)

1113
SPIKE
1044-1058
GKG:YHLMSFPQS:APH
4
4
7.89
3
0.67

(1424)
(SEQ ID NO:

3)

1114
SPIKE
1127-1141
DV:VIGIVNNTVY:DPL
5
9
16.52
1
0.44

(1425)
(SEQ ID NO:

3)

1115
SPIKE
1152-1166
LDK:YFKNHTSPDV:DL
2
7
12.63
0
0.00

(1426)
(SEQ ID NO:

3)

1116
SPIKE
1169-1183
ISG:INASVVNIQ:KEI
4
6
12.24
5
1.00

(1427)
(SEQ ID NO:

3)

1117
SPIKE
1214-1228
WYI:WLGFIAGLI:AIV
9
7
14.24
12
2.42

(1428)
(SEQ ID NO:

3)

1118
MEMBRANE
0032-0048
ICL:LQFAYANRNRF:LYI
9
14
26.89
2
0.73

(1429)
(SEQ ID NO: 2)

1119
MEMBRANE
0146-0160
RGH:LRIAGHHLG:RCD
2
6
10.96
1
0.14

(1430)
(SEQ ID NO: 2)

1120
SPIKE
0087-0102
NDG:VYFASTEKSNI:IR
4
7
10.83
2
0.29

(1431)
(SEQ ID NO:

3)

1121
SPIKE
0347-0364
FAS:VYAWNRKRISNS:VA
5
7
11.13
3
0.63

(1432)
(SEQ ID NO:

D

3)

1122
SPIKE
0536-0549
NKC:VNFNFNGLT:GT
2
7
12.54
0
0.00

(1433)
(SEQ ID NO:

3)

1123
SPIKE
0895-0911
QIP:FAMQMAYRFNG:IG
15
7
10.20
0
0.00

(1434)
(SEQ ID NO:

V

3)

1124
NUCLEOCAPS
081-098
DDQ:IGYYRRATRRIR:GG
4
12
22.85
5
0.77

(1435)
ID

D

(SEQ ID NO:

1693)

1125
NUCLEOCAPS
154-168
NAA:IVLQLPQGT:TLP
2
6
10.05
8
4.00

(1436)
ID

(SEQ ID NO:

1693)

1126
NUCLEOCAPS
224-238
LDR:LNQLESKMS:GKG
3
7
14.25
8
0.80

(1437)
ID

(SEQ ID NO:

1693)

1127
NUCLEOCAPS
346-362
FKD:QVILLNKHIDA:YKT
7
12
20.41
3
0.58

(1438)
ID

(SEQ ID NO:

1693)

1128
NUCLEOCAPS
402-419
DFS:KQLQQSMSSADS:T
2
11
18.34
6
0.67

(1439)
ID

QA

(SEQ ID NO:

1693)

1129
ORF3A
001-015
MDL:FMRIFTIGT:VTL
13
6
12.25
3
0.33

(1440)
(SEQ ID NO:

1694)

1130
ORF3A
025-040
PSD:FVRATATIPI:QAS
7
11
19.68
4
0.91

(1441)
(SEQ ID NO:

1694)

1131
ORF3A
042-058
PFG:WLIVGVALLAV:FQS
1
13
24.28
24
4.23

(1442)
(SEQ ID NO:

1694)

1132
ORF3A
053-067
LA:VFQSASKIIT:LKK
6
13
26.09
0
0.00

(1443)
(SEQ ID NO:

1694)

1133
ORF3A
067-082
KR:WQLALSKGVHF:VCN
6
11
23.09
9
2.42

(1444)
(SEQ ID NO:

1694)

1134
ORF3A
074-106
SKG:VHFVCNLLLLFVTV
38
53
95.68
91
3.26

(1445)
(SEQ ID NO:

YSHLLLVAAGLEA:PFL

1694)

1135
ORF3A
102-119
EAP:FLYLYALVYFLQ:SIN
24
18
34.75
7
0.84

(1446)
(SEQ ID NO:

1694)

1136
ORF3A
109-126
YAL:VYFLQSINFVRI:IMR
15
16
28.23
3
0.30

(1447)
(SEQ ID NO:

1694)

1137
ORF3A
115-132
LQS:INFVRIIMRLWL:CW
16
11
17.21
3
0.27

(1448)
(SEQ ID NO:

K

1694)

1138
ORF3A
157-174
CIP:YNSVTSSIVITS:GDG
6
11
20.89
5
0.73

(1449)
(SEQ ID NO:

1694)

1139
ORF3A
208-223
TSD:YYQLYSTQLS:TDT
8
14
28.35
5
1.40

(1450)
(SEQ ID NO:

1694)

1140
ORF6
013-029
EIL:LIIMRTFKVSI:WNL
11
11
22.01
1
0.08

(1451)
(SEQ ID NO:

1695)

1141
ORF6
026-048
IWN:LDYIINLIIKNLSKSLT:
39
39
77.11
44
2.88

(1452)
(SEQ ID NO:

ENK

1695)

1142
ORF7A
001-016

MKHLFLALITLA:TCE
12
17
31.91
56
6.52

(1453)
(SEQ ID NO:

1696)

1143
ORF7A
068-086
PDG:VKHVYQLRARSVS:P
7
14
24.13
10
1.27

(1454)
(SEQ ID NO:

KL

1696)

1144
ORF7A
090-104
QEE:VQELYSPIF:LIV
10
5
8.27
1
0.80

(1455)
(SEQ ID NO:

1696)

1145
ORF7A
097-114
YSP:IFLIVAAIVFIT:LCF
12
23
41.55
19
2.09

(1456)
(SEQ ID NO:

1696)

1146
ORF8
001-017
MK:FLVFLGIITTVA:AFH
17
7
11.16
9
1.36

(1457)
(SEQ ID NO:

1697)

1147
ORF8
013-027
VAA:FHQECSLQS:CTQ
2
8
18.24
0
0.00

(1458)
(SEQ ID NO:

1697)

1148
ORF8
042-060
YSK:WYIRVGARKSAPL:IE
5
12
20.81
3
0.67

(1459)
(SEQ ID NO:

L

1697)

1149
ORF10
001-014
MG:YINVFAFPF:TIY
14
5
10.1
2
1.17

(1460)
(SEQ ID NO:

1698)

1150
ORF10
008-029
AFP:FTIYSLLLCRMNSRNY
22
13
17.72
14
1.07

(1461)
(SEQ ID NO:

:IAQ

1668)

1151
ORF1AB_
066-080
EKS:YELQTPFEI:KLA
5
4
6.37
1
0.50

(1462)
NSP2

(SEQ ID NO:

1699)

1152
ORF1AB_
072-088
QTP:FEIKLAKKFDT:FNG
7
10
18.15
8
1.10

(1463)
NSP2

(SEQ ID NO:

1699)

1153
ORF1AB_
083-097
FDT:FNGECPNFV:FPL
7
6
13.17
1
0.67

(1464)
NSP2

(SEQ ID NO:

1699)

1154
ORF1AB_
089-108
ECP:NFVFPLNSIIKTIQ:PR
9
12
18.19
3
1.33

(1465)
NSP2

V

(SEQ ID NO:

1699)

1155
ORF1AB_
117-131
MGR:IRSVYPVAS:PNE
2
9
18.76
1
0.56

(1466)
NSP2

(SEQ ID NO:

1699)

1156
ORF1AB_
204-218
LAE:YHNESGLKT:ILR
5
5
10.36
2
1.60

(1467)
NSP2

(SEQ ID NO:

1699)

1157
ORF1AB_
268-285
DNL:LEILQKEKVNIN:IVG
2
9
14.52
10
1.90

(1468)
NSP2

(SEQ ID NO:

1699)

1158
ORF1AB_
284-300
VGD:FKLNEEIAIIL:ASF
5
8
12.84
4
1.13

(1469)
NSP2

(SEQ ID NO:

1699)

1159
ORF1AB_
290-307
NEE:IAIILASFSASTS:AF
9
16
28.92
30
3.29

(1470)
NSP2

(SEQ ID NO:

1699)

1160
ORF1AB_
298-311
AS:FSASTSAFV:ETV
9
8
15.81
6
0.70

(1471)
NSP2

(SEQ ID NO:

1699)

1161
ORF1AB_
340-361
AWN:IGEQKSILSPLYAFA
9
9
10.10
3
0.23

(1472)
NSP2

S:EAA

(SEQ ID NO:

1699)

1162
ORF1AB_
361-375
ARV:VRSIFSRTL:ETA
5
7
15.58
2
0.75

(1473)
NSP2

(SEQ ID NO:

1699)

1163
ORF1AB_
376-394
QNS:VRVLQKAAITILD:GI
3
15
25.26
10
0.73

(1474)
NSP2

s

(SEQ ID NO:

1699)

1164
ORF1AB_
437-452
FGT:VYEKLKPVLD:WLE
6
6
10.63
8
2.00

(1475)
NSP2

(SEQ ID NO:

1699)

1165
ORF1AB_
465-482
GWE:IVKFISTCACEI:VGG
7
9
15.40
1
0.20

(1476)
NSP2

(SEQ ID NO:

1699)

1166
ORF1AB_
495-509
VQT:FFKLVNKFLA:LC
10
15
37.66
13
1.59

(1477)
NSP2

(SEQ ID NO:

1699)

1167
ORF1AB_
500-515
KL:VNKFLALCADSI:II
7
11
20.46
5
0.67

(1478)
NSP2

(SEQ ID NO:

1699)

1168
ORF1AB_
511-531
DSI:IIGGAKLKALNLGET:F
8
9
10.51
6
0.60

(1479)
NSP2

VT

(SEQ ID NO:

1699)

1169
ORF1AB_
616-630
TEK:YCALAPNMM:VTN
4
6
10.65
0
0.00

(1480)
NSP2

(SEQ ID NO:

1699)

1170
ORF1AB_NSP
009-025
DDT:VIEVQGYKSVNI:TF
6
7
10.17
1
0.43

(1481)
3

(SEQ ID NO:

1700)

1171
ORF1AB_NSP
203-217
QTI:EVNSFSGYLK:LT
9
4
6.16
3
0.6

(1482)
3

(SEQ ID NO:

1700)

1172
ORF1AB_NSP
211-225
SGY:LKLTDNVYI:KNA
7
5
8.14
1
0.5

(1483)
3

(SEQ ID NO:

1700)

1173
ORF1AB_NSP
236-251
PTV:VVNAANVYLK:HGG
4
9
16.93
1
0.3

(1484)
3

(SEQ ID NO:

1700)

1174
ORF1AB_NSP
254-268
AGA:LNKATNNAM:QVE
2
7
15.46
0
0

(1485)
3

(SEQ ID NO:

1700)

1175
ORF1AB_NSP
307-324
GED:IQLLKSAYENFN:QH
6
11
18.73
4
0.55

(1486)
3

E

(SEQ ID NO:

1700)

1176
ORF1AB_NSP
317-334
NQH:EVLLAPLLSAGI:FGA
8
11
17.99
81
13.17

(1487)
3

(SEQ ID NO:

1700)

1177
ORF1AB_NSP
362-376
KNL:YDKLVSSFL:EMK
8
7
14.87
3
0.5

(1488)
3

(SEQ ID NO:

1700)

1178
ORF1AB_NSP
369-383
VSS:FLEMKSEKQ:VEQ
5
8
17.97
9
2.22

(1489)
3

(SEQ ID NO:

1700)

1179
ORF1AB_NSP
427-442
ETK:FLTENLLLYI:DIN
13
4
6.31
15
3.14

(1490)
3

(SEQ ID NO:

1700)

1180
ORF1AB_NSP
432-452
TEN:LLLYIDINGNLHPDS:
10
11
14.83
3
0.64

(1491)
3

ATL

(SEQ ID NO:

1700)

1181
ORF1AB_NSP
457-472
DIT:FLKKDAPYIV:GDV
5
6
10.03
2
0.29

(1492)
3

(SEQ ID NO:

1700)

1182
ORF1AB_NSP
524-539
AKT:VLKKCKSAFY:ILP
5
8
14.57
11
2

(1493)
3

(SEQ ID NO:

1700)

1183
ORF1AB_NSP
532-546
KSA:FYILPSIIS:NEK
10
8
17.75
0
0

(1494)
3

(SEQ ID NO:

1700)

1184
ORF1AB_NSP
601-618
GAR:FYFYTSKTTVAS:LIN
12
14
24.21
6
0.73

(1495)
3

(SEQ ID NO:

1700)

1185
ORF1AB_NSP
642-659
EEA:ARYMRSLKVPAT:VS
9
15
28.80
5
0.38

(1496)
3

V

(SEQ ID NO:

1700)

1186
ORF1AB_NSP
665-683
VTA:YNGYLTSSSKTPE:EH
10
11
19.02
12
1.5

(1497)
3

F

(SEQ ID NO:

1700)

1187
ORF1AB_NSP
680-698
EEH:FIETISLAGSYKD:WS
10
11
17.23
9
1.09

(1498)
3

Y

(SEQ ID NO:

1700)

1188
ORF1AB_NSP
732-751
VIT:FDNLKTLLSLREVR:TI
9
16
29.14
21
1.95

(1499)
3

K

(SEQ ID NO:

1700)

1189
ORF1AB_NSP
742-755
LS:LREVRTIKV:FTT
7
6
11.09
0
0

(1500)
3

(SEQ ID NO:

1700)

1190
ORF1AB_NSP
783-800
GAD:VTKIKPHNSHEG:KT
2
8
11.79
1
0.13

(1501)
3

F

(SEQ ID NO:

1700)

1191
ORF1AB_NSP
822-839
PSF:LGRYMSALNHTK:K
18
9
13.90
2
0.33

(1502)
3

WK

(SEQ ID NO:

1700)

1192
ORF1AB_NSP
865-881
LQQ:IELKFNPPALQ:DAY
2
10
15.40
4
0.9

(1503)
3

(SEQ ID NO:

1700)

1193
ORF1AB_NSP
878-892
QDA:YYRARAGEA:ANF
0
8
14.97
3
0.44

(1504)
3

(SEQ ID NO:

1700)

1194
ORF1AB_NSP
892-907
FCA:LILAYCNKTV:GEL
8
8
15.65
2
0.2

(1505)
3

(SEQ ID NO:

1700)

1195
ORF1AB_NSP
907-922
TMS:YLFQHANLDS:CKR
7
10
19.45
5
2

(1506)
3

(SEQ ID NO:

1700)

1196
ORF1AB_NSP
947-963
VEA:VMYMGTLSYEQ:FK
18
8
12.44
1
0.13

(1507)
3

K

(SEQ ID NO:

1700)

1197
ORF1AB_NSP
976-999
TKY:LVQQESPFVM
18
19
31.56
4
0.25

(1508)
3

MSAPPAQY:ELK

(SEQ ID NO:

1700)

1198
ORF1AB_NSP
1015-1029
CGH:YKHITSKET:LYC
7
6
13.49
1
0.17

(1509)
3

(SEQ ID NO:

1700)

1199
ORF1AB_NSP
1046-1060
TDV:FYKENSYTT:TIK
9
6
11.90
2
0.29

(1510)
3

(SEQ ID NO:

1700)

1200
ORF1AB_NSP
1078-1094
LDN:YYKKDNSYFTE:QPI
6
8
14.43
1
0.38

(1511)
3

(SEQ ID NO:

1700)

1201
ORF1AB_NSP
1120-1140
ADD:LNQLTGYKKPASREL
8
13
19.75
3
0.62

(1512)
3

:KVT

(SEQ ID NO:

1700)

1202
ORF1AB_NSP
1151-1167
AID:YKHYTPSFKKGA:KL
13
5
6.73
0
0

(1513)
3

(SEQ ID NO:

1700)

1203
ORF1AB_NSP
1169-1187
HKP:IVWHVNNATNKAT:
4
13
21.58
3
0.46

(1514)
3

YKP

(SEQ ID NO:

1700)

1204
ORF1AB_NSP
1254-1270
VGD:IILKPANNSLK:ITE
7
18
33.29
10
2

(1515)
3

(SEQ ID NO:

1700)

1205
ORF1AB_NSP
1292-1308
PNE:LSRVLGLKTLAT:HG
2
12
21.59
23
3

(1516)
3

(SEQ ID NO:

1700)

1206
ORF1AB_NSP
1299-1312
LG:LKTLATHGL:AAV
6
4
7.29
2
0.33

(1517)
3

(SEQ ID NO:

1700)

1207
ORF1AB_NSP
1324-1342
AKP:FLNKVVSTTTNIV:TR
6
13
21.73
1
0.23

(1518)
3

C

(SEQ ID NO:

1700)

1208
ORF1AB_NSP
1340-1354
TRC:LNRVCTNYM:PYF
10
8
15.30
0
0

(1519)
3

(SEQ ID NO:

1700)

1209
ORF1AB_NSP
1350-1369
YMP:YFFTLLLQLCTFTR:ST
23
18
28.49
18
2.15

(1520)
3

N

(SEQ ID NO:

1700)

1210
ORF1AB_NSP
1361-1375
LCT:FTRSTNSRI:KAS
6
9
21.21
3
1.33

(1521)
3

(SEQ ID NO:

1700)

1211
ORF1AB_NSP
1369-1383
NSR:IKASMPTTI:AKN
5
6
13.33
0
0

(1522)
3

(SEQ ID NO:

1700)

1212
ORF1AB_NSP
1390-1411
KFC:LEASFNYLKSPNFSKL:
19
19
31.36
9
1.1

(1523)
3

INI

(SEQ ID NO:

1700)

1213
ORF1AB_NSP
1402-1416
SPN:FSKLINIII:WFL
12
9
19.04
4
0.36

(1524)
3

(SEQ ID NO:

1700)

1214
ORF1AB_NSP
1408-1428
LIN:IIIWFLLLSVCLGSL:IYS
32
27
54.71
65
6.13

(1525)
3

(SEQ ID NO:

1700)

1215
ORF1AB_NSP
1422-1446
LGS:LIYSTAALGVL
17
26
41.14
8
0.69

(1526)
3

MSNLGMPS:YCT

(SEQ ID NO:

1700)

1216
ORF1AB_NSP
1449-1464
REG:YLNSTNVTIA:TYC
10
9
18.54
3
0.89

(1527)
3

(SEQ ID NO:

1700)

1217
ORF1AB_NSP
1493-1507
ISS:FKWDLTAFG:LVA
7
5
10.55
1
0.17

(1528)
3

(SEQ ID NO:

1700)

1218
ORF1AB_NSP
1510-1526
FLA:YILFTRFFYVL:GLA
21
5
7.26
3
0.29

(1529)
3

(SEQ ID NO:

1700)

1219
ORF1AB_NSP
1517-1533
TR:FFYVLGLAAIMQ:LFF
11
19
34.54
17
1.6

(1530)
3

(SEQ ID NO:

1700)

1220
ORF1AB_NSP
1526-1544
AAI:MQLFFSYFAVHFI:SN
23
12
20.44
1
0.08

(1531)
3

S

(SEQ ID NO:

1700)

1221
ORF1AB_NSP
1535-1551
YFA:VHFISNSWLMW:LII
24
8
14.00
0
0

(1532)
3

(SEQ ID NO:

1700)

1222
ORF1AB_NSP
1543-1565
NSW:LMWLIINLV
22
23
37.35
6
0.56

(1533)
3

QMAPISAM:VRM

(SEQ ID NO:

1700)

1223
ORF1AB_NSP
1556-1579
MAP:ISAMVRMY
48
16
22.07
2
0.25

(1534)
3

IFFASFYYVW:KSY

(SEQ ID NO:

1700)

1224
ORF1AB_NSP
1569-1587
FAS:FYYVWKSYVHVVD:
16
9
13.35
0
0

(1535)
3

GCN

(SEQ ID NO:

1700)

1225
ORF1AB_NSP
1592-1606
MMC:YKRNRATRV:ECT
2
9
20.01
2
0.33

(1536)
3

(SEQ ID NO:

1700)

1226
ORF1AB_NSP
1606-1624
TTI:VNGVRRSFYVYAN:G
8
4
4.21
4
1

(1537)
3

GK

(SEQ ID NO:

1700)

1227
ORF1AB_NSP
1614-1628
RSF:YVYANGGKG:FCK
4
6
11.79
1
0.17

(1538)
3

(SEQ ID NO:

1700)

1228
ORF1AB_NSP
1648-1670
SDE:VARDLSLQFKRPINP
7
10
12.12
5
0.6

(1539)
3

TD:QSS

(SEQ ID NO:

1700)

1229
ORF1AB_NSP
1700-1720
LSH:FVNLDNLRANNTKG
3
23
39.96
3
0.21

(1540)
3

S:LPI

(SEQ ID NO:

1700)

1230
ORF1AB_NSP
1743-1757
QPI:LLLDQALVS:DVG
5
4
8.56
23
6.86

(1541)
3

(SEQ ID NO:

1700)

1231
ORF1AB_NSP
1770-1791
VKM:FDAYVNTFS
22
13
19.73
1
0.07

(1542)
3

STFNVPM:EKL

(SEQ ID NO:

1700)

1232
ORF1AB_NSP
1805-1819
LST:FISAARQGF:VDS
3
5
10.54
4
1

(1543)
3

(SEQ ID NO:

1700)

1233
ORF1AB_NSP
1878-1896
ARH:INAQVAKSHNIAL:I
3
11
17.01
3
0.23

(1544)
3

WN

(SEQ ID NO:

1700)

1234
ORF1AB_NSP
1904-1923
SEQ:LRKQIRSAAKKNNL:
7
14
23.21
4
0.29

(1555)
3

PFK

(SEQ ID NO:

1700)

1235
ORF1AB_NSP
1919-1933
NLP:FKLTCATTR:QVV
8
8
15.93
0
0

(1546)
3

(SEQ ID NO:

1700)

1236
ORF1AB_NSP
1929-1945
TRQ:VVNVVTTKIALKG:G
5
8
11.23
3
0.88

(1547)
3

(SEQ ID NO:

1700)

1237
ORF1AB_NSP
1-30
KI:VNNWLKQLIKV
32
49
81.67
28
0.96

(1548)
4

TLVFLFVAAIFYLI:TPV

(SEQ ID NO:

1701)

1238
ORF1AB_NSP
20-36
VAA:IFYLITPVHVM:SKH
10
10
18.37
1
0.18

(1549)
4

(SEQ ID NO:

1701)

1239
ORF1AB_NSP
115-137
NGD:FLHFLPRVFSAVGNI
11
18
28.25
5
0.67

(1550)
4

CY:TPS

(SEQ ID NO:

1701)

1240
ORF1AB_NSP
188-203
DTR:YVLMDGSIIQ:FPN
9
7
13.38
4
0.57

(1551)
4

(SEQ ID NO:

1701)

1241
ORF1AB_NSP
195-210
DGS:IIQFPNTYLE:GSV
9
6
9.83
1
0.33

(1552)
4

(SEQ ID NO:

1701)

1242
ORF1AB_NSP
238-259
SGR:WVLNNDYYRSLPGV
11
13
19.79
3
0.38

(1553)
4

FC:GVD

(SEQ ID NO:

1701)

1243
ORF1AB_NSP
258-279
VDA:VNLLTNMFTPLIQPI
19
18
30.45
0
0

(1554)
4

G:ALD

(SEQ ID NO:

1701)

1244
ORF1AB_NSP
296-311
CLA:YYFMRFRRAFG:EY
11
9
18.69
3
0.44

(1555)
4

(SEQ ID NO:

1701)

1245
ORF1AB_NSP
302-315
MR:FRRAFGEYS:HVV
0
8
16.53
4
1.75

(1556)
4

(SEQ ID NO:

1701)

1246
ORF1AB_NSP
315-334
VA:FNTLLFLMSFTVLCL:T
30
20
36.58
2
0.18

(1557)
4

PV

(SEQ ID NO:

1701)

1247
ORF1AB_NSP
338-364
LPG:VYSVIYLYLTF
43
23
30.57
3
0.22

(1558)
4

YLTNDVSFLA:HIQ

(SEQ ID NO:

1701)

1248
ORF1AB_NSP
362-380
HIQ:WMVMFTPLVPFWI:
28
11
19.08
0
0

(1559)
4

TIA

(SEQ ID NO:

1701)

1249
ORF1AB_NSP
372-386
LVP:FWITIAYII:CIS
15
6
11.53
1
0.11

(1560)
4

(SEQ ID NO:

1701)

1250
ORF1AB_NSP
387-405
TKH:FYWFFSNYLKRRV:V
22
22
41.58
6
0.38

(1561)
4

FN

(SEQ ID NO:

1701)

1251
ORF1AB_NSP
394-415
FSN:YLKRRVVFNGVSFST
13
11
13.16
13
1.13

(1562)
4

F:EEA

(SEQ ID NO:

1701)

1252
ORF1AB_NSP
417-433
LCT:FLLNKEMYLKL:RSD
10
8
13.04
2
0.33

(1563)
4

(SEQ ID NO:

1701)

1253
ORF1AB_NSP
424-441
KEM:YLKLRSDVLLPL:TQY
12
11
20.85
16
1.85

(1564)
4

(SEQ ID NO:

1701)

1254
ORF1AB_NSP
435-458
LLP:LTQYNRYLAL
21
22
37.02
4
0.3

(1565)
4

YNKYKYFS:GAM

(SEQ ID NO:

1701)

1255
ORF1AB_3
054-072
YED:LLIRKSNHNFLVQ:AG
6
18
35.29
1
0.05

(1566)
C-LIKE_

N

PROTEINASE

(SEQ ID NO:

1702)

1256
ORF1AB_3
063-078
NHN:FLVQAGNVQL:RVI
3
11
20.58
7
1.08

(1567)
C-LIKE_

PROTEINASE

(SEQ ID NO:

1702)

1257
ORF1AB_3
098-114
TPK:YKFVRIQPGQT:FSV
7
11
19.63
1
0.25

(1568)
C-LIKE_

PROTEINASE

(SEQ ID NO:

1702)

1258
ORF1AB_3
123-147
SGV:YQCAMRPN
13
17
23.50
2
0.18

(1569)
C-LIKE_

FTIKGSFLNGS:CGS

PROTEINASE

(SEQ ID NO:

1702)

1259
ORF1AB_3
216-231
DRW:FLNRFTTTLN:DFN
12
9
15.74
4
0.89

(1570)
C-LIKE_

PROTEINASE

(SEQ ID NO:

1702)

1260
ORF1AB_NSP
4-30
KRT:IKGTHHWLLL
27
39
71.82
85
6.41

(1571)
6

TILTSLLVLVQ:STQ

(SEQ ID NO:

1703)

1261
ORF1AB_NSP
31-49
WSL:FFFLYENAFLPFA:M
18
17
29.44
1
0.06

(1572)
6

GI

(SEQ ID NO:

1703)

1262
ORF1AB_NSP
46-66
AMG:IIAMSAFA
24
9
11.39
0
0

(1573)
6

MMFVKHK:HAF

(SEQ ID NO:

1703)

1263
ORF1AB_NSP
56-70
AMM:FVKHKHAFL:CLF
14
4
8.09
4
0.71

(1574)
6

(SEQ ID NO:

1703)

1264
ORF1AB_NSP
63-82
KHA:FLCLFLLPSLATVA:YF
19
20
37.07
29
1.9

(1575)
6

N

(SEQ ID NO:

1703)

1265
ORF1AB_NSP
77-94
TVA:YFNMVYMPASWV:
23
9
13.05
1
0.22

(1576)
6

MRI

(SEQ ID NO:

1703)

1266
ORF1AB_NSP
111-127
KDC:VMYASAVVLLI:LMT
12
18
34.15
26
2.44

(1577)
6

(SEQ ID NO:

1703)

1267
ORF1AB_NSP
119-134
AVV:LLILMTARTV:YDD
13
12
25.21
5
0.35

(1578)
6

(SEQ ID NO:

1703)

1268
ORF1AB_NSP
136-157
ARR:VWTLMNVLT
25
17
31.10
7
0.94

(1579)
6

LVYKVYY:GNA

(SEQ ID NO:

1703)

1269
ORF1AB_NSP
147-161
TLV:YKVYYGNAL:DQA
4
5
10.01
0
0

(1580)
6

(SEQ ID NO:

1703)

1270
ORF1AB_NSP
162-182
ISM:WALIISVTSNYSGVV:
16
22
41.16
3
0.18

(1581)
6

TTV

(SEQ ID NO:

1703)

1271
ORF1AB_NSP
196-211
YCP:IFFITGNTLQ:CIM
6
11
19.96
0
0

(1582)
6

(SEQ ID NO:

1703)

1272
ORF1AB_NSP
221-246
CTC:YFGLFCLLNRYF
33
23
36.55
7
0.61

(1583)
6

RLTLGVYD:YLV

(SEQ ID NO:

1703)

1273
ORF1AB_NSP
248-264
TQE:FRYMNSQGLLP:PKN
5
13
29.32
3
0.64

(1584)
6

(SEQ ID NO:

1703)

1274
ORF1AB_NSP
266-284
IDA:FKLNIKLLGVGGK:PCI
5
13
27.78
15
3.2

(1585)
6

(SEQ ID NO:

1703)

1275
ORF1AB_NSP
8-33
CTS:VVLLSVLQQLR
11
33
56.25
54
3.09

(1586)
7

VESSSKLWA:QCV

(SEQ ID NO:

1704)

1276
ORF1AB_NSP
46-70
TEA:FEKMVSLLSV
17
39
78.38
118
7.00

(1587)
7

LLSMQGAVD:INK

(SEQ ID NO:

1704)

1277
ORF1AB_NSP
003-017
ASE:FSSLPSYAA:FAT
10
7
14.84
3
0.38

(1588)
8

(SEQ ID NO:

1705)

1278
ORF1AB_NSP
030-050
DSE:VVLKKLKKSLNVAKS:
5
28
53.65
51
4

(1589)
8

EFD

(SEQ ID NO:

1705)

1279
ORF1AB_NSP
064-082
DQA:MTQMYKQARSEDK
3
9
12.98
2
0.22

(1590)
8

:RAK

(SEQ ID NO:

1705)

1280
ORF1AB_NSP
089-103
TML:FTMLRKLDN:DAL
8
5
10.14
4
1.71

(1591)
8

(SEQ ID NO:

1705)

1281
ORF1AB_NSP
100-118
NDA:LNNIINNARDGCV:P
3
8
11.35
4
0.63

(1592)
8

LN

(SEQ ID NO:

1705)

1282
ORF1AB_NSP
116-133
PLN:IIPLTTAAKLMV:VIP
9
7
10.25
2
0.57

(1593)
8

(SEQ ID NO:

1705)

1283
ORF1AB_NSP
182-198
WPL:IVTALRANSAVKL:Q
4
20
39.66
10
0.95

(1594)
8

(SEQ ID NO:

1705)

1284
ORF1AB_NSP
035-052
TKG:GRFVLALLSDLQ:DLK
5
14
25.78
18
2.21

(1595)
9

(SEQ ID NO:

1706)

1285
ORF1AB_NSP
084-097
KVK:YLYFIKGLN:NL
6
4
8.18
1
0.20

(1596)
9

(SEQ ID NO:

1706)

1286
ORF1AB_NSP
088-101
LY:FIKGLNNLNR:GM
3
16
34.47
9
1.90

(1597)
9

(SEQ ID NO:

1706)

1287
ORF1AB_NSP
092-108
KG:LNNLNRGMVLGS:LA
2
8
12.57
1
0.13

(1598)
9

A

(SEQ ID NO:

1706)

1288
ORF1AB_NSP
099-113
RGM:VLGSLAATV:RLQ
5
6
11.10
7
3.00

(1599)
9

(SEQ ID NO:

1706)

1289
ORF1AB_NSP
039-053
TNC:VKMLCTHTG:TGQ
5
6
11.8
0
0.00

(1600)
10

(SEQ ID NO:

1707)

1290
ORF1AB_RNA-
066-084
IDS:YFVVKRHTFSNYQ:HE
7
11
16.70
1
0.08

(1601)
DEPENDENT_

E

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1291
ORF1AB_RNA-
084-100
ETI:YNLLKDCPAVA:KHD
5
7
11.45
2
0.57

(1602)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1292
ORF1AB_RNA-
168-183
NPD:ILRVYANLGE:RVR
3
11
23.92
10
2.25

(1603)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1293
ORF1AB_RNA-
234-251
VDS:YYSLLMPILTLTR:AL
21
12
20.48
4
0.4

(1604)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1294
ORF1AB_RNA-
242-258
MP:ILTLTRALTAES:HVD
4
15
28.61
17
2.47

(1605)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1295
ORF1AB_RNA-
291-305
DQT:YHPNCVNCL:DDR
4
6
11.92
0
0

(1606)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1296
ORF1AB_RNA-
305-328
RCI:LHCANFNVL
23
12
14.65
1
0.08

(1607)
DEPENDENT_

FSTVFPPTS:FGP

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1297
ORF1AB_RNA-
355-371
HNQ:DVNLHSSRLSFK:EL
7
11
17.69
9
2

(1608)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1298
ORF1AB_RNA-
366-379
LS:FKELLVYAA:DPA
3
6
12.69
1
0.14

(1609)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1299
ORF1AB_RNA-
377-391
DPA:MHAASGNLL:LDK
10
6
14.08
2
1.17

(1610)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1300
ORF1AB_RNA-
539-559
ITQ:MNLKYAISAKNRART
5
15
25.08
5
0.41

(1611)
DEPENDENT_

:VAG

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1301
ORF1AB_RNA-
568-587
NRQ:FHQKLLKSIAATRG:
6
20
35.77
20
2.14

(1612)
DEPENDENT_

ATV

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1302
ORF1AB_RNA-
591-610
TSK:FYGGWHNMLKTVYS
12
11
15.98
2
0.17

(1613)
DEPENDENT_

:DVE

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1303
ORF1AB_RNA-
626-643
MPN:MLRIMASLVLAR:K
15
20
42.54
16
1.71

(1614)
DEPENDENT_

HT

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1304
ORF1AB_RNA-
633-647
MAS:LVLARKHTT:CCS
2
6
10.13
2
0.29

(1615)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1305
ORF1AB_RNA-
649-664
SHR:FYRLANECAQ:VLS
3
8
15.28
0
0

(1616)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1306
ORF1AB_RNA-
693-711
VFN:ICQAVTANVNALL:S
6
12
19.60
11
1.86

(1617)
DEPENDENT_

TD

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1307
ORF1AB_RNA-
701-718
:TANVNALLSTDGNKI:AD
8
8
9.04
7
1.75

(1618)
DEPENDENT_

K

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1308
ORF1AB_RNA-
716-731
ADK:YVRNLQHRLY:ECL
8
10
19.14
4
0.82

(1619)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1309
ORF1AB_RNA-
738-760
DTD:FVNEFYAYL
16
18
28.46
4
0.28

(1620)
DEPENDENT_

RKHFSMMI:LSD

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1310
ORF1AB_RNA-
760-777
DDA:VVCFNSTYASQG:LV
11
11
17.69
0
0

(1621)
DEPENDENT_

A

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1311
ORF1AB_RNA-
776-798
VAS:IKNFKSVLYY
18
19
32.13
5
0.39

(1622)
DEPENDENT_

QNNVFMS:EAK

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1312
ORF1AB_RNA-
809-823
PHE:FCSQHTMLV:KQG
8
6
11.70
0
0

(1623)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1313
ORF1AB_RNA-
853-867
TLM:IERFVSLAI:DAY
12
6
13.02
4
2

(1624)
DEPENDENT_

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1314
ORF1AB_RNA-
878-899
ADV:FHLYLQYIR
15
16
24.38
8
0.71

(1625)
DEPENDENT_

KLHDELT:GHM

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1315
ORF1AB_RNA-
896-918
TGH:MLDMYSVM
23
11
13.76
1
0.09

(1626)
DEPENDENT_

LTNDNTSRY:WEP

RNA_

POLYMERASE

(SEQ ID NO:

1708)

1316
ORF1AB_
030-046
CYD:HVISTSHKLVL:SVN
4
10
17.06
16
1.62

(1627)
HELICASE

(SEQ ID NO:

1709)

1317
ORF1AB_
038-053
SHK:LVLSVNPYVC:NAP
4
11
20.30
4
0.58

(1628)
HELICASE

(SEQ ID NO:

1709)

1318
ORF1AB_
086-104
NGQ:VFGLYKNTCVGSD:
4
9
12.97
3
0.33

(1629)
HELICASE

NVT

(SEQ ID NO:

1709)

1319
ORF1AB_
127-143
TER:LKLFAAETLKA:TEE
4
14
25.30
3
0.71

(1630)
HELICASE

(SEQ ID NO:

1709)

1320
ORF1AB_
142-160
EET:FKLSYGIATVREV:LSD
8
8
14.36
3
0.75

(1631)
HELICASE

(SEQ ID NO:

1709)

1321
ORF1AB_
182-198
FTG:YRVTKNSKVQI:GEY
2
10
19.77
10
2.00

(1632)
HELICASE

(SEQ ID NO:

1709)

1322
ORF1AB_
208-223
AVV:YRGTTTYKLN:VGD
12
7
13.35
2
0.38

(1633)
HELICASE

(SEQ ID NO:

1709)

1323
ORF1AB_
221-236
VGD:YFVLTSHTVM:PLS
10
8
15.19
4
0.67

(1634)
HELICASE

(SEQ ID NO:

1709)

1324
ORF1AB_
229-243
SHT:VMPLSAPTL:VPQ
10
2
3.06
1
0.67

(1635)
HELICASE

(SEQ ID NO:

1709)

1325
ORF1AB_
243-260
QEH:YVRITGLYPTLN:ISD
6
8
11.70
4
0.50

(1636)
HELICASE

(SEQ ID NO:

1709)

1326
ORF1AB_
259-273
SDE:FSSNVANYQ:KVG
5
6
12.95
0
0.00

(1637)
HELICASE

(SEQ ID NO:

1709)

1327
ORF1AB_
271-288
KVG:MQKYSTLQGPPG:T
3
7
10.02
9
1.50

(1638)
HELICASE

GK

(SEQ ID NO:

1709)

1328
ORF1AB_
340-353
VEC:FDKFKVNST:LE
2
4
6.74
0
0.00

(1639)
HELICASE

(SEQ ID NO:

1709)

1329
ORF1AB_
344-357
DK:FKVNSTLEQ:YVF
7
7
14.65
5
2.38

(1640)
HELICASE

(SEQ ID NO:

1709)

1330
ORF1AB_
379-401
ATN:YDLSVVNA
5
11
13.62
12
1.36

(1641)
HELICASE

RLRAKHYVY:IGD

(SEQ ID NO:

1709)

1331
ORF1AB_
418-432
EPE:YFNSVCRLMK:TI
5
9
17.54
1
0.11

(1642)
HELICASE

(SEQ ID NO:

1709)

1332
ORF1AB_
469-495
AQC:FKMFYKGVIT
10
20
30.69
3
0.35

(1643)
HELICASE

HDVSSAINRPQ:IGV

(SEQ ID NO:

1709)

1333
ORF1AB_
507-528
RKA:VFISPYNSQNAVASK
9
19
31.53
6
0.52

(1644)
HELICASE

I:LGL

(SEQ ID NO:

1709)

1334
ORF1AB_
555-576
SCN:VNRFNVAITRAKVGI
7
15
22.59
4
0.53

(1645)
HELICASE

L:CIM

(SEQ ID NO:

1709)

1335
ORF1AB_3-
048-065
DMT:YRRLISMMGFKM:N
21
12
26.54
0
0

(1646)
TO-

YQ

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1336
ORF1AB_3-
104-128
NLP:LQLGFSTGV
11
14
15.63
3
0.43

(1647)
TO-

NLVAVPTGYV:DTP

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1337
ORF1AB_3-
143-165
GDQ:FKHLIPLMY
16
17
28.76
2
0.24

(1648)
TO-

KGLPWNVV:RIK

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1338
ORF1AB_3-
156-177
GLP:WNVVRIK
12
11
15.57
1
0.08

(1649)
TO-

IVQMLSDTL:KNL

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1339
ORF1AB_3-
302-324
ELK:INAACRKV
5
12
15.18
8
0.71

(1650)
TO-

QHMVVKAAL:LAD

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1340
ORF1AB_3-
401-416
FDT:RVLSNLNLPG:CDG
0
9
18.48
44
11.55

(1651)
TO-

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1341
ORF1AB_3-
416-431
GGS:LYVNKHAFHT:PAF
8
7
10.8
1
0.25

(1652)
TO-

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1342
ORF1AB_3-
434-448
SA:FVNLKQLPFF:YYS
10
10
18.32
2
0.5

(1653)
TO-

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1343
ORF1AB_3-
462-479
DID:YVPLKSATCITR:CNL
5
8
12.05
1
0.13

(1654)
TO-

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1344
ORF1AB_3-
495-513
LDA:YNMMISAGFSLWV:
26
14
25.79
4
0.36

(1655)
TO-

YKQ

5_

EXONUCLEASE

(SEQ ID NO:

1710)

1345
ORF1AB_
002-018
LEN:VAFNVVNKGHF:DG
3
8
12.71
1
0.13

(1656)
ENDORNASE

Q

(SEQ ID NO:

1711)

1346
ORF1AB_
065-082
PVP:EVKILNNLGVDI:AAN
4
16
31.14
9
1.19

(1657)
ENDORNASE

(SEQ ID NO:

1711)

1347
ORF1AB_
085-099
IWD:YKRDAPAHI:STI
2
6
12.58
1
0.17

(1658)
ENDORNASE

(SEQ ID NO:

1711)

1348
ORF1AB_
130-145
QVD:LFRNARNGVL:ITE
4
12
26.64
1
0.08

(1659)
ENDORNASE

(SEQ ID NO:

1711)

1349
ORF1AB_
145-162
EGS:VKGLQPSVGPKQ:AS
4
9
13.1
5
0.78

(1660)
ENDORNASE

L

(SEQ ID NO:

1711)

1350
ORF1AB_
190-211
PET:YFTQSRNLQEFKPRS
8
14
20.99
11
1.93

(1661)
ENDORNASE

Q:MEI

(SEQ ID NO:

1711)

1351
ORF1AB_
237-263
YGD:FSHSQLGGLH
16
18
24.65
6
0.33

(1662)
ENDORNASE

LLIGLAKRFKE:SPF

(SEQ ID NO:

1711)

1352
ORF1AB_
300-316
DDF:VEIIKSQDLSV:VSK
4
8
13.64
6
1.2

(1663)
ENDORNASE

(SEQ ID NO:

1711)

1353
ORF1AB_2-O-
011-026
MPN:LYKMQRMLLE:KCD
8
8
16.48
4
0.63

(1664)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1354
ORF1AB_2-O-
044-058
VAK:YTQLCQYLN:TLT
10
5
8.87
3
0.57

(1665)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1355
ORF1AB_2-O-
054-068
LNT:LTLAVPYNM:RVI
8
6
11.14
0
0.00

(1666)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1356
ORF1AB_2-O-
081-095
GTA:VLRQWLPTGT:LL
5
6
9.52
0
0.00

(1667)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1357
ORF1AB_2-O-
147-161
EGF:FTYICGFIQ:QKL
10
4
7.26
0
0.00

(1668)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1358
ORF1AB_2-O-
153-170
ICG:FIQQKLALGGSV:AIK
5
14
26.33
20
2.67

(1669)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1359
ORF1AB_2-O-
173-187
ADL:YKLMGHFAW:WTA
16
5
9.08
1
0.29

(1670)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1360
ORF1AB_2-O-
186-200
HFA:WWTAFVTNVN:AS
7
6
10.74
1
0.50

(1671)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1361
ORF1AB_2-O-
191-208
TA:FVTNVNASSSEAF:LIG
1
18
34.78
8
1.22

(1672)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1362
ORF1AB_2-O-
202-216
SEA:FLIGCNYLG:KPR
11
7
15.41
0
0.00

(1673)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1363
ORF1AB_2-O-
226-242
ANY:IFWRNTNPIQL:SSY
11
14
28.01
1
0.07

(1674)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1364
ORF1AB_2-O-
267-285
IND:MILSLLSKGRLII:REN
11
27
50.22
38
2.80

(1675)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1365
ORF1AB_2-O-
277-293
KGR:LIIRENNRVVI:SSD
4
13
24.71
3
0.46

(1676)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1713
ORF1AB_
226-242
ANY:IFWRNTNPIQL:SSY
11

28.01

0.07

(2562)
METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1723
ORF1AB_RNA
738-760
DTD:FVNEFYAYLRKHFS
16

28.46

0.28

(2563)
_POLYMERASE

MM I:LSD

(SEQ ID NO:

1708)

1724
ORF1AB_RNA-
736-760
DVDTD:F

(2564)
_POLYMERASE

VNEFYAYLRKHFSMMI:LS

_MOD

D

(SEQ ID NO:

1712)

1742
ORF1AB_NSP
162-182
ISM:WALIISVTSNYSGVV:
16

41.16

0.18

(2565)
6

TTV

(SEQ ID NO:

1703)

1756
ORF1AB_RNA-
776-798
VAS:IKNFKSVLYYQNNVF
18

32.13

0.39

(2566)
_POLYMERASE

MS:EAK

(SEQ ID NO:

1708)

1772
ORF1AB_NSP
182-198
WPL:IVTALRANSAVKL:Q
4

39.66

0.95

(2567)
8

(SEQ ID NO:

1705)

1782
SPIKE
446-468
GGN:
14

58.68

1.10

(2568)
(SEQ ID NO:

YNYLYRLFRKSNLKPFE:

3)

RDI

1783
SPIKE
444-468
KVGGN:YNYLYRLFRKSNL

(2569)
MOD (SEQ ID

KPFE:RDI

NO: 3)

1801
ORF1AB_RNA
626-643
MPN:MLRIMASLVLAR:K
15

42.54

1.71

(2570)
_POLYMERASE

HT

(SEQ ID NO:

1708)

1812
ORF1AB_NSP
435-458
LLP:LTQYNRYLALYNKYK
21

37.02

0.3

(2571)
4

YFS:GAM

(SEQ ID NO:

1701)

1813
ORF1AB_NSP
435-459
LLP:LTQYNRYLALYNKYK

(2572)
4

YFS:GAMD

MOD (SEQ ID

NO: 1701)

1831
ORF1AB_EXO
48-65
DMT:
21

26.54

0.0

(2573)
NUCLEASE

YRRLISMMGFKM:NYQ

(SEQ ID NO:

1710)

1832
ORF1AB_
48-66
DMT:YRRLISMMGFKM:N

(2574)
EXONUCLEASE

YQV

MOD (SEQ ID

NO: 1710)

1844
ORF1AB_
143-165
GDQ:FKHLIPLMYKGLPW
16

28.76

0.24

(2575)
EXONUCLEASE

NVV:RIK

(SEQ ID NO:

1710)

1860
ORF1AB_RNA
539-559
ITQ:MNLKYAISAKNRART
5

25.08

0.41

(2576)
_POLYMERASE

:VAG

(SEQ ID NO:

1708)

1861
ORF1AB_RNA
536-559
IPTITQ:MNLKYAISAKNR

(2577)
_POLYMERASE

ART:VAG

MOD (SEQ ID

NO: 1708)

1878
ORF1AB_
191-208
TA:FVTNVNASSSEAF:LIG
1

34.78

1.22

(2578)
METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1879
ORF1AB_
191-205
TA:FVTNVNASSSEAF:

(2579)
METHYL

TRANSFERASE

MOD (SEQ ID

NO: 1712)

1890
ORF1AB_NSP
88-101
LY:FIKGLNNLNR:GM
3

34.37

1.9

(2580)
9

(SEQ ID NO:

1706)

1891
ORF1AB_NSP
88-100
LY:FIKGLNNLNR:G

(2581)
9

MOD (SEQ ID

NO: 1706)

1898
ORF1AB_
507-528
RKA:VFISPYNSQNAVASK
9

31.53

0.52

(2582)
HELICASE

I:LGL

(SEQ ID NO:

1709)

1913
ORF1AB_
258-279
VDA:VNLLTNMFTPLIQPI
19

30.35

0.52

(2583)
NSP4

G:ALD

(SEQ ID NO:

1701)

1928
ORF1AB_
277-293
KGR:LIIRENNRVVI:SSD
4

24.71

0.46

(2584)
METHYL

TRANSFERASE

(SEQ ID NO:

1712)

1929
ORF1AB_
277-292
KGR:LIIRENNRVVI:SS

(2585)
METHYL

TRANSFERASE

MOD (SEQ ID

NO: 1712)

1939
ORF1AB_
127-143
TER:LKLFAAETLKA:TEE
4

25.3

0.71

(2586)
HELICASE

(SEQ ID NO:

1709)

1949
ORF1AB_
555-576
SSN:VNRFNVAITRAKVGI
7

22.59

0.53

(2587)
HELICASE

L:SIM

(SEQ ID NO:

1709)

1964
ORF1AB_
416-431
GGS:LYVNKHAFHT:PAF
8

10.8

0.25

(2588)
EXONUCLEASE

(SEQ ID NO:

1710)

1965
ORF1AB_
416-432
GGS:LYVNKHAFHT:PAFD

(2589)
EXONUCLEASE

MOD (SEQ ID

NO: 1710)

1975
SPIKE
114-130
TQS:LLIVNNATNVVI:KV
5

36.68

0.52

(2590)
(SEQ ID NO:

3)

1985
ORF1AB_NSP
1700-1720
LSH:FVNLDNLRANNTKG
3

39.96

0.21

(2591)
3

S:LPI

(SEQ ID NO:

1700)

1999
ORF1AB_
54-72
YED:LLIRKSNHNFLVQ:AG
6

35.29

6

(2592)
PROTEINASE

N

(SEQ ID NO:

1702)

2647
SPIKE
61-78
NVT:WFHAIHVSGTNG:T

(2648)
(SEQ ID NO:

KR

3)

2649
SPIKE
61-78
NVT:WFHAISGTNG:TKR

(2650)
(SEQ ID NO:
(del69_

3)
70)

2651
SPIKE
61-156
FLG:YYHKNNKSWM:ESE

(2652)
(SEQ ID NO:
(del144)

3)

2653
SPIKE
61-156
FLG:YYHKNNKSWM:ESE

(2654)
(SEQ ID NO:
(del144)

3)

2655
SPIKE
232-247
GIN:ITRFQTLLALHR:S

(2656)
(SEQ ID NO:

3)

2657
SPIKE
232-247

custom-character

(2658)
(SEQ ID NO:

3)

2659
SPIKE
445-461
VGG:NYNYLYRLFRK:SNL

(2660)
(SEQ ID NO:

3)

2661
SPIKE
445-461

custom-character

(2662)
(SEQ ID NO:

3)

2663
SPIKE
478-494

custom-character

(2664)
(SEQ ID NO:

3)

2665
SPIKE
492-508

custom-character

(2666)
(SEQ ID NO:

3)

2667
SPIKE
492-508

custom-character

(2668)
(SEQ ID NO:

3)

2669
SPIKE
673-689
SYQ:TQTNSPRRARS:VAS

(2670)
(SEQ ID NO:

3)

2671
SPIKE
673-689

custom-character

(2672)
(SEQ ID NO:

3)

2673
SPIKE
715-729
PIN:FTISVTTEI:LPV

(2674)
(SEQ ID NO:

3)

2675
SPIKE
512-526
VLS:FELLHAPAT:VEG

(2676)
(SEQ ID NO:

3)

2677
ORF3A
157-174

SIP:YNSVTSSIVITS:GDG

(2678)
(SEQ ID NO:

1694)

2679
ORF1AB_NSP
1324-1342
AKP:FLNKVVSTTTNIV:TR

(2680)
3

S

(SEQ ID NO:

1700)

2681
ORF1AB_NSP
1422-1446
LGS:LIYSTAALGVLMSNL

(2682)
3

GMPS:YST

(SEQ ID NO:

1700)

2683
ORF1AB_NSP
1592-1606
MMS:YKRNRATRV:EST

(2684)
3

(SEQ ID NO:

1700)

2685
ORF1AB_NSP
1592-1606
MME:YKRNRATRV:EST

(2686)
3

(SEQ ID NO:

1700)

2687
ORF1AB_NSP
1449-1464
REG:YLNSTNVTIA:TYS

(2688)
3

(SEQ ID NO:

1700)

2689
ORF1AB_NSP
258-275
VDA:VNLLTNMFTPLI:QPI

(2690)
4

(SEQ ID NO:

1701)

2691
ORF1AB_NSP
100-118
NDA:LNNIINNARDGS

(2692)
8

(SEQ ID NO:

1705)

2693
ORF1AB_HELI
469-495
AQS:FKMFYKGVITHDVSS

(2694)
CASE

AINRPQ:IGV

(SEQ ID NO:

1709)

2695
ORF1AB_HELI
555-576
SSN:VNRFNVAITRAKVGI

(2696)
CASE

L:SIM

(SEQ ID NO:

1709)

2697
ORF1AB_3
054-071
YED:LLIRKSNHNFLVQ:AG

(2698)
C-LIKE_

PROTEINASE

(SEQ ID NO:

1702)

2699
ORF1AB_2-O-
011-026
MPN:LYKMQRMLLE:KDD

(2700)
RIBOSE

_METHYL

TRANSFERASE

(SEQ ID NO:

1712)

2701
SPIKE
492-508

custom-character

(2702)
(SEQ ID NO:

R

3)

2703
SPIKE
492-508

custom-character

(2704)
(SEQ ID NO:

R

3)

2705
NUCLEOCAPS
081-098
DDQ:IGYYRRATRRIR:GG

(2706)
ID

DG

(SEQ ID NO:

1693)

2707
ORF3A
102-119

KEAP:FLYLYALVYFLQ:SIN

(2708)
(SEQ ID NO:

1694)

2709
ORF3A
157-174

KIP:YNSVTSSIVITS:GDGT

(2710)
(SEQ ID NO:

1694)

2711
ORF1AB_NSP
1422-1446

KLGS:LIYSTAALGVLMSNL

(2712)
3

GMPS:YST

(SEQ ID NO:

1700)

2713
ORF1AB_NSP
338-364

KPG:VYSVIYLYLTF

(2714)
4

YLTNDVSFLA:HIQ

(SEQ ID NO:

1701)

2715
ORF1AB_NSP
162-182

KISM:WALIISVTSNYSGV

(2716)
6

V:TTV

(SEQ ID NO:

1703)

2717
ORF1AB_RNA-
539-558
ITQ:MNLKYAISAKNRART

(2718)
DEPENDENT_

:VA

RNA_

POLYMERASE

(SEQ ID NO:

1708)

- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 4-68, 1003-1005, 708-739, 1055-1059, 1366-1370, 2735-2792, or 8588 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 4-68, 1003-1005, 708-739, 1055-1059, 1366-1370, 2735-2792, or 8588 in the amino acid sequence of the envelope (SEQ ID NO: 1) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 69-209, 1006-1015, 740-851, 1060-1072, 1118-1119, 1371-1383, 1429-1430, 2255, 2561, 2793-2966, 8568, 8587, 8590, 8691, or 8692 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 69-213, 1006-1015, 740-851, 1060-1072, 1118-1119, 1371-1383, 1429-1430, 2793-2966, 8568, 8587, 8590, 8691, or 8692 in the amino acid sequence of the membrane (SEQ ID NO: 2) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 210-707, 1016-1054, 852-1002, 1073-1117, 1120-1123, 1384-1428, 1431-1434, 1782-1800, 1975-1984, 2568, 2569, 2590, 2647-2676, 2701-2704, 7810-8540, 8543, 8553, 8555, 8583, 8599, 8609, 8613, 8617, 8620, 8652, 8654, 8670, 8675, 8686, or 8690 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 210-707, 1016-1054, 852-1002, 1073-1117, 1120-1123, 1384-1428, 1431-1434, 1782-1800, 1975-1984, 2568, 2569, 2590, 2647-2676, 2701-2704, 7810-8540, 8543, 8553, 8555, 8583, 8599, 8609, 8613, 8617, 8620, 8652, 8654, 8670, 8675, 8686, or 8690 in the amino acid sequence of the spike (SEQ ID NO: 3) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1124-1128 1435-1439, 2705-2706, 2967-3140, 8556, 8594, 8607, 8642, or 8655 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1124-1128, 1435-1439, 2705-2706, 2967-3140, 8556, 8594, 8607, 8642, or 8655 in the amino acid sequence of the nucleocapsid (SEQ ID NO: 1693) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1129-1139, 1440-1450, 2677-2678, 2707-2710, 7397-7610, 8566, 8575, 8625, 8635, or 8683 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1129-1139, 1440-1450, 2677-2678, 2707-2710, 7397-7610, 8566, 8575, 8625, 8635, or 8683 in the amino acid sequence of the ORF3a protein (SEQ ID NO: 1694) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1140-1141, 1451-1452, 7611-7657, 8548, or 8787 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1140-1141, 1451-1452, 7611-7657, 8548, or 8787 in the amino acid sequence of the ORF6 protein (SEQ ID NO: 1695) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1142-1145, 1453-1456, or 7658-7746 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1142-1145, 1453-1456, or 7658-7746 in the amino acid sequence of the ORF7a protein (SEQ ID NO: 1696) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1146-1148, 1457-1459, 7747-7809, 8662, or 8668 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1146-1148, 1457-1459, 7747-7809, 8662, or 8668 in the amino acid sequence of the ORF8 protein (SEQ ID NO: 1697) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1149-1150, 1460-1461, or 3140-3169 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1149-1150, 1460-1461, or 3140-3169 in the amino acid sequence of the ORF10 protein (SEQ ID NO: 1698) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1151-1169, 1462-1480, 4447-4798, 8549, 8552, 8612, 8659, 8661, 8663, 8664, 8684,or 8685 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1151-1169, 1462-1480, 4447-4798, 8549, 8552, 8612, 8659, 8661, 8663, 8664, 8684,or 8685 in the amino acid sequence of the ORF1ab non-structural protein 2 (NSP2) (SEQ ID NO: 1699) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1170-1236, 1481-1547,1985-1998, 2591, 2679-2688, 2711-2712, 4799-5969, 8542, 8545, 8547, 8560, 8564, 8567, 8569, 8570, 8589, 8591, 8598, 8600, 8603, 8608, 8610, 8616, 8618, 8629, 8630, 8633, 8639, 8649, 8650, 8665-8668, 8671, or 8681 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1170-1236 1481-1547, 1985-1998, 2591, 2679-2688, 2711-2712, 4799-5969, 8542, 8545, 8547, 8560, 8564, 8567, 8569, 8570, 8589, 8591, 8598, 8600, 8603, 8608, 8610, 8616, 8618, 8629, 8630, 8633, 8639, 8649, 8650, 8665-8668, 8671, or 8681 in the amino acid sequence of the ORF1ab non-structural protein 3 (NSP3) (SEQ ID NO: 1700) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1237-1254, 1548-1565, 1812-1830, 1913-1927, 2571, 2572, 2583, 2689-2690, 2713-2714, 5970-6334, 8571, 8578, 8580-8582, 8606, 8632, 8634, or 8677 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1237-1254, 1548-1565, 1812-1830, 1913-1927, 2571, 2572, 2583, 2689-2690, 2713-2714, 5970-6334, 8571, 8578, 8580-8582, 8606, 8632, 8634, or 8677 in the amino acid sequence of the ORF1ab non-structural protein 4 (NSP4) (SEQ ID NO: 1701) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1255-1259, 1566-1570, 1999-2010, 2592, 2697-2698, 3170-3342, or 8574 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1255-1259, 1566-1570, 1999-2010, 2592, 2697-2698, 3170-3342, or 8574 in the amino acid sequence of the ORF1ab 3C-like proteinase (SEQ ID NO: 1702) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1260-1274, 1571-1585, 1742-1755, 2715-2716, 6335-6598, 8561, 8577, 8615, 8621, 8623, 8627, 8638, or 8647 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1260-1274, 1571-1585, 1742-1755, 2715-2716, 6335-6598, 8561, 8577, 8615, 8621, 8623, 8627, 8638, or 8647 in the amino acid sequence of the ORF1ab non-structural protein 6 (NSP6) (SEQ ID NO: 1703) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1275-1276, 1586-1587, 6599-6636, or 8682 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1275-1276, 1586-1587, 6599-6636, or 8682 in the amino acid sequence of the ORF1ab non-structural protein 7 (NSP7) (SEQ ID NO: 1704) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1277-1283, 1588-1594, 1772-1781, 2567, 2691-2692, 6637-6739, 8585, 8593, 8611, 8619, 8657, or 8669 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1277-1283, 1588-1594, 1772-1781, 2567, 2691-2692, 6637-6739, 8585, 8593, 8611, 8619, 8657, or 8669 in the amino acid sequence of the ORF1ab non-structural protein 8 (NSP8) (SEQ ID NO: 1705) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1284-1288, 1595-1599, 1890-1897, 2580, 6740-6789, 8558, or 8628 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1284-1288, 1595-1599, 1890-1897, 2580, 6740-6789, 8558, or 8628 in the amino acid sequence of the ORF1ab non-structural protein 9 (NSP9) (SEQ ID NO: 1706) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1289, 1600, or 4390-4446 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1289, 1600, or 4390-4446 in the amino acid sequence of the ORF1 ab non-structural protein 10 (NSP10) (SEQ ID NO: 1707) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1290-1315, 1601-1626, 1723-1741, 1756-1771, 1802-1811, 1860-1877, 2563, 2564, 2566, 2570, 2717-2718, 6790-7396, 8541, 8544, 8546, 8550, 8559, 8565, 8579, 8584, 8595, 8596, 8601, 8602, 8605, 8614, 8622, 8624, 8626, 8636, 8643, 8645, 8648, 8658, 8660, or 8674 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1290-1315, 1601-1626, 1723-1741, 1756-1771, 1802-1811, 1860-1877, 2563, 2564, 2566, 2570, 2717-2718, 6790-7396, 8541, 8544, 8546, 8550, 8559, 8565, 8579, 8584, 8595, 8596, 8601, 8602, 8605, 8614, 8622, 8624, 8626, 8636, 8643, 8645, 8648, 8658, 8660, or 8674 in the amino acid sequence of the ORF1ab RNA-dependent RNA polymerase protein (SEQ ID NO: 1708) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1316-1334, 1627-1645, 1898-1912, 1939-1963, 2582, 2586, 2693-2696, 3839-4179, 8572, 8641, 8646, 8651, or 8676 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1316-1334, 1627-1645, 1898-1912, 1939-1963, 2582, 2586, 2693-2696, 3839-4179, 8572, 8641, 8646, 8651, or 8676 in the amino acid sequence of the ORF1ab helicase protein (SEQ ID NO: 1709) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1335-1344, 1646-1655, 1831-1859, 1964-1974, 2573, 2574, 2575, 2588, 2589, 3343-3661, 8554, 8557, 8563, 8573, 8576, 8586, 8592, 8597, 8631, 8640, 8644, 8653, or 8672 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1335-1344, 1646-1655, 1831-1859, 1964-1974, 2573, 2574, 2575, 2588, 2589, 3343-3661, 8554, 8557, 8563, 8573, 8576, 8586, 8592, 8597, 8631, 8640, 8644, 8653, or 8672 in the amino acid sequence of the ORF1 ab 3′-5′ exonuclease protein (SEQ ID NO: 1710) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1345-1352, 1656-1663, 3362-3838, 8551, 8562, 8637, or 8680 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1345-1352, 1656-1663, 3362-3838, 8551, 8562, 8637, or 8680 in the amino acid sequence of the ORF1ab endoRNase protein (SEQ ID NO: 1711) of SARS-CoV-2.
- For a peptide or polypeptide have a core sequence comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence of SEQ ID NOS: 1353-1365, 1664-1676, 1713-1722, 1878-1889,1928-1938, 2578, 2579, 2584, 2699-2700, 4180-4389, 8604, 8656, 8673, 8678, or 8679 (and/or fragments and variants thereof), optionally with extensions of 1 to 12 amino acids on the C-terminal and/or the N-terminal, the extensions of 1 to 12 amino acids are those found flanking the amino acid sequence of SEQ ID NOS: 1353-1365, 1664-1676, 1713-1722, 1878-1889, 1928-1938, 2578, 2579, 2584, 2699-2700, 4180-4389, 8604, 8656, 8673, 8678, or 8679 in the amino acid sequence of the ORF1ab 2′O-ribose methyltransferase protein (SEQ ID NO: 1712) of SARS-CoV-2.

In aspects, said flanking amino acid sequences as described herein may serve as a WIC stabilizing region. The use of a longer peptide may allow endogenous processing by patient cells and may lead to more effective antigen presentation and induction of T cell responses. In aspects, the peptides or polypeptides of the instant disclosure can be isolated, recombinant, and/or synthetic. In aspects, the peptides or polypeptides can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In aspects, the peptides or polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group.

In aspects, the instant disclosure is directed to one or more Class II polypeptides (“clusters”) of Table 1, (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of a polypeptide of Table 1.

In aspects, the instant disclosure is directed to one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1055, 1366, 1060, 1371, 1061, 1372, 1062, 1373, 1065, 1376, 1066, 1376, 1069, 1380, 1070, 1381, 1071 1382, 1072, 1383, 1074, 1384, 1078, 1389, 1080, 1391, 1081, 1392, 1082, 1393, 1085, 1395, 1086 1397, 1087, 1398, 1088, 1399, 1089, 1400, 1091, 1401, 1092, 1403, 1093, 1404, 1096, 1407, 1100 1411, 1104, 1415, 1105, 1416, 1106, 1417, 1107, 1418, 1110, 1421, 1113, 1424, 1115, 1426, 1116, and 1427 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1055, 1366, 1060, 1371, 1061, 1372, 1062, 1373, 1065, 1376, 1066, 1376, 1069, 1380, 1070, 1381, 1071 1382, 1072, 1383, 1074, 1384, 1078, 1389, 1080, 1391, 1081, 1392, 1082, 1393, 1085, 1395, 1086 1397, 1087, 1398, 1088, 1399, 1089, 1400, 1091, 1401, 1092, 1403, 1093, 1404, 1096, 1407, 1100 1411, 1104, 1415, 1105, 1416, 1106, 1417, 1107, 1418, 1110, 1421, 1113, 1424, 1115, 1426, 1116, and 1427. Optionally, such peptides or polypeptides may be used/administered in the instantly-disclosed vaccines and related methods as “peptide pools”, as disclosed in Example 8 and Tables 61 and 62.

In aspects, the instant disclosure is directed to one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1062 1373, 1066. 1377, 1079, 1390, 1080, 1391, 1085, 1396, 1124, 1435, 1127, 1438, 1132, 1443, 1134, 1445, 1136, 1447, 2677, 2678, 1140, 1451, 1148, 1459, 1184, 1495, 1185, 1496, 1197, 1507, 1203, 1514, 2679, 2680, 2681, 2682, 1220, 1531, 1222, 1533, 1223, 1534, 2683, 2684, 1234, 1555, 2689, 3690, 1247, 1558, 1250, 1561, 1254, 1565, 2697, 2698, 1261, 1571, 1267, 1578, 1268, 1579, 1270, 1581, 1273, 1584, 1293, 1604, 1300, 1611, 1309, 1619, 1311, 1622, 1314, 1625, 1319, 1630, 2693, 2694, 1333, 1644, 2695, 2696, 1335, 1646, 1337, 1648, 1344, 1655, 1348, 1659, 1351, 1662, 1363 1674, 1365 and 1676 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1062, 1373, 1066, 1377, 1079, 1390, 1080, 1391, 1085, 1396, 1124, 1435, 1127, 1438, 1132, 1443, 1134, 1445, 1136, 1447, 2677, 2678, 1140, 1451, 1148, 1459, 1184, 1495, 1185, 1496, 1197, 1507, 1203, 1514, 2679, 2680, 2681, 2682, 1220, 1531, 1222, 1533, 1223, 1534, 2683, 2684, 1234, 1555, 2689, 3690, 1247, 1558, 1250, 1561, 1254, 1565, 2697, 2698, 1261, 1571, 1267, 1578, 1268, 1579, 1270, 1581, 1273, 1584, 1293, 1604, 1300, 1611, 1309, 1619, 1311, 1622, 1314, 1625, 1319, 1630, 2693, 2694, 1333, 1644, 2695, 2696, 1335, 1646, 1337, 1648, 1344, 1655, 1348, 1659, 1351, 1662, 1363 1674, 1365 and 1676. In aspects, such polypeptides may be used/delivered via a micro needle patch, as are known in the art.

In aspects, the instant disclosure is directed to one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 2647, 2648, 2649, 2650, 2651, 2652, 2653, 2654, 2655, 2656, 2657, 2658, 2659, 2660, 2661, 2662, 1092, 1403, 2663, 2664, 1093, 1404, 2665, 2666, 2667, 2668, 2669, 2670, 2671, 2672, 1098, 1409, 2673, and 2674 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 2647, 2648, 2649, 2650, 2651, 2652, 2653, 2654, 2655, 2656, 2657, 2658, 2659, 2660, 2661, 2662, 1092, 1403, 2663, 2664, 1093, 1404, 2665, 2666, 2667, 2668, 2669, 2670, 2671, 2672, 1098, 1409, 2673, and 2674.

In aspects, the instant disclosure is directed to one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1071, 1382, 1087, 1398, 2675, 2676, 1124, 1435, 1155, 1466, 1170, 1481, 1174, 1485, 1181, 1492, 1190, 1501, 1192, 1503, 1193, 1504, 2687, 2688, 2685, 2686, 1240, 1551, 1253, 1564, 1255, 1566, 1262, 1573, 1267, 1578, 2691, 2692, 1295, 1606, 1310, 1621, 1344, 1655, 1348, 1659, 1352, 1663, 2699, and 2700 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1071, 1382, 1087, 1398, 2675, 2676, 1124, 1435, 1155, 1466, 1170, 1481, 1174, 1485, 1181, 1492, 1190, 1501, 1192, 1503, 1193, 1504, 2687, 2688, 2685, 2686, 1240, 1551, 1253, 1564, 1255, 1566, 1262, 1573, 1267, 1578, 2691, 2692, 1295, 1606, 1310, 1621, 1344, 1655, 1348, 1659, 1352, 1663, 2699, and 2700.

In aspects, the instant disclosure is directed to one or more Class I polypeptides (9-mers or lOmers) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 2735-8540 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 2735-8540.

In aspects, the instant disclosure is directed to one or more Class I polypeptides (9-mers) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 8541-8690 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 8541-8690. In aspects, such polypeptides may be used/delivered via a micro needle patch, as are known in the art.

In aspects, the instant disclosure is directed to a polypeptide comprising an amino acid sequence having at least 75%, 80%, 85%, 90%, or 95% homology to any one of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments thereof), wherein said polypeptide is still able to bind to a same HLA molecule (i.e., retain MHC binding propensity) and/or retain the same TCR specificity, and/or retain anti-SARS-CoV-2 activity.

In aspects, the present disclosure is directed to a concatemeric polypeptide or peptide that comprises at one or more of the instantly-disclosed polypeptides or peptides (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) linked, fused, or joined together (e.g., fused in-frame, chemically-linked, or otherwise bound) to an additional peptide or polypeptide. Such additional peptide or polypeptide may be one or more of the instantly instantly-disclosed polypeptides or peptides, or may be an additional peptide or polypeptide of interest. In aspects, a concatemeric peptide is composed of 3 or more, 4 or more, 5 or more 6 or more 7 or more, 8 or more, 9 or more of the instantly-disclosed peptides or polypeptides. In other aspects, the concatemeric peptides or polypeptides include 1000 or more, 1000 or less, 900 or less, 500 or less, 100 or less, 75 or less, 50 or less, 40 or less, 30 or less, 20 or less or 100 or less peptide epitopes. In yet other embodiments, a concatemeric peptide has 3-100, 5-100, 10-100, 15-100, 20-100, 25-100, 30-100, 35-100, 40-100, 45-100, 50-100, 55-100, 60-100, 65-100, 70-100, 75-100, 80-100, 90-100, 5-50, 10-50, 15-50, 20-50, 25-50, 30-50, 35-50, 40-50, 45-50, 100-150, 100-200, 100-300, 100-400, 100-500, 50-500, 50-800, 50-1,000, or 100-1,000 of the instantly-disclosed peptides or polypeptides linked, fused, or joined together. Each peptide or polypeptide of the concatemeric polypeptide may optionally have one or more linkers, which may optionally be cleavage sensitive sites, adjacent to their N and/or C terminal end. Such suitable linkers and cleavage sensitive sites, including AAY cleavage motifs or a poly GS linker which may be include on the N terminus of the C-terminal element, are known in the art. In such a concatemeric peptide, two or more of the peptide epitopes may have a linker, which may act as a cleavage sensitive site, between them. Alternatively two or more of the peptide epitopes may be connected directly to one another or through a linker that is not a cleavage sensitive site. In aspects, such linker is antigenically neutral, and the liker is preferably less than the length of a peptidyl backbone of 9 amino acids linearly arranged. In aspects, linker length is the length of a peptidyl backbone of between 2 and 8 amino acids, linearly arranged. In aspects, the spacer is unable to hydrogen bond in any spatially distinct manner to other distinct elements of the enhancing hybrid peptide.

In aspects, and with respect to antigenically neutral linker elements, various chemical groups may be incorporated as linkers instead of amino acids. Examples are described in U.S. Pat. No. 5,910,300, the contents of which are incorporated herein by reference. In apsects, a linker may be comprised of an aliphatic chain optimally interrupted by heteroatoms, for example a C₂-C₆alkylene, or ═N—(CH₂)_2-6—N═. Alternatively, a spacer may be composed of alternating units, for example of hydrophobic, lipophilic, aliphatic and aryl-aliphatic sequences, optionally interrupted by heteroatoms such as O, N, or S. Such components of a spacer are preferably chosen from the following classes of compounds: sterols, alkyl alcohols, polyglycerides with varying alkyl functions, alkyl-phenols, alkyl-amines, amides, hydroxyphobic polyoxyalkylenes, and the like. Other examples are hydrophobic polyanhydrides, polyorthoesters, polyphosphazenes, polyhydroxy acids, polycaprolactones, polylactic, polyglycolic polyhydroxy-butyric acids. A linker may also contain repeating short aliphatic chains, such as polypropylene, isopropylene, butylene, isobutylene, pentamethlyene, and the like, separated by oxygen atoms.

Additional peptidyl sequences which can be used in as possible linkers are described in U.S. Pat. No. 5,856,456, the contents of which are incorporated herein by reference. In one embodiment, a linker has a chemical group incorporated within which is subject to cleavage. Without limitation, such a chemical group may be designed for cleavage catalyzed by a protease, by a chemical group, or by a catalytic monoclonal antibody. In the case of a protease-sensitive chemical group, tryptic targets (two amino acids with cationic side chains), chymotryptic targets (with a hydrophobic side chain), and cathepsin sensitivity (B, D or S) are favored. The term ‘tryptic target’ is used herein to describe sequences of amino acids which are recognized by trypsin and trypsin-like enzymes. The term chymotryptic target' is used herein to describe sequences of amino acids which are recognized by chymotrypsin and chymotrypsin-like enzymes. In addition, chemical targets of catalytic monoclonal antibodies, and other chemically cleaved groups are well known to persons skilled in the art of peptide synthesis, enzymatic catalysis, and organic chemistry in general, and can be designed into the hybrid structure and synthesized, using routine experimental methods.

In aspects, a concatemeric polypeptide of the instant disclosure is produced using the EpiAssembler System (EpiVax). The EpiAssembler system is useful for assembling overlapping epitopes to Immunogenic Consensus Sequences (ICS). EpiAssembler is an algorithm that optimizes the balance between pathogen and population coverage. EpiAssembler uses the information from the sequences produced by Conservatrix and EpiMatrix to form highly immunogenic consensus sequences. In aspects, the concatemeric peptides of the instant disclosure include those of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 1677-1681 and 2641-2646 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 2723-2734 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 2639-2640 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 1685-1692 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 2593-2604 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the concatemeric peptides of the instant disclosure include one or more of SEQ ID NOS: 2719-2722 (as well as nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such concatemeric peptides). In aspects, the present disclosure provides a concatemeric polypeptide with at least 60%, 70%, 80%, 90%, or 95% homology to each of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734. In aspects, the present disclosure provides a concatemeric polypeptide having anti-SARS-CoV-2 activity, said polyeptide having at least 60%, 70%, 80%, 90%, or 95% homology to each of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734. In aspects, the present disclosure provides concatemeric polypeptides with at least 60%, 70%, 80%, 90%, or 95% homology to those of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734. As previously described, anti-SARS-CoV-2 activity means that the instantly-disclosed therapeutic T-cell epitope compounds and compositions are, in aspects: capable of stimulating, inducing, and/or expanding an immune response to SARS-CoV-2 (e.g., a cellular (CD4+ and/or CD8+ T-cell response) or humoral immune response to SARS-CoV-2) and/or associated diseases in a subject; capable of stimulating, inducing, and/or expanding a SARS-CoV-2-specific IFNγresponse (e.g., by lymphocytes such as PMBC, or effector CD4+ and/or CD8+ T-cells), capable of inhibiting SARS-CoV-2 viral replication or infectivity, and/or capable of inducing immunity against SARS-CoV-2. In aspects, a T-cell epitope compound or composition of the present disclosure having anti-SARS-CoV-2 activity will reduce the disease symptoms resulting from SARS-CoV-2 challenge by at least about 5% to about 50%, at least about 10% to about 60%, at least about 30% to about 70%, at least about 40% to about 80%, or at least about 50% to about 90% or greater, including any value or range therebetween. Again, anti-SARS-CoV-2 activity can be determined by various experiments and assays as known to those of skill in the art, including methods such as by antibody titrations of sera, e.g., by ELISA and/or seroneutralization assay analysis and/or by vaccination challenge evaluation, including use of experiments and assays as disclosed in the Examples herein.

In aspects, the concatemeric polypeptides of the instant disclosure can be isolated, recombinant, and/or synthetic. In aspects, the concatemeric peptides or polypeptides can be either in neutral (uncharged) or salt forms, and may be either free of or include modifications such as glycosylation, side chain oxidation, or phosphorylation. In aspects, the concatemeric peptides or polypeptides of the instant disclosure can be capped with an n-terminal acetyl and/or c-terminal amino group.

In aspects, one or more peptides or polypeptides of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692; as well as the concatemeric polypeptides disclosed herein, including SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide. As previously described, with respect to the one or more T-cell epitopes of the instant disclosure, the term “heterologous polypeptide” is intended to mean that the one or more T-cell epitopes of the instant disclosure are heterologous to, or not included naturally, in the heterologous polypeptide. In aspects, a heterologous polypeptide may include, but are not limited to, e.g. monoclonal antibody, polyclonal antibody, mouse antibody, human antibody, humanized antibody, mono specific antibody, bispecific antibody, glycosylated antibody, Fc-modified antibody, or antibody-drug conjugates; an antibody of different class or subclass (e.g., IgG (e.g., IgG1, IgG2, IgG3, IgG4), IgM, IgA, IgD or IgE molecules) or antigen-specific antibody fragments thereof (including, but not limited to, a Fab, F(ab′)₂, Fv, disulfide linked Fv, scFv, single domain antibody, closed conformation multispecific antibody, disulfide-linked scFv, diabody)). In aspects, one or more of the instantly-disclosed polypeptides may be inserted into the heterologous polypeptide (e.g., through recombinant techniques, mutagenesis, or other known means in the art), may be added to the C-terminus (with or without the use of linkers, as is known in the art), and/or added to the N-terminus (with or without the use of linkers, as is known in the art) of the heterologous polypeptide. In aspects, one or more of the instantly-disclosed polypeptides may be inserted into or replace amino acids in a Fc domain as disclosed in U.S. Pat. Nos. 7,442,778, 7,645,861, 7,655,764, 7,655,765, and/or 7,750,128 (each of which are herein incorporated by reference in their entirety). For example, protein engineering by mutagenesis can be performed using site-directed mutagenesis techniques, or other mutagenesis techniques known in the art (see e.g., James A. Brannigan and Anthony J. Wilkinson., 2002, Protein engineering 20 years on. Nature Reviews Molecular Cell Biology 3, 964-970; Turanli-Yildiz B. et al., 2012, Protein Engineering Methods and Applications, intechopen.com, which are herein incorporated by reference in their entirety). In aspects, chimeric or fusion polypeptides comprise one or more of the instantly-disclosed polypeptides of the present disclosure operatively linked to a heterologous polypeptide. “Operatively linked” indicates that the one or more of the instantly-disclosed polypeptides and the heterologous protein are fused in-frame or chemically linked or otherwise bound. For example, in aspects, the one or more of the instantly-disclosed polypeptides may be covalently bound to one or more internal conjugation site(s) in an Fc domain as disclosed in U.S. Pat. Nos. 8,008,453, 9,114,175, and/or 10,188,740 (each of which are herein incorporated by reference in their entirety). In aspects, the one or more peptides or polypeptides of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) may be joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide as a whole, although it may be made up from a joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted amino acid sequence, together with flanking amino acids of the heterologous polypeptide. In aspects, the present disclosure is directed to polypeptide (which, in aspects, may be an isolated, synthetic, or recombinant) having a sequence comprising one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), wherein said one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 is not naturally included in the polypeptide and/or said one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 is not located at its natural position in the polypeptide. For example, in aspects, the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure may be inserted into a SARS-CoV-2 sequence in which the SARS-CoV-2 sequence does not include the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure (e.g., the SARS-CoV-2 sequence does not include, or is mutated to not include, the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure) or the one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure is inserted into a SARS-CoV-2 sequence but not at its natural position. In aspects, the one or more peptides or polypeptides of the instant disclosure can be joined or linked to (e.g., fused in-frame, chemically-linked, or otherwise bound) to a small molecule (e.g., albumin or other known carriers and proteins), drug, or drag fragment, for example but not limited to, a drug or drug fragment that is binds with high affinity to defined HLAs.

As used herein, two polypeptides (or a region of the polypeptides) are substantially homologous or identical when the amino acid sequences are at least about 45-55%, typically at least about 70-75%, more typically at least about 80-85%, more typically greater than about 90%, and more typically greater than 95% or more homologous or identical. To determine the percent homology or identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the sequence of one polypeptide or nucleic acid molecule for optimal alignment with the other polypeptide or nucleic acid molecule). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in one sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the other sequence, then the molecules are homologous at that position. As is known in the art, the percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. Sequence homology for polypeptides is typically measured using sequence analysis software. As used herein, amino acid or nucleic acid “homology” is equivalent to amino acid or nucleic acid “identity”. In aspects, the percent homology between the two sequences is a function of the number of identical positions shared by the sequences (e.g., percent homology equals the number of identical positions/total number of positions x 100).

In aspects, the present disclosure also encompasses polypeptides having a lower degree of identity but having sufficient similarity so as to perform one or more of the same functions performed by a polypeptide of the instant disclosure (e.g., a polypeptide having a sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692; concatemeric peptides as disclosed herein, including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734),. Similarity is determined by conserved amino acid substitution. Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics. Conservative substitutions are likely to be phenotypically silent. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, Met, and Ile; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gln, exchange of the basic residues His, Lys and Arg and replacements among the aromatic residues Trp, Phe and Tyr. Guidance concerning which amino acid changes are likely to be phenotypically silent are found (Bowie J U et al., (1990), Science, 247(4948):130610, which is herein incorporated by reference in its entirety). In aspects, a variant polypeptide can differ in amino acid sequence by one or more substitutions, deletions, insertions, inversions, fusions, and truncations or a combination of any of these. Variant polypeptides can be fully functional (e.g., retain MHC binding propensity and/or TCR specificity, and/or retain anti-SARS-CoV-2 activity) or can lack function in one or more activities. Fully functional variants typically contain only conservative variation or variation in non-critical residues or in non-critical regions; in this case, typically MHC contact residues provided MHC binding is preserved. Functional variants can also contain substitution of similar amino acids that result in no change or an insignificant change in function (e.g., retain MHC binding propensity and/or TCR specificity, and/or retain anti-SARS-CoV-2 activity). Alternatively, such substitutions can positively or negatively affect function to some degree. Non-functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region; in this case, typically TCR contact residues. In aspects, a variant and/or a homologous polypeptide retains the desired anti-SARS-CoV-2 activity of the instant disclsoure (e.g.: capable of stimulating, inducing, and/or expanding an immune response to SARS-CoV-2 (e.g., a cellular (CD4+ and/or CD8+ T-cell response) or humoral immune response to SARS-CoV-2) and/or associated diseases in a subject; capable of stimulating, inducing, and/or expanding a SARS-CoV-2-specific IFNγresponse (e.g., by lymphocytes such as PMBC, or effector CD4+ and/or CD8+ T-cells); and/or capable of inhibiting SARS-CoV-2 viral replication or infectivity, and/or capable of inducing immunity against SARS-CoV-2). Alternatively, such substitutions can positively or negatively affect function to some degree. Non-functional variants typically contain one or more non-conservative amino acid substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical region; in this case, typically TCR contact residues. In aspects, funcational variants of a polypeptide having a sequence (or a core sequence) comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein may contain one or more conservative substitutions, and in aspects one or more non-conservative substitutions, at amino acid residues which are not believed to be essential for functioning (with amino acid residues considered being essential for functioning, including, e.g., retain WIC binding propensity and/or TCR specificity, and/or retain anti-SARS-CoV-2 activity) of the instantly-disclosed polypeptides. For example, in aspects, a variant polypeptide having a sequence (or a core sequence) comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, or fragments thereof as disclosed herein, or a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 or a fragment thereof as disclosed herein, may contain one or more conservative substitutions (and in aspects, a nonconservative subsitution) in one or more HLA contact residues, provided HLA binding is preserved. MHC binding assays are well known in the art. In aspects, such assays may include the testing of binding affinity with respect to WIC class I and class II alleles in in vitro binding assays, with such binding assays as are known in the art. Exampels include, e.g., the soluble binding assays as disclosed in U.S. Pat. No. 7,884,184 or PCT/US2020/020089, both of which are herein incorporated by reference in their entireties. Additionally, in aspects, a fully functional variant polypeptide having a sequence (or a core sequence) comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein do not contain mutations at one or more critical residues or regions, such as TCR contact residues.

In aspects, the TCR-binding epitope (which can be referred to as TCR binding residues, TCR facing epitope, TCR facing residues, or TCR contacts) for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a MHC class II molecule are at position 2, 3, 5, 7, and 8 of the identified epitope, while the MHC-binding agretope (which can be referred to as MHC contacts, MHC facing residues, WIC-binding residues, or MHC-binding face) for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a WIC class II molecule are at position 1, 4, 6, and 9, both as counted from the amino terminal.

In aspects, the TCR binding epitope for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 or as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that binds to a MHC class I molecule are at position 4, 5, 6, 7, and 8 of the identified epitope, while the MHC binding agretope for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a WIC class I molecule are at position 1, 2, 3, and 9, both as counted from the amino terminal.

In aspects, the TCR binding epitope for a 10-mer identified epitope that bind to a MHC class I molecule are at position 4, 5, 6, 7, 8, and 9 of the identified epitope (which may be a 10-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 10-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734), while the MHC binding agretope for a 10-mer identified epitope (which may be a 10-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 10-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a MHC class I molecule are at position 1, 2, 3, 9, and 10, both as counted from the amino terminal.

In aspects, the TCR-binding epitope for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a MHC class II molecule are at any combination of residues at positions 2, 3, 5, 7, and 8 (e.g., but not limited to, positions 3, 5, 7 and 8; positions 2, 5, 7, and 8; positions 2, 3, 5, and 7, etc.) of the identified epitope, while the MHC binding agretope for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) is the complementary face to the TCR facing residues, both as counted from the amino terminal.

In aspects, the TCR binding epitope for 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a MHC class I molecule are at positions 4, 5, 6, 7, and 8; 1, 4, 5, 6, 7 and 8; or 1, 3, 4, 5, 6, 7, and 8 of the identified epitope, while the MHC binding agretope for a 9-mer identified epitope (which may be a 9-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 9-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) is the complementary face to the TCR facing residues, both as counted from the amino terminal.

In aspects, the TCR-binding epitope for a 10-mer identified epitope (which may be a 10-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 10-mer fragment of a concatemeric peptide of SEQ ID NOS:

1677-1692, 2593-2604, 2639-2646, and 2719-2734) that bind to a MHC class I molecule are at any combination of residues at positions 1, 3, 4, 5, 6, 7, 8, and 9 of the identified epitope, while the MHC binding agretope for a 10-mer identified epitope (which may be a 10-mer fragment of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 as disclosed herein or a 10-mer fragment of a concatemeric peptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734) is the complementary face to the TCR facing residues, both as counted from the amino terminal.

Based on the above, it should be understood that in apsects in which one or more 9-mers and/or 10-mer epitopes are contained within a longer polypeptide and are predicted to bind one or more Class I or Class II MHC molecules and are occurring in close proximity to each other in a naturally occurring sequence (e.g., wherein position 1 of each pair of binding 9-mers and/or 10-mers fall within, e.g., 3 amino acids of each other), such epitopes may be combined to form an epitope cluster. In a given cluster, any given amino acid may be, with respect to a given 9-mer epitope or 10-mer epitope, MHC facing and, with respect to another 9-mer epitope, TCR facing.

In aspects, the present disclosure also includes fragments of the instantly-disclosed polypeptides and concatemeric polypeptides. In aspects, the present disclosure also encompasses fragments of the variants of the instantly-disclosed polypeptides and concatemeric polypeptides as described herein. In aspects, as used herein, a fragment comprises at least about nine contiguous amino acids. In aspects, the present disclosure also encompasses fragments of the variants of the T-cell epitopes described herein. Useful fragments (and fragments of the variants of the polypeptides and concatemeric polypeptides described herein) include those that retain one or more of the biological activities, particularly: MHC binding propensity and/or TCR specificity, and/or anti-SARS-CoV-2 activity. Biologically active fragments are, for example, about 9, 10, 11, 12, 1, 14, 15, 16, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100 or more amino acids in length, including any value or range therebetween. Fragments can be discrete (not fused to other amino acids or polypeptides) or can be within a larger polypeptide. Several fragments can be comprised within a single larger polypeptide. In aspects, a fragment designed for expression in a host can have heterologous pre- and pro-polypeptide regions fused to the amino terminus of the polypeptide fragment and an additional region fused to the carboxyl terminus of the fragment.

In aspects, the instantly disclosed polypeptides and concatemeric polypeptides of the present disclosure can include allelic or sequence variants (“mutants”) or analogs thereof, or can include chemical modifications (e.g., pegylation, glycosylation). In aspects, a mutant retains the same function, particularly MHC binding propensity and/or TCR specificity, and/or anti-SARS-CoV-2 activity. In aspects, a mutant can provide for enhanced binding to MHC molecules. In aspects, a mutant can lead to enhanced binding to TCRs. In another instance, a mutant can lead to a decrease in binding to MHC molecules and/or TCRs. Also contemplated is a mutant that binds, but does not allow signaling via the TCR.

The manner of producing the polypeptides of the present disclosure will vary widely, depending upon the nature of the various elements comprising the molecule. For example, an isolated polypeptide can be purified from cells that naturally express it, purified from cells that have been altered to express it (recombinant), or synthesized using known protein synthesis methods. The synthetic procedures may be selected so as to be simple, provide for high yields, and allow for a highly purified stable product. For example, polypeptides of the instant disclosure can be produced either from a nucleic acid disclosed herein, or by the use of standard molecular biology techniques, such as recombinant techniques, mutagenesis, or other known means in the art. An isolated polypeptide can be purified from cells that naturally express it, purified from cells that have been altered to express it (recombinant), or synthesized using known protein synthesis techniques. In aspects, a polypeptide of the instant disclosure is produced by recombinant DNA or RNA techniques. In aspects, a polypeptide of the instant disclosure can be produced by expression of a recombinant nucleic acid of the instant disclosure in an appropriate host cell. For example, a nucleic acid molecule encoding the polypeptide is cloned into an expression cassette or expression vector, the expression cassette or expression vector introduced into a host cell and the polypeptide expressed in the host cell. The polypeptide can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques. Alternatively a polypeptide can be produced by a combination of ex vivo procedures, such as protease digestion and purification. Further, polypeptides of the instant disclosure can be produced using site-directed mutagenesis techniques, or other mutagenesis techniques known in the art (see e.g., James A. Brannigan and Anthony J. Wilkinson., 2002, Protein engineering 20 years on. Nature Reviews Molecular Cell Biology 3, 964-970; Turanli-Yildiz B. et al., 2012, Protein Engineering Methods and Applications, intechopen.com, which are herein incorporated by reference in their entirety).

In aspects, the present disclosure also provides chimeric or fusion polypeptide compositions. In aspects, the present disclosure is directed to a chimeric or fusion polypeptide composition (which in aspects may be isolated, synthetic, or recombinant) comprising one or more peptides, polypeptides, or concatemeric peptides of the present disclosure (e.g., one or more peptides or polypeptides of the present disclosure have a sequence, e.g. but not limited to, comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and similarly may have a sequence, e.g. but not limited to, comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734). In aspects, a chimeric or fusion polypeptide composition of the present disclosure comprises one or more peptides, polypeptides, and/or concatemeric peptides of the present disclosure joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide, such as an unrelated protein. As previously described, with respect to the one or more T-cell epitopes (e.g., peptide, polypeptides, or concatemeric peptides of the instant disclosure), the term “heterologous polypeptide” is intended to mean that the one or more T-cell epitopes of the instant disclosure are heterologous to, or not included naturally, in the heterologous polypeptide. In aspects, a heterologous polypeptide may include, but are not limited to, e.g. monoclonal antibody, polyclonal antibody, mouse antibody, human antibody, humanized antibody, mono specific antibody, bispecific antibody, glycosylated antibody, Fc-modified antibody, or antibody-drug conjugates; an antibody of different class or subclass (e.g., IgG (e.g., IgG1, IgG2, IgG3, IgG4), IgM, IgA, IgD or IgE molecules) or antigen-specific antibody fragments thereof (including, but not limited to, a Fab, F(ab′)2, Fv, disulfide linked Fv, scFv, single domain antibody, closed conformation multispecific antibody, disulfide-linked scFv, diabody)). In aspects, one or more of the instantly-disclosed peptides, polypeptides, or concatemeric peptides may be inserted into the heterologous polypeptide (e.g., through recombinant techniques, mutagenesis, or other known means in the art), may be added to the C-terminus (with or without the use of linkers, as is known in the art), and/or added to the N-terminus (with or without the use of linkers, as is known in the art) of the heterologous polypeptide. In aspects, one or more of the instantly-disclosed polypeptides may be inserted into or replace amino acids in a Fc domain as disclosed in U.S. Pat. Nos. 7,442,778, 7,645,861, 7,655,764, 7,655,765, and/or 7,750,128 (each of which are herein incorporated by reference in their entirety). For example, protein engineering by mutagenesis can be performed using site-directed mutagenesis techniques, or other mutagenesis techniques known in the art (see e.g., James A. Brannigan and Anthony J. Wilkinson., 2002, Protein engineering 20 years on. Nature Reviews Molecular Cell Biology 3, 964-970; Turanli-Yildiz B. et al., 2012, Protein Engineering Methods and Applications, intechopen.com, which are herein incorporated by reference in their entirety). In aspects, chimeric or fusion polypeptides comprise one or more of the instantly-disclosed peptides, polypeptides, or concatemeric peptides operatively linked to a heterologous polypeptide. “Operatively linked” indicates that the one or more of the instantly-disclosed peptides, polypeptides, or concatemeric peptides and the heterologous polypeptide are fused in-frame or chemically-linked or otherwise bound. For example, in aspects, the one or more of the instantly-disclosed polypeptides may be covalently bound to one or more internal conjugation site(s) in an Fc domain as disclosed in U.S. Pat. Nos. 8,008,453, 9,114,175, and/or 10,188,740 (each of which are herein incorporated by reference in their entirety). In aspects, the one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure may be joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into a heterologous polypeptide as a whole, although it may be made up from a joined to, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted amino acid sequence, together with flanking amino acids of the heterologous polypeptide. In aspects, a chimeric or fusion polypeptide composition comprises a peptide, polypeptide, or concatemeric peptide of the instant disclosure wherein said one or more of peptides, polypeptides, or concatemeric peptides is not naturally included in the heterologous polypeptide and/or said one or more of peptides, polypeptides, or concatemeric peptides is not located at its natural position in the heterologous polypeptide. In aspects, the one or more of peptide or polypeptides of the present disclosure can be joined, linked to (e.g., fused in-frame, chemically-linked, or otherwise bound), and/or inserted into the heterologous polypeptide. In aspects, chimeric or fusion polypeptide compositions comprise one or more of the instantly-disclosed T-cell epitopes (e.g., peptides, polypeptides, or concatemeric peptides of the instant disclosure) operatively linked to a heterologous polypeptide having an amino acid sequence not substantially homologous to the T-cell epitope. In aspects, the chimeric or fusion polypeptide does not affect function of the T-cell epitope per se. For example, the fusion polypeptide can be a GST-fusion polypeptide in which the T-cell epitope sequences are fused to the C-terminus of the GST sequences. Other types of fusion polypeptides include, but are not limited to, enzymatic fusion polypeptides, for example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions and Ig fusions. Such fusion polypeptides, particularly poly-His fusions or affinity tag fusions, can facilitate the purification of recombinant polypeptide. In certain host cells (e.g., mammalian host cells), expression and/or secretion of a polypeptide can be increased by using a heterologous signal sequence. Therefore, in aspects, the chimeric or fusion polypeptide contains a heterologous signal sequence at its N-terminus. In aspects of the above chimeric or fusion polypeptide compositions, the heterologous polypeptide or polypeptide comprises a biologically active molecule. In aspects, the biologically active molecule is selected from the group consisting of an immunogenic molecule, a T cell epitope, a viral protein, and a bacterial protein. In aspects, the one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure can be joined or linked to (e.g., fused in-frame, chemically-linked, or otherwise bound) to a small molecule, drug, or drug fragment. For example, the one or more peptides, polypeptides, or concatemeric peptides of the instant disclosure can be joined or linked to (e.g., fused in-frame, chemically-linked, or otherwise bound) to an unrelated peptide or protein, a small molecule (e.g., albumin or other known carriers and proteins), drug, or drag fragment, for example but not limited to, a drug or drug fragment that is binds with high affinity to defined HLAs. In aspects of the above-described chimeric or fusion polypeptide compositions, the chimeric or fusion polypeptide compositions can be recombinant, isolated, and/or synthetic.

A chimeric or fusion polypeptide composition can be produced by standard recombinant DNA or RNA techniques as are known in the art. For example, DNA or RNA fragments coding for the different polypeptide sequences may be ligated together in-frame in accordance with conventional techniques. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, polymerase chain reaction (PCR) amplification of nucleic acid fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive nucleic acid fragments which can subsequently be annealed and re-amplified to generate a chimeric nucleic acid sequence(Short Protocols in Molecular Biology: A Compendium of Methods from Current Protocols in Molecular Biology, (2^ND, 1992), FM Asubel et al. (eds), Green Publication Associates, New York, N.Y. (Publ), ISBN: 9780471566355, which is herein incorporated by reference in its entirety). Further, one or more peptides, polypeptides or concatemeric of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) can be inserted into a heterologous polypeptide or inserted into a non-naturally occurring position of a polypeptide through recombinant techniques, synthetic polymerization techniques, mutagenesis, or other standard techniques known in the art. For example, protein engineering by mutagenesis can be performed using site-directed mutagenesis techniques, or other mutagenesis techniques known in the art (see e.g., James A. Brannigan and Anthony J. Wilkinson., 2002, Protein engineering 20 years on. Nature Reviews Molecular Cell Biology 3, 964-970; Turanli-Yildiz B. et al., 2012, Protein Engineering Methods and Applications, intechopen.com, which are herein incorporated by reference in their entirety).

In aspects, the polypeptides, concatemeric polypeptides, and chimeric or fusion polypeptides can be purified to homogeneity or partially purified. It is understood, however, that preparations in which the T-cell epitope compounds and compositions are not purified to homogeneity are useful. The critical feature is that the preparation allows for the desired function of the composition, even in the presence of considerable amounts of other components. Thus, the present disclosure encompasses various degrees of purity. In one embodiment, the language “substantially free of cellular material” includes preparations of the polypeptides, concatemeric polypeptides, and chimeric or fusion polypeptides having less than about 30% (by dry weight) other proteins (e.g., contaminating protein), less than about 20% other proteins, less than about 10% other proteins, less than about 5% other proteins, less than about 4% other proteins, less than about 3% other proteins, less than about 2% other proteins, less than about 1% other proteins, or any value or range therebetween.

In aspects, when a polypeptide, concatemeric polypeptide, and chimeric or fusion polypeptide of the present disclosure is recombinantly produced, the composition can also be substantially free of culture medium, for example, culture medium represents less than about 20%, less than about 10%, or less than about 5% of the volume of the the polypeptides, concatemeric polypeptides, and chimeric or fusion polypeptides preparation. The language “substantially free of chemical precursors or other chemicals” includes preparations of the the polypeptides, concatemeric polypeptides, and chimeric or fusion polypeptides in which it is separated from chemical precursors or other chemicals that are involved in the T-cell epitope's synthesis. The language “substantially free of chemical precursors or other chemicals” can include, for example, preparations of the the polypeptides, concatemeric polypeptides, and chimeric or fusion polypeptides having less than about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% chemical precursors or other chemicals, less than about 10% chemical precursors or other chemicals, less than about 5% chemical precursors or other chemicals, less than about 4% chemical precursors or other chemicals, less than about 3% chemical precursors or other chemicals, less than about 2% chemical precursors or other chemicals, or less than about 1% chemical precursors or other chemicals.

In aspects, the present disclosure also includes pharmaceutically acceptable salts of the T-cell epitope compounds and compositions (including one or more of e.g., peptides or polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric or fusion polypeptide compositions as disclosed herein (which in aspects may be isolated, synthetic, and/or recombinant). “Pharmaceutically acceptable salt” means a salt that is pharmaceutically acceptable and that possesses the desired pharmacological activity of the parent peptide or polypeptide (e.g., peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as disclosed herein). As used herein, “pharmaceutically acceptable salt” refers to derivative of the instantly-disclosed polypeptides, concatemeric polypeptides, and/or chimeric or fusion polypeptides, wherein such compounds are modified by making acid or base salts thereof. Examples of pharmaceutically acceptable salts include, but are not limited to, mineral or organic acid salts of basic residues such as amines, alkali or organic salts of acidic residues such as carboxylic acids, and the like. The pharmaceutically acceptable salts include the conventional non-toxic salts or the quaternary ammonium salts of the parent compound formed, for example, from non-toxic inorganic or organic acids. For example, such conventional non-toxic salts include, but are not limited to, those derived from inorganic and organic acids selected from 2-acetoxybenzoic, 2-hydroxyethane sulfonic, acetic, ascorbic, benzene sulfonic, benzoic, bicarbonic, carbonic, citric, edetic, ethane disulfonic, 1,2-ethane sulfonic, fumaric, glucoheptonic, gluconic, glutamic, glycolic, glycollyarsanilic, hexylresorcinic, hydrabamic, hydrobromic, hydrochloric, hydroiodic, hydroxymaleic, hydroxynaphthoic, isethionic, lactic, lactobionic, lauryl sulfonic, maleic, malic, mandelic, methane sulfonic, napsylic, nitric, oxalic, pamoic, pantothenic, phenylacetic, phosphoric, polygalacturonic, propionic, salicyclic, stearic, subacetic, succinic, sulfamic, sulfanilic, sulfuric, tannic, tartaric, toluene sulfonic, and the commonly occurring amine acids, e.g., glycine, alanine, phenylalanine, arginine, etc.

Nucleic Acids

In aspects, the present disclosure also provides for nucleic acids (e.g., DNAs (including cDNA, RNAs (such as, but limited to mRNA), vectors, viruses, or hybrids thereof, all of which may be isolated, synthetic, or recombinant) that encode in whole or in part one or more one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides of the present disclosure as described herein. In aspects, the nucleic acid further comprises, or is contained within, an expression cassette, a plasmid, and expression vector, or recombinant virus, wherein optionally the nucleic acid, or the expression cassette, plasmid, expression vector, or recombinant virus is contained within a cell, optionally a human cell or a non-human cell, and optionally the cell is transformed with the nucleic acid, or the expression cassette, plasmid, expression vector, or recombinant virus. In aspects, cells are transduced, transfected, or otherwise engineered to contain within one or more of e.g., polypeptides of the present disclosure; isolated, synthetic, or recombinant nucleic acids, expression cassettes, plasmids, expression vectors, or recombinant viruses as disclosed herein; and/or isolated, synthetic, or recombinant chimeric or fusion polypeptide compositions as disclosed herein. In aspects, the cell can be a mammalian cell, bacterial cell, insect cell, or yeast cell. In aspects, the nucleic acid molecules of the present disclosure can be inserted into vectors and used, for example, as expression vectors or gene therapy vectors. Gene therapy vectors can be delivered to a subject by, e.g., intravenous injection, local administration (U.S. Pat. No. 5,328,470) or by stereotactic injection (Chen S H et al., (1994), Proc Natl Acad Sci USA, 91(8):3054-7, which are herein incorporated by reference in their entirety). Similarly, the nucleic acid molecules of the present disclosure can be inserted into plasmids. The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that produce the gene delivery system. Such pharmaceutical compositions can be included in a container, pack, or dispenser together with instructions for administration. In aspects of the above nucleic acids (e.g., DNAs, RNAs, vectors, viruses, or hybrids thereof) that encode in whole or in part at least one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as described herein, the nucleic acids encode one or more peptides or polypeptides of the instant disclosure as described above (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692; as well as the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734). In aspects, the present disclosure is directed to a vector comprising a nucleic acid of the present disclosure encoding one or more polypeptides of the present disclosure or chimeric or fusion polypeptide composition of the present disclosure. In aspects, the present disclosure is directed to a cell comprising a vector of the present disclosure. In aspects, the cell can be a mammalian cell, bacterial cell, insect cell, or yeast cell.

The nucleic acid of the instant disclosure may be DNAs (including but not limited to cDNA) or RNAs (including but not limited to mRNA), single- or double-stranded. The nucleic acid is typically DNA or RNA (including mRNA). The nucleic acid may be produced by techniques well known in the art, such as synthesis, or cloning, or amplification of the sequence encoding the immunogenic polypeptide; synthesis, or cloning, or amplification of the sequence encoding the cell membrane addressing sequence; ligation of the sequences and their cloning/amplification in appropriate vectors and cells. The nucleic acids provided herein (whether RNAs, DNAs, vectors, viruses or hybrids thereof) that encode in whole or in part one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as described herein can be isolated from a variety of sources, genetically engineered, amplified, synthetically produced, and/or expressed/generated recombinantly. Recombinant polypeptides generated from these nucleic acids can be individually isolated or cloned and tested for a desired activity. Any recombinant expression system can be used, including e.g. in vitro, bacterial, fungal, mammalian, yeast, insect or plant cell expression systems. In aspects nucleic acids provided herein are synthesized in vitro by well-known chemical synthesis techniques (as described in, e.g., Adams (1983) J. Am. Chem. Soc. 105:661; Belousov (1997) Nucleic Acids Res. 25:3440-3444; Frenkel (1995) Free Radic. Biol. Med. 19:373-380; Blommers (1994) Biochemistry 33:7886-7896; Narang (1979) Meth. Enzymol. 68:90; Brown (1979) Meth. Enzymol. 68:109; Beaucage (1981) Tetra. Lett. 22:1859; U.S. Pat. No. 4,458,066, all of which are herein incorporated by reference in their entirety). Further, techniques for the manipulation of nucleic acids provided herein, such as, e.g., subcloning, labeling probes (e.g., random-primer labeling using Klenow polymerase, nick translation, amplification), sequencing, hybridization and the like are well described in the scientific and patent literature (see, e.g., Sambrook, ed., MOLECULAR CLONING: A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Ausubel, ed. John Wiley & Sons, Inc., New York (1997); LABORATORY TECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY: HYBRIDIZATION WITH NUCLEIC ACID PROBES, Part I. Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993), all of which are herein incorporated by reference in their entirety).

A further object of the invention relates to a nucleic acid molecule encoding one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as described herein. The nucleic acid may be used to produce the one or more peptides, polypeptides, concatemeric peptides, and/or chimeric or fusion polypeptides as described herein in vitro or in vivo, or to produce cells expressing the polypeptide on their surface, or to produce vaccines wherein the active agent is the nucleic acid or a vector containing the nucleic acid. The nucleic acid may be, e.g., DNA, cDNA, PNA, CNA, RNA, either single- and/or double-stranded, or native or stabilized forms of polynucleotides as are known in the art.

As previously mentioned, the nucleic acid molecules according to the present disclosure may be provided in the form of a nucleic acid molecule per se such as naked nucleic acid molecules; a plasmid, a vector; virus or host cell, etc., either from prokaryotic or eukaryotic origin. Vectors include expression vectors that contain a nucleic acid molecule of the invention. An expression vector capable of expressing a polypeptide can be prepared. Expression vectors for different cell types are well known in the art and can be selected without undue experimentation. Generally, the (e.g., cDNA, or RNA, including mRNA) is inserted into an expression vector, such as a plasmid, in proper orientation and correct reading frame for expression. If necessary, the DNA (e.g., cDNA, or RNA, including mRNA) may be linked to the appropriate transcriptional and translational regulatory control nucleotide sequences recognized by the desired host (e.g., bacteria), although such controls are generally available in the expression vector. The vector is then introduced into the host bacteria for cloning using standard techniques. The vectors of the present invention may, for example, comprise a transcriptional promoter, and/or a transcriptional terminator, wherein the promoter is operably linked with the nucleic acid molecule, and wherein the nucleic acid molecule is operably linked with the transcription terminator. One or more peptides or polypeptides of the present disclosure may be encoded by a single expression vector. Such nucleic acid molecules may act as vehicles for delivering peptides/polypeptides to the subject in need thereof, in vivo, in the form of, e.g., DNA/RNA vaccines.

In aspects, the vector may be a viral vector comprising a nucleic acid as defined above. The viral vector may be derived from different types of viruses, such as, Swinepox, Fowlpox, Pseudorabies, Aujezky's virus, salmonella, vaccinia virus, BHV (Bovine Herpes Virus), HVT (Herpes Virus of Turkey), adenovirus, TGEV (Transmissible Gastroenteritidis Coronavirus), Erythrovirus, and SIV (Simian Immunodeficiency Virus). Other expression systems and vectors may be used as well, such as plasmids that replicate and/or integrate in yeast cells.

The instant disclosure also relates to a method for preparing a peptide, polypeptide, concatemeric peptide, and/or chimeric or fusion polypeptide of the instant disclosure, the method comprising culturing a host cell containing a nucleic acid or vector as defined above under conditions suitable for expression of the nucleic acid and recovering the polypeptide. As indicated above, the proteins and peptides may be purified according to techniques known per se in the art.

Pharmaceutical Compositions and Formulations

In aspects, the T-cell epitope compositions of the present disclosure (including one or more of e.g., polypeptides as disclosed herein; concatemeric peptides as disclosed herein; chimeric of fusion polypeptide compositions as disclosed herein; nucleic acids as disclosed herein, including nucleic acids encoding such peptides, polypeptides, concatemeric peptides, or chimeric of fusion polypeptide compositions as disclosed herein; expression cassettes, plasmids, expression vectors, recombinant viruses, or cells as disclosed herein, and vaccines as disclosed herein; hereafter referred to as “T-cell epitope compounds and compositions of the present disclosure”) may be comprised in a pharmaceutical composition or formulation. In aspects, the instantly-disclosed pharmaceutical compositions or formulations generally comprise a T-cell epitope composition of the present disclosure and a pharmaceutically-acceptable carrier and/or excipient. In aspects, a pharmaceutical composition or formulation comprises an adjuvant. In aspects, said pharmaceutical compositions are suitable for administration. Pharmaceutically-acceptable carriers and/or excipients are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions for administering the instantly-disclosed T-cell epitope compositions (see, e.g., Remington's Pharmaceutical Sciences, (18^THEd, 1990), Mack Publishing Co., Easton, Pa. Publ)). In aspects, the pharmaceutical compositions are generally formulated as sterile, substantially isotonic, and in full compliance with all Good Manufacturing Practice (GMP) regulations of the U.S. Food and Drug Administration. Pharmaceutical compositions as disclosed herein are able for use in stimulating, inducing, and/or expanding an immune response to a SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject, and can be used in methods of treating and/or preventing SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject, such as a human.

The terms “pharmaceutically-acceptable,” “physiologically-tolerable,” and grammatical variations thereof, as they refer to compositions, carriers, excipients, and reagents, are used interchangeably and represent that the materials are capable of administration to or upon a subject without the production of undesirable physiological effects to a degree that would prohibit administration of the composition. For example, “pharmaceutically-acceptable excipient” means, for example, an excipient that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic, and desirable, and includes excipients that are acceptable for veterinary use as well as for human pharmaceutical use. Such excipients can be solid, liquid, semisolid, or, in the case of an aerosol composition, gaseous. A person of ordinary skill in the art would be able to determine the appropriate timing, sequence and dosages of administration for particular T-cell epitope compositions of the present disclosure.

In aspects, preferred examples of such carriers or diluents include, but are not limited to, water, saline, Ringer's solutions, dextrose solution, and 5% human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils can also be used. The use of such media and compounds for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or compound is incompatible with the T-cell epitope compounds and compositions of the present disclosure and as previously described above, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.

In aspects, T-cell epitope compounds and compositions of the present disclosure are formulated to be compatible with its intended route of administration. The T-cell epitope compounds and compositions of the present disclosure can be administered by parenteral, topical, intravenous, oral, subcutaneous, intraarterial, intradermal, transdermal, rectal, intracranial, intrathecal, intraperitoneal, intranasal; vaginally; intramuscular route or as inhalants. In aspects, T-cell epitope compounds and compositions of the present disclosure can be injected directly into a particular tissue where deposits have accumulated, e.g., intracranial injection. In other aspects, intramuscular injection or intravenous infusion may be used for administration of T-cell epitope compounds and compositions of the present disclosure. In some methods, T-cell epitope compounds and compositions of the present disclosure are administered as a sustained release composition or device, such as but not limited to a MedipadTM device. In aspects, T-cell epitope compounds and compositions of the present disclosure are administered intradermally, e.g., by using a commercial needle-free high-pressure device such as Pulse NeedleFree technology (Pulse 50TM Micro Dose Injection System, Pulse NeedleFree Systems; Lenexa, Kans., USA). In aspects, said commercial needle-free high-pressure device (e.g., Pulse NeedleFree technology) confers one or more of the following benefits: non-invasive, reduces tissue trauma, reduces pain, requires a smaller opening in the dermal layer to deposit the composition in the subject (e.g., only requires a micro skin opening), instant dispersion of the composition, better absorption of the composition, greater dermal exposure to the composition, and/or reduced risk of sharps injury.

In aspects, T-cell epitope compounds and compositions of the present disclosure can optionally be administered in combination with other agents that are at least partly effective in treating various medical conditions as described herein.

In aspects, solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include, but are not limited to, the following components: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial compounds such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfate; chelating compounds such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and compounds for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. Examples of excipients can include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, water, ethanol, DMSO, glycol, propylene, dried skim milk, and the like. The composition can also contain pH buffering reagents, and wetting or emulsifying agents.

In aspects, pharmaceutical compositions or formulations suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor ELTM (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition is sterile and should be fluid to the extent that easy syringeability exists. It is stable under the conditions of manufacture and storage and is preserved against the contaminating action of microorganisms such as bacteria and fungi. In aspects formulations including a T-cell epitope compound or composition of the present disclosure may include aggregates, fragments, breakdown products and post-translational modifications, to the extent these impurities bind HLA and present the same TCR face to cognate T cells they are expected to function in a similar fashion to pure T-cell epitopes. The carrier can be a solvent or dispersion medium containing, e.g., water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, e.g., by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal compounds, e.g., parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic compounds, e.g., sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition a compound that delays absorption, e.g., aluminum monostearate and gelatin.

In aspects, sterile injectable solutions (e.g., sterile solutions suitable for injectable and/or intradermal needle-free high-pressure device) can be prepared by incorporating the T-cell epitope compounds and compositions of the present disclosure 4—in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the binding agent into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof. Further, T-cell epitope compounds and compositions of the present disclosure can be administered in the form of a depot injection or implant preparation that can be formulated in such a manner as to permit a sustained or pulsatile release of the active ingredient.

In aspects, oral compositions generally include an inert diluent or an edible carrier and can be enclosed in gelatin capsules or compressed into tablets. In aspects, for the purpose of oral therapeutic administration, the binding agent can be incorporated with excipients and used in the form of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and expectorated or swallowed. Pharmaceutically compatible binding compounds, and/or adjuvant materials can be included as part of the composition. In aspects, the tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating compound such as alginic acid, Primogel or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening compound such as sucrose or saccharin; or a flavoring compound such as peppermint, methyl salicylate or orange flavoring.

For administration by inhalation, T-cell epitope compounds and compositions of the present disclosure can be delivered in the form of an aerosol spray from pressured container or dispenser that contains a suitable propellant, e.g., a gas such as carbon dioxide, or a nebulizer.

In aspects, systemic administration of the T-cell epitope compounds and compositions of the present disclosure can also be by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, e.g., for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. For transdermal administration, the T-cell epitope compounds and compositions may be formulated into ointments, salves, gels, or creams and applied either topically or through transdermal patch technology as generally known in the art.

In aspects, the T-cell epitope compounds and compositions of the present disclosure can also be prepared in the form of suppositories (e.g., with conventional suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery.

In aspects, the T-cell epitope compounds and compositions of the present are prepared with carriers that protect the T-cell epitope compounds and compositions against rapid elimination from the body, such as a controlled-release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as, for example, ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. The materials can also be obtained commercially, e.g., from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically-acceptable carriers. These can be prepared according to methods known to those skilled in the art (U.S. Pat. No. 4,522,811, which is herein incorporated by reference in its entirety). In aspects, the T-cell epitope compounds and compositions of the present disclosure can be implanted within or linked to a biopolymer solid support that allows for the slow release of the T-cell epitope compounds and compositions to the desired site.

In aspects, it is especially advantageous to formulate oral or parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of binding agent calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the instant disclosure are dictated by and directly dependent on the unique characteristics of the binding agent and the particular therapeutic effect to be achieved, and the limitations inherent in the art of compounding such T-cell epitope compounds and compositions for the treatment of a subject.

In aspects of a pharmaceutical composition as described herein, the composition may comprise at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 of the instantly-disclosed peptides or polypetides (including concatemeric polypeptides) or nucleic acids encoding such peptides or polypeptides (including concatemeric polypeptides). For example, in aspects, a pharmaceutical composition can comprise at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 peptides or polypeptides (including up to 40 peptides or polypetides), including any value or range therebetween, comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692; concatemeric peptides as disclosed herein, including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such peptides, polypeptides, or concatemeric peptides, and/or fragments and variants thereof, as described herein.

Vaccine Compositions

The term “vaccine” as used herein includes an agent which may be used to cause, stimulate or amplify the immune system of animals (e.g., humans) against a pathogen. Vaccines of the invention are able to cause or stimulate or amplify an immune response against a SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19,.

The term “immunization” includes the process of delivering an immunogen to a subject. Immunization may, for example, enable a continuing high level of antibody and/or cellular response in which T-lymphocytes can kill or suppress the pathogen in the immunized animal, such as a human, which is directed against a pathogen or antigen to which the animal has been previously exposed.

Vaccines of the instant disclosure comprise an immunologically effective amount of a T cell epitope compound or composition of the instant disclosure as described above, and in aspects in a pharmaceutically acceptable vehicle and optionally with additional excipients and/or an adjuvant. As a result of the vaccination with a composition of the present disclosure, animals, and in aspects humans, become at least partially or completely immune to SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, or resistant to developing moderate or severe SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19. The instantly disclosed vaccines may be used to elicit a humoral and/or a cellular response, including CD4+ and CD8+ T effector cell responses. In aspects, an animal subject, such as a human, is protected to an extent to which one to all of the adverse physiological symptoms or effects of SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, are significantly reduced, ameliorated or totally prevented.

In practice, the exact amount required for an immunologically effective dose may vary from subject to subject depending on factors such as the age and general condition of the subject, the nature of the formulation and the mode of administration. An appropriate “effective amount” may be determined by one of ordinary skill in the art using only routine experimentation. For instance, methods are known in the art for determining or titrating suitable dosages of a vaccine to find minimal effective dosages based on the weight of the animal subject, including human subject, concentration of the vaccine and other typical factors. The dosage of the vaccines of the present disclosure will depend on the species, breed, age, size, vaccination history, and health status of the animal (e.g., swine/pig) to be vaccinated, as well as the route of administration, e.g., subcutaneous, intradermal, oral intramuscular or intravenous administration. The vaccines of the instant disclosure can be administered as single doses or in repeated doses. The vaccines of the instant disclosure can be administered alone, or can be administered simultaneously or sequentially administered with one or more further compositions, such as other porcine immunogenic or vaccine compositions. Where the compositions are administered at different times, the administrations may be separate from one another or overlapping in time.In aspects, the vaccine comprises a unitary dose of between 0.1-3000 including any value or range therebetween of polypeptide and/or nucleic acid of the instant disclosure.

The dosage of the vaccine, concentration of components therein and timing of administering the vaccine, which elicit a suitable immune response, can be determined by methods such as by antibody titrations of sera, e.g., by ELISA and/or seroneutralization assay analysis and/or by vaccination challenge evaluation.

In aspects, the vaccine comprises a novel, therapeutic T cell epitope compounds or compositions as disclosed herein) in purified form, optionally in combination with any suitable excipient, carrier, adjuvant, and/or additional protein antigen.

In another aspect, the vaccine comprises a nucleic acid as defined above, optionally in combination with any suitable excipient, carrier, adjuvant, and/or additional protein antigen. In aspects, the vaccine comprises a viral vector containing a nucleic acid as defined above. In aspects, the vaccine comprises one or more plasmid vectors.

Vaccine constructs including a T-cell epitope compound or composition of the present disclosure upon administration to a subject may initiate a strong T-cell mediated immune response, but may not induce a humoral immune response. Therefore, aspects of a vaccine against SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, contains a combination of the putative T-cell epitopes together with either live attenuated virus (LAV, for example live attenuated SARS-CoV-2) or inactivated virus (for example inactivated SARS-CoV-2). This vaccine composition (including both the putative T-cell epitopes and an LAV or inactivated virus) upon administration to a subject may induce both cellular and humoral immune responses, thereby conferring comprehensive immunity against SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, in the animals, including humans.

Vaccines may comprise other ingredients, known per se by one of ordinary skill in the art, such as pharmaceutically acceptable carriers, excipients, diluents, adjuvants, freeze drying stabilizers, wetting or emulsifying agents, pH buffering agents, gelling or viscosity enhancing additives, and preservatives, depending on the route of administration.

Examples of pharmaceutically acceptable carriers, excipients or diluents include, but are not limited to demineralised or distilled water; saline solution; vegetable based oils such as peanut oil, arachis oil, safflower oil, olive oil, cottonseed oil, maize oil, sesame oil, or coconut oil; silicone oils, including polysiloxanes, such as methyl polysiloxane, phenyl polysiloxane and methylphenyl polysolpoxane; volatile silicones; mineral oils such as light liquid paraffin oil, or heavy liquid paraffin oil; squalene; cellulose derivatives such as methylcellulose, ethylcellulose, carboxymethylcellulose, carboxymethylcellulose sodium salt, or hydroxypropyl methylcellulose; lower alkanols, for example ethanol or isopropanol; lower aralkanols; lower polyalkylene glycols or lower alkylene glycols, for example polyethylene glycol, polypropylene glycol, ethylene glycol, propylene glycol, 1,3-butylene glycol or glycerin; fatty acid esters such as isopropyl palmitate, isopropyl myristate or ethyl oleate; polyvinylpyrrolidone; agar; carrageenan; gum tragacanth or gum acacia; and petroleum jelly. Typically, the carrier or carriers will form from 10% to 99.9% by weight of the vaccine composition and may be buffered by conventional methods using reagents known in the art, such as sodium hydrogen phosphate, sodium dihydrogen phosphate, potassium hydrogen phosphate, potassium dihydrogen phosphate, a mixture thereof, and the like.

Examples of adjuvants include, but are not limited to, oil in water emulsions, aluminum hydroxide (alum), immunostimulating complexes, non-ionic block polymers or copolymers, cytokines (like IL-1, IL-2, IL-7, IFN-a, IFN-(3, IFN-γ, etc.), saponins, monophosphoryl lipid A (MLA), muramyl dipeptides (MDP), MCA, and the like. Other suitable adjuvants include, for example, aluminum potassium sulfate, heat-labile or heat-stable enterotoxin(s) isolated from Escherichia coli, cholera toxin or the B subunit thereof, diphtheria toxin, tetanus toxin, pertussis toxin, Freund's incomplete or complete adjuvant, etc. Toxin-based adjuvants, such as diphtheria toxin, tetanus toxin and pertussis toxin may be inactivated prior to use, for example, by treatment with formaldehyde. Further adjuvants may include, but are not limited to, poly-ICLC, 1018 ISS, aluminum salts, Amplivax, AS 15, BCG, CP-870,893, CpG7909, CyaA, dSLIM, GM-CSF, IC30, IC31, Imiquimod, ImuFact IMP321, IS Patch, ISS, ISCOMATRTX, Juvlmmune, LipoVac, MF59, monophosphoryl lipid A, Montanide IMS 1312, Montanide ISA 206, Montanide ISA 50V, Montanide ISA-51, OK-432, OM-174, OM-197-MP-EC, ONTAK, PEPTEL, vector system, PLGA microparticles, resiquimod, SRL172, Virosomes and other Virus-like particles, YF-17D, VEGF trap, R848, beta-glucan, Pam3Cys, and Aquila's QS21 stimulon. In aspects of the pharmaceutical compositions or vaccines as disclosed herein, the adjuvant comprises poly-ICLC. The TLR9 agonist CpG and the synthetic double-stranded RNA (dsRNA) TLR3 ligand poly-ICLC are two of the most promising vaccine adjuvants currently in clinical development. In preclinical studies, poly-ICLC appears to be the most potent TLR adjuvant when compared to LPS and CpG. This appears due to its induction of pro-inflammatory cytokines and lack of stimulation of IL-10, as well as maintenance of high levels of co-stimulatory molecules in DCs. Poly-ICLC is a synthetically prepared double-stranded RNA consisting of polyl and polyC strands of average length of about 5000 nucleotides, which has been stabilized to thermal denaturation and hydrolysis by serum nucleases by the addition of polylysine and carboxymethylcellulose. The compound activates TLR3 and the RNA helicase-domain of MDA5, both members of the PAMP family, leading to DC and natural killer (NK) cell activation and mixed production of type I interferons, cytokines, and chemokines.

Examples of freeze-drying stabilizer may be for example carbohydrates such as sorbitol, mannitol, starch, sucrose, dextran or glucose, proteins such as albumin or casein, and derivatives thereof.

Vaccines may additionally comprise at least one immunogen from at least one additional pathogen, e.g., a pig pathogen such as Actinobacillus pleuropneunomia; Adenovirus; Alphavirus such as Eastern equine encephalomyelitis viruses; Balantidium coli; Bordetella bronchiseptica; Brachyspira spp., preferably B. hyodyentheriae, B. pilosicoli, B. innocens, Brucella suis, preferably biovars 1, 2 and 3; Classical swine fever virus, Chlamydia and Chlamydophila spp., preferably C. pecorum and C. abortus; Clostridium spp., preferably Cl. difficile, Cl. perfringens types A, B and C, Cl. novyi, Cl. septicum, Cl. tetani; Digestive and respiratory Coronavirus; Cryptosporidium parvum; Eimeria spp.; Eperythrozoonis suis currently named Mycoplasma haemosuis; Erysipelothrix rhusiopathiae; Escherichia coli; Haemophilus parasuis, preferably subtypes 1, 7 and 14; Hemagglutinating encephalomyelitis virus; lsospora suis; Japanese Encephalitis virus; Lawsonia intracellulars; Leptospira spp., preferably Leptospira australis, Leptospira canicola, Leptospira grippotyphosa, Leptospira icterohaemorrhagicae, Leptospira interrogans, Leptospira Pomona and Leptospira tarassovi; Mannheimia haemolytica; Mycobacterium spp., preferably M. avium, M. intracellular and M. bovis: Mycoplasma hyponeumoniae; Parvovirus; Pasteurella multocida; Porcine circovirus; Porcine cytomegolovirus; Porcine parovirus, Porcine reproductive and respiratory syndrome virus: Pseudorabies virus; Rotavirus; Sagiyama virus; Salmonella spp., preferably S. thyhimurium and S. choleraesuis; Staphylococcus spp., preferably S. hyicus; Streptococcus spp., preferably Strep suis; Swine cytomegalovirus; Swine herpes virus; Swine influenza virus; Swinepox virus; Toxoplasma gondii; Vesicular stomatitis virus and virus of exanthema of swine;

or other isolates and subtypes of porcine circovirus.

The vaccine compositions of the instant disclosure may be liquid formulations such as an aqueous solution, water-in-oil or oil-in-water emulsion, syrup, an elixir, a tincture, or a preparation for parenteral, subcutaneous, intradermal, intramuscular or intravenous administration (e.g., injectable administration), such as sterile suspensions or emulsions. Such formulations are known in the art and are typically prepared by dissolution of the antigen and other typical additives in the appropriate carrier or solvent systems. Liquid formulations also may include suspensions and emulsions that contain suspending or emulsifying agents.

The route of administration can be percutaneous, via mucosal administration, or via a parenteral route (intradermal, intramuscular, subcutaneous, intravenous, or intraperitoneal).

Vaccine compositions according to the present disclosure may be administered alone, or can be co-administered or sequentially administered with other treatments or therapies. A vaccine of the present disclosure can conveniently be administered intranasally, transdermally (i.e., applied on or at the skin surface for systemic absorption), parenterally, ocularly, etc. The parenteral route of administration includes, but is not limited to, intramuscular, intravenous, intradermal, and intraperitoneal routes and the like. In aspects, vaccines of the present disclosure are administered intradermally, e.g., by using a micro needle patch as is known in the art or by using a commercial needle-free high-pressure device such as Pulse NeedleFree technology (Pulse 50TM Micro Dose Injection System, Pulse NeedleFree Systems; Lenexa, Kans., USA).

The present disclosure also relates to methods of immunizing or inducing an immune response in animals (e.g., humans) comprising administering to said animal a peptide, polypeptide, concatemeric peptide, chimeric or fusion polypeptide, nucleic acid, cell, vector, pharmaceutical, or vaccine as described above.

The present disclosure also relates to methods of treating and/or preventing SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, diseases in animals (e.g., humans) comprising administering to said animal a peptide, polypeptide, concatemeric peptide, chimeric or fusion polypeptide, nucleic acid, cell, vector, pharmaceutical, or vaccine as disclosed herein.

A vaccine of the present disclosure can conveniently be administered intranasally, transdermally (i.e., applied on or at the skin surface for systemic absorption), parenterally, ocularly, etc. The parenteral route of administration includes, but is not limited to, intramuscular, intravenous, and intraperitoneal routes and the like.

The dosage of the vaccines of the present disclosure will depend on the species, breed, age, size, vaccination history, and health status of the animal (e.g., human) to be vaccinated, as well as the route of administration, e.g., subcutaneous, intradermal, oral intramuscular or intravenous administration. The vaccines of the instant disclosure can be administered as single doses or in repeated doses. The vaccines of the instant disclosure can be administered alone, or can be administered simultaneously or sequentially administered with one or more further compositions, such as other porcine immunogenic or vaccine compositions. Where the compositions are administered at different times, the administrations may be separate from one another or overlapping in time.

In aspects, the present disclosure includes multiple rounds of administration of the instantly-disclosed vaccine compositions. For example, the vaccine can be boosted at one, two, three, and/or four week intervals.. Such are known in the art to improve or boost the immune system to improve protection against the pathogen. Additionally, the present disclosure may also include assessing a subject's immune system to determine if further administrations of the instantly-disclosed vaccine compositions is warranted. In some aspects, multiple administrations may include the development of a prime boosting strategy of vaccination using the instantly-discloed vaccines (e.g.., polypeptide based or nucleic acid based as disclosed herein). Such may provide an opportunity to produce sequential immunogenic responses against SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19. In some aspects, the vaccine can be boosted at 1, 2, 3, 4, 5, or 6 week intervals. In some aspects, the vaccine is boosted at 2 week intervals. In some apsects, the vaccine is boosted at 3 week intervals. In some aspects, peptide based vaccine and nucleice acid (e.g., RNA or DNA) vaccinations can be achieved in an alternative manner to provide a regimen of immunization with the same immunogen presented in different fashions to the subject's immune system.

In one aspect, the vaccine compositions of the present disclosure are administered to a subject susceptible to or otherwise at risk for SARS-CoV-2 infection and/or related diseases caused by SARS-CoV-2, including COVID-19, to enhance the subject own immune response capabilities. The subject to which the vaccine is administered is, in one aspect, a human. The animal may be susceptible to infection by SARS-CoV-2 infection (or a closely related virus) and/or related diseases caused by SARS-CoV-2, including COVID-19.

In aspects of a vaccine as described herein, the vaccine may comprise at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 of the instantly-disclosed peptides or polypetides (including concatemeric polypeptides) or nucleic acids encoding such peptides or polypeptides (including concatemeric polypeptides). For example, in aspects, a vaccine can comprise at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 peptides or polypeptides (including up to 40 peptides or polypetides), including any value or range therebetween, comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692; concatemeric peptides as disclosed herein, including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and nucleic acids (e.g., RNA mRNA, DNA, cDNA) encoding such peptides, polypeptides, or concatemeric peptides, and/or fragments and variants thereof, as described herein.

The present disclosure also provides a container comprising an immunologically effective amount of a polypeptide, nucleic acid or vaccine as described above. The present disclosure also provides vaccination kits comprising an optionally sterile container comprising an immunologically effective amount of the vaccine, means for administering the vaccine to animals, and optionally an instruction manual including information for the administration of the immunologically effective amount of the composition for treating and/or preventing SARS-CoV-2 infection (or a closely related virus) and/or related diseases caused by SARS-CoV-2, including COVID-19.

Methods of Treatment

Stimulating T-cells with T-cell epitope compounds and compositions of the present disclosure can stimulate, induce, and/or expand a corresponding naturally occurring immune response, e.g., stimulating, inducing, and/or expanding a corresponding naturally occurring immune response to a SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, including CD4+ and/or CD8+ T cell responses, and in aspects results in increased secretion of one or more cytokines and chemokines. In aspects, T-cells activated by the T-cell epitope compounds and compositions of the present disclosure stimulate cell-mediated immunity against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject.

In aspects, T cells activated by the T-cell epitope compounds and compositions of the present disclosure stimulate cell-mediated immunity against SARS-CoV-2 infection (or a closely related virus) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject.

In aspects, the present disclosure is directed to a method of stimulating, inducing, and/or expanding an immune response, e.g., against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject in need thereof by administering to the subject a therapeutically effect amount of a T-cell epitope composition (compound or composition of the present disclosure.

In aspects, the present disclosure is directed to a method of preventing, treating, or ameliorating a disease by SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) such as COVID-19, in a subject in need thereof by administering to the subject a therapeutically effect amount of a T-cell epitope compound or composition of the present disclosure.

Assays, Methods, and Kits of the Instant Disclosure

In aspects, the present disclosure is directed to methods of measuring an immune response, including a CMI response, in a subject by incubating a sample from the subject which comprises T-cells or other cells of the immune system with one or more peptides or polypeptides of the instant disclosure. In aspects, production of IFN-γor other cytokine or immune effector molecule(s) is then detected. The presence or level of immune effector is then indicative of the level of cell mediated responsiveness of the subject. In aspects, preferably, the sample is whole blood which is collected in a suitable container comprising the antigen. Optionally, a simple sugar such as dextrose is added to the incubation mixture. Accordingly, one aspect of the present disclosure relates to a method for measuring a CMI response in a subject, preferably a human subject and more preferably a human subject potentially infected with SARS-CoV-2 or a related coronavirus (such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)), said method comprising collecting a sample from said subject wherein said sample comprises cells of the immune system which are capable of producing immune effector molecules following stimulation by an antigen, incubating said sample with one or more peptides of the instant disclosure and then measuring the presence of or elevation in the level of an immune effector molecule wherein the presence or level of said immune effector molecule is indicative of the capacity of said subject to mount a cell-mediated immune response. In aspects, the presence of or elevation in the level of an immune effector molecule wherein the presence or level of said immune effector molecule is indicative of the capacity of said subject to mount a cell-mediated immune response against SARS-CoV-2 or a related coronavirus infection.

Reference to a “subject” includes a human or non-human species including primates, livestock animals (e.g. sheep, cows, pigs, horses, donkey, goats), laboratory test animals (e.g. mice, rats, rabbits, guinea pigs, hamsters), companion animals (e.g. dogs, cats), avian species (e.g. poultry birds, aviary birds), reptiles and amphibians. The present disclosure has applicability, therefore, in human medicine as well as having livestock and veterinary and wild life applications. Most preferably, however, the subject is a human and the CMI response assay has applications in screening for responsiveness to COVID-19 or related coronavirus infections.

Reference to “immune cells” includes cells such as lymphocytes including natural killer (NK) cells, T-cells, (CD4+ and/or CD8+ cells), B-cells, macrophages and monocytes, dendritic cells or any other cell which is capable of producing an effector molecule in response to direct or indirect antigen stimulation. Conveniently, the immune cells are lymphocytes and more particularly T-lymphocytes.

The immune effector molecules may be any of a range of molecules which are produced in response to cell activation or stimulation by an antigen. Although an interferon (IFN) such as IFN-γ is a particularly useful immune effector molecule, others include a range of cytokines such as interleukins (IL), e.g. IL-2, IL-4, IL-10 or IL-12, tumor necrosis factor alpha (TNF-α), a colony stimulating factor (CSF) such as granulocyte (G)-CSF or granulocyte macrophage (GM)-CSF amongst many others such as complement or components in the complement pathway.

Accordingly, in aspects, the present disclosure provides a method for measuring a CMI response in a subject, said method comprising collecting a sample from said subject wherein said sample comprises cells of the immune system which are capable of producing IFN-γ molecules following stimulation by one or more peptides of the instant disclosure, incubating said sample with one or more peptides of the instant disclosure and then measuring the presence of or elevation in the level of an IFN-γ molecule wherein the presence or level of said IFN-γ molecule is indicative of the capacity of said subject to mount a cell-mediated immune response.

The sample collected from the subject is generally deposited into a blood collection tube. A blood collection tube includes a blood draw tube or other similar vessel. Conveniently, when the sample is whole blood, the blood collection tube is heparinized. Alternatively, heparin is added to the tube after the blood is collected. Notwithstanding that whole blood is the preferred and most convenient sample, the present invention extends to other samples containing immune cells such as lymph fluid, cerebral fluid, tissue fluid and respiratory fluid including nasal and pulmonary fluid. The use of blood collection tubes is compatible with standard automated laboratory systems and these are amenable to analysis in large-scale and random access sampling. Blood collection tubes also minimize handling costs and reduce laboratory exposure to whole blood and plasma and, hence, reduce the risk of laboratory personnel from contracting a pathogenic agent.

Combining the incubation step with the collection tube is particularly efficacious and enhances the sensitivity of the assay as does the optional feature of incubating the cells in the presence of a sample sugar such as dextrose.

The incubation step may be from 5 to 72 hours, more preferably 5 to 40 hours and even more preferably 8 to 24 hours or any value or range therebetween. In aspects, the incubation step is conducted in the presence of a simple sugar such as dextrose.

Detection of the immune effector molecules may be made at the protein or nucleic acid levels. Consequently, reference to “presence or level of said immune effector molecule” includes direct and indirect data. For example, high levels of IFN-γ mRNA is indirect data showing increased levels of IFN-γ.

Ligands to the immune effectors are particularly useful in detecting and/or quantitating these molecules. Antibodies to the immune effectors are particularly useful. Techniques for the assays contemplated herein are known in the art and include, for example, sandwich assays, ELISA and ELISpot. Reference to “antibodies” includes parts of antibodies, mammalianized (e.g. humanized) antibodies, recombinant or synthetic antibodies and hybrid and single chain antibodies. It should be understood that a wide range of immunoassay techniques as are known in the art are compatible with the instant disclosure, such as those disclosed in U.S. Pat. No. 7,608,392 (herein incorporated by reference in its entirety).

In aspects, the present disclosure is directed to methods of assaying for SARS-CoV-2 or a related coronavirus (such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) peptide-specific T-cells, the method comprising providing a fluid containing T-cells, adding one or more peptides or polypeptides of the instant disclosure to the fluid, incubating the fluid to cause cytokine release, and detecting the released cytokine. Preferably the method comprises providing the fluid containing T-cells in contact with a surface carrying an immobilized first antibody to the cytokine, adding the peptide to the fluid, incubating the resulting fluid mixture under conditions to cause any peptide-specific T-cells that have been pre-sensitized in vivo to the peptide to secrete the cytokine, and detecting any secreted cytokine bound to the immobilized first antibody.

In aspects, the cells are preferably peripheral blood mononuclear cells (PMBC). They may suitably be taken from a patient known to be suffering, or to have suffered, from COVID-19 infection or a related coronavirus infection. In aspects, the cells used are fresh. In aspects, the assay is used to identify or quantitate peptide-specific T-cells e.g. CD8+ or CD4+ cells that have been activated or pre-sensitized in vivo to a particular peptide. In aspects, these are unrestimulated T-cells, i.e. cells capable of immediate effector function without the need to effect division/differentiation by in vitro culture. When a peptide in question is presented to such cells, the cells secrete various cytokines, of which any one may be selected for the purposes of this assay. In aspects, the cytokine selected is interferon-γ (IFN γ). However, although an interferon (IFN) such as IFN-γ is a particularly useful immune effector molecule, others include a range of cytokines such as interleukins (IL), e.g. IL-2, IL-4, IL-10 or IL-12, tumor necrosis factor alpha (TNF-α), a colony stimulating factor (CSF) such as granulocyte (G)-CSF or granulocyte macrophage (GM)-CSF amongst many others.

The secreted cytokine can be detected by any of a variety of methods known in the literature. Preferably the assay method involves providing a surface carrying an immobilized first antibody to the IFN-γ or other cytokine. A fluid containing the PBMC or other fresh cells is placed in contact with that immobilized antibody. About 30% of the PBMC are CD8+ cells.

In aspects, the method comprises adding a peptide or polypeptide of the instant disclosure to the fluid. If activated or pre-sensitized peptide-specific T-cells (CD4+ and/or CD8+ T cells) are present in the test fluid, they respond by secreting appropriate effector cytokines, such as IFN-γ or other cytokine, which then becomes bound to the immobilized antibody. In aspects, the one or more peptides or polypeptides of the instant disclosure may be added in uncombined form to the fresh cells. While it is possible to add cultured cells that have been pulsed with such peptides or polypeptides, this is not necessary when using defined peptide/polypeptide epitopes. The peptides/polypeptides should be added in an amount sufficient to generate an observable signal; in aspects a preferred concentration range in the fluid is 0.01 up to 100 μM particularly 0.5-5.0 μM.

Incubation should be continued for a time sufficient to permit CD8+ and/or CD4+ cells that have been pre-sensitized in vivo to the particular peptide/polypeptide chosen to secrete the IFN-γ or other cytokine. In aspects, the incubation should not continue for so long that quiescent CD8+ and/or CD4+ cells have time to differentiate and become activated by the peptide and start to secrete cytokines. This suggests an incubation time of 4-24 hours, more particularly 6-16 hours. It is an advantage of the invention that the incubation part of the test can be performed in a single working day or overnight, and without the use of sterile conditions required for cell culture in vitro.

In aspects, during the incubation, any IFN-γor other cytokine secreted by CD8+ and/or CD4+ cells becomes bound to the first antibody immobilized on the surface. After incubation, the surface may be washed to remove unbound material. For detection, in aspects a labelled second antibody to the cytokine is used. When this is applied to the surface it becomes bound to any cytokine present. In aspects, the second antibody should recognize a different epitope from the first antibody. In aspects, one or both of the first and second antibodies may be monoclonal. The label may be any that is conventionally used in the field, including radioisotopes, enzymes to generate color or chemiluminescence, fluorescent groups or groups for detection by mass spectrometry or refractive index (e.g. by surface plasmon resonance). It is convenient but not necessary to use a labelled antibody, any reagent that binds specifically to the cytokine could be labelled and used. Detection and perhaps quantitation of the label is effected by means well known in the field and appropriate to the nature of the label used, and may be those as disclosed in U.S. Pat. No. 7,608,392 (herein incorporated by reference in its entirety).

In aspects, the assay may conveniently be carried out in a multi-well plate. Each well of the plate has a surface carrying a bound first antibody. To each well is added a fluid containing an appropriate number, e.g. 103-106 of cells. Different peptides and/or controls are added to individual wells of the plate. Cells that secrete a cytokine during incubation show up as spots (spot forming cells or SFCs) and the number or density of these in each well can readily be determined.

In aspects, the present disclosure provides a method of detecting an anti-SARS-CoV-2 (or related coronavirus, such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) T cell response (which in aspects can included CD4+ and/or CD8+ T cell response) comprising contacting a population of T cells of an individual with a peptide of the instant disclosure, wherein one or more of said peptides may be substituted by an analogue which binds a T cell receptor that recognizes the peptide, and determining whether T cells of the T cell population recognize the peptide(s). Further, in aspects, the present disclosure provides a method of diagnosing a SARS-CoV-2 or related coronavirus infection (such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, in a host, or exposure of a host, to SARS-CoV-2 or related coronavirus, the method comprising (i) contacting a population of T cells from the host with one or more peptides or analogues as disclosed here, and analogues thereof which can bind a T cell receptor which recognizes any of the said peptides; and (ii) determining whether the T cells of said T cell population recognize the peptide(s) and/or analogue(s).

In aspects, the host is generally a human but may be an animal, typically one which can be naturally or artificially infected by a mycobacterium. The host may be a mammal, such as a primate, cow, sheep, pig, badger or rodent, e.g. a mouse or rat. The host typically has an active or latent SARS-CoV-2 infection or related coronavirus infection, or has had such an infection recently. The host may be a healthy contact who has been exposed to SARS-CoV-2 infection or related coronavirus infection. Thus the method may be used to trace the healthy contacts of individuals with such SARS-CoV-2 infection or related coronavirus infection. The method may also be used to carry out population surveys to measure the number of individuals in a population who have a SARS-CoV-2 infection or related coronavirus infection or are healthy contacts.

In aspects, the T cells which recognize the peptide in the method are generally T cells which have been pre-sensitized in vivo to antigen from SARS-CoV-2 or related coronavirus. These antigen-experienced T cells are generally present in the peripheral blood of a host which has been exposed to the SARS-CoV-2 infection or related coronavirus infection. The T cells may be CD4+ and/or CD8+ T cells.

It is understood that the term ‘peptide’ or ‘polypeptide’ also includes the analogue of that peptide (which may not be a peptide as defined by the ordinary use of the term) unless the context requires otherwise.

In aspects, the T cells can be contacted with the peptides in vitro or in vivo, and determining whether the T cells recognize the peptide can be done in vitro or in vivo. In aspects, determination of whether the T cells recognize the peptide or polypeptide of the instant disclosure is generally done by detecting a change in the state of the T cells in the presence of the peptide or polypeptide or determining whether the T cells bind the peptide or polypeptide. The change in state is generally caused by antigen specific functional activity of the T cell after the T cell receptor binds the peptide or polypeptide. Generally when binding the T cell receptor the peptide is bound to an MHC class II or MHC class I molecule, which is typically present on the surface of an antigen presenting cell (APC).

In aspects, the change in state of the T cell may be the start of or increase in secretion of a substance from the T cell, such as a cytokine, especially IFN-γ, IL-2 or TNF-α. In aspects, the cytokine selected is interferon-γ (IFN γ). However, although an interferon (IFN) such as IFN-γis a particularly useful immune effector molecule, others include a range of cytokines such as interleukins (IL), e.g. IL-2, IL-4, IL-10 or IL-12, tumor necrosis factor alpha (TNF-α), a colony stimulating factor (CSF) such as granulocyte (G)-CSF or granulocyte macrophage (GM)-CSF amongst many others. The substance can typically be detected by allowing it to bind to a specific binding agent and then measuring the presence of the specific binding agent/substance complex. The specific binding agent is typically an antibody, such as polyclonal or monoclonal antibodies. Antibodies to cytokines are commercially available, or can be made using standard techniques.

In aspects, the specific binding agent is immobilized on a solid support. After the substance is allowed to bind the solid support can optionally be washed to remove material which is not specifically bound to the agent. In aspects, the agent/substance complex may be detected by using a second binding agent which will bind the complex. Typically the second agent binds the substance at a site which is different from the site which binds the first agent. In aspects, the second agent is preferably an antibody and is labelled directly or indirectly by a detectable label.

Thus the second agent may be detected by a third agent which is typically labelled directly or indirectly by a detectable label. For example the second agent may comprise a biotin moiety, allowing detection by a third agent which comprises a streptavidin moiety and typically alkaline phosphatase as a detectable label.

In aspects the detection system which is used is the ex-vivo ELISPOT assay as is known in the art. In such an assay, IFN-γ secreted from the T cell is bound by a first IFN-γ specific antibody which is immobilized on a solid support. The bound IFN-γ is then detected using a second IFN-γ specific antibody which is labelled with a detectable label.

In aspects, the change in state of the T cell which can be measured may be the increase in the uptake of substances by the T cell, such as the uptake of thymidine. The change in state may be an increase in the size of the T cells, or proliferation of the T cells, or a change in cell surface markers on the T cell.

In aspects, the T cells which are contacted in the assay/methods are taken from the host in a blood sample, although other types of samples which contain T cells can be used. The sample may be added directly to the assay or may be processed first. Typically the processing may comprise diluting of the sample, for example with water or buffer. Typically the sample is diluted from 1.5 to 100 fold, for example 2 to 50 or 5 to 10 fold. In aspects, the processing may comprise separation of components of the sample. Typically mononuclear cells (MCs) are separated from the samples. The MCs will comprise the T cells and APCs. Thus in the method the APCs present in the separated MCs can present the peptide to the T cells. In aspects, only T cells, such as only CD4+ and/or only CD8+ T cells, can be purified from the sample. PBMCs, MCs and T cells can be separated from the sample using techniques known in the art.

In aspects, the T cells used in the assay/methods are in the form of unprocessed or diluted samples, or are freshly isolated T cells (such as in the form of freshly isolated MCs or PBMCs) which are used directly ex vivo, i.e. they are not cultured before being used in the method. However the T cells can be cultured before use, for example in the presence of one or more of the peptides, and generally also exogenous growth promoting cytokines. During culturing the peptides are typically present on the surface of APCs, such as the APC used in the method. Pre-culturing of the T cells may lead to an increase in the sensitivity of the method. Thus the T cells can be converted into cell lines, such as short term cell lines.

In aspects, the APC which is typically present in the assays/methods may from the same host as the T cell or from a different host. In aspects, the APC may be a naturally occurring APC or an artificial APC. In aspects, the APC is a cell which is capable of presenting the peptide to a T cell. It is typically a B cell, dendritic cell or macrophage. It is typically separated from the same sample as the T cell and is typically co-purified with the T cell. Thus the APC may be present in MCs or PBMCs. In aspects, the APC is typically a freshly isolated ex vivo cell or a cultured cell.

It may be in the form of a cell line, such as a short term or immortalized cell line. The APC may express empty MHC class II molecules or MHC class I molecules on its surface.

In aspects of the assays/methods, the T cells derived from the sample can be placed into an assay with all the peptides (i.e. a pool of the peptides) which it is intended to test (the relevant panel) or the T cells can be divided and placed into separate assays each of which contain one or more of the peptides. In aspects, in the in vitro or in vivo forms of the methods/assays at least one or more of the instantly disclosed peptides/polypeptides as described herein or analogues thereof are used.

In aspects, one or more peptides or polypeptides as disclosed herein is added directly to an assay comprising T cells and APCs. As discussed above the T cells and APCs in such an assay could be in the form of MCs. When peptides which can be recognized by the T cell without the need for presentation by APCs are used then APCs are not required. Analogues which mimic the original peptide bound to a MHC molecule are an example of such a peptide. In aspects, the peptide or polypeptide is provided to the APC in the absence of the T cell. The APC is then provided to the T cell, typically after being allowed to present the peptide on its surface. The peptide may have been taken up inside the APC and presented, or simply be taken up onto the surface without entering inside the APC.

In aspects, the duration for which the peptide or polypeptide is contacted with the T cells will vary depending on the method used for determining recognition of the peptide. Typically 10⁵to 10⁷, preferably 5×10⁵to 10⁶PBMCs are added to each assay. In the case where a peptide is added directly to the assay its concentration is from 10⁻to 10³μg/ml, preferably 0.5 to 50 g/ml or 1 to 10 μg/ml. In aspects, the length of time for which the T cells are incubated with a peptide or polypeptide is from 4 to 24 hours, preferably 6 to 16 hours.

In aspects, the determination of the recognition of a peptide or polypeptide of the instant disclosure by the T cells may be done by measuring the binding of the peptide to the T cells. Typically T cells which bind the peptide can be sorted based on this binding, for example using a FACS machine. The presence of T cells which recognize the peptide will be deemed to occur if the frequency of cells sorted using the peptide is above a ‘control’ value. The frequency of antigen-experienced T cells is generally 1 in 10⁶to 1 in 10³, and therefore whether or not the sorted cells are antigen-experienced T cells can be determined.

In aspects, the determination of the recognition of the peptide by the T cells may be measured in vivo. In aspects, a peptide or polypeptide of the instant disclosure is administered to the host and then a response which indicates recognition of the peptide or polypeptide may be measured. In aspects, the peptide is administered intradermally, typically in a similar manner to the Mantoux test. In aspects, the peptide may be administered epidermally. In aspects, peptide is administered by needle, such as by injection, but can be administered by other methods such as ballistics, for example the ballistics techniques which have been used to deliver nucleic acids. EP-A-0693119 describes techniques which can typically be used to administer the peptide. In aspects, from 0.001 to 1000 μg, for example from 0.01 to 100 μg or 0.1 to 10 μg of peptide is administered.

Alternatively an agent can be administered which is capable of providing the peptides in vivo. Thus a polynucleotide capable of expressing the peptide can be administered, typically in any of the ways described above for the administration of the peptide. In aspects, the polynucleotide has any of the characteristics of the polynucleotide provided by the invention which is discussed below. In aspects, the peptide is expressed from the polynucleotide in vivo and recognition of the peptide in vivo is measured. In aspects, from 0.001 to 1000 μg, for example from 0.01 to 100 μg or 0.1 to 10 μg of polynucleotide is administered. Recognition of the peptide in vivo is typically indicated by the occurrence of a response.

The analogue which can be used in the assays/methods can bind to a T cell receptor which recognizes the equivalent peptide or polypeptide of the instant disclosure. Therefore generally when the analogue is added to T cells in the presence of the equivalent said peptide or polypeptide, typically also in the presence of an APC, the analogue inhibits the recognition of the equivalent peptide or polypeptide. In aspects, the binding of the analogue to the said T cell receptors can be tested by standard techniques. For example T cell receptors can be isolated from T cells which have been shown to recognize the peptide or polypeptide (e.g. using the assays/methods of the instant disclosure). In aspects, demonstration of the binding of the analogue to the T cell receptors can then shown by determining whether the T cell receptors inhibit the binding of the analogue to a substance that binds the analogue, e.g. an antibody to the analogue. In aspects, the analogue is bound in an MHC molecule in such an inhibition of binding assay.

In aspects, the analogue inhibits the binding of the peptide to a T cell receptor. In this case the amount of peptide which can bind the T cell receptor in the presence of the analogue is decreased. This is because the analogue is able to bind the T cell receptor and therefore competes with the peptide for binding to the T cell receptor.

T cells for use in the above binding experiments can be isolated from patients with COVID-19 or related coronavirus infection, for example with the aid of the method of the instant disclosure.

Other binding characteristics of the analogue are also the same as the corresponding peptide or polypeptide of the instant disclosure, and thus typically the analogue binds to the same MHC class II molecule or MHC class I molecule which the peptide or polypeptide of the instant disclosure binds.

The analogue is typically a peptide or polypeptide. It may have homology with the equivalent original peptide or polypeptide of the instant disclosure. A peptide or polypeptide which is homologous to another peptide or polypeptide is typically at least 70% homologous to the peptide, preferably at least 80 or 90% and more preferably at least 95%, 97% or 99% homologous thereto, for example over a region of at least 15, preferably at least 30, for instance at least 40, 60 or 100 or more contiguous amino acids. Methods of measuring protein homology are well known in the art and it will be understood by those of skill in the art that in the present context, homology is calculated on the basis of amino acid identity (sometimes referred to as “hard homology”). For example the UWGCG Package provides the BESTFIT program which can be used to calculate homology (for example used on its default settings) (Devereux et al (1984) Nucleic Acids Research 12, p 387-395).

The homologous peptides or polypeptides may differ by substitution, insertion or deletion, for example from 1, 2, 3, 4, 5, 6, 7, 8 or more substitutions, deletions or insertions, which can be at the N or C terminal or at any other position in the sequence. The substitutions are preferably conservative. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, Met, and Ile; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gln, exchange of the basic residues His, Lys and Arg and replacements among the aromatic residues Trp, Phe and Tyr.

The analogue is typically from 8 to 80 amino acids in length, such as 10 to 60 or 12 to 50, preferably 15 to 30 or 20 to 25. In aspects, the amino acids in the analogue at the equivalent positions to amino acids in the original peptide or polypeptide which contribute to binding the MHC molecule or are responsible for the recognition by the T cell receptor, are the same or are conserved.

In aspects, the analogue peptide comprises one or more modifications, which may be natural post-translation modifications or artificial modifications. The modification may provide a chemical moiety (typically by substitution of a hydrogen, e.g. of a C—H bond), such as an amino, acetyl, hydroxy or halogen (e.g. fluorine) group or carbohydrate group. In aspects, the modification is present on the N or C terminus. In aspects, the analogue may comprise one or more non-natural amino acids, for example amino acids with a side chain different from natural amino acids. Generally, the non-natural amino acid will have an N terminus and/or a C terminus. The non-natural amino acid may be an L-amino acid. In aspects, the analogue has a shape, size, flexibility or electronic configuration which is substantially similar to the original peptide or polypeptide. It is typically a derivative of the original peptide or polypeptide.

In aspects, the analogue is or mimics the original peptide bound to a MHC class II molecule or a MHC class I molecule. In aspects, the analogue may be or may mimic the original peptide bound to 2, 3, 4 or more MHC class II molecules or MHC class I molecules associated or bound to each other. In aspects, these MHC molecules may be bound together using a biotin/streptavidin based system, in which typically 2, 3 or 4 biotin labelled MHC molecules bind to a streptavidin moiety. This analogue typically inhibits the binding of the peptides or polypeptides. In aspects, class II or class I complex to a T cell receptor or antibody which is specific for the complex. In aspects, the analogue is an antibody or a fragment of an antibody, such as a Fab or (Fab)2 fragment.

In aspects, the analogue may be immobilized on a solid support, particularly an analogue which mimics peptide bound to a MHC molecule.

In aspects, the analogue is designed by computational means and then synthesized using methods known in the art. Alternatively the analogue can be selected from a library of compound. The library may be a combinatorial library or a display library, such as a phage display library. The library of compounds may be expressed in the display library in the form of being bound to a MHC class II molecule or MHC class I molecule, such as the MHC molecule which the original peptide binds. Analogues are generally selected from the library based on their ability to mimic the binding characteristics of the original peptides. Thus they may be selected based on ability to bind a T cell receptor or antibody which recognizes the original peptide.

The present disclosure also provides a kit for carrying out the above-methods and assays comprising one or more of the peptides or analogues as disclosed herein, and optionally a means to detect the recognition of the peptide by the cells of the immune system, such as T cells. In aspects, the means to detect recognition allows or aids detection based on the techniques discussed above, however other detection means in the art may used, such as those disclosed in U.S. Pat. No. 7,608,392 (herein incorporated by reference in its entirety). Thus the means may allow detection of a substance secreted by the T cells after recognition. The kit may thus additionally include a specific binding agent for the substance, such as an antibody. In aspects, the agent is specific for IFN-γ, however agents listed for other cytokines as described above may be used. In aspects, the agent is immobilized on a solid support, which means that after binding the agent the substance will remain in the vicinity of the T cell which secreted it. Thus ‘spots’ of substance/agent complex are formed on the support, each spot representing a T cell which is secreting the substance. Quantifying the spots, and typically comparing against a control, allows determination of recognition of the peptide.

In aspects, the kit may also comprise a means to detect the substance/agent complex. A detectable change may occur in the agent itself after binding the substance, such as a color change.

Alternatively a second agent directly or indirectly labelled for detection may be allowed to bind the substance/agent complex to allow the determination of the spots. As discussed above the second agent may be specific for the substance, but binds a different site on the substance than the first agent. In aspects, the means to detect recognition allows or aids detection based on the techniques discussed above, however other detection means in the art may used, such as those disclosed in U.S. Pat. No. 7,608,392 (herein incorporated by reference in its entirety). In aspects, the immobilized support may be a plate with wells, such as a microtitre plate. Each assay can therefore be carried out in a separate well in the plate.

In aspects, the kit may additionally comprise medium for the cells of the immune system, such as T cells, detection agents, and/or washing buffers to be used in the detection steps. In aspects, the kit may additionally comprise reagents suitable for the separation from the sample, such as the separation of PBMCs or T cells from the sample. In aspects, the kit may be designed to allow detection of the T cells directly in the sample without requiring any separation of the components of the sample.

In aspects, the kit may comprise an instrument which allows administration of the peptide, such as intradermal or epidermal administration. Typically such an instrument comprises one or more needles. The instrument may allow ballistic delivery of the peptide. The peptides or polypeptides in the kit may be in the form of a pharmaceutical composition.

In aspects, the kit may also comprise controls, such as positive or negative controls. In aspects, the positive control may allow the detection system to be tested. Thus the positive control typically mimics recognition of the peptides or polypeptides in any of the above assays or methods. In aspects of the kit designed to determine recognition in vitro the positive control is a cytokine. In aspects of the kit designed to detect in vivo recognition of the peptide the positive control may be antigen to which most individuals should response.

In aspects, the kit may also comprise a means to take a sample containing immune cells, such as T cells, from the host/subject, such as a blood sample. In aspects, the kit may comprise a means to separate mononuclear cells or T cells from a sample from the host. In aspect, the kit is conveniently in compartmental form with one or more compartments adapted to receive a sample from a subject such as whole blood. That compartment or another compartment may also be adapted to contain heparin where the sample is whole blood with or without a simple sugar such as dextrose. The simple sugar may also be maintained in a separate container.

In aspects, the kit is in a form which is packaged for sale with a set of instructions. The instructions would generally be in the form to conduct the assays and/or methods as disclosed herein.

Although any assay, methods, and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, the preferred methods and materials are described. Other features, objects, and advantages of the present disclosure will be apparent from the description and the claims. In the specification and the appended claims, the singular forms include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All references cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference in its entirety for all purposes.

Aspects

A 1st aspect is directed to a polypeptide consisting of an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 2nd aspect is directed to a polypeptide consisting essentially of an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 3rd aspect is directed to a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 4th aspect is directed to a polypeptide according to any one of apects 1-3, wherein said variant or fragment of an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 retains MHC binding propensity and TCR specificity, and/or retains anti-SARS-CoV-2 activity.

A 5^thaspect is directed to a polypeptide consisting of an amino acid sequence having at least 75%, 80%, 85%, 90%, or 95% homology to any one of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and fragments thereof, wherein said polypeptide retains MHC binding propensity and the same TCR specificity, and/or retains anti-SARS-CoV-2 activity.

A 6^thaspect is directed to a polypeptide consisting essentially of an amino acid sequence having at least 75%, 80%, 85%, 90%, or 95% homology to any one of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and fragments thereof, wherein said polypeptide retains MHC binding propensity and the same TCR specificity, and/or retains anti-SARS-CoV-2 activity.

A 7^thaspect is directed to a polypeptide comprising an amino acid sequence having at least 75%, 80%, 85%, 90%, or 95% homology to any one of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and fragments thereof, wherein said polypeptide retains MEW binding propensity and the same TCR specificity, and/or retains anti-SARS-CoV-2 activity.

An 8^thaspect is directed to a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734, and fragments or variants thereof.

A 9^thaspect is directed to a polypeptide according to aspect 8, wherein said fragment or variant of a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 retains anti-SARS-CoV-2 activity.

A 10^thaspect is directed to a nucleic acid encoding a polypeptide consisting of an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 11^thaspect is directed to a nucleic acid encoding a polypeptide consisting essentially of an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 12^thaspect is directed to a nucleic acid encoding a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692, and/or fragments and variants thereof, and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692.

A 13^thaspect is directed to a nucleic acid encoding a polypeptide comprising an amino acid selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and/or fragments and variants thereof.

A 14^thaspect is directed to a nucleic acid consisting of a sequence selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 and fragments or variants thereof.

A 15^thaspect is directed to a nucleic acid consisting essentially of a sequence selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734, and fragments or variants thereof.

A 16^thaspect is directed to a nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734, and fragments or variants thereof.

A 17^thaspect is directed to a nucleic acid of any one of aspects 10-12, wherein said fragment or variant of the nucleic acid encoding a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 encodes a polypeptide that retains anti-SARS-CoV-2 activity.

A 18^thaspect is directed to a nucleic acid of aspect 13, wherein said fragment or variant of the nucleic acid encoding a polypeptide comprising an amino acid selected from the group consisting of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734 encodes a polypeptide that retains anti-SARS-CoV-2 activity.

A 19^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) of Table 1, (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of a polypeptide of Table 1.

A 20^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1091, 1401, 1062, 1373, 1085, 1395, 1066, 1376, 1080, 1391, 1081, 1392, 1065, 1376, 1092, 1403, 1104, 1415, 1071, 1382, 1107, 1418, 1072, 1383, 1074, 1384, 1115, 1426, 1096, 1407, 1110, 1421, 1116, 1427, 1105, 1416, 1055, 1366, 1070, and 1381 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1091, 1401, 1062, 1373, 1085, 1395, 1066, 1376, 1080, 1391, 1081, 1392, 1065, 1376, 1092, 1403, 1104, 1415, 1071, 1382, 1107, 1418, 1072, 1383, 1074, 1384, 1115, 1426, 1096, 1407, 1110, 1421, 1116, 1427, 1105, 1416, 1055, 1366, 1070, and 1381.

A 21^staspect is directed to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1055, 1366, 1060, 1371, 1061, 1372, 1062, 1373, 1065, 1376, 1066, 1376, 1069, 1380, 1070, 1381, 1071 1382, 1072, 1383, 1074, 1384, 1078, 1389, 1080, 1391, 1081, 1392, 1082, 1393, 1085, 1395, 1086 1397, 1087, 1398, 1088, 1399, 1089, 1400, 1091, 1401, 1092, 1403, 1093, 1404, 1096, 1407, 1100 1411, 1104, 1415, 1105, 1416, 1106, 1417, 1107, 1418, 1110, 1421, 1113, 1424, 1115, 1426, 1116, and 1427 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1055, 1366, 1060, 1371, 1061, 1372, 1062, 1373, 1065, 1376, 1066, 1376, 1069, 1380, 1070, 1381, 1071 1382, 1072, 1383, 1074, 1384, 1078, 1389, 1080, 1391, 1081, 1392, 1082, 1393, 1085, 1395, 1086 1397, 1087, 1398, 1088, 1399, 1089, 1400, 1091, 1401, 1092, 1403, 1093, 1404, 1096, 1407, 1100 1411, 1104, 1415, 1105, 1416, 1106, 1417, 1107, 1418, 1110, 1421, 1113, 1424, 1115, 1426, 1116, and 1427.

A 22^ndaspect is directed to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1062 1373, 1066. 1377, 1079, 1390, 1080, 1391, 1085, 1396, 1124, 1435, 1127, 1438, 1132, 1443, 1134, 1445, 1136, 1447, 2677, 2678, 1140, 1451, 1148, 1459, 1184, 1495, 1185, 1496, 1197, 1507, 1203, 1514, 2679, 2680, 2681, 2682, 1220, 1531, 1222, 1533, 1223, 1534, 2683, 2684, 1234, 1555, 2689, 3690, 1247, 1558, 1250, 1561, 1254, 1565, 2697, 2698, 1261, 1571, 1267, 1578, 1268, 1579, 1270, 1581, 1273, 1584, 1293, 1604, 1300, 1611, 1309, 1619, 1311, 1622, 1314, 1625, 1319, 1630, 2693, 2694, 1333, 1644, 2695, 2696, 1335, 1646, 1337, 1648, 1344, 1655, 1348, 1659, 1351, 1662, 1363 1674, 1365 and 1676 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1062 1373, 1066. 1377, 1079, 1390, 1080, 1391, 1085, 1396, 1124, 1435, 1127, 1438, 1132, 1443, 1134, 1445, 1136, 1447, 2677, 2678, 1140, 1451, 1148, 1459, 1184, 1495, 1185, 1496, 1197, 1507, 1203, 1514, 2679, 2680, 2681, 2682, 1220, 1531, 1222, 1533, 1223, 1534, 2683, 2684, 1234, 1555, 2689, 3690, 1247, 1558, 1250, 1561, 1254, 1565, 2697, 2698, 1261, 1571, 1267, 1578, 1268, 1579, 1270, 1581, 1273, 1584, 1293, 1604, 1300, 1611, 1309, 1619, 1311, 1622, 1314, 1625, 1319, 1630, 2693, 2694, 1333, 1644, 2695, 2696, 1335, 1646, 1337, 1648, 1344, 1655, 1348, 1659, 1351, 1662, 1363 1674, 1365 and 1676.

A 23^rdaspect is directed to to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 2647, 2648, 2649, 2650, 2651, 2652, 2653, 2654, 2655, 2656, 2657, 2658, 2659, 2660, 2661, 2662, 1092, 1403, 2663, 2664, 1093, 1404, 2665, 2666, 2667, 2668, 2669, 2670, 2671, 2672, 1098, 1409, 2673, and 2674 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 2647, 2648, 2649, 2650, 2651, 2652, 2653, 2654, 2655, 2656, 2657, 2658, 2659, 2660, 2661, 2662, 1092, 1403, 2663, 2664, 1093, 1404, 2665, 2666, 2667, 2668, 2669, 2670, 2671, 2672, 1098, 1409, 2673, and 2674.

A 24^thaspect is directed to to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class II polypeptides (“clusters”) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 1071, 1382, 1087, 1398, 2675, 2676, 1124, 1435, 1155, 1466, 1170, 1481, 1174, 1485, 1181, 1492, 1190, 1501, 1192, 1503, 1193, 1504, 2687, 2688, 2685, 2686, 1240, 1551, 1253, 1564, 1255, 1566, 1262, 1573, 1267, 1578, 2691, 2692, 1295, 1606, 1310, 1621, 1344, 1655, 1348, 1659, 1352, 1663, 2699, and 2700 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 1071, 1382, 1087, 1398, 2675, 2676, 1124, 1435, 1155, 1466, 1170, 1481, 1174, 1485, 1181, 1492, 1190, 1501, 1192, 1503, 1193, 1504, 2687, 2688, 2685, 2686, 1240, 1551, 1253, 1564, 1255, 1566, 1262, 1573, 1267, 1578, 2691, 2692, 1295, 1606, 1310, 1621, 1344, 1655, 1348, 1659, 1352, 1663, 2699, and 2700.

A 25^thaspect is directed to to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class I polypeptides (9-mers or 1 Omers) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 2735-8540 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 2735-8540.

A 26^thaspect is directed to to a polypeptide or nucleic acid of any one of aspects 1-7, 10-12, or 17, wherein said polypeptide is one or more Class I polypeptides (9-mers) comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 8541-8690 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 8541-8690.

A 27^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 1677-1681 and 2641-2646.

A 28^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 2723-2734.

A 29^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 2639-2640.

A 30^thaspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 1685-1692.

A 31^staspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 2593-2604.

A 32^ndaspect is directed to a polypeptide or nucleic acid of any one of aspects 8-9, or 13-16, or 18, wherein said polypeptide is one or more polypeptides comprising, consisting, or consisting essentially of an amino acid sequence of one or more of SEQ ID NOS: 2719-2722.

A 33^rdaspect is directed to a plasmid comprising a nucleic acid of any one of aspects 10-32.

A 34^thaspect is directed to a vector comprising a nucleic acid according to any one of aspects 10-32.

A 35^thaspect is directed to a pharmaceutical composition comprising a polypeptide according to any one of aspects 1-9 or 19-32 and a pharmaceutically-acceptable carrier and/or excipient.

A 36^thaspect is directed to a pharmaceutical composition comprising a nucleic acid according to any one of aspects 10-32 and a pharmaceutically-acceptable carrier and/or excipient.

A 37^thaspect is directed to a pharmaceutical composition comprising a plasmid according to aspect 33 and a pharmaceutically-acceptable carrier and/or excipient.

A 38^thaspect is directed to a pharmaceutical composition comprising a vector according to aspect 34 and a pharmaceutically-acceptable carrier and/or excipient.

A 39^thaspect is directed to a vaccine comprising a polypeptide according to any one of aspects 1-9 or 19-32 and a pharmaceutically-acceptable excipient, carrier, and/or adjuvant.

A 40^thaspect is directed to a vaccine comprising a nucleic acid according to any one of aspects 10-32 and a pharmaceutically-acceptable excipient, carrier, and/or adjuvant.

A 41^staspect is directed to a vaccine comprising a plasmid according to aspect 33 and a pharmaceutically-acceptable excipient, carrier, and/or adjuvant.

A 42^ndaspect is directed to a vaccine comprising a vector according to aspect 34 and a pharmaceutically-acceptable excipient, carrier, and/or adjuvant.

A 43^rdaspect is directed to a method for inducing immunity against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a polypeptide according to any one of aspects 1-9 or 19-32.

A 44^thaspect is directed to a method for inducing immunity against a SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a nucleic acid according to any one of aspects 10-32.

A 45^thaspect is directed to a method for inducing immunity against a SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a plasmid according to aspect 33.

A 46^thaspect is directed to a method for inducing immunity SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a vector according to aspect 34.

A 47^thaspect is directed to a method for inducing immunity against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a pharmaceutical composition according to any one of aspects 35-38.

A 48^thaspect is directed to a method for inducing immunity against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a vaccine composition according to any one of aspects 39-42.

A 49^thaspect is directed to a method according to any one of aspects 43-48, wherein the step of administration additionally includes administration of an SARS-CoV-2 virus, wherein the virus is a live attenuated virus or inactivated virus.

A 50^thaspect is directed to a method for inducing an immune response against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of one or more of a polypeptide according to any one of aspects 1-9 or 10-32.

A 51st aspect is directed to a method for inducing an immune response against against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a nucleic acid according to any one of aspects 10-32.

A 52^ndaspect is directed to a method for inducing an immune response against against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a plasmid according to aspect 33.

A 53^rdaspect is directed to a method for inducing an immune response against against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a vector according to aspect 34.

A 54^thaspect is directed to a method for inducing an immune response against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a pharmaceutical composition according to any one of aspects 35-38.

A 55^thaspect is directed to a method for inducing an immune response against against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a vaccine composition according to any one of aspects 39-42.

A 56^thaspect is directed to a method according to any one of aspects 50-55, wherein the step of administration additionally includes administration of a SARS-CoV-2 virus, wherein the virus is a live attenuated virus or inactivated virus. A 57^thaspect is directed to a chimeric or fusion polypeptide comprising a polypeptide of any one of aspects 1-9 or 10-32, wherein said polypeptide is joined, linked, or inserted into a heterologous polypeptide.

A 58^thaspect is directed to a method for measuring a CMI response against SARS-CoV-2 or a related coronavirus in a subject, said method comprising; collecting a whole blood sample from said subject wherein said whole blood sample comprises cells of the immune system which are capable of producing immune effector molecules following stimulation by an antigen, incubating a mixture comprising the whole blood sample, at least one polypeptide according to any one of aspects 1-9 or 10-32, optionally an amount of an isolated simple sugar effective to enhance the stimulation by the antigen, and optionally heparin, and measuring the presence of, or elevation in, the level of an immune effector molecule wherein the presence or level of said immune effector molecule is indicative of the capacity of said subject to mount a cell-mediated immune response.

A 59^thaspect is directed to a method of aspect 58 wherein the subject is a human.

A 60^thaspect is directed to a method of aspect 58 wherein the whole blood is collected in a tube comprising the at least one polypeptide.

A 61^staspect is directed to a method of aspect 58 wherein the whole blood is collected in a tube comprising heparin.

A 62^ndaspect is directed to a method of aspect 60 wherein the tube comprises heparin.

A 63^rdaspect is directed to a method of aspect 58 wherein the whole blood sample is incubated with the at least one polypeptide for from about 5 to about 50 hours.

A 64^thaspect is directed to a method of aspect 58 wherein the immune effector molecule is a cytokine.

A 65^thaspect is directed to a method of aspect 64 wherein the cytokine is IFN-γ.

A 66^thaspect is directed to a method of aspect 64 wherein the cytokine is GM-CSF.

A 67^thaspect is directed to a method of aspect 64 wherein the cytokine is an interleukin.

A 68^thaspect is directed to a method of aspect 64 wherein the cytokine is a TNF-α.

A 69^thaspect is directed to a method of either aspect 58 or 59 wherein the subject is infected by SARS-CoV-2 or a related coronavirus.

A 70^thaspect is directed to a method of aspect 58 wherein the immune cells are selected from NK cells, T-cells, B-cells, dendritic cells, macrophages or monocytes.

A 71st aspect is directed to a method of aspect 60 wherein the immune cells are T-cells.

A 72^ndaspect is directed to a method of aspect 58 wherein the simple sugar is dextrose.

A 73^rdaspect is directed to a method of aspect 58 wherein the immune effectors are detected with antibodies specific for same.

A 74^thaspect is directed to a method of aspect 73 wherein the immune effectors are detected using ELISA.

A 75^thaspect is directed to a method of aspect 73 wherein the immune effectors are detected using ELISpot.

A 76^thaspect is directed to a assay for identifying SARS-CoV-2 or a related coronavirus-specific immediate effector T cells in a subject, comprising: (a) providing a sample from said subject containing T cells; (b) exposing said T cells to an immunogenic amount of at least one polypeptide according to any one of aspects 1-9 or 10-32; and (c) prior to the generation of new immediate effector T cells in the sample, determining whether said T cells are activated by said polypeptide by measuring secretion of a cytokine from said T cells; wherein activation of said T cells identifies the presence of SARS-CoV-2 or a related coronavirus -specific immediate effector T cells that were present in the original sample, in said subject.

A 77^thaspect is directed to a method of aspect 76, wherein said T cells are peripheral blood mononuclear cells.

A 78^thaspect is directed to a method of aspect 76, wherein the activation of said T cells is determined by measuring secretion of interferon-γ from said T cells.

A 79^thaspect is directed to a method of aspect 74, wherein said subject is known to be suffering, or to have suffered from, infection with SARS-CoV-2 or a related coronavirus.

A 80^thaspect is directed to a method of aspect 79, wherein said infection is monitored.

A 81^staspect is directed to an assay for identifying SARS-CoV-2 or a related coronavirus-specific immediate effector T cells in a subject, comprising: (a) providing a sample from said subject containing T cells; (b) exposing said T cells to an immunogenic amount of at least one polypeptide according to any one of aspects 1-9 or 10-32; (c) incubating said T cells for a period of time which is not sufficient to effect differentiation of quiescent T cells to immediate effector T cells; and (d) determining whether said T cells are activated by said polypeptide by measuring secretion of a cytokine from said T cells, wherein activation of said T cells identifies the presence of SARS-CoV-2 or a related coronavirus-specific immediate effector T cells in said subject.

A 82^ndaspect is directed to a method of either aspect 80 or 81, wherein said T cells are exposed to said polypeptide at around 37° C.

A 83^thaspect is directed to a method of aspect 81, wherein said incubation time is from 4 hours to 24 hours.

A 84^thaspect is directed to a method of aspect 81, wherein said incubation time is from 6 hours to 16 hours.

A 85^thaspect is directed to a method of detecting an anti-SARS-CoV-2 or a related coronavirus CD8+ and/or CD4+ T cell response comprising contacting a population of CD8+ and/or CD4+ T cells of a human individual with one or more polypeptides according to any one of aspects 1-9 or 10-32, wherein one or more polypeptides may be substituted by an analogue which binds a T cell receptor that recognizes the peptide, and determining whether CD8+ and/or CD4+ T cells of the CD8+ and/or CD4+ T cell population recognize the peptide(s).

A 86^thaspect is directed to a method according to aspect 85 wherein a peptide panel is employed, wherein said panel includes said one or more polypeptide or said analogues thereof.

A 87th aspect is directed to a method according to aspect 85 wherein any analogue which is used is (i) at least 70% homologous, preferably at least 80% homologous, more preferably at least 90% homologous, to the entire polypeptide, and/or (ii) has one or more deletions at the N-terminus and/or C-terminus in comparison to the polypeptide, and/or (iii) has one or more conservative substitutions compared to the polypeptide.

A 88^thaspect is directed to a method according to aspect 85 in which the recognition of the polypeptide(s) by the CD8+ and/or CD4+ T cells is determined by measuring secretion of a cytokine from the CD8+ and/or CD4+ T cells.

89^thaspect is directed to a method according to aspect 88 in which IFN-γ secretion from the T cells is measured.

A 90^thaspect is directed to a method according to aspect 89 in which IFN-γ secretion from the CD8+ and/or CD4+ T cells is determined by allowing secreted IFN-γ to bind an immobilized antibody specific to the cytokine and then determining the presence of antibody/cytokine complex.

A 91^staspect is directed to a method according to aspect 85 in which the CD8+ and/or CD4+ T cells are freshly isolated ex vivo cells from peripheral blood.

A 92^ndaspect is directed to a method according to aspect 85 in which CD8+ and/or CD4+ T cells are pre-cultured in vitro with the peptide(s).

A 93^rdaspect is directed to a method according to aspect 85 wherein the population of CD8+ and/or CD4+ T cells is from an individual to whom an anti-SARS-CoV-2 or related coronavirus vaccine has been administered.

A 94^thaspect is directed to a method according to aspect 85 which is carried out in vitro.

A 95th aspect is directed to a method of diagnosing infection in a human host by, or exposure of a human host to, a SARS-CoV-2 or a related coronavirus, which method comprises the steps of: (i) contacting a population of T cells from the host with one or more polypeptides according to any one of aspects 1-9 or 10-32; and (ii) determining in vitro whether the T cells of said T cell population show a recognition response to said polypeptide.

A 96^thaspect is directed to a method of aspect 95, wherein the T cells are freshly isolated.

A 97^thaspect is directed to a method of aspect 95, wherein the T cells are isolated from blood.

A 98^thaspect is directed to a method of aspect 95, wherein the T cell population comprises CD4+ and/or CD8+ T cells.

A 99^thaspect is directed to a method of aspect 95, wherein the host is a healthy human host who has been exposed to SARS-CoV-2 or a related coronavirus.

A 100^thaspect is directed to a kit comprising one or more polypeptides according to any one of aspects 1-9 or 10-32, wherein said one or more polypeptides may be substituted by an analogue which binds a T cell receptor which recognizes the polypeptide, and optionally a means to detect recognition of the polypeptide(s) by CD8+ and/or CD4+ T cells.

A 101st aspect is directed to a kit according to aspect 100 which includes an antibody to IFN-γ.

A 102^ndaspect is directed to a kit according to aspect 100 wherein said antibody is immobilized on a solid support and which optionally also includes a means to detect any antibody/IFN-γ complex.

A 103^rdaspect is directed to a kit according to aspect 100 which includes the means to detect recognition of the peptide(s) by CD8+ and/or CD4+ T cells.

A 104^thaspect is directed to a kit according to aspect 102 which includes the means to detect any antibody/IFN-γ complex.

A 105^thaspect is directed to a method of preventing, treating, or ameliorating a disease by SARS-CoV-2 infection, such as COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of one or more of a polypeptide according to any one of aspects 1-9 or 10-32.

A 106^thaspect is directed to a method of preventing, treating, or ameliorating a disease by SARS-CoV-2 infection, such as COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a nucleic acid according to any one of aspects 10-32.

A 107^thaspect is directed to a method of preventing, treating, or ameliorating a disease by SARS-CoV-2 infection, such as COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a plasmid according to aspect 33.

A 106^thaspect is directed to a method of preventing, treating, or ameliorating a disease by SARS-CoV-2 infection, such as COVID-19 in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a pharmaceutical composition according to any one of aspects 35-38.

A 108^thaspect is directed to a method according to any one of aspects 50-55, wherein the step of administration additionally includes administration of a SARS-CoV-2 virus, wherein the virus is a live attenuated virus or inactivated virus.

A 109^thaspect is directed to a polypeptide according to any one of aspects 1-9 or 10-32, wherein said polypeptide has one or more conservative substitutions compared to the polypeptide.

A 110^thaspect is directed to a polypeptide according to aspect 109, wherein said polypeptide retains MHC binding propensity and TCR specificity, and/or retains anti-SARS-CoV-2 activity.

Further aspects and advantages of the instant disclosure are provided in the following section, which should be considered as illustrative only.

EXAMPLES

The examples that follow are not to be construed as limiting the scope of the invention in any manner. In light of the present disclosure, numerous embodiments within the scope of the claims will be apparent to those of ordinary skill in the art.

Example 1: In-Silico Identification of Potential Epitopes for HLA

T-cells specifically recognize epitopes presented by cells in the context of MHC (Major Histocompatibility Complex) Class I and II molecules. These T-cell epitopes can be represented as linear sequences comprising 7 to 30 contiguous amino acids that fit into the MHC Class I or II binding groove. A number of computer algorithms have been developed and used for detecting Class I and II epitopes within protein molecules of various origins (De Groot A S et al., (1997), AIDS Res Hum Retroviruses,13(7):539-41; Schafer J R et al., (1998), Vaccine,16(19):1880-4; De Groot A S et al., (2001), Vaccine, 19(31):4385-95; De Groot A S et al., (2003), Vaccine, 21(27-30):4486-504). These “in silico” predictions of T-cell epitopes have been successfully applied to the design of vaccines and the de-immunization of therapeutic proteins, i.e. antibody-based drugs, Fc fusion proteins, anticoagulants, blood factors, bone morphogenetic proteins, engineered protein scaffolds, enzymes, growth factors, hormones, interferons, interleukins, and thrombolytics (Dimitrov D S, (2012), Methods Mol Biol, 899:1-26).

The Conservatrix system (EpiVax, Providence, R.I.) is an algorithm useful for identifying 9-mer polypeptide sequences from a larger set of data. The Conservatrix system parses input sequences into 9-mer sequences that are conserved amongst multiple inputted whole sequences, such as multiple strains of the same pathogen, for even the most mutable of potential vaccine targets. These 9-mer sequences may be searched for identically matched 9-mer sequences across data sets.

The EpiMatrix™ system (EpiVax, Providence, R.I.) is a set of predictive algorithms encoded into computer programs useful for predicting class I and class II HLA ligands and T cell epitopes. The EpiMatrix™ system uses matrices in order to model the interaction between specific amino acids and binding positions within the HLA molecule. In order to identify putative epitopes resident within any given input protein, the EpiMatrix™ System first parses the input protein into a set of overlapping n-mer frames (n=length of amino acids of epitope peptide being screen; e.g., n=9 or n=10) where each frame overlaps the last by n-1 amino acids. Each frame is then scored for predicted affinity to one or more common alleles of the HLA molecules. Briefly, for any given n-mer peptide specific amino acid codes (one for each of 20 naturally occurring amino acids) and relative binding positions (1 to n) are used to select coefficients from the predictive matrix. Individual coefficients are derived using a proprietary method similar to, but not identical to, the pocket profile method first developed by Sturniolo (Sturniolo T et al., 1999, Nat Biotechnol, 17(6):555-61). Individual coefficients are then summed to produce a raw score. EpiMatrix™ raw scores are then normalized with respect to a score distribution derived from a very large set of randomly generated peptide sequences. The resulting “Z” scores are normally distributed and directly comparable across alleles. It was determined that any peptide scoring above 1.64 on the EpiMatrix™ “Z” scale (approximately the top 5% of any given peptide set) has a significant chance of binding to the MHC molecule for which it was predicted. Peptides scoring above 2.32 on the scale (the top 1%) are extremely likely to bind. FIGS. 2-4 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIGS. 6-15 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIGS. 17-55 are EpiMatrix Cluster detail reports for identified MHC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2. FIG. 59 is an EpiMatrix staircase report for identified MHC class I clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIG. 60 is an EpiMatrix staircase report for identified MHC class I clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIG. 61 is an EpiMatrix staircase report for identified MHC class I clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2.

Peptides containing clusters of putative T cell epitopes are more likely to test positive in validating in vitro and in vivo assays. In aspects, the results of the initial EpiMatrix™ analysis is further screened for the presence of putative T cell epitope “clusters” using a second proprietary algorithm known as Clustimer™ algorithm. The Clustimer™ algorithm identifies sub-regions contained within any given amino acid sequence that contains a statistically unusually high number of putative T cell epitopes. Typical T-cell epitope “clusters” range from about 9 to roughly 30 amino acids in length and, considering their affinity to multiple alleles and across multiple 9-mer frames, can contain anywhere from about 4 to about 40 putative T cell epitopes. FIG. 1 is an overview of MHC class II cluster selection from the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIG. 5 is an overview of MHC class II cluster selection from the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIG. 16 is an overview of MHC class II cluster selection from the spike (SEQ ID NO: 3) of SARS-CoV-2. Each epitope cluster identified an aggregate EpiMatrix™ score is calculated by summing the scores of the putative T cell epitopes and subtracting a correcting factor based on the length of the candidate epitope cluster and the expected score of a randomly generated cluster of the same length. EpiMatrix™ cluster scores in excess of +10 are considered significant. In aspects, the T-cell epitopes of the instant disclosure contain several putative T-cell epitopes forming a pattern known as a T-cell epitope cluster.

The JanusMatrix system (EpiVax, Providence, Rhode Island) useful for screening peptide sequences for cross-conservation with a host proteome. JanusMatrix is an algorithm that predicts the potential for cross-reactivity between peptide clusters and the host genome or proteome, based on conservation of TCR-facing residues in their putative MHC ligands. The JanusMatrix algorithm first considers all the predicted epitopes contained within a given protein sequence and divides each predicted epitope into its constituent agretope and epitope. Each sequence is then screened against a database of host proteins. Peptides with a compatible MHC-facing agretope (i.e., the agretopes of both the input peptide and its host counterparty are predicted to bind the same MHC allele) and exactly the same TCR-facing epitope are returned. The JanusMatrix

Homology Score suggests a bias towards immune tolerance. In the case of a therapeutic protein, cross-conservation between autologous human epitopes and epitopes in the therapeutic may increase the likelihood that such a candidate will be tolerated by the human immune system. In the case of a vaccine, cross-conservation between human epitopes and the antigenic epitopes may indicate that such a candidate utilizes immune camouflage, thereby evading the immune response and making for an ineffective vaccine. When the host is, for example, a human, the peptide clusters are screened against human genomes and proteomes, based on conservation of TCR-facing residues in their putative HLA ligands. The peptides are then scored using the JanusMatrix Homology Score. In aspects, peptides with a JanusMatrix Homology Score below 2.5 or below 3.0 indicate low tolerogenicity potential and may be useful for vaccines. In aspects, peptides with a JanusMatrix Homology Score above 3.0 indicate high tolerogenicity potential and may not be useful for vaccines, and in aspects may be excluded from the T cell epitope compositions of the present disclosure. FIG. 56 is the JanusMatrix reports for identified MHC class II clusters of the envelope (SEQ ID NO: 1) of SARS-CoV-2. FIG. 57 is the JanusMatrix reports for identified MHC class II clusters of the membrane (SEQ ID NO: 2) of SARS-CoV-2. FIG. 58 is the JanusMatrix reports for identified MHC class II clusters of the spike (SEQ ID NO: 3) of SARS-CoV-2.

In aspects, the VaccineCAD system is useful for arranging potential epitopic vaccine candidates into a string to avoid creation of novel epitopes upon joining of the vaccine candidate sequences. Specifically, VaccineCAD designs potential vaccine candidates into a string-of-beads vaccine while minimizing any deleterious junctional epitopes that may appear in the joining process. VaccineCAD may use EpiMatrix to predict junctional epitopes. Particularly concatemeric peptides of interest developed using VaccineCad are those of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734, and in aspects SEQ ID NOS: 1677-1684. Similarly, particular nucleic acids of interest (including RNA, mRNA, DNA, etc.) are those encoding a peptide or polypeptide comprising, consisting of, or consisting essentially of one or more peptides or polypeptides having an amino acid sequence off SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734.

Example 2: Concatemeric Peptides

2.1—A total of 22-34 class I and class II peptides were selected for inclusion in a first set of the instantly-disclosed vaccine constructs, following immunoinformatic predictions. Selection was based on at least, high binding likelihood to HLA class I and class II alleles, and low tolerogenicity potential. Putative class I epitopes were in the top 1% of predicted ligands, and had Janus Matrix Homology Scores below 2. Putative class II epitopes, were predicted to bind to four or more HLA alleles, and had JanusMatrix Homology Scores below 2. The selected epitope clusters for HLA Class I and Class II used to produce the concatemeric peptides of SEQ ID NOS: 1677-1681, 2593-2604, 2641-2646, and 2719-2722 (and associated nucleic acid constructs encoding such) include the below sequences in TABLE 2. The selected epitope clusters for HLA

Class I and Class II used to produce the concatemeric peptides of SEQ ID NOS: 1682-1684 (and associated nucleic acid constructs encoding such) include the below sequences in TABLE 3.

TABLE 2

>ENVELOPE: 0001 Surface

MYSFVSEETGTLIVN (SEQ ID NO: 1055)

>ENVELOPE: 0060 Intracellular

SRVKNLNSSRVPD (SEQ ID NO: 1058)

>MEMBRANE: 0013 Surface

LKKLLEQWNLVIGFLFLTW (SEQ ID NO: 1061)

>MEMBRANE: 0032 Transmembrane

ICLLQFAYANRNRFLYI (SEQ ID NO: 1118)

>MEMBRANE: 0062 Transmembrane

LACFVLAAVYRINWI (SEQ ID NO: 1064)

>MEMBRANE: 0093 Transmembrane

LSYFIASFRLFARTRSMWSFNPETNILLNV (SEQ ID 2638)

>MEMBRANE: 0146 Intracellular

RGHLRIAGHHLGRCD (SEQ ID NO: 1119)

>MEMBRANE: 0165 Intracellular

PKEITVATSRTLSYY (SEQ ID NO: 1070)

>MEMBRANE: 0201 Intracellular

IGNYKLNTDHSSSSDNI (SEQ ID NO: 1072)

>SPIKE: 0025 Extracellular

PPAYTNSFTRGVYY (SEQ ID NO: 1074)

>SPIKE: 0052 Extracellular

QDLFLPFFSNVTWFHAIHVSGTNGT (SEQ ID NO: 2617)

>SPIKE: 0087 Extracellular

NDGVYFASTEKSNIIR (SEQ ID NO: 1120)

>SPIKE: 0114 Extracellular

TQSLLIVNNATNVVIKV (SEQ ID NO: 1975)

>SPIKE: 0140 Extracellular

FLGVYYHKNNKSWMESEFRVYSSANNCTFEYV (SEQ ID

NO: 2621)

>SPIKE: 0198 Extracellular

DGYFKIYSKHTPINL (SEQ ID NO: 1082)

>SPIKE: 0239 Extracellular

QTLLALHRSYLTPGDSS (SEQ ID NO: 1085)

>SPIKE: 0272 Extracellular

PRTFLLKYNENGTITDA (SEQ ID NO: 1086)

>SPIKE: 0315 Extracellular

TSNFRVQPTESIVRF (SEQ ID NO: 1086)

>SPIKE: 0338 Extracellular

FGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFK

(SEQ ID NO: 2637)

>SPIKE: 0446 Extracellular

GGNYNYLYRLFRKSNLKPFERDI (SEQ ID NO: 1091)

>SPIKE: 0483 Extracellular

VEGFNCYFPLQSYGFQPTNGVGYQPY (SEQ ID NO: 2620)

>SPIKE: 0512 Extracellular

VLSFELLHAPATVCG (SEQ ID NO: 1095)

>SPIKE: 0536 Extracellular

NKCVNFNFNGLTGTGVLTE (SEQ ID NO: 2616)

>SPIKE: 0715 Extracellular

PTNFTISVTTEILPV (SEQ ID NO: 1098)

>SPIKE: 0753 Extracellular

LLQYGSFCTQLNRALTGIAVEQ (SEQ ID NO: 1099)

>SPIKE: 0797 Extracellular

FGGFNFSQILPDPSKPSKRS (SEQ ID NO: 1100)

>SPIKE: 0866 Extracellular

TDEMIAQYTSALLA (SEQ ID NO: 1104)

>SPIKE: 0895 Extracellular

QIPFAMQMAYRFNGIGV (SEQ ID NO: 1123_

>SPIKE: 0920 Extracellular

QKLIANQFNSAIGKIQDSL (SEQ ID NO: 2609)

>SPIKE: 0998 Extracellular

TGRLQSLQTYVTQQL (SEQ ID NO: 1110)

>SPIKE: 1044 Extracellular

GKGYHLMSFPQSAPH (SEQ ID NO: 1113)

>SPIKE: 1127 Extracellular

DVVIGIVNNTVYDPL (SEQ ID NO: 1114)

>SPIKE: 1152 Extracellular

LDKYFKNHTSPDVDL (SEQ ID NO: 1115)

>SPIKE: 1169 Extracellular

ISGINASVVNIQKEI (SEQ ID NO: 1116)

TABLE 3

>PENTAMER1: 0140

FLGVYYHKNNKSWMESE (SEQ ID NO: 1080)

>PENTAMER1: 0483

VEGFNCYFPLQSYGFQPT (SEQ ID NO: 1092)

>PENTAMER1: 1152

LDKYFKNHTSPDVDL (SEQ ID NO: 1115)

>PENTAMER1: 0536

NKCVNFNFNGLTGT (SEQ ID NO: 1122)

>PENTAMER1: 0315

TSNFRVQPTESIVRF (SEQ ID NO: 1089)

>PENTAMER2: 0154

ESEFRVYSSANNCTFEYV (SEQ ID NO: 1081)

>PENTAMER2: 0797

FGGFNFSQILPDPSKPSKRS (SEQ ID NO: 1100)

>PENTAMER2: 0998

TGRLQSLQTYVTQQL (SEQ ID NO: 1110)

>PENTAMER2: 0492

LQSYGFQPTNGVGYQPY (SEQ ID NO: 1093)

>PENTAMER2: 0025

PPAYTNSFTRGVYY (SEQ ID NO: 3)

>PENTAMER3: 0446

GGNYNYLYRLFRKSNLKPFERDI (SEQ ID NO: 1091)

>PENTAMER3: 0239

QTLLALHRSYLTPGDSS (SEQ ID NO: 1085)

>PENTAMER3: 0140

FLGVYYHKNNKSWMESE (SEQ ID NO: 1080)

>PENTAMER3: 0272

PRTFLLKYNENGTITDA (SEQ ID NO: 1086)

>PENTAMER3: 0154

ESEFRVYSSANNCTFEYV (SEQ ID NO: 1081)

>PENTAMER1: 0140

FLGVYYHKNNKSWMESE (SEQ ID NO: 1080)

>PENTAMER1: 0483

VEGFNCYFPLQSYGFQPT (SEQ ID NO: 1092)

>PENTAMER1: 1152

LDKYFKNHTSPDVDL (SEQ ID NO: 1115)

>PENTAMER1: 0536

NKCVNFNFNGLTGT (SEQ ID NO: 1122)

>PENTAMER1: 0315

TSNFRVQPTESIVRF (SEQ ID NO: 1087)

>PENTAMER2: 0154

ESEFRVYSSANNCTFEYV (SEQ ID NO: 1081)

>PENTAMER2: 0797

FGGFNFSQILPDPSKPSKRS (SEQ ID NO: 1100)

>PENTAMER2: 0998

TGRLQSLQTYVTQQL (SEQ ID NO: 1110)

>PENTAMER2: 0492

LQSYGFQPTNGVGYQPY (SEQ ID NO: 1093)

>PENTAMER2: 0025

PPAYTNSFTRGVYY (SEQ ID NO: 1074)

>PENTAMER3: 0446

GGNYNYLYRLFRKSNLKPFERDI (SEQ ID NO: 1091)

>PENTAMER3: 0239

QTLLALHRSYLTPGDSS (SEQ ID NO: 1085)

>PENTAMER3: 0140

FLGVYYHKNNKSWMESE (SEQ ID NO: 1080)

>PENTAMER3: 0272

PRTFLLKYNENGTITDA (SEQ ID NO: 1086)

>PENTAMER3: 0154

ESEFRVYSSANNCTFEYV (SEQ ID NO 1081)

Predicted epitope sequences were concatenated to form 22 multi-epitope pseudo-proteins. Vaccine constructs predicted to have no junctional epitopes were designed. VaccineCAD was used to rearrange the peptides to avoid creation of novel epitopes at peptide junctions, and used JanusMatrix to predict junctional epitopes. Where reordering did not sufficiently reduce the potential for junctional immunogenicity, spacers (e.g., Gly-Pro-Gly-Pro-Gly) were introduced. In aspects, a cleavage promoting motif or a binding inhibiting ‘breaker’ sequence could be introduced between peptides to optimize epitope processing. In aspects, T cell epitope cluster flanking residues were extended or removed to further minimize junctional T cell epitope content. These post-VaccineCAD, optimized multi-epitope constructs are represented by sequences SEQ ID NOS: 1677-1684,2593-2604, 2641-2646, and 2719-2722. These constructs are located below in Tables 4-11, Tables 35-40, and Tables 41-44. Additional epitope concatemers containing coronavirus cross-conserved sequences are represented by SEQ ID NOS: 1685-1692 and 2723-2734, and are found in Tables 12-19 and 45-56.

TABLE 4

SEQ ID NO: 1677

TQSLLIVNNATNVVIKVFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVP

KEITVATSRTLSYYGSGSGRGHLRIAGHHLGRCDNKCVNFNFNGLTGTGV

LTELLQYGSFCTQLNRALTGIAVEQLSYFIASFRLFARTRSMWSFNPETN

ILLNVGGNYNYLYRLFRKSNLKPFERDILKKLLEQWNLVIGFLFLTWGSG

SGFGGFNFSQILPDPSKPSKRSVLSFELLHAPATVCGQTLLALHRSYLTP

GDSSICLLQFAYANRNRFLYIVEGFNCYFPLQSYGFQPTNGVGYQPYDVV

IGIVNNTVYDPLQIPFAMQMAYRFNGIGVTGRLQSLQTYVTQQLTDEMIA

QYTSALLANDGVYFASTEKSNIIRPPAYTNSFTRGVYYLDKYFKNHTSPD

VDLPTNFTISVTTEILPVGSGSGTSNFRVQPTESIVRFDGYFKIYSKHTP

INLQDLFLPFFSNVTWFHAIHVSGTNGTSRVKNLNSSRVPDPRTFLLKYN

ENGTITDAISGINASVVNIQKEIFGEVFNATRFASVYAWNRKRISNCVAD

YSVLYNSASFSTFKGKGYHLMSFPQSAPHQKLIANQFNSAIGKIQDSLGS

GSGIGNYKLNTDHSSSSDNIMYSFVSEETGTLIVNLACFVLAAVYRINWI

TABLE 5

SEQ ID NO: 1678

FGGFNFSQILPDPSKPSKRSGGNYNYLYRLFRKSNLKPFERDIFLGVYYH

KNNKSWMESEFRVYSSANNCTFEYVPRTFLLKYNENGTITDASRVKNLNS

SRVPDTQSLLIVNNATNVVIKVGSGSGGKGYHLMSFPQSAPHISGINASV

VNIQKEIGSGSGLKKLLEQWNLVIGFLFLTWQIPFAMQMAYRFNGIGVTG

RLQSLQTYVTQQLGSGSGIGNYKLNTDHSSSSDNIRGHLRIAGHHLGRCD

VLSFELLHAPATVCGLLQYGSFCTQLNRALTGIAVEQLSYFIASFRLFAR

TRSMWSFNPETNILLNVNDGVYFASTEKSNIIRGSGSGQKLIANQFNSAI

GKIQDSLMYSFVSEETGTLIVNFGEVFNATRFASVYAWNRKRISNCVADY

SVLYNSASFSTFKQTLLALHRSYLTPGDSSPPAYTNSFTRGVYYVEGFNC

YFPLQSYGFQPTNGVGYQPYDVVIGIVNNTVYDPLGSGSGLACFVLAAVY

RINWIPKEITVATSRTLSYYPTNFTISVTTEILPVNKCVNFNFNGLTGTG

VLTEDGYFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGTNGTTSNFRV

QPTESIVRFTDEMIAQYTSALLALDKYFKNHTSPDVDLICLLQFAYANRN

RFLYI

TABLE 6

SEQ ID NO: 1679

GKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSVLSFELLHAPATVCGQKL

IANQFNSAIGKIQDSLLKKLLEQWNLVIGFLFLTWQIPFAMQMAYRFNGI

GVPKEITVATSRTLSYYPTNFTISVTTEILPVICLLQFAYANRNRFLYIN

DGVYFASTEKSNIIRTDEMIAQYTSALLANKCVNFNFNGLTGTGVLTEDG

YFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGTNGTTGRLQSLQTYVT

QQLGSGSGLACFVLAAVYRINWIVEGFNCYFPLQSYGFQPTNGVGYQPYF

LGVYYHKNNKSWMESEFRVYSSANNCTFEYVLDKYFKNHTSPDVDLPPAY

TNSFTRGVYYPRTFLLKYNENGTITDASRVKNLNSSRVPDTQSLLIVNNA

TNVVIKVGGNYNYLYRLFRKSNLKPFERDIMYSFVSEETGTLIVNFGGFN

FSQILPDPSKPSKRSLLQYGSFCTQLNRALTGIAVEQISGINASVVNIQK

EIDVVIGIVNNTVYDPLRGHLRIAGHHLGRCDTSNFRVQPTESIVRFFGE

VFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKIGNYKLNTDHSS

SSDNILSYFIASFRLFARTRSMWSFNPETNILLNV

TABLE 35

SEQ ID NO: 2641

GKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSVLSFELLHAPATVCGQKL

IANQFNSAIGKIQDSLLKKLLEQWNLVIGFLFLTWQIPFAMQMAYRFNGI

GVPKEITVATSRTLSYYPTNFTISVTTEILPVICLLQFAYANRNRFLYIN

DGVYFASTEKSNIIRTDEMIAQYTSALLANKCVNFNFNGLTGTGVLTEDG

YFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGTNGTTGRLQSLQTYVT

QQLSSDYTIVLPVIGAIVVGLCLMGMGVYKIRLRCQSSGYQRI

TABLE 36

SEQ ID NO: 2642

LACFVLAAVYRINWIVEGFNCYFPLQSYGFQPTNGVGYQPYFLGVYYHKN

NKSWMESEFRVYSSANNCTFEYVLDKYFKNHTSPDVDLPPAYTNSFTRGV

YYPRTFLLKYNENGTITDASRVKNLNSSRVPDTQSLLIVNNATNVVIKVG

GNYNYLYRLFRKSNLKPFERDIMYSFVSEETGTLIVNFGGFNFSQILPDP

SKPSKRSLLQYGSFCTQLNRALTGIAVEQISGINASVVNIQKEIDVVIGI

VNNTVYDPLRGHLRIAGHHLGRCDTSNFRVQPTESIVRFFGEVFNATRFA

SVYAWNRKRISNCVADYSVLYNSASFSTFKIGNYKLNTDHSSSSDNILSY

FIASFRLFARTRSMWSFNPETNILLNV

TABLE 7

SEQ ID NO: 1680

TSNFRVQPTESIVRFGSGSGDVVIGIVNNTVYDPLGSGSGISGINASVVN

IQKEIGSGSGNDGVYFASTEKSNIIRGSGSGGKGYHLMSFPQSAPHGSGS

GRGHLRIAGHHLGRCDGSGSGQDLFLPFFSNVTWFHAIHVSGTNGTGSGS

GLACFVLAAVYRINWIGSGSGVEGFNCYFPLQSYGFQPTNGVGYQPYGSG

SGNKCVNFNFNGLTGTGVLTEGSGSGFGGFNFSQILPDPSKPSKRSGSGS

GTDEMIAQYTSALLAGSGSGGGNYNYLYRLFRKSNLKPFERDIGSGSGQT

LLALHRSYLTPGDSSGSGSGFLGVYYHKNNKSWMESEFRVYSSANNCTFE

YVGSGSGQKLIANQFNSAIGKIQDSLGSGSGPRTFLLKYNENGTITDAGS

GSGVLSFELLHAPATVCGGSGSGTQSLLIVNNATNVVIKVGSGSGIGNYK

LNTDHSSSSDNIGSGSGSRVKNLNSSRVPDGSGSGLLQYGSFCTQLNRAL

TGIAVEQGSGSGMYSFVSEETGTLIVNGSGSGPKEITVATSRTLSYYGSG

SGICLLQFAYANRNRFLYIGSGSGLKKLLEQWNLVIGFLFLTWGSGSGFG

EVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKGSGSGLDKYFK

NHTSPDVDLGSGSGQIPFAMQMAYRFNGIGVGSGSGPPAYTNSFTRGVYY

GSGSGPTNFTISVTTEILPVGSGSGTGRLQSLQTYVTQQLGSGSGLSYFI

ASFRLFARTRSMGSGSGDGYFKIYSKHTPINLWSFNPETNILLNV

TABLE 37

SEQ ID NO: 2643

TSNFRVQPTESIVRFGSGSGDVVIGIVNNTVYDPLGSGSGISGINASVVN

IQKEIGSGSGNDGVYFASTEKSNIIRGSGSGGKGYHLMSFPQSAPHGSGS

GRGHLRIAGHHLGRCDGSGSGQDLFLPFFSNVTWFHAIHVSGTNGTGSGS

GLACFVLAAVYRINWIGSGSGVEGFNCYFPLQSYGFQPTNGVGYQPYGSG

SGNKCVNFNFNGLTGTGVLTEGSGSGFGGFNFSQILPDPSKPSKRSGSGS

FGTDEMIAQYTSALLAGSGSGGGNYNYLYRLRKSNLKPFERDIGSGSGQT

LLALHRSYLTPGDSSGSGSGFLGVYYHKNNKSWMESEFRVYSSANNCTFE

YVGSGSGQKLIANQFNSAIGKIQDSL

TABLE 38

SEQ ID NO: 2644

PRTFLLKYNENGTITDAGSGSGVLSFELLHAPATVCGGSGSGTQSLLIVN

NATNVVIKVGSGSGIGNYKLNTDHSSSSDNIGSGSGSRVKNLNSSRVPDG

SGSGLLQYGSFCTQLNRALTGIAVEQGSGSGMYSFVSEETGTLIVNGSGS

GPKEITVATSRTLSYYGSGSGICLLQFAYANRNRFLYIGSGSGLKKLLEQ

WNLVIGFLFLTWGSGSGFGEVFNATRFASVYAWNRKRISNCVADYSVLYN

SASFSTFKGSGSGLDKYFKNHTSPDVDLGSGSGQIPFAMQMAYRFNGIGV

GSGSGPPAYTNSFTRGVYYGSGSGDGYFKIYSKHTPINLGSGSGPTNFTI

SVTTEILPVGSGSGTGRLQSLQTYVTQQLGSGSGLSYFIASFRLFARTRS

MWSFNPETNILLNV

TABLE 8

SEQ ID NO: 1681

RGHLRIAGHHLGRCDGSGSGSGSLKKLLEQWNLVIGFLFLTWGSGSGSGS

TQSLLIVNNATNVVIKVGSGSGSGSDVVIGIVNNTVYDPLGSGSGSGSIG

NYKLNTDHSSSSDNIGSGSGSGSTGRLQSLQTYVTQQLGSGSGSGSDGYF

KIYSKHTPINLGSGSGSGSPTNFTISVTTEILPVGSGSGSGSQIPFAMQM

AYRFNGIGVGSGSGSGSSRVKNLNSSRVPDGSGSGSGSTSNFRVQPTESI

VRFGSGSGSGSISGINASVVNIQKEIGSGSGSGSTDEMIAQYTSALLAGS

GSGSGSFGGFNFSQILPDPSKPSKRSGSGSGSGSLACFVLAAVYRINWIG

SGSGSGSQDLFLPFFSNVTWFHAIHVSGTNGTGSGSGSGSNDGVYFASTE

KSNIIRGSGSGSGSNKCVNFNFNGLTGTGVLTEGSGSGSGSGGNYNYLYR

LFRKSNLKPFERDIGSGSGSGSPPAYTNSFTRGVYYGSGSGSGSQTLLAL

HRSYLTPGDSSGSGSGSGSVLSFELLHAPATVCGGSGSGSGSFLGVYYHK

NNKSWMESEFRVYSSANNCTFEYVGSGSGSGSQKLIANQFNSAIGKIQDS

LGSGSGSGSLLQYGSFCTQLNRALTGIAVEQGSGSGSGSPRTFLLKYNEN

GTITDAGSGSGSGSVEGFNCYFPLQSYGFQPTNGVGYQPYGSGSGSGSGK

GYHLMSFPQSAPHGSGSGSGSLSYFIASFRLFARTRSMWSFNPETNILLN

VGSGSGSGSLDKYFKNHTSPDVDLGSGSGSGSMYSFVSEETGTLIVNGSG

SGSGSFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKGSGS

GSGSPKEITVATSRTLSYYGSGSGSGSICLLQFAYANRNRFLYI

TABLE 39

SEQ ID NO: 2645

RGHLRIAGHHLGRCDGSGSGSGSLKKLLEQWNLVIGFLFLTWGSGSGSGS

TQSLLIVNNATNVVIKVGSGSGSGSDVVIGIVNNTVYDPLGSGSGSGSIG

NYKLNTDHSSSSDNIGSGSGSGSTGRLQSLQTYVTQQLGSGSGSGSDGYF

KIYSKHTPINLGSGSGSGSPTNFTISVTTEILPVGSGSGSGSQIPFAMQM

AYRFNGIGVGSGSGSGSSRVKNLNSSRVPDGSGSGSGSTSNFRVQPTESI

VRFGSGSGSGSISGINASVVNIQKEIGSGSGSGSTDEMIAQYTSALLAGS

GSGSGSFGGFNFSQILPDPSKPSKRSGSGSGSGSLACFVLAAVYRINWIG

SGSGSGSQDLFLPFFSNVTWFHAIHVSGTNGT

TABLE 40

SEQ ID NO: 2646

NDGVYFASTEKSNIIRGSGSGSGSNKCVNFNFNGLTGTGVLTEGSGSGSG

SGGNYNYLYRLFRKSNLKPFERDIGSGSGSGSPPAYTNSFTRGVYYGSGS

GSGSQTLLALHRSYLTPGDSSGSGSGSGSVLSFELLHAPATVCGGSGSGS

GSFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVGSGSGSGSQKLIANQF

NSAIGKIQDSLGSGSGSGSLLQYGSFCTQLNRALTGIAVEQGSGSGSGSP

RTFLLKYNENGTITDAGSGSGSGSVEGFNCYFPLQSYGFQPTNGVGYQPY

GSGSGSGSGKGYHLMSFPQSAPHGSGSGSGSLSYFIASFRLFARTRSMWS

FNPETNILLNVGSGSGSGSLDKYFKNHTSPDVDLGSGSGSGSMYSFVSEE

TGTLIVNGSGSGSGSFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSA

SFSTFKGSGSGSGSPKEITVATSRTLSYYGSGSGSGSICLLQFAYANRNR

FLYI

TABLE 9

SEQ ID NO: 1682

FLGVYYHKNNKSWMESEGSGSGTSNFRVQPTESIVRFLDKYFKNHTSPD

VDLNKCVNFNFNGLTGTVEGFNCYFPLQSYGFQPT

TABLE 10

SEQ ID NO: 1683

ESEFRVYSSANNCTFEYVPPAYTNSFTRGVYYTGRLQSLQTYVTQQLFG

GFNFSQILPDPSKPSKRSLQSYGFQPTNGVGYQPY

TABLE 11

SEQ ID NO: 1684

PRTFLLKYNENGTITDAQTLLALHRSYLTPGDSSGGNYNYLYRLFRKSN

LKPFERDIFLGVYYHKNNKSWMESEESEFRVYSSANNCTFEYV

TABLE 12

SEQ ID NO: 1685

LSHFVNLDNLRANNTKGSLPIANYIFWRNTNPIQLSSYGPGPGDVDTDF

VNEFYAYLRKHFSMMILSD

TABLE 13

SEQ ID NO: 1686

LGSLIYSTAALGVLMSNLGMPSYGPGPGDQAISMWALIISVTSNYSGVV

TTVGPGPGVASIKNFKSVLYYQNNVFMSEAK

TABLE 14

SEQ ID NO: 1687

WPLIVTALRANSAVKLQKVGGNYNYLYRLFRKSNLKPFERDIMPNMLRI

MASLVLARKHT

TABLE 15

SEQ ID NO: 1688

LLPLTQYNRYLALYNKYKYFSGAMDGPGPGDMTYRRLISMMGFKMNYQV

GPGPGYEDLLIRKSNHNFLVQAGN

TABLE 16

SEQ ID NO: 1689

GDQFKHLIPLMYKGLPWNVVRIKGPGPGIPTITQMNLKYAISAKNRART

VAGGPGPGNSWLMWLIINLVQMAPISAMVRM

TABLE 17

SEQ ID NO: 1690

TAFVTNVNASSSEAFGPGPGLYFIKGLNNLNRGGPGPGRKAVFISPYNS

QNAVASKILGL

TABLE 18

SEQ ID NO: 1691

VDAVNLLTNMFTPLIQPIGALDKGRLIIRENNRVVISSGPGPGTERLKL

FAAETLKATEE

TABLE 19

SEQ ID NO: 1692

SSNVNRFNVAITRAKVGILSIMGPGPGGSLYVNKHAFHTPAFDTQSLLI

VNNATNWIKV

TABLE 41

SEQ ID NO: 2719

NSSFSRGETRCEQDLEGKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSV

LSFELLHAPATVCGQKLIANQFNSAIGKIQDSLLKKLLEQWNLVIGFLF

LTWQIPFAMQMAYRFNGIGVPKEITVATSRTLSYYPTNFTISVTTEILP

VICLLQFAYANRNRFLYINDGVYFASTEKSNIIRTDEMIAQYTSALLAN

KCVNFNFNGLTGTGVLTEDGYFKIYSKHTPINLQDLFLPFFSNVTWFHA

IHVSGTNGTTGRLQSLQTYVTQQLGSGSGLACFVLAAVYRINWIVEGFN

CYFPLQSYGFQPTNGVGYQPYFLGVYYHKNNKSWMESEFRVYSSANNCT

FEYVLDKYFKNHTSPDVDLPPAYTNSFTRGVYYPRTFLLKYNENGTITD

ASRVKNLNSSRVPDTQSLLIVNNATNVVIKVGGNYNYLYRLFRKSNLKP

FERDIMYSFVSEETGTLIVNFGGFNFSQILPDPSKPSKRSLLQYGSFCT

QLNRALTGIAVEQISGINASVVNIQKEIDVVIGIVNNTVYDPLRGHLRI

AGHHLGRCDTSNFRVQPTESIVRFFGEVFNATRFASVYAWNRKRISNCV

ADYSVLYNSASFSTFKIGNYKLNTDHSSSSDNILSYFIASFRLFARTRS

MWSFNPETNILLNV

TABLE 42

SEQ ID NO: 2720

NSSFSRGETRCEQDLERGHLRIAGHHLGRCDGSGSGSGSLKKLLEQWNL

VIGFLFLTWGSGSGSGSTQSLLIVNNATNVVIKVGSGSGSGSDVVIGIV

NNTVYDPLGSGSGSGSIGNYKLNTDHSSSSDNIGSGSGSGSTGRLQSLQ

TYVTQQLGSGSGSGSDGYFKIYSKHTPINLGSGSGSGSPTNFTISVTTE

ILPVGSGSGSGSQIPFAMQMAYRFNGIGVGSGSGSGSSRVKNLNSSRVP

DGSGSGSGSTSNFRVQPTESIVRFGSGSGSGSISGINASVVNIQKEIGS

GSGSGSTDEMIAQYTSALLAGSGSGSGSFGGFNFSQILPDPSKPSKRSG

SGSGSGSLACFVLAAVYRINWIGSGSGSGSQDLFLPFFSNVTWFHAIHV

SGTNGTGSGSGSGSNDGVYFASTEKSNIIRGSGSGSGSNKCVNFNFNGL

TGTGVLTEGSGSGSGSGGNYNYLYRLFRKSNLKPFERDIGSGSGSGSPP

AYTNSFTRGVYYGSGSGSGSQTLLALHRSYLTPGDSSGSGSGSGSVLSF

ELLHAPATVCGGSGSGSGSFLGVYYHKNNKSWMESEFRVYSSANNCTFE

YVGSGSGSGSQKLIANQFNSAIGKIQDSLGSGSGSGSLLQYGSFCTQLN

RALTGIAVEQGSGSGSGSPRTFLLKYNENGTITDAGSGSGSGSVEGFNC

YFPLQSYGFQPTNGVGYQPYGSGSGSGSGKGYHLMSFPQSAPHGSGSGS

GSLSYFIASFRLFARTRSMWSFNPETNILLNVGSGSGSGSLDKYFKNHT

SPDVDLGSGSGSGSMYSFVSEETGTLIVNGSGSGSGSFGEVFNATRFAS

VYAWNRKRISNCVADYSVLYNSASFSTFKGSGSGSGSPKEITVATSRTL

SYYGSGSGSGSICLLQFAYANRNRFLYI

TABLE 43

SEQ ID NO: 2721

NSSFSRGETRCEQDLEGKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSV

LSFELLHAPATVCGQKLIANQFNSAIGKIQDSLQIPFAMQMAYRFNGIG

VPKEITVATSRTLSYYPTNFTISVTTEILPVICLLQFAYANRNRFLYIN

DGVYFASTEKSNIIRTDEMIAQYTSALLANKCVNFNFNGLTGTGVLTED

GYFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGTNGTTGRLQSLQTY

VTQQLVEGFNCYFPLQSYGFQPTNGVGYQPYFLGVYYHKNNKSWMESEF

RVYSSANNCTFEYVLDKYFKNHTSPDVDLPPAYTNSFTRGVYYPRTFLL

KYNENGTITDASRVKNLNSSRVPDTQSLLIVNNATNVVIKVGGNYNYLY

RLFRKSNLKPFERDIMYSFVSEETGTLIVNFGGFNFSQILPDPSKPSKR

SLACFVLAAVYRINWILLQYGSFCTQLNRALTGIAVEQISGINASVVNI

QKEIDVVIGIVNNTVYDPLRGHLRIAGHHLGRCDLKKLLEQWNLVIGFL

FLTWTSNFRVQPTESIVRFFGEVFNATRFASVYAWNRKRISNCVADYSV

LYNSASFSTFKIGNYKLNTDHSSSSDNILSYFIASFRLFARTRSMWSFN

PETNILLNV

TABLE 44

SEQ ID NO: 2722

NSSFSRGETRCEQDLEGKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSV

LSFELLHAPATVCGQKLIANQFNSAIGKIQDSLQIPFAMQMAYRFNGIG

VPKEITVATSRTLSYYPTNFTISVTTEILPVICLLQFAYANRNRFLYIN

DGVYFASTEKSNIIRTDEMIAQYTSALLANKCVNFNFNGLTGTGVLTED

GYFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGTNGTTGRLQSLQTY

VTQQLVEGFNCYFPLQSYGFQPTNGVGYQPYFLGVYYHKNNKSWMESEF

RVYSSANNCTFEYVLDKYFKNHTSPDVDLPPAYTNSFTRGVYYPRTFLL

KYNENGTITDASRVKNLNSSRVPDTQSLLIVNNATNVVIKVGGNYNYLY

RLFRKSNLKPFERDIMYSFVSEETGTLIVNFGGFNFSQILPDPSKPSKR

SLACFVLAAVYRINWIGPGPGLLQYGSFCTQLNRALTGIAVEQISGINA

SVVNIQKEIDVVIGIVNNTVYDPLRGHLRIAGHHLGRCDLKKLLEQWNL

VIGFLFLTWGPGPGTSNFRVQPTESIVRFFGEVFNATRFASVYAWNRKR

ISNCVADYSVLYNSASFSTFKIGNYKLNTDHSSSSDNILSYFIASFRLF

ARTRSMWSFNPETNILLNV

TABLE 45

SEQ ID NO: 2723

FLGVYYHKNNKSWMESEGSGSGTSNFRVQPTESIV

RFLDKYFKNHTSPDVDLNKCVNFNFNGLTGTVEGF

NCYFPLQSYGFQPTGPGPGSSDYTIVLPVIGAIVV

GLCLMGMGVYKIRLRCQSSGYQRI

TABLE 46

SEQ ID NO: 2724

ESEFRVYSSANNCTFEYVPPAYTNSFTRGVYYTGR

LQSLQTYVTQQLFGGFNFSQILPDPSKPSKRSLQS

YGFQPTNGVGYQPYGPGPGSSDYTIVLPVIGAIVV

GLCLMGMGVYKIRLRCQSSGYQRI

TABLE 47

SEQ ID NO: 2725

PRTFLLKYNENGTITDAQTLLALHRSYLTPGDSSG

GNYNYLYRLFRKSNLKPFERDIFLGVYYHKNNKSW

MESEESEFRVYSSANNCTFEYVGPGPGSSDYTIVL

PVIGAIVVGLCLMGMGVYKIRLRCQSSGYQRI

TABLE 48

SEQ ID NO: 2726

FLGVYYHKNNKSWMESEGSGSGTSNFRVQPTESIV

RFLDKYFKNHTSPDVDLNKCVNFNFNGLTGTVEGF

NCYFPLQSYGFQPTSSDYTIVLPVIGAIVVGLCLM

GMGVYKIRLRCQSSGYQRI

TABLE 49

SEQ ID NO: 2727

ESEFRVYSSANNCTFEYVPPAYTNSFTRGVYYTGR

LQSLQTYVTQQLFGGFNFSQILPDPSKPSKRSLQS

YGFQPTNGVGYQPYSSDYTIVLPVIGAIVVGLCLM

GMGVYKIRLRCQSSGYQRI

TABLE 50

SEQ ID NO: 2728

PRTFLLKYNENGTITDAQTLLALHRSYLTPGDSSG

GNYNYLYRLFRKSNLKPFERDIFLGVYYHKNNKSW

MESEESEFRVYSSANNCTFEYVSSDYTIVLPVIGA

IVVGLCLMGMGVYKIRLRCQSSGYQRI

TABLE 51

SEQ ID NO: 2729

MAAPGSARRPLLLLLLLLLLGLMHCASAAFLGVYY

HKNNKSWMESEGSGSGTSNFRVQPTESIVRFLDKY

FKNHTSPDVDLNKCVNFNFNGLTGTVEGFNCYFPL

QSYGFQPTGPGPGSSDYTIVLPVIGAIVVGLCLMG

MGVYKIRLRCQSSGYQRI

TABLE 52

SEQ ID NO: 2730

MAAPGSARRPLLLLLLLLLLGLMHCASAAESEFRV

YSSANNCTFEYVPPAYTNSFTRGVYYTGRLQSLQT

YVTQQLFGGFNFSQILPDPSKPSKRSLQSYGFQPT

NGVGYQPYGPGPGSSDYTIVLPVIGAIVVGLCLMG

MGVYKIRLRCQSSGYQRI

TABLE 53

SEQ ID NO: 2731

MAAPGSARRPLLLLLLLLLLGLMHCASAAPRTFLL

KYNENGTITDAQTLLALHRSYLTPGDSSGGNYNYL

YRLFRKSNLKPFERDIFLGVYYHKNNKSWMESEES

EFRVYSSANNCTFEYVGPGPGSSDYTIVLPVIGAI

VVGLCLMGMGVYKIRLRCQSSGYQRI

TABLE 54

SEQ ID NO: 2732

MAAPGSARRPLLLLLLLLLLGLMHCASAAGKGYHL

MSFPQSAPHQTLLALHRSYLTPGDSSVLSFELLHA

PATVCGQKLIANQFNSAIGKIQDSLQIPFAMQMAY

RFNGIGVPKEITVATSRTLSYYPTNFTISVTTEIL

PVICLLQFAYANRNRFLYINDGVYFASTEKSNIIR

TDEMIAQYTSALLANKCVNFNFNGLTGTGVLTEDG

YFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGT

NGTTGRLQSLQTYVTQQLVEGFNCYFPLQSYGFQP

TNGVGYQPYFLGVYYHKNNKSWMESEFRVYSSANN

CTFEYVLDKYFKNHTSPDVDLPPAYTNSFTRGVYY

PRTFLLKYNENGTITDASRVKNLNSSRVPDTQSLL

IVNNATNVVIKVGGNYNYLYRLFRKSNLKPFERDI

MYSFVSEETGTLIVNFGGFNFSQILPDPSKPSKRS

LACFVLAAVYRINWIGPGPGLLQYGSFCTQLNRAL

TGIAVEQISGINASVVNIQKEIDVVIGIVNNTVYD

PLRGHLRIAGHHLGRCDLKKLLEQWNLVIGFLFLT

WGPGPGTSNFRVQPTESIVRFFGEVFNATRFASVY

AWNRKRISNCVADYSVLYNSASFSTFKIGNYKLNT

DHSSSSDNILSYFIASFRLFARTRSMWSFNPETNI

LLNVGPGPGSSDYTIVLPVIGAIVVGLCLMGMGVY

KIRLRCQSSGYQRI

TABLE 55

SEQ ID NO: 2733

MAAPGSARRPLLLLLLLLLLGLMHCASAAGKGYHL

MSFPQSAPHQTLLALHRSYLTPGDSSVLSFELLHA

PATVCGQKLIANQFNSAIGKIQDSLQIPFAMQMAY

RFNGIGVPKEITVATSRTLSYYPTNFTISVTTEIL

PVICLLQFAYANRNRFLYINDGVYFASTEKSNIIR

TDEMIAQYTSALLANKCVNFNFNGLTGTGVLTEDG

YFKIYSKHTPINLQDLFLPFFSNVTWFHAIHVSGT

NGTTGRLQSLQTYVTQQLVEGFNCYFPLQSYGFQP

TNGVGYQPYFLGVYYHKNNKSWMESEFRVYSSANN

CTFEYVLDKYFKNHTSPDVDLPPAYTNSFTRGVYY

GPGPGSSDYTIVLPVIGAIVVGLCLMGMGVYKIRL

RCQSSGYQRI

TABLE 56

SEQ ID NO: 2734

MAAPGSARRPLLLLLLLLLLGLMHCASAAPRTFLL

KYNENGTITDASRVKNLNSSRVPDTQSLLIVNNAT

NVVIKVGGNYNYLYRLFRKSNLKPFERDIMYSFVS

EETGTLIVNFGGFNFSQILPDPSKPSKRSLACFVL

AAVYRINWIGPGPGLLQYGSFCTQLNRALTGIAVE

QISGINASVVNIQKEIDVVIGIVNNTVYDPLRGHL

RIAGHHLGRCDLKKLLEQWNLVIGFLFLTWGPGPG

TSNFRVQPTESIVRFFGEVFNATRFASVYAWNRKR

ISNCVADYSVLYNSASFSTFKIGNYKLNTDHSSSS

DNILSYFIASFRLFARTRSMWSFNPETNILLNVGP

GPGSSDYTIVLPVIGAIVVGLCLMGMGVYKIRLRC

QSSGYQR

2.2.—Reference datasets for individual antigens were compiled from SARS-CoV-2, SARS-CoV-1, MERS-CoV, and human CoV (HKU1, 0C43, NL63, 229E). All the protein sequences were downloaded from GenBank. As a first step, each of the protein sequences from the SARS-CoV-2 reference strain (Wuhan 1_2020; GenBank MN908947) were parsed into overlapping 9-mers. The binding potential of each 9-mer was predicted against a set of nine class II HLA supertype alleles using the EpiMatrix algorithm. To define regions that have an unusually high potential for immunogenicity (T cell epitope clusters), each SARS-CoV-2 protein was screened with the ClustiMer algorithm. EpiMatrix Cluster Scores were calculated for each identified T cell epitope cluster. T cell epitope clusters were then screened for cross-conservation against the human proteome using the JanusMatrix algorithm. T cell epitope clusters with JanusMatrix Human Homology Scores above two were considered as potentially tolerogenic (Tregitopes). To identify T cell epitope cluster cross-conserved with other coronaviruses, both the standard and a less stringent version of the JanusMatrix algorithm were applied. Finally, the Class I T cell epitope content for 9-mer and 10-mer frames of each T cell epitope clusters was identified for a set of six class I HLA supertype alleles using EpiMatrix.

SARS-CoV-2 T cell epitope clusters were next ranked based on Cluster Scores, JanusMatrix Human Homology Scores, and JanusMatrix Coronavirus Homology Scores. Peptides with Cysteines in their cores were excluded. The top 22 T cell epitope clusters were selected for vaccine design.

The selected T cell epitope clusters were then aggregated into epitope concatemers using the VaxCAD algorithm. VaxCAD also optimized the sequence arrangement to minimize Class I and II junctional epitope content. Gly-Pro-Gly-Pro-Gly spacer sequences were introduced between epitopes to remove junctional epitopes where reordering did not sufficiently reduce potential junctional immunogenicity. T cell epitope cluster flanking residues were extended or removed to further minimize junctional T cell epitope content. Nine concatemers were constructed with two or three T cell epitope clusters per concatemer. The peptides with identified originating protein and starting position utilized to generate the concatemers of SEQ ID NOS: 2593-2604 (FIG. 120) are presented in Tables 20-28. Table 29 also presents 9 concatemer constructs.

TABLE 20

Cocatemer 1

>ORF1AB_METHYLTRANSFERASE_0226

ANYIFWRNTNPIQLSSY (SEQ ID NO: 1713)

>ORF1AB_RNA_POLYMERASE_0738

DVDTDFVNEFYAYLRKHFSMMILSD

(SEQ ID NO: 1724)

TABLE 21

Concatemer 2

>ORF1AB_NSP60162

ISMWALIISVTSNYSGVVTTV

(SEQ ID NO: 172)

>ORF1AB_RNA_POLYMERASE_0776

VASIKNFKSVLYYQNNVFMSEAK

(SEQ ID NO: 1756)

TABLE 22

Concatemer 3

> ORF1AB NSP8 0182

WPLIVTALRANSAVKLQ

(SEQ ID NO: 1772)

> SPIKE_0446

KVGGNYNYLYRLFRKSNLKPFERDI

(SEQ ID NO: 1783)

> ORFIAB_RNA_POLYMERASE_0626

MPNMLRIMASLVLARKHT (SEQ ID NO 1801)

TABLE 23

Concatemer 4

>ORF1AB_NSP4_0435

LLPLTQYNRYLALYNKYKYFSGAMD (SEQ ID NO: 1812)

>ORF1AB_EXONUCLEASE_0048

DMTYRRLISMMGFKMNYQV (SEQ ID NO: 1831)

TABLE 24

Concatemer 5

>ORF1AB_EXONUCLEASE_0143

GDQFKHLIPLMYKGLPWNVVRIK (SEQ ID NO: 1844)

>ORF1AB_RNA_POLYMERASE_0539

IPTITQMNLKYAISAKNRARTVAG (SEQ ID NO: 1861)

TABLE 25

Concatemer 6

>ORF1AB_METHYLTRANSFERASE 0191

TAFVTNVNASSSEAF (SEQ ID NO: 1879)

>ORF1AB_NSP9_0088

LYFIKGLNNLNRG (SEQ ID NO: 1890)

>ORF1AB_HELICASE_0507

RKAVFISPYNSQNAVASKILGL (SEQ ID NO: 1898)

TABLE 26

Concatemer 7

>ORF1AB_NSP4_0258

VDAVNLLTNMFTPLIQPIGALD (SEQ ID NO: 1913)

>ORF1AB_METHYLTRANSFERASE_0277

KGRLIIRENNRVVISS (SEQ ID NO: 1929)

>ORF1AB_HELICASE_0127

TERLKLFAAETLKATEE (SEQ ID NO: 1939)

TABLE 27

Concatemer 8

>ORF1AB_HELICASE_0555

SSNVNRFNVAITRAKVGILSIM (SEQ ID NO: 1949)

>ORF1AB_EXONUCLEASE_0416

GGSLYVNKHAFHTPAFD (SEQ ID NO: 1965)

>SPIKE_0114

TQSLLIVNNATNVVIKV (SEQ ID NO: 1975)

TABLE 28

Concatemer 9

>ORF1AB_NSP3_1700

LSHFVNLDNLRANNTKGSLPI (SEQ ID NO: 1985)

>ORF1AB_PROTEINASE_0054

YEDLLIRKSNHNFLVQAGN (SEQ ID NO 1999)

TABLE 29

Concatemer
Sequence

1
ANYIFWRNTNPIQLSSYGPGPGDVDTDFVNEFYAYLRK

HFSMMILSD (SEQ ID NO: 2593)

2
ISMWALIISVTSNYSGVVTTVGPGPGVASIKNFKSVLY

YQNNVFMSEAK (SEQ ID NO: 2594)

3
WPLIVTALRANSAVKLQKVGGNYNYLYRLFRKSNLKPF

ERDIMPNMLRIMASLVLARKHT (SEQ ID NO:

2595)

4
LLPLTQYNRYLALYNKYKYFSGAMDGPGPGDMTYRRLI

SMMGFKMNYQV (SEQ ID NO: 2596)

5
GDQFKHLIPLMYKGLPWNVVRIKGPGPGIPTITQMNLK

YAISAKNRARTVAG (SEQ ID NO: 2597)

6
TAFVTNVNASSSEAFGPGPGLYFIKGLNNLNRGGPGPG

RKAVFISPYNSQNAVASKILGL (SEQ ID NO:

2598)

7
VDAVNLLTNMFTPLIQPIGALDKGRLIIRENNRVVISS

GPGPGTERLKLFAAETLKATEE (SEQ ID NO:

2599)

8
SSNVNRFNVAITRAKVGILSIMGPGPGGSLYVNKHAFH

TPAFDTQSLLIVNNATNVVIKV (SEQ ID NO:

2600)

9
LSHFVNLDNLRANNTKGSLPIGPGPGYEDLLIRKSNHN

FLVQAGN (SEQID NO: 2601)

2.3—34 class I and class II peptides were selected for inclusion in a third set of the instantly-disclosed vaccine constructs, following further immunoinformatic predictions. Peptides assessed in 2.1 above were further modified and assembled to produce the concatemeric peptides of SEQ ID NOS: 2602-2604 (and associated nucleic acid constructs encoding such) include the below sequences in TABLES 30-32 (SEQ ID NOS: 2605-2638—with original peptide source sequences also identified). The final concatemer sequences are presented in FIG. 120 (SEQ ID NOS: 2602-2604) and in Tables 33 and 34.

TABLE 30

>Spike 1044-SEQ ID NO: 2605-from SEQ ID NO: 655

GKGYHLMSFPQSAPH

>Spike 239-SEQ ID NO: 2606-from SEQ ID NO: 342

QTLLALHRSYLTPGDSS

>Spike 512-SEQ ID NO: 2607-from SEQ ID NO: 465

VLSFELLHAPATVCG

>Spike 920-SEQ ID NO: 2608-from SEQ ID NO: 578

QKLIANQFNS AIGKIQD SL

>Spike 895-SEQ ID NO: 2609-from SEQ ID NO: 568

QIPFAMQMAYRFNGIGV

>Membrane 165-SEQ ID NO: 2610-from SEQ ID NO: 178

PKEITVATSRTLSYY

>Spike 715-SEQ ID NO: 2611-from SEQ ID NO: 492

PTNFTISVTTEILPV

>Membrane 32-SEQ ID NO: 2612-from SEQ ID NO: 81

ICLLQFAYANRNRFLYI

>Spike 87-SEQ ID NO: 2613-from SEQ ID NO: 274

NDGVYFASTEKSNIIR

>Spike 866-SEQ ID NO: 2614-from SEQ ID NO: 556

TDEMIAQYTSALLA

>Spike 536-SEQ ID NO: 2615-from SEQ ID NO: 480

NKCVNFNFNGLTGTGVLTE

>Spike 198-SEQ ID NO: 2616-from SEQ ID NO: 320

DGYFKIYSKHTPINL

>Spike 52-SEQ ID NO: 2617-from SEQ ID NO: 251

QDLFLPFFSNVTWFHAIHVSGTNGT

TABLE 31

>SPIKE 998-SEQ ID NO: 2618-from SEQ ID NO: 626

TGRLQSLQTYVTQQL

>SPIKE 483-SEQ ID NO: 2619-from SEQ ID NO: 444

VEGFNCYFPLQSYGFQPTNGVGYQPY

>SPIKE 140-SEQ ID NO: 2620-from SEQ ID NO: 296

FLGVYYHKNNKSWMESEFRVYSSANNCTFEYV

>SPIKE 1152-SEQ ID NO: 2621-from SEQ ID NO: 677

LDKYFKNHTSPDVDL

>SPIKE 25-SEQ ID NO: 2622-from SEQ ID NO: 229

PPAYTNSFTRGVYY

>SPIKE 272-SEQ ID NO: 2623-from SEQ ID NO: 359

PRTFLLKYNENGTITDA

>ENV 60-SEQ ID NO: 2624-from SEQ ID NO: 46

SRVKNLNSSRVPD

>SPIKE 114-SEQ ID NO: 2625-from SEQ ID NO: 284

TQSLLIVNNATNVVIKV

>SPIKE 446-SEQ ID NO: 2626-from SEQ ID NO: 424

GGNYNYLYRLFRKSNLKPFERDI

>ENV 1-SEQ ID NO: 2627-from SEQ ID NO: 4

MYSFVSEETGTLIVN

>SPIKE 797-SEQ ID NO: 2628-from SEQ ID NO: 519

FGGFNFSQILPDPSKPSKRS

TABLE 32

MEMBRANE 60-SEQ ID NO: 2629-from SEQ ID NO: 116

SRVKNLNSSRVPD

>SPIKE 753-SEQ ID NO: 2630-from SEQ ID NO: 500

LLQYGSFCTQLNRALTGIAVEQ

>SPIKE 1169-SEQ ID NO: 2631-from SEQ ID NO: 686

ISGINASVVNIQKEI

>SPIKE 1127-SEQ ID NO: 2632-from SEQ ID NO: 668

DVVIGIVNNTVYDPL

>MEMBRANE 146-SEQ ID NO: 2633-from SEQ ID NO: 164

RGHLRIAGHHLGRCD

>MEMBRANE 13-SEQ ID NO: 2634-from SEQ ID NO: 69

LKKLLEQWNLVIGFLFLTW

>SPIKE 315-SEQ ID NO: 2635-from SEQ ID NO: 375

TSNFRVQPTESIVRF

>SPIKE 338-SEQ ID NO: 2636-from SEQ ID NO: 389

and 401

FGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFK

>MEMBRANE 201-SEQ ID NO: 2637-from SEQ ID NO: 199

IGNYKLNTDHSSSSDNI

>MEMBRANE 93-SEQ ID NO: 2638-from SEQ ID NO: 129

LSYFIASFRLFARTRSMWSFNPETNILLNV

TABLE 33

SEQ ID NO: 2639

GKGYHLMSFPQSAPHQTLLALHRSYLTPGDSSVLSFELLHAPATVCGQKL

IANQFNSAIGKIQDSLQIPFAMQMAYRFNGIGVPKEITVATSRTLSYYPT

NFTISVTTEILPVICLLQFAYANRNRFLYINDGVYFASTEKSNIIRTDEM

IAQYTSALLANKCVNFNFNGLTGTGVLTEDGYFKIYSKHTPINLQDLFLP

FFSNVTWFHAIHVSGTNGTTGRLQSLQTYVTQQLVEGFNCYFPLQSYGFQ

PTNGVGYQPYFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVLDKYFKNH

TSPDVDLPPAYTNSFTRGVYYPRTFLLKYNENGTITDASRVKNLNSSRVP

DTQSLLIVNNATNVVIKVGGNYNYLYRLFRKSNLKPFERDIMYSFVSEET

GTLIVNFGGFNFSQILPDPSKPSKRSLACFVLAAVYRINWIGPGPGLLQY

GSFCTQLNRALTGIAVEQISGINASVVNIQKEIDVVIGIVNNTVYDPLRG

HLRIAGHHLGRCDLKKLLEQWNLVIGFLFLTWGPGPGTSNFRVQPTESIV

RFFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKIGNYKLN

TDHSSSSDNILSYFIASFRLFARTRSMWSFNPETNILLNV

TABLE 34

SEQ ID NO: 2640

GKGYHLMSFPQSAPHGSGSGQTLLALHRSYLTPGDSSGSGSGVLSFELLH

APATVCGGSGSGQKLIANQFNSAIGKIQDSLGSGSGQIPFAMQMAYRFNG

IGVGSGSGPKEITVATSRTLSYYGSGSGPTNFTISVTTEILPVGSGSGIC

LLQFAYANRNRFLYIGSGSGNDGVYFASTEKSNIIRGSGSGTDEMIAQYT

SALLAGSGSGNKCVNFNFNGLTGTGVLTEGSGSGDGYFKIYSKHTPINLG

SGSGQDLFLPFFSNVTWFHAIHVSGTNGTGSGSGTGRLQSLQTYVTQQLG

SGSGVEGFNCYFPLQSYGFQPTNGVGYQPYGSGSGFLGVYYHKNNKSWME

SEFRVYSSANNCTFEYVGSGSGLDKYFKNHTSPDVDLGSGSGPPAYTNSF

TRGVYYGSGSGPRTFLLKYNENGTITDAGSGSGSRVKNLNSSRVPDGSGS

GTQSLLIVNNATNVVIKVGSGSGGGNYNYLYRLFRKSNLKPFERDIGSGS

GMYSFVSEETGTLIVNGSGSGFGGFNFSQILPDPSKPSKRSGSGSGLACF

VLAAVYRINWIGPGPGLLQYGSFCTQLNRALTGIAVEQGSGSGISGINAS

VVNIQKEIGSGSGDVVIGIVNNTVYDPLGSGSGRGHLRIAGHHLGRCDGS

GSGLKKLLEQWNLVIGFLFLTWGPGPGTSNFRVQPTESIVRFGSGSGFGE

VFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKGSGSGIGNYKLN

TDHSSSSDNIGSGSGLSYFIASFRLFARTRSMWSFNPETNILLNV

Example 3: Administration of SARS-CoV-2 Vaccine

Vaccine construct designs are developed, such as is demonstrated in the specification, and in aspects as specifically exemplified in Example 2. In aspects, this results in a concatemeric polypeptide vaccine or an “epistring” that consists of overlapping T-cell epitopes. As described throughout, such vaccines may be used for stimulating, inducing, and/or expanding an immune response against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19. In aspects, such vaccines initiate a strong T-cell mediated immune response, and has potential of inducing a humoral immune response.

Therefore, a vaccine containing a combination of the epistring together with either live attenuated virus (LAV) or inactivated virus is administered in an immunization trial in an appropriate animal model , e.g., mice, rats, rabbits, hamsters, etc., or even humans, as are known in the art. Data from administration of this combination vaccine provides positive results on the safety and effectiveness of the vaccine. This vaccination approach is expected to induce both cellular and humoral immune responses, thereby stimulating, inducing, and/or expanding an immune response against SARS-CoV-2 infection (or a closely related virus such as Severe Acute Respiratory Syndrome (SARS) or Middle East respiratory syndrome coronavirus (MERS-CoV)) and/or related diseases caused by SARS-CoV-2, including COVID-19, in humans.

Example 4: Polypeptide Binding to MHC
Methods for the Assessment Polypeptides as Disclosed Herein Binding to Soluble MHC.

Synthesis of peptides. The polypeptides of the present disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) are produced by direct chemical synthesis or by recombinant methods (J Sambrook et al., Molecular Cloning: A Laboratory Manual, (2ED, 1989), Cold Spring Harbor Laboratory Press, Cold Springs Harbor, N.Y. (Publ)). Every peptide undergoes rigorous quality control characterization before release to determine purity, mass, and correct sequence. Peptides are assessed for purity by reversed phase high-pressure liquid chromatography (RP-HPLC). Peptides are >90% pure, and each preparation will undergo Amino

Acid Analysis to ensure that the equivalent molar amounts are used in assays for consistency and reproducibility between different lots of peptides, and will also allow for reliable comparison studies between peptide efficacy. Peptides are assessed for mass and correct sequence using tandem mass spectrometry and MS CheckT analysis. In certain aspects, the polypeptides as disclosed herein may be capped with an n-terminal acetyl and/or c-terminal amino group. HPLC, mass spectrometry and UV scan (ensuring purity, mass and spectrum, respectively) analysis of the selected polypeptides will indicate ≥80% purity.

HLA Binding Assay. Binding activity is analyzed at EpiVax (Providence, Rhode Island) and is conducted for any polypeptides of the present disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692). The binding assay used (Steere AC et al., (2006), J Exp Med, 2003(4):961-71) yields an indirect measure of peptide-MHC affinity. Soluble HLA molecules are loaded onto a 96-well plate with the unlabeled experimental polypeptides and labeled control peptide. Once the binding mixture reaches steady equilibrium (at 24 hours), the HLA-polypeptide complexes are captured on an ELISA plate coated with anti-human DR antibody and detected with a Europium-linked probe for the label (PerkinElmer, Waltham, Mass.). Time-resolved fluorescence measuring bound labeled control peptide is assessed by a SpectraMax® M5 unit (Spectramax, Radnor, Pa.). Binding of experimental polypeptides is expressed as the percent inhibition of the labeled control peptide (experimental fluorescence/control fluorescence multiplied by 100). The percent inhibition values for each experimental polypeptide (across a range of molar concentrations) is used to calculate the concentration at which it inhibits 50% of the labeled control polypeptide's specific binding, i.e., the polypeptides's IC_50.

Select experimental polypeptides are solvated in DMSO. The diluted polypeptide is mixed with binding reagents in aqueous buffering solution, yielding a range of final concentrations from 100,000 nM down to 100 nM. The select polypeptides are assayed against a panel of eight common Class II HLA alleles: DRB1*0101, DRB1*0301, DRB1*0401, DRB1*0701, DRB1*0801, DRB1*1101, DRB1*1301, and DRB1*1501. From the percent inhibition of labeled control peptide at each concentration, IC₅₀values are derived for each polypeptide/allele combination using linear regression analysis.

In this assay, the experimental polypeptides are considered to bind with very high affinity if they inhibit 50% of control peptide binding at a concentration of 100 nM or less, high affinity if they inhibit 50% of control peptide binding at a concentration between 100 nM and 1,000 nM, and moderate affinity if they inhibit 50% of control peptide binding at a concentration between 1,000 nM and 10,000 nM. Low affinity peptides inhibit 50% of control peptide binding at concentrations between 10,000 nM and 100,000 nM. Peptides that fail to inhibit at least 50% of control peptide binding at any concentration below 100,000 nM and do not show a dose response are considered non-binders (NB).

Peptide Characterization by Binding to HLA Class II Molecules

Soluble MHC binding assays are performed on any of the instantly disclosed polypeptides (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692). Soluble MHC binding assays are performed on selected polypeptides as disclosed herein according to the methods described previously. IC₅₀values (nM) will be derived from a six-point inhibition curve. EpiMatrix™ Predictions, calculated IC₅₀values, and results classifications are reported for each polypeptide and HLA allele. Binding curves are generated for certain polypeptides against the selected Class II HLA alleles, such as for the HLA DRB1 *0801 assay and the HLA DRB1 *1501 assay.

Example 5: Peptide Exposed APCs
Methods for Assessing the Phenotype of PeptideExposed APC

Surface expression of Class II HLA (HLA-DR) and CD86 by professional antigen presenting cells (APCs) is one way APCs modulate T cell response. In this assay, candidate polypeptides are tested for their ability to effect (e.g., upregulate) the expression of Class II HLA and the co-stimulatory molecule CD86 on the surface of professional APCs, specifically dendritic cells.

Polypeptides of the present disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692) are individually tested for effector potential using a proprietary APC phenotyping assay previously developed at EpiVax (EpiVax, Providence, R.I.). Previously harvested and frozen PBMC are thawed and suspended in chRPMI by conventional means. HLA typing is conducted on small, extracted samples of cellular material, provided by EpiVax, by Hartford Hospital (Hartford, Conn.). On assay day 0, 0.5×106 cells are extracted, screened for the presence of surface marker CD11c (a marker specific to dendritic cells) and are analyzed for the presence of surface markers HLA-DR and CD86 by flow cytometry. The remaining cells are plated (4.0×106 cell per ml in chRPMI plus 800u1 media) and are stimulated (50 μg/mL) with one of the selected peptides or positive and negative controls including buffer only (negative control), Tregitope 167 (negative control for effector activity) (21st Century Biochemicals, Marlboro, Mass.), Flu-HA 306-318 (positive control) (21ST Century Biochemicals, Marlboro, Mass.) and Ova 323-339 (negative control) (21st Century Biochemicals, Marlboro, Mass.). Plated cells are incubated for seven days at 37° C. On assay day 7, incubated cells are screened by flow cytometry for the presence of surface marker CD11c. CD11c positive cells are then analyzed for the presence of surface markers HLA-DR and CD86. The experimental peptides are tested in samples drawn from five different human donors.

Leukocyte Reduction Filters are obtained from the Rhode Island Blood Center (Providence, R.I.) to filter white blood cells from whole blood obtained from healthy donors. After the whole blood is run through the filters, the filters are flushed in the opposite direction to push collected white blood cells out of the filter. The white blood cells are isolated using a conventional

Ficoll™ separation gradient (GE Healthcare). The collected white blood cells are thereafter frozen for future use. When needed for use in an assay, the frozen white blood cells are thawed using conventional methods. For the GvHD studies discussed below, PBMCs are obtained (e.g., from HemaCare, Van Nuys, Calif.).

Exposure to the instantly disclosed polypeptides as disclosed herein on the phenotypes of dendritic cells is measured by multiple means. First, for each experimental condition, dot-plots, contrasting surface expression of CD11c and HLA-DR, is produced. Dot-plots of cells exposed to all control and experimental peptides are overlaid onto dot-plots produced from control cells exposed to only the culture media. The overlay provides an effective method to visually observe shifts in HLA-DR distribution between polypeptide stimulated, and unstimulated CD11c-high cells (data not shown). Observed shifts in the distribution of HLA-DR are reported as a qualitative measure. Next, the change in intensity of HLA-DR expression for the CD11c-high segment of each dot-plot is calculated. Percent change in intensity of HLA-DR expression equals Mean Florescence Index (MFI) of HLA-DR expression for peptide exposed cells minus MFI of HLA-DR expression for media exposed cells divided by 1MFI of HLA-DR expression for media exposed cells, times 100 (HLA-DRMFIpeptide−HLA-DRMFImedia/HLA-DRMFImedia*100). Next, the percent change in the percentage of HLA-DR-low cells present among the CD11c high population is calculated for each peptide relative to media control. Percent change in the percentage of HLA-DR-low cells is calculated, and equals the percent of HLA-DR-low for peptide exposed cells minus the percent of HLA-DR-low for media exposed cells divided by percent of HLA-DR-low for media exposed cells times 100 (HLA-DR-low % peptide−HLA-DR-low %media/HLA-DR-low % media*100). In this assay, a negative change in observed HLA-DR MFI and a positive change in percentage of HLA-DR-low cells present in the CD11c-high population indicates reduced expression of HLA and a shift to a regulatory APC phenotype. In this assay, a positive change in observed HLA-DR 1MFI and a negative change in percentage of HLA-DR-low cells present in the CD11c-high population indicates increased expression of HLA and a shift to an effector APC phenotype.

A similar process will be used to assess the impact of the instantly-disclosed polypeptides exposure on surface expression of CD86, which is a costimulatory molecule known to promote T cell activation. First, for each experimental condition, dot plots contrasting surface expression of CD11c and CD86 are produced. Dot plots of cells exposed to all control and experimental

Tregitopes are overlaid onto dots plots produced from control cells exposed to only the culture media. The overlay provides an effective method to visually observe shifts in CD86 distribution between polypeptide stimulated and un-stimulated CD11c-high cells. Observed shifts in the distribution of CD86 are reported as a qualitative measure. Next, the change in intensity of CD86-high expression for the CD11c-high segment of each dot plot is calculated. Percent change in intensity of CD86-high expression equals Mean Florescence Index (MFI) of CD86 expression for peptide exposed cells minus 1MFI of CD86-high expression for media exposed cells divided by MFI of CD86 expression for media exposed cells, times 100 (CD86-highMFIpeptide−CD86-highMFImedia/CD86-highMFImedia*100). Next, the percent change in the percentage of CD86-low cells present among the CD11c high population is calculated. Percent change in the percentage of CD86-high cells equals the percent of CD86-high for peptide exposed cells minus the percent of CD86-high for media exposed cells divided by percent of CD86-high for media exposed cells, times 100 (CD86-low % Ipeptide−CD86-low %media/CD86-low % media*100). In this assay, a negative change in observed CD86 1MFI and a positive change in percentage of CD86-low cells present in the CD11c-high population indicates reduced expression of CD86 and a shift to a regulatory APC phenotype. In this assay, a positive change in observed CD86 MFI and a negative change in percentage of CD86-low cells present in the CD1 is-high population indicates increased expression of CD86 and a shift to an effector APC phenotype.

Characterization of Peptide Exposed APC

Dendritic cell phenotyping assays are performed on the polypeptides of the instant disclosure according to the methods described previously.

Dot plots representing the surface expression of CD11 vs HLA-DR will be analyzed on assay day 7 across the five donors in the presence of various peptide stimulants. In aspects, it is expected that upward movement of the CD11c+/HLA-DR+ population will apparent in the samples treated with the effector polypeptides of the instant disclosure (e.g., select polypeptides of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675) as compared to media control indicating an acquired effector phenotype. In aspects, it is expected that downward movement of the CD11c+/HLA-DR+ population will apparent in the samples treated with the Tregitopes of the instant disclosure (one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675) as compared to media control indicating an acquired regulatory phenotype.

Dot plots representing the surface expression of CD11c vs CD86 will be analyzed on assay day 7 across the five donors in the presence of various peptide stimulants. It is expected that an increase in CD86-hi cells present in the samples treated with effector polypeptides of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) as compared to media control, which indicates a shift to the acquired effector phenotype.

Example 6: Memory T cell Responses to SARS-CoV-2 in COVID-19 Convalescents Materials and Methods

Peptide synthesis. Synthetic peptides were manufactured using 9-fluoronylmethoxycarbonyl (Fmoc) chemistry by 21st Century Biochemicals (Marlboro, Mass.). Peptide purity was >90% as ascertained by analytical reversed phase HPLC. Peptide mass was confirmed by tandem mass spectrometry.

SARS-CoV-2 convalescent donors. Convalescent patients were recruited by Sanguine Biosciences, a clinical services group that identified, consented and enrolled participants. Inclusion criteria included subjects (i) willing and able to provide written informed consent and photo identification, (ii) aged 18-60, both male or female, (iii) confirmed COVID-19 diagnosis (recovered) with date of diagnosis a minimum of 30 days from blood collection, and (iv) positive COVID-19 PCR based-kit documented by time-stamped medical record and/or diagnostic test report and test kit used identified. Exclusion criteria included subjects who (i) are pregnant or nursing, (ii) have a known history of HIV, hepatitis or other infectious diseases, (iii) have autoimmune diseases, (iv) in vulnerable patient population (prisoners, mentally impaired), (v) have medical conditions impacting their ability to donate blood (i.e. anemia, acute illness) (vi) received immunosuppressive therapy or steroids within the last 6 months , (vii) received an investigational product in the last 30 days, (viii) experienced excess blood loss including blood donation defined as 250 mL in the last month or 500 mL in the last two months, or (ix) had a positive COVID-19 PCR test, but were asymptomatic. Samples were collected in accordance with NIH regulations and with IRB approval.

Healthy unexposed donors. Samples were obtained from leukocyte reduction filters from the Rhode Island Blood Center for unrelated studies prior to the SARS-CoV-2 outbreak in December 2019. Samples were collected in accordance with NIH regulations and with IRB approval.

PBMC culture. Thawed whole PBMCs (normal healthy donors) were rested overnight and expanded by antigen stimulation (including select polypeptides of the instant disclosure (e.g., but not limited to, a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS:

4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)) over nine days at 37° C. under a 5% CO2 atmosphere. In a 48-well plate, 5×10¹\6 cells in 150 _Ill RPMI medium supplemented with human AB serum were stimulated with pools of peptides at 10 μg/ml on Day 1. Three days later, IL-2 was added to 10 ng/ml and the culture volume raised to 300 _Ill. On Day 7, cells were supplemented with 10 ng/ml IL-2 by half media replacement. Two days later, PBMCs were collected and washed in preparation to measure immune recall responses.

FluoroSpot Assay. Interferon-gamma (IFNg) Fluorospot assays were performed ex vivo and following culture using kits purchased from Mabtech and performed according to the manufacturer's specifications. Peptides were added individually at 10 μg/ml and pooled at 10 μg/ml (8 peptides, 1.25 μg/mL) to triplicate wells containing 250,000 PBMCs (ex vivo) or 100,000 PBMCs (cultured) in RPMI medium supplemented with 10% human AB serum. Triplicate wells were plated with ConA (10 μg/ml) as a positive control, and six wells containing no antigen stimulus were used for background determination. Cells were incubated for 40-48 hours at 37° C. under a 5% CO2 atmosphere. Plates were developed according to the manufacturer's directions using FITC-labeled anti-IFN-γ detection antibody.

Raw spot counts were recorded by ZellNet Consulting, Inc. using a FluoroSpot reader system (iSpot Spectrum, AID, Strassberg, Germany) with software version 7.0, build 14790, where fluorescent spots were counted utilizing separate filters for FITC, Cy3, and Cy5. Camera exposure and gain settings were adapted for each filter to obtain high quality spot images preventing over- or underexposure. Fluorophore-specific spot parameters were defined using spot size, spot intensity and spot gradient (fading of staining intensity from center to periphery of spot), and a spot separation algorithm was applied for optimal spot detection.

Results were calculated as the average number of spots in the peptide wells, adjusted to spots per one million cells. Responses meeting the following criteria are positive when the number of spots is (i) at least twice background, (ii) greater than 50 spot forming cells per well above background (1 response per 20,000 PBMCs), and (iii) statistically different (p<0.05) from the media-only control by the Student's t test.

Results:

For FIG. 63A-68, peptide 1 (or rank 1) is SEQ ID NO: 1091 (cluster SEQ ID NO: 1401); peptide 2 (or rank 2) is SEQ ID NO: 1062 (cluster SEQ ID NO: 1373); peptide 3 (or rank 3) is SEQ ID NO: 1085 (cluster SEQ ID NO: 1395); peptide 4 (or rank 4) is SEQ ID NO: 1066 (cluster SEQ ID NO: 1377); peptide 5 (or rank 5) is SEQ ID NO: 1080 (cluster SEQ ID NO: 1391); peptide 6 (or rank 6) is SEQ ID NO: 1081 (cluster SEQ ID NO: 1392); peptide 7 (or rank 7) is SEQ ID NO: 1065 (cluster SEQ ID NO: 1376); peptide 8 (or rank 8) is SEQ ID NO: 1092 (cluster SEQ ID NO: 1403); peptide 9 (or rank 9) is SEQ ID NO: 1104 (cluster SEQ ID NO: 1415); peptide 10 (or rank 10) is SEQ ID NO: 1071 (cluster SEQ ID NO: 1382); peptide 11 (or rank 11) is SEQ ID NO: 1107 (cluster SEQ ID NO: 1418); peptide 12 (or rank 12) is SEQ ID NO: 1072 (cluster SEQ ID NO: 1383); peptide 13 (or rank 13) is SEQ ID NO: 1074 (cluster SEQ ID NO: 1384); peptide 14 (or rank 14) is SEQ ID NO: 1115 (cluster SEQ ID NO: 1426); peptide 15 (or rank 15) is SEQ ID NO: 1096 (cluster SEQ ID NO: 1407); peptide 16 (or rank 16) is SEQ ID NO: 1110 (cluster SEQ ID NO: 1421); peptide 17 (or rank 17) is SEQ ID NO: 1116 (cluster SEQ ID NO: 1427); peptide 18 (or rank 18) is SEQ ID NO: 1105 (cluster SEQ ID NO: 1416); peptide 19 (or rank 19) is SEQ ID NO: 1055 (cluster SEQ ID NO: 1366); peptide 20 (or rank 20) is SEQ ID NO: 1070 (cluster SEQ ID NO: 1381); peptide 21 (or rank 21) is SEQ ID NO: 1086 (cluster SEQ ID NO: 1397); peptide 22 (or rank 22) is SEQ ID NO: 1089 (cluster SEQ ID NO: 1400); peptide 23 (or rank 23) is SEQ ID NO: 1069 (cluster SEQ ID NO: 1380); peptide 24 (or rank 24) is SEQ ID NO: 1120 (cluster SEQ ID NO: 1430); peptide 25 (or rank 25) is SEQ ID NO: 1087 (cluster SEQ ID NO: 1398); peptide 26 (or rank 26) is SEQ ID NO: 1100 (cluster SEQ ID NO: 1411); peptide 27 (or rank 27) is SEQ ID NO: 1093 (cluster SEQ ID NO: 1404); peptide 28 (or rank 28) is SEQ ID NO: 1106 (cluster SEQ ID NO: 1417); peptide 29 (or rank 29) is SEQ ID NO: 1113 (cluster SEQ ID NO: 1424); peptide 30 (or rank 30) is SEQ ID NO: 1060 (cluster SEQ ID NO: 1371); peptide 31 (or rank 31) is SEQ ID NO: 1088 (cluster SEQ ID NO: 1399); and peptide 32 (or rank 32) is SEQ ID NO: 1082 (cluster SEQ ID NO: 1393). Additionally, pool A includes the following: SEQ ID NO: 1091 (cluster SEQ ID NO: 1401); SEQ ID NO: 1062 (cluster SEQ ID NO: 1373); SEQ ID NO: 1085 (cluster SEQ ID NO: 1395); SEQ ID NO: 1066 (cluster SEQ ID NO: 1377); SEQ ID NO: 1080 (cluster SEQ ID NO: 1391); SEQ ID NO: 1081 (cluster SEQ ID NO: 1392); SEQ ID NO: 1065 (cluster SEQ ID NO: 1376); SEQ ID NO: 1092 (cluster SEQ ID NO: 1403). Pool B includes the following: SEQ ID NO: 1104 (cluster SEQ ID NO: 1415); SEQ ID NO: 1071 (cluster SEQ ID NO: 1382); SEQ ID NO: 1107 (cluster SEQ ID NO: 1418); SEQ ID NO: 1072 (cluster SEQ ID NO: 1383); SEQ ID NO: 1074 (cluster SEQ ID NO: 1384); SEQ ID NO: 1115 (cluster SEQ ID NO: 1426); SEQ ID NO: 1096 (cluster SEQ ID NO: 1407); SEQ ID NO: 1110 (cluster SEQ ID NO: 1421). Pool C includes the following: peptide SEQ ID NO: 1116 (cluster SEQ ID NO: 1427); SEQ ID NO: 1105 (cluster SEQ ID NO: 1416); SEQ ID NO: 1055 (cluster SEQ ID NO: 1366); SEQ ID NO: 1070 (cluster SEQ ID NO: 1381); SEQ ID NO: 1086 (cluster SEQ ID NO: 1397); SEQ ID NO: 1089 (cluster SEQ ID NO: 1400); SEQ ID NO: 1069 (cluster SEQ ID NO: 1380); SEQ ID NO: 1120 (cluster SEQ ID NO: 1430) Pool D includes the following: SEQ ID NO: 1087 (cluster SEQ ID NO: 1398); SEQ ID NO: 1100 (cluster SEQ ID NO: 1411); SEQ ID NO: 1093 (cluster SEQ ID NO: 1404); SEQ ID NO: 1106 (cluster SEQ ID NO: 1417); SEQ ID NO: 1113 (cluster SEQ ID NO: 1424); SEQ ID NO: 1060 (cluster SEQ ID NO: 1371); SEQ ID NO: 1088 (cluster SEQ ID NO: 1399); and SEQ ID NO: 1082 (cluster SEQ ID NO: 1393).

As shown in FIG. 63A and FIG. 63B, ex vivo immune recall responses differentiate SARS-CoV-2 naïve and experienced individuals and exhibit different COVID-19 immunotypes. Robust and failed immune responses in convalescent donors may represent different immunotypes characterized in a deep immune profiling study of SARS-CoV-2 experienced humans (Giles et al. Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications. Science. 2020 Jul. 15:eabc8511. doi: 10.1126/science.abc8511. PMID: 32669297, herein incorporated by reference in its entirety). In aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ). In aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675 are not expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ, but rather will produce a regulatory response (e.g., production of regulatory cytokines, e.g., IL-10).

As shown in FIG. 64A and FIG. 64B, strong ex vivo immune recall responses are found found in SARS-CoV-2 experienced individuals using polypeptides of the instant disclosure. In aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong ex vivo immune recall responses. 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734

As shown in FIG. 65, polypeptides of the instant disclosure 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734 stimulate ex vivo immune recall response in natural SARS-CoV-2 infection. In this exemplary study, of 32 tested peptides, 15 testes peptides demonstrated positive responses, as shown in green, in at least one donor. External data is shown from the preprint of Peng et al. bioRxiv [Preprint]. 2020 Jun. 8 PMID: 32577665; PMCID: PMC7302222. In aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ). In aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675 are not expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ, but rather will produce a regulatory response (e.g., production of regulatory cytokines, e.g., IL-10).

As shown in FIG. 66A and FIG. 66B, polypeptides of the instant disclosure 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734 stimulate higher IFN-γresponses in naïve and COVID-19 convalescent donors following expansion in culture. Response in naïve donors suggest such polypeptides of the instant disclosure expand low frequency cold coronavirus cross-reactive T cells. Further, differences between responses by pool in ex vivo and cultured assay may reflect variable phenotypes and/or proliferative capacities of epitope-specific T cells when they are put into culture. In aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ). In aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675 are not expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ, but rather will produce a regulatory response (e.g., production of regulatory cytokines, e.g., IL-10).

As shown in FIG. 67A and FIG. 67B, polypeptides of the instant disclosure 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734 stimulate low frequency epitope-specific T cells following expansion in culture in naive and COVID-19 convalescent donors. Differences between responses to spike and membrane peptides in ex vivo and cultured assay may reflect variable phenotypes and/or proliferative capacities of epitope-specific T cells when they are put into culture. In aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ). Again, in aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675 are not expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ, but rather will produce a regulatory response (e.g., production of regulatory cytokines, e.g., IL-10).

As shown in FIG. 68, polypeptides of the instant disclosure 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734stimulate low frequency epitope-specific T cells following expansion in culture in naïve and COVID-19 convalescent donors. Of 32 tested peptides, 27 testes peptides demonstrated positive responses, as shown in green, in at least one donor. Further, predicted spike epitope cross-conservation with common cold coronaviruses were confirmed in naïve donors. External data is shown from the preprint of Nelde et al. Research Square [preprint]: 2020 Jun. 17 doi: 10.21203/rs.3.rs-35331/v1.

As such, the data in FIG. 63A-FIG. 68 demonstrates that polypeptides of the instant disclosure 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86924-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-86921677-1692, 2593-2604, 2639-2646, and 2719-2734are recognized by T cells raised in natural infection, stimulate Thl cytokine production, may stimulate pre-existing immunity to common cold coronaviruses, and memory may boost immunity in clinical trials. Again, in aspects, a polypeptide of the instant disclosure (e.g., a peptide or polypeptide comprising, consisting, or consisting essentially of an amino acid sequence of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692 (and/or fragments or variants thereof), and optionally 1 to 12 additional amino acids distributed in any ratio on the N terminus and/or C-terminus of the polypeptide of SEQ ID NOS: 4-1676, 1713-2595, 2605-2638, 2647-2718, and 2735-8692), including the concatemeric peptides of SEQ ID NOS: 1677-1692, 2593-2604, 2639-2646, and 2719-2734)), excluding one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675)) are expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ). In aspects, one or more of a peptide or polypeptide having an amino acid sequence comprising, consisting of, or consisting essentially of one or more of SEQ ID NOS: 1056-1057, 1059, 1367-1368, 1370, 1063, 1068, 101, 1374, 1379, 1382, 1073, 1075, 1083-1084, 1094, 1101, 1103, 1108, 1111, 1112, 1117, 1384, 1386, 1394-1395, 1405, 1412, 1414, 1419, 1422, 1423, 1428, 1125, 1436, 1131, 1133, 1134, 1442, 1444, 1445, 1141, 1452, 1142, 1145, 1453, 1456, 1152, 1157, 1159, 1164, 1463, 1468, 1470, 1475,1176, 1178, 1179, 1182, 1204, 1205, 1209, 1214, 1230, 1487, 1489 1490, 1493, 1499, 1515, 1516, 1520, 1525, 1541, 1237, 1245, 1251, 1253, 1548, 1556, 1562, 1564, 1260, 1264, 1266, 1274, 1571, 1575, 1577, 1585, 1275, 1276, 1586, 1587, 1278, 1589, 1284, 1286, 1595, 1597, 1292, 1301, 1303, 1603, 1612, 1614, 1316, 1321, 1329, 1330, 1627, 1632, 1640, 1641, 1340, 1651, 1350, 1352, 1661, 1663, 1358, 1361, 1364, 1669, 1672, and/or 1675 are not expected to elicit a strong effector immune recall response (such as production of effector cytokines, e.g., IFN-γ, but rather will produce a regulatory response (e.g., production of regulatory cytokines, e.g., IL-10).

Example 7—T cell epitope mapping in humans with and without SARS-CoV-2 experience: Identification of Candidate Epitopes for Rational COVID-19 vaccine design
Materials and Methods

SARS-CoV-2 sequences. SARS-CoV-2 (taxid: 2697049), SARS-CoV-1 (taxid: 694009), MERS-CoV (taxid: 1335626), and human CoV (taxids: 11137, 443239, 277944 and 31631) antigen sequences isolated from human hosts were obtained from GenBank at the National Center for Biotechnology Information. SARS-CoV-2 epitopes were compared across sequences obtained from isolates with fully sequenced genomes isolated from December 2019 to December 2020. SARS-CoV-2 Wuhan-Hu-1 (GenBank id: MN908947) was selected as the reference strain.

T cell epitope mapping. Each antigen sequence was parsed into all possible linear 9-mer sequences. Each 9-mer was scored for likelihood of binding to panels of class I and class II HLA alleles using EpiMatrix version 1.3, a matrix-based algorithm for mapping T cell epitopes. Class II epitopes were identified for nine supertype alleles: DRB1*0101, DRB1*0301, DRB1*0401, DRB1*0701, DRB1*0801, DRB1*0901, DRB1*1101, DRB1*1301 and DRB1*1501. Class I epitope 9-mers (and 10-mers) were identified for potential binding to six supertype alleles: A*0101, A*0201, A*0301, A*2402, B*0702, B*4401. Each allele set covers >95% of the human population. (Southwood S, Sidney J, Kondo A, et al. Several common HLA-DR types share largely overlapping peptide binding repertoires. J Immunol. 1998;160(7):3363-337; and Sette A, Sidney J. Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism. Immunogenetics. 1999;50(3-4):201-212, both incorporated by reference in their entireties). Promiscuous class II HLA epitopes representing regions of a protein with a high density of T cell immunogenicity potential were identified using the ClustiMer algorithm. Clusters scoring an overall EpiMatrix score >10 after subtracting the average expected sum of scores for a random sequence of equal length have significant immunogenicity potential (e.g., Moise et al. iVAX: An integrated toolkit for the selection and optimization of antigens and the design of epitope-driven vaccines. Human Vaccines & Immunotherapeutics 11:9, 2312-2321 (2015), herein incorporated by reference in its entirety.

T cell epitope homology analysis. The JanusMatrix algorithm (Moise L, Gutierrez A H, Bailey-Kellogg C, et al. The two-faced T cell epitope: Examining the host-microbe interface with JanusMatrix. Hum Vaccines Immunother. 2013 ; 9(7): 1577-1586. doi:10.4161/hv.24615, herein incorporated by reference in its entirety) was used to identify SARS-CoV-2 epitopes that share T cell receptor (TCR)-face conservation (positions 2, 3, 5, 7, and 8) with epitopes restricted by the same alleles, but found in the human proteome, and other (alpha- and beta-) coronavirus epitopes. Epitopes with identical TCR-facing residues, which are also predicted to bind to the same MHC allele, are more likely to induce cross-reactive T cells. A JanusMatrix score threshold of two (cross-conserved HLA-allele-specific epitopes averaged over the length of the sequence) for cross-conservation with human (self) proteins was applied to identify epitopes with elevated potential to be tolerated or actively regulatory. To investigate the conservation of SARS-CoV-2 epitopes with related highly pathogenic coronaviruses (SARS-CoV and MERS-CoV) and low pathogenicity common cold coronaviruses (CCCs; OC43, HKU1, NL62, and 229E), JanusMatrix was applied requiring identical TCR-facing residues at five positions and at positions 2 and 8 only. A separate homology analysis of iVAX-predicted SARS-CoV-2 T cell epitopes with published coronavirus epitopes deposited in the Immune Epitope Database was performed using BLAST and a cutoff of 80% similarity.

Peptide synthesis. Synthetic peptides were manufactured using 9-fluoronylmethoxycarbonyl (Fmoc) chemistry by 21^stCentury Biochemicals (Marlboro, Mass.). Peptide purity was >90% as ascertained by analytical reversed phase HPLC. Peptide mass was confirmed by tandem mass spectrometry.

SARS-CoV-2 convalescent donors. Convalescent patients were recruited by Sanguine Biosciences, a clinical services group that identified, consented and enrolled participants. Inclusion criteria included subjects (i) willing and able to provide written informed consent, (ii) aged 18-80 years of age, both male or female, and (iii) PCR-confirmed COVID-19 diagnosis (recovered) with date of diagnosis a minimum of 30 days from blood collection. Exclusion criteria included subjects who (i) were pregnant or nursing, (ii) had a known history of HIV, hepatitis or other infectious diseases, (iii) had autoimmune diseases, (iv) were members of vulnerable patient population (prisoners, mentally impaired), (v) had medical conditions impacting their ability to donate blood (i.e. anemia, acute illness) (vi) had received immunosuppressive therapy or steroids within the last 6 months, (vii) had received an investigational product in the last 30 days, (viii) had experienced excess blood loss including blood donation defined as 250 mL in the last month or 500 mL in the last two months, or (ix) had a positive COVID-19 PCR test, but were asymptomatic. Samples were collected in accordance with NIH regulations and with the approval of an independent external institutional review board.

Healthy unexposed donors. Deidentified samples were obtained from leukocyte reduction filters from the Rhode Island Blood Center for unrelated studies prior to the SARS-CoV-2 outbreak in December 2019 (Date of samples: February 2016-November 2019). Samples were obtained in accordance with NIH regulations and with the approval of an independent external institutional review board (Ethical & Independent Review Services, Independence, Mo.).

PBMC culture. PBMCs from COVID-19 convalescents were put into culture directly following Ficoll separation. Cells from normal healthy donors were thawed and rested overnight before placing into culture. Samples were allocated for ex vivo assay and antigen-specific cell expansion followed by cultured assay. Antigen stimulated cells were cultured over eight days at 37° C. under a 5% CO₂atmosphere. In a 48-well plate, 5×10⁶cells in 150 μl RPMI medium supplemented with human AB serum were stimulated with pools of peptides at 10 μg/ml on Day 1. Three days later, IL-2 was added to 10 ng/ml and the culture volume raised to 300 μl. On Day 7, cells were supplemented with 10 ng/ml IL-2 by half media replacement. Two days later, PBMCs were collected and washed in preparation for measurement of immune recall responses.

Ex vivo and cultured human Fluorospot assay. Interferon-gamma (IFNγ) Fluorospot assays were performed using ex vivo and cultured PBMC using assay kits purchased from Mabtech and executed according to the manufacturer's specifications. For ex vivo assays, unless otherwise noted, peptides were added individually at 20 μg/ml or pooled at 10 μg/mL per peptide in triplicate wells containing 250,000 PBMCs in RPMI medium supplemented with 10% human AB serum. For cultured assays, peptides were added individually at 10 μg/ml or pooled at a total peptide concentration of 1 Oug/mL (32 peptides, 0.313 μg/mL) and added to triplicate wells containing 100,000 PBMCs. Triplicate wells were plated with ConA (5 μg/ml) as a positive control, and six wells containing no antigen stimulus (0.2-0.4% DMSO) were used for background determination. Cells were incubated for 40-48 hours at 37° C. under a 5% CO2 atmosphere. Plates were developed according to the manufacturer's directions using FITC-labeled anti-IFN-γ detection antibody. Raw spot counts were recorded by ZellNet Consulting, Inc. using a FluoroSpot reader system (iSpot Spectrum, AID, Strassberg, Germany) with software version 7.0, build 14790, where fluorescent spots were counted utilizing FITC and Cy3 filters. Camera exposure and gain settings were adapted for each filter to obtain high quality spot images preventing over- or underexposure. Fluorophore-specific spot parameters were defined using spot size, spot intensity and spot gradient (fading of staining intensity from center to periphery of spot), and a spot separation algorithm was applied for optimal spot detection.

Results were calculated as the average number of spots in the peptide wells, adjusted to spots per one million cells. Responses meeting the following criteria were considered to be positive when the number of spots was (i) at least five times background, (ii) greater than 25 spot forming cells per well above background (1 response per 40,000 PBMCs), and (iii) statistically different (p<0.05) from the media-only control by the Student's t test.

HLA typing. Donor HLA Class II types were determined using the One Lambda Micro SSPTM High Resolution HLA Class II kit at the Hartford Hospital Transplant Immunology Laboratory.

Mice. HLA-DR3 transgenic mice were obtained from Dr. Chella David (Mayo Clinic) under commercial license. The mice express the HLA-DRA and DRB1*0301 genes on a B.10-Ab⁰mouse class II-negative background. Animal research protocols for mouse studies were reviewed and approved by the Absorption Systems Inc. Institutional Animal Care and Use Committee.

Peptide vaccine preparation. Per dose, a pool of 20 peptides at 1.25 or 5.0 μg/peptide was admixed with 50 μg poly-ICLC (Hiltonol™; Oncovir) in 50 μL.

Vaccinations. Vaccine- and sham-treated HLA-DR3 mice (N=5/group) were female and 6-8 weeks old at the start of immunizations. Mice were primed and boosted two weeks later by intradermal immunization with peptide/poly-ICLC vaccine. Control groups received sterile water (N=3) or poly-ICLC alone (N=5). Mice were sacrificed nine days after the boost immunization. Blood at baseline and termination and spleens were harvested for immune monitoring. One mouse in the group that received 5μg/peptide was excluded following splenocyte isolation due to insufficient recovery and poor viability for reasons thought to be unrelated to vaccination.

Ex vivo FluoroSpot assay in mouse splenocytes. The frequency of vaccine-specific splenocytes was determined by dual cytokine IFNγ/IL-4 FluoroSpot assay using the Mabtech mouse IFNγ/IL-4 FluoroSpot Kit with pre-coated plates according to the manufacturer's protocol. Washed splenocytes in RPMI 1640 (Gibco) supplemented with 10% fetal calf serum (FCS, Atlanta Biologicals) were added at 250,000 cells per well. Antigen stimulations included pools of all 20 vaccine peptides, as well as spike-derived vaccine peptides, and membrane-derived vaccine peptides. Peptide pools were added at 0.5 μg/ml per peptide. Triplicate wells were stimulated with 2 μg/ml Concanavalin A (ConA; Sigma Aldrich) as a positive control, and six replicate wells with medium containing 0.2% DMSO were used for background determination. Raw spot counts were recorded by ZellNet Consulting, Inc. and results were calculated as described above for human FluoroSpot assays. Flow cytometry in mouse splenocytes. Splenocytes were plated at 300,000 cells per well and stimulated in triplicate over six hours with a pool of all 20 vaccine peptides at 0.5 μg/mL per peptide and 4μg/mL co-stimulatory anti-CD28 antibody. Triplicate wells were stimulated with PMA (50 ng/mL)+ionomycin (1 μg/mL) as a positive control. For background determinations, triplicate wells were treated with medium containing 0.2% DMSO only and medium containing 0.2% DMSO and 4μg/mL co-stimulatory anti-CD28 antibody. Brefeldin A (5 ng/μL) and 2μM monensin were added with stimulations to enable detection of intracellular cytokines. Following stimulation, cells were incubated with fixable viability stain 450 to discriminate dead from live cells, and then stained with the following surface marker antibody panel: CD3e-AF700 (clone 500A2), CD4-APC/Fire750 (clone GK1.5), CD8a-FITC (clone 53-6.7), CD62L-APC (clone MEL-14) (BioLegend), CD44-eFLuor506 (clone IM7) (Thermo). To detect intracellular cytokine expression, cells were fixed and permeabilized and immunostained using IFNO -BV605 (clone XMG1.2), IL-4-PerCP/Cy5.5 (clone 11B11) and IL-5-PE (clone TRFKS) (BioLegend) antibodies. Flow cytometry measurements were made on an Invitrogen Attune cytometer and collected data analyzed using FlowJo software (Version 10.6.2). Cells were gated on lymphocyte/singlet/live events. Recalled Thl and Th2 cells were defined, respectively, as IFNγ-producing CD3⁺CD4⁺ CD44⁺ T cells and IL-4- and/or IL-5-producing CD3⁺CD4⁺ CD44⁺ T cells. Tc1 and Tc2 cells were defined, respectively, as IFNγ-producing CD3⁺CD8⁺ CD44⁺ T cells and IL-4- and/or IL-5-producing CD3⁺CD8⁺ CD44⁺ T cells.

Results
In Silico Prediction of SARS-CoV-2 T Cell Targets

To identify SARS-CoV-2 sequences recognized by T cells capable of inducing protective responses in natural infection, we analyzed the T cell immunogenicity potential of the SARS-CoV-2 surface antigens, spike, membrane and envelope, using immunoinformatic tools. As a group, these antigens are structural proteins, potential antibody targets, and estimated to be produced at higher abundance than other antigens in infected cells. Initially, we predicted CD4+ T cell immunogenicity potential using the EpiMatrix T cell epitope mapping algorithm and the Wuhan-Hu-1 strain as a reference sequence. Predictions were made using nine HLA class II and six HLA class I supertype alleles representing >95% of the human population. For each antigen, we identified regions of high class II HLA epitope density, called clusters, across multiple supertype alleles. A total of 52 epitope clusters containing between 6 and 60 binding motifs each were identified (Table 57). These epitope clusters contain more than 100 total individual binding motifs for each of the nine supertype alleles, ranging from 109 for DRB1*0301 to 180 for DRB1*0901.

The CD8⁺ T cell immunogenicity potential of these clusters was evaluated and showed that multiple putative class I HLA epitopes overlap in regions of high class II HLA epitope density (Table 57). These epitope clusters are therefore expected to recall both CD4⁺ and CD8⁺ T cell responses in individuals with SARS-CoV-2 history and may stimulate both CD4⁺ and CD8⁺ T cell immunity in a T cell-directed vaccine. We also investigated the conservation of these clusters in other SARS-CoV-2 isolates and determined that the clusters are identical to clusters found in >98.38% of strains isolated between January and September 2020 (Table 57). Highly conserved peptides such as these are useful for vaccines and as reagents for assays that interrogate T cell responses using samples from natural infection and immunization.

HLA class II ligands may stimulate effector or regulatory CD4⁺ T cells leading to divergent immunological outcomes. HLA binding predictions do not distinguish between these possibilities. They account for peptide interactions with the HLA binding groove and overlook potential interactions with the T cell receptor (TCR).

As the T cell repertoire is shaped by training on human T cell epitopes, we routinely assess potential for regulatory T cell induction by screening the TCR-face of epitopes for homology with self antigens using the JanusMatrix algorithm. For each cluster, the average depth of coverage in the human proteome was calculated holding all TCR-facing positions fixed and allowing HLA-facing positions to vary while requiring human sequences to bind to the same HLA alleles as the SARS-CoV-2 sequences. JanusMatrix analysis revealed that each SARS-COV-2 protein contains clusters with significant human homology scores (>2). This was also true for some of the 52 selected clusters. 17 (32.7%) were found to have elevated regulatory T cell induction potential based on high JanusMatrix homology with the human proteome, and in contrast, 35 (67.3%) were considered more likely to induce effector T cell responses. The results suggest that different CD4+ T cell subsets may be activated by these epitopes in the course of SARS-CoV-2 infection and measurement of recall responses in vitro.

We designed 32 peptides for synthesis by manually editing clusters to center effector epitopes and remove epitopes with significant human homology. We used these 32 peptides to probe T cell recognition of the putative effector T cell epitopes. The peptides included one envelope sequence, eight sourced from membrane, and 23 from spike (Table 58A and Table 58B). Spike peptides comprise 14 in the 51 domain, including five RBD sequences, and nine in the S2 domain.

TABLE 57

Immunogenicity potential and human and coronavirus cross-conservation

features of SARS-CoV-2 surface antigen epitope clusters

Janus

Antigen/
SEQ

EpiMatrix
EpiMatrix
EpiMatrix
Human
IEDB
SARS-

Cluster
ID
Cluster
Hits
Hits
Cluster
Homology
Corona
CoV-2

Address
NO:
Sequence
Class I^a
Class II^b
Score^c
Score^d
virus^e
Conserv.^f

ENVELOPE/
4
MYSFVSEETGTLI
25
23
36.95
3.03
1
98.97%

1-25

VNSVLLFLAFVV

ENVELOPE/
22
VNSVLLFLAFVVFL
44
60
110.24
3.78
0
99.81%

14-44

LVTLAILTALRLCA

YCC

ENVELOPE/
46
IVNVSLVKPSFYVY
20
37
63.92
1.15
0
99.40%

46-75

SRVKNLNSSRVPD

LLV

MEMBRANE/
69
LKKLLEQWNLVIGF
22
8
13.01
9.00
1
99.54%

13-31

LFLTW

MEMBRANE/
81
GFLFLTWICLLQFA
28
18
28.42
16.00
0
99.82%

25-48

YANRNRFLYI

MEMBRANE/
98
RNRFLYIIKLIFLWL
41
24
39.94
22.00
3
99.84%

42-66

LWPVTLACFV

MEMBRANE/
116
LWPVTLACFVLAA
13
12
20.32
9.00
1
99.65%

57-76

VYRINWI

MEMBRANE/
129
LVGLMWLSYFIASF
37
34
58.30
16.00
0
99.76%

87-14

RLFARTRSMWSF

NP

MEMBRANE/
150
MWSFNPETNILLN
10
15
26.00
11.00
0
99.79%

109-129

VPLHGTIL

MEMBRANE/
164
IGAVILRGHLRIAG
4
12
18.30
21.00
0
99.87%

140-160

HHLGRCD

MEMBRANE/
178
PKEITVATSRTLSY
13
7
11.79
1.00
2
99.79%

165-181

YKL

MEMBRANE/
188
TLSYYKLGASQRV
4
11
19.56
8.00
2
99.78%

175-192

AGDSG

MEMBRANE/
199
YSRYRIGNYKLNT
13
11
15.31
10.0
0
99.69%

196-217

DHSSSSDNI

SPIKE/
210
MFVFLVLLPLVSS
13
41
71.61
3.19
0
98.38%

1-26

QCVNLTTRTQLPP

SPIKE/
229
PPAYTNSFTRGVY
10
6
12.80
0.17
0
99.64%

25-39

YP

SPIKE/
237
PDKVFRSSVLHST
14
15
23.82
1.94
0
98.77%

39-59

QDLFLPFF

SPIKE/
251
QDLFLPFFSNVTW
18
9
16.27
0.22
1
98.91%

52-69

FHAIH

SPIKE/
262
NVTWFHAIHVSGT
7
12
21.41
0.54
0
99.43%

61-79

NGTKRF

SPIKE/
274
NDGVYFASTEKS
6
7
10.83
0.29
0
99.64%

87-103

NIIRG

SPIKE/
284
KTQSLLIVNNATN
5
23
40.90
0.52
0
99.85%

113-131

VVIKVC

SPIKE/
296
NDPFLGVYYHKN
8
14
23.78
0.00
0
98.88%

137-156

NKSWMESE

SPIKE/
309
ESEFRVYSSANN
17
11
18.25
0.09
0
99.78%

154-171

CTFEYV

SPIKE/
320
IDGYFKIYSKHTP
10
12
22.25
0.21
0
99.50%

197-215

INLVRD

SPIKE/
332
HTPINLVRDLPQ
3
8
14.08
4.25
0
99.26%

207-223

GFSAL

SPIKE/
342
GINITRFQTLLAL
16
30
55.01
1.03
0
99.31%

232-255

HRSYLTPGDSS

SPIKE/
359
YVGYLQPRTFLL
12
16
23.08
0.84
0
98.50%

266-288

KYNENGTITDA

SPIKE/
375
EKGIYQTSNFRV
7
10
13.16
0.00
1
98.78%

309-329

QPTESIVRF

SPIKE/
389
LCPFGEVFNATR
7
8
10.23
0.38
0
99.76%

335-353

FASVYAW

SPIKE/
401
FASVYAWNRKR
5
7
11.13
0.63
0
99.88%

347-364

ISNCVAD

SPIKE/
412
VADYSVLYNSAS
19
20
36.57
0.95
0
99.83%

362-380

FSTFKCY

SPIKE/
424
DSKVGGNYNYLY
20
33
58.42
1.03
0
99.13%

442-468

RLFRKSNLKPFER

DI

SPIKE
444
VEGFNCYFPLQS
10
11
17.96
0.00
0
99.34%

483-500

YGFQPT

SPIKE/
455
LQSYGFQPTNGV
3
7
10.08
0.14
0
99.33%

492-508

GYQPY

SPIKE/
465
YRVVVLSFELLHA
12
15
24.27
3.35
3
98.85%

508-529

PATVCGPKK

SPIKE/
480
NKCVNFNFNGLT
2
12
18.82
0.00
2
99.79%

536-554

GTGVLTE

SPIKE/
492
PTNFTISVTTEILPV
8
6
12.63
0.00
1
99.91%

715-729

SPIKE/
500
CSNLLLQYGSFCT
13
21
30.52
0.64
1
99.76%

749-774

QLNRALTGIAVEQ

SPIKE/
519
FGGFNFSQILPDP
4
8
10.25
0.13
1
99.68%

797-816

SKPSKRS

SPIKE
532
IEDLLFNKVTLAD
7
12
18.64
1.69
2
99.81%

818-837

AGFIKQY

SPIKE/
545
AQKFNGLTVLPP
4
10
15.82
1.64
3
99.70%

852-869

LLTDEM

SPIKE/
556
TDEMIAQYTSALL
11
13
21.49
1.00
0
99.86%

866-884

AGTITS

SPIKE/
568
QIPFAMQMAYRF
15
7
10.20
0.00
3
99.88%

895-911

NGIGV

SPIKE/
578
NQKLIANQFNSAI
12
18
24.88
1.35
0
99.36%

919-945

GKIQDSLSSTASAL

SPIKE/
598
IQDSLSSTASALG
6
7
11.21
3.11
1
99.56%

934-949

KLQ

SPIKE/
607
AQALNTLVKQLSS
14
30
52.49
1.50
2
99.87%

956-981

NFGAISSVLNDIL

SPIKE/
626
EVQIDRLITGRLQ
8
12
16.19
0.29
2
99.79%

990-1012

SLQTYVTQQL

SPIKE/
642
QQLIRAAEIRASA
7
14
25.75
2.73
2
99.95%

1010-1029

NLAATKM

SPIKE/
655
GKGYHLMSFPQS
9
8
11.27
0.50
3
99.91%

1044-1063

APHGWFL

SPIKE/
668
CDVVIGIVNNTVY
5
9
16.52
0.44
3
99.45%

1126-1141

DPL

SPIKE/
677
LDKYFKNHTSPD
3
7
12.63
0.00
2
99.82%

1152-1167

VDLG

SPIKE/
686
LGDISGINASVVN
5
9
15.12
1.00
2
99.87%

1166-1183

IQKEI

SPIKE/
697
WYIWLGFIAGLIA
13
11
18.34
2.42
2
99.67%

1214-1231

IVMVT

^aNumber of Class I epitope 9-mers and 10-mers ic entified for potential binding to six supertype alleles.

^bNumber of Class I epitope 9-mers identified for potential binding to nine supertype alleles.

^cT cell immunogenicity potential considering deviation in predicted T cell epitope content from random expectation. T cell epitope clusters scoring above 10 are considered potentiallyimmunogenic.

^dAverage depth of coverage within the human proteome for the HLA binding peptides contained within the T cell epitope cluster.

^eNumber of coronavirus T cell epitopes in IEDB with more than 80% similarity to T cell epitope clusters.

^fPercentage of strains isolated between January and December 2020 (n = 16450) that contain identical epitope clusters

TABLE 58A

Coronavirus cross-conservation features of effector T cell

epitope peptides screened in immunoassays

Scaled Class II

SEQ

Janus CoV Homology Score^a

Input
ID
Cluster
SARS-
SARS-
MERS-
hCoV-
hCoV-
hCoV-
hCoV-

Sequence
NO:
Sequence
CoV-2
CoV-1
CoV
HKU1
OC43
NL63
229E

ENVELOPE
1055
MYSFVSEETGTLIVN
1.00
1.00
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1060
LKKLLEQWNLVIGFL
1.00
1.09
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1118
ICLLQFAYANRNRFLYI
1.00
0.89
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1065
LSYFIASFRLFARTR
1.00
0.22
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1066
IASFRLFARTRSMWSFN
1.00
1.02
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1119
RGHLRIAGHHLGRCD
1.00
0.09
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1070
PKEITVATSRTLSYY
1.00
0.73
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1071
TLSYYKLGASQRVAGD
1.00
0.92
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1072
IGNYKLNTDHSSSSDNI
1.00
0.20
0.00
0.00
0.00
0.00
0.00

SPIKE
1074
PPAYTNSFTRGVYY
0.97
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1120
NDGVYFASTEKSNIIR
1.00
0.26
0.00
0.00
0.00
0.00
0.00

SPIKE
1080
FLGVYYHKNNKSWMESE
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1081
ESEFRVYSSANNCTFEYV
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1082
DGYFKIYSKHTPINL
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1085
QTLLALHRSYLTPGDSS
0.98
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1086
PRTFLLKYNENGTITDA
1.00
0.14
0.00
0.00
0.00
0.00
0.00

SPIKE
1087
TSNFRVQPTESIVRF
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1088
FGEVFNATRFASVYA
1.00
0.11
0.00
0.00
0.00
0.00
0.00

SPIKE
1089
FASVYAWNRKRISNCVAD
1.00
0.11
0.00
0.00
0.00
0.00
0.00

SPIKE
1091
GGNYNYLYRLFRKSNLKP
1.00
0.00
0.00
0.00
0.00
0.00
0.00

FERDI

SPIKE
1092
VEGFNCYFPLQSYGFQPT
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1093
LQSYGFQPTNGVGYQPY
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1122
NKCVNFNFNGLTGT
1.00
0.92
0.00
0.00
0.00
0.00
0.00

SPIKE
1100
FGGFNFSQILPDPSKPSKRS
1.00
0.46
0.00
0.00
0.00
0.00
0.00

SPIKE
1104
TDEMIAQYTSALLA
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1123
QIPFAMQMAYRFNGIGV
1.00
0.92
0.00
0.00
0.00
0.00
0.00

SPIKE
1106
QKLIANQFNSAIGKI
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1107
ANQFNSAIGKIQDSL
1.00
0.00
0.00
0.00
0.00
0.00
0.00

SPIKE
1108
TGRLQSLQTYVTQQL
1.00
0.92
0.00
0.00
0.00
0.00
0.00

SPIKE
1113
GKGYHLMSFPQSAPH
1.00
0.46
0.00
0.00
0.00
0.00
0.00

SPIKE
1115
LDKYFKNHTSPDVDL
1.00
0.92
0.00
0.00
0.00
0.00
0.00

SPIKE
1116
ISGINASVVNIQKEI
1.00
0.92
0.00
0.00
0.00
0.00
0.00

TABLE 58B

Coronavirus cross-conservation features of effector T cell

epitope peptides screened in immunoassays

Scaled Class II

SEQ

Janus CoV TCRf 5,8 Homology Score^b

Input
ID
Cluster
SARS-
SARS-
MERS-
hCoV-
hCoV-
hCoV-
hCoV-

Sequence
NO:
Sequence
CoV-2
CoV-1
CoV
HKU1
OC43
NL63
229E

ENVELOPE
1055
MYSFVSEETGTLIVN
1.00
1.00
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1060
LKKLLEQWNLVIGFL
1.00
1.09
0.00
0.00
0.00
0.20
0.80

MEMBRANE
1118
ICLLQFAYANRNRFLYI
1.00
1.09
0.20
0.47
0.34
0.13
0.27

MEMBRANE
1065
LSYFIASFRLFARTR
1.19
0.97
0.25
0.87
0.26
0.18
0.44

MEMBRANE
1066
IASFRLFARTRSMWSFN
2.00
1.99
0.00
0.00
0.00
0.00
0.27

MEMBRANE
1119
RGHLRIAGHHLGRCD
1.00
0.95
0.00
0.29
0.00
0.00
0.14

MEMBRANE
1070
PKEITVATSRTLSYY
0.43
0.36
0.00
0.43
0.00
0.29
0.29

MEMBRANE
1071
TLSYYKLGASQRVAGD
2.00
1.92
0.00
0.00
0.00
0.00
0.00

MEMBRANE
1072
IGNYKLNTDHSSSSDNI
1.00
0.20
0.00
0.11
0.03
0.00
0.00

SPIKE
1074
PPAYTNSFTRGVYY
0.97
0.00
0.17
0.83
0.67
0.00
0.00

SPIKE
1120
NDGVYFASTEKSNIIR
1.14
0.39
0.00
1.12
0.71
1.14
0.57

SPIKE
1080
FLGVYYHKNNKSWMESE
1.00
0.13
0.36
0.65
0.57
0.58
0.57

SPIKE
1081
ESEFRVYSSANNCTFEYV
1.18
0.33
0.18
0.53
0.38
0.70
0.36

SPIKE
1082
DGYFKIYSKHTPINL
1.08
0.08
0.25
0.02
0.00
0.08
0.08

SPIKE
1085
QTLLALHRSYLTPGDSS
1.39
0.43
0.13
0.28
0.13
0.21
0.20

SPIKE
1086
PRTFLLKYNENGTITDA
1.31
0.49
0.16
0.53
0.75
0.61
0.38

SPIKE
1087
TSNFRVQPTESIVRF
1.50
0.15
1.16
0.19
0.00
0.00
0.07

SPIKE
1088
FGEVFNATRFASVYA
1.00
0.46
0.62
0.81
0.49
0.25
0.13

SPIKE
1089
FASVYAWNRKRISNCVAD
1.25
0.23
0.25
0.27
0.13
0.39
0.13

SPIKE
1091
GGNYNYLYRLFRKSNLKPF
1.19
0.21
0.26
0.38
0.19
0.30
0.53

ERDI

SPIKE
1092
VEGFNCYFPLQSYGFQPT
1.00
0.08
0.36
0.10
0.18
0.26
0.00

SPIKE
1093
LQSYGFQPTNGVGYQPY
1.14
0.13
0.44
0.12
0.13
0.43
0.29

SPIKE
1122
NKCVNFNFNGLTGT
1.57
1.57
0.15
0.06
0.01
0.57
0.57

SPIKE
1100
FGGFNFSQILPDPSKPSKRS
1.00
0.46
0.88
0.52
0.13
1.47
0.38

SPIKE
1104
TDEMIAQYTSALLA
1.40
0.92
0.21
0.29
0.23
1.00
0.89

SPIKE
1123
QIPFAMQMAYRFNGIGV
1.25
0.92
0.00
0.14
0.24
0.13
0.00

SPIKE
1106
QKLIANQFNSAIGKI
1.00
0.79
0.14
0.58
0.57
0.29
0.71

SPIKE
1107
ANQFNSAIGKIQDSL
1.00
0.92
0.14
0.26
0.00
1.30
0.57

SPIKE
1108
TGRLQSLQTYVTQQL
1.14
0.92
1.28
0.14
1.29
0.86
0.71

SPIKE
1113
GKGYHLMSFPQSAPH
1.00
0.76
0.00
0.30
0.67
0.01
0.70

SPIKE
1115
LDKYFKNHTSPDVDL
1.33
1.02
0.00
0.33
0.12
0.45
0.00

SPIKE
1116
ISGINASWNIQKEI
1.11
1.02
0.22
0.43
0.22
0.53
0.67

SARS-CoV-2 Epitope Conservation with Related Coronaviruses at the TCR-Face

We also investigated the conservation of the selected peptides with related coronaviruses that infect humans including highly pathogenic SARS-CoV and MERS-CoV and low pathogenicity common cold coronaviruses (CCCs) OC43, HKU1, NL63, and 229E. Prior exposure to these viruses may have established T cell memory that can be recalled upon SARS-CoV-2 infection and vaccination. As well, SARS-CoV-2 infection and vaccination establishes T cell memory that can influence responses in future infections to these viruses or yet-to-emerge coronaviruses. To identify potentially cross-reactive sequences, we screened the TCR-face of SARS-CoV-2 epitopes for homology with coronaviruses that infect humans, using the JanusMatrix algorithm (Table 58A and Table 58B). All of the membrane and envelope sequences shared identical TCR-face patterns with SARS-CoV. Half the spike clusters were unique to SARS-CoV-2 and the other half are conserved with SARS-CoV. Only three clusters were cross-conserved outside SARS viruses. Given reports of pre-existing T cell immunity in people with no SARS-CoV-2 experience, we relaxed the requirement for 100% identity at every TCR-face position. Fixing two positions shown to be extensively involved in TCR interactions (positions 5 and 8), JanusMatrix predicted an expanded cross-conservation landscape for SARS-CoV-2 spike and membrane clusters. Most spike clusters were conserved, by these criteria, in the subset of coronaviruses that infect humans. The remainder of the sequences with cross-reactivity potential are cross-conserved among highly pathogenic beta-coronaviruses or among high and low pathogenicity beta-coronaviruses. Only three clusters were unique to SARS-CoV-2 by the cristeria described above, and none are solely conserved with SARS-CoV. The single membrane clusters was cross-conserved in the highly pathogenic beta-coronaviruses and coronaviruses that infect humans subsets.

As the vast majority of people were not exposed to SARS-CoV and MERS-CoV, we also explored cross-conservation between SARS-CoV-2 and CCCs only. Of the 32 peptides, only 2 are SARS-CoV-2-specific. Eighteen clusters (56.6%) are cross-conserved across OC43, HKU1, NL63, and 229E and 12 (37.5%) are cross-conserved in at least one of these four CCCs.

Clinical Cohort

Persons with and without SARS-CoV-2 infection history were selected to provide PBMC samples for validation of predicted T cell epitopes. COVID-19 convalescents with PCR-confirmed SARS-CoV-2 infection (N=15) over March-June 2020 were recruited between 30 and 180 days after their most recent positive test and a minimum of 14 days after symptoms resolved (Table 59, Table 60). Donors exhibited a wide range of COVID-19 symptoms and experienced either mild or moderate disease according to WHO criteria. Blood draws from convalescents ranged from approximately one to six months from diagnosis. Healthy individuals (N=10) provided cell samples from February 2016 up to November 2019 and had no opportunity for SARS-CoV-2 exposure. Both cohorts contain a balanced proportion of females and males and similar average age and age range. 60% of convalescent donors are from racial and ethnic minorities. No ethnicity information is available for the healthy cohort.

TABLE 59

COVID-19 convalescent and healthy donor cohort characteristics

Donor Characteristics

Naïve
Convalescent

(N = 10)
(N = 15)

Age (years)

Average
48.5
44.8

Range
26-79
23-71

Gender

Female
40% (4/10)
53.33% (8/15)

Male
60% (6/10)
46.67% (7/15)

Ethnicity

Caucasian
N/A
40% (6/15)

Hispanic/Latino
N/A
33.33% (5/15)

African American
N/A
6.67% (1/15)

Asian/Pacific Islander
N/A
20% (3/15)

Disease Severity

Mild
N/A
73.33% (11/15)

Moderate
N/A
26.67% (4/15)

Severe
N/A
0

Critical
N/A
0

Recruitment

Sample Collection Date
February 2016-November 2019
May-October 2020

Date Tested Positive
N/A
March-June 2020

Time Since Last Positive Test (Days)
N/A
33-158

Time Since Symptoms Resolved (Days)
N/A
26-116

Date Last Tested Positive
N/A
March-June 2020

Date Symptoms Resolved
N/A
March-June 2020

PCR Positivity
N/A
100% (15/15)

TABLE 60

Additional COVID-19 convalescent and healthy

donor cohort characteristics

Donor Characteristics

Naïve
Convalescent

(N = 10)
(N = 15)

BMI

Range
N/A
21.1-43.4

Obese
N/A
13.33%
(2/15)

Overweight
N/A
46.67%
(7/15)

Normal
N/A
40%
(6/15)

Underweight
N/A
0

Symptoms

Fever
N/A
73.33%
(11/15)

Fever Duration (days)
N/A
1.5-14

Cough
N/A
80%
(12/15)

Shortness of breath
N/A
46.67%
(7/15)

Muscle pain
N/A
80%
(12/15)

Headache
N/A
93.33%
(14/15)

Sore throat
N/A
53.33%
(8/15)

Loss of taste or smell
N/A
73.33%
(11/15)

Nausea
N/A
26.67%
(4/15)

Weight Loss
N/A
6.67%
(1/15)

Loss of appetite
N/A
20%
(3/15)

Diarrhea
N/A
40%
(6/15)

Urinary Incontinence
N/A
6.67%
(1/15)

Fatigue
N/A
20%
(3/15)

Malaise
N/A
6.67%
(1/15)

Sweats
N/A
20%
(3/15)

Chills
N/A
20%
(3/15)

Dizziness
N/A
13.33%
(2/15)

Pneumonia
N/A
13.33%
(2/15)

Anemia
N/A
6.67%
(1/15)

Sensitivity to light
N/A
6.67%
(1/15)

Peeling Skin
N/A
6.67%
(1/15)

Chest Pain
N/A
6.67%
(1/15)

Back Pain
N/A
6.67%
(1/15)

Numbing of foot
N/A
6.67%
(1/15)

Disease Severity

Mild
N/A
73.33%
(11/15)

Moderate
N/A
26.67%
(4/15)

Severe
N/A
0

Critical
N/A
0

Past Medical History

Asthma

20%
(3/15)

Fibroids

6.67%
(1/15)

Myopia

20%
(3/15)

Hypertension

13.33%
(2/15)

GERD

20%
(3/15)

Hyperopia

13.33%
(2/15)

Arthritis

6.67%
(1/15)

Hypothyroidism

13.33%
(2/15)

Cataract

6.67%
(1/15)

Osteopenia

6.67%
(1/15)

Breast cancer (in remission)

6.67%
(1/15)

Osteoarthritis

6.67%
(1/15)

Allergies

Seasonal

6.67%
(1/15)

Dust

6.67%
(1/15)

Sulfa

13.33%
(2/15)

Ceclor

6.67%
(1/15)

Dairy

6.67%
(1/15)

Unknown medications

6.67%
(1/15)

Current Medications

Metoprolol

6.67%
(1/15)

Inhaler

6.67%
(1/15)

Natural D-Hist

6.67%
(1/15)

Vitamin C

6.67%
(1/15)

Aspirin

6.67%
(1/15)

Losartin

6.67%
(1/15)

Protonix

6.67%
(1/15)

Amlodipine

6.67%
(1/15)

Meprazole

6.67%
(1/15)

Naprosyn

6.67%
(1/15)

Vitamin B

6.67%
(1/15)

Multivitamin

6.67%
(1/15)

Fish Oil

6.67%
(1/15)

Zinc

6.67%
(1/15)

Magnesium

6.67%
(1/15)

Tumeric

6.67%
(1/15)

Atenolol

6.67%
(1/15)

Levothryoxine

6.67%
(1/15)

Pantoprazole

6.67%
(1/15)

Lisinopril

6.67%
(1/15)

Vitamin D3

6.67%
(1/15)

Lifestyle Habits

Tobacco Use (used to smoke)

40%
(6/15)

Light Caffeine Use

6.67%
(1/15)

Heavy Caffeine Use

66.67%
(10/15)

Light Alcohol Use

26.67%
(4/15)

Moderate Alcohol Use

26.67%
(4/15)

Recreational Drug Use

6.67%
(1/15)

Marijuana Use

33.33%
(5/15)

Predicted Epitopes are Recognized in Natural SARS-CoV-2 Infection

To determine what predicted effector CD4⁺ T cell epitope clusters are recognized by T cells raised in SARS-CoV-2 infection, we conducted ex vivo IFNγFluorospot assays using whole PBMC preparations from convalescent donors. Immune recall in the Fluorospot assay was stimulated using either individual or pooled peptides. Responses that were both >25 spot forming cells over background and >5-fold over background were considered positive. In preliminary studies, a small cohort of donors were stimulated with lower concentrations of peptide (0.313 μg/mL per pooled peptide, 10 μg/mL for individual peptide stimulations). These concentrations were increased to potentially heighten assay sensitivity, however, we found that there was no significant difference in the frequency of epitope-specific clones nor in the number of peptides detected per donor between the assay conditions, thus both data sets are combined here.

Overall, we found that the majority of COVID-19 convalescent donors (9/15; 60%) responded to a pool of the 32 peptides (FIG. 121A). In contrast, only a single healthy control donor out of ten demonstrated immune recall, just above the threshold, to this pool. Individual peptide stimulations showed that COVID-19 convalescent donors had positive responses to envelope, membrane, and spike peptides, including responses in the Si and S2 domains and RBD (FIG. 121D).

For 21/32 (66%) peptides, a recall response was observed in at least one convalescent donor, (corresponding to 1/1 envelope-derived epitope, 7/8 (87.5%) membrane-derived epitopes, 13/23 (56.5%) spike-derived epitopes); 14/32 (44%) peptides were confirmed in at least 20% of convalescent donors (FIG. 121C).

The number of peptides confirmed per donor ranged from 0 to 18 with an average response of 4-5 peptides per convalescent donor (FIG. 121B). Of the healthy donors, only two recognized any of the peptides ex vivo: the aforementioned donor responding to the total peptide pool recalled the sole envelope epitope (peptide 1) and an S2-derived sequence (peptide 25), while another donor mounted a response to an S2 domain sequence (peptide 26). Although rare, these findings suggest that there may be a memory response that cross-reacts with prior common cold coronavirus infection.

Given the variable patterns of epitope-specific responses, we also evaluated the cumulative response to specific antigens by batching the response to individual peptides (FIG. 122A). As expected from the individual donor peptide analysis, T cells specific for membrane-derived epitopes were most frequently recalled with 11/15 donors demonstrating significant anti-membrane responses and membrane epitope-specific T cells making up >40% of the total recall response in 10/11 donors. We also found that despite greater variability in responses to individual S1-derived epitopes, the magnitude of the response to these peptides was similar to the magnitude of membrane-specific T cell responses in individual donors.

Interestingly, the quality of each donor's cumulative response could be used to distinguish three distinct immunophenotypes within the cohort. These cohorts were defined by whether individuals mounted a (1) robust, (2) weak, or (3) no T cell response. Furthermore, divergent trends in terms of responses could be identified for males and females (FIG. 122B).

While similar T cell response levels were observed in the younger individuals evaluated in this study, regardless of sex, they diverge with age with higher effector T cell responses in males only (R=0.757, p=0.011, n=7); conversely, older women (>50 years) primarily exhibit decreased T cell activity. Correlation analysis in the complete cohort of women did not achieve significance (R<0.0001, p=0.993, n=8) due to two outliers (both moderate cases; pneumonia without requiring supplemental oxygen or hospitalization) and possibly because of the small size of the study. However, in the absence of these outliers, the cohort of women with mild cases shows a significant correlation between effector T cell responses and age (R=0.704, p=0.037,n=6)

Collectively, these findings highlight the high prevalence of membrane-targeted T cell responses in most COVID-19 cases and demonstrates the importance of targeting spike in SARS-CoV-2 vaccination to hasten and focus T cell responses to this antigen upon infection.

Pre-Existing SARS-CoV-2 Immunity in Healthy Donors is Stimulated by Predicted Epitopes Following Antigen-Specific Cell Expansion

While convalescent donors as a group recognized a majority of predicted epitopes ex vivo, as individuals, each recognized only a smaller subset. We hypothesized that SARS-CoV induced T cell clones underwent variable expansion and contraction over the course of disease resulting in memory T cell populations resulting in frequencies of T cell responses that were both detectable and undetectable ex vivo. To uncover low frequency epitope-specific T cells, PBMCs were stimulated with SARS-CoV-2 peptides and expanded in short-term culture and then re-stimulated with individual or pools of peptides in an IFNγ Fluorospot assay. Although changes to cellular phenotypes over the course of in vitro expansion do not represent the natural immune response to infection, expansion of epitope-specific T cells present ex vivo and their detection by cultured Fluorospot assay may augment the repertoire of immunogenic SARS-CoV-2 T cell epitopes and suggest greater T cell memory is generated than thought from ex vivo recall alone.

Using the same criteria defining an IFNγ response as above, we found that 11/15 (73.3%) convalescent donors mounted a response to the pool of 32 peptides (FIG. 123A). Like the ex vivo assay, individual peptide stimulations showed positive responses to envelope and membrane peptides, as well as spike peptides covering the S1 and S2 domains and RBD (FIG. 123D). Only a single S2-derived epitope was not recalled. Convalescent donors recognized between 0 and 27 peptides with an average of 9-10 peptide responses per donor. (FIG. 123B). At least one convalescent donor recognized 31/32 (96.9%) peptides, with the majority of peptides (26/32; including 0/1 envelope epitope, 6/8 membrane epitopes, and 20/23 spike epitopes) being confirmed in ≥20% of our cohort. (FIG. 123C).

Changes in IFNγ responses between ex vivo and cultured assay measurements identify broader patterns of peptide recognition associated with source antigen (Data not shown). For example, responses identified in convalescent donors to envelope and membrane-derived antigens were rarely replicated following culture in donors that exhibited positive responses ex vivo. In contrast, responses to spike-derived peptides were primarily identified following culture, however those that did response ex vivo were much more likely to maintain positive responses following culture as well. These differences may reflect variable memory phenotypes (effector vs. central), proliferative capacities, and/or activation/exhaustion profiles of epitope-specific T cell populations following natural infection.

Healthy donors mounted strong recall responses to SARS-CoV-2 T cell epitopes following expansion culture in sharp contrast to the ex vivo response. To the pool of 32 peptides, 8/10 (80%) healthy donors elicited detectable IFNγ responses (FIG. 123A). To individual peptide restimulations, cross-reactive T cell responses were found to envelope, membrane, and spike Si, S2, and RBD-derived peptides (FIG. 123D). On average individual control donors responded to 15 peptides, ranging from 0-27 recognized peptides per donor (FIG. 123B). Following culture, 31/32 (96.9%) peptides stimulated responses in at least one control donor. Of these, only a single peptide was not confirmed in ≥20% of the healthy cohort (FIG. 123C). The 30 peptides correspond to 1/1 envelope peptide, 6/8 membrane peptides, and 23/23 spike peptides. Overall, the magnitude and prevalence of cross-reactive T cell responses in donors with no SARS-CoV-2 history suggests T cell memory to common cold coronavirus infection may contribute to immune response to SARS-CoV-2 infection.

SARS-CoV-2 Peptide Vaccination Stimulates Type 1 Immunity

Confirmation that these predicted SARS-CoV-2 peptides are recognized by T cells raised in infection suggested that they may elicit de novo immune responses by vaccination. Peptide vaccination may prime CD4 and CD8 T cells and generate immune memory that is recalled upon infection to support protective humoral and cellular mechanisms of immunity. Of the 32 peptides, we selected the highest EpiMatrix scoring peptides (above 20) for vaccination. (One peptide was removed to due to poor solubility in the pool and was replaced with the 21^stranked peptide.) Twenty peptides is in range with what is used in peptide vaccine clinical trials. Peptides were formulated with poly-ICLC (Hiltonol), an adjuvant composed of carboxymethylcellulose, poly-inosinic-poly-cytidylic acid, and poly-L-lysine double-stranded RNA, that stimulates the TLR-3 and MDA-5 innate immune pathways and Thl -skewed CD4 T cell responses.

HLA-DR3 transgenic mice were primed and boosted with the peptide vaccine (EPV-CoV-19) to assess in vivo vaccine immunogenicity in the context of human MHC restriction, which is not feasible using wild-type mice. MHC Class II-mediated cellular immunity in the mouse MHC-II knockout/HLA-DR3 knock-in transgenic strain is completely restricted by human MHC, not by its murine ortholog, and presents peptides that are also recognized by human T cells. Thus, vaccine immunogenicity in the HLA-DR3 mouse model may support human application of SARS-CoV-2 peptides. Mice were immunized by the intradermal route. A published peptide vaccine clinical trial utilizing poly-ICLC adjuvant and intradermal route of immunization reported minimal injection site reactions and no toxicity associated with the immunizations.

The type 1/type 2 T cell balance stimulated by vaccination was assessed by measuring cytokine production. IFNγ production was measured as a marker of type 1 responses and IL-4 and IL-5 as markers of type 2 responses. In an IFNγ/IL-4 dual cytokine Fluorospot assay (FIG. 124A), splenocytes from individual mice were stimulated with pools of all vaccine peptides, as well as membrane-derived peptides or spike-derived peptides in the vaccine. All low- and high-dose vaccinated mice mounted an IFNγ response to each pool, as defined by SFC (FIG. 124B) and SI (FIG. 124C) criteria. Although the magnitude of response to each pool was statistically no different between the low- and high-dose groups, the ratio of spike-specific:membrane-specific IFNγ responses was significantly higher in animals that received the high-dose vaccine. Control mice that received either poly-ICLC only or saline did not respond. IL-4 secretion meeting both positivity criteria was not detected in either immunized or mock-immunized mice (FIG. 124D-E). A ratio of the IFNγ and IL-4 responses was used as an indicator of the type 1/type 2 T cell balance. For both the low-dose and high-dose vaccine groups, the IFNγ/IL-4 ratio skews strongly toward type 1 (FIG. 124F).

To assess cytokine production in T cell subpopulations, we used intracellular cytokine staining and flow cytometry to measure frequency, mean fluorescence intensity, and differentiation status of splenic CD4⁺ and CD8⁺ T cells responding to EPV-CoV-19 peptides. Representative flow cytometry data, inlduing gating, is shown in FIG. 125A. EPV-CoV-19 vaccination at both low and high doses stimulated statistically significant increases in numbers of IFNγ-producing memory T cells that were recalled specifically by vaccine peptides, as well as increased average amounts of IFNγ produced per cell compared to control mice (FIG. 125B and data not shown). In contrast, neither IL-4 nor IL-5 production from memory CD4⁺ and CD8⁺ T cells could be restimulated by vaccine peptides in vitro, above controls (FIG. 125C-D). Although we do note a minimal increase in the frequency of IL-5-producing CD4⁺ T cells from vaccinated animals (Data not shown) there is no concurrent increase in IL-5 production per cell, nor is the magnitude of this response comparable to the induction of Th1 subsets, thus minimizing the functional significance of this observation.

Overall, both the low-dose and high-dose vaccine groups exhibit IFNγ/(IL-4+IL-5) ratios that sharply skew toward Th1/Tc1 (FIG. 125E). Taken altogether, these results show EPV-CoV-19 stimulates a strong T cell response that can be recalled to elicit a potent type 1-skewed response in a dose-independent manner while also critically avoiding significant type 2 induction that is associated with enhanced respiratory disease.

Example 8
Overall Design:

A total of 28 healthy participants who meet the eligibility criteria will be enrolled in this study. The participants will be divided into 4 sequential cohorts of 7 participants each, with randomization of treatment within cohorts so that 5 participants receive active vaccine and 2 receive saline placebo. The first 3 cohorts will consist of participants seronegative for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) antibody who will receive either placebo or vaccine with 10 μg/peptide if in the first cohort, 50 μg/peptide if in the second cohort and 150 μg/peptide if in the third. The fourth cohort will consist of participants seropositive for SARS-CoV-2 antibody who will receive either saline placebo or a vaccine dose of 10 μg/peptide.

The dose levels for dose escalation are planned as 10 μg, 50 μg, and 150 μg per peptide, administered intradermally. The starting dose for EPV-CoV-19 Vaccine will be 10 μg per peptide (for up to 20 peptides). The maximum dose, 150 μg per peptide, is lower than the subcutaneous or intramuscular doses (300 μg per peptide) used in Phase 2/3 studies reported by Ott et al for their neoantigen peptide vaccine clinical study. The total peptide dose proposed for the FIH study will be well below those administered safely in previous clinical studies using synthetic peptides.

A staggered dosing will be used for each cohort:

On Day 1, 1 participant will receive EPV-CoV-19 Vaccine and 1 participant will receive saline placebo.

The remaining participants will be dosed at least 48 hours (h) later provided satisfactory safety and tolerability is demonstrated for the participants dose on Day 1.

All participants in the dose ranging part of the study (Cohort 1 to Cohort 3) will go through a Screening Period of up to 14 days (Day −14 to Day −1), while those with positive serum antibodies for SARS-CoV-2 (Cohort 4) will have a Screening Period of up to 30 days (Day −30 to Day −1). Study intervention (including saline placebo and EPV-CoV-19 Vaccine) will be given to all participants at the study site on Day 1 and Day 15. Participants will be released at the discretion of the Investigator following completion of the assessment at 60 minutes post each vaccination provided there are no safety concerns identified from the review of the clinical safety data. All participants will return to the study site on Day 3 (2 days after the first vaccination), Day 8 (one week after the first vaccination), Day 22 (one week after the second vaccination), Day 43 (28 days after the second vaccination), and at 2, 6, and 12 months after the second vaccination for safety, tolerability, and immunogenicity follow-up. Participants will receive monthly phone calls from 3 months after the second vaccination until the end of study (EOS) for safety and efficacy follow-up, except for the months (M6 and M12) where site visit is required. An unscheduled visit for safety and coronavirus disease 2019 (COVID-19) symptom follow-up may be needed as judged by the Investigator.

Safety, tolerability, and immunogenicity data for each cohort will be reviewed by the Safety Monitoring Committee (SMC) before the dose escalation to the next cohort and before involving SARS-CoV-2 antibody positive participants. Dose escalation will only take place after one week of safety follow-up for the second vaccination of the last participant in the previous cohort is completed and provided EPV-CoV-19 is well tolerated following review of all safety, tolerability, and immunogenicity data from the previous cohort by the SMC. Similarly, the cohort of SARS-CoV-2 antibody positive participants will only receive investigational product after one week of safety follow-up for the second vaccination of the last participant from the dose ranging groups is completed and provided EPV-CoV-19 is well tolerated following review of all safety, tolerability, and immunogenicity data from the previous cohort by the SMC. Due to the logistics of the bioanalytical immunogenicity analysis, immunogenicity data for the most recently enrolled cohort will not be included in the safety review for the cohort, but will be included when available in subsequent reviews. Screening procedures may continue between dose escalations to facilitate enrollment of remaining participants.

Brief Summary:

The purpose of this study is to assess the safety, tolerability, and immunogenicity of EPV-CoV-19 Vaccine (T cell epitope-driven) compared with saline placebo in healthy participants. Study details include:

The study duration will be up to: 13 months for Cohort 1 to Cohort 3 and 13.5 months for Cohort 4

The treatment duration will be up to: 15 days

The visit frequency will be: Day 1, Day 3, weekly from Day 8 to Day 22, followed by Day 43, and at 2, 6, and 12 months after the second vaccination; follow-up by phone calls will be performed monthly from 3 months after the second vaccination till the EOS, except for the months (M6 and M12) where site visit is required.

Number of Participants:

A maximum of 28 healthy participants will be randomly assigned to study intervention.

Note: “Enrolled” means a participant's, or their legally acceptable representative's, agreement to participate in a clinical study following completion of the informed consent process and screening. Potential participants who are screened for the purpose of determining eligibility for the study, but do not participate in the study, are not considered enrolled, unless otherwise specified by the protocol. A participant will be considered enrolled if the informed consent is not withdrawn prior to participating in any study activity after screening.

Intervention Groups and Duration:

Total duration of study participation for participants in the dose ranging part of the study (Cohort 1 to Cohort 3) will be up to 13 months and that for participants with positive serum antibodies for SARS-CoV-2 (Cohort 4) will be up to 13.5 months: a Screening Period of up to 14 days, a Treatment and Follow-up Period of 75 days (from first dosing to 60 days after the second dosing), and a Long-term Follow-up Period up to 12 months after the second vaccination.

Participants will receive study intervention (including saline placebo and EPV-CoV-19 Vaccine [10 μg, 50 μg, and 150 μg per peptide for up to 20 peptides]) administered by the Investigator/authorized study personnel at the study site on Day 1 and Day 15. Exemplary vaccines are found in Table 61 using the peptide pools found in Table 62.

TABLE 61

Vaccine
Pool 1
Pool 2
Pool 3
Pool 4

1
2
50
57
107

2
2
50
58
108

3
3
51
57
107

4
3
51
58
108

5
6
53
57
107

6
6
53
58
108

7
7
54
57
107

8
7
54
58
108

9
9
43
57
107

10
9
43
58
108

11
10
44
57
107

12
10
44
58
108

13
13
49
57
107

14
13
49
58
108

15
14
47
57
107

16
14
47
58
108

17
15
37
57
107

18
15
37
58
108

19
16
38
57
107

20
16
38
58
108

21
19
42
57
107

22
19
42
58
108

23
21
41
57
107

24
21
41
58
108

25
22
30
57
107

26
22
30
58
108

27
23
31
57
107

28
23
31
58
108

29
25
34
57
107

30
25
34
58
108

31
26
35
57
107

32
26
35
58
108

TABLE 62

Peptide 1
Peptide 2
Peptide 3
Peptide 4
Peptide 5

Pool
(SEQ ID NO:)
(SEQ ID NO:)
(SEQ ID NO:)
(SEQ ID NO:)
(SEQ ID NO:)

2
1088
1115
1070
1087
1080

3
1092
1115
1070
1087
1080

6
1092
1089
1070
1087
1080

7
1088
1089
1070
1087
1080

9
1088
1115
1070
1085
1080

10
1092
1115
1070
1085
1080

13
1092
1089
1070
1085
1080

14
1088
1089
1070
1085
1080

15
1092
1089
1078
1087
1080

16
1088
1089
1078
1087
1080

19
1092
1115
1078
1087
1080

21
1088
1115
1078
1087
1080

22
1092
1089
1078
1085
1080

23
1088
1089
1078
1085
1080

25
1088
1115
1078
1085
1080

26
1092
1115
1078
1085
10®

30
1088
1115
1070
1087
1066

31
1092
1115
1070
1087
1066

34
1092
1089
1070
1087
1066

35
1088
1089
1070
1087
1066

37
1088
1115
1070
1085
1066

38
1092
1115
1070
1085
1066

41
1092
1089
1070
1085
1066

42
1088
1089
1070
1085
1066

43
1092
1089
1078
1087
1066

44
1088
1089
1078
1087
1066

47
1092
1115
1078
1087
1066

49
1088
1115
1078
1087
1066

51
1088
1089
1078
1085
1066

53
1088
1115
1078
1085
1066

54
1092
1115
1078
1085
1066

57
1104
1107
1110
1116
1105

58
1081
1107
1110
1116
1105

107
1081
1055
1093
1100
1066

108
1104
1055
1093
1100
1066

On each dosing day, the study intervention will be given as four separate i.d. injections (0.5 mL per injection) to each of the four extremities by Mantoux technique. As EPV-CoV-19 Vaccine is composed of four peptide pools (five peptides per pool), each pool will be injected at one injection site. The injection sites at each time will be schematically documented in the case report form (CRF). Participants will be released at the discretion of the Investigator following completion of the assessment at 60 minutes post each vaccination provided there are no safety concerns identified from the review of the clinical safety data.

Stopping/Halting Rules:

For each participant, the booster (second dosing) will be halted if one of the following occurs: a Grade 2 adverse event (AE) lasting for more than two weeks or, a Grade 3 AE, a hypersensitivity reaction of any grade, or positive pregnancy test.

For each dosing cohort, study intervention will be halted if one of the following occurs: two Grade 2 AEs lasting for more than two weeks, two Grade 3AEs, one Grade 2 AEs lasting for more than two weeks combined with one Grade 3 AE, or one Grade 4 AE. The study/study enrollment will be halted at the same time if study intervention is halted for a dosing cohort.

Adverse events will be graded using FDA Guidance for Industry: Toxicity Grading Scale for Healthy Adult and Adolescent Volunteers Enrolled in Preventive Vaccine Clinical Trials (September 2007).

Safety Monitoring Committee:

A SMC has been appointed for this study. The SMC includes Principal Investigator (PI), medical monitor, and Sponsor medical representative and is appointed to monitor the safety and scientific integrity of a human research intervention, and to make recommendations to the Sponsor regarding the stopping of a study for efficacy, for harms, or for futility. The composition of the committee is dependent upon the scientific skills and knowledge required for monitoring the study.

Study Population:
Inclusion Criteria

Participants are eligible to be included in the study only if all of the following criteria apply:
- 1. Ability to provide written informed consent prior to initiation of any study procedures.
- 2. Be able to understand and agrees to comply with planned study procedures and be available for all study visits.
- 3. Agrees to the collection of venous blood per protocol.
- 4. Male or non-pregnant female, 18 to 55 years of age, inclusive, at time of enrollment.
- 5. Body Mass Index 18-35 kg/m², inclusive, at Screening.
- 6. Women of childbearing potential must have a negative urine or serum pregnancy test within 24 hours prior to each vaccination.
- 7. Oral temperature is less than 100.4 degrees Fahrenheit (38.0 degrees Celsius).
- 8. Pulse no greater than 100 beats per minute.
- 9. Systolic blood pressure is 85 to 150 mmHg, inclusive.
- 10. Clinical screening laboratory evaluations (white blood cell [WBC], hemoglobin [Hgb], platelets [PLTs], alanine transaminase [ALT], aspartate transaminase [AST], creatinine [Cr], alkaline phosphatase [ALP], total bilirubin [TBL], Lipase, prothrombin time [PT], and partial thromboplastin time [PTT]) are within acceptable normal reference ranges at the clinical laboratory being used.
- 11. Must agree to have samples stored for secondary research.
- 12. The participant must agree to refrain from donating blood or plasma during the study (outside of this study).
- 13. Negative human immunodeficiency virus (HIV) diagnostic test.
- 14. Contraceptive use by men and women must be consistent with local regulations regarding the methods of contraception for those participating in clinical studies.
  
  NOTE: The reliability of sexual abstinence for male and/or female enrollment eligibility needs to be evaluated in relation to the duration of the clinical study and the preferred and usual lifestyle of the participant. Periodic abstinence (e.g., calendar, ovulation, symptothermal, or post ovulation methods) and withdrawal are not acceptable methods of contraception.

a. Male Participants:

A male participant must agree to use a highly effective contraception as detailed in Appendix 4 of this protocol during the intervention period and an additional 90 days (a spermatogenesis cycle) after the last dose of study intervention and refrain from donating sperm during this period.

b. Female Participants:

A female participant is eligible to participate if she is not pregnant (see Appendix 4), not breastfeeding, and at least one of the following conditions applies:
Not a woman of childbearing potential (WOCBP) as defined in Appendix 4.
OR
A WOCBP who agrees to follow the contraceptive guidance in Appendix 4 during the intervention period and an additional 90 days after the last dose of study intervention.
Additional criteria applicable only to Cohorts 1-3:
- 15. Negative test for SARS-CoV-2 in nasopharyngeal swabs/sputum swabs through real-time reverse transcriptase polymerase chain reaction (RT-PCR) and BinaxNow™ COVID-19 test on Day-1.
- 16. Negative test for serum IgG antibodies to SARS-CoV-2 (titer <1:80) measured by antibody assay (per standardized assay at central laboratory, Mt. Sinai School of Medicine).
Additional criteria applicable only to Cohort 4:
- 17. Negative test for SARS-CoV-2 in nasopharyngeal swabs/sputum swabs through RT-PCR and BinaxNow™ COVID-19 test on Day −30 through Day −1.
- 18. Positive test for serum IgG antibodies to SARS-CoV-2 (titer≥1:80) measured by antibody assay (per standardized assay at central laboratory, Mt. Sinai School of Medicine).
- 19. Fulfills either one of the following criteria with regard to prior COVID-19 symptoms:
  - a. Never had symptoms of COVID-19.

- - b. Experienced mild to moderate symptoms of COVID-19 that resolved at least 30 days prior to vaccination and never required hospitalization.

Exclusion Criteria

Participants are excluded from the study if any of the following criteria apply:
- 1. Positive pregnancy test either at Screening or just prior to each vaccine administration.
- 2. Female participant who is breastfeeding or plans to breastfeed from the time of the first vaccination through 60 days after the last vaccination.
- 3. Has any medical disease or condition that, in the opinion of the site PI or appropriate sub-investigator, precludes study participation.
  
  Note: including acute, subacute, intermittent or chronic medical disease or condition that would place the participant at an unacceptable risk of injury, render the participant unable to meet the requirements of the protocol, or may interfere with the evaluation of responses or the participant's successful completion of this study.
- 4. Presence of self-reported or medically documented significant medical or psychiatric condition(s).
  
  Note: significant medical or psychiatric conditions include but are not limited to:
- (1) respiratory disease (e.g., chronic obstructive pulmonary disease [COPD], asthma) requiring daily medications currently or any treatment of respiratory disease exacerbations (e.g., asthma exacerbation) in the last 5 years. Asthma medications: inhaled, oral, or intravenous corticosteroids, leukotriene modifiers, long and short acting beta agonists, theophylline, ipratropium, biologics.
- (2) significant cardiovascular disease (e.g., congestive heart failure, cardiomyopathy, ischemic heart disease) or history of myocarditis or pericarditis as an adult.
- (3) neurological or neurodevelopmental conditions (e.g., history of migraines in the past 5 years, epilepsy, stroke, seizures in the last 3 years, encephalopathy, focal neurologic deficits, Guillain-Barré syndrome, encephalomyelitis or transverse myelitis).
- (4) ongoing malignancy or recent diagnosis of malignancy in the last five years excluding basal cell and squamous cell carcinoma of the skin, which are allowed.
- (5) an autoimmune disease, including hypothyroidism without a defined non-autoimmune cause, localized, or history of psoriasis.
- (6) An immunodeficiency of any cause.
- 5. Has an acute illness, as determined by the site PI or appropriate sub-investigator, with or without fever (oral temperature≥38.0 degrees Celsius [100.4 degrees Fahrenheit]) within 72 h prior to each vaccination.
  
  Note: an acute illness which is nearly resolved with only minor residual symptoms remaining is allowable if, in the opinion of the site PI or appropriate sub-investigator, the residual symptoms will not interfere with the ability to assess safety parameters as required by the protocol.
- 6. Has a positive test result for hepatitis B surface antigen, hepatitis C virus antibody, or HIV types 1 or 2 antibodies at Screening.
- 7. Has participated in another investigational study involving any investigational product (including study drug, biologic, or device) within 60 days, or 5 half-lives, whichever is longer, before the first vaccine administration.
- 8. Currently enrolled in or plans to participate in another clinical trial with an investigational agent (including licensed or unlicensed vaccine, drug, biologic, device, blood product, or medication) that will be received during the study reporting period (or 13 months after the first vaccination).
- 9. Has a history of hypersensitivity or severe allergic reaction (e.g., anaphylaxis, generalized urticaria, angioedema, other significant reaction) to any previous licensed or unlicensed vaccines.
- 10. Chronic use (more than 14 continuous days) of any medications that may be associated with impaired immune responsiveness. This includes, but not limited to, systemic corticosteroids exceeding 10 mg/day of prednisone equivalent, allergy injections, immunoglobulin, interferon, immunomodulators, cytotoxic drugs, or other similar or toxic drugs during the preceding 6-month period prior to the first vaccine administration. The use of low dose topical, ophthalmic, inhaled and intranasal steroid preparations will be permitted.
- 11. Received immunoglobulins and/or any blood or blood products within the 4 months before the first vaccine administration or at any time during the study.
- 12. Has any blood dyscrasias or significant disorder of coagulation.
- 13. Has a history of alcohol abuse or other recreational drug (excluding cannabis) use within 6 months before the first vaccine administration.
- 14. Has any abnormality or permanent body art (e.g., tattoo) that will interfere with the ability to observe local reactions at the injection site (deltoid/thigh region).
- 15. Received or plans to receive a licensed, live vaccine within 4 weeks before or after each vaccination.
- 16. Received or plans to receive a licensed, inactivated vaccine within 2 weeks before or after each vaccination.
- 17. Receipt of any other SARS-CoV-2 or other experimental coronavirus vaccine at any time prior to or during the study.
- 18. On current treatment with investigational agents for prophylaxis of COVID-19.
- 19. Current use of any prescription or over-the-counter medications within 7 days prior to vaccination, unless approved by the Investigator.
- 20. Plan to travel outside the US (continental US, Hawaii, and Alaska) from enrollment through 28 days after the second vaccination.
- 21. Participant allergic to any component of the investigational vaccine, or a more severe allergic reaction and history of allergies in the past.
- 22. Direct contact with a person who has tested positive for SARS-CoV-2 within 30 days of enrollment.

Additional criteria applicable only to Cohorts 1-3:
- 23. A history of laboratory-confirmed SARS-CoV-2 infection.
- 24. Participant has had clinical signs and symptoms consistent with SARS-CoV-2 infection (including stuffy or runny nose, score throat, shortness of breath, cough, low energy or tiredness, muscle or body aches, headache, chills or shivering, feeling hot or feverish, nausea, vomit, diarrhea, reduced sense of smell and taste, as specified in the FDA guidance) within 2 weeks prior to the first vaccination.

Additional criterion applicable only to Cohort 4:

Participant has had clinical signs and symptoms consistent with SARS-CoV-2 infection (including stuffy or runny nose, score throat, shortness of breath, cough, low energy or tiredness, muscle or body aches, headache, chills or shivering, feeling hot or feverish, nausea, vomit, diarrhea, reduced sense of smell and taste, as specified in the FDA guidance) and a positive SARS-CoV-2 test result within 30 days prior to the first vaccination.

Number	Date	Country
62976715	Feb 2020	US
62983012	Feb 2020	US
62991790	Mar 2020	US
62991814	Mar 2020	US
63001632	Mar 2020	US
63001624	Mar 2020	US
63004729	Apr 2020	US
63006962	Apr 2020	US
63065135	Aug 2020	US
63065161	Aug 2020	US
63065152	Aug 2020	US
63065129	Aug 2020	US
63065163	Aug 2020	US
63073161	Sep 2020	US
63073156	Sep 2020	US
63081062	Sep 2020	US
63081055	Sep 2020	US
63083389	Sep 2020	US
63092229	Oct 2020	US

T CELL EPITOPES AND RELATED COMPOSITIONS USEFUL IN THE PREVENTION, DIAGNOSIS, AND TREATMENT OF COVID-19

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATION

PCT Information

Provisional Applications (19)