CANCER VACCINES

Information

  • Patent Application
  • 20220323558
  • Publication Number
    20220323558
  • Date Filed
    May 13, 2021
    3 years ago
  • Date Published
    October 13, 2022
    2 years ago
Abstract
The present disclosure provides (i) isolated immunogenic TAA polypeptides (i.e., an immunogenic MUC1 polypeptides, an immunogenic MSLN polypeptides, and an immunogenic TERT polypeptides), (ii) isolated nucleic acid molecules encoding one or more immunogenic TAA polypeptides, (iii) compositions comprising an immunogenic TAA polypeptide or an isolated nucleic acid molecule encoding an immunogenic TAA polypeptide, and (iv) methods relating to uses of the polypeptides, nucleic acid molecules, and compositions.
Description
REFERENCE TO SEQUENCE LISTING

This application is being filed along with a sequence listing in electronic format. The sequence listing is provided as a file in .txt format entitled “PC71855A_SeqList_ST25.txt”, created on Nov. 8, 2016, and having a size of 751 KB. The sequence listing contained in the .txt file is part of the specification and is herein incorporated by reference in its entity.


FIELD OF THE INVENTION

The present invention relates generally to immunotherapy and specifically to vaccines and methods for treating or preventing neoplastic disorders.


BACKGROUND OF THE INVENTION

Cancers are a leading cause of mortality worldwide. They may occur in a variety of organs, such as pancreas, ovaries, breasts, lung, colon, and rectum. Pancreatic cancers are the fourth most common cause of cancer deaths in the United States. Pancreatic cancers may occur in the exocrine or endocrine component of the pancreas. Exocrine cancers include (1) pancreatic adenocarcinoma, which is by far the most common type, (2) acinar cell carcinoma, which represents 5% of exocrine pancreatic cancers, (3) cystadenocarcinomas, which account for 1% of pancreatic cancers, and (4) other rare forms of cancers, such as pancreatoblastoma, adenosquamous carcinomas, signet ring cell carcinomas, hepatoid carcinomas, colloid carcinomas, undifferentiated carcinomas, and undifferentiated carcinomas with osteoclast-like giant cells.


Ovarian cancer accounts for about 3% of cancers among women, but it causes more deaths than any other cancer of the female reproductive system. Ovarian cancers include (1) epithelial cancers, such as epithelial ovarian carcinomas, (2) germ cell cancers, such as immature teratomas, and (3) stromal cancers, such as granulosa cell tumors.


Breast cancer is the second most common cancer among American women and the second leading cause of cancer death in women. Breast cancers can be classified based on the hormone receptors and HER2/neu status, such as (1) hormone receptor-positive cancers (where the cancer cells contain either estrogen receptors or progesterone receptors), (2) hormone receptor-negative cancers (where the cancer cells don't have either estrogen or progesterone receptors), (3) HER2/neu positive (wherein cancers that have excessive HER2/neu protein or extra copies of the HER2/neu gene), (4) HER2/neu negative cancers (where the cancers don't have excess HER2/neu), (5) triple-negative cancers (wherein the breast cancer cells have neither estrogen receptors, nor progesterone receptors, nor excessive HER2), and (6) triple-positive cancers (where the cancers are estrogen receptor-positive, progesterone receptor-positive, and have too much HER2).


Lung cancer accounts for more than a quarter of all cancer deaths and is by far the leading cause of cancer death among both men and women. The most common type of lung cancers is non-small cell lung cancers (NSCLC), which account for about 85% to 90% of lung cancers. NSCLC may be further classified into several subtypes, such as squamous cell (epidermoid) carcinoma, adenocarcinoma, large cell (undifferentiated) carcinoma, adenosquamous carcinoma, and sarcomatoid carcinoma. The second common type of lung cancer is small cell lung cancer (SCLC), which accounts for about 10% to 15% of all lung cancers.


Colorectal cancer (CRC) is the second leading cause of cancer-related deaths in the United States when both men and women are combined. Adenocarcinoma is the most common type of CRC, which accounts for more than 95% of colorectal cancers. Other less common types of CRC include Carcinoid tumors, gastrointestinal stromal tumors (GISTs), lymphomas, and sarcomas.


Gastric cancer is the third most common cause of cancer-related death in the world. It remains difficult to cure, primarily because most patients present with advanced disease. In the United States, gastric cancer is currently the 15th most common cancer. About 90-95% of gastric cancers are adenocarcinomas; other less common types include lymphoma (4%), GISTs, and carcinoid tumors (3%).


Traditional regimens of cancer management have been successful in the management of a selective group of circulating and solid cancers. However, many types of cancers are resistant to traditional approaches. In recent years, immunotherapy for cancers has been explored, particularly cancer vaccines and antibody therapies. One approach of cancer immunotherapy involves the administering an immunogen to generate an active systemic immune response towards a tumor-associated antigen (TAA) on the target cancer cell. While a large number of tumor-associated antigens have been identified and many of these antigens have been explored as viral-, bacterial-, protein-, peptide-, or DNA-based vaccines for the treatment or prevention of cancers, most clinical trials so far have failed to produce a therapeutic product. Therefore, there exists a need for immunogens that may be used in the treatment or prevention of cancers.


The present disclosure relates to immunogens derived from the tumor-associated antigens MUC1, mesothelin, and TERT, nucleic acid molecules encoding the immunogens, and compositions comprising such immunogens or nucleic acids.


The human mucin 1 (MUC1; also known as episialin, PEM, H23Ag, EMA, CA15-3, and MCA) is a polymorphic transmembrane glycoprotein expressed on the apical surfaces of simple and glandular epithelia. The MUC1 gene encodes a single polypeptide chain precursor that includes a signal peptide sequence. Immediately after translation the signal peptide sequence is removed and the remaining portion of the MUC1 precursor is further cleaved into two peptide fragments: the longer N-terminal subunit (MUC1-N or MUC1a) and the shorter C-terminal subunit (MUC1-C or MUC1P). The mature MUC1 comprises a MUC1-N and a MUC1-C associated through stable hydrogen bonds. MUC1-N, which is an extracellular domain, contains 25 to 125 variable number tandem repeats (VNTR) of 20 amino acid residues. MUC1-C contains a short extracellular region (approximately 53 amino acids), a transmembrane domain (approximately 28 amino acid), and a cytoplasmic tail (approximately 72 amino acids). The cytoplasmic tail of MUC1 (MUC1-CT) contains highly conserved serine and tyrosine residues that are phosphorylated by growth factor receptors and intracellular kinases. Human MUC1 exists in multiple isoforms resulting from different types of MUC1 RNA alternative splicing. The amino acid sequence of full length human MUC1 isoform 1 protein precursor (isoform 1, Uniprot P15941-1) is provided in SEQ ID NO: 1 (“MUC1 Isoform 1 Reference Polypeptide”). At least 16 other isoforms of human MUC-1 have been reported so far (Uniprot P15941-2 through P15941-17), which include various insertions, deletions, or substitutions as compared to the sequence of isoform 1. These isoforms are known as isoform 2, 3, 4, 5, 6, Y, 8, 9, F, Y-LSP, S2, M6, ZD, T10, E2, and J13 (Uniprot P15941-2 through P15941-17, respectively). The full length human MUC1 isoform 1 precursor protein consists of 1255 amino acids, which includes a signal peptide sequence at amino acids 1-23. The MUC1-N and MUC1-C domains of the mature MUC1 protein consist of amino acids 24-1097 and 1098-1255, respectively.


Mesothelin (also known as MSLN) is a membrane-bound glycoprotein present on the surface of cells lining the pleura, peritoneum and pericardium, and is overexpressed in several human tumors, including mesothelioma, ovarian, and pancreatic adenocarcinoma. The Mesothelin gene encodes a 71-kilodalton (kDa) precursor protein that is processed to a 40-kDa Mesothelin protein and a secreted megakaryocyte potentiating factor (MPF) protein (Chang, et al, Proc Natl Acad Sci USA (1996) 93:136-40). Alternative splicing of MSLN gene results in at least four mesothelin isoforms. The amino acid sequences of isoform 1 (Uniprot Q13421-1), isoform 2 (Uniprot Q13421-3), isoform 3 (Uniprot Q13421-2), and isoform 4 (Uniprot Q13421-4) are available at Uniprot (www.uniprot.org). The amino acid sequence of full length human MSLN isoform 2 precursor protein (Uniprot identifier Q13421-3), which consists of 622 amino acids, is provided in SEQ ID N0:2 (“Mesothelin Precursor Isoform 2 Reference Polypeptide”). The cytoplasmic portion of MSLN comprises amino acid residues 37 to 597 of SEQ ID N0:2 Isoform 2 is the major form of MSLN. Isoform 1, which consists of 630 amino acids, differs from isoform 2 by having an insertion of 8 amino acids (PQAPRRPL) at position 409 of the isoform 2 sequence. Isoform 3 has an alternative C terminus (at positions 593-622 of isoform 2) while isoform 4 has a deletion of amino acid 44, as compared with isoform 2. Isoform 2 is initially translated as a 622-amino acid precursor, which comprises a signal peptide sequence (amino acids 1-36) at the N-terminus and a GPI-anchor sequence at the C-terminus. The signal peptide sequence and the GPI-anchor sequence may be cleaved off in the mature mesothelin.


Telomerase reverse transcriptase (or TERT) is the catalytic component of the telomerase, which is a ribonucleoprotein polymerase responsible for maintaining telomere ends by addition of the telomere repeat TTAGGG. In addition to TERT, telomerase also includes an RNA component which serves as a template for the telomere repeat. Human TERT gene encodes an 1132 amino acid protein. Several isoforms of human TERT exist, which result from alternative splicing. The amino acid sequences of isoform 1, isoform 2, isoform 3, and isoform 4 are available at Uniprot (<www.uniprot.org>; Uniprot identifiers 014746-1, 014746-2, 014746-3, and 014746-4, respectively). The amino acid sequence of human full length TERT isoform 1 protein (isoform 1, Genbank AAD30037, Uniprot 014746-1) is also provided herein in SEQ ID NO:3 (“TERT Isoform 1 Reference Polypeptide”). As compared with TERT isoform 1 (014746-1), isoform 2 (014746-2) has replacement of amino acids 764-807 (STLTDLQPYM . . . LNEASSGLFD→LRPVPGDPAG . . . AGRAAPAFGG) and deletion of C-terminal amino acids 808-1132), isoform 3 (014746-3) has deletion of amino acids 885-947, and isoform 4 (014746-4) has deletions of amino acids 711-722 and 808-1132, and replacement of amino acids 764-807 (STLTDLQPYM . . . LNEASSGLFD→LRPVPGDPAG . . . AGRAAPAFGG).


SUMMARY OF THE INVENTION

In some aspects, the present disclosure provides isolated immunogenic polypeptides which comprise amino acid sequences of one or more human TAA selected from MUC1, MSLN, and TERT. The immunogenic polypeptides are useful, for example, in eliciting an immune response in vivo in a subject or for use as a component in vaccines for treating cancer.


In other aspects, the present disclosure provides nucleic acid molecules that encode an immunogenic polypeptide provided by the present disclosure. In some embodiments, the present disclosure provides multi-antigen nucleic acid constructs that each encode two, three, or more immunogenic polypeptides.


The disclosure also provides vectors containing one or more nucleic acid molecules of the invention. The vectors are useful for cloning or expressing the immunogenic TAA polypeptides encoded by the nucleic acid molecules, or for delivering the nucleic acid molecules in a composition, such as a vaccine, to a host cell or to a host animal or a human.


In some further aspects, the present disclosure provides compositions comprising one or more immunogenic TAA polypeptides, isolated nucleic acid molecules encoding immunogenic TAA polypeptides, or vectors or plasmids containing nucleic acid molecules encoding immunogenic TAA polypeptides. In some embodiments, the composition is an immunogenic composition useful for eliciting an immune response against a TAA in a subject, such as a mouse, dog, monkey, or human. In some embodiments, the composition is a vaccine composition useful for immunization of a mammal, such as a human, for inhibiting abnormal cell proliferation, for providing protection against the development of cancer (used as a prophylactic), or for treatment of disorders (used as a therapeutic) associated with TAA over-expression, such as cancer, particularly pancreatic, ovarian, and triple-negative breast cancer. In still other aspects, the present disclosure provides methods of using the immunogenic TAA polypeptides, isolated nucleic acid molecules, and compositions comprising an immunogenic TAA polypeptide or isolated nucleic acid molecules described herein above. In some embodiments, the present disclosure provides a method of eliciting an immune response against a TAA in a subject, particularly a human, comprising administering to the subject an effective amount of a polypeptide provided by the invention that is immunogenic against the target TAA, an effective amount of an isolated nucleic acid molecule encoding such an immunogenic polypeptide, or a composition comprising such an immunogenic TAA polypeptide or an isolated nucleic acid molecule encoding such an immunogenic TAA polypeptide. The polypeptides, nucleic acids, or compositions comprising the polypeptide or nucleic acid may be used together with one or more adjuvants or immune modulators.







DETAILED DESCRIPTION OF THE INVENTION
A. Definitions

The term “adjuvant” refers to a substance that is capable of enhancing, accelerating, or prolonging an immune response elicited by an immunogen.


The term “agonist” refers to a substance which promotes (induces, causes, enhances or increases) the activity of another molecule (such as a receptor). The term “agonist” encompasses substances which bind a receptor and substances which promote receptor function without binding thereto.


The term “antagonist” or “inhibitor” refers to a substance that partially or fully blocks, inhibits, or neutralizes a biological activity of another molecule or a receptor.


The term “co-administration” refers to administration of two or more agents to the same subject during a treatment period. The two or more agents may be encompassed in a single formulation and thus be administered simultaneously. Alternatively, the two or more agents may be in separate physical formulations and administered separately, either sequentially or simultaneously to the subject. The term “administered simultaneously” or “simultaneous administration” means that the administration of the first agent and that of a second agent overlap in time with each other, while the term “administered sequentially” or “sequential administration” means that the administration of the first agent and that of a second agent do not overlap in time with each other.


The term “cytosolic” or “cytoplasmic” means that after a nucleotide sequence encoding a particular polypeptide is expressed by a host cell, the expressed polypeptide is expected to be retained inside the host cell.


The term “degenerate variant” refers to a polynucleotide that differs in the nucleotide sequence from the reference polynucleotide but encodes the same polypeptidesequence as encoded by the reference polynucleotide. Most of the 20 natural amino acids that are components of proteins or peptides are specified by more than one codon. For instance, the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. Thus, at every position where an arginine is specified within a protein-encoding sequence, the codon can be altered to any of the corresponding codons described without altering the amino acid sequence of the encoded protein. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given polypeptide.


The term “effective amount” refers to an amount administered to a subject that is sufficient to cause a desired effect in the subject.


The term “fragment” of a given polypeptide refers to a polypeptide that is shorter than the given polypeptide and shares 100% identity with the sequence of the given polypeptide.


The term “functional variant” of an immunogenic TAA polypeptide refers to a polypeptide that comprises from 90% to 110% of the number of amino acids of the reference immunogenic TAA polypeptide, has lower than 100% but higher than 95% identity to the amino acid sequence of the reference TAA polypeptide, and possess the same or similar immunogenic properties of the reference immunogenic TAA polypeptide.


The term “identical” refers to two or more nucleic acids, or two or more polypeptides, that share the exact same sequence of nucleotides or amino acids, respectively. The term “percent identity” describes the level of similarity between two or more nucleic acids or polypeptides. When two sequences are aligned by bioinformatics software, “percent identity” is calculated by multiplying the number of exact nucleotide/amino acid matches between the sequences by 100, and dividing by the length of the aligned region, including gaps. For example, two 100-amino acid long polypeptides that exhibit 10 mismatches when aligned would be 90% identical.


The term “immune-effector-cell enhancer” or “IEC enhancer” refers to a substance capable of increasing or enhancing the number, quality, and/or function of one or more types of immune effector cells of a subject. Examples of immune effector cells include cytolytic CD8 T cells, CD4 T cells, NK cells, and B cells.


The term “immune modulator” refers to a substance capable of altering (e.g., inhibiting, decreasing, increasing, enhancing or stimulating) the working or function of any component of the innate, humoral, or cellular immune system of a subject. Thus, the term “immune modulator” encompasses the “immune-effector-cell enhancer” as defined herein and the “immune-suppressive-cell inhibitor” as defined herein, as well as substance that affects any other components of the immune system of a subject.


The term “immune response” refers to any detectable response to a particular substance (such as an antigen or immunogen) by the immune system of a host vertebrate animal, including, but not limited to, innate immune responses (e.g., activation of Toll-like receptor signaling cascade), cell-mediated immune responses (e.g., responses mediated by T cells, such as antigen-specific T cells, and non-specific cells of the immune system), and humoral immune responses (e.g., responses mediated by B cells, such as generation and secretion of antibodies into the plasma, lymph, and/or tissue fluids). Examples of immune responses include an alteration (e.g., increase) in Toll-like receptor activation, lymphokine (e.g., cytokine (e.g., Th1, Th2 or Th17 type cytokines) or chemokine) expression or secretion, macrophage activation, dendritic cell activation, T cell (e.g., CD4+ or CD8+ T cell) activation, NK cell activation, B cell activation (e.g., antibody generation and/or secretion), binding of an immunogen (e.g., antigen, immunogenic polypeptide) to an MHC molecule, induction of a cytotoxic T lymphocyte (“CTL”) response, induction of a B cell response (e.g., antibody production), and expansion (e.g., growth of a population of cells) of cells of the immune system (e.g., T cells and B cells), and increased processing and presentation of antigen by antigen presenting cells. The term “immune response” also encompasses any detectable response to a particular substance (such as an antigen or immunogen) by one or more components of the immune system of a vertebrate animal in vitro.


The term “immunogen” refers to a substance that is immunogenic.


The term “immunogenic” refers to the ability of a substance upon administration to a subject (such as a human) to cause, elicit, stimulate, or induce an immune response, or to improve, enhance, increase or prolong a pre-existing immune response, against a particular antigen in the subject, whether alone or when linked to a carrier, in the presence or absence of an adjuvant.


The term “immunogenic composition” refers to a composition that is immunogenic.


The term “immunogenic MUC1 polypeptide” refers to a polypeptide that is immunogenic against a human native MUC1 protein or against cells expressing the human native MUC1 protein. The polypeptide may have the same amino acid sequence as that of a human native MUC1 protein or display one or more mutations as compared to the amino acid sequence of a human native MUC1 protein.


The term “immunogenic MSLN polypeptide” refers to a polypeptide that is immunogenic against a human native MSLN protein or against cells expressing human native MSLN protein. The polypeptide may have the same amino acid sequence as that of a human native MSLN protein or displays one or more mutations as compared to the amino acid sequence of a human native MSLN protein.


The term “immunogenic TERT polypeptide” refers to a polypeptide that is immunogenic against a human native TERT protein or against cells expressing a human native TERT protein. The polypeptide may have the same amino acid sequence as that of a human native TERT protein or displays one or more mutations as compared to the amino acid sequence of a human native TERT protein.


The term “immunogenic TAA polypeptide” refers to an “immunogenic MSLN polypeptide,” an “immunogenic MUC1 polypeptide, or an “immunogenic TERT polypeptide,” each as defined herein above.


The term “immunogenic MUC1 nucleic acid molecule” refers to a nucleic acid molecule that encodes an “immunogenic MUC1 polypeptide” as defined herein.


The term “immunogenic MSLN nucleic acid molecule” refers to a nucleic acid molecule that encodes an “immunogenic MSLN polypeptide” as defined herein.


The term “immunogenic TERT nucleic acid molecule” refers to a nucleic acid molecule that encodes an “immunogenic TERT polypeptide” as defined herein.


The term “immunogenic TAA nucleic acid molecule” refers to a nucleic acid molecule that encodes an “immunogenic MUC1 polypeptide,” an “immunogenic MSLN polypeptide, or an “immunogenic TERT polypeptide” as defined herein above.


The term “immune-suppressive-cell inhibitor” or “ISC inhibitor” refers to a substance capable of reducing and/or suppressing the number and/or function of immune suppressive cells of a subject. Examples of immune suppressive cells include regulatory T cells (“Tregs”), myeloid-derived suppressor cells, and tumor-associated macrophages.


The term “subject” refers to either a human or a non-human mammal. The term “mammal” refers to any animal species of the Mammalia class. Examples of mammals include: humans; non-human primates such as monkeys; laboratory animals such as rats, mice, guinea pigs; domestic animals such as cats, dogs, rabbits, cattle, sheep, goats, horses, and pigs; and captive wild animals such as lions, tigers, elephants, and the like.


The term “membrane-bound” means that after a nucleotide sequence encoding a particular polypeptide is expressed by a host cell, the expressed polypeptide is bound to, attached to, or otherwise associated with, the membrane of the cell.


The term “neoplastic disorder” refers to a condition in which cells proliferate at an abnormally high and uncontrolled rate, the rate exceeding and uncoordinated with that of the surrounding normal tissues. It usually results in a solid lesion or lump known as “tumor.” This term encompasses benign and malignant neoplastic disorders. The term “malignant neoplastic disorder”, which is used interchangeably with the term “cancer” in the present disclosure, refers to a neoplastic disorder characterized by the ability of the tumor cells to spread to other locations in the body (known as “metastasis”). The term “benign neoplastic disorder” refers to a neoplastic disorder in which the tumor cells lack the ability to metastasize.


The term “mutation” refers to deletion, addition, or substitution of amino acid residues in the amino acid sequence of a protein or polypeptide as compared to the amino acid sequence of a reference protein or polypeptide.


The term “operably linked” refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner. A control sequence “operably linked” to a transgene is ligated in such a way that expression of the transgene is achieved under conditions compatible with the control sequences.


The term “pharmaceutically composition” refers to a solid or liquid composition suitable for administration to a subject (e.g. a human patient) for eliciting a desired physiological, pharmacological, or therapeutic effect. In addition to containing one or more active ingredients, a pharmaceutical composition may contain one or more pharmaceutically acceptable excipients.


The term “pharmaceutically acceptable excipient” refers to a substance in an immunogenic, pharmaceutical, or vaccine composition, other than the active ingredients (e.g., the antigen, antigen-coding nucleic acid, immune modulator, or adjuvant) that is compatible with the active ingredients and does not cause significant untoward effect in subjects to whom it is administered.


The terms “peptide,” “polypeptide,” and “protein” are used interchangeably herein, and refer to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically, or biochemically modified or derivatized amino acids, and polypeptides having modified polypeptide backbones.


The term “preventing” or “prevent” refers to (a) keeping a disorder from occurring or (b) delaying the onset of a disorder or onset of symptoms of a disorder.


The term “secreted” in the context of a polypeptide means that after a nucleotide sequence encoding the polypeptide is expressed by a host cell, the expressed polypeptide is secreted outside of the host cell.


The term “suboptimal dose” when used to describe the amount of an immune modulator, such as a protein kinase inhibitor, refers to a dose of the immune modulator that is below the minimum amount required to produce the desired therapeutic effect for the disease being treated when the immune modulator is administered alone to a patient. The term “treating,” “treatment,” or “treat” refers to abrogating a disorder, reducing the severity of a disorder, or reducing the severity or occurrence frequency of a symptom of a disorder.


The term “tumor-associated antigen” or “TAA refers to an antigen which is specifically expressed by tumor cells or expressed at a higher frequency or density by tumor cells than by non-tumor cells of the same tissue type. Tumor-associated antigens may be antigens not normally expressed by the host; they may be mutated, truncated, misfolded, or otherwise abnormal manifestations of molecules normally expressed by the host; they may be identical to molecules normally expressed but expressed at abnormally high levels; or they may be expressed in a context or milieu that is abnormal. Tumor-associated antigens may be, for example, proteins or protein fragments, complex carbohydrates, gangliosides, haptens, nucleic acids, or any combination of these or other biological molecules.


The term “vaccine” refers to an immunogenic composition for administration to a mammal (such as a human) for eliciting a protective immune response against a particular antigen or antigens. The primary active ingredient of a vaccine is the immunogen(s).


The term “vector” refers to a nucleic acid molecule, or a modified microorganism, that is capable of transporting or transferring a foreign nucleic acid molecule into a host cell. The foreign nucleic acid molecule is referred to as “insert” or “transgene.” A vector generally consists of an insert and a larger sequence that serves as the backbone of the vector. Based on the structure or origin of vectors, major types of vectors include plasmid vectors, cosmid vectors, phage vectors (such as lambda phage), viral vectors (such as adenovirus vectors), artificial chromosomes, and bacterial vectors.


B. Immunogenic Tumor-Associated-Antigen (TAA) Polypeptides

In some aspects, the present disclosure provides isolated immunogenic MUC1 polypeptides, TERT polypeptides, and MSLN polypeptides, which are useful, for example, for eliciting an immune response in a subject against MUC1, TERT, and MSLN, respectively, or for use as a component in vaccines for treating cancer, such as pancreatic, ovarian, and breast cancer, particularly triple-negative breast cancer.


These immunogenic TAA polypeptides can be prepared by methods known in the art in light of the present disclosure. The capability of the polypeptides to elicit an immune response can be measured in in vitro assays or in vivo assays. In vitro assays for determining the capability of a polypeptide or DNA construct to elicit immune responses are known in the art. One example of such in vitro assays is to measure the capability of the polypeptide or nucleic acid expressing a polypeptide to stimulate T cell response as described in U.S. Pat. No. 7,387,882, the disclosure of which is incorporated in this application. The assay method comprises the steps of: (1) contacting antigen presenting cells in culture with an antigen thereby the antigen can be taken up and processed by the antigen presenting cells, producing one or more processed antigens; (2) contacting the antigen presenting cells with T cells under conditions sufficient for the T cells to respond to one or more of the processed antigens; (3) determining whether the T cells respond to one or more of the processed antigens. The T cells used may be CD8+ T cells or CD4+ T cells. T cell response may be determined by measuring the release of one of more of cytokines, such as interferon-gamma and interleukin-2, and lysis of the antigen presenting cells (tumor cells). B cell response may be determined by measuring the production of antibodies.


B-1. Immunogenic MUC1 Polypeptides


In one aspect, the present disclosure provides isolated immunogenic MUC1 polypeptides derived from a human native MUC1, wherein the MUC1 polypeptides display one or more introduced mutations relative to the human native MUC1 protein. Examples of mutations include deletion of some, but not all, of the tandem repeats of 20 amino acids in the VNTR region of the MUC1 protein, deletion of the signal peptide sequence in whole or in part, and deletion of amino acids of non-consensus amino acid sequences found in the MUC1 isoforms. Thus, in some embodiments, the immunogenic MUC1 polypeptides provided by the present disclosure comprise (1) the amino acid sequence of 3 to 30 tandem repeats of 20 amino acids of a human MUC1 protein and (2) the amino acid sequences of the human MUC1 protein that flank the VNTR region. In some particular embodiments, the immunogenic MUC1 polypeptides comprise (1) the amino acid sequence of 5 to 25 tandem repeats of the human MUC1 and (2) the amino acid sequences of the human MUC1 protein that flank the VNTR region. In some further embodiments, the immunogenic MUC1 polypeptides are in cytoplasmic form (or “cMUC1”). The term “cytoplasmic form” refers to an immunogenic MUC1 polypeptide that lacks in whole or in part the secretory sequence (amino acids 1-23; also known as “signal peptide sequence”) of the human native MUC1 protein. The deletion of amino acids of the secretory sequence is expected to prevent the polypeptide from entering the secretory pathway as it is expressed in the cells. In some other embodiments, the immunogenic MUC1 polypeptides comprise the amino acid sequence of a membrane-bond form of the MUC1.


The immunogenic MUC1 polypeptides provided by the present disclosure may be derived, constructed, or prepared from the amino acid sequence of any of the human MUC1 isoforms known in the art or discovered in the future, including, for example, Uniprot isoforms 1, 2, 3, 4, 5, 6, Y, 8, 9, F, Y-LSP, S2, M6, ZD, T10, E2, and J13 (Uniprot P15941-1 through P15941-17, respectively). In some embodiments, the immunogenic MUC1 polypeptides comprise an amino acid sequence that is part of human MUC1 isoform 1 wherein the amino acid sequence of the human MUC1 isoform 1 is set forth in SEQ ID NO:1. In a specific embodiment, the immunogenic MUC1 polypeptide comprises amino acids 24-225 and 1098-1255 of the amino acid sequence of SEQ ID NO:1. In another specific embodiment, the immunogenic MUC1 polypeptide comprises amino acids 22-225 and 946-1255 of the amino acid sequence of SEQ ID NO:1. In some other specific embodiments, the immunogenic MUC1 polypeptide comprises, or consists of, the amino acid sequence selected from the group consisting of:


(1) the amino acid sequence of SEQ ID NO:8 (Plasmid 1027 polypeptide);


(2) an amino acid sequence comprising amino acids 4-537 of SEQ ID NO:8;


(3) an amino acid sequence comprising amino acids 24-537 of SEQ ID NO:8;


(4) the amino acid sequence of SEQ ID NO:16 (Plasmid 1197 polypeptide);


(5) an amino acid sequence comprising amino acids 4-517 of SEQ ID NO:16; and


(6) an amino acid sequence comprising amino acids 4-517 of SEQ ID NO:16, wherein in SEQ ID NO:16 the amino acid at positon 513 is T.


In some specific embodiments, the immunogenic MUC1 polypeptides comprise the amino acid sequence of SEQ ID NO:8 (Plasmid 1027 polypeptide) or SEQ ID NO:16 (Plasmid 1197 polypeptide).


B-2. Immunogenic MSLN Polypeptides


In one aspect, the present disclosure provides isolated immunogenic MSLN polypeptides derived from a human MSLN precursor by deletion of a portion or the entire signal peptide sequence of the MSLN precursor. Thus, the immunogenic MSLN polypeptides comprise the amino acid sequence of a native human MSLN precursor, wherein part or the entire signal peptide sequence of the MSLN precursor is absent. In some embodiments, part of, or the entire, GPI anchor sequence of the native human MSLN (i.e., amino acids 598-622 of SEQ ID NO:2) is also absent in the immunogenic MSLN polypeptide. As used herein, the term “human MSLN” encompasses any human MSLN isoform, such as isoform 1, 2, 3, or 4. In some particular embodiments, the human MSLN is human MSLN isoform 2.


In some particular embodiments, the isolated immunogenic MSLN polypeptide is selected from the group consisting of:


1) a polypeptide comprising, or consisting of, amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


2) a polypeptide comprising an amino acid sequence that is at least 90%, 95%, 98%, or 99% identical to the amino acid sequence consisting of amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


3) a polypeptide comprising, or consisting of, the amino acid sequence of SEQ ID NO:6, or amino acids 4-564 of the amino acid sequence of SEQ ID NO:6; and


4) a polypeptide comprising an amino acid sequence that has at least 93%-99%, 94%-98%, or 94%-97% identity to the amino acid sequence of SEQ ID NO:6 (“Plasmid 1103 Polypeptide”).


B-3. Immunogenic TERT Polypeptides


In another aspect, the present disclosure provides isolated immunogenic TERT polypeptides derived from a human TERT protein by deletion of up to 600 of the N-terminal amino acids of the TERT protein. Thus, in some embodiments, the immunogenic TERT polypeptides comprise the amino acid sequence of TERT isoform 1 set forth in SEQ ID NO:3, wherein up to about 600 amino acids from the N-terminus (amino terminus) of the amino acid sequence of TERT isoform 1 are absent. Any number of amino acids up to 600 from the N-terminus of the TERT isoform 1 may be absent in the immunogenic TERT polypeptide. For example, the N-terminal amino acids from position 1 through position 50, 100, 50, 200, 245, 300, 350, 400, 450, 500, 550, or 600 of the TERT isoform 1 of SEQ ID NO:3 may be absent from the immunogenic TERT polypeptide. Thus, an immunogenic TERT polypeptide provided by the present disclosure may comprise amino acids 51-1132, 101-1132, 151-1132, 201-1132, 251-1132, 301-1132, 351-1132, 401-1132, 451-1132, 501-1132, or 551-1132 of SEQ ID NO:3. The immunogenic TERT polypeptides may also be constructed from other TERT isoforms. Where the polypeptides are constructed from TERT isoforms with C-terminal truncations, however, it is preferred that fewer amino acids may be deleted from the N-terminus.


In some further embodiments, the immunogenic TERT polypeptide further comprises one or more amino acid mutations that inactivate the TERT catalytic domain. Examples of such amino acid mutations include substitution of aspartic acid with alanine at position 712 of SEQ ID NO:3 (D712A) and substitution of valine with isoleucine at position 713 of SEQ ID NO:3 (V7131). In some embodiments the immunogenic TERT polypeptide comprises both mutations D712A and V7131.


In some specific embodiments, the present disclosure provides an immunogenic TERT polypeptide selected from the group consisting of:


1) a polypeptide comprising an amino acid sequence of SEQ ID NO:10 or amino acids 2-892 of SEQ ID NO:10 (“Plasmid 1112 Polypeptide”); or a functional variant of the polypeptide;


2), a polypeptide comprising an amino acid sequence of SEQ ID NO:14 or amino acids 3-789 of SEQ ID NO:14 (“Plasmid 1326 Polypeptide”), or a functional variant of the polypeptide; and


3) a polypeptide comprising an amino acid sequence of SEQ ID NO:12 or amino acids 4-591 of SEQ ID NO:12 (“Plasmid 1330 Polypeptide”), or a functional variant of the polypeptide.


C. Nucleic Acid Molecules Encoding Immunogenic TAA Polypeptides

In some aspects, the present disclosure provides nucleic acid molecules that each encode one, two, three, or more separate immunogenic TAA polypeptides that are provided by the present disclosure. The nucleic acid molecules can be deoxyribonucleotides (DNA) or ribonucleotides (RNA). Thus, a nucleic acid molecule can comprise a nucleotide sequence disclosed herein wherein thymidine (T) can also be uracil (U), which reflects the differences between the chemical structures of DNA and RNA. The nucleic acid molecules can be modified forms, single or double stranded forms, or linear or circular forms. The nucleic acid molecules can be prepared using methods known in the art light of the present disclosure.


C-1. Single-Antigen Constructs


In one aspect, the present disclosure provides an isolated nucleic acid molecule, which comprises a nucleotide sequence encoding a single immunogenic MUC1 polypeptide, a single immunogenic MSLN polypeptide, or a single immunogenic TERT polypeptide provided by the present disclosure. A nucleic acid molecule that encodes only one immunogenic TAA polypeptide, such as an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, or an immunogenic TERT, is also referred to herein as “single-antigen construct.”


C-1a. MUC1 Single Antigen Constructs


In some embodiments, the present disclosure provides isolated nucleic acid molecules that encode an immunogenic MUC1 polypeptide provided in the present disclosure. The immunogenic MUC1 polypeptide encoded by a nucleic acid molecule may be in cytoplasmic form (or cMUC1) or “membrane-bound form (or mMUC1). The term “membrane-bound form” refers to an immunogenic MUC1 polypeptide that, after being expressed from the coding nucleic acid by a host cell, is bound to, attached to, or otherwise associated with, the membrane of the host cell.


In some specific embodiments, the isolated nucleic acid molecules provided by the present disclosure comprise a nucleotide sequence that encodes an immunogenic MUC1 polypeptide selected from the group consisting of:


(1) an immunogenic MUC1 polypeptide comprising the amino acid sequence of SEQ ID NO:8 (Plasmid 1027 polypeptide);


(2) an immunogenic MUC1 polypeptide comprising amino acids 4-537 of SEQ ID NO:8;


(3) an immunogenic MUC1 polypeptide comprising amino acids 24-537 of SEQ ID NO:8;


(4) an immunogenic MUC1 polypeptide comprising the amino acid sequence of SEQ ID NO:16 (Plasmid 1197 polypeptide);


(5) an immunogenic MUC1 polypeptide comprising amino acids 4-517 of SEQ ID NO:16;


(6) an immunogenic MUC1 polypeptide comprising amino acids 4-517 of SEQ ID NO:16, with the proviso that the amino acid at positon 513 is T; and


(7) an immunogenic MUC1 polypeptide comprising amino acids 24-225 and 946-1255 of SEQ ID NO:1.


In some other specific embodiments, the isolated nucleic acid molecules provided by the present disclosure comprise a nucleotide sequence, or a degenerate variant thereof, selected from the group consisting of:


(1) the nucleotide sequence of SEQ ID NO:7 (Plasmid 1027);


(2) a nucleotide sequence comprising nucleotides 10-1611 of SEQ ID NO:7; (3) the nucleotide sequence of SEQ ID NO:15 (Plasmid 1197); and


(4) a nucleotide sequence comprising nucleotides 10-1551 of SEQ ID NO:15;


C-1b. MSLN Single Antigen Constructs


In some embodiments, the present disclosure provides isolated nucleic acid molecules that encode an immunogenic MSLN polypeptide provided in the present disclosure.


In some particular embodiments, the isolated nucleic acid molecule encodes an immunogenic MSLN polypeptide selected from the group consisting of:


1) an immunogenic MSLN polypeptide comprising, or consisting of, amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


2) an immunogenic MSLN polypeptide comprising an amino acid sequence that is at least 90%, 95%, 98%, or 99% identical to the amino acid sequence consisting of amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


3) an immunogenic MSLN polypeptide comprising, or consisting of, the amino acid sequence of SEQ ID NO:6; and


4) an immunogenic MSLN polypeptide comprising an amino acid sequence that has at least 93%-99%, 94%-98%, or 94%-97% identity to the amino acid sequence of SEQ ID NO:6 (“Plasmid 1103 Polypeptide”).


In some other specific embodiments, the isolated nucleic acid molecules provided by the present disclosure comprise a nucleotide sequence, or a degenerate variant thereof, selected from the group consisting of:


(1) the nucleotide sequence of SEQ ID NO:5; and


(2) a nucleotide sequence comprising nucleotides 10-1692 of SEQ ID NO:5.


C-1c. TERT Single Antigen Constructs


In some other embodiments, the present disclosure provides isolated nucleic acid molecules that encode an immunogenic TERT polypeptide provided in the present disclosure.


An immunogenic TERT polypeptide encoded by a nucleic acid provided by the represent disclosure may contain a deletion of maximum of 600 amino acids from the N-terminus of the amino acid sequence of TERT isoform 1. Generally, an immunogenic TERT polypeptide may be expected to possess stronger immunogenicity if it has deletion of fewer amino acids from the N-terminus of the TERT protein. The number of N-terminal amino acids that can be deleted from the TERT protein may be determined based on how the nucleic acid molecule encoding the polypeptide is intended to be used or delivered. For example, where the nucleic acid molecule is to be delivered using a particular viral vector, the deletion may be determined based on the capacity of the vector used.


In some embodiments, the immunogenic TERT polypeptides encoded by the nucleic acid molecules comprise one or more amino acid mutations that inactivate the TERT catalytic domain. Examples of such amino acid mutations include substitution of aspartic acid with alanine at position 712 of SEQ ID NO:3 (D712A) and substitution of valine with isoleucine at position 713 of SEQ ID NO:3 (V7131). In some embodiments the immunogenic TERT polypeptide comprises both mutations D712A and V7131.


In some specific embodiments, the isolated nucleic acid molecules encode an immunogenic TERT polypeptide selected from the group consisting of:


(1) an immunogenic TERT polypeptide comprising an amino acid sequence of SEQ ID NO:10 or amino acids 2-892 of SEQ ID NO:10 (“Plasmid 1112 Polypeptide”), or a functional variant of the polypeptide;


(2), an immunogenic TERT polypeptide comprising an amino acid sequence of SEQ ID NO:14 or amino acids 3-789 of SEQ ID NO:14 (“Plasmid 1326 Polypeptide” or a functional variant of the polypeptide; and


(3) an immunogenic TERT polypeptide comprising an amino acid sequence of SEQ ID NO:12 or amino acids 4-591 of SEQ ID NO:12 (“Plasmid 1330 Polypeptide”), or a functional variant of the polypeptide.


In some particular embodiments, the isolated nucleic acid molecules comprise a nucleotide sequence, or a degenerate variant thereof, selected from the group consisting of:


(1) the nucleotide sequence of SEQ ID NO:9 (TERT240);


(2) a nucleotide sequence comprising nucleotides 4-2679 of SEQ ID NO:9;


(3) the nucleotide sequence of SEQ ID NO:11 (TERT541);


(4) a nucleotide sequence comprising nucleotides 10-1782 of SEQ ID NO:11;


(5) the nucleotide sequence of SEQ ID NO:13 (TERT342); and


(6) a nucleotide sequence comprising nucleotides 7-2373 of SEQ ID NO:13.


C-2. Multi-Antigen Constructs


In another aspect, the present disclosure provides nucleic acid molecules that each encode two, three, or more different immunogenic TAA polypeptides. A nucleic acid molecule that encodes more than one immunogenic TAA polypeptide is also referred to as “multi-antigen construct,” “multi-antigen vaccine,” “multi-antigen plasmid,” and the like, in the present disclosure. A nucleic acid molecule that encodes two different immunogenic TAA polypeptides is also referred to as a “dual-antigen construct,” “dual antigen vaccine,” or “dual antigen plasmid,” etc., in this disclosure. A nucleic acid molecule that encodes three different immunogenic TAA polypeptides is also referred to as a “triple-antigen construct,” “triple-antigen vaccine,” or “triple-antigen plasmid” in this disclosure.


Multi-antigen constructs provided by the present disclosure can be prepared using various techniques known in the art in light of the disclosure. For example, a multi-antigen construct can be constructed by incorporating multiple independent promoters into a single plasmid (Huang, Y., Z. Chen, et al. (2008). “Design, construction, and characterization of a dual-promoter multigenic DNA vaccine directed against an HIV-1 subtype C/B’ recombinant.” J Acquir Immune Defic Syndr 47(4): 403-411; Xu, K., Z. Y. Ling, et al. (2011). “Broad humoral and cellular immunity elicited by a bivalent DNA vaccine encoding HA and NP genes from an H5N1 virus.” Viral Immunol 24(1): 45-56). The plasmid can be engineered to carry multiple expression cassettes, each consisting of a) a eukaryotic promoter for initiating RNA polymerase dependent transcription, with or without an enhancer element, b) a gene encoding a target antigen, and c) a transcription terminator sequence. Upon delivery of the plasmid to the transfected cell nucleus, transcription will be initiated from each promoter, resulting in the production of separate mRNAs, each encoding one of the target antigens. The mRNAs will be independently translated, thereby producing the desired antigens.


Multi-antigen constructs provided by the present disclosure can also be constructed through the use of viral 2A peptides (Szymczak, A. L. and D. A. Vignali (2005). “Development of 2A peptide-based strategies in the design of multicistronic vectors.” Expert Opin Biol Ther 5(5): 627-638; de Felipe, P., G. A. Luke, et al. (2006). “E unum pluribus: multiple proteins from a self-processing polyprotein.” Trends Biotechnol 24(2): 68-75; Luke, G. A., P. de Felipe, et al. (2008). “Occurrence, function and evolutionary origins of ‘2A-like’ sequences in virus genomes.” J Gen Virol 89(Pt 4): 1036-1042; Ibrahimi, A., G. Vande Velde, et al. (2009). “Highly efficient multicistronic lentiviral vectors with peptide 2A sequences.” Hum Gene Ther 20(8): 845-860; Kim, J. H., S. R. Lee, et al. (2011). “High cleavage efficiency of a 2A peptide derived from porcine teschovirus-1 in human cell lines, zebrafish and mice.” PLoS One 6(4): e18556). These peptides, also called cleavage cassettes or CHYSELs (cis-acting hydrolase elements), are approximately 20 amino acids long with a highly conserved carboxy terminal D-V/I-EXNPGP motif (Table 19). These peptides are rare in nature, most commonly found in viruses such as Foot-and-mouth disease virus (FMDV), Equine rhinitis A virus (ERAV), Equine rhinitis B virus (ERBV), Encephalomyocarditis virus (EMCV), Porcine teschovirus (PTV), and Thosea asigna virus (TAV) (Luke, G. A., P. de Felipe, et al. (2008). “Occurrence, function and evolutionary origins of ‘2A-like’ sequences in virus genomes.” J Gen Virol 89(Pt 4): 1036-1042). With a 2A-based multi-antigen expression strategy, genes encoding multiple target antigens are linked together in a single open reading frame, separated by sequences encoding viral 2A peptides. The entire open reading frame can be cloned into a vector with a single promoter and terminator. Upon delivery of the constructs to a host cell, mRNA encoding the multiple antigens will be transcribed and translated as a single polyprotein. During translation of the 2A peptides, ribosomes skip the bond between the C-terminal glycine and proline. The ribosomal skipping acts like a cotranslational autocatalytic “cleavage” that releases the peptide sequences upstream of the 2A peptide from those downstream. The incorporation of a 2A peptide between two protein antigens may result in the addition of ˜20 amino acids onto the C-terminus of the upstream polypeptide and 1 amino acid (proline) to the N-terminus of downstream protein. In an adaptation of this methodology, protease cleavage sites can be incorporated at the N terminus of the 2A cassette such that ubiquitous proteases will cleave the cassette from the upstream protein (Fang, J., S. Yi, et al. (2007). “An antibody delivery system for regulated expression of therapeutic levels of monoclonal antibodies in vivo.” Mol Ther 15(6): 1153-1159).


Another strategy for constructing the multi-antigen constructs provided by the present disclosure involves the use of an internal ribosomal entry site, or IRES. Internal ribosomal entry sites are RNA elements found in the 5′ untranslated regions of certain RNA molecules (Bonnal, S., C. Boutonnet, et al. (2003). “IRESdb: the Internal Ribosome Entry Site database.” Nucleic Acids Res 31(1): 427-428). They attract eukaryotic ribosomes to the RNA to facilitate translation of downstream open reading frames. Unlike normal cellular 7-methylguanosine cap-dependent translation, IRES-mediated translation can initiate at AUG codons far within an RNA molecule. The highly efficient process can be exploited for use in multi-cistronic expression vectors (Bochkov, Y. A. and A. C. Palmenberg (2006). “Translational efficiency of EMCV IRES in bicistronic vectors is dependent upon IRES sequence and gene location.” Biotechniques 41(3): 283-284, 286, 288). Typically, two transgenes are inserted into a vector between a promoter and transcription terminator as two separate open reading frames separated by an IRES. Upon delivery of the constructs to a host cell, a single long transcript encoding both transgenes will be transcribed. The first open reading frame (ORF) will be translated in the traditional cap-dependent manner, terminating at a stop codon upstream of the IRES. The second ORF will be translated in a cap-independent manner using the IRES. In this way, two independent proteins can be produced from a single mRNA transcribed from a vector with a single expression cassette.


In some aspects, the present disclosure provides a dual-antigen construct comprising two coding nucleotide sequences, wherein each of the coding nucleotide sequences encodes an individual immunogenic TAA polypeptide. The structure of such a dual-antigen construct is shown in formula (I):





TAA1-SPACER1-TAA2  (1),


wherein in formula (I):


(i) TAA1 and TAA2 are nucleotide sequences each encoding an immunogenic TAA polypeptides selected from the group consisting of an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, wherein TAA1 and TAA 2 encode different immunogenic TAA polypeptides; and


(ii) SPACER1 is a spacer nucleotide sequence, or may be absent.


In some embodiments, the present disclosure provides a dual-antigen construct of formula (I), wherein in formula (I) TAA1 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide, and TAA2 is a nucleotide sequence encoding an immunogenic MSLN polypeptide or immunogenic TERT polypeptide.


In some other embodiments, the present disclosure provides a dual-antigen construct of formula (I), wherein in formula (I) TAA1 is a nucleotide sequence encoding an immunogenic MSLN polypeptide, and TAA2 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide or immunogenic TERT polypeptide.


In some further embodiments, the present disclosure provides a dual-antigen construct of formula (I), wherein in formula (I) TAA1 is a nucleotide sequence encoding an immunogenic TERT polypeptide, and TAA2 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide or immunogenic MSLN polypeptide.


In some specific embodiments, the present disclosure provides a dual-antigen construct of a formula selected from a group consisting of:





(1) MUC1-2A-TERT  (II)





(2) MUC1-2A-MSLN  (III)





(3) MSLN-2A-TERT  (IV)





(4) MSLN-2A-MUC1  (V)





(5) TERT-2A-MSLN  (VI)





(6) TERT-2A-MUC1  (VII)


wherein in each of formulas (II)-(VII): (i) MUC1, MSLN, and TERT represent a nucleotide sequence encoding an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, respectively, and (ii) 2A is a nucleotide sequence encoding a 2A peptide.


In some other aspects, the present disclosure provides a triple-antigen construct comprising three coding nucleotide sequences wherein each of the coding nucleotide sequences expresses a different individual immunogenic TAA polypeptide. The structure of a triple-antigen construct is shown in formula (VIII):





TAA1-SPACER1-TAA2-SPACER2-TAA3  (VIII)


wherein in formula (VIII):


(i) TAA1, TAA2, and TAA3 are each a nucleotide sequence encoding an immunogenic TAA polypeptide selected from the group consisting of an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, wherein TAA1, TAA2, and TAA3 encode different immunogenic TAA polypeptides; and


(ii) SPACER1 and SPACER2 are each a spacer nucleotide sequence, wherein (a) SPACER1 and SPACER2 may be the same or different and (b) either SPACER1 or SPACER2 or both SPACER1 and SPACER2 may be absent.


The term “spacer nucleotide sequence” as used in the present disclosure refers to a nucleotide sequence that is inserted between two coding sequences or transgenes in an open reading frame of a nucleic acid molecule and functions to allow co-expression or translation of two separate gene products from the nucleic acid molecule. Examples of spacer nucleotide sequences that may be used in the multi-antigen constructs provided by the present disclosure include eukaryotic promoters, nucleotide sequences encoding a 2A peptide, and internal ribosomal entry sites (IRES). Examples of 2A peptides include foot-and-mouth disease virus 2A peptide (FMD2A), equine rhinitis A virus 2A peptide (ERA2A), Equine rhinitis B virus 2A peptide (ERB2A), encephalomyocarditis virus 2A peptide (EMC2A), porcine teschovirus 2A peptide (PT2A), and Thosea asigna virus 2A peptide (T2A). The sequences of these 2A peptides are provided in Table 19.


In some embodiments, SPACER1 and SPACER2 are, independently, a nucleotide sequence encoding a 2A peptide, or a nucleotide sequence encoding GGSGG.


In some embodiments, the present disclosure provides a triple-antigen construct of formula (VIII), wherein in formula (VIII) (i) TAA1 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide, (ii) TAA2 is a nucleotide sequence encoding an immunogenic MSLN polypeptide, and (iii) TAA3 is a nucleotide sequence encoding an immunogenic TERT polypeptide.


In some other embodiments, the present disclosure provides a triple-antigen construct of formula (VIII), wherein in formula (VIII) (i) TAA1 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide, (ii) TAA2 is a nucleotide sequence encoding an immunogenic TERT polypeptide, and (iii) TAA3 is a nucleotide sequence encoding an immunogenic MSLN polypeptide.


In some other embodiments, the present disclosure provides a triple-antigen construct of formula (VIII), wherein in formula (VIII) (i) TAA1 is a nucleotide sequence encoding an immunogenic MSLN polypeptide, (ii) TAA2 is a nucleotide sequence encoding an immunogenic TERT polypeptide, and (iii) TAA3 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide.


In some other embodiments, the present disclosure provides a triple-antigen construct of formula (VIII), wherein in formula (VIII) (i) TAA1 is a nucleotide sequence encoding an immunogenic MSLN polypeptide, (ii) TAA2 is a nucleotide sequence encoding an immunogenic MUC1 polypeptide, and (iii) TAA3 is a nucleotide sequence encoding an immunogenic TERT polypeptide.


In some specific embodiments, the present disclosure provides a triple-antigen construct of a formula selected from the group consisting of:





(1) MUC1-2A-MSLN-2A-TERT  (IX)





(2) MUC1-2A-TERT-2A-MSLN  (X)





(3) MSLN-2A-MUC1-2A-TERT  (XI)





(4) MSLN-2A-TERT-2A-MUC1  (XII)





(5) TERT-2A-MUC1-2A-MSLN  (XIII)





(6) TERT-2A-MSLN-2A-MUC1  (XIV)


wherein in each of formulas (IX)-(XIV: (i) MUC1, MSLN, and TERT represent a nucleotide sequence encoding an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, respectively, and (ii) 2A is a nucleotide sequence encoding a 2A peptide.


The immunogenic MSLN polypeptide encoded by a multi-antigen construct may be a full length MSLN protein or a fragment thereof, such as a cytoplasmic, secreted, or membrane-bound fragment. In some embodiments the multi-antigen construct comprises a nucleotide sequence encoding an immunogenic MSLN polypeptide selected from the group consisting of:


1) a polypeptide comprising, or consisting of, amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


2) a polypeptide comprising an amino acid sequence that is at least 90%, 95%, 98%, or 99% identical to the amino acid sequence consisting of amino acids 37-597 of the amino acid sequence of SEQ ID NO:2;


3) a polypeptide comprising, or consisting of, the amino acid sequence of SEQ ID NO:6, or amino acids 4-564 of the amino acid sequence of SEQ ID NO:6; and


4) polypeptide comprising an amino acid sequence that has at least 93%-99%, 94%-98%, or 94%-97% identity to the amino acid sequence of SEQ ID NO:8 (“Plasmid 1103 Polypeptide”).


In some particular embodiments the multi-antigen construct comprises a nucleotide sequence of SEQ ID NO:5 or a degenerate variant thereof.


The immunogenic MUC1 polypeptide encoded by a multi-antigen construct may comprise (1) an amino acid sequence of 3 to 30 tandem repeats of 20 amino acids of a human MUC1 protein and (2) the amino acid sequences of the human MUC1 protein that flank the VNTR region. In some embodiments the multi-antigen construct comprises a nucleotide sequence encoding an immunogenic MUC1 polypeptide, wherein the immunogenic MUC1 polypeptide comprises, or consists of, the amino acid sequence selected from the group consisting of:


(1) the amino acid sequence of SEQ ID NO:8 (Plasmid 1027 polypeptide);


(2) an amino acid sequence comprising amino acids 4-537 of SEQ ID NO:8;


(3) an amino acid sequence comprising amino acids 24-537 of SEQ ID NO:8;


(4) the amino acid sequence of SEQ ID NO:16 (Plasmid 1197 polypeptide);


(5) an amino acid sequence comprising amino acids 4-517 of SEQ ID NO:16; and


(6) an amino acid sequence comprising amino acids 4-517 of SEQ ID NO:16, with the proviso that the amino acid at positon 513 is T.


In some particular embodiments, the multi-antigen construct comprises a nucleotide sequence of SEQ ID NO:7, a nucleotide sequence of SEQ ID NO:15, or a degenerate variant of the nucleotide sequence of SEQ ID NO:7 or 15.


The immunogenic TERT polypeptide encoded by a multi-antigen construct may be the full length protein or any truncated form. The full length TERT protein is expected to generate stronger immune responses than a truncated form. However, depending on the specific vector chosen to deliver the construct, the vector may not have the capacity to carry the gene encoding the full TERT protein. Therefore, deletions of some amino acids from the protein may be made such that the transgenes would fit into a particular vector. The deletions of amino acids can be made from the N-terminus, C-terminus, or anywhere in the sequence of the TERT protein. Additional deletions may be made in order to remove the nuclear localization signal, thereby rendering the polypeptides cytoplasmic, increasing access to cellular antigen processing/presentation machinery.


In some embodiments, the amino acids up to position 200, 300, 400, 500, or 600 of the N-terminus of the TERT protein are absent from the immunogenic TERT polypeptides. Mutations of additional amino acids may be introduced in order to inactivate the TERT catalytic domain. Examples of such mutations include D712A and V713T.


In some further embodiments, the multi-antigen construct comprises a nucleotide sequence encoding an immunogenic TERT polypeptide, wherein the immunogenic TERT polypeptide comprises, or consist of, an amino acid sequence selected from the group consisting of;


1) the amino acid sequence of SEQ ID NO:10 (“Plasmid 1112 Polypeptide”; TERT 240);


2) the amino acid sequence of SEQ ID NO:12 (“Plasmid 1330 Polypeptide”; TERT 541); and


3) the amino acid sequence of SEQ ID NO: 14 (“Plasmid 1326 Polypeptide”; TERT 343).


In some particular embodiments, the multi-antigen construct comprises the nucleotide sequence of SEQ ID NO:9, 11, or 13, or a degenerate variant of the nucleotide sequence of SEQ ID NO:9, 11, or 13.


In some particular embodiments, the present disclosure provides a dual antigen construct encoding an immunogenic MUC1 polypeptide and an immunogenic MSLN polypeptide, which comprises a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:18, 20, 22, or 24;


(2) the nucleotide sequence of SEQ ID NO:17, 19, 21, or 23; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO:17, 19, 21, or 23.


In some other particular embodiments, the present disclosure provides a dual antigen construct encoding an immunogenic MUC1 polypeptide and an immunogenic TERT polypeptide, which comprises a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:26, 28, 30, 32, or 34;


(2) a nucleotide sequence of SEQ ID NO:25, 27, 29, 31, or 33; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO:25, 27, 29, 31, or 33.


In some other particular embodiments, the present disclosure provides a dual antigen construct encoding an immunogenic MSLN polypeptide and an immunogenic TERT polypeptide, which comprises a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:36, 38, 40, or 42;


(2) the nucleotide sequence of SEQ ID NO:35, 37, 39, or 41; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO:35, 37, 39, or 41.


In some other particular embodiments, the present disclosure provides a triple-antigen construct encoding an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, which comprises a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, or 66;


(2) the nucleotide sequence of SEQ ID NO:43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65.


D. Vectors Containing a Nucleic Acid Molecule Encoding an Immunogenic TAA Polypeptide

Another aspect of the invention relates to vectors containing one or more of any of the nucleic acid molecules provided by the present disclosure, including single antigen constructs, dual-antigen constructs, triple-antigen constructs, and other multi-antigen constructs. The vectors are useful for cloning or expressing the immunogenic TAA polypeptides encoded by the nucleic acid molecules, or for delivering the nucleic acid molecule in a composition, such as a vaccine, to a host cell or to a host subject, such as a human. In some particular embodiments, the vector comprises a triple-antigen construct encoding an immunogenic MUC1 polypeptide, an immunogenic MSLN polypeptide, and an immunogenic TERT polypeptide, wherein the triple-antigen construct which comprises a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, or 66;


(2) the nucleotide sequence of SEQ ID NO:43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65.


A wide variety of vectors may be prepared to contain and express a nucleic acid molecule of the invention, such as plasmid vectors, cosmid vectors, phage vectors, and viral vectors.


In some embodiments, the disclosure provides a plasmid-based vector containing a nucleic acid molecule of the invention. Examples of suitable plasmid vectors include pBR325, pUC18, pSKF, pET23D, and pGB-2. Other examples of plasmid vectors, as well as method of constructing such vectors, are described in U.S. Pat. Nos. 5,580,859, 5,589,466, 5,688,688, 5,814,482, and 5,580,859.


In other embodiments, the present invention provides vectors that are constructed from viruses, such as retroviruses, alphaviruses, and adenoviruses. Examples of retroviral vectors are described in U.S. Pat. Nos. 5,219,740, 5,716,613, 5,851,529, 5,591,624, 5,716,826, 5,716,832, and 5,817,491. Representative examples of vectors that can be generated from alphaviruses are described in U.S. Pat. Nos. 5,091,309 and 5,217,879, 5,843,723, and 5,789,245.


In some particular embodiments, the present disclosure provides adenoviral vectors that comprise a nucleic acid sequence of non-human primate adenoviruses, such as simian adenoviruses. Examples of such adenoviral vectors, as well as their preparation, are described in PCT application publications WO2005/071093 and WO 2010/086189, and include non-replicating vectors constructed from simian adenoviruses, such as ChAd3, ChAd4, ChAd5, ChAd7, ChAd8, ChAd9, ChAd10, ChAd11, ChAd16, ChAd17, ChAd19, ChAd20, ChAd22, ChAd24, ChAd26, ChAd30, ChAd31, ChAd37, ChAd38, ChAd44, ChAd63, ChAd68, ChAd82, ChAd55, ChAd73, ChAd83, ChAd146, ChAd147, PanAd1, Pan Ad2, and Pan Ad3, and replication-competent vectors constructed simian adenoviruses Ad4 or Ad7. It is preferred that in constructing the adenoviral vectors from the simian adenoviruses one or more of the early genes from the genomic region of the virus selected from E1A, E1B, E2A, E2B, E3, and E4 are either deleted or rendered non-functional by deletion or mutation. In a particular embodiment, the vector is constructed from ChAd3 or ChAd68. Suitable vectors can also be generated from other viruses such as: (1) pox viruses, such as canary pox virus or vaccinia virus (Fisher-Hoch et al., PNAS 86:317-321, 1989; Flexner et al., Ann. N.Y. Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; U.S. Pat. Nos. 4,603,112, 4,769,330 and 5,017,487; WO 89/01973); (2) SV40 (Mulligan et al., Nature 277:108-114, 1979); (3) herpes (Kit, Adv. Exp. Med. Biol. 215:219-236, 1989; U.S. Pat. No. 5,288,641); and (4) lentivirus such as HIV (Poznansky, J. Virol. 65:532-536, 1991).


Methods of constructing vectors are well known in the art. Expression vectors typically include one or more control elements that are operatively linked to the nucleic acid sequence to be expressed. The term “control elements” refers collectively to promoter regions, polyadenylation signals, transcription termination sequences, upstream regulatory domains, origins of replication, internal ribosome entry sites (“IRES”), enhancers, and the like, which collectively provide for the replication, transcription, and translation of a coding sequence in a recipient cell. Not all of these control elements need always be present so long as the selected coding sequence is capable of being replicated, transcribed, and translated in an appropriate host cell. The control elements are selected based on a number of factors known to those skilled in that art, such as the specific host cells and source or structures of other vector components. For enhancing the expression of an immunogenic TAA polypeptide, a Kozak sequence may be provided upstream of the sequence encoding the immunogenic TAA polypeptide. For vertebrates, a known Kozak sequence is (GCC)NCCATGG, wherein N is A or G and GCC is less conserved. Exemplary Kozak sequences that may be used include GAACATGG, ACCAUGG and ACCATGG.


E. Compositions Comprising an Immunogenic TAA Polypeptide (Polypeptide Compositions)

In another aspect, the present disclosure provides polypeptide compositions, which comprise one or more isolated immunogenic TAA polypeptides provided by the present disclosure (“polypeptide composition”). In some embodiments, the polypeptide composition is an immunogenic composition useful for eliciting an immune response against a TAA protein in a subject, such as a mouse, dog, nonhuman primates or human. In some other embodiments the polypeptide composition is a pharmaceutical composition for administration to a subject, such as a human. In still other embodiments, the polypeptide composition is a vaccine composition useful for immunization of a mammal, such as a human, for inhibiting abnormal cell proliferation, for providing protection against the development of cancer (used as a prophylactic), or for treatment of disorders (used as a therapeutic) associated with TAA over expression, such as cancers.


A polypeptide composition provided by the present disclosure may contain a single type of immunogenic TAA polypeptide, such an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, or an immunogenic TERT polypeptide. A composition may also contain a combination of two or more different types of immunogenic TAA polypeptides. For example, a polypeptide composition may contain immunogenic TAA polypeptides in any of the following combinations:


1) an immunogenic MSLN polypeptide and an immunogenic MUC1 polypeptide;


2) an immunogenic MSLN polypeptide and a TERT polypeptide; or


3) an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, and a TERT polypeptide.


In some embodiments, a polypeptide composition provided by the present disclosure, such as an immunogenic composition, a pharmaceutical composition, or a vaccine composition, further comprises a pharmaceutically acceptable excipient. Pharmaceutically acceptable excipients suitable for immunogenic, pharmaceutical, or vaccine compositions are known in the art. Examples of suitable excipients that may be used in the compositions include biocompatible oils, such as rape seed oil, sunflower oil, peanut oil, cotton seed oil, jojoba oil, squalan, squalene, physiological saline solution, preservatives and osmotic pressure controlling agents, carrier gases, pH-controlling agents, organic solvents, hydrophobic agents, enzyme inhibitors, water absorbing polymers, surfactants, absorption promoters, pH modifiers, and anti-oxidative agents.


The immunogenic TAA polypeptide in a composition, particularly an immunogenic composition or a vaccine composition, may be linked to, conjugated to, or otherwise incorporated into a carrier for administration to a subject. The term “carrier” refers to a substance or structure that an immunogenic polypeptide can be attached to or otherwise associated with for delivery of the immunogenic polypeptide to the subject. The carrier itself may be immunogenic. Examples of carriers include immunogenic polypeptides, immune CpG islands, limpet hemocyanin (KLH), tetanus toxoid (TT), cholera toxin subunit B (CTB), bacteria or bacterial ghosts, liposome, chitosome, virosomes, microspheres, dendritic cells, or their like. One or more immunogenic TAA polypeptide molecules may be linked to a single carrier molecule. Methods for linking an immunogenic polypeptide to a carrier are known in the art,


A vaccine composition or immunogenic composition provided by the present disclosure may be used in conjunction or combination with one or more immune modulators or adjuvants. The immune modulators or adjuvants may be formulated separately from the vaccine composition or immunogenic composition, or they may be part of the same composition formulation. Thus, in some embodiments, the present disclosure provides a vaccine composition that further comprises one or more immune modulators or adjuvants. Examples of immune modulators and adjuvants are provided herein below.


The polypeptide compositions, including the immunogenic and vaccine compositions, can be prepared in any suitable dosage forms, such as liquid forms (e.g., solutions, suspensions, or emulsions) and solid forms (e.g., capsules, tablets, or powder), and by methods known to one skilled in the art.


F. Compositions Comprising an Immunogenic TAA Nucleic Acid Molecule (Nucleic Acid Compositions)

The present disclosure also provides nucleic acid compositions, which comprise an isolated nucleic acid molecule or vector provided by the present disclosure (“nucleic acid composition”). The nucleic acid compositions are useful for eliciting an immune response against a TAA protein in vitro or in vivo in a subject, including a human. In some embodiments, the nucleic acid compositions are immunogenic compositions or pharmaceutical compositions.


In some particular embodiments, the nucleic acid composition is a DNA vaccine composition for administration to a subject, such as a human for (1) inhibiting abnormal cell proliferation, providing protection against the development of cancer (used as a prophylactic), (2) treatment of cancer (used as a therapeutic) associated with TAA over-expression, or (3) eliciting an immune response against a particular human TAA, such as MSLN, MUC1, or TERT. The nucleic acid molecule in the composition may be a “naked” nucleic acid molecule, i.e., simply in the form of an isolated DNA free from elements that promote transfection or expression. Alternatively, the nucleic acid molecule in the composition is incorporated into a vector, such as a plasmid vector or a viral vector.


A nucleic acid composition provided by the present disclosure may comprise individual isolated nucleic acid molecules that each encode only one type of immunogenic TAA polypeptide, such as an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, or an immunogenic TERT polypeptide.


A nucleic acid composition may comprise a multi-antigen construct that encodes two or more types of immunogenic TAA polypeptides. For example, a multi-antigen construct may encode two or more immunogenic TAA polypeptides in any of the following combinations:


(1) an immunogenic MSLN polypeptide and an immunogenic MUC1 polypeptide;


(2) an immunogenic MSLN polypeptide and an immunogenic TERT polypeptide;


(3) an immunogenic MUC1 polypeptide and an immunogenic TERT polypeptide; and


(4) an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, and an immunogenic TERT polypeptide.


In some particular embodiments, the compositions provided by the present disclosure comprise a dual antigen construct comprising a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:18, 20, 22, or 24, 26, 28, 30, 32, or 34, 36, 38, 30, 40, or 42;


(2) the nucleotide sequence of SEQ ID NO:17, 19, 21, or 23, 25, 27, 29, 31, or 33, 35, 37, 39, or 41; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO:17, 19, 21, or 23, 25, 27, 29, 31, or 33, 35, 37, 39, or 41.


In some other particular embodiments, the compositions provided by the present disclosure comprise a triple-antigen construct comprising a nucleotide sequence selected from the group consisting of:


(1) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, or 66;


(2) the nucleotide sequence of SEQ ID NO:43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65; and


(3) a degenerate variant of the nucleotide sequence of SEQ ID NO: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, or 65.


The nucleic acid compositions, such as a pharmaceutical composition or a DNA vaccine composition, may further comprise a pharmaceutically acceptable excipient. Pharmaceutical acceptable excipients suitable for nucleic acid compositions, including DNA vaccine compositions, are well known to those skilled in the art. Such excipients may be aqueous or nonaqueous solutions, suspensions, and emulsions. Examples of non-aqueous excipients include propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Examples of aqueous excipient include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Suitable excipients also include agents that assist in cellular uptake of the polynucleotide molecule. Examples of such agents are (i) chemicals that modify cellular permeability, such as bupivacaine, (ii) liposomes or viral particles for encapsulation of the polynucleotide, or (iii) cationic lipids or silica, gold, or tungsten microparticles which associate themselves with the polynucleotides. Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: A Practical Approach, RPC New Ed, IRL press (1990), for a detailed description of methods for making liposomes) and are useful for delivering a large range of products, including polynucleotides. Cationic lipids are also known in the art and are commonly used for gene delivery. Such lipids include Lipofectin™ also known as DOTMA (N-[I-(2,3-dioleyloxy) propyls N,N, N-trimethylammonium chloride), DOTAP (1,2-bis (oleyloxy)-3 (trimethylammonio) propane), DDAB (dimethyldioctadecyl-ammonium bromide), DOGS (dioctadecylamidologlycyl spermine) and cholesterol derivatives such as DCChol (3 beta-(N-(N′,N′-dimethyl aminomethane)-carbamoyl) cholesterol). A description of these cationic lipids can be found in EP 187,702, WO 90/11092, U.S. Pat. No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Pat. No. 5,527,928. A particular useful cationic lipid formulation that may be used with the nucleic acid compositions provided by the disclosure is VAXFECTIN, which is a commixture of a cationic lipid (GAP-DMORIE) and a neutral phospholipid (DPyPE) which, when combined in an aqueous vehicle, self-assemble to form liposomes. Cationic lipids for gene delivery are preferably used in association with a neutral lipid such as DOPE (dioleyl phosphatidylethanolamine), as described in WO 90/11092 as an example. In addition, a nucleic acid construct, such as a DNA construct, can also be formulated with a nonionic block copolymer such as CRL1005.


A nucleic acid composition provided by the present disclosure, such as a pharmaceutical composition or immunogenic composition, may be used in conjunction or combination with one or more immune modulators. The nucleic acid composition, such as a pharmaceutical composition or immunogenic composition, may also be used in conjunction or combination with one or more adjuvants. Further, the nucleic acid composition may be used in conjunction or combination with one or more immune modulators and one or more adjuvants. The immune modulators or adjuvants may be formulated separately from the nucleic composition, or they may be part of the same composition formulation. Thus, in some embodiments, the present disclosure provides a nucleic acid vaccine composition that further comprises one or more immune modulators and/or one or more adjuvants. Examples of immune modulators and adjuvants are provided herein below.


The nucleic acid compositions, including vaccine compositions, can be prepared in any suitable dosage forms, such as liquid forms (e.g., solutions, suspensions, or emulsions) and solid forms (e.g., capsules, tablets, or powder), and by methods known to one skilled in the art.


G. Uses of the Immunogenic TAA Polypeptides, Nucleic Acid Molecules, and Compositions

In other aspects, the present disclosure provides methods of using the immunogenic TAA polypeptides, isolated nucleic acid molecules, and compositions described herein above. In one aspect, the present disclosure provides a method of eliciting an immune response against a TAA in a subject, particularly a human, comprising administering to the subject an effective amount of (1) an immunogenic TAA polypeptide that is immunogenic against the target TAA, (2) an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides, (3) a composition comprising one or more immunogenic TAA polypeptides, or (4) a composition comprising an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides. In some embodiments, the disclosure provides a method of eliciting an immune response against MSLN in a subject, comprising administering to the subject an effective amount of an immunogenic MSLN composition provided by the present disclosure, wherein the immunogenic MSLN composition is selected from: (1) an immunogenic MSLN polypeptide, (2) an isolated nucleic acid molecule encoding an immunogenic MSLN polypeptide, (3) a composition comprising an immunogenic MSLN polypeptide, or (4) a composition comprising an isolated nucleic acid molecule encoding an immunogenic MSLN polypeptide. In some other embodiments, the disclosure provides a method of eliciting an immune response against MUC1 in a subject, comprising administering to the subject an effective amount of an immunogenic MUC1 composition provided by the present disclosure, wherein the immunogenic MUC1 composition is selected from: (1) an immunogenic MUC1 polypeptide, (2) an isolated nucleic acid molecule encoding an immunogenic MUC1 polypeptide, (3) a composition comprising an immunogenic MUC1 polypeptide, or (4) a composition comprising an isolated nucleic acid molecule encoding an immunogenic MUC1 polypeptide. In some embodiments, the disclosure provides a method of eliciting an immune response against TERT in a subject, comprising administering to the subject an effective amount of an immunogenic TERT composition provided by the present disclosure, wherein the immunogenic TERT composition is selected from: (1) an immunogenic TERT polypeptide, (2) an isolated nucleic acid molecule encoding an immunogenic TERT polypeptide, (3) a composition comprising an immunogenic TERT polypeptide, or (4) a composition comprising an isolated nucleic acid molecule encoding an immunogenic TERT polypeptide.


In another aspect, the present disclosure provides a method of inhibiting abnormal cell proliferation in a human, wherein the abnormal cell proliferation is associated with over-expression of a TAA. The method comprises administering to the human an effective amount of immunogenic TAA composition provided by the present disclosure that is immunogenic against the over-expressed TAA. The immunogenic TAA composition may be (1) an immunogenic TAA polypeptide, (2) an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides, (3) a composition comprising an immunogenic TAA polypeptide, or (4) a composition comprising an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides. The abnormal cell proliferation may be in any organ or tissues of a human, such as breast, stomach, ovaries, lungs, bladder, large intestine (e.g., colon and rectum), kidneys, pancreas, and prostate. In some embodiments, the method is for inhibiting abnormal cell proliferation in the breast, ovaries, pancreas, colon, lung, stomach, and rectum.


In another aspect, the present disclosure provides a method of treating cancer in a human wherein the cancer is associated with over-expression of a TAA. The method comprises administering to the human an effective amount of immunogenic TAA composition capable of eliciting an immune response against the over-expressed TAA. The immunogenic TAA composition may be (1) an immunogenic TAA polypeptide, (2) an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides, (3) a composition comprising an immunogenic TAA polypeptide, or (4) a composition comprising an isolated nucleic acid molecule encoding one or more immunogenic TAA polypeptides.


In some embodiments, the disclosure provides a method of treating a cancer in a human, comprising administering to the human an effective amount of a nucleic acid composition provided herein above. The nucleic acids in the composition may be a single-antigen construct encoding only one particular immunogenic TAA polypeptide, such as an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, or an immunogenic TERT polypeptide. The nucleic acids in the composition may also be a multi-antigen construct encoding two, three, or more different immunogenic TAA polypeptides. In some specific embodiments, the disclosure provides a method of treating a cancer in a human, comprising administering to the human an effective amount of a composition comprising a dual-antigen construct. The dual-antigen construct may encode any two different immunogenic TAA polypeptides selected from: (1) an immunogenic MSLN polypeptide and an immunogenic MUC1 polypeptide; (2) an immunogenic MSLN polypeptide and an immunogenic TERT polypeptide; (3) an immunogenic TERT polypeptide and an immunogenic MUC1 polypeptide.


In some other specific embodiments, the disclosure provides a method of treating a cancer in a human, wherein the cancer is associated with over-expression of one or more TAAs selected from MUC1, MSLN, and TERT, which method comprises administering to the human an effective amount of a composition comprising a triple-antigen construct encoding an immunogenic MSLN polypeptide, an immunogenic MUC1 polypeptide, and an immunogenic TERT polypeptide.


Any cancer that over-expresses the tumor-associate antigen MUC1, MSLN, and/or TERT may be treated by a method provided by the present disclosure. Examples of cancers include breast cancer, ovarian cancer, lung cancer (such as small cell lung cancer and non-small cell lung cancer), colorectal cancer, gastric cancer, and pancreatic cancer. In some particular embodiments, the present disclosure provide a method of treating cancer in a human, which comprises administering to the human an effective amount of a composition comprising a triple-antigen construct, wherein the cancer is (1) breast cancer, such as triple-negative breast cancer, (2) pancreatic cancer, such as pancreatic ductal adenocarcinoma, or (3) ovarian cancer, such as ovarian adenocarcinoma.


The polypeptide and nucleic acid compositions can be administered to a subject, including human (such as a human patient), by a number of suitable methods known in the art. Examples of suitable methods include: (1) intramuscular, intradermal, intraepidermal, or subcutaneous administration, (2) oral administration, and (3) topical application (such as ocular, intranasal, and intravaginal application). One particular method of intradermal or intraepidermal administration of a nucleic acid composition that may be used is gene gun delivery using the Particle Mediated Epidermal Delivery (PMED™) DNA delivery device marketed by PowderMed. PMED is a needle-free method of administering DNAs to animals or humans. The PMED system involves the precipitation of DNA onto microscopic gold particles that are then propelled by helium gas into the epidermis. The DNA-coated gold particles are delivered to the APCs and keratinocytes of the epidermis, and once inside the nuclei of these cells, the DNA elutes off the gold and becomes transcriptionally active, producing encoded protein. One particular method for intramuscular administration of a nucleic acid composition is electroporation. Electroporation uses controlled electrical pulses to create temporary pores in the cell membrane, which facilitates cellular uptake of the nucleic acid composition injected into the muscle. Where a CpG is used in combination with a nucleic acid composition, the CpG and nucleic acid composition may be co-formulated in one formulation and the formulation is administered intramuscularly by electroporation.


The effective amount of the immunogenic TAA polypeptide or nucleic acid encoding an immunogenic TAA polypeptide in the composition to be administered to a subject, such as human patient, a given method provided by the present disclosure can be readily determined by a person skilled in the art and will depend on a number of factors. In a method of treating cancer, such as pancreatic cancer, ovarian cancer, and breast cancer, factors that may be considered in determining the effective amount of the immunogenic TAA polypeptide or nucleic acid include, but not limited: (1) the subject to be treated, including the subject's immune status and health, (2) the severity or stage of the cancer to be treated, (3) the specific immunogenic TAA polypeptides used or expressed, (4) the degree of protection or treatment desired, (5) the administration method and schedule, and (6) other therapeutic agents (such as adjuvants or immune modulators) used. In the case of nucleic acid vaccine compositions, including the multi-antigen vaccine compositions, the method of formulation and delivery are among the key factors for determining the dose of the nucleic acid required to elicit an effective immune response. For example, the effective amounts of the nucleic acid may be in the range of 2 μg/dose-10 mg/dose when the nucleic acid vaccine composition is formulated as an aqueous solution and administered by hypodermic needle injection or pneumatic injection, whereas only 16 ng/dose-16 μg/dose may be required when the nucleic acid is prepared as coated gold beads and delivered using a gene gun technology. The dose range for a nucleic acid vaccine by electroporation is generally in the range of 0.5-10 mg/dose. In the case where the nucleic acid vaccine is administered together with a CpG by electroporation in a co-formulation, the dose of the nucleic acid vaccine may be in the range of 0.5-5 mg/dose and the dose of CpG is typically in the range of 0.05 mg-5 mg/dose, such as 0.05, 0.2, 0.6, or 1.2 mg/dose per person. The nucleic acid or polypeptide vaccine compositions of the present invention can be used in a prime-boost strategy to induce robust and long-lasting immune response. Priming and boosting vaccination protocols based on repeated injections of the same immunogenic construct are well known. In general, the first dose may not produce protective immunity, but only “primes” the immune system. A protective immune response develops after the second or third dose (the “boosts”). The boosts are performed according to conventional techniques, and can be further optimized empirically in terms of schedule of administration, route of administration, choice of adjuvant, dose, and potential sequence when administered with another vaccine. In one embodiment, the nucleic acid or polypeptide vaccines of the present invention are used in a conventional homologous prime-boost strategy, in which the same vaccine is administered to the animal in multiple doses. In another embodiment, the nucleic acid or polypeptide vaccine compositions are used in a heterologous prime-boost vaccination, in which different types of vaccines containing the same antigens are administered at predetermined time intervals. For example, a nucleic acid construct may be administered in the form of a plasmid in the initial dose (“prime”) and as part of a vector in the subsequent doses (“boosts”), or vice versa.


The polypeptide or nucleic acid immunogenic compositions of the present disclosure may be used together with one or more adjuvants. Examples of suitable adjuvants include: (1) oil-in-water emulsion formulations (with or without other specific immunostimulating agents such as muramyl polypeptides or bacterial cell wall components), such as (a) MF59™ (PCT Publication No. WO 90/14837; Chapter 10 in Vaccine design: the subunit and adjuvant approach, eds. Powell & Newman, Plenum Press 1995), containing 5% Squalene, 0.5% Tween 80 (polyoxyethylene sorbitan mono-oleate), and 0.5% Span 85 (sorbitan trioleate) formulated into submicron particles using a microfluidizer, (b) SAF, containing 10% Squalene, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP either microfluidized into a submicron emulsion or vortexed to generate a larger particle size emulsion, and (c) RIBI™ adjuvant system (RAS) (Ribi Immunochem, Hamilton, Mont.) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall components such as monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS); (2) saponin adjuvants, such as QS21, STIMULON™ (Cambridge Bioscience, Worcester, Mass.), Abisco® (Isconova, Sweden), or Iscomatrix® (Commonwealth Serum Laboratories, Australia); (3) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IFA); (4) cytokines, such as interleukins (e.g. IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12 (PCT Publication No. WO 99/44636), etc.), interferons (e.g. gamma interferon), macrophage colony stimulating factor (M-CSF), and tumor necrosis factor (TNF); (5) monophosphoryl lipid A (MPL) or 3-O-deacylated MPL (3dMPL), (WO 00/56358); (6) combinations of 3dMPL with QS21 and/or oil-in-water emulsions (EP-A-0835318, EP-A-0735898, EP-A-0761231); (7) oligonucleotides comprising CpG motifs, i.e. containing at least one CG dinucleotide, where the cytosine is unmethylated (WO 98/40100, WO 98/55495, WO 98/37919 and WO 98/52581); (8) a polyoxyethylene ether or a polyoxyethylene ester (WO 99/52549); (9) a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol (WO 01/21207) or a polyoxyethylene alkyl ether or ester surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol (WO 01/21152); (10) a saponin and an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) (WO 00/62800); (11) metal salt, including aluminum salts (also known as alum), such as aluminum phosphate and aluminum hydroxide; (12) a saponin and an oil-in-water emulsion (WO 99/11241); and (13) a combination of saponin (e.g. QS21), 3dMPL, and 1M2 (WO 98/57659).


Further, for the treatment of a neoplastic disorder, including a cancer, in a subject, such as a human patient, the polypeptide or nucleic acid compositions, including vaccine compositions, provided by the present disclosure may be administered in combination with one or more immune modulators. The immune modulator may be an immune-suppressive-cell inhibitor (ISC inhibitor) or an immune-effector-cell enhancer (IEC enhancer). Further, one or more ISC inhibitors may be used in combination with one or more IEC enhancers. The immune modulators may be administered by any suitable methods and routes, including (1) systemic administration such as intravenous, intramuscular, or oral administration, and (2) local administration such intradermal and subcutaneous administration. Where appropriate or suitable, local administration is generally preferred over systemic administration. Local administration of any immune modulators can be carried out at any location of the body of the subject that is suitable for local administration of pharmaceuticals; however, it is more preferable that these immune modulators are administered locally at close proximity to the vaccine draining lymph node.


The compositions, such as a vaccine, may be administered simultaneously or sequentially with any or all of the immune modulators used. Similarly, when two or more immune modulators are used, they may be administered simultaneously or sequentially with respect to each other. In some embodiments, a vaccine is administered simultaneously (e.g., in a mixture) with respect to one immune modulator, but sequentially with respect to one or more additional immune modulators. Co-administration of the vaccine and the immune modulators can include cases in which the vaccine and at least one immune modulator are administered so that each is present at the administration site, such as vaccine draining lymph node, at the same time, even though the antigen and the immune modulators are not administered simultaneously. Co-administration of the vaccine and the immune modulators also can include cases in which the vaccine or the immune modulator is cleared from the administration site, but at least one cellular effect of the cleared vaccine or immune modulator persists at the administration site, such as vaccine draining lymph node, at least until one or more additional immune modulators are administered to the administration site. In cases where a nucleic acid vaccine is administered in combination with a CpG, the vaccine and CpG may be contained in a single formulation and administered together by any suitable method. In some embodiments, the nucleic acid vaccine and CpG in a co-formulation (mixture) is administered by intramuscular injection in combination with electroporation.


In some embodiments, the immune modulator that is used in combination with the polypeptide or nucleic acid composition is an ISC inhibitor. Examples of SIC inhibitors include (1) protein kinase inhibitors, such as imatinib, sorafenib, lapatinib, BIRB-796, and AZD-1152, AMG706, Zactima (ZD6474), MP-412, sorafenib (BAY 43-9006), dasatinib, CEP-701 (lestaurtinib), XL647, XL999, Tykerb (lapatinib), MLN518, (formerly known as CT53518), PKC412, ST1571, AEE 788, OSI-930, OSI-817, sunitinib malate (SUTENT), axitinib (AG-013736), erlotinib, gefitinib, axitinib, bosutinib, temsirolismus and nilotinib (AMN107). In some particular embodiments, the tyrosine kinase inhibitor is sunitinib, sorafenib, or a pharmaceutically acceptable salt or derivative (such as a malate or a tosylate) of sunitinib or sorafenib; (2) cyclooxygenase-2 (COX-2) inhibitors, such as celecoxib and rofecoxib; (3) phosphodiesterase type 5 (PDE5) inhibitors, such as Examples of PDE5 inhibitors include avanafil, lodenafil, mirodenafil, sildenafil, tadalafil, vardenafil, udenafil, and zaprinast, and (4) DNA crosslinkers, such as cyclophosphamide.


In some embodiments, the immune modulator that is used in combination with the polypeptide or nucleic acid composition is an IEC enhancer. Two or more IEC enhancers may be used together. Examples of IEC enhancers that may be used include: (1) TNFR agonists, such as agonists of OX40, 4-1BB (such as BMS-663513), GITR (such as TRX518), and CD40 (such as CD40 agonistic antibodies); (2) CTLA-4 inhibitors, such as is Ipilimumab and Tremelimumab; (3) TLR agonists, such as CpG 7909 (5′ TCGTCGTTTTGTCGTTTTGTCGTT3′), CpG 24555 (5′ TCGTCGTTTTTCGGTGCTTTT3′ (CpG 24555); and CpG 10103 (5′ TCGTCGTTTTTCGGTCGTTTT3′); (4) programmed cell death protein 1 (PD-1) inhibitors, such as nivolumab and pembrolizumab; and (5) PD-L1 inhibitors, such as atezolizumab, durvalumab, and velumab; and (6) IDO1 inhibitors.


In some embodiments, the IEC enhancer is CD40 agonist antibody, which may be a human, humanized or part-human chimeric anti-CD40 antibody. Examples of specific CD40 agonist antibodies include the G28-5, mAb89, EA-5 or S2C6 monoclonal antibody, and CP870,893. CP-870,893 is a fully human agonistic CD40 monoclonal antibody (mAb) that has been investigated clinically as an anti-tumor therapy. The structure and preparation of CP870,893 is disclosed in WO2003041070 (where the antibody is identified by the internal identified “21.4.1” and the amino acid sequences of the heavy chain and light chain of the antibody are set forth in SEQ ID NO: 40 and SEQ ID NO: 41, respectively). For use in combination with a composition present disclosure, CP-870,893 may be administered by any suitable route, such as intradermal, subcutaneous, or intramuscular injection. The effective amount of CP870893 is generally in the range of 0.01-0.25 mg/kg. In some embodiment, CP870893 is administered at an amount of 0.05-0.1 mg/kg.


In some other embodiments, the IEC enhancer is a CTLA-4 inhibitor, such as Ipilimumab and Tremelimumab. Ipilimumab (also known as MEX-010 or MDX-101), marketed as YERVOY, is a human anti-human CTLA-4 antibody. Ipilimumab can also be referred to by its CAS Registry No. 477202-00-9, and is disclosed as antibody 10DI in PCT Publication No. WO 01/14424. Tremelimumab (also known as CP-675,206) is a fully human IgG2 monoclonal antibody and has the CAS number 745013-59-6. Tremelimumab is disclosed in U.S. Pat. No. 6,682,736, incorporated herein by reference in its entirety, where it is identified as antibody 11.2.1 and the amino acid sequences of its heavy chain and light chain are set forth in SEQ ID NOs:42 and 43, respectively. For use in combination with a composition provided by the present disclosure, Tremelimumab may be administered locally, particularly intradermally or subcutaneously. The effective amount of Tremelimumab administered intradermally or subcutaneously is typically in the range of 5-200 mg/dose per person. In some embodiments, the effective amount of Tremelimumab is in the range of 10-150 mg/dose per person per dose. In some particular embodiments, the effective amount of Tremelimumab is about 10, 25, 50, 75, 100, 125, 150, 175, or 200 mg/dose per person.


In some other embodiments, the immune modulator is a PD-1 inhibitor or PD-L1 inhibitor, such as nivolumab, pembrolizumab, RN888 (anti-PD-1 antibody), Atezolizumab (PD-L1-specific mAbs from Roche), Durvalumab (PD-L1-specific mAbs from Astra Zeneca), and Avelumab (PD-L1-specific mAbs from Merck). (Okazaki T et al., International Immunology (2007); 19, 7:813-824, Sunshine J et al., Curr Opin Pharmacol. 2015 August; 23:32-8).


In other embodiments, the present disclosure provides use of an immune modulator with a vaccine, including anti-cancer vaccines, wherein the immune modulator is an inhibitor of indoleamine 2,3-dioxygenase 1 (also known as “IDO1”). IDO1 was found to modulate immune cell function to a suppressive phenotype and was, therefore, believed to partially account for tumor escape from host immune surveillance. The enzyme degrades the essential amino acid tryptophan into kynurenine and other metabolites. It was found that these metabolites and the paucity of tryptophan leads to suppression of effector T-cell function and augmented differentiation of regulatory T cells. The IDO1 inhibitors may be large molecules, such as an antibody, or a small molecule, such as a chemical compound.


In some particular embodiments, the polypeptide or nucleic acid composition provided by the present disclosure is used in combination with a 1,2,5-oxadiazole derivative IDO1 inhibitor disclosed in WO2010/005958. Examples of specific 1,2,5-oxadiazole derivative IDO1 inhibitors include the following compounds:

  • 4-({2-[(aminosulfonyl)amino]ethyl}amino)-N-(3-bromo-4-fluorophenyl)-N′-hydroxy-1,2,5-oxadiazole-3-carboximidamide;
  • 4-({2 [(aminosulfonyl)amino]ethyl} amino)-N-(3-chloro-4-fluorophenyl)-N′-hydroxy-1,2,5-oxadiazole 3-carboximidamide;
  • 4-({2 [(aminosulfonyl)amino]ethyl} amino)-N-[4-fluoro-3-(trifluoromethyl)phenyl]-N′-hydroxy-1,2,5 oxadiazole-3-carboximidamide;
  • 4-({2 [(aminosulfonyl)amino]ethyl} amino)-N′-hydroxy-N-[3-(trifluoromethyl)phenyl]-1,2,5 oxadiazole-3-carboximidamide;
  • 4-({2 [(aminosulfonyl)amino]ethyl} amino)-N-(3-cyano-4-fluorophenyl)-N′-hydroxy-1,2,5-oxadiazole 3-carboximidamide;
  • 4-({2 [(aminosulfonyl)amino] ethyl} amino)-N-[(4-bromo-2-furyl)methyl]-N′-hydroxy-1,2,5 oxadiazole-3-carboximidamide; or
  • 4-({2 [(aminosulfonyl)amino] ethyl} amino)-N-[(4-chloro-2-furyl)methyl]-N′-hydroxy-1,2,5 oxadiazole-3-carboximidamide.


The 1,2,5-oxadiazole derivative IDO1 inhibitors are typically administered orally once or twice per day and effective amount by oral administration is generally in the range of 25 mg-1000 mg per dose per patient, such as 25 mg, 50 mg, 100 mg, 200 mg, 300 mg, 400 mg, 500 mg, 600 mg, 700 mg, 800 mg, or 1000 mg. In a particular embodiment, the polypeptide or nucleic acid composition provided by the present disclosure is used in combination with 4-({2-[(aminosulfonyl)amino]ethyl}amino)-N-(3-bromo-4-fluorophenyl)-N′-hydroxy-1,2,5-oxadiazole-3-carboximidamide administered orally twice per day at 25 mg or 50 mg per dose. The 1,2,5-oxadiazole derivatives may be synthesized as described in U.S. Pat. No. 8,088,803, which is incorporated herein by reference in its entirety.


In some other specific embodiments, the polypeptide or nucleic acid composition provided by the present disclosure is used in combination with a pyrrolidine-2,5-dione derivative IDO1 inhibitor disclosed in WO2015/173764. Examples of specific pyrrolidine-2,5-dione derivative inhibitors include the following compounds:

  • 3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • (3-2H)-3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • (−)-(R)-3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • 3-(1H-indol-3-yl)pyrrolidine-2,5-dione;
  • (−)-(R)-3-(1H-indol-3-yl)pyrrolidine-2,5-dione;
  • 3-(5-chloro-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • (−)-(R)-3-(5-chloro-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • 3-(5-bromo-1H-indol-3-yl)pyrrolidine-2,5-dione;
  • 3-(5,6-difluoro-1H-indol-3-yl)pyrrolidine-2,5-dione; and
  • 3-(6-chloro-1H-indol-3-yl)pyrrolidine-2,5-dione.


The pyrrolidine-2,5-dione derivative IDO1 inhibitors are typically administered orally once or twice per day and the effective amount by oral administration is generally in the range of 50 mg-1000 mg per dose per patient, such as 125 mg, 250 mg, 500 mg, 750 mg, or 1000 mg. In a particular embodiment, the polypeptide or nucleic acid composition provided by the present disclosure is used in combination with 3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione administered orally once per day at 125-100 mg per dose per patient. The pyrrolidine-2,5-dione derivatives may be synthesized as described in U.S. patent application publication US2015329525, which is incorporated herein by reference in its entirety.


H. Examples

The following examples are provided to illustrate certain embodiments of the invention. They should not be construed to limit the scope of the invention in any way. From the above description and these examples, one skilled in the art can ascertain the essential characteristics of the invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usage and conditions.


Example 1. Construction of Single-Antigen, Dual-Antigen, and Triple-Antigen Constructs

Example 1 illustrates the construction of single antigen constructs, dual-antigen constructs, and triple antigen constructs. Unless as otherwise noted, reference to amino acid positions or residues of MUC1, MSLN, and TERT protein refers to the amino acid sequence of human MUC1 isoform 1 precursor protein as set forth in SEQ ID NO:1, amino acid sequence of human mesothelin (MSLN) isoform 2 precursor protein as set forth in SEQ ID NO:2, and the amino acid sequence of human TERT isoform 1 precursor protein as set forth in SEQ ID NO:3, respectively.


1A. Single-Antigen Constructs


Plasmid 1027 (MUC1). Plasmid 1027 was generated using the techniques of gene synthesis and restriction fragment exchange. The amino acid sequence of human MUC1 with a 5× tandem repeat VNTR region was submitted to GeneArt for gene optimization and synthesis. The gene encoding the polypeptide was optimized for expression, synthesized, and cloned. The MUC-1 open reading frame was excised from the GeneArt vector by digestion with NheI and BgIII and inserted into similarly digested plasmid pPJV7563. The open reading frame (ORF) nucleotide sequence of Plasmid 1027 is set forth in SEQ ID NO:7. The amino acid sequence encoded by Plasmid 1027 is set for in SEQ ID NO:8.


Plasmid 1103 (cMSLN). Plasmid 1103 was constructed using the techniques of PCR and restriction fragment exchange. First, the gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1084 with primers MSLN34 and MSLN598, resulting in the addition of NheI and BgIII restriction sites at the 5′ and 3′ ends of the amplicon, respectively. The amplicon was digested with NheI and Bgl II and inserted into similarly digested plasmid pPJV7563. The open reading frame nucleotide sequence of Plasmid 1103 is set forth in SEQ ID NO:5. The amino acid sequence encoded by Plasmid 1103 is set for in SEQ ID NO:6.


Plasmid 1112 (TERT240). Plasmid 1112 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding TERT amino acids 241-1132 was amplified by PCR from plasmid 1065 with primers f pmed TERT 241G and r TERT co#pMed. The amplicon was cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid1112 is set forth in SEQ ID NO:9. The amino acid sequence encoded by Plasmid 1112 is set for in SEQ ID NO:10.


Plasmid 1197 (cMUC1). Plasmid 1197 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding MUC1 amino acids 22-225, 946-1255 was amplified by PCR from plasmid 1027 with primers ID1197F and ID1197R. The amplicon was cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1197 is set forth in SEQ ID NO:15. The amino acid sequence encoded by Plasmid 1197 is set for in SEQ ID NO:16.


Plasmid 1326 (TERT343). Plasmid 1326 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding TERT amino acids 344-1132 was amplified by PCR from plasmid 1112 with primers TertA343-F and Tert-R. The amplicon was cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid1326 is set forth in SEQ ID NO:13. The amino acid sequence encoded by Plasmid 1326 is set for in SEQ ID NO:14.


Plasmid 1330 (TERT541). Plasmid 1330 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding TERT amino acids 542-1132 was amplified by PCR from plasmid 1112 with primers TertA541-F and Tert-R. The amplicon was cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1330 is set forth in SEQ ID NO:11. The amino acid sequence encoded by Plasmid 1330 is set for in SEQ ID NO:12.


1B. Dual-Antigen Constructs


Plasmid 1158 (cMSLN-PT2A-Muc1). Plasmid 1158 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN and r PTV2A Bamh cMSLN. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f1 PTV2A Muc, f2 PTV2A, and r pmed Bgl Muc. PCR resulted in the addition of overlapping PTV 2A sequences at the 3′ end of cMSLN and 5′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1158 is set forth in SEQ ID NO:23. The amino acid sequence encoded by Plasmid 1158 is set for in SEQ ID NO:24.


Plasmid 1159 (Muc1-PT2A-cMSLN). Plasmid 1159 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f1 PTV2A cMSLN, f2 PTV2A, and r pmed Bgl cMSLN. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f pmed Nhe Muc and r PTV2A Bamh Muc. PCR resulted in the addition of overlapping PTV 2A sequences at the 5′ end of cMSLN and 3′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1159 is set forth in SEQ ID NO:21. The amino acid sequence encoded by Plasmid 1159 is set for in SEQ ID NO:22.


Plasmid 1269 (Muc1-Ter240). Plasmid 1269 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f tg link Ter240 and r pmed Bgl Ter240. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f pmed Nhe Muc and r link muc. PCR resulted in the addition of an overlapping GGSGG linker at the 5′ end of Tert and 3′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1269 is set forth in SEQ ID NO:25. The amino acid sequence encoded by Plasmid 1269 is set for in SEQ ID NO:26.


Plasmid 1270 (Muc1-ERB2A-Ter240). Plasmid 1270 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f2 ERBV2A, f1 ERBV2A Ter240, and r pmed Bgl Ter240. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f pmed Nhe Muc and r ERB2A Bamh Muc. PCR resulted in the addition of overlapping ERBV 2A sequences at the 5′ end of Tert and 3′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1270 is set forth in SEQ ID NO:27. The amino acid sequence encoded by Plasmid 1270 is set for in SEQ ID NO:28.


Plasmid 1271 (Ter240-ERB2A-Muc1). Plasmid 1271 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f pmed Nhe Ter240 and r ERB2A Bamh Ter240. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f2 ERBV2A, f1 ERBV2A Muc, and r pmed Bgl Muc. PCR resulted in the addition of overlapping ERBV 2A sequences at the 3′ end of Tert and 5′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1271 is set forth in SEQ ID NO:29. The amino acid sequence encoded by Plasmid 1271 is set for in SEQ ID NO:30.


Plasmid 1272 (Ter240-T2A-cMSLN). Plasmid 1272 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f pmed Nhe Ter240 and r T2A Tert240. The gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f2 T2A, f1 T2A cMSLN, and r pmed Bgl cMSLN. PCR resulted in the addition of overlapping TAV 2A sequences at the 3′ end of Tert and 5′ end of cMSLN. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1272 is set forth in SEQ ID NO:35. The amino acid sequence encoded by Plasmid 1272 is set for in SEQ ID NO:36.


Plasmid 1273 (Tert240-cMSLN). Plasmid 1273 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f pmed Nhe Ter240 and r link Tert240. The gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f tert ink cMSLN and r pmed Bgl cMSLN. PCR resulted in the addition of an overlapping GGSGG linker at the 3′ end of Tert and 5′ end of cMSLN. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1273 is set forth in SEQ ID NO:37. The amino acid sequence encoded by Plasmid 1273 is set for in SEQ ID NO:38.


Plasmid 1274 (cMSLN-T2A-Tert240). Plasmid 1274 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f2 T2A, f1 T2A Tert240 and r pmed Bgl Ter240. The gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN and r T2A Bamh cMSLN. PCR resulted in the addition of overlapping TAV 2A sequences at the 5′ end of Tert and 3′ end of cMSLN. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1274 is set forth in SEQ ID NO:39. The amino acid sequence encoded by Plasmid 1274 is set for in SEQ ID NO:40.


Plasmid 1275 (cMSLN-Tert240). Plasmid 1275 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f tg link Ter240 and r pmed Bgl Ter240. The gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN and r link cMSLN. PCR resulted in the addition of an overlapping GGSGG linker at the 5′ end of Tert and 3′ end of cMSLN. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1275 is set forth in SEQ ID NO:41. The amino acid sequence encoded by Plasmid 1275 is set for in SEQ ID NO:42.


Plasmid 1286 (cMuc1-ERB2A-Tert240). Plasmid 1286 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f2 ERBV2A, f1 ERBV2A Ter240, and r pmed Bgl Ter240. The gene encoding human Mucin-1 amino acids 22-225, 946-1255 was amplified by PCR from plasmid 1197 with primers f pmed Nhe cytMuc and r ERB2A Bamh Muc. PCR resulted in the addition of overlapping ERBV 2A sequences at the 5′ end of Tert and 3′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1286 is set forth in SEQ ID NO:31. The amino acid sequence encoded by Plasmid 1286 is set for in SEQ ID NO:32.


Plasmid 1287 (Tert240-ERB2A-cMuc1). Plasmid 1287 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human telomerase amino acids 241-1132 was amplified by PCR from plasmid 1112 with primers f pmed Nhe Ter240 and r ERB2A Bamh Ter240. The gene encoding human Mucin-1 amino acids 22-225, 946-1255 was amplified by PCR from plasmid 1197 with primers f2 ERBV2A, f1 ERBV2A cMuc, and r pmed Bgl Muc. PCR resulted in the addition of overlapping ERBV 2A sequences at the 3′ end of Tert and 5′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1287 is set forth in SEQ ID NO:33. The amino acid sequence encoded by Plasmid 1287 is set for in SEQ ID NO: 34.


Plasmid 1313 (Muc1-EMC2A-cMSLN). Plasmid 1313 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers EMCV_cMSLN_F—33, EMCV2A_F—34 and pMED_cMSLN_R—37. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers pMED_MUC1_F—31, EMCV2A_R—36, and EMCV_Muc1_R—35. PCR resulted in the addition of overlapping EMCV 2A sequences at the 5′ end of cMSLN and 3′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1313 is set forth in SEQ ID NO:19. The amino acid sequence encoded by Plasmid 1313 is set for in SEQ ID NO:20.


Plasmid 1316 (cMSLN-EMC2A-Muc1). Plasmid 1316 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the human mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN and r EM2A Bamh cMSLN. The gene encoding human Mucin-1 amino acids 2-225, 946-1255 was amplified by PCR from plasmid 1027 with primers f1 EM2A Muc, f2 EMCV2A, and r pmed Bgl Muc. PCR resulted in the addition of overlapping EMCV 2A sequences at the 3′ end of cMSLN and 5′ end of Muc1. The amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1316 is set forth in SEQ ID NO:17. The amino acid sequence encoded by Plasmid 1316 is set for in SEQ ID NO:18.


1C. Triple-Antigen Constructs


Plasmid 1317 (Muc1-EMC2A-cMSLN-T2A-Tert240). Plasmid 1317 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human Mucin-1 amino acids 2-225, 946-1255, an EMCV 2A peptide, and the amino terminal half of the mesothelin precursor were amplified by PCR from plasmid 1313 with primers f pmed Nhe Muc and r MSLN 1051-1033. The genes encoding the carboxy terminal half of the mesothelin precursor, a TAV 2A peptide, and human telomerase amino acids 241-1132 were amplified by PCR from plasmid 1274 with primers f MSLN 1028-1051 and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1317 is set forth in SEQ ID NO:43. The amino acid sequence encoded by Plasmid 1317 is set for in SEQ ID NO:44.


Plasmid 1318 (Muc1-ERB2A-Tert240-T2A-cMSLN). Plasmid 1318 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human Mucin-1 amino acids 2-225, 946-1255, an ERBV 2A peptide, and the amino terminal half of human telomerase were amplified by PCR from plasmid 1270 with primers f pmed Nhe Muc and r tert 1602-1579. The genes encoding the carboxy terminal half of telomerase, a TAV 2A peptide, and human mesothelin precursor amino acids 37-597 were amplified by PCR from plasmid 1272 with primers f tert 1584-1607 and r pmed Bgl cMSLN. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1318 is set forth in SEQ ID NO:45. The amino acid sequence encoded by Plasmid 1318 is set for in SEQ ID NO:46.


Plasmid 1319 (cMSLN-EMC2A-Muc1-ERB2A-Tert240). Plasmid 1319 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human mesothelin precursor amino acids 37-597, an EMCV 2A peptide, and the amino terminal half of human Mucin-1 were amplified by PCR from plasmid 1316 with primers f pmed Nhe cMSLN and r muc 986-963. The genes encoding the carboxy terminal half of Mucin-1, an ERBV 2A peptide, and human telomerase amino acids 241-1132 were amplified by PCR from plasmid 1270 with primers f Muc 960-983 and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1319 is set forth in SEQ ID NO:47. The amino acid sequence encoded by Plasmid 1319 is set for in SEQ ID NO:48.


Plasmid 1320 (cMSLN-T2A-Tert240-ERB2A-Muc1). Plasmid 1320 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human mesothelin precursor amino acids 37-597, a TAV 2A peptide, and the amino terminal half of human telomerase were amplified by PCR from plasmid 1274 with primers f pmed Nhe cMSLN and r tert 1602-1579. The genes encoding the carboxy terminal half of telomerase, an ERBV 2A peptide, and human Mucin-1 amino acids 2-225, 946-1255 were amplified by PCR from plasmid 1271 with primers f tert 1584-1607 and r pmed Bgl Muc. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1320 is set forth in SEQ ID NO:49. The amino acid sequence encoded by Plasmid 1320 is set for in SEQ ID NO:50.


Plasmid 1321 (Tert240-T2A-cMSLN-EMC2A-Muc1). Plasmid 1321 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding the amino terminal half of human telomerase was amplified by PCR from plasmid 1112 with primers f pmed Nhe Ter240 and r tert 1602-1579. The genes encoding the carboxy terminal half of telomerase, a TAV 2A peptide, and the amino terminal half of human mesothelin precursor were amplified by PCR from plasmid 1272 with primers f tert 1584-1607 and r MSLN 1051-1033. The genes encoding the carboxy terminal half of human mesothelin precursor, an EMCV 2A peptide, and human Mucin-1 amino acids 2-225, 946-1255 were amplified by PCR from plasmid 1316 with primers f MSLN 1028-1051 and r pmed Bgl Muc. The three partially overlapping amplicons were mixed together and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1321 is set forth in SEQ ID NO:51. The amino acid sequence encoded by Plasmid 1321 is set for in SEQ ID NO:52.


Plasmid 1322 (Tert240-ERB2A-Muc1-EMC2A-cMSLN). Plasmid 1322 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human telomerase amino acids 241-1132, an ERBV 2A peptide, and the amino terminal half of human Mucin-1 were amplified by PCR from plasmid 1271 with primers f pmed Nhe Ter240 and r muc 986-963. The genes encoding the carboxy terminal half of Mucin-1, an EMCV 2A peptide, and human mesothelin precursor amino acids 37-597 were amplified by PCR from plasmid 1313 with primers f Muc 960-983 and r pmed Bgl cMSLN. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1322 is set forth in SEQ ID NO:53. The amino acid sequence encoded by Plasmid 1322 is set for in SEQ ID NO:54.


Plasmid 1351 (Muc1-EMC2A-cMSLN-T2A-Tert541). Plasmid 1351 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human Mucin-1 amino acids 2-225, 946-1255, an EMCV 2A peptide, and the human mesothelin precursor were amplified by PCR from plasmid 1313 with primers f pmed Nhe Muc and r T2A Bamh cMSLN. The genes encoding a TAV 2A peptide and human telomerase amino acids 541-1132 were amplified by PCR from plasmid 1330 with primers f1 T2A Tert d541, f2 T2A, and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1351 is set forth in SEQ ID NO:55. The amino acid sequence encoded by Plasmid 1351 is set for in SEQ ID NO:56.


Plasmid 1352 (cMSLN-EMC2A-Muc1-ERB2A-Tert541). Plasmid 1352 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human mesothelin precursor amino acids 37-597, an EMCV 2A peptide, and human Mucin-1 were amplified by PCR from plasmid 1316 with primers f pmed Nhe cMSLN and r ERB2A Bamh Muc. The genes encoding an ERBV 2A peptide and human telomerase amino acids 541-1132 were amplified by PCR from plasmid 1330 with primers f1 ERBV2A Tert d541, f2 ERBV2A, and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1352 is set forth in SEQ ID NO:57. The amino acid sequence encoded by Plasmid 1352 is set for in SEQ ID NO:58.


Plasmid 1353 (cMSLN-T2A-Tert541-ERB2A-Muc1). Plasmid 1353 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding human mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN, r2 T2A, and r T2A Bamh cMSLN. The genes encoding a TAV 2A peptide, human telomerase amino acids 541-1132, an ERBV 2A peptide, and human Mucin-1 amino acids 2-225, 946-1255 were amplified by PCR from plasmid 1271 with primers f1 T2A Tert d541 and r pmed Bgl Muc. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1353 is set forth in SEQ ID NO:59. The amino acid sequence encoded by Plasmid 1353 is set for in SEQ ID NO:60.


Plasmid 1354 (Muc1-EMC2A-cMSLN-T2A-Tert342). Plasmid 1354 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human Mucin-1 amino acids 2-225, 946-1255, an EMCV 2A peptide, and the human mesothelin precursor were amplified by PCR from plasmid 1313 with primers f pmed Nhe Muc and r T2A Bamh cMSLN. The genes encoding a TAV 2A peptide and human telomerase amino acids 342-1132 were amplified by PCR from plasmid 1326 with primers f1 T2A Tert d342, f2 T2A, and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1354 is set forth in SEQ ID NO:61. The amino acid sequence encoded by Plasmid 1354 is set for in SEQ ID NO:62.


Plasmid 1355 (cMSLN-EMC2A-Muc1-ERB2A-Tert342). Plasmid 1355 was constructed using the techniques of PCR and Seamless cloning. First, the genes encoding human mesothelin precursor amino acids 37-597, an EMCV 2A peptide, and human Mucin-1 were amplified by PCR from plasmid 1316 with primers f pmed Nhe cMSLN and r ERB2A Bamh Muc. The genes encoding an ERBV 2A peptide, and human telomerase amino acids 342-1132 were amplified by PCR from plasmid 1326 with primers f1 ERBV2A Ter d342, f2 ERBV2A, and r pmed Bgl Ter240. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1355 is set forth in SEQ ID NO:63. The amino acid sequence encoded by Plasmid 1355 is set for in SEQ ID NO:64.


Plasmid 1356 (cMSLN-T2A-Tert342-ERB2A-Muc1). Plasmid 1356 was constructed using the techniques of PCR and Seamless cloning. First, the gene encoding human mesothelin precursor amino acids 37-597 was amplified by PCR from plasmid 1103 with primers f pmed Nhe cMSLN, r2 T2A, and r T2A Bamh cMSLN. The genes encoding a TAV 2A peptide, human telomerase amino acids 342-1132, an ERBV 2A peptide, and human Mucin-1 amino acids 2-225, 946-1255 were amplified by PCR from plasmid 1271 with primers f1 T2A Tert d342 and r pmed Bgl Muc. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The resulting clone #3 contained an unintended single base mutation. To correct the mutation, PCR and Seamless cloning were repeated using clone #3 as the template. The genes encoding human mesothelin precursor amino acids 37-597, a TAV 2A peptide, and the amino terminal half of human telomerase were amplified by PCR from clone #3 with primers f pmed Nhe cMSLN and r tert 1602-1579. The genes encoding the carboxy terminal half of telomerase, an ERBV 2A peptide, and human Mucin-1 amino acids 2-225, 946-1255 were amplified by PCR from clone #3 with primers f tert 1584-1607 and r pmed Bgl Muc. The partially overlapping amplicons were digested with Dpn I, mixed together, and cloned into the Nhe I/Bgl II sites of pPJV7563 by Seamless cloning. The open reading frame nucleotide sequence of Plasmid 1356 is set forth in SEQ ID NO:65. The amino acid sequence encoded by Plasmid 1356 is set for in SEQ ID NO:66.


1D. Vector Construction


Vectors for expressing single or multi-antigen constructs were constructed from chimpanzee adenovirus Ad68 genomic sequences. Three versions of the AdC68 backbone without transgenes (called “empty vectors”) were designed in silico. The vectors differed only in the extent of the E1 and E3 deletions that were engineered into the viruses to render them replication incompetent and create space for transgene insertion. Vectors AdC68W and AdC68× were described in international patent application WO2015/063647A1. Vector AdC68Y, carrying deletions of bases 456-3256 and 27476-31831, was engineered to have improved growth properties over AdC68X and a greater transgene carrying capacity than AdC68W. All three empty vectors were biochemically synthesized in a multi-stage process utilizing in vitro oligo synthesis and subsequent recombination-mediated intermediate assembly in Escherichia coli (E. coli) and yeast. Open reading frames (ORF) encoding the various immunogenic TAA polypeptides were amplified by PCR from the plasmids described in the Examples. Open reading frames were then inserted into the empty vector bacmids. Recombinant viral genomes were released from the bacmids by digestion with PacI and the linearized nucleic acids were transfected into an E1 complimenting adherent HEK293 cell line. Upon visible cytopathic effects and adenovirus foci formation, cultures were harvested by multiple rounds of freezing/thawing to release virus from the cells. Viruses were amplified and purified by standard techniques.


Example 2. Immunogenicity of Immunogenic MUC1 Single-Antigen

Constructs


Study in HLA-A2/DR1 Mice


Study design. Twelve mixed gender HLA-A2/DR1 mice were primed on day 0 and boosted on day 14 with DNA construct Plasmid 1027 (which encodes the membrane-bound immunogenic MUC1 polypeptide of SEQ ID NO:8) or Plasmid 1197 (which encodes the cytosolic immunogenic MUC1 polypeptide of SEQ ID NO:16) using the PMED method. On day 21, mice were sacrificed and splenocytes assessed for MUC1-specific cellular immunogenicity in an interferon-gamma (IFN-γ) ELISpot and intracellular cytokine staining (ICS) assay.


Particle Mediated Epidermal Delivery (PMED). PMED is a needle-free method of administering DNAs to a subject. The PMED system involves the precipitation of DNA onto microscopic gold particles that are then propelled by helium gas into the epidermis. The ND10, a single use device, uses pressurized helium from an internal cylinder to deliver gold particles and the X15, a repeater delivery device, uses an external helium tank which is connected to the X15 via high pressure hose to deliver the gold particles. Both of these devices were used in studies to deliver the MUC1 DNA plasmids. The gold particle was usually 1-3 μm in diameter and the particles were formulated to contain 2 μg of antigen DNA plasmids per 1 mg of gold particles. (Sharpe, M. et al.: P. Protection of mice from H5N1 influenza challenge by prophylactic DNA vaccination using particle mediated epidermal delivery. Vaccine, 2007, 25(34): 6392-98: Roberts L K, et al.: Clinical safety and efficacy of a powdered Hepatitis B nucleic acid vaccine delivered to the epidermis by a commercial prototype device. Vaccine, 2005; 23(40):4867-78).


IFN-γ ELISpot assay. Splenocytes from individual animals were co-incubated in triplicate with individual Ag-specific peptides (each peptide at 2-10 ug/ml, 2.5-5e5 cells per well) or pools of 15mer Ag-specific peptides (overlapping by 11 amino acids, covering the entire Ag-specific amino acid sequence; each peptide at 2-5 ug/ml, 1.25-5e5 cells per well) in IFN-γ ELISPOT plates (see also Peptide Pools Table (Table 18), and Tables 15-17). The plates were incubated for ˜16 hours at 37° C., 5% CO2, then washed and developed, as per manufacturer's instruction. The number of IFN-γ spot forming cells (SFC) was counted with a CTL reader. The average of the triplicates was calculated and the response of the negative control wells, which contained no peptides, subtracted. The SFC counts were then normalized to describe the response per 1e6 splenocytes. The antigen-specific responses in the tables represent the sum of the responses to the Ag-specific peptides or peptide pools.


ICS assay. Splenocytes from individual animals were co-incubated with H-2b-, HLA-A2-, or HLA-A24-restricted Ag-specific peptides (each peptide at 5-10 ug/ml, 1-2e6 splenocytes per well) or pools of 15mer Ag-specific peptides (overlapping by 11 amino acids, covering the entire Ag-specific amino acid sequence; each peptide at 2-5 ug/ml, 1-2e6 splenocytes per well) in U-bottom 96-well-plate tissue culture plates (see also Peptide Pools Table (Table 18) and Tables 15-17). The plates were incubated ˜16 hours at 37° C., 5% CO2. The cells were then stained to detect intracellular IFN-γ expression from CD8+ T cells and fixed. Cells were acquired on a flow cytometer. The data was presented per animal as frequency of peptide(s) Ag- or peptide pool Ag-specific IFN-γ+ CD8+ T cells after subtraction of the responses obtained in the negative control wells, which contained no peptide.


Sandwich ELISA assay. The standard sandwich ELISA assay was done using the Tecan Evo, Biomek FxP, and BioTek 405 Select TS automation instruments. The 384 well microplates (flat-well, high binding) were coated at 25 μl/well with 1.0 μg/mL human MUC1 or human MSLN protein (antigen) in 1×PBS, and incubated overnight at 4° C. The next morning, plates were blocked for one hour at RT with 5% FBS in PBS with 0.05% Tween 20 (PBS-T). Mouse sera was prepared at a 1/100 starting dilution in PBS-T in 96 U-bottom well plates. The Tecan Evo performed ½ log serial dilutions in PBS-T over 9 dilution increment points, followed by stamping of 25 μl/well of diluted serum from the 96 well plates to 384 well plates. The 384 well plates were incubated for 1 hour at RT on a shaker at 600 RPM, then, using the BioTek EL 405 Select TS plate washer, the plates were washed 4 times in PBS-T. Secondary mouse anti-IgG-HRP antibody was diluted to an appropriate dilution and stamped by Biomek FxP at 25 μl/well into 384 well plates, and incubated for 1 hour at RT on a shaker at 600 RPM, followed by 5 repeated washes. Using the Biomek FxP, plates were stamped at 25 μl/well of RT TMB substrate and incubated in the dark at RT for 30 minutes, followed by 25 μl/well stamping of 1 N H2SO4 acid to stop the enzymatic reaction. Plates were read on the Molecular Devices, Spectramax 340PC/384 Plus at 450 nm wavelength. Data were reported as calculated titers at OD of 1.0 with a limit of detection of 99.0. The antigen-specific commercial monoclonal antibody was used in each plate as a positive control to track plate-to-plate variation performance, irrelevant vaccinated mouse serum was used as a negative control, and PBS-T only wells were used to monitor non-specific binding background. Titers in the tables represent antigen-specific IgG titers elicited from individual animals.


Results. Table 1 shows ELISpot and ICS data from HLA-A2/DR1 splenocytes cultured with peptide pools derived from the MUC1 peptide library (see also tables 15 and 18) or MUC1 peptide aa516-530, respectively. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1 peptide pools, and background subtraction. Numbers in column 4 represent the frequency of CD8+ T cells being IFN-γ+ after restimulation with MUC1 peptide aa516-530 and background subtraction. A positive response is defined as having SFC>100 and a frequency of IFN-γ+ CD8+ T cells >0.05%. As shown in Table 1, the immunogenic MUC1 polypeptides made with the full-length membrane-bound (Plasmid 1027) and cytosolic (Plasmid 1197) MUC1 constructs described in Example 1A above are capable of inducing MUC1-specific T cell responses including HLA-A2-restricted MUC1 peptide aa516-530-specific CD8+ T cell responses. The cytosolic MUC1 antigen format induced the highest magnitude of T cell responses. Importantly, T cell responses derived from cancer patients against the MUC1 peptide aa516-530 have been shown to correlate with anti-tumor efficacy in vitro (Jochems C et al., Cancer Immunol Immunother (2014) 63:161-174) demonstrating the importance of raising cellular responses against this specific epitope.









TABLE 1







T cell response induced by the single-antigen MUC1 DNA


constructs (Plasmid 1027 and Plasmid 1197)


in HLA-A2/DR1 mice














# IFN-γ
% CD8+ T




Animal
spots/106
cells being



Construct ID
#
splenocytes
IFN-γ+
















Plasmid 1027
31
494
2.25




32
277
1.44




33
475
0.10




34
1096
0.84




35
282
1.45




36
649
1.36



Plasmid 1197
43
569
4.69




44
1131
2.15




45
122
2.81




46
373
1.73




47
503
1.80




48
2114
5.52










Study in HLA-A24 Mice


Study design. Mixed gender HLA-A24 mice were primed on day 0 and boosted on days 14, 28 and 42 with DNA construct Plasmid 1027 by PMED administration. On day 21, mice were sacrificed and splenocytes assessed for MUC1-specific cellular immunogenicity (ELISpot).


Results. Table 2 shows ELISpot data from HLA-A24 splenocytes cultured with peptide pools derived from the MUC1 peptide library (see also Peptide Pools Table (Table 18) and Table 15). Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1 peptide pools and background subtraction. The number in bold font indicates that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. A positive response is defined as having SFC>100. As shown in Table 2, membrane-bound MUC1 construct is capable of inducing MUC1-specific cellular responses.









TABLE 2







T cell response induced by the single-antigen DNA


construct Plasmid 1027 encoding human native full-length


membrane-bound MUC1 antigen in HLA-A24 mice











# IFN-γ spots/106


Construct ID
Animal #
splenocytes












Plasmid 1027
8
3341



9
3181



10
6207



11
3112



12
3346



13
3699









Study in Monkeys


Study design. 14 Chinese cynomolgus macaques were primed with an AdC68W adenovirus vector encoding the cytosolic (Plasmid 1197) or full-length membrane-bound MUC1 antigen (Plasmid 1027) at 2e11 viral particles by bilateral intramuscular injection (1 mL total). 29 days later, animals were boosted with DNA encoding cytosolic or full-length membrane-bound MUC1 antigen delivered intramuscularly bilaterally via electroporation (2 mL total). Anti-CTLA-4 was administered subcutaneously on days 1 (32 mg) and 29 (50 mg). 14 days after the last immunization, animals were bled and PBMCs and sera isolated to assess MUC1-specific cellular (ELISpot, ICS) and humoral (ELISA) responses, respectively.


NHP-Specific Immune Assays.


ELISpot assay. PBMCs from individual animals were co-incubated in duplicate with pools of 15mer Ag-specific peptides (overlapping by 11 amino acids, covering the entire Ag-specific amino acid sequence), each peptide at 2 ug/ml, 4e5 cells per well, in IFN-γ ELISPOT plates (see also Peptide Pools Table (Table 18) and Tables 15-17). The plates were incubated for ˜16 hours at 37° C., 5% CO2, then washed and developed, as per manufacturer's instruction. The number of IFN-γ spot forming cells (SFC) was counted with a CTL reader. The average of the duplicates was calculated and the response of the negative control wells, which contained no peptides, subtracted. The SFC counts were then normalized to describe the response per 1e6 PBMCs. The antigen-specific responses in the tables represent the sum of the responses to the Ag-specific peptide pools.


ICS assay. PBMCs from individual animals were co-incubated with pools of 15mer MUC1 peptides (overlapping by 11 amino acids, covering the entire native full-length MUC1 amino acid sequence, see Table 15), each peptide at 2 ug/mL, 1.5-2e6 PBMCs per well, in U-bottom 96-well-plate tissue culture plates. The plates were incubated for ˜16 hours at 37° C., 5% CO2, and then stained to detect intracellular IFN-γ expression from CD8 T cells. After fixation, the cells were acquired on a flow cytometer. The results are presented per individual animal as number of MUC1, MSLN, or TERT-specific IFN-γ+ CD8+ T cells after subtraction of the responses obtained in the negative control wells, which contained no peptide, and normalized to 1e6 CD8+ T cells.


Sandwich ELISA assay. The standard sandwich ELISA assay was done using the Tecan Evo, Biomek FxP, and BioTek 405 Select TS automation instruments. The 384 well microplates (flat-well, high binding) were coated at 25 μl/well with 1.0 μg/mL human MUC1 or human MSLN protein (antigen) in 1×PBS, and incubated overnight at 4° C. The next morning, plates were blocked for one hour at RT with 5% FBS in PBS with 0.05% Tween 20 (PBS-T). Sera from Chinese cynomolgus macaques was prepared at a 1/100 starting dilution in PBS-T in 96 U-bottom well plates. The Tecan Evo performed ½ log serial dilutions in PBS-T over 9 dilution increment points, followed by stamping of 25 μl/well of diluted serum from the 96 well plates to 384 well plates. The 384 well plates were incubated for 1 hour at RT on a shaker at 600 RPM, then, using the BioTek EL 405 Select TS plate washer, the plates were washed 4 times in PBS-T. Secondary rhesus anti-IgG-HRP antibody, which cross-reacts with cynomolgus IgG, was diluted to an appropriate dilution and stamped by Biomek FxP at 25 μl/well into 384 well plates, and incubated for 1 hour at RT on a shaker at 600 RPM, followed by 5 repeated washes. Using the Biomek FxP, plates were stamped at 25 μl/well of RT TMB substrate and incubated in the dark at RT for 30 minutes, followed by 25 μl/well stamping of 1 N H2SO4 acid to stop the enzymatic reaction. Plates were read on the Molecular Devices, Spectramax 340PC/384 Plus at 450 nm wavelength. Data were reported as calculated titers at OD of 1.0 with a limit of detection of 99.0. The antigen-specific commercial monoclonal antibody was used in each plate as a positive control to track plate-to-plate variation performance, irrelevant vaccinated mouse serum was used as a negative control, and PBS-T only wells were used to monitor non-specific binding background. Titers in the tables represent antigen-specific IgG titers elicited from individual animals.


Results. Table 3 shows the ELISpot and ICS data from Chinese cynomolgus macaques' PBMCs cultured with peptide pools derived from the MUC1 peptide library (see also Peptide Pools Table (Table 18) and Table 15), and the ELISA data from Chinese cynomolgus macaques' sera. Numbers in column 3 represent #IFN-γ spots/106 PBMCs after restimulation with MUC1 peptide pools and background subtraction. Numbers in column 4 represent #IFN-γ+ CD8+ T cells/106 CD8+ T cells after restimulation with MUC1 peptide pools and background subtraction. Numbers in column 5 represent the anti-MUC1 IgG titer (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0). A positive response is defined as having SFC>50, IFN-γ+ CD8+ T cells/1e6 CD8+ T cells >50, and IgG titers >99. As shown in Table 3, the immunogenic MUC1 polypeptides made with the cytosolic (1197) and native full-length membrane-bound (1027) MUC1 constructs are capable of inducing MUC1-specific T and B cell responses. The native full-length membrane-bound MUC1 construct (1027) was shown to induce the overall best MUC1-specific cellular and humoral response.









TABLE 3







T and B cell responses induced by the single-antigen adenoviral


AdC68W and single-antigen DNA constructs (Plasmid


1197; Plasmid 1027) in Chinese cynomolgus macaques













# IFN-γ
# IFN-γ+ CD8+ T



Construct
Animal
spots/106
cells/1e6 CD8+ T
IgG


ID #
#
splenocytes
cells
titer














Plasmid
4001
0
0.0
8589.7


1197
4002
38
1549.0
4245.9



4003
17
0.0
2631.9



4501
165
4792.3
614.6



4502
1703
47727.4
1882.8



4503
0
802.8
4366.4



4504
373
1857.0
4419.3


Plasmid
5001
797
813.5
5332.2


1027
5002
1013
312.9
16233.5



5003
1011
9496.9
6885.8



5004
175
170.2
48759.0



5501
214
4803.3
13010.4



5502
306
8367.6
13115.3



5503
405
0.0
89423.0









Example 3. Immunogenicity of MSLN Single-Antigen Constructs

Immune Response Study in Pasteur (HLA-A2/DR1) Mice


Study design. Twelve female HLA-A2/DR1 mice were primed with an AdC68W adenovirus vector encoding the membrane-bound (Plasmid 1084) or cytosolic MSLN antigen (Plasmid 1103) at 1e10 viral particles by intramuscular injection (50 ul). 28 days later, animals were boosted with DNA single-antigen construct encoding an immunogenic MSLN polypeptide using PMED method as described in Example 2. The antigen-specific T cell response was measured seven days later in an IFN-γ ELISPOT and ICS assay.


Results. Table 4 shows ELISpot and ICS data from HLA-A2/DR1 splenocytes cultured with peptide pools derived from the MSLN peptide library (see also Peptide Pools Table (Table 18) and Table 16) or MSLN peptides aa50-64, aa102-116, and aa542-556, respectively. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MSLN peptide pools and background subtraction. Numbers in column 4 represent the frequency of CD8+ T cells being IFN-γ+ after restimulation with MSLN peptides aa50-64, aa102-116 and aa542-556, and background subtraction. A positive response is defined as having SFC>100 and a frequency of IFN-γ+ CD8+ T cells >0.05%. As shown in Table 4, the immunogenic MSLN polypeptides made with the membrane-bound (1084) and cytosolic (1103) MSLN constructs described in Example 1A above are capable of inducing MSLN-specific T cell responses. The cytosolic MSLN antigen format induced the highest magnitude of MSLN-specific T cell responses.









TABLE 4







T cell response induced by the single-


antigen adenoviral AdC68W and single-


antigen DNA constructs in HLA-A2/DR1 mice















% CD8+





# IFN-γ
T cells




Animal
spots/106
being



Construct ID
#
splenocytes
IFN-γ+







Plasmid 1084
37
1744
 1.07




38
3488
 3.13




39
1905
 0.19




40
1649
 2.47




41
1900
 0.09




42
1108
 1.87



Plasmid 1103
49
4839
 2.34




50
4685
13.49




51
2508
 3.69




52
1865
 2.09




53
 708
 0.38




54
2525
 4.41










Immune Response Study in HLA A24 Mice


Study designs. Twelve mixed-gender HLA-A24 mice were immunized with membrane-bound (1084) or cytosolic MSLN (1103) DNA constructs using the PMED method in a prime/boost/boost/boost regimen, two weeks apart between each vaccination. MSLN-specific T cell responses were measured 7 days after the last immunization in an IFN-γ ELISpot and ICS assay.


Results. Table 5 shows ELISpot and ICS data from HLA-A24 splenocytes cultured with peptide pools derived from the MSLN peptide library (see also Peptide Pools Table (Table 18) and Table 16) or MSLN peptides aa130-144 and aa230-244, respectively. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MSLN peptide pools and background subtraction. Numbers in column 4 represent the frequency of CD8+ T cells being IFN-γ+ after restimulation with MSLN peptides aa130-144 and aa230-244, and background subtraction. A positive response is defined as having SFC>100 and a frequency of IFN-γ+ CD8+ T cells >0.05%. As shown in Table 5, the immunogenic MSLN polypeptides made with the membrane-bound (1084) and cytosolic MSLN (1103) constructs are capable of inducing MSLN-specific T cell responses. The cytosolic MSLN antigen format induced the highest magnitude of MSLN-specific T cell responses.









TABLE 5







T cell response induced by the single-


antigen DNA constructs in HLA-A24 mice















% CD8+





# IFN-γ
T cells




Animal
spots/106
being



Construct ID
#
splenocytes
IFN-γ+







Plasmid 1084
 1
 47
Not determined




 2
 161
Not determined




 3
 13
Not determined




 7
 105
Not determined




 8
 232
Not determined




 9
 151
Not determined



Plasmid 1103
13
2440
0.00




14
2345
0.17




15
1789
0.00




19
3184
0.64




21
5463
1.62




22
2324
0.39










Immune Response Study in Monkeys


Study design. 14 Chinese cynomolgus macaques were primed with an AdC68W adenovirus vector encoding the membrane-bound (Plasmid 1084) or cytosolic MSLN antigen (Plasmid 1103) at 2e11 viral particles by bilateral intramuscular injection (1 mL total). 29 days later, animals were boosted with DNA encoding membrane-bound (1084) or cytosolic MSLN antigen (1103) delivered intramuscularly bilaterally via electroporation (2 mL total). Anti-CTLA-4 was administered subcutaneously on days 1 (32 mg) and 29 (50 mg). 14 days after the last immunization, animals were bled and PBMCs and serum isolated to assess MSLN-specific cellular (ELISpot, ICS) and humoral (ELISA) responses, respectively.


Results. Table 6 shows the ELISpot and ICS data from Chinese cynomolgus macaques' PBMCs cultured with peptide pools derived from the MSLN peptide library (see also Peptide Pools Table (Table 18) and Table 16), and the ELISA data from Chinese cynomolgus macaques' sera. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MSLN peptide pools and background subtraction. Numbers in column 4 represent #IFN-γ+ CD8+ T cells/106 CD8+ T cells after restimulation with MSLN peptide pools and background subtraction. Numbers in column 5 represent the anti-MSLN IgG titer (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0). A positive response is defined as having SFC>50, IFN-γ+ CD8+ T cells/1e6 CD8+ T cells >50, and IgG titers >99. As shown in Table 6, the immunogenic MSLN polypeptides made with the membrane-bound (1084) and cytosolic (1103) MSLN constructs are capable of inducing MSLN-specific T and B cell responses. The cytoplasmic MSLN construct (Plasmid 1103) was shown to induce the strongest MSLN-specific cellular response; in contrast, the membrane-bound MSLN construct (Plasmid 1084) was shown to induce the strongest MSLN-specific humoral response.









TABLE 6







T and B cell responses induced by the single-


antigen adenoviral AdC68W and single-antigen


DNA constructs in Chinese cynomolgus macaques














# IFN-γ+






CD8+





# IFN-γ
T cells/




Animal
spots/106
1e6 CD8+



Construct ID #
#
splenocytes
T cells
IgG titer














Plasmid 1084
1001
390
181.4
40886.6



1002
787
512.0
41476.1



1003
2083
5642.6
11948.1



1501
894
1083.7
41248.3



1502
1789
6501.0
42668.3



1503
2358
37238.3
42026.5



1504
269
1340.9
43023.6


Plasmid 1103
2001
2131
15318.5
1459.3



2002
2818
7163.4
99.0



2003
1115
2291.0
2393.2



2004
948
3602.6
1948.0



2501
2477
13741.4
1751.7



2502
2082
9318.7
15412.5



2503
831
1797.8
99.0









Example 4. Immunogenicity of Tert Single-Antigen Constructs

Immune Responses Study in Pasteur Mice


Study design. Six mixed gender HLA-A2/DR1 mice were primed with an AdC68W adenovirus vector encoding the truncated (A240) cytosolic immunogenic TERT polypeptide (Plasmid 1112) at 1e10 viral particles by intramuscular injection (50 ul). 28 days later, animals were boosted intramuscularly with 50 ug DNA delivered bilaterally via electroporation (2×20 ul) encoding the truncated (A240) cytosolic TERT antigen (Plasmid 1112). The antigen-specific T cell response was measured seven days later in an IFN-γ ELISPOT and ICS assay.


Results. Table 7 shows ELISpot and ICS data from HLA-A2/DR1 splenocytes cultured with peptide pools derived from the TERT peptide library (see also Peptide Pools Table (Table 18) and Table 17) or TERT peptide aa861-875, respectively. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with TERT peptide pools and background subtraction. Numbers in column 4 represent the frequency of CD8+ T cells being IFN-γ+ after restimulation with TERT peptide aa861-875 and background subtraction. A positive response is defined as having SFC>100 and a frequency of IFN-γ+ CD8+ T cells >0.05%. As shown in Table 7, the immunogenic TERT polypeptide made with the truncated (A240) cytosolic TERT construct described in Example 1A above is capable of inducing HLA-A2-restricted TERT-specific CD8 T cell responses.









TABLE 7







T cell response induced by the single-antigen adenoviral


AdC68W and single-antigen DNA constructs


(Plasmid 1112) encoding human truncated (Δ240)


cytosolic TERT antigen in HLA-A2/DR1 mice















% CD8+





# IFN-γ
T cells




Animal
spots/106
being



Construct ID
#
splenocytes
IFN-γ+







Plasmid 1112
13
2851
32.79




14
2691
13.60




15
3697
 7.87




16
2984
21.30




17
1832
26.40




18
1385
 3.16










Immune Responses Study in HLA A24 Mice


Study designs. Eight mixed gender HLA-A24 mice were primed with an AdC68W adenovirus vector encoding the truncated (A240) cytosolic TERT antigen (Plasmid 1112) at 1e10 viral particles total by bilateral intramuscular injection (50 ul into each tibialis anterior muscle). 14 days later, animals were boosted intramuscularly with 50 ug DNA delivered bilaterally via electroporation (2×20 ul) encoding the truncated (A240) cytosolic TERT antigen (Plasmid 1112). The antigen-specific T cell response was measured seven days later in an IFN-γ ELISPOT and ICS assay.


Results. Table 8 shows IFN-γ ELISpot and ICS data from HLA-A24 splenocytes cultured with peptide pools derived from the TERT peptide library (see also Peptide Pools Table (Table 18) and Table 17) or TERT peptide aa841-855), respectively. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with TERT peptide pools and background subtraction. Numbers in column 4 represent the frequency of CD8+ T cells being IFN-γ+ after restimulation with TERT peptides aa841-855, and background subtraction. The number in bold font indicates that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. A positive response is defined as having SFC>100 and a frequency of IFN-γ+ CD8+ T cells >0.1%. As shown in Table 8, the immunogenic TERT polypeptide made with the truncated (Δ240) cytosolic TERT (1112) construct is capable of inducing HLA-A24-restricted TERT-specific CD8+ T cell responses.









TABLE 8







T cell response induced by the single-antigen


adenoviral AdC68W single-(Δ240) cytosolic


antigen DNA constructs (Plasmid 1112)


encoding human truncated


TERT antigen in HLA-A24 mice















% CD8+





# IFN-γ
T cells




Animal
spots/106
being



Construct ID
#
splenocytes
IFN-γ+
















Plasmid 1112
17
4233
41.5




18
2643
3.34




19
1741
31.5




20
3407
3.05




21
3213
0.0903




22
 596
0




23
1875
13.8




24
2011
19.8










Immune Responses Study in Monkeys


Study design. Eight Chinese cynomolgus macaques were primed with an AdC68W adenovirus vector encoding the truncated (Δ240) cytosolic TERT antigen (Plasmid 1112) at 2e11 viral particles by bilateral intramuscular injection (1 mL total). 30 and 64 days later, animals were boosted with DNA (Plasmid 1112) encoding truncated (Δ240) cytosolic TERT antigen delivered intramuscularly bilaterally via electroporation (2 mL total). Anti-CTLA-4 was administered subcutaneously on days 1 (32 mg), 31 (50 mg) and 65 (75 mg). 14 days after the last immunization, animals were bled and PBMCs isolated to assess TERT-specific cellular (ELISpot, ICS) responses.


Results. Table 9 shows the ELISpot and ICS data from Chinese cynomolgus macaques' PBMCs cultured with peptide pools derived from the TERT peptide library (see also Peptide Pools Table (table 18) and Table 17). Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with TERT peptide pools and background subtraction. Numbers in column 4 represent #IFN-γ+ CD8+ T cells/106 CD8+ T cells after restimulation with TERT peptide pools and background subtraction. A positive response is defined as having SFC>50 and IFN-γ+ CD8+ T cells/1e6 CD8+ T cells >50. As shown in Table 9, the immunogenic TERT polypeptide made with the truncated (Δ240) cytosolic (Plasmid 1112) TERT construct is capable of inducing TERT-specific T cell responses.









TABLE 9







T cell response induced by the TERT single-antigen


adenoviral AdC68W and TERT single-antigen DNA


constructs in Chinese cynomolgus macaques















# IFN-γ+






CD8+





# IFN-γ
T cells/




Animal
spots/106
1e6 CD8+



Construct ID #
#
splenocytes
T cells
















Plasmid 1112
1001
3487
29472.2




1002
1130
4906.6




1003
2077
2984.2




1004
 133
337.8




1501
3157
5325.1




1502
2037
653.2




1503
2697
16953.4




1504
1208
1178.9










Example 5. Immunogenicity of Dual-Antigen Constructs

Immune Response Study in Monkeys


Study design. 24 Chinese cynomolgus macaques were primed with dual-antigen adenoviral AdC68W vectors encoding human native full-length membrane-bound MUC1 (MUC1) and human truncated (Δ240) cytosolic TERT (TERTΔ240) antigens at 2e11 viral particles by bilateral intramuscular injection (1 mL total). 30 and 64 days later, animals were boosted with dual-antigen DNA constructs (Plasmids 1270, 1271, and 1269) encoding the same two antigens delivered intramuscularly bilaterally via electroporation (2 mL total). Anti-CTLA-4 was administered subcutaneously on days 1 (32 mg), 31 (50 mg) and 65 (75 mg). 14 days after the last immunization, animals were bled and PBMCs and serum isolated to assess MUC1- and TERT-specific cellular (ELISpot, ICS) and MUC1-specific humoral (ELISA) responses, respectively. In total, three different dual-antigen constructs, which co-expressed both antigens, were evaluated: a) MUC1-2A-TERTΔ240 (Plasmid 1270), an AdC68W vector and DNA plasmid encoding MUC1 and TERT linked by a 2A peptide; b) TERTΔ240-2A-MUC1 (Plasmid 1271), an AdC68W vector and DNA plasmid encoding TERT and MUC1 linked by a 2A peptide; c) MUC1-TERTΔ240 (Plasmid 1269), an AdC68W vector and DNA plasmid encoding the MUC1-TERT fusion protein (see also Example 1B).


Results. Table 10 shows the ELISpot and ICS data from Chinese cynomolgus macaques' PBMCs cultured with peptide pools derived from the MUC1 and TERT peptide libraries (see also Peptide Pools Table (Table 18) and Tables 15 and 17), and the ELISA data from Chinese cynomolgus macaques' sera. A positive response is defined as having SFC>50, IFN-γ+ CD8+ T cells/1e6 CD8+ T cells >50, and IgG titers >99. Numbers in columns 3 and 6 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1 and TERT peptide pools and background subtraction, respectively. Numbers in bold font indicates that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. Numbers in columns 4 and 7 represent #IFN-γ+ CD8+ T cells/106 CD8 #T cells after restimulation with MUC1 peptide pools and TERT peptide pools, respectively, and background subtraction. Numbers in column 5 represent the ani-MUC1 IgG titer (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0). As shown in Table 10, the immunogenic MUC1 and TERT polypeptides made with the MUC1- and TERT-expressing dual-antigen constructs (Plasmids 1270, 1271, and 1269) are capable of inducing MUC1- and TERT-specific T cell responses, and MUC1-specific B cell responses. The dual-antigen construct 1269 encoding a MUC1-TERT fusion protein was shown to induce the strongest overall MUC1-specific cellular response; in contrast, dual-antigen construct Plasmid 1271 (TERT-2A-MUC1) was shown to induce the strongest overall TERT-specific cellular response. All three dual-antigen constructs were shown to induce a comparable MUC1-specific humoral response.









TABLE 10







T and B cell responses induced by the dual-antigen adenoviral


AdC68W and single-antigen DNA constructs (Plasmid 1270,


1271, and 1269) encoding an immunogenic MUC1 and/or TERT


polypeptide in Chinese cynomolgus macaques










MUC1
TERT















# IFN-γ
# IFN-γ+

# IFN-γ
# IFN-γ+




spots/
CD8+ T

spots/
CD8+ T




106
cells/1e6

106
cells/1e6


Construct
Animal
spleno-
CD8+ T
IgG
spleno-
CD8+ T


ID
#
cytes
cells
titer
cytes
cells
















Plasmid
5001
813
1024.4
10725.8
307
436.9


1270
5002
2778
14740.6
27090.7
1573
423.0



5003
217
1198.7
19339.6
1687
40680.3



5004
298
Excluded
3980.3
252
805.3



5501
2287
6255.7
16278.9
692
0.0



5502
760
0.0
6496.2
3010
13302.0



5503
1315
199.8
6446.4
3702
7259.3



5504
500
281.8
39868.0
2005
13727.8


Plasmid
6001
1037
0.0
11770.3

2937

63106.1


1271
6002
185
0.0
13925.4
1295
194.8



6003
372
267.4
15439.7

2138

46023.2



6004
203
97.1
10530.7
1562
8424.0



6501
1315
2137.3
43487.3

3794

20358.2



6502
1008
179.2
8742.0

2955

1503.5



6503
552
226.4
35183.4
1797
50008.6



6504
2200
162.8
35539.9

4402

24058.6


Plasmid
7001
193
0.0
14868.3
3320
7321.5


1269
7002
1353
2153.2
7546.6
870
736.2



7003
1253
133.5
21277.4
2750
25827.7



7004
1858
20846.7
10359.9
3230
19664.0



7501
2138
773.6
31272.8
927
332.0



7502
2177
10547.7
16635.5
2640
7527.3



7503
1460
5086.2
5465.1
2362
938.6



7504
922
0.0
38530.4

2875

2949.3









Example 6. Immunogenicity of Triple-Antigen Constructs

Example 6 illustrates the capability of triple-antigen adenoviral and nucleic acid constructs expressing the human native full-length membrane-bound MUC1 antigen (MUC1), human cytosolic MSLN antigen (cMSLN), and human truncated (Δ240) cytosolic TERT antigen (TERTΔ240 or TERTΔ541) to elicit Ag-specific T and B cell responses to all three encoded cancer antigens.


Immune Response Study in C57BL16J Mice Using Electroporation


Study Design. 48 female C57BL/6J mice were immunized with triple-antigen DNA constructs encoding human MUC1, cMSLN, and TERTΔ240. The triple-antigen DNA construct (100 ug) was delivered intramuscularly bilaterally (20 ul total into each tibialis anterior muscle) with concomitant electroporation in a prime/boost regimen, two weeks apart between each vaccination. MUC1-, MSLN-, and TERT-specific cellular responses, and MUC1- and MSLN-specific humoral responses were measured 7 days after the last immunization in an IFN-γ ELISpot assay and ELISA assay, respectively. In total, six different triple-antigen DNA constructs encoding all three antigens linked by 2A peptides were used as follows: MUC1-2A-cMSLN-2A-TERTΔ240 (Plasmid 1317), MUC1-2A-TERTΔ240-2A-cMSLN (Plasmid 1318), cMSLN-2A-MUC1-2A-TERTΔ240 (Plasmid 1319), cMSLN-2A-TERTΔ240-2A-MUC1 (Plasmid 1320), TERTΔ240-2A-cMSLN-2A-MUC1 (Plasmid 1321), TERTΔ240-2A-MUC1-2A-cMSLN (Plasmid 1322) (see also Example 1C). Results. Table 11 shows the ELISpot data from C57BL/6J splenocytes cultured with peptide pools derived from the MUC1, MSLN, and TERT peptide libraries (see also Peptide Pools Table (Table 18) and Tables 15-17), and the ELISA data from C57BL/6J mouse sera. A positive response is defined as having SFC>100 and IgG titers >99. Numbers in columns 3, 5 and 7 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1, MSLN and TERT peptide pools and background subtraction, respectively. Numbers in bold font indicates that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. Numbers in columns 4 and 6 represent the anti-MUC1 and MSLN IgG titer, respectively (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0). As shown in Table 11, the immunogenic MUC1, MSLN, and TERT polypeptides made with the MUC1-, MSLN-, and TERT-expressing triple-antigen constructs are capable of inducing T cell responses against all three antigens, and B cell responses against MUC1; in contrast, only triple-antigen constructs Plasmids 1317, 1318, and 1322 are capable of inducing B cell responses against MSLN.









TABLE 11







T and B cell responses induced by the triple-antigen DNA


constructs (1317-1322) encoding human native full-length


membrane-bound MUC1, human cytosolic MSLN, and human truncated


(Δ240) cytosolic TERT antigens in C57BL/6J mice











MUC1
MSLN
TERT















# IFN-γ

# IFN-γ

# IFN-γ




spots/

spots/

spots/




106

106

106


Construct

spleno-
IgG
spleno-
IgG
spleno-


ID
Animal
cytes
titer
cytes
titer
cytes
















Plasmid
1
1433
1772.7
369
3069.8

2920



1317
2
1979
5214.6
2764
9420.3

3133




3
1729
3229.9
464
6205.6

2413




4
1570
3220.1
1108
3892.8

3255




5
1023
3837.1
497
11621.6

2293




6
1509
5573.0
898
2804.0

2817




7
1095
3905.2
163
1745.6
2311



8
1778
5147.2
2140
7709.5

3233



Plasmid
9
842
7873.1
652
99.0

2875



1319
10
1443
8987.3
760
99.0

3652




11
2832
7789.4
343
99.0

3510




12
1797
13430.0
603
99.0

3863




13
1351
9923.4
901
99.0

3443




14
1626
3242.3
917
99.0

3541




15
829
7361.0
563
99.0

3003




16
1165
6143.4
871
99.0

3080



Plasmid
17
475
1352.7
160
194.3
704


1318
18
1027
6933.6
188
99.0

2413




19
1424
1886.9
557
213.2
2244



20
2241
3864.1
597
326.3

2799




21
1447
5095.6
240
1926.4

2787




22
789
3992.6
116
1198.2

2455




23
700
4968.0
195
3040.2
2221



24
1584
5403.9
231
3017.3

3310



Plasmid
25
2043
4173.3
908
99.0

4896



1320
26
2307
4158.6
1609
99.0

4532




27
2271
10258.5
1281
99.0

3807




28
829
6768.5
243
99.0

2420




29
1355
7163.9
624
99.0

2993




30
1938
7404.1
673
99.0

3214




31
1373
3941.5
386
99.0

3139




32
1581
7843.7
393
99.0

3745



Plasmid
33
964
5579.2
225
99.0
2500


1321
34
690
6364.0
141
99.0

2674




35
923
8861.3
99
99.0

2492




36
767
10270.5
573
99.0

2467




37
1039
3211.9
148
99.0
1785



38
1283
8614.10
308
99.0
2042



39
1929
15147.2
276
99.0

2805




40
529
3581.12
199
99.0
1412


Plasmid
41
1017
5933.07
281
7430.2

2702



1322
42
1936
5333.3
271
112.5

3317




43
1719
3113.3
484
7054.2

3711




44
994
4422.0
254
4499.5

2797




45
1824
3902.0
1710
3246.3

5541




46
1435
1189.9
416
1122.6

4654




47
2430
686.7
613
99.0

4548




48
1931
7288.6
1665
2088.1

4408










Immune Response Study in C57BL/6J Mice Using Adenoviral Vectors Study Design. 36 female C57BL/6J mice were primed with triple-antigen adenoviral vectors encoding human MUC1, cMSLN, and TERTΔ240 or TERTΔ541, at 1e10 viral particles by intramuscular injection (50 ul). 28 days later, animals were boosted with triple-antigen DNA constructs (50 ug) delivered intramuscularly bilaterally (20 ul total into each tibialis anterior muscle) with concomitant electroporation. MUC1-, MSLN-, and TERT-specific cellular responses, and MUC1- and MSLN-specific humoral responses were measured 7 days after the last immunization in an IFN-γ ELISpot and ICS assay, and an ELISA assay, respectively. In total, three triple-antigen adenoviral and DNA constructs encoding MUC1, cMSLN, and TERTΔ240 linked by 2A peptides, and three triple-antigen adenoviral and DNA constructs encoding MUC1, cMSLN, and TERTΔ541 linked by 2A peptides were used as follows: MUC1-2A-cMSLN-2A-TERTΔ240 (Plasmid 1317), cMSLN-2A-MUC1-2A-TERTΔ240 (Plasmid 1319), cMSLN-2A-TERTΔ240-2A-MUC1 (Plasmid 1320), and MUC1-2A-cMSLN-2A-TERTΔ541 (Plasmid 1351), cMSLN-2A-MUC1-2A-TERTΔ541 (Plasmid 1352), cMSLN-2A-TERTΔ541-2A-MUC1 (Plasmid 1353) (see also Example 1C).


Results. Table 12 shows the ELISpot data from C57BL/6J splenocytes cultured with peptide pools derived from the MUC1, MSLN, and TERT peptide libraries (see also Peptide Pools Table (Table 18) and Tables 15-17), the ICS data from C57BL/6J splenocytes cultured with TERT peptide aa1025-1039, and the ELISA data from C57BL/6J mouse sera. A positive response is defined as having SFC>100, a frequency of IFN-γ+ CD8+ T cells >0.1%, and IgG titers >99. Numbers in columns 3, 5, and 7 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1, MSLN and TERT peptide pools, and background subtraction, respectively. Numbers in bold font indicate that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. Numbers in column 8 represent #IFN-γ+ CD8+ T cells/106 CD8+ T cells after restimulation with TERT-specific peptide TERT aa1025-1039, and background subtraction. Numbers in columns 4 and 6 represent the anti-MUC1 and anti-MSLN IgG titer, respectively (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0). As shown in Table 12, the immunogenic MUC1, MSLN, and TERT polypeptides made with MUC1-, MSLN-, and TERT-expressing triple-antigen constructs are capable of inducing T cell responses against all three antigens, and B cell responses against MUC1; in contrast, only triple-antigen constructs 1317 and 1351 are capable of inducing B cell responses against MSLN.









TABLE 12A







MUC1-specific T and B cell responses induced


by the triple-antigen adenoviral AdC68Y and


DNA constructs (Plasmids 1317, 1319, and 1320)


encoding human native full-length membrane-bound


MUC1, human cytosolic MSLN, and human


truncated (Δ240) cytosolic TERT antigens,


and by the triple-antigen adenoviral AdC68Y


and DNA constructs (Plasmids 1351-1353)


encoding human native full-length


membrane-bound MUC1, human cytosolic


MSLN, and human truncated (Δ541)


cytosolic TERT antigens in C57BL/6J mice













MUC1














# IFN-γ





Animal
spots/106
IgG



Construct ID
#
splenocytes
titer
















Plasmid 1317
19
3119
11653.4




20
3347
11941.0




21
1712
7287.2




22
3604
14391.7




23
2349
12599.0




24
2457
12969.1



Plasmid 1319
25
1865
15018.2




26
1661
8836.8




27
1657
13335.1




28
1933
17854.1




29
1293
10560.2




30
2035
10477.6



Plasmid 1320
31
2377
2667.4




32
1629
11322.4




33
1632
9562.9




34
1259
7092.0




35
2024
11306.8




36
 861
1785.1



Plasmid 1351
37
2615
10253.1




38
1595
13535.4




39
1889
14557.4




40
1869
15470.1




41
1979
11944.4




42
1892
18093.0



Plasmid 1352
43
1593
22002.4




44
2133
11821.6




45
1341
48297.5




46
1673
8682.2




47
1933
11621.7




48
1767
19318.1



Plasmid 1353
49
1859
4826.7




50
1845
3060.0




51
1784
4499.9




52
2209
2940.9




53
2177
7738.32




54
1821
2985.5

















TABLE 12B







MSLN-specific T and B cell responses


induced by the triple-antigen adenoviral


AdC68Y and DNA constructs (Plasmids


1317, 1319, and 1320) encoding human native


full-length membrane-bound MUC1, human


cytosolic MSLN, and human truncated (Δ240)


cytosolic TERT antigens, and by the triple-


antigen adenoviral AdC68Y full-length and DNA


constructs (Plasmids 1351-1353) encoding


human native membrane-bound MUC1, human


cytosolic MSLN, and human truncated (Δ541)


cytosolic TERT antigens in C57BL/6J mice













MSLN














# IFN-γ





Animal
spots/106
IgG



Construct ID
#
splenocytes
titer
















Plasmid 1317
19
856
99.0




20
911
1581.9




21
336
1401.2




22
820
767.3




23
721
99.0




24
1067
99.0



Plasmid 1319
25
708
99.0




26
368
99.0




27
769
99.0




28
1620
99.0




29
880
99.0




30
427
99.0



Plasmid 1320
31
424
99.0




32
399
99.0




33
289
99.0




34
321
99.0




35
540
99.0




36
316
99.0



Plasmid 1351
37
685
99.0




38
804
281.3




39
505
155.8




40
333
99.0




41
285
2186.7




42
444
99.0



Plasmid 1352
43
1504
99.0




44
421
99.0




45
1293
99.0




46
581
99.0




47
747
99.0




48
821
99.0



Plasmid 1353
49
984
99.0




50
740
99.0




51
412
99.0




52
1266
99.0




53
764
99.0




54
432
99.0

















TABLE 12 C







TERT-specific T cell responses


induced by the triple-antigen adenoviral


AdC68Y and DNA constructs (Plasmids


1317, 1319, and 1320) encoding human native


full-length membrane-bound MUC1, human


cytosolic MSLN, and human truncated (Δ240)


cytosolic TERT antigens, and by the triple-


antigen adenoviral AdC68Y and DNA


constructs (Plasmids 1351-1353) encoding


human native full-length membrane-bound


MUC1, human cytosolic MSLN, and human


truncated (Δ541) cytosolic TERT antigens in


C57BL/6J mice













TERT















% CD8+





# IFN-γ
T cells




Animal
spots/106
being



Construct ID
#
splenocytes
IFN-γ+







Plasmid 1317
19
5730
4.1




20
4119
2.0




21
4587
4.9




22
5522
4.3




23
5120
3.6




24
4383
4.5



Plasmid 1319
25
4995
3.1




26
4628
7.1




27
2892
2.7




28
4977
4.7




29
3913
5.2




30
3153
2.9



Plasmid 1320
31
3732
3.6




32
4308
4.3




33
4153
1.4




34
5067
5.2




35
5351
5.1




36
3268
5.0



Plasmid 1351
37
3766
2.4




38
5805
7.7




39
4391
4.7




40
3401
2.7




41
3874
4.0




42
3260
2.5



Plasmid 1352
43
5235
5.0




44
2853
3.4




45
2876
3.5




46
2610
3.3




47
3275
2.8




48
3009
3.3



Plasmid 1353
49
5806
9.1




50
6114
6.1




51
4759
6.5




52
5157
4.8




53
3999
2.9




54
4719
3.3










Immune Response Study in HLA-A24 Mice


Study Design. Eight mixed gender HLA-A24 mice were primed with an adenoviral AdC68Y triple-antigen construct (Plasmid 1317; MUC1-2A-cMSLN-2A-TERTΔ240) encoding human MUC1, cMSLN, and TERTΔ240 at 1e10 viral particles by intramuscular injection (50 ul into each tibialis anterior muscle). 14 days later, animals were boosted intramuscularly with 50 ug triple-antigen DNA construct (Plasmid 1317) encoding the same three antigens (20 ul delivered into each tibialis anterior muscle with concomitant electroporation). HLA-A24-restricted MUC1-specific cellular responses were measured 7 days after the last immunization in an IFN-γ ELISpot assay.


Results. Table 13 shows the ELISpot data from HLA-A24 splenocytes cultured with the MUC1 peptide aa524-532. A positive response is defined as having SFC>50. Numbers in column 3 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1 peptide aa524-532 and background subtraction. As shown in Table 13, the immunogenic MUC1 polypeptides made with the MUC1-, MSLN-, and TERT-expressing triple-antigen construct 1317 are capable of inducing HLA-A24-restricted MUC1 peptide aa524-532-specific CD8′ T cell responses. Importantly, T cell responses derived from cancer patients against this specific MUC1 peptide have been shown to correlate with anti-tumor efficacy in vitro (Jochems C et al., Cancer Immunol Immunother (2014) 63:161-174) demonstrating the importance of raising cellular responses against this specific epitope.









TABLE 13







HLA-A24-restricted MUC1 peptide aa524-532-


specific T cell responses induced by the triple-


antigen adenoviral and DNA constructs Plasmid 1317


(MUC1-2A-cMSLN-2A-TERTΔ240) encoding


human native full-length membrane-bound MUC1,


human cytosolic MSLN, and human truncated


(Δ240) cytosolic TERT antigens in HLA-A24 mice











# IFN-γ



Animal
spots/106


Construct ID
#
splenocytes





Plasmid 1317
89
 89



90
289



91
291



92
207



93
 83



94
295



95
 82



96
100









Immune Response Study in Monkeys


Study design. 24 Chinese cynomolgus macaques were primed with AdC68Y adenoviral vectors encoding human native full-length membrane-bound MUC1 (MUC1), human cytoplasmic MSLN (cMSLN), and human truncated (Δ240) cytosolic TERT (TERTΔ240) antigens at 2e1 l viral particles by bilateral intramuscular injection (1 mL total). 28 and 56 days later, animals were boosted with DNA encoding the same three antigens delivered intramuscularly bilaterally via electroporation (2 mL total). Anti-CTLA-4 was administered subcutaneously on days 1 (32 mg), 29 (50 mg) and 57 (75 mg). 21 days after the last immunization, animals were bled and PBMCs and serum isolated to assess MUC1-, MSLN-, and TERT-specific cellular (ELISpot, ICS) and MUC1- and MSLN-specific humoral (ELISA) responses, respectively. In total, three triple-antigen adenoviral and DNA constructs encoding MUC1, cMSLN, and TERTΔ240 linked by 2A peptides were evaluated: MUC1-2A-cMSLN-2A-TERTΔ240 (Plasmid 1317), cMSLN-2A-MUC1-2A-TERTΔ240 (Plasmid 1319), and cMSLN-2A-TERTΔ240-2A-MUC1 (Plasmid 1320).


Results. Tables 14A, 14B, and 14C show the ELISpot and ICS data from Chinese cynomolgus macaques' PBMCs cultured with peptide pools derived from the MUC1, MSLN, and TERT peptide libraries (see also Peptide Pools Table (Table 18) and Tables 15-17), and the ELISA data from Chinese cynomolgus macaques' sera. A positive response is defined as having SFC>50, IFN-γ+ CD8+ T cells/1e6 CD8+ T cells >50, and IgG titers >99. Numbers in columns 3, 6, and 9 represent #IFN-γ spots/106 splenocytes after restimulation with MUC1, MSLN, and TERT peptide pools, and background subtraction, respectively. Numbers in bold font indicate that at least 1 peptide pool tested was too numerous to count, therefore the true figure is at least the value stated. Numbers in columns 4, 7, and 10 represent #IFN-γ+ CD8+ T cells/106 CD8+ T cells after restimulation with MUC1, MSLN, and TERT peptide pools, respectively, and background subtraction. Numbers in column 5 and 8 represent the anti-MUC1 and anti-MSLN IgG titer (Optical Density (O.D)=1, Limit of Detection (L.O.D)=99.0), respectively. As shown in Table 14, the immunogenic MUC1, MSLN, and TERT polypeptides made with MUC1-, MSLN-, and TERT-expressing triple-Ag constructs are capable of inducing cellular responses against all three antigens, and humoral responses against MUC1. However, only triple-antigen construct 1317 is able to induce significant MSLN-specific B cell responses.









TABLE 14A







MUC1-specific T and B cell responses


induced by the triple-antigen


adenoviral AdC68Y and DNA constructs


(Plasmids 1317, 1319, and 1320) encoding


human native full-length membrane-bound


MUC1, human cytoplasmic MSLN, and


human truncated (Δ240) cytosolic TERT


antigens in Chinese cynomolgus macaques











MUC1














# IFN-γ+






CD8+





# IFN-γ
T cells/




Animal
spots/106
1e6 CD8+
IgG


Construct ID
#
splenocytes
T cells
titer














Plasmid 1317
4001
1319
0.0
27565.9



4002
2664
48690.6
55784.5



4003
373
322.3
16151.0



4004
1617
8476.8
29970.0



4501
2341
1359.0
24289.1



4502
1157
0.0
21841.4



4503
2286
3071.1
63872.6



4504
1638
2172.4
45515.2


Plasmid 1319
5001
88
0.0
22857.2



5002
1308
0.0
29024.8



5003
294
0.0
13356.0



5004
527
468.8
15029.1



5501
1296
2088.2
44573.6



5502
1377
6624.2
23185.5



5503
1302
0.0
25699.1



5504
2499
10403.1
14456.8


Plasmid 1320
6001
486
0.0
24454.1



6002
1742
412.3
31986.3



6003
1369
1154.9
23966.8



6004
1129
561.6
39738.0



6501
1673
447.4
21119.6



6502
1215
0.0
18092.2



6503
1817
3332.4
16364.6



6504
1212
1157.1
17340.2
















TABLE 14B







MSLN-specific T and B cell responses


induced by the triple-antigen


adenoviral AdC68Y and DNA constructs


(Plasmids 1317, 1319, and 1320) encoding


human native full-length membrane-bound


MUC1, human cytoplasmic MSLN, and


human truncated (Δ240) cytosolic TERT


antigens in Chinese cynomolgus macaques













MSLN
















# IFN-γ+







CD8+






# IFN-γ
T cells/





Animal
spots/106
1e6 CD8+
IgG



Construct ID
#
splenocytes
T cells
titer







Plasmid 1317
4001
1479
3732.4
7683.9




4002
1587
1795.3
6147.4




4003
648
884.7
3197.3




4004
164
0.0
4561.3




4501
2279
15469.0
6350.0




4502
1930
22480.2
11699.5




4503
1234
865.1
19065.6




4504
1543
2348.1
4492.7



Plasmid 1319
5001
258
426.6
99.0




5002
1855
2030.9
232.0




5003
1505
642.8
99.0




5004
1275
2410.4
243.3




5501
282
0.0
99.0




5502
732
558.6
418.4




5503
2070
4529.3
130.9




5504
871
3466.9
99.0



Plasmid 1320
6001
2446
6723.2
1381




6002
1953
3185.0
184.8




6003
2045
4053.7
99.0




6004
395
0.0
419.3




6501
1742
5813.1
322.7




6502
1617
12311.5
99.0




6503
448
0.0
285.6




6504
338
0.0
168.8

















TABLE 14C







TERT-specific T cell responses


induced by the triple-antigen


adenoviral AdC68Y and DNA constructs


(Plasmids 1317, 1319, and 1320) encoding


human native full-length membrane-bound


MUC1, human cytoplasmic MSLN, and


human truncated (Δ240) cytosolic TERT


antigens in Chinese cynomolgus macaques













TERT















# IFN-γ+






CD8+





# IFN-γ
T cells/




Animal
spots/106
1e6 CD8+



Construct ID
#
splenocytes
T cells
















Plasmid 1317
4001
1723
8843.8




4002
870
658.1




4003
2128
5976.1




4004
420
0.0




4501
2136
999.1




4502
2342
1195.6




4503
1966
6701.1




4504
2436
6985.5



Plasmid 1319
5001
1018
1724.4




5002
2121
713.8




5003
2184
324.3




5004
822
714.4




5501
462
1851.4




5502
325
692.9




5503
401
0.0




5504
517
0.0



Plasmid 1320
6001
3011
8615.5




6002
2825
2002.0




6003
1489
1235.8




6004
2272
2462.2




6501
2428
1362.2




6502
1875
4649.5




6503
2515
8493.2




6504
2584
5171.0

















TABLE 15







Human MUC1 Peptide Library peptide pools


and corresponding amino acid sequences









Amino Acid Sequence
Peptide #
SEQ ID NO





MASTPGTQSPFFLLL
  1aAS
132





TPGTQSPFFLLLLLT
  1bAS
133





TQSPFFLLLLLTVLT
  2
134





FFLLLLLTVLTVVTG
  3
135





LLLTVLTVVTGSGHA
  4
136





VLTVVTGSGHASSTP
  5
137





VTGSGHASSTPGGEK
  6
138





GHASSTPGGEKETSA
  7
139





STPGGEKETSATQRS
  8
140





GEKETSATQRSSVPS
  9
141





TSATQRSSVPSSTEK
 10
142





QRSSVPSSTEKNAVS
 11
143





VPSSTEKNAVSMTSS
 12
144





TEKNAVSMTSSVLSS
 13
145





AVSMTSSVLSSHSPG
 14
146





TSSVLSSHSPGSGSS
 15
147





LSSHSPGSGSSTTQG
 16
148





SPGSGSSTTQGQDVT
 17
149





GSSTTQGQDVTLAPA
 18
150





TQGQDVTLAPATEPA
 19
151





DVTLAPATEPASGSA
 20
152





APATEPASGSAATWG
 21
153





EPASGSAATWGQDVT
 22
154





GSAATWGQDVTSVPV
 23
155





TWGQDVTSVPVTRPA
 24
156





DVTSVPVTRPALGST
 25
157





VPVTRPALGSTTPPA
 26
158





RPALGSTTPPAHDVT
 27
159





GSTTPPAHDVTSAPD
 28
160





PPAHDVTSAPDNKPA
 29
161





DVTSAPDNKPAPGST
 30
162





APDNKPAPGSTAPPA
 31
163





KPAPGSTAPPAHGVT
 32
164





GSTAPPAHGVTSAPD
 33
165





PPAHGVTSAPDTRPA
 34
166





GVTSAPDTRPAPGST
 35
167





APDTRPAPGSTAPPA
 36
168





RPAPGSTAPPAHGVT
 37
169





GVTSAPDTRPALGST
 55
170





APDTRPALGSTAPPV
 56
171





RPALGSTAPPVHNVT
 57
172





GSTAPPVHNVTSASG
 58
173





PPVHNVTSASGSASG
 59
174





NVTSASGSASGSAST
 60
175





ASGSASGSASTLVHN
 61
176





ASGSASTLVHNGTSA
 62
177





ASTLVHNGTSARATT
 63
178





VHNGTSARATTTPAS
 64
179





TSARATTTPASKSTP
 65
180





ATTTPASKSTPFSIP
 66
181





PASKSTPFSIPSHHS
 67
182





STPFSIPSHHSDTPT
 68
183





SIPSHHSDTPTTLAS
 69
184





HHSDTPTTLASHSTK
 70
185





TPTTLASHSTKTDAS
 71
186





LASHSTKTDASSTHH
 72
187





STKTDASSTHHSSVP
 73
188





DASSTHHSSVPPLTS
 74
189





THHSSVPPLTSSNHS
 75
190





SVPPLTSSNHSTSPQ
 76
191





LTSSNHSTSPQLSTG
 77
192





NHSTSPQLSTGVSFF
 78
193





SPQLSTGVSFFFLSF
 79
194





STGVSFFFLSFHISN
 80
195





SFFFLSFHISNLQFN
 81
196





LSFHISNLQFNSSLE
 82
197





ISNLQFNSSLEDPST
 83
198





QFNSSLEDPSTDYYQ
 84
199





SLEDPSTDYYQELQR
 85
200





PSTDYYQELQRDISE
 86
201





YYQELQRDISEMFLQ
 87
202





LQRDISEMFLQIYKQ
 88
203





ISEMFLQIYKQGGFL
 89
204





FLQIYKQGGFLGLSN
 90
205





YKQGGFLGLSNIKFR
 91
206





GFLGLSNIKFRPGSV
 92X
207





LSNIKFRPGSVVVQL
 93X
208





KFRPGSVVVQLTLAF
 94X
209





GSVVVQLTLAFREGT
 95X
210





VVVQLTLAFREGTIN
 95XX
211





QLTLAFREGTINVHD
 96
212





AFREGTINVHDVETQ
 97
213





GTINVHDVETQFNQY
 98
214





VHDVETQFNQYKTEA
 99
215





ETQFNQYKTEAASRY
100
216





NQYKTEAASRYNLTI
101
217





TEAASRYNLTISDVS
102
218





SRYNLTISDVSVSDV
103
219





LTISDVSVSDVPFPF
104
220





DVSVSDVPFPFSAQS
105
221





SDVPFPFSAQSGAGV
106
222





FPFSAQSGAGVPGWG
107
223





AQSGAGVPGWGIALL
108
224





AGVPGWGIALLVLVC
109
225





GWGIALLVLVCVLVA
110
226





ALLVLVCVLVALAIV
111
227





LVCVLVALAIVYLIA
112
228





LVALAIVYLIALAVC
113
229





AIVYLIALAVCQCRR
114
230





LIALAVCQCRRKNYG
115
231





AVCQCRRKNYGQLDI
116
232





CRRKNYGQLDIFPAR
117
233





NYGQLDIFPARDTYH
118
234





LDIFPARDTYHPMSE
119
235





PARDTYHPMSEYPTY
120
236





TYHPMSEYPTYHTHG
121
237





MSEYPTYHTHGRYVP
122
238





PTYHTHGRYVPPSST
123
239





THGRYVPPSSTDRSP
124
240





YVPPSSTDRSPYEKV
125
241





SSTDRSPYEKVSAGN
126
242





RSPYEKVSAGNGGSS
127
243





EKVSAGNGGSSLSYT
128
244





AGNGGSSLSYTNPAV
129
245





GSSLSYTNPAVAAAS
130
246





LSYTNPAVAAASANL
131
247
















TABLE 16







Human MSLN Peptide Library peptide pools


and corresponding amino acid sequences









Amino Acid Sequence
Peptide #
SEQ ID NO





MASLPTARPLLGSCG
  1aS
248





TARPLLGSCGTPALG
  2
249





LLGSCGTPALGSLLF
  3
250





CGTPALGSLLFLLFS
  4
251





ALGSLLFLLFSLGWV
  5
252





LLFLLFSLGWVQPSR
  6
253





LFSLGWVQPSRTLAG
  7
254





GWVQPSRTLAGETGQ
  8
255





PSRTLAGETGQEAAP
  9
256





TLAGETGQEAAPLDG
 10X
257





TGQEAAPLDGVLANP
 11
258





AAPLDGVLANPPNIS
 12
259





DGVLANPPNISSLSP
 13
260





ANPPNISSLSPRQLL
 14
261





NISSLSPRQLLGFPC
 15
262





LSPRQLLGFPCAEVS
 16
263





QLLGFPCAEVSGLST
 17
264





FPCAEVSGLSTERVR
 18
265





EVSGLSTERVRELAV
 19
266





LSTERVRELAVALAQ
 20
267





RVRELAVALAQKNVK
 21
268





LAVALAQKNVKLSTE
 22
269





LAQKNVKLSTEQLRC
 23
270





NVKLSTEQLRCLAHR
 24
271





STEQLRCLAHRLSEP
 25
272





LRCLAHRLSEPPEDL
 26
273





AHRLSEPPEDLDALP
 27
274





SEPPEDLDALPLDLL
 28
275





EDLDALPLDLLLFLN
 29
276





ALPLDLLLFLNPDAF
 30
277





DLLLFLNPDAFSGPQ
 31
278





FLNPDAFSGPQACTR
 32
279





DAFSGPQACTRFFSR
 33
280





GPQACTRFFSRITKA
 34
281





CTRFFSRITKANVDL
 35
282





FSRITKANVDLLPRG
 36
283





TKANVDLLPRGAPER
 37
284





VDLLPRGAPERQRLL
 38
285





PRGAPERQRLLPAAL
 39
286





PERQRLLPAALACWG
 40
287





RLLPAALACWGVRGS
 41
288





AALACWGVRGSLLSE
 42
289





CWGVRGSLLSEADVR
 43
290





RGSLLSEADVRALGG
 44
291





LSEADVRALGGLACD
 45
292





DVRALGGLACDLPGR
 46
293





LGGLACDLPGRFVAE
 47
294





ACDLPGRFVAESAEV
 48
295





PGRFVAESAEVLLPR
 49
296





VAESAEVLLPRLVSC
 50
297





AEVLLPRLVSCPGPL
 51
298





LPRLVSCPGPLDQDQ
 52
299





VSCPGPLDQDQQEAA
 53
300





GPLDQDQQEAARAAL
 54
301





QDQQEAARAALQGGG
 55
302





EAARAALQGGGPPYG
 56
303





AALQGGGPPYGPPST
 57
304





GGGPPYGPPSTWSVS
 58
305





PYGPPSTWSVSTMDA
 59
306





PSTWSVSTMDALRGL
 60
307





SVSTMDALRGLLPVL
 61
308





MDALRGLLPVLGQPI
 62
309





RGLLPVLGQPIIRSI
 63
310





PVLGQPIIRSIPQGI
 64
311





QPIIRSIPQGIVAAW
 65
312





RSIPQGIVAAWRQRS
 66
313





QGIVAAWRQRSSRDP
 67
314





AAWRQRSSRDPSWRQ
 68
315





QRSSRDPSWRQPERT
 69
316





RDPSWRQPERTILRP
 70
317





WRQPERTILRPRFRR
 71
318





ERTILRPRFRREVEK
 72
319





LRPRFRREVEKTACP
 73
320





FRREVEKTACPSGKK
 74
321





VEKTACPSGKKAREI
 75
322





ACPSGKKAREIDESL
 76
323





GKKAREIDESLIFYK
 77
324





REIDESLIFYKKWEL
 78
325





ESLIFYKKWELEACV
 79
326





FYKKWELEACVDAAL
 80
327





WELEACVDAALLATQ
 81
328





ACVDAALLATQMDRV
 82
329





AALLATQMDRVNAIP
 83
330





ATQMDRVNAIPFTYE
 84
331





DRVNAIPFTYEQLDV
 85
332





AIPFTYEQLDVLKHK
 86
333





TYEQLDVLKHKLDEL
 87
334





LDVLKHKLDELYPQG
 88
335





KHKLDELYPQGYPES
 89
336





DELYPQGYPESVIQH
 90
337





PQGYPESVIQHLGYL
 91
338





PESVIQHLGYLFLKM
 92
339





IQHLGYLFLKMSPED
 93
340





GYLFLKMSPEDIRKW
 94
341





LKMSPEDIRKWNVTS
 95
342





PEDIRKWNVTSLETL
 96
343





RKWNVTSLETLKALL
 97
344





VTSLETLKALLEVNK
 98
345





ETLKALLEVNKGHEM
 99
346





ALLEVNKGHEMSPQV
100
347





VNKGHEMSPQVATLI
101
348





HEMSPQVATLIDRFV
102
349





PQVATLIDRFVKGRG
103
350





TLIDRFVKGRGQLDK
104
351





RFVKGRGQLDKDTLD
105
352





GRGQLDKDTLDTLTA
106
353





LDKDTLDTLTAFYPG
107
354





TLDTLTAFYPGYLCS
108
355





LTAFYPGYLCSLSPE
109
356





YPGYLCSLSPEELSS
110
357





LCSLSPEELSSVPPS
111
358





SPEELSSVPPSSIWA
112
359





LSSVPPSSIWAVRPQ
113
360





PPSSIWAVRPQDLDT
114
361





IWAVRPQDLDTCDPR
115
362





RPQDLDTCDPRQLDV
116
363





LDTCDPRQLDVLYPK
117
364





DPRQLDVLYPKARLA
118
365





LDVLYPKARLAFQNM
119
366





YPKARLAFQNMNGSE
120
367





RLAFQNMNGSEYFVK
121
368





QNMNGSEYFVKIQSF
122
369





GSEYFVKIQSFLGGA
123
370





FVKIQSFLGGAPTED
124
371





QSFLGGAPTEDLKAL
125
372





GGAPTEDLKALSQQN
126
373





TEDLKALSQQNVSMD
127
374





KALSQQNVSMDLATF
128
375





QQNVSMDLATFMKLR
129
376





SMDLATFMKLRTDAV
130
377





ATFMKLRTDAVLPLT
131
378





KLRTDAVLPLTVAEV
132
379





DAVLPLTVAEVQKLL
133
380





PLTVAEVQKLLGPHV
134
381





AEVQKLLGPHVEGLK
135
382





KLLGPHVEGLKAEER
136
383





PHVEGLKAEERHRPV
137
384





GLKAEERHRPVRDWI
138
385





EERHRPVRDWILRQR
139
386





RPVRDWILRQRQDDL
140
387





DWILRQRQDDLDTLG
141
388





RQRQDDLDTLGLGLQ
142
389





DDLDTLGLGLQGGIP
143
390





TLGLGLQGGIPNGYL
144
391





GLQGGIPNGYLVLDL
145
392





GIPNGYLVLDLSMQE
146
393





YLVLDLSMQEALSGT
147XX
394





LDLSMQEALSGTPCL
148
395





MQEALSGTPCLLGPG
149
396





LSGTPCLLGPGPVLT
150
397





PCLLGPGPVLTVLAL
151
398





GPGPVLTVLALLLAS
152
399





PVLTVLALLLASTLA
153
400
















TABLE 17







Human TERT Peptide Library peptide pools


and corresponding amino acid sequences









Amino Acid Sequence
Peptide #
SEQ ID NO












RRGAAPEPERTPVGQ
61
401





APEPERTPVGQGSWA
62
402





ERTPVGQGSWAHPGR
63
403





VGQGSWAHPGRTRGP
64
404





SWAHPGRTRGPSDRG
65
405





PGRTRGPSDRGFCVV
66
406





RGPSDRGFCVVSPAR
67
407





DRGFCVVSPARPAEE
68
408





CVVSPARPAEEATSL
69
409





PARPAEEATSLEGAL
70
410





AEEATSLEGALSGTR
71
411





TSLEGALSGTRHSHP
72
412





GALSGTRHSHPSVGR
73
413





GTRHSHPSVGRQHHA
74
414





SHPSVGRQHHAGPPS
75
415





VGRQHHAGPPSTSRP
76
416





HHAGPPSTSRPPRPW
77
417





PPSTSRPPRPWDTPC
78
418





SRPPRPWDTPCPPVY
79
419





RPWDTPCPPVYAETK
80
420





TPCPPVYAETKHFLY
81
421





PVYAETKHFLYSSGD
82
422





ETKHFLYSSGDKEQL
83
423





FLYSSGDKEQLRPSF
84
424





SGDKEQLRPSFLLSS
85
425





EQLRPSFLLSSLRPS
86
426





PSFLLSSLRPSLTGA
87
427





LSSLRPSLTGARRLV
88
428





RPSLTGARRLVETIF
89
429





TGARRLVETIFLGSR
90
430





RLVETIFLGSRPWMP
91
431





TIFLGSRPWMPGTPR
92
432





GSRPWMPGTPRRLPR
93
433





WMPGTPRRLPRLPQR
94
434





TPRRLPRLPQRYWQM
95
435





LPRLPQRYWQMRPLF
96
436





PQRYWQMRPLFLELL
97
437





WQMRPLFLELLGNHA
98
438





PLFLELLGNHAQCPY
99
439





ELLGNHAQCPYGVLL
100
440





NHAQCPYGVLLKTHC
101
441





CPYGVLLKTHCPLRA
102
442





VLLKTHCPLRAAVTP
103
443





THCPLRAAVTPAAGV
104
444





LRAAVTPAAGVCARE
105
445





VTPAAGVCAREKPQG
106
446





AGVCAREKPQGSVAA
107
447





AREKPQGSVAAPEEE
108
448





PQGSVAAPEEEDTDP
109
449





VAAPEEEDTDPRRLV
110
450





EEEDTDPRRLVQLLR
111
451





TDPRRLVQLLRQHSS
112
452





RLVQLLRQHSSPWQV
113
453





LLRQHSSPWQVYGFV
114
454





HSSPWQVYGFVRACL
115
455





WQVYGFVRACLRRLV
116
456





GFVRACLRRLVPPGL
117
457





ACLRRLVPPGLWGSR
118
458





RLVPPGLWGSRHNER
119
459





PGLWGSRHNERRFLR
120
460





GSRHNERRFLRNTKK
121
461





NERRFLRNTKKFISL
122
462





FLRNTKKFISLGKHA
123
463





TKKFISLGKHAKLSL
124
464





ISLGKHAKLSLQELT
125
465





KHAKLSLQELTWKMS
126
466





LSLQELTWKMSVRDC
127
467





ELTWKMSVRDCAWLR
128
468





KMSVRDCAWLRRSPG
129
469





RDCAWLRRSPGVGCV
130
470





WLRRSPGVGCVPAAE
131
471





SPGVGCVPAAEHRLR
132
472





GCVPAAEHRLREEIL
133
473





AAEHRLREEILAKFL
134
474





RLREEILAKFLHWLM
135
475





EILAKFLHWLMSVYV
136
476





KFLHWLMSVYVVELL
137
477





WLMSVYVVELLRSFF
138
478





VYVVELLRSFFYVTE
139
479





ELLRSFFYVTETTFQ
140
480





SFFYVTETTFQKNRL
141
481





VTETTFQKNRLFFYR
142
482





TFQKNRLFFYRKSVW
143
483





NRLFFYRKSVWSKLQ
144
484





FYRKSVWSKLQSIGI
145
485





SVWSKLQSIGIRQHL
146
486





KLQSIGIRQHLKRVQ
147
487





IGIRQHLKRVQLREL
148
488





QHLKRVQLRELSEAE
149
489





RVQLRELSEAEVRQH
150
490





RELSEAEVRQHREAR
151
491





EAEVRQHREARPALL
152
492





RQHREARPALLTSRL
153
493





EARPALLTSRLRFIP
154
494





ALLTSRLRFIPKPDG
155
495





SRLRFIPKPDGLRPI
156
496





FIPKPDGLRPIVNMD
157
497





PDGLRPIVNMDYVVG
158
498





RPIVNMDYVVGARTF
159
499





NMDYVVGARTFRREK
160
500





VVGARTFRREKRAER
161
501





RTFRREKRAERLTSR
162
502





REKRAERLTSRVKAL
163
503





AERLTSRVKALFSVL
164
504





TSRVKALFSVLNYER
165
505





KALFSVLNYERARRP
166
506





SVLNYERARRPGLLG
167
507





YERARRPGLLGASVL
168
508





RRPGLLGASVLGLDD
169
509





LLGASVLGLDDIHRA
170
510





SVLGLDDIHRAWRTF
171
511





LDDIHRAWRTFVLRV
172
512





HRAWRTFVLRVRAQD
173
513





RTFVLRVRAQDPPPE
174
514





LRVRAQDPPPELYFV
175
515





AQDPPPELYFVKVDV
176
516





PPELYFVKVDVTGAY
177
517





YFVKVDVTGAYDTIP
178
518





VDVTGAYDTIPQDRL
179
519





GAYDTIPQDRLTEVI
180
520





TIPQDRLTEVIASII
181
521





DRLTEVIASIIKPQN
182
522





EVIASIIKPQNTYCV
183
523





SIIKPQNTYCVRRYA
184
524





PQNTYCVRRYAVVQK
185
525





YCVRRYAVVQKAAHG
186
526





RYAVVQKAAHGHVRK
187
527





VQKAAHGHVRKAFKS
188
528





AHGHVRKAFKSHVST
189
529





VRKAFKSHVSTLTDL
190
530





FKSHVSTLTDLQPYM
191
531





VSTLTDLQPYMRQFV
192
532





TDLQPYMRQFVAHLQ
193
533





PYMRQFVAHLQETSP
194
534





QFVAHLQETSPLRDA
195
535





HLQETSPLRDAVVIE
196
536





TSPLRDAVVIEQSSS
197
537





RDAVVIEQSSSLNEA
198
538





VIEQSSSLNEASSGL
199
539





SSSLNEASSGLFDVF
200
540





NEASSGLFDVFLRFM
201
541





SGLFDVFLRFMCHHA
202
542





DVFLRFMCHHAVRIR
203
543





RFMCHHAVRIRGKSY
204
544





HHAVRIRGKSYVQCQ
205
545





RIRGKSYVQCQGIPQ
206
546





KSYVQCQGIPQGSIL
207
547





QCQGIPQGSILSTLL
208
548





IPQGSILSTLLCSLC
209
549





SILSTLLCSLCYGDM
210
550





TLLCSLCYGDMENKL
211
551





SLCYGDMENKLFAGI
212
552





GDMENKLFAGIRRDG
213
553





NKLFAGIRRDGLLLR
214
554





AGIRRDGLLLRLVDD
215
555





RDGLLLRLVDDFLLV
216
556





LLRLVDDFLLVTPHL
217
557





VDDFLLVTPHLTHAK
218
558





LLVTPHLTHAKTFLR
219
559





PHLTHAKTFLRTLVR
220
560





HAKTFLRTLVRGVPE
221
561





FLRTLVRGVPEYGCV
222
562





LVRGVPEYGCVVNLR
223
563





VPEYGCVVNLRKTVV
224
564





GCVVNLRKTVVNFPV
225
565





NLRKTVVNFPVEDEA
226
566





TVVNFPVEDEALGGT
227
567





FPVEDEALGGTAFVQ
228
568





DEALGGTAFVQMPAH
229
569





GGTAFVQMPANGLFP
230
570





FVQMPAHGLFPWCGL
231
571





PAHGLFPWCGLLLDT
232
572





LFPWCGLLLDTRTLE
233
573





CGLLLDTRTLEVQSD
234
574





LDTRTLEVQSDYSSY
235
575





TLEVQSDYSSYARTS
236
576





QSDYSSYARTSIRAS
237
577





SSYARTSIRASLTFN
238
578





RTSIRASLTFNRGFK
239
579





RASLTFNRGFKAGRN
240
580





TFNRGFKAGRNMRRK
241
581





GFKAGRNMRRKLFGV
242
582





GRNMRRKLFGVLRLK
243
583





RRKLFGVLRLKCHSL
244
584





FGVLRLKCHSLFLDL
245
585





RLKCHSLFLDLQVNS
246
586





HSLFLDLQVNSLQTV
247
587





LDLQVNSLQTVCTNI
248
588





VNSLQTVCTNIYKIL
249
589





QTVCTNIYKILLLQA
250
590





TNIYKILLLQAYRFH
251
591





KILLLQAYRFHACVL
252
592





LQAYRFHACVLQLPF
253
593





RFHACVLQLPFHQQV
254
594





CVLQLPFHQQVWKNP
255
595





LPFHQQVWKNPTFFL
256
596





QQVWKNPTFFLRVIS
257
597





KNPTFFLRVISDTAS
258
598





FFLRVISDTASLCYS
259
599





VISDTASLCYSILKA
260
600





TASLCYSILKAKNAG
261
601





CYSILKAKNAGMSLG
262
602





LKAKNAGMSLGAKGA
263
603





NAGMSLGAKGAAGPL
264
604





SLGAKGAAGPLPSEA
265
605





KGAAGPLPSEAVQWL
266
606





GPLPSEAVQWLCHQA
267
607





SEAVQWLCHQAFLLK
268
608





QWLCHQAFLLKLTRH
269
609





HQAFLLKLTRHRVTY
270
610





LLKLTRHRVTYVPLL
271
611





TRHRVTYVPLLGSLR
272
612





VTYVPLLGSLRTAQT
273
613





PLLGSLRTAQTQLSR
274
614





SLRTAQTQLSRKLPG
275
615





AQTQLSRKLPGTTLT
276
616





LSRKLPGTTLTALEA
277
617





LPGTTLTALEAAANP
278
618





TLTALEAAANPALPS
279
619





LEAAANPALPSDFKT
280
620





AANPALPSDFKTILD
281
621
















TABLE 18







Peptide Pools








Antigen
Peptide Pools





MUC1
116 sequential 15-mer peptides, overlapping by



11 amino acids, covering amino acids 1-224 and



945-1255 of the MUC1 precursor protein of SEQ



ID NO:1 (amino acid sequence of SEQ ID NO:8)


MSLN
153 sequential 15-mer peptides, overlapping by



11 amino acids, covering the entire MSLN



precursor protein sequence of SEQ ID NO:2.


TERT
221 sequential 15-mer peptides, overlapping by



11 amino acids, covering the TERTΔ240 protein



sequence of SEQ ID NO:10 (amino acids 239-1132



of SEQ ID NO:3 (total 894 amino acids,



(excluding the first 238 amino acids of the native



full-length TERT recursor protein of SEQ ID NO:3)
















TABLE 19







2A Peptides










2A Peptide
Amino Acid Sequence







FMD2A
QTLNFDLLKLAGDVESNPGP







T2A
EGRGSLLTCGDVEENPGP







EMC2A
HYAGYFADLLIHDIETNPGP







ERA2A
QCTNYALLKLAGDVESNPGP







ERB2A
TILSEGATNFSLLKLAGDVELNPGP







PT2A
ATNFSLLKQAGDVEENPGP










Example 7. Combination of Vaccines with Immune Modulators

The following example is provided to illustrate enhanced tumor growth inhibition effects when an anti-cancer vaccine was administered in combination with an anti-Cytotoxic T-Lymphocyte Antigen (CTLA4) antibody and/or an indoleamine 2,3-dioxygenase 1 (IDO1) inhibitor.


Study Procedures.


BALB-neuT mice were implanted on study day 0 with TUBO tumor cells by subcutaneous injection. Mice were dosed with 200 mg/Kg of 3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione (IDO1 inhibitor) or vehicle twice daily from study day 7 using oral gavage. Comparator groups were sham dosed with vehicle from study day 7 onwards. Appropriated mice were immunized on study day 10 with 1e10 Viral Particles of an adenovirus vector engineered to express rat HER2 (rHER2) (rHER2 vaccine) or vector lacking the rHER2 transgene (control vaccine), by intramuscular injection. Subsequently, 250 ug of an anti-CTLA4 antibody (murine monoclonal antibody to CTLA-4, clone 9D9) or an IgG2 isotype control monoclonal antibody was injected subcutaneously in close proximity to lymph nodes draining the site of adenovirus vector injection. Every two weeks thereafter, mice were immunized with 100 ug of a DNA plasmid encoding rHER2 (rHER2 vaccine) or a DNA plasmid lacking the rHER2 transgene (control vaccine) by electroporation. Subsequent to the DNA plasmid administration, 250 ug of the anti-CTLA4 antibody was injected subcutaneously in close proximity to lymph nodes draining the site of DNA plasmid injection. To track tumor progression, subcutaneous tumor volumes were measured twice a week throughout the study. Animals with subcutaneous tumor volumes that reached 2000 mm3 or displaying irreversible signs of disease were euthanized.


Results.


Subcutaneous tumor volumes of individual animals in each treatment group are presented in Tables 20-A-20-H.


No effect on tumor growth rates was observed in mice treated with the anti-CTLA4 antibody alone or with the IDO1 inhibitor alone. However, slower growth rates were observed in some of the animals treated with the rHER2 vaccine alone. Mice treated with the rHER2 vaccine in combination with the anti-CTLA4 antibody and mice treated with the rHER2 vaccine in combination with the IDO1 inhibitor had reduced tumor growth rates compared to the corresponding control animals. Tumor growth inhibition was most pronounced in mice treated with the rHER2 vaccine, the anti-CTLA4 antibody, and the IDO1 inhibitor.









TABLE 20-A







Subcutaneous tumor volumes from BALB-neuT mice treated with rHER2 vaccine, isotype control antibody, and vehicle








Study
Animal ID




















Day
001
002
003
004
005
006
007
008
009
010
011
012
013























7
15.28
24.88
25.22
43.22
20.92
23.31
54.61
18.97
15.63
7.26
34.97
23.85
26.51


11
59.85
51.25
32.16
70.17
53.95
33.47
58.64
27.65
23.43
24.93
52.01
30.46
64.37


14
69.49
58.15
44.48
92.14
77.00
48.03
94.39
35.07
28.64
28.73
95.93
60.86
76.06


18
121.53
105.11
69.57
162.26
147.15
89.85
200.97
64.56
54.34
48.57
268.43
62.34
99.72


21
177.93
109.81
78.17
182.61
145.82
106.58
194.34
63.14
71.46
88.39
254.23
83.27
137.39


24
209.82
89.80
80.60
186.71
130.91
120.51
309.21
70.57
101.02
90.27
340.71
80.33
151.06


27
251.78
178.06
145.48
172.65
203.23
132.37
304.55
129.14
107.72
127.13
324.79
113.27
147.59


32
288.46
299.49
182.91
299.93
228.06
119.13
357.37
132.57
171.17
155.00
466.10
139.30
163.84


35
442.65
518.22
233.63
307.12
283.16
209.64
434.25
208.03
213.44
233.02
481.62
260.75
261.80


39
419.12
503.33
442.52
345.36
355.59
231.06
432.68
318.63
315.93
286.47
572.77
298.59
303.23


42
379.48
513.54
449.02
340.25
362.51
254.14
487.55
294.58
349.26
379.28
626.35
286.86
319.48


46
601.65
778.43
637.73
453.39
899.49
292.45
519.25
294.40
531.22
342.83
642.31
445.75
300.56


49
525.83
682.34
768.94
337.45
594.31
291.11
632.67
388.48
639.75
491.05
631.72
408.40
308.73


53
618.09
893.01
932.23
391.25
576.25
280.96
657.04
503.44
829.63
456.57
606.13
491.55
447.34


56
793.23
1309.26
1085.82
411.50
412.62
350.51
750.48
685.26
1125.76
612.29
700.58
616.91
526.88


60
739.94
1422.57
1373.49
551.40
804.04
337.95
707.31
785.59
1195.66
563.75
843.39
638.94
693.70


63
741.90
1467.32
1450.32
446.17
1078.52
366.30
677.67
875.47
1369.64
687.52
845.94
700.93
563.40


66
866.83
1933.07
1695.44
407.94
1033.35
329.52
871.66
1274.41
1664.40
748.11
844.09
755.00
658.18


70
906.91

2055.70
454.26
1128.39
377.46
857.93
1429.06
1902.09
899.02
977.86
1151.34
739.87


74
1050.44


510.24
1176.17
431.46
953.57
1316.47

1008.84
1082.74
1132.80
737.69


77
1053.86


487.54
1454.97
504.43
974.43


1218.43
1062.30
1010.54
809.49


80
1195.52


560.59
1461.63
527.31
1298.82


1316.89
1165.74
1123.06



83
1211.15


591.58
1883.74
529.70
1530.85


1405.59
1132.02
1269.96



88
1999.58


680.13

489.05
1515.67


1704.43
1117.78




91



676.45

468.02



1731.76
1139.45




94



742.06

547.24




1340.71




98



848.97

778.30




1455.98




102



878.51

1299.14




1594.26




105



941.87

1052.06




1687.50




109



1033.39






1954.73




112















116















119















123















130
















TABLE 20-B







Subcutaneous tumor volumes from BALB-neuT mice treated with rHER2 vaccine, anti-CTLA4 antibody, and vehicle








Study
Animal ID




















Day
014
015
016
017
018
019
020
021
022
023
024
025
026























7
13.95
22.56
18.32
15.62
11.30
23.49
18.30
31.84
9.95
19.57
33.34
16.69
65.80


11
34.59
36.30
43.55
30.54
62.36
47.97
41.74
74.32
25.47
36.62
50.96
29.98
154.10


14
41.04
48.08
62.76
42.47
80.69
57.69
51.46
96.98
43.76
43.28
47.76
38.12
130.87


18
67.89
80.31
110.34
86.72
183.17
111.21
105.15
128.14
61.38
44.66
65.32
95.87
166.62


21
99.74
87.70
116.80
63.01
202.53
131.95
170.80
144.47
74.50
81.06
95.35
96.24
225.45


24
100.18
104.47
126.72
123.72
199.19
174.90
181.60
189.93
79.15
104.51
107.09
138.34
229.64


27
138.24
115.05
170.33
106.01
207.56
164.46
196.44
218.62
82.23
134.48
146.91
157.49
324.63


32
196.50
135.98
189.16
163.10
293.78
208.00
248.90
280.19
114.59
185.61
191.56
183.81
337.93


35
300.50
169.60
305.77
181.56
291.73
245.74
290.40
320.25
111.99
184.88
184.57
176.67
380.74


39
348.00
183.57
256.74
228.53
263.61
223.27
360.65
295.43
100.52
194.95
192.31
190.70
367.56


42
390.91
204.84
371.25
210.94
300.94
254.67
476.59
322.83
133.90
191.45
219.12
210.83
422.25


46
421.06
239.56
459.18
283.40
311.32
342.97
627.22
297.13
153.38
228.26
252.46
338.26
514.20


49
570.42
242.71
444.89
285.69
254.99
300.41
686.74
284.73
156.78
285.33
230.83
351.06
418.01


53
564.06
227.19
491.62
296.54
257.35
357.26
800.42
310.23
193.53
335.75
222.12
356.37
601.40


56
733.33
228.06
627.11
472.36
259.93
418.71
1013.00
302.14
219.62
383.69
241.56
449.13
609.87


60
897.14
267.39
607.90
517.19
312.72
420.79
1308.77
320.64
239.16
515.83
299.24
489.26
749.84


63
1057.26
268.83
660.87
445.35
316.86
483.64
1291.15
287.14
232.50
662.34
282.33
535.65
896.13


66
1300.92
322.12
896.63
481.50
348.28
488.58
1429.48
306.39
233.64
847.54
266.11
657.11
1007.19


70
1405.80
390.93
904.47
478.25
348.24
601.13
1420.89
382.32
315.81
804.92
268.72
760.97
977.72


74
1663.99
530.06
1051.68
520.03
404.21
658.56

367.96
440.99
955.16
344.38
794.70
1421.67


77
1926.01
573.89
1219.67
601.49
470.28
749.73

412.46
464.76
1194.80
329.63
901.75
1329.51


80

739.80
1349.40
718.31
394.95
752.93

420.98
495.99
1263.58
373.06
946.52
1232.01


83

877.75
1653.19
910.62
466.02
820.70

448.59
566.16
1553.21
438.83
942.35
1298.75


88

954.88

1265.55
846.03
937.01

414.87
788.55
1916.96
495.65
1301.75
2002.26


91

961.42

1174.80
866.62
954.49

491.20
846.32

581.42
1283.15



94

1053.93

1399.91
1002.14
1078.80

408.20
933.39

495.83
1539.79



98

1477.19

1785.93
1094.65
1355.24

480.75
1020.62

695.49




102

2005.53

2455.90
1132.60
1506.85

617.31
1196.80

1049.34




105




1137.28
1646.70

558.65
1519.06

973.46




109




1629.53
2411.79

567.21
1927.56

1376.50




112




1610.74


659.07


1331.93




116




1903.32


736.53


2020.97




119







843.09







123







812.58







130
















TABLE 20-C







Subcutaneous tumor volumes from BALB-neuT mice treated with rHER2


vaccine, isotype monoclonal antibody, and IDO1 inhibitor








Study
Animal ID




















Day
027
028
029
030
031
032
033
034
035
036
037
038
039























7
22.57
18.54
24.25
23.74
62.87
47.26
26.06
19.89
10.02
28.07
9.21
19.87
26.44


11
27.87
26.90
25.69
35.75
109.91
55.24
54.27
30.68
16.48
75.34
18.47
66.24
42.82


14
32.46
29.47
31.95
40.32
144.26
47.57
54.29
58.07
25.83
92.84
31.83
58.95
71.17


18
37.10
57.13
44.48
94.60
278.87
90.96
63.91
74.80
44.97
129.98
43.36
94.67
123.44


21
50.62
96.30
64.33
124.60
392.48
143.92
101.50
73.61
71.45
153.08
63.58
123.83
111.42


24
55.76
109.72
75.91
174.47
438.38
161.19
115.11
79.60
109.91
174.26
64.08
93.15
128.18


27
55.49
118.93
95.32
178.96
542.65
202.58
154.54
105.80
127.26
195.71
79.97
106.00
144.12


32
113.02
157.60
160.49
235.16
717.70
252.81
233.30
127.84
188.83
260.21
93.05
177.97
137.99


35
92.45
185.58
176.42
257.51
786.70
368.98
292.35
142.96
309.08
262.68
119.74
194.95
127.66


39
128.29
276.68
279.74
333.07
937.96
457.17
284.18
216.33
363.62
340.70
113.68
234.56
162.13


42
200.60
308.88
309.27
411.98
1141.65
546.41
378.60
193.55
445.98
279.28
139.47
238.77
171.63


46
245.58
362.11
390.14
554.66
1129.43
699.15
522.13
211.14
579.92
446.04
163.31
271.10
171.35


49
185.53
407.07
389.34
678.29
1357.08
663.42
435.48
199.16
548.74
496.65
256.92
327.15
158.18


53
234.92
572.92
472.69
760.44
1657.89
764.79
576.68
195.14
749.50
403.22
271.69
340.39
179.89


56
315.08
654.90
527.02
970.81
1830.37
918.21
811.53
215.44
1080.73
535.72
398.94
394.64
240.12


60
358.46
802.56
733.00
1126.99
2337.11
943.99
973.45
235.27
1169.89
727.24
431.20
437.25
219.66


63
329.23
988.22
686.07
1326.18

1114.22
1180.40
205.89
1491.79
749.53
706.03
443.52
228.26


66
419.20
1116.22
720.64
1550.51

1367.74
2093.28
183.53
1747.57
1500.11
948.51
536.57
249.72


70
474.17
1374.23
967.99
1760.87



227.05

1478.26
1065.41
623.91
248.73


74
624.62
1772.89
1197.73
2006.03



233.91

1494.91
1316.37
622.88
374.49


77
647.51
1989.96
1262.15




253.71

1990.94
1897.98
714.20
486.35


80


1539.37




247.06



746.30
361.49


83


2002.66




221.28



947.06
470.06


88







302.35



1049.34
607.71


91







283.62



1094.29
584.53


94







240.95



1223.56
707.49


98







267.69



1157.88
819.76


102







332.05



1588.42
1166.09


105















109















112















116















119















123















130
















TABLE 20-D







Subcutaneous tumor volumes from BALB-neuT mice treated with rHER2 vaccine, anti-CTLA4 antibody, and IDO1 inhibitor








Study
Animal ID




















Day
040
041
042
043
044
045
046
047
048
049
050
051
052























7
54.10
39.35
23.64
21.18
12.84
21.67
20.25
19.96
25.33
36.98
36.19
31.76
23.13


11
44.01
62.51
25.95
22.00
20.61
29.61
22.93
31.30
54.95
60.31
40.28
64.26
35.48


14
82.71
61.84
44.03
41.17
27.61
39.84
31.52
50.27
53.59
167.13
39.13
71.77
46.37


18
109.42
104.01
70.45
47.24
39.37
43.45
46.45
64.50
96.05
118.82
86.60
117.11
52.67


21
156.97
122.98
122.03
88.69
39.10
79.80
74.00
59.76
126.21
150.23
67.16
106.50
64.01


24
161.80
181.51
136.55
66.17
83.81
80.21
101.01
78.26
212.63
154.56
83.75
155.85
83.60


27
193.79
191.62
257.40
93.98
102.01
129.31
84.05
104.26
160.51
139.11
77.42
167.41
92.72


32
243.54
263.07
273.35
158.04
101.16
150.00
98.57
156.07
255.04
162.11
101.56
203.30
114.82


35
312.75
361.78
504.87
164.62
144.05
120.50
122.05
142.97
316.54
172.90
114.17
218.18
132.22


39
396.82
323.31
582.32
242.63
157.89
232.03
95.33
154.30
425.09
257.83
149.53
267.35
168.35


42
413.28
367.59
663.21
254.92
250.62
281.01
169.40
159.04
427.33
259.24
151.88
259.86
147.14


46
442.03
400.06
833.78
245.54
247.14
265.42
196.92
188.31
582.82
304.99
146.35
227.25
171.42


49
499.68
458.65
692.49
269.68
303.80
298.85
239.11
199.54
582.63
363.70
147.05
184.65
192.15


53
602.85
388.55
832.63
319.74
338.88
350.68
147.19
189.14
683.19
425.27
141.44
180.96
175.06


56
678.49
583.14
1172.40
313.88
375.36
490.44
121.54
250.75
1015.63
421.92
193.75
223.80
167.34


60
716.23
566.05
1993.58
297.92
405.37
488.16
168.02
276.09
1016.18
568.77
192.25
253.13
176.65


63
763.88
694.35

360.21
477.42
576.05
214.59
394.42
1118.06
623.54
160.79
259.78
142.51


66
903.52
896.37

398.70
639.62
743.61
272.91
395.65
1444.51

200.21
264.16
219.92


70
1067.20
981.05

432.21
590.19
768.14
239.03
427.52
1594.41

193.97
320.82
183.68


74
991.59
1190.31

573.68
743.70
903.33
222.29
428.53
1656.59

188.48
308.71
167.55


77
1018.46
1567.97

556.19
716.08
967.81
309.27
484.64
1917.82

194.87
253.57
162.01


80
1195.74
1390.97

574.12

1102.62
277.63
627.19


261.80
367.28
201.87


83
1331.93
1884.11

579.14

1695.16
256.90
690.39


292.88
325.23
199.87


88



772.39

1995.92
276.57
645.27


363.61
379.12
210.81


91



751.29


320.68
626.20


350.28
428.39
224.19


94



1288.49


335.59
627.27


402.96
462.28
238.84


98



1164.73


337.65
830.60


438.33
581.69
298.47


102



1324.12


409.66
1014.27


505.42
602.90
427.80


105



1202.44


467.05
1140.43


521.30
712.86
411.28


109



2079.90


483.78
1218.84


757.14
707.01
544.77


112






579.36
1346.57


607.66
873.67
598.32


116






814.25
1570.94


721.33
1148.33
658.27


119






782.56
1999.79


784.41
1318.46
601.06


123






661.23



664.85
1320.43
626.48


130






1027.75



883.59
1979.35
671.05
















TABLE 20-E







Subcutaneous tumor volumes from BALB-neuT mice treated with control vaccine, isotype monoclonal antibody, and vehicle








Study
Animal ID




















Day
053
054
055
056
057
058
059
060
061
062
063
064
065























7
15.08
18.70
72.45
17.86
31.00
18.49
33.40
29.51
67.11
24.58
10.81
23.92
19.49


11
58.60
54.25
123.35
30.28
58.33
33.39
50.68
123.50
101.88
40.82
37.88
46.17
54.79


14
66.13
57.25
141.53
59.92
51.27
38.54
69.03
149.25
115.84
59.04
60.55
47.41
60.04


18
100.35
127.83
169.74
108.08
98.62
74.59
93.79
221.58
216.32
66.67
150.77
88.44
96.81


21
104.51
155.77
207.70
135.72
129.89
107.63
104.90
323.75
280.31
81.26
154.01
106.39
153.56


24
164.46
178.17
273.86
194.10
166.70
130.99
108.08
428.86
388.16
121.42
204.26
240.02
179.89


27
173.12
266.11
433.12
274.20
221.81
175.65
208.91
501.03
393.66
143.97
228.35
196.57
262.10


32
240.12
374.31
702.18
390.43
326.57
241.87
243.68
603.91
567.21
223.65
309.24
290.32
450.62


35
372.09
483.08
708.07
543.61
542.74
318.46
343.70
890.20
705.62
251.46
424.33
286.92
397.85


39
455.22
657.38
939.28
588.96
567.05
467.47
473.88
956.11
993.67
395.46
526.97
308.71
620.57


42
585.12
765.03
1120.45
666.99
688.37
555.97
607.03
951.75
1173.89
463.83
672.59
469.69
773.08


46
791.60
1105.75
1323.69
1128.15
1155.17
702.22
789.24
1616.22
1451.45
639.83
934.86
479.77
927.62


49
1097.81
1189.35
2028.49
1236.32
1244.71
1014.51
1016.09
1914.25
2034.67
749.54
1173.78
707.56
1212.93


53
1363.43
1631.61

1657.80
1743.67
1081.25
1316.46


1274.45
1667.48
790.81
1474.56


56
1483.62
1904.26

1771.67
1688.79
1183.68
1311.02


1098.40
1953.44
960.80
1659.31


60
1901.83


2068.50
2061.22
1286.21
2034.92


1705.47

1061.59
1779.08


63





1517.04



1642.50

1308.21



66





1902.74



1940.74

1450.05



70















74















77















80















83















88















91















94















98















102















105















109















112















116















119















123















130
















TABLE 20-F







Subcutaneous tumor volumes from BALB-neuT mice treated with control vaccine, anti-CTLA4 antibody, and vehicle








Study
Animal ID




















Day
066
067
068
069
070
071
072
073
074
075
076
077
078























7
31.57
16.81
19.84
26.53
31.95
45.30
30.22
15.04
28.27
24.27
18.27
23.86
26.78


11
65.01
42.45
77.71
42.97
36.94
69.07
53.78
18.79
28.90
60.85
35.20
33.73
35.30


14
67.75
52.64
58.52
59.81
51.64
133.54
50.06
18.81
54.38
56.90
38.19
43.61
42.69


18
107.43
80.43
24.77
75.41
120.27
138.58
113.91
32.23
72.21
86.65
63.99
61.86
79.41


21
108.33
122.66
58.44
99.93
115.49
169.67
108.74
33.66
68.85
81.49
66.19
88.88
92.80


24
135.87
142.73
205.72
138.58
195.34
245.58
199.84
38.82
78.26
111.41
115.74
81.24
114.15


27
202.52
136.92
233.44
218.55
257.60
249.39
215.05
76.57
102.07
177.24
146.52
118.63
158.62


32
265.16
246.28
392.99
523.97
289.01
453.52
389.26
119.16
173.75
215.29
168.01
195.78
237.85


35
268.11
307.75
523.86
498.69
338.58
411.31
536.87
158.61
254.89
319.28
282.80
305.00
330.57


39
409.74
488.72
621.93
678.35
518.57
665.59
568.49
234.62
508.87
394.05
315.21
347.86
518.94


42
497.76
579.50
613.71
650.46
604.07
786.51
635.44
267.01
515.71
498.47
474.74
425.11
661.10


46
568.07
779.30
807.17
846.95
842.44
866.45
856.31
300.02
602.44
740.77
583.16
507.16
874.70


49
870.94
998.56
1070.92
1642.07
1027.26
1066.57
957.73
354.92
833.74
770.76
792.40
839.43
1103.46


53
924.06
1547.25
1372.06
2026.49
1295.16
1430.84
1522.16
498.99
1122.50
971.82
967.85
1050.27
1374.42


56
1119.84
1615.70
1971.09

1602.07

1567.94
598.49
1087.10
1101.63
1179.51
1148.98
1930.34


60
1734.81
2275.56


1953.31

2130.99
821.52
1460.04
1371.70
1568.95
1543.35



63
2187.87






881.14

1613.94
1944.88
1672.95



66







1097.85

2080.38

2213.55



70







1476.50







74







1925.13







77















80















83















88















91















94















98















102















105















109















112















116















119















123















130
















TABLE 20-G







Subcutaneous tumor volumes from BALB-neuT mice treated with control


vaccine, isotype control monoclonal antibody, and IDO1 inhibitor








Study
Animal ID




















Day
079
080
081
082
083
084
085
086
087
088
089
090
091























7
27.80
36.46
21.11
15.78
34.61
12.22
14.78
20.72
28.62
21.87
32.40
18.45
21.99


11
50.27
46.02
29.32
42.34
66.59
21.18
19.13
51.22
33.59
28.63
52.20
24.65
50.36


14
66.16
39.75
31.22
43.83
99.08
27.42
36.76
53.21
62.59
35.08
54.92
44.31
85.94


18
87.17
73.84
62.25
84.25
115.90
47.77
40.28
81.38
130.43
43.33
77.07
67.44
136.82


21
91.03
75.81
78.22
89.04
182.85
58.23
54.88
135.90
127.97
64.93
113.76
113.10
163.91


24
161.46
100.08
101.79
140.26
284.27
88.35
55.22
110.30
155.81
92.35
169.45
127.09
198.49


27
163.11
125.82
123.19
186.49
361.01
112.48
80.13
147.33
241.15
110.42
171.37
129.44
240.05


32
252.04
251.57
194.14
275.98
541.91
153.38
110.90
184.76
321.37
173.28
301.94
202.21
337.79


35
324.53
262.56
246.60
364.82
598.92
209.97
141.15
244.33
521.30
260.28
306.58
377.96
401.70


39
414.72
434.13
389.39
471.60
671.98
338.06
192.15
328.32
572.30
343.13
512.15
430.30
596.09


42
603.00
551.64
463.99
601.50
820.44
340.52
268.88
441.62
676.89
408.33
574.86
574.25
680.54


46
660.63
696.77
782.22
933.81
997.91
431.11
345.63
682.81
1060.91
604.23
818.67
719.57
909.66


49
685.30
917.68
1138.52
1124.52
1219.32
609.28
470.15
807.07
1164.50
629.55
940.30
942.51
1045.76


53
864.42
1073.56
1288.73
1449.44
1275.84
735.98
547.89
1167.81
1618.54
792.50
1373.25
1139.13
1614.31


56
943.28
1323.69
1631.81
1937.45
2064.17
952.77
893.91
1714.64
1754.98
1128.21
1630.88
1431.28
1471.68


60
1384.35
1653.35
1673.55


1107.86
928.10
1923.35
1918.84
1450.47

1946.29



63
1600.66
2089.13
1682.01


1426.67
938.92


1652.48





66
1776.61

1982.89


1416.50
1198.20


1985.40





70
2186.37




1974.53
1804.46








74






1816.96








77






2039.55








80















83















88















91















94















98















102















105















109















112















116















119















123















130
















TABLE 20-H







Subcutaneous tumor volumes from BALB-neuT mice treated with control vaccine, anti-CTLA4 antibody, and IDO1 inhibitor








Study
Animal ID




















Day
092
093
094
095
096
097
098
099
100
101
102
103
104























7
23.50
79.61
37.58
33.69
19.24
51.28
54.39
19.99
17.96
31.15
41.65
32.98
14.52


11
45.95
175.07
60.28
42.51
34.51
127.99
62.57
55.07
51.65
88.69
90.89
44.64
17.86


14
63.89
163.67
77.18
67.42
37.76
116.30
79.39
64.35
48.63
82.68
82.10
61.33
31.37


18
97.30
243.40
197.71
102.33
112.32
153.27
92.24
113.19
68.81
140.37
217.09
87.60
42.00


21
160.23
249.28
155.64
109.24
159.77
171.07
124.87
141.01
104.12
184.89
223.66
124.77
52.28


24
214.44
358.20
178.52
146.05
155.75
189.29
160.00
185.05
133.44
222.78
308.93
149.93
65.39


27
240.41
415.24
198.00
191.28
267.39
298.20
231.41
157.93
191.15
238.17
416.37
211.67
87.13


32
513.57
601.62
385.73
344.44
444.06
376.94
324.54
244.96
328.38
365.22
635.31
358.92
99.41


35
616.99
692.22
389.96
455.32
417.99
484.35
441.24
264.56
333.93
437.71
813.28
385.04
177.23


39
715.16
1023.24
500.83
638.78
601.10
775.30
639.05
308.73
509.92
543.97
905.65
530.66
235.25


42
717.28
1165.74
503.20
815.34
596.80
798.96
795.51
361.27
438.96
638.64
1106.64
673.19
239.49


46
1123.80
1329.85
768.11
1034.27
895.57
1266.07
1001.88
500.68
781.21
813.68
1270.88
827.87
352.48


49
1401.34
1734.62
1016.27
1222.00
945.71
1346.31
1044.92
712.92
1214.73
954.08
1780.32
843.85
571.40


53
1589.06
2021.70
1176.32
1559.52
1296.01
1620.80
1558.21
958.70
1154.16

2036.34
931.60
673.63


56
2311.25

1343.46
1818.39
1465.17
2067.77
1760.94
1195.69
1541.24


1296.52
901.88


60


1631.83
2068.78
1667.99

1626.18
1630.40
1783.77


1468.55
1185.11


63


1969.65

1571.44


1651.73
2028.86


1737.96
1296.69


66







1927.56




1929.80


70















74















77















80















83















88















91















94















98















102















105















109















112















116















119















123















130

















RAW SEQUENCE LISTING



MUC1 Isoform 1 protein (Reference Polypeptide; Uniprot P15941-1)


(human)


SEQ ID NO: 1



MTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTS






SVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPA





HDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTA





PPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPG





STAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTS





APDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHG





VTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPP





AHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGST





APPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAP





GSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTR





PAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAP





DTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVT





SAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAH





GVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAP





PAHGVTSAPDTRPAPGSTAPPAHGVTSAPDNRPALGSTAPPVHNVTSASGSASGSAS





TLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSN





HSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGL





SNIKFRPGSWVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFS





AQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSE





YPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAATSANL





Mesothelin Isoform 2 precursor protein (Reference Polypeptide;


Uniprot Q13421-3) (human)


SEQ ID NO: 2



MALPTARPLLGSCGTPALGSLLFLLFSLGWVQPSRTLAGETGQEAAPLDGVLANPPNIS






SLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPEDLDAL





PLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGVRGSLL





SEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQGGGPPY





GPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERTILRPRF





RREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYEQLDVL





KHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHEMSPQ





VATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDLDTCD





PRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATFMKLR





TDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQGGIPNG





YLVLDLSMQEALSGTPCLLGPGPVLTVLALLLASTLA





TERT Isoform 1 protein (Reference Polypeptide; Genbank


AAD30037, Uniprot 014746-1) (human)


SEQ ID NO: 3



MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDPAAFRALVAQCLVC






VPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALLDGARGGPPEA





FTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCG





PPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPAPGARRRGGSA





SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATS





LEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQL





RPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGN





HAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHS





SPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMS





VRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKN





RLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGL





RPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHR





AWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVV





QKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGL





FDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLL





RLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQM





PAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRL





KCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTA





SLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRT





AQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





AdC68Y Empty


SEQ ID NO: 4



ccatcttcaataatatacctcaaactttttgtgcgcgttaatatgcaaatgaggcgtttgaatttggggaggaagggcggtgatt






ggtcgagggatgagcgaccgttaggggcggggcgagtgacgttttgatgacgtggttgcgaggaggagccagtttgcaa





gttctcgtgggaaaagtgacgtcaaacgaggtgtggtttgaacacggaaatactcaattttcccgcgctctctgacaggaaa





tgaggtgtttctgggcggatgcaagtgaaaacgggccattttcgcgcgaaaactgaatgaggaagtgaaaatctgagtaa





tttcgcgtttatggcagggaggagtatttgccgagggccgagtagactttgaccgattacgtgggggtttcgattaccgtgttttt





cacctaaatttccgcgtacggtgtcaaagtccggtgthttactactgtaatagtaatcaattacggggtcattagttcatagccc





atatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtc





aataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgccc





acttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattat





gcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttg





gcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgt





tttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtac





ggtgggaggtctatataagcagagctgtccctatcagtgatagagatctccctatcagtgatagagagtttagtgaaccgtc





agatccgctagggtaccgcgatcgcacctcgagctgatcataatcagccataccacatttgtagaggttttacttgctttaaa





aaacctcccacacctccccctgaacctgaaacataaaatgaatgcaattgttgttgttaacttgtttattgcagcttataatggtt





acaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaat





gtatcttaccaggtgccgagcctgcgagtgcggagggaagcatgccaggttccagcccgtgtgtgtggatgtgacggagg





acctgcgacccgatcatttggtgttgccctgcaccgggacggagttcggttccagcggggaagaatctgactagagtgagt





agtgttctggggcgggggaggacctgcatgagggccagaataactgaaatctgtgcttttctgtgtgttgcagcagcatgag





cggaagcggctcctttgagggaggggtattcagcccttatctgacggggcgtctcccctcctgggcgggagtgcgtcagaa





tgtgatgggatccacggtggacggccggcccgtgcagcccgcgaactcttcaaccctgacctatgcaaccctgagctcttc





gtcgttggacgcagctgccgccgcagctgctgcatctgccgccagcgccgtgcgcggaatggccatgggcgccggctac





tacggcactctggtggccaactcgagttccaccaataatcccgccagcctgaacgaggagaagctgttgctgctgatggc





ccagctcgaggccttgacccagcgcctgggcgagctgacccagcaggtggctcagctgcaggagcagacgcgggccg





cggttgccacggtgaaatccaaataaaaaatgaatcaataaataaacggagacggttgttgattttaacacagagtctgaa





tctttatttgatttttcgcgcgcggtaggccctggaccaccggtctcgatcattgagcacccggtggatcttttccaggacccgg





tagaggtgggcttggatgttgaggtacatgggcatgagcccgtcccgggggtggaggtagctccattgcagggcctcgtgc





tcgggggtggtgttgtaaatcacccagtcatagcaggggcgcagggcatggtgttgcacaatatctttgaggaggagactg





atggccacgggcagccctttggtgtaggtgtttacaaatctgttgagctgggagggatgcatgcggggggagatgaggtgc





atcttggcctggatcttgagattggcgatgttaccgcccagatcccgcctggggttcatgttgtgcaggaccaccagcacggt





gtatccggtgcacttggggaatttatcatgcaacttggaagggaaggcgtgaaagaatttggcgacgcctttgtgcccgccc





aggttttccatgcactcatccatgatgatggcgatgggcccgtgggcggcggcctgggcaaagacgtttcgggggtcgga





cacatcatagttgtggtcctgggtgaggtcatcataggccattttaatgaatttggggcggagggtgccggactgggggaca





aaggtaccctcgatcccgggggcgtagttcccctcacagatctgcatctcccaggctttgagctcggagggggggatcatg





tccacctgcggggcgataaagaacacggtttccggggcgggggagatgagctgggccgaaagcaagttccggagcag





ctgggacttgccgcagccggtggggccgtagatgaccccgatgaccggctgcaggtggtagttgagggagagacagct





gccgtcctcccggaggaggggggccacctcgttcatcatctcgcgcacgtgcatgttctcgcgcaccagttccgccagga





ggcgctctccccccagggataggagctcctggagcgaggcgaagtttttcagcggcttgagtccgtcggccatgggcatttt





ggagagggtttgttgcaagagttccaggcggtcccagagctcggtgatgtgctctacggcatctcgatccagcagacctcct





cgtttcgcgggttgggacggctgcgggagtagggcaccagacgatgggcgtccagcgcagccagggtccggtccttcca





gggtcgcagcgtccgcgtcagggtggtctccgtcacggtgaaggggtgcgcgccgggctgggcgcttgcgagggtgcgc





ttcaggctcatccggctggtcgaaaaccgctcccgatcggcgccctgcgcgtcggccaggtagcaattgaccatgagttcg





tagttgagcgcctcggccgcgtggcctttggcgcggagcttacctttggaagtctgcccgcaggcgggacagaggaggg





acttgagggcgtagagcttgggggcgaggaagacggactcgggggcgtaggcgtccgcgccgcagtgggcgcagac





ggtctcgcactccacgagccaggtgaggtcgggctggtcggggtcaaaaaccagtttcccgccgttctttttgatgcgtttctt





acctttggtctccatgagctcgtgtccccgctgggtgacaaagaggctgtccgtgtccccgtagaccgactttatgggccggt





cctcgagcggtgtgccgcggtcctcctcgtagaggaaccccgcccactccgagacgaaagcccgggtccaggccagc





acgaaggaggccacgtgggacgggtagcggtcgttgtccaccagcgggtccaccttttccagggtatgcaaacacatgtc





cccctcgtccacatccaggaaggtgattggcttgtaagtgtaggccacgtgaccgggggtcccggccgggggggtataa





aagggtgcgggtccctgctcgtcctcactgtcttccggatcgctgtccaggagcgccagctgttggggtaggtattccctctc





gaaggcgggcatgacctcggcactcaggttgtcagtttctagaaacgaggaggatttgatattgacggtgccggcggaga





tgcctttcaagagcccctcgtccatctggtcagaaaagacgatctttttgttgtcgagcttggtggcgaaggagccgtagagg





gcgttggagaggagcttggcgatggagcgcatggtctggtttttttccttgtcggcgcgctccttggcggcgatgttgagctgc





acgtactcgcgcgccacgcacttccattcggggaagacggtggtcagctcgtcgggcacgattctgacctgccagccccg





attatgcagggtgatgaggtccacactggtggccacctcgccgcgcaggggctcattagtccagcagaggcgtccgccct





tgcgcgagcagaaggggggcagggggtccagcatgacctcgtcgggggggtcggcatcgatggtgaagatgccgggc





aggaggtcggggtcaaagtagctgatggaagtggccagatcgtccagggcagcttgccattcgcgcacggccagcgcg





cgctcgtagggactgaggggcgtgccccagggcatgggatgggtaagcgcggaggcgtacatgccgcagatgtcgtag





acgtagaggggctcctcgaggatgccgatgtaggtggggtagcagcgccccccgcggatgctggcgcgcacgtagtcat





acagctcgtgcgagggggcgaggagccccgggcccaggttggtgcgactgggcttttcggcgcggtagacgatctggc





ggaaaatggcatgcgagttggaggagatggtgggcctttggaagatgttgaagtgggcgtggggcagtccgaccgagtc





gcggatgaagtgggcgtaggagtcttgcagcttggcgacgagctcggcggtgactaggacgtccagagcgcagtagtcg





agggtctcctggatgatgtcatacttgagctgtcccttttgtttccacagctcgcggttgagaaggaactcttcgcggtccttcca





gtactcttcgagggggaacccgtcctgatctgcacggtaagagcctagcatgtagaactggttgacggccttgtaggcgca





gcagcccttctccacggggagggcgtaggcctgggcggccttgcgcagggaggtgtgcgtgagggcgaaagtgtccct





gaccatgaccttgaggaactggtgcttgaagtcgatatcgtcgcagcccccctgctcccagagctggaagtccgtgcgctt





cttgtaggcggggttgggcaaagcgaaagtaacatcgttgaagaggatcttgcccgcgcggggcataaagttgcgagtg





atgcggaaaggttggggcacctcggcccggttgttgatgacctgggcggcgagcacgatctcgtcgaagccgttgatgttg





tggcccacgatgtagagttccacgaatcgcggacggcccttgacgtggggcagtttcttgagctcctcgtaggtgagctcgt





cggggtcgctgagcccgtgctgctcgagcgcccagtcggcgagatgggggttggcgcggaggaaggaagtccagaga





tccacggccagggcggtttgcagacggtcccggtactgacggaactgctgcccgacggccattttttcgggggtgacgca





gtagaaggtgcgggggtccccgtgccagcgatcccatttgagctggagggcgagatcgagggcgagctcgacgagcc





ggtcgtccccggagagtttcatgaccagcatgaaggggacgagctgcttgccgaaggaccccatccaggtgtaggtttcc





acatcgtaggtgaggaagagcctttcggtgcgaggatgcgagccgatggggaagaactggatctcctgccaccaattgg





aggaatggctgttgatgtgatggaagtagaaatgccgacggcgcgccgaacactcgtgcttgtgtttatacaagcggccac





agtgctcgcaacgctgcacgggatgcacgtgctgcacgagctgtacctgagttcctttgacgaggaatttcagtgggaagt





ggagtcgtggcgcctgcatctcgtgctgtactacgtcgtggtggtcggcctggccctcttctgcctcgatggtggtcatgctga





cgagcccgcgcgggaggcaggtccagacctcggcgcgagcgggtcggagagcgaggacgagggcgcgcaggccg





gagctgtccagggtcctgagacgctgcggagtcaggtcagtgggcagcggcggcgcgcggttgacttgcaggagtttttc





cagggcgcgcgggaggtccagatggtacttgatctccaccgcgccattggtggcgacgtcgatggcttgcagggtcccgt





gcccctggggtgtgaccaccgtcccccgtttcttcttgggcggctggggcgacgggggcggtgcctcttccatggttagaag





cggcggcgaggacgcgcgccgggcggcaggggcggctcggggcccggaggcaggggcggcaggggcacgtcgg





cgccgcgcgcgggtaggttctggtactgcgcccggagaagactggcgtgagcgacgacgcgacggttgacgtcctggat





ctgacgcctctgggtgaaggccacgggacccgtgagtttgaacctgaaagagagttcgacagaatcaatctcggtatcgtt





gacggcggcctgccgcaggatctcttgcacgtcgcccgagttgtcctggtaggcgatctcggtcatgaactgctcgatctcct





cctcttgaaggtctccgcggccggcgcgctccacggtggccgcgaggtcgttggagatgcggcccatgagctgcgagaa





ggcgttcatgcccgcctcgttccagacgcggctgtagaccacgacgccctcgggatcgcGggcgcgcatgaccacctg





ggcgaggttgagctccacgtggcgcgtgaagaccgcgtagttgcagaggcgctggtagaggtagttgagcgtggtggcg





atgtgctcggtgacgaagaaatacatgatccagcggcggagcggcatctcgctgacgtcgcccagcgcctccaaacgtt





ccatggcctcgtaaaagtccacggcgaagttgaaaaactgggagttgcgcgccgagacggtcaactcctcctccagaag





acggatgagctcggcgatggtggcgcgcacctcgcgctcgaaggcccccgggagttcctccacttcctcttcttcctcctcc





actaacatctcttctacttcctcctcaggcggcagtggtggcgggggagggggcctgcgtcgccggcggcgcacgggca





gacggtcgatgaagcgctcgatggtctcgccgcgccggcgtcgcatggtctcggtgacggcgcgcccgtcctcgcgggg





ccgcagcgtgaagacgccgccgcgcatctccaggtggccgggggggtccccgttgggcagggagagggcgctgacg





atgcatcttatcaattgccccgtagggactccgcgcaaggacctgagcgtctcgagatccacgggatctgaaaaccgctg





aacgaaggcttcgagccagtcgcagtcgcaaggtaggctgagcacggtttcttctggcgggtcatgttggttgggagcggg





gcgggcgatgctgctggtgatgaagttgaaataggcggttctgagacggcggatggtggcgaggagcaccaggtctttgg





gcccggcttgctggatgcgcagacggtcggccatgccccaggcgtggtcctgacacctggccaggtccttgtagtagtcct





gcatgagccgctccacgggcacctcctcctcgcccgcgcggccgtgcatgcgcgtgagcccgaagccgcgctggggct





ggacgagcgccaggtcggcgacgacgcgctcggcgaggatggcttgctggatctgggtgagggtggtctggaagtcatc





aaagtcgacgaagcggtggtaggctccggtgttgatggtgtaggagcagttggccatgacggaccagttgacggtctggt





ggcccggacgcacgagctcgtggtacttgaggcgcgagtaggcgcgcgtgtcgaagatgtagtcgttgcaggtgcgcac





caggtactggtagccgatgaggaagtgcggcggcggctggcggtagagcggccatcgctcggtggcgggggcgccgg





gcgcgaggtcctcgagcatggtgcggtggtagccgtagatgtacctggacatccaggtgatgccggcggcggtggtgga





ggcgcgcgggaactcgcggacgcggttccagatgttgcgcagcggcaggaagtagttcatggtgggcacggtctggcc





cgtgaggcgcgcgcagtcgtggatgctctatacgggcaaaaacgaaagcggtcagcggctcgactccgtggcctggag





gctaagcgaacgggttgggctgcgcgtgtaccccggttcgaatctcgaatcaggctggagccgcagctaacgtggtattg





gcactcccgtctcgacccaagcctgcaccaaccctccaggatacggaggcgggtcgttttgcaacttttttttggaggccgg





atgagactagtaagcgcggaaagcggccgaccgcgatggctcgctgccgtagtctggagaagaatcgccagggttgcg





ttgcggtgtgccccggttcgaggccggccggattccgcggctaacgagggcgtggctgccccgtcgtttccaagaccccat





agccagccgacttctccagttacggagcgagcccctcttttgttttgtttgtttttgccagatgcatcccgtactgcggcagatgc





gcccccaccaccctccaccgcaacaacagccccctccacagccggcgcttctgcccccgccccagcagcaacttccag





ccacgaccgccgcggccgccgtgagcggggctggacagagttatgatcaccagctggccttggaagagggcgagggg





ctggcgcgcctgggggcgtcgtcgccggagcggcacccgcgcgtgcagatgaaaagggacgctcgcgaggcctacgt





gcccaagcagaacctgttcagagacaggagcggcgaggagcccgaggagatgcgcgcggcccggttccacgcggg





gcgggagctgcggcgcggcctggaccgaaagagggtgctgagggacgaggatttcgaggcggacgagctgacggg





gatcagccccgcgcgcgcgcacgtggccgcggccaacctggtcacggcgtacgagcagaccgtgaaggaggagag





caacttccaaaaatccttcaacaaccacgtgcgcaccctgatcgcgcgcgaggaggtgaccctgggcctgatgcacctgt





gggacctgctggaggccatcgtgcagaaccccaccagcaagccgctgacggcgcagctgttcctggtggtgcagcata





gtcgggacaacgaagcgttcagggaggcgctgctgaatatcaccgagcccgagggccgctggctcctggacctggtga





acattctgcagagcatcgtggtgcaggagcgcgggctgccgctgtccgagaagctggcggccatcaacttctcggtgctg





agtttgggcaagtactacgctaggaagatctacaagaccccgtacgtgcccatagacaaggaggtgaagatcgacgggt





tttacatgcgcatgaccctgaaagtgctgaccctgagcgacgatctgggggtgtaccgcaacgacaggatgcaccgtgcg





gtgagcgccagcaggcggcgcgagctgagcgaccaggagctgatgcatagtctgcagcgggccctgaccggggccg





ggaccgagggggagagctactttgacatgggcgcggacctgcactggcagcccagccgccgggccttggaggcggcg





gcaggaccctacgtagaagaggtggacgatgaggtggacgaggagggcgagtacctggaagactgatggcgcgacc





gtatttttgctagatgcaacaacaacagccacctcctgatcccgcgatgcgggcggcgctgcagagccagccgtccggca





ttaactcctcggacgattggacccaggccatgcaacgcatcatggcgctgacgacccgcaaccccgaagcctttagaca





gcagccccaggccaaccggctctcggccatcctggaggccgtggtgccctcgcgctccaaccccacgcacgagaaggt





cctggccatcgtgaacgcgctggtggagaacaaggccatccgcggcgacgaggccggcctggtgtacaacgcgctgct





ggagcgcgtggcccgctacaacagcaccaacgtgcagaccaacctggaccgcatggtgaccgacgtgcgcgaggcc





gtggcccagcgcgagcggttccaccgcgagtccaacctgggatccatggtggcgctgaacgccttcctcagcacccagc





ccgccaacgtgccccggggccaggaggactacaccaacttcatcagcgccctgcgcctgatggtgaccgaggtgcccc





agagcgaggtgtaccagtccgggccggactacttcttccagaccagtcgccagggcttgcagaccgtgaacctgagcca





ggctttcaagaacttgcagggcctgtggggcgtgcaggccccggtcggggaccgcgcgacggtgtcgagcctgctgacg





ccgaactcgcgcctgctgctgctgctggtggcccccttcacggacagcggcagcatcaaccgcaactcgtacctgggcta





cctgattaacctgtaccgcgaggccatcggccaggcgcacgtggacgagcagacctaccaggagatcacccacgtga





gccgcgccctgggccaggacgacccgggcaacctggaagccaccctgaactttttgctgaccaaccggtcgcagaaga





tcccgccccagtacgcgctcagcaccgaggaggagcgcatcctgcgttacgtgcagcagagcgtgggcctgttcctgatg





caggagggggccacccccagcgccgcgctcgacatgaccgcgcgcaacatggagcccagcatgtacgccagcaac





cgcccgttcatcaataaactgatggactacttgcatcgggcggccgccatgaactctgactatttcaccaacgccatcctga





atccccactggctcccgccgccggggttctacacgggcgagtacgacatgcccgaccccaatgacgggttcctgtggga





cgatgtggacagcagcgtgttctccccccgaccgggtgctaacgagcgccccttgtggaagaaggaaggcagcgaccg





acgcccgtcctcggcgctgtccggccgcgagggtgctgccgcggcggtgcccgaggccgccagtcctttcccgagcttgc





ccttctcgctgaacagtatccgcagcagcgagctgggcaggatcacgcgcccgcgcttgctgggcgaagaggagtactt





gaatgactcgctgttgagacccgagcgggagaagaacttccccaataacgggatagaaagcctggtggacaagatga





gccgctggaagacgtatgcgcaggagcacagggacgatccccgggcgtcgcagggggccacgagccggggcagcg





ccgcccgtaaacgccggtggcacgacaggcagcggggacagatgtgggacgatgaggactccgccgacgacagca





gcgtgttggacttgggtgggagtggtaacccgttcgctcacctgcgcccccgtatcgggcgcatgatgtaagagaaaccg





aaaataaatgatactcaccaaggccatggcgaccagcgtgcgttcgtttcttctctgttgttgttgtatctagtatgatgaggcgt





gcgtacccggagggtcctcctccctcgtacgagagcgtgatgcagcaggcgatggcggcggcggcgatgcagcccccg





ctggaggctccttacgtgcccccgcggtacctggcgcctacggaggggcggaacagcattcgttactcggagctggcacc





cttgtacgataccacccggttgtacctggtggacaacaagtcggcggacatcgcctcgctgaactaccagaacgaccac





agcaacttcctgaccaccgtggtgcagaacaatgacttcacccccacggaggccagcacccagaccatcaactttgacg





agcgctcgcggtggggcggccagctgaaaaccatcatgcacaccaacatgcccaacgtgaacgagttcatgtacagca





acaagttcaaggcgcgggtgatggtctcccgcaagacccccaatggggtgacagtgacagaggattatgatggtagtca





ggatgagctgaagtatgaatgggtggaatttgagctgcccgaaggcaacttctcggtgaccatgaccatcgacctgatga





acaacgccatcatcgacaattacttggcggtggggcggcagaacggggtgctggagagcgacatcggcgtgaagttcg





acactaggaacttcaggctgggctgggaccccgtgaccgagctggtcatgcccggggtgtacaccaacgaggctttccat





cccgatattgtcttgctgcccggctgcggggtggacttcaccgagagccgcctcagcaacctgctgggcattcgcaagag





gcagcccttccaggaaggcttccagatcatgtacgaggatctggaggggggcaacatccccgcgctcctggatgtcgac





gcctatgagaaaagcaaggaggatgcagcagctgaagcaactgcagccgtagctaccgcctctaccgaggtcagggg





cgataattttgcaagcgccgcagcagtggcagcggccgaggcggctgaaaccgaaagtaagatagtcattcagccggt





ggagaaggatagcaagaacaggagctacaacgtactaccggacaagataaacaccgcctaccgcagctggtaccta





gcctacaactatggcgaccccgagaagggcgtgcgctcctggacgctgctcaccacctcggacgtcacctgcggcgtgg





agcaagtctactggtcgctgcccgacatgatgcaagacccggtcaccttccgctccacgcgtcaagttagcaactacccg





gtggtgggcgccgagctcctgcccgtctactccaagagcttcttcaacgagcaggccgtctactcgcagcagctgcgcgc





cttcacctcgcttacgcacgtcttcaaccgcttccccgagaaccagatcctcgtccgcccgcccgcgcccaccattaccac





cgtcagtgaaaacgttcctgctctcacagatcacgggaccctgccgctgcgcagcagtatccggggagtccagcgcgtg





accgttactgacgccagacgccgcacctgcccctacgtctacaaggccctgggcatagtcgcgccgcgcgtcctctcgag





ccgcaccttctaaatgtccattctcatctcgcccagtaataacaccggttggggcctgcgcgcgcccagcaagatgtacgg





aggcgctcgccaacgctccacgcaacaccccgtgcgcgtgcgcgggcacttccgcgctccctggggcgccctcaaggg





ccgcgtgcggtcgcgcaccaccgtcgacgacgtgatcgaccaggtggtggccgacgcgcgcaactacacccccgccg





ccgcgcccgtctccaccgtggacgccgtcatcgacagcgtggtggcCgacgcgcgccggtacgcccgcgccaagagc





cggcggcggcgcatcgcccggcggcaccggagcacccccgccatgcgcgcggcgcgagccttgctgcgcagggcca





ggcgcacgggacgcagggccatgctcagggcggccagacgcgcggcttcaggcgccagcgccggcaggacccgga





gacgcgcggccacggcggcggcagcggccatcgccagcatgtcccgcccgcggcgagggaacgtgtactgggtgcg





cgacgccgccaccggtgtgcgcgtgcccgtgcgcacccgcccccctcgcacttgaagatgttcacttcgcgatgttgatgt





gtcccagcggcgaggaggatgtccaagcgcaaattcaaggaagagatgctccaggtcatcgcgcctgagatctacggc





cctgcggtggtgaaggaggaaagaaagccccgcaaaatcaagcgggtcaaaaaggacaaaaaggaagaagaaag





tgatgtggacggattggtggagtttgtgcgcgagttcgccccccggcggcgcgtgcagtggcgcgggcggaaggtgcaa





ccggtgctgagacccggcaccaccgtggtcttcacgcccggcgagcgctccggcaccgcttccaagcgctcctacgacg





aggtgtacggggatgatgatattctggagcaggcggccgagcgcctgggcgagtttgcttacggcaagcgcagccgttcc





gcaccgaaggaagaggcggtgtccatcccgctggaccacggcaaccccacgccgagcctcaagcccgtgaccttgca





gcaggtgctgccgaccgcggcgccgcgccgggggttcaagcgcgagggcgaggatctgtaccccaccatgcagctga





tggtgcccaagcgccagaagctggaagacgtgctggagaccatgaaggtggacccggacgtgcagcccgaggtcaa





ggtgcggcccatcaagcaggtggccccgggcctgggcgtgcagaccgtggacatcaagattcccacggagcccatgg





aaacgcagaccgagcccatgatcaagcccagcaccagcaccatggaggtgcagacggatccctggatgccatcggct





cctagtcgaagaccccggcgcaagtacggcgcggccagcctgctgatgcccaactacgcgctgcatccttccatcatccc





cacgccgggctaccgcggcacgcgcttctaccgcggtcataccagcagccgccgccgcaagaccaccactcgccgcc





gccgtcgccgcaccgccgctgcaaccacccctgccgccctggtgcggagagtgtaccgccgcggccgcgcacctctga





ccctgccgcgcgcgcgctaccacccgagcatcgccatttaaactttcgccTgctttgcagatcaatggccctcacatgccg





ccttcgcgttcccattacgggctaccgaggaagaaaaccgcgccgtagaaggctggcggggaacgggatgcgtcgcca





ccaccaccggcggcggcgcgccatcagcaagcggttggggggaggcttcctgcccgcgctgatccccatcatcgccgc





ggcgatcggggcgatccccggcattgcttccgtggcggtgcaggcctctcagcgccactgagacacacttggaaacatct





tgtaataaaccAatggactctgacgctcctggtcctgtgatgtgttttcgtagacagatggaagacatcaatttttcgtccctgg





ctccgcgacacggcacgcggccgttcatgggcacctggagcgacatcggcaccagccaactgaacgggggcgccttc





aattggagcagtctctggagcgggcttaagaatttcgggtccacgcttaaaacctatggcagcaaggcgtggaacagcac





cacagggcaggcgctgagggataagctgaaagagcagaacttccagcagaaggtggtcgatgggctcgcctcgggca





tcaacggggtggtggacctggccaaccaggccgtgcagcggcagatcaacagccgcctggacccggtgccgcccgcc





ggctccgtggagatgccgcaggtggaggaggagctgcctcccctggacaagcggggcgagaagcgaccccgccccg





atgcggaggagacgctgctgacgcacacggacgagccgcccccgtacgaggaggcggtgaaactgggtctgcccac





cacgcggcccatcgcgcccctggccaccggggtgctgaaacccgaaaagcccgcgaccctggacttgcctcctcccca





gccttcccgcccctctacagtggctaagcccctgccgccggtggccgtggcccgcgcgcgacccgggggcaccgcccg





ccctcatgcgaactggcagagcactctgaacagcatcgtgggtctgggagtgcagagtgtgaagcgccgccgctgctatt





aaacctaccgtagcgcttaacttgcttgtctgtgtgtgtatgtattatgtcgccgccgccgctgtccaccagaaggaggagtg





aagaggcgcgtcgccgagttgcaagatggccaccccatcgatgctgccccagtgggcgtacatgcacatcgccggaca





ggacgcttcggagtacctgagtccgggtctggtgcagtttgcccgcgccacagacacctacttcagtctggggaacaagttt





aggaaccccacggtggcgcccacgcacgatgtgaccaccgaccgcagccagcggctgacgctgcgcttcgtgcccgt





ggaccgcgaggacaacacctactcgtacaaagtgcgctacacgctggccgtgggcgacaaccgcgtgctggacatgg





ccagcacctactttgacatccgcggcgtgctggatcggggccctagcttcaaaccctactccggcaccgcctacaacagtc





tggcccccaagggagcacccaacacttgtcagtggacatataaagccgatggtgaaactgccacagaaaaaacctata





catatggaaatgcacccgtgcagggcattaacatcacaaaagatggtattcaacttggaactgacaccgatgatcagcca





atctacgcagataaaacctatcagcctgaacctcaagtgggtgatgctgaatggcatgacatcactggtactgatgaaaag





tatggaggcagagctcttaagcctgataccaaaatgaagccttgttatggttcttttgccaagcctactaataaagaaggag





gtcaggcaaatgtgaaaacaggaacaggcactactaaagaatatgacatagacatggctttctttgacaacagaagtgc





ggctgctgctggcctagctccagaaattgttttgtatactgaaaatgtggatttggaaactccagatacccatattgtatacaa





agcaggcacagatgacagcagctcttctattaatttgggtcagcaagccatgcccaacagacctaactacattggtttcag





agacaactttatcgggctcatgtactacaacagcactggcaatatgggggtgctggccggtcaggcttctcagctgaatgct





gtggttgacttgcaagacagaaacaccgagctgtcctaccagctcttgcttgactctctgggtgacagaacccggtatttcag





tatgtggaatcaggcggtggacagctatgatcctgatgtgcgcattattgaaaatcatggtgtggaggatgaacttcccaact





attgtttccctctggatgctgttggcagaacagatacttatcagggaattaaggctaatggaactgatcaaaccacatggacc





aaagatgacagtgtcaatgatgctaatgagataggcaagggtaatccattcgccatggaaatcaacatccaagccaacct





gtggaggaacttcctctacgccaacgtggccctgtacctgcccgactcttacaagtacacgccggccaatgttaccctgcc





caccaacaccaacacctacgattacatgaacggccgggtggtggcgccctcgctggtggactcctacatcaacatcggg





gcgcgctggtcgctggatcccatggacaacgtgaaccccttcaaccaccaccgcaatgcggggctgcgctaccgctcca





tgctcctgggcaacgggcgctacgtgcccttccacatccaggtgccccagaaatttttcgccatcaagagcctcctgctcct





gcccgggtcctacacctacgagtggaacttccgcaaggacgtcaacatgatcctgcagagctccctcggcaacgacctg





cgcacggacggggcctccatctccttcaccagcatcaacctctacgccaccttcttccccatggcgcacaacacggcctcc





acgctcgaggccatgctgcgcaacgacaccaacgaccagtccttcaacgactacctctcggcggccaacatgctctacc





ccatcccggccaacgccaccaacgtgcccatctccatcccctcgcgcaactgggccgccttccgcggctggtccttcacg





cgtctcaagaccaaggagacgccctcgctgggctccgggttcgacccctacttcgtctactcgggctccatcccctacctcg





acggcaccttctacctcaaccacaccttcaagaaggtctccatcaccttcgactcctccgtcagctggcccggcaacgacc





ggctcctgacgcccaacgagttcgaaatcaagcgcaccgtcgacggcgagggctacaacgtggcccagtgcaacatg





accaaggactggttcctggtccagatgctggcccactacaacatcggctaccagggcttctacgtgcccgagggctacaa





ggaccgcatgtactccttcttccgcaacttccagcccatgagccgccaggtggtggacgaggtcaactacaaggactacc





aggccgtcaccctggcctaccagcacaacaactcgggcttcgtcggctacctcgcgcccaccatgcgccagggccagc





cctaccccgccaactacccctacccgctcatcggcaagagcgccgtcaccagcgtcacccagaaaaagttcctctgcga





cagggtcatgtggcgcatccccttctccagcaacttcatgtccatgggcgcgctcaccgacctcggccagaacatgctctat





gccaactccgcccacgcgctagacatgaatttcgaagtcgaccccatggatgagtccacccttctctatgttgtcttcgaagt





cttcgacgtcgtccgagtgcaccagccccaccgcggcgtcatcgaggccgtctacctgcgcacccccttctcggccggta





acgccaccacctaagctcttgcttcttgcaagccatggccgcgggctccggcgagcaggagctcagggccatcatccgc





gacctgggctgcgggccctacttcctgggcaccttcgataagcgcttcccgggattcatggccccgcacaagctggcctgc





gccatcgtcaacacggccggccgcgagaccgggggcgagcactggctggccttcgcctggaacccgcgctcgaacac





ctgctacctcttcgaccccttcgggttctcggacgagcgcctcaagcagatctaccagttcgagtacgagggcctgctgcgc





cgcagcgccctggccaccgaggaccgctgcgtcaccctggaaaagtccacccagaccgtgcagggtccgcgctcggc





cgcctgcgggctcttctgctgcatgttcctgcacgccttcgtgcactggcccgaccgccccatggacaagaaccccaccat





gaacttgctgacgggggtgcccaacggcatgctccagtcgccccaggtggaacccaccctgcgccgcaaccaggagg





cgctctaccgcttcctcaactcccactccgcctactttcgctcccaccgcgcgcgcatcgagaaggccaccgccttcgacc





gcatgaatcaagacatgtaaaccgtgtgtgtatgttaaatgtctttaataaacagcactttcatgttacacatgcatctgagatg





atttatttagaaatcgaaagggttctgccgggtctcggcatggcccgcgggcagggacacgttgcggaactggtacttggc





cagccacttgaactcggggatcagcagtttgggcagcggggtgtcggggaaggagtcggtccacagcttccgcgtcagtt





gcagggcgcccagcaggtcgggcgcggagatcttgaaatcgcagttgggacccgcgttctgcgcgcgggagttgcggt





acacggggttgcagcactggaacaccatcagggccgggtgcttcacgctcgccagcaccgtcgcgtcggtgatgctctcc





acgtcgaggtcctcggcgttggccatcccgaagggggtcatcttgcaggtctgccttcccatggtgggcacgcacccgggc





ttgtggttgcaatcgcagtgcagggggatcagcatcatctgggcctggtcggcgttcatccccgggtacatggccttcatga





aagcctccaattgcctgaacgcctgctgggccttggctccctcggtgaagaagaccccgcaggacttgctagagaactgg





ttggtggcgcacccggcgtcgtgcacgcagcagcgcgcgtcgttgttggccagctgcaccacgctgcgcccccagcggtt





ctgggtgatcttggcccggtcggggttctccttcagcgcgcgctgcccgttctcgctcgccacatccatctcgatcatgtgctcc





ttctggatcatggtggtcccgtgcaggcaccgcagcttgccctcggcctcggtgcacccgtgcagccacagcgcgcaccc





ggtgcactcccagttcttgtgggcgatctgggaatgcgcgtgcacgaagccctgcaggaagcggcccatcatggtggtca





gggtcttgttgctagtgaaggtcagcggaatgccgcggtgctcctcgttgatgtacaggtggcagatgcggcggtacacctc





gccctgctcgggcatcagctggaagttggctttcaggtcggtctccacgcggtagcggtccatcagcatagtcatgatttcca





tacccttctcccaggccgagacgatgggcaggctcatagggttcttcaccatcatcttagcgctagcagccgcggccaggg





ggtcgctctcgtccagggtctcaaagctccgcttgccgtccttctcggtgatccgcaccggggggtagctgaagcccacgg





ccgccagctcctcctcggcctgtctttcgtcctcgctgtcctggctgacgtcctgcaggaccacatgcttggtcttgcggggtttc





ttcttgggcggcagcggcggcggagatgttggagatggcgagggggagcgcgagttctcgctcaccactactatctcttcc





tcttcttggtccgaggccacgcggcggtaggtatgtctcttcgggggcagaggcggaggcgacgggctctcgccgccgcg





acttggcggatggctggcagagccccttccgcgttcgggggtgcgctcccggcggcgctctgactgacttcctccgcggcc





ggccattgtgttctcctagggaggaacaacaagcatggagactcagccatcgccaacctcgccatctgcccccaccgcc





gacgagaagcagcagcagcagaatgaaagcttaaccgccccgccgcccagccccgccacctccgacgcggccgtcc





cagacatgcaagagatggaggaatccatcgagattgacctgggctatgtgacgcccgcggagcacgaggaggagctg





gcagtgcgcttttcacaagaagagatacaccaagaacagccagagcaggaagcagagaatgagcagagtcaggctg





ggctcgagcatgacggcgactacctccacctgagcgggggggaggacgcgctcatcaagcatctggcccggcaggcc





accatcgtcaaggatgcgctgctcgaccgcaccgaggtgcccctcagcgtggaggagctcagccgcgcctacgagttga





acctcttctcgccgcgcgtgccccccaagcgccagcccaatggcacctgcgagcccaacccgcgcctcaacttctaccc





ggtcttcgcggtgcccgaggccctggccacctaccacatctttttcaagaaccaaaagatccccgtctcctgccgcgccaa





ccgcacccgcgccgacgcccttttcaacctgggtcccggcgcccgcctacctgatatcgcctccttggaagaggttcccaa





gatcttcgagggtctgggcagcgacgagactcgggccgcgaacgctctgcaaggagaaggaggagagcatgagcac





cacagcgccctggtcgagttggaaggcgacaacgcgcggctggcggtgctcaaacgcacggtcgagctgacccatttc





gcctacccggctctgaacctgccccccaaagtcatgagcgcggtcatggaccaggtgctcatcaagcgcgcgtcgccca





tctccgaggacgagggcatgcaagactccgaggagggcaagcccgtggtcagcgacgagcagctggcccggtggctg





ggtcctaatgctagtccccagagtttggaagagcggcgcaaactcatgatggccgtggtcctggtgaccgtggagctgga





gtgcctgcgccgcttcttcgccgacgcggagaccctgcgcaaggtcgaggagaacctgcactacctcttcaggcacgggt





tcgtgcgccaggcctgcaagatctccaacgtggagctgaccaacctggtctcctacatgggcatcttgcacgagaaccgc





ctggggcagaacgtgctgcacaccaccctgcgcggggaggcccggcgcgactacatccgcgactgcgtctacctctac





ctctgccacacctggcagacgggcatgggcgtgtggcagcagtgtctggaggagcagaacctgaaagagctctgcaag





ctcctgcagaagaacctcaagggtctgtggaccgggttcgacgagcgcaccaccgcctcggacctggccgacctcatttt





ccccgagcgcctcaggctgacgctgcgcaacggcctgcccgactttatgagccaaagcatgttgcaaaactttcgctctttc





atcctcgaacgctccggaatcctgcccgccacctgctccgcgctgccctcggacttcgtgccgctgaccttccgcgagtgcc





ccccgccgctgtggagccactgctacctgctgcgcctggccaactacctggcctaccactcggacgtgatcgaggacgtc





agcggcgagggcctgctcgagtgccactgccgctgcaacctctgcacgccgcaccgctccctggcctgcaacccccag





ctgctgagcgagacccagatcatcggcaccttcgagttgcaagggcccagcgaaggcgagggttcagccgccaaggg





gggtctgaaactcaccccggggctgtggacctcggcctacttgcgcaagttcgtgcccgaggactaccatcccttcgagat





caggttctacgaggaccaatcccatccgcccaaggccgagctgtcggcctgcgtcatcacccagggggcgatcctggcc





caattgcaagccatccagaaatcccgccaagaattcttgctgaaaaagggccgcggggtctacctcgacccccagaccg





gtgaggagctcaaccccggcttcccccaggatgccccgaggaaacaagaagctgaaagtggagctgccgcccgtgga





ggatttggaggaagactgggagaacagcagtcaggcagaggaggaggagatggaggaagactgggacagcactca





ggcagaggaggacagcctgcaagacagtctggaggaagacgaggaggaggcagaggaggaggtggaagaagca





gccgccgccagaccgtcgtcctcggcgggggagaaagcaagcagcacggataccatctccgctccgggtcggggtcc





cgctcgaccacacagtagatgggacgagaccggacgattcccgaaccccaccacccagaccggtaagaaggagcg





gcagggatacaagtcctggcgggggcacaaaaacgccatcgtctcctgcttgcaggcctgcgggggcaacatctccttc





acccggcgctacctgctcttccaccgcggggtgaactttccccgcaacatcttgcattactaccgtcacctccacagcccct





actacttccaagaagaggcagcagcagcagaaaaagaccagcagaaaaccagcagctagaaaatccacagcggc





ggcagcaggtggactgaggatcgcggcgaacgagccggcgcaaacccgggagctgaggaaccggatctttcccacc





ctctatgccatcttccagcagagtcgggggcaggagcaggaactgaaagtcaagaaccgttctctgcgctcgctcacccg





cagttgtctgtatcacaagagcgaagaccaacttcagcgcactctcgaggacgccgaggctctcttcaacaagtactgcg





cgctcactcttaaagagtagcccgcgcccgcccagtcgcagaaaaaggcgggaattacgtcacctgtgcccttcgcccta





gccgcctccacccatcatcatgagcaaagagattcccacgccttacatgtggagctaccagccccagatgggcctggcc





gccggtgccgcccaggactactccacccgcatgaattggctcagcgccgggcccgcgatgatctcacgggtgaatgaca





tccgcgcccaccgaaaccagatactcctagaacagtcagcgctcaccgccacgccccgcaatcacctcaatccgcgta





attggcccgccgccctggtgtaccaggaaattccccagcccacgaccgtactacttccgcgagacgcccaggccgaagt





ccagctgactaactcaggtgtccagctggcgggcggcgccaccctgtgtcgtcaccgccccgctcagggtataaagcgg





ctggtgatccggggcagaggcacacagctcaacgacgaggtggtgagctcttcgctgggtctgcgacctgacggagtctt





ccaactcgccggatcggggagatcttccttcacgcctcgtcaggccgtcctgactliggagagttcgtcctcgcagccccgc





tcgggtggcatcggcactctccagttcgtggaggagttcactccctcggtctacttcaaccccttctccggctcccccggcca





ctacccggacgagttcatcccgaacttcgacgccatcagcgagtcggtggacggctacgattgaatgtcccatggtggcg





cagctgacctagctcggcttcgacacctggaccactgccgccgcttccgctgcttcgctcgggatctcgccgagtttgcctac





tttgagctgcccgaggagcaccctcagggcccggcccacggagtgcggatcgtcgtcgaagggggcctcgactcccac





ctgcttcggatcttcagccagcgtccgatcctggtcgagcgcgagcaaggacagacccttctgactctgtactgcatctgca





accaccccggcctgcatgaaagtctttgttgtctgctgtgtactgagtataataaaagctgagatcagcgactactccggact





tccgtgtgtTTAAACtcacccccttatccagtgaaataaagatcatattgatgatgattttacagaaataaaaaataatcatt





tgatttgaaataaagatacaatcatattgatgatttgagtttaacaaaaaaataaagaatcacttacttgaaatctgataccag





gtctctgtccatgttttctgccaacaccacttcactcccctcttcccagctctggtactgcaggccccggcgggctgcaaacttc





ctccacacgctgaaggggatgtcaaattcctcctgtccctcaatcttcattttatcttctatcagatgtccaaaaagcgcgtccg





ggtggatgatgacttcgaccccgtctacccctacgatgcagacaacgcaccgaccgtgcccttcatcaacccccccttcgt





ctcttcagatggattccaagagaagcccctgggggtgttgtccctgcgactggccgaccccgtcaccaccaagaacggg





gaaatcaccctcaagctgggagagggggtggacctcgattcctcgggaaaactcatctccaacacggccaccaaggcc





gccgcccctctcagtttttccaacaacaccatttcccttaacatggatcaccccttttacactaaagatggaaaattatccttac





aagtttctccaccattaaatatactgagaacaagcattctaaacacactagctttaggttttggatcaggtttaggactccgtgg





ctctgccttggcagtacagttagtctctccacttacatttgatactgatggaaacataaagcttaccttagacagaggtttgcat





gttacaacaggagatgcaattgaaagcaacataagctgggctaaaggtttaaaatttgaagatggagccatagcaacca





acattggaaatgggttagagtttggaagcagtagtacagaaacaggtgttgatgatgcttacccaatccaagttaaacttgg





atctggccttagctttgacagtacaggagccataatggctggtaacaaagaagacgataaactcactttgtggacaacacc





tgatccatcaccaaactgtcaaatactcgcagaaaatgatgcaaaactaacactttgcttgactaaatgtggtagtcaaata





ctggccactgtgtcagtcttagttgtaggaagtggaaacctaaaccccattactggcaccgtaagcagtgctcaggtgtttct





acglittgatgcaaacggtgttcttttaacagaacattctacactaaaaaaatactgggggtataggcagggagatagcata





gatggcactccatataccaatgctgtaggattcatgcccaatttaaaagcttatccaaagtcacaaagttctactactaaaaa





taatatagtagggcaagtatacatgaatggagatgtttcaaaacctatgcttctcactataaccctcaatggtactgatgaca





gcaacagtacatattcaatgtcattttcatacacctggactaatggaagctatgttggagcaacatttggggctaactcttatac





cttctcatacatcgcccaagaatgaacactgtatcccaccctgcatgccaacccttcccaccccactctgtggaacaaactc





tgaaacacaaaataaaataaagttcaagtgttttattgattcaacagttttacaggattcgagcagttatttttcctccaccctcc





caggacatggaatacaccaccctctccccccgcacagccttgaacatctgaatgccattggtgatggacatgcttttggtctc





cacgttccacacagtttcagagcgagccagtctcgggtcggtcagggagatgaaaccctccgggcactcccgcatctgca





cctcacagctcaacagctgaggattgtcctcggtggtcgggatcacggttatctggaagaagcagaagagcggcggtgg





gaatcatagtccgcgaacgggatcggccggtggtgtcgcatcaggccccgcagcagtcgctgccgccgccgctccgtca





agctgctgctcagggggtccgggtccagggactccctcagcatgatgcccacggccctcagcatcagtcgtctggtgcgg





cgggcgcagcagcgcatgcggatctcgctcaggtcgctgcagtacgtgcaacacagaaccaccaggttgttcaacagtc





catagttcaacacgctccagccgaaactcatcgcgggaaggatgctacccacgtggccgtcgtaccagatcctcaggta





aatcaagtggtgccccctccagaacacgctgcccacgtacatgatctccttgggcatgtggcggttcaccacctcccggta





ccacatcaccctctggttgaacatgcagccccggatgatcctgcggaaccacagggccagcaccgccccgcccgccat





gcagcgaagagaccccgggtcccggcaatggcaatggaggacccaccgctcgtacccgtggatcatctgggagctga





acaagtctatgttggcacagcacaggcatatgctcatgcatctcttcagcactctcaactcctcgggggtcaaaaccatatc





ccagggcacggggaactcttgcaggacagcgaaccccgcagaacagggcaatcctcgcacagaacttacattgtgcat





ggacagggtatcgcaatcaggcagcaccgggtgatcctccaccagagaagcgcgggtctcggtctcctcacagcgtggt





aagggggccggccgatacgggtgatggcgggacgcggctgatcgtgttcgcgaccgtgtcatgatgcagttgctttcgga





cattttcgtacttgctgtagcagaacctggtccgggcgctgcacaccgatcgccggcggcggtctcggcgcttggaacgctc





ggtgttgaaattgtaaaacagccactctctcagaccgtgcagcagatctagggcctcaggagtgatgaagatcccatcatg





cctgatggctctgatcacatcgaccaccgtggaatgggccagacccagccagatgatgcaattttgttgggtttcggtgacg





gcgggggagggaagaacaggaagaaccatgattaacttttaatccaaacggtctcggagtacttcaaaatgaagatcgc





ggagatggcacctctcgcccccgctgtgttggtggaaaataacagccaggtcaaaggtgatacggttctcgagatgttcca





cggtggcttccagcaaagcctccacgcgcacatccagaaacaagacaatagcgaaagcgggagggttctctaattcctc





aatcatcatgttacactcctgcaccatccccagataattttcatttttccagccttgaatgattcgaactagttcCtgaggtaaat





ccaagccagccatgataaagagctcgcgcagagcgccctccaccggcattcttaagcacaccctcataattccaagatat





tctgctcctggttcacctgcagcagattgacaagcggaatatcaaaatctctgccgcgatccctgagctcctccctcagcaat





aactgtaagtactctttcatatcctctccgaaatttttagccataggaccaccaggaataagattagggcaagccacagtac





agataaaccgaagtcctccccagtgagcattgccaaatgcaagactgctataagcatgctggctagacccggtgatatctt





ccagataactggacagaaaatcgcccaggcaatttttaagaaaatcaacaaaagaaaaatcctccaggtggacgtttag





agcctcgggaacaacgatgaagtaaatgcaagcggtgcgttccagcatggttagttagctgatctgtagaaaaaacaaa





aatgaacattaaaccatgctagcctggcgaacaggtgggtaaatcgttctctccagcaccaggcaggccacggggtctcc





ggcgcgaccctcgtaaaaattgtcgctatgattgaaaaccatcacagagagacgttcccggtggccggcgtgaatgattc





gacaagatgaatacacccccggaacattggcgtccgcgagtgaaaaaaagcgcccgaggaagcaataaggcactac





aatgctcagtctcaagtccagcaaagcgatgccatgcggatgaagcacaaaattctcaggtgcgtacaaaatgtaattact





cccctcctgcacaggcagcaaagcccccgatccctccaggtacacatacaaagcctcagcgtccatagcttaccgagca





gcagcacacaacaggcgcaagagtcagagaaaggctgagctctaacctgtccacccgctctctgctcaatatatagccc





agatctacactgacgtaaaggccaaagtctaaaaatacccgccaaataatcacacacgcccagcacacgcccagaaa





ccggtgacacactcaaaaaaatacgcgcacttcctcaaacgcccaaaactgccgtcatttccgggttcccacgctacgtc





atcaaaacacgactttcaaattccgtcgaccgttaaaaacgtcacccgccccgcccctaacggtcgcccgtctctcagcca





atcagcgccccgcatccccaaattcaaacacctcatttgcatattaacgcgcacaaaaagtttgaggtatattattgatgatg





g





Plasmid 1103 ORF (cMSLN)


SEQ ID NO: 5



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctg





Plasmid 1103 Polypeptide (cMSLN)


SEQ ID NO: 6



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEAL





Plasmid 1027 ORF (MUC1)


SEQ ID NO: 7



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctg





Plasmid 1027 Polypeptide (537 aa) (MUC1)


SEQ ID NO: 8



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANL





Plasmid 1112 ORF (TERT240)


SEQ ID NO: 9



atgggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggac






catccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctg





gaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggcca





tgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccg





tccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtg





gatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgg





gaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccgg





agtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtg





caacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctggg





ctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgt





cgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttcca





gctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgc





gctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtc





aatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggc





ccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgt





cgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaa





ctacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacc





tttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatacta





ttccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtg





gtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacat





gaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacg





aagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgca





gtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcg





ctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaa





acctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgt





cgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctgga





cacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcg





gctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaa





gtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttca





gctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaat





cctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagt





ggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactg





cacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgt





cagatttcaagaccatcttggac





Plasmid 1112 Polypeptide (TERT240)


SEQ ID NO: 10



MGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHS






HPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP





SLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLL





KTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVR





ACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRS





PGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVW





SKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVV





GARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVR





AQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKA





FKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHH





AVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPH





LTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPANGLFPWCGLL





LDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNS





LQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAG





MSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTT





LTALEAAANPALPSDFKTILD





Plasmid 1330 ORF (TERT541)


SEQ ID NO: 11



atggctagcgccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactac






ctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaag





agggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgt





ctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgt





gaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcct





ggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaa





gaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccga





agtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggc





cacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgc





aagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttga





cgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaagg





cagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggtt





gctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgagg





ggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggagg





aaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcag





tccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacat





gcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtg





cacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgt





ggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccgg





aatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcct





gaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaa





actccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1330 Polypeptide (TERT541)


SEQ ID NO: 12



MASAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRV






QLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLT





SRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAIT





GAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYM





RQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIP





QGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVP





EYGCVVNLRKTVVNFPVEDEALGGTAFVQMPANGLFPWCGLLLDTRTLEVQSDYSSY





ARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQA





YRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPS





EAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSD





FKTILD





Plasmid 1326 ORF (TERT343)


SEQ ID NO: 13



atggctagcttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac






gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggac





Plasmid 1326 Polypeptide (TERT343)


SEQ ID NO: 14



MASFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGN






HAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHS





SPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMS





VRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKN





RLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGL





RPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHR





AWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQ





KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLF





DVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLR





LVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMP





AHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLK





CHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTAS





LCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTA





QTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1197 ORF (cMUC1)


SEQ ID NO: 15



atggctagcacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagca






gcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggca





gcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggac





aggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgccc





ctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagcccc





aggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcct





gcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatca





gctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctg





ctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgc





acaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccaca





gcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctct





gaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctg





cagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgc





aaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccct





ggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagcc





ggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccagg





atggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgc





cggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacatac





cacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcgg





cagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1197 Polypeptide) (cMUC1)


SEQ ID NO: 16



MASTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQG






QDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTA





PPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPG





STAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSAS





GSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPL





TSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQG





GFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSD





VPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTY





HPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL





Plasmid 1316 ORF


SEQ ID NO: 17



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacct





gctgatccacgacatcgagacaaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgt





gctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaa





gcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcg





gcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggg





gacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcg





cccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagc





cccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcct





cctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgaca





tcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagac





ctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggt





gcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccacca





cagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccc





tctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacc





tgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcct





gcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgacc





ctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccag





ccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccag





gatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagt





gccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacat





accacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggc





ggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1316 Polypeptide


SEQ ID NO: 18



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGRIFNAHYAGYFADLLIH





DIETNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKN





AVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALG





STTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTP





TTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPS





TDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQF





NQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIAL





AVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGG





SSLSYTNPAVAAASANL





Plasmid 1313 ORF


SEQ ID NO: 19



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctg





ctgatccacgacatcgagacaaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtg





ctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgag





cacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgc





ctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgcctt





cagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccc





tgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggcc





ctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtc





ccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctag





cacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccaca





gggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcc





caggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatctt





ctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccc





cttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatc





cagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctga





aggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggca





gaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgagga





actgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgt





gctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcg





gagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggac





cgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaaga





acggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcagg





ggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctg





Plasmid 1313 Polypeptide


SEQ ID NO: 20



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPLAGETGQEAAPLDGVLAN





PPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPE





DLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGV





RGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQG





GGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERT





ILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYE





QLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHE





MSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDL





DTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATF





MKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQG





GIPNGYLVLDLSMQEAL





Plasmid 1159 ORF


SEQ ID NO: 21



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcgccaccaatttcagcctgctgaaacaggccggcgacgtgga





agagaaccctggccctctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaa





tatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgg





gaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtct





gagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcct





gcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgc





tgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgt





gatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccag





gatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccac





catggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcct





ggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagagg





tggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggag





ctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagct





ggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacct





gtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagt





gaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggaca





aggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgcca





cctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggccc





ggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgagga





cctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctct





gacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtg





cgcgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatgg





ctacctggtgctggacctgagcatgcaggaagccctg





Plasmid 1159 Polypeptide


SEQ ID NO: 22



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGATNFSLLKQAGDVEENPGPLAGETGQEAAPLDGVLANPPNIS





SLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPEDLDAL





PLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGVRGSLL





SEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQGGGPPY





GPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERTILRPRF





RREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYEQLDVL





KHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHEMSPQ





VATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDLDTCD





PRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATFMKLR





TDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQGGIPNG





YLVLDLSMQEAL





Plasmid 1158 ORF


SEQ ID NO: 23



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcgccaccaatttcagcctgctgaaacaggccggcgacgtg





gaagagaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacag





gctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagca





gcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaaca





cagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgaca





agcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagc





ctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagc





cccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtg





acaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacact





agacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagc





accgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcac





cagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccct





accacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagc





aaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacag





cagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaag





cagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccggga





aggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctg





accatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgc





tctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaaga





attacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacg





gcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctga





gctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1158 Polypeptide


SEQ ID NO: 24



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGATNFSLLKQAGDVEEN





PGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM





TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANL





Plasmid 1269 ORF


SEQ ID NO: 25



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggaggctccggcggaggagctgccccggagccggagaggacccccgtt





ggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccagg





ccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagca





ccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaa





cacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccgga





gcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctccca





cagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaag





actcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggc





agctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctac





gggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgaga





aatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgatt





gcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaa





atttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaacc





gcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcg





ggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatccca





aagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccg





aacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagct





tcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccgga





actgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatc





atcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggc





gttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccc





tgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttca





tgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgact





ctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggt





ggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaata





cggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaa





atgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagct





atgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttc





ggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaa





gatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgacc





ttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcg





aaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggc





acagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccac





cctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1269 Polypeptide


SEQ ID NO: 26



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGGSGGGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARP





AEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSS





GDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLF





LELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQL





LRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELT





WKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTET





TFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIP





KPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGL





DDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRR





YAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEA





SSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRD





GLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAF





VQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFG





VLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVI





SDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLG





SLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1270 ORF


SEQ ID NO: 27



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaa





ctggccggcgacgtggaactgaaccctggccctggagctgccccggagccggagaggacccccgttggccagggatc





gtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaag





aggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcggga





ccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtac





tcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagatt





ggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactg





gcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccct





ctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaag





aggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgc





gcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaa





gtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctg





cgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattgg





ctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttcta





ccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccg





aggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacg





ggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacct





cacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctggga





ctggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtg





aaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgca





gaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgca





cgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcg





gtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacg





cggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccct





ttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcct





gctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggt





caatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcac





atggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggac





gagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagctlitcggagtcctcc





ggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgc





tccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgg





gtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagcc





gcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgac





ctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctc





tggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1270 Polypeptide


SEQ ID NO: 28



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQGSWA





HPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRP





PRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPW





MPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR





EKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNE





RRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILA





KFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLREL





SEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKA





LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDT





IPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVA





HLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSIL





STLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGC





VVNLRKTVVNFPVEDEALGGTAFVQMPANGLFPWCGLLLDTRTLEVQSDYSSYARTSI





RASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFH





ACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQ





WLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTIL





D





Plasmid 1271 ORF


SEQ ID NO: 29



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctligttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaact





ggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctg





actgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagca





gcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggca





gcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggac





aggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgccc





ctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagcccc





aggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcct





gcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatca





gctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctg





ctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgc





acaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccaca





gcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctct





gaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctlictgtccttccacatcagcaacctg





cagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgc





aaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccct





ggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagcc





ggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccagg





atggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgc





cggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacatac





cacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcgg





cagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1271 Polypeptide


SEQ ID NO: 30



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPF





FLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPG





SGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDN





KPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSA





PDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNV





TSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASST





HHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEM





FLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTI





SDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLD





IFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASA





NL





Plasmid 1286 ORF


SEQ ID NO: 31



atggctagcacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagca






gcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggca





gcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggac





aggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgccc





ctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagcccc





aggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcct





gcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatca





gctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctg





ctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgc





acaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccaca





gcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctct





gaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctg





cagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgc





aaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccct





ggclitccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagcc





ggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccagg





atggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgc





cggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacatac





cacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcgg





cagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgagggcg





ccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctggagctgccccggagccggaga





ggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtc





accggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgg





gccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatg





ccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccga





gcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcc





cgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacgga





gtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccag





ggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccc





tggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgc





cgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgt





cagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaaga





aattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactaccttt





caaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagg





gtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctga





gattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaa





agcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcct





gctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccc





tccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcat





cgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtg





agaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagag





acttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttc





ctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcat





tctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgct





cagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtg





ccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgc





atttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgact





actccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcag





aaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaa





catctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaaga





acccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcg





ctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagct





gaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactcccc





ggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1286 Polypeptide


SEQ ID NO: 32



MASTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQG






QDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTA





PPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPG





STAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSAS





GSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPL





TSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQG





GFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSD





VPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTY





HPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANLGSGTIL





SEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVS





PARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHF





LYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQM





RPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRR





LVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSL





QELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFY





VTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRL





RFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASV





LGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCV





RRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLN





EASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIR





RDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGT





AFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLF





GVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLR





VISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLL





GSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1287 ORF


SEQ ID NO: 33



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaact





ggccggcgacgtggaactgaaccctggccctacaggctctggccacgccagctctacacctggcggcgagaaagaga





caagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgag





cagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcct





ctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacacccc





ctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctct





gccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagac





ccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagca





ccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtga





ccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgcc





agcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcaccc





ccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcac





ccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttcttt





ctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagc





gggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggc





agcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagt





acaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgccc





agtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctg





attgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccacc





ccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgag





aaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1287 Polypeptide


SEQ ID NO: 34



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPANGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTGSGHASS





TPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEP





ASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPD





TRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTS





APDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGT





SARATTTPASKSTPFSIFSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQL





STGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRP





GSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAG





VPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTH





GRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL





Plasmid 1272 ORF


SEQ ID NO: 35



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaaga





gaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatat





cagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcggga





actggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctga





gcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgc





acccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctg





cctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtga





tctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccagga





tcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccacca





tggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctgg





cggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtg





gaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagct





ggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctg





gacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgt





ttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtga





acaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaag





gacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacct





agctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccgg





ctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacct





gaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctga





cagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcg





cgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggcta





cctggtgctggacctgagcatgcaggaagccctg





Plasmid 1272 Polypeptide


SEQ ID NO: 36



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGEGRGSLLTCGDVEENPGPLAGETGQEAAPLD





GVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRL





SEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAAL





ACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAAR





AALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWR





QPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAI





PFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEV





NKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAV





RPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVS





MDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLG





LGLQGGIPNGYLVLDLSMQEAL





Plasmid 1273 ORF


SEQ ID NO: 37



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttglictccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggaggctccggcggactggctggcgagacaggacaggaagccgctcctctg





gacggcgtgctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtc





cggcctgagcacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcag





ctgcggtgcctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaacc





ccgacgccttcagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccag





aggcgcccctgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgat





gtgcgggccctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagact





ggtgtcctgtcccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatg





gacctcctagcacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagat





ccatcccacagggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaa





tcctgcggcccaggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgag





agcctgatcttctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtga





acgccatccccttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccga





gagcgtgatccagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctg





gaaaccctgaaggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattc





gtgaagggcagaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgt





cccccgaggaactgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctaga





cagctggatgtgctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtc





cttcctgggcggagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatga





agctgcggaccgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaa





ggccgaagaacggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctg





ggactgcaggggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctg





Plasmid 1273 Polypeptide


SEQ ID NO: 38



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGGSGGLAGETGQEAAPLDGVLANPPNISSLSPRQL





LGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLF





LNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGVRGSLLSEADVR





ALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQGGGPPYGPPSTW





SVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERTILRPRFRREVEK





TACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYEQLDVLKHKLDEL





YPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHEMSPQVATLIDRF





VKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVL





YPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPL





TVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLS





MQEAL





Plasmid 1274 ORF


SEQ ID NO: 39



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcctgctgacatgtggcgacgtgga





agagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccggga





cgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcga





gggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccag





accgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaagg





aacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttcctt





gggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgtt





cctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtca





ctccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatc





cgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcct





ggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaa





acatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcg





tcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtg





gtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtgga





gcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgcca





gcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtc





aacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccct





cttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccacc





gggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccg





gagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtca





ggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccg





acctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcaga





gctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcagggg





aaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatgg





aaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacc





tcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgt





ggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgc





ggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcct





cactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctct





ttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccac





gcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctcc





ctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcga





agcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctc





gctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaaccc





agcattgccgtcagatttcaagaccatcttggac





Plasmid 1274 Polypeptide


SEQ ID NO: 40



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEEN





PGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR





HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILD





Plasmid 1275 ORF


SEQ ID NO: 41



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggaggctccggcggaggagctgccccggagccggagaggacccccg





ttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccag





gccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagc





accacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaa





acacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccgg





agcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctccc





acagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaa





gactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtgg





cagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtcta





cgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgag





aaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgat





tgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggcca





aatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaac





cgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgc





gggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatccc





aaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggcc





gaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggag





cttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccgg





aactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgat





catcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaagg





cgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgccc





ctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttc





atgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcga





ctctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactg





gtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaat





acggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtcca





aatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagc





tatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagctttt





cggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaa





gatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgacc





ttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcg





aaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggc





acagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccac





cctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1275 Polypeptide


SEQ ID NO: 42



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGGSGGGAAPEPERTPVGQ





GSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPS





TSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGS





RPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGV





CAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSR





HNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLRE





EILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQL





RELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSR





VKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGA





YDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQ





FVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQ





GSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPE





YGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYA





RTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAY





RFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSE





AVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDF





KTILD





Plasmid 1317 ORF


SEQ ID NO 43



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctg





ctgatccacgacatcgagacaaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtg





ctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgag





cacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgc





ctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgcctt





cagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccc





tgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggcc





ctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtc





ccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctag





cacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccaca





gggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcc





caggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatctt





ctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccc





cttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatc





cagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctga





aggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggca





gaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgagga





actgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgt





gctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcg





gagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggac





cgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaaga





acggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcagg





ggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcc





tgctgacatgtggcgacgtggaagagaaccctggccccggagctgccccggagccggagaggacccccgttggccag





ggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagca





gaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgc





gggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcc





tgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgc





agattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagaga





tactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcact





gccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctcc





ggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttc





gtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatact





aagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcct





ggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctg





cattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgtt





cttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaac





tttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcc





cgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgct





tgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtg





ctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgta





cttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaa





accgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaa





gtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgaga





gatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtc





atcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttg





tgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacg





acttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctg





tgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgcca





gcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgccc





ggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtc





ctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctg





ctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctg





cgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaagg





agccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagag





tgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgacc





gctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1317 Polypeptide


SEQ ID NO: 44



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPLAGETGQEAAPLDGVLAN





PPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPE





DLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGV





RGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQG





GGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERT





ILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYE





QLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHE





MSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDL





DTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATF





MKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQG





GIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEENPGPGAAPEPERTPVGQGSWAH





PGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPP





RPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWM





PGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCARE





KPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNER





RFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAK





FLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELS





EAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKAL





FSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTI





PQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAH





LQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILS





TLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCV





VNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIR





ASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHA





CVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQW





LCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1318 ORF


SEQ ID NO: 45



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaa





ctggccggcgacgtggaactgaaccctggccctggagctgccccggagccggagaggacccccgttggccagggatc





gtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaag





aggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcggga





ccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtac





tcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagatt





ggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactg





gcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccct





ctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaag





aggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgc





gcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaa





gtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctg





cgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattgg





ctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttcta





ccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccg





aggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacg





ggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacct





cacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctggga





ctggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtg





aaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgca





gaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgca





cgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcg





gtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacg





cggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccct





ttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcct





gctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggt





caatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcac





atggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggac





gagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctcc





ggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgc





tccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgg





gtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagcc





gcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgac





ctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctc





tggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacggatccggcgagggcagaggcagcc





tgctgacatgtggcgacgtggaagagaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacg





gcgtgctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggc





ctgagcacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgc





ggtgcctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccga





cgccttcagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggc





gcccctgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgc





gggccctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgt





cctgtcccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacct





cctagcacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatc





ccacagggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctg





cggcccaggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcc





tgatcttctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgc





catccccttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagc





gtgatccagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaa





ccctgaaggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtga





agggcagaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccc





cgaggaactgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacag





ctggatgtgctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttc





ctgggcggagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagct





gcggaccgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggc





cgaagaacggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctggga





ctgcaggggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctg





Plasmid 1318 Polypeptide


SEQ ID NO: 46



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQGSWA





HPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRP





PRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPW





MPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR





EKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNE





RRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILA





KFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLREL





SEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKA





LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDT





IPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVA





HLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSIL





STLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGC





VVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSI





RASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFH





ACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQ





WLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTIL





DGSGEGRGSLLTCGDVEENPGPLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPC





AEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDA





FSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGL





ACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVST





MDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACP





SGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQ





GYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKG





RGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPK





ARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVA





EVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQ





EAL





Plasmid 1319 ORF


SEQ ID NO: 47



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacct





gctgatccacgacatcgagacaaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgt





gctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaa





gcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcg





gcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggg





gacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcg





cccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagc





cccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcct





cctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgaca





tcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagac





ctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggt





gcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccacca





cagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccc





tctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacc





tgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcct





gcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgacc





ctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccag





ccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccag





gatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagt





gccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacat





accacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggc





ggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgaggg





cgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctggagctgccccggagccgga





gaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggt





gtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcgg





tgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgt





atgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagacc





gagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcc





tcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacg





gagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagcccc





agggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgc





cctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagc





gccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagat





gtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaa





gaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactac





ctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaag





agggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgt





ctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgt





gaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcct





ggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaa





gaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccga





agtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggc





cacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgc





aagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttga





cgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaagg





cagcattctgtcgactctcttgtgliccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggtt





gctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgagg





ggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggagg





aaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcag





tccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacat





gcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtg





cacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgt





ggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccgg





aatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcct





gaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaa





actccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1319 Polypeptide


SEQ ID NO: 48



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGRIFNAHYAGYFADLLIH





DIETNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKN





AVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALG





STTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTP





TTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPS





TDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQF





NQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIAL





AVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGG





SSLSYTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQ





GSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPS





TSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGS





RPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGV





CAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSR





HNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLRE





EILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQL





RELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSR





VKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGA





YDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQ





FVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQ





GSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPE





YGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYA





RTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAY





RFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSE





AVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDF





KTILD





Plasmid 1320 ORF


SEQ ID NO: 49



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcctgctgacatgtggcgacgtgga





agagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccggga





cgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcga





gggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccag





accgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaagg





aacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttcctt





gggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgtt





cctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtca





ctccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatc





cgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcct





ggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaa





acatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcg





tcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtg





gtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtgga





gcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgcca





gcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtc





aacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccct





cttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccacc





gggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccg





gagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtca





ggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccg





acctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcaga





gctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcagggg





aaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatgg





aaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacc





tcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgt





ggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgc





ggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcct





cactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctct





ttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccac





gcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctcc





ctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcga





agcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctc





gctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaaccc





agcattgccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgct





gaaactggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgac





cgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccaga





gaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggca





gcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacct





ggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgacca





gcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagacc





agccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgc





tcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtg





acatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgatacca





gacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacac





tggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagcca





ccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgcc





ccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagca





acctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgtt





cctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctg





accctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgcc





agccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgcc





aggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgcca





gtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccac





ataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacg





gcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1320 Polypeptide


SEQ ID NO: 50



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEEN





PGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR





HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPF





FLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPG





SGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDN





KPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSA





PDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNV





TSASGSASGSASTLVHNGTSARATTTPASKSTPFSIFSHHSDTPTTLASHSTKTDASST





HHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEM





FLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTI





SDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLD





IFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASA





NL





Plasmid 1321 ORF


SEQ ID NO: 51



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctligttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctlictgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaaga





gaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatat





cagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcggga





actggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctga





gcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgc





acccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctg





cctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtga





tctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccagga





tcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccacca





tggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctgg





cggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtg





gaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagct





ggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctg





gacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgt





ttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtga





acaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaag





gacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacct





agctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccgg





ctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacct





gaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctga





cagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcg





cgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggcta





cctggtgctggacctgagcatgcaggaagccctgggatccggcagaatcttcaacgcccactacgccggctacttcgccg





acctgctgatccacgacatcgagacaaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctga





ccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccaga





gaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggca





gcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacct





ggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgacca





gcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagacc





agccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgc





tcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtg





acatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgatacca





gacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacac





tggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagcca





ccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgcc





ccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagca





acctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgtt





cctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctg





accctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgcc





agccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgcc





aggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgcca





gtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccac





ataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacg





gcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1321 Polypeptide


SEQ ID NO: 52



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGEGRGSLLTCGDVEENPGPLAGETGQEAAPLD





GVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRL





SEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAAL





ACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAAR





AALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWR





QPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAI





PFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEV





NKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAV





RPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVS





MDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLG





LGLQGGIPNGYLVLDLSMQEALGSGRIFNAHYAGYFADLLIHDIETNPGPTPGTQSPFFL





LLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSG





SSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSA





SGSASGSASTLVHNGTSARATTTPASKSTPFSIFSHHSDTPTTLASHSTKTDASSTHHS





SVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQI





YKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDV





SVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPA





RDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL





Plasmid 1322 ORF


SEQ ID NO: 53



atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcacca






ggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagc





gttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgcca





cggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagctt





cggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcac





gtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaa





ttgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggc





ggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccg





cctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccg





cctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgcca





agttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtg





ttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaact





ggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctg





actgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagca





gcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggca





gcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggac





aggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgccc





ctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagcccc





aggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcct





gcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatca





gctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctg





ctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgc





acaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccaca





gcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctct





gaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctg





cagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgc





aaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccct





ggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagcc





ggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccagg





atggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgc





cggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacatac





cacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcgg





cagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccact





acgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctggccccctggctggcgagacaggacagg





aagccgctcctctggacggcgtgctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattccct





tgtgccgaggtgtccggcctgagcacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctg





agcaccgagcagctgcggtgcctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgct





gctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtgg





acctgctgcccagaggcgcccctgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctg





ctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtg





ctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcgg





aggccctccttatggacctcctagcacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggcca





gcctatcatcagatccatcccacagggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagc





ccgagcggacaatcctgcggcccaggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccaga





gagatcgacgagagcctgatcttctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccaga





tggacagagtgaacgccatccccttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtaccccca





gggctaccccgagagcgtgatccagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaac





gtgaccagcctggaaaccctgaaggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacact





gatcgacagattcgtgaagggcagaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctat





ctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatac





ctgcgatcctagacagctggatgtgctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgt





gaagatccagtccttcctgggcggagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctg





gccacctttatgaagctgcggaccgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgt





ggaagggctgaaggccgaagaacggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctgga





cacactgggcctgggactgcaggggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctg





Plasmid 1322 Polypeptide


SEQ ID NO: 54



MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTR






HSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSL





RPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGV





LLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF





VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLR





RSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSV





WSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYV





VGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRV





RAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRK





AFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCH





HAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTP





HLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCG





LLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQV





NSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKN





AGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP





GTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPF





FLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPG





SGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDN





KPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSA





PDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNV





TSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASST





HHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEM





FLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTI





SDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLD





IFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASA





NLGSGRIFNAHYAGYFADLLIHDIETNPGPLAGETGQEAAPLDGVLANPPNISSLSPRQL





LGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLF





LNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGVRGSLLSEADVR





ALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQGGGPPYGPPSTW





SVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERTILRPRFRREVEK





TACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYEQLDVLKHKLDEL





YPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHEMSPQVATLIDRF





VKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVL





YPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPL





TVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLS





MQEAL





Plasmid 1351 ORF


SEQ ID NO: 55



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctg





ctgatccacgacatcgagacaaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtg





ctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgag





cacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgc





ctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgcctt





cagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccc





tgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggcc





ctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtc





ccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctag





cacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccaca





gggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcc





caggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatctt





ctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccc





cttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatc





cagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctga





aggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggca





gaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgagga





actgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgt





gctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcg





gagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggac





cgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaaga





acggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcagg





ggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcc





tgctgacatgtggcgacgtggaagagaaccctggccccgccaaatttctgcattggctgatgtcagtgtacgtggtcgagct





gctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctg





cagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgg





gaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatgga





ttacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtg





ctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggc





ggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatg





atactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtac





gccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagc





cttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcct





gaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcata





cgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaag





ctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacg





ccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaattt





ccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgct





gctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttca





atcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgat





ctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcg





tgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgtta





ctcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggt





gcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcg





cactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcatt





gccgtcagatttcaagaccatcttggac





Plasmid 1351 Polypeptide


SEQ ID NO: 56



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPLAGETGQEAAPLDGVLAN





PPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPE





DLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGV





RGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQG





GGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERT





ILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYE





QLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHE





MSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDL





DTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATF





MKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQG





GIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEENPGPAKFLHWLMSVYVVELLRSFF





YVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSR





LRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGAS





VLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYC





VRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSL





NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGI





RRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGG





TAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKL





FGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFL





RVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPL





LGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1352 ORF


SEQ ID NO: 57



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacct





gctgatccacgacatcgagacaaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgt





gctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaa





gcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcg





gcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggg





gacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcg





cccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagc





cccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcct





cctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgaca





tcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagac





ctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggt





gcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccacca





cagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccc





tctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacc





tgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcct





gcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgacc





ctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccag





ccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccag





gatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagt





gccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacat





accacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggc





ggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgaggg





cgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctgccaaatttctgcattggctgat





gtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgc





aaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggc





agaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggct





gaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcac





gggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactg





gacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaag





gtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaa





cacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtg





tccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtgg





tcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggt





gcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgct





acggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctg





gtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaat





ctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatgg





cctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagc





atccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggctt





aaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaa





gcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcatt





agcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgg





gacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacg





tcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaa





gccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1352 Polypeptide


SEQ ID NO: 58



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGRIFNAHYAGYFADLLIH





DIETNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKN





AVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALG





STTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIFSHHSDTP





TTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPS





TDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQF





NQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIAL





AVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSRYEKVSAGNGG





SSLSYTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPAKFLHWLMSVYVVE





LLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARP





ALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRP





GLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIK





PQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVI





EQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDME





NKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPV





EDEALGGTAFVQMPANGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAG





RNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVW





KNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRH





RVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD





Plasmid 1353 ORF


SEQ ID NO: 59



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcctgctgacatgtggcgacgtgga





agagaaccctggccccgccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcact





gagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagc





atctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctca





cgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacct





ttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaag





acggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgaga





gcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgact





caccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcg





catggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcg





catttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtc





tgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatccc





acaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacggg





acgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctg





gtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactc





ggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaa





gtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacg





aaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagac





cgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaac





aggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaac





gccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggcttt





cctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtcta





gaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttg





gacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaac





cctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca





cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctg





Plasmid 1353 Polypeptide


SEQ ID NO: 60



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEEN





PGPAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQ





LRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTS





RVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITG





AYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMR





QFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIP





QGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVP





EYGCVVNLRKTVVNFPVEDEALGGTAFVQMPANGLFPWCGLLLDTRTLEVQSDYSSY





ARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQA





YRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPS





EAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSD





FKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPFFLLLLLTVLTVVTGSGHA





SSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPAT





EPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSA





PDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGV





TSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHN





GTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSP





QLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKF





RPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSG





AGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTY





HTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL





Plasmid 1354 ORF


SEQ ID NO: 61



atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggcca






cgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgaga





agaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccag





gatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccag





tgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctgg





aagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgc





acacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctc





ccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcc





cggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcc





cgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccaga





gccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggc





cagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcac





aagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaag





atcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggctt





cctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatc





aacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgat





gtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctc





gtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagc





tggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgc





cacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatc





ctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctg





ctgatccacgacatcgagacaaaccctggccccctggctggcgagacaggacaggaagccgctcctctggacggcgtg





ctggccaaccctcccaatatcagcagcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgag





cacagagagagtgcgggaactggctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgc





ctggcccacagactgtctgagcctcccgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgcctt





cagcggacctcaggcctgcacccggttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccc





tgagagacagagactgctgcctgctgctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggcc





ctgggaggcctggcttgtgatctgcctggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtc





ccggccctctggaccaggatcagcaggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctag





cacttggagcgtgtccaccatggatgccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccaca





gggcatcgtggccgcctggcggcagagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcc





caggtttcggagagaggtggaaaagaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatctt





ctacaagaagtgggagctggaagcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccc





cttcacctatgagcagctggacgtgctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatc





cagcacctgggctacctgtttctgaagatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctga





aggccctgctggaagtgaacaagggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggca





gaggccagctggacaaggacaccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgagga





actgagcagcgtgccacctagctctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgt





gctgtatcccaaggcccggctggccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcg





gagcccctaccgaggacctgaaagctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggac





cgacgccgtgctgcctctgacagtggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaaga





acggcacagacccgtgcgcgactggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcagg





ggggcatccctaatggctacctggtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcc





tgctgacatgtggcgacgtggaagagaaccctggccccagcttcctcctgtcgtcgctcagaccgagcctgaccggagca





cgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacag





agatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagact





cactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcag





ctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgg





gttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaa





tactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgc





gcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatt





tctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgc





ctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgg





gaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaa





agcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccga





acgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagctt





cggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaa





ctgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatca





tcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgt





tcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctg





agagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatg





tgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactct





cttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtgg





acgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacg





gctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaat





gccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctat





gcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcg





gagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaag





atcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgacctt





ctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcga





aaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcac





agagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccct





gaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac





Plasmid 1354 Polypeptide


SEQ ID NO: 62



MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSM






TSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTP





PAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGS





TAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAL





GSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLAS





HSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYY





QELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYK





TEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQ





CRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLS





YTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPLAGETGQEAAPLDGVLAN





PPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCLAHRLSEPPE





DLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRLLPAALACWGV





RGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQEAARAALQG





GGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRDPSWRQPERT





ILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQMDRVNAIPFTYE





QLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLETLKALLEVNKGHE





MSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSLSPEELSSVPPSSIWAVRPQDL





DTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGGAPTEDLKALSQQNVSMDLATF





MKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVRDWILRQRQDDLDTLGLGLQG





GIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEENPGPSFLLSSLRPSLTGARRLVETI





FLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTP





AAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGL





WGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEH





RLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVVVSKLQSIGIRQHLK





RVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAE





RLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVK





VAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQ





PYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQC





QGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLV





RGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSD





YSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKIL





LLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAG





PLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPA





LPSDFKTILD





Plasmid 1355 ORF


SEQ ID NO: 63



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacct





gctgatccacgacatcgagacaaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgt





gctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaa





gcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcg





gcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggg





gacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcg





cccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagc





cccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcct





cctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgaca





tcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagac





ctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggt





gcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccacca





cagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccc





tctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacc





tgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcct





gcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgacc





ctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccag





ccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccag





gatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagt





gccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacat





accacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggc





ggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgaggg





cgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctagcttcctcctgtcgtcgctcag





accgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggc





gcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgt





acggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagc





cccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcct





cgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacg





agcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtgga





agatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgaga





gaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgaga





ctacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctg





aagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtc





gcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcg





ccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacg





gcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcc





caagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcac





cgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcat





ggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcat





ttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgt





ttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccaca





aggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacg





ggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtg





aggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcgg





aggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtg





cagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaa





catgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgt





gtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacag





gtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgc





cggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcc





tcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctaga





aaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttgga





C





Plasmid 1355 Polypeptide


SEQ ID NO: 64



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGRIFNAHYAGYFADLLIH





DIETNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKN





AVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALG





STTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRP





APGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPD





TRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTP





TTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPS





TDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQF





NQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIAL





AVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGG





SSLSYTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPSFLLSSLRPSLTGA





RRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCP





LRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRR





LVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGC





VPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSI





GIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFR





REKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPP





ELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVS





TLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGK





SYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTF





LRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLE





VQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCT





NIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAK





GAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEA





AANPALPSDFKTILD





Plasmid 1356 ORF


SEQ ID NO: 65



atggctagcctggctggcgagacaggacaggaagccgctcctctggacggcgtgctggccaaccctcccaatatcagc






agcctgagccccagacagctgctgggattcccttgtgccgaggtgtccggcctgagcacagagagagtgcgggaactgg





ctgtggccctggcccagaaaaacgtgaagctgagcaccgagcagctgcggtgcctggcccacagactgtctgagcctcc





cgaggatctggacgccctgcctctggatctgctgctgttcctgaaccccgacgccttcagcggacctcaggcctgcacccg





gttcttcagcagaatcaccaaggccaacgtggacctgctgcccagaggcgcccctgagagacagagactgctgcctgct





gctctggcctgttggggagtgcggggctctctgctgtctgaagctgatgtgcgggccctgggaggcctggcttgtgatctgcc





tggaagattcgtggccgagagcgccgaagtgctgctgcctagactggtgtcctgtcccggccctctggaccaggatcagc





aggaagctgccagagctgctctgcagggcggaggccctccttatggacctcctagcacttggagcgtgtccaccatggat





gccctgaggggcctgctgccagtgctgggccagcctatcatcagatccatcccacagggcatcgtggccgcctggcggc





agagaagctctagagatccctcttggcggcagcccgagcggacaatcctgcggcccaggtttcggagagaggtggaaa





agaccgcctgcccctctggcaagaaggccagagagatcgacgagagcctgatcttctacaagaagtgggagctggaa





gcctgcgtggacgccgctctgctggccacccagatggacagagtgaacgccatccccttcacctatgagcagctggacgt





gctgaagcacaagctggatgagctgtacccccagggctaccccgagagcgtgatccagcacctgggctacctgtttctga





agatgagccccgaggacatccggaagtggaacgtgaccagcctggaaaccctgaaggccctgctggaagtgaacaa





gggccacgagatgtccccccaggtggccacactgatcgacagattcgtgaagggcagaggccagctggacaaggaca





ccctggatacactgaccgccttctaccccggctatctgtgcagcctgtcccccgaggaactgagcagcgtgccacctagct





ctatctgggctgtgcggccccaggacctggatacctgcgatcctagacagctggatgtgctgtatcccaaggcccggctgg





ccttccagaacatgaacggcagcgagtacttcgtgaagatccagtccttcctgggcggagcccctaccgaggacctgaa





agctctgagccagcagaacgtgtccatggatctggccacctttatgaagctgcggaccgacgccgtgctgcctctgacagt





ggctgaggtgcagaaactgctgggcccccatgtggaagggctgaaggccgaagaacggcacagacccgtgcgcgac





tggatcctgaggcagagacaggatgacctggacacactgggcctgggactgcaggggggcatccctaatggctacctg





gtgctggacctgagcatgcaggaagccctgggatccggcgagggcagaggcagcctgctgacatgtggcgacgtgga





agagaaccctggccccagcttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatctt





ccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctc





tgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcgg





tcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccg





atccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgcc





gcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttgg





aaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgg





gcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgta





cgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgt





ggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtcc





gccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctat





cgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaagg





ccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatcc





accgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatca





ccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcg





tcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcac





cgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagca





gagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcagg





ggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatat





ggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgc





acctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaa





ctgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatg





gtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgcca





gcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattc





gctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggtt





ccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactg





cctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcct





agcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgct





gggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgc





caacccagcattgccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcag





cctgctgaaactggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctg





ctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacc





cagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcct





ggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgc





cacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgt





gaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagatacc





agaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctc





tactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatg





gcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctga





taccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctc





tacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatcccta





gccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagc





gtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacat





cagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcg





agatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtg





cagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgag





gccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcag





gcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccg





tgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagta





ccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccgg





caacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg





Plasmid 1356 Polypeptide


SEQ ID NO: 66



MASLAGETGQEAAPLDGVLANPPNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQ






KNVKLSTEQLRCLAHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDL





LPRGAPERQRLLPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLV





SCPGPLDQDQQEAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGI





VAAWRQRSSRDPSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEA





CVDAALLATQMDRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDI





RKWNVTSLETLKALLEVNKGHEMSPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLC





SLSPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFL





GGAPTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRP





VRDWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSMQEALGSGEGRGSLLTCGDVEEN





PGPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYVVQMRPLFLELLG





NHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQH





SSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKM





SVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQK





NRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDG





LRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHR





AWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQ





KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLF





DVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLR





LVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMP





AHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLK





CHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTAS





LCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTA





QTQLSRKLPGTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPG





PTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTS





SVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPA





HDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTA





PPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALG





STAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASH





STKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQ





ELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKT





EAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQC





RRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSY





TNPAVAAASANL





2A PEPTIDES


The amino acid sequence of the 2A peptides set forth in SEQ ID NOs: 67-74 includes


a glycine-serine-glycine (GSG) linker encoded by the nucleic acid sequence


(SEQ ID NOs: 67-74)



GGATCCGGC.






Encephalomyocarditis Virus (EMCV) 2A Nucleotide sequence:


SEQ ID NO: 67



ggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctg






gcccc





Encephalomyocarditis Virus (EMCV) 2A Amino acid sequence:


SEQ ID NO: 68



GSGRIFNAHYAGYFADLLIHDIETNPGP






Thosea Asigna Virus (TAV) 2A Nucleotide sequence:


SEQ ID NO: 69



ggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggcccc






Thosea Asigna Virus (TAV) 2A Amino acid sequence:


SEQ ID NO: 70



GSGEGRGSLLTCGDVEENPGP






Equine Rhinitis B Virus (ERBV) 2A Nucleotide sequence:


SEQ ID NO: 71



ggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctg






gccct





Equine Rhinitis B Virus (ERBV) 2A Amino acid sequence:


SEQ ID NO: 72



GSGTILSEGATNFSLLKLAGDVELNPGP






Porcine teschovirus (PTV) 2A Nucleotide sequence:


SEQ ID NO: 73



ggatccggcgccaccaatttcagcctgctgaaacaggccggcgacgtggaagagaaccctggccct






Porcine teschovirus (PTV) 2A Amino acid sequence:


SEQ ID NO: 74



GSGATNFSLLKQAGDVEENPGP























SEQ


Primer
SEQUENCE (5′ TO 3′)
Strand
ID NO


















EMCV_cMSLN_F-
GAGACAAACCCTGGCCCCCTGGCTGGCGAGACAGGAC
Sense
75


33
AGGAAG







EMCV_Muc1_R-
GTTGAAGATTCTGCCGGATCCCAGGTTGGCGGAGGCA
Antisense
76


35
GCGGCCACG







EMCV2A_F-34
GCTACTTCGCCGACCTGCTGATCCACGACATCGAGACA
Sense
77



AACCCTGGC







EMCV2A_R-36
GGTCGGCGAAGTAGCCGGCGTAGTGGGCGTTGAAGAT
Antisense
78



TCTGCCGGAT







f MSLN 1028-
TTCTGAAGATGAGCCCCGAGGACA
Sense
79


1051








f Muc 960-983
CGGCGTCTCATTCTTCTTTCTGTC
Sense
80





f pmed Nhe
ACCCTGTGACGAACATGGCTAGCCTGGCTGGCGAGAC
Sense
81


cMSLN
AGGACAGGA







f pmed Nhe
ACCCTGTGACGAACATGGCTAGCACAGGCTCTGGCCAC
Sense
82


cytMuc
GCCAG







f pmed Nhe Muc
ACCCTGTGACGAACATGGCTAGCACCCCTGGAACCCAG
Sense
83



AGCC







f pmed Nhe
ACCCTGTGACGAACATGGCTAGCGGAGCTGCCCCGGA
Sense
84


Ter240
GCCGG







f tert 1584-1607
TCTCACCGACCTCCAGCCTTACAT
Sense
85





f tert ink cMSLN
ACGGAGGCTCCGGCGGACTGGCTGGCGAGACAGGACA
Sense
86





f tg link Ter240
TGGGAGGCTCCGGCGGAGGAGCTGCCCCGGAGCCGG
Sense
87





f1 EM2A Muc
CCTGCTGATCCACGACATCGAGACAAACCCTGGCCCCA
Sense
88



CCCCTGGAACCCAGAGCC







f1 ERBV2A cMuc
TGGCCGGCGACGTGGAACTGAACCCTGGCCCTACAGG
Sense
89



CTCTGGCCACGCCAG







f1 ERBV2A Muc
TGGCCGGCGACGTGGAACTGAACCCTGGCCCTACCCCT
Sense
90



GGAACCCAGAGCC







f1 ERBV2A Ter
TGGCCGGCGACGTGGAACTGAACCCTGGCCCTAGCTTC
Sense
91


d342
CTCCTGTCGTCGCTCA







f1 ERBV2A Ter240
TGGCCGGCGACGTGGAACTGAACCCTGGCCCTGGAGC
Sense
92



TGCCCCGGAGCCGG







f1 ERBV2A Tert
TGGCCGGCGACGTGGAACTGAACCCTGGCCCTGCCAA
Sense
93


d541
ATTTCTGCATTGGCTGATG







f1 PTV2A cMSLN
TGGAAGAGAACCCTGGCCCTCTGGCTGGCGAGACAGG
Sense
94



ACAGGA







f1 PTV2A Muc
TGGAAGAGAACCCTGGCCCTACCCCTGGAACCCAGAGC
Sense
95



C







f1 T2A cMSLN
GCGACGTGGAAGAGAACCCTGGCCCCCTGGCTGGCGA
Sense
96



GACAGGACAGGA







f1 T2A Tert d342
GCGACGTGGAAGAGAACCCTGGCCCCAGCTTCCTCCTG
Sense
97



TCGTCGCTCA







f1 T2A Tert d541
GCGACGTGGAAGAGAACCCTGGCCCCGCCAAATTTCTG
Sense
98



CATTGGCTGATG







f1 T2A Tert240
GCGACGTGGAAGAGAACCCTGGCCCCGGAGCTGCCCC
Sense
99



GGAGCCGG







f2 EMCV2A
AGAATCTTCAACGCCCACTACGCCGGCTACTTCGCCGA
Sense
100



CCTGCTGATCCACGACATCGA







f2 ERBV2A
TGTCTGAGGGCGCCACCAACTTCAGCCTGCTGAAACTG
Sense
101



GCCGGCGACGTGGAACTG







f2 PTV2A
TTCAGCCTGCTGAAACAGGCCGGCGACGTGGAAGAGA
Sense
102



ACCCTGGCCCT







f2 T2A
CCGGCGAGGGCAGAGGCAGCCTGCTGACATGTGGCGA
Sense
103



CGTGGAAGAGAACCCTG







pMED_cMSLN_R-
GGGCCCAGATCTTCACAGGGCTTCCTGCATGCTCAGGT
Antisense
104


37
CCAGCAC







pMED_MUC1_F-
ACGAACATGGCTAGCACCCCTGGAACCCAGAGCCCCTT
Sense
105


31
C







r EM2A Bamh
GTGGGCGTTGAAGATTCTGCCGGATCCCAGGGCTTCCT
Antisense
106


cMSLN
GCATGCTCAGGT







r ERB2A Bamh
TGGTGGCGCCCTCAGACAGGATTGTGCCGGATCCCAG
Antisense
107


Muc
GTTGGCGGAGGCAGCG







r ERB2A Bamh
TGGTGGCGCCCTCAGACAGGATTGTGCCGGATCCGTCC
Antisense
108


Ter240
AAGATGGTCTTGAAATCTGA







r link cMSLN
TCCGCCGGAGCCTCCCAGGGCTTCCTGCATGCTCAGGT
Antisense
109





r link muc
TCCGCCGGAGCCTCCCAGGTTGGCGGAGGCAGCG
Antisense
110





r link Tert240
TCCGCCGGAGCCTCCGTCCAAGATGGTCTTGAAATCTG
Antisense
111



A







r MSLN 1051-
TGTCCTCGGGGCTCATCTT
Antisense
112


1033








r muc 986-963
AAGGACAGAAAGAAGAATGAGACG
Antisense
113





r pmed Bgl
TTGTTTTGTTAGGGCCCAGATCTTCACAGGGCTTCCTGC
Antisense
114


cMSLN
ATGCTCAGG







r pmed Bgl Muc
TTGTTTTGTTAGGGCCCAGATCTTCACAGGTTGGCGGA
Antisense
115



GGCAGCG







r pmed Bgl
TTGTTTTGTTAGGGCCCAGATCTTCAGTCCAAGATGGTC
Antisense
116


Ter240
TTGAAATCTGA







r PTV2A Bamh
CTGTTTCAGCAGGCTGAAATTGGTGGCGCCGGATCCCA
Antisense
117


cMSLN
GGGCTTCCTGCATGCTCAGGT







r PTV2A Bamh
CTGTTTCAGCAGGCTGAAATTGGTGGCGCCGGATCCCA
Antisense
118


Muc
GGTTGGCGGAGGCAGCG







r T2A Bamh
TGCCTCTGCCCTCGCCGGATCCCAGGGCTTCCTGCATGC
Antisense
119


cMSLN
TCAGGT







r T2A Tert240
TGCCTCTGCCCTCGCCGGATCCGTCCAAGATGGTCTTGA
Antisense
120



AATCTGA







r tert 1602-1579
AGGCTGGAGGTCGGTGAGAGTGGA
Antisense
121





r2 T2A
AGGGTTCTCTTCCACGTCGCCACATGTCAGCAGGCTGC
Antisense
122



CTCTGCCCTCGCCGGATCC







TertΔ343-F
ACGAACATGGCTAGCTTCCTCCTGTCGTCGCTCAGACC
Sense
123



GAG







Tert-R
TTGTTTTGTTAGGGCCCAGATCTTCAGTCCAAGATGGTC
Antisense
124



TTGAAATC







TertΔ541-F
ACGAACATGGCTAGCGCCAAATTTCTGCATTGGCTGAT
Sense
125



GTC







r TERT co# pMed
TTGTTTTGTTAGGGCCCAGATCTTCAGTCCAAGATGGTC
Antisense
126



TTGAAATC







f pmed TERT
ACCCTGTGACGAACATGGGAGCTGCCCCGGAGCCGGA
Sense
127


241G
GA







MSLN34
CAACAAGCTAGCCTGGCTGGCGAGACAGGACA
Sense
128





MSLN598
CAACAAAGATCTTTACAGGGCTTCCTGCATGCACAG
Antisense
129





ID1197F
ACCCTGTGACGAACATGGCTAGC
Sense
130





ID1197R
AGATCTGGGCCCTAACA
Antisense
131








Claims
  • 1-20. (canceled)
  • 21. An antigen construct, which comprises a nucleotide sequence encoding an immunogenic TERT polypeptide of SEQ ID NO:3, wherein about 200 to about 600 amino acids of the N-terminus of the sequence of SEQ ID NO:3 are absent.
  • 22. The antigen construct of claim 21, where in the immunogenic TERT polypeptide comprises amino acids 501-1132 of SEQ ID NO:3.
  • 23. The antigen construct of claim 21, where in the immunogenic TERT polypeptide comprises amino acids 200-1132 of SEQ ID NO:3.
  • 24. The antigen construct of claim 21, wherein the immunogenic TERT polypeptide is selected from the group consisting of: (1) a polypeptide comprising the amino acid sequence of SEQ ID NO:10 or amino acids 20892 of SEQ ID NO:10;(2) a polypeptide comprising the amino acid sequence of SEQ ID NO:12 or amino acids 4-591 of SEQ ID NO:12; and(3) a polypeptide comprising the amino acid sequence of SEQ ID NO:14 or amino acids 3-789 of SEQ ID NO:14.
  • 25. The antigen construct of claim 21, wherein the nucleotide sequence encoding the immunogenic TERT polypeptide is selected from the group consisting of: (1) the nucleotide sequence of SEQ ID NO:9;(2) a) the nucleotide sequence of SEQ ID NO:11;(3) the nucleotide sequence of SEQ ID NO:13; and(4) a degenerate variant of the nucleotide sequence of SEQ ID NO:9, SEQ ID NO:11, or SEQ ID NO:13.
  • 26. The antigen construct of claim 21, which is a DNA.
  • 27. The antigen construct of claim 21, which is an RNA
  • 28. A vector, comprising the antigen construct of claim 21.
  • 29. The vector of claim 28, where the vector is a plasmid vector.
  • 30. The vector of claim 28, wherein the vector is a viral vector.
  • 31. A composition, comprising the antigen construct of claim 21.
  • 32. The composition of claim 31, where in the antigen construct is an RNA.
  • 33. The composition of claim 31, further comprising a pharmaceutically acceptable carrier.
  • 34. The composition of claim 33, wherein where in the antigen construct is an RNA.
  • 35. A method of treating cancer in a patient, comprising administering to the patient an effective amount of the composition of claim 33.
  • 36. The method of claim 35, wherein the cancer over-expresses tumor-associated antigen TERT.
  • 37. The method of claim 35, wherein the cancer is breast cancer, pancreatic cancer, and ovarian cancer.
  • 38. The method of claim 35, further comprising administering to the patient an immune modulator.
  • 39. The method of claim 38, wherein the immune modulator is a PD-1 inhibitor or PD-L1 inhibitor.
  • 40. The method of claim 38, wherein the immune modulator RN888.
REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Application No. 62/280,636 filed Jan. 19, 2016 and U.S. Provisional Application No. 62/419,190 filed Nov. 8, 2016. The entire content of each of the foregoing applications is incorporated herein by reference.

Provisional Applications (2)
Number Date Country
62419190 Nov 2016 US
62280636 Jan 2016 US
Divisions (2)
Number Date Country
Parent 16252239 Jan 2019 US
Child 17319395 US
Parent 15407890 Jan 2017 US
Child 16252239 US