Tuberculosis Compositions And Methods Of Treating Or Preventing Tuberculosis

Information

  • Patent Application
  • 20230365631
  • Publication Number
    20230365631
  • Date Filed
    May 12, 2023
    a year ago
  • Date Published
    November 16, 2023
    6 months ago
Abstract
The present disclosure provides fusion proteins comprising Mycobacterium tuberculosis (Mtb) antigens, nucleic acid molecules encoding the same, vectors comprising nucleic acid molecules, compositions comprising the same, and methods of eliciting an immune response against tuberculosis.
Description
REFERENCE TO SEQUENCE LISTING

This application includes a Sequence Listing filed electronically as an XML file named 901513170SEQ, created on May 12, 2023, with a size of 170 kilobytes. The Sequence Listing is incorporated herein by reference.


FIELD

The present disclosure is directed, in part, to fusion proteins comprising Mycobacterium tuberculosis (Mtb) antigens, nucleic acid molecules encoding the same, vectors comprising nucleic acid molecules, compositions comprising the same, and methods of eliciting an immune response against tuberculosis.


BACKGROUND

Tuberculosis (TB) is a global health problem resulting in 8 million new cases and 2 million deaths each year. The emergence of multi-drug and totally-drug resistant strains of TB only makes this problem more severe. The life cycle of Mtb has 3 stages. In the acute phase following initial infection the bacteria replicate in the host and virulence factors are expressed, leading to the generation of an immune response by the host. As the immune response begins to control the infection, the Mtb enters a latent, asymptomatic state in which the bacteria become non-replicating and are encased in granulomas. The bacterium can persist in this latent state in infected individuals for many years, making diagnosis and treatment of disease difficult. In some cases, the bacteria are reactivated and begin replicating again, leading back to the disease state. Reactivation can occur for numerous reasons, including immune suppression caused by diseases such as HIV, treatments such as chemotherapy, or the weakening of the immune system due to aging. An estimated 2 billion people are latently infected with Mtb worldwide, and reactivation of latent Mtb accounts for most new cases of active TB disease. Reactivation is associated with inflammation, necrosis and cavitation of the lung, a process that results in draining of the lesions into the bronchus. Aerosols generated when individuals with bronchial lesions cough causes dissemination of the Mtb organism to uninfected, susceptible persons, and the transmission cycle is thus maintained.


The only currently available vaccine against TB, Mycobacterium bovis (Bacille Calmette-Guerin) (BCG), was first introduced in 1921. BCG has been widely utilized and while studies show that for some purposes BCG is effective (e.g. against disseminated TB), it is known to be ineffective with respect to preventing the development, persistence and reactivation of latent TB. There is an ongoing need to develop improved, more effective vaccines against TB. In particular, there is a need to develop vaccines that provide protection against the development, maintenance and/or reactivation of latent tuberculosis infection. With the availability of the entire genomic sequence of Mtb, and the tools for bioinformatic and experimental analysis of Mtb antigens, many new potential Mtb vaccine candidates have been identified in recent years. These include antigens that are involved in acute infection, maintenance of latency, or reactivation of Mtb. There are a range of delivery strategies in clinical development that are comprised of combinations of these and other antigens that have been tested in animal models and are currently or will soon be in clinical trials.


While vaccines are often effective to immunize individuals prophylactically or therapeutically against pathogen infection or human diseases, there is a need for improved vaccines. There is also a need for compositions and methods that produce an enhanced immune response. Likewise, while some immunotherapeutics are useful to modulate immune response in a patient, there remains a need for improved immunotherapeutic compositions and methods.


SUMMARY

This disclosure describes antigen cassettes (and specified variants) that can be used to create tuberculosis vaccines comprising specified Mycobacterium tuberculosis (Mtb) antigens. The disclosure also describes the strategic combination of antigens which are incorporated into a variety of delivery platforms in such a way as to provide pathways to a matrix of matched combinations of antigen delivery to obtain an optimized immune response. The subject matter described herein can be used as a prophylactic or therapeutic TB vaccine. The initial selection of antigens for inclusion into a usable cassette was based on a number of parameters including, for example, a thorough review of the literature, expression data, responses by human T cells, inclusion of human immunogenic regions, mouse protection studies, and conservation in sequence across most strains of TB with full genome sequences. Specific antigens were then probed to be sure they were able to be expressed in a variety of systems (BCG, protein, viral vectors, nucleic acids), that they were immunogenic, and they could be made as fusions in proteins or other vectors to simplify downstream manufacturing concerns. All of the selected antigens were then shown to be immunogenic in mice, either when used alone, or in a variety of combinations, to arrive at the present application.


The constructs described herein have been integrated into a specified range of delivery platforms that include the following classes (but not exhaustive) of representative delivery platforms: 1) viral vector delivery systems, 2) recombinant BCG, 3) recombinant purified protein fusions, 4) DNA plasmid vector systems, and 5) RNA vector systems. These delivery platforms can be used either in a single platform alone or in combinations as matched antigen prime-boost approaches. In addition, the use of these antigens in a single rBCG vector system, which is envisioned to be used as an antigen matched prime for a boost with any of the modalities above, including protein, viral vectors, nucleic acids, or others.


The present disclosure provides fusion proteins that comprise at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides nucleic acid molecules encoding fusion proteins that comprise at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides: compositions comprising the fusion proteins and a pharmaceutically acceptable carrier; vectors encoding the fusion proteins; compositions comprising the vectors and a pharmaceutically acceptable carrier; cells comprising the vectors; compositions comprising the cells and a pharmaceutically acceptable carrier.


The present disclosure also provides compositions that comprise at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides compositions that comprise at least two or three Mycobacterium tuberculosis (Mtb) antigens, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of one or more fusion proteins comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of a composition comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of a composition comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides fusion proteins for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides fusion proteins for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides uses of a fusion protein in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides uses of a fusion protein in treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides compositions for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides composition for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides uses of a composition in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides uses of a composition in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides compositions for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier; wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides compositions for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides uses of a composition in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides uses of a composition in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides fusion proteins, compositions, cells, vectors, methods, and uses, as described herein, substantially as described with reference to the accompanying examples and/or figures.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows mouse immunogenicity data for a recombinant protein form of the Ag85B, Ag85A, Rv3407 construct and a recombinant BCG encoding the same.



FIG. 2 shows mouse efficacy data for a recombinant protein form of the Ag85B, Ag85A, Rv3407 construct and a recombinant BCG encoding the same.



FIG. 3 shows mouse immunogenicity data for an Ad5 encoding the Ag85B, Ag85A, Rv3407 antigens and a with a recombinant BCG.



FIG. 4 shows mouse immunogenicity data for several Mtb antigens generated using DNA vaccine constructs.





DESCRIPTION OF EMBODIMENTS

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.


As used herein, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.


For recitation of numeric ranges herein, each intervening number there between with the same degree of precision is explicitly contemplated. For example, for the range of 6-9, the numbers 7 and 8 are contemplated in addition to 6 and 9, and for the range 6.0-7.0, the numbers 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, and 7.0 are explicitly contemplated.


As used herein, “adjuvant” means any molecule added to any composition described herein to enhance the immunogenicity of the Mtb antigens.


As used herein, “coding sequence” or “encoding nucleic acid” means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes an Mtb antigen. The coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of an individual or mammal to which the nucleic acid is administered.


As used herein, “consensus” or “consensus sequence” means a polypeptide sequence based on analysis of an alignment of multiple subtypes of a particular Mtb antigen. Nucleic acid sequences that encode a consensus polypeptide sequence can be prepared. Vaccines comprising Mtb antigens that comprise consensus sequences and/or nucleic acid molecules that encode such antigens can be used to induce broad immunity against multiple subtypes or serotypes of a particular antigen.


As used herein, “electroporation” means the use of a transmembrane electric field pulse to induce microscopic pathways (pores) in a bio-membrane; their presence allows biomolecules such as plasmids, oligonucleotides, siRNA, drugs, ions, and water to pass from one side of the cellular membrane to the other.


As used herein, “fragment” with respect to nucleic acid sequences, means a nucleic acid sequence or a portion thereof, that encodes a portion of an Mtb antigen capable of eliciting an immune response in a mammal that cross reacts with a full length wild type Mtb antigen. The fragments can be DNA fragments selected from at least one of the various nucleotide sequences that encode protein fragments set forth below.


As used herein, “fragment” or “immunogenic fragment” with respect to polypeptide sequences, means a portion of an MTB antigen capable of eliciting an immune response in a mammal that cross reacts with a full length wild type strain Mtb antigen. Fragments of consensus or wild type Mtb antigens can comprise at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% or at least 95% of a consensus or wild type Mtb antigen. In some embodiments, fragments of consensus proteins can comprise at least 20 amino acids or more, at least 30 amino acids or more, at least 40 amino acids or more, at least 50 amino acids or more, at least 60 amino acids or more, at least 70 amino acids or more, at least 80 amino acids or more, at least 90 amino acids or more, at least 100 amino acids or more, at least 110 amino acids or more, at least 120 amino acids or more, at least 130 amino acids or more, at least 140 amino acids or more, at least 150 amino acids or more, at least 160 amino acids or more, at least 170 amino acids or more, at least 180 amino acids or more of a consensus or wild type protein.


As used herein, “genetic construct” refers to the DNA or RNA molecules that comprise a nucleotide sequence which encodes an Mtb antigen. The coding sequence includes initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of the individual to whom the nucleic acid molecule is administered.


As used herein, “expressible form” refers to gene constructs that contain the necessary regulatory elements operable linked to a coding sequence that encodes an Mtb antigen such that when present in the cell of the individual, the coding sequence will be expressed.


As used herein, “homology” refers to a degree of complementarity. There can be partial homology or complete homology (i.e., identity). A partially complementary sequence that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid is referred to using the functional term “substantially homologous.” When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term “substantially homologous” refers to a probe that can hybridize to a strand of the double-stranded nucleic acid sequence under conditions of low stringency. When used in reference to a single-stranded nucleic acid sequence, the term “substantially homologous” refers to a probe that can hybridize to (i.e., is the complement of) the single-stranded nucleic acid template sequence under conditions of low stringency.


As used herein, “identical” or “identity” in the context of two or more nucleic acids or polypeptide sequences, means that the sequences have a specified percentage of residues that are the same over a specified region. The percentage can be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity. In cases where the two sequences are of different lengths or the alignment produces one or more staggered ends and the specified region of comparison includes only a single sequence, the residues of single sequence are included in the denominator but not the numerator of the calculation. When comparing DNA and RNA, thymine (T) and uracil (U) residues can be considered equivalent. Identity can be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0.


As used herein, “immune response” means the activation of a host's immune system, e.g., that of a mammal, in response to the introduction of an Mtb antigen. The immune response can be in the form of a cellular or humoral response, or both.


As used herein, “Mtb antigen” means an antigen from Mycobacterium tuberculosis, which may be an isolated antigen, or an antigen that forms part of a fusion protein with other antigen(s).


As used herein, “nucleic acid” or “oligonucleotide” or “polynucleotide” means at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a nucleic acid also encompasses the complementary strand of a depicted single strand. Many variants of a nucleic acid can be used for the same purpose as a given nucleic acid. Thus, a nucleic acid also encompasses substantially identical nucleic acids and complements thereof. A single strand provides a probe that can hybridize to a target sequence under stringent hybridization conditions. Thus, a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions. Nucleic acids can be single stranded or double stranded, or can contain portions of both double stranded and single stranded sequence. The nucleic acid can be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid can contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine. Nucleic acids can be obtained by chemical synthesis methods or by recombinant methods.


As used herein, “operably linked” means that expression of a gene is under the control of a promoter with which it is spatially connected. A promoter can be positioned 5′ (upstream) or 3′ (downstream) of a gene under its control. The distance between the promoter and a gene can be approximately the same as the distance between that promoter and the gene it controls in the gene from which the promoter is derived. As is known in the art, variation in this distance can be accommodated without loss of promoter function.


As used herein, “promoter” means a synthetic or naturally-derived molecule which is capable of conferring, activating or enhancing expression of a nucleic acid in a cell. A promoter can comprise one or more specific transcriptional regulatory sequences to further enhance expression and/or to alter the spatial expression and/or temporal expression of same. A promoter can also comprise distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription. A promoter can be derived from sources including viral, bacterial, fungal, plants, insects, and animals. A promoter can regulate the expression of a gene component constitutively, or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, pathogens, metal ions, or inducing agents.


As used herein, “signal peptide” and “leader sequence”, used interchangeably, refer to an amino acid sequence that can be linked at the amino terminus of an Mtb antigenic protein set forth herein. Signal peptides/leader sequences typically direct localization of a protein. Signal peptides/leader sequences used herein can facilitate secretion of the protein from the cell in which it is produced or anchor it in the membrane. Signal peptides/leader sequences are often cleaved from the remainder of the protein, often referred to as the mature protein, upon secretion from the cell. Signal peptides/leader sequences are linked at the N terminus of the protein.


As used herein, “stringent hybridization conditions” means conditions under which a first nucleic acid sequence (e.g., probe) will hybridize to a second nucleic acid sequence (e.g., target), such as in a complex mixture of nucleic acids. Stringent conditions are sequence-dependent and will be different in different circumstances. Stringent conditions can be selected to be about 5 to 10° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm can be the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions can be those in which the salt concentration is less than about 1.0 M sodium ion, such as about 0.01 to 1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., about 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than about 50 nucleotides). Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive signal can be at least 2 to 10 times background hybridization. Exemplary stringent hybridization conditions include the following: 50% formamide, 5×SSC, and 1% SDS, incubating at 42° C., or, 5×SSC, 1% SDS, incubating at 65° C., with wash in 0.2×SSC, and 0.1% SDS at 65° C.


As used herein, “substantially complementary” means that a first sequence is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical to the complement of a second sequence over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 180, 270, 360, 450, 540, or more nucleotides or amino acids, or that the two sequences hybridize under stringent hybridization conditions.


As used herein, “substantially identical” means that a first and second sequence are at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 180, 270, 360, 450, 540 or more nucleotides or amino acids, or with respect to nucleic acids, if the first sequence is substantially complementary to the complement of the second sequence.


As used herein, “variant” with respect to a nucleic acid means: i) a portion or fragment of a referenced nucleotide sequence; ii) the complement of a referenced nucleotide sequence or portion thereof, iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof, or iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequences substantially identical thereto.


As used herein, “variant” with respect to a peptide or polypeptide means that it differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retains at least one biological activity. Variant can also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity. A conservative substitution of an amino acid, i.e., replacing an amino acid with a different amino acid of similar properties (e.g., hydrophilicity, degree and distribution of charged regions) is recognized in the art as typically involving a minor change. Amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties.


As used herein, “vector” means a nucleic acid sequence containing an origin of replication. A vector can be a viral vector, bacteriophage, bacterial artificial chromosome or yeast artificial chromosome. A vector can be a DNA or RNA vector. A vector can be a self-replicating extrachromosomal vector.


The present disclosure provides fusion proteins comprising at least two Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least three Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least four Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least five Mycobacterium tuberculosis (Mtb) antigens.


In some embodiments, the nucleic acid molecule encoding any particular Mtb antigen can be a mycobacterial sequence, a bacterial codon optimized sequence (such as an E. coli optimized sequence), or a mammalian optimized sequence (such as a human optimized sequence). The E. coli optimized sequences can be used, for example, to produce fusion proteins. The human optimized sequences can be used in, for example, viral vectors. Methods of codon optimization (whether for bacterial or mammalian) are well known to the skilled artisan.


In some embodiments, the Mtb antigen is Rv1009 (also known as RpfB), Rv3136 (also known as PPE51), Rv3615c (also known as EspC), Rv2628, Rv2034, Rv3136 N-terminus, Ag85A, Ag85B (also known as Rv1886c), Rv3407, Rv1733, Rv2626c, RpfA, RpfC, or RpfD.


A nucleotide sequence encoding Rv1009 (including the signal sequence) is shown in Table 1 as SEQ ID NO: 1, and an amino acid sequence of Rv1009 (including the signal sequence) is shown in Table 1 as SEQ ID NO:2. A nucleotide sequence encoding Rv1009 (without the signal sequence) is shown in Table 1 as SEQ ID NO:3, and an amino acid sequence of Rv1009 (without the signal sequence) is shown in Table 1 as SEQ ID NO:4.


A nucleotide sequence encoding Rv3136 is shown in Table 1 as SEQ ID NO:5 (mycobacterial sequence; not codon optimized), SEQ ID NO:6 (E. coli optimized), and as SEQ ID NO:7 (human optimized), and an amino acid sequence of Rv3136 is shown in Table 1 as SEQ ID NO:8.


A nucleotide sequence encoding Rv3615c is shown in Table 1 as SEQ ID NO:9 (mycobacterial sequence; not codon optimized) and as SEQ ID NO:10 (human optimized), and an amino acid sequence of Rv3615c is shown in Table 1 as SEQ ID NO: 11.


A nucleotide sequence encoding Rv2628 is shown in Table 1 as SEQ ID NO: 12 (mycobacterial sequence; not codon optimized) and as SEQ ID NO:13 (human optimized), and an amino acid sequence of Rv2628 is shown in Table 1 as SEQ ID NO: 14.


A nucleotide sequence encoding Rv2034 is shown in Table 1 as SEQ ID NO: 15, and an amino acid sequence of Rv2034 is shown in Table 1 as SEQ ID NO:16.


A nucleotide sequence encoding Rv3136 N-terminus (Rv3136Nt) is shown in Table 1 as SEQ ID NO:17, and an amino acid sequence of Rv3136 N-terminus (Rv3136Nt) is shown in Table 1 as SEQ ID NO: 18.


A nucleotide sequence encoding Ag85A is shown in Table 1 as SEQ ID NO:19, and an amino acid sequence of Ag85A is shown in Table 1 as SEQ ID NO:20.


A nucleotide sequence encoding Ag85B is shown in Table 1 as SEQ ID NO:21 (mycobacterial sequence; not codon optimized), SEQ ID NO:22 (E. coli optimized), and SEQ ID NO:23 (human optimized), and an amino acid sequence of Ag85B is shown in Table 1 as SEQ ID NO:24 (mycobacterial sequence) and SEQ ID NO:25 (E. coli and human optimized).


A nucleotide sequence encoding Rv3407 is shown in Table 1 as SEQ ID NO:26, and an amino acid sequence of Rv3407 is shown in Table 1 as SEQ ID NO:27.


A nucleotide sequence encoding Rv1733 is shown in Table 1 as SEQ ID NO:28, and an amino acid sequence of Rv1733 is shown in Table 1 as SEQ ID NO:29.


A nucleotide sequence encoding Rv2626c is shown in Table 1 as SEQ ID NO:30, and an amino acid sequence of Rv2626c is shown in Table 1 as SEQ ID NO:31.


A nucleotide sequence encoding RpfA is shown in Table 1 as SEQ ID NO:32, and an amino acid sequence of RpfA is shown in Table 1 as SEQ ID NO:33.


A nucleotide sequence encoding RpfC is shown in Table 1 as SEQ ID NO:34, and an amino acid sequence of RpfC is shown in Table 1 as SEQ ID NO:35.


A nucleotide sequence encoding RpfD is shown in Table 1 as SEQ ID NO:36, and an amino acid sequence of RpfD is shown in Table 1 as SEQ ID NO:37.


The amino acid and nucleotide sequences shown in Table 1 are derived from H37Rv.










TABLE 1






nucleotide sequence


Construct
amino acid sequence







Rv1009
atgttgcgcctggtagtcggtgcgctgctgctggtgttggcgttcgccggtggctatgcggtcgccgcat



gcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcgggtgatcga



catcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgtgcaggtc



catgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcacgacgcta



agcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgacacggcgcc



ggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgccaagacg



gtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggctgctgag



tgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgtcgaaggc



atgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccgaacgcgc



gtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccggggaccca



ggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaacgtcgtg



gtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccggtgatcg



acggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccggcaacgg



gtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgcaccccgc



gctgacctgtatgtgctgcacgagcgggtgcgcgctga 



(SEQ ID NO: 1)






MLRLVVGALLLVLAFAGGYAVAACKTVTLTVDGTAMRVTTMKSRV



IDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDA



KQVWTTASTVDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSA



KTVQLNDGGLVRTVHLPAPNVAGLLSAAGVPLLQSDHVVPAATAPI



VEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVP



GTQDVTFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVP



PVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGL



RYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGAR 



(SEQ ID NO: 2)





Rv1009
gcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcgggtga


No signal
tcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgtgca


sequence
ggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcacgac



gctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgacacgg



cgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgccaa



gacggtgcagctcaacgacggcgggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggctg



ctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgtcg



aaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccgaa



cgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgggg



acccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaacg



tcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccggt



gatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccggc



aacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgcac



ccccgcctggccggtatgtgctgcacgagcgggtgcgcgctga



(SEQ ID NO: 3)






ACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAGV



QVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMTD



TAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVA



GLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPP



NARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGAR 



(SEQ ID NO: 4)





Rv3136
atggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcaggatcgc



tgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcatatggatc



ggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgct



ccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcag



cgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggatacagct



gctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtac



gccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgc



tgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgctcaggccgccgcggtcag



ccaggccaccgacccactgtcgctgctgattgagacggtgacccaagcgctgcaagcgctgacgattccg



agcttcatccctgaggacttcaccttccttgacgccatattcgctggatatgccacggtaggtgtgacgc



aggatgtcgagtcctttgttgccgggaccatcggggccgagagcaacctaggccttttgaacgtcggcga



cgagaatcccgcggaggtgacaccgggcgactttgggatcggcgagttggtttccgcgaccagtcccggc



ggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggcaacacggtgctcgcgagtgtcggcc



gggcaaactcgattgggcaactatcggtcccaccgagctgggccgcgccctcgacgcgccctgtctcggc



attgtcgcccgccggcctgaccacactcccggggaccgacgtggccgagcacgggatgccaggtgtaccg



ggggtgccagtggcagcagggcgagcctccggcgtcctacctcgatacggggttcggctcacggtgatgg



cccacccacccgcggcagggtaa 



(SEQ ID NO: 5)






atggattttgcgctgctgccgccggaagtgaacagcgcgcgcatgtataccggcccgggcgcgggcagcc



tgctggcggcggcgggcggctgggatagcctggcggcggaactggcgaccaccgcggaagcgtatggcag



cgtgctgagcggcctggcggcgctgcattggcgcggcccggcggcggaaagcatggcggtgaccgcggcg



ccgtatattggctggctgtataccaccgcggaaaaaacccagcagaccgcgattcaggcgcgcgcggcgg



cgctggcgtttgaacaggcgtatgcgatgaccctgccgccgccggtggtggcggcgaaccgcattcagct



gctggcgctgattgcgaccaacttttttggccagaacaccgcggcgattgcggcgaccgaagcgcagtat



gcggaaatgtgggcgcaggatgcggcggcgatgtatggctatgcgaccgcgagcgcggcggcggcgctgc



tgaccccgtttagcccgccgcgccagaccaccaacccggcgggcctgaccgcgcaggcggcggcggtgag



ccaggcgaccgatccgctgagcctgctgattgaaaccgtgacccaggcgctgcaggcgctgaccattccg



agctttattccggaagattttacctttctggatgcgatttttgcgggctatgcgaccgtgggcgtgaccc



aggatgtggaaagctttgtggcgggcaccattggcgcggaaagcaacctgggcctgctgaacgtgggcga



tgaaaacccggcggaagtgaccccgggcgattttggcattggcgaactggtgagcgcgaccagcccgggc



ggcggcgtgagcgcgagcggcgcgggcggcgcggcgagcgtgggcaacaccgtgctggcgagcgtgggcc



gcgcgaacagcattggccagctgagcgtgccgccgagctgggcggcgccgagcacccgcccggtgagcgc



gctgagcccggcgggcctgaccaccctgccgggcaccgatgtggcggaacatggcatgccgggcgtgccg



ggcgtgccggtggcgggggccgcgcgagcggcgtgctgccgcgctatggcgtgcgcctgaccgtgatggc



gcatccgccggcggcgggcgaattt 



(SEQ ID NO: 6)






atggatttcgctctgctcccccccgaggtgaatagcgctaggatgtacacaggacccggagctggaagcc



tcctggctgctgctggaggatgggactccctggctgccgagctcgctacaaccgctgaggcttacggaag



cgtgctctccggcctggctgctctccattggagaggccctgctgccgagtccatggctgtcacagccgct



ccctacattggatggctgtacaccaccgccgagaagacccagcaaaccgctattcaggccagagctgccg



ccctggccttcgaacaggcctacgctatgacactccccccccctgtcgtggctgccaataggatccagct



cctggccctcatcgccaccaacttcttcggccaaaacaccgctgccatcgctgccaccgaagcccagtac



gccgaaatgtgggcccaggatgccgctgctatgtacggctatgccacagctagcgctgccgctgctctgc



tcacacccttcagcccccccaggcaaacaaccaaccctgccggactgacagcccaagctgctgccgtcag



ccaagctaccgaccccctgagcctcctgatcgaaaccgtgacacaggccctgcaggccctgaccattccc



agctttatccccgaggacttcacctttctggacgctatcttcgctggctacgccaccgtgggcgtgacac



aagacgtcgagtccttcgtcgccggcacaatcggagccgagtccaacctcggactcctcaacgtcggcga



cgaaaatcccgccgaagtgacacctggagacttcggcattggagaactcgtcagcgccacatcccctggc



ggaggagtgagcgcttccggagctggaggagctgcttccgtgggcaataccgtgctggccagcgtgggaa



gggccaactccattggccagctcagcgtccccccttcctgggctgccccttccacaaggcctgtgtccgc



tctcagccctgctggactgaccacactccctggcacagacgtggctgagcatggcatgcccggagtgcct



ggagtccctgtggctgctggcagagcttccggagtcctccctaggtatggcgtgaggctgacagtgatgg



ctcatccccccgctgccggataa 



(SEQ ID NO: 7)






MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAY



GSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQAR



AAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEA



QYAEMWAQDAAAMYGYATASAAAALLTPFSPPRQTTNPAGLTAQA



AAVSQATDPLSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGV



TQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSATSPG



GGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSA



LSPAGLTTLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVM



AHPPAAG 



(SEQ ID NO: 8)





Rv3615c
atgacggaaaacttgaccgtccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcgg



tcgatgcctcctcgggcgtcgaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgta



ctgctcacagttcaacgacacgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcat



acggccggtgtcgatctcgccaaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgt



ggcgcaaggctatcgacgggttgtttacctga 



(SEQ ID NO: 9)






atgaccgagaacctgaccgtgcagcctgagaggctgggagtgctggccagccaccacgacaacgctgccg



tggacgcttccagcggagtggaggctgctgctggactgggagagagcgtggccatcacccacggacccta



ctgcagccagttcaacgacaccctgaacgtgtacctgacagcccacaacgccctgggaagcagcctgcat



acagccggcgtggacctggctaagtccctgaggatcgccgccaagatctacagcgaggccgacgaggcct



ggaggaaagccatcgacggcctgttcacctaa 



(SEQ ID NO: 10)






MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGP



YCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEA



WRKAIDGLFT 



(SEQ ID NO: 11)





Rv2628
atgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcatgggccggccgatgtg



gtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggt



gcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgac



gagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttggccgatgagcttg



cgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgc



cgttgcggtctaa 



(SEQ ID NO: 12)






atgagcacccagagacccaggcacagcggcattagggccgtgggaccttatgcttgggccggcagatgcg



gaaggatcggcagatggggcgtgcaccaagaggccatgatgaacctggccatctggcaccccaggaaggt



gcagagcgccaccatctaccaggtgaccgacaggagccatgacggaaggaccgccagagtgcccggcgat



gagatcaccagcaccgtgagcggctggctgagcgaactgggcacccaatcccccctggctgatgaactgg



ccagggctgtgaggatcggcgattggcctgccgcctatgccatcggcgagcatctgagcgtggagatcgc



cgtggccgtgtaa 



(SEQ ID NO: 13)






MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPR



KVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADE



LARAVRIGDWPAAYAIGEHLSVEIAVAV 



(SEQ ID NO: 14)





Rv2034
gtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtgg



gcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggta



gtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtc



taccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctga



ctggctacgcgcagctcatcgactccgaaggagacgacacatga 



(SEQ ID NO: 15)






VSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRPA



VSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTRA



LTGYAQLIDSEGDDT 



(SEQ ID NO: 16)





Rv3136
atggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcaggatcgc


N-terminus
tgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcatatggatc


(Rv3136Nt)
ggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgct



ccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcag



cgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggatacagct



gctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtac



gccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgc



tgacaccgttctccccgccgcggcagaccaccaacccggccggcctgacc 



(SEQ ID NO: 17)






MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAY



GSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQAR



AAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEA



QYAEMWAQDAAAMYGYATASAAAALLTPFSPPRQTTNPAGLT 



(SEQ ID NO: 18)





Ag85A
atgcagcttgttgacagggttcgtggcgccgtcacgggtatgtcgcgtcgactcgtggtcggggccgtcg



gcgcggccctagtgtcgggtctggtcggcgccgtcggtggcacggcgaccgcgggggcattttcccggcc



gggcttgccggtggagtacctgcaggtgccgtcgccgtcgatgggccgtgacatcaaggtccaattccaa



agtggtggtgccaactcgcccgccctgtacctgctcgacggcctgcgcgcgcaggacgacttcagcggct



gggacatcaacaccccggcgttcgagtggtacgaccagtcgggcctgtcggtggtcatgccggtgggtgg



ccagtcaagcttctactccgactggtaccagcccgcctgcggcaaggccggttgccagacttacaagtgg



gagaccttcctgaccagcgagctgccggggtggctgcaggccaacaggcacgtcaagcccaccggaagcg



ccgtcgtcggtctttcgatggctgcttcttcggcgctgacgctggcgatctatcacccccagcagttcgt



ctacgcgggagcgatgtcgggcctgttggacccctcccaggcgatgggtcccaccctgatcggcctggcg



atgggtgacgctggcggctacaaggcctccgacatgtggggcccgaaggaggacccggcgtggcagcgca



acgacccgctgttgaacgtcgggaagctgatcgccaacaacacccgcgtctgggtgtactgcggcaacgg



caagccgtcggatctgggtggcaacaacctgccggccaagttcctcgagggcttcgtgcggaccagcaac



atcaagttccaagacgcctacaacgccggtggcggccacaacggcgtgttcgacttcccggacagcggta



cgcacagctgggagtactggggcgcgcagctcaacgctatgaagcccgacctgcaacgggcactgggtgc



cacgcccaacaccgggcccgcgccccagggcgcctag 



(SEQ ID NO: 19)






MQLVDRVRGAVTGMSRRLVVGAVGAALVSGLVGAVGGTATAGAF



SRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALYLLDGLRAQDDF



SGWDINTPAFEWYDQSGLSVVMPVGGQSSFYSDWYQPACGKAGCQ



TYKWETFLTSELPGWLQANRHVKPTGSAVVGLSMAASSALTLAIYH



PQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYKASDMWGPKE



DPAWQRNDPLLNVGKLIANNTRVWVYCGNGKPSDLGGNNLPAKFL



EGFVRTSNIKFQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLNAM



KPDLQRALGATPNTGPAPQGA 



(SEQ ID NO: 20)





Ag85B
atgacagacgtgagccgaaagattcgagcttggggacgccgattgatgatcggcacggcagcggctgtag



tccttccgggcctggtggggcttgccgcggagcggcaaccgcgggcgcgttctcccggccggggctgccg



gtcgagtacctgcaggtgccgtcgccgtcgatgggccgcgacatcaaggttcagttccagagcggtggga



acaactcacctgcggtttatctgctcgacggcctgcgcgcccaagacgactacaacggctgggatatcaa



caccccggcgttcgagtggtactaccagtcgggactgtcgatagtcatgccggtcggcgggcagtccagc



ttctacagcgactggtacagcccggcctgcggtaaggctggctgccagacttacaagtgggaaaccttcc



tgaccagcgagctgccgcaatggttgtccgccaacagggccgtgaagcccaccggcagcgctgcaatcgg



cttgtcgatggccggctcgtcggcaatgatcttggccgcctaccacccccagcagttcatctacgccggc



tcgctgtcggccctgctggacccctctcaggggatggggcctagcctgatcggcctcgcgatgggtgacg



ccggcggttacaaggccgcagacatgtggggtccctcgagtgacccggcatgggagcgcaacgaccctac



gcagcagatccccaagctggtcgcaaacaacacccggctatgggtttattgcgggaacggcaccccgaac



gagttgggcggtgccaacatacccgccgagttcttggagaacttcgttcgtagcagcaacctgaagttcc



aggatgcgtacaacgccgcgggcgggcacaacgccgtgttcaacttcccgcccaacggcacgcacagctg



ggagtactggggcgctcagctcaacgccatgaagggtgacctgcagagttcgttaggcgccggctga 



(SEQ ID NO: 21)






atgtttagccgtcctggcctgccagttgaatacctgcaagttccgagcccgtccatgggtcgtgacatta



aggtgcagttccagagcggcggtaacaatagcccggctgtgtacctgctggacggtctgcgtgcgcagga



tgattacaacggctgggacatcaataccccggcatttgagtggtattaccagtcgggtctgagcattgtg



atgccggttggcggtcaaagcagcttctatagcgattggtacagcccggcatgcggcaaggctggttgcc



aaacctacaagtgggaaactttcttgaccagcgagctgccgcaatggttgagcgccaaccgtgcggtcaa



accgaccggtagcgctgctattggcctgtccatggccggcagcagcgcgatgatcttggcggcataccat



ccgcagcagtttatctacgccggtagcctgagcgcattgctggacccgagccaaggcatgggtccgagcc



tgattggtctggcaatgggtgacgcaggtggttacaaagcggccgatatgtggggcccatctagcgaccc



ggcatgggagcgtaatgacccgacccagcaaattccgaaactggtggcgaataacacgcgcctgtgggtc



tactgtggcaatggtacgccgaacgagctgggtggcgcgaatatccctgcggagtttctggaaaactttg



ttcgcagcagcaacctgaaattccaggacgcgtataacgcagccggtggtcacaatgcggttttcaattt



cccgccaaatggcactcatagctgggagtactggggtgcgcagttgaacgcaatgaaaggcgatctgcaa



tcctctctgggtgcgggc 



(SEQ ID NO: 22)






atgttctccaggcccggcctgcctgtcgagtatctgcaggtcccctccccctccatgggcagagacatca



aggtgcagttccaatccggaggcaacaacagccccgccgtgtatctcctcgacggcctgagggctcagga



cgactacaacggctgggacatcaacacccccgccttcgagtggtactaccagtccggactgagcatcgtc



atgcccgtgggcggccagagctccttctacagcgactggtatagccctgcctgcggcaaagccggatgcc



agacctacaagtgggagacctttctgaccagcgaactgccccagtggctgtccgccaatagggccgtcaa



acctaccggctccgctgccatcggactcagcatggccggaagctccgctatgatcctggccgcctaccac



ccccagcaatttatctacgctggcagcctgtccgctctgctggatcctagccaaggcatgggccctagcc



tcattggcctggccatgggcgatgctggcggctataaggccgccgatatgtggggccctagctccgatcc



tgcctgggagaggaatgaccccacccagcagatccccaagctggtggccaacaacacaaggctctgggtg



tactgcggcaatggcacccccaacgaactgggcggagccaacattcccgccgagttcctggagaacttcg



tcaggagcagcaacctgaagttccaggacgcctacaatgccgccggaggccacaacgctgtgttcaactt



ccctcccaacggcacccacagctgggagtattggggcgctcagctgaacgccatgaaaggcgacctccag



agctccctgggagctgga



(SEQ ID NO: 23)






MTDVSRKIRAWGRRLMIGTAAAVVLPGLVGLAGGAATAGAFSRPGL



PVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLDGLRAQDDYNGWD



INTPAFEWYYQSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWE



TFLTSELPQWLSANRAVKPTGSAAIGLSMAGSSAMILAAYHPQQFIY



AGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWGPSSDPAWER



NDPTQQIPKLVANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSSN



LKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQSSL



GAG 



(SEQ ID NO: 24)






MFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLDGLRAQD



DYNGWDINTPAFEWYYQSGLSIVMPVGGQSSFYSDWYSPACGKAGC



QTYKWETFLTSELPQWLSANRAVKPTGSAAIGLSMAGSSAMILAAY



HPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWGPSS



DPAWERNDPTQQIPKLVANNTRLWVYCGNGTPNELGGANIPAEFLE



NFVRSSNLKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMK



GDLQSSLGAG 



(SEQ ID NO: 25)





Rv3407
atgcgtgctaccgttgggcttgtggaggcaatcggaatccgagaactaagacagcacgcatcgcgatacc



tcgcccgggttgaagccggcgaggaacttggcgtcaccaacaaaggaagacttgtggcccgactcatccc



ggtgcaggccgcggagcgttctcgcgaagccctgattgaatcaggtgtcctgattccggctcgtcgtcca



caaaaccttctcgacgtcaccgccgaaccggcgcgcggccgcaagcgcaccctgtccgatgttctcaacg



aaatgcgcgacgagcagtga 



(SEQ ID NO: 26)






MRATVGLVEAIGIRELRQHASRYLARVEAGEELGVTNKGRLVARLIP



VQAAERSREALIESGVLIPARRPQNLLDVTAEPARGRKRTLSDVLNE



MRDEQ 



(SEQ ID NO: 27)





Rv1733
atgatcgccacaacccgcgatcgtgaaggagccaccatgatcacgtttaggctgcgcttgccgtgccgga



cgatactgcgggtgttcagccgcaatccgctggtgcgtgggacggatcgactcgaggcggtcgtcatgct



gctggccgtcacggtctcgctgctgactatcccgttcgccgccgcggccggcaccgcagtccaggattcc



cgcagccacgtctatgcccaccaggcccagacccgccatcccgcaaccgcgaccgtgatcgatcacgagg



gggtgatcgacagcaacacgaccgccacgtcagcgccgccgcgcacgaagatcaccgtgcctgcccgatg



ggtcgtgaacggaatagaacgcagcggtgaggtcaacgcgaagccgggaaccaaatccggtgaccgcgtc



ggcatttgggtcgacagtgccggtcagctggtcgatgaaccagctccgccggcccgtgccattgcggatg



cggccctggccgccttgggactctggttgagcgtcgccgcggttgcgggcgccctgctggcgctcactcg



ggcgattctgatccgcgttcgcaacgccagttggcaacacgacatcgacagcctgttctgcacgcagcgg



tga 



(SEQ ID NO: 28)






MIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAVVMLL



AVTVSLLTIPFAAAAGTAVQDSRSHVYAHQAQTRHPATATVIDHEGV



IDSNTTATSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIW



VDSAGQLVDEPAPPARAIADAALAALGLWLSVAAVAGALLALTRAI



LIRVRNASWQHDIDSLFCTQR 



(SEQ ID NO: 29)





Rv2626c
atgaccaccgcacgcgacatcatgaacgcaggtgtgacctgtgttggcgaacacgagacgctaaccgctg



ccgctcaatacatgcgtgagcacgacatcggcgcgttgccgatctgcggggacgacgaccggctgcacgg



catgctcaccgaccgcgacattgtgatcaaaggcctggctgcgggcctagacccgaataccgccacggct



ggcgagttggcccgggacagcatctactacgtcgatgcgaacgcaagcatccaggagatgctcaacgtca



tggaagaacatcaggtccgccgtgttccggtcatctcagagcaccgcttggtcggaatcgtcaccgaagc



cgacatcgcccgacacctgcccgagcacgccattgtgcagttcgtcaaggcaatctgctcgcccatggcc



ctcgccagctag 



(SEQ ID NO: 30)






MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDRLH



GMLTDRDIVIKGLAAGLDPNTATAGELARDSIYYVDANASIQEMLNV



MEEHQVRRVPVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMAL



AS 



(SEQ ID NO: 31)





RpfA
atgagtggacgccaccgtaagcccaccacatccaacgtcagcgtcgccaagatcgcctttaccggcgcag



tactcggtggcggcggcatcgccatggccgctcaggcgaccgcggccaccgacggggaatgggatcaggt



ggcccgctgcgagtcgggcggcaactggtcgatcaacaccggcaacggttacctcggtggcttgcagttc



actcaaagcacctgggccgcacatggtggcggcgagttcgccccgtcggctcagctggccagccgggagc



agcagattgccgtcggtgagcgggtgctggccacccagggtcgcggcgcctggccggtgtgcggccgcgg



gttatcgaacgcaacaccccgcgaagtgcttcccgcttcggcagcgatggacgctccgttggacgcggcc



gcggtcaacggcgaaccagcaccgctggccccgccgcccgccgacccggcgccacccgtggaacttgccg



ctaacgacctgcccgcaccgctgggtgaacccctcccggcagctcccgccgacccggcaccacccgccga



cctggcaccacccgcgcccgccgacgtcgcgccacccgtggaacttgccgtaaacgacctgcccgcaccg



ctgggtgaacccctcccggcagctcccgccgacccggcaccacccgccgacctggcaccacccgcgcccg



ccgacctggcgccacccgcgcccgccgacctggcgccacccgcgcccgccgacctggcaccacccgtgga



acttgccgtaaacgacctgcccgcgccgctgggtgaacccctcccggcagctcccgccgaactggcgcca



cccgccgatctggcacccgcgtccgccgacctggcgccacccgcgcccgccgacctggcgccacccgcgc



ccgccgaactggcgccacccgcgcccgccgacctggcaccacccgctgcggtgaacgagcaaaccgcgcc



gggcgatcagcccgccacagctccaggcggcccggttggccttgccaccgatttggaactccccgagccc



gacccccaaccagctgacgcaccgccgcccggcgacgtcaccgaggcgcccgccgaaacgccccaagtct



cgaacatcgcctatacgaagaagctgtggcaggcgattcgggcccaggacgtctgcggcaacgatgcgct



ggactcgctcgcacagccgtacgtcatcggctga 



(SEQ ID NO: 32)






MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEWD



QVARCESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLAS



REQQIAVGERVLATQGRGAWPVCGRGLSNATPREVLPASAAMDAPL



DAAAVNGEPAPLAPPPADPAPPVELAANDLPAPLGEPLPAAPADPAPP



ADLAPPAPADVAPPVELAVNDLPAPLGEPLPAAPADPAPPADLAPPAP



ADLAPPAPADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPP



ADLAPASADLAPPAPADLAPPAPAELAPPAPADLAPPAAVNEQTAPG



DQPATAPGGPVGLATDLELPEPDPQPADAPPPGDVTEAPAETPQVSNI



AYTKKLWQAIRAQDVCGNDALDSLAQPYVIG 



(SEQ ID NO: 33)





RpfC
gtgcatcctttgccggccgaccacggccggtcgcggtgcaatagacacccgatctcaccactctctctaa



tcggtaacgcttcggccacttccggcgatatgtcgagcatgacaagaatcgccaagccgctcatcaagtc



cgccatggccgcaggactcgtcacggcatccatgtcgctctccaccgccgttgcccacgccggtcccagc



ccgaactgggacgccgtcgcgcagtgcgaatccgggggcaactgggcggccaacaccggaaacggcaaat



acggcggactgcagttcaagccggccacctgggccgcattcggcggtgtcggcaacccagcagctgcctc



tcgggaacaacaaatcgcagttgccaatcgggttctcgccgaacagggattggacgcgtggccgacgtgc



ggcgccgcctctggccttccgatcgcactgtggtcgaaacccgcgcagggcatcaagcaaatcatcaacg



agatcatttgggcaggcattcaggcaagtattccgcgctga



(SEQ ID NO: 34)






VHPLPADHGRSRCNRHPISPLSLIGNASATSGDMSSMTRIAKPLIKSA



MAAGLVTASMSLSTAVAHAGPSPNWDAVAQCESGGNWAANTGNG



KYGGLQFKPATWAAFGGVGNPAAASREQQIAVANRVLAEQGLDAW



PTCGAASGLPIALWSKPAQGIKQIINEIIWAGIQASIPR 



(SEQ ID NO: 35)





RpfD
atgacaccgggtttgcttactactgcgggtgctggccgaccacgtgacaggtgcgccaggatcgtatgca



cggtgttcatcgaaaccgccgttgtcgcgaccatgtttgtcgcgttgttgggtctgtccaccatcagctc



gaaagccgacgacatcgattgggacgccatcgcgcaatgcgaatccggcggcaattgggcggccaacacc



ggtaacgggttatacggtggtctgcagatcagccaggcgacgtgggattccaacggtggtgtcgggtcgc



cggcggccgcgagtccccagcaacagatcgaggtcgcagacaacattatgaaaacccaaggcccgggtgc



gtggccgaaatgtagttcttgtagtcagggagacgcaccgctgggctcgctcacccacatcctgacgttc



ctcgcggccgagactggaggttgttcggggagcagggacgattga 



(SEQ ID NO: 36)






MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLSTISSK



ADDIDWDAIAQCESGGNWAANTGNGLYGGLQISQATWDSNGGVGS



PAAASPQQQIEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFL



AAETGGCSGSRDD 



(SEQ ID NO: 37)









In some embodiments, the fusion protein comprises at least two Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least three Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least four Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the fusion protein comprises at least five Mtb antigens. In some embodiments, the fusion protein comprises at least six Mtb antigens. In some embodiments, the fusion protein comprises from at least two to at least six Mtb antigens. In some embodiments, the fusion protein comprises from at least three to at least six Mtb antigens. In some embodiments, the fusion protein comprises from at least three to at least five Mtb antigens. In some embodiments, the fusion protein comprises at least three or at least four Mtb antigens. In some embodiments, the fusion protein comprises from at least four to at least six Mtb antigens. In some embodiments, the fusion protein comprises at least four or at least five Mtb antigens.


In some embodiments, the fusion protein comprises Rv1009, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the fusion protein comprises Rv1009, Rv2034, and Rv3136 Mtb antigens. In some embodiments, the fusion protein comprises Rv1009, Rv2628, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the fusion protein comprises Rv1009, Rv3615c, Rv2034, and Rv2628 Mtb antigens. In some embodiments, the fusion protein comprises Rv2034, Rv3615c, Rv2628, and Rv3136 Mtb antigens. In some embodiments, the fusion protein comprises Rv1009, Rv2034, Rv2628, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the fusion protein comprises Rv1009, Rv3136Nt, Rv2628, Rv2034, and Rv3615c Mtb antigens. In some embodiments, the fusion protein comprises Ag85A, Ag85B, and Rv3407 Mtb antigens. In some embodiments, the fusion protein comprises Rv1733 and Rv2626c Mtb antigens. In some embodiments, the fusion protein comprises RfpA, RpfC, and RpfD Mtb antigens.


In any of the embodiments of fusion proteins set forth herein, the individual Mtb antigens can be present in any order. For example, for a fusion protein comprising Rv3615c, Rv2034, Rv2628, and Rv1009 Mtb antigens, the first (or N-terminal) antigen may be Rv3615c, Rv2034, Rv2628, or Rv1009; the second antigen may be Rv3615c, Rv2034, Rv2628, or Rv1009 (whichever one is not the first Mtb antigen); the third antigen may be Rv3615c, Rv2034, Rv2628, or Rv1009 (whichever one is not the first or second Mtb antigen); and the fourth (or C-terminal) antigen may be Rv3615c, Rv2034, Rv2628, or Rv1009 (whichever one is not the first, second, or third Mtb antigen). Likewise for every fusion protein disclosed herein.


Individual Mtb antigens may be linked together in a C-terminus to N-terminus or N-terminus to C-terminus manner without any linker. Alternately, a linker may be present between any two Mtb antigens within any of the fusion proteins disclosed herein. In some embodiments, the linker is a segment of DNA or RNA optionally containing one or more restrictions sites, wherein the linker is inserted between nucleic acid molecules encoding two Mtb antigens of any of the fusion proteins disclosed herein.


In some embodiments, the fusion protein comprises Rv1009-Rv3615c-Rv3136 (Construct A; see Table 2). The nucleotide sequence is SEQ ID NO:38 (inserted EcoRI and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:39.


In some embodiments, the fusion protein comprises Rv1009-Rv2034-Rv3136 (Construct B; see Table 2). The nucleotide sequence is SEQ ID NO:40 (inserted Sac and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:41.


In some embodiments, the fusion protein comprises Rv1009-Rv2628-Rv3615c-Rv3136 (Construct C; see Table 2). The nucleotide sequence is SEQ ID NO:42 (inserted EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:43.


In some embodiments, the fusion protein comprises Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136 (Construct D; see Table 2). The nucleotide sequence is SEQ ID NO:44 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:45.


In some embodiments, the fusion protein comprises Rv2034-Rv1009-Rv3136 (Construct E; see Table 2). The nucleotide sequence is SEQ ID NO:46 (inserted EcoRI and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:47.


In some embodiments, the fusion protein comprises Rv3136-Rv2034-Rv1009 (Construct F; see Table 2). The nucleotide sequence is SEQ ID NO:48 (inserted Sac and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:49.


In some embodiments, the fusion protein comprises Rv1009-Rv3615c-Rv2034-Rv2628 (Construct G; see Table 2). The nucleotide sequence is SEQ ID NO:50 (inserted EcoRI, Sac and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:51.


In some embodiments, the fusion protein comprises Rv3615c-Rv2034-Rv2628-Rv1009 (Construct H; see Table 2). The nucleotide sequence is SEQ ID NO:52 (inserted BamHI, EcoRI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:53.


In some embodiments, the fusion protein comprises Rv2034-Rv3615c-Rv2628-Rv3136 (Construct I; see Table 2). The nucleotide sequence is SEQ ID NO:54 (inserted EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:55.


In some embodiments, the fusion protein comprises Rv3136-Rv2628-Rv3615c-Rv2034 (Construct J; see Table 2). The nucleotide sequence is SEQ ID NO:56 (inserted EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:57.


In some embodiments, the fusion protein comprises Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c (Construct K; see Table 2). The nucleotide sequence is SEQ ID NO:58 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:59.


In some embodiments, the fusion protein comprises Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009 (Construct L; see Table 2). The nucleotide sequence is SEQ ID NO:60 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:61.


In some embodiments, the fusion protein comprises Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034 (Construct M; see Table 2). The nucleotide sequence is SEQ ID NO:62 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:63.


In some embodiments, the fusion protein comprises Ag85A-Ag85B-Rv3407 (Construct N; see Table 2). The nucleotide sequence is SEQ ID NO:64, and the corresponding amino acid sequence is SEQ ID NO:65.


In some embodiments, the fusion protein comprises Rv1733-Rv2626c (Construct O; see Table 2). The nucleotide sequence is SEQ ID NO:66, and the corresponding amino acid sequence is SEQ ID NO:67.


In some embodiments, the fusion protein comprises RpfA-RpfC-RpfD (Construct P; see Table 2). The nucleotide sequence is SEQ ID NO:68, and the corresponding amino acid sequence is SEQ ID NO:69.


In some embodiments, the fusion protein comprises Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c (Construct Q; see Table 2). The nucleotide sequence is SEQ ID NO:70 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:71.


In some embodiments, the fusion protein comprises Rv2628-Rv3136Nt-Rv1009-Rv2034-Rv3615c (Construct R; see Table 2). The nucleotide sequence is SEQ ID NO:72 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:73.


In some embodiments, the fusion protein comprises Rv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009 (Construct S; see Table 2). The nucleotide sequence is SEQ ID NO:74 (inserted BamHI, EcoRI, SacI, and HindIII restriction sites are bolded), and the corresponding amino acid sequence is SEQ ID NO:75.










TABLE 2





Con-
nucleotide sequence


struct
amino acid sequence







A
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacggcgggttggtgcgcacggtgcacttgccggcccccaatgtcgcgggg



ctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcg



tcgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgcc



gaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccg



gggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgcca



acgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgccccc



ggtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacacc



ggcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatg



caccccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaagg



ttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcgaattcatgacggaaaacttgaccgtc



cagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcg



aagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgacac



gttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgcc



aaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggt



tgtttaccaagcttatggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccc



tggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagcc



gaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatgg



cggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatcca



agccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggcc



aaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggcca



ccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagc



ggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgctcag



gccgccgcggtcagccaggccaccgacccactgtcgctgctgattgagacggtgacccaagcgctgcaag



cgctgacgattccgagcttcatccctgaggacttcaccttccttgacgccatattcgctggatatgccac



ggtaggtgtgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgagagcaacctaggcctt



ttgaacgtcggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcggcgagttggtttccg



cgaccagtcccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggcaacacggtgct



cgcgagtgtcggccgggcaaactcgattgggcaactatcggtcccaccgagctgggccgcgccctcgacg



cgccctgtctcggcattgtcgcccgccggcctgaccacactcccggggaccgacgtggccgagcacggga



tgccaggtgtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacctcgatacggggttcg



gctcacggtgatggcccacccacccgcggcagggtaa 



(SEQ ID NO: 38)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGAREFMTENLTVQPERLGVLASHHDNAAV



DASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHT



AGVDLAKSLRIAAKIYSEADEAWRKAIDGLFTKLMDFALLPPEVNSA



RMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWR



GPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMT



LPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAA



MYGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLI



ETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVAGTIGA



ESNLGLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGAGGAAS



VGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTTLPGTDV



AEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG 



(SEQ ID NO: 39)





B
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcgaattcgtgtccacttacagatcaccgg



atcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccgct



ggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgctc



aagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacaggcc



ttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatcga



ctccgaaggagacgacacaaagcttatggatttcgcactgttaccaccggaagtcaactccgcccggatg



tacaccggccctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttgg



ccaccacagccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagc



ggaatcgatggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaa



acagcgatccaagccagggggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggt



ggtagcggccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcg



atcgcggccaccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgcca



ccgcctcagcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcct



gaccgctcaggccgccgcggtcagccaggccaccgacccactgtcgctgctgattgagacggtgacccaa



gcgctgcaagcgctgacgattccgagcttcatccctgaggacttcaccttccttgacgccatattcgctg



gatatgccacggtaggtgtgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgagagcaa



cctaggccttttgaacgtcggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcggcgag



ttggtttccgcgaccagtcccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggca



acacggtgctcgcgagtgtcggccgggcaaactcgattgggcaactatcggtcccaccgagctgggccgc



gccctcgacgcgccctgtctcggcattgtcgcccgccggcctgaccacactcccggggaccgacgtggcc



gagcacgggatgccaggtgtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacctcgat



acggggttcggctcacggtgatggcccacccacccgcggcagggtaa 



(SEQ ID NO: 40)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGAREFVSTYRSPDRAWQALADGTRRAIVER



LAHGPLAVGELARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVY



QLDPTGLAALRTDLDRFWTRALTGYAQLIDSEGDDTKLMDFALLPPE



VNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAA



LHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQA



YAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQ



DAAAMYGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATDP



LSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVA



GTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGA



GGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTT



LPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG



(SEQ ID NO: 41)





C
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcgaattcatgtccacgcaacgaccgaggc



actccggtattcgggctgttggcccctacgcatgggccggccgatgtggtcggataggcaggtggggggt



gcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcag



gtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccg



gttggttgtcggagttgggcacccaaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcga



ctggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgccgttgcggtcgagctcatgacg



gaaaacttgaccgtccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatg



cctcctcgggcgtcgaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctc



acagttcaacgacacgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggcc



ggtgtcgatctcgccaaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgca



aggctatcgacgggttgtttaccaagcttatggatttcgcactgttaccaccggaagtcaactccgcccg



gatgtacaccggccctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgag



ttggccaccacagccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccgg



cagcggaatcgatggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacaca



gcaaacagcgatccaagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgcca



ccggtggtagcggccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactg



cggcgatcgcggccaccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggtta



cgccaccgcctcagcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggcc



ggcctgaccgctcaggccgccgcggtcagccaggccaccgacccactgtcgctgctgattgagacggtga



cccaagcgctgcaagcgctgacgattccgagcttcatccctgaggacttcaccttccttgacgccatatt



cgctggatatgccacggtaggtgtgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgag



agcaacctaggccttttgaacgtcggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcg



gcgagttggtttccgcgaccagtcccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgt



cggcaacacggtgctcgcgagtgtcggccgggcaaactcgattgggcaactatcggtcccaccgagctgg



gccgcgccctcgacgcgccctgtctcggcattgtcgcccgccggcctgaccacactcccggggaccgacg



tggccgagcacgggatgccaggtgtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacc



tcgatacggggttcggctcacggtgatggcccacccacccgcggcagggtaa 



(SEQ ID NO: 42)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGAREFMSTQRPRHSGIRAVGPYAWAGRCGR



IGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGD



EITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAV



AVELMTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAI



THGPYCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSE



ADEAWRKAIDGLFTKLMDFALLPPEVNSARMYTGPGAGSLLAAAGG



WDSLAAELATTAEAYGSVLSGLAALHWRGPAAESMAVTAAPYIGW



LYTTAEKTQQTAIQARAAALAFEQAYAMTLPPPVVAANRIQLLALIA



TNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAAAALLTPF



SPPRQTTNPAGLTAQAAAVSQATDPLSLLIETVTQALQALTIPSFIPED



FTFLDAIFAGYATVGVTQDVESFVAGTIGAESNLGLLNVGDENPAEV



TPGDFGIGELVSATSPGGGVSASGAGGAASVGNTVLASVGRANSIGQ



LSVPPSWAAPSTRPVSALSPAGLTTLPGTDVAEHGMPGVPGVPVAAG



RASGVLPRYGVRLTVMAHPPAAG 



(SEQ ID NO: 43)





D
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcggatccgtgtccacttacagatcaccgg



atcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccgct



ggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgctc



aagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacaggcc



ttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatcga



ctccgaaggagacgacacagaattcatgtccacgcaacgaccgaggcactccggtattcgggctgttggc



ccctacgcatgggccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcgatgatgaatc



tagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgg



gcgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccggttggttgtcggagttgggcacc



caaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcg



gtgagcacctgtccgttgagattgccgttgcggtcgagctcatgacggaaaacttgaccgtccagcccga



gcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcgaagctgcc



gctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgacacgttaaatg



tgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgccaaaagtct



tcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggttgtttacc




aagcttatggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcag




gatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcata



tggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacg



gccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccaggg



cggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggat



acagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggca



cagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcgg



ccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgctcaggccgccgc



ggtcagccaggccaccgacccactgtcgctgctgattgagacggtgacccaagcgctgcaagcgctgacg



attccgagcttcatccctgaggacttcaccttccttgacgccatattcgctggatatgccacggtaggtg



tgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgagagcaacctaggccttttgaacgt



cggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcggcgagttggtttccgcgaccagt



cccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggcaacacggtgctcgcgagtg



tcggccgggcaaactcgattgggcaactatcggtcccaccgagctgggccgcgccctcgacgcgccctgt



ctcggcattgtcgcccgccggcctgaccacactcccggggaccgacgtggccgagcacgggatgccaggt



gtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacctcgatacggggttcggctcacgg



tgatggcccacccacccgcggcagggtaa 



(SEQ ID NO: 44)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGARGSVSTYRSPDRAWQALADGTRRAIVER



LAHGPLAVGELARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVY



QLDPTGLAALRTDLDRFWTRALTGYAQLIDSEGDDTEFMSTQRPRHS



GIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQ



VTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGD



WPAAYAIGEHLSVEIAVAVELMTENLTVQPERLGVLASHHDNAAVD



ASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTA



GVDLAKSLRIAAKIYSEADEAWRKAIDGLFTKLMDFALLPPEVNSAR



MYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWRG



PAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMTL



PPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAM



YGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIET



VTQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVAGTIGAES



NLGLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGAGGAASVG



NTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTTLPGTDV



AEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG 



(SEQ ID NO: 45)





E
atggtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcg



tggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgc



ggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgc



gtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccc



tgactggctacgcgcagctcatcgactccgaaggagacgacacagaattcgcatgcaaaacggtgacgtt



gaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcgggtgatcgacatcgtcgaagagaac



gggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgtgcaggtccatgacgccgacacca



tcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcacgacgctaagcaggtgtggacgac



cgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgacacggcgccggccgcggcttctcgc



gccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgccaagacggtgcagctcaacgacg



gcgggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggctgctgagtgcggccggcgtgcc



gctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgtcgaaggcatgcagatccaggtg



acccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccgaacgcgcgtcgtgtcgaggacc



cggagatgaacatgagccgggaggtcgtcgaagacccgggggttccggggacccaggatgtgacgttcgc



ggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaacgtcgtggtgaccccggcccac



gaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccggtgatcgacggaagcatctggg



acgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccggcaacgggtattacggtggtgt



gcagtttgaccagggcacctgggaggccaacgggggctgcggtatgcaccccgcgctgacctcgccaccc



gcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggttggggcgcctggccggtatgtgc



tgcacgagcgggtgcgcgcaagcttatggatttcgcactgttaccaccggaagtcaactccgcccggatg



tacaccggccctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttgg



ccaccacagccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagc



ggaatcgatggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaa



acagcgatccaagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccgg



tggtagcggccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggc



gatcgcggccaccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgcc



accgcctcagcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcc



tgaccgctcaggccgccgcggtcagccaggccaccgacccactgtcgctgctgattgagacggtgaccca



agcgctgcaagcgctgacgattccgagcttcatccctgaggacttcaccttccttgacgccatattcgct



ggatatgccacggtaggtgtgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgagagca



acctaggccttttgaacgtcggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcggcga



gttggtttccgcgaccagtcccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggc



aacacggtgctcgcgagtgtcggccgggcaaactcgattgggcaactatcggtcccaccgagctgggccg



cgccctcgacgcgccctgtctcggcattgtcgcccgccggcctgaccacactcccggggaccgacgtggc



cgagcacgggatgccaggtgtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacctcga



tacggggttcggctcacggtgatggcccacccacccgcggcagggtaa 



(SEQ ID NO: 46)






MVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRP



AVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTR



ALTGYAQLIDSEGDDTEFACKTVTLTVDGTAMRVTTMKSRVIDIVEE



NGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVW



TTASTVDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQL



NDGGLVRTVHLPAPNVAGLLSAAGVPLLQSDHVVPAATAPIVEGMQI



QVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDV



TFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDG



SIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPR



ADLATREEQIAVAEVTRLRQGWGAWPVCAARAGARKLMDFALLPP



EVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLA



ALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQ



AYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWA



QDAAAMYGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATD



PLSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVA



GTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGA



GGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTT



LPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG



(SEQ ID NO: 47)





F
atggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcaggatcgc



tgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcatatggatc



ggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgct



ccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcag



cgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggatacagct



gctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtac



gccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgc



tgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgctcaggccgccgcggtcag



ccaggccaccgacccactgtcgctgctgattgagacggtgacccaagcgctgcaagcgctgacgattccg



agcttcatccctgaggacttcaccttccttgacgccatattcgctggatatgccacggtaggtgtgacgc



aggatgtcgagtcctttgttgccgggaccatcggggccgagagcaacctaggccttttgaacgtcggcga



cgagaatcccgcggaggtgacaccgggcgactttgggatcggcgagttggtttccgcgaccagtcccggc



ggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggcaacacggtgctcgcgagtgtcggcc



gggcaaactcgattgggcaactatcggtcccaccgagctgggccgcgccctcgacgcgccctgtctcggc



attgtcgcccgccggcctgaccacactcccggggaccgacgtggccgagcacgggatgccaggtgtaccg



ggggtgccagtggcagcagggcgagcctccggcgtcctacctcgatacggggttcggctcacggtgatgg



cccacccacccgcggcaggggagctcgtgtccacttacagatcaccggatcgcgcttggcaggcgctggc



ggacggcactcgccgggccatcgtggagcggctggcgcacggcccgctggccgtcggcgagttggcccgc



gacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcg



accgccccgcgggaacacgccgcgtctaccagctcgacccgacaggccttgcggcattgcgcaccgacct



cgaccggttctggacacgcgccctgactggctacgcgcagctcatcgactccgaaggagacgacacaaag




cttgcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg




tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacggcgggttggtgcgcacggtgcacttgccggcccccaatgtcgcgggg



ctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcg



tcgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgcc



gaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccg



gggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgcca



acgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgccccc



ggtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacacc



ggcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatg



caccccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaagg



ttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgctga 



(SEQ ID NO: 48)






MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAY



GSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQAR



AAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEA



QYAEMWAQDAAAMYGYATASAAAALLTPFSPPRQTTNPAGLTAQA



AAVSQATDPLSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGV



TQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSATSPG



GGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSA



LSPAGLTTLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVM



AHPPAAGELVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELA



RDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRT



DLDRFWTRALTGYAQLIDSEGDDTKLACKTVTLTVDGTAMRVTTM



KSRVIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQISLD



GHDAKQVWTTASTVDEALAQLAMTDTAPAAASRASRVPLSGMALP



VVSAKTVQLNDGGLVRTVHLPAPNVAGLLSAAGVPLLQSDHVVPAA



TAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVVED



PGVPGTQDVTFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGTKPG



TEVPPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEAN



GGLRYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGAR



(SEQ ID NO: 49)





G
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcgaattcatgacggaaaacttgaccgtcc



agcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcga



agctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgacacg



ttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgcca



aaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggtt



gtttaccgagctcgtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgc



cgggccatcgtggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtca



gccgacccgcggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcggg



aacacgccgcgtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctgg



acacgcgccctgactggctacgcgcagctcatcgactccgaaggagacgacacaaagcttatgtccacgc



aacgaccgaggcactccggtattcgggctgttggcccctacgcatgggccggccgatgtggtcggatagg



caggtggggggtgcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggtgcaatccgcc



accatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgacgagatcacta



gcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttggccgatgagcttgcgcgtgcggt



gcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgccgttgcggtc



taa 



(SEQ ID NO: 50)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGAREFMTENLTVQPERLGVLASHHDNAAV



DASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHT



AGVDLAKSLRIAAKIYSEADEAWRKAIDGLFTELVSTYRSPDRAWQA



LADGTRRAIVERLAHGPLAVGELARDLPVSRPAVSQHLKVLKTARLV



CDRPAGTRRVYQLDPTGLAALRTDLDRFWTRALTGYAQLIDSEGDD



TKLMSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIW



HPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPL



ADELARAVRIGDWPAAYAIGEHLSVEIAVAV 



(SEQ ID NO: 51)





H
atgacggaaaacttgaccgtccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcgg



tcgatgcctcctcgggcgtcgaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgta



ctgctcacagttcaacgacacgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcat



acggccggtgtcgatctcgccaaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgt



ggcgcaaggctatcgacgggttgtttaccggatccgtgtccacttacagatcaccggatcgcgcttggca



ggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccgctggccgtcggcgag



ttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgctcaagaccgccaggc



tggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacaggccttgcggcattgcg



caccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatcgactccgaaggagac



gacacagaattcatgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcatggg



ccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatatggca



cccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgg



gtgcctggtgacgagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttgc



cgatgagcttgcgcgtggcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtc



cgttgagattgccgttgcggtcaagcttatggcatgcaaaacggtgacgttgaccgtcgacggaaccgcg



atgcgggtgaccacgatgaaatcgcgggtgatcgacatcgtcgaagagaacgggttctcagtcgacgacc



gcgacgacctgtatcccgcggccggcgtgcaggtccatgacgccgacaccatcgtgctgcggcgtagccg



tccgctgcagatctcgctggatggtcacgacgctaagcaggtgtggacgaccgcgtcgacggtggacgag



gcgctggcccaactcgcgatgaccgacacggcgccggccgcggcttctcgcgccagccgcgtcccgctgt



ccgggatggcgctaccggtcgtcagcgccaagacggtgcagctcaacgacgggggttggtgcgcacggtg



cacttgccggcccccaatgtcgcggggctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacg



tggtgcccgccgcgacggccccgatcgtcgaaggcatgcagatccaggtgacccgcaatcggatcaagaa



ggtcaccgagcggctgccgctgccgccgaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgg



gaggtcgtcgaagacccgggggttccggggacccaggatgtgacgttcgcggtagctgaggtcaacggcg



tcgagaccggccgtttgcccgtcgccaacgtcgtggtgaccccggcccacgaagccgtggtgcgggtggg



caccaagcccggtaccgaggtgcccccggtgatcgacggaagcatctgggacgcgatcgccggctgtgag



gccggtggcaactgggcgatcaacaccggcaacgggtattacggtggtgtgcagtttgaccagggcacct



gggaggccaacggcgggctgcggtatgcaccccgcgctgacctcgccacccgcgaagagcagatcgccgt



tgccgaggtgacccgactgcgtcaaggttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgc



tga 



(SEQ ID NO: 52)






MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGP



YCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEA



WRKAIDGLFTGSVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVG



ELARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAA



LRTDLDRFWTRALTGYAQLIDSEGDDTEFMSTQRPRHSGIRAVGPYA



WAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGR



TARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGE



HLSVEIAVAVKLMACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSV



DDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVWTTAST



VDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGG



LVRTVHLPAPNVAGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTR



NRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDVTFAV



AEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWD



AIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPRADLA



TREEQIAVAEVTRLRQGWGAWPVCAARAGAR 



(SEQ ID NO: 53)





I
atggtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcg



tggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgc



ggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgc



gtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccc



tgactggctacgcgcagctcatcgactccgaaggagacgacacagaattcatgacggaaaacttgaccgt



ccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtc



gaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgaca



cgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgc



caaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacggg



ttgtttaccgagctcatgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcat



gggccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatggc



acccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacg



ggtgcctggtgacgagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttg



gccgatgagcttgcgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgt



ccgttgagattgccgttgcggtcaagcttatggatttcgcactgttaccaccggaagtcaactccgcccg



gatgtacaccggccctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgag



ttggccaccacagccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccgg



cagcggaatcgatggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacaca



gcaaacagcgatccaagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgcca



ccggtggtagcggccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactg



cggcgatcgcggccaccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggtta



cgccaccgcctcagcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggcc



ggcctgaccgctcaggccgccgcggtcagccaggccaccgacccactgtcgctgctgattgagacggtga



cccaagcgctgcaagcgctgacgattccgagcttcatccctgaggacttcaccttccttgacgccatatt



cgctggatatgccacggtaggtgtgacgcaggatgtcgagtcctttgttgccgggaccatcggggccgag



agcaacctaggccttttgaacgtcggcgacgagaatcccgcggaggtgacaccgggcgactttgggatcg



gcgagttggtttccgcgaccagtcccggcggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgt



cggcaacacggtgctcgcgagtgtcggccgggcaaactcgattgggcaactatcggtcccaccgagctgg



gccgcgccctcgacgcgccctgtctcggcattgtcgcccgccggcctgaccacactcccggggaccgacg



tggccgagcacgggatgccaggtgtaccgggggtgccagtggcagcagggcgagcctccggcgtcctacc



tcgatacggggttcggctcacggtgatggcccacccacccgcggcagggtag 



(SEQ ID NO: 54)






MVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRP



AVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTR



ALTGYAQLIDSEGDDTEFMTENLTVQPERLGVLASHHDNAAVDASS



GVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGV



DLAKSLRIAAKIYSEADEAWRKAIDGLFTELMSTQRPRHSGIRAVGPY



AWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDG



RTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIG



EHLSVEIAVAVKLMDFALLPPEVNSARMYTGPGAGSLLAAAGGWDS



LAAELATTAEAYGSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTT



AEKTQQTAIQARAAALAFEQAYAMTLPPPVVAANRIQLLALIATNFF



GQNTAAIAATEAQYAEMWAQDAAAMYGYATASAAAALLTPFSPPR



QTTNPAGLTAQAAAVSQATDPLSLLIETVTQALQALTIPSFIPEDFTFL



DAIFAGYATVGVTQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGD



FGIGELVSATSPGGGVSASGAGGAASVGNTVLASVGRANSIGQLSVP



PSWAAPSTRPVSALSPAGLTTLPGTDVAEHGMPGVPGVPVAAGRAS



GVLPRYGVRLTVMAHPPAAG 



(SEQ ID NO: 55)





J
atggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcaggatcgc



tgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcatatggatc



ggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgct



ccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcag



cgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggatacagct



gctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtac



gccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgc



tgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgctcaggccgccgcggtcag



ccaggccaccgacccactgtcgctgctgattgagacggtgacccaagcgctgcaagcgctgacgattccg



agcttcatccctgaggacttcaccttccttgacgccatattcgctggatatgccacggtaggtgtgacgc



aggatgtcgagtcctttgttgccgggaccatcggggccgagagcaacctaggccttttgaacgtcggcga



cgagaatcccgcggaggtgacaccgggcgactttgggatcggcgagttggtttccgcgaccagtcccggc



ggtggggtgtctgcgtcgggtgccggcggtgcggcgagcgtcggcaacacggtgctcgcgagtgtcggcc



gggcaaactcgattgggcaactatcggtcccaccgagctgggccgcgccctcgacgcgccctgtctcggc



attgtcgcccgccggcctgaccacactcccggggaccgacgtggccgagcacgggatgccaggtgtaccg



ggggtgccagtggcagcagggcgagcctccggcgtcctacctcgatacggggttcggctcacggtgatgg



cccacccacccgcggcaggggaattcatgtccacgcaacgaccgaggcactccggtattcgggctgttgg



cccctacgcatgggccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcgatgatgaat



ctagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgctcgcaccggg



cgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccggttggttgtcggagttgggcaccc



aaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcgg



tgagcacctgtccgttgagattgccgttgcggtcgagctcatgacggaaaacttgaccgtccagcccgag



cgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcgaagctgccg



ctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgacacgttaaatgt



gtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgccaaaagtctt



cgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggttgtttacca




agcttatggtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggc




catcgtggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccga



cccgcggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacac



gccgcgtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacg



cgccctgactggctacgcgcagctcatcgactccgaaggagacgacacatag 



(SEQ ID NO: 56)






MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAY



GSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQAR



AAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEA



QYAEMWAQDAAAMYGYATASAAAALLTPFSPPRQTTNPAGLT



AQAAAVSQATDPLSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYAT



VGVTQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSA



TSPGGGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTR



PVSALSPAGLTTLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVR



LTVMAHPPAAGEFMSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVH



QEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSG



WLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAVELMTE



NLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCS



QFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWR



KAIDGLFTKLMVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGE



LARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAAL



RTDLDRFWTRALTGYAQLIDSEGDDT 



(SEQ ID NO: 57)





K
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcggattcatggatttcgcactgttaccac



cggaagtcaactccgcccggatgtacaccggccctggggcaggatcgctgttggctgccgcgggcggctg



ggattcgctggccgccgagttggccaccacagccgaggcatatggatcggtgctgtccggactggccgcc



ttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgctccctatatcggttggctgtaca



cgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcagcgctggccttcgagcaagcata



cgcaatgaccctgccgccaccggtggtagcggccaaccggatacagctgctagcactgatcgcgacgaac



ttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtacgccgagatgtgggcccaggacg



ccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgctgacaccgttctccccgccgcg



gcagaccaccaacccggccggcctgaccgaattcatgtccacgcaacgaccgaggcactccggtattcgg



gctgttggcccctacgcatgggccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcga



tgatgaatctagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgctc



gcacgacgggcgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccggttggttgtcggag



ttgggcacccaaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcgactggcccgctgcgt



acgcaatcggtgagcacctgtccgttgagattgccgttgcggtcgagctcgtgtccacttacagatcacc



ggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccg



ctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgc



tcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacagg



ccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatc



gactccgaaggagacgacacaaagcttatgacggaaaacttgaccgtccagcccgagcgtctcggtgtac



tggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcgaagctgccgctggcctaggcga



atctgtggcgatcactcacggtccgtactgctcacagttcaacgacacgttaaatgtgtacttgactgcc



cacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgccaaaagtcttcgaattgcggcga



agatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggttgtttacctga 



(SEQ ID NO: 58)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGARGFMDFALLPPEVNSARMYTGPGAGSLL



AAAGGWDSLAAELATTAEAYGSVLSGLAALHWRGPAAESMAVTAA



PYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMTLPPPVVAANRIQL



LALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAAAA



LLTPFSPPRQTTNPAGLTEFMSTQRPRHSGIRAVGPYAWAGRCGRIGR



WGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITS



TVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAVE



LVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRPA



VSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTRA



LTGYAQLIDSEGDDTKLMTENLTVQPERLGVLASHHDNAAVDASSG



VEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGVD



LAKSLRIAAKIYSEADEAWRKAIDGLFT 



(SEQ ID NO: 59)





L
atggtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcg



tggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgc



ggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgc



gtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccc



tgactggctacgcgcagctcatcgactccgaaggagacgacacaggatccatgacggaaaacttgaccgt



ccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtc



gaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgaca



cgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgc



caaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacggg



ttgtttaccgaattcatggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggcc



ctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacagc



cgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgatg



gcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatcc



aagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggc



caaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggcc



accgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcag



cggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgagct




catgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcatgggccggccgatgt




ggtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaagg



tgcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtga



cgagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttggccgatgagctt



gcgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattg



ccgttgcggtcaagcttgcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccac



gatgaaatcgcgggtgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtat



cccgcggccggcgtgcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatct



cgctggatggtcacgacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaact



cgcgatgaccgacacggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgcta



ccggtcgtcagcgccaagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccc



caatgtcgcggggctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcg



acggccccgatcgtcgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggc



tgccgctgccgccgaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaaga



cccgggggttccggggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgt



ttgcccgtcgccaacgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggta



ccgaggtgcccccggtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactg



ggcgatcaacaccggcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggc



gggctgcggtatgcaccccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgaccc



gactgcgtcaaggttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgctga 



(SEQ ID NO: 60)






MVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRP



AVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTR



ALTGYAQLIDSEGDDTGSMTENLTVQPERLGVLASHHDNAAVDASS



GVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGV



DLAKSLRIAAKIYSEADEAWRKAIDGLFTEFMDFALLPPEVNSARMY



TGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWRGPA



AESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMTLPP



PVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMY



GYATASAAAALLTPFSPPRQTTNPAGLTELMSTQRPRHSGIRAVGPY



AWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDG



RTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIG



EHLSVEIAVAVKLACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSV



DDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVWTTAST



VDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGG



LVRTVHLPAPNVAGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTR



NRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDVTFAV



AEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWD



AIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPRADLA



TREEQIAVAEVTRLRQGWGAWPVCAARAGAR 



(SEQ ID NO: 61)





M
atgacggaaaacttgaccgtccagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcgg



tcgatgcctcctcgggcgtcgaagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgta



ctgctcacagttcaacgacacgttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcat



acggccggtgtcgatctcgccaaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgt



ggcgcaaggctatcgacgggttgtttaccggattcatgtccacgcaacgaccgaggcactccggtattcg



ggctgttggcccctacgcatgggccggccgatgtggtcggataggcaggtggggggtgcaccaggaggcg



atgatgaatctagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcaggtgaccgatcgct



cgcacgacgggcgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccggttggttgtcgga



gttgggcacccaaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcgactggcccgctgcg



tacgcaatcggtgagcacctgtccgttgagattgccgttgcggtcgaattcgcatgcaaaacggtgacgt



tgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcgggtgatcgacatcgtcgaagagaa



cgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgtgcaggtccatgacgccgacacc



atcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcacgacgctaagcaggtgtggacga



ccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgacacggcgccggccgcggcttctcg



cgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgccaagacggtgcagctcaacgac



ggcgggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggctgctgagtgcggccggcgtgc



cgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgtcgaaggcatgcagatccaggt



gacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccgaacgcgcgtcgtgtcgaggac



ccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccggggacccaggatgtgacgttcg



cggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaacgtcgtggtgaccccggccca



cgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccggtgatcgacggaagcatctgg



gacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccggcaacgggtattacggtggtg



tgcagtttgaccagggcacctgggaggccaacgggggctgcggtatgcaccccgcgctgacctcgccacc



cgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggttggggcgcctggccggtatgtg



ctgcacgagcgggtgcgcgcgagctcatggatttcgcactgttaccaccggaagtcaactccgcccggat



gtacaccggccctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttg



gccaccacagccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcag



cggaatcgatggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagca



aacagcgatccaagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccg



gtggtagcggccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcgg



cgatcgcggccaccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgc



caccgcctcagcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggc



ctgaccaagcttatggtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactc



gccgggccatcgtggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgt



cagccgacccgcggtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcg



ggaacacgccgcgtctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttct



ggacacgcgccctgactggctacgcgcagctcatcgactccgaaggagacgacacataa 



(SEQ ID NO: 62)






MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGP



YCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEA



WRKAIDGLFTGFMSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQE



AMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWL



SELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAVEFACKTV



TLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAGVQVHDA



DTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMTDTAPAA



ASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVAGLLSA



AGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPPNARR



VEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPVANVV



VTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAINTGNG



YYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTRLRQGW



GAWPVCAARAGARELMDFALLPPEVNSARMYTGPGAGSLLAAAGG



WDSLAAELATTAEAYGSVLSGLAALHWRGPAAESMAVTAAPYIGW



LYTTAEKTQQTAIQARAAALAFEQAYAMTLPPPVVAANRIQLLALIA



TNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAAAALLTPF



SPPRQTTNPAGLTKLMVSTYRSPDRAWQALADGTRRAIVERLAHGPL



AVGELARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTG



LAALRTDLDRFWTRALTGYAQLIDSEGDDT 



(SEQ ID NO: 63)





N
atggcattttcccggccgggcttgccggtggagtacctgcaggtgccgtcgccgtcgatgggccgtgaca



tcaaggtccaattccaaagtggtggtgccaactcgcccgccctgtacctgctcgacggcctgcgcgcgca



ggacgacttcagcggctgggacatcaacaccccggcgttcgagtggtacgaccagtcgggcctgtcggtg



gtcatgccggtgggtggccagtcaagcttctactccgactggtaccagcccgcctgcggcaaggccggtt



gccagacttacaagtgggagaccttcctgaccagcgagctgccggggtggctgcaggccaacaggcacgt



caagcccaccggaagcgccgtcgtcggtctttcgatggctgcttcttcggcgctgacgctggcgatctat



cacccccagcagttcgtctacgcgggagcgatgtcgggcctgttggacccctcccaggcgatgggtccca



ccctgatcggcctggcgatgggtgacgctggcggctacaaggcctccgacatgtggggcccgaaggagga



cccggcgtggcagcgcaacgacccgctgttgaacgtcgggaagctgatcgccaacaacacccgcgtctgg



gtgtactgcggcaacggcaagccgtcggatctgggtggcaacaacctgccggccaagttcctcgagggct



tcgtgcggaccagcaacatcaagttccaagacgcctacaacgccggtggcggccacaacggcgtgttcga



cttcccggacagcggtacgcacagctgggagtactggggcgcgcagctcaacgctatgaagcccgacctg



caacgggcactgggtgccacgcccaacaccgggcccgcgccccagggcgccatgttctcccggccggggc



tgccggtcgagtacctgcaggtgccgtcgccgtcgatgggccgcgacatcaaggttcagttccagagcgg



tgggaacaactcacctgcggtttatctgctcgacggcctgcgcgcccaagacgactacaacggctgggat



atcaacaccccggcgttcgagtggtactaccagtcgggactgtcgatagtcatgccggtcggcgggcagt



ccagcttctacagcgactggtacagcccggcctgcggtaaggctggctgccagacttacaagtgggaaac



cttcctgaccagcgagctgccgcaatggttgtccgccaacagggccgtgaagcccaccggcagcgctgca



atcggcttgtcgatggccggctcgtcggcaatgatcttggccgcctaccacccccagcagttcatctacg



ccggctcgctgtcggccctgctggacccctctcaggggatggggcctagcctgatcggcctcgcgatggg



tgacgccggcggttacaaggccgcagacatgtggggtccctcgagtgacccggcatgggagcgcaacgac



cctacgcagcagatccccaagctggtcgcaaacaacacccggctatgggtttattgcgggaacggcaccc



cgaacgagttgggcggtgccaacatacccgccgagttcttggagaacttcgttcgtagcagcaacctgaa



gttccaggatgcgtacaacgccgcgggcgggcacaacgccgtgttcaacttcccgcccaacggcacgcac



agctgggagtactggggcgctcagctcaacgccatgaagggtgacctgcagagttcgttaggcgccggca



tgcgtgctaccgttgggcttgtggaggcaatcggaatccgagaactaagacagcacgcatcgcgatacct



cgcccgggttgaagccggcgaggaacttggcgtcaccaacaaaggaagacttgtggcccgactcatcccg



gtgcaggccgcggagcgttctcgcgaagccctgattgaatcaggtgtcctgattccggctcgtcgtccac



aaaaccttctcgacgtcaccgccgaaccggcgcgcggccgcaagcgcaccctgtccgatgttctcaacga



aatgcgcgacgagcagtga 



(SEQ ID NO: 64)






MAFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALYLLDGLRAQ



DDFSGWDINTPAFEWYDQSGLSVVMPVGGQSSFYSDWYQPACGKA



GCQTYKWETFLTSELPGWLQANRHVKPTGSAVVGLSMAASSALTLA



IYHPQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYKASDMWG



PKEDPAWQRNDPLLNVGKLIANNTRVWVYCGNGKPSDLGGNNLPA



KFLEGFVRTSNIKFQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLN



AMKPDLQRALGATPNTGPAPQGAFSRPGLPVEYLQVPSPSMGRDIKV



QFQSGGNNSPAVYLLDGLRAQDDYNGWDINTPAFEWYYQSGLSIVM



PVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVK



PTGSAAIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLI



GLAMGDAGGYKAADMWGPSSDPAWERNDPTQQIPKLVANNTRLW



VYCGNGTPNELGGANIPAEFLENFVRSSNLKFQDAYNAAGGHNAVF



NFPPNGTHSWEYWGAQLNAMKGDLQSSLGAGAAARATVGLVEAIGI



RELRQHASRYLARVEAGEELGVTNKGRLVARLIPVQAAERSREALIE



SGVLIPARRPQNLLDVTAEPARGRKRTLSDVLNEMRDEQ 



(SEQ ID NO: 65)





O
atgaccaccgcacgcgacatcatgaacgcaggtgtgacctgtgttggcgaacacgagacgctaaccgctg



ccgctcaatacatgcgtgagcacgacatcggcgcgttgccgatctgcggggacgacgaccggctgcacgg



catgctcaccgaccgcgacattgtgatcaaaggcctggctgcgggcctagacccgaataccgccacggct



ggcgagttggcccgggacagcatctactacgtcgatgcgaacgcaagcatccaggagatgctcaacgtca



tggaagaacatcaggtccgccgtgttccggtcatctcagagcaccgcttggtcggaatcgtcaccgaagc



cgacatcgcccgacacctgcccgagcacgccattgtgcagttcgtcaaggcaatctgctcgcccatggcc



ctcgccagcatgatcgccacaacccgcgatcgtgaaggagccaccatgatcacgtttaggctgcgcttgc



cgtgccggacgatactgcgggtgttcagccgcaatccgctggtgcgtgggacggatcgactcgaggcggt



cgtcatgctgctggccgtcacggtctcgctgctgactatcccgttcgccgccgcggccggcaccgcagtc



caggattcccgcagccacgtctatgcccaccaggcccagacccgccatcccgcaaccgcgaccgtgatcg



atcacgagggggtgatcgacagcaacacgaccgccacgtcagcgccgccgcgcacgaagatcaccgtgcc



tgcccgatgggtcgtgaacggaatagaacgcagcggtgaggtcaacgcgaagccgggaaccaaatccggt



gaccgcgtcggcatttgggtcgacagtgccggtcagctggtcgatgaaccagctccgccggcccgtgcca



ttgcggatgcggccctggccgccttgggactctggttgagcgtcgccgcggttgcgggcgccctgctggc



gctcactcgggcgattctgatccgcgttcgcaacgccagttggcaacacgacatcgacagcctgttctgc



acgcagcggtga 



(SEQ ID NO: 66)






MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDRLH



GMLTDRDIVIKGLAAGLDPNTATAGELARDSIYYVDANASIQEMLNV



MEEHQVRRVPVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMAL



ASMIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAVVM



LLAVTVSLLTIPFAAAAGTAVQDSRSHVYAHQAQTRHPATATVIDHE



GVIDSNTTATSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRV



GIWVDSAGQLVDEPAPPARAIADAALAALGLWLSVAAVAGALLALT



RAILIRVRNASWQHDIDSLFCTQR 



(SEQ ID NO: 67)





P
atggcgtcagggaggcatcggaaaccaactacaagcaatgtatctgttgccaagattgctttcaccggcg



cagttcttggaggtggcggaattgccatggctgcccaggcaacagccgctacagatggagagtgggatca



ggtggctcgatgtgagtctggtggcaactggtctatcaacactgggaacgggtatcttggcggcttgcaa



tttactcagagcacttgggctgcccacggagggggtgaatttgctcctagcgcgcagctggcctcccgcg



agcagcagatcgctgtgggagagagggtgttggccacacagggaagaggtgcctggcctgtctgtggccg



cggactcagtaatgctacccctagggaggtgctgcccgcctcagccgctatggacgctccactggatgct



gccgccgtgaatggcgagccagctccgctggcacccccacctgcagaccccgctcccccagtcgagctgg



cggcaaacgacctgcccgcacctctcggagaaccacttcctgcagcgcctgccgatccagctccacctgc



tgatttggctccccccgctcccgccgatgtagcccctccggtcgagttggctgtgaatgacctgccggca



cctctgggcgagcccctcccagccgctccggccgaccctgcccctcctgctgatctggcaccacccgctc



ctgccgacctcgccccacccgccccagcagacctggctccaccagcgcctgcggatcttgccccgcctgt



tgagctggctgtcaacgatcttcctgcgcctcttggagagcccctgcccgctgctccagccgaactcgca



ccaccggcagatctggctcccgcctctgccgatcttgcacctcccgcaccggcggacttggcacctccag



caccagcagaactggctccccctgcgccggctgacctggcccctccagcagccgttaatgagcaaaccgc



accaggggaccagccggctacggcaccaggtggaccggtggggctggccaccgacctggagctgcctgag



ccggatccccaaccagctgatgctcccccacctggcgacgtaactgaggccccagctgaaacgccccagg



tcagtaacatcgcttacacaaagaaactgtggcaggcaattagggctcaggacgtgtgtgggaacgacgc



cctggacagcttggcccaaccgtacgtgatcggtatgcaccccctccccgctgatcatggtcgcagtcgc



tgtaaccgccaccccatttcacctctcagccttattgggaatgcgtctgctacaagtggcgacatgtcta



gtatgacaaggattgctaagcccctcatcaaaagtgcgatggctgccggtctggtaacagcatccatgag



cttgtccaccgcagtggctcacgctgggccttccccgaactgggatgccgtcgcccagtgcgagtcaggc



ggcaattgggccgcaaataccggtaacggtaagtatggaggactgcagtttaaacctgcaacttgggccg



cctttggaggagtgggtaatcctgcagctgcttctagagaacagcagattgccgtggctaaccgcgttct



cgcggagcagggtctggacgcctggccgacctgtggcgccgcatcaggtttgccgatcgcgttgtggtca



aagcccgcccagggaatcaagcagattatcaatgagatcatctgggccggaatacaggcaagcatcccta



gaatgactcctgggcttctgacaaccgctggcgctgggaggcccagggataggtgcgcccggatcgtttg



taccgtattcatagagaccgccgtggtcgcgacaatgttcgtggctctcttgggcttgagcaccattagc



tctaaggccgatgatatagattgggatgctattgctcaatgcgaatccggtgggaactgggccgctaata



ccggaaatgggctctacggcggactgcagatcagccaggctacatgggatagcaacggaggagtcgggtc



ccctgccgctgcatccccgcaacagcaaatcgaggtggccgataacatcatgaaaacccagggacccgga



gcctggcccaaatgtagctcatgtagccaaggagatgcgcccctcggttcactgacgcacatcctcacct



tcctcgccgcggaaaccggagggtgctctggcagccgggacgactga 



(SEQ ID NO: 68)






MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEWD



QVARCESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLAS



REQQIAVGERVLATQGRGAWPVCGRGLSNATPREVLPASAAMDAPL



DAAAVNGEPAPLAPPPADPAPPVELAANDLPAPLGEPLPAAPADPAPP



ADLAPPAPADVAPPVELAVNDLPAPLGEPLPAAPADPAPPADLAPPAP



ADLAPPAPADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPP



ADLAPASADLAPPAPADLAPPAPAELAPPAPADLAPPAAVNEQTAPG



DQPATAPGGPVGLATDLELPEPDPQPADAPPPGDVTEAPAETPQVSNI



AYTKKLWQAIRAQDVCGNDALDSLAQPYVIGVHPLPADHGRSRCNR



HPISPLSLIGNASATSGDMSSMTRIAKPLIKSAMAAGLVTASMSLSTA



VAHAGPSPNWDAVAQCESGGNWAANTGNGKYGGLQFKPATWAAF



GGVGNPAAASREQQIAVANRVLAEQGLDAWPTCGAASGLPIALWSK



PAQGIKQIINEIIWAGIQASIPRMTPGLLTTAGAGRPRDRCARIVCTVFI



ETAVVATMFVALLGLSTISSKADDIDWDAIAQCESGGNWAANTGNG



LYGGLQISQATWDSNGGVGSPAAASPQQQIEVADNIMKTQGPGAWP



KCSSCSQGDAPLGSLTHILTFLAAETGGCSGSRDD 



(SEQ ID NO: 69)





Q
atggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcggg



tgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggcgt



gcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtcac



gacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccgaca



cggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagcgc



caagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggggc



tgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatcgt



cgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgccg



aacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttccgg



ggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgccaa



cgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccccg



gtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacaccg



gcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtatgc



accccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaaggt



tggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcggatccatgtccacgcaacgaccgaggc



actccggtattcgggctgttggcccctacgcatgggccggccgatgtggtcggataggcaggtggggggt



gcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggtgcaatccgccaccatctatcag



gtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgacgagatcactagcaccgtgtccg



gttggttgtcggagttgggcacccaaagcccgttggccgatgagcttgcgcgtgcggtgcggatcggcga



ctggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgccgttgcggtcgaattcatggat



ttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggccctggggcaggatcgctgttgg



ctgccgcgggcggctgggattcgctggccgccgagttggccaccacagccgaggcatatggatcggtgct



gtccggactggccgccttgcattggcgtggaccggcagcggaatcgatggcggtgacggccgctccctat



atcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatccaagccagggcggcagcgctgg



ccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcggccaaccggatacagctgctagc



actgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggccaccgaggcacagtacgccgag



atgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctcagcggctgcggccctgctgacac



cgttctccccgccgcggcagaccaccaacccggccggcctgaccgagctcgtgtccacttacagatcacc



ggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccg



ctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgc



tcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacagg



ccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatc



gactccgaaggagacgacacaaagcttatgacggaaaacttgaccgtccagcccgagcgtctcggtgtac



tggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcgaagctgccgctggcctaggcga



atctgtggcgatcactcacggtccgtactgctcacagttcaacgacacgttaaatgtgtacttgactgcc



cacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgccaaaagtcttcgaattgcggcga



agatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggttgtttacctga 



(SEQ ID NO: 70)






MACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAG



VQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVDEALAQLAMT



DTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNV



AGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLP



PNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPV



ANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAI



NTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREEQIAVAEVTR



LRQGWGAWPVCAARAGARGSMSTQRPRHSGIRAVGPYAWAGRCG



RIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPG



DEITSTVSGWLSELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIA



VAVEFMDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATT



AEAYGSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQT



AIQARAAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAI



AATEAQYAEMWAQDAAAMYGYATASAAAALLTPFSPPRQTTNPAG



LTELVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVS



RPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFW



TRALTGYAQLIDSEGDDTKLMTENLTVQPERLGVLASHHDNAAVDA



SSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAG



VDLAKSLRIAAKIYSEADEAWRKAIDGLFT 



(SEQ ID NO: 71)





R
atgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcatgggccggccgatgtg



gtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggt



gcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgac



gagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttggccgatgagcttg



cgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgc



cgttgcggtcggattcatggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggc



cctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacag



ccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgat



ggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatc



caagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcgg



ccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggc



caccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctca



gcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgaat




tcatggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccacgatgaaatcgcg




ggtgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtatcccgcggccggc



gtgcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatctcgctggatggtc



acgacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaactcgcgatgaccga



cacggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgctaccggtcgtcagc



gccaagacggtgcagctcaacgacgggggttggtgcgcacggtgcacttgccggcccccaatgtcgcggg



gctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgcgacggccccgatc



gtcgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcggctgccgctgccgc



cgaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaagacccgggggttcc



ggggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccgtttgcccgtcgcc



aacgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggtaccgaggtgcccc



cggtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaactgggcgatcaacac



cggcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacggcgggctgcggtat



gcaccccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacccgactgcgtcaag



gttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgcgagctcgtgtccacttacagatcacc



ggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcgtggagcggctggcgcacggcccg



ctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcggtgtcacagcacctcaaagtgc



tcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcgtctaccagctcgacccgacagg



ccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccctgactggctacgcgcagctcatc



gactccgaaggagacgacacaaagcttatgacggaaaacttgaccgtccagcccgagcgtctcggtgtac



tggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcgaagctgccgctggcctaggcga



atctgtggcgatcactcacggtccgtactgctcacagttcaacgacacgttaaatgtgtacttgactgcc



cacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgccaaaagtcttcgaattgcggcga



agatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggttgtttacctga 



(SEQ ID NO: 72)






MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPR



KVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADE



LARAVRIGDWPAAYAIGEHLSVEIAVAVGFMDFALLPPEVNSARMYT



GPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWRGPAA



ESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMTLPPP



VVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYG



YATASAAAALLTPFSPPRQTTNPAGLTEFMACKTVTLTVDGTAMRVT



TMKSRVIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQIS



LDGHDAKQVWTTASTVDEALAQLAMTDTAPAAASRASRVPLSGMA



LPVVSAKTVQLNDGGLVRTVHLPAPNVAGLLSAAGVPLLQSDHVVP



AATAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVV



EDPGVPGTQDVTFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGT



KPGTEVPPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTW



EANGGLRYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGA



RELVSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSR



PAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWT



RALTGYAQLIDSEGDDTKLMTENLTVQPERLGVLASHHDNAAVDAS



SGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGV



DLAKSLRIAAKIYSEADEAWRKAIDGLFT 



(SEQ ID NO: 73)





S
atgtccacgcaacgaccgaggcactccggtattcgggctgttggcccctacgcatgggccggccgatgtg



gtcggataggcaggtggggggtgcaccaggaggcgatgatgaatctagcgatatggcacccgcgcaaggt



gcaatccgccaccatctatcaggtgaccgatcgctcgcacgacgggcgcacagcacgggtgcctggtgac



gagatcactagcaccgtgtccggttggttgtcggagttgggcacccaaagcccgttggccgatgagcttg



cgcgtgcggtgcggatcggcgactggcccgctgcgtacgcaatcggtgagcacctgtccgttgagattgc



cgttgcggtcggattcatggatttcgcactgttaccaccggaagtcaactccgcccggatgtacaccggc



cctggggcaggatcgctgttggctgccgcgggcggctgggattcgctggccgccgagttggccaccacag



ccgaggcatatggatcggtgctgtccggactggccgccttgcattggcgtggaccggcagcggaatcgat



ggcggtgacggccgctccctatatcggttggctgtacacgaccgccgaaaagacacagcaaacagcgatc



caagccagggcggcagcgctggccttcgagcaagcatacgcaatgaccctgccgccaccggtggtagcgg



ccaaccggatacagctgctagcactgatcgcgacgaacttcttcggccagaacactgcggcgatcgcggc



caccgaggcacagtacgccgagatgtgggcccaggacgccgccgcgatgtacggttacgccaccgcctca



gcggctgcggccctgctgacaccgttctccccgccgcggcagaccaccaacccggccggcctgaccgaat




tcgtgtccacttacagatcaccggatcgcgcttggcaggcgctggcggacggcactcgccgggccatcgt




ggagcggctggcgcacggcccgctggccgtcggcgagttggcccgcgacctgcccgtcagccgacccgcg



gtgtcacagcacctcaaagtgctcaagaccgccaggctggtgtgcgaccgccccgcgggaacacgccgcg



tctaccagctcgacccgacaggccttgcggcattgcgcaccgacctcgaccggttctggacacgcgccct



gactggctacgcgcagctcatcgactccgaaggagacgacacagagctcatgacggaaaacttgaccgtc



cagcccgagcgtctcggtgtactggcgtcgcaccatgacaacgcggcggtcgatgcctcctcgggcgtcg



aagctgccgctggcctaggcgaatctgtggcgatcactcacggtccgtactgctcacagttcaacgacac



gttaaatgtgtacttgactgcccacaatgccctgggctcgtccttgcatacggccggtgtcgatctcgcc



aaaagtcttcgaattgcggcgaagatatatagcgaggccgacgaagcgtggcgcaaggctatcgacgggt



tgtttaccaagcttatggcatgcaaaacggtgacgttgaccgtcgacggaaccgcgatgcgggtgaccac



gatgaaatcgcgggtgatcgacatcgtcgaagagaacgggttctcagtcgacgaccgcgacgacctgtat



cccgcggccggcgtgcaggtccatgacgccgacaccatcgtgctgcggcgtagccgtccgctgcagatct



cgctggatggtcacgacgctaagcaggtgtggacgaccgcgtcgacggtggacgaggcgctggcccaact



cgcgatgaccgacacggcgccggccgcggcttctcgcgccagccgcgtcccgctgtccgggatggcgcta



ccggtcgtcagcgccaagacggtgcagctcaacgacggcgggttggtgcgcacggtgcacttgccggccc



ccaatgtcgcggggctgctgagtgcggccggcgtgccgctgttgcaaagcgaccacgtggtgcccgccgc



gacggccccgatcgtcgaaggcatgcagatccaggtgacccgcaatcggatcaagaaggtcaccgagcgg



ctgccgctgccgccgaacgcgcgtcgtgtcgaggacccggagatgaacatgagccgggaggtcgtcgaag



acccgggggttccggggacccaggatgtgacgttcgcggtagctgaggtcaacggcgtcgagaccggccg



tttgcccgtcgccaacgtcgtggtgaccccggcccacgaagccgtggtgcgggtgggcaccaagcccggt



accgaggtgcccccggtgatcgacggaagcatctgggacgcgatcgccggctgtgaggccggtggcaact



gggcgatcaacaccggcaacgggtattacggtggtgtgcagtttgaccagggcacctgggaggccaacgg



cgggctgcggtatgcaccccgcgctgacctcgccacccgcgaagagcagatcgccgttgccgaggtgacc



cgactgcgtcaaggttggggcgcctggccggtatgtgctgcacgagcgggtgcgcgctga 



(SEQ ID NO: 74)






MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPR



KVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADE



LARAVRIGDWPAAYAIGEHLSVEIAVAVGFMDFALLPPEVNSARMYT



GPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWRGPAA



ESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYAMTLPPP



VVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYG



YATASAAAALLTPFSPPRQTTNPAGLTEFVSTYRSPDRAWQALADGT



RRAIVERLAHGPLAVGELARDLPVSRPAVSQHLKVLKTARLVCDRPA



GTRRVYQLDPTGLAALRTDLDRFWTRALTGYAQLIDSEGDDTELMT



ENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYC



SQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWR



KAIDGLFTKLMACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDD



RDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVWTTASTVD



EALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVR



TVHLPAPNVAGLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRI



KKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDVTFAVAEV



NGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAG



CEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPRADLATREE



QIAVAEVTRLRQGWGAWPVCAARAGAR 



(SEQ ID NO: 75)









Any Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can have an amino acid sequence that is 100%, or from 70% to 99.9%, identical to the particular amino acid sequence listed in Tables 1 and 2. The amino acid sequence of any individual Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can be at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the particular amino acid sequence listed in Tables 1 and 2. Identity or similarity with respect to an amino acid or nucleotide sequence is defined herein as the percentage of amino acid residues (or nucleotide residues as the case may be) in the particular Mtb antigen that are identical (i.e., same residue) with the amino acid or nucleotide sequence for the Mtb antigen shown in Tables 1 and 2, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Percent sequence identity can be determined by, for example, the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, Madison WI), using default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489). Any amino acid number calculated as a % identity can be rounded up or down, as the case may be, to the closest whole number.


Any Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can be fragments of the particular amino acid sequence listed in Table 1. The amino acid sequence of any individual Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can be missing consecutive amino acids constituting at least 20%, at least 15%, at least 10%, at least 5%, at least 4%, at least 3%, at least 2%, or at least 1%, of the particular amino acid sequence listed in Table 1. The omitted consecutive amino acids may be from the C-terminus or N-terminus portion of the antigen. Alternately, the omitted consecutive amino acids may be from the internal portion of the antigen, thus retaining at least its C-terminus and N-terminus amino acids of the antigen.


Any Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can have one or more amino acid additions, deletions, or substitutions compared to the particular amino acid sequence listed in Table 1. Any individual Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can have at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, or at least twelve amino acid additions, deletions, or substitutions compared to the particular amino acid sequence listed in Table 1. Any individual Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, can have one, two, three, four, five, six, seven, eight, nine, ten, eleven, or twelve amino acid additions, deletions, or substitutions compared to the particular amino acid sequence listed in Table 1. The amino acid additions, deletions, or substitutions can take place at any amino acid position within the Mtb antigen.


Where a particular Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, comprises at least one or more substitutions, the substituted amino acid(s) can each be, independently, any naturally occurring amino acid or any non-naturally occurring amino acid. Thus, a particular Mtb antigen may comprise one or more amino acid substitutions that are naturally occurring amino acids and/or one or more amino acid substitutions that are non-naturally occurring amino acids. Individual amino acid substitutions are selected from any one of the following: 1) the set of amino acids with nonpolar sidechains, for example, Ala, Cys, Ile, Leu, Met, Phe, Pro, Val; 2) the set of amino acids with negatively charged side chains, for example, Asp, Glu; 3) the set of amino acids with positively charged sidechains, for example, Arg, His, Lys; and 4) the set of amino acids with uncharged polar sidechains, for example, Asn, Cys, Gln, Gly, His, Met, Phe, Ser, Thr, Trp, Tyr, to which are added Cys, Gly, Met and Phe. Substitutions of a member of one class with another member of the same class are contemplated herein. Naturally occurring amino acids include, for example, alanine (Ala), arginine (Arg), asparagine (Asn), aspartic acid (Asp), cysteine (Cys), glutamine (Gln), glutamic acid (Glu), glycine (Gly), histidine (His), isoleucine (Ile), leucine (Leu), lysine (Lys), methionine (Met), phenylalanine (Phe), proline (Pro), serine (Ser), threonine (Thr), tryptophan (Trp), tyrosine (Tyr), and valine (Val). Non-naturally occurring amino acids include, for example, norleucine, omithine, norvaline, homoserine, and other amino acid residue analogues such as those described in Ellman et al., Meth. Enzym., 1991, 202, 301-336. To generate such non-naturally occurring amino acid residues, the procedures of Noren et al., Science, 1989, 244, 182 and Ellman et al., supra, can be used. Briefly, these procedures involve chemically activating a suppressor tRNA with a non-naturally occurring amino acid residue followed by in vitro transcription and translation of the RNA.


The Mtb antigens, including any Mtb antigen within any of the fusion proteins described herein, which are modified as described herein retain their ability to elicit an immune response against Mycobacterium tuberculosis. That is, modification of a particular Mtb antigen, including any Mtb antigen within any of the fusion proteins described herein, will still allow the resultant Mtb antigen, or fusion protein comprising the same, to elicit an immune response against Mycobacterium tuberculosis.


The present disclosure also provides nucleic acid molecules encoding any of the fusion proteins described herein that comprise at least three Mycobacterium tuberculosis (Mtb) antigens. The nucleic acid molecules described herein and in Tables 1 and 2 are representative. The specific sequences recited in Table 1 are simply one example of a nucleic acid molecule that can encode a particular Mtb antigen within a fusion protein. One skilled in the art having knowledge of the genetic code can routinely prepare and design a plethora of nucleic acid molecules encoding the same Mtb antigen. The length and nucleotide content of any particular nucleic acid molecule is dictated by the desired amino acid sequence of the encoded Mtb antigen. The nucleic acid molecule sequences shown in Tables 1 and 2 are DNA, although RNA nucleic acid molecules are also contemplated.


The present disclosure also provides vectors encoding any of the Mtb antigens, including Mtb antigens within any of the fusion proteins described herein, including any of the modified versions described herein. The vector can be capable of expressing an Mtb antigen in the cell of a mammal in a quantity effective to elicit an immune response in the mammal. The vector can be recombinant. The vector can comprise heterologous nucleic acid encoding the antigen. The vector can be a plasmid. In some embodiments, the plasmid is a DNA plasmid, such as a pVAX backbone vector. The vector can be useful for transfecting cells with nucleic acid encoding an Mtb antigen, which the transformed host cell is cultured and maintained under conditions wherein expression of the antigen takes place.


In some embodiments, coding sequences can be optimized for stability and high levels of expression. In some instances, codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding.


In some embodiments, the vectors can comprise regulatory elements for gene expression of the coding sequences of the nucleic acid. The regulatory elements can be a promoter, an enhancer an initiation codon, a stop codon, or a polyadenylation signal. In some embodiments, the vector can comprise heterologous nucleic acid encoding an Mtb antigen and can further comprise an initiation codon, which is upstream of the antigen coding sequence, and a stop codon, which is downstream of the antigen coding sequence. The initiation and termination codon are in frame with the antigen coding sequence.


The vector can also comprise a promoter that is operably linked to the antigen coding sequence. The promoter operably linked to the Mtb antigen coding sequence can be a promoter from simian virus 40 (SV40), a mouse mammary tumor virus (MMTV) promoter, a human immunodeficiency virus (HIV) promoter such as the bovine immunodeficiency virus (BIV) long terminal repeat (LTR) promoter, a Moloney virus promoter, an avian leukosis virus (ALV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter, Epstein Barr virus (EBV) promoter, or a Rous sarcoma virus (RSV) promoter, or the like. The promoter can also be a promoter from a human gene such as human actin, human myosin, human hemoglobin, human muscle creatine, or human metalothionein. The promoter can also be a tissue specific promoter, such as a muscle or skin specific promoter, natural or synthetic. Representative examples of promoters include the bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, mycobacterial Hsp60 promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, SV40 early promoter or SV40 late promoter and the CMV IE promoter.


The vector can also comprise a polyadenylation signal, which can be downstream of the antigen coding sequence. The polyadenylation signal can be a SV40 polyadenylation signal, LTR polyadenylation signal, CMV polyadeylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human β-globin polyadenylation signal. The SV40 polyadenylation signal can be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA).


The vector can also comprise an enhancer upstream of the consensus BoNT-A, BoNT-B, BoNT-E, and BoNT-F antigen sequences. The enhancer can be necessary for DNA expression. The enhancer can be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV or EBV. Polynucleotide function enhancers are described in U.S. Pat. Nos. 5,593,972, 5,962,428, and WO94/016737.


The vector can also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell. The vector can be pVAX1, pCEP4 or pREP4 from Invitrogen (San Diego, CA), which can comprise the Epstein Barr virus origin of replication and nuclear antigen EBNA-1 coding region, which can produce high copy episomal replication without integration. The vector can be pVAX1 or a pVax1 variant with changes such as the variant plasmid described herein. The variant pVax1 plasmid is a 2998 basepair variant of the backbone vector plasmid pVAX1 (Invitrogen, Carlsbad CA). The CMV promoter is located at bases 137-724. The T7 promoter/priming site is at bases 664-683. Multiple cloning sites are at bases 696-811. Bovine GH polyadenylation signal is at bases 829-1053. The Kanamycin resistance gene is at bases 1226-2020. The pUC origin is at bases 2320-2993.


The vector can also comprise a regulatory sequence, which can be well suited for gene expression in a mammalian or human cell into which the vector is administered. The consensus coding sequence can comprise a codon, which can allow more efficient transcription of the coding sequence in the host cell.


The vector can be pSE420 (Invitrogen, San Diego, Calif) or pET28b (EMD Millipore, Billerca, Mass.), which can be used for protein production in Escherichia coli (E. coli). The vector can also be pYES2 (Invitrogen, San Diego, Calif), which can be used for protein production in Saccharomyces cerevisiae strains of yeast. The vector can also be of the MAXBAC™ complete baculovirus expression system (Invitrogen, San Diego, Calif), which can be used for protein production in insect cells. The vector can also be pcDNA I or pcDNA3 (Invitrogen, San Diego, Calif.), which may be used for protein production in mammalian cells such as Chinese hamster ovary (CHO) cells. The vector can be expression vectors or systems to produce protein by routine techniques and readily available starting materials including Sambrook et al., Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989).


In some embodiments, the vector is a viral vector. Suitable viral vectors include, but are not limited to, an adenovirus vector, an adeno-associated virus vector, a poxvirus vector (such as, for example, vaccinia virus vector), a paramyxovirus vector, a fowlpox virus vector, an attenuated yellow fever vectors (such as, for example, YFV-17D), an alphavirus vector, a retrovirus vector (such as, for example, lentivirus vector), a Sendai virus vector, and cytomegalovirus (CMV) vector. Suitable adenovirus vectors include, but are not limited to, adenovirus 4, adenovirus 5, chimpanzee adenovirus 3, chimpanzee adenovirus 63, and chimpanzee adenovirus 68. A suitable vaccinia virus vector includes, but is not limited to, modified vaccinia Ankara (MVA). Suitable paramyxovirus vectors include, but are not limited to, modified parainfluenza virus (PIV2) and recombinant human parainfluenza virus (rHPIV2). Suitable CMV vectors include, but are not limited to, Rhesus Macaque CMV (RhCMV) vectors and Human CMV (HCMV) vectors. In some embodiments, the vector is present within a composition comprising a pharmaceutically acceptable carrier. One skilled in the art is readily familiar with numerous vectors, many of which are commercially available.


In some embodiments, the vector is a non-viral vector. In some embodiments, the non-viral vector is RNA, such as mRNA. In some embodiments, the mRNA is protamine-complexed mRNA, wherein the Mtb antigen or fusion protein is encoded by the mRNA, and the protamine complexes contribute a strong immunostimulatory signal. An exemplary mRNA vector platform is RNActive® (CureVac Inc).


The present disclosure also provides host cells comprising any of the nucleic acid molecules or vectors disclosed herein. The host cells can be used, for example, to express the Mtb antigens, or fragments of thereof. The Mtb antigens, or fragments thereof, can also be expressed in cells in vivo. The host cell that is transformed (for example, transfected) to produce the Mtb antigens, or fragments of thereof can be an immortalised mammalian cell line, such as those of lymphoid origin (for example, a myeloma, hybridoma, trioma or quadroma cell line). The host cell can also include normal lymphoid cells, such as B-cells, that have been immortalized by transformation with a virus (for example, the Epstein-Barr virus).


In some embodiments, the host cells include, but are not limited to: bacterial cells, such as E. coli, Caulobacter crescentus, Streptomyces species, and Salmonella typhimurium; yeast cells, such as Saccharomyces cerevisiae, Schizosaccharomyces pombe, Pichia pastoris, Pichia methanolica; insect cell lines, such as those from Spodoptera frugiperda (for example, Sf9 and Sf21 cell lines, and expresSF™ cells (Protein Sciences Corp., Meriden, CT, USA)), Drosophila S2 cells, and Trichoplusia in High Five® Cells (Invitrogen, Carlsbad, CA, USA); and mammalian cells, such as COS1 and COS7 cells, Chinese hamster ovary (CHO) cells, NS0 myeloma cells, NIH 3T3 cells, 293 cells, Procell92S, perC6, HEPG2 cells, HeLa cells, L cells, HeLa, MDCK, HEK293, WI38, murine ES cell lines (for example, from strains 129/SV, C57/BL6, DBA-1, 129/SVJ), K562, Jurkat cells, and BW5147. Other useful mammalian cell lines are well known and readily available from the American Type Culture Collection (“ATCC”) (Manassas, VA, USA) and the National Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at the Coriell Cell Repositories (Camden, NJ, USA). In some embodiments, the cell is a recombinant BCG. These cell types are only representative and are not meant to be an exhaustive list.


Among other considerations, some of which are described above, a host cell strain may be chosen for its ability to process the expressed Mtb antigens, or fragment thereof, in the desired fashion. Post-translational modifications of the polypeptide include, but are not limited to, glycosylation, acetylation, carboxylation, phosphorylation, lipidation, and acylation, and it is an aspect of the present disclosure to provide Mtb antigens thereof with one or more of these post-translational modifications.


In some embodiments, the recombinant BCG has been genetically engineered to express a functional endosomalytic protein that is bioactive at pH values near neutrality (e.g. about pH 6-8 or about 6.5 to 7.5). The endosomalytic protein is active within Mycobacteria-containing endosomes, which typically have an internal pH near neutrality. The activity of the endosomalytic protein produced by the rBCG results in disruption of the endosome, permitting the rBCG to escape from the endosome and into the cytoplasm of the cell. In some embodiments, the endosomalytic protein that is introduced into the rBCG by genetic engineering is Perfringolysin O (PfoA) from Clostridium perfringens or a mutant thereof, such as PfoAG137Q, as described in WO 2007/058663.


In some embodiments, the Mycobacteria are attenuated, as exemplified by BCG. However, those of skill in the art will recognize that other attenuated and nonattenuated Mycobacteria exist which would also be suitable for use herein. Examples of additional types of Mycobacteria include, but are not limited to, M. tuberculosis strain CDC1551, M. tuberculosis strain Beijing, M. tuberculosis strain H37Ra (ATCC #: 25177), M. tuberculosis strain H37Rv (ATCC #: 25618), M. bovis (ATCC #: 19211 and 27291), M. fortuitum (ATCC #: 15073), M. smegmatis (ATCC #: 12051 and 12549), M. intracellulare (ATCC #: 35772 and 13209), M. kansasii(ATCC #: 21982 and 35775) M. avium(ATCC #: 19421 and 25291), M. gallinarum (ATCC #: 19711), M. vaccae (ATCC #: 15483 and 23024), M. leprae(ATCC #), M. marinarum (ATCC #: 11566 and 11567), and M. microtti (ATCC #: 11152).


Examples of attenuated Mycobacterium strains include, but are not restricted To, M. tuberculosis pantothenate auxotroph strain, M. tuberculosis rpoV mutant strain, M. tuberculosis leucine auxotroph strain, BCG Danish strain (ATCC #35733), BCG Japanese strain (ATCC #35737), BCG Chicago strain (ATCC #27289), BCG Copenhagen strain (ATCC #: 27290), BCG Pasteur strain (ATCC #: 35734), BCG Glaxo strain (ATCC #: 35741), BCG Connaught strain (ATCC #35745), BCG Montreal (ATCC #35746), BCG1331 strain, BCG Tokyo strain, BCG Moreau strain, BCG-Pasteur Aeras, and BCG Moscow strain.


In some embodiments, the cell comprising the one or more vector(s) is present within a composition comprising a pharmaceutically acceptable carrier.


In some embodiments, the Mtb antigen, or fragment thereof, is labeled with a detectable marker. Detectable markers include, but are not limited to, radioactive isotopes (such as P32 and S35), enzymes (such as horseradish peroxidase, chloramphenicol acetyltransferase (CAT), β-galactosidase (β-gal), and the like), fluorochromes, chromophores, colloidal gold, dyes, and biotin. The labeled Mtb antigens, or fragments thereof, can be used to carry out diagnostic procedures in a variety of cell or tissue types. For imaging procedures, in vitro or in vivo, the Mtb antigens can be labeled with additional agents, such as NMR contrasting agents, X-ray contrasting agents, or quantum dots. Methods for attaching a detectable agent to polypeptides are known in the art. The Mtb antigens can also be attached to an insoluble support (such as a bead, a glass or plastic slide, or the like).


In some embodiments, the Mtb antigens, or fragment thereof, can be conjugated to a therapeutic agent including, but not limited to, radioisotopes (such as 111In or 90Y), toxins (such as tetanus toxoid or ricin), toxoids, and chemotherapeutic agents.


In some embodiments, the Mtb antigens, or fragments thereof, can be conjugated to an imaging agent. Imaging agents include, for example, a labeling moiety (such as biotin, fluorescent moieties, radioactive moieties, histidine tag or other peptide tags) for easy isolation or detection.


The present disclosure also provides compositions comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens. In some embodiments, the at least two or three Mtb antigens are not present in a fusion protein. In some embodiments, the at least two or three Mtb antigens are in the form of a protein and not nucleic acid molecules encoding the Mtb antigens.


In some embodiments, the Mtb antigen is Rv1009, Rv3136, Rv3615c, Rv2628, Rv2034, Rv3136 N-terminus, Ag85A, Ag85B (also known as Rv1886c), Rv3407, Rv1733, Rv2626c, RpfA, RpfC, or RpfD. In some embodiments, the composition comprises at least two Mtb antigens. In some embodiments, the composition comprises Rv1733 and Rv2626c Mtb antigens. In some embodiments, the composition comprises at least three Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv2034, and Rv3136 Mtb antigens. In some embodiments, the composition comprises Ag85A, Ag85B, and Rv3407 Mtb antigens. In some embodiments, the composition comprises RpfA, RpfC, and RpfD Mtb antigens. In some embodiments, the composition comprises at least four Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv2628, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv3615c, Rv2034, and Rv2628 Mtb antigens. In some embodiments, the composition comprises Rv2034, Rv3615c, Rv2628, and Rv3136 Mtb antigens. In some embodiments, the composition comprises at least five Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv2034, Rv2628, Rv3615c, and Rv3136 Mtb antigens. In some embodiments, the composition comprises Rv1009, Rv3136Nt, Rv2628, Rv2034, and Rv3615c Mtb antigens. In some embodiments, the composition further comprises a pharmaceutically acceptable carrier.


The present disclosure also provides compositions comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens. In some embodiments, the composition comprises one Mtb antigen in protein form and one or two nucleic acid molecules encoding two Mtb antigens. In some embodiments, the composition comprises two Mtb antigens in protein form, optionally as a fusion protein, and one nucleic acid molecule encoding one Mtb antigen. Thus, the present composition is a mixture of a protein Mtb antigen(s) and nucleic acid molecule(s) encoding an Mtb antigen(s).


In some embodiments, at least two Mtb antigens are encoded by one or more nucleic acid molecules within one or more vectors. In some embodiments, the one or more vectors is one or more viral vectors. In some embodiments, the one or more viral vectors are any one or more of an adenovirus vector, an adeno-associated virus vector, a poxvirus vector (such as, for example, vaccinia virus vector), a paramyxovirus vector, a fowlpox virus vector, an attenuated yellow fever vectors (such as, for example, YFV-17D), an alphavirus vector, a retrovirus vector (such as, for example, lentivirus vector), a Sendai virus vector, and CMV vector. In some embodiments, the adenovirus vector is adenovirus 4, adenovirus 5, chimpanzee adenovirus 3, chimpanzee adenovirus 63, or chimpanzee adenovirus 68. In some embodiments, the vaccinia virus vector is MVA. In some embodiments, the paramyxovirus vector is PIV2 or rHPIV2. In some embodiments, the CMV vector is a RhCMV vector of an HCMV vector. In some embodiments, the vector is present within a composition comprising a pharmaceutically acceptable carrier. In some embodiments, the at least two Mtb antigens are encoded by a single nucleic acid molecule within the same expression vector as a fusion protein. In some embodiments, the one or more vectors is a non-viral vector. In some embodiments, the non-viral vector is RNA, such as mRNA. In some embodiments, the mRNA is protamine-complexed mRNA. An exemplary mRNA vector platform is RNActive® (CureVac Inc).


In some embodiments, where a rBCG is used as the vehicle to deliver the Mtb antigens, or fusion proteins, or nucleic acids and or vectors comprising or encoding the same, expression of all or part of the Dos R regulon is not up-regulated in the rBCG. In some embodiments, one or more of the following Dos R regulon antigens are not up-regulated in the rBCG: Rv1738, Rv2623, Rv2031c, Rv2032, Rv2626c, Rv2005c, Rv3127, Rv1733c, Rv1996, Rv2628c, Rv0079, Rv3130c, Rv3131, Rv1813c, Rv2006, Rv2029c, Rv2627c, Rv2030c, Rv3132c, and Rv2629. In some embodiments, the rBCG does not comprise up-regulation of: 1) one or more Mtb antigens, including “classical” Mtb antigens such as 85A, 85B and TB 10.4; and 2) at least one Mtb resuscitation antigen selected from Rv0867c, Rv1009, Rv1884c, Rv2389c, Rv2450c, Rv0288, Rv1009, Rv0685, Rv0824c, Rv1349, Rv2744c, Rv3347c, Rv1130, and Rv1169c. In some embodiments, the rBCG does not include the expression of the following combinations: classical antigens Rv1886c, Rv3804c; resuscitation antigens Rv0867c, Rv1884c, Rv2389c; and Mtb-specific antigen Rv3407. In some embodiments, the rBCG does not include the expression of the following combination: Rv3804c, Rv1886c, and Rv3407, or in addition with Rv3133c, and with the combination of Rv0867c, Rv1884c, and Rv2389c. In some embodiments, the rBCG does not include the expression of the following combination: TB10.4, Ag85B, Ag85A, and Rv3407. In some embodiments, the cell is not a rBCG.


The present disclosure also provides compositions comprising any one or more of the fusion proteins, Mtb antigens, nucleic acid molecules encoding Mtb antigens, including fusion proteins thereof, cells, and/or vectors and a pharmaceutically acceptable carrier. Compositions include, for example, pharmaceutical compositions. A pharmaceutically acceptable carrier refers to at least one component of a pharmaceutical preparation that is normally used for administration of active ingredients. As such, a carrier can contain any pharmaceutical excipient used in the art and any form of vehicle for administration. Carriers include, but are not limited to, phosphate buffered saline, physiological saline, water, citrate/sucrose/Tween formulations and emulsions such as, for example, oil/water emulsions.


The compositions can also include an active therapeutic agent and a variety of other pharmaceutically acceptable components. See Remington's Pharmaceutical Science (15th ed., Mack Publishing Company, Easton, Pennsylvania (1980)). The desired form depends on the intended mode of administration and therapeutic application. The compositions can also include, depending on the formulation desired, pharmaceutically acceptable, non-toxic carriers or diluents, which are defined as vehicles commonly used to formulate pharmaceutical compositions for animal or human administration. The diluent is selected so as not to affect the biological activity of the combination. Examples of such diluents include, but are not limited to, distilled water, physiological phosphate-buffered saline, Ringer's solutions, dextrose solution, and Hank's solution. In addition, the pharmaceutical composition or formulation may also include other carriers, adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like.


Solid formulations of the compositions for oral administration can contain suitable carriers or excipients, such as corn starch, gelatin, lactose, acacia, sucrose, microcrystalline cellulose, kaolin, mannitol, dicalcium phosphate, calcium carbonate, sodium chloride, or alginic acid. Disintegrators that can be used include, without limitation, microcrystalline cellulose, corn starch, sodium starch glycolate, and alginic acid. Tablet binders that can be used include acacia, methylcellulose, sodium carboxymethylcellulose, polyvinylpyrrolidone (Povidone™), hydroxypropyl methylcellulose, sucrose, starch, and ethylcellulose. Lubricants that can be used include magnesium stearates, stearic acid, silicone fluid, talc, waxes, oils, and colloidal silica. Additional excipients include, for example, colorants, taste-masking agents, solubility aids, suspension agents, compressing agents, enteric coatings, sustained release aids, and the like.


In some embodiments, the compositions can be administered in the form of a depot injection or implant preparation, which can be formulated in such a manner as to permit a sustained release. An exemplary composition comprises any one or more of the compositions described herein formulated in aqueous buffer.


In some embodiments, liquid formulations of a pharmaceutical composition for oral administration prepared in water or other aqueous vehicles can contain various suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. Liquid formulations of pharmaceutical compositions can also include solutions, emulsions, syrups and elixirs containing, together with the active compound(s), wetting agents, sweeteners, and coloring and flavoring agents. Various liquid and powder formulations of the pharmaceutical compositions can be prepared by conventional methods for inhalation into the lungs of the mammal to be treated.


In some embodiments, liquid formulations of a pharmaceutical composition for injection can comprise various carriers such as vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, isopropyl myristate, ethanol, polyols such as, for example, glycerol, propylene glycol, liquid polyethylene glycol, and the like. In some embodiments, the composition includes a citrate/sucrose/tween carrier. For intravenous injections, water soluble versions of the compositions can be administered by the drip method, whereby a pharmaceutical formulation containing the antifungal agent and a physiologically acceptable excipient is infused. Physiologically acceptable excipients can include, for example, 5% dextrose, 0.9% saline, Ringer's solution or other suitable excipients. A suitable insoluble form of the composition can be prepared and administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, such as an ester of a long chain fatty acid such as, for example, ethyl oleate.


The compositions can be, for example, injectable solutions, aqueous suspensions or solutions, non-aqueous suspensions or solutions, solid and liquid oral formulations, salves, gels, ointments, intradermal patches, creams, aerosols, lotions, tablets, capsules, sustained release formulations, and the like. In some embodiments, for topical applications, the pharmaceutical compositions can be formulated in a suitable ointment. In some embodiments, a topical semi-solid ointment formulation typically comprises a concentration of the active ingredient from about 1 to 20%, or from 5 to 10%, in a carrier, such as a pharmaceutical cream base. Some examples of formulations of a composition for topical use include, but are not limited to, drops, tinctures, lotions, creams, solutions, and ointments containing the active ingredient and various supports and vehicles.


Typically, compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection can also be prepared. The preparation also can be emulsified or encapsulated in liposomes or microparticles such as polylactide, polyglycolide, or copolymer for enhanced adjuvant effect (see Langer, Science, 1990, 249, 1527 and Hanes, Advanced Drug Delivery Reviews, 1997, 28, 97). A sterile injectable preparation such as, for example, a sterile injectable aqueous or oleaginous suspension can also be prepared. This suspension may be formulated according to techniques known in the art using suitable dispersing, wetting, and suspending agents. In some embodiments, the pharmaceutical composition can be delivered in a microencapsulation device so as to reduce or prevent a host immune response against the protein.


In some embodiments, any of the Mtb antigens, constructs, vectors, or cells described herein, or compositions comprising the same, can be combined into a single therapeutic or prophylactic regimen. For example, in some embodiments, an antigen matched BCG can be combined or used with a recombinant protein vaccine.


In some embodiments, any of the Mtb antigens, constructs, vectors, or cells described herein, or compositions comprising the same, can be administered to a mammal as an aerosol. In some embodiments, the aerosol inocula comprises saline. Conventional aerosol delivery devices include, but are not limited to, a pressurized metered dose inhaler (pMDI) and a dry power inhaler (DPI), both of which deliver a dry powder formulation, and nebulizers such as the PARI eFlow device, which delivers an aqueous dose as a fine mist. In some embodiments, the aerosol delivery device is a Pari eFlow portable electronic aerosol delivery platform attached to a delivery mask. In some embodiments, the average particle size is from about 1 μm to about 10 μm, from about 1 μm to about 5 μm, from about 3 μm to about 5 μm, from about 4 μm to about 5 μm, or from about 3.9 μm to about 4.9 μm. In some embodiments, the aerosol is in a volume from about 0.1 ml to about 5 ml, from about 0.1 ml to about 2 ml, from about 0.1 ml to about 1.5 ml, from about 0.5 ml to about 1.5 ml, from about 0.5 ml to about 1.2 ml, from about 0.7 ml to about 1.2 ml, or about 1 ml.


Effective doses of the compositions of the present disclosure, for the treatment of a condition vary depending upon many different factors, including means of administration, target site, physiological state of the subject, whether the subject is human or an animal, other medications administered, and whether treatment is prophylactic or therapeutic. Usually, the subject is a human but non-human mammals including transgenic mammals can also be treated.


In some embodiments, the compositions can be administered to a subject by injection intravenously, subcutaneously, intraperitoneally, intramuscularly, intramedullarily, intraventricularly, intraepidurally, intraarterially, intravascularly, intraarticularly, intrasynovially, intrasternally, intrathecally, intrahepatically, intraspinally, intratumorly, intracranially, enteral, intrapulmonary, transmucosal, intrauterine, sublingual, or locally at sites of inflammation or tumor growth by using standard methods. Alternately, the compositions can be administered to a subject by routes including oral, nasal, ophthalmic, rectal, or topical. The most typical route of administration is intravascular, subcutaneous, or intramuscular, although other routes can be effective. In some embodiments, compositions are administered as a sustained release composition or device, such as a Medipad™ device. The composition can also be administered via the respiratory tract, for example, using a dry powder inhalation device, nebulizer, or a metered dose inhaler. The composition can also be administered by traditional syringes, needleless injection devices, “microprojectile bombardment gone guns,” or other physical methods such as electroporation (“EP”), “hydrodynamic method”, or ultrasound.


In some embodiments, the composition can be administered to a subject by sustained release administration, by such means as depot injections of erodible implants directly applied during surgery or by implantation of an infusion pump or a biocompatible sustained release implant into the subject. Alternately, the composition can be administered to a subject by injectable depot routes of administration, such as by using 1-, 3-, or 6-month depot injectable or biodegradable materials and methods, or by applying to the skin of the subject a transdermal patch containing the composition, and leaving the patch in contact with the subject's skin, generally for 1 to 5 hours per patch.


In some embodiments, the compositions comprise about 1 nanogram to about 10 mg of nucleic acid. In some embodiments, the compositions comprise: 1) at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 nanograms, or at least 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 405, 410, 415, 420, 425, 430, 435, 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750, 755, 760, 765, 770, 775, 780, 785, 790, 795, 800, 805, 810, 815, 820, 825, 830, 835, 840, 845, 850, 855, 860, 865, 870, 875, 880, 885, 890, 895. 900, 905, 910, 915, 920, 925, 930, 935, 940, 945, 950, 955, 960, 965, 970, 975, 980, 985, 990, 995 or 1000 micrograms, or at least 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5 or 10 mg or more; and 2) up to and including 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 nanograms, or up to and including 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 405, 410, 415, 420, 425, 430, 435, 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750, 755, 760, 765, 770, 775, 780, 785, 790, 795, 800, 805, 810, 815, 820, 825, 830, 835, 840, 845, 850, 855, 860, 865, 870, 875, 880, 885, 890, 895. 900, 905, 910, 915, 920, 925, 930, 935, 940, 945, 950, 955, 960, 965, 970, 975, 980, 985, 990, 995, or 1000 micrograms, or up to and including 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5 or 10 mg.


In some embodiments, the compositions comprise about 5 nanograms to about 10 mg of nucleic acid molecule. In some embodiments, the compositions comprise about 25 nanograms to about 5 mg of nucleic acid molecule. In some embodiments, the compositions contain about 50 nanograms to about 1 mg of nucleic acid molecule. In some embodiments, the compositions contain about 0.1 to about 500 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 1 to about 350 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 5 to about 250 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 10 to about 200 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 15 to about 150 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 20 to about 100 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 25 to about 75 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 30 to about 50 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 35 to about 40 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 100 to about 200 micrograms of nucleic acid molecule. In some embodiments, the compositions comprise about 10 to about 100 micrograms of nucleic acid molecule. In some embodiments, the compositions comprise about 20 to about 80 micrograms of nucleic acid molecule. In some embodiments, the compositions comprise about 25 to about 60 micrograms of nucleic acid molecule. In some embodiments, the compositions comprise about 30 nanograms to about 50 micrograms of nucleic acid molecule. In some embodiments, the compositions comprise about 35 nanograms to about 45 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 0.1 to about 500 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 1 to about 350 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 25 to about 250 micrograms of nucleic acid molecule. In some embodiments, the compositions contain about 100 to about 200 micrograms of nucleic acid molecule.


In some embodiments, the delivery platforms described herein can be used either in a single administration alone or in combinations as matched antigen prime-boost approaches. In addition, the use of these antigens in a single vector system, which is envisioned to be used as an antigen matched prime for a boost with any of the modalities above, including protein, viral vectors, nucleic acids, or others. For example, the same Mtb antigen construct can be used as both the prime and the boost. In other embodiments, a first Mtb antigen construct can be used as the prime and a second different Mtb antigen construct can be used as the boost (i.e., heterologous prime-boost). In some embodiments, the prime is a DNA or RNA (such as mRNA) prime and the boost is a viral vector boost. In some embodiments, the prime is a viral vector prime and the boost is a DNA or RNA (such as mRNA) boost.


The compositions can be formulated according to the mode of administration to be used. In cases where compositions are injectable pharmaceutical compositions, they are sterile, pyrogen free and particulate free. An isotonic formulation can be used. Generally, additives for isotonicity can include sodium chloride, dextrose, mannitol, sorbitol and lactose. In some cases, isotonic solutions such as phosphate buffered saline are suitable. Stabilizers include gelatin and albumin. In some embodiments, a vasoconstriction agent is added to the formulation.


The compositions can further comprise a pharmaceutically acceptable excipient. The pharmaceutically acceptable excipient can be functional molecules as vehicles, adjuvants, carriers, or diluents. The pharmaceutically acceptable excipient can be a transfection facilitating agent, which can include surface active agents, such as immune-stimulating complexes (ISCOMS), Freund's incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs, vesicles such as squalene and squalane, hyaluronic acid, lipids, liposomes, calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents.


The transfection facilitating agent is a polyanion, polycation, including poly-L-glutamate (LGS), or lipid. The transfection facilitating agent is poly-L-glutamate, and more suitably, the poly-L-glutamate is present in the composition at a concentration less than 6 mg/ml. The transfection facilitating agent can also include surface active agents such as immune-stimulating complexes (ISCOMS), Freunds incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs and vesicles such as squalene and squalane, and hyaluronic acid can also be used administered in conjunction with the genetic construct. In some embodiments, the plasmid compositions can also include a transfection facilitating agent such as lipids, liposomes, including lecithin liposomes or other liposomes known in the art, as a DNA-liposome mixture (see for example WO9324640), calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents. In some embodiments, the transfection facilitating agent is a polyanion, polycation, including poly-L-glutamate (LGS), or lipid. Concentration of the transfection agent in the composition is less than 4 mg/ml, less than 2 mg/ml, less than 1 mg/ml, less than 0.750 mg/ml, less than 0.500 mg/ml, less than 0.250 mg/ml, less than 0.100 mg/ml, less than 0.050 mg/ml, or less than 0.010 mg/ml.


The pharmaceutically acceptable excipient may be an adjuvant. The adjuvant may be other genes that are expressed in alternative plasmid or are delivered as proteins in combination with the plasmid above. The adjuvant may be selected from the group consisting of: α-interferon (IFN-α), β-interferon (IFN-−), γ-interferon, platelet derived growth factor (PDGF), TNFα, TNFβ, GM-CSF, epidermal growth factor (EGF), cutaneous T cell-attracting chemokine (CTACK), epithelial thymus-expressed chemokine (TECK), mucosae-associated epithelial chemokine (MEC), IL-12, IL-15, MHC, CD80, CD86 including IL-15 having the signal sequence deleted and optionally including the signal peptide from IgE. The adjuvant may be IL-12, IL-15, IL-28, CTACK, TECK, platelet derived growth factor (PDGF), TNFα, TNFβ, GM-CSF, epidermal growth factor (EGF), IL-1, IL-2, IL-4, IL-5, IL-6, IL-10, IL-12, IL-18, or a combination thereof.


Other genes which may be useful adjuvants include those encoding: MCP-1, MIP-la, MIP-1p, IL-8, L-selectin, P-selectin, E-selectin, CD34, GlyCAM-1, MadCAM-1, LFA-1, VLA-1, Mac-1, p150.95, PECAM, ICAM-1, ICAM-2, ICAM-3, CD2, LFA-3, M-CSF, G-CSF, IL-4, mutant forms of IL-18, CD40, CD40L, vascular growth factor, fibroblast growth factor, IL-7, nerve growth factor, vascular endothelial growth factor, Fas, TNF receptor, Flt, Apo-1, p55, WSL-1, DR3, TRAMP, Apo-3, AIR, LARD, NGRF, DR4, DR5, KILLER, TRAIL-R2, TRICK2, DR6, Caspase ICE, Fos, c-jun, Sp-1, Ap-1, Ap-2, p38, p65Rel, MyD88, IRAK, TRAF6, IkB, Inactive NIK, SAP K, SAP-1, JNK, interferon response genes, NFkB, Bax, TRAIL, TRAILrec, TRAILrecDRC5, TRAIL-R3, TRAIL-R4, RANK, RANK LIGAND, Ox40, Ox40 LIGAND, NKG2D, MICA, MICB, NKG2A, NKG2B, NKG2C, NKG2E, NKG2F, TAP1, TAP2 and functional fragments thereof.


The present disclosure also provides kits comprising any of the Mtb antigens, fragments thereof, fusion proteins, nucleic acid molecules, vectors, or cells, described herein. The kit can include, for example, container(s), package(s) or dispenser(s) along with labels and instructions for administration or use.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of one or more fusion proteins comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens. Any of the fusion proteins described herein can be administered. In some embodiments, the fusion protein comprises at least three Mtb antigens. In some embodiments, the fusion protein comprises at least four Mtb antigens. In some embodiments, the fusion protein comprises at least five Mtb antigens.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of a composition comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier. Any of the compositions comprising two or more Mtb antigens can be administered. In some embodiments, the composition comprises at least three Mtb antigens. In some embodiments, the composition comprises at least four Mtb antigens. In some embodiments, the composition comprises at least five Mtb antigens.


The present disclosure also provides methods of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of a composition comprising at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier; wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens. Any of the compositions comprising a mixture of one or more Mtb antigen proteins and one of more nucleic acid molecules encoding one or more Mtb antigens described herein can be administered.


The fusion proteins and compositions described herein can be used to treat or prevent tuberculosis. In some embodiments, the method comprises administering to a human a therapeutically- or prophylactically-effective amount of any of the fusion proteins or compositions described herein such that the tuberculosis infection is diminished or prevented.


In some embodiments, the subject being treated will have been previously diagnosed as having tuberculosis. Such subjects will, thus, have been diagnosed as being in need of such treatment. Alternately, the treatment may be intended to prevent a tuberculosis infection in a subject that does not yet have tuberculosis or to a subject that is travelling to an area where tuberculosis is prevalent.


Treatment of a subject suffering from tuberculosis can be monitored using standard methods. Some methods entail determining a baseline value, for example, of an antibody level or profile in a subject, before administering a dosage of agent, and comparing this with a value for the profile or level after treatment. A significant increase such as, for example, greater than the typical margin of experimental error in repeat measurements of the same sample, expressed as one standard deviation from the mean of such measurements in value of the level or profile signals a positive treatment outcome (i.e., that administration of the agent has achieved a desired response). If the value for immune response does not change significantly, or decreases, a negative treatment outcome is indicated.


In other embodiments, a control value such as a mean and standard deviation, of level or profile is determined for a control population. Typically the individuals in the control population have not received prior treatment. Measured values of the level or profile in a subject after administering a therapeutic agent are then compared with the control value. A significant increase relative to the control value, such as greater than one standard deviation from the mean, signals a positive or sufficient treatment outcome. A lack of significant increase or a decrease signals a negative or insufficient treatment outcome. Administration of the therapeutic is generally continued while the level is increasing relative to the control value. As before, attainment of a plateau relative to control values is an indicator that the administration of treatment can be discontinued or reduced in dosage and/or frequency.


In other embodiments, a control value of the level or profile, such as a mean and standard deviation, is determined from a control population of individuals who have undergone treatment with a therapeutic agent and whose levels or profiles have plateaued in response to treatment. Measured values of levels or profiles in a subject are compared with the control value. If the measured level in a subject is not significantly different, such as by more than one standard deviation, from the control value, treatment can be discontinued. If the level in a subject is significantly below the control value, continued administration of agent is warranted. If the level in the subject persists below the control value, then a change in treatment may be indicated.


In other embodiments, a subject who is not presently receiving treatment but has undergone a previous course of treatment is monitored for antibody levels or profiles to determine whether a resumption of treatment is required. The measured level or profile in the subject can be compared with a value previously achieved in the subject after a previous course of treatment. A significant decrease relative to the previous measurement, such as greater than a typical margin of error in repeat measurements of the same sample, is an indication that treatment can be resumed. Alternately, the value measured in a subject can be compared with a control value (mean plus standard deviation) determined in a population of subjects after undergoing a course of treatment. Alternately, the measured value in a subject can be compared with a control value in populations of prophylactically treated subjects who remain free of symptoms of disease, or populations of therapeutically treated subjects who show amelioration of disease characteristics. In all of these cases, a significant decrease relative to the control level, such as more than a standard deviation, is an indicator that treatment should be resumed in a subject.


In some methods, a baseline measurement of antibody to a given antigen in the subject is made before administration, a second measurement is made soon thereafter to determine the peak antibody level, and one or more further measurements are made at intervals to monitor decay of antibody levels. When the level of antibody has declined to baseline or a predetermined percentage of the peak less baseline, such as 50%, 25% or 10%, administration of a further dosage of antigen is administered. In some embodiments, peak or subsequent measured levels less background are compared with reference levels previously determined to constitute a beneficial prophylactic or therapeutic treatment regime in other subjects. If the measured antibody level is significantly less than a reference level, such as less than the mean minus one standard deviation of the reference value in population of subjects benefiting from treatment, administration of an additional dosage of antigen is indicated.


In some embodiments, the subject(s) that can be treated by the above-described methods is an animal, including mammals and non-mammals. Suitable mammals, include, but are not limited to, humans, non-human primates, rodents (including rats, mice, hamsters and guinea pigs) cow, horse, sheep, badger, opossum, goat, pig, dog and cat. In most instances, the mammal is a human. In some embodiments, the non-mammal is a fish. Immunization of animals with any one or more of the vaccines described herein can prevent zoonotic transmission (i.e., transition of a disease, such as TB, from an animal to a human).


The present disclosure also provides fusion proteins for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides fusion proteins for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides uses of a fusion protein in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides uses of a fusion protein in treating or preventing a Mycobacterium tuberculosis infection, wherein the fusion protein comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens.


The present disclosure also provides compositions for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides compositions for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides uses of a composition in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides uses of a composition in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier.


The present disclosure also provides compositions for use in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides compositions for use in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides uses of a composition in the preparation of a medicament for treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides uses of a composition in treating or preventing a Mycobacterium tuberculosis infection, wherein the composition comprises at least two or three Mycobacterium tuberculosis (Mtb) antigens, and a pharmaceutically acceptable carrier, wherein the composition comprises at least one nucleic acid molecule encoding at least one of the Mtb antigens.


The present disclosure also provides any of the fusion proteins described herein, or any of the compositions described herein, or any of the cells described herein, or any of the vectors described herein, or any of the methods described herein, or any of the uses described herein, substantially as described with reference to the accompanying examples and/or figures.


Embodiment 1. A composition or fusion protein comprising at least three Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are chosen from Rv1009, Rv3136, Rv3615c, Rv2628, Rv2034, and Rv3136 N-terminus.


Embodiment 2. The composition or fusion protein according to embodiment 1 wherein Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4.


Embodiment 3. The composition or fusion protein according to embodiment 1 wherein Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:1 or SEQ ID NO:3.


Embodiment 4. The composition or fusion protein according to embodiment 1 wherein Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:8.


Embodiment 5. The composition or fusion protein according to embodiment 1 wherein Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NO:7.


Embodiment 6. The composition or fusion protein according to embodiment 1 wherein Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:11.


Embodiment 7. The composition or fusion protein according to embodiment 1 wherein Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:9 or SEQ ID NO:10.


Embodiment 8. The composition or fusion protein according to embodiment 1 wherein Rv2628 comprises the amino acid sequence set forth in SEQ ID NO: 14.


Embodiment 9. The composition or fusion protein according to embodiment 1 wherein Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO:12 or SEQ ID NO:13.


Embodiment 10. The composition or fusion protein according to embodiment 1 wherein Rv2034 comprises the amino acid sequence set forth in SEQ ID NO: 16.


Embodiment 11. The composition or fusion protein according to embodiment 1 wherein Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO:15.


Embodiment 12. The composition or fusion protein according to embodiment 1 wherein Rv3136Nt comprises the amino acid sequence set forth in SEQ ID NO:18.


Embodiment 13. The composition or fusion protein according to embodiment 1 wherein Rv3136Nt is encoded by a nucleotide sequence set forth in SEQ ID NO:17.


Embodiment 14. The composition or fusion protein according to any one of embodiments 1 to 13 comprising at least four Mycobacterium tuberculosis (Mtb) antigens.


Embodiment 15. The composition or fusion protein according to any one of embodiments 1 to 13 comprising at least five Mycobacterium tuberculosis (Mtb) antigens.


Embodiment 16. The composition or fusion protein according to embodiment 1 comprising: Rv1009, Rv3615c, and Rv3136 Mtb antigens; Rv1009, Rv2034, and Rv3136 Mtb antigens; Rv1009, Rv3615c, and Rv3136 Mtb antigens; Rv1009, Rv2628, Rv3615c, and Rv3136 Mtb antigens; Rv1009, Rv3615c, Rv2034, and Rv2628 Mtb antigens; Rv2034, Rv3615c, Rv2628, and Rv3136 Mtb antigens; Rv1009, Rv2034, Rv2628, Rv3615c, and Rv3136 Mtb antigens; Rv1009, Rv3136Nt, Rv2628, Rv2034, and Rv3615c Mtb antigens; or Rv2034, Rv3615c, Rv3136Nt, Rv2628, and Rv1009 Mtb antigens.


Embodiment 17. The composition or fusion protein according to embodiment 16 wherein: Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4; Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:8; Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:11; Rv2628 comprises the amino acid sequence set forth in SEQ ID NO:14; Rv2034 comprises the amino acid sequence set forth in SEQ ID NO:16; and Rv3136Nt comprises the amino acid sequence set forth in SEQ ID NO:18.


Embodiment 18. The composition or fusion protein according to embodiment 16 wherein: Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:1 or SEQ ID NO: 3; Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NO:7; Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:9 or SEQ ID NO: 10; Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO:12 or SEQ ID NO: 13; Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO: 15; and Rv3136Nt is encoded by a nucleotide sequence set forth in SEQ ID NO:17.


Embodiment 19. The fusion protein according to embodiment 1 comprising: Rv1009-Rv3615c-Rv3136; Rv1009-Rv2034-Rv3136; Rv1009-Rv2628-Rv3615c-Rv3136; Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136; Rv2034-Rv1009-Rv3136; Rv3136-Rv2034-Rv1009; Rv1009-Rv3615c-Rv2034-Rv2628; Rv3615c-Rv2034-Rv2628-Rv1009; Rv2034-Rv3615c-Rv2628-Rv3136; Rv3136-Rv2628-Rv3615c-Rv2034; Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c; Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009; Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034; Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c; Rv2628-Rv3136Nt-Rv1009-Rv2034-Rv3615c; or Rv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009.


Embodiment 20. The fusion protein according to embodiment 19 wherein: Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4; Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:8; Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:11; Rv2628 comprises the amino acid sequence set forth in SEQ ID NO: 14; Rv2034 comprises the amino acid sequence set forth in SEQ ID NO:16; and Rv3136Nt comprises the amino acid sequence set forth in SEQ ID NO:18.


Embodiment 21. The fusion protein according to embodiment 19 wherein: Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:1 or SEQ ID NO: 3; Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NO:7; Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:9 or SEQ ID NO: 10; Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO: 12 or SEQ ID NO: 13; Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO: 15; and Rv3136Nt is encoded by a nucleotide sequence set forth in SEQ ID NO:17.


Embodiment 22. The fusion protein according to embodiment 19 wherein: Rv1009-Rv3615c-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:39; Rv1009-Rv2034-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:41; Rv1009-Rv2628-Rv3615c-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:43; Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:45; Rv2034-Rv1009-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:47; Rv3136-Rv2034-Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:49; Rv1009-Rv3615c-Rv2034-Rv2628 comprises the amino acid sequence set forth in SEQ ID NO:51; Rv3615c-Rv2034-Rv2628-Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:53; Rv2034-Rv3615c-Rv2628-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:55; Rv3136-Rv2628-Rv3615c-Rv2034 comprises the amino acid sequence set forth in SEQ ID NO:57; Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:59; Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:61; Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034 comprises the amino acid sequence set forth in SEQ ID NO:63; Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:71; Rv2628-Rv3136Nt-Rv1009-Rv2034-Rv3615c comprises the amino acid sequence set forth in SEQ ID NO:73; and Rv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:75.


Embodiment 23. The fusion protein according to embodiment 19 wherein: Rv1009-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:38; Rv1009-Rv2034-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:40; Rv1009-Rv2628-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:42; Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:44; Rv2034-Rv1009-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:46; Rv3136-Rv2034-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:48; Rv1009-Rv3615c-Rv2034-Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO:50; Rv3615c-Rv2034-Rv2628-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:52; Rv2034-Rv3615c-Rv2628-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:54; Rv3136-Rv2628-Rv3615c-Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO:56; Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:58; Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:60; Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO:62; Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:70; Rv2628-Rv3136Nt-Rv1009-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:72; and Rv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:74.


Embodiment 24. A composition or fusion protein comprising at least two Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are Rv1733 and Rv2626c.


Embodiment 25. The composition or fusion protein according to embodiment 24 wherein Rv1733 comprises the amino acid sequence set forth in SEQ ID NO:29.


Embodiment 26. The composition or fusion protein according to embodiment 24 wherein Rv1733 is encoded by a nucleotide sequence set forth in SEQ ID NO:28.


Embodiment 27. The composition or fusion protein according to embodiment 24 wherein Rv2626c comprises the amino acid sequence set forth in SEQ ID NO:31.


Embodiment 28. The composition or fusion protein according to embodiment 24 wherein Rv2626c is encoded by a nucleotide sequence set forth in SEQ ID NO:30.


Embodiment 29. The fusion protein according to claim 24 comprising Rv1733-Rv2626c.


Embodiment 30. The fusion protein according to embodiment 29 wherein: Rv1733 comprises the amino acid sequence set forth in SEQ ID NO:29; and Rv2626c comprises the amino acid sequence set forth in SEQ ID NO:31.


Embodiment 31. The fusion protein according to embodiment 29 wherein: Rv1733 is encoded by a nucleotide sequence set forth in SEQ ID NO:28; and Rv2626c is encoded by a nucleotide sequence set forth in SEQ ID NO:30.


Embodiment 32. The fusion protein according to embodiment 29 wherein Rv1733-Rv2626c comprises the amino acid sequence set forth in SEQ ID NO:67 Embodiment 33. The fusion protein according to embodiment 29 wherein Rv1733-Rv2626c is encoded by a nucleotide sequence set forth in SEQ ID NO:66.


Embodiment 34. A composition or fusion protein comprising at least three Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are chosen from Ag85A, Ag85B, Rv3407, RpfA, RpfC, and RpfD.


Embodiment 35. The composition or fusion protein according to embodiment 34 wherein Ag85A comprises the amino acid sequence set forth in SEQ ID NO:20.


Embodiment 36. The composition or fusion protein according to embodiment 34 wherein Ag85A is encoded by a nucleotide sequence set forth in SEQ ID NO: 19.


Embodiment 37. The composition or fusion protein according to embodiment 34 wherein Ag85B comprises the amino acid sequence set forth in SEQ ID NO:24 or SEQ ID NO:25.


Embodiment 38. The composition or fusion protein according to embodiment 34 wherein Ag85B is encoded by a nucleotide sequence set forth in SEQ ID NO:21, SEQ ID NO:22, or SEQ ID NO:23.


Embodiment 39. The composition or fusion protein according to embodiment 34 wherein Rv3407 comprises the amino acid sequence set forth in SEQ ID NO:27.


Embodiment 40. The composition or fusion protein according to embodiment 34 wherein Rv3407 is encoded by a nucleotide sequence set forth in SEQ ID NO:26.


Embodiment 41. The composition or fusion protein according to embodiment 34 wherein RpfA comprises the amino acid sequence set forth in SEQ ID NO:33.


Embodiment 42. The composition or fusion protein according to embodiment 34 wherein RpfA is encoded by a nucleotide sequence set forth in SEQ ID NO:32.


Embodiment 43. The composition or fusion protein according to embodiment 34 wherein RpfC comprises the amino acid sequence set forth in SEQ ID NO:35.


Embodiment 44. The composition or fusion protein according to embodiment 34 wherein RpfC is encoded by a nucleotide sequence set forth in SEQ ID NO:34.


Embodiment 45. The composition or fusion protein according to embodiment 34 wherein RpfD comprises the amino acid sequence set forth in SEQ ID NO:37.


Embodiment 46. The composition or fusion protein according to embodiment 34 wherein RpfD is encoded by a nucleotide sequence set forth in SEQ ID NO:36.


Embodiment 47. The composition or fusion protein according to embodiment 34 comprising: Ag85A, Ag85B, and Rv3407 Mtb antigens; or RpfA, RpfC, and RpfD Mtb antigens.


Embodiment 48. The composition or fusion protein according to embodiment 47 wherein: Ag85A comprises the amino acid sequence set forth in SEQ ID NO:20; Ag85B comprises the amino acid sequence set forth in SEQ ID NO:24 or SEQ ID NO:25; Rv3407 comprises the amino acid sequence set forth in SEQ ID NO:27; RpfA comprises the amino acid sequence set forth in SEQ ID NO:33; RpfC comprises the amino acid sequence set forth in SEQ ID NO:35; and RpfD comprises the amino acid sequence set forth in SEQ ID NO:37.


Embodiment 49. The composition or fusion protein according to embodiment 47 wherein: Ag85A is encoded by a nucleotide sequence set forth in SEQ ID NO:19; Ag85B is encoded by a nucleotide sequence set forth in SEQ ID NO:21, SEQ ID NO:22, or SEQ ID NO:23; Rv3407 is encoded by a nucleotide sequence set forth in SEQ ID NO:26; RpfA is encoded by a nucleotide sequence set forth in SEQ ID NO:32; RpfC is encoded by a nucleotide sequence set forth in SEQ ID NO:34; and RpfD is encoded by a nucleotide sequence set forth in SEQ ID NO:36.


Embodiment 50. The fusion protein according to embodiment 34 comprising: Ag85A-Ag85B-Rv3407; or RpfA-RpfC-RpfD.


Embodiment 51. The fusion protein according to embodiment 50 wherein: Ag85A comprises the amino acid sequence set forth in SEQ ID NO:20; Ag85B comprises the amino acid sequence set forth in SEQ ID NO:24 or SEQ ID NO:25; Rv3407 comprises the amino acid sequence set forth in SEQ ID NO:27; RpfA comprises the amino acid sequence set forth in SEQ ID NO:33; RpfC comprises the amino acid sequence set forth in SEQ ID NO:35; and RpfD comprises the amino acid sequence set forth in SEQ ID NO:37.


Embodiment 52. The fusion protein according to embodiment 50 wherein: Ag85A is encoded by a nucleotide sequence set forth in SEQ ID NO:19; Ag85B is encoded by a nucleotide sequence set forth in SEQ ID NO:21, SEQ ID NO:22, or SEQ ID NO:23; Rv3407 is encoded by a nucleotide sequence set forth in SEQ ID NO:26; RpfA is encoded by a nucleotide sequence set forth in SEQ ID NO:32; RpfC is encoded by a nucleotide sequence set forth in SEQ ID NO:34; and RpfD is encoded by a nucleotide sequence set forth in SEQ ID NO:36.


Embodiment 53. The fusion protein according to embodiment 50 wherein: Ag85A-Ag85B-Rv3407 comprises the amino acid sequence set forth in SEQ ID NO:65; and RpfA-RpfC-RpfD comprises the amino acid sequence set forth in SEQ ID NO:69.


Embodiment 54. The fusion protein according to embodiment 50 wherein: Ag85A-Ag85B-Rv3407 is encoded by a nucleotide sequence set forth in SEQ ID NO:64; RpfA-RpfC-RpfD is encoded by a nucleotide sequence set forth in SEQ ID NO:68.


Embodiment 55. A pharmaceutical composition comprising the composition or fusion protein according to any one of embodiments 1 to 54 and a pharmaceutically acceptable carrier.


Embodiment 56. A vector encoding the fusion protein according to any one of embodiments 1 to 54.


Embodiment 57. The vector according to embodiment 56 which is a viral vector.


Embodiment 58. The vector according to embodiment 57 wherein the viral vector is chosen from an adenovirus vector, an adeno-associated virus vector, a poxvirus vector, a paramyxovirus vector, a fowlpox virus vector, an attenuated yellow fever vector, an alphavirus vector, a retrovirus vector, a Sendai virus vector, and a CMV vector.


Embodiment 59. The vector according to embodiment 58 wherein the adenovirus vector is adenovirus 4, adenovirus 5, chimpanzee adenovirus 3, chimpanzee adenovirus 63, or chimpanzee adenovirus 68.


Embodiment 60. The vector according to embodiment 58 wherein the poxvirus vector is vaccinia virus vector.


Embodiment 61. The vector according to embodiment 60 wherein the vaccinia virus vector is a modified vaccinia Ankara (MVA).


Embodiment 62. The vector according to embodiment 58 wherein the retrovirus vector is a lentivirus vector.


Embodiment 63. The vector according to embodiment 58 wherein the CMV vector is a RhCMV vector or a HCMV vector.


Embodiment 64. The vector according to embodiment 58 wherein the paramyxovirus vector is PIV2 or rHPIV2.


Embodiment 65. The vector according to embodiment 56 which is a non-viral vector.


Embodiment 66. The vector according to embodiment 65 wherein the non-viral vector is mRNA.


Embodiment 67. A cell comprising the vector according to any one of embodiments 56 to 66.


Embodiment 68. The cell according to embodiment 67 which is a recombinant BCG.


Embodiment 69. A method of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of the composition or fusion protein according to any one of embodiments 1 to 54.


Embodiment 70. A method of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of the composition according to embodiment 55.


Embodiment 71. A method of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of the vector according to any one of embodiments 56 to 66.


In order that the subject matter disclosed herein may be more efficiently understood, examples are provided below. It should be understood that these examples are for illustrative purposes only and are not to be construed as limiting the claimed subject matter in any manner. Throughout these examples, molecular cloning reactions, and other standard recombinant DNA techniques, were carried out according to methods described in Maniatis et al., Molecular Cloning—A Laboratory Manual, 2nd ed., Cold Spring Harbor Press (1989), using commercially available reagents, except where otherwise noted.


EXAMPLES
Example 1: Cloning, Overexpression and Purification of Fusion Proteins
Preparation of the Antigen Cassette and its Variants as Fusion Proteins

Cloning: To clone Rv1009-Rv3615c-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv2034-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv2628-Rv3615c-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv2034-Rv1009-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv3136-Rv2034-Rv1009, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv3615c-Rv2034-Rv2628, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv3615c-Rv2034-Rv2628-Rv1009, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv2034-Rv3615c-Rv2628-Rv3136, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct.


To clone Rv3136-Rv2628-Rv3615c-Rv2034, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv2628-Rv3136N-Rv1009-Rv2034-Rv3615c, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


To clone Rv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009, the genes encoding the protein antigens were PCR amplified from Mtb H37Rv using primers that added restriction sites. Rv1009 was amplified without the N-terminal signal sequence. The amplified genes were cloned into the pET28b vector (Novagen) via the indicated restriction enzyme sites, creating N-terminal 6×His-tagged fusion proteins. The genes were cloned with no spacer sequences, only the restriction enzyme sites between each gene. The pET28b constructs were cloned in E. coli cloning strains, screened by restriction digest and sequenced to verify each construct. The DNA and amino acid sequences of fusions show Rv1009 minus its N-terminal signal sequence.


Expression: The plasmids encoding the fusion proteins were transformed into E. coli BL21 (Novagen) and T7 Express (New England Biolabs). Multiple colonies of each fusion construct were picked and grown overnight shaking at 37° C. in Tryptic Soy Broth (TSB) (Sigma). Overnight cultures were diluted 1:100 in TSB and grown shaking at 37° C. to OD600=0.6. Cultures were induced with 1 mM IPTG and grown shaking at 37° C. for 3 hours. Induced and uninduced aliquots of each culture were run on 4-12% Bis/Tris SDS-PAGE gels to verify induction of the fusion proteins. Colonies expressing each of the fusion proteins were frozen in TSB+20% glycerol at −80° C. as research stocks.


Purification of fusion proteins: Ten ml cultures were inoculated from glycerol stocks of the fusion constructs and grown overnight shaking at 37° C. The overnight cultures were diluted 1:100 in 250 ml TSB and grown shaking at 37° C. to OD600=0.6. Cultures were induced with 1 mM IPTG and grown shaking at 37° C. for 3 hours. An aliquot of the induced sample was run on a 4-12% Bis/Tris SDS-PAGE gel to confirm induction of the protein. The induced culture was centrifuged at 6,000×g for 10 m and pellets were frozen at −80° C. Pellets were thawed and resuspended in 10 ml BPER buffer (Thermo Scientific), and an aliquot was taken for testing (lysate). Lysozyme (20 U/ml) and DNase I (25 U/ml) were added to help complete cell lysis. The lysed cells were centrifuged at 12,000×g for 10 minutes and the supernatant was collected (soluble fraction). The insoluble pellet was resuspended in 10 ml BPER buffer and a 100 μl aliquot was removed (insoluble fraction). The cells in the resuspended pellet were diluted with 10 ml 10% BPER buffer and the suspension was centrifuged at 12,000×g for 10 minutes. The supernatant was discarded and the pellet was washed again with 10 ml 10% BPER buffer 3 more times. The lysate, soluble and insoluble fractions and washes were run on a 4-12% Bis/Tris SDS-PAGE gel to confirm expression and determine the subcellular localization of the protein. The fusion proteins were found localized to the insoluble pellet in inclusion bodies. The insoluble pellet was resuspended in 10 ml denaturing binding buffer (DBB) (8 M urea, 92 mM Na2HPO4, 7 mM NaH2PO4, 10 mM Tris) pH 7.8. The inclusion bodies were lysed by sonication, and the lysate was cleared of debris by centrifugation at 12,000×g for 20 minutes.


Proteins were purified by column purification. Five (5) ml of HisPur Cobalt resin (Thermo Scientific) was equilibrated with DBB and incubated with 5 ml of cleared lysate. The mixture was rocked at room temperature for 90 minutes. The lysate/resin mixture was then loaded on a 30 ml column and washed with 25 volumes of denaturing wash buffer (8 M urea, 25 mM Na2HPO4, 75 mM NaH2PO4, 10 mM Tris, 12 mM sodium deoxycholate, pH 7.8). His-tagged protein was eluted from the Co+ column by eluting with elution buffer (8 M urea, 10 mM Tris, 5% glycerol) pH 8.0 with 50, 100, 350, 500, and 1000 mM imidazole. Eluted proteins were run on a4-12% Bis/Tris SDS-PAGE gel and clean fractions were dialyzed stepwise from 8M urea, 10 mM Tris, 5% glycerol to 10 mM Tris, 5% glycerol. Dialyzed protein was analyzed by SDS-PAGE for purity, western blot for the presence of each antigen, and was assayed for the presence of residual endotoxin. Pure samples with <0.25 U endotoxin/ml were aliquoted and frozen at −80° C.


Example 2: Mouse Immunogenicity and Efficacy

The immunogenicity and efficacy of an antigen-matched recombinant BCG (422M, which overexpresses Ag85B, Ag85A, and Rv3407) with a recombinant protein vaccine (rBARv; comprised of Ag85B, Ag85A, and Rv3407) combined with the adjuvant PIKA was examined. At Week 0, mice were immunized in the scruff of the neck with approximately 2×106 CFU of BCG or 422M. At Weeks 6 and 8, mice received booster immunizations of 1 μg of rBARv with 100 μg of PIKA adjuvant in PBS, also administered subcutaneously in the scruff of the neck.


Immunogenicity: Two weeks following the final vaccination, a subset of mice were sacrificed and splenocytes were harvested for use in an ELISpot assay. As shown in FIG. 1, vaccination with both rBARv and 422M induced moderate immune responses against both Ag85B and Ag85A. In addition, a combination of 422M and BARv induced a potent immune response.


Efficacy: Four weeks after the final immunization, a subset of mice (15) from the naïve, BCG, 422M, BCG+BARv and 422M+BARv groups were challenged with 50 to 100 CFU of Mtb HN878. At Weeks 4, 12, and 20 post-challenge, five mice per group were sacrificed and the bacterial burden was evaluated in both lungs and spleen (see, FIG. 2). At each time point, vaccinated mice showed a significant degree of protection compared to naïve mice. However, the only group that demonstrated a significant increase in protection relative to BCG was the BCG+BARv group at week 12.


Example 3: Mouse Immunogenicity

The immunogenicity of an antigen-matched recombinant BCG (422M; which overexpresses Ag85B, Ag85A, and Rv3407) with a recombinant adenovirus serotype 5 (Ad5-C; encoding Ag85B, Ag85A, and Rv3407) was examined. This study was performed in two strains of mice commonly used in TB research (C57BL/6 mice (BL6) and CB6F1 mice). At Week 0, mice were immunized in the scruff of the neck with approximately 2×106 CFU of BCG or 422M. At week 6, mice were vaccinated with 1×109 viral particles of Ad5-C. At week 8, mice were sacrificed and the splenocytes were harvested for use in an intracellular cytokine staining assay. As shown above in FIG. 3, immunization with Ad5-C induced potent T cell responses that can be significantly enhanced with a BCG or rBCG prime. Similar immunogenicity data has been observed for PanAd, Ad4, and ChAd63 (data not shown).


Example 4: Mouse Immunogenicity

The immunogenicity of several Mtb antigens delivered as plasmid DNA (pVAX backbone) were examined in two studies two studies (NBL12-11 and NBL12-12). NBL12-12 included modifications of certain antigens, including addition of a secretory leader sequence (sec) or removal of a transmembrane domain (mod). Mice were immunized with individual antigens at weeks 0, 2, and 4. At week 6, mice were sacrificed and the splenocytes were harvested for use in ELISpot assays. In some instances a) where peptide was not available, protein was used for stimulation, and b) if both were available, both were used. The strongest responders included Ag85B, PPE51, and Rv2626 (see, FIG. 4). Modest responses were also detected against ESAT6, Rv3407, Rv2628, PPE15, and PE3 (see, FIG. 4).

Claims
  • 1. A composition or fusion protein comprising at least three Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are chosen from Rv1009, Rv3136, Rv3615c, Rv2628, Rv2034, and Rv3136 N-terminus.
  • 2-7. (canceled)
  • 8. The fusion protein according to claim 1 comprising: Rv1009-Rv3615c-Rv3136;Rv1009-Rv2034-Rv3136;Rv2034-Rv1009-Rv3136;Rv3136-Rv2034-Rv1009.
  • 9. The fusion protein according to claim 8 wherein: Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:4;Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:8;Rv3615c comprises the amino acid sequence set forth in SEQ ID NO: 11;Rv2628 comprises the amino acid sequence set forth in SEQ ID NO: 14;Rv2034 comprises the amino acid sequence set forth in SEQ ID NO: 16; andRv3136Nt comprises the amino acid sequence set forth in SEQ ID NO: 18.
  • 10. The fusion protein according to claim 8 wherein: Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:1 or SEQ ID NO:3;Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NO:7;Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:9 or SEQ ID NO:10;Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO: 12 or SEQ ID NO:13;Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO: 15; andRv3136Nt is encoded by a nucleotide sequence set forth in SEQ ID NO:17.
  • 11. The fusion protein according to claim 8 wherein: Rv1009-Rv3615c-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:39;Rv1009-Rv2034-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:41;Rv2034-Rv1009-Rv3136 comprises the amino acid sequence set forth in SEQ ID NO:47;Rv3136-Rv2034-Rv1009 comprises the amino acid sequence set forth in SEQ ID NO:49.
  • 12. The fusion protein according to claim 8 wherein: Rv1009-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:38;Rv1009-Rv2034-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:40;Rv1009-Rv2628-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:42;Rv1009-Rv2034-Rv2628-Rv3615c-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:44;Rv2034-Rv1009-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO:46;Rv3136-Rv2034-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:48;Rv1009-Rv3615c-Rv2034-Rv2628 is encoded by a nucleotide sequence set forth in SEQ ID NO:50;Rv3615c-Rv2034-Rv2628-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO: 52;Rv2034-Rv3615c-Rv2628-Rv3136 is encoded by a nucleotide sequence set forth in SEQ ID NO: 54;Rv3136-Rv2628-Rv3615c-Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO:56;Rv1009-Rv3136Nt-Rv2628-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:58;Rv2034-Rv3615c-Rv3136Nt-Rv2628-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:60;Rv3615c-Rv2628-Rv1009-Rv3136Nt-Rv2034 is encoded by a nucleotide sequence set forth in SEQ ID NO:62;Rv1009-Rv2628-Rv3136Nt-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:70;Rv2628-Rv3136Nt-Rv1009-Rv2034-Rv3615c is encoded by a nucleotide sequence set forth in SEQ ID NO:72; andRv2628-Rv3136Nt-Rv2034-Rv3615c-Rv1009 is encoded by a nucleotide sequence set forth in SEQ ID NO:74.
  • 13. A composition or fusion protein comprising at least two Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are Rv1733 and Rv2626c.
  • 14. The fusion protein according to claim 13 comprising Rv1733-Rv2626c.
  • 15. A composition or fusion protein comprising at least three Mycobacterium tuberculosis (Mtb) antigens, wherein the Mtb antigens are chosen from Ag85A, Ag85B, Rv3407, RpfA, RpfC, and RpfD.
  • 16. The composition or fusion protein according to claim 15 comprising: Ag85A, Ag85B, and Rv3407 Mtb antigens; orRpfA, RpfC, and RpfD Mtb antigens.
  • 17. The fusion protein according to claim 16 comprising: Ag85A-Ag85B-Rv3407; orRpfA-RpfC-RpfD.
  • 18. A pharmaceutical composition comprising the composition or fusion protein according to claim 1 and a pharmaceutically acceptable carrier.
  • 19. A vector comprising a nucleic acid molecule encoding the fusion protein according to claim 1.
  • 20. A cell comprising the vector according to claim 19.
  • 21. A method of eliciting an immune response against Mycobacterium tuberculosis in a mammal comprising administering to the mammal an immunologically sufficient amount of the composition or fusion protein according to claim 1.
  • 22. The method of claim 21 wherein the composition or fusion protein is administered to the mammal in the form of an aerosol.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. provisional application Ser. No. 62/350,837 filed Jun. 16, 2016, which is incorporated herein by reference in its entirety.

Provisional Applications (1)
Number Date Country
62350837 Jun 2016 US
Continuations (2)
Number Date Country
Parent 16541518 Aug 2019 US
Child 18316631 US
Parent 15624853 Jun 2017 US
Child 16541518 US