Comparative mycobacterial genomics as a tool for identifying targets for the diagnosis, prophylaxis or treatment of mycobacterioses

Abstract
The present invention is directed to a method of selection of purified nucleotidic sequences or polynucleotides encoding proteins or part of proteins carrying at least an essential function for the survival or the virulence of mycobacterium species by a comparative genomic analysis of the sequence of the genome of M. tuberculosis aligned on the genome sequence of M. leprae and M. tuberculosis and M. leprae marker polypeptides of nucleotides encoding the polypeptides, and methods for using the nucleotides and the encoded polypeptides are disclosed.
Description


BACKGROUND OF THE INVENTION

[0002] The present invention is directed to a method of selection of purified nucleotidic sequences or polynucleotides encoding proteins or part of proteins carrying at least an essential function for the survival or the virulence of mycobacterium species by a comparative genomic analysis of the sequence of the genome of M. tuberculosis aligned on the genome sequence of M. leprae. The selection by the method of the invention of these nucleotidic or peptidic sequences of interest which are encoding said essential functions of mycobacterium leads to identify and characterize specific antigens or regulator sequences, said antigens being chosen as potential candidates for an immunogenic or vaccine composition, but also useful to determine novel potential drug targets for the pharmaceutical industry. The molecules having essential functions encoded by these genes or corresponding to regulatory elements represent also new highly specific targets for chemotherapy. The sequence of the polynucleotides according to the invention have the particularity to be maintained during the evolution of the mycobacterium and therefore have been highly conserved in pathogenic mycobacterium species. The invention is directed to purified nucleic acid selected by the method of the invention as well as the purified polypeptides with essential functions for the survival or the virulence of mycobacterium species encoded by these sequences. In a preferred embodiment, the invention is directed to genes that code for essential proteins for which the functions have been attributed. The invention is also directed to a process for the production of recombinant polypeptides and chimeric polypeptides comprising them, antibodies generated against these polypeptides, immunogenic or vaccine compositions comprising at least one polypeptide useful as protective antigens or capable to induce a protective response in vivo or in vitro against mycobacterium infections, immunotherapeutic compositions comprising at least such a polypeptide according to the invention, and the use of such nucleic acids and polypeptides in diagnostic methods, vaccines, kits, or antimicrobial therapy.


[0003] To illustrate the new approach of comparative mycobacterial genomics for identifying essential molecules as regulator nucleotidic sequences and proteins for the survival or the virulence of mycobacterium species, the inventors made several examples which will not limit the scope of the present invention. A comparative genomic analysis, which permitted the inventors to select the sequences encoding essential molecules as regulatory nucleotidic sequences and proteins for the survival or the virulence of mycobacterium species, has been made by analysis of the complete genome sequence of both Mycobacterium tuberculosis and Mycobacterium leprae. The whole genome comparisons led also to the identification of genes that are present in both M. tuberculosis and M. leprae but have no counterparts elsewhere. The polypeptides having essential functions for the survival or the virulence mycobacterium species are characterized by at least 40% identity at the protein level and at least 70% identity at the gene level between both genomic sequences. The amino acid sequences have been compared using the program GAP, “GCG” (Genetic Computer Group) from Program Manual (UNIX), Wisconsin Sequence Analysis Package™, Algorithm of Needleman and Wunsch. (J.Mol.Biol.48:443, 1970) The parameters are chosen as follows:


[0004] For amino acid comparisons:


[0005] Gap penalty: 5


[0006] Gap extension penalty: 0.30


[0007] Length: the sequence to be compared are the following XXX SED ID NO:XXX and having XXX amino acids.


[0008] For nucleotide comparisons:


[0009] Gap penalty: 50


[0010] Gap extension penalty: 3


[0011] Also the parameters could be adapted case by case.


[0012] Other techniques are known by the man of the art for the comparison of sequences. We can refer to the algorithm of Smith and Wateman (Ad. App. Math. 2: 482, 1982), the method of search of similarities of Pearson and Lipman (Proc. NatI. Acad. Sci. USA 85: 2444, 1988), Zhang et al. “a greedy algorithm for aligning DNA sequences” (J. Comp. Biol. 2000, Feb-Apr. 7 (1-2) p203-214), these algorithms are used by the way of informatic tools (GAP, BLASTP, BLASTN, BLASTX, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Sciences Dr., Madison Wis.).


[0013] The recombinant clones carrying DNA from Mycobacterium tuberculosis and Mycobacterium leprae strains containing genomic sequences of said bacteria, have been deposited at the Collection Nationale de Cultures de Microoganismes (C.N.C.M.), of Institut Pasteur, 28, rue du Docteur Roux, F-75724 Paris cedex 15, France, and are designated as following.


[0014] HRV37 genomic library, deposited on Nov. 19, 1997 . . . under the accession number I-1945;


[0015] A recombinant BAC containing a fragment of M. tuberculosis genome deposited at the C.N.C.M. under the accession number I-2625.


[0016] A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001, at the C.N.C.M. under the accession number I-2626.


[0017] A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001, at the C.N.C.M. under the accession number I-2627.


[0018] A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001, at the C.N.C.M. under the accession number I-2628.


[0019] A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001, at the C.N.C.M. under the accession number I-2629.


[0020] A recombinant cosmid containing a fragment of M. leprae genome deposited on Feb. 21, 2001, at the C.N.C.M. under the accession number I-2632.


[0021] A recombinant cosmid containing a fragment of M. leprae genome deposited on Feb. 21, 2001, at the C.N.C.M. under the accession number I-2633.


[0022] Leprosy, one of the oldest recorded diseases, remains a major public health problem. Although prevalence has been reduced extensively by WHO multidrug therapy and vaccination with BCG1,2, the incidence of the disease remains worrying with more than 690,000 new cases annually3 in the world. Leprosy was common in Europe in the middle ages but gradually disappeared.


[0023] In 1873, in the first convincing association of a microorganism with a human disease, Armauer Hansen4 discovered the leprosy bacillus in skin biopsies but failed to culture Mycobacterium leprae. A century later, the nine-banded armadillo5 was used as a surrogate host, enabling large quantities of the bacillus to be isolated for biochemical and physiological studies. Subsequent efforts to demonstrate multiplication in synthetic media have been equally fruitless altough metabolic activity can be detected6. The exceptionally slow growth of the bacillus, which has a doubling time of ˜14 days 7, may contribute to these failures.


[0024] The means of transmission of leprosy is uncertain but, like tuberculosis, it is believed to be spread by the respiratory route since lepromatous patients harbour bacilli in their nasal passages. The bacterium accumulates principally in the extremities of the body where it resides with macrophages and infects the Schwann cells of the peripheral nervous system. Lack of myelin production by infected Schwann cells, and their desctruction by host-mediated immune reactions, leads to nerve damage, sensory loss and the disfiguration that, sadly, are hallmarks of leprosy.


[0025] There is no data or technical information in the prior art which permit to select specifically potential new targets and protective antigens for new drugs and vaccine compositions to treat and prevent mycobacterial diseases, particularly tuberculosis and leprosy. Furthermore, there is a need for the development of new tools for the selection of genes which are encoding for essential proteins or regulatory nucleotidic sequences in the survival or infection of mycobacterium species and useful for the design of antituberculosis drugs and vaccines based on the knowledge of comparative mycobactertial genomics.



SUMMARY OF THE INVENTION

[0026] The invention aids in fulfilling these needs in the art. The method according to the invention has the advantage to reduce drastically the number of potential new targets and protective antigens by giving for the first time an exhaustive description of conserved proteins in the tuberculosis and leprae bacilli. The isolated polynucleotides and proteins described in the present invention, which are highly conserved in both genomic sequences of M. tuberculosis and M. leprae, are by this characteristic essential for the survival or the virulence of these mycobacteria in the host. The identification of antigens and potentially therapeutic targets has been made on an evolutionary basis by a method of comparative genomic analysis.


[0027] This invention provides a method for the identification and the selection of essential genes for the survival or the virulence of mycobacterium species which comprises:


[0028] a. Aligning the genomic sequence of a first mycobacterium species on a genomic sequence of the genomic sequence of a second mycobacterium species,


[0029] b. Selecting a polypolynucleotide sequence highly conserved in both genomes with no counterparts in other bacterial genomic sequences and which corresponds to an essential gene for the survival or the virulence of mycobacterium species, and


[0030] c. Optionaly, testing the polypolynucleotide selected in step b) for its capacity of virulence or involved in the survival of a mycobacterium species, said testing being based on the activation or inactivation of said polyucleotide in a bacterial host or said testing being based on the activity of the product of expression of said polynucleotide in vivo or in vitro.


[0031] This invention provides also a method for the identification and the selection in silico of essential genes for the survival or the virulence of mycobacterium species which comprises the following steps:


[0032] a. Aligning the genomic sequence of a first mycobacterium specie on a genomic sequence of the genomic sequence of a second mycobacterium specie, and


[0033] b. Selecting a polynucleotide sequence highly conserved in both genomes with no counterparts in other bacterial genomic sequences and which corresponds to an essential gene for the survival or the virulence of mycobacterium species.


[0034] Optionally, testing the polypolynucleotide selected in step b) for its capacity of virulence or involved in the survival of a mycobacterium species can be carried out, said testing being based on the activation or inactivation of said polynucleotide in a bacterial host or said testing being based on the activity of the product of expression of said polynucleotide in vivo or in vitro.


[0035] The method according to the invention permits also to determine the polynucleotidic sequences, which encode for polypeptides and regulatory sequences essential for the virulence and/or the survival of mycobacterium which are, in one hand, specific to Mycobacterium tuberculosis and, in the other hand, specific to Mycobacterium leprae, that is to say, said polynucleotidic sequences are not found in publicly accessible banks of non-Mycobacterium tuberculosis and non-Mycobacterium leprae genome.


[0036] A gene according to the invention is a defined nucleotidic sequence, which contains an open reading frame with base composition, codon usage, GC skew and other features typical of a microorganism, preferably a mycobacterium. The definition of gene according to the invention comprises nucleotidic sequences, which encode an antigen or a fragment thereof, or nucleotidic sequences, which encode for essential polypeptide with essential function in the host, or nucleotidic sequence, which encodes polypeptide with regulation function in the bacteria, by example, in the DNA expression or in the transcription. An essential function for a polypeptide in bacteria according to the invention comprises functions implicated in the survival or in the virulence of the bacteria.


[0037] In a preferred embodiment the first genomic sequence of mycobacterium belongs to Mycobacterium tuberculosis. The Mycobacterium microti is a Mycobacterium which infect the vole. It has a genome sequence close to the sequence of Mycobacterium tuberculosis (Cole et al. (1998, Nature, 393, 537-544)) and therefore in a second preferred embodiment, the first genomic sequence of Mycobacterium microti belongs to Mycobacterium genus.


[0038] In another preferred embodiment the second genomic sequence of mycobacterium belongs to Mycobacterium leprae.


[0039] In a preferred embodiment, the method according to the invention comprises the complete genomic sequence of said mycobacterium species which is analysed. This invention provides purified polypolynucleotide molecule obtained by the method according to the invention.


[0040] Further, this invention provides a purified polynucleotide molecule according to the invention which encodes essential proteins or fragments of proteins of Mycobacterium species.


[0041] The invention also encompasses a purified polynucleotide molecule of a formula selected from the group consisting of polynucleotidic sequences, which encode for polypeptides and regulatory sequences essential for the virulence and/or the survival of mycobacterium which are, in one hand, specific to Mycobacterium tuberculosis and, in the other hand, specific to Mycobacterium leprae, that is to say, said polynucleotidic sequences are not found in publicly accessible banks of non-Mycobacterium tuberculosis and non-Mycobacterium leprae genome. In a preferred embodiment, this purified polynucleotide is obtained by the method according to the invention.


[0042] The invention emcompasses a purified polypolynucleotide molecule that hybridizes to either stand of a denatured, double-stranded DNA comprising the purified polynucleotide sequence according to the invention under conditions of moderate stringency in 50% formamide and 6× SSC at 42° C. with washing conditions of 60° C., 0.5×SSC, 0.1% SDS.


[0043] This invention provides a purified polypeptide of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO: 644.


[0044] This invention also provides a purified nucleic acid molecule encoding a polypeptide of a formula selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:644.


[0045] The nucleic acid molecules of the invention, which include DNA and RNA, are referred to herein as “M. tuberculosis and M. leprae marker nucleic acids” or “M. tuberculosis and M. leprae marker DNA”. The polypeptides encoded by these molecules, which are referred to herein as “M. tuberculosis and M. leprae marker polypeptides,” have formulas selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:644.


[0046] Further, this invention provides a purified nucleic acid molecule that hybridizes to either strand of a denatured, double-stranded DNA comprising the nucleic acid molecule encoding the polypeptide of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644 under conditions of moderate stringency in 50% formamide and 6× SSC, at 42° C. with washing conditions of 60° C., 0.5×SSC, 0.1% SDS. This nucleic acid molecule that hybridizes under the stated conditions can be derived by in vitro mutagenesis of a M. tuberculosis and M. leprae marker nucleic acid of the invention.


[0047] The invention also encompasses purified nucleic acid molecules degenerate from M. tuberculosis and M. leprae marker nucleic acids as a result of the genetic code, purified nucleic acid molecules that are allelic variants of M. tuberculosis and M. leprae marker nucleic acids, and a species homolog of M. tuberculosis and M. leprae marker nucleic acids. The invention also encompasses recombinant vectors that direct the expression of these nucleic acid molecules and host cells transformed or transfected with these vectors.


[0048] The invention further encompasses methods for the production of M. tuberculosis and M. leprae marker polypeptides, including culturing a host cell under conditions promoting expression, and recovering the polypeptide from the culture medium. Especially, the expression of M. tuberculosis and M. leprae marker polypeptides in bacteria, yeast, plant, and animal cells is encompassed by the invention.


[0049] This invention also provides labeled M. tuberculosis and M. leprae marker polypeptides. Preferably, the labeled polypeptides are in purified form. It is also preferred that the unlabeled or labeled polypeptide is capable of being immunologically recognized by human body fluid containing antibodies to a mycobacterium. The polypeptides can be labeled, for example, with an immunoassay label selected from the group consisting of radioactive, enzymatic, fluorescent, chemiluminescent labels, and chromophores.


[0050] Immunological complexes between the M. tuberculosis and M. leprae marker polypeptides of the invention and antibodies recognizing the polypeptides are also provided. The immunological complexes can be labeled with an immunoassay label selected from the group consisting of radioactive, enzymatic, fluorescent, chemiluminescent labels, and chromophores.


[0051] Furthermore, this invention provides a method for detecting infection by mycobacteria. The method comprises providing a composition comprising a biological material suspected of being infected with a mycobacteria, and assaying for the presence of M. tuberculosis and M. leprae marker polypeptide of the mycobacteria. The polypeptides are typically assayed by electrophoresis or by immunoassay with antibodies that are immunologically reactive with M. tuberculosis and M. leprae marker polypeptides of the invention.


[0052] This invention also provides an in vitro diagnostic method for the detection of the presence or absence of antibodies, which bind to an antigen comprising a M. tuberculosis and M. leprae marker polypeptide of the invention or mixtures of the polypeptides. The method comprises contacting the antigen with a biological fluid for a time and under conditions sufficient for the antigen and antibodies in the biological fluid to form an antigen-antibody complex, and then detecting the formation of the complex. The detection step can further comprise measuring the formation of the antigen-antibody complex. The formation of the antigen-antibody complex is preferably measured by immunoassay based on Western blot technique, ELISA (enzyme linked immunosorbent assay), indirect immunofluorescent assay, or immunoprecipitation assay.


[0053] The polypeptides of this invention are thus useful as a portion of a diagnostic composition for detecting the presence of antibodies to antigenic proteins associated with mycobacteria. Thus, a diagnostic kit for the detection of the presence or absence of antibodies, which bind to the M. tuberculosis and M. leprae marker polypeptide of the invention or mixtures of the polypeptides, contains antigen comprising the M. tuberculosis and M. leprae marker polypeptide, or mixtures thereof, and means for detecting the formation of immune complex between the antigen and antibodies. The antigens and the means are present in an amount sufficient to perform the detection.


[0054] This invention also provides an immunogenic composition comprising a M. tuberculosis and M. leprae marker polypeptide of the invention or a mixture thereof in an amount sufficient to induce an immunogenic or protective response in vivo, in association with a pharmaceutically acceptable carrier therefor. A vaccine composition of the invention comprises a neutralizing amount of the M. tuberculosis and M. leprae marker polypeptide and a pharmaceutically acceptable carrier therefor.


[0055] In addition, the M. tuberculosis and M. leprae marker polypeptides can be used to raise antibodies for detecting the presence of antigenic proteins associated with a mycobacterium. Purified polyclonal or monoclonal antibodies that bind to M. tuberculosis and M. leprae marker polypeptides are encompassed by the invention.


[0056] The polypeptides of the invention can be also employed to raise neutralizing antibodies that either inactivate the mycobacteria, reduce the viability of a mycobacterium in vivo, or inhibit or prevent bacterial replication. The ability to elicit mycobacteria-neutralizing antibodies is especially important when the proteins and polypeptides of the invention are used in immunizing or vaccinating compositions to activate the B-cell arm of the immune response or induce a cytotoxic T lymphocyte response (CTL) in the recipient host, or other T cell mediated response.


[0057] Further, this invention provides a method for detecting the presence or absence of a mycobacterium comprising:


[0058] (1) contacting a sample suspected of containing bacterial genetic material of a mycobacterium with at least one nucleotide probe, and


[0059] (2) detecting hybridization between the nucleotide probe and the bacterial genetic material in the sample,


[0060] wherein said nucleotide probe is complementary to the full-length sequence of a purified M. tuberculosis and M. leprae marker nucleic acid of the invention.


[0061] Also, this invention provides a method of comparing genetic complements of different types of organisms, wherein the method comprises:


[0062] (a) providing a database including sequence libraries for a plurality of types of organisms, said libraries having multiple genomic sequences;


[0063] (b) providing one or more probe sequences encoding a polypeptide of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644;


[0064] (c) determining homologous matches between one or more of said probe sequences and one or more sequences of said sequences in said genomic libraries; and


[0065] (d) displaying the results of said determination.


[0066] The method can be carried out using a computer system comprising a database including sequence libraries for a plurality of types of organisms, wherein the libraries have multiple genomic sequences, and providing a database including the one or more probe sequences encoding a polypeptide of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644. The computer system includes a user interface capable of receiving sequence information from the sequence libraries and the probe sequence information for comparison and displaying the results of the comparison.







BRIEF DESCRIPTION OF THE DRAWINGS

[0067] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee. This invention will be described with reference to the drawings in which:


[0068]
FIG. 1 is a circular genome map. From the outside, circles 1, 2, clockwise and anticlockwise, genes on the − and + strands, respectively; circles 3 and 4, pseudogenes; 5 and 6, M. leprae specific genes; 7, repeat sequences; 8, G+C content; 9, G/C bias (G+C)/(G−C). See legend to FIG. 2 for colour code.


[0069]
FIG. 2 is a comparison of the proS loci of M. leprae and M. tuberculosis . A. The M. leprae proS region is shown above that of M. tuberculosis. Genes or operons are depicted by arrows while crosses denote pseudogenes. Note the absence of ugpAEBC and dinF from M. leprae and the presence of proS at this site. B. Domain structures of prolyl-tRNA synthetases of bacterial (M. tuberculosis ) or eukaryotic (M. leprae ) types after19.


[0070]
FIG. 3 shows distribution of genes by functional category. The number of complete (blue) and pseudogenes (red) within each category for M. leprae is shown. Data for M. tuberculosis (green) were taken from the published genome sequence8. Functional categories: 1. Small-molecule catabolism, 2. Energy Metabolism, 3. Central intermediary metabolism, 4. Amino acid biosynthesis, 5. Nucleosides and nucleotide biosynthesis and metabolism, 6. Biosynthesis of cofactors, prosthetic groups and carriers, 7. Lipid biosynthesis, 8. Polyketide and nonribosomal peptide synthesis, 9. Proteins performing regulatory functions, 10. Synthesis and modification of macromolecules, 11. Degradation of macromolecules, 12. Cell envelope constituents, 13. Transport/binding proteins, 14. Chaperones/Heat shock proteins, 15. Cell division proteins, 16. Protein and peptide secretion, 17. Adaptations and atypical conditions, 18. Detoxification, 19. Virulence determinants, 20. IS elements and phage derived proteins, 21. PE and PPE families, 22. Antibiotic production and resistance, 23. Cytochrome P450 enzymes, 24. Coenzyme F420-dependent enzymes, 25. Miscellaneous transferases, 26. Miscellaneous phosphatases, lyases and hydrolases, 27. Cyclases, and 28. Chelatases. Inset graph: The Y axis shows the number of genes within each functional category. The X axis shows the functional categories: 29. Conserved hypothetical proteins, 30. Hypothetical proteins which share no significant similarity with any protein currently in the databases.


[0071]
FIG. 4: Polynucleotidic sequence of the Mycobacterium tuberculosis H37Rv BAC clone, BAC-Rv221, deposited at the C.N.C.M. under the accession number I-2625, which corresponds to pBelo BACII with HindIII partial digest fragment from the genome of M. tuberculosis H37 Rv that starts at position 2,115,612 and extends to position 2,198,604 according to Cole et al. (1998, Nature, 393, 537-544). All of the M. tuberculosis genes contained herein are of interest.


[0072]
FIG. 5: Polynucleotidic sequence of the Mycobacterium tuberculosis H37Rv BAC clone, BAC-Rv230, deposited at the C.N.C.M. under the accession number I-2626, which corresponds to pBelo BACII with HindIII partial digest fragment from the genome of M. tuberculosis H37 Rv that starts at position 1,336,764 and extends to position 1,411,979 according to Cole et al. (1998, Nature, 393, 537-544). All of the M. tuberculosis genes contained herein are of interest.


[0073]
FIG. 6: Polynucleotidic sequence of the Mycobacterium tuberculosis H37Rv BAC clone, BAC-Rv234, deposited at the C.N.C.M. under the accession number I-2627, which corresponds to pBelo BACII with HindIII partial digest fragment from the genome of M. tuberculosis H37 Rv that starts at position 2,847,864 and extends to position 2,928,420 according to Cole et al. (1998, Nature, 393, 537-544). All of the M. tuberculosis genes contained herein are of interest.


[0074]
FIG. 7: Polynucleotidic sequence of the Mycobacterium tuberculosis H37Rv BAC clone, BAC-Rv265, deposited at the C.N.C.M. under the accession number I-2628, which corresponds to pBelo BACII with HindIII partial digest fragment from the genome of M. tuberculosis H37 Rv that starts at position 514,402 and extends to position 599,515 according to Cole et al. (1998, Nature, 393, 537-544). All of the M. tuberculosis genes contained herein are of interest.


[0075]
FIG. 8: Polynucleotidic sequence of the Mycobacterium tuberculosis H37Rv BAC clone, BAC-Rv267, deposited at the C.N.C.M. under the accession number I-2629, which corresponds to pBelo BACII with HindIII partial digest fragment from the genome of M. tuberculosis H37 Rv that starts at position 1,124,621 and extends to position 1,169,811 according to Cole et al. (1998, Nature, 393, 537-544). All of the M. tuberculosis genes contained herein are of interest.


[0076]
FIG. 9: Polynucleotidic sequence of the Mycobacterium leprae cosmid which corresponds to pYUB18 with Sau3A partial digest fragment from the genome of M. leprae that starts at position 1,373,705 and extends to position 1,403,746. This sequence comprises the sequence of the Mycobacterium leprae cosmid MLCY 811 which corresponds to pYUB18 with Sau3A partial digest fragment of the genome of M. leprae deposited at the C.N.C.M. under the accession number I-2633 that starts at position 1,363,759 and extends to position 1,403,737 according to Cole et al. (2001, Nature, 409, 1007-1011).


[0077]
FIG. 10: Polynucleotidic sequence of the Mycobacterium leprae cosmid which corresponds to pYUB18 with Sau3A partial digest fragment of the genome of M. leprae that starts at position 3,160,443 and extends to position 3,194,161. This sequence comprises the sequence of the Mycobacterium leprae cosmid MLCY 047 which corresponds to pYUB18 with Sau3A partial digest fragment of the genome of M. leprae deposited at the C.N.C.M. under the accession number I-2632 that starts at position 3,160,458 and extends to position 3,194,087 according to Cole et al. (2001, Nature, 409, 1007-1011).


[0078]
FIG. 4 to 10 can be found in the APPENDIX hereto.







DETAILED DESCRIPTION OF THE INVENTION


Sequence Analysis of M. leprae

[0079] The complete genome sequence of M. leprae contains 3,268,203 bp, and has an average G+C content of 57.8%. These values are much lower than those reported for the M. tuberculosis genome, comprising ˜4,000 genes, 4,411,529 bp and 65.6% G+C8. On detailed pairwise comparisons of both genome and proteome sequences8, 9, it was immediately apparent that only 49.5% of the genome was occupied by protein-coding genes, while 27% contained recognisable pseudogenes, inactive reading frames with functional counterparts in the tubercle bacillus. The remaining 23.5% of the genome did not appear to be coding, and probably contains gene remnants mutated beyond recognition. The distribution of the 1,114 pseudogenes was essentially random (FIG. 1) and, after their exclusion, 1604 potentially active genes remained, of which 1,439 were common to both pathogens. Among the remaining 165 genes, with no ortholog in M. tuberculosis , were 29 for which functions could be attributed. Many of the 136 residual CDS in M. leprae , showing no similarity to known genes, may also represent pseudogenes as they are shorter than average and occur in regions of low gene density (FIGS. 1).



Reductive evolution

[0080] Assuming that the genome of M. leprae was once topologically equivalent and similar in size to those of all other mycobacteria (˜4.4 Mb)10-12, then extensive downsizing and rearrangement must have occurred during evolution. Loss of 1.1 Mb would eliminate ˜1100 CDS, and M. leprae would, therefore, be expected to produce 3000 proteins compared to the 4000 predicted in M. tuberculosis. On analysis of the proteome only 391 soluble protein species were detected13, compared to nearly 1800 in M. tuberculosis14, consistent with there being many pseudogenes. In conclusion, since diverging from the last common mycobacterial ancestor, the leprosy bacillus may have lost >2000 genes.


[0081] Reductive evolution is documented in obligate intracellular parasites, such as Rickettsia and Chlamydia spp., and in some endosymbionts15, as genes become inactivated once their functions are no longer required in highly specialised niches. This process may have naturally defined the minimal gene set for a pathogenic mycobacterium. The most extensive genome degradation reported previously was in Rickettsia prowazekii, the typhus agent, where only 76% of the potential coding capacity was used and 12 pseudogenes identified16. In comparison with M. leprae, the level of gene loss detected was modest, and it is striking that elimination of pseudogenes by deletion lags far behind gene inactivation in both pathogens. Intriguingly, the G+C content of M. leprae genes (60.1%) is higher than that of the pseudogenes (56.5%), and the remainder of the genome (54.5%). The high G+C content of M. leprae, and other mycobacteria, is apparently driven by the codon preference of active genes, while random mutation within non-coding regions results in drift towards a more neutral G+C content, closer to that of the host.



Mosaic Organisation and Horizontal Transfer

[0082] While the precise mechanism behind pseudogene formation in M. leprae is unclear, loss of dnaQ-mediated proof-reading activities of DNA polymerase III17 may have contributed. By contrast, there is extensive evidence for large scale rearrangements and deletions arising from homologous recombination events. Comparison with M. tuberculosis delineated ˜65 segments that show synteny but differ in their relative order and distribution in the M. leprae genome. Breaks in synteny generally correspond to dispersed repeats, tRNA genes or gene-poor regions. Copies of all three repeats, RLEP, REPLEP, and LEPREP, occur at the junctions of discontinuity suggesting that the mosaic arrangement of the M. leprae genome reflects multiple recombination events between related repetitive sequences. In some cases, aberrant recombination may have occurred as truncated repeats exist.


[0083] While there is little sequence similarity indicating that they are insertion sequences, RLEP is clearly capable of transposition since it exists within sequences corresponding to known genes. Unlike M. tuberculosis H37Rv, which contains at least two prophages and 56 intact or truncated IS elements8, 18, M. leprae has only three phage-like genes, all with M. tuberculosis orthologs, and 26 transposase gene fragments. However, some signs of horizontal transfer of genetic material were detected when the aminoacyl-tRNA synthetase genes were examined. With one exception, all of these are more closely related to M. tuberculosis enzymes than to those of any other organism. Surprisingly, prolyl-tRNA synthetase, encoded by proS, is more similar to the enzymes of Borrelia burgdorfieri and eukaryotes such as Drosophila, humans and yeast. It has been proposed that horizontal transfer of tRNA synthetase genes occurs frequently, and that the pathogen B. burgdorferi may have acquired proS from its host19. Comparison of the genetic context provides further support for this hypothesis as the M. leprae proS is both displaced and inverted with respect to the M. tuberculosis genome (FIG. 4), consistent with recent acquisition.



Multigene Families

[0084] Half of the genes (52%) present in M. tuberculosis arose from gene duplication events leading to extensive functional redundancy9. Many of these are involved in lipid metabolism or belong to the novel PE and PPE families, encoding unusual glycine-rich proteins of repetitive structure and unknown function. The latter are confined to certain mycobacterial species20, and represent sources of genetic and, possibly antigenic, variation8. The corresponding 167 genes are exceptionally GC-rich and occupy >8% of the M. tuberculosis genome21. By contrast, only 9 intact PE and PPE genes were found in M. leprae although 30 pseudogenes were present. No intact members of the PE-PGRS subfamily were found. This reduction partly contributes to the smaller genome size and the lower GC content of M. leprae . Recently, some PE-PGRS proteins were shown to be upregulated in Mycobacterium marinum during granuloma formation in frogs22. However, this effect is probably not mediated directly by the PE-PGRS, as granulomas are a prominent cytological feature of all forms of leprosy. Essentially all of the gene families9 have undergone extensive retraction in M. leprae and now encode “just-enough” activity to permit intracellular growth. Selected examples of this are given in Table 1, whereas the comprehensive comparison presented in FIG. 3 shows that all functional categories have shrunk.
1TABLE 1Selected examples of metabolic streamlining.M.tuberculosisM. lepraeM. lepraeFunctionPathwayGeneGenePseudogenegltA1, gltA2,gltA2citACitrate synthaseKrebs cyclecit, 4icd1, icd2icd2icd1IsocitrateKrebs cycledehydrogenaseic1, aceAaceAic1Isocitrate lyaseGlyoxy-latecyclegnd1, gnd2gnd1gnd26-PentosephosphogluconatedphosphateehydrogenasepathwaypfkA, pjkBpfkApfkBPhospho-GlycolysisfructokinaseaceE, lpdA,aceE, lpdlpdA, pdhA,Pyruvate,EnergypdhA, pdhB,(Rv0462)pdhB, pdhCdehydrogenasemetabolismpdhC, lpdcomplex(Rv0462)lldD1, lldD2lldD2lldD1L-lactateRespirationdehydrogenasemmaA1, mmaA2,mmaA1,mmaA2,MethyltransferaseMycolic acidmmaA3, mmaA4,mmaA4,nmaA3,modificationcmaA1, cmaA2,cmaA2, umaA2cmaA1, umaA1umaAl, umaA2glnA1, glnA2,glnA1, glnA2glnA3, glnA4GlutamineGlutamineglnA3, glnA4synthasebiosynthesismetA, metB,met4, metB,metCVariousMethioninemetC, metE,metE,biosynthesismetH, metK, metZmetH, metK,metZbfrA, bfrBbfrAbfrBBacterioferritinIron storageligA, ligB, ligCligAligBDNA ligaseDNA metabolism



Metabolic clues

[0085] Successive generations of microbiologists have failed to grow M. leprae in axenic culture leading to the notion that the bacterium lacks certain biosynthetic pathways. Complete genome comparisons shed new light on this. Lipid metabolism is prominent in the biochemical repertoire of M. leprae but to a lesser extent than in the tubercle bacillus whose cell envelope has a greater diversity of lipids, glycolipids and carbohydrates23.



Envelope Biogenesis

[0086] Mycolic acids, structural components of all mycobacteria, include the alpha mycolates, lacking oxygen functions, and the oxygenated keto- and methoxy- forms. Reappraisal of mycolic acid modification is now possible in the light of the reduced cmaA, mmaA and umaA gene-sets encoding the effector methyltransferases. M. leprae contains no methoxy-mycolates23, probably due to the loss of the MmaA2 and MmaA3 enzymes that attach the methoxy group in M. tuberculosis10, 24. However, the mycolic acids do contain cyclopropane functions25 that in M. tuberculosis are introduced by MmaA2 and CmaAI. Since both the mmaA2 and cmaA1 genes have decayed in M. leprae , cyclopropanation must be encoded by one of the related umaA genes. Recently, both umaA2 and cmaA2 were shown to be essential for the cyclopropanation function in M. tuberculosis26, The same enzymes also catalyse cyclopropanation in M. leprae as their duplicate copies are both inactive (Table 1).


[0087] Foremost among the outer lipids of the leprosy bacillus is phenolic glycolipid 1 (PGL1), an envelope component not found in M. tuberculosis27. PGL1 is derived from phthiocerol-dimycocerosate (PDIM), an esterified compound lipid generated by mycocerosic acid synthase and a type I polyketide synthase (PKS), by addition of three o-methylated deoxy sugars23. However, the genes for the glycosyltransferases, that modify PDIM to produce PGLI, could not be detected despite extensive comparisons. PDIM, a virulence factor in M. tuberculosis, requires the RND protein, MmpL7, for its transport across the cytoplasmic membrane9,28,29. Of the 18 PKS systems identifiable in M. tuberculosis8, only six were predicted in M. leprae and the number of mmpL genes (often linked to PKS genes) has decreased from 16 to five, presumably because they are no longer required for polyketide or lipid export. Deletion of such systems may be reflected in the lack of mycolipenic and hydroxylipenic acids, polyketides esterified to trehalose in M. tuberculosis. Further PKS missing from M. leprae include the mbt operon required for production of the salicylate-based mycobactin siderophores. Lipids, polyketides and aromatic compounds are often substrates, for cytochrome-P450 monooxygenases30, enzymes that are exceptionally abundant in M. tuberculosis8. Astonishingly, none of these is functional in M. leprae although a novel enzyme is predicted.



Lipolysis

[0088] Intracellular mycobacteria probably derive much of their energy from degradation of host-derived lipids31, a process initiated by lipases. In remarkable contrast to the 22 lip genes of M. tuberculosis, M. leprae has only two lipase genes, of which, lipG, clusters with mmaA genes and could, therefore, effect fatty acid remodelling. This appears to leave just one lipase for scavenging fatty acids. The enzyme LipE (ML1190) or its counterpart in M. tuberculosis (Rv3775) could represent an attractive drug target. In addition to the multifunctional FadA and FadB enzymes, which catalyse β-oxidation, M. tuberculosis has numerous alternative systems for fatty acid degradation8. Once again, M. leprae has roughly one third as many potential enzymes; however, there are three-times more FadD acyl-CoA synthases than FadE acyl-CoA dehydrogenases, whereas these are expected in equal amounts in M. tuberculosis. This may be explained by the dual role of FadD in β-oxidative and anabolic processes while FadE only participates catabolically.


[0089] The acetyl-CoA produced by β-oxidation, or glycolysis, flows into the central pathways of carbon metabolism in M. leprae . However, the pattern of “just enough” genes for each step is firmly established, so that the redundancy seen in M. tuberculosis almost never occurs. For instance, there is only one isocitrate lyase (with low predicted activity) capable of participating in the glyoxylate shunt (Table 1)32, 33, and one enzyme complex that oxidatively decarboxylates pyruvate to acetyl-CoA, compared to two such systems in M. tuberculosis. In the Krebs cycle, as in glycolysis, replicate genes for the same activity are deleted although differences in expression levels might compensate for some missing copies. Thus, while lack of pdh genes is reflected in a low rate of oxidative decarboxylation of pyruvate, isocitrate dehydrogenase activity is comparable in host-grown leprosy and tubercle bacilli34 even though a duplicate icd gene is inactivated in M. leprae.



Central and Energy Metabolism

[0090] Despite an active glyoxylate cycle, there appear to be fundamental differences elsewhere in anaplerotic pathways between M. leprae and M. tuberculosis. Here, phospho-enol-pyruvate (PEP) carboxylase replaces the pyruvate carboxylase of M. tuberculosis, and the malic enzyme, associated with fast growth in mycobacteria35, is missing. The metabolic implications are that flux between C3 and C4 compounds and the balance between glycolysis and gluconeogenesis will be very different. Another missing link between by-products of lipid metabolism and the Krebs cycle is the production of succinyl-CoA by catabolic Acc carboxylases predicted for M. tuberculosis8. Other carbon sources lost to M. leprae are acetate, as ackA, pta and acs are all inactive, and galactose, so the cell wall galactan can only be produced from glucose since the galK, T genes are missing. This might imply that M. leprae is limited to growth on very few carbon sources, or even a limited and rather specialised combination, on which it can maintain a balanced carbon metabolism. Though a similar range of potential substrates is available to both M. leprae and M. tuberculosis in the host, marked differences in their ability to exploit them are apparent on examination of the systems involved in carbon and nitrogen compound degradation: there are fewer oxidoreductases, oxygenases and short-chain alcohol dehydrogenases, and their probable regulatory genes, (FIG. 3). The inescapable conclusion is that catabolism in M. leprae is severely limited.


[0091] In the same vein, the leprosy bacillus has lost anaerobic and microacrophilic electron transfer systems, such as formate dehydrogenase, nitrate, and fumarate reductase together with the biosynthetic and transport systems required to produce the cognate prosthetic groups. Likewise, the aerobic respiratory chain of M. leprae is truncated as only the 3′-end of the NADH oxidase operon, nuoA-N, remains. The consequences of this event are far-reaching, for not only has the potential to produce ATP from the oxidation of NADH been lost, but also regeneration of NAD+ may be limited, relying heavily on ndh, which is involved only in recycling NAD+. Alternatively, M. leprae may regenerate NAD+ from NADH by (1) diverting pyruvate to acetate and CO2 using lactate dehydrogenase and lactate oxidase; (2) diverting PEP to malate or fumarate via oxaloacetate, using its PEP carboxylase (an enzyme not found in the tubercle bacillus) that only catalyses the reaction in this direction. Given the loss of genes reviewed above, the acids produced by (1) and (2) cannot be recycled and must be excreted.



Anabolism

[0092] In surprising contrast, all the anabolic pathways seem to be relatively intact. With few exceptions, complete enzyme systems are predicted for synthesis of amino acids, purines, pyrimidines, nucleosides, nucleotides, most vitamins and cofactors. This suggests that the availability of these metabolites in phagosomes is either highly limited or that M. leprae cannot transport them efficiently. It also sets the biology of the leprosy bacillus apart from that of the other obligate parasites for which genomes have been sequenced15, 16. M., leprae may, however, be auxotrophic for methionine as metC, encoding cystathionine β-lyase, is a pseudogene, whereas the other counterparts of M. tuberculosis met genes are all intact. This requirement for methionine may be dictated by the inactivation of the sulphate transport operon, cysYWA, and in turn this implies that M. leprae depends upon an organic source of sulphur. A second auxotrophy that is predicted is for cobinamide, as examination of the cob genes shows selective deletion of those to make cobinamide, while the genes needed to produce vitamin B12 from cobinamide are retained.



Pathogenesis and Disease Control

[0093] Central to a successful pathogenic lifestyle is the ability to obtain iron. M. leprae has many genes for haem and iron-based proteins and employs the iron regulatory systems, ideR and furB, yet may be severely handicapped compared to M. tuberculosis as it appears to have lost the mbt operon, encoding the non-ribosomal peptide synthase required for production of the iron-scavenging siderophores, mycobactin/exochelin8, 36, 37. However, part of the iron uptake system is functional in M. leprae, as it transports exochelinMN, from M. neoaurum but not those of M. smegmatis or M. tuberculosis38. The genes for exochelinMN are unknown and seem unlikely to occur in M. leprae.


[0094] As might be expected given the differences in their respective pathologies, M. leprae contains several enzymes that have no counterparts in the tubercle bacillus, including a eukaryotic-like uridine phosphorylase and adenylate cyclase. In addition, there are two transport systems that may play significant physiological roles: an ABC-transporter for sugars, and a second member of the Nrampl family, involved in divalent metal ion uptake. M. leprae may have acquired another Nrampl gene39 to ensure adequate intracellular iron concentrations resulting from its lack of mycobactin siderophores.


[0095]

M. leprae,
shows a marked tropism for myelin-producing Schwann cells, and a surface-exposed 21 kDa laminin-binding protein (LBP) may be an important virulence factor40-42. Inspection of the genome sequence revealed a single LBP gene and this also occurs in M. tuberculosis . No further candidates for virulence genes were detected, and many of those present in M. tuberculosis have been inactivated or lost, including three of the Mce operons encoding putative invasins9, 43. Although the leprosy and tubercle bacilli both survive within macrophages, M. leprae has no catalase-peroxidase44, and fewer peroxidoxins and epoxide hydrolases to combat reactive oxygen species. It has retained both superoxide dismutases suggesting that these may contribute to its survival.



Comparative Mycobacterial Genomics

[0096] Comparative genomics is a powerful new tool for exploring micobial evolution and identifying those genes that might encode new drug targets or protective antigens. Coupled with knowledge derived from bioinformatic analysis of the proteome, and understanding of the underlying microbiology, it is possible to reduce the number of potential new targets within a pathogen to a more tangible level.


[0097] This invention includes discoveries resulting from the findings of a comparative analysis in which gene and protein sets of the leprosy and tubercle bacilli have been compared pairwise, and against the completed genome sequences of various prokaryotes and eukaryotes.


[0098] The genome of M. leprae , an exceptionally slow growing bacterium, is substantially smaller than that of M. tuberculosis and contains numerous pseudogenes. While the genome of M. tuberculosis comprises 4.41 Mb and contains around 4,000 genes, the genome of M. leprae is only 3.27 Mb and a mere 49.5% is occupied by protein-coding genes. About 27% of the M. leprae genome contains recognizable pseudogenes, inactive reading frames with functional counterparts in the tubercle bacillus. The remaining 23.5% of the genome does not appear to be coding, and probably contains gene remnants mutated beyond recognition. The distribution of the 1,114 pseudogenes was essentially random throughout the chromosome. 1,604 potentially active genes were identified, of which 1,439 were common to both pathogens. Among the remaining 165 genes, with no ortholog in M. tuberculosis, were 29 for which functions could be attributed. Many of the 136 residual CDS in M. leprae, showing no similarity to known genes, may also represent pseudogenes as they are shorter than average and occur in regions of low gene density. In summary, assuming that all mycobacteria are descendants of a common ancestor, M. leprae has probably lost around 2,000 genes during evolution and the minimal gene set required by a pathogenic mycobacterium has been naturally defined.


[0099] Whole genome comparisons led to the identification of genes that are present in both M. tuberculosis and M. leprae but have no counterparts elsewhere. Since these pathogenic mycobacteria occupy similar niches in the human body where they encounter the same physiological stresses and immune responses, it is conceivable that the products of some of these genes may conduct highly specialized functions that could be essential for intracellular growth of mycobacteria. If this were the case, the corresponding proteins or enzymes might represent novel drug targets. In addition to those proteins that are confined to the species or the genus, there is a second group of polypeptides that also occur in Streptomyces spp., related members of the Actinomycetales kingdom, but not in other prokaryotes. It is reasonable to assume that such proteins confer specific properties on actinomycetes.


[0100] Knowledge of the subcellular location of proteins is particularly valuable for the design of new tuberculosis vaccines since it is widely believed that surface-exposed or secreted proteins correspond to those antigenic components that are first encountered by the immune system during infection51. Bioinformatics has also been used to identify proteins that localize to the cell envelope and these include transmembrane proteins with hydrophobic domains, and lipoproteins with N-terminal cysteine residues that are modified by addition of lipid groups. Proteins that are secreted via the general secretory pathway52 are readily identifiable by their characteristic signal peptides, whereas those metallo-enzymes that are secreted by the twin arginine transporter system, Tat, can be recognized by the presence at the N-terminus of the cognate motif, S/TRRXFLK preceeding the signal peptide53, 54. This will be discussed further below.


[0101] Other proteins that lack signal peptides and are secreted from mycobacteria in a Sec-independent manner, include those belonging to the ESAT-6 family61. ESAT-6 is a potent T-cell antigen that induces strong Th1-type responses55 and has been extensively studied as a potential diagnostic reagent for infection56, since its gene is missing from BCG57, 58, 59, and as a component of a subunit vaccine60. The comparative genomic analysis identified several ESAT-6 proteins, and their potential secretion machinery, that were common to both M. tuberculosis and M. leprae (Table 2).


[0102] Several examples are given in Table 2 of proteins of limited distribution with potential drug targets, diagnostic antigens or subunit vaccine components.


[0103] Legend of Table 2:


[0104] The reading of the first example, by instance,


[0105]

M. leprae



[0106] ML0048: Name of an identified ORF in the genome of M. leprae.


[0107]

M. tub.:



[0108] Rv3876: Name of Equivalent ORF in the genome of M. tuberculosis published in 1998.


[0109] BLASTP:


[0110] Method of comparing protein sequences for establishing their degree of similarity or identity.


[0111] 1,00E−79:


[0112] BLASTP score, which indicates how similar the protein sequences are. The analyses of the results are described in Cole et al. for the comparisons between the genome of M. tuberculosis and the genome of BCG (Analysis of the proteome of Mycobacterium tuberculosis in silico, tuber Lung. Dis. 1999; 79(6):329-42).


[0113] Description:


[0114] Description of the protein, identified from all publically accessible databases, with highest similarity for the M. leprae protein ML0048.


[0115] Sc3C3.03C: Nomenclature of the streptomyces protein.


[0116] EMB : AL031231: Accession number in EMBL databank for the sequence of the Streptomyces protein found to be most similar to ML0048.


[0117] 1083: length of the sequence in the EMBL databank.


[0118] FASTA score: Different method, like BLAST, for comparing sequences for their similarity.


[0119] Score denotes the degree of similarity.


[0120] 31.6%: Percentage of identity between C terminal part of the Streptomyces protein and the amino acid sequence of ML0048. This 31.6% identity is found in an overlapping region of 580 amino acids between the two sequences. The other examples should be read similarly.
2TABLE 2Proteins of limited distribution with potential as drug targets,diagnostic antigens or subunit vaccine componentsM.GrouplepraeM. tub.BLASTPDescriptionAML0048Rv38761,00E-C-terminal half of Streptomyces coelicolor SC3C3.03C,79hypothetical protein, TR:O86637 (EMBL:AL031231) (1083aa); Fasta score E( ): 5.9e-27, 31.6% identity in 580aa overlap, which contains Pro-Gln repeatsAML0115Rv37802,00E-S. coelicolor SCGD3.23C, hypothetical protein,44TR:Q9XA56 (EMBL:AL096822)AML0124Rv01642,00E-S. coelicolor SC6G10.02C, hypothetical protein,40TR:Q9X7Y8 (EMBL:AL049497) (144 aa); Fasta score E( ):7e-05, 21.9% identity in 137 aa overlap.AML0151Rv0948c2,00E-S. coelicolor SCD63.16C, hypothetical protein,25TR:CAB82023 (EMBL:AL161755)AML0169Rv0966c7,00E-S. coelicolor SCE6.30C, hypothetical protein,66TR:CAB88834 (EMBL:AL353832) (277 aa); Fasta score E( ):3.3e-20, 41.0% identity in 205 aa overlap.AML0229Rv3603c2,00E-N-terminal half of S. coelicolor SCE126.02C,60hypothetical protein, TR:Q9X845 (ENBL):AL049630) (420aa); Fasta score E( ): 4.1e-24, 36.7% identity in 294aa overlapAlsr2Rv3597c1,00E-S. coelicolor SCE94.26C, putative lsr2-like protein,27TR:Q9X8N1 (EMBL:AL049628) (111 aa); Fasta score E( ):7.3e-18, 56.3% identity in 112 aa overlapAML0284Rv0360c3,00E-S. coelicolor SCH10.25C, hypothetical protein,23TR:Q9X8R4 (EMBL:AL049754)AwhiB3Rv34166,00E-Transcriptional regulator38AlppSRv2518ce-135many predicted lipoproteins from S. coelicolor.AML0451Rv2609c2,00E-S. coelicolor e.g. SC2E1.17, hypothetical protein,85TR:O69888 (EMBL:AL023797) (172 aa); Fasta score E( ):2e-13, 43.3% identity in 150 aa overlap.AML0486Rv2588c2,00E-S. coelicolor SCL2.07C, putative secreted protein,19TR:CAB70919 (EMBL:AL137778) (169 aa); Fasta score E( )7.3e-08, 35.8% identity in 120 aa overlapAML0542Rv13906,00E-S. coelicolor SC9C5.02C, hypothetical protein,51TR:CAB93358 (EMBL:AL357523) (90 aa); Fasta score E( ):2e-18, 71.3% identity in 80 aa overlap.AML0561Rv14173,00E-Corynebacterium ammoniagenes ribX, hypothetical38protein, TR:O24754 (EMBL:AB003693) (184 aa); Fastascore E( ): 2.1e-15, 34.5% identity in 148 aa overlap.Contains hydrophobic, possible membrane-spanningregionsAML0580Rv1446c2,00E-hypothetical proteins from S. coelicolor e.g.64SCC22.20, hypothetical protein, TR:Q9XAB8(EMBL:AL096839) (351 aa); Fasta score E( ): 7.1e-21,36.0% identity in 203 aa overlap, although these havea short N-terminal extension relative to thishomologue.AML0603Rv2413c3,00E-S. coelicolor SCC123.02C, putative DNA-binding77protein, TR:Q9RDM2 (EMBL:AL136518) (336 aa); Fastascore E( ): 0, 39.3% identity in 326 aa overlap.AML0630Rv2365c2,00E-S. coelicolor SCC77.05, hypothetical protein,15TR:Q9RDF3 (EMBL:AL136503) (132 aa); Fasta score E( ):3.3e-06, 39.4% identity in 99 aa overlap.AML0642Rv3195e-143S. coelicolor SCE9.14C, hypothetical protein,TR:Q9X8I7 (EMBL:AL049841) (375 aa) ; Fasta score E( )4.9e-12, 24.9% identity in 305 aa overlap.AwhiB2Rv3260c9,00E-Transcription factor31AML0762Rv3258c4,00E-S. coelicolor hypothetical 15.0 kDa protein SCE34.11C41TR:CAB88914 (EMBL:AL353862) fasta scores: E( ): 4.8e-16, 47.0% id in 151 aa.AlpqBRv3244c0.0S. coelicolor putative lipoprotein SCE33.13CTR:CAB90922 (EMBL:AL355774) fasta scores: E( ):0.00039, 24.4% id in 624 aaAwhiB1Rv32196,00E-Transcription factor31AML0814Rv3208c3,00E-S. coelicolor hypothetical protein32gp|AL390975|AL390975_32 (94 aa) E( ): 2.5e-09; 47.945%identityAML0816Rv3207ce-101S. coelicolor putative membrane protein SC2H12.28c(314 aa) TR:CAB94652 (EMBL:AL359215) fasta scores:E( ): 1e-13, 30.2% id in 331 aaAML0857Rv22192,00E-S. coelicolor putative integral membrane protein59SC3H12.04 TR:CAB90843 (EMBL:AL355740) (234 aa) fastascores: E( ): 1.2e-26, 39.6% id in 230 aaAML0869Rv22064,00E-S. coelicolor putative integral membrane protein40SC5F7.32 TR:Q9S2R7 (EMBL:AL096872)AML0876Rv2199c2,00E-S. coelicolor hypothetical proteins e.g. putative43integral membrane protein SC6G10.27C TR:Q9X812(EMBL:AL049497) (132 aa) fasta scores: E( ) : 6.2e-15,38.8% id in 139 aaAML0920Rv2147c3,00E-pir||T34949 hypothetical protein SC4A10.12c -89Streptomyces coelicolorAML0921Rv2146c5,00E-S. coelicolor TR:Q9S2X3 (EMBL:AL109663) (94 aa); Fasta32score E( ): 2.9e-12, 40.7% identity in 86 aa overlap.Contains possible membrane spanning hydrophobicdomains.AML0986Rv2738c3,00E-S. coelicolor TR:O50484 (EMBL:AL020958) (64 aa); Fasta21score E( ) : 2.5e-08, 44.4% identity in 63 aa overlapAML0994Rv2728c1,00E-S. coelicolor TR:O69964 (EMBL:AL022268) (237 aa);56Fasta score E( ): 1.3e-13, 32.9% identity in 243 aaoverlapAML1009Rv2714e-106pir||T35742 hypothetical protein SC7H2.11c S.coelicolorAML1016Rv2708c1,00E-emb|CAB72193.1| (AL138851) hypothetical protein25SCE59.06c [S. coelicolor A3(2)] Length = 97AML1026Rv2699c2,00E-T34816 hypothetical protein SC2E9.05 SC2E9.05 - S.32coelicolor 144 2e-34AML1027Rv26981,00E-membrane protein, S. coelicolor TR:O5413233(EMBL:AL021530) (154 aa); Fasta score E( ) : 1.1e-08,33.6% identity in 149 aa overlap.AML1029Rv2696c7,00E-pir||T34821 hypothetical protein SC2E9.10 SC2E9.10 -69S. coelicolor 86 4e-16AML1041Rv26802,00E-pir||T34710 hypothetical protein SC1C3.18c SC1C3.18c -62S. coelicolor 158 5e-38AML1067Rv12119,00E-emb|CAC01346.1| (AL390975) conserved hypothetical23protein S. coelicolor 101 1e-21AML1093Rv12445,00E-lipoprotein, pir||T35857 probable secreted substrate-78binding protein - S. coelicolor 67 3e-10AML1105Rv1259e-115S. coelicolor TR:Q9S2L3 (EMBL:AL109732) (237 aa);Fasta score E( ): 0, 54.5% identity in 231 aa overlap.AML1117Rv1276c3,00E-pir||T36773 hypothetical protein SCI28.03c - S.53coelicolor 115 4e-25AML1147Rv13123,00E-possible secreted protein, emb|CAB94546.1| (AL359152)42putative secreted/membrane protein S. coelicolor 662e-10AML1166Rv13327,00E-S. coelicolor TR:Q9S2G6 (EMBL:AL096852) (202 aa);54Fasta score E( ) 1.5e-05, 34.6% identity in 188 aaoverlap.AML1230Rv1182e-149papA3, emb|CAC08383.1|(AL392176) hypothetical proteinS.coelicolor 132 8e-30AML1306Rv21255,00E-S. coelicolor TR:Q9S2K6 (EMBL:AL109732) (312 aa);87Fasta score E( ) 1.6e-07, 23.4% identity in 278 aaoverlapAML1321Rv2111c3,00E-upstream of bacterial proteasome beta subunits07including: Mycobacterium smegmatis TR:O30517(EMBL:AF009645) (64 aa); Fasta score E( ): 6.2e-18,82.8% identity in 64 aa overlap, RhodococcusAML1338Rv2673e-150conserved integral membrane protein, S. coelicolorTR:Q53873 (EMBL:AL031317) (411 aa); Fasta score E( )1.1e-12, 28.3% identity in 410 aa overlapAML1439Rv20504,00E-emb|CAB61670.1| (AL133213) hypothetical protein31SC6D7.18c. S. coelicolor 101 4e-21AML1485Rv2466c2,00E-S. coelicolor TR:CAB71809 (EMBL:AL138662) (216 aa);66Fasta score E( ): 0, 52.3% identity in 214 aa overlapAML1508Rv11552,00E-S. coelicolor TR:Q9XAG1 (EMBL:AL079356) (144 aa);48Fasta score E( ): 5.6e-25, 54.3% identity in 140 aaoverlap.AML1525Rv2771c8,00E-S. coelicolor TR:Q9RD46 (EMBL:AL133424) (151 aa);27Fasta score E( ): 1.3e-28, 56.1% identity in 148 aaoverlapAML1548Rv2795ce-132S. coelicolor TR:O88028 (EMBL:AL031107) (295 aa);Fasta score E( ): 0, 54.4% identity in 285 aa overlapAML1557Rv2840c2,00E-emb|CAB91141.1| (AL355913) hypothetical protein S.27coelicolor 46 7e-05AML1561Rv28441,00E-S. coelicolor TR:CAB91137 (EMBL:AL355913) (167 aa);39Fasta score E( ): 1.4e-07, 35.8% identity in 137 aaAML1624Rv29170.0S. coelicolor TR:Q9S3Y6 (EMBL:AF170560) (597 aa);Fasta score E( ): 0, 55.5% identity in 566 aa overlapAML1644Rv2235e-113N-terminal signal sequence plus membrane spanninghydrophobic domain; emb|CAB59445.1| (AL132644)putative membrane protein [Streptomyc . . . 109 4e-23AML1649Rv2239c3,00E-emb|CAB92846.1| (AL356892) hypothetical protein36[Streptomyces Co . . . 137 6e-32AML1652Rv22420.0S. coelicolor TR:Q9RDP8 (EMBL:AL133423) (401 aa);Fasta score E( ) : 4.3e-26, 42.0% identityAML1666Rv2968c9,00E-S. coelicolor putative integral membrane protein59TR:CAB93387 (EMBL:AL357523) (240 aa); Fasta score E( ):3.6e-25, 36.1% identity in 191 aa overlapAML1698Rv3005c4,00E-conserved membrane protein, emb|CAB61735.1| (AL133220)54putative membrane protein. S. coelicolor 99 5e-20AML1706Rv3015c1,00E-S. coelicolor TR:Q9Z586 (EMBL:AL035569) (331 aa);91Fasta score E( ): 0, 38.6% identity in 337 aa overlap,AML1781Rv2256c4,00E-4pir||T11215 hypothetical protein 5 - Streptomyces62glaucescens >g . . . 153 1e-36AML1782Rv2257ce-121S. coelicolor SC4A7.08 TR:Q9RDQ4 (EMBL:AL133423) (273aa); Fasta score E( ): 0, 53.2% identity in 269 aaoverlapAML1791Rv1976c8,00E-S. coelicolor hypothetical protein SC1C3.03C TR:O6984525(EMBL:AL023702) (125 aa); Fasta score E( ): 4.3e-06,36.6% identity in 112 aa overlap.AML1908Rv06373,00E-S. coelicolor SCD82.07 TR:CAB77410 (EMBL:AL160431)62(150 aa); Fasta score E( ): 4.7e-11, 29.3% identity in150 aa overlap.AML1910Rv06359,00E-emb|CAB77410.1| (AL160431) hypothetical protein49SCD82.07 S. coelicolor 83 1e-15AML1926Rv04316,00E-S. coelicolor hypothetical protein SCD95A.2024TR:CAB93047 (EMBL:AL357432) (84 aa); Fasta score E( ):4.1e-11AML1927Rv04302,00E-S. coelicolor hypothetical protein SCD95A.2025TR:CAB93047 (EMBL:AL357432) (84 aa) ; Fasta score E( )4.1e-11, 52.8% identity in 72 aa overlap.AML1997Rv09707,00E-S. coelicolor putative integral membrane protein39SCM2.15CAML2030Rv1884c1,00E-Rpf, emb|CAC09538.1| (AL442120) putative secreted34protein S. coelicolor 108 5e-23AML2031Rv1883c1,00E-Streptomyces actuosus NSH-OrfB TR:P72384 (EMBL:U75434)44fasta scores: E( ): 2.5e-08, 34.4% in 125 aaAML2048Rv1871c2,00E-S. coelicolor hypothetical protein TR:CAB8843414(EMBL:AL353815) fasta scores: E( ): 0.0092, 39.3% in 61aa; truncated at C-terminus; may represent apseudogeneAML2063Rv1846c3,00E-possible regulator, pir||T36388 hypothetical protein35SCE94.28c - S. coelicolor 64 6e-10AML2064Rv1845c3,00E-S. coelicolor putative integral membrane protein82SC10A7.04 TR:Q9XAS1 (EMBL:AL078618) fasta scores: E( ):1.8e-19, 32.6% in 328 aaAML2073Rv18302,00E-S. coelicolor hypothetical 19.1 kda protein74TR:CAB88877 (EMBL:AL353861) fasta scores: E( ): 3.7e-30, 64.8% in 145 aaAML2075Rv18287,00E-S. coelicolor hypothetical 26.5 kda protein71TR:CAB88879 (EMBL:AL353861) fasta scores: E( ): 1.1e-14, 41.4% in 237 aa.AML2114Rv09097,00E-S. coelicolor hypothetical 9.9 kda protein TR:O6996507(EMBL:AL022268) fasta scores: E( ): 0.038, 41.3% in 46aaAML2135Rv0885e-123S. coelicolor putative membrane protein TR:Q9XAE8(EMBL:AL079356) fasta scores: E( ) : 1.5e-13, 27.1% in255 aaAML2137Rv0883c1,00E-S. coelicolor hypothetical 39.0 kda protein TR:O5052976(EMBL:AL009204) fasta scores: E( ): 2.2e-19, 36.0% in247 aaAML2142Rv08778,00E-S. coelicolor hypothetical 32.2 kda protein91TR:CAB93404 (EMBL:AL357524) fasta scores: E( ): 2.5e-19, 43.3% in 270 aa.AML2143Rv0876ce-172S. coelicolor putative integral membrane proteinTR:CAB93403 (EMBL:AL357524) fasta scores: E( ): 5.3e-16, 38.8% in 448 aa.AML2151Rv0867c1,00E-Probable resusicitation-promoting factors, exported35proteinAML2156Rv0862c0.0S. coelicolor hypothetical 90.4 kda proteinTR:CAB93395 (EMBL:AL357524) fasta scores: E( ): 3.9e-27, 34.6% in 856 aaAML2193Rv08192,00E-Acetyltransferase (GNAT) family, emb|CAB88484.1|87(AL353816) putative acetyltransferase S. coelicolor216 3e-55AML2199Rv31181,00E-Saccharopolyspora erythraea hypothetical 10.2 kda28protein TR:Q54084 (EMBL:M29612) fasta scores: E( ):2.7e-16, 53.0% in 100 aaAML2200Rv0813c3,00E-S. coelicolor hypothetical 21.7 kda protein59TR:CAB94083 (EMBL:AL358692) fasta scores: E( ): 4.4e-11, 30.5% in 167 aaAML2204Rv0810c2,00E-S. coelicolor hypothetical 9.3 kda protein SCD25.24C13TR:Q9RKJ8 (EMBL:AL118514) fasta scores: E( ): 1.3e-06,46.8% id in 62 aa.AML2207Rv08078,00E-S. coelicolor hypothetical protein SCD25.20 TR:Q9RKK036(EMBL:AL118514) (202 aa) fasta scores: E( ): 6.6e-16,52.5% id in 101 aa.AML2219ARv0787A1,00E-S. coelicolor hypothetical protein SCD25.13 (AL118514)33AML2253Rv2145c1,00E-antigen 84 homolog, also in S. coelicolor, etc.06AML2261Rv0546c1,00E-emb|CAB95979.1| (AL360034) conserved hypothetical43protein S. coelicolor 119 1e-26AML2289Rv3662c7,00E-S. coelicolor putative oxidoreductase SCH5.22C64TR:Q9X924 (EMBL:AL035636) (274 aa) fasta scores: E( )1e-11, 40.9% id in 269 aaAML2295Rv3668c7,00E-emb|CAB61552.1| (AL133171) protease precursor S.67coelicolor 53 2e-06AML2296Rv36692,00E-Similar_to S. coelicolor putative integral membrane43transport protein SCH5.28 TR:Q9X930 (EMBL:AL035636)(162 aa) fasta scores: E( ): 3.3e-10, 37.3% idAML2306Rv3680e-110S. coelicolor putative ion-transporting ATPaseTR:Q9XA35 (EMBL:AL079353) (481 aa) fasta scores: E( ):0, 48.6% id in 432 aaAML2307Rv3681c4,00E-whiB428AML2330Rv3716c6,00E-pir||T35387 hypothetical protein SC66T3.30c - S.10coelicolor 47 6e-05AML2332Rv3718c1,00E-S. coelicolor conserved hypothetical protein TR:Q9ZBJ239(EMBL:AL035161) (147 aa) fasta scores: E( ) : 1.4e-22,47.6% id in 147 aa.AML2410Rv0528e-160conserved membrane protein, emb|CAC08381.1| (AL392176)putative integral membrane protein S. coelicolor 2212e-56AML2425Rv0504c7,00E-emb|CAB77410.1| (AL160431) hypothetical protein52SCD82.07 [Strept . . . 73 2e-12AML2428ARv0500B6,00E-Small, strongly basic, S. coelicolor SCE68.25c,17gp|AL079345|AL079345_25 S. coelicolor (32 aa) E( ):1.7e-07; 93.103%AML2432Rv0498e-101S. coelicolor hypothetical protein TR:Q9X8H0(EMBL:AL049819) (285 aa) fasta scores: E( ): 3.2e-30,51.6% id in 273 aaAML2435Rv0495c7,00E-S. coelicolor hypothetical protein TR:Q9X8H294(EMBL:AL049819) (271 aa) fasta scores: E( ): 0, 48.4%id in 250 aaAML2442Rv04871,00E-emb|CAC04041.1| (AL391406) conserved hypothetical47protein S. coelicolor 142 2e-33AML2446Rv0483e-137possible lipoprotein, S. coelicolor putativelipoprotein TR:CAB76012 (EMBL:AL157916) fasta scores:E( ): 2.5e-24, 28.6% id in 405 aa.AML2453Rv04769,00E-conserved membrane protein, emb|CAC04036.1| (AL391406)22putative membrane protein S. coelicolor 57 3e-08AML2522Rv03095,00E-S. coelicolor putative secreted protein SCL24.0865TR:CAB76092 (EMBL:AL157956)AML2529Rv0290e-116S. coelicolor putative integral membrane proteinSC3C3.21 TR:O86654 (EMBL:AL031231) fasta scores: E( ):1.9e-05, 23.8% id in 483 aaAML2566Rv0241ce-125S. coelicolor putative dehydratase TR:CAB77291(EMBL:AL160312)AML2630Rv00074,00E-emb|CAB92992.1| (AL357152) putative integral membrane06protein S. coelicolor 69 5e-11AML2640Rv01463,00E-pir||T35930 hypothetical protein SC9B5.10 - S.93coelicolor 141 1e-32AML2664Rv0116c1,00E-possible secreted protein, pir||T35535 probable72secreted protein - S. coelicolor 154 7e-37AML2687Rv0051e-150conserved membrane protein, pir||T36589 probabletransmembrane protein - S. coelicolor 185 1e-45AML2699Rv39090.0putative secreted protein, pir||T36582 hypotheticalprotein SCH24.17c - S. coelicolor 90 8e-17MML0007Rv00076,00E-Putative membrane protein51MML0012Rv0010c4,00E-Contains hydrophobic, possible membrane-spanning30regions.MML0013Rv0011c3,00E-Contains hydrophobic, possible membrane-spanning36regions.MML0022Rv0020ce-114MML0030Rv0039c9,00E-putative membrane protein06MML0031Rv0040c3,00E-Contains a probable N-terminal signal sequence48MML0042Rv3882ce-138putative membrane proteinMML0044Rv3880c2,00E-19MML0047Rv3877e-146putative membrane proteinMML0049Rv38755,00E-possible secreted protein, ESAT-614MML0050Rv38744,00E-possible secreted protein, ESAT-612MML0051Rv38731,00E-PPE-family protein30MML0054Rv3869e-151putative membrane proteinMML0056Rv38672,00E-13MML0068Rv38508,00E-71MML0069Rv38494,00E-41MML0071Rv38472,00E-65MML0073Rv3843c3,00E-putative membrane protein51MML0081Rv3835e-107putative membrane protein, possible membrane-spanningregion near the N-terminus.MML0091Rv38101,00E-erp, pirG, exported repetitive protein precursor39MML0093Rv3808c0.0MML0094Rv3807c6,00E-putative membrane protein30MML0096Rv3805c0.0putative membrane proteinMML0099Rv3802c8,00E-96MembBRv37950.0arabinosyl transferaseMembARv37940.0arabinosyl transferaseMembCRv37930.0arabinosyl transferaseMML0107Rv37920.0Nycobacterium smegmatis ORF3, hypothetical membraneproteinMML0116Rv3779e-179putative membrane proteinMML0133Rv2949c3,00E-Pfam match to entry PF01947 DUF98, Protein of unknown64functionMlppXRv2945c6,00E-putative lipoprotein60MML0158Rv09544,00E-34 kDa antigen, membrane protein20MML0159Rv09552,00E-putative membrane protein74MML0185Rv09962,00E-possible membrane-spanning regions74MML0187Rv0998e-124Cyclic nucleotide-binding domain.MML0199Rv3647c2,00E-52MML0208Rv36322,00E-putative membrane protein38MML0227Rv3605c3,00E-putative membrane protein36MMML0228Rv3604c2,00E-putative membrane protein51MlpqTRv1016c1,00E-putative lipoprotein52MML0256Rv10242,00E-Contains hydrophobic, possible membrane-spanning42regionMML0271Rv04011,00E-putative membrane protein23MML0279Rv0356c9,00E-63MML0281Rv03582,00E-36MML0285Rv03611,00E-putative membrane protein50MML0298Rv04165,00E-possibly thiamine_biosynthesis10MlpqERv35843,00E-putative lipoprotein40MML0370Rv34382,00E-Contains PS00107 Protein kinases ATP-binding region78signatureMNL0383Rv3415c5,00E-59MML0386Rv34124,00E-45MML0405Rv3616c1,00E-71MML0406Rv3615c2,00E-14MML0407Rv3614c2,00E-45MML0410Rv21078,00E-PE-family protein08MML0411Rv21081,00E-PPE-family protein, serine-rich antigen22MML0425Rv2520c2,00E-putative membrane protein10MML0431Rv25071,00E-putative membrane protein41MML0520Rv25361,00E-putative membrane protein40MPERv13861,00E-PE protein family21MPPE.0Rv13873,00E-PPE protein family99MmihFRv13884,00E-integration host factor24MlprGRv1411c1,00E-putative lipoprotein50Mmtb12Rv2376c2,00E-putative secreted protein28MML0676Rv33542,00E-15MML0703Rv3311e-125MML0730Rv32811,00E-Contains Pfam match to entry PF01039 Carboxyl_trans,20Carboxyl transferase domainMML0733Rv3278c4,00E-putative membrane protein53MML0734Rv32772,00E-putative membrane protein64MML0748Rv32691,00E-irpA15MML0761Rv32592,00E-Mycobacterium smegmatis hypothetical 6.0 kDa protein48(partial CDS) TR:Q9S425 (EMBL:AF164439) fasta scores:E( ): 1e-10, 75.5% id in 53 aaMML0764Rv3256c1,00E-79MML0806Rv3217c5,00E-putative membrane protein25MML0810Rv3212e-104putative membrane proteinMML0813Rv32092,00E-putative membrane protein24MML0818Rv3205ce-102MML0834Rv23421,00E-21MML0872Rv22039,00E-putative membrane protein43MmmpS3Rv2198c3,00E-putative membrane protein49MML0878Rv2197c1,00E-putative membrane protein55MML0888Rv2186c8,00E-41MML0889Rv2185c8,00E-41MML0891Rv2183c2,00E-27MML0895Rv2179c1,00E-60MML0898Rv2175c1,00E-putative DNA-binding protein41MMML0901Rv2172ce-102MML0902Rv21713,00E-probable lipoprotein57MML0903Rv21709,00E-55MML0904Rv2169c7,00E-putative membrane protein32MML0907Rv2164c2,00E-putative conserved membrane protein50MML0923Rv2144c3,00E-possible membrane protein07MML0984Rv27403,00E-31MML0990Rv2732c9,00E-possible conserved membrane protein46MML1001Rv27227,00E-06MML1004Rv2719c1,00E-possible conserved membrane protein17MML1015Rv27097,00E-possible conserved membrane protein26MML1025Rv27001,00E-possible secreted protein62MML1030Rv26951,00E-47MML1053Rv21078,00E-PE protein11MML1055Rv2347c1,00E-—, family19MML1056Rv3619c6,00E-—, family18MML1065Rv12096,00E-membrane protein21MML1077Rv12223,00E-Mycobacterium avium TR:O05736 (EMBL:U87308) (133 aa);34Fasta score E( ): 0, 71.7% identity in 138 aa overlapMML1079Rv12242,00E-possible secreted protein29MML1096Rv1249c2,00E-putative membrane protein48MML1098Rv1251c0.0some_similarity_to_GTP-binding_proteinsMML1099Rv1252c5,00E-putative lipoprotein41MML1115Rv12743,00E-lipoprotein, lprB58MML1116Rv12758,00E-lipoprotein, lprC54MML1120Rv12780.0Contains multiple possible coiled-coils. ContainsPS00017 ATP/GTP-binding site motif A (P-loop)MML1138Rv13033,00E-integral membrane protein20MML1176Rv1342c3,00E-possible conserved membrane protein34MML1177Rv1343c5,00E-possible lipoprotein, membrane protein43MML1180Rv3619c6,00E-ESAT-6 family18MML1181Rv2347c1,00E-QILSS family19MML1182Rv1361c2,00E-PPE family47MML1183Rv21078,00E-PE family11MML1190Rv2525c3,00E-—, twin-Arginine secreted protein70MML1221Rv15902,00E-18MML1222Rv15911,00E-membrane protein29MML1232Rv1184c2,00E-Possibly secreted PE protein, Contains PS0001777ATP/GTP-binding site motif A (P-loop)MML1233Rv38219,00E-conserved membrane protein33MML1244Rv2484ce-130conserved membrane proteinMML1255Rv2468c1,00E-41MML1270Rv16108,00E-conserved membrane protein, Contains Pfam match to36entry PF00218 IGPS,MML1296Rv2137c1,00E-25MML1299Rv2134c9,00E-60MML1300Rv2133c4,00E-90MML1315Rv21161,00E-lipoprotein, LppK35MML1334Rv2091c6,00E-conserved membrane protein, calcium-binding28MML1357Rv16937,00E-09MML1361Rv1697e-114conserved membrane proteinMML1362Rv16986,00E-conserved secreted protein58MML1389Rv1635ce-144conserved membrane proteinMML1446Rv2061c5,00E-35MML1470Rv2446c2,00E-conserved membrane protein16MML1505Rv1158c7,00E-conserved hypothetical Proline rich protein, possibly17secretedMML1506Rv1157c4,00E-62MML1526Rv2772c2,00E-conserved membrane protein43MML1537Rv17971,00E-possible secreted protein98MML1540Rv1794e-101MML1544Rv1782e-155conserved membrane proteinMML1560Rv28438,00E-24MML1584Rv28763,00E-conserved membrane protein25MML1607Rv2898c2,00E-Contains Pfam match to entry PF02021 UPF0102,17Uncharacterised protein family UPF0102,sp|O83883|Y913_TREPA HYPOTHETICAL PROTEIN TP0913>gi|7514634|pir.MML1610Rv2901c2,00E-39MML1638Rv2229c2,00E-63MML1677Rv29803,00E-possible secreted protein33MML1704Rv30136,00E-71MML1720Rv3035e-107MML1813Rv14763,00E-39MPPE.1Rv0256c3,00E-PPE-family protein93MML1828ARv02571,00E-Probably pseudogene as Rv0257 is longer15MML1911ARv0634A—, May be pseudogene as Rv0634A is predicted to be 13aa longerMML1918Rv3587c5,00E-conserved hypothetical membrane protein69MML1937Rv1111c9,00E-probable integral membrane protein39MMML1939Rv1109c9,00E-49MML1945Rv11006,00E-possible membrane protein57MML1991Rv00964,00E-PPE90MML1988Rv0093c1,00E-Contains possible membrane spanning hydrophobic52domains. Note lacks the N-terminal 46 aa of the M.tuberculosis proteinMML1993Rv00983,00E-50MML1995Rv01001,00E-18MML2010Rv1906c4,00E-putative lipoprotein (secreted in Mt)31MML2022Rv18932,00E-13MML2023Rv18912.00E-Contains probable N-terminal signal sequence.46MML2054Rv18611,00E-integral membrane protein07MML2070Rv1836ce-171MML2111Rv09121,00E-membrane protein35MML2113Rv09106,00E-49MML2141Rv0879c9,00E-22MML2144Rv0875c2,00E-possible exported protein45MML2155Rv08632,00E-18MML2195Rv0817c4,00E-probable exported protein68MML2228Rv0779c3,00E-probable membrane protein50MML2258Rv0543c2,00E-28MML2259Rv0544c2,00E-possible membrane protein16MML2271Rv05566,00E-putative membrane protein46MML2274Rv0559c9,00E-putative secreted protein23MML2320Rv3705c8,00E-64MML2337Rv37234,00E-possible membrane spanning hydrophobic domains57MML2377Rv0451c1,00E-mmpS4, Mycobacterium avium TmtpA TR:Q9XCF435(EMBL:AF143772) (221 aa) fasta scores: E( ): 0, 58.9%id in 146 aaMML2380Rv0455c2,00E-possible secreted protein37MML2388Rv04639,00E-possible membrane protein18MML2390Rv10831,00E-possible secreted/membrane protein10MML2392Rv1081c6,00E-conserved membrane protein,34hydrophobic_stretch_from_aa_26-48MML2407Rv05315,00E-putative membrane protein06MML2433Rv04975,00E-conserved membrane protein39MML2450Rv0479c7,00E-possible secreted protein, >gb|AAF74996.1|AF143402_157(AF143402) putative multicopper oxidase[Mycobacterium avium]MML2452Rv04772,00E-23MML2454Rv04756,00E-possible hemagglutinin40MML2465Rv0464c7,00E-53MML2473Rv3753c2,00E-53MML2489Rv0383c5,00E-possible secreted protein, hydrophobic N-terminus and91Pro-rich C-terminusMML2491Rv1754ce-109MML2518Rv03131,00E-39MML2527Rv02922,00E-conserved membrane protein,69MML2530Rv02892,00E-92MML2531Rv02885,00E-ESAT-6 family, possible cell surface protein27MML2532Rv3020c9,00E-PE-family protein10MML2534Rv02859,00E-PE-family protein13MML2536Rv0283e-156conserved membrane proteinMML2557Rv0250c2,00E-26MmceRv0169e-107Mce proteinMML2569ARv0236A2,00E-Small secreted protein with typical N-terminal signal24peptideMML2570Rv0236c0.0possible integral membrane proteinMML2581Rv0227ce-116putative integral membrane proteinMML2582Rv0226ce-132conserved membrane proteinMML2595Rv01752,00E-possible membrane protein41MML2596Rv01761,00E-conserved membrane protein73MML2597Rv01771,00E-conserved membrane protein42MML2598Rv01782,00E-conserved membrane protein43MML2604Rv01848,00E-64MML2605Rv01853,00E-47MML2614Rv01993,00E-conserved membrane protein47MML2615Rv02005,00E-probable membrane protein55MML2616Rv0201c5,00E-36MML2621Rv0207c2,00E-43MML2627Rv0216e-103MML2629Rv01646,00E-44MML2689Rv00491,00E-45XML0190Rv10007,00E-gp|AL357613|AL357613_12 S. coelicolor cosmid (210 aa)53E( ): 2.4e-44; 55.122% identity in 205 aa overlap;AE003963|AE003963_5 Xylella fastidiosa, E( ) 9.7e-14; 3 9.894% identity in 188 aa overlap. Weaksimilarity to proteins involved in DNA repairXML0257Rv10254,00E-Also hypothetical proteins from Thermotoga maritima72e.g. TN1078, hypothetical protein, TR:Q9X0G7(EMBL:AE001768) (170 aa)XML0418Rv3368c2,00E-weak similarity Thermus aquaticus nox, NADH76dehydrogenase, SW:NOX_THETH (X60110) (205 aa); Fastascore E( ) 0.00023, 28.8% identity in 212 aa overlap.XML0577Rv14409,00E-putative protein-export membrane protein, secG12XML0776Rv3242c3,00E-probable competence protein ComF - Deinococcus11radio . . . 77 2e-13XML1037Rv26832,00E-Contains 2 Pfam matches to entry PF00571 CBS, CBS42domain.XML1119Rv1277e-105possibly phosphoesteraseXML1159Rv1324e-116probable thioredoxinXML1249Rv2476c0.0Rickettsia prowazekii TR:Q9ZCI2 (EMBL:AJ235273) (1581aa); Fasta score E( ): 0, 32.9% identity in 1494 aaoverlapXML1399Rv16471,00E-weakly adenylate cyclases76XML1444Rv20543,00E-Weakly several carboxymethylenebutenolidases (EC943.1.1.45) involved in 3-chlorocatechol degradatione.g. Pseudomonas putida SW:CLCD_PSEPU (P11453) (236aa)XML1494Rv11718,00E-conserved membrane protein, pir||PH0210 hypothetical19protein 133 (fdxA 5′ region) - Saccharo . . . 74 5e-13XML1503ARv1159A9,00E-S. coelicolor (SC5C7.25) gp|AL03151335AL031515|AL031515_25 (101 aa) E( ): 1.9e-06; 34.831%identity in 89 aa overlap; and archaebacteria.XML1660Rv2926c2,00E-—, pir||E72412 conserved hypothetical protein -69Thermotoga manitima . . . 66 4e-10XML1723Rv3038ce-152—, gb|AAC01738.1| (AF040571) methyltransferase[Amycolatopsis medit . . . 59 1e-07XML1909Rv06369,00E-Contains Pfam match to entry PF01575 MaoC_dehydratas,72MaoC like domain. ML2566XdesA2Rv10947,00E-Gossypium hirsutum (Upland cotton) acyl-[acyl-carrier85protein) desaturase precursor SW:STAD_GOSHI (X95988)(397 aa); Fasta score E( ): 5.6e-05, 23.9% identity in293 aa overlap.XML1983Rv1919c8,00E-weakly similar pollen allergen45XML2366Rv37601,00E-Deinococcus radiodurans conserved hypothetical12protein TR:Q9RU17XML2463Rv0466e-102weakly similar acyl-ACP thioesteraseA, Actinomycete-specific; M, mycobacterial-specific; X, limited distribution; —, no information available.


[0121]

3





TABLE 3










Possible twin arginine secreted proteins










M. tuberculosis
M. leprae
Gene
Predicted function





Rv0203
del

unknown


Rv0265c
NF
fecB2









iron_transport_protein_FeIII_dicitrate_transporter










Rv0846c
ML2171 ps

similar_to_several_L-ascorbate_oxidases


Rv1755c
del
plcD
phospholipase_C_precursor


Rv2349c
NF
plcC
phospholipase_C_precursor


Rv2350c
del
plcB
phospholipase_C_precursor


Rv2351c
NF
plcA
phospholipase_C_precursor


Rv2525c
ML1190

unknown


Rv2577
ML0497 ps

similarity_to_G755244_acid_phosphatase


Rv2833c
del
ugpB
sn-glycerol-3-phosphate transport


Rv3353c
del

unknown


NF
ML2649

unknown






del, deleted; NF, not found; ps, pseudogene.








[0122] The implications for this invention are widespread. M. tuberculosis and M. leprae marker polypeptides are disclosed in SEQ ID NO: 1 to SEQ ID NO:644. The discovery of the M. tuberculosis and M. leprae marker polypeptides and DNA encoding the polypeptides enables construction of expression vectors comprising nucleic acid sequences encoding M. tuberculosis and M. leprae marker polypeptides; host cells transfected or transformed with the expression vectors; biologically active M. tuberculosis and M. leprae marker polypeptides and M. tuberculosis and M. leprae marker polypeptides as isolated or purified peptides; and antibodies immunoreactive with M. tuberculosis and M. leprae marker polypeptides. In addition, understanding of the mechanism by which M. tuberculosis and M. leprae marker polypeptides function enables the design of assays to detect inhibitors of M. tuberculosis and M. leprae marker polypeptide activity.


[0123] As used herein, the term “M. tuberculosis and M. leprae marker polypeptides” refers to a genus of polypeptides that encompasses polypeptides of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644, as well as those polypeptides having a high degree of similarity (at least 90% homology) with such amino acid sequences and which polypeptides are immunoreactive or biologically active.


[0124] The term “purified” as used herein, means that the M. tuberculosis and M. leprae marker polypeptides are essentially free of association with other proteins or polypeptides, for example, as a purification product of recombinant host cell culture or as a purified product from a non-recombinant source. The term “substantially purified” as used herein, refers to a mixture that contains M. tuberculosis and M. leprae marker polypeptides and is essentially free of association with other proteins or polypeptides, but for the presence of known proteins that can be removed using a specific antibody, and which substantially purified M. tuberculosis and M. leprae marker polypeptides can be used as antigens.


[0125] A M. tuberculosis and M. leprae marker polypeptide “variant” as referred to herein means a polypeptide substantially homologous to native M. tuberculosis and M. leprae marker polypeptides, but which has an amino acid sequence different from that of native M. tuberculosis and M. leprae marker polypeptides because of one or more deletions, insertions, or substitutions. The variant amino acid sequence preferably is at least 80% identical to a native M. tuberculosis and M. leprae marker polypeptide amino acid sequence, most preferably at least 90% identical. The percent identity can be determined, for example by comparing sequence information using the GAP computer program, version 6.0 described by Devereux et al. (Nucl. Acids Res. 12:387, 1984) and available from the University of Wisconsin Genetics Computer Group (UWGCG). The GAP program utilizes the alignment method of Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), as revised by Smith and Waterman (Adv. Appl. Math 2:482, 1981). The preferred default parameters for the GAP program include: (1) a unary comparison matrix (containing a value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745, 1986, as described by Schwartz and Dayhoff, eds., Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps.


[0126] Variants can comprise conservatively substituted sequences, meaning that a given amino acid residue is replaced by a residue having similar physicochemical characteristics. Examples of conservative substitutions include substitution of one aliphatic residue for another, such as Ile, Val, Leu, or Ala for one another, or substitutions of one polar residue for another, such as between Lys and Arg; Glu and Asp; or Gln and Asn. Other such conservative substitutions, for example, substitutions of entire regions having similar hydrophobicity characteristics, are well known. Naturally occurring M. tuberculosis and M. leprae marker polypeptide variants are also encompassed by the invention. Examples of such variants are proteins that result from alternate mRNA splicing events or from proteolytic cleavage of the M. tuberculosis and M. leprae marker polypeptides. Variations attributable to proteolysis include, for example, differences in the termini upon expression in different types of host cells, due to proteolytic removal of one or more terminal amino acids from the M. tuberculosis and M. leprae marker polypeptides. Variations attributable to frameshifting include, for example, differences in the termini upon expression in different types of host cells due to different amino acids.


[0127] As stated above, the invention provides isolated and purified, or homogeneous, M. tuberculosis and M. leprae marker polypeptides, both recombinant and non-recombinant. Variants and derivatives of native M. tuberculosis and M. leprae marker polypeptides that can be used as antigens can be obtained by mutations of nucleotide sequences coding for native M. tuberculosis and M. leprae marker polypeptides. Alterations of the native amino acid sequence can be accomplished by any of a number of conventional methods. Mutations can be introduced at particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of the native sequence. Following ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid insertion, substitution, or deletion.


[0128] Alternatively, oligonucleotide directed, site specific mutagenesis procedures can be employed to provide an altered gene wherein predetermined codons can be altered by substitution, deletion, or insertion. Exemplary methods of making the alterations set forth above are disclosed by Walder et al. (Gene 42:133, 1986); Bauer et al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 1981); Kunkel (Proc. Natl. Acad. Sci. USA 82:488, 1985); Kunkel et al. (Methods in Enzymol. 154:367, 1987); and U.S. Pat. Nos. 4,518,584 and 4,737,462, all of which are incorporated by reference.


[0129] Within an aspect of the invention, M. tuberculosis and M. leprae marker polypeptides can be utilized to prepare antibodies that specifically bind to M. tuberculosis and M. leprae marker polypeptides. The term “antibodies” is meant to include polyclonal antibodies, monoclonal antibodies, fragments thereof such as F(ab′)2 and Fab fragments, as well as any recombinantly produced binding partners. Antibodies are defined to be specifically binding if they bind M. tuberculosis and M. leprae marker polypeptides with a Ka of greater than or equal to about 107 M−1. Affinities of binding partners or antibodies can be readily determined using conventional techniques, for example, those described by Scatchard et al., Ann. N. Y Acad. Sci., 51:660 (1949). Polyclonal antibodies can be readily generated from a variety of sources, for example, horses, cows, goats, sheep, dogs, chickens, rabbits, mice, or rats, using procedures that are well known in the art.


[0130] The invention further encompasses isolated fragments and oligonucleotides derived from the nucleotide sequences of the invention. The invention also encompasses polypeptides encoded by these fragments and oligonucleotides.


[0131] Nucleic acid sequences within the scope of the invention include isolated DNA and RNA sequences that hybridize to the native M. tuberculosis and M. leprae marker nucleic acids disclosed herein under conditions of moderate or severe stringency, and which encode M. tuberculosis and M. leprae marker polypeptides. As used herein, conditions of moderate stringency, as known to those having ordinary skill in the art, and as defined by Sambrook et al. Molecular Cloning: A Laboratory Manual, 2 ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory Press, (1989), include use of a prewashing solution for the nitrocellulose filters 5× SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0), hybridization conditions of 50% formamide, 6× SSC at 42° C. (or other similar hybridization solution, such as Stark's solution, in 50% formamide at 42° C.), and washing conditions of about 60° C., 0.5× SSC, 0.1% SDS. Conditions of high stringency are defined as hybridization conditions as above, and with washing at 68° C., 0.2× SSC, 0.1% SDS. The skilled artisan will recognize that the temperature and wash solution salt concentration can be adjusted as necessary according to factors such as the length of the probe.


[0132] Due to the known degeneracy of the genetic code, wherein more than one codon can encode the same amino acid, a DNA sequence can vary and still encode a M. tuberculosis and M. leprae marker polypeptide of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644. Such variant DNA sequences can result from silent mutations (e.g., occurring during PCR amplification), or can be the product of deliberate mutagenesis of a native sequence.


[0133] The invention thus provides equivalent isolated DNA sequences, encoding M. tuberculosis and M. leprae marker polypeptides, selected from: (a) DNA derived from the coding region of a native M. tuberculosis and M. leprae marker nucleic acid; (b) cDNA comprising the nucleotide sequence of the invention; (c) DNA capable of hybridization to a DNA of (a) under conditions of moderate stringency and which encode M. tuberculosis and M. leprae marker polypeptides; and (d) DNA which is degenerate as a result of the genetic code to a DNA defined in (a), (b) or (c) and which encodes M. tuberculosis and M. leprae marker polypeptides. M. tuberculosis and M. leprae marker polypeptides encoded by such DNA equivalent sequences are encompassed by the invention.


[0134] DNA that is equivalent to the DNA sequence of the invention will hybridize under moderately stringent conditions to the double-stranded native DNA sequence that encodes polypeptides of a formula selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:644. Examples of M. tuberculosis and M. leprae marker polypeptides encoded by such DNA, include, but are not limited to, M. tuberculosis and M. leprae marker polypeptide fragments and M. tuberculosis and M. leprae marker polypeptides comprising inactivated N-glycosylation site(s), inactivated protease processing site(s), or conservative amino acid substitution(s), as described above. M. tuberculosis and M. leprae marker polypeptides encoded by DNA derived from other species, wherein the DNA will hybridize to the complement of the DNA of the invention are also encompassed.


[0135] Recombinant expression vectors containing a nucleic acid sequence encoding M. tuberculosis and M. leprae marker polypeptides can be prepared using well known methods. The expression vectors include a M. tuberculosis and M. leprae marker DNA sequence operably linked to suitable transcriptional or translational regulatory nucleotide sequences, such as those derived from a mammalian, microbial, viral, or insect gene. Examples of regulatory sequences include transcriptional promoters, operators, or enhancers, an mRNA ribosomal binding site, and appropriate sequences which control transcription and translation initiation and termination. Nucleotide sequences are “operably linked” when the regulatory sequence functionally relates to the M. tuberculosis and M. leprae marker DNA sequence. Thus, a promoter nucleotide sequence is operably linked to a M. tuberculosis and M. leprae marker DNA sequence if the promoter nucleotide sequence controls the transcription of the M. tuberculosis and M. leprae marker DNA sequence. The ability to replicate in the desired host cells, usually conferred by an origin of replication, and a selection gene by which transformants are identified can additionally be incorporated into the expression vector.


[0136] In addition, sequences encoding appropriate signal peptides that are not naturally associated with M. tuberculosis and M. leprae marker polypeptides can be incorporated into expression vectors. For example, a DNA sequence for a signal peptide (secretory leader) can be fused in-frame to the M. tuberculosis and M. leprae marker nucleotide sequence so that the M. tuberculosis and M. leprae marker polypeptide is initially translated as a fusion protein comprising the signal peptide. A signal peptide that is functional in the intended host cells enhances extracellular secretion of the M. tuberculosis and M. leprae marker polypeptide. The signal peptide can be cleaved from the M. tuberculosis and M. leprae marker polypeptide upon secretion of the marker polypeptide from the cell.


[0137] Expression vectors fdr use in prokaryotic host cells generally comprise one or more phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a gene encoding a protein that confers antibiotic resistance or that supplies an autotrophic requirement. Examples of useful expression vectors for prokaryotic host cells include those derived from commercially available plasmids. Commercially available vectors include those that are specifically designed for the expression of proteins. These include pMAL-p2 and pMAL-c2 vectors, which are used for the expression of proteins fused to maltose binding protein (New England Biolabs, Beverly, Mass., USA).


[0138] Promoter sequences commonly used for recombinant prokaryotic host cell expression vectors include β-lactamase (penicillinase), lactose promoter system (Chang et al., Nature 275:615, 1978; and Goeddel et al., Nature 281:544, 1979), tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids Res. 8:4057, 1980; and EP-A-36776), and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412, 1982).


[0139] Suitable host cells for expression of M. tuberculosis and M. leprae marker polypeptides include prokaryotes, yeast or higher eukaryotic cells. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular hosts are described, for example, in Pouwels et al. Cloning Vectors: A Laboratory Manual, Elsevier, N.Y., (1985). Cell-free translation systems could also be employed to produce M. tuberculosis and M. leprae marker polypeptides using RNAs derived from DNA constructs disclosed herein.


[0140] It will be understood that the present invention is intended to encompass the previously described proteins in isolated or purified form, whether obtained using the techniques described herein or other methods. In a preferred embodiment of this invention, the M. tuberculosis and M. leprae marker polypeptides are substantially free of human tissue and human tissue components, nucleic acids, extraneous proteins and lipids, and adventitious microorganisms, such as bacteria and a mycoplasma. It will also be understood that the invention encompasses equivalent proteins having substantially the same biological and immunogenic properties. Thus, this invention is intended to cover serotypic variants of the proteins of the invention.


[0141] Depending on the use to be made of the M. tuberculosis and M. leprae marker polypeptides of the invention, it may be desirable to label them. Examples of suitable labels are radioactive labels, enzymatic labels, fluorescent labels, chemiluminescent labels, and chromophores. The methods for labeling proteins and glycoproteins of the invention do not differ in essence from those widely used for labeling immunoglobulin. The need to label may be avoided by using labeled antibody to the antigen of the invention or anti- immunoglobulin to the antibodies to the antigen as an indirect marker.


[0142] Once the M. tuberculosis and M. leprae marker polypeptides of the invention have been obtained, they can be used to produce polyclonal and monoclonal antibodies reactive therewith. Thus, a polypeptide of the invention can be used to immunize an animal host by techniques known in the art. Such techniques usually involve inoculation, but they may involve other modes of administration. A sufficient amount of the polypeptide is administered to create an immunogenic response in the animal host. Any host that produces antibodies to the antigen of the invention can be used. Once the animal has been immunized and sufficient time has passed for it to begin producing antibodies to the antigen, polyclonal antibodies can be recovered. The general method comprises removing blood from the animal and separating the serum from the blood. The serum, which contains antibodies to the antigen, can be used as an antiserum to the antigen. Alternatively, the antibodies can be recovered from the serum. Affinity purification is a preferred technique for recovering purified polyclonal antibodies to the antigen from the serum.


[0143] Monoclonal antibodies to the antigens of the invention can also be prepared. One method for producing monoclonal antibodies reactive with the antigens comprises the steps of immunizing a host with the antigen; recovering antibody producing cells from the spleen of the host; fusing the antibody producing cells with myeloma cells deficient in the enzyme hypoxanthine-guanine phosphoribosyl transferase to form hybridomas; select at least one of the hybridomas by growth in a medium comprising hypoxanthine, aminopterin, and thymidine; identifying at least one of the hybridomas that produces an antibody to the antigen, culturing the identified hybridoma to produce antibody in a recoverable quantity; and recovering the antibodies produced by the cultured hybridoma.


[0144] These polyclonal or monoclonal antibodies can be used in a variety of applications. Among these is the neutralization of corresponding proteins. They can also be used to detect viral antigens in biological preparations or in purifying corresponding proteins, glycoproteins, or mixtures thereof, for example, when used in a affinity chromatographic columns.


[0145] The M. tuberculosis and M. leprae marker polypeptides can be used as antigens to identify antibodies to a mycobacteria in materials and to determine the concentration of the antibodies in those materials. Thus, the antigens can be used for qualitative or quantitative determination of a mycobacteria in a material. Such materials of course include human tissue and human cells, as well as biological fluids, such as human body fluids, including human sera. When used as a reagent in an immunoassay for determining the presence or concentration of the antibodies to a mycobacteria, the antigens of the present invention provide an assay that is convenient, rapid, sensitive, and specific.


[0146] More particularly, the antigens of the invention can be employed for the detection of a mycobacterium by means of immunoassays that are well known for use in detecting or quantifying humoral components in fluids. Thus, antigen-antibody interactions can be directly observed or determined by secondary reactions, such as precipitation or agglutination. In addition, immunoelectrophoresis techniques can also be employed. For example, the classic combination of electrophoresis in agar followed by reaction with anti-serum can be utilized, as well as two-dimensional electrophoresis, rocket electrophoresis, and immunolabeling of polyacrylamide gel patterns (Western Blot or immunoblot). Other immunoassays in which the antigens of the present invention can be employed include, but are not limited to, radioimmunoassay, competitive immunoprecipitation assay, enzyme immunoassay, and immunofluorescence assay. It will be understood that turbidimetric, colorimetric, and nephelometric techniques can be employed. An immunoassay based on Western Blot technique is preferred.


[0147] Immunoassays can be carried out by immobilizing one of the immunoreagents, either an antigen of the invention or an antibody of the invention to the antigen, on a carrier surface while retaining immunoreactivity of the reagent. The reciprocal immunoreagent can be unlabeled or labeled in such a manner that immunoreactivity is also retained. These techniques are especially suitable for use in enzyme immunoassays, such as enzyme linked immunosorbent assay (ELISA) and competitive inhibition enzyme immunoassay (CIEIA).


[0148] When either the antigen of the invention or antibody to the antigen is attached to a solid support, the support is usually a glass or plastic material. Plastic materials molded in the form of plates, tubes, beads, or disks are preferred. Examples of suitable plastic materials are polystyrene and polyvinyl chloride. If the immunoreagent does not readily bind to the solid support, a carrier material can be interposed between the reagent and the support. Examples of suitable carrier materials are proteins, such as bovine serum albumin, or chemical reagents, such as gluteraldehyde or urea. Coating of the solid phase can be carried out using conventional techniques.


[0149] The invention provides immunogenic M. tuberculosis and M. leprae marker polypeptides, and more particularly, protective polypeptides for use in the preparation of vaccine compositions against a mycobacterium. These polypeptides can thus be employed as viral vaccines by administering the polypeptides to a mammal susceptible to a mycobacteria infection. Conventional modes of administration can be employed. For example, administration can be carried out by oral, respiratory, or parenteral routes. Intradermal, subcutaneous, and intramuscular routes of administration are preferred when the vaccine is administered parenterally.


[0150] The major purpose of the immune response in a mycobacteria-infected mammal is to inactivate the free mycobacteria and to eliminate mycobacteria infected cells that have the potential to release infectious mycobacteria. The B-cell arm of the immune response has some responsibility for inactivating free mycobacteria. The principal manner in which this is achieved is by neutralization of infectivity. Another major mechanism for destruction of the a mycobacteria-infected cells is provided by cytotoxic T lymphocytes (CTL) that recognize M. tuberculosis and M. leprae marker antigens expressed in combination with class I histocompatibility antigens at the cell surface. The CTLs recognize M. tuberculosis and M. leprae marker polypeptides processed within cells from a M. tuberculosis and M. leprae marker polypeptide that is produced, for example, by the infected cell or that is internalized by a phagocytic cell. Thus, this invention can be employed to stimulate a B-cell response to M. tuberculosis and M. leprae marker polypeptides, as well as immunity mediated by a CTL response following infection. The CTL response can play an important role in mediating recovery from primary mycobacterial infection and in accelerating recovery during subsequent infections.


[0151] The ability of the M. tuberculosis and M. leprae marker polypeptides and vaccines of the invention to induce protective levels of neutralizing antibody in a host can be enhanced by emulsification with an adjuvant, incorporating in a liposome, coupling to a suitable carrier, or by combinations of these techniques. For example, the M. tuberculosis and M. leprae marker polypeptides of the invention can be administered with a conventional adjuvant, such as aluminum phosphate and aluminum hydroxide gel, in an amount sufficient to potentiate humoral or cell-mediated immune responses in the host. Similarly, the M. tuberculosis and M. leprae marker polypeptides can be bound to lipid membranes or incorporated in lipid membranes to form liposomes. The use of nonpyrogenic lipids free of nucleic acids and other extraneous matter can be employed for this purpose.


[0152] The immunization schedule will depend upon several factors, such as the susceptibility of the host to infection and the age of the host. A single dose of the vaccine of the invention can be administered to the host or a primary course of immunization can be followed in which several doses at intervals of time are administered. Subsequent doses used as boosters can be administered as needed following the primary course.


[0153] The M. tuberculosis and M. leprae marker polypeptides and vaccines of the invention can be administered to the host in an amount sufficient to prevent or inhibit a mycobacteria infection or replication in vivo. In any event, the amount administered should be at least sufficient to protect the host against substantial immunosuppression, even though a mycobacterial infection may not be entirely prevented. An immunogenic response can be obtained by administering the polypeptides of the invention to the host in an amount of about 10 to about 500 micrograms antigen per kilogram of body weight, preferably about 50 to about 100 micrograms antigen per kilogram of body weight. The polypeptides and vaccines of the invention can be administered together with a physiologically acceptable carrier. For example, a diluent, such as water or a saline solution, can be employed.


[0154] Another aspect of the invention provides a method of DNA vaccination. The method also includes administering any combination of the nucleic acids encoding M. tuberculosis and M. leprae marker polypeptides, with or without carrier molecules, to an individual. In embodiments, the individual is an animal, and is preferably a mammal. More preferably, the mammal is selected from the group consisting of a human, a dog, a cat, a bovine, a pig, and a horse. In an especially preferred embodiment, the mammal is a human.


[0155] The methods of treating include administering immunogenic compositions comprising M. tuberculosis and M. leprae marker polypeptides, but compositions comprising nucleic acids encoding M. tuberculosis and M. leprae marker polypeptides as well. Those of skill in the art are cognizant of the concept, application, and effectiveness of nucleic acid vaccines (e.g., DNA vaccines) and nucleic acid vaccine technology as well as protein and polypeptide based technologies. The nucleic acid based technology allows the administration of nucleic acids encoding M. tuberculosis and M. leprae marker polypeptides, naked or encapsulated, directly to tissues and cells without the need for production of encoded proteins prior to administration. The technology is based on the ability of these nucleic acids to be taken up by cells of the recipient organism and expressed to produce an immunogenic determinant to which the recipient's immune system responds. Typically, the expressed antigens are displayed on the surface of cells that have taken up and expressed the nucleic acids, but expression and export of the encoded antigens into the circulatory system of the recipient individual is also within the scope of the present invention. Such nucleic acid vaccine technology includes, but is not limited to, delivery of naked DNA and RNA and delivery of expression vectors encoding M. tuberculosis and M. leprae marker polypeptides. Although the technology is termed “vaccine”, it is equally applicable to immunogenic compositions that do not result in a protective response. Such non-protection inducing compositions and methods are encompassed within the present invention.


[0156] Although it is within the present invention to deliver nucleic acids encoding M. tuberculosis and M. leprae marker polypeptides and carrier molecules as naked nucleic acid, the present invention also encompasses delivery of nucleic acids as part of larger or more complex compositions. Included among these delivery systems are mycobacterium, mycobacteria-like particles, or bacteria containing the nucleic acid encoding M. tuberculosis and M. leprae marker polypeptides. Also, complexes of the invention's nucleic acids and carrier molecules with cell permeabilizing compounds, such as liposomes, are included within the scope of the invention. Other compounds, such as molecular vectors (EP 696,191, Samain et al.) and delivery systems for nucleic acid vaccines are known to the skilled artisan and exemplified in, for example, WO 93 06223 and WO 90 11092, U.S. Pat. Nos. 5,580,859, and U.S. 5,589,466 (Vical's patents), which are incorporated by reference herein, and can be made and used without undue or excessive experimentation.


[0157] To further achieve the objective and in accordance with the purposes of the present invention, a kit capable of diagnosing mycobacteria infection is described. This kit, in one embodiment, contains the DNA sequences of this invention, which are capable of hybridizing to RNA or analogous DNA sequences to indicate the presence of a mycobacteria infection. Different diagnostic techniques can be used which include, but are not limited to: (I) Southern blot procedures to identify cellular DNA which may or may not be digested with restriction enzymes; (2) Northern blot techniques to identify RNA extracted from cells; and (3) dot blot techniques, i.e., direct filtration of the sample through an ad hoc membrane, such as nitrocellulose or nylon, without previous separation on agarose gel. Suitable material for dot blot technique could be obtained from body fluids including, but not limited to, serum and plasma, supernatants from culture cells, or cytoplasmic extracts obtained after cell lysis and removal of membranes and nuclei of the cells by centrifugation.


[0158] The invention also provides screening assays for identifying agents that modulate (e.g. augment or inhibit) the activity of M. tuberculosis and M. leprae marker polypeptides. Assays for detecting the ability of agents to inhibit or augment the activity of M. tuberculosis and M. leprae marker polypeptides provide for facile high-throughput screening of agent banks (e.g., compound libraries, peptide libraries, and the like) to identify antagonists or agonists of these marker polypeptides. Such M. tuberculosis and M. leprae marker polypeptide antagonists and agonists may modulate marker polypeptide activity and thereby modulate, inhibit, or even prevent infection of a host by M. tuberculosis and M. leprae


[0159] For example, yeast comprising (1) an expression cassette encoding a GAL4 DNA binding domain (or GAL4 activator domain) fused to a binding fragment of M. tuberculosis or M. leprae marker polypeptide, (2) an expression cassette encoding a GAL4 DNA activator domain (or GAL4 binding domain, respectively) fused to a binding fragment of a test polypeptide, and (3) a reporter gene (e.g., β-galactosidase) comprising a cis-linked GAL4 transcriptional response element can be used for agent screening. Such yeast are incubated, and expression of the reporter gene (e.g., β-galactosidase) is determined by the capacity of the agent to affect expression of the reporter gene and thereby identify the test polypeptide as a candidate modulatory agent for M. tuberculosis or M. leprae marker polypeptides.


[0160] Yeast two-hybrid systems can be used to screen a mammalian (typically human) cDNA expression library, wherein cDNA is fused to a GAL4 DNA binding domain or activator domain, and either a M. tuberculosis or M. leprae marker polypeptide sequence is fused to a GAL4 activator domain or DNA binding domain, respectively. Such a yeast two-hybrid system can screen for cDNAs that encode proteins that interact with M. tuberculosis or M. leprae marker polypeptides.


[0161] Polypeptides that interact with M. tuberculosis or M. leprae marker polypeptides can also be identified by immunoprecipitation of M. tuberculosis or M. leprae marker polypeptides with antibody, and identification of co-precipitating species. Further, polypeptides that interact with M. tuberculosis or M. leprae marker polypeptides can be identified by screening a peptide library (e.g., a bacteriophage peptide display library) with a M. tuberculosis or M. leprae marker polypeptide.


[0162] Additional embodiments of the invention are directed to methods that employ specific antisense polynucleotides complementary to all or part of M. tuberculosis or M. leprae marker nucleic acids. Such complementary antisense polynucleotides may include nucleotide substitutions, additions, deletions, or transpositions, so long as specific hybridization to the relevant target sequence corresponding to M. tuberculosis or M. leprae marker nucleic acids is retained as a functional property of the polynucleotide. Complementary antisense polynucleotides include soluble antisense RNA or DNA oligonucleotides that can hybridize specifically to M. tuberculosis and M. leprae marker nucleic acid species and prevent transcription of the mRNA species and/or translation of the encoded polypeptide. See (Ching et al. (1989) Proc. Natl. Acad. Sci. U.S.A. 86:10006; Broder et al. (1990) Ann. Int. Med. 113:604; Loreau et al. (1990) FEBS Letters 274:53; Holcenberg et al., WO91/11535; U.S. Ser. No. 07/530,165; WO91/09865; WO91/04753; WO90/13641; and EP 386563). The antisense polynucleotides, therefore, inhibit production of M. tuberculosis or M. leprae marker polypeptides. Antisense polynucleotides that prevent transcription and/or translation of mRNA corresponding to M. tuberculosis or M. leprae marker polypeptides may inhibit or prevent infection by M. tuberculosis or M. leprae. Antisense polynucleotides of various lengths may be produced, although such antisense polynucleotides typically comprise a sequence of about at least 25 consecutive nucleotides, which are substantially identical to a naturally-occurring M. tuberculosis or M. leprae marker nucleic acids, and typically are identical to a M. tuberculosis or M. leprae marker nucleic acid. For general methods relating to antisense polynucleotides, see Antisense RNA and DNA, (1988) D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.).


[0163] Polypeptides with similar sequence should have similar function. Thus, the functions of M. tuberculosis and M. leprae marker polypeptides can be assessed by a database search. One method by which structural and functional domains can be identified is by comparison of the nucleotide and/or amino acid sequence data for M. tuberculosis and M. leprae marker polypeptides, or M. tuberculosis or M. leprae marker nucleic acids, to public or proprietary sequence databases. Preferably, computerized comparison methods are used to identify sequence motifs or predict polypeptide conformation domains that occur in other polypeptides of known structure and/or function. For example, methods to identify protein sequences that fold into a known three-dimensional structure are known (Bowie et al. (1991) Science 253:164).


[0164] As other examples, but not for limitation, the programs GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package (Genetics Computer Group, 575 Science Dr., Madison, Wis.) can be used to identify sequences in databases, such as GenBank/EMBL, that have regions of homology with M. tuberculosis or M. leprae marker polypeptides or M. tuberculosis or M. leprae marker nucleic acids. Such homologous regions are candidate structural or functional domains. Alternatively, other algorithms are provided for identifying such domains from sequence data. Further, network methods, whether implemented in hardware or software, can be used to: (1) identify related protein sequences and nucleotide sequences, and (2) define structural or functional domains in M. tuberculosis and M. leprae marker polypeptides.


[0165] Thus, those of skill in the art can recognize sequence motifs and structural conformations that may be used to define structural and functional domains in the M. tuberculosis and M. leprae marker nucleic acids of the invention. Hydrophobicity profiles can be generated and displayed graphically using the ProtScale utility at ExPASy (http://expasy.hcuge.ch/cgi-bin/protscale.pl). There are a number of different ways of predicting transmembrane helices in sequences, the simplest being merely to look for regions of the protein containing a run of 20 hydrophobic residues. However, there are also a number of more sophisticated, and accurate, algorithms which can be used not only to predict the location of transmembrane helices but also their orientation in the membrane.


[0166] Proteins can contain signals within their sequence which assist in their processing within the cell, for example leader sequences or signals which target proteins to specific compartments within cells. Web resources are available to help predict both these types of sites. Different regions of a polypeptide evolve at different rates; some parts of a polypeptide must retain a certain pattern of residues for the polypeptide to function. By identifying such conserved regions, it is possible to make predictions about the polypeptide function. Examples of conserved sequences can be found around the active sites of enzymes, sites of post-translational modification, binding sites for co-factors, protein sorting signals, etc. A number of bioinformatics resources have been developed both to build databases of conserved patterns and to search for instances of such patterns in sequences. One of the best known motif databases is PROSITE, which can be employed in this invention.


[0167] This invention will be described in greater detail in the following Examples.



EXAMPLE 1

[0168] The whole genome sequence was obtained from a combination of sequenced cosmids45 and 54,000 end sequences (giving 7.1× coverage) from a pUC18 genomic shotgun library using dye terminator chemistry on ABI373 or 377 automated sequencers The sequences of 42 cosmids previously generated by multiplex sequencing46 were used for scaffolding purposes only. The sequence was assembled using Phrap (P. Green, unpublished), finished using GAP447 then compared with sequences present in public databases using FASTA, BLASTN and BLASTX48. Potential CDS were predicted, and gene and protein sequences analysed as described previously8, 49, using Artemis50 to collate data and facilitate annotation. The genome and proteome sequences of M. leprae and M. tuberculosis H37Rv were compared pairwise to identify conserved genes using the Artemis Comparison Tool (ACT) (K. Rutherford; unpublished; http://www.sanger.ac.uk/Software/ACT/). Pseudogenes had one or more mutations that would ablate expression and were pinpointed by direct comparison with M. tuberculosis



EXAMPLE 2


Target Discovery

[0169] To illustrate the usefulness of comparative mycobacterial genomics for identifying potentially important proteins, a precise example will now be given. Preproteins transported by the TAT pathway generally bind redox cofactors and fold or oligomerize before crossing the membrane54, 62. After removal of the signal peptide, these proteins usually function in extracytoplasmic electron transfer chains. The specialized machinery that recognizes the twin-arginine motif, and translocates the preprotein across the membrane, is composed of several different Tat proteins. In Escherichia coli, TatA and TatE are 50% identical and share weak similarity with TatB. All three proteins are predicted to be anchored to the cytoplasmic membrane via an N-terminal hydrophobic alpha-helix, and to have cytoplasmic amphipathic helices followed by variable regions. The TatC protein is predicted to be an integral membrane protein with six transmembrane segments. M. tuberculosis and M. leprae both contain clearly identifiable tatA, tatB, tatC, and tatD genes and must, therefore, produce a functional Tat system.


[0170] On examination of the proteome of M. tuberculosis, eleven potential substrates for the Tat export system were recognized on the basis of their signal peptides containing potential twin arginine motifs (Table 3). During the extensive reductive evolution of the genome of M. leprae only one of the corresponding genes, ML1190, has escaped inactivation. It is orthologous to Rv2525c of M. tuberculosis but shows no similarity to proteins present in sequence databases. The 240 amino acid long precursor protein encoded by Rv2525c (or its counterpart ML1190 contains five histidines and one cysteine residue that may be important for coordinating divalent metal ions. The conservation of this coding sequence by M. leprae , in the face of massive gene loss, is a strong indication that it must play an important biological role. Given the many parallels with Tat systems elsewhere, it is likely to be in electron transport. These indirect arguments suggest on the one hand that, if this function were essential, the ML1190/Rv2525c gene product might represent a novel drug target or, on the other, since it is likely to be located extracellularly it may, therefore, be an important sentinel protein antigen.


[0171] The Mycobacterium tuberculosis strain HRV37 genomic library has been deposited at the Collection Nationale de Cultures de Microorganismes (C.N.C.M.), of Institut Pasteur, 28, rue du Docteur Roux, F-75724 Paris, Cedex 15, France, on Nov. 19, 1997, under the Accession Number I-1945. This genomic DNA library is disclosed in International patent application No. WO 9954487 (Institut Pasteur).


[0172] In summary, Leprosy, a chronic human neurological disease, results from infection with the obligate intracellular pathogen Mycobacterium leprae , a close relative of the tubercle bacillus. M. leprae has the longest doubling time of all known bacteria and has thwarted every effort at axenic culture. Comparison of the 3.27 Mb genome sequence of an armadillo-derived Indian isolate of the leprosy bacillus with that of Mycobacterium tuberculosis (4.41 Mb) provides clear explanations for these properties and reveals an extreme case of reductive evolution. Less than half of the genome contains functional genes while pseudogenes, with intact counterparts in M. tuberculosis, abound. Genome downsizing and the current mosaic arrangement appear to have resulted from extensive recombination events between dispersed repetitive sequences. Gene deletion and decay have eliminated many important metabolic activities including siderophore production, part of the oxidative, and all of the microaerophilic and anaerobic respiratory chains, together with numerous catabolic systems and their regulatory circuits.



REFERENCES

[0173] The entire disclosure of each of the following publications is relied upon and incorporated by reference herein.


[0174] 1. Anon. Randomised controlled trial of single BCG, repeated BCG, or combined BCG and killed Mycobacterium leprae vaccine for prevention of leprosy and tuberculosis in Malawi. Karonga Prevention Trial Group. Lancet 348, 17-24 (1996).


[0175] 2. Nordeen, S. K. & Hombach, J. M. in Tropical disease resarch: Progress 1991-1992: eleventh programme report of the UNDP/Warld Bank/WHO Special Programme for Research and Training in Tropical Diseases. (eds. Walgate, R. & Simpson, K.) 47-55 (World Health Organisation, Geneva, 1993).


[0176] 3. Anon. WHO Weekly Epidemiological Record 73, 40 (1998).


[0177] 4. Hansen, G. H. A. Undersogelser angaende spedalskhedens aasager. Norsk Magazin for Laegervidenskaben (supplement) 4, 1-88 (1874).


[0178] 5. Kirchheimer, W. K. &. Storrs, E. E. Attempts to establish the armadillo (Dasypus novemcinctus Linn.) as a model for the study of leprosy. 1. Report of lepromatoid leprosy in an experimentally infected armadillo. Int J Lep 39,693-702 (197 1).


[0179] 6. Franzblau, S. Drug susceptibility testing of Mycobacterium leprae in the BACTEC 460 system. Antimicrobial Agents and Chemotherapy 33, 2115-2117 (1989).


[0180] 7. Shephard, C. C. in Leprosy (eds. Hastings, R. C.) 269-286 (Churchill Livingstone, Edinburgh, 1985).


[0181] 8. Cole, S. T., et al. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393,537-544 (1998).


[0182] 9. Tekaia, F., et al. Analysis of the proteome of Mycobacterium. tuberculosis in silico. Tubercle Lung Disease 79,329-342 (1999).


[0183] 10. Brosch, R., Gordon, S. V., Eigimeier, K., Gamier, T. & Cole, S. T. Comparative genomics of the leprosy and tubercle bacilli. Res. Microbiol. 151, 135-142 (2000).


[0184] 11. Philipp, W., Schwartz, D. C., Telenti, A. & Cole, S. T. Mycobacterial genome structure. Electrophoresis 19, 573-576(1998).


[0185] 12. Stinear, T. P., Jenkin, G. A., Johnson, P. D. R. & Davies, J. K. Comparative genetic analysis of Mycobacterium ulcerans and Mycobacterium marinum reveals evidence of recent divergence. J. Bacteriol. 182, 6322-6330 (2000).


[0186] 13. Marques, M. A. M., Chitale, S., Brennan, P. J. & Pessolani, M. C. V. Mapping and identification of the major cell-wall associated components of Mycobacterium leprae. Infection Immunity 66, 2625-2631 (1998).


[0187] 14. Jungblut, P. R., et al. Comparative proteome analysis of Mycobacterium tuberculosis and Mycobacterium bovis BCG strains: towards functional genomics of microbial pathogens. Mol Microbiol 33, 1103-1117 (1999).


[0188] 15. Andersson, J. O. & Andersson, S. G. E. Insights into the evolutionary process of genome degradation. Curr. Opin. Genetics & Development 9, 664-671 (1999).


[0189] 16. Anderssen, S. G. E., et al. The complete genome sequence of the obligate intracellular parasite Rickettsia prowazekii. Nature 396,133-140(1998).


[0190] 17. Mizrahi, V., Dawes, S. S. & Rubin, H, in Molecular genetics of mycobacteria (eds. Haffull, G. F. & Jacobs, W. R., Jr.) 159-172(ASM Press, Washington, D.C., 2000).


[0191] 18. Gordon, S. V., Heym, B., Parkhill, J., Barrell, B. & Cole, S. T. New insertion sequences and a novel repeated sequence in the genome of Mycobacterium tuberculosis H37Rv. Microbiology 145, 881-892 (1999).


[0192] 19. Wolf, Y. I., Aravind, L., Grishin, N. V. & Koonin, E. V. Evolution of amino-acyl-tRNA synthetases —analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events. Genome Research 9, 689-710. (1999).


[0193] 20. Poulet, S. & Cole, S. T. Repeated DNA sequences in mycobacteria. Arch. Microbiol. 163, 79-86 (1995).


[0194] 21. Cole, S. T. Learning from the genome sequence of Mycobacterium tuberculosis H37Rv. FEBS Letters 452, 7-10(1999).


[0195] 22. Ramakrishnan, L., Federspiel, N. A. & Falkow, S. Granuloma-specific expression of mycobacteriurm virulence proteins from the glycine-rich PE-PGRS family. Science 288, 1436-1439 (2000).


[0196] 23. Daffe, M. & Draper, P. The envelope layers of mycobacteria with reference to their pathogenicity. Advances in Microbial Physiology 39, 131-203 (1998).


[0197] 24. Yuan, Y. & Barry, C. E., 3rd. A common mechanism for the biosynthesis of methoxy and cyclopropyl mycolic acids in Mycobacterium tuberculosis. Proceedings of the National Academy of Sciences of the United States of America 93, 12828-33 (1996).


[0198] 25. Draper, P., Dobson, G., Minnikin, D. E. & Minnikin, S. M. The mycolic acids of Mycobacterium leprae harvested from experimentally infected nine-banded armadillos. Ann Microbiol (Paris) 133,39-47 (1982).


[0199] 26. Glickman, M. S., Cox, J. S. & Jacobs, W. R., Jr. A novel mycolic acid cyclopropane synthetase is required for coding, persistence, and virulence of Mycobacterium tuberculosis. Mol Cell 5, 717-27 (2000).


[0200] 27. Melancon-Kaplan, J., et al. Immunological significance of the cell wall of Mycobacterium leprae. Proc. Nad. Acad. Sci. USA 85,1917-1921 (1988).


[0201] 28. Cox, J. S., Chen, B., McNeil, M. & Jacobs, W. R., Jr. Complex lipid determines tissue specific replication of Mycobacterium tuberculosis in mice. Nature 402, 79-83 (1999).


[0202] 29. Camacho, L. R., Ensergueix, D., Perez, E., Gicquel, B. & Guilhot, C. Identification of a virulence gene cluster of Mycobacterium tuberculosis by signature-tagged transposon mutagenesis. Mol. Microbiol. 34, 257-267 (1999).


[0203] 30. Peterson, J. A. & Graham, S. E. A close family resemblance: the importance of structure in understanding cytochromes P450. Structure 6,1079-1085 (1998).


[0204] 31. Wheeler, P. R. & Ratledge, C. in Tuberculosis: Pathogenesis, protection, and control. (eds. Bloom, B. R.) 353-385 (American Society for Microbiology, Washington DC 20005, 1994).


[0205] 32. Honer Zu Bentrup, K., Miczak, A., Swenson, D. L. & Russell, D. G. Characterization of activity and expression of isocitrate lyase in Mycobacterium avium and Mycobacterium tuberculosis. Journal of Bacteriology 181,7161-7167 (1999).


[0206] 33. McKinney, J. D., et al. Persistence of Mycobacterium tuberculosis in macrophages and mice requires the glyoxylate shunt enzyme isocitrate lyase. Nature 406, 735-738 (2000).


[0207] 34. Wheeler, P. R. Oxidation of carbon sources through the tricarboxylic acid cycle in Mycobacterium leprae grown in armadillo liver. J Gen Microbiol 130, 381-9 (1984).


[0208] 35. Ratledge, C. R. in The Biology of the Mycobacteria (eds. Ratledge, C. & Stanford, J.) 53-94 (Academic Press Limited, San Diego, Calif., 1982).


[0209] 36. De Voss, J. J., et al. The salicylate-derived mycobactin siderophores of Mycobacterium tuberculosis are essential for growth in macrophages. Proc. Natl.Acad. Sci. USA 97, 1252-7, (2000).


[0210] 37. Quadri, L. E., Sello, J., Keating, T. A., Weinreb, P. H. & Walsh, C. T. Identification of a Mycobacterium tuberculosis gene cluster encoding the biosynthetic enzymes for assembly of the virulence-conferring siderophore mycobactin. Chemistry & Biology 5, 631-45 (1998).


[0211] 38. Hall, R. M. & Wheeler, P. R. Exochelin-mediated iron uptake into Mycobacteriurn leprae. Int J Lepr Other Mycobact Dis 51, 490-4 (1983).


[0212] 39. Makui, H., et al. Identification of the Escherichia coli k-12 Nramp orthologue (MntH) as a selective divalent metal ion transporter. Molecular Microbiology 35, 1065-1078 (2000).


[0213] 40. Shimoji, Y., Ng, V., Matsumura, K., Fischetti, V. A. & Rambukkana, A. A 21-kDa surface protein of Mycobacterium leprae binds peripheral nerve laminin-2 and mediates Schwann cell invasion. Proceedings of the National Academy of Sciences of the United States ofAmerica 96,9857-62 (1999).


[0214] 41. Rambukkana, A., et al. Role of alpha-dystroglycan as a Schwann cell receptor for Mycobacterium leprae. 282 2076-9 (1998).


[0215] 42. Rambukkana, A., Salzer, J. L., Yurchenco, P. D. & Tuomanen, E. I. Neural targeting of Mycobacterium leprae mediated by the G domain of the laminin-α2 chain. Cell 88,811-821 (1997).


[0216] 43. Arruda, S., Bomfim, G., Knights, R., Huima-Byron, T. & Riley, L. W. Cloning of an M. tuberculosis DNA fragment associated with entry and survival inside cells. Science 261, 1454-1457 (1993).


[0217] 44. Eigimeier, K., Fsihi, H., Heym, B. & Cole, S. T. On the catalase-peroxidase gene, katG,of Mycobacterium leprae and the implications for treatment of leprosy with isoniazid. FEMS Microbiol. Lett. 149, 273-278 (1997).


[0218] 45. Eiglmeier, K., Honoré, N., Woods, S. A., Caudron, B. & Cole, S. T. Use of an ordered cosmid library to deduce the genomic organisation of Mycobacterium leprae Mol. Microbiol. 7,197-206 (1993).


[0219] 46. Smith, D. R., et al. Multiplex sequencing of 1.5 Mb of the Mycobacterium leprae genome. Genome Research 7, 802-819 (1997).


[0220] 47. Bonfield, J. K., Smith, K. F. & Staden, R. A new DNA sequence assembly program. Nucleic Acids Res. 24, 4992-4999 (1995).


[0221] 48. Altschul, S. F., Boguski, M. S., Gish, W. & Wooton, J. C. Issues in searching molecular sequence databases. Nature Genet. 6,119-129 (1994).


[0222] 49. Parkhill, J., et al. Complete DNA sequence of a serogroup A strain of Neisseria meningitidis Z249 1. Nature 404,502-505 (2000).


[0223] 50. Rutherford, K, et al Artemis: sequence visualization and annotation. Bioinformatics 16, 944-945 (2000).


[0224] 51. Andersen, P. Host responses and antigens involved in protective immunity to Mycobacterium tuberculosis. Scandinavian Journal of Immunology, 45, 115-31 (1997).


[0225] 52. Pugsley, A. P. The complete general secretory pathway in gram-negative bacteria. Microbiological Reviews, 57, 50-108 (1993).


[0226] 53. Stanley, N. R., Palmer, T. & Berks, B. C. The twin arginine consensus motif of Tat signal peptides is involved in Sec-independent protein targeting in Escherichia coli. J. Biol. Chem., 275, 11591-61 (2000).


[0227] 54. Berks, B. C., Sargent, F. & Palmer, T. The Tat protein export pathway. Mol. Microbiol., 35, 260-74 (2000).


[0228] 55. Lalvani, A., et al. Human cytolytic and interferon gamma-secreting CD8+ T lymphocytes specific for Mycobacterium tuberculosis. Proceedings of the National Academy of Sciences of the United States of America, 95, 270-5 (1998).


[0229] 56. Pollock, J. M. & Andersen, P. The potential of the ESAT-6 antigen secreted by virulent mycobacteria for specific diagnosis of tuberculosis. J. Inf. Dis., 175, 1251-1254 (1997).


[0230] 57. Gordon, S. V., et al. Identification of variable regions in the genomes of tubercle bacilli using bacterial artificial chromosome arrays. Molec. Microbiol., 32, 643-656 (1999).


[0231] 58. Harboe, M., Oettinger, T., Wiker, H. G, Rosenkrands, I & Andersen, P. Evidence for occurrence of the ESAT-6 protein in Mycobacterium tuberculosis and virulent Mycobacterium bovis and for its absence in Mycobacterium bovid BVG. Infection Immun., 64 16-22 (1996).


[0232] 59. Mahairas, G. G., Sabo, P. J., Hickey, M. J., Singh, D.C> & Stover, C. K. Molecular analysis of genetic differences between Mycobacterium bovis BCG and virulent M. bovis, J. Bacteriol., 178, 1274-1282 (1996).


[0233] 60. Brandt, L., Elhay, M., Rosenkrands, I., Lindblad, E. B. & Andersen, P. ESAT-6 subunit vaccination against Mycobacterium tuberculosis. Infect Immun., 68, 791-795 (2000).


[0234] 61. Tekaia, F., et al. Analysis of the proteome of Mycobacterium tuberculosis in silico. Tubercle Lung Disease, 79, 329-342 (1999).


[0235] 62. Berks, B. C., et al. A novel protein transport involved in the biogenesis of bacterial electron transfer chains. Biochim. Biophys. Acta., 1459, 325-330 (2000).
4M. leprae proteins that are potential targets for the diagnosis,prophylaxis or treatment of mycobacterioses.>BL;ML0007, ML.tab 11195:12106 forward MW:32204VTSPNESRAFNAADDLIGDGSVERAGLHRATSVPGESSEGLQRGHSPEPNDSPPWQRGSARASQSGYRPSDPLTTTRQSN(SEQ ID NO:1)PAPGANVRSNRFISGMTAPALSGQLPKKNNSTQALEPVLMSNEVPFTESYASELPDLSGPVQRTVPCKPSPDRGSSTPRMGRLEITKVRGTGEIRSQISRRSHGPVRASMQIRRIDPWSMLKVSLLLSVALFFVWMIAVAFLYLLLGGMGVWAKLNSNVGDLLNNTGGNSGELVSNSTIFGCAVLVGLVNIVLMTTMAAIAAFVYNLSSDLVGGVEVTLADLD>BL;ML0012, ML.tab 16566:16979 reverse MW:14690MQQTAWGPRTARIAGCGGAGIVIAIACSTLDIDTPGFMLTGIAALGLILFAGLSWRARPKLAINPDGLAVQGWFRTRLFG(SEQ ID NO:2)PADIKIIRITEFRRFGRKVRLLEIEAINGDLVILSRWDLGTGPLEVLDALITAGYAG>BL;ML0013, ML.tab 17134:17415 reverse MW:10464MPKSKVRKKNDFTITSVSRTPVKVKVGPSSVWFVTLFVGLMLIGLVWLMVFQLAALGTQAPTALHWMAQLGPWNYAIAFA(SEQ ID NO:3)FMITGLLLTMRWH>BL;ML0022, ML.tab 27173:28639 reverse MW:52748MDNQKELIQRIERKLESSIDDAFARMFGGSIVPQEVEALLRREASDGVRSLQGNRLLAPNEYIITLGVHDLEKNKADPDL(SEQ ID NO:4)TSSAFASDLADYINEQGWQTYGDVVVRFDQSSSLHTGQIRARSVVNPDVEPRPTVNDPVRTQSNQAFSAEPGVPPMTDNSSYRGGQGQGRPSDDYYGRPQDDPRGADPQGGQDPRGCYPPKPGSYPQQAGHPPLHRPDQGGYPGQGGYEDQRAYHDQGQGGYPSPYEQRPATPGGYGSQGHDQGYRPGSYGPPSGGQPGYGGYGDYGRGPARPDEGSYTPSGFPAPPEQRVAYPDQGGGYDQGYQHSGLGYGREDYGRQEYTQYAENLPGGVYAPSSGGYAEPAGRDYDYGQPGAANDYSQPVIGGYGGYGALGSAVILQLDDGSGRTYQLREGSNIVGRGQDAQFRLPDTGVSRRHLEIRWDGQVALLSDLNSTNGTTVNNAPVQEWQLADGDVIRLGHSEIIVRIH>BL;ML0030, ML.tab 34750:35091 reverse MW:11383MLIAGTLCVCAAVISAVFGTWALIHNQTVDPTQLAMRAMAPPQLAAAIMLAAGGVVALVAVAHTALIVVAVCVTGAVGTL(SEQ ID NO:5)AAGSWQSARYTLRRRATATSCGKNCAGCILSCR>BL;ML0031, ML.tab 35287:36123 reverse MW:28788MIQSTQTWRVLAGGLAATAMGVTVFAGGTAAADPSPPAPPPAIPGVLPPASLPPIQSVTAVPGGITTNNRFVATPQAPGP(SEQ ID NO:6)AALGQPPLAVAAPVSESLHDYFKAKNIKLVAQKPHGFKALDITLPVPTRWTQVPDPNVPDAFAVIADRLGNSLYTSNAQLVVYNLVGNFDPKEAITHGFVDTQQLSAWQTTNASKADFDGFPSSIIEGTYRENGMTLNTSRRHVIASSGPDKYLVSLSVTTALSQAVADAPATNAIVNGFRVSSPTVSAPVPPQLGTR>BL;ML0042, ML.tab 51993:53396 forward MW:51453MRNPVWLRFSMGRALLVTALVPPCIILFFHTQYWWAGIALVVLVVILTLVEFSGRWLSGWLMALYSFFRRSSKPLDTPSE(SEQ ID NO:7)PVIGATVKPADQVIMRWQDGFLVSVVELIPRPFTPTVIVDGEAQTDDLLETQLLEHLLSVHCPDLEAVVVSAGYRVGHVASLDVVNLYQQVIGADPAPAHRRTWIMLRADPVRTRKSAQRRDAGVAGLARYLIASTTRIADQLASHGVDAVCGHSFESVDHATDVGFMQEKWSMMRGQNAYSVAYTAPAGPDAWWSARADHTITRVWVAPGKTPQATVVLTTLGKPKTPCGFYRLHGAQQPALLGRSFVAYQHCQMPIGSAGVLVGETVNRCSVYMPFDDVDVSVSLGDVQTFTQFVVRAAAAGGIVTLGQQFEKFARMIGGQIGSVAKVAWPNATTYLDPYPGSERVILKHDIIGTPRHRKLPIRRISPPEEGHYQMVLPKSSYEL>BL;ML0044, ML.tab 54698:55039 forward MW:12157MDLDPTQAQTMALLGQFQSALDEQCNRMTDGVFKASDQEKTVEVTINGYQWLTGIRIESGALREFGHAVVADRINEALQN(SEQ ID NO:8)AQGVATAYNEVSGEQLAARLSALSCSIGEPPPT>BL;ML0047, ML.tab 58020:59558 reverse MW:54486LSAPAVTAGPATAGITPARPSATRVTILTGKRMTDLVLPSTVSIEAYIDETVAVLSDLLEDAPADVLAGFDFSAQGVWTF(SEQ ID NO:9)ARPGSPPMKLDQSLDDAGVVDGSLLTLVSTSRTERYRPLVEDVIDAIAVLNESPEFNRKAVDRFIGVAIPVLSLPITAVAVWAWWVTGRSPFWSLAIGILSIVALTGSIVAEKFYKNLDLSESLLLTSYPLIASAAALVTPLPNGVDSLGPPQVAAAAAAVLFLTLLTRGGARRHSGYASFTAITTIAIVVIAIAYGFGYQHWVPTGAVAFGLFIVTNAAKLTVAVARIALPPIPVPGETVDNEELLDPVVTPHEATHEETPTWQAIIASVPDSAVRLTERSSLAKRLLIGYVISGTLILCSGAIAVIVRGHFFAHSLVVAFLLTVVCTFRSRLYAERWCAWALLAAAVVIPTGLTVKLCIWYTQIAWLLLTSYLVAAIIALMVFGATVRVRRVSPVTKRIMELIDGAVVASIIPLLLWIAGVYDMVRNLSF>BL;ML0048, ML.tab 59555:61315 reverse MW:63226MAADYDKLFRLDDGAYASPDQAAEQLFDDAPLYPPPIIPTCTTTPNGEVASPMPDWSEQLPPNPPAASKSPLPPMPIGSS(SEQ ID NO:10)VQPPPASSESPRAPMPVSAPPRSPAASLMPISEPPQWPPAEAPEHQFAKAEPPSVPIPINEPSPAKPATPMPMTPIDGSQRTPVTSPEPSLAEFEAQPPATPKPSLLPRPMSSPPEAPRPSANQHSRHARRGHHHRDETQQANPASATEPMIAPRARTAELRQAPHAAAEPAPTQHLTRPDGLVSHRTALHDSTATSAIGVQTGRSTGAKKPSKVVAKRGWRHWVHTVTRINLGLSPDERYELDLRTRVRRPPRGSYQIGILGLKGGAGKTTVTVTLGSMFARVRNDRILVVDADTSCGNLADRAGRFSEANIADLLADKDVKSYNDIRTHTSVNAVNLEVLPAAEYSTAQHALSGEDWNFAAATVSKYYNVMLADCGVGLFDPVTRGVLSTASGVVIVTSTSVDAARQAAIALDWLRHNGYQDLLSRACVVINHVMPKEPNIASKDLVQQFEQQIQPGRVVVLPWDKHIAAGTEIRLORLDPLYRRRILELAAALSDDFERAGRH>BL;ML0049, ML.tab 61406:61693 reverse MW:10464MIQAWHFPALQGAVNELQGSQSRIDALLEQCQESLTKLQSSWHGSGNESYSSVQRRFNQNTEGINHALGDLVQAINHSAE(SEQ ID NO:11)TMQQTEAGVMSMFTG>BL;ML0050, ML.tab 61720:62022 reverse MW:10964MAEMITEAAILTQQAAQFDQIASGLSQERNFVDSIGQSFQNTWEGQAASAALGALGRFDEAMQDQIRQLESIVDKLNRSG(SEQ ID NO:12)GNYTKTDDEANQLLSSKNNF>BL;ML0051, ML.tab 62201:63109 reverse MW:32135MTWPMLWPASVPSECPPNYWHTPAPSAKCEPEQAAVAPIAAAKPMITWLQSAAEQTTTQAEAHRQAMASTPGMAVITENH(SEQ ID NO:13)ITQAILATINFFGINMAPIAFTEAGDFICMRTQTALAMNSYQAETLLNTAFQKLEPMAAILNPSSYSPPSALTSQVNQFTQMISGFSAALPSTQVLQQTVGQVAELARPMQQVKSLFTSIDSTGVYTSAQRGDTESAHRIGLFGASTLSSHPLVGITGTTTDTRLLCAESLPSASGSLAWTPLMTQFQLIDKSIAPEPRQRVMLPPWAAGSPGHNAQDGGTT>BL;ML00S4, ML.tab 67417:68862 reverse MW:51875MGLRLTTKVQVSGWRFLLRRVEHAIVRRDTRMFDDPLQFYSRSIALGIVVAVALLIGAVLLAYFKPQGKLGASTLLTDRA(SEQ ID NO:14)TNQLYVLLSGHLYPVYNLTSARLALGKPANPAAVKSSELTKLPIGQTIGIPGAPYATPVSGDSSSTWTLCDTVAKIESESPAVQTSVIARSLQIDPAINPLQPNEALLASYRDKTWLVNSKGRHSIDLADRALTSAIGISVNVKPTPLSEGLFNALPDVGRWELPTIPDTGAPNSLGLSPDLVIGSVFQIQMEKSPQYYVVLSDGIAQVNATTADALRATQSHGLVAPPSLVPNVVVQIPERVYDSPLPDEPLKMVARDDYPTLCWAWERKASDQAPKRTMLIGQHLPLQPSANSTGIKQIRGTATVYIDGGRYVALQSPDPRYSESLYYVDPEGVRYGLANSEAVKALGLASPQTAPWVIVRLLVEGPVLSRDAALLEHDTLIADPSPRKVPAGYSGVRP>BL;ML0056, ML.tab 70584:71093 reverse MW:18452VNLLGNDDDNHLASLDFYSANRYYEESLFFDELDGYAPTTPVGIEANDLDVFQSLTEPEEELEVELLAVTNPAKSVSALM(SEQ ID NO:15)NGRVHQVELTDQVTRIGEKKPATEAFVLASLARQKARTSQGTCILDSLQGGGENTTARCELVGLTLNLPTSEQAAAEAEMFSNNILRQK>BL;ML0068, ML.tab 89620:90336 reverse MW:26088MGLFHKRRSRAMRRAEARAIKARAKLEARLAAKNEARRLNSAQRATNKALKAQLKAKRNSDRVALKVAETELKAAKECKL(SEQ ID NO:16)LSPTRIRRVLTVSRLLAPIVVPLIYRAAIATRALIDQRRADQLGIPLAQIGQFSGPSARLSARIARSEQSVLLVQEKKPKDAETKQFVSTITERLIDLSAAVAAVENMPATRRRAVHSTISSQLDGIEADLMARLGVDLTTSMADNRSVADSTRKAAT>BL;ML0069, ML.tab 90521:90919 reverse MW:14807MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQLRSGNRTNPSSATISALANFFRIKPAYFTNDEY(SEQ ID NO:17)YEKLDQELAWLATMRDEGVRRIAMRTIGLSAQAQQDIVDRVDELRRAEHLDV>BL;ML0071, ML.tab 91913:92446 reverse MW:18446METGSGLPIGVVPFHARGALKGFVISGRWPDSTKEWAQLLMVAVRIASLPGLLSTTTVFGAREELPDEPEPGTVGLVLAE(SEQ ID NO:18)GTVFGESAIQPGYFADHQPPALLMLHPPSETMPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGTITSMVSRVGVDPISHPDTAILAMLLAA>BL;ML0073, ML.tab 94092:95126 forward MW:38124VIQVCSQCGTRWNVRERRREWCLRCRGALIAPQAEMPTAVKQWSSHVGLPAGIVPEALGWQRTPPGFRWIAVRPGAAPPN(SEQ ID NO:19)RRRQRHHVPTPRYAVMPRWGLADRVDQDSTWIQAPLKPGPSSAKVRTTLFVAVLVFTLAALVYVVRYVLLVINRNTLLNFGVAAIADWLGVIASLAAIAATLACVMALSRWLIARRAAAYAHHAVPEQRSSWELRAGCLLPLVNLLWAPVYVIELALMENHYTQLQKPIFMWWIVWVFSYVISVVAVVTSWAKDAQGIANNTVAMVFAYLFAAAAVAAVARVFEGFECKSIKRPVHRWVVVHLGGSVVHPSPGSVELEGQEPAA>BL;ML0081, ML.tab 100648:102000 reverse MW:48150MLDAPEEEPALADDLTGEDEQPPVEFQWPSTLQARATRRGLLLTALGGLLIGGLVTAIPTVGTGSGLLATYIDSNPVPST(SEQ ID NO:20)GAKSNVAFNRATNGDCLMWPDSTPHTAVIVNCADDHRFEVAESIDMRTFPGSEYGPNAAPPSPARIQQISEEQCETAVRRYLGTKFDPNSKYTISMLWSGDRAWRQSGERRMLCGLQLPGVNNQQVAFKGKVANIDQSKVWPAGTCLSIDLTTNQPIDIPVDCVAPHAMEVTGTVNLADKFPNALPAASEQDTFIKDACTRLTDIYLAAIELRTTTLTLIYPTLSLSSWAAGSRKVACSIGATMGNGGWATLVNSAKGPLLINNQPPTPPPDIPEERLGMPPIPLHHLQVPNSQSNVPVNPIPPGNQQHRKQQPIVTVPQSPASTAPAASVSPAKHLPKARHTRQMSKRRGRPISPHPWAGRASPGGLAE>BL;ML0091, ML.tab 113153:113863 reverse MW:23752VPNRRRCKLSTAISTVATLAIASPCAYFLVYEPTASAKPAAKHYEFKQAASIADLPGEVLDAISQGLSQFGINLPPVPSL(SEQ ID NO:21)TGTDDPGNGLRTPGLTSPDLTNQELGTPVLTAPGTGLTPPVTGSPICTAPDLNLGGTCPSEVPITTPISLDPGTDGTYPILGDPSTLGGTSPISTSSGELVNDLLKVANQLGASQVMDLIKGVVMPAVMQGVQNGNVAGDLSGSVTPAAISLIPVT>BL;ML0093, ML.tab 115371:117302 forward MW:72371MTAVSLLARVILPRPGDPLDVRKLYLVESITNARRAHALSPTTLQIGAESEVSFATYFNAFPASYWRRWTICKSVVLRVE(SEQ ID NO:22)VTGAGRVDVYRTKANGARIFVEGREFAGVEDDASKAQVVELEVGLQPFEDGGWIWFDITAETRVTLCSGGWYATSPAPGRANIAVGIPTFNRPADCTNALAELTADPLVDEVIGAVIVPDQGVRKVRDHPDFPEAAARLGDRLSIHDQPNLGGSGGYSRVMYEALKNTDCQQILFMDDDIRIEPDSILRVLAMHRFAKSPMLVGGQMLSLQEPSHLHIMGEVVNRSNFIWTAAPHAEYDHDFVEYPLNDKEDKSKLLHRRIDVDYNGWWTCMIPRQVAEELGQPLPLFIKWDDADYGLRAAEHGYPTVTLPGAAIWHMAWSDKDDAIDWQAYFHLRNRLVVAAMHWDGDVTGLVRSHLKATLKHLACLEYSTVAIQNKAIDDFLAGPDHIFSILESALPEVHRMRKEYPDAVVLPAATELPPPVHKNKVMKPPENPLSIVYRLLRGIFHNLTAADPECHKRPEFNIPTQDARWFRLCTVDGTTVTTADGCGVVYRQRDRAKNFLLLFSSLHRQLQLARRFDELRKIYRDALPVLSSKQKWEMALLPLPDSPTRFPAEQEPEHA>BL;ML0094, ML.tab 117295:117873 forward MW:19928MPEDKAPTGELAAIAAVQSVLVDRPGVLPTARGMSHFGEHSIGWLAISLLGAILVPCRRRYWLVAGAGVFAAHVAAVLIK(SEQ ID NO:23)RMVRRIRPNHPAVTVNVGTPSPLSFPSAHATSTAAAAILIGRASRLPKGIVAAVLVAPMALSRIVLGVHYPSDVAFGVVLGAAVAGTTARFDSRLSRRWTVQHGLSSGSAVK>BL;ML0096, ML.tab 118819:120768 forward MW:71091MRRRMVSRQVGWPLFPYHIVVRVSLWASVLLVAALFGWGAWQRRWIADDGLIVLRTVRNLLAGNGPVFNQGERVEANTST(SEQ ID NO:24)VWTYLLYAGSWVGGPMRFEYVALAAALVLSVLGMVLLMLGTGRLYAPSLQGRQAIMLPAGALVYVALPPARDFATSGLESGLVLTYLGLLWWMMVCWAQPLRNRSQSRRFIGALAFVAGCSVLVRPELALMGGSALIMLMIAARTCWLRALIVVAGGSLPVAYQLFRMGYYGLLVPGTALAKDAAGDKWSQGIIYLSNFNQPYVLWVPLVLLVLLGLLLMLIHRWPSFMHPLETPDSGRVARAVQSPPAVVVFVVFSGLLQAFYWIRQGGDFMHGRVLLAPLFCLLAPVVVIPVVISEGADFSRQTGNWLAGVTSLLWLGVAGWSLWAANSPGMGDDATNVSYSGIVDERRFYAQATGHAHPLTAADYLGYPRMAAVLVALNNTPDGALLLPSGNYIKWDLVPMIQLSPSSPGSPPDSLVSQKPQHTVFFTNLGMLGMNVGLDVRVIDQIGLANPLAQHTERLQHGRIGHDKNLFPDWVIADGPWVKWYPGIPGYLDQAWIAQAVAALQCSGTQAVLSSVRAPMALHRFISNLLNSFEFTRYRFDRVPLYELVRCGLPVPDVLPATPPE>BL;ML0099, ML.tab 123370:124380 forward MW:35611MVEKVPRKRHRVLAWTAALSMAAVVALAIVAVVILLRSAESPRSSLPPGVLPIPSTAPHPRKPRPAFQDVSCPDVQLLVV(SEQ ID NO:25)PGTWESSLQDNPLDPVQFPDALLRNSTMTIGQQFPTSRVQTYTIPYTAQFHNPLSGDKQMTYNDSRAEGTRAMVQEMINVNNKCPLTSYVLVGFSQGAVIAGDITSDIGNGHGPVDDDLVLGVTLIADGRRQQGVGNDIGPNPPGEGAEVTLHEVPVLSGLGMTMTCARPGGFGVLHSRTNEICAPGDLICAAPAEAFSVANLPATLNTLASGAGQPIHANYATAQFWDLDGAPATVWTLNWVHRLIEGAPHPKHG>BL;ML0107, ML.tab 147460:149358 reverse MW:68650MRNALASFGQIVLAAVVASGVAAVSLIAIARVHWPAFPSSNQLHALTTVGQVGCLTGLLAVGGVWQAGRFRRLAQLGGLV(SEQ ID NO:26)FVSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDSAALQDMTYLGLPPFYPPGWFWIGGRVAALTGTPAWEIFKPWAITSITIAVAITLVLWWQMIRFEYALLVTIATAAVTLVYSSPEPYAAMITVLLPPALVLTWSGLRAAEREADRTLGNKRGWATVVGAGIFLGFAATWYTLLLAYTAFTVVLMTLLLATALCRRAGFRATFDPLRRLAGIVVIAAAIGAITWLPFLARAAHDPVSDTGSAQHYLPADGAELAFPMLQFSLLGMICMLGTLWLIVRTSSSVRASALMISVLAVYLWSLLSILTTLARTTLLSFRLQPTLTVLLVTAGVFGFIETAQSLAKHNRAVLSVASAIGLAGAIAFSQDIPNVLRPDLTIAYTDTDGHGQRGDRRPPGSEKYYWAIDEAVLHITGKPRDQTVVLTADYSFLAYYPYWGFQGLTSHYANPLAQFDLRAAQIQQWSRLTTASELIHALDTLPWPPPTVFVMRHGAGNTYTLRLAKNVYPNQPNVRRYTVDLPAALFADQRFAVQDIGPFVLAIRKPMGNA>BL;ML0115, ML.tab 155413:155937 reverse MW:19053LNNDNDDSIEIIGGVDPRTMATRGEDESRDSDEPSLTDLVEQPAKVMRIGTMIKQLLEEVRAAPLDEASRNQLREIHATS(SEQ ID NO:27)IRELEDGLAPELREELDRLTLPFNESTAPSNAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLEQMRNSALPPGMGKPGQAGGQGTGQYL>BL;ML0116, ML.tab 155965:157929 reverse MW:70767VGLCCGTLIALFLLIVPETIVARFAALTWPIAIAVSPALTYGVIALVIIPFGAVGIPWNSWTALAALVAVSMLMIAFRLL(SEQ ID NO:28)LVRYRDTAAETRGISGWPAVTVAVGVLLGALLIGWAAYRGLLHWQSIPSTWDAVWHANTVRFILDTGQASPTHMGELRNVETHSVLYYPSVLHALAGVYCQLTGAAPTTGYTVSSLAVAVWLFPVSAATLTWHLLRPVTTQKRAAGASATAAALSAAFTSVPYVEFGVAANPNLAAYGVAVPTMVLITSTLRHRDRIPVAILALVGTFSVHLTGGIVVSLFLLGWWLMNALLHPVRSRAADARTLAAVVMPTALILAPQFIAVLNQADIIAGHSFPSFKSVKQGVIDALLLHTRHLNDFPIQYGLVVLAAIGMAILLYQKIWWPSIAWLVLTVATVYSAAPFRGPIGSAIESFSQFFYNDPRRLSAVVTMLLTPMAGIALFAGVLLLVVGARRVTARFTALPRPVWTTATVVLLVAATVLTAWHYLFRHLVLFGDKYDSVMVNQKDLDAMSYLATLPGAHNTIIGNSNTDGSSWMYAVADLHPLWTHYDFPQQTGPGYFRYAFWAYARTGNPWVVEAVRVFNIRYILTTSPTVQGFAIPDGLVSLEESKSWTKIYDNGAARIFEWSGNATATRA>BL;ML0124, ML.tab 166949:167389 reverse MW:16448MPVLSKTVEIDTDTATIMAIVTDFESYPQWHEWIKGVWVLAHYDDGRPSQLRIDINFQGMQGTYIQAVYYPGVNQIQTVM(SEQ ID NO:29)QOGOLYSKQEQLFSVTQAEGGSVLTVDLDVELTMPVPAPMVKNLLNTALDRLAEKLKLYAEHLAPS>BL;ML0133, ML.tab 179256:179888 forward MW:24045MTNRTLSREEIRKLDRDLRILVATNGTLTRVLNVVANEEIVVDIINQQLLDVAPKIPELENLKIGRILQRDILLKGQKSG(SEQ ID NO:30)ILFVAAESLIVIDLLPTAITTYLTKTHHPIGEIMAASRIETYKEDAQVWIGDLPCWLADYGYWDLPKRAVGRRYRIIAGGQPVIITTEYFLRSVFQDTPREELDRCQYSNDIDTRSGDRFVLHGRVFKNL>BL;ML0151, ML.tab 213175:213492 reverse MW:11830MRPEPPHHENAELTEMNTEVVEAPLLTDIEELREEIDRLDAQILATVKRRAEVSQAIGKVRMASGGTRLVHSREMKVIER(SEQ ID NO:31)YSELGPDGKDLAILLLRLGRGRLGH>BL;ML0158, ML.tab 221657:222601 forward MW:31355LHCSSGAVTALEITGGVNTYLPGSPGYPLVQPAGSYPGATPSFVKSDVGESQLYHYLTIAVVVLGLAVYLGNFGPTFTSS(SEQ ID NO:32)SDIGPGSGGFAGDAGTAVVVALLAALLAGLDLLPKAKSSAGVVGAIAVLGALLAISEMINMPAGFSIGWANWFILVCSVLQAIAAVAALLLEAGIITAPAPRLSYDPYLQYGQYGAQSYYGQPNRQLQVGLNAHSPQQSPAGYGAQYGAYTSSPTQIQAGMPATGGFSAQHSAQQGPSTPPTGFPSFSPPPSVGAAAGSQAGSAPVSYSNPTDSKQGFGQGRESTSSSSGSAPV>BL;ML0159, ML.tab 222650:223942 forward MW:44093VGDNRAAGVRQARDLVKVAFGPAVVALAIIAAITLLQLLIANSDMTGALGAIASMWLGVHQVPIAIGGRELSIMPLLPVL(SEQ ID NO:33)LMVWATAHSTSQATSAYSSWLVIRWVVASALGGPLLIAAISLAVIHDASSVLTELQTPKALRAFTGVLVVHAIGAAIGVNSRVGRRVLTASRLPDWVGDSVHAATAGVLALLGLSGLVTAGSLVVHWATMQEFYGITDSIFGQFSLTVLSVLYAPNVIVGTSAVAVGSSAHLGFATFSSFTVFGGDIPALPVLAAAPTPPLAPVWVALLIVGAASGVAVGQQCTRHPLPLLAALAKLLVAAATGALMMALLGYAGSGRLGNFGDIDVDQGALVVGVFFWFAVVGWVTVVVACGIKRFPRHLKPPPALSSEEHADASSKDHEAYFGVDLNVPFDLSGEDEIPKAEPGEAAD>BL;ML0169, ML.tab 234932:235534 forward MW:22082MSNSAQRDAKGARDEPLRAADTDRIQIAQLLAYAAEQGRLELKDYEDRLAKAYAATTYQELEQLRDDLPGSQVSARRGGN(SEQ ID NO:34)PNPAPSTLLLALMSGFERRGRWNVPRKLTTFSLWGSGVLDLRYADFTSTEVELHAYSVMGVQTILLPPEVNVEISGHGVMGSFDRQVRGQGTPGAPTVKIRGFSLWGGVGIKRKARRPRR>BL;ML0185, ML.tab 250221:251249 forward MW:38250MPSIPQSLLWISLVVLWLFVLVPMLISKRDVVRRISDVALATRVLNGVAGARLLKRGGPATGHRSDHNWELDEDWRQNPV(SEQ ID NO:35)DGEFADADQDIGEEQDQNVDDTQRTRPVVMEVAVAELTGTDYLDVDVVEDSVALPIEDSADVTESVLLAVGEGGSPGEEAEAEQRQSDRYGYVDASSGLGLEQKDDKSPVPVAPTVSRQRRYDTKTATAVSARKYAFRKRVLMVMAIILVGSAAAAFEVDSNAWWICGSSTTVTVLYLAYLRRQTRIEEKVRSRRMHRIARARVDVENAHDREFDVVPSRLRRPGAVVLEIDDEDPIFEHLDYEMPIRTFGWPRDLPRAVGQ>BL;ML0187, ML.tab 252658:253719 forward MW:38375VAGSWQCGHCESCASPLGPRDIAVVELIADRAEEFAAMDIFRGLPAEDLMSVAVSVEPVLAAAGEVLMQQGEQAVSFLLI(SEQ ID NO:36)SSGNVEVRRVDDDGAVIVGQASHGMIIGEIALLRDGRRTATVITTEPLTGWVGDIDAFAQMVQIPSITRRLLLTVRQRLAAFITPIPVQLRDGTHLMLRPVLPGDTERSLRGHVRFSRETLYLRFMSARAPSDELMHYLSEVDYVDHFVWVVTDGGDPVADARFVRDESDPTLAEIAFTVADAYQGRGVGNFLISALSIAAHVNGVNRFSARMLTDNGPMRAIMDHHGAVWRRYDVGVITTVIDVPRQRDLIIGRAMADQIAGVVRQVIGAVG>BL;ML0190, ML.tab 255125:255742 reverse MW:22988MCDMLVDVGIAFQGSLFEYHERRQLGDGAFIELRSGWLTDGVELLDTLLSEVPWRIERRRMYDKVVNVPRLVSFHDLTTD(SEQ ID NO:37)DPPHPLLTRLRRRLNDIYAGELGEPFTSVGLCCYRDGSDSIAWHGDTIGRNSSEDTMVAIISLGATRVFALRKRGGGPSLRLPLTHGDLLVMGGSCQRTWEHSVPKTSASTGPRVSIQFRPRNVH>BL;ML0199, ML.tab 265213:265815 forward MW:21217VSQLSFFTAESLLPAIADLAGVLAASGQIVVVSASGQSPAPAARLSVVVDQLWRASALAEMISEAGLVPEISRTEEDTPL(SEQ ID NO:38)VRTAVDPLLCPIAAEWTRGAVKTVPPRWLPGPRELRAWILAAGVPEAANRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRSGRPALRISGRRRLSRLLENVGEPPDWAEALALWPRV>BL;ML0208, ML.tab 279784:280125 forward MW:12884MNWIQVLLIGSIIVLLIYLLRSRRNVRSRAWVKVGYIAFVLGGVYAVLRPNDTTVVAHWFGVCRGTDLMLYALIMAFSFT(SEQ ID NO:39)TLSIYIRFKDLELRYACLARVVALEGARAPEPF>BL;ML0227, ML.tab 298516:298992 forward MW:17121MGPTRKRDLTAANIGAAVVGYLLVLVLYRWFPPITVWTGLSLLAVAIPEALWARYVRTKISDGEIGDGPGWLHPLAVAHS(SEQ ID NO:40)LMVAKASAWVGALVLGWWVGVLVYFLPRWPWLRVADKDTSGTVVAALSALALLVAALWLQHCCKSPQDPTEHGEGAEN>BL;ML0229, ML.tab 300537:301466 forward MW:32734MVQFDGLRSARLNIAILSTGRVGVALERADQVVVACSAVSHASRQWVQFRLPETSVASPPEVASSAELLLLAVPDCEFAG(SEQ ID NO:41)LMSGVAVTSVPRPGTIVAHTSWANGVGILAQLGKDGCIPLAIHPANMFSGSDEDLSQCQLRDTYFGITKTDDVGYAIAQSLVLEMGGEPFCVVEYARILYHSVSPHVGNNIVTVLADALEVRRSALRGSELLGLGVPPACRGEVVDDQLDVIVERIVGSLARAACENTLQRGQAGLTKLVARGDLDALAGHLVALMRIGPELAQAYRVNALRKTQRAHAPYDVVEALAP>BL;ML0256, ML.tab 335129:335812 forward MW:24402MSEAKRLDPKRRSPASRPGKAGDSVRGRRSTKPVAKLSVKPSRTTPASSHSGRNSTRMLTQHVVEPIRQSIIESRERRSD(SEQ ID NO:42)QQLGFTARRAAVLAAVVCVLTLTIAGPVRTYFAQHAEIEQLAATEATLRRQIADLEQQKGKLADSAYIAARARERLGFVMPGDVPFQVQLPSTAAVSSQPGGRAAKPANNDPWYTSLWHNIADAPHLPPGAGTPPFSLTPLSTTSGG>BL;ML0257, ML.tab 335805:336308 forward MW:17960VVDRADLEAVARHLGREPRGVLEIAYRCPSGEPGVVKTAPKLDDGTPFPTLYYLTHPVLIAAASRLESTGLMREMTERLG(SEQ ID NO:43)QDPELAAGYRRAHESYLTERDAIESLGTTFSAGGMPDRVKCLHVLIAHSLAKGPGLNSLGDEVLALLAADPKTAATLVAGQWKECDR>BL;ML0271, ML.tab 354849:355220 reverse MW:12992MRLGQALAWLATDIVAVSVFCAVGRCSHAEGLTVADLAVTLWPFLTGTAIGWLASRGWQRPTAVVPTGVVVWLCTVVVGV(SEQ ID NO:44)ALRKASSAGVVANFMVVAASTTAALFLGWRAVVELILRRRSTR>BL;ML0279, ML.tab 361297:361953 reverse MW:24062VTYYSSLRPEDLPPERPKHEHYSGFPEYELANPGVGFRRFVATMRRLQDLAVSADPSDEVWYAAADRAVALVELLGPFAT(SEQ ID NO:45)DEGKAPAGRVPDMPGMGSLLLPPWTLTRSGPDSVEMTGYFTRFHVGFNHAVIIGGVLPLVFDHLFGMISYTAGRSISRTAFLHVDYRKITPIDEPLVMRGRVTRTEGRKAFVSAELVDGDEMLLAEGNGMMVRLLAGQP>BL;ML0281, ML.tab 363432:364121 forward MW:25240LNTSDSAPGVAVLLFGDDRTRQRWNTLTALSTYRAGGPDDIDSIDATIGPYRRLVVVGGDGDLAAVLGRLLRADRLDIEV(SEQ ID NO:46)AYVPHQRTAATRVYRLPTGRRAARRARRGYATRVPLIRDETGSVIVGRADWLPVVDRQPLNGEAIVDDIPLFDGDVAGVRIAPTLANPGLRARLHTSRTGIGIWSRWLTGRAVQLGSTGVAVVRDGVPTRRRERRSTFYRNVEGWMLVR>BL;ML0284, ML.tab 365860:366273 reverse MW:14541MTSMGDLLGPDPILLPDDSAAEVELRANKDPGTVAAAHIPSASVAWAALAEGALADDKATTAYAYARTGYHRGLDQLRCNG(SEQ ID NO:47)WKGFGPVPYSHEPNRGFLRCVAALARAANAIGETDEYRRCLNLLDDCDPAARNELGL>BL;ML0285, ML.tab 366385:367263 forward MW:31593MPNASEPERGITLNRPGLAPRTLDKNVVSNLPEAKDNPANTHGEEIAAGYPLAHSDSETEAMVLTKTEPDQDPGADRQHH(SEQ ID NO:48)ERRFTAPGFDARATAIMATAPDPATEAIHPPLSSSDPPGHLGISPKAAVPQSIPPVLGTKLRSARHFHWGWVVALLMMVLALAAIAILGTVLLTRGKHVKASPAEQVRHAIQSFDVAVQTGNLTALRSITCGTTRDGYVEYDESSWDETYHRVSAAKQYPVIASIDQVVVNGQHAEANITTFMAYDPQVRSTRSLDLQFCDDQWKICQSPSG>BL;ML0298, ML.tab 381423:381647 reverse MW:7851MIVXTVNERPVEVNEQTTVAALLESLGFPASGIAVAVEFSVLPRSYWATKISELPAVTGRSEPIRLEVVTAVQGG(SEQ ID NO:49)>BL;ML0370, ML.tab 461945:462814 reverse MW:30055VRYRRQVAHTRKLLAALSRRGPHRVLRGDLSFAGLPGVVYTPAGGLNLPGVAFGHDWLTGTARYAGLLEHLASWGIVTGA(SEQ ID NO:50)PDTQRGLTPSVLNLAFDLGSALDIVAGVRLGPGNISVHPAKLGLVGHGFGGSAAVLAAAGLPGLAGLPAKSAVAIFPTVTSPAPEQPAATCKVPGLILTAPGDPKTLNSNALSLYRAWDDATLRIVSKAKAGGLVEGWRMTKVVGLAGPHRATQKAVRSLLTGYLLYALGGDKEYRDFADPDMHLPHTVPVDPEAPLVTPEQKIVTLLK>BL;ML0383, ML.tab 476440:477285 forward MW:29301VTSVSIVVEIGHTSAAEPMLAAAAFGNQPGRWPLPTATTPHQLWLRAVAAGGQGHYSSAYRDLAVLRRSVPAGRLASLAH(SEQ ID NO:51)STEGSFLRQLGWHSLARGWDGRALVLAGTDSEARADALIGLAADALGVGRLAAAATLLRRVGSALAPAQLPAQVADRLAVRRRWVAAELAMAVGDGATAVRNAREAVELAQVGRVSVRHQVKSDVVLALCSAATEPRVVAEAALAATGRLGLIPLRWALACLLIDIGSVTFSEPELSELRDVCADQVRRAGGTWRTA>BL;ML0386, ML.tab 480093:480506 reverse MW:15294VRDHLPPGLPPDPFADDPCDPSAALDAVEPGQPLDQQERIAVEADLADLAVYEALLAHKGIRGLVVCCDECQQDHYHDWD(SEQ ID NO:52)MLRANLLQLLIDGTVRPHEPAYDPEPDSYVTWDYCRGYADASLNQATSDADGYRRRH>BL;ML0405, ML.tab 503217:504401 forward MW:40754MSGAFIIDPTLKAIEAWHALLGIGVPNDGGVLYSSLSFFEKALEHLAAAFPGDGWLGSAADKYAGQNRKRVDIFQELAEL(SEQ ID NO:53)DKELIELIHNQANSVQTTRGILDGAKKALLFVRPVAIDLNYIPLVGSVMSASIQAQACAAAMAAVSGGLAYLLVQTAIHTAKFVALLARLAHLLASAVADVVSDGVAIIKGIVDHLWHFIAGALTGLKDIVEKIIHWFFGLFSHWWSRLHSFFGGIPGLSGATSGLSQVTGLFGVPGLAGSSGLLSGESLLSTENLPSLAGVGAGLGLGSLPQLAQLHAASTRQGTRSQAGVSAELSTEQFGGQQEPVSAQGSQGMGGSQGMGGMTPASTKSKKDERKKKKYSEGAAAGTDDAERAPIEVQSGGGKRALAQHVV>BL;ML0406, ML.tab 504459:504779 forward MW:11110MRSMIDNLTVQSEHLNSLASQHENEAACASSGVSAAAGLANAVSTSHGSYCAQFNDTLKMYEDAHRTLGESLHTGGIDLA(SEQ ID NO:54)RVLRVAAANYCDADEICGSDIKSAFG>BL;ML0407, ML.tab 504793:505443 forward MW:24227MGSRRRINRRLLPMSTFPAWQEFRRDVVVVFPGNDFDRDDCDTVDPWGVGGAAHWTIDPIVGFASSSAPQDRGTDVDNTR(SEQ ID NO:55)GQAEEDEKQKEPEVAIFTVTNPPRTVSVSVLMDGRIDHVELSKRVTWMSESQVASEILVLADLARQKAQSAQYTFILDKLSQLADGDEHRVALLRESVGNTWNLPSPEQAAEAEAEVFATRYSDYCPAQDTENDQW>BL;ML0410, ML.tab 508327:508629 forward MW:10951MSFFLRVEVGGLMMAAGRLERITSESMACNAKLTPVTTKVVPPAADQVSKLVSQVFSSYGKQYEGYAAQGVDQSRLFVQS(SEQ ID NO:56)LKDAAGDYMDSDHMYLNTED>BL;ML0411, ML.tab 508755:509981 forward MW:42466MFDFMVYSPEVNAFLMSRGPGSTPLWGAAEAWISLAEQLMEAAQEVSDTIVVAVPASFAGETSDMLASRVSTFVAWLDGN(SEQ ID NO:57)AENAGLIARVLHAVAYAFEEARAGMVPLLTVLGNIIHTMALKAINWFGQVSTTVAALEADYDLMWVQNSTAMTTYRDTVLRETGKMENFEPAPQLVSRYCMDRRDSVNSFHSSSSSDSLYESIDNLYDSVAQSEEHGSDSMSQSYNTCGSVAQSELCDSPFGTPSQSSQSNDLSATSLTQQLGGLDSIISSASASILLTTNSISSSTASSIMPIVASQVTETLGRSQVAVEKMIQSISSTAVSVDVAASKVVAGVGQAVSVGALRVPENWATASQPVMATAHSVPAGCSAITTAVSGPLEGVTQPAEEVLTASVAGGSGTGGPAFNEAV>BL;ML0418, ML.tab 517644:518276 forward MW:23558LLSVDEVLTTTRSVRKRLDFDKPVPRDVLMECLQLALQAPTGSNSQGWHWVFVEDAEKRKAIGDIYLVNARCYLSQPAPEY(SEQ ID NO:58)PEGDTRGERMRLVRDSATYLAEHMHEVPVLLIPCLLGRAEESPLGAVSYWASLFPAVWSFCLALRSRGLGTCWTSLHLLGDGEQRAAEVLGIPSDKYSQGGLFPIAYTKGTDFRPANRLPAENVTHWDIW>BL;ML0425, ML.tab 524416:524643 forward MW:8231VADRHPDTIKLEIDVAREQFAATVDSLAERANPRRLAGDLKARVVEFGRRPAVIAALVSCAVLTVIVVVRKVKNR(SEQ ID NO:59)>BL;ML0431, ML.tab 530916:531695 reverse MW:27233MNNPRRSEWLGPSLAGSGPIEPQVHQYPPLTDPAYAEQAPYAPAYGASLPPWTPKKPPQQLPRYWQQDQPPPTDIPPEGL(SEQ ID NO:60)TLPPPHEPKSPHWFLWVVAGASVVLVVGLVMALIIANGAIKTQTAVPPLPAITESSSATPTPTTKTSPTPTAGPAPSTTGSGTLTQTIGPSAMLDVVYSITGQGRAISVTYMDTGDVIQTEFNVVLPWSKQVSLSKSAVHPASVTIVNIGHDVTCSVTVAGVQIRQHTGVGLTICDAPR>BL;ML0451, ML.tab 551191:552240 reverse MW:37930MLTLLVLLVALATLAGGWGYQTANRLNRLHVRYDLSWQALDGALARRAVVARAVAIDAYSGTSPGRRLAALADAAECAPR(SEQ ID NO:61)HTRENAENELSAALAMVDPASLPTALIAELADAEARVLLARRFHNDAVRDTLALGEQRLVRTLRLRGTASVPTYFEIVERPHALTHGDHGVPNQRTSARVVLLDETGAVLLLCGSDPAITNGHAPRWWITVGGEVRPGERLAAAAARELAEETGLRVIPTHMVGPIWRRDAIFEFNGSVIDSEEFYLVYRTRRFEPSTVGWTELEHLCLHGSRWCDANDIAELVASGEQVYPRQLGELLPVANQLADASTGTARGTAAARNTYVLLSIC>BL;ML0486, ML.tab 589652:589996 forward MW:12749MESLVLLLLFLLIMGGFMFFASRRQRRSMQATIDLYNSLQPGDRVNTTSGLQATIIVVGDDTVDLEIAPGVVTTWMKLAI(SEQ ID NO:62)RDRILPDDAYMDEHEAEPGDFVYCDELEESDGSS>BL;ML0520, ML.tab 631179:631787 reverse MW:21872MNNWMLRGLVFAALMIVVRLMQGTMINVWQAQSVLISVVLLAVFIIAVVVWAARDGRADAIANPDPDRRRDLAMTWLLTG(SEQ ID NO:63)ILVGVLSDAVAWVISLLYNGIYTGGLVSELTTFSAFTALIVFLTGIIGVACGRWRVDRRSPPVPEHSRSGQNRADSNVFAAVCTDDDTPTGELSAAQTKEQTAAVATAESEAPTEIIYIIQRA>BL;ML0542, ML.tab 657980:658312 forward MW:11942VSIPQSNTSLSAVIAVDQFDPSSGGQGVYDTPLGITNPPIDELLDRVSSKYALVIYAAKRARQINDHYNQLGEGILEYVG(SEQ ID NO:64)PLVEPGLQEKPLSIANREIHADLLEHTEGE>BL;ML0561, ML.tab 678085:678555 forward MW:17254MTAQLDRDDWDVELRPYWTPLFAYAAAFLIAAAHITVGLLLRIKSSGVVFRTADQVAIGALGLVIASAVLLLTRPRLRVG(SEQ ID NO:65)AAGLLVRNIMFYRIIPWSHVVDVSFPLGSHWARIDLPDDEYIPLMAIQAVDKERAVEAMDAVRALLARYRAGPYGP>BL;ML0577, ML.tab 699950:700183 forward MW:8150MELALQITLVVTSILVVLLVLLHRAKGGGLSTLFGGGVQSSLSGSTVVEKNLDRLTLFVTGIWLVSIIGVALLTKYR(SEQ ID NO:66)>BL;ML0580, ML.tab 704001:704798 reverse MW:29039MSRVLTLVITPYSKANLKESIEAANGASHKYPNRIIIANRVNSYANKARLDAQLWVGADTGAGVVVSRTLAVYAHSVVIS(SEQ ID NO:67)ILLPDIPMVAWWPNIAPTMSGQDSLGKLAIQRITNATNSIDPLATIKSRLSDYTADDTHLAWDLITYWRALLTSAVNLPPHEPIDLALVSGMKTEPALDVLAGWLANRINRPLRRAVADLKVELIRNSETIVLSRPQTWVTSTLIRTVKPDALVPWGAQGSRGVPSRKSATTGSRQVLLQCLRRH>BL;ML0603, ML.tab 735002:736117 forward MW:39051VLVLWRCFRVANISALMVAVACLPDWLSGFLTGGLIAGSSARRATIYGVSNKFSSLHLVLGNEELLVERAVGEVLRSARQ(SEQ ID NO:68)RAGTQDVPVSRMRAGDVGTYELTELLSPSLFADERIVVLEAAAEAGKEAAALIVSAAADIPQGTVLVVVHSGGGRAKALANELQSLGATVHPCARITKLSERTDFVRKELRSLRVKVDEGAVTALLNAVGSDVRELASACSQLVADTAGDVDADAVQRYHSGKAEVKGFDIADKAVGGDVSGAVEALRWANMRGEPLVVLADALAEAVHTIGRVGPLSGDSYRLASRLGMPPWRVQKAQQQARRWSRDTVAAAMRVVAALNADVKGAAADAYYALESAVRKVAELAADGSR>BL;ML0630, ML.tab 763032:763358 forward MW:11114MAELLNTEDAKLVVLVRAAMARTEAGSGAVVRDFDGRTYAAAPVTLSTLELIGLQEEAAAAFSASSVVSGLEVGVLVAGS(SEQ ID NO:69)VDEPDIAMVRELASTAVVILIDRNGNRV>BL;ML0642, ML.tab 774506:775945 reverse MW:50246VAHSGSPVSTVDGVANPPFGFSSGNDSPNDESGRDKHGKNGPDSGSSGSDPLASFGMSGDFGMSDLGQIFTHLGQMFTNA(SEQ ID NO:70)GTAMTADKQLGPVNYELARRVASSSIGFVAPIPATTSSAIGDAVHLAETWLDGVTALPAGTTKAEGWTPDDWVNNTLETWKRLCDPMAQQISTVWAASLPEEAKSMASPLLSMMSQMGGMAFGSQLGQAFGQLSREVLTSTDIGLPLGPRGVAAIMPQAVESFADGLEQPRCEILTFLATREAAHHRLFSHVPWLASQLLGAVEAYAAGMKIDMNGIEELARDFNPASLSDPTAIEELLGQGVFEPQATPAQTQALERLEALLALIEGWVQVVVTAALGDRIPGAAALGETLRRRRASGGPAEQTFATLVGLELRPRKLREAAVLWERLTQAAGVDARDAVWQHPDLLPSGKDLDDPASFIDRIIGGDTSGIDEAIAKLDLDRGNSDGRTPGSGGPVDN>BL;ML0676, ML.tab 811815:812291 reverse MW:16889MFLRLMLSALKALQRLGAVMNSLARIDHWIWLFRCQPLTIRLLVATAALFTAATAFEVPAEADAIDDTFIKALNHAGVNF(SEQ ID NO:71)GEPRSAMTMGHYVCPILAKSGGNFAAAVQRIRGNSDMSPQMAETFAKIAISIYCPTMMANVASGNLPSLPPGPGIPGI>BL;ML0703, ML.tab 840882:842153 reverse MW:46028MVADLVPICLSLPAGDRYTVWAPRWRDGGDEWEAFLGKDDNLYACETVADLVAFVRTDSDNDLVDHPAWKDLTSVHAHKL(SEQ ID NO:72)DPSEDNQFDLVVVEELVAEKPTAESVTTLAATLAIVASIGSVCELPAVSKFFNGNPSLGAVSGGIEHFTGRAGQRRWNSIAEIIGRSWDDVLSAIDKVISTPRVNAAMSAKAADELAEEPVEPEVEPDDEDGADSATAQANNSDDTEDESRTAGDTVVLGSDKDFWLQVGIDPVRIMTGAGTFYTLRCYLDDHPIFLGRNGRISVFSSERALARYLADEHDHDLSYLSTYDDIRTAATDGSLAIDITDDNIYVLSGLSDDLADGPDAVDRDQLDLAVELLRDIGQYSEESAVDTALETNRPLGKLVAHVLSPSAVDKPVAPYSAAVREWEKLEQFVESRLRLE>BL;ML0730, ML.tab 871757:872011 reverse MW:9373VSAANDGSETNKLPTTQNPHIQITKGQPTDQELAALIVVLSSIGGASQVKQPEPTRWGLPVDKLRYPVFSWQRITLHEMT(SEQ ID NO:73)HMRR>BL;ML0733, ML.tab 874677:875195 forward MW:19913MRYPGNTLVAGEQVVLHRHPHWKRLIWPAVVLILATGLVSFGSGYVNSTHWAQVAKNVIYGVLWGVWLVIVGWLTLWPFL(SEQ ID NO:74)NWLTTHFVVTNRRVMFRQGTLTRSGVDIPLARINSVEFRDRLFERMFRTGTLIIESASQDPVEFYNIPRLRQMYALLYHEVFDTLGSEESPS>BL;ML0734, ML.tab 875150:875836 reverse MW:25515VSFPDATITRLPTVLQPYAQRYHELIKFAIVGGTTFIIDSAIFYTLKLTILEPKPVTAKVVAGIVAVIASYVLNREWSFR(SEQ ID NO:75)DRGGRERHNEALLFFAFSGIGVLLSNAPLWFSSYVLQLRAPTVSLTVENLADFLSAYIIGNLLQMAFRFWAFRRWVFPDAFARNPEKTLESALTAGGIAEVFEDAIDGVFEDFGDALLRAWRNRSRRLDLSPASQLGDSSEPRVSKTS>BL;ML0748, ML.tab 890981:891259 reverse MW:9830MVQGLLAKAATMVITGLTGVTAYEMLRKAVTKVPLHQIAVSALELGLRGSRKAEEAAESARLKLADVMAEARERIGKETT(SEQ ID NO:76)APAVSDIHQHDH>BL;ML0761, ML.tab 903525:904028 reverse MW:18775VSNSCSSSRHGQWSRRFSSRRAAKRGRDIRGPLLPPTVPGWRSRAERFDMAVLEAYEPIEQRWQGRVSELDVAVDEIPRI(SEQ ID NO:77)AARNPENVQWPPEVIADGPIALARLIPAGVDVRSNATRARIVLFRKPIERRAHDTVELGELLHDILVAQVAIYLDVEPSAIDPTMDD>BL;ML0762, ML.tab 904068:904565 forward MW:17248MRVSGASATFSHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVREPHSWDLCVDHAARITAPRGWELVR(SEQ ID NO:78)HAGPLPSNPDEDDLVALADAVREGPGGEHGSYGNGARASLGGFADPQLQSAGAHATVPSGGLLAPSELRSGRRRGHLRVLPDPSD>BL;ML0764, ML.tab 906078:907175 forward MW:37464VNATRLIDLEDTKGLIAADRDGLLRAASSAGAQVRAIAAAAEEGALETLRAHDRPRTVIWVAGRGTAETAGAMLAATSGG(SEQ ID NO:79)ATTEPIVVASEAPPWVGPLDVLIVAGDDPGDPALVGAAATAVGRGARVVVVAPYEGPLRDATAGRVAVLEPRLRIPNEFGLCRYLAAGLAALQTVDPRLRLDLANLADELDSEALHNSVGYEVFTNPAKTLAASVSGHRVALAGDCAATLALARHGSSVLLRIAHQVTSATGLSDAVVAVRSSVDVADYAPTSVDVLFHDEEIDGSLPERLRVLALTLASERTVVAARVVGLDDVYLVAAEDVPDGPSGLAGLPVSGGADRAEQELANLAVRLEMAAVYLRLVRG>BL;ML0776, ML.tab 920258:920515 forward MW:8946VAGCGVFATRWSDACTAELSVAAGEPRVVSLCVDPLVPVVVLGRYVGARRQAILAMKEHGRRNLVALPTRQCVSRLGSCT(SEQ ID NO:80)LPGRG>BL;ML0806, ML.tab 955206:955727 forward MW:17808VDRVIALLSSGAIVGPCDYADVVTLPHKRAVFSRAPAAVRGAGLIVVVQGAVALVVAAALVVRGLTGADQRIVNGLGTAI(SEQ ID NO:81)WFVVVGVAVLAAGCALLVGKRWGRGLAVFTQLLLLPVAWYLVVGSHQSAFGFPMGIVALIALILLFSPPAVRWSAGAYQRSVASSANRKADSR>BL;ML0810, ML.tab 958882:960105 reverse MW:42979MVRPERRTKADTIAAMTITVVMAAMVSLIWWTSDAQATHSRPATIPAPNPTPAREVPTAFNQLWAAASPATTAPVVVGGA(SEQ ID NO:82)VITGDGHQIDGRNPVTGESRWSYARDSDLCGVSWVYHYAVAVYRDDRGCGQVSTIDGSTGRREAARSSYADPNVRLSSDGTAVLSAGDTRLELWRSDMVRMLAYGEIDARVKPPARGLHSGCTLESTAASSSAVAVLEACANQDDLQLVLLRPGKEDDEPQQHLVAEPRVRSGSGARVLTVSDTHTAVYLPGEAGTQPRVDVIDETGTTVASTLLTKPPSSSAVVSQAGNLVTWWTGDTLMVFNQSNLTLRYTIAAGETTAPVGPGVMMAGQLLVPVTGKIGVYDLFSGANNRYIPVRRPPSSSAVIPAVSGSTVFEQRGDTLVALG>BL;ML0813, ML.tab 962707:963294 reverse MW:20353MRLTETTSIRRTTTTSYSGHPIVDGRQVAALLGSVAALCAIATAVIINSGDNATTKAIVGAPTPRPVLTTPSIPLPATPS(SEQ ID NO:83)STPPLLLLFDTATATIPHKAAPPALHPRTVVYNVTGMKELLDLVTVVYTDARGYPKTEFNVVLPWTKAVVLNLGVKTQSVVATSFHSQLHCSIVNAEGQPVVASTNNAVIATCTR>BL;ML0814, ML.tab 963593:963841 forward MW:8652VEVKIGITDSPRELTFSSAQTPGEIEELVSAALREGLGLLVLTDERGRRFLIHGAKIAYVEIGVADARRVGFGIGAESAT(SEQ ID NO:84)NG>BL;MLO816, ML.tab 964701:965726 forward MW:37443VSSPGPVGSPGRVPVLREEWRAPLRAQREPLARGEGRVRVNRGRSRRWRKQTRLGRFVSVFGWRAYALPFLMALTAVVLY(SEQ ID NO:85)QTVTGTNAPEPAASEPITEPPVIGAVGTAIMDVPPRGLAAFDANLPAGTLPDGGAFTEAGDKTWHVVPGTMPQISQSATKVFKYSIEIENGLDPTMFGGDGAFAQMVDQTLANPKGWTHNPQFAFTRIDTGMPDFRISLVSPLTIRAGCGYEFRLETSCYNPSFGPDRQARVLINEARWLRGALPFEGDVGSYRQYVINHEVGHAIGYVRHEPCDKQGGLAPVMMQQTFSTSNNDGAKFDPEWVKPDGKTCRFNPWPYPIA>BL;ML0818, ML.tab 967172:968065 forward MW:32374LRSPRSGSTRLTRVTVEPPPEHVLSAFGLTGVQPVPLGASWEGGWRCGEVVLSMVADNARAAWSARVRETLFVDGIRLAR(SEQ ID NO:86)PVRSTDGRYVVSCWRANTFVAGTPEARHDEVVSAAVRLHEATGKLERPRFLTQGPTARWADVDIFIAADRAAWEGRPLQSVPSGVWAAPMTTDGQRSVDLINQLAGLRKPTRSPNQLVHGDLYGTMLFVGTAAPGITDITPYWRPASWAAGVVVVDALSWGEADDGLIERWNALPEWPQMLLRALMFRLAVHALHPRSTAEAFPGLARTAALVRLVL>BL;ML0834, ML.tab 990401:990703 reverse MW:10909VRQDGASRTVVGGTALIRYVIVLGLGYVLGAKAGRRRYEQIVGIYRTLTGSPMAKSMIAEGRRKVANRISPDEGFVTLAE(SEQ ID NO:87)IDNQTTVIERSAEWRENGGN>BL;ML0857, ML.tab 1019690:1020442 reverse MW:26808MAKPRNAAAHKAARAEAKAARKAASRQRRLQLWQAFTIQRTEDKRLIPYMIAAFSLMVSASVTAGVLVGGLTMITLILLG(SEQ ID NO:88)VVLGALVAFIIFGRRTQQSVYHKAEGQTGGAAWALDNLRGKWRVSPGVAANGHFDAVHRVIGRPGVIFVAEGSAARVKPLLAQEKKRTARLVGDVPIYDI IVGNGDGEVALVKLERHLARLPANISVKQVDILESRLAALGSRAGASLIPKCPLPNACKNRGVQRTVRRK>BL;ML0869, ML.tab 1033201:1033575 reverse MW:13804MMAGEEAYLPPRDQGPVRRYIRDLVDARRNALGLFTPSALVLLFITFGVPQLQLYMSPAMLVLLSVMGIDGIILGRKISK(SEQ ID NO:89)LVDVKFPSNTESHWRLGLYAAGRASQMRRLRVPRPQVEHGSSVG>BL;ML0872, ML.tab 1035615:1036130 reverse MW:19047MLPPAVSYPRRRSKRLI ISVLVAIALVAAMTAVI IYGVRTNGSKTGGTFSEVTAKTAI EDYLKALEQSNINTIARNALCG(SEQ ID NO:90)MYDSVRDQRPDQALAQLSSDAFRKQFSQVELTS IDQIVYWSPYQAQVLFTMRTSPATGGPKRRQIQGIAQLLYRRNQVLVCSYMLRTADSH>BL;ML0876, ML.tab 1040896:1041315 forward MW:14969MHI EARLFEFVAVFFVIMAVLYGVLTSMFATGGVDWVGTTALALTGGLALIVATFFRFVARRLDI RPEDYEGAEISDGAG(SEQ ID NO:91)ELGFFSPHSWWPVLVALSGSVAAVGIALWLPWLIVAGVVFVLASAAGLVFEYYVGPEKH>BL;ML0878, ML.tab 1042383:1043021 forward MW:22756MMNRYSPY2RGSDTIASDVIDRILVGVCAAVWLVLIGVSVAAAVALEDLGRGFHKIASDPHTTWVLYGIIVVSVLIIAGA(SEQ ID NO:92)VPVLLWARRVARVEPPIRPAGVPERGGVRQLVSAGRSTARIEVERVCAEERVQSVAQPGEWFDAAVDRIWLRGTVGLTGTMGAALVAVAASTYLMAVGRDGASWVGYVLAGIVTAVMPVIEWIYVRQLRRVG>BL;ML0888, ML.tab 1055260:1055667 forward MW:15125MNSTNS IQIADETYVAADRALIGAAVADRSSWHRWWPDLRLQVVEDRAEKGIRWAVTGTLTGTMEIWLEPLTEELDGVVL(SEQ ID NO:93)HYFLHAEPAGVAAWQLAKMNMAKVTHRRRVAGKAMAFEVKKTLERSRSIGVSPVI>BL;ML0889, ML.tab 1055786:1056220 forward MW:16483VADKTTQTFYIDANPGEVMKTIADIESYPQWISEYKEVEVLEVDDEDFPKRARMLMDAKIFKDTLIMSYDWTADHQSVSW(SEQ ID NO:94)ILESSSLLKSLEGSYRLVPKGSTTEVTYELAVDFAIPMIGMLKRKAEHRLIDGALKDLKKRVEG>BL;ML0891, ML.tab 1057485:1057877 forward MW:13550MSGGYADIGPELRKLAQMTLDGIGPAVRSAAALVAGARGTGKCQQAWCPVCALTALVIGEQHPLLTVIADHSVALLDVIR(SEQ ID NO:95)AIVDDIDQSNKIPPDSPHGGGLDTETPAQTNTSNGTVRRRYQPIPVSVED>BL;ML0895, ML.tab 1061008:1061523 forward MW:19954MRYPYDTEFIEDGRTIELVSIGVVAEDGREYYAVSNEFDPERAGNWVRVNVLSKLPPLASQLWRSRRQIRLDLEEFFGVD(SEQ ID NO:96)GSEPTEPIELWAWVGAYDHVALCQLWGPMPDLPEALPRFTREIRQLWEDRGCPRMPPRPRDLHDALVDARDQLRRFRIIMSADDVGSLPTH>BL;ML0898, ML.tab 1064343:1064747 forward MW:14698VGSIPAGDDVLDPDEPTYDLTQVAELLGIPVSRVHRKLCEGYLVAVRRGDSLVVPQIFFTNSGAVVKSLPGLLTILHDGS(SEQ ID NO:97)FHETEIVRWLFTPDPSLTLTRDGSRDVVSNARPVDALHTHQAREVVRRAQAMAY>BL;ML0902, ML.tab 1068511:1069230 reverse MW:25542VRARFPPLFTRGCTAQRRRTLTIALLLVAMVPLATGCLRVTASITISPDNLVSGKIIAAAKPKNKNDAGPQLNDNLPFSQ(SEQ ID NO:98)KIAVSNYNSDGYVGSQAVFSDLTFAELPQLANMNSSTTDVTLSLRRNGNLVILESRADLTSVTDPDADVELTVAFPGVVTSTNGDRIETKVVAWKLKPGVVSTMSARARYTDPDTRSFTGAAVWLGIASFSAASVVVLLAWNERKSSARLQIPRDSSSS>BL;ML0903, ML.tab 1069302:1069934 reverse MW:23864LAIFLINLSPNEMERRLNEALEVYVDAMRYPRNTENLRAGIWLEHIRRPGWQAVAAVEVRIEVADVADMADGPAHPAPSA(SEQ ID NO:99)DELNNAPLRGVAYGYPGAPGQWWQQQVVQGLQRSGLSTLEIARLMNSYFELTELHIHPHTQGRGIGEALTRRLLAHRRENNVLLSTPETNGETNRAWRLYRRLGFMDIIRRHYFAGDPRAFAILGRTLPL>BL;ML0904, ML.tab 1070251:1070655 forward MW:14614MPLSDHEQRMLDQIESALYAEDPKFVSSVRGGGLRVPTARRRTQGAALFVIGLGMLVCGVAFKATMIGSFPILSVFGFVV(SEQ ID NO:100)MFGGVLFAITGSRLSGREDHPGLAPGTSRQRRSKGAAGSFTSRMEDRFRRRFDE>BL;ML0907, ML.tab 1072702:1073835 forward MW:39543MKAQRDTPIRRGDSGRPGGRDGAARSGKRTANEAGSRRLRTHAGKISASAREVGVPKSGPRTSPMSRPVERPARPRNTTQ(SEQ ID NO:101)AKARAKARKAKAPKVVRPRLGECLIARLALIDLRPRTLVNKVPFVVLVISSLGVGLGLTLWLSTDSAERSYQLGDAREQARMLQQQKEALERDVREAESAPALAETARKQGMIPTRDTAHLVQGPGGNWVVVGTPKPADGVPPPPLNTKLPDAGPPSLKPPEIPLEVPVRVVPGPGGPPPPARSGPQMWLRVPDGATTLGGQHLPQELPQLPGMLNGPAAAQVQVPGFMPTPGLPIPGSTMRVPVPAPAPTEVPVRLQPGLVSPAVTSPVISTSPVPTPVNSEQFGPVTATAPGTSR>BL;ML0920, ML.tab 1090054:1090686 forward MW:24016MSTLHKVKAYFGMAPMEDYDDEYYDDRSPTHGYGRSRFEEGYGRYEGRDYSDLRGDPTGYLPLGYRGGYGDEHRFRPREF(SEQ ID NO:102)DRPDLSRPRLGSWLRNSTRGALAMDPRRMAMLFDEGSPLSKITTLRPKDYSEARTIGERFRDGTPVIIDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLSPADVDVSPEERRRIAETGFYAYQ>BL;ML0921, ML.tab 1090811:1091101 forward MW:10769LALFYQILGLALFVFWLLLIARVVVEFIRSFSRDWRPNGVTVVILETIMSITDPPVKLLRRLIPQLTIGAVRFDLSIMVL(SEQ ID NO:103)LLVAFIGMQLALSAAA>BL;ML0923, ML.tab 1092221:1092613 forward MW:13650MLIIALVLALIGLVVLVFAVATSNLLMAWVCIGASVLGVLLLIVDAVREHQCIDAANNEDKEDTDQDDGAVYVDYLDEVP(SEQ ID NO:104)AGTSTEAPDAGSQEGDTNSGELSGYWGRLTIDTGEQSAVAADDHDNDRAT>BL;ML0984, ML.tab 1151024:1151473 reverse MW:16902VAIAELTEASVQGVENIRTVEVFLAALQDAGLRNRIRDVGRQPRVYQNVGLPTIHGRSKTITLWRKMADCIGFEIKIHRI(SEQ ID NO:105)AAVAIAVLCERADAVIVGPLWMQFWVCGTFEVQNKRIMLWPNYFDLFDLFKATMRSLVALRIPSLNAAF>BL;ML0986, ML.tab 1152702:1152905 forward MW:7514LAGVRLTEFHERVVLRFGAAYGASVLVDHVLTGFDGRTVAQAIEDGVELRDVWRALCVDFDVPRDQW(SEQ ID NO:106)>BL;ML0990, ML.tab 1157281:1157910 forward MW:22743MNEEPNIVDFPDSNPLQAALEAEELRVVREIDSGAKIFVLIAVLVFMLLGSFILPHTGQVRGWDVLFDSHGAGAAAVALP(SEQ ID NO:107)LRIFAWLSLVFGVGFSMLALMTRRWVLAWIALAGAANASIVGLLAVWSRQTVAVGQPGPGVGLIVAWITLILLTFHWARVVWSSTIVQLATEEQRRRVVAQQQSKTLLDGLYSAGNRDTRTRSDPQVGS>BL;ML0994, ML.tab 1162620:1163318 forward MW:23576VLSAIAIVPSAPVLVPELTGAAAAEVADLRSAVLAVAACLPPCWIVVGTGRADDVVGPGGCLGTFAGFGADVRVRLSPQV(SEQ ID NO:108)GGEAELLVDFPVCALIAAWVRGQSQLDASAQVRVYCGDHDPDMALACGRQLRVEIEQAPDPIGVLVVADGATTLTSSSPGGYDPSAADAELVLDDALASGDVAALTRLSCQISGRVAFQVLAGLVEPGPRLAKELYRGAPYGVGYFVGVWQP>BL;ML1001, ML.tab 1172599:1172874 reverse MW:10150MPDPVVMPVPCPTSGFTQYSPYYRGAQITLLQQTILAKLNQKYYNNRYRVDVEMVLSHTGVEADSAASHTILGLSSSILP(SEQ ID NO:109)PYGCRTKKQRS>BL;ML1004, ML.tab 1176008:1176502 forward MW:17108MLVIYAVPPLIGNVRHPMSRPILGPRCGSGESAGSRRPAPSRSASAPMRYSGASVANLVAPHRGRTVSLAKTIGLALLAG(SEQ ID NO:110)MITLWLGLMADVSQVIDGDATGFVTHVPNRLAVVRVEAGESLQDVAARVAPDAPVRQVSERIRELNVLDSSMLVAGQTLIAPVG>BL;ML1009, ML.tab 1179519:1180499 reverse MW:36153MSHSHYQDPDDEQHYQPGQPGMYVLEFPAPQLLASDGRGPVLIHALEGFSDAGHAIRLAATHLKAALNTELVASFAIDEL(SEQ ID NO:111)LDYRSRRPLMTFKTDHFTHYDDPELSLYALRDSVGTPFLLLAGMEPDLKWERFITAVRLLAERLGVRQTISLGTVPMAVPHTRPITLTAHSNNGELIADFTPWITEIQVPGSASNLLEYRMGQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGALQLPLSALAEAAAEIRAKIDEQVQASTEVAQVVAALERQYDAFIDAQENRSLLRRDEDLPSGDELGAEFERFLAQQAEKKRDDDLT>BL;ML1015, ML.tab 1185456:1185875 reverse MW:15761VGKLSTAPNRGTTDTFDDNSRPVLITTAAPSYEEERRTRVRKYMTLMAFRIPALMLTTVAYSAWHNGLISLLIVAASVPL(SEQ ID NO:112)PWMAVLIANDRPLRRTEEPRRFDSRRRRTPLLLTTEQPAFKSLRRPPPKPTSLATDSRS>BL;ML1016, ML.tab 1185903:1186226 forward MW:11848VSGFCFSVGRVRRHNVTMLGRHNVTMLGMQTQTIEHTYTDEHVDDGTGSDTPKYFHYVKKDKIVESAVMGSNVVALCGEV(SEQ ID NO:113)FPVTRAAKPGSPVCSDCKRVYDMLKKG>BL;ML1O2S, ML.tab 1192722:1193372 reverse MW:22931VVTQITEGTAFDKHGRPFRRRNARPAIFVVVFLVIVAGVSWTIALTRPAKVREPEVCNPPTQSTGSVPTQLGKQVPRTEM(SEQ ID NO:114)TDVTPAKLSDTKVHVLNASGRDGQAADIAGALRDLGFAQPTAANDPMYADTLLNCQGQLRFGTAGQATVAAVWLVAPCTELLHDNRTDDSVDLALGTDFTALAHNDDIDAVLASLRPGATEPSDPALLQKIHANSC>BL;ML1026, ML.tab 1193568:1193870 forward MW:10969MPTDYDAPRRTETDNVPEDSLEELKARRNEAASAVVDVDESESAESFELPGADLSGEELSVRVIPKQADEFTCSSCFLVQ(SEQ ID NO:115)NRSRLASEKNGVMICTDCTA>BL;ML1027, ML.tab 1193875:1194348 reverse MW:17131VSGTPVAPHNVRYRERLWVPWWWWPLAFALASLIAFEVNLSGATLPSWLPFAVAAGTLLWLGRVEIQVIADAPLGGSVEL(SEQ ID NO:116)WAGNAMLPITAIAQSAAISRSAKSAALGRQLDPAAYVLHRAWVGPMILVVLDDPDDPTPYWLVSCRHPERVLSALRS>BL;ML1029, ML.tab 1194835:1195656 forward MW:29018VTADDDAERSDEDGAAVMATFGKRTGKDGASRTLTEPADPEATELPAASEPDSEEVDELEGPFDIDDFEDPAVAVLARLD(SEQ ID NO:117)LGSVLIPLPEGSQLQVELTDVGVPNAVWVVTANGRFTITAYAAPKTGGLWREVAGELADSLRNDSAKVTVKDGPWGREVVGTNTGVVRFIGVDGYRWMIRCVVNGPLETIDVLSEEARAALADTVVRRGDTPLPVRTPLPVQLPEQMAEQLREAAVAQQSAQHADARQQSRELAPRRGAAGSAMQQLHNTTGG>BL;ML1030, ML.tab 1195701:1196399 reverse MW:23579VAVDLRNVTTVLLPGTGSDDDYVYRAFSGPLHRVGAAVLTPPPQPNRLIDGYLSALDDAARAGPIGVGGVSIGAAVAAAW(SEQ ID NO:118)ALAPERAVAVLAALPAWNGAPESAPAALAARYSASHLRRDGLAATTLQMQASSPPWLADELARSWCGQWPLLPDANEEAAAYIAPSCAELARLATPLGVAAAVDDPIHPLQVGVDWVTAAPHAALQTVTLNQIGTNAAALGTACLAALALT>BL;ML1037, ML.tab 1200579:1201133 reverse MW:19903VSHEYWSTAVSCNPGIHHIVRLDVAJ4TPRTTQTYRHSMLAEDIAEDFPAISINSSALDAARMLAEHGLPGLLVTDMSDKP(SEQ ID NO:119)YAVLPASQVVRFIVPRYIQDDPSLAGVLNESTADQAAEKLSSKKVRDVLPDHLVNVSPVNADDTIIEVAATMSRQRSPLLAVVKGGQLLGVITASRLLRAALKH>BL;ML1O41, ML.tab 1206538:1207128 reverse MW:21153VAPVTAAEPTPFREAVAAMNAFTVRPEIELGPIRPPQRLAPYSYALGAQVKHPELDIVPEQSEDNAFGRLILLYDPDGSD(SEQ ID NO:120)AWDGTIRLVAYIQSDLDSREAIDPLLPEVAWSWLIEALESRIDHVTALGGTVTATTSVRYGDISGPPRAHQLELRASWTATTPEVGVNVKAFCEVLENAAGLPPAGVIDLGSRSRS>BL;ML1053, ML.tab 1220374:1220673 forward MW:10117MPLFLNAEPQALTAAANTLEGLSAATVASNAAAAQLTTEIAPPAADDVSILLAHFFSGHGRQYQAHASQGATNHQDLIQS(SEQ ID NO:121)LLTSSSAYAGTETANIHDSL>BL;ML1055, ML.tab 1221433:1221735 forward MW:11298MTAAHFMTDPQAMRDMARKFDMHAQNVRDESHKNFMSSMDIAGAGWSGTAQLTSHDTMGQINQAFRHIVTLLQDVRDQLG(SEQ ID NO:122)TAADRYEHQEENSRKILSGS>BL;ML1056, ML.tab 1221787:1222074 forward MW:10260MGNINYQFGEIDAHGAAIRAQAAALETTHQAILATVRDAAEFWGGQGSTAHEMFIADLGRNFQMIYEQANSHGQKVQRAS(SEQ ID NO:123)SSMADTDRSVSSAWS>BL;ML1065, ML.tab 1230644:1230988 forward MW:11933VALVLLYLVVLVLVAIVLFGAASLLFGRGERLPPLPRGTTATVLPAHGVTGADVDAVKFTQVLRGYKPSEVDWVLDRLGR(SEQ ID NO:124)ELEALRGQLAAIADAEADADVSNVPSGDDGQDVT>BL;ML1067, ML.tab 1231777:1232004 forward MW:7998MLGAEWVREGGPARVWREHTMAAMKPRTGDGPLEATKEGRGIVMRVPLEGGGRLVVELTPDEAAALSDELKGVTS(SEQ ID NO:125)>BL;ML1077, ML.tab 1240278:1240697 forward MW:15219VMADRGQVFRRVFSWLPAQFASQNDAPVGAPRRFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHISLCAQCAAEVDDQSR(SEQ ID NO:126)TRAALRDSHPIRIPSTLFGLLTAIPRCSPDYTSPVSEPFSEGSVSDRFVDGVAREQGKR>BL;ML1079, ML.tab 1242373:1242735 forward MW:13348VFANIGWGEMLVLVVVGLVVLGPERFPGAIRWTLGALRQTRDYLSGVTNQLREDIGPEFDDLRGQFGELQKLRGMTPRAA(SEQ ID NO:127)LTKHLLDGDDSLFTGNFDRPAAAKQQDRDNHQTPFDTDAT>BL;ML1093, ML.tab 1257729:1258586 forward MW:29736MRVGRLAALLLAGVGVFVAGGCATDRGDRHPELVVGSKPDSESTLLAAIYVAALRSYGFGARGETGADPMAMLDSGGFTV(SEQ ID NO:128)VPGFTGKVLQILQPRAAVLSAARVYRANVSALPEGIAAGDYTTAAEDKPTLVVTPDTAKAWSGSOLSLVLSHCNELVVGIVAGTHTPSAVGSCRLPAAREFPDYPTMFAALRAGQLTAGWTTTANPDLPADLIVLTDGKATLIQAENVVPLYRRNVLTDRQMLAINEVAGVLDTAALIEMRRQVIRGADSQAIADGWLAEHPMGR>BL;ML1096, ML.tab 1263932:1264606 reverse MW:23505LAQVIERSVWIQGPAAEAYVARLRRTHPSASPTEIVAKLEKHYLAALTASGAVVGSVATLPGIGTLAAVSANAGETVFFL(SEQ ID NO:129)EATAVFVLTIASVYGIPANHRERRRALVLAVLAGDDTRLTIGELIGPGRTNGGWLLEGMASLPLSTWSQLHTRMLRYAAKRCTVRRGALMFGKILPIGIGAAVGGAGNRVVGKKIISNTRNAFGTAPSRWPATLILLPTVHNAG>BL;ML1098, ML.tab 1266648:1270106 reverse MW:126012VFVTDETIVYSASDLAAASRCEYALLRDFDARLGRGPVVATTEDELFARTSALGADHEQRHLDQLRHEFGDAVAVIGRPA(SEQ ID NO:130)YTYAGFAAAAEATQRAIANRAPAVYQAANFDGRFVGFIDFLVRDGEQYRVVDTKLARSPKVTALLQLAAYADALAHSGVPVAPEAELRLGDGMVVSYRICDLIPVYRSQRSLLQRLLDRHYTAGTAVRWQDDEVRSCFRCPQCTEQLRATDDLLLIAGMRISQRSKLLNVGITTIAELADHSGPVPDLSSSALSELTAQAKLQVQQRNTGTPQFEIVDPQPLALLPDPDPGDLFFDFEGDPLWTVDGQEWGLEYLFGVLDSEISGTFRPLWAHNRVEERKALTEFLKMVTKRRKQRPHMHVYHYAPYEKTALLRLAGRYGVCEDEVDELLRSGTLVDLYPLVSKSIRVGAESFSLKALEPLYMGKQLRSGDVTTATDSITCYGRYCELLSAGNFDEAATVLKEIEDYNHYDCRSTRELRNWLLLQAYEAGVVPVGAQPVPEGNTVKDDDELSAILSALSGFTGDVAVGDRTPEQTAIALVAAARGYHRREDKPFWWGHFDRLNFPVGEWADNTDVFVADDASIIIDWHTPPRARKPQRRVRLRGRLARGNLGSAVFALYDPPAPLANDVHPGRRAAGRAEVVEADDLSIPTEVVIVERVGNDGNTFHQLPFALTPGPPIATTALRDSIESTATTLAASLPQLPRTALIDILLRRIPRTHSGATLPRGTDTVADITAAVLDLDSSYLAVHGPPGTGKTHTAAHVITQLVSNHSWRIGVVAQSHAAVENLLDGVITAGLDARQVAKKRHDRSAPPWQEIDGNDYPTFIADPMGCVIGGTAWDFANRNRVPPGSLDLLVIDEAGQFCLANTIAVAPAAANLMLLGDPQQLPQVSQGTHPEPVNTSALDWLVEGQRTLPNERGYFLDRSYRMHPAICAAVSTLSYEGKLHAHTEYTAARRLNEYQPGVHVLAVHHQGNSTESPEEAGAITAEIERLLGTPWTDEHGTRPLDVSDILVLAPYNAQVALVRQQLMSAGFSGVRVGTVDKFQGGQAPVVFISMTSSSVEVVPRGISFLLNRNRLNVAVSRAQYAAVIVRSETLTEYLPATPVGLIDLGAFLTLTTFNGTGRLESRLDKP>BL;ML1099, ML.tab 1270157:1270765 reverse MW:20773VRDVWSLPCRKSLLGVAAVVLVSGTLTGCSSGDSTVAKTPVPPSTTTGTISTIISSAPSPPFATAAPPTSNTPPDDPCAV(SEQ ID NO:131)NLASPTIARVVSELPRDPRSAQPWNPEPLAGNYNECAQLSAVIIKANTNAVNPTTRAVLFHLGRFIPQGVPDTYGFNGIDPAQTTGDTVALTYPSSIDGLATAVRFHWNGNAVELISNIAGG>BL;ML1105, ML.tab 1279262:1279951 forward MW:24905VLSGGAGSIPELNAQISVCRACPRLVDWREEVAVVKRRAFADQPYWGRPVPGWGSEQPRLLIVGLAPAAhGANRTGRMFT(SEQ ID NO:132)GDRSGDQLYAALHRAGLVNLPISMDAADGLQANQIRITAPVRCAPPGNAPTQAEWVTCSPWLEAEWRLVSEYVRAIVALGGFAWQIVLRLPGVSANRKPRFSHGVVAQLYAGVRLLGCYHPSQQNMFTGRLTPANLDDIFRDAKKLAGI>BL;ML1115, ML.tab 1291563:1292129 forward MW:19913VRCDVRALALAARGLIELMIVIPMVAGCSNAGSNKSVGTISSTPGNTEGHHGPMFPRCGGISDQTMSQLTKVTGLTNTAR(SEQ ID NO:133)NSVGCQWLAGGGIVGPHFSFSWYRGSPIGRERKTEELSRASVDDININGHSGFIAVGNEPSLGDSLCEVGIQFQDDFIEWSVSFSQKPFPSPCGIAKELTRQSIANSK>BL;ML1116, ML.tab 1292138:1292701 forward MW:19839MRLSVRGRRSVFAGVAVLVSAALVVTGCSRSIGGTAVKAGSHDVPRNNNSQQQYPNLLKECEVLTTDILAKTVGADSLDI(SEQ ID NO:134)QSTFVGAICRWQAANPASLIDITRFWFEQGSLANERKVADFLKYKVENRSIAGVDSIVMRPDDPNGACGVASDAAGVVGWWINPQASGIDACGQAIKLMELTLATNS>BL;ML1117, ML.tab 1292709:1293194 reverse MW:17617MRHAKSSYPRGFPDHIADHDRRLAPRGVREASLAGGWLRTNVPAIEKVLCSTAMRARETLTHSGIEAPVRYTERLYRADP(SEQ ID NO:135)DTVIKEIKAISDEVTTSLIVSHEPTISAVALALTGSGTNNDAAQRISTKFPTSGIAVLNVAGRWQHLELESAELVAFHVPR>BL;ML1119, ML.tab 1294763:1295914 forward MW:41281MRFLHTADWQLGMTRHFLAGDAQPRYSAARRDAVAGLGALAAEVGAEFVVVAGDVFEHNQLAPQVVSQSLEAIRAIGIPV(SEQ ID NO:136)YLLPGNHDPLDASSVYTSALFTAECFDNINVLDRAGVHQVRPGLEIVAAPWRSKAPTTDLVAEMLGGLTADIVTRVLVAHGSIDVFDPDRDKPSLIRLAGIDDALAGGAVHYVALGDRHSLTQVGSSGRVWYSGSPEVTNFDDIESNSGNVLVVEIDENDPRRPVTVTARHVGHWRFFTLHWQVDNGRDIADLDMNLDQMMDKDRSVVRLALTGSLTITDRAVLDACLDRYARLFAWLGLWERRSDLAVIPADGEFTDIGIGGFAAAAVDELVATAREGDTESAIDAQAALALLLRLADRGVA>BL;ML1120, ML.tab 1295911:1298532 forward MW:94011MKLHRLALTNYRGTARREIEFPDRGVILVCGANEIGKSSMIEALDLLLEFRDRSTKKEVKQVKPANADIGSEVCAEISSG(SEQ ID NO:137)TYRFVYRKRFNKKCETALTVLAPHREQLTGDEAHERVRAMLAETVDNDLWHAQRVLQAASTAAVDLSGCDALSRALDLAAGDHAELSGTEPLLIERIEAEYRRYFTSTGRPTGEWAVAISRLSDAETAVGECAAAVAEVDDRVRRHAVLTERVAGLAQQRFAAGPRLAVAQAAADKVAVLTRQAREAELVAAAATATNAAAVAAHTSRLRLLAEIDTRAVVLAATQDEAQEAVDAVSTMRADAEASDAAVEESTEALMAAQQRADIARSTVDQLVDRQEADRLSTRLAKIDAIQGERDLICAELSAVTLTEQLLQRIENAAAIVDRTGEQLKLISAAVEFTATADIEISVGQQRVSLEAGQSWSTTATGPTEVEVLGVLTACVIPGATALDAQSKYVAAQEELSVALADGGVVDLAAARCANQRRRELQSSLDQLSAALAGVCGDDQIDQLRARLEQLRDGYPGEPDLLAVDIGSARAELEAAETVRAAVDFEREVCRRTAAAANCRLVETSARANFLLKKAETQRAELDQDIDQLAQQRASVSDEDLASAAEAGLRAVQIAEQRVAKLTEELVAAEPEAVTAELVAATAAAESLRDQHEDAAGALREISIELSVFGTEGRQGKLDTAEAEREHAISQNTQIGRRARAAQLLRSVMARHRDTTRLRYVEPYRTELQRLGRPVFGPTFEVDIDSDLRIRSRTLDGITVPFESLSGGAKEQLGILARFAGATLVAKEDNVPVVVDDALGFTDPDRLAKMGEMFDTVGAHGQVIVLTCSPDRYDGFTGAHRIDLNV>BL;ML1138, ML.tab 1329962:1330423 forward MW:16168VTTPAQDAPLVLPAVAFRPVRLFIINIVLTGLANLAAGLSGHLMVGVFFGIGLLLGLLNALLVRCSVESITAQGHPLKRS(SEQ ID NO:138)MALNSASRLAIITVFGLIIAYAFPLAGLGVVFGLALFQVLLVLSTMLPVWRKFRFGEADGGVLKGSEGEEQQR>BL;ML1147, ML.tab 1337937:1338380 forward MW:16518MSAPMVGMVVLVVTLGAAVLALSYRLWKLRQGGTAGIMRDIPAVGGHGWRHGVIRYRGEAAAFYRLSSLRLWPDRRLSRR(SEQ ID NO:139)GVEIVARRGPRGDEFDIMTDKIVVLELRDTTQDRRSGYEIALDQGALAAFLSWLESRPSPRTRRRSV>BL;ML1159, ML.tab 1354280:1355185 forward MW:31923MAGAVDLSGLKQRARQKASTSDPASRATLGARGTGSSENTSVIEITEANFEDEVLVRSHEVPILVLVWSPRSDACVKLLE(SEQ ID NO:140)TLSGLAVADSGTWLLATVNVDAVPRVAQIFGVDAVPTVVALAAGQPLSSFQGMQSVDQLRRWLDSLLSVTAGKLRGPTRSEDSAEIDPAIAQARQQLEAGDFLTAKQSYHAILDADPASVEAKAAIRQIDFLTRATAQHPDAVAVADAAPGDIAAAFAAADVQILNQDVTAAFERLIVLVRSTSGDERSSVRTRLIELFELFDPDDPNVIVGRRNLANALY>BL;ML1166, ML.tab 1364964:1365551 forward MW:21484VRKWKRVETANGPRFRSVVAPHEVALLKHLVGALLGLLNERESSSPLDELEVITGIKAGNAQRPEDPTLRRLLPDFYTPD(SEQ ID NO:141)DKDQLDPAALDAVDSLNAALRSLHEPEIVDAKRSAAQQLLDTLPESDGRLELTEASANAWIAAVNDLRLALGVILEIDRPAPERVPAGHPLSVIIFDVYQWLTVLQEYLVLALMAT>BL;ML1176, ML.tab 1372485:1372844 reverse MW:13792MTRPETPQAPDFDFEKSRTALLGYRIMAWTTGIWLIALCYEIVSNLVFHHEIRWIEVVHGWVYFVYVLTAFNLAIKVRWP(SEQ ID NO:142)IGKTVGVLLAGTVPLLGIIVEHFQTKflVKTRFGLRRSRT>BL;ML1177, ML.tab 1372841:1373221 reverse MW:14447VSTTRRRRPALVALVTIAACGCLALGWWQWTRFQSASGTFQNLGYALQWPLFAGFCLYTYHNFVRYEESPPQPRHMNCIA(SEQ ID NO:143)EIPPELLPARPKPEQQPPDDPALRKYNTYLAELAKQDAENHNRTTT>BL;ML1180, ML.tab 1380300:1380587 reverse MW:10260MGNINYQFGEIDAHGAAIRAQAAALETTHQAILATVRDAAEFWGGQGSTAHEMFIADLGRNFQMIYEQANSHGQKVQRAS(SEQ ID NO:144)SSMADTDRSVSSAWS>BL;ML1181, ML.tab 1380639:1380941 reverse MW:11298MTAAHFMTDPQAMRDMARKFDMHAQNVRDESHKMFMSSMDIAGAGWSGTAQLTSHDTMGQINQAFRHIVTLLQDVRDQLG(SEQ ID NO:145)TAADRYEHQEENSRKILSGS>BL;ML1182, ML.tab 1381004:1382269 reverse MW:43120MFDFAALSPETNSTRMYLGPGSSPILTAAAAWVVLAKELTAAAQGLQSAVEALLTTFEGESAAALAERVTPYEKWLTQNA(SEQ ID NO:146)ASAELTATQLTVAANAYETAFTMTVPPLMVFVNRAQACLLIMSNIFGQNSTAIAEKEAEYTEMWIQDAAAMTSYQASVLEAVGATKAFTAPPLGVNEVGLAQEVVEEVVEEVVEEVVEEVVEAEQAISQAALDQAVNEGMEATVVPQVDQQVNVDVATPQTAVPDSSSAAAPQLWGGFAQHLSPINDTLSMINNHAGMANAGLSLVNGMGSANKSLAPTTTKAAESAFKANGSAVQSTGRGLLGSSSGGHVTAQLGRAASIGSLRVPQTWTTASQPVTAATRALSPARVAVATESESAPLLGGGLPMAPMVPGGGSGTGGVNTALRLQPRAFVMPRNPAAG>BL;ML1183, ML.tab 1382326:1382625 reverse MW:10117MPLFLNAEPQALTAAANTLEGLSAATVASNAAAAQLTTEIAPPAADDVSILLARFFSCHGRQYQAHASQCATNHQDLIQS(SEQ ID NO:147)LLTSSSAYAGTETANHDSL>BL;ML1190, ML.tab 1395291:1396010 forward MW:25342MSVSRRDVLKFATVTPGLLGLGVAAAALCAVPASTAGSLGTLLDYAAGVIPASQIRATGAVGAIRYVSDPRGTWAVGKPI(SEQ ID NO:148)QVTEARDLINNGLKIVSCYQYGKGNTADWLGGATAGLRHAQRGVQLHTAAGGPVSAPIYASIDSNPTYEQYKQQVAPYLRSWESVIIGHQRTGVYANSRTIAWALQDGLASYFWQHNWGSPKGYTHPAANLHQVEIDRRTVGGVGVDVNTILKPQFGQWA>BL;ML1221, ML.tab 1443565:1443807 forward MW:8793VVQDLPTPIGAGIYNIYTGVHRDELAGASMPTVAQLGLEPPRFCAECGRRMVVQVRPDGWRAKCSRHGQVDSVDMEAKR(SEQ ID NO:149)>BL;ML1222, ML.tab 1443804:1444400 forward MW:20601VTEHCASDISDVSCPPRGRVIVGVVLGLAGTGALIGGLWAWIAPPIHAVVGLTRTGERGHDYLGNESEHFFVAPCLMLGL(SEQ ID NO:150)LTVLAVTASVLAWQLRQHRGPGMVIGLAIGLMICAATAAAVGALLVWMRYGALNFDAVPLSYDHKVAHVIQAPPVFFAHGLLQVAATVLWPAGIAALVYAVLAAANGRDDLGGRLCSR>BL;ML1232, ML.tab 1465612:1466688 reverse MW:38687MKRLLATTLAALTVGTVSGFGSTVASSEPGEPWLPPSPVPVRENSPAKIVYALGGARPPTFDWDYYTIRAGDEFFPDVNR(SEQ ID NO:151)KLIDYPARAPFRYVPTFLVPGPRDEVTIGEAIAVATKNLNQAIHRGTEPAAVVGLSQGSLALDTEQEQLATDPTAPPPDQLTFNTFGDPSGYHGFGKSVLASIFRPGDYIPLIDYTMPQRMDSQYDSNRVVAAYDGLSEFPDRADNLLAQLNCFAGGAISHTPSGFFNPEDVPPQNIRTTVNSRGAKTTTYLIPVNHLPLTLPLRYLGWSDALVDQIDAVLQPKIDAAYAYNDNPLNKPISVDPVNGMDPIAGIDAELRDSILNVFAQLRSILPPPPG>BL;ML1233, ML.tab 1466853:1467545 reverse MW:24400MWITILLMAIAISLEPFRIGMTVLMLNRPRPTLQLLVFLCSGFTMGMTVGFVVLFVFRRRLMASMQLTLPKVQILIGVLA(SEQ ID NO:152)LLVAAVLTVQVCISSEPPAESPVDSASGPPKPSKWAPRPLARLLNGDSLWVAGVAGLGIALPSVDYLAALAVILTSGAAATTQVGALLMFNVVAFALIEIPLAAYLLAPDTTRAWTAALNNWIRSRRRLEVATLLAGVSCLLLAVGIAGL>BL;ML1244, ML.tab 1482817:1484292 forward MW:52449MADSVEGIGPFDELGALDYLLHRGEANPRTRAGIMAVELLDTTPDWNRFRSRIEDVSQRVLRLRQKVVVPTLPTAAPRWV(SEQ ID NO:153)VDPDFELDFHVRRVRVPDPGTLREVFDLAEVIQQSPMDVSRPLWTATLVEGLAAGRAANLLQISHAITDGVGSVEMFAENIYDLERDPPSRPRSPQPIPQDLSRNDLMLQGINHLPVALAGGVXTGGLSGVASVAGRAILRPASTVSGVVGYVRSGIRVLSQAAEPSPLLRQRSLATRTEAIEIQLSDLHKAAKAGDGSINDAYLAGLVGALRRYNEALGVSISTLPMAVPVNVRTEADVVGSNRFVGVTLAAPLGTNDPAARMQKIRSQMTQRRDEPAMNIIGSLAPLMTVLPASVLDFIVDSVASSDVNASNIPAYPGDTYFAGAKILRQYGIGPRPGVAMMAVLMSRGGFCTVTVRYDRASVKSEALFARCLLEGFDEVLALAGDPTPHAVPASFAARSSGSPAGWLSSS>BL;ML1249, ML.tab 1490899:1495767 forward MW:177878MTIDPGATHVAELCTTFTQGADVPDWISKAYIDSYRGSHGDVREAPETSRVNPNALVTPAMLSAHYRLGQCRPNGRNCVR(SEQ ID NO:154)VYPADDPAGFGPALQIVTDHGGMVMDSITVLLHRLGVTYTAMMTPVFMVLRSPTGELLGVEPRASSTSHSIEGTWVGEVWIYIQLLPAVDSKSLAEVEQLLPRTLVDVQRVAADAAALNATLSGLAADVKTNKEGHFSASDRDDVAALLHWLGNGNFLLLGYQRCRVHYGLVSCDRSTGLGVLRARTGSRPRLTDDNELLVLAQAAVGNYLRYGAYPYAIAVREYDDGGDCGIIEHRFVGLFTVAAMNADVLEIPSISHRVRAALAMANSDPIYPGQLLLDVIQTVPRSELFTLSAERLFTMAKEVVDLGSGRRALLFLRADRLQYFVSCLVYVPRDRYTTGVRLQIEDILVREFGGTQVEFTARVSESPWALMHFMVRLSEGAATGSVDVSEGNRIRIQAMLSEAARTWSDRLIAAAASFSEGSVSYAEAEHYAATFSETYKQAVTPADAIDHIAIIKELADDSVKLVFFERKADGFAQLTWFLGGRSASLSQLLPMLQSMGVVVLEERPFTVARTDGLPVWIYQFKISPHPTIPLASTANERELTAKRFSDAVTAIWQGRVEIDRFNELVMRARLTWQQVVLLRAYAKYLRQAGFNYSQSYIESVLNEHPSTARSLVALFEALFDPSPLSSSTNCDAQAAAAAVAADIDALVSLDTDRILRAFASLVQATLRTNYFVTQKFSARSKGVLVLKLDAQLINELPLPRPKFEIFVYSPRVEGVHLRFGAVARGGLRWSDRLDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVLKRPPLPTGDAAADRDAMRAEGIACYQLFISGLLDITDNVDHATGKVNAPPQVVRRDSDDAYLVVAADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGYDHKAMGITAKGAWEAVKRHFREMGVDTQNEDFTVVGIGDMSGDVFGNGMLLSKHIRLIAAFDHRHVFLDPDPDAAVSWAERQRMFDLPRSSWDDYNKSLISEGGGVYSREQKAIPTSPQVRTALGIDGEVTEMAPPNLIRAILQAPVDLLFNGGIGTYIKAETESVADVGDRANDPVRVNANQVRAKVIGEGGNLGVTALGRVEFDLSGGRINTDAMDNSAGVDCSDHEVNIKILIDSLVTAGKVKVEERKHLLESMTDEVARLVLTDNEDQNDLIGTSRANAANMLSVHAMQIKYLVDERGVNRELEALPSEKEIQRRSEAGIGLTSPELSTLMAHVKLALKEQMLATELPDQOVFVSRLPRYFPKPLRERFTPEIRSHQLRREIVTTMLINDLVDTAGISYAFRIAEDIGVGPIOAIRTYVATDAIFGVGOVLRRIRAANLSVVLSDRMTLDTRRLIORAGRWLLNYRPQPLAVGAEINRFAAKVKALTPRMSEWLRGODQAIVEQQATEFVSQGAPEDLAYRVAVGLYRYSLLDIIDIADITELDPAEVADTYFSLMDRLGTDGLLTAVSKLPQNDRWHSLARLAIRDDIYASLRSLCFDVLAVGEPDESGEEKIAEWEHISASRVERARLMLAEIHASGEKDLATLSVAARQIRRMTRTSGRGSSG>BL;ML1255, ML.tab 1499768:1500259 forward MW:16842MGNQSSQSEVAPVVRGDVVTELPKGWVITTSGRVSGVTEPGDRSVHYPFPIKDLVALDDALTYSSRASHARFAVYLGDLG(SEQ ID NO:155)NDTAALAREILAQVPTPDDAVLVAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGNLIDGLISAVRVLSAGISRS>BL;ML1270, ML.tab 1513725:1514522 forward MW:27833VMAPDIKSARAGRLTIQIAQLLLVVAAGALWMAARLPWVVIRSFDGLGPPKEVALSGASWSAVLLPLALLMLAATVAAIA(SEQ ID NO:156)VRGWPLRVLAGLLAVASFLVGYLGVSLWVLPDVTVRGAVLAHVSLLSLVGSQRHHLGAGAAVAASGCTLIAAVLLMRSASVIGSARQGTSKYVVPAQRRSIARRDGAATAISQMSERMIWDALDEDRDPTDRLREPDTEGRWWTACRRSLPFMNVVEIGGCTGSVAGRWVTSGKGNDTHVSGNCA>BL;ML1296, ML.tab 1545740:1546075 forward MW:12349MYRFGMRYLDSMTVDRHVAGNEFTVEEISTGIFASGYGQVGDGRSFSFHIEHWSLWEIYRTRLAGLVPQTEEVVPRA(SEQ ID NO:157)IRGLVNIDLTDERSLAAAVRDLVARTLTVSG>BL;ML1299, ML.tab 1547791:1548381 forward MW:21454MARAIHVFRTPDRFVAGTVGQPGNRTFYIQAVHDSRVVSVVLEKQQVAVLAERIGALLLEVHRRFGTPVPPEPAEINDLN(SEQ ID NO:158)PLVMPVDAEFRVGTMGLGWDSEAQTVVVELLAVTDAEFDASVVLDDTDEGPDAVRVFLTPESARQFATRSNRVILAGRPPCPLCDEPLDPEGHVCARTNGYKRSALLGPKDDDTEW>BL;ML1300, ML.tab 1548392:1549180 forward MW:28681VLRNGELTVLGRIRSASNATFLCESTLDQRSVHCVYKPVSGEQPLWDFPEGTLAGRELSAYLVSTDLGWNIVPYTVIRDG(SEQ ID NO:159)PAGPGMLQLWVQQPGDVADSAPRSGPDMVDLFPADKLQSGYLPVLRSYNYAGDEVILMHADDTQLRRLAVFDVLINNADRKGGHILYGLDGHVYGVDHGVSLHVEDKLRTVLWGWAGKPIDNQTLEEVAGLADALSGPLADTLAGQITWAEIIALRRRAYANLDNPVMPGPNRDRAIPWPAF>BL;ML1306, ML.tab 1555819:1556643 reverse MW:30377VAAFEGWNDASDAASGALEHLNAVWEADPIVEIDDEAYYDYQVNRPVIRQVDGVTRELVWPAMRISYCRPPGSDRNVVLM(SEQ ID NO:160)HGVEPNMRWRTFCTELLTIADRLNVDTVVILGALLADTPHTRPVPVSGAAYSPESARRFGLEETRYEGPTGIAGVFQDACVAARIPAVMFWAAVPHYVSHPPNPKATVALLRRVEDVLDVEVPLADLPTQAEDWEQAITEIAAEDDELAEYVHSLEQRGDAEVDVNDALGKIDGDALAAEFERYLRRRRPGFGR>BL;ML1315, ML.tab 1567372:1567956 reverse MW:20428VSRWTHRTFFIALSAIVTTAGFGSSGCAHGNSSTSESAVPSTFPGISSSITAPPATGLPAPEVLTNVLSRLADPNIPGID(SEQ ID NO:161)KLPLIESATPDSAVTLDKFSNALRONGYLPMTFTANNIAWSNKNPSDVLATISVNIAQTNNSVFSFPMEFTPFPPPQQSWQLSKRTADMLLEFGNSSGLTNPAPIKAPTPTPSH>BL;ML1321, ML,tab 1575493:1575684 forward MW:7067MAQEQTRRGGGGDDDEFTSSTSVGQERREKLTEETDDLLDEIDDVLEENAEDFVRAYVQKGGQ(SEQ ID NO:162)>BL;ML1334, ML.tab 1587322:1588140 forward MW:29047MSEPQGSDPGKQWQSPGEGVENHSSDQPTQAASPWQQQPSTQDSTWHPPAYASPECYNYPQLTEPVYPHQYPSATPGYGQ(SEQ ID NO:163)PGYFGAQFSQCGIPGQYPQSGSPGQYGSPGQYGPPGQYGPPGQYGPPGQYGPPGQYGPPGQYGPPGQYSQQFQPYEQPGTKGFVALIGSIAGVIGVLIFAAILVTGFLWPAWLVTTKLDVNKAQASVQQVLTDETNGYGAKNVKDVKCNNGADPTVKKGDTFDCSVSIDGMQKRVTVTFQDDKGTYEVGRPQ>BL;ML1338, ML.tab 1595449:1596771 reverse MW:50163VYGALVTAADSTQTGLRNWLLAVFHPRTHTPSTATIVRSALWPAAILSVLHRSTVITTNGNITDDFKPVYRAVLNFRHGW(SEQ ID NO:164)DIYNEHFDYVDPHYLYPPGGTLLMAPFGYLPFAPSRYLFILINTGAILIAWYLILRLFKYTLSSVAAPTLLLAMFCTETVTSTLVFTNINGCIMLLEVLFLWWLINGSEPKTVSQQWWAGGAIGLTLVLKPLLGPLLCLPLLNRQWQALVPAIALPVVINLAALPLVSHPMDFFTRTVPYILGTRDYFNSSIEGNGVYFGLPTWLIVFLRLLFTVLAICSLWLLNRYYRTRDPLFWFTCSTGVLLLWSWLVLPLAQGYYSMMLFPFLMTVVLPNSLIRNWPAWLGIYGFLTLDRWLLFNWMRYGRALEYLKITYGWSLLLIVVSTVLCFRYLDAKAENRLDHGIDPAWLTAERERASVNA>BL;ML1357, ML.tab 1617916:1618101 forward MW:6609MTIDPDQIRAEIDALLAQLPDFADLEDSVSGLSLAQLEEVALRFSEVHSVLLQALESAEKG(SEQ ID NO:165)>BL;ML1361, ML.tab 1621823:1623004 forward MW:42405MRMSALLSRNNSRPGLVGTARVDRNIDRLLRRICPGDIVVLDVLDLDRITADALVEADIVAVVNASPSVSGRYPNLGPEV(SEQ ID NO:166)LVNNGVTLIDETGPEVFKKIKDGAKIRLHEGGVYSGDRRLICGTERTDHDIADLMREAKSGLATHLEAFAGNTIEFIKSESPLLIDGIGIPDIDVDLRRRHVVIVADEPSAADDLKSLKPFIKEYQPVLVGVSGGADVLRKAGYRPQLIVGDPEQISTEALRCGAHVVLPADADGHAPGLERIQDLGVGAMTFPAAGSATDLALLLADHHGAALLVTAGHTANIETFFDRTRTQSNPSTFLTRLRVGEKLVDAKAVATLYRNHISFGAIALLALIMLIAVIVALWVSRTDGVVLNGVIDYWNRFSLWIQRLIA>BL;ML1362, ML.tab 1623026:1623979 forward MW:32360MISLRQHAFSLAAVFLALAVGVVLGSGFLSDTLLSSLRDEKRDLYTQISGLNDQKNMLNEKVSAANNFDNQLLGRIVHDV(SEQ ID NO:167)LGGTSVVVFRTPDAKDDDVAAVSKIVVQAGGTVTGTVSLTQEFVDANSTEKLRSVVNSSILPAGAQLSTKLVDQGSQAGDLLGITLLVNANPAVPNVGDAQRSTVLVALRDTGFITYQTYNRNDHLGAANAALVITGGLLPQDAGNQGVSVARFSAALAPHGSGTLLAGRDGSATGVAAVAVARADAGMAATISTVDNVDAEPGRITAILGLHDLLSGGHTGQYGVGHGATSITVPQ>BL;ML1389, ML.tab 1665174:1666757 reverse MW:57894VTSIMSVSALEQSAADVGDNSARQHAHGALPDSLAIAMLATVISGAWASRPSLWFDEAATISASASRTVPELWRLLSHID(SEQ ID NO:168)AVHGLYYLLMHGWFAIFPSTEFWSRVPSCLAIGAAAAGVTVFTRQFATRTTAVYAGIVFAILPRITWAGIEARSSALSVAAANWLTVLLVASVQRNRPRLWLCYALTLMLSILLNLTLATLVLVYAVILPWLAPNKFRNSPFIWWAVTSVVALGTITPFILFAHGQVWQVDWIFRVSWHYVFDITQRQYFDHSVSFAIATAVIIVPAIATRLAGLRAPAGDLRSLVIICTAWIVIPTTLMVGYSAVIEPVYYPRYLILTAPAAAIVIAVCIVTVARKPWPIAGVLVLFAVAAFPNYLFTQRGRYAKEGWDYSQVADVISSQAAPGDCLIVDNTVPWRPGPIRALLAARPAAFRSLIDIERGFYGPTVGTLWDGHVPVWLVTAKINKCSTVWTVSDRDTSLPDHQAGQLLSPGLILGRAPAYQFPSYLGFRIVERWQFHYSQVIKSTR>BL;ML1399, ML.tab 1679214:1680188 forward MW:34653LPPSAKSTYPGQVEGAPHDGTPSAPQEPDEDTVTVPAPALIRRSSVSMPNAAQWLHTTNRSPRLVANVRRARRLLPGDPD(SEQ ID NO:169)FGDPLSTAGEGGPRAAARAADRLLGDRGAASREVSLSVLQVWQALTEAIARRPVNPEVTLVFTDLVGFSGWSLQAGDEATLALLRQVARAVESPLLDAGGHIVKRMGDGIMAVFRDPSVAVQAVLAATEAMKSVEVGGYTPRIRVGIHTGRPQRLAADWLGVDVNIAARVMERATKGGIMISGPTLDLIPQSDLKELGIITRRVRKPMFTSKFTGIPPDMVIYRIKARRELTASDETAQTNSLT>BL;ML1439, ML.tab 1733347:1733682 forward MW:12946MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPFADDADIPGTWLCRNCMEGTLIEGDLPEPKMVKPPRT(SEQ ID NO:170)HWDMLLERRSIEELEELLKERLDLIRSRRRG>BL;ML1444, ML.tab 1738595:1739293 forward MW:25220MTKIQIDAPDGPIDALLSVPPGPEPWPGVVVIHDAIGYEPDKESTNNHIAMAGYVAITPNLYSRGSRARCITRVMRELLT(SEQ ID NO:171)KRGRAFDDILATRDYLLAMPKCSGRVGIAGFCMGGRFALVMSPKGFDASAPFYGTPLPRNLSETLNGACPIVASFGGRDPLGIGAPKRLRQATQTRHITTDIKVYPDAGHSFANKLPGQPLMRITGFGYNQAATEDAWSRVFAFFDKHLRTD>BL;ML1446, ML.tab 1740098:1740484 reverse MW:13769VTPTFADLAKAKYILLTTFTKDGRPKPTPIWAAADGDRLLAISAGKAWKVKRIRNNPRITLATCNVRGCATSAGVQGNAT(SEQ ID NO:172)ILDKLQTGSVYDAICKQYGIQGRLFNFVSKLRGGMQNNVGLELRVSGS>BL;ML1470, ML.tab 1768618:1768989 reverse MW:12909MTDSPGEHGPQKPLPSADPWKSFAGVMAAILFLEAIVVLLALPVLGASGGLTLSALSFLIGLAGLLIVLVGLQRKAWAIW(SEQ ID NO:173)VNLGVQVVVLVGCAVYPVLGFVGVLFAGLWALIVYFRAEVSRR>BL;ML1485, ML.tab 1787915:1788538 reverse MW:23064MPEKASVKYQADLWFDPLCPWCWITSRWILEVEKVRDVEVHFHVMSLAILNENREDLTENYLENITKAWGPARVVIAAEQ(SEQ ID NO:174)ANGASVLDPLYTAMGIRIHNEDNKNLDEVIKRSLADTGLPAELAAAAQSNAYDDALRESHHAGMDPVGDDVGTPTVHVNGVAFFGPVLSKIPRGEEAGKLWDASLTLAAYPHFFELKRTRTEPPQFA>BL;ML1494, ML.tab 1801370:1801723 reverse MW:12353MRIVVLVLFAADGVLSAVVGANLMPLYIGSVPFPISGLISGLVNAVLVWAARLWTRSPWLVALPLWVWLLTVGLLSLGGP(SEQ ID NO:175)GVDVVSGQSVMASGALWLILLGVSPPACVLWRCNRYG>BL;ML1503A, ML.tab 1815091:1815375 forward MW:10624MTVLTDEQVDAALLDLNDWKHTDGALCRSIKFPTFLAGIDAVCRVAEHAETKDHHPDIDIRYRTVTFILVTHYADGITKK(SEQ ID NO:176)DITMARDIDRLIGD>BL;ML15OS, ML.tab 1816758:1817306 forward MW:17516MATIWTYLRATAIVVGSSAALLTGGIAHADTAPAPAPAPAPAPAPALNIPQQLISSAANAPQILQNLATALGATPPVTPS(SEQ ID NO:177)APGISFPGLTPAAATVPTSSAATLPSLPGIMAPAISNTPTTPGLPASTPGFPQARVDMPAMPPLPVSVPPQISLPGDLQALASSASAAAAPVAAPTLLSALP>BL;ML1506, ML.tab 1817334:1818380 forward MW:34620LKTGGFVTSAWNLPKGLVAVVTASTAAFGLCQNAAADPANPYGTPPNPTQQLPGLPALAQLSPIVQQAANNPQQATQLLM(SEQ ID NO:178)EAVSALTQNPTAPIASKNLATSVSQFMQEPNNPNPGASALDIPTSGVPAPAANGITPLDVVLVPHLPSAGAEPGAQAHLPTGIDPVHAAGPATAATPTPGSPTNRTAAPPTPVASPAPTTPELPATTPGFGPDAPPTQDFIYPSISTNCLADGSSSIATALSVAGPAKIPLPGPGPGQAAYVFTAVGTPGPADVQKLPLNVTWVNLTTGKSGSATLKPRPDINPEGPTTLTVIADTGSGSIMSTIFGQVTTKEKQCQFMPTIGSTVVP>BL;ML1508, ML.tab 1819557:1820048 reverse MW:18221MRFGASLLRVRRRLYGMGSQVFDDKLLAVISGNSIGVLATIKRDGRPQLSNVQYHFDPRDLAVRVSITEPGVKTRNLRRD(SEQ ID NO:179)PRASILVDVDDGWSYAVAEGTAELTPPAAAPDDDTVEALIVLYRNIVGEHLDWDEYRQAMVTDRRVLLTLPILHVYGLPPGIR>SL;ML1525, ML.tab 1840011:1840466 reverse MW:15892MTKTLLVVHHTPSPTTRELLEAVLAGANDSEIDGVEVVSRPALAATIPDMLNADGYLFGTTANFGYMSGALKHFFDTVYY(SEQ ID NO:180)PILDNVSGRPYGLWVHGMNDTVGAAVAVGKLATGLSLTHAADVLEVVGPVDAMVCERAHELGGTLAAMLME>BL;ML1526, ML.tab 1840474:1840956 reverse MW:17670MNDTVMTKRSLYILYIQLLIAALCFANLAYLVLLGRMAVANIGSGQAAAVSLGLALLIMPVIGLWANIATLRAGFAHQKL(SEQ ID NO:181)ARLIAADGMELDTSVLLRRPSGRIQRDAADALFATVRAELAGDPDNWRCWYRLARAYDYAGDRRRAREANKTALELHGRD>BL;ML1537, ML.tab 1853559:1854764 reverse MW:43686MKAQRRLGLALSWSRVTTVFVVAIVVLVLASHVPELWQAGHHVAWCVGVGITVVVMVLVLVSYHGITLMSGLATWVWDCF(SEQ ID NO:182)ADPRAALAAGCTPAIDHQRRFGRDVVGVREYKGRLVTVIAVDDGEDDPVGRHRHRTTPAGLPVQAVADGLRQFDIRLDSIDIVSVKIRRGGNAAKFSALDNWGSEEWGLVCACPPTYQRCTWLVLRMNPQHNVVAVAARDSLASTLVAATERLAQDLDGQNCAARPLAAGELAEVDSAVLADLEPTWSYPGWRHIKYVNGFATSFWVTPSDIDSETLNELWLSDLPDIKATVITIRLASRSCQTRLSAWVRHHSETRLSKEACRGLNQLTGRQLAAVRASLPAPATRPVLVVPSRELSDHDELVLPVGQGREGSASLFAGQ>BL;ML1540, ML.tab 1858295:1859197 reverse MW:32551MDHQSTRTDITVNVDGFWLLQALLDIRHVAPELRCRPFVSTDSNDWLHEHPGMAVMREQGIVVNGNVNEHVATRMRVFAA(SEQ ID NO:183)PDLEVVALLSRGKLLYGAVNDENQPPGSRDIPENEFRVVLVRRGSHWASAVRVGNDITIDDVAIADSASITSLVMDGLESIHHADPAAINAVNVPLEEMLEATKSWQDAGFNVFSGRDLRRMGISAATVAALGQALSDPAAEAAVYARQYRDDAKGPSASVLSLKDSSSGRIAIYQQARTAGLGDAWLAICPATPQLVQLGLKTVLDTLPYGDWKTHSRV>BL;ML1544, ML.tab 1866369:1867889 reverse MW:53849VAEESSGRRGSEYGLGLSTRTQVTGYQFLARRTAMALTRWRVRMEVEPGRRQTLAVVASVSAALLGCLGALLWSFISPSG(SEQ ID NO:184)QLNESPIIIDRDSGALYVRVGDRLYPALNLASARLITGRPDNPHAVRSSQIATLPHGPLVGIPGAPSELSPETPQTSSWLVCDTVATSTGIGSSSGVTVTVIDGSPDLSGHRRVLTGSDAVVLHYGGDAWVIRQGRRSRIDAMNRSVLLPLGLTPEQVSQARPMSRALFDALPVGPDLVVPDVPNEGNPASFPGAPGPVGTVIVTPQISGPQQYSLVLADGVETVSPLVAQIMQNAGKPGNTKLITVAPSVLVKMPVVNKLDLSVYPDTPLQVVDLRENPSTCWWWERTAGESRSRIQVITGSTIPINSADVKKVVSLVKADTTGREADQVYFGSNYANFVAVTGNNPAAQTTESLWWVTRTGARFGVEDTKDVRDALGLNGTPNLAPWVALRLLSQGPTLSRADALMEHDTLPMDMTPAELVVPK>BL;ML1548, ML.tab 1872638:1873603 reverse MW:37427VAKQEPMCGVEPTLWAISDLHTGHVGNKPVAESLYPLSPDDWLIVAGDVAECTDEIRWTLELLRRRFAKVIWVPGNHELW(SEQ ID NO:185)TTNRDPMQIFGRARYDYLINMCDQMGVVTSEHPFPLWTERGGPATIVPMFLLYDYTFLPTGADSKAKGLAIARERNVVATDEYLLSSEPYATREAWCRDRLDVTRSRLEQLDWMTPTVLVNNFPLVREPCDALFYPEFSLWCGTTKTADWHIRYNAVCSVYGHLHIPRTTWYNEVRFEEVSVGYPREWRRRKPYSWLRQVLPDPQYAPGYLNDFGGHFVITSEMRAQAVQFRERLRQKQSR>BL;ML1557, ML.tab 1883016:1883336 reverse MW:11540VRTCVGCRKRELAVELLRVVAPSTGKGSYAVIVDTASSLSGRGAWLHPDMQCVQQAIRRRAFTGALRIAGSPDTSAVVEH(SEQ ID NO:186)IEFLSELDRPGNRTGSKEHEHTVKSR>BL;ML1560, ML.tab 1885239:1885775 forward MW:17494MLLVPSAVPVINRRGVLAGGATLTVLGVCFSACSKSPSKTPEIEELLGPLDQARHDSALASAAAAAIGNLPQITAALAVV(SEQ ID NO:187)ATQFAAHARALVTEISRATGKLASSSSDTTSPGLSPASPPSKPPPPVSDVIDALRTSAEGAGRLVSTASGYRAGLLASIAASCTASYTVALVSGGPSI>BL;ML1561, ML.tab 1885772:1886269 forward MW:17495MTSIEPSAPTPVATPKRTPVSQDSDNAGLSEALVVEHSTIYGYGIVLALSPPNANSLVVDALIQHRQRRDDIIVMLTARR(SEQ ID NO:188)VSPPVAASGYQLPMLVGSAADAARLAVRMENDGATAWRAVAEHAETADDRTFAAMALAQSAVMAARWNKMLGAWPITTTFPGSNE>BL;ML1584, ML.tab 1909590:1909844 forward MW:9465VASTEGEHNAGVDPAEVPSVAWGWSRINHHTWHIVGLFAIGLLLANLRGNHIGHVENWYLIGFAALVFFVLIRDLLGRRR(SEQ ID NO:189)GWIR>BL;ML1607, ML.tab 1932700:1932990 reverse MW:11018MTTHKAMTRVQLEANGEVFAVDNLTRMGLRGLHCNWRCRYGECDVIASETAHRTVVSRLRSIAATVMEGSRRSAPEQKVR(SEQ ID NO:190)WLRWLAGLWPANQDEF>BL;ML1610, ML.tab 1935020:1935325 reverse MW:12258MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSVEMMPRNTDGEVYFELRLADAWVWDMYRPARFVKQVR(SEQ ID NO:191)VVTFKDVNIEEVEKPELRLPE>BL;ML1624, ML.tab 1947637:1949427 forward MW:65207MTQSDHRHTRPSSYGGTVLTQRVSQATVEDSRPLRGWQRRAMVKYLASQPRDFLAVATPGSGKTTFALRVMTELLNSHAV(SEQ ID NO:192)EQVTVVVPTEHLKVQWTRAAATHGLALDPKFSNTNPRTSPEYHGVTMTYAQVAAHPTLHRVRTEGRRTLVIFDEIHHCGNAKAWGDAIREAFSDATRRLALTGTPFRSDDKPIPFVTYALDADGLMHSQADHTYSYAEGLADGVVRPVVFLVYSCQARWRNSAGEEHAARLGEPLSAEQTARAWRTALDPSGEWMPAVISAADQRLRQLRTHVPDAGGMIIASDQTAARAYANLLAQMTSETPTLVLSDDPGSSARITEFAKNTSQWLIAVRMVSEGVDIPRLSVGIYATSASTPLFFAQAIGRFVRSRHPGETASIFVPSVPNLLQLASELETQRNHVLGKPHRESTDNPLGGNPATMTQTEQDDTEKYFTAIGADAELDQIIFDGSSFGTATPAGSEEEAYYLGIPGLLDADQMRALLHRRQNEQLQKRTAAQQASSTPDRTSGAPASVHGQLRELRRELNSLVSIAHHHTGKPHGWIHNELRRRCGGPPIAAATHDQLKARIDAVRQLNAEPS>BL;ML1638, ML.tab 1975122:1975820 reverse MW:25842LSKLDDELSRIAHRANYLPQREAYERMRVERTGANDRLVAVQIALEDVDTQVFLLESEIDAMRQREDRDRLLLNSGATDA(SEQ ID NO:193)KQLSDLQPEFGTWQRRKNSLEDSLREVMKRRGELQDQLTAELGAIERMQTDLVGARQTLDVAPAEIDQVGQPHSSQCDVLIAELAPALSAPYERLCAGGGLGVGQLQGHRCGACRSEIGRGELSCISVDVDDEVVKYPESGAIQLLDKGFFQ>BL;ML1644, ML.tab 1978961:1979773 forward MW:30049MRHLALLLRPGWIALTLVVTAFTYLCFMVLAPWQLGKNTRMSRENNQIEYSLNTPPVPVKTLLSHQDLSTSKSQWRQVTA(SEQ ID NO:194)TGRYLPDVQVLARLRVVDSGQAFEVLAPFVVDDGPTVLVDRGYVRPEPGSHVPPIPRPPNEAVSITARLRDSEPVMKDKEPFSRDGVQQVYSINIEQVARLTKIPLAGSYLQLVDNQPGGLGVIDIPHLDAGPFLSYGIQWISFGIIAPIGLGYLAYAEIRTHRQEKLAKPSQAPMAVEEKLADRYGHPR>BL;ML1649, ML.tab 1987829:1988251 reverse MW:15144VVAADHTPSFARKLGIQQGQVVQEWGWDEDTDDDIRASVEEACGGELLDEDTDEVVDVVLLWWRDGDGDLVDTLMDAITP(SEQ ID NO:195)RAEDGVIWVLTPKTGKPGHVPPAEIAEAAPTAGLMLTSSVNLGDWSASRLVQPKSRIGKR>BL;ML1652, ML.tab 1992060:1993304 forward MW:44873VNNNHFVPAPFRRLPLELLDTVPDSLLRRLKQYSGRLATEAVTAMQERLPFFADLEASQRASVTLVVQTAVVNFVQWMQN(SEQ ID NO:196)PHSDVSYTAQAFELVPQDLARRIALRHTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFTAATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAPATVVVGTPTPNHDGPNGQVSSERASQEVREIAARHGRAALTDVHGTWLVAIISGQLAPTDKFFSDLLHAFSDGPVVIGPTAPMLTAAYHSASEAVSGMNAVAGWSGAPRPVQARELLPERALMGDASAIVALNTDVMLPLADAGPTLIETLDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGHDPTLPRDAYVLRVAATVGKLNYPTHH>BL;ML1660, ML.tab 2001607:2002260 reverse MW:23449VTVALALRDGCRRISTMTRQYGVTTQRHLISPVVFDITPLGRRPGAIIALQKTVPSLARIGLELVVIEWGAPINLDLRVE(SEQ ID NO:197)SVSEDVLVAGTVTAPTVSECVRCLTAVHGHVQVTLNQLFAYPYSATKVTTEEDAVGHVVDGTIDLEQSIIDAVGIELPFAPMCRSDCPGLCAECGTSLVVEPGHPHDRIDPWWAKLTDMLAPDVPQTSETDGSRSEW>BL;ML1666, ML.tab 2009601:2010245 reverse MW:23370VSAQPVERPGDLKPAPASVLPMPVPTAWWVLIAGVIGLVASMMLTVEKIRILLNSAYVPSCNVNPIVACGSVMSTPQASV(SEQ ID NO:198)LGFPNPLLGIVGFTLVTVTGVLSVAEVSLPQWYWIGLAVGTLAGVGFVHWLIFQSLYRIGALCAYCMVIWAVSVSLLVVVTAIVFRPLLEVLPGRTSAIARGIYQWRWSIATLWFITVFLLIMVRFWNYWQTLL>BL;ML1677, ML.tab 2021235:2021810 forward MW:19668VTSNSDAGSAVDAGGPPRTVIIAAVVLTAATIGTILVLAATLHEPPQPVVITAVPAPQATTAACRSLTQALPQRLGDYER(SEQ ID NO:199)APVAQPAPDAVAAWRTGSDTEPVVLRCGLDGPAEFVVGSPIQAVDQVQWFEVDAKPKPAIDAGRSTWYTVDRPVYVALTLPSGSGPTPIQELSDVIDRTLAAIPISPAQSH>BL;ML1698, ML.tab 2046984:2047817 reverse MW:29424VTGQSHDEHWRRPGECPEPIPGRPASASLVDPEDDLTPVGYPGDFGTTTVIPYSDPDHLKGPGGTGYNLLDQQEPLPYVQ(SEQ ID NO:200)PQARHAVAEPTEIDSDQDNERLHTVGRRGTQHLGLLVLRVGLGVVLAAQGLHKLFGWWGGQGLTGFKNSLTQVGYQHADILAYVSAGGEAVAGVLLVLGLFAPVVAAGALAFLINGLLAAWPHSPLFSFFLPDGNEHQITLIVMDVTVILCGPGRYGLDAGRRWAYRPFIGSFIVLIAGIAAGIAVWVLLNGVNPLA>BL;ML1704, ML.tab 2055043:2055741 forward MW:24598VEMPLTQRYATVLAVPSYLLRIELADRPGSLGSLAVALGSVGGDILSLDVVERSGGYAIDDLVIEVPSGAMPDKLITAFE(SEQ ID NO:201)SLPGVRVDSVRPHSGLLEAHRELELLDQVAAADDNASKLQVLADEAPKVLRVSWCTVLRSSQGKLLRLAASVGAPETRANSAPWLPIERAAPLDGTAEWVPQSLRDMNTTMIAAPLGDPHTAVVLGRPGPEFRPSEVARLGYLAGIVATMLR>BL;ML1706, ML.tab 2057887:2058900 reverse MW:34564MSVFATASGVGSWPGSSPYPAAKVVVGELAGALAHIVELPARGVGADMLGRAGALLIDVAIDTVPRGYRIAARPGAVTRR(SEQ ID NO:202)AFSLLDEDMDAFEAAWEMAGLRGRGRVVKVQAPGPITLAAGLELANGHRAITDSCAVRDLAESLAEGVAAHRAALARRLDTQVVVQFDEVSLPAALGGLLTGVTAFSPVAPLDETLAATLFDSCVATVGADVLLHSCAAELPWNFLQRSAIRAVSVDVNVLRTGDLDGIAAFVESGRTVVLGVVPATAPQRLPSVEQVAAAVVGVTDRLGFGRSALRDRIGVSPACGLAGATPHWACTAIELARKTAEAFAQDPDAI>BL;ML1720, ML.tab 2076101:2077195 forward MW:38298LAAGPALSARGYLAMNAQTQAGCSLMEWENNDNGRQRWCVRLVQGGGFGGPLFDGFENLYVGEPGTIFSFPMTQWTRWRQ(SEQ ID NO:203)PVIGMPSTPRFLGNGQLLVTTHLGQVLVFDAHRGMVVGSPLDLVDGVNPTDPTRGLADCVSAQRGCPVASAPAYSPASOTVVLDIWQPGAPTAGLIGLKYHSRQTPLLARAWTSDAIGAGVLASPVLSADGSTVYVNGRDQHLWALHAANGKPKWSVPLEFLAQTPPTVTPQGLIVSGGGPDTRLAAFKDAGDHAQQIWRRDDLTPLSTSSLAGVSVGYTVVSSSPAADAPGMSLLAFDPRNGHTLNSYPLPAATGYPVGVSIANNRRVITVTSDGQLYSFAPT>BL;ML1723, ML.tab 2079775:2080758 reverse MW:36615MTRSTEISANAVPNPHATAEQVAAARKDSKLAQVLYHDWEAENYDEKWSISYDQRCVDYVRHCFDAIVPDELFTELPYDC(SEQ ID NO:204)ALELGCGTGFFLLNLIQSGVARRGSVTDLSPGMVKIATRNGQSLGLDINGRVADAEGIPYDDNTFDLVVGHAVLHHIPDVELSLREVLRVLKPGGRFVFAGEPTDVGNRYARVFADLTWKVTIRVVQLPGLSAWRRTQVEIDENSRAAALEWVVDLHTFEPKVLETMATNAGAVQVKTVTEEFTAAMLGWPLRTFESTMPPSKLGWGWARFAFTSWKTLSWVDANVWRRVIPKSWFYNIIITGVKPS>BL;ML1781, ML.tab 2157673:2158185 reverse MW:18049MRASNQFADATAGVVYVHASPAAVCPHVEWALSSTLQTKAINLVWTPQPAMPGQLRAVTNWTGPVGTGARLANALRSWSVL(SEQ ID NO:205)RFEVTEDPSPGVDGQRFSHTPQLGLWSGSMSANGDIMVGEMRLRANMAHGADTLAAELDSVLGTAWDDALEVYRDGGDVGEVTWLSRGVG>BL;ML1782, ML.tab 2158324:2159145 reverse MW:29017MVALDALDDWPVPTTAAAVVGPTGVLAARGDTEQVFPLASVTKPLVARAVQIAIEEGVVDLDTVAGPPGSTIRHLLAHAS(SEQ ID NO:206)GLAMHSKYVMAPPGTRRIYSNYGFRVLAETVQREAGIGFSRYLTEAVLEPLAMTATKLEDGTWAAGFGATSTVADLAAFANDLLRPATVSAQIHAEATSVQFPDLDGVLPGHGVQRPNDWGLGYEIKNSKLPHWTGTLNSARTYGHFGQSGGFIWADPEAELALVVLTDRDFGEWALQLWPAISDAVISEYAR>BL;ML1791, ML.tab 2165620:2166063 reverse MW:16468LGKIFIMLPTVTHQAITCTVRRVRWIVDAMNVIGTRPDCWWKDRRGAMVRLVGKLERWASTERNHVTVVFERPPSPSIRS(SEQ ID NO:207)SVIVIAHAPKAFPDSADDEIVRLVQADPEPQGICVVTSDSALTDRVQEVGALAYPAARFRKHIDSID>BL;ML1813, ML.tab 2194277:2194849 reverse MW:20227MTGQDVVVQQPQTIPMLPIYIPQDVDMTVVKSEVAAAGVSASPAAMPGLLEVVSHAQAEGINLKIVLLDHNLPNDTPLRD(SEQ ID NO:208)IATVVGADYPDVTVLTLSPNYVGSYSTHYPRVTLEAGEDISKTGNPVQSAQNFLGELNVPEFPWTVLTIVLLIGVLVAAIGTRFMQLRSKRLATSLDAAGILAEDVNRAD>BL;ML1908, ML.tab 2293144:2293644 reverse MW:18864VALKTDIRGMVWRYSDYFIVGREQCREFARAIKCDHPAYFSEDAAAELGYDAIVAPLTFVTIFAKYVQLDFFRNVDVGME(SEQ ID NO:209)TNQIVQVDQRFVFHKPVLVGDKLWARMDIHSVSERFGADIVVTKNSCTSDDGELVMEAYTTLMGQQGDNSSQLKWDKESGQVIRSA>BL;ML1909, ML.tab 2293648:2294076 reverse MW:14875MARREFSSVKVGDLLPEKTYPLTRQDLVNYAGVSGDLNPIHWDDEIAKVVGLDTAIAHGMLTMGIGGGYVTSWVGDPGAV(SEQ ID NO:210)IEYNVRFTAVVPVPNDGQGAVLVFSGKVKSVDPDTKSVTIALSATTGGKKIFGRAIASAKLA>BL;ML1910, ML.tab 2294063:2294542 reverse MW:17608VGLTTNIVGMHYRYPDHYEVEREKIREYAVAVQNEDTSYFEEDAAAELGYKGLLAPLTFICLFGYKAQSAFFKHANIAVT(SEQ ID NO:211)DQQIVQIDQVLKFVKPIVAGDKLYCDVYVDSMREAHGTQIIVTKNVVTNEVGDIVQETYTTLAGRVGEGGEEGFSDGAA>BL;ML1911A, ML.tab 2295156:2295371 reverse MW:7998MLKKVEIEVDDDLVQEVIRRYGLLGRREAVHLALKALLGEPGVGGLSEQDPEYDEFSNPDAWRTRRSSDTG(SEQ ID NO:212)>BL;ML1918, ML.tab 2301835:2302626 reverse MW:27133VLDLEPRGPLPTEIYWRRRGLAVGITVIVIVIAAIVAVVGNGAAAQPANVDKPGSSQNHPGSATPKVLPPNGHEGNLAPA(SEQ ID NO:213)PPQGRNPESSTSTAAVQPPPVLREGDDCPDSTLAVKGLTNVPQYFLGDQPKFTMVVTNIGLVSCKRDVGAAVLAAYVYSLDNKRLWSNLDCAPSNETLVKTFTPGEQVTTAVTWTGMGSSPHCPLPRPAIGPGTYNLVVQLGNLHSMPVPFILNQPPPPPGSSGPALAPSPEAPPANVPVSGG>BL;ML1926, ML.tab 2311558:2312061 reverse MW:17296LVDTVGLVNKCVPDSSGLPLRAMVMVLLFLGVIFLLLGWQALGSSGNSDDYSALPMSSMPNTPTTPAATSTSSTSAANQA(SEQ ID NO:214)EVRVYNISSKEGIAARTRDQLTTAGFKVTEVDNLVVSDVSATTVYYSDAEGEYATADAVGQKLGAPVEPRIAAITNQPPGVIVLVTS>BL;ML1927, ML.tab 2312092:2312400 reverse MW:11799MDGAMARAHRAGDDVEIVDGLTRREHDILAFERQWWKFAGVKEEAIKELFSMSATRYYQVLNALVDRPEALAVDPMLVKR(SEQ ID NO:215)LRRLRASRQKARAARRLGFEVT>BL;ML1937, ML.tab 2323320:2324552 forward MW:46582MERPNEYMARQRSILNKVFTGFSAYCRYGQHVSKQQARSTVESTYRSILPNIPGVPWWAALLIATTASAIGYAIDAGNTD(SEQ ID NO:216)LTTVFTGFYITGCVAAVLAVRQSGLFTAVIQPPLILFCAVPAAYWLFHGSKIKSLKDLLINCGYPLIERFPLMLGTAGSVLLIGLVRWFFQLMYRTTASSNSEDDTVSKPNSLISGITAVLNSILCIYSNDQNHRANQDTADTELMHSDPPLRTRQTARDNRSARTHSGQARRVVEYTSEPLVEPWQHSSQRSSEQARDFDAGESPRRSRRQPTPQSDPELRSQPPREIRRDAYGHRSGPYEWPTNHSSHLEPYRRYKQPGPPEHFTEHGQLYERYKQPRRRATPPRASSVNPISQVRYRGSTARDDPRVDRQRSQTPRRPVTESWEFDV>BL;ML1945, ML.tab 2331776:2332549 reverse MW:27316MVNDCPRSRSATWSWALAVNQQGWDTRRVTFEWQPNSEVANKPGSVSWSAKPRLLQDGRDMFWSLVPLVVGCILLAGMAG(SEQ ID NO:217)TCTFAPGGTTKSTVSSYDAAAALRADARALGFPVRLPELPTGWRSNSGSRGSIEDGRMDLSTGKRLPAVTSTVGYITPTGMYLSLTQSNADEDRLVASIHPSSHPTGTVDVAGTKWVVYQGSDQKKDHSGAAAEPVWTSRFASPVGAAQIAITGAGCSSQFRMLASATQSQQPLPAR>BL;ML1983, ML.tab 2367418:2367885 forward MW:16955MSNRKFSFEVTRTSSASAAVLFRLVTDGANWAKWAKPIVLQSSWARLGDLRPGGIGAVRKIGIWPILVREETVEYEQDRR(SEQ ID NO:218)HLYKLIGPPNPAKDYLGEVLLAPNAAGGTNIHWSGSFTENIPGTGPVMRAALKGAVRCLTVRLVKTAERESDGVQ>BL;ML1988, ML.tab 2375705:2376418 reverse MW:25568MDCEVAREALSARLDGEREAVPSVRVDEHLGECSACRAWFDQVADQARDLRQLVDSRPAITPVDALGIVAPPRRRRPLMT(SEQ ID NO:219)WQRWALLCVGIAQIVLATVQGLGVSVGLTHDRAMSFGSYLLHESTAWSASLGVIMMAAVLWPGVAAGLAGVLTVFVGLLTVYVVVDVVAGATITLRILMHLPVAIGAVLAIAVWRHSSTPRPTFDDEVDVDLDIVLPRNASRGRRRGHLWPTDGSAV>8L;ML1993, ML.tab 2380915:2381466 forward MW:20576MSAELAPSLQNAAESTNTFPMAEDLLGSILEPYSYKGCRYLIDAQYRASPDSVFAYGNFGIEESAYIRGTGHFNAVELML(SEQ ID NO:220)CFNQLGYSAYAQSVVNKDISALRGWSIADYCRNQLSGILIKNTSSRFKKLINPQKFSARLHVYDLRIVERTWRYLQLSNTIEFWDDNGGSAIGEFEVAILNIP>BL;ML1995, ML.tab 2383087:2383323 forward MW:8810VTETARERILTAVCEVLYIAESDLVDGDETDLRDLGLDSVRFTLLMKQLGLSQEAEMQSKLMDNFSIANWVRQLESST(SEQ ID NO:221)>BL;ML1997, ML.tab 2387532:2388164 reverse MW:23261MIDDLLLRWVTGLFVLSAAECGFACLACRRPWTLVPSNGLHFAMAIAMAVMAWPRGAQLPTTGTVVFCGLVGAWFVILA(SEQ ID NO:222)TVSSRRIAERAAYAYPALMMLANVWMYVIMDSHLHDHWATGHHTSPHTSMLGVDMTTTWPASGIPGWISIINWLWFAFFCIATVFWTYRSFATSRRNAGFSRYCSLHPSGQAMMSTGMAIMFGVLLFHV>BL;ML2010, ML.tab 2402973:2403434 forward MW:15613MWLTAAPAAARFVVASVIAASCAASTGVAGADPQSPSAPKTTIDHDGTYAVGTDIAPGTYSSAGPVGNGTCYWKRIDNPD(SEQ ID NO:223)GPIDNAMSKKPKIVQIEASNKAFKTTGCQPWQQTSNTTVSTDLPGPIAGIQLESNLGILNGLLASNGQQVPRS>BL;ML2022, ML.tab 2413644:2414168 reverse MW:18955MSGPSSSCRCLPCCPHDRSSQPRCGESLGWQGYGRVYLESPWPFGLKQILQILRYGAEYAGKILAQPSKPTDVSSGPRRK(SEQ ID NO:224)TQVFAKATVVPKILQPKETQMTFDPKNAVNAARDIATNFVEKASDIVENVSDIIKGDIAGGANDIVQNSIDIATHAVDKAKEIFIGKGEYDELE>BL;ML2023, ML.tab 2414264:2414668 reverse MW:14091MIRELLAISAIASGAVNSAPRAAANPHYDGDVPGMSYDASLSAPCFSWERYIFGRGPSGQAEACHFPPPNQFPPANTGYW(SEQ ID NO:225)VISYPLYGVQQAGAPCPKTQAAAQSADGLPMLCLGGQGWQPGWFTDKGFFPPAG>BL;ML2030, ML.tab 2420647:2421120 forward MW:16613LDMPGEMLDVRKLCKLFVKSAVVSGIVTASMALSTSTGMANAVPREPNWDAVAQCESGRNWRANTGNGFYGGLQFKPTIW(SEQ ID NO:226)ARYGGVGNPAGASREQQITVANRVLADQGLDAWPKCGAASDLPITLWSHPAQGVKQIINDIIQMGDTTLAAIALNGL>BL;ML2031, ML.tab 2421151:2421606 forward MW:17306MEGSVTVHMAAPADKVWNLIADVRNTGRFSPETFEAEWLDDVTGPALNAKFRGHVRRNEIGPVYWTTCKVTACEPGREFG(SEQ ID NO:227)FTVLLGNKPVNNWHYRLVTSGDGTYVTESFRFNRSPLLTVYWLLGGFLRKRRNIRDMTKTLRRIKDVVEAE>BL;ML2048, ML.tab 2437805:2438062 forward MW:9877MTNLWDLKQKVIYSIQWLWNSLGHQLPRTMLETIRRKTGQPWRTAAGVHPVDNQFGMVCENSEYSDYVYNIKANTAVRT(SEQ ID NO:228)HKSKI>BL;ML2054, ML.tab 2442182:2442481 reverse MW:10350MDVMAATEYLARSTTLTSVGWIGYIIIGGIAGWIAGKIVQGGGSGILMNIVIGVVGALIGGFLLSFFVNTAAGGWWFTLF(SEQ ID NO:229)TSILGSVILLWVVGRVRKT>BL;ML2063, ML.tab 2450656:2451084 forward MW:16160MAKLTRLGDLERAVMDHLWSRQEPQTVRQVHEALSARRDLAYTTVMTVLQRLAKKNLVSQIRNNRAHRYAPVHGRDELVA(SEQ ID NO:230)GLMVDALDQAEDSDSRQAALVHFVERVGADEADALRRALAELETHQRHVRHLVALQRTTEGD>BL;ML2064, ML.tab 2451089:2452039 forward MW:32995VSALAFTILAVLLVGPTPTLVARSTWPLRAPRAAMVLWQTIALAAALSTFSAGIAIASRVLMPGPDGRLLTASVIGAAGRL(SEQ ID NO:231)GWPLWAAYVAAFALTVLVGARLIVAIVRVAIATRRRRAHHRMVVDLVGVGHNAALAQPCARARDLRVLEVAQPLAYCLPGVRSRVVVSEGALTKLNDTEVTAILTHERAHLRARHDLVLEAFTAVHAAFPRLVRSSAALSAVRLLVELLADDAAVRAAGRTPLARALVACASGQAPSGALAAGGNTTVLRVRRLSGRSNSAVVSAAAYLAAAIVLLVPTVALAVPWLTELQRLFNI>BL;ML2070, ML.tab 2460317:2462518 forward MW:77018MTKLGAEFKPDQTRINRKAANTGMGRHSMPDPEDSIDQPSNQFAASGPDQSDEIDHGYQSRMGYPEPVFEPAATGSPSYR(SEQ ID NO:232)SYPHGAEHPADSTPEALDETIDYQSYWAEDRNEDLFVDGAADDHPDFPPRPAGSSTSSQAPTSLSHLFKASHRSVGKWQGGHRSDGGRRGVSIGVIATLVAVVVLVGAVIMWSFLGHILNNRKHQAAARCVGGHQTVAVVADPSIADYLQEFAQSYNASARPVGDHCMMVTVKPVGSEAALTGFNDSWPANLGDKPALWIPGSSISAARLAVTADQKTISESHSLVTSPVLLAVRPEFEQALANKGWAALPGLQTNPNSLADLNLPAWGSLRLALPMNGNSDATFLAGEAVATASVPAGAPAIQGVGAVRTLMSAQPKLADSTWAEAMSTLLKPGDVATAPVHAVITTEQQLFQRGQSLSDAKSALGSWLPHGPAPVADYPAVLLNGSWLTQEQAAAASEFARFVQKPDQLAKLAKAGFRVNGVTPPSSSVTSFAAVPSTVSVGDDGMRATLVEEMIQPSSGVAATIMLDQSMPTDEGGKTRLANVVAALDDKINAMPPTSVMGLWTFDGHKGQTEVTTGQLADPVNGQPRSAALTAALDKQYSSNGGAVSFTTLRMIYQEMLANYHVGQTNSVLVITAGPHTDQTLDGARLQDFIRTSADPAKPIAVNVIDFGTDPDQATWKAVAQISGGSYQNLSTSASLDLATAINTFLS>BL;ML2073, ML.tab 2466445:2467140 reverse MW:24857MTHLVTRARSARGNTVSEQPRQGQLDLADYRDTPTATTHGDIGLNGPTAVSGPAQPGLFPDDSVPDELVGYRGPSACQIA(SEQ ID NO:233)GITYRQLDYWARTSLVVPSIRGAAGSGSQRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVQDLANITLFSDGTTVYECTSAEEVVDLLQGGQGVFGIAVSGANRELTGVIDDFRGERADGGESIAAPEDELASRRKHRDRKIG>BL;ML2075, ML.tab 2467984:2468739 reverse MW:26531VSAPDSPALVOMSIGAVLDLLRSDFPDVTISKIRFLEAEGLVTPQRSASGYRRFTAYDCARLRFILTAQRDHYLPLKVIR(SEQ ID NO:234)AQLDAQPDGELPSFGSPYVAPRLFSVTGGPGAGVGSGVGSDIPAVSPAGVRLSREDLLDRSGVADDLLTALLKAGVITTGPGGFFDEHAIVILQCARALSEYGVEPRHLRAFRSAADRQSDLIVQIAGPIVKAGKAGARDRADDLAREVAALAITLHTSLIKSAVRGVLHR>BL;ML2111, ML.tab 2512539:2512973 reverse MW:15031MIHRLRLGWLVAFFATIVLISVWIPWLTTKVNDEGWANAIGGTNGNLELPSGFGAGQLIVLLSSTLLVAGAMMGRGLSVK(SEQ ID NO:235)VASIAALIISLLIVVIVVWYYQLNVNPPWAECGLYLGAAGAVCAMVCSVWAVIAAVAAGRSSL>BL;ML2113, ML.tab 2513870:2514415 reverse MW:19748VTSRIKRMAKLTGSIDVPLPPNEAWMCASDLARFGEWLTIHRAWRSKLPEVVEKGTVIESYVEVKGMPNRIRWTVVRYKA(SEQ ID NO:236)PEANTLNGOGVGGVKVKLIAKVSPKDDGSVVSFDVNLGGSALFGPIGMIVAVALRTDIRESLQNFVTAFSRPEPGLIRSRALVVDQHSRAVVEQRSASAGL>BL;ML2114, ML.tab 2514412:2514582 reverse MW:6182VLDKAKDILVQNADKVETVLDKAGEFVDEKAKGKYTDTIYQVAEETKKAASFDTFC(SEQ ID NO:237)>SL;ML2135, ML.tab 2537496:2538521 reverse MW:40179MARTRMVRRWRNNMEVRDDTDYVGMLTTLSEGSVRRNFNPYTDIDWESPAFAVKDNDPRWILPDTDPLGRHPWYLAQPEQ(SEQ ID NO:238)RKIEIGMWRQANVAKVGLHFESILVRGLMNYTFWMPNGSPEYRYCLHECVEECNHTMMFQEMVNRVGADVPGMPRRLRWLSPLVPLVAGPLPVAFFIGVLAGEEPIDHMQKNVLREGKSLHPIMERVMAIHVAEEARHISFAHEFLRKRLTHLTKRQLFWVSLYYPLTMRVLCNAIMVPPKAFWQEFNIPREVKKELFFRSPESRKLLRDIFADVRMLAYDTRLMETRSARLMWHICKIDGQPSRYRGEPQRQHPATTSAA>BL;ML2137, ML.tab 2539983:2540738 forward MW:27365MRELKVVGLDPDSKTILCEGDIPGERFKLPADDRLRAAVRGDTTLLDQPQLDIQVTNMLSPKEIQARIRAGASVEQVANA(SEQ ID NO:239)SGSDIARIRRFAHPVLLERSRAAELATAAHPVLADGPAVLTLLETVSAALVKRGLEPDKLSWDAWRNEDSRWTVQLMWHAGRSDNLAHFCFTPGAHGGTVTASDDAANELIDPDFNRPLRPLARVAHLDFVEPATPATVTPDNQLVSSRRGKPTIPAWEDVLLGVRSAGQP>BL;ML2141, ML.tab 2542620:2542895 forward MW:9766MLVEPDRSREPPPLPSMLLEVWPVIMVGALAWLIAVVAAFVVSSLQSWRPVALAGLVVGLFGTGIFVWQLAAARRGARGA(SEQ ID NO:240)QGGLETYLDPR>BL;ML2142, ML.tab 2543007:2543816 reverse MW:28190VTGPVEDSAVATVAEWPEELAAVLTNAADDARAAIEEFSGSVTVGDYLGVGYEDPNAATHRFLAHLPGYQGWQWAVVVAA(SEQ ID NO:241)YPGAEHATVSEVVLVPGPTALLAPEWVPWEQRVRPGDLGPGDLLAPTSEDLRLVPGYNASGDPAVDEIAAEIGLGRRWVMSVWGRSAAAERWHGGDYGPDSPMARSTKRVCRDCGFFLSLVGSLGANFGVCGNEMSADGHVVDKLYGCGAHSDTPAPAGSGSSVYEPYDDGVLDILEPSTVQLPESPAD>BL;ML2143, ML.tab 2543926:2545665 forward MW:61675VSGRRRNHPGRLASIPGLRTRTGSRNQHPGIANYPADSSDFRPAQQRRALEQREQQAGQLDPARRSPSMPSANRYLPPLG(SEQ ID NO:242)QQQSEPQHNSAPPCGPYPGERIKVTRAAAQRSREMGYRMYWMVQRAATADGADKSGLTALTWPVVTNFAVDSAMAVALANTLFFAAASGESKSKVALYLLITIAPFAVIAPLIGPALDRLQHGRRVALATSFVLRTGLATLLIMNYDGATGSYPSMVLYPCALAMMVLSKSFSVLRSAVTPRVMPPSIDLVRVNSRLTVFGLLGGTIAGGAIAAGVEFVCAHLFKLPGALFVVAAITISGALLSMRIPRWVEVTAGEIPATLSYRLHRKPLRQSWPEEVKKVSGRLRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKEHQANGWVQLGMLGLIGTAAAIGNFAGNFTSARLQLGRPAVLVVRCTIAVTVLALAASVAGNLLMTTIATLITSGSSAIAKASLDASLQNDLPEESRASGFGRSESTLQLAWVLGGALGVMVYTDLWVGFTAVSALLILGLAQTVVSFRGDSLIPGLGGNRPVMIEQESMRRAAAVSPQ>BL;ML2144, ML.tab 2545673:2546161 forward MW:18046VQRRVTVLLPAVVMLLAAAAGFGVWLLVREPDPQRPEISVYSHGHLTRVGPYLHCNMLNLDECQTPQAQGVLSADEHNPV(SEQ ID NO:243)QLSVPETISRAPWRLLRIYEDPTGTTSTLYRPNTRLAVTIPTLDPHRGRLTGIVVQLLTLVVDPAGEVHDVPHTEWLVRLTF>BL;ML21S1, ML.tab 2554745:2555269 forward MW:17666MSESYRKLTTSSIIVAKITFTGAMLDGSIALAGQASPATDSEWDQVARCESGGNWSINTGNGYLGGLQFSQGTWASHGGG(SEQ ID NO:244)EYAPSAQLATREQQIAVAERVLATQGSGAWPACGHGLSGPSLQEVLPAGMGAPWINGAPAPLAPPPPAEPAPPQPPADNFPPTPGDVPSPLARP>BL;ML2155, ML.tab 2556716:2556940 reverse MW:8095MKSVNYYVVADIAKSKAHKSRYVDNGWPTTDPDHHAVSELVTDCAGALSPFGDLVFPVPADDLPYVHPVTVVNR(SEQ ID NO:245)>BL;ML2156, ML.tab 2557038:2559299 forward MW:80254MTDNTPDIPLGSWLADLSDERLIQLLELRPDLAQPPPGSIAALAARAQARQSIKAATDELNFLQFAVIDALLVLQADSTP(SEQ ID NO:246)VPTTKLKALIGDRAPQADVVSALDNLRQRALAWGETTVRIAAAAAAALPWHPGQVTLEDISSTGEQIAELIARLSQTQRDVLQKLLEGSPLGRTRDAAPGAPPDRPVPQLLAMGLLRRIDADTVILPRRVGQVLRGEQSGPTQLTQPYPVVSVTTPNDADAEAAGAVIEALHELDVLLETLGSAPVYELRNGGLGVREFKRLAKATGINEPRLGLLLEVTAAAGLIASGIPDPEPATGDSPYWAPTVATDRFTANPPAERWHLLASTWLDLQCRPALIGSRGPDAKSYGALSNSLYSTAAPLDRRLLLGMLAELPAGVGVEAAEASAALIWRRPRWARRLQPGPVADMLAEGHTMGLVGRGAISTPGRALLDEAIASADPAIAVAAMTRALPEPIDHFLVQADLTVVVPGPLQRNLAKELGTVATVESAGTANVYRISEQSIRHALDIGKTRDWMHALFTNHSKTPVPQRLTYLINDVARRHGQLRIGMAASFVRCEDPALLTQTVAAAEELQLRALAPTVAVSPAPIAEVLVTLQSAGFAPAAEDSSGSIVDVRPRRARLPTPQHRRPYRPLQRPNLETLNAVIAVLRKVAATPFGGIRVDPTVTMSLLQRATKEKTTLVIGYLDAAGIATQRMVSPIAIRGGQLVAFDPGTGRLRDFVIHRITSVVSSDSQ>BL;ML2193, ML.tab 2608113:2609048 reverse MW:33819MVLNWRFALSADEQRLVREIISAATEFDEVSPVGEQVLRELGYDRTEHLLVTDSRPYAPIIGYLNLSSPRDAGVANAELV(SEQ ID NO:247)VHPRERRRGVGAANVRAALAKTGGRNRFWAHGTLASARATASVLGLVPVRELVQMQRSLRTIPDPMVPDQLGVWVRTYVGTVDDAELLRVNNAAFAGHPEQGGWTATQLAERRSEPWFDPAGLFLAFGDSSSNQPGKLLGFHWTKVHAAHPGLGEVYVLGVDPSAQGRGLGQMLTSIGIASLAQRLVGPSAEPTVMLYVESDNVAAARTYERLGFTTYSVDTAYALARIDD>BL;ML2195, ML.tab 2609965:2610816 forward MW:30382MVKDATCIKVRHNMRMRKVLTSVIATVVTMAVLVVGVLGIDYGTSIYAEYLLSVNVRNAANLGSDPFVAILAFPFIPQAM(SEQ ID NO:248)RDHYTELEIKANAVDHANVGKASLEATMYSIDLTYASWLIKPDAKLPVDRLESRIIIDSMHLGQYLGISDLMVEAPHRETNNATGGTTESGISSGHGLVLSGTPKSANFDHRVSVLVDLSIAPEDQATLVFTPTGIATGPDTANQPVPDDKRNPVLRAFSARMSDQRLPFGVAPTSEGARGSDVIIEGITQGVTITLDGFKQS>BL;ML2199, ML.tab 2612896:2613198 forward MW:10570MCSEPKQRLALPANVNLEKETVITGRVVDRKGQAVGGAFVRLLDSSNEFTAEVVTSATGDFRFFAAPGSWTLRALSSVGN(SEQ ID NO:249)GDAMMSPSGTGIHKVDVKIT>BL;ML2200, ML.tab 2613457:2614143 forward MW:24489VTSDEVRDGAGSPADSSKGNKCTAAGMFQAAKRSTVSAARNIPAFDDLPVPSDTANLREGANLNSTLLALLPLVGVWRGE(SEQ ID NO:250)GEGRGPNGDYHFGQQIVVSHDGGNYLNWEARSWRLNDAGEYQETSLRETGFWRFVSDPYDPTESQAIELLLAHSAGYVELFYGRPRNASSWELVTDALACSKSGVLVGGAKRLYGIVEGGDLAYVEERVDADGGLVPNLSARLYRFAG>BL;ML2204, ML.tab 2619139:2619327 forward MW:7054MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSSTGAADPGQLDGDDRVSEDSWDEDAWRR(SEQ ID NO:251)>BL;ML2207, ML.tab 2622297:2622692 reverse MW:13959MAVCDRADPAKTRQAVLALADWLKDRTLPAPDRDAVATAVRLTVRTLATLAPGASVEVRIPPYVAVQCVSGPSHTRGTPP(SEQ ID NO:252)NVVETDSRTWLLVATGLMQLVEAVATGALRMSGSRAGDIEVWMPLINLRCT>BL;ML2219A, ML.tab 2636676:2636915 reverse MW:8613VARAVVMVMLRAEILDPQGQAIAGALGRLGHTGISDVRQGKRFELEIDDTVDDSELAMIAESLLANTVIEDWTITRESQ(SEQ ID NO:253)>BL;ML2228, ML.tab 2645990:2646610 forward MW:21778MNSRFLPYATSPGRLTIQLLSDIAVVMWTTFWVLVGIAVYNTISTIADTGAQVESGAHGIADNLASASHGMQLIPVLGNA(SEQ ID NO:254)VSKPLTLTSAAALDIADAGHSLNTTASWLAVLLALAVIALPILVAVVPWLVLRFWFFRHKWTVTTLAATPAGKQLLALRALTLRSPSKLAAVSADPVGGWRREDPGTIRGLAALELQSSGITLRVH>BL;ML2253, ML.tab 2676319:2676534 reverse MW:8295MPNNIAFSRSYIGKRCYSEQEVGVFIDLAEQERTRHIEEDVEFRNRNAELRNQDGAPQPRSCADRARNCHV(SEQ ID NO:255)>BL;ML2258, ML.tab 2682528:2682830 reverse MW:11299VNRFLTSIVSWLRAGYPEGIPATDTFAVLALLARRLTNDEVKIVARELIRRGEFDKIDIGVMISHLTDELPSPQDIERVR(SEQ ID NO:256)TRLNAKGWLLDNARDNGEPT>BL;ML2259, ML.tab 2682860:2683150 reverse MW:10541LSPWFNYEATLKILLFSTLAGAALPGLFALGIRLQVNDAGDASTNNATPNRKPILVTLAWVIYALVLMVVILGVLYIVSR(SEQ ID NO:257)DFIAHHTHYPFLGIKP>BL;ML2261, ML.tab 2684489:2684905 reverse MW:15612MEILASRMLLRPTDYQRSLSFYRDQIGLAIAREYGTGTVFFAGQSLLELAGYVIQGAPDHSRGAFPGALWLQVRDIAVTQ(SEQ ID NO:258)ADLEGRGVSITREPRREPWGLHEMHVTDPDEITLIFVEVPANHPLRRDTRSERPRTPD>BL;ML2271, ML.tab 2695503:2696030 forward MW:19222VILSKSLLRILIHGRSDEPPSTRARVIMRWGRIAVLIVTGLITLQSVLLVAGAWRNDLTIQHNMGVAQAEVLSAGPRRST(SEQ ID NO:259)IEFVTPERVTYRPELGVLYLSELSTGMRIYVEYNKNDPNLVRVRHRNAGLAIIPAGSIAVVCWLAATVVLVALAVLDKRLDRHTESSAVPSQPTS>BL;ML2274, ML.tab 2698017:2698355 reverse MW:11815MKGTGLAANVAMAAAATVLAAPALADDYDAPFNAQLHSYGIYGAQDYNAWLGKIACQRLAKGVDGDVNKSATF IQRNLPL(SEQ ID NO:260)YTTEGQSLQFLGAAINHYCPNQIGILQRAGAR>BL;ML2289, ML.tab 2713408:2714178 reverse MW:26191VTADLLAPLMELPGVATASDRARDALGRAHRHRANLRGWPVTAAEAALRAARASSVLDGGPVRLDHLPGASPQAGGVSDP(SEQ ID NO:261)VFGGALRVAQALEGGAGPLVGVWQRAPLQALARLHVLAAADQVGDEWLGRPRMDAEVGLRLGLLVDVVSGRTFAAAPVVAAVVHGELLTLQPFGSADGVVARAVSRLVTIATGLDPHGLGVPEVSWMRRAAVYRDAACGFAGGTPKGVAAWLVLCCRALHAGAQEALSIAESLPSR>BL;ML2295, ML.tab 2720556:2721260 reverse MW:23836LVLRRGHWCILVALVAVLLAVVSMPAKTVFADGRLPMGGGAGIVINGDTMCTLTTIGTDSAAELIGFTSAHCGGPGAQVA(SEQ ID NO:262)AEGAENRGPMGTMIAGNDNLDYAVIKFDPAKVMPVAAYNGFVISGIGQDPAFGQIACKQGRTTGNSCGVAWGMGETSGTLVMQVCGRPGDSGAPVTVNNLLVGMIHGAFTDNLPVCITKFIPLHTPAVVMSMNAILADVNKNNRPGAGFVPQPV>BL;ML2296, ML.tab 2721464:2722009 forward MW:20041MSKGDRKNGVPSTLTTIPLVDPHAEPTEPSIGDLIKDATTQVSTLVRAEVELARAEIIRDVKKGLTGSVFFIAALVVLFY(SEQ ID NO:263)STFFFFLFLAELLDTWLWRWVALLIVFAIMVMVTAALALLGYVKIRRIRGPHQTIESVKETRTALTPGHDKAQARPRKLTGSGTNPENNPDRRTPADPSGW>BL;ML2306, ML.tab 2730213:2731358 forward MW:40804MTAPTSHRPPTLDMAAILADTSNRVVVCCGAGGVGKTTTAAAMALQAAEYGRTVVVLTIDPAKRLAQALGVNDLGNTPQR(SEQ ID NO:264)VPLAPEVPGELHAMMLDMRRTFDEMVVQYSGPERAQAILNSEFYQTVATSLAGTQEYMAMEKLGQLLSQDRWDLVVVDTPPSRNALDFLDAPKRLGNFMNSRLGRLLLTPGRGIGRLVTGANGLAMRALSTVLGSQMLADAATFVQSLDATFGGFRGKADRTYALLKQRGTQFVVVSAAEPDALREASFFVDRLSQENMPLAGLMLNRTHPTLCALPVEQAIDASKTLQDEITNSAAASLATAVLRINDRGQTAKREARLLSRFTGANPHVPVIGIPLLPFDVSDLEALRAIADRLTASH>BL;ML2307, ML.tab 2731463:2731708 reverse MW:9325LCRATDPDELFVRGAAQRKAAVICRHCPVMQECRADALDNKVEFGVWGGMTERQRRALLKQHPEVVSWADFFDTRKHRNV(SEQ ID NO:265)S>BL;ML2320, ML.tab 2747152:2747799 reverse MW:22771VSWVIRISAVVMSVGLGLGVPVASARPSEPGVVSYAVLGKGSVGNIIGRPMGWESLLTEPLQAYSVDLPMCNNWADIGLP(SEQ ID NO:266)EVYHDVDLASFNGAITQTSANDQTHFVKQAVGVFATNDAAVRAFHRVVDRTVGCSGQTTAMHLDNGTTQVWSFVGGTPTYADANWTKQEAGTDRRCFVQTRLRENVLLQTKVCQPGNAGPAVNVLAGAMQNALGQ>BL;ML2330, MLtab 2761253:2761603 reverse MW:11931MQPGGDMSALLAQAQQMQQKLLETQQQLANAQVHGQGGGGLVEVVVKGSGEVVSVAIDPKVVDPGDIETLQDLIVGAMAD(SEQ ID NO:267)ASKQVTKLAQERLGALTSAMRPTAPPPTPPTYMAGT>BL;ML2332, ML.tab 2762489:2762926 reverse MW:15496MGPVSAVSTILVNVEPVATLAAVADYQKNRPKILSPQYNEYQVVQGGQGPGTVVKWKLQVTRSRVRDVQVNVDVAGHTVI(SEQ ID NO:268)EKDANSSMVTSWTVAPAGPGSSVTMKTAWTGAGGVKGFFEKTFAPLGLKKIQAEVLANLKNELER>BL;ML2337, ML.tab 2769442:2770194 forward MW:27248MGYKVATLWHASFSIGAGVLYFYFVLPRWPELIGETTHTLGTTLRIVTGVLFGLAALPVVFTLLRSQTSELGIPQLALSI(SEQ ID NO:269)RTWSIVAHVLSGVLIVSTAIGEIWLSSDTAVQCLFGIYGAAAASAVLGFVAFYLSFVAALPPPSPKPVKLKKANQRRIRRRKGRKDNEAEDEEADAGEHNEAETPAQAEEVTSENPQPAPESGAQKEPDELLANKTEETDEPRRGLHNRRPTGKVSHQRRRARSGIAVEN>BL;ML2366, ML.tab 2834617:2834958 reverse MW:11956MAFMTSNPSGPPSQPAPAVGLTPGERAVPVTRAGALWSALFAGFLILILLLIFIAQNTTSTPFTFFGWHWSLPLGVAIML(SEQ ID NO:270)SAVGGGLLAVAVGTARILQLRRTVKKRYVAAHR>BL;ML2377, ML.tab 2846366:2846830 forward MW:16863MSKRQRGKGISIFKLLSRIWIPLVILVVLVVGGFVVYRVHSYFASEKRESYADSNLGSSKPFNPKQIVYEVFGPPGTVAD(SEQ ID NO:271)ISYFDANSDPQRIDGAQLPWSLLMTTTLAAVMGNLVAQGNTDSIGCRIIVDGVVKAERVSNEVNAYTYCLVKSA>BL;ML2380, ML.tab 2850483:2850944 reverse MW:17491MSRLSTSLCKGAVFLVFGIIPVAFPTTAVADGSTEDFPIPRRQIATTCDAEQYLAAVRDTSPIYYQRYMIDMHNKPTDIQ(SEQ ID NO:272)QAAVNRIHWFYSLSPTDRRQYSEDTATNVYYEQMATHWGNWAKIFFNNKGVVAKATEVCNQYQAGDMSVWNWP>BL;ML2388, ML.tab 2858302:2858607 forward MW:10942VAAGDNRLIGNDTPEIAVGAIGGVATGYVLWLVVMSIGYNITTVSQWSLIVLILSMVSVFCSGMCGWWLRQRRKHAWGAF(SEQ ID NO:273)TFGLPVFPVVLTLAVLAKIYL>BL;ML2390, ML.tab 2861300:2861605 reverse MW:10840VSHIFLTLIADGELQYHGPDFGKASPMGLLVIVLLLVATLLLLWSMNRQLKKIPASFDSEHPELDQAADEGTELGGYLDE(SEQ ID NO:274)EPSDTDRNGPSLPPEPGADSG>BL;ML2392, ML.tab 2862579:2863013 forward MW:15303MNQAFTPLTETRYGRSRLPGRSRHRGVIALTVLAVAASTGIAVIGYQRLGTSDVAGSLASYRVLDDETVSVTISVMRSDP(SEQ ID NO:275)SRPVDCIVRVRAKDGSETGRREVLVAPAEATTVQVVTTVKSARPPVMADIYGCGTDVPGYLRPA>BL;ML2407, ML.tab 2878350:2878685 reverse MW:12141VSKGPNSICAADHNRDHLVVNVLLYAAARLLLVILLGSVIYGVARLLGVTQLPIVVAALFALIIAMPLGIWLFSPLRRA(SEQ ID NO:276)TAALAVVGERRRSEREQLRARLRGEPLPDEE>BL;ML2410, ML.tab 2881021:2882649 reverse MW:59195LMQVLMRLLAWVRNTWRALTSMGTALVLLFLLALGAIPGALLPQRDLNVGKVDDYVAAHPVIGSWLDQLQAFNVFSSVWF(SEQ ID NO:277)TAIYTLLFVSLVGCLTPRMIEHARSLRAMPVIAPRNLARLPKHASFQVIGDDKTLAGTIAGRLRGWRTVIRTQDGVVPETVVSAEKGYLREFGNLVFHFSLLGLLVAVAVDKLFGYEGNVVVIADGGPGFCSASPAAFDAFRAGNTVDGTSLHPICIRVNDFAAHYLPSGQAMSFTADIDYQDGHDLTVNSWRPYRLEVNHPLRVAGNRVYLQGHGYAPTFTVTFPDGQTRTSTVQWRPENQQTLLSSGVVRIDPPAGSYPNMSERRQHEIAIQGLLAPTEQLDGTLLLSRFPALNAPAVAIDIYRGDTGLDSGRPQSLFTLDPRLINQGRLTKEKRVNLQAGEQVRLGQGPGAGTVVRFDGAVPFVNLQVSHDPGQAVFVFAIAMMVGLVVSLMVRRRRVWVRLTPAAGTVNVELGGLARTDNSGWGDEFERLTERLLAGLVAADRTLAAARRSSQMDVK>BL;ML2425, ML.tab 2901005:2901505 forward MW:18654MTVPLEAEGLIGKYRQLDHFQVGREKIREFAIAVKDDHPTHYNETAAFEAGYPALVAPLTFLAIACRRVQLEIFTKFNI(SEQ ID NO:278)PINVARVFHRDQKFRFYRTILAQDKLYFDTYLDSVIESHGTVIAEVRSEVTDTEGKAVVTSIVTMLCELARQDATAEETVAAIASI>BL;ML2428A, ML.tab 2904745:2904846 reverse MW:4145MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK(SEQ ID NO:279)>BL;ML2432, ML.tab 2907135:2907977 reverse MW:30442LRPVIKVGLSTASVYPLRAEAAFEYAAKLGYDGVELMVWGESVSQDIDAVKGLSRRYRVPVLSVHAPCLLISQRVWGANP(SEQ ID NO:280)VPKLERSVRAAEQLGAQTVVVHPPFRWQRRYAEGFSDQVAELEAASDVMIAVENPFPFRADRFFGADQSRERMRKRGGGPGPAISVFAPSFDPLAGNHAHYTLDLSHTATAGSDSLEMVRRMGSGLVHLHLCDGSGLPADEHLVPGRGTQPTAAVCQLLAGADFAGHVVLEVSTSSVRSATERETMLTESLQFARTYLLR>BL;ML2433, ML.tab 2908008:2909075 reverse MW:37938MTGPHNDTESPHARPISVAELLARNGTIGAPAVSRRRRRRTDSDAVTVAELTCDIPIIHDDHADEQHLAATHAHRANIGV(SEQ ID NO:281)RVVEPAAQSPLEPVCEGIVAEPPVDDHGHVPPGCWSAPEPRWPKSPPLTHLRTGLQRSACSRPLPHLGDVRHPVAPDSIAQKQSDAEGMSPDPVEPFADIPVDVMGSEVRAAELVAEESAYARYNLQMSAGALFSGHTLTNELAERRGDEHAAGGLLAVGIDLDEDHLDLHTDLAGITSPARGWQSRFEALWRGSLIVLQSILAVVFGAGLFVAFDQLWRWNSIVALVLSVLVILGLVVGVRVVRRTEDIASTLIAVVVGALITLGPLALSLQSG>BL;ML2435, ML.tab 2910195:2911028 forward MW:31007VSSCPESGSTFEYVANSHLEPVHPGEEVDLDFTREWVEFYDPDNSEQLIAADLTWLLSRWTCVFGTPACRGTVAGRPDDG(SEQ ID NO:282)CCSHGAFLSDDADRTRLDDAVKKLSHDDWQFREKGLGRKGYLELDEHDGQSQFRTRKHKNACIFLNRPGFPIGAGCALHSKALKLGVPPRTMKPDICWQLPIRHSQEWVTRPDGTEILKTTVTEYDRRSWGSGGADLHWYCTGDPASHVDSKQLWESLADELTELLGAKAYAKLAAICKRRNRLGIIAVHPATQEAK>BL;ML2442, ML.tab 2917372:2917926 reverse MW:20820MSTAWDTVWHACSVIEHALQASHLTYSEFSGVPDGLLRLVVELPGERRLKTNAILSIGEHSVNVEAFVCRKPDENHEGVY(SEQ ID NO:283)RFLLKRNRRLFCVSYTLDNVGDIYLVGRMSLASVDTDEIDRVLGQVLEAVESDFNTLLELGFRSSIQKEWDWRISRGESLNNLQAFAHLIDDEGDGDASIYARP>BL;ML2446, ML.tab 2921383:2922708 reverse MW:46863VIASTPRQNRESINRRVALTALGVGVFAPSVFVACAGSAIKPSEKKTTPAPHLTFQPATATDDVIPVAPISVQIADGWFQ(SEQ ID NO:284)RVTLTNPVGKVVAGVFNQDRTVYTITEPLGYDTTYTWNGSAVGHDGKAVPVTGKFSTVTPVKKVNGGFQLADGQTVGVAAPITIQFDAPISDKSAVEKALTVTTTPPVEGSWAWLPDEAKGARIHYRPREYYPAGTTVNVDAKLYGLPFGDGAYGLQDMSLNIQIGRRQVVKAEVSSHRIQVVTDAGVIMDFPCSYGEADQARNVTRNGIHVVTEKYSDFYMSNPAAGYSNVYERWAVRISNNGEFIHANPASVGAQGNTNVTNGCINLSTGDAEQFYRSAIYGDPVEVTGSSIQLSYSDGDIWDWAVDWDTWVGMSALLSFPTVHQPATQIPVTAPVTPPGAPILSGTPTSGSGTARPGG>BL;ML2450, ML.tab 2925762:2926499 forward MW:26174LLRDPLAIVLILIIVVALVISGLIGAELFARHTANSKVARVVTCEIKDQATAKFGVTPLLLWQFATQHFTNISVETAGNQ(SEQ ID NO:285)IRDAKGMKIAIDIQNVQIRDTPTSRGTIGVLDAIITWSSDGIRQSVQNSIPVLGGVVTTSVTTHPTNGTIELKGMLNDIVAKPVVSNGGLQLQIVSFNTLGFSLPKETVQFTLDDFTTNLTKNYPLGIHADNVEVTSTGVTSHFSARNTNIPNSTGGQDPCFANL>BL;ML2452, ML.tab 2927236:2927607 reverse MW:13492MSSPLSPLYVLPFVDHTKWTRWRSLISLQAYSNLFGRTSAMQPDVAAGDEAWGDVLTLSPDADTADMHAQFICHGQFAEF(SEQ ID NO:286)VQPSNTSSNLEPWRPVVDDSEIFAAGCHPGISEGIQQADEGPR>BL;ML2453, ML.tab 2927710:2927997 reverse MW:10004MSHLVGTVMLVLQLAVLVTAVYAFVHAALQRPDAYTAAEKLTKPVWLVILGAAVSLTSILGFVFGVLGIVIAACAAGVYL(SEQ ID NO:287)VDVRPKLLDIQGKSR>BL;ML2454, ML.tab 2928117:2928683 reverse MW:20421MAENPNVDDLRAPLLAALGAADLALTTVNELVGNMRERAEETRIDTRSRVEESRARVAKLQEVLPEHLSELREKFTADEL(SEQ ID NO:288)RKAAEGYLEAATNRYNELVERGEAALERLRSRPVFEDASARAEGYVDQAVELTQEALGTVASQTRAVGGRAAKLVGIELPKKAAAPARKAPAKKAPAKKAPAKKVTQK>BL;ML2463, ML.tab 2936751:2937545 reverse MW:30332VSLDKIMMPVPEGHPDVFDREWPLRVGDIDRTGRLRLDAGVRHIQDIGQDQLREMGFEETHPLWIVRRTMVDLIRPVEFQ(SEQ ID NO:289)EMLRLRRWCSGTSNRWCEMRVRIDGRKGGLIESEAFWININRETQMPARIADDFLAGLHRTTSVDRLRWRGYLQPGSRDDASEIHEFPVRVTDIDLFDHMNNSVYWSVIEDYLVSHSELLKGPLRTTIEHEAPVALGDKLEIVLHVHPAGSTDQFGPGLVDRSVITLTYTVGDETKAIAAIFAL>BL;ML2465, ML.tab 2939177:2939743 forward MW:21174MSGGTGTGPVGRIPPGSLRQLGPINWVIAKLAASLLRTSEMHLFTILGQRQLLFWAWLIYGGRLLRGKLPRVDTELVILR(SEQ ID NO:290)VAHLRTCEYELQHHRRMARKRGLDTKIQAMIPAWPDVPTGAGLSVRQQALLAATDEFVKDRKITSSSWQQLETHLDRRRLIEFCMLISQYDGLAATISSLDIPLDNSC>BL;ML2473, ML.tab 2948272:2948751 forward MW:17160MPDGFGVAVVREEGQWRCSAMASKSLTSLTAAETELRELRSVGAVFGLLDVDDEFFIILRPAPSGTRLLLSDATAALDYD(SEQ ID NO:291)IAAEILDSLDAEIDPEDLEDAYPFEEGDLGLLSDVGLPEATLGVILDQTDLYADEQLGHIAREMGFAEQLSAVINRLGR>BL;ML2489, ML.tab 2962275:2963135 reverse MW:32001MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFDYERESTEILQRWKRGVMSTVGDISAHNVVLGQIRGE(SEQ ID NO:292)AVYIFDLEEVATVIALHRKVGTNIVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAEVMWNEQNWTLVSMPIGSTRVQWDEGLRTVRQFNDLLRVLPPLPADTSQEAGASARNAAPSRPLASVGRAELPPDRGVESDVAGLLGSGVQAGRSAEPISRDEGRWDGIRRPPSVERNGHQTTNYQY>BL;ML2491, ML.tab 2966697:2967698 forward MW:36545VTGPNSPNKTLQRFGISGTDLGIPWDNGDPTNHQVLMAFGDSFGYCSVKGQQRRYNILFRSSNQDLSHGIRIADGVPNDK(SEQ ID NO:293)YSGSPVWTTGLAKQVVNTIHRAPHETGIIPTTAISIGKTQYMNYMSIKKWGRDGEWTTNYSAIARSIDNGQSWGTYPGTIRTASPDAIPGTHFVPGNQNFQMGAFMRGNDGYLYSFGTPSGRSGAAYLARVPQNLVPDLSKYQYWNGNWVPNNPGAATPLFSGPVGEMSAQYNDYLKQYIILYSNGDSNDVVARTAPAPQGPWSPEQPLVSSFQMPGGIYAPMIHPWSSGRDLYFNLSLWSAYDVVLMHTVLP>BL;ML2518, ML.tab 2999816:3000208 reverse MW:14353VGEYSAFGFDPDDFDRLIKEGSEGLRDAFERISRFVGGPGVRTAWSAIFEDLSRRARPAQETADEAGDGVWAIYTVTGDG(SEQ ID NO:294)AARVEQVYATELDALRANKNNVDPKRKVRFLPYGIAVSVLDSHQESTQQL>BL;ML2522, ML.tab 3004590:3005246 reverse MW:22735MCRLIALLSAVVCAAWATLILAPIGAAAGAAWFANKVGNATQVVSVVSTGGSNAKMDIYQRTGTGWQPLKTGIPTYVGSA(SEQ ID NO:295)GLVAQAKSGYPATPMGVYSLDSAFGTAPNPGSQLPYTQVGPNHWWSGDDHSPTFNTMQVCQKSHCRFNTAESENLQIPSYKHAVVMGVNKAKVPGSGSAFFLHTTGGGPTEGCVAIDDVTLVQILRWLRPGAVIAITK>BL;ML2527, ML.tab 3009256:3010275 reverse MW:36773VSAHRSRPAAMWPGSSRITLALLAVMPALMAYPWWFTRSYWLLGIVALVVVVLFGWWRGLYLTTILRRRLAMMWRRGRPV(SEQ ID NO:296)SASGSATRTTVLLRLGPPVGGSDVIPLPLITRYLNCYGIRADSIRITSRNNESDGALCETWIGLTVSAAKNLAALQARSSRIPLQETAQVAARRLADHLREIGWEVSLAVPDGIPRLITAAGDETWRGMQQGSDYLTAYRVNVDDELPGTLDAVRLYPARETWTALEIACPDSSSNRNTIAAACAFLTDTAPQGAAPLAGLTPQHGNHRPALAVLDLFSAQRLDGHTDTDADLLTRLRWPAPAAGVSSCTATRSPAVSA>BL;ML2529, ML.tab 3011710:3013167 reverse MW:49866MVYVFAEEFCEGPVTSGAVMPIVRVAILALSRLIEMALPTELPLREILPAVKRLVVPAASDNDSPLAANASLHLSLAPIG(SEQ ID NO:297)GAPFSLDASLDTVGVVDGDLLALQPVPVGPAAPGIVEDIADAAMIFSTARLQSWGPTHIRRGALAATTAVTFAATGLGVTYRAVTGALTGLLVVIVIAVLIALGGLVLRSRAARTGLVLSIAALVPIGAVFALAVPGIFGPAQVLLASAGVTAWSLIALIVPGPERVRIVAFFTATVVIGVAVMLEAGAALLWQLTPLTIGCGLILAALLVTVEAAQLSALWARFPLPVIPAPGDPTPSAPSFQVLEDLPRRVRISSAHQSGFIAAATLLSMLGSVAIALRPEAVSSVGWYLVAATAVAATLRARVWDSVACKAWLLAQPYLVASVLLGLYTATGRYVAASAALLVLVVVVLAWAVVALNPRIASSDSYSLPLRRLLGMVASGLDASLIPAMAYLVSLFSWVLNR>BL;ML2530, ML.tab 3013309:3014178 reverse MW:31518MKVDPNAVELTVDHAWFIAEVIGAGSFPWVLAITTPYRDAGERSAFVERQVDELTRMGLLVAENSVDPTVADWIRVVCFP(SEQ ID NO:298)DRWLDLRYLRSTSAVGDSELLRGMVAQRAGVSDKTVVALRSAQLITFTAMDIDDPLRLVPILGAGLAQRPPARFDEFSMPMRVGVRADERLRSGTSLAEVADYLGIPKSAQPVVESVFSGPRSYVEIVAGCRRDGKHATTEVGMSIVDTTTGRVLVNPSRAFDGEWVSTFSPGTPFAIAVGIEQLTATLPEGQWFPGQRLCRDFSGQTS>BL;ML2531, ML.tab 3014189:3014479 reverse MW:10392MTQIMYNYPAMLDHAGNMSACAGALQGVGIDIAAEQAALQACWGGDTGISYQAWQVQWNQATEEMVRAYHAMANTHQNNT(SEQ ID NO:299)LAMLTRDQAEAAKWGG>BL;ML2532, ML.tab 3014518:3014814 reverse MW:10191VSLLDVHIPQLVASESAFAAKAALMRSQINQAECEAISAQAFHQGESSAAFQSAHAQFVTAAEKINALLDIAQQHLGEAA(SEQ ID NO:300)ETYVATDATAASTYTTGL>BL;ML2534, ML.tab 3016103:3016411 reverse MW:10243MTLRVVPEKLAATSEAMKALTARLEAAHAAAFPCLVAVVPPAADPVSLQTAAGFSARGQEHALVAAQGVEELGRAGIGVG(SEQ ID NO:301)QSSTHYAISDALAASTYGIVES>BL;ML2536, ML.tab 3020432:3022090 reverse MW:57975MTSNELPGEWSGERRSFFSRTPVNDNPDKVVYRRGFVTRHQVTGWRFVMRRIASGIALHDTRMLVDPLRTQSRSVLVGAL(SEQ ID NO:302)LVITGLIGCFVFSFIRPNGAAGNNAVLADRSTAALYVRVGDELHPVLNLTSARLIVGRSVNPITVKSSELDRFPRGNLIGIPGAPERMVQNTTHDANWTVCDVVSGEGGHAAHSMGVTVIAGPPDSHGMRAAVLGSAHGVLVDAPSERGGsTWLLWDGKRSEIDLADHAVTDALGFGVGFAEVPAPRPIGAOLFNAIPEAPPLKAPVIPNAGATPSFGVRAPIGAVVVSFGLAALGKNPYDSVRYYAVLPDGLQPISPVLAAILRNINSYGLQQPPRLGADEVDKLPVSRMLDTERYPEQQISLIDAGYAPVSCAYWSKPAGAATSFLSLMSGAALPVPDAARAVELVSAPSRGDSSTASRVVLTPGTGYFAQTVGVGSAAPATASLFWVSDTGVRYGIDTEADARSEATAGPGKIVEALGLKLPAVPIPWSILSLFAVGPTLSRADALLEHDGLAPDTRAGRTTTAYGEHR>BL;ML2557, ML.tab 3049294:3049590 forward MW:10997LYATTELAELHDLIGRMRRSVASFKARYGDSPNRRIAIDADRILSDIELLDADISELDLARATVQQSNEKIAIPDTQYD(SEQ ID NO:303)SDFWRDVDDEGVGGHSRS>BL;ML2566, ML.tab 3061369:3062271 forward MW:32889LTFGFVLRRLRRHFSVKENTVNQPSGLKNILRAIVGALPLLPRTDQLPSRTVTIEELPIDHTNVSAYASVTGLRYGNHVP(SEQ ID NO:304)LTYPFALTFPANMSLVTGFDFPFAANGSVHTENHITQYRPIAVTDVVGVQVHAENLREHRKGLLVDLVTDVSVGNDTAWHQVTTFLHLQRTSLSDEPKPPSQKQPKLLPPSAVLQITPRQIRRYAAVGGDHNPIHTNPIVAKLFGFPTTIAHGMFSAAAVLANIEARLPDAVHYSARFVKPVVLPATTGLYVDESAGNWDLTLRNIAKGYPHLAGTVQGV>BL;ML2569A, ML.tab 3065268:3065441 forward MW:5896MSRIVAPAAASVVVGLLLGAATIFGMTLMVQQDTKPPLPGGDPQSSVLNRVEYGNRT>BL;ML2570, ML.tab 3065557:3069774 forward MW:147681VAAMSRWWLVLVGVVAVALTFAQSPGQISPDTKLDLTTNPLRFLARATNLWNSDLPFGQVQNQAYGYLFPHGTFFLIGQL(SEQ ID NO:305)LGSPGWITQRLWWALLLTAGFWGLLRVAETLSIGSPTSRAIGAVAFALSPRVLTTLGSISSETLPMMLSPWVLLPTILALQGAPGRSVRTRAAQAGLAVALMGAVNAIATLAGCLPAVIWWACHRPNRLWWRYTGWWLLALCLATLWWVVALVLLHGVSPPFLDFIESSGVTTQWSSLVEMLRGTDSWTPFVAQTATAGTPLVTESVAILGTCLVAAAGLAGLASTGMPARGRLVTMLVIGVVLLSAGYSGGLGSPLAQAVQAFLDSSGAALRNVHKLESVIRTPFALGIAGLLGRIPLPGSAPVLVWLSSFAHPERDKRVAATVAVLTALLVSTSSAWTGRLTPPGTFSAIPQYWNDTSDWLSEHNTGIPTPGRVLVVPGAPFATQVWGTSHDEPLQVLGNSPWGVRDSIPLTPPQAIRALDSVQRLFASGRPSVGLADTLARQGISYVVLRNDLDPDTSRSARPILVHRAIAGSPQLEKVAQFGAPVGTNMLKGFVADSELRPWYPAVEIYRVAVSDGTNPGKPYFADTDQLPRIDGGPEVLLRLDERRRLLGQPALGPALMTADAQFAGLPLPSRAEVTITDTPVARETDYGRVDQNSSAIRAVNDARHTFNRVPDYPVPGAEMVFGGWSGGRITASSSSSDATSMPDVAPATSPAAAIDGDPATSWVSNALQPAVGQWLQVDFDHPVNNAVITVTPSATAVGAQVRRIEIETVNGTTNLRVDEAGKPLAVALPYGETPWVRITAAATDDGSSGVQFGITDLTITQYDASGFAHPVNLRHTALVPGSPPGWAVAGWDLGSELLGRPGCAPAPDNVRCAASMTLAPEEPVNFSRTLTVPYPISVTAMLWVRPRQGPKLADLIAEPKTTRAYGEADTVDILGSAYAATDGNPATSWTAPQRVVQHKTPPTLTLVLPRPTEVNGLRLAPSRSALPARPTLVAVNLGNGPQVRELQAGEPQALSLKPRITDTVTISLLDWHDVIDRNALGFDQLKPPGLAEVTVLGTDGNPTAPANASENRIREVTVDCDHGPIIAVAGRFVHTSIRTTAAALLDGEPVAAVPCERAPIVLPAGQQELLISPGAAFIVDGAQLSTQDGTELPSARTISADTGKWGPSRREVRAPGSATSQVLVMPDSINPGWVAHTSTGVRLMPVAVNGWQQGWLVPAGNPGTITLTFTANSLYRPGLAAGLALLPLLALLALWGRRNERAADAAAQPWTPGAWSAVAVLSAGAVIAGAAGVVVVGAALSLRYALRHQQRWRNGLTVGLSAGGLVLAGAALSRQPWRSVDGYSGHSANVQLLALISLAALAASVVSPRCGSTGVAT>BL;ML2581, ML.tab 3081538:3082821 forward MW:46681VNRAVILRFTACGIIGLGAAFLIAALLLATYTSSRITKIPLDIDATLVSEGNGTALDSSSLSSEHIIVNQNVPLVSQQQI(SEQ ID NO:306)TVESPANVDVVTFQVGVSIRRTDKQKDTGLLLAVVDTVTLNRKTANAVSDDTHTGGSIQKPRGFTDENPPTAIPLRHDGLSYRFPFHTEKKTYPYFDPVAQKTFDVNYQNQEDINGLTTYRFTQNVGYDADGKLVAPITYPSLYASDEDGKITTTAJUWGLSGDPSEQITMTRYYAAQRTFWVDPVSGTIVKETEHVNIYFARDTLKPEVTLADYKVTSTEETIESQVNSARDERDRLALWSRVLPITFTAVGLITLLSGGFLASFSLRTESALTESGLDRANRDAFGHCRTEEPVPGAEAETEKLPTQRPELRDSSILSVSAHRRRSSESSPPNSGPADPGHPERG>BL;ML2582, ML.tab 3082923:3084725 forward MW:61888VSRELYHRSMSWSRPSYALGMALLVVGPLMRPGYLLLRDAVSTPRSYLSDAALGLTSAPRSTPQDFAVAMASHLVDGGIV(SEQ ID NO:307)VKSALVLGLWLAGWGAARLVVTALPSAGVAGQFVASTLAVWNPYVAERLLQGHWSLLVGYGCLPWVAEAMLMLRSSDNASRPGLLGFFALACWIALAGLTPTGLMLAATVALICVAVPVEGPGEPRPRWLCAAATLGSALGAALPWLTASAVGTSLTAHTVANTLGVTVFAPRAEPGLGTLASLASLGGIWNGEAVPTSRTTLFAVLSATVLLGVVVAGLPVAVRRPAVVPLLVLAAVAVATPAALATGPGIDMLKAVVNAVPGLGVLRDGQKWVALAVPGYSLAGAGAVVTLGRWLRPSRPLSPVVTALACCLALILALPDLAWGVWGKVQPVHYPSGWAAVAATINDRGEGPGWVAVLPAGTMRRFSWSGTAPVLDPLPRWVRDDVLTTGDLIISGVMVAGEGNHARAAQDLLLSGPNPSALTAAGVAWLVVESDTAGDMGASARTLAALQPTYRDDAIGLYRIGGSNAKTAPSPYRGLLIAAHLTWLVILVMAVVGMQITRHTHFRDVSSVALLSRR>BL;ML2595, ML.tab 3100808:3101356 forward MW:19619LPESQVAADDSGVTLNRSRLGRGWLTGIAVALLLAGCGIGTGGYCMLRYHQDSQAMARNDNAALKTALDCVAATQAPDTN(SEQ ID NO:308)TMAASEQKIIDCGTDAFHAQALLYTNMLVQAYQAANVHVQVSDMRAAVERHNNDGSIDVLVALRVKLSNDRAHNQETGYRLRVKNALAEGQYKISKLDQVTK>BL;ML2596, ML.tab 3101353:3102330 forward MW:35970VTVVVAKSQTAAVIPEPLSNRLAPWHLRLVALAVDVLPGLVAVSTMTLVVFTVPLRSAWWWLCMAVGGIVILSMLVNRLL(SEQ ID NO:309)LPTIIGWSLGRALCGISVIMRDGVAIGPWRLLLRDLTNLLDTAAVFAGWLWPLWDSRRRTFADILLRTEVRCVQAVERQRTIRWWASVALLTAAGVSLGGASVSWAVVYSHDRAIDQTRSEIAIQGPKMVAQMLTYNPKSLRDDFTHAQSLASDKYRRQLAAQQDVVKKGHPVINEYWPTAGAIQSATRDRATMLLFMQGRRGAAPGERYISATVRVSFAKGEHNHWLVDDLTVLTKPKTTGNGR>BL;ML2597, ML.tab 3102327:3102881 forward MW:20340MSPRRKFQAGEGLLLVSHTVASQRRWGLPLAATFAALVMVAAITASTLMSISHASRELAVAKDQQVLSYVKWFMTQFTTL(SEQ ID NO:310)DPYHANDYVARILAQATGDFAKQYNEKVNEILLQVAQAEPATGTVLDAGVERWNDDGSANVLVATEVTSKYPDEKQVLENTNRWTATATRECNQWKISNLLQVI>BL;ML2598, ML.tab 3102971:3103525 forward MW:19619VAAAEGGGSWRNRRAGQRTAIAVVVAAVLFVGSAAFAGAAVQPYLADRATVAVKLEVARTAANAITVLWTYTPENMDTLA(SEQ ID NO:311)DRAATYLSGDFGAQYRKFVDAIVGPNKQAKITNSTEVTGVAVESLDASNAIAIVYTNTTSTSPLTKNIPALKYLSYRLFMKRSAVRWLVTRMTTITSLDLTPQL>BL;ML2604, ML.tab 3108446:3109195 forward MW:26859MTDNKMLARIAALLRQAEGTDNAHEADAFMATAQRLATAASIDLAVARSHVANRSTAQAPTQRTITIGTAGTRGLRTYVQ(SEQ ID NO:312)LFVLIAAANDVRCDVASNSTFLYAYGFAEDIDATHALYASLVVQMVRESDAYLASGAYRPTPTITARLNFQLGFGMRVGQRLTEARDHIRSAVTEAWDRPTATAIALRDKEIELIDYYRSASKARGTWQAARASAGYSSAARNAGDQAGRRAWIDNSTELPGARAALGR>BL;ML2605, ML.tab 3109192:3109698 forward MW:18602MNLLDSVRDAQRSKVYAAEEFIRTLFDRAVEHGSPAVEFFGAQLSLPPEARFGSVAAVQRYVDDVLALQAVRQRWPRMLP(SEQ ID NO:313)LTVRARRAATAAHYENLDGAGVIAVPGNNADWAMRELVVLHEVAHHLCKDPPPHGPEFVATICALTELVMGPELGYVFRVVYAKEGVR>BL;ML2614, ML.tab 3120828:3121502 forward MW:24144VNWPKTPGTLAAMPDEEQTELPVHKEFAGIADYSDPGLSDGSVFSQYGIASTVLAVLSAAAVVFGVVIWRAHHDNSAERA(SEQ ID NO:314)YLTHVMQTAFDWTGVLINMNTSNVDASLQRLHAGTVGELNTDFDAAVQPYRKVVEKLQTQSRGQIEAVAIESVHHDLDTQPGVAHPVVTTKLLPPAARTDSVMLVATSVSENVGGKPTTVHWSLRLDVSDVDGKLMISHLESIR>BL;ML2615, ML.tab 3121499:3122188 forward MW:24416MRNRWRLLAFDVVAPLVAIAALVMIGVVLDWPRWWVSACSVLVLLIVEGVGVNFWLLRRDSVTIGTDDDAPGLRLAXIXISV(SEQ ID NO:315)CTAALCAAVLIGYMHWTSPDRDFSLDSREVVQIATGMAEAFVIASFTPSAPTSSIDRAAAMIMPDQAGVFKEQYRKSSADLARRNVTAQASVLAAGVEAIGSSAASVAVILRVTQNTPGQPPSQAAPAVRVTLIKRGSDWLVTDVLSINAR>BL;ML2616, ML.tab 3122267:3122779 reverse MW:18682VTLADDHRRPAAPPEQPAADQGRYDPDQPVEFWSTAAIRSALHAGSIEIWKLITAAVKHDPYGRTAHQVEEVLEGTRPYG(SEQ ID NO:316)ICKALGEVLQRARTHLEINERAEVARHVRLLIDRSGLGHQEFASRIGVAPEGLASYLDGSTSPSAALMVRMRRVSDRFVKVKAARSANSD>BL;ML2621, ML.tab 3131780:3132367 reverse MW:21956MLIWDAPNLDMGLDAIVDHHHRNALERPCFDALGRWLFTCNTEVAVGYPDSTIGLKGTMFTNIAQASADVVRLWVDTLRN(SEQ ID NO:317)VEFVIFVKPKIDEDSHMLGRIKGRYNEGLAVQVVVSAYSQALRQTLERTAHAVIDVQMIGFREHTSWALASAILEFADLEDIAGVFRESLPRISLDSLPAQGEWCAPFPGRWLRY>BL;ML2627, ML.tab 3140261:3141280 forward MW:36081VTANGHAGRREGGPYFDDLSIGQVFDWAPAVTLTSGMAAVHQAILGDRMRLALDAELSTTVIGTHAMLAHPGLVDDVAIG(SEQ ID NO:318)QSTLVTQRVKANLFYRGLTFHRFPVIGDTLYTSTEVVGLQANSAKPGRPPTGMAALRITTTDQHDRLVLDFYRCAMLPASAAWHPHDALNRNDDLANVGADAVASASDPTGQWDAAAFRERVPGPHFDAGITGAVLRSTGDIVSSAPDLARLTLNIASTHHDSRVRGLRLVYGGHTIGLALAQAGRMLPNLATVLNWRSCDHTGPVHEGDTLYSELHVESAEATEQGGVLGLRSLVYAVSAAGGSDHLVLDWRFTVLQF>BL;ML2629, ML.tab 3144572:3145042 reverse MW:17022LGSVPGYASPMPVMSKTVEVRATAASIMAIVTDFEAYPQWNDGVKGVWVLARYDDGRPSQLRLDTEIQGTKCTYIQAVYY(SEQ ID NO:319)PATNQIQTIMQQGDLFTKQEQLFSAVEIGAASLLTVDIDVESSMPVPAPMVKALLNNVLDNLAENLKLRAEQLAAN>BL;ML2630, ML.tab 3145226:3145597 reverse MW:12982LLTDGVLLPELLFGYLNKCCLLPQLFDTAINTSVGVTSPNESRAFNAADDLIGDGSVERAGLHRATSVPGESPEGLQRGH(SEQ ID NO:320)SPEPNDSPPWQRGSAQASQSGYRPSDPLTTTRQSNPAPGANVR>BL;ML2640, ML.tab 3159055:3159987 reverse MW:34454MRTHDDTWDIKTSVGTTAVMVAAARAAETDRPDALIRDPYAKLLVTNTGAGALWEAMLDPSMVAKVEAIDAEAAAMVEHM(SEQ ID NO:321)RSYQAVRTNFFDTYFNNAVIDGIRQFVILASGLDSRAYRLDWPTGTTVYEIDQPKVLAYKSTTLAEHGVTPTADRREVPIDLRQDWPPALRSAGFDPSARTAWLAEGLLMYLPATAQDGLFTEIGGLSAVGSRIAVETSPLHGDEWREQMQLRFRRVSDALGFEQAVDVQELIYHDENRAVVADWLNRHGWRATAQSAPDEMRRVGRWGDGVPMADDKDAFAEFVTAHRL>BL;ML2664, ML.tab 3190775:3191530 forward MW:26919MYQAVRYLLVMAAIILMAVAESGSPSVAAIPALKPTPEVASVLPTNGAVVGVAHLVVVTFTAPVTDRSAAERSIRITSPN(SEQ ID NO:322)NMTGHFEWLDGDVVQWIPTKYWPAYTHVSVEVQALTTGFETGDALLGVASLSTHTFTVSRNGEVLRTMPASMGKPTRPTPIGKFTALSKERTVVMDSRTIGIPLNSPEGYLITAQYAVRVTWSGVYVHSAPWSVNSQGYTNVSHGCINLSPDDATWYFNTVNVGDPIEVVA>BL;ML2687, ML.tab 3234968:3236662 reverse MW:61392VINAQTHSTTISPRPLAADRQSADNRDCPSRTDYLGAALADAIGGPVGCHALIGRSWLMTPLRVMFLIGLVFLALGWSTK(SEQ ID NO:323)AACLQTTGTGPGGQRVPNWDNQRAYYELCYSDIVPLYGTELLSQGKFPYKSSWIETDSSGTPRTRYDGRLAVRYMEYPVLTGIYQYVSMAVAKSYTALSEPVSLPAVAEVVMFFDVVAFGLALAWLATIWATAGLAGLRIWDAALVAASPLVIFQVFTNFDALAIAFATGGLLAWSRCRPISAGVLIGLGAAAKLYPLLFLVPLFVLGVRTGRLGGVACAAVTAATTWLLVNLPVLLLFPRGWSEFFRFNTRRGDDMDSLYNVVKSLTGWRGFDTKLGFCELPLVLNTVVTVLFALCCAAVAYIALTAAQRPRVVQLAFLLVAVFLLTNKVWSPQFSLWLVPLAVLALPHRRVLLAWMTIDALVWVPRMYYLYGNPSRSLPEQWFTATVLLRDIAVVALCALVIRQIYRPDEDPVRLGGRVDDPAGGPFDRAPYAPPSWLPDWLHPAGMRRVVTLAASSVTETELAAAATPSGPMHHPHAPSSI>BL;ML2689, ML.tab 3239183:3239599 reverse MW:15278VVNYSLRRRFLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSQVMCPICRKEQLTLVSWVFGNQLGAISGSARTAEELV(SEQ ID NO:324)LLATRYEEFAVYVVEVCRTCSWNYLVRSYVLGAARSAPPPRGTPVTRTACNGARMAIE>BL;ML2699, ML.tab 3250667:3253060 forward MW:84667VTASRLRLAGSLSIALVVDIVASFAVLLVAPTATPHAAADEPRATSFVRVRIDKVTPDVVTTSSEPVVTVSGVVTNIGDR(SEQ ID NO:325)PVRDLMVRLEHESAVISSAVLRTYLDDGADQFQTAADFVTVAEELQRGQEAGFTLVAPIRSTTKPSMAIDQPGIYPVLVNVNGTPDYGTPARLDNARFLLPVAGVPPAKSDAMDSAVAPDITKPVWITMLWPLADRPRLSPGAPGGTIPVRLVDDDLASSLAPGGRLDILLTAAETATGRDVDPDGAVSRALCLAVDPDLLVTVNAMTGGYIVSNSPDGPAQQPGTPTHPGTGQDAAVIWLNRLRALAHRMCVASLPYAQADLDALQRINDTELSTTATTSVGDIVDHILDVTSIRGVTMLPDSPLTNRVVDLLNDNNSTVAIAAAAFSAQDSTSGSLVDIDTEPRRLSPRVVVAPFDPAVGAALAAAGTDPIVPTYLDSSLNIRIVHDSDTARRQDALSSILWRALERDAAPRSQILVPPTSWHLQADDARVMLTTLSTVIRSGLAVARPLPTVIADALARTKLSDTVGSYTSARGRFNDDIIADIASQVGRLWGLTSALTADGRTGLTGVQYTAPLREDMLRALSQLEPPATRNGLAQQRLAVVSKTIKDLIGAVTIVNPGGSYTLATEHSPLPLALHNGLAVPIRVRLQVDAPPGMTVTDVSQIELPPGYLPLRVPIEVNFTQRVAVDVALQTPEGIQLGEPVRLLVHSNAYGKVLFEITLTAATILIVLAGRRLWHRFRIQTEGADSNRPDPLIVDAHPQHQYDDWVDEENRI>BL;PE, ML.tab 654129:654437 forward MW:10295MTLGVIPEGLEGASAVIEALTAHLATVHAEAAPFIMEVIPPGSGSVSVQNQVGFNVHGCQYVAMTAHGAEELGRWGVGVA(SEQ ID NO:326)ESGVSYALRDAFAVASYLGGGL>BL;desA2, ML.tab 2339270:2340097 reverse MW:31139MAQKPVPNALILQLEPVVKDNMARHFANEELWFAHDYVPFDRGENFAFLGGRDWDPSQATLAKAVTDACEILLILKDNLA(SEQ ID NO:327)GYHRELVEHFILEGWWGRWLGRWTAEEHLHAIALREYLVVTREVDPVANEQVRVEHVMKGYRVNSYTQIETLVYMAFLERSYAFFCGSLAAQIKEPALFGLINQIVKDEVRHEEFFANLVAHCLECNRDETVAAIAARAAGLDVLGADIDAYHDKVENISAAGIFGSVELRQVISDRITAWGLINEPQLAQFVTS>BL;embA, ML.tab 139821:143156 reverse MW:118457VPHDGHEPPQRIIRLIAVGAGITGLLLCAVVPLLPVKQTTATIRWPQSATRDGWVTQITAPLVSGTPRALDISIPCSAMA(SEQ ID NO:328)TLPDSVGLVVSTLPSGGVDTGKSGLFVRANKNAVVVAFRDSVAAVAPRPAVAAGNCSVLHIWANTRGAGANFVGIPGAAGILTAEKKPQVGGIFTDLKVPVQPGLSAHIDIDTRFITAPTAIKKIAVGVGAAAVLIAILALSALDRRNRNGHRLINWRVSMAWLAQWRVILATPPRAGGASRIADGGVLATLLLWHIIGATSSDDGYNLTVARVSSEAGYLANYYRYFGATEAPFDWYFTVLAKLASVSTAGVWMRIPATLAGIACWLIINHWVLRRLGPGTGGLSTNRVAVLTAGAMFLAAWLPFNNGLRPEPLIALGVLFTWVLVERAIALRRLASAATAAVVAILTATLAPQGLIAIAALLTGARAITQTIRRRRTTDGLLAPLLVLAASLSLITLVVFHSQTLATVGESARIKYKVGPTIACYQDFLRYYFLTVESNADGSMTRRFPVLVLLLCMFGVLVVLLRRSRVPGLASGPTWRLIGTTATSLLLLTFTPTKWAIQFGALAGLTGTFGAIAAFAFARISLHTRRNLTVYITALLFVLAWATAGINGWFGVSNYGVPWFDIQPVIAGHPVTSIFLTLSILTGLLAGGQHFRLDYAKHTEVKDTRRNRFLATTPLVVVATTMVLCEVGSLAKGAVARYPLYTTAKANLAALRSGLAPSVCAMADDVLTEPDPNAGMLQPVPGQIFGPTGPLGGMNPIGFKPEGVNDDLKSDPVVSKPGLVNSDASPNKPNVTFSDSAGTAGGKGPVGVNGSHVALPFGLDPDRTPVMGSYGENTLAASATSAWYQLPLHWKESIADRPLVVVSAAGAIWSYKEDGNFIYGQSLKLQWGVTRPDGIIQPLAQVMPIDIGPQPAWRNLRFPLTWAPPEANVARVVAYDPNLSPDQWLAFTPPRVPVLQTLQQLLGSQTPVLMDIATAANFPCQRPFSEHLGIAELPQYRILPDHKQTAASSNLWQSSEAGGPFLFLQALLRTSTISTYLRDDWYRDWGSVEQYYRLVPADQAPEAVVKQGMITVPGWIRRGPIRALP>BL;embB, ML.tab 136573:139824 reverse MW:117159MSVIYRAHRVAIANRTASRNVRVARWVAAIAGLIGFVSSVVTPLLPVVQTTATLNWPQNGQLNSVTAPLISLTPVDITAT(SEQ ID NO:329)VPCAVVAALPPSGGVVLGTAPKQGKDANLNALFIDVNSQRVDVTDRNVVILSVPRNQVAGDAGAPGCSSIEVTSTHAGTFATFVGVTDSAGNPLRGGFPDPNLRPQIVGVFTDLTGGAP5GLRLSATIDTRFSSTPTTLKRFAMMLAIITTVGALVALWRLDQLDGRRMRRLIPARWSMFTLVDVAVIFGFLLWHVIGANSSDDGYQMQMARTADHSGYMANYFRWFGSPEDPFGWYYNLLALMIHVSDASMWIRLPDLICGVACWLLLSREVLPRLGPAIVGFKPALWAAGLVLLAAWMPFNNGLRPEGQIALGALITYVLIERAITYGRMTPVALATLTAAFTIGIQPTGLIAVAALLAGGRPMLYILVRRHRAVGAWPLVAPLLAAGTVVLTVVFAEQTLSTVLEATKVRTAIGPAQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCLFTAVLITLRRKQIPGVARGPAWRLIGTILGTMFFLTFAPTKWVHHFGLFAALGAAVAALTTVLVSHEVLRWSRNRMAFLAALLFVMTLCFATTNGWWYVSSYGVPFNSAMPRIDGITFSTIFFILFAIVALYAYYLHFTNTGHGEGRLIRTLTVSFWAPIPFAAGLMTLVFIGSMVAGIVRQYPTYSNGWANIRALTGGCGLADDVLVEPDSNAGYMTALPSNYGPLGPLGOVNAIGFTANGVPEHTVAEAIRITPNQPGTDYDWEAPTKLKAPGINGSVVPLPYGLNPNKVPIAGTYTTGAQQQSRLTSAWYQLPKPDDRHPLVVVTAAGKITGNSVLHGHTYGQTVVLEYGDPGPNGGLVPAGRLVPDDLYGEQPKAWRNLRFARSQMPFDAVAVRVVAENLSLTPEDWIAVTPPRVPELRSLQEYVGSSQPVLLDWEVGLAFPCQQPMLHANGVTDIPKFRITPDYSAKKIDTDTWEDGANGGLLGITDLLLRAHVMSTYLARDWGRDWGSLRKFDPLVDTHPAQLDLDTATRSGWWSPGKIRIKP>BL;embC, ML.tab 144115:147327 reverse MW:114723VSGAGANYWIARLLAVIAGLLGALLAMATPFLPVNQNTAQLNWPQNSTFESVEAPLIGYVATGLNVTVPCAAAAGLTGPQ(SEQ ID NO:330)SAGQTVLLSTVPKQAPKAVDRGLLIQRANDDLVLVVRNVPVVSAPMSQVLSPACQRLTFAAYFDKITAEFVGLTYGPNAEHPGVPLRGERSGYDFRPQIVGVFTDLSGPIPTGLNFSATIDTRYSSSPTLLKTIAMILGVVLTIVALVALHLLDTADGTQHRRLLPSRWWSIGCLDGLVITILAWWHFVGANTSDDGYILTMARVSEHAGYMANYYRWFGTPEAPFGWYYDLLALWAHVTTTSAWMRVPTLAMALTCWWLISREVIPRLGHAAKASRAAAWTAAGMFLAVWLPLDNGLRPEPIIALGILLTWCSVERAVATSRLLPVAVACIVGALTLFSGPTGIASIGALLVAVGPLLTILQRRSKQFGAVPLVAPILAASTVTAILIFRDQTFAGESQASLLKRAVGPSLKWFDEHIRYERLFMASPDGSVARRFAVLALLVALSVAVAMSLRKGRIPGLAAGPSRRIIGITVTSFLAMMFTPTKWTHHFGVFAGLAGSLGALAAVAVASAALRSRRNRTVFAAVVLFVVALSFASVNGWWYVSNFGVPWSNSFPKLRWSLTTALLELTVIVLLLAAWFHFVATTNGSAKTRFGVRIDRIVQSPIAIATWSLVIFEVASLTMAMIGQYPAWTVGKSNLQALTGQTCGLAEEVLVEQDPNAGMLLPVSTPVADALGSSLAEAFTANGIPADVSADPVMEPPGDRSFVKENGMTTGGEAGNEGGTNATPGINGSRAQLPYNLDPARTPVLGSWQSGIQVVARLRSGWYRLPARDKAGPLLVVSAAGRFDHHEVKLQWATDSGAASGQPGGAFQFSDVGASPAWRNLRLPLSAIPSMATQIRLVADDEDLAPQHWIALTPPRIPQLRTLQDVVGYQDPVFLDWLVGLAFPCQRPFDHQYGVDETPKWRILPDRFGAEANSPVMDNNGGGPLGVTELLLKATTVASYLKDDWSRDWGALQRLTPYYPNAQPARLSLGTTTRSGLWNPAPLRH>BL;1ppS, ML.tab 524929:526143 forward MW:43242VGTATRRVQPKAWRALLTLLVISVVMPGVACNRGGGNVPVNVIGDKGTPFADLLVPKLTASVTDGAVGVNVDMPVTVTVA(SEQ ID NO:331)DGVLAAVTMINDNGRMINGQFSPDGLRWSTTEPLGFNRCYTLSAKALGLGGVVNRQMTFQTSSPAHLTMPYVNPGNGEIVGIGEPVAIRFDENIANRLAAQKAITITANPPVEGAFYWLNNREVRWRPEHFWKSGTAVDVAVNTYGVDLGEGMFGEDNVKTHFTIGDEVFATADDATKMLTVRVNGEVVKIMPTSMGKDSTPTANGIYIVGARFKHIIMDSSTYGVPVNSPNGYRADVDWATQLSYSGVFLHSAPWSVGAQGHTNTSHGCLNVSPSNAQWFYDHVKRGDIVEVVNTVGDTLPGAEGLGDWNIPWEQWKAGNANI>BL;1ppX, ML.tab 188551:189252 forward MW:24411MNDRKWVTSSVMLVTLSACLALGLSGCSSTKPDAQEQSSSSSPASSDPALTAEIKQSLETTKALSSVHVVVQTTGKVDAL(SEQ ID NO:332)LGISNADVDVQANPLAVKGTCTYNDQPGVPFRVLGDNISVKLFDDWSNLGSISDLSTSHVLDPNTGITQVLSGVINLQAQGTEVVDRIPTNKITGTVPTSSVKMLDPKAKGSKLATVWIAQDGSHHLVRASIDLGSGSIQLTQSKWNEPVNTN>BL;1pqB, ML.tab 918368:920137 forward MW:61768VMRGVLVIMRLLCLGMLFTGCAGVPNSSAPQAIGTVERPVPSNLPKPTPGMDPDVLLREFFKATADPANRHLAARQFLTQ(SEQ ID NO:333)SASNAWDDAGRALLIDHVVFVETRGAERVSATMRADILGSLSDMGVFETAEGVLPDPGPVELIKTSGCWRIDRLPNGVFLDWQQFQATYKRNTLYFADPTGKTVVPDPRYVAVLGHDQLATELVSKLLAGPRPEMAHAVRNLLAPPLRLRGPVTRADGSKSGIGRGYGGARIDLEKLSTTDPHSRQLLAAQIIWTLARADIRGPYVINADGAPLDDRFADGWTTSDVAATDPGVADGAGAGLHALVGGALVSLIGQNTTTVLGAFGRMGYQTGAALSRSGRQVASVVTLRRGAPDMAASLWIGDLGGEAVQSADGHSLSRPSWSLDDAVWVVVDTNNVLRAIPEPASGQPARIPVDSAAVASRFPGPITDLQLSRDGTRAAMVIGGQVILAGVEQTQAGQFALTYPRRLGFGLGSSVVSLSWRTGDDIVVTRTDATHPVSYVNLDGVNSDAPARGLQVPLSVIAANPSTVYVAGPQGVLQYSASVAESQQGWSEVAGLTVMGAEPVLPG>BL;1pqE, ML.tab 409442:409993 reverse MW:19242VSRFKISLPALATRVAVLGFLTLMASVLGGCGAGQISQTATQEPAVNGNRVTLNNLALRDIRIQAAQTGDFLQSGRTVDL(SEQ ID NO:334)MLVAINNSPYVTDRLVSITSDIGTVALNGYTQLPTNGMLFIGTSEGQRIKPPPLQSNNIAKAIVTLAKPITNGLTYNFTFNFEKAGQANVAVPVSAGLAPRQT>BL;1pqT, ML.tab 322399:323055 reverse MW:23455MQAIRLGLHTAAAVVTLSISAVSCGTKTPDYQLILSKSSTTTTTTPDKPIPLPQYLESIGVTGQQVAPSSLPGLTVSIPT(SEQ ID NO:335)PPGWSPYSNPNITPETLIIAKSGKYPTARLVAFKLRGDFDPTQVIKHGNDDAQLFENFRQLDVSTANYNGFPSAMIQGSYDLEGRRLHAWNRIVIPTGPPPSKQQYLVQLTITSLANEAVAQSNDIEAIIRGFVVAPK>BL;1prG, ML.tab 674679:675395 reverse MW:24874MQAPKHHRRLFAVLATLNTATAVIAGCSSGSNLSSGPLPDATTWVKQATDITKNVTSAHLVLSVNGKITGLPVKTLTGDL(SEQ ID NO:336)TTHPNTVASGNATITLDGADLNANFVVVDGELYATLTPSKWSDFGKASDIYDVASILNPDAGLANVLANFTGAKTEGRDSINGQSAVRISGNVSADAVNKIAPPFNATQPMPATVWIQETGDHQLAQIRIDNKSSGNSVQMTLSNWDEPVQVTKPQVS>BL;1sr2, ML.tab 305368:305706 forward MW:12164MAKKVTVTLVDDFDGAGAADETVEFGLDGVTYEIDLTNKNAAKLRGDLRQWVSAGRRVGGRRRGRSNSGRGRGAIDREQS(SEQ ID NO:337)AAIREWARRNGHNVSTRGRIPADVIDAFHAAT>BL;mihF, ML.tab 656895:657212 forward MW:11474VALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEIMTELD(SEQ ID NO:338)IAPTRRLRGLGERQRKALLEKFGSA>BL;mmpS3, ML.tab 1041502:1042383 forward MW:30891MSGPNPPGRENEESDSGNELSGELDPHNGVESVDELVPVPDSDLVTASDHTSETEVYSQAYSAPEAEHFTAVPYVPADLR(SEQ ID NO:339)LYDYDESSVYDEPGAAPRWPWVVGVAAILAAISLVVSVSLLFTRTDTSKLSTPTTGRSTPPVQDEITTVKPPPPSTETSTATETQTVTVTPLPPPSATSTAVPPSSVVPPPPTTPTTTVTTLTGPRQVTYSVTGTKAPGDIISVTYVDASCRRRTQHMVYIPWSMTVTPISQSDVGSVEAFSLFRVSKLNCLITTSDGTVLSSNSNDAPQTSC>BL;mtb12, ML.tab 753532:754035 forward MW:17130MTMKSIATYAALAIIGAAVDGLTSMAIPTGPAASHIQPVAFGVPLPQDPAPAADVPTAAELTSLLNKIVDPDVSFMHKSQ(SEQ ID NO:340)LVEGGIGSAEAHIGDRELKNAAQKGELPLLFSVTNIRPGTSGSATADVSVSGPKLNPPVTQNITFINKGSWVLSRHSAMELLQAAGR>BL;whiB1, ML.tab 953665:953919 reverse MW:9318MDWRHKAVCRDEDPELFFPVGNSGPAIAQIADAKLVCNRCPVTTECLAWALNTGQDSGVWGGMSEDERRALKRRNTRTKA(SEQ ID NO:341)RSGV>BL;whiB2, ML.tab 903227:903496 forward MW:10119VVPKALVAFEVESEPESSDQWQDRALCAQTDPEAFFPEKGGSTREAKKICLGCEVRHECLEYALAHDERFGIWGGLSERE(SEQ ID NO:342)RRRLKRGVI>BL;whiB3, ML.tab 475771:476079 reverse MW:11576MPQPKQLPGPNATIWNWQLQGLCRGVDSSMFFHPDGERGRARMQREQRAKEMCRRCPVIEECRAHALDVGEPYGVWGGLS(SEQ ID NO:343)ESERDLLLKGDLARSRSIPRSAH. tuberculosis proteins that are potential targets for the diagnosis,prophylaxis or treatment of mycobacterioses.>BL;RV0007, H37RV2.tab 9914:10825 forward MW:31041VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADA(SEQ ID NO:344)RLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAPLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD>BL;Rv0010C, H37RV2.tab 13136:13558 reverse MW:15166MQQTAWAPRTSGIAGCGAGGVVMAIASVTLVTDTPGRVLTGVAALGLILFASATWRARPRLAITPDGLAIRGWFRTQLLR(SEQ ID NO:345)NSNIKIIRIDEFRRYGRLVRLLEIETVSGGLLILSRWDLGTDPVEVLDALTAAGYAGRGQR>BL;Rv0011c, H37RV2.tab 13717:13995 reverse MW:10429MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIGLIWLMVFQLAAIGSQAPTALNWMAQLGPWNYAIAFA(SEQ ID NO:346)FMITGLLLTMRWH>BL;RV0020C, H37RV2.tab 23864:25444 reverse MW:56880MGSQKRLVQRVERKLEQTVGDAFARIFGGSIVPQEVEALLRREAADGIQSLQGNRLLAPNEYIITLGVHDFEKLGADPEL(SEQ ID NO:347)KSTGFARDLADYIQEQGWQTYGDVVVRFEQSSNLHTGQFRARGTVNPDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNSSYRGGQGQGRPDEYYDDRYARPQEDPRGGPDPQGGSDPRGGYPPETGGYPPQPGYPRPRHPDQGDYPEQIGYPDQGGYPEQRGYPEQRGYPDQRGYQDQGRGYPDQGQGGYPPPYEQRPPVSPGPAAGYGAPGYDQGYRQSGGYGPSPGGGQPGYGGYGEYGRGPARHEEGSYVPSGPPGPPEQRPAYPDQGGYDQGYQQGATTYGRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYGQGGYGSAGTSVTLQLDDGSGRTYQLREGSNIIGRGQDAQFRLPDTGVSRRHLEIRWDGQVALLADLNSTNGTTVNNAPVQEWQLADGDVIRLGHSEIIVRMH>BL;Rv0039C, H37RV2.tab 42007:42351 reverse MW:11292MFLAGVLCMCAAAASALFGSWSLCHTPTADPTALALRAMAPTQLAAAVMLAAGGVVAVAAPGHTALMVVIVCIAGAVGTL(SEQ ID NO:348)AAGSWQSAQYALRRETASPTANCVGSCAVCTQACH>BL;Rv0040c, H37RV2.tab 42434:43365 reverse MW:31923MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAPVSAPATVPPVQNLTALPGGSSNRFSPAPAPAPIASP(SEQ ID NO:349)IPVGAPGSTAVPPLPPPVTPAISGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAFVVIADRLGNSVYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTNASMANFDGFPSSIIEGTYRENDMTLNTSRRHVIATSGADKYLVSLSVTTALSQAVTDGPATDAIVNGFQVVAHAAPAQAPAPAPGSAPVGLPGQAPGYPPAGTLTPVPPR>BL;Rv0049, H37RV2.tab 52831:53241 forward MW:15000VDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRVICPICRKEQLTLVSWVFGEHLGAVSGSARTAEELIL(SEQ ID NO:350)LATRFSEFAVHVVEVCRTCSWNHLVKSYVLGAARPARPPRGSGGTRTARNGARTASE>BL;Rv0051, H37RV2.tab 55696:57375 forward MW:61209VTGALSQSSNISPLPLAADLRSADNRDCPSRTDVLGAALANVVGGPVGRHALIGRTRLMTPLRVMFAIALVFLALGWSTK(SEQ ID NO:351)AACLQSTGTGPGDQRVANWDNQRAYYQLCYSDTVPLYGAELLSQGKFPYKSSWIETDSNGTPQLRYDGQIAVRYMEYPVLTGIYQYLSMAIAKTYTALSKVAPLPVVAEVVMFFNVAAFGLALAWLTTVWATSGLAGRRIWDAALVAASPLVIFQIFTNFDALATGLATSGLLAWARRRPVLAGVLIGLGSAAKLYPLLFLYPLLLLGIRAGRLNALARTMAAAAATWLLVNLPVMLLFPRGWSEFFRLNTRRGDDMDSLYNVVKSFTGWRGFDPTLGFWEPPLVLNTVVTLLFVLCCAAIAYIALTAPHRPRVAQLTFLTVASFLLVNKVWSPQFSLWLVPLAVLALPHRRILLAWMTIDALVWVPRMYYLYGNPSRSLPEQWFTTTVLLRDIAVMVLCGLVWQIYRPGRDLVRTGGPGALPACGGVDDPVGGVFANAAPPGRLPSWLRPRLGDEHRERTPDAGRDRTFSGQHRA>BL;RV0093C, H37RV2.tab 102818:103663 reverse MW:29599VLAQATTAGSFNHHASTVLQGCRGVPAAMWSEPAGAIRRHCATIDGMDCEVAREALSARLDGERAPVPSARVDEHLGECS(SEQ ID NO:352)ACRAWFTQVASQAGDLRRLAESRPVVPPVGRLGIRRAPRRQHSPMTWRRWALLCVGIAQIALGTVQGFGLDVGLTHQHPTGAGTHLLNESTSWSIALGVIMVGAALWPSAAAGLAGVLTAFVAILTGYVIVDALSGAVSTTRILTHLPVVIGAVLAIMVWRSASGPRPRPDAVAAEPDIVLPDNASRGRRRGHLWPTDGSAA>BL;RV0098, H37RV2.tab 107600:108148 forward MW:20528MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELIL(SEQ ID NO:353)CFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP>BL;Rv0100, H37RV2.tab 109783:110016 forward MW:8660VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAANPSIAGWLRELEAVCTEFG(SEQ ID NO:354)>BL;Rv0116c, H37Rv2.tab 140270:141022 reverse MW:26915MRRVVRYLSVVVAITLMLTAESVSIATAAVPPLQPIPGVASVSPANGAVVGVAHPVVVTFTTPVTDRRAVERSIRISTPH(SEQ ID NO:355)NTTGHFEWVASNVVRWVPHRYWPPHTRVSVGVQELTEGFETGDALIGVASISAHTFTVSRNGEVLRTMPASLGKPSRPTPIGSFHAMSKERTVVMDSRTIGIPLNSSDGYLLTAHYAVRVTWSGVYVHSAPWSVNSQGYANVSHGCINLSPDNAAWYFDAVTVGDPIEVVG>BL;Rv0146, H37Rv2.tab 172211:173140 forward MW:34016MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTNAGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYL(SEQ ID NO:356)RSYQAVRTNFFDTYFASAVAAGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVTPSAGRREVPADLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRLFTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADVLGIEQTIDVQELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRvGRWVEGVPMADDPTAFAEFVTAERL>BL;Rv0164, H37Rv2.tab 193626:194180 forward MW:20165MTAISCSPRPRYASRMPVLSKTVEVTADAASIMAIVADIERYPEWNEGVKGAWVLARYDDGRPSQVRLDTAVQGIEGTYI(SEQ ID NO:357)HAVYYPGENQIQTVMQQGELFAKQEQLFSVVATGAASLLTVDMDVQVTMPVPEPMVKNLLNNVLEHLAENLKQPAEQLAAS#GMCGLSRRLRSQPGPPSACVPHR>BL;Rv0175, H37Rv2.tab 206814:207452 forward MW:22324VKAADSAESDAGADQTGPQVKAADSAESDAGELGEDACPEQALVERRPSRLRRGWLVGIAATLLALAGGLGAAGYFALRS(SEQ ID NO:358)HQESQSIAREDLAAIEAAKDCVAATQAPDAGANSASMQKIIECGTGDFGAQASLYTSMLVEAYQAASVHVQVTDMRAAVERNNNDGSVDVLVALRVKVSNTDSDAHEVGYRLRVRMALDEGRYKIAKLDQVTK>BL;Rv0176, H37Rv2.tab 207452:208417 forward MW:35405VTVVVEKTPTTLPQATPNGAAPWHVRAGAFAIDVLPGLAVAATMALTALTVPPGSAWRWLCACLLGLTILLLAVNRLLLP(SEQ ID NO:359)TITGWSLGRALTGIRVVRRDGSAIGPWRLLVRDLAHLVDTLSLFVGWLWPLWDSRRRTFADLLLRTEVRRVEPVQRPAVIRRLTAAVALAAAGACASATAVGAAVVYVNEWQTDHTRAQLATRGPKLVVDVLSYDPETVQRDFERARSLATDRYRPQLSIQQDSVRESGPVRNQYWVTDSAVLSATPAQATMLLFMQGERGTPPNQRYIQSTVRAIFQKSRGQWRLDDLAVVMKPRQPTGEK>BL;Rv0177, H37Rv2.tab 208417:208968 forward MW:20164MSPRRKFEPGEGALLAPQSIEPSRRWGLPLALTASAVVMAAAISACALMRISHESHQRAAHKDIVMLSDVRSFMTMFTSP(SEQ ID NO:360)DPFHANEYAERVLSHATGDFAKQYHERANDILIRISGVEPTTGTVLDAGVQRWNEDGSANVLVVTQITSKSADGKRVVSNANRWLVTAKQEGNEWKISSLLPVI>BL;Rv0178, H37Rv2.tab 208938:209669 forward MW:25879VEDQQSASGDLTQKSVANGESTDTASAATEGHRGEIDAAGEPDERGAAVADSQADEDDSAATAARGGKTRARRSRGRRLA(SEQ ID NO:361)ITVGVAAALFVGSAAFAGATVEPYLSERAVVATKLMVARTAANAITTLWTYTPENMDTLADRAANYLSGDFAAQYRRFVDQIAAANKQAXITNDTEVTGAAVESLSGRDAVAIVYTNTTTTSPVTKNIPALKYLSYRLFMKRYDARWLVTRMTTITSLDLTPQV>BL;Rv0184, H37Rv2.tab 214969:215715 forward MW:26826MTNDKMLARIAALLRQAEGTDNPHEADAFMSTAQRLATAASIDLAVARSHAGNRSPAQAPTQRTITIGAAGTRGLRTYVQ(SEQ ID NO:362)LFVLIAAANDVRCDVASNSTFVYAYGFAEDIDTSHALYASLVVQMVRASDAYLASGAHRPTPTITARLNFQLAFGARVGQRLADAREQTRQEATKDRDRPPGTAIALRDKDIELHEYYRRSSKARGAWRASRATAGYSSAARRAGDRAGRQARLGNNPELPGARAALGR>BL;Rv0185, H37Rv2.tab 215715:216221 forward MW:18365VIGADVPRDSQRARVYAAEAFVRTLFDRvTAHGSPTVEFFGTQLTLPPEGRFGSVASVQRYVDDVLALPAVGQNWPTVSP(SEQ ID NO:363)VRVRARRAATAAHYENHGGTGTIAVPDRHTAGWANRELVVLHEVAHHLCQVPPPHGPEFVATVCTLTELVMGPEVGHVFRVVYAQEGVR>BL;Rv0199, H37Rv2.tab 236550:237206 forward MW:23520MPDGEQSQPPAQEDAEDDSRPDAAEAAAAEPKSSAGPMFSTYGIASTLLGVLSVAAVVLGAMIWSAHRDDSGERTYLTRV(SEQ ID NO:364)MLTAAEWTAVLINMNADNIDASLQRLHDGTVGQLNTDFDAVVQPYRQVVEKLRTHSSGRIEAVAIDTVHRELDTQSGAARPVVTTKLPPFATRTDSVLLVATSVSENAGAKPQTVHWNLRLDVSDVDGKLMISRLESIR>BL;Rv0200, H37Rv2.tab 237206:237892 forward MW:24029MRNAWRLVVFDVLAPLATIAALAAIGVLLGWPLWWVSTCSVLVLLVVEGVAINFWLLRRDSVTVGTDDDAPGLRLAVVFL(SEQ ID NO:365)CAAAISAAVVTGYLRWTTPDRDFNRDSREVVHLATGMAETVASFSPSAPAAAVDRAAANMVPEHAGGFKEQYAKSSADLARRGVTAQAATLAAGVEAIGPSAASVAVILRVSQSIPGQPTSQAARALRVTLTKRGSGWLVLDVTPINAR>BL;Rv0201c, H37Rv2.tab 237895:238395 reverse MW:18484VTLAAEPHPAPPQQPTVAWSEPDVDRRVEFWPTVAIRSALESGDIATWQRIAAALKRDPYGRTARQVEEVLEGIPATGIA(SEQ ID NO:366)NAFWEVLDRARTHLDANERAEVARQVGLLLDRSGLQRQEFASRIGVTAQDLTAYLDGIVSPSASLMIRMRRLSDRFVRAKSVRAADS>BL;Rv0207c, H37Rv2.tab 247387:248112 reverse MW:26175MSLTEDVTSQTSESLARHSVLAEDLSQDGLTSLGAPGARvLLVWDAPNLDMGLGSILGRRPTALERPRFDALGRWLLART(SEQ ID NO:367)AEIVAGRPGISTEPEATVFTNIAPGSAEVVRPWVDALRNVGFAVFAKPKVDEDSDVDRDMLAHIDERYREGLAALVVASADGQAFRQPLEAVARSGTPVQVLGFREHASWALASDTLEFVDLEDIAGVFREPLPRIGLDSLPEQGAWLQPFRPLSSLLTSRV>BL;Rv0216, H37Rv2.tab 258913:259923 forward MW:35756VASGYGGIRvGGPYFDDLSKGQVFDWAPGVTLSLGLAAAHQSIVGNRLRLALDSDLCAAVTGMPGPLAHPGLVCDVAIGQ(SEQ ID NO:368)STLATQRVKANLFYRGLRFHRFPAVGDTLYTRTEVVGLRANSPKPGRAPTGLAGLRMTTIDRTDRLVLDFYRCANLPASPDWKPGAVPGDDLSRIGADAPAPAADPTAHWDGAVFRKRVPGPHFDAGIAGAVLHSTADLVSGAPELARLTLNIAATHHDWRVSGRRLVYGGHTIGLALAQATRLLPNLATVLDWESCDHTAPVHEGDTLYSELHIESAQAHADGGVLGLRSLVYAVSDSASEPDRQVLDWRFSALQF>BL;Rv0226c, H37Rv2.tab 269837:271564 reverse MW:59107VRWFRPGYALVLVLLLAAPLLRPGYLLLRDAVSTPRSYVSANALGLTSAPRATPQDFAVALASHLVDGGVVVKALLLLGL(SEQ ID NO:369)WLAGWGAARLVATALPAAGAAGQFVAITLAIWNPYVAERLLQGHWSLLVGYGCLPWVATAMLTMRTTVGAGWFGLFGLAFWVALAGLTPSGLLLAATVAVVCVANPGAGRPRWQCGVAALGSALVGALPWLTASALGSSLTSHTAANQLGVTAFAPRAEPGLGTLGSLASLGGIWNGEAVPSSRTTLFAVASAVVLLANVAIGLPTVARRPVAVPLLTLAAVSVMVPAVLATGPGLHALRVVVDAAPGLGVLRDGQKWVALAVPGYTLSGAGTVLTLRRWLRPATAAVVCCLALVLTLPDLAWGVWGKVAPVHYPSGWAAVAAAINADPRTVAVLPAGTMRRFSWSGSAPVLDPLPRWVRADVLTTGDLVISGVTVPGEDAHARAVQELLLTGPHPSTLAAAGVGWLVVESDSAGDMGAAARTLGRLAAAHRDDELALYRVGGQTSGASSARLKATMLAHWAWLSMLLVGGAGAAGYWVRRHLHHCEDTPASRAQD>BL;Rv0227c, H37Rv2.tab 271577:272839 reverse MW:45528MLRFAACGAIGLGAALLIAALLLSTYTTSRIAEIPLDIDATLISDGTGTALDSASLATEHIVVNQDVPLVSQQQVTVESP(SEQ ID NO:370)ANADVVTLQVGSSLRRTDKQKDSGLLLAIVDTVTLNRKTAMAVSDDTHTGGAVQKPRGLNDENPPTAIPLRHDGLSYRFPFHTEKKTYPYFDPIAQKAFDANYEGEEDVNGLTTYRFTQNVGYTPEGKLVAPLKYPSLYAGDEDGKVTTSAANWGLPGDPNEQITMTRYYAAQRTFWVDPVSGTIVKETERANNYFARDPLKPEVTFADYOVTSTEETVESQVNAARDERDRLALWSRVLPITFTAAGLVALVGGGLFASFSLRTEGALMAASGDRDDHDYRRGGFEEPVPGAEAETEKLPTQRPDFPREPSGSDPPRLGSAQPPPPPDAGHPDPGPPERR>BL;Rv0236A, H37Rv2.tab 286898:287071 reverse MW:5833MNRIVAPAAASVVVGLLLGAAAIFGVTLMVQQDKKPPLPGGDPSSSVLNRvEYGNRS(SEQ ID NO:372)>BL;Rv0236c, H37Rv2.tab 282652:286851 reverse MW:146250VAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFLARATNLWNSDLPFGQAQNQAYGYLFPHGTFFVIGHL(SEQ ID NO:373)LGVPGWVTQRLWWAVLLTVGFWGLLRvAEALGVGGPSSRvVGAVAFALSPRVLTTLGSISSETLPMMLAPWVLLPTILALRGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIWWACHRPNRLWWRYTAWWLLAMALATLWWVMALTQLNGVSPPFLDPIESSGVTTQWSSLVEVLRGTDSWTPFVAPNATAGAPLVTGSAAILGTCLVAAAGLAGLTSPAMPARGRLVTMLLVGVVLLAVGHRGGLASPVAHPVQAFLDAAGTPLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAFAHPERDKRVAVAVVALTALMVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAATPTPGRVLVVPGAPFATQVWGTSHDEPLQVLGDGPWGVRDSIPLTPPQTIRALDSVQRLFAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLHRSIAGSPGLAKLAEFGAPVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPANPGAPYFAATDQLARvDGGPEVLLRLDERRRLQGQPPLGPVLMTADARAAGLPVPQVAVTDTPVARETDYGRVDHHSSAIRAPGDARHTYNRVPDYPVPGAEPVVGGWTGGRITVSSSSADATAMPDVAPASAPAAAVDGDPATAWVSNALQAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTLRFDEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGFAHPVQLRHTVLVPGPPPGSAIAGWDLGSELLGRPGCAPGPDGVRCAASMALAPEEPANLSRTLTVPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGDSDLVDILGSAYAAADGDPATAWTAPQRvVQHKTPPTLTLTLPRPTVVTGLRLAASRSMLPAHPTVVAINLGDGPQVRQLQVGELTTLWLHPRVTDTVSVSLLDWDDVIDRNALGFDQLKPPGLAEVVVLSAGGAPIAPADAARNRARALTVDCDHGPVVAVAGRFVHTSIRTTVGALLDGEPVAALPCEREPIALPAGQQELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESATSRVLVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLTFAPNSLYRASLAIGLALLPLLALLAFWRTGRRQLADRPTPPWRPGAWAAAGVLAAGAVIASIAGVMVMGTALGVRYALRRRERLRDRvTVGLAAGGLILAGAALSRHPWRSVDGYAGNWASVQLLALISVSVVAASVVATSESRGQDRMQ>BL;2V0241c, H37Rv2.tab 289815:290654 reverse MW:30163VTQPSGLKNLLRAAAGALPVVPRTDQLPNRTVTVEELPIDPANVAAYAAVTGLRYGNQVPLTYPFALTFPSVMSLVTGFD(SEQ ID NO:374)FPFAAMGAIHTENHITQYRPIAVTDAVGVRVRAENLREHRRGLLVDLVTNVSVGNDVAWHQVTTFLHQQRTSLSGEPKPPPQKKPKLPPPAAVLRITPAKIRRYAAVCGDHNPIHTNPIAAKLFGFPTVIAHGMFTAAAVLANIEARFPDAVRYSVRFAKPVLLPATAGLYVAEGDGGWDLTLRNMAKGYPHLTATVRGL>BL;Rv02500, H37Rv2.tab 301738:302028 reverse MW:10878LSTTAELAELHDLVGGLRRCVTALKARFGDNPATRRIVIDADRILTDIELLDTDVSELDLERAAVPQPSEKIAIPDTEYD(SEQ ID NO:375)REFWRDVDDEGVGGHRY>BL;Rv0257, H37Rv2.tab 309699:310071 forward MW:13053MTRVSWLPDRCLPRLPACGRGLRGSLPGDSGGTAPDSHRLPASSSPDGKNIGMQSVDLHVERHLPSRGRSNRTVATVTCV(SEQ ID NO:376)TALGDIRSAQLSATGAWPAVLFPSWSWLCGIGGGVDLQKPSCRA>BL;Rv0283, H37Rv2.tab 344022:345635 forward MW:55943MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTGWRFVMRRIAAGIALHDTRMLVDPLRTQSRAVLMGVL(SEQ ID NO:377)IVITGLIGSFVFSLIRPNGQAGSNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTELDQFPRGNLIGIPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVIAGPLEDTGARAAALGPGQAVLVDSGAGTWLLWDGKRSPIDLADHAVTSGLGLGADVPAPRIIASGLFNAIPEAPPLTAPIIPDAGNPASFGVPAPIGAVVSSYALKDSGKTISDTVQYYAVLPDGLQQISPVLAAILRNNNSYGLQQPPRLGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSKPVGAATSSLTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPDAPGAGSLFWVSDTGVRYGIDNEPQGVAGGGKAVEALGLNPPPVPIPWSVLSLFVPGPTLSRADALLAHDTLVPDSRPARPVSAEGGYR>BL;Rv0288, H37Rv2.tab 351848:352135 forward MW:10390MSQIMYNYPMLGHAGDMAGYAGTLQSLGAEIAVEQAALQSAWQGDTGITYQAWQAQWNQANEDLVRAYHAMSSTHEANT(SEQ ID NO:378)MAMMARDTAEAAKWGG>BL;Rv0289, H37Rv2.tab 352149:353033 forward MW:31559MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRGAFVDRQRDELTRMGLLSPQGVINPAVADWIKVVCFP(SEQ ID NO:379)DRWLDLRYVGPASADGACELLRGIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGVGLAHRPPARFDEFSLPTRvGARADERLRSGVPLGEVVDYLGIPASARPVVESVFSCPRSYVEIVAGCNRDGRHTTTEVGLSIVDTSAGRVLVSPSRAFDGEWVSTFSPGTPFAIAVAIQTLTACLPDGQWFPGQRvSRDFSTQSS>BL;Rv0290, H37Rv2.tab 353083:354498 forward MW:47944MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPSAQNGDGGQADSGAAVQLSLAPVGGQPFSLDASLDTV(SEQ ID NO:380)GVVDGDLLVLQPVPAGPAAPGIVEDIADAANIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRvATGVLAGLLAVAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALALAVPGKFGPAQVLLGAAGVAAWSLIALMIPSAERERVVAFFTAAAVVGASVALAAGAQLLWQLPLLSIGCGLIVAALLVTIQAAQLSALWARFPLPVIPAPGDPTPSAPPLRLLEDLPRRVRVSDAHQSGFIAAAVLLSVLGSVAIAVRPEALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAGVLLVFYTATGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAGLDVSLIPVMAYLVGLFAWVLNR>BL;Rv0292, H37Rv2.tab 355880:356872 forward MW:35932MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAVVIGLFGFWRGLYFTTIARRGLAILRRRRRIAEPATC(SEQ ID NO:381)TRTTVLVWVGPPASDTNVLPLTLIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQARSARIPLQETAQVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRGMRHTDSDYVAAYRVSANAELPDTLPAIRSRPAQETWIALEIAYAAGSSTRYTVAAACALRTDWRPGGTAPVAGLLPQHGNHVPALTALDPRSTRRLDGHTDAPADLLTRLHWPTPTAGAHRAPLTNAVSRT>BL;Rv0309, H37Rv2.tab 377931:378584 forward MW:22528MSRLLALLCAAVCTGCVAVVLAPVSLAVVNPWFANSVGNATQVVSVVGTGGSTAKMDVYQRTAAGWQPLKTGITTHIGSA(SEQ ID NO:382)GMAPEAKSGYPATPMGVYSLDSAFGTAPNPGGGLPYTQVGPNHWWSGDDNSPTFNSMQVCQKSQCPFSTADSENLQIPQYKHSVVMGVNKAKVPGKGSAFFFHTTDGGPTAGCVAIDDATLVQIIRWLRPGAVIAIAK>BL;Rv0313, H37Rv2.tab 382490:382873 forward MW:13916VGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADG(SEQ ID NO:383)GARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDDPVDEAQ>BL;Rv0356c, H37Rv2.tab 434833:435474 reverse MW:22879VTDASVHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQDLAVAADPGDAVWDEAAERAAALVELLSPFEADEGK(SEQ ID NO:384)APAGRTPGLPGMGSLLLPPWTVTRYGTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRPISRTAFLHVDYRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLAEGNGLMVRLLPGQP>BL;Rv0358, H37Rv2.tab 436860:437504 forward MW:23102MYTAENAPGVAVLLSGDADVPGPLTGLPTHQDNLDTVIGRYSRLIVVGADADLGAVLTRLLRTDRLDVEVGYVPRRRSPA(SEQ ID NO:385)TRAYRLPAGRRAARRARCGVARRVPLIRDETGSVIVGRAQWLPAEEQALIHGEAVVDDTVLFDGDVAGVCIEPTLTLPGLRAAVDGAGKWRRWIGGRAAQLGTTGAAVLRDGVAAPFPVRRSTFYRNVEGWLLVR>BL;Rv0360c, 1137Rv2.tab 438305:438739 reverse MW:15297VTKRTITPMTSMGDLLGPEPILLPGDSDAEAELLANESPSIVAAAHPSASVAWAVLAEGALADDKTVTAYAYARTGYHRG(SEQ ID NO:386)LDQLRRHGWKGRGPVPYSHQPNRGFLRCVAALARAAAAIGETDEYGRCLDLLDDCDPAARPALGL>BL;Rv0361, H37Rv2.tab 438822:439646 forward MW:29982MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQV(SEQ ID NO:387)IVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRvSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN>BL;Rv0383c, H37Rv2.tab 458464:459315 reverse MW:31801MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFDYERESTEILKRWTRGVMSTVGDVAAHNVVLGQIRGE(SEQ ID NO:388)AVYIFDLEEVATVIALHRKVGTNVVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAEIMWNEQNWTLVSMPIASTRAQWDEGLRTVRQFNDLLRVLPPLPQEMPQQTGVGPRGAAPGRPVAPGGPAELPPRRAQPDPATTVLPDPARRAPEPIRRDEGRSEGVRRPPPAGRNGQQATNYQH>BL;Rv0401, H37Rv2.tab 479789:480157 forward MW:12641MRPRRALAGLAADVVAVLVFCAVGRRSHAEGLSVTGLAATAWPFLTGTGIGWVLARGWRRPTALAPTGVIVWLCTIWGM(SEQ ID NO:389)VLRKVSSAGVAASFVVVASAVTAVLLLGWRAAVALMAPHRADG>BL;Rv0416, H137Rv2.tab 502167:502370 forward MW:7367MIVVVNEQQVEVDEQTTIAALLDSLGFGDRGIAVALNFSVLPRSDWATKICELRKPVRLEVVTAVQGG(SEQ ID NO:390)>BL;Rv0430, H37Rv2.tab 518733:519038 forward MW:11723MDSAMARAIRSGDDAEVADGLTRREHDILAFERQWWKFAGVKEEAIKELFSMSATRYYQVLNALVDRPEALAADPMLVKR(SEQ ID NO:391)LRRLRASRQKARAARRLGFEVT>BL;Rv0431, H37Rv2.tab 519073:519564 forward MW:16905VLVTVGSMNERVPDSSGLPLRAMVMVLLFLGVVFLLLVWQALGSSPNSEDDSSAISTMTTTTAAPTSTSVKPAAPRAEVR(SEQ ID NO:392)VYNISGTEGAAARTADRLKAAGFTVTDVGNLSLPDVAATTVYYTEVEGERATADAVGRTLGAAVELRLPELSDQPPGVIVVVTG>BL;Rv0455c, H37Rv2.tab 545378:545821 reverse MW:16639MSRLSSILRAGAAFLVLGIAAATFPQSAAADSTEDFPIPRRMIATTCDAEQYLAAVRDTSPVYYQRYMIDFNNHANLQQA(SEQ ID NO:393)TINKANWFFSLSPAERRDYSEHFYNGDPLTFAWVNHMKIFFNNKGVVAKGTEVCNGYPAGDMSVWNWA>BL;Rv0463, H37Rv2.tab 554016:554306 forward MW:10111MTRRASTDTPQIIMGAIGGVTGYILWLAAISVGDGLTTVSQWSRVVLLLSVLVAVCGAAGGLRLRSRGKLAWSAFAFSL(SEQ ID NO:394)PIPPVVLTVAVLADIYL>BL;Rv0464C, H37Rv2.tab 554316:554885 reverse MW:21304MTGQNGQVARISPGKFRQLGPVNWLVAKLAARAVGAPQMHLFTTLGYRQYLFWTFAIYTGRLLHGRLPGVDTELVILRVA(SEQ ID NO:395)HLRSCEYELQHHRRMARRRGLDANTQATIFAWPDVPDGDGPRKVLSARQQALLQATDELIKDRTITAGTWERLATNLDPRLLIEFCLLATQYDAIAATITALAIPPDNPQ>BL;Rv0466, H37Rv2.tab 556458:557249 forward MW:30153VSLDKKLMPVPDGHPDVFDREWPLRVGDIDRAGRLRLDAACRHIQDIGQDQLREMGFEETHPLWIVRRTMVDLIRPIEFG(SEQ ID NO:396)DMLRCRRWCSGTSNRWCEMRVRVDGRKGGLIESEAFWIHVNRETEMPARIADDFLAGLHRTTSVDRLRWKGYLKPGSRDDASEIHEFPVRVTDIDLFDHMNNAVYWSVIEDYLASHAELLRGPLRVTIEHEAPVALGDKLEIISHVHPAGSTEIFGPGLVDRAVTTLTYVVGDEPKAVASLFNL>BL;Rv0476, H37Rv2.tab 566508:566768 forward MW:9166MLVLLVAVLVTAVYAFVHAALQRPDAYTAADKLTKPVWLVILGAAVALASILYPVLGVLGMAMSACASGVYLVDVRPKLL(SEQ ID NO:397)EIQGKSR>BL;Rv0477, H37Rv2.tab 566776:567219 forward MW:15658MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAA(SEQ ID NO:398)DAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF>BL;Rv0479C, H37Rv2.tab 567924:568967 reverse MW:37016VTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAGHIQEPVSPPTQPEQQPQTEHLAASHAHTRRSGRQAA(SEQ ID NO:399)HQAWDPTGLLAAQEEEPAAVKTKRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACVVKDQATASFGVAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQNVRLKNTPNSRGTIGALDATITWSSEGIKESVQNAIPILGAFVTSSVVTHPADGTVELKGLLNNITAKPIVAGKGLELQIINFNTLGFSLPKETVQSTLNEFTSSLTKNYPLGIHADSVQVTSTGVVSRFSTRDAAIPTGIQNPCFSHI>BL;Rv0483, H372V2.tab 571710:573062 forward MW:47858VVIRVLFRPVSLIPVNNSSTPQSQGPISRRLALTALGFGVLAPNVLVACAGKVTKLAEKRPPPAPRLTFRPADSAADVVP(SEQ ID NO:400)IAPISVEVGDGWFQRVALTNSAGKVVAGAYSRDRTIYTITEPLGYDTTYTWSGSAVGHDGKAVPVAGKFTTVAPVKTINAGFQLADGQTVGIAAPVIIQFDSPISDKAAVERALTVTTDPPVEGGWAWLPDEAQGARVHWRPREYYPAGTTVDVDAKLYGLPFGDGAYGAQDMSLHFQIGRRQVVKAEVSSHRIQVVTDAGVIMDFPCSYGEADLARNVTRNGIHVVTEKYSDFYMSNPAAGYSHIHERWAVRISNNGEFIHANPMSAGAQGNSNVTNGCINLSTENAEQYYRSAVYGDPVEVTGSSIQLSYADGDIWDWAVDWDTWVSMSALPPPAAKPAATQIPVTAPVTPSDAPTPSGTPTTTNGPGG>BL;2V0487, H37Rv2.tab 576787:577335 forward MW:20684VTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGERKLKINTILSVGEHSVRVEAFVCRKPDENREDVYRFL(SEQ ID NO:401)LRRNRRLYGVAYTLDNVGDIYLVGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWRLSRGESLQNLQAFAHLRPTTMQSAQRDEKELGG>BL;Rv0495c, H37Rv2.tab 585427:586314 reverse MW:32960VWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPGQEVELDFAREWVEFYDPDNPEHLIAADLTWLLSRWA(SEQ ID NO:402)CVFGTPACQGTVAGRPNDGCCSHGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQPQHRTRKHKGACIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLPIRRSQEWVTRPDGTEILKTTLTEYDRRGWGSGGADLHWYCTGDPAAHVGTKQVWQSLADELTELLGEKAYGELAANCKRRSQLGLIAVHPATRAAQ>BL;Rv0497, H37Rv2.tab 587377:588306 forward MW:33092MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDAITVAELTGEIPIIRDDHHHAGPDAHASQSPAANGRV(SEQ ID NO:403)QVGEAAPQSPAEPVAEQVAEEPTRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPPSGAEHMSPDPVEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAAAAGTDVEGDGAAEARVARRALDVVPTLWRGALVVLQSILAVAFGAGLFIAFDQLWRWNSIVALVLSVMVILGLVVSVRAVRKTEDIASTLIAVAVGALITLGPLALLQSG>BL;Rv0498, H37Rv2.tab 588325:589164 forward MW:30433VRPAIKVGLSTASVYPLRAEAAFEYADRLGYDGVELMVWGESVSQDIDAVRKLSRRYRVPVLSVHAPCLLISQRVWGANP(SEQ ID NO:404)ILKLDRSVRAAEQLGAQTVVVHPPFRWQRRYAEGFSDQVAALEAASTVMVAVENMFPFRADRFFGAGQSRERMRKRGGGPGPAISAFAPSYDPLDGNHAHYTLDLSHTATAGTDSLDMARRMGPGLVHLHLCDGSGLPADEHLVPGRGTQPTAEVCQMLAGSGFVGHVVLEVSTSSARSANERESMLAESLQFARTHLLR>BL;RvO500B, H37Rv2.tab 591475:591573 forward MW:4145MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK(SEQ ID NO:405)>BL;Rv0504c, H37Rv2.tab 594805:595302 reverse MW:18360MTVPEEAQTLIGKHYRAPDHFLVGREKIREFAVAVKDDHPTHYSEPDAAAAGYPALVAPLTFLAIAGRRVQLEIFTKFNI(SEQ ID NO:406)PINIARVFHRDQKFRFHRPILANDKLYFDTYLDSVIESHGTVLAEIRSEVTDAEGKPVVTSVVTMLGEAAHHEADADATVAAIASI>BL;Rv0528, H37Rv2.tab 618305:619891 forward MW:57131MWRSLTSMGTALVLLFLLALAAIPGALLPQRGLNAAKVDDYLAAHPLIGPWLDELQAFDVFSSFWFTAIYVLLFVSLVGC(SEQ ID NO:407)LAPRTIEHARSLRATPVAAPRNLARLPKHANARLAGEPAALAATITGRLRGWRSITRQQGDSVEVSAEKGYLREFGNLVFHFALLGLLVAVAVGKLFGYEGNVIVIADGGPGFCSASPAAFDSFRAGNTVDGTSLHPICVRVNNFQAHYLPSGQATSFAADIDYQADPATADLIANSWRPYRLQVNHPLRVGGDRVYLQGHGYAPTFTVTFPDGQTRTSTVQWRPDNPQTLLSAGVVRIDPPAGSYPNPDERRKHQIAIQCLLAPTEQLDGTLLSSRFPALNAPAVAIDIYRGDTGLDSGRPQSLFTLDHRLIEQGRLVKEKRVNLRAGQQVRIDQGPAAGTVVRFDGAVPFVNLQVSHDPGQSWVLVFAITMMAGLLVSLLVRRRRVWARITPTTAGTVNVELGGLTRTDNSGWGAEFERLTGRLLAGFEARSPDMAEAAAGTGRDVD>BL;Rv0531, H37Rv2.tab 622329:622643 forward MW:11436VSEAPNDKTTRGVVDILVYATARLLLVVAVSAAIFGVARLIGLTEFPVVVATLFGLIIAMPLGIWVFSPLRRRATAALAV(SEQ ID NO:408)AGERRRAERERLRARLRGESLPEEQ>BL;Rv0543c, H37Rv2,tab 635576:635875 reverse MW:11279VNRFLTSIVAWLRAGYPEGIPPTDSFAVLALLCRRLSHDEVKAVANELMRLGDFDQIDIGVVITHFTDELPSPEDVERVR(SEQ ID NO:409)ARLAAQGWPLDDVRDREEHA>BL;2V0544c, H37Rv2.tab 635938:636213 reverse MW:9747VSAWFNYTATLKILIFSLLAGALLPGLFAVGVRLQAAGDGADATARRRPLLVAVSWAIFALVLAVVIIGVLYIARDFIAH(SEQ ID NO:410)HTGWAFLGATPK>BL;RvOS46c, H37Rv2.tab 637586:637969 reverse MW:14346MEILASRMLLRPADYQRSLSFYRDQIGLAIAREYGAGTVFFAGQSLLELAGYGEPDNSRGPFPGALWLQVRDLEATQTEL(SEQ ID NO:411)VSRGVSIAREPRREPWGLHEMHVTDPDGITLIFVEVPEGHPLRTDTRA>BL;Rv0556, H37Rv2.tab 647959:648471 forward MW:18725VISPKPLLHILIHGLSDELPDTRGRIVLRWLRIAVLIVTGLVTLQSVLLVAGAWRNDIAIQRNNGVAQAEVLSAGPRRST(SEQ ID NO:412)IEFVTPDRITYRPQLGVLYPSELSTGMRIYVEYNKRDPNLVRvQHRNAGLAIIPAGSIAVVAWLIAAAALVVLAVLDKRLERRENSASATG>BL;Rv0559c, H37Rv2.tab 650410:650745 reverse MW:12116MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPR(SEQ ID NO:413)GTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR>BL;Rv0634A, H37Rv2.tab 731113:731364 forward MW:9408LGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTLLGEADTAEHGHDDEYDEFSDPNAWVPRRSR(SEQ ID NO:414)DTG>BL;Rv0635, H37Rv2.tab 731930:732403 forward MW:17448VALSADIVGMHYRYPDHYEVEREKIREYAVAVQNDDAWYFEEDGAAELGYKGLLAPLTFICVFGYKAQAAFFKNANIATA(SEQ ID NO:415)EAQIVQVDQVLKFEKPIVAGDKLYCDVYVDSVREAHGTQIIVTKNIVTNEEGDLVQETYTTLAGRAGEDGEGFSDGAA>BL;Rv0636, H37Rv2.tab 732393:732818 forward MW:14934MALREFSSVKVGDQLPEKTYPLTRQDLVNYAGVSGDLNPIHWDDEIAKVVGLDTAIAHGMLTMGIGGGYVTSWVGDPGAV(SEQ ID NO:416)TEYNVRFTAVVPVPNDGKGAELVFNGRVKSVDPESKSVTIALTATTGGKKIFGRAIASAKLA>BL;Rv0637, H37Rv2.tab 732825:733322 forward MW:18929MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEAAADLGYDALVAPLTFVTILAKYVQLDFFRHVDVGME(SEQ ID NO:417)TMQIVQVDQRFVFHKPVLAGDKLWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGSARLKWDKESGQVIRTA>BL;Rv0779c, H37Rv2.tab 872675:873292 reverse MW:21572MRSRFLPYATTPGRLLAQLISDITVAVWTTLWMLVGLAVHDAISIIGEAGRQIEIGSHGIAGNLAAAGQDAQRIPVVGDA(SEQ ID NO:418)LSNPITAASQAALDIAGAGHNLDTTAGWLAVVLALAVAATPILAVAMPWLFLRLRFCRRKWTVTTLAATPAGRQLLALRALANRPPGKLAAVSTDPVGAWRREDPATMRALAALELRAAGIPLRGD>BL;Rv0807, H37Rv2.tab 901635:902021 forward MW:13480MSARDRVDPAKTRQVVLALADWLRDETLPAPDTDVLAAAVRLTARTLAALAPGASVEVRIPPFAAVQCISGPRHTRGTPP(SEQ ID NO:419)NVVQTDPRTWLLVATGLSGVAQARGSGALQLSGSRAGEIEAWLPLVDLG>BL;Rv0810c, H37Rv2.tab 904908:905087 reverse MW:6900MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSGTGTDRLDGDGPSDDDSWNDEDDWRR(SEQ ID NO:420)>BL;Rv0813C, H37Rv2.tab 907341:908018 reverse MW:23868VSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFDDLPVPADTANLREGADLNNALLALLPLVGVWRGEGE(SEQ ID NO:421)GRGPDGDYRFGQQIVVSHDGGDYLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLAHSAGYVELFYGRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDLAYVEERvDADGGLVPHLSARLSRFVG>BL;Rv0817C, H37Rv2.tab 910033:910842 reverse MW:28567MPMRKVLVGVTGAAIVVAVLIVGAVGADFGASIYAEYRLSTTVRKAANLRSDPFVAILRFPFIPQAMREHYAELEIKAFA(SEQ ID NO:422)VEHAGSGTATLEATMHSIDLSYASWLIRPDAKLPVGELESRIIIDSMHLGRYLGISDLMVAAPRQESNDATGGTTESGISGSRGLVFSGTPISANFAHRVSVLVDLSVASDDRATLVITPTAVVTGPDTADQPVPDDKRDAVLHAFASKLPNQKLPFGVVPNTVGARGSDVIIEGITRGVTISLDEFKQS>BL;Rv0819, H37Rv2.tab 911736:912680 forward MW:33567VTALDWRSALTADEQRSVRALVTATTAVDGVAPVGEQVLRELGQQRTEHLLVAGSRPGGPIIGYLNLSPPRGAGGANAEL(SEQ ID NO:423)VVHPQSRRRGIGTAMARAALAKTAGRNQFWAHGTLDPARATASALGLVGVRELIQMRRPLRDIPEPTIPDGVVIRTYAGTSDDAELLRVNNAAFAGHPEQGGWTAVQLAERRGEAWFDPDGLILAFGDSPRERPGRLLGFHWTKVHPDHPGLGEVYVLGVDPAAQRRGLGQMLTSIGIVSLARRLGGRKTLDPAVEPAVLLYVESDNVAAVRTYQSLGFTTYSVDTAYALAGTDN>BL;Rv0862C, H37Rv2.tab 960345:962612 reverse MW:79667MTEHTPDIPLGSWLAALPDERLTQLLELRPDLAQPPPGSIAALAARAQARQSVKAATDELDFLRLAVFDALLVLQADTAP(SEQ ID NO:424)VPIVRLLAVIGDRAAQADVLGALADLKQRALAWGETAVRVATDAGTALPWHPGQVTLEGSSRSGDQLADLIAGLDPAQRDVLDKLLQGSPVGRTRDAAPGAPSDRPVPRLLAMGLLRRIDAETVILPRHVGQVLRGEQPGPMELTAPDPVVSTTTPDDADAAAAGAVIDLLREVDVLLENLGATPVAELRSGGLGVREFKRLAKATGIDEPRLGLILEIAAAAGLIASGMPDPEPPHSDGPFWAPTVAADRFATMSPAERWHLLASAWLDLPGRPALIGTRGPDAKPYGALSDSLFSTAAPLDRRLLLGMLAELPAGAGVDASRASATLIWRRPRWARRLQPAPIADLLTEGHALGLVGRGAISTPARALLDEALEPATAPAAAVGVMARALPKPIDHFLVQADLTVVVPGPLQRELADDLTTVATVESAGTAMVYRVSEQSIRHALDVGKSRDWLQEFFANRSKTPVPQGLTYLIDDVARRHGQLRIGMAASFVRCEDPTLLAQVVAAPEADGLALRALAPTVAVSPAPISEVLVTLRCAGFAPAAEDSTGAVVDVRTRGARVPTPQRRRPYRPPPRPNSEALKAVVAVLREVTAAPFANVRVDPAVTMSLLQRAAKDQATLVISYLDAAGVATQRVVAPITLRGGQLVAFDSSSGRLRDFAIHRITLVVSAHDR>BL;Rv0863, H37Rv2.tab 962599:962877 forward MW:10079VCSVIADQRRPDQPCGVGGCKTCQNGFVADIAEGKARKTRYVDHGWPTTDPDDHAVSELVTDRTGALSPFGELTFPVPSD(SEQ ID NO:425)DLPYIHPVTVINR>BL;Rv0875c, H37Rv2.tab 973809:974294 reverse MW:17800VKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHGHLTRVGPYLYCNVVDLDDCQTPQAQGELPVSERYPV(SEQ ID NO:426)QLSVPEVISRAPWRLLQVYQDPANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRDVPHAEWSVRLIF>BL;Rv0876C, H37Rv2.tab 974294:975937 reverse MW:57938MAPTPGRRTRNGSVNGHPGMANYPPDDANYRRSRRPPPMPSANRYLPPLGEQPEPERSRVPPRTTRAGERITVTRAAAMR(SEQ ID NO:427)SREMGSRMYLLVHRAATADGADKSGLTALTWPVMANFAVDSAMAVALANTLFFAAASGESKSRVALYLLITIAPFAVIAPLIGPALDRLQHGRRvALALSFGLRTALAVVLIMNYDGATGSFPSWVLYPCALANMVFSKSFSVLRSAVTPRVMPPTIDLVRVNSRLTVFGLLGGTIAGGAIAAGVEFVCTHLFQLPGALFVVVAITIAGASLSMRIPRWVEVTSGEVPATLSYHRDRGRLRRRWPEEVKNLGGTLRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHEANGWVQLGMLGLIGAAAAVGNFAGNFTSARLQLGRPAVLVVRCTVLVTVLAIAAAVAGSLAATAIATLITAGSSAIAKASLDASLQHDLPEESRASGFGRSESTLQLAWVLGGAVGVLVYTELWVGFTAVSALLILGLAQTIVSFRGDSLIPGLGGNRPVMAEQETTRRGAAVAPQ>BL;Rv0877, H37Rv2.tab 976075:976860 forward MW:27437)VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVAS(SEQ ID NO:428)YSGADHATISEVVLVPGPTALLAPDWVPWEQRvRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPAES>BL;Rv0879c, H37Rv2.tab 978484:978756 reverse MW:9512MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAOLATGLLGTTIFVWQLAAARRGARGA(SEQ ID NO:429)QAGLETYLDPK>BL;Rv0883C, H37Rv2.tab 980509:981267 reverse MW:27373MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSVQPEQAQLDIEVTNVLSPKEIQARIRAGASVEQVAAA(SEQ ID NO:430)SGSDIARIRRFAHPVLLERSRAAELATAAHPVLADGPAVLTMQETVAAALVARGLNPDSLTWDAWRNEDSRWTVQLAWKAGRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLRPLAPVAHLDFDEPEPAQPTLTVPSAQPVSNRRGKPAIPAWEDVLLGVRSGGRR>BL;Rv0885, H37Rv2.tab 982762:983781 forward MW:39798MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDIDWESPEFAVTDNDPRWILPATDPLGRHPWYQAQSRE(SEQ ID NO:431)RQIEIGMWRQANVAKVGLHFESILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRvGADVPGLPRRLRWVSPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSLHPIMERvMSIHVAEEARHISFAHEYLRKRLPRLTRMQRFWISLYFPLTMRSLCNAIVVPPKAFWEEFDIPREVKKELFFGSPESRKWLCDMFADARMLAHDTGLMNPIARLVWRLCKIDGKPSRYRSEPQRQHLAAAPAA>BL;Rv0909, H37Rv2.tab 1014681:1014857 forward MW:6403MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKLHDAASNVVGMSDQQS(SEQ ID NO:432)>BL;Rv0910, H37Rv2.tab 1014866:1015297 forward MW:15754MAKLSGSIDVPLPPEEAWMhASDLTRYREWLTIHKVWRSKLPEVLEKGTVVESYVEVKGMPNRIKWTIVRYKPPEGMTLN(SEQ ID NO:433)GDGVGGVKVKLIAKVAPKEHGSVVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG>BL;Rv0912, H37Rv2.tab 1016236:1016682 forward MW:15438MTRRLRPGWLVALSAAVIAASTWMPWLTTTVGGGGWVNAIGGTHGSLELPHGFGPGQLIVLLSSTLLVVGAMAGRGLSVK(SEQ ID NO:434)LSSIAALVVSLLIVALTVWYYKLNVNPPVSAEYGLYFGAAGGVCAVGCSLWAAVSAASPGRRRHREVVR>BL;Rv0948C, H37Rv2.tab 1057649:1057963 reverse MW:11770MRPEPPHHENAELAAMNLEMLESQPVPEIDTLREEIDRLDAEILALVKRRAEVSKAIGKARMASGGTRLVHSREMKVIER(SEQ ID NO:435)YSELGPDGKDLAILLLRLGRGRLGH>BL;Rv0954, H37Rv2.tab 1065127:1066035 forward MW:30203MTYSPGNPGYPQAQPAGSYGGVTPSFAMADEGASKLPMYLNIAVAVLGLAAYFASFGPMFTLSTELGGGDGAVSGDTGLP(SEQ ID NO:436)VGVALLAALLAGVALVPKAKSHVTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVLALLVETGAITAPAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAAGLQSPGPQQSPQPPGYGSQYGGYSSSPSQSGSGYTAQPPAQPPAQSGSQQSHQGPSTPPTGFPSFSPPPPVSAGTGSQAGSAPVNYSNPSGGEQSSSPGGAPV>BL;Rv0955, H37Rv2.tab 1066078:1067442 forward MW:46056VNRVSASADDRAAGARPARDLVRvAFGPGVVALGIIAAVTLLQLLIANSDMTGAWGAIASMWLGVHLVPISIGGRALGVM(SEQ ID NO:437)PLLPVLLMVWATARSTARATSPQSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAFTSVLVVHSVGAATGVWSRvGRRALAATALPDWLHDSMRAAAAGVLALLGLSGVVTAGSLVVHWATMQELYGITDSIFGQFSLTVLSVLYAPNVIVGTSAIAVGSSAHIGFATFSSFAVLGGDIPALPILAAAPTPPLGPAWVALLIVGASSGVAVGQQCARRALPFVAANAKLLVAAVAGALVMAVLGYGGGGRLGNFGDVGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPPVELDADESSPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEPPPRAD>BL;Rv0966c, H37Rv2.tab 1077236:1077835 reverse MW:22210MSNSAQRDARNSRDESARASDTDRIQIAQLLAYAAEQGRLQLTDYEDRLARAYAATTYQELDRLRADLPGAAIGPRRGGE(SEQ ID NO:438)CNPAPSTLLLALLGGFERRGRWNVPKKLTTFTLWGSGVLDLRYADFTSTEVDIRAYSIMGAQTILLPPEVNVEIHGHRVMGGFDRKVVGEGTRGVPTVRIRGFSLWGDVGIKRKPRKPRK>BL;Rv0970, H37Rv2.tab 1081052:1081681 forward MW:22887MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNNGLHFANAVAMAVMAWPWGARVPTTGPAVFFLLAAVWFGATA(SEQ ID NO:439)VVAVRGTATRGLYGYHGLMMLATAWMYAAMNPRLLPVRSCTEYATEPDGSMPANDMTAMNMPPNSGSPIWFSAVNWIGTVGFAVAAVFWACRFVMERRQEATQSRLPGSIGQANMAAGMAMLFFAMLFPV>BL;Rv0996, H37Rv2.tab 1112384:1113457 forward MW:39519MPSIPQSLLWISLVVLWLFVLVPMLISKRDAVRRTSDVALATRvLNGGAGARLLKRGGPAAGHRWGYLPPEGQGDDPDWK(SEQ ID NO:440)PEEDWRDDPVEDGFADVEHDIDEDQEADDARRRGAVVMKVAAPQTAGADEPDYLDVDVVEEDSEALPVGAGAAVGESADEADAEAADGVAGHADPEADPVEYEYEYEYVEDTCGLELEEDDQEAPPTVASGTSRRRRFDTKTAAAVSARKYTFRKRALIVMAVILVGSAAAAFELTPVAWWICGSATGVTVLYLAYLRRQTRIEEKVRRRRMQRIARARLGVENTRDREYDVVPSRLRRPGAVVLEIDDEDPIFTHLESAAPIRNYGWPRDLPRAVGQ>BL;Rv0998, H37Rv2.tab 1114748:1115746 forward MW:35608LDGIAELTGARVEDLAGMDVFQGCPAEGLVSLAASVQPLRAAAGQVLLRQGEPAVSFLLISSGSAEVSHVGDDGVAIIAR(SEQ ID NO:441)ALPGMIVGEIALLRDSPRSATVTTIEPLTGWTGGRGAFATMVHIPGVGERLLRTARQRLAAFVSPIPVRLADGTQLMLRPVLPGDRERTVHGHIQFSGETLYRRFMSARVPSPALMHYLSEVDYVDHFVWVVTDGSDPVADARFVRDETDPTVAEIAFTVADAYQGRGIGSFLIGALSVAARVDGVERFAARMLSDNVPMRTIMDRYGAVWQREDVGVITTMIDVPGPGELSLGREMVDQINRVARQVIEAVG>BL;Rv1000, H37Rv2.tab 1116531:1117148 reverse MW:22648MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEELLDALLSTVPWRAERRQMYDRVVDVPRLVSFHDLTIE(SEQ ID NO:442)DPPHPQLARMRRRLNDIYGGELGEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFALRPRGRGPSLRLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFRPRDVR>BL;Rv1024, H37Rv2.tab 1145858:1146541 forward MW:24570MPEAKRPESKRRSPASRPGKAGDSVRGGRATKPSAKPSTPAPHASRKTTRTPHEHIVEPIKRAITESVEKRSEQRLGFTA(SEQ ID NO:443)RRAAILAAVVCVLTLTIARPVRTYFAQRAEMEQLAATEANLRRQIADLEEQQVKLADPAYIAAQARERLGFVMPGDIPFQVQLPSTPLAPPQPGSDAATATNNEPWYTALWHTIADDPHLPPAAPPAPEPGRPGPLPPASPNPEQPGG>BL;Rv1025, H37Rv2.tab 1146561:1147025 forward MW:16593VVTRQLGRAPRGVLAIAYRCPNGEPGVVKTAPRLPDGTPFPTLYYLTHPVLTAAASRLETTGLMREMNRRLGQDAELAAA(SEQ ID NO:444)YRRAHESYLSERDALEPLGTTVSAGGMPDRvKCLHVLIAHSLAKGPGLNPFGDEALALLAAEPRTAATLVAGQWR>BL;Rv1081c, H37Rv2.tab 1205987:1206418 reverse MW:15384MTHTPIPRPDARYGRPRLSRRARRRvAIALGVLVAAAGIVIAVIGYQRISTSAVTGSLVGYRLVDDETASVTISVTRSDP(SEQ ID NO:445)SRPVACIVRVRATNGSETGRRELLVPPSEATTVQVTTTVKSSQPPVMADVYGCGTEVPSYLRLP>BL;Rv1083, H37Rv2.tab 1207383:1207646 forward MW:9263VNQILLSVIAEGGPGNTGPDFGKASPVGLLVIVLLVIATLFLVRSMNQQLKKVPKSFDRDHPELDQAADEGTDRDGPARP(SEQ ID NO:446)PGPPHESG>BL;Rv1100, H37Rv2.tab 1228683:1229381 forward MW:24562MVGDCPRSRTVRWSWDTGHVTAEPQPTPRPAKPRLLQDGRDMFWSLAPLVVGCILLAGLVGMCSFQLGGTKRGPIPSYDA(SEQ ID NO:447)AQALRADAKTLGFPIRLPQLPGGWTPNSGGRGGIENGRADPATGQRRNAATSIVGFISPTGRYLSLTQSNADEDKLVGSIHPSMYPTGTVDVGGTRWVVYEGSDENGAVEPVWTTRLTGPGGATQLAITGAGSIDQFRTLASATQSQPPLPAR>BL;Rv1109c, H37Rv2.tab 1235460:1236095 reverse MW:22957MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVVMRFQQGLAELVIKGDNTLETLFPPKDEKPEWATFDE(SEQ ID NO:448)DLPDALEGTSIPLLGLSDASEAKNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPTVPTPAVAAELDYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQTLLANRITRATAK>BL;Rv1111c, H37Rv2.tab 1237212:1238192 reverse MW:36985VSAQRARSAVQASHRSIHPHIPGVPWWAAILIAVTATAIGYAIDAGSGHKALTLVFTGCYIAGCVGAVLAVRQSDLFTAL(SEQ ID NO:449)VQPPLILFCAVPGAYWLFHGGTIGKFKDLLINCGYSLIERFPLMLGTAAGVLLIGLVRWYLGTALFDSIARKLSSLMTGDSDDDGGRRSAQRPARTRSRHARPPSEDNREPIAERRSRRRPRPQNDPHPRRNAMERPAPRSSRFDSYRSYQPSEPSGPAEPVNRYERRGARYQPYARYEPTYEPQRRRARPSEPTNPTHHPISQVRYRCSATRDARRDNYREEQRFDRRDRSRAPRRPPAESWEYDV>BL;Rv1155, H37Rv2.tab 1281429:1281869 forward MW:16300MARQVFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRKLLIQVSIAEPRAKTRNLRRDPRASILVDADDGWSYA(SEQ ID NO:450)VAEGTAQLTPPAAAPDDDTVEALIALYRNIAGEHSDWDDYRQAMVTDRRvLLTLPISHVYGLPPGMR>BL;Rv1157c, H37Rv2.tab 1283059:1284171 reverse MW:36448VRRLTNTEHRENTTVASTWSVCKGLAAVVITSAAAFALCPNAAADPATPQPNPTQQLPGLPALAQLSPIIQQAAMNPAQA(SEQ ID NO:451)TQLLMAAASAFAGNPAVPTESKNVASSVNQFVAEPTNPDSAALGVPAPHGVALPEAIPVPHVPPLGAEPGVQAHLPTGIDPSHAAGPAPAVAPTVTPPVAAPPASAPAPAPDAAQPVAVPGPPPAPPAPRAAAPAPASAAPAPAAAPAPASGFGADAPPTQDFMYPSIGPNCVADGSNSIATALSVAGPAKIPLPGPGPGQTAYVFTAVGTPGPADVQRLPLNVTWVNLTTGKSGSATLRPRSDINPDGPTTLTVIADTGSGSIMSTIFGQVTTKDRQCQFMPTIGSTVVP>SL;Rv1158c, H37Rv2.tab 1284182:1284862 reverse MW:21401MPTIWTFVRAAAVLVGSSAALLTGGIAHADPAPAPAPAPNIPQQLISSAANAPQILQNLATALGATPPLSAPKVAEPAPA(SEQ ID NO:452)APGITATFPGLTPAAPAAAAAPALTPSIPGVNAPIPGITPAAPALPVTAPAAAPTIPGVNAPIPGITAPAPAAAAVPASVPGVPSAKVDLPQLPYLPLQVPQQLSLPADLPALASGVIPAAPIAPTPPAPGAPALPPGPPSLLAALP>BL;Rv1159A, H37Rv2.tab 1286284:1286568 reverse MW:10379MAVLTDEQVDAALHDLNGWQRAGGVLRRSIKFPTFMAGIDAVRRVAERAEEVNHHPDIDIRWRTVTFALVTHAVGGITEN(SEQ ID NO:453)DIAMAHDIDANFGA>BL;Rv1171, H37Rv2.tab 1301307:1301744 forward MW:15185VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTT(SEQ ID NO:454)SSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG>BL;Rv1184c, H37Rv2.tab 1324535:1325611 reverse MW:37818MKRvIAGAFAVWLVGWAGGFGTAIAASEPAYPWAPGPPPSPSPVGDASTAKVVYALGGARMPGIPWYEYTNQAGSQYFPN(SEQ ID NO:455)AKHDLIDYPAGAAFSWWPTMLLPPGSHQDNMTVGVAVKDGTNSLDNAIHHGTDPAAAVGLSQGSLVLDQEQARLANDPTAPAPDKLQFTTFGDPTGRHAFGASFLARIFPPGSHIPIPFIEYTMPQQVDSQYDTNEVVTAYDGFSDFPDRPDNLLAVANAAIGAAIAHTPIGFTGPGDVPPQNIRTTVNSRGATTTTYLVPVNHLPLTLPLRYLGMSDAEVDQIDSVLQPQIDAAYARNDNWFTRPVSVDPVRGLDPLTAPGSIVEGARGLLGSPAFCG>BL;Rv1209, H37Rv2.tab 1353157:1353522 forward MW:13089VALVLVYLVVLVLVAIVLFAAASLLFGRGEQLPPLPRATTATTLPAFGVTRADVDAVKFTQVLRGYKTSEVDWVLERLGR(SEQ ID NO:456)ELEALRSQLGAIHASSEDAEAESDASNPSRGETVVHYRSDPA>BL;Rv1211, H37Rv2.tab 1354243:1354467 forward MW:7810MLGADQARAGGPARIWREHSMAAMKPRTGDGPLEATKEGRGIVMRVPLEGGGRLVVELTPDEAAALGDELKGVTS(SEQ ID NO:457)>BL;Rv1222, H37Rv2.tab 1365344:1365805 forward MW:16250MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQ(SEQ ID NO:458)SRAPAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR>BL;Rv1249c, H37Rv2.tab 1393197:1393982 reverse MW:27571MSARRIRSWKRFDNRSANAAEPDPQLAGTGGRPKVSTRALAQVIERSSRIQGPAAQAYVARLRRAHPGASPAKIVAKLEK(SEQ ID NO:459)RFLSVVTASGAAVGAAATLPGIGTLAAWFAAAGEVVVFLEATALFVLALASVHAIPLDHRERRRALVLAVLVGDNTTAVADLLGPGRTSGGWVSETMASLPLPAISSLNSRMLKYVVKRFALKRGALMFGKLVPMGIGAIIGAIGNRLVGKKLVRNARSAFGTPPARWPVTLNVLPTVRDAS>BL;Rv1251c, H37Rv2.tab 1395824:1399240 reverse MW:123118VFVTGDSIVYSASDLAAAARCQYALLREFDAKLGRGPAVAVDDELMARAAVLGSAHEGRRLDQLRHEFGDAVAIIGRPAY(SEQ ID NO:460)TPAGLAAAADATRRAIANHAPVVYQAAMFDGRFVGFADFLIRDGHRYRVADTKLARSPTVTALLQLAAYADALVHSGVPVAADAELELGDGTIVRYRVGELIPVYRSQPALLQRLLDGHYTAGTAVRWDDERVQACFRCPQCTERLRASDDLLLVGGMRVRQRDKLLEAGITTIAELADHTAPVPGLTTNALGKLTAQAKLQIRQRDTGAPQFEIVDPRPLTLLPEPNPGDLFFDFEGDPLWTADGKQWGLEYLFGVLEAGRAGVFRPLWAHDRTAERQALTDFLAIVARRRRRHPNMNIYHYAPYEKTALLRLVGRYGIGEDDVDDLLRNGVLVDLYPLVRKSIRVGTDSFSLKALEPLYLGTQPRSGDVTTAADSINSYARYCELRAAGRIDEAATVLKEIEGYNHYDCRSTRALRDWLLMRAWEAGVTPIGAQPVPDADPIDDGDSLASVLSKFTGDAAAGERTPEQTAVALLAAARGYHRREDKPFWWAHFDRLNYPVDEWSDSTDVFLASEASVTVDWHMPPRARKPQRRVRLTGELARGDLNGNVFALYEPPAPPGMTDNPDRRAAGPAAVVETDDPTVPTEVVIVERTGSDGNTFQQLPFALAPGPPVPTTALRESIESTAAAVASGSPQLPSTALMDVLLRRPPRTRSGAALPRSSDPVTDIAAAALDLDSSYLAVHGPPGTGKTYTAARVIAELVTEHAWRIGVVAQSHATVENLLEGVISAGLDPGQVAKKPHDHTAGRWQSIDGSQYTEFIRDTAGCVIGGTAWDFANGNRvPKASLDLLVIDEAGQFCLANTIAVAPAATNLLLLGDPQQLPQVSQGTHPEPVDTSALSWLVDGQHTLPDERGYFLDRSYRMHPAVCAAVSALSYEGRLCSHTERTAVRRLDGYPPGVHTRGVHHKGNSIESPEEAEAILAELRQLLGSPWTDEHGTRPLAASDVLVLAPYNAQVALVRRRLASAGLGGADGVRVGTVDKFQGGQAPVVFISMTASSADDVPRGISFLLNRNRLNVAVSRAQYAAVIVRSELLTQYLPATPDGLVDLGAFLGLTSTS>BL;Rv1259, H37Rv2.tab 1407339:1408235 forward MW:31783MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPVEPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICE(SEQ ID NO:461)LNALISVCRACPRLVSWREEVAVVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTGDRSGDQLYAALHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPTPAERLTCSPWLNAEWRLVSDHIRAIVALGGFAWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPSQQNMFTGRLTPTMLDDIFREAKKLAGIE>BL;Rv1276c, H37Rv2.tab 1425441:1425914 reverse MW:16461MRHAKSAYPDGIADHDRPLAPRGIREAGLAGGWLRANLPAVDAVLCSTATRARQTLAHTGIDAPARYAERLYGAAPGTVI(SEQ ID NO:462)EEINRVGDNVTTLLVVGHEPTTSALAIVLASISGTDAAVAERISEKFPTSGIAVLRvAGHWADVEPGCAALVGFHVFR>BL;Rv1277, H37Rv2.tab 1426164:1427414 forward MW:44758VSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQLGMTRHFLAGDAQPRYSAARRDAVAGLKALAADVGA(SEQ ID NO:463)EFVVVAGDVFEHNQLAPQIVGQSLEANRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAGVHEVRPGVQIVAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDALDPDHDKPSLIRLAALDDALTRQAIHYVALGDKHSLTQVGSSGRVWYSGAPEVTNFDDVEPDPGHVLVVDIDESDPRHPVTVDARRIGRWRFVTLHHQVDTSRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTDRAALDTCLDKYARLFAWLGLWERNTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARGGODESAVDAQAALALLLRLADRGAA>BL;Rv1278, H37Rv2.tab 1427414:1430038 forward MW:93319VKLHRLALTNYRGIAHRDVEFPDHGVVVVCGANEIGKSSMVEALDLLLEYKDRSTKKEVKQVKPTNADVGSEVIAEISSG(SEQ ID NO:464)PYRFVYRKRFHKRCETELTVLAPRREQLTGDEAHERVRTMLAETVDTELWHAQRVLQAASTAAVDLSGCDALSRALDLAAGDDAALSGTESLLIERIEAEYARYFTPTGRPTGEWSAAVSRLAAAEAAVADCAAAVAEVDDGVRRHTELTEQVAELSQQLLAHQLRLEAARVAAEKIAAITDDAREAKLIATAAAATSGASTAAHAGRLGLLTEIDTRTAAVVAAEAKARQAADEQATARAEAEACDAALTEATQVLTAVRLRAESARRTLDQLADCEEADRLAARLARIDDIEGDRDRVCAELSAVTLTEELLSRIERAAAAVDRGGAQLASISAAVEFTAAVDIELGVGDQRVSLSAGQSWSVTATGPTEVKVPGVLTARIVPGATALDFQAKYAAAQQELADALAAGEVADLAAARSADLCRRELLSRRDQLTATLAGLCGDEQVDQLRSRLEQLCAGQPAELDLVSTDTATARAELDAVEAARIAAEKDCETRRQIAAGAARRLAETSTRATVLQNAAAAESAELGAAMTRLACERASVGDDELAAKAEADLRVLQTAEQRVIDLADELAATAPDAVAAELAEAADAVELLRERHDEAIRALHEVGVELSVFGTQGRKGKLDAAETEREHAASHHARVGRRARAARLLRSVMARHRDTTRLRYVEPYRAELHRLGRPVFGPSFEVEVDTDLRIRSRTLDDRTVPYECLSGGAKEQLGILARLAGAALVAKEDAVPVLIDDALGFTDPERLAKMGEVFDTIGADGQVIVLTCSPTRYGGVKGAHRIDLDAIQ>BL;Rv1303, H37Rv2.tab 1459766:1460248 forward MW:16863VTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLTVGMFLGLGLLLGLLNALLVRRSAESITAKEHPLKRS(SEQ ID NO:465)NALNSASRLAIITILGLIIAYIFRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQTGGSEGRSASDD>BL;Rv1312, H37Rv2.tab 1467688:1468128 forward MW:16594MSAPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAVGGHGWRHGVIRYRGGEAAFYRLSSLRLWPDRRLSRR(SEQ ID NO:466)GVEIISRRAPRGDEFDIMTDEIVVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRARRRSM>BL;Rv1324, H37Rv2.tab 1487161:1488072 forward MW:32138VTRPRPPLGPAMAGAVDLSGIKQRAQQNAAASTDADRALSTPSGVTEITEANFEDEVIVRSDEVPVVVLLWSPRSEVCVD(SEQ ID NO:467)LLDTLSGLAAAAKGKWSLASVNVDVAPRVAQIFGVQAVPTVVALAAGQPISSFQGLQPADQLSRWVDSLLSATAGKLKGAASSEESTEVDPAVAQARQQLEDGDFVAARKSYQAILDANPGSVEAKAAIRQIEFLIRATAQRPDAVSVADSLSDDIDAAFAAADVQVLNQDVSAAFERLIALVRRTSGEERTRvRTRLIELFELFDPADPEVVAGRRNLANALY>BL;Rv1332, H37Rv2.tab 1500926:1501579 forward MW:24054MPPVCGRRCSRTGEIRGYSGSIVRRWKRVETRDGPRFRSSLAPHEAALLKNLAGAMIGLLDDRDSSSPSDELEEITGIKT(SEQ ID NO:468)GHAQRPGDPTLRRLLPDFYRPDDLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTVPDNGGRLELTESDANAWIAAVNDLRLALGVMLEIGPRGPERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMGSR>BL;Rv1342c, H37Rv2.tab 1508187:1508546 reverse MW:13382MTAPETPAAQHAEPAIAVERIRTALLGYRIMAWTTGLWLIALCYEIVVRYVVKVDNPPTWIGVVNGWVYFTYLLLTLNLA(SEQ ID NO:469)VKVRWPLGKTAGVLLAGTIPLLGIVVEHFQTKEIKARFGL>BL;Rv1343c, H37Rv2.tab 1508546:1508923 reverse MW:14241VSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLGYALQWPLFAWFCVYAYRNFVRYEETPPQPPTGGAAA(SEQ ID NO:470)EIPAGLLPERPKPAQQPPDDPVLREYNAYLAELAKDDARKQNRTTA>BL;Rv1390, H37Rv2.tab 1565093:1565422 forward MW:11810VSISQSDASLAAVPAVDQFDPSSGASGGYDTPLGITNPPIDELLDRvSSKYALVIYAAKRARQINDYYNQLGEGILEYVG(SEQ ID NO:471)PLVEPGLQEKPLSIALREIHADLLEHTEGE>BL;Rv1417, H37Rv2.tab 1592150:1592611 forward MW:16351VTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSSGVVFQTADQVAMGALGLVLAGAVLLFARPRLRVGSA(SEQ ID NO:472)GLSVRNLLGDRIVGWSEVIGVSFPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAANDTVRSLLARYRPDLCAR>DL;Rv1476, H37Rv2.tab 1666204:1666761 forward MW:19601MTGPYFPQTIPFLPSYIPQDVDMTAVKAEVAALGVSAPPAATPGLLEVVQHARDEGIDLKIVLLDHNPPNDTPLRDIATV(SEQ ID NO:473)VGADYSDATVLVLSPNYVGSYSTQYPRvTLEAGEDHSKTGNPVQSAQNFVHELSTPEFPWSALTIVLLIGVLAAAVGARLMQLRGRRSATSTDAAPGAGDDLNQGV>BL;Rv1590, H37Rv2.tab 1791334:1791570 forward MW:8602MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFCAQCGRRMVVQVRPDGWWARCSRHGQVDSADLATQR(SEQ ID NO:474)>BL;Rv1591, H37Rv2.tab 1791570:1792232 forward MW:23151VTEPPGFGGPSEPSGAPRTSRTRAVLFVMLGLSATGVLVGGLWAWIAPPIHAVVAITRAGERVHEYLGSESQNFPIAPFM(SEQ ID NO:475)LLGLLSVLAVVASALMWQWREHRGPQMVAGLSIOLTTAAAIAAGVGALVVRLRYGALDFDTVPLSRGDHALTYVTQAPPVFFARRPLQIALTLMWPAGIASLVYALLAAGTARDDLGGYPAVDPSSNARTEALETPQAPVS>BL;Rv1610, H37Rv2.tab 1809443:1810147 forward MW:24586VAANAGSVRPNRRARPMIGIAQLLLVVAAGALWMAARLPWVVIGSFDELGPPKEVTLTGASWSTALLPLALLMLAAAVAA(SEQ ID NO:476)LAVRGWPLRALAVLLAAASFAVGYLGISLWVVPDVAARGADLAHVPVVTLVGSARHYWGAVAAVLAAVCALLAAVFLMSSAAIRGSAGEDMARYAAPRARRSIARRQHSNAAGRAAPQDDGPDMGPRMSERMIWEALDEGRDPTDREQESDTEGR>BL;Rv1635c, H37Rv2.tab 1840575:1842242 reverse MW:60035MNASRPGAPPHAGLPSRRTAGDQDHRADPKVTRIMSASTLEQPAAAHVDELVARMRGRLLDPLAIAVLAAVISGAWASRP(SEQ ID NO:477)SLWFDEGATISASASRTLPELWSLLGHIDAVHGLYYLLMHGWFAIFPPTELWSRLPSCLAIGAAAAGVVVFAKQFSGRTTAVCAGAVFAILPRVTWAGIEARSSALSVAAAVWLTVLLVAAVRCNTQRRWLLYALVLMLSILVSINLALLVPAYATMVPLLASGKSRKSPVIWWTVVTAAALGAMTPFILFAMGQVWQVGWIAGLNRNIILDVIHRQYFDHSVPFAILAGLIVAAGIAAHLAGARGPGGDTHRLVLVSAAWIVVPTAVVLIYSATVEPIYYPRYLILTAPAAAVILAVCVVTIARKPWLIAGVVFLLAAAAFPNYFFTQRGPYAKEGWDYSQVADVISAHAKPGDCLLVDNTAGWRPGPIRALLATRPAAFRSLIDVERGTYGPKVGTLWDGHVAVWLTTAKIDKCPTLWTIANRDKSLPDHQVGEMLSPGTGFGRTPVYRFPSYLGFRIVERWQFHYSQVVKSTR>BL;Rv1647, H37Rv2.tab 1856774:1857721 forward MW:33939LAGSARTTYPCHVEVGPQDSESGAPDETATAMASPVPRQRSALRWLRTVNRSPGLVSFIHRARRLLPGDPEFGDPLSTAG(SEQ ID NO:478)EGGPRAAARAADRLLRDRDAASREVGLSVLQVWQALTEAVSRRPANPEVTLVFTDLVGFSTWSLHAGDDATLTLLRQVARAVESPLLDAGGHIVKRLGDGIMAVFRNPTVALRAVLVAQDAVKSLEVQGYTPRMRIGIHTGRPQRLAADWLGVDVNIAARVMERATKGGIMISQPTLDLIPQSELDALGVVARRVRKPVFASKPTGIPPDLAIYRIKTVSESTAADNFDEMSPDAQ>BL;Rv1693, H37Rv2.tab 1917756:1917929 forward MW:6094MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG(SEQ ID NO:479)>BL;Rv1697, H37Rv2.tab 1921542:1922720 forward MW:42423MRMSALLSRNTSRPGLIGIARVDRNIDRLLRRvCPGDIVVLDVLDLDRITADALVEAEIAAVVNASSSVSGRYPNLGPEV(SEQ ID NO:480)LVTNGVTLIDETGPEIFKKVKDGAKVRLYEGGVYAGDRRLIRGTERTDHDIADLMREAKSGLVAHLEAFAGNTIEFIRSESPLLIDGIGIPDVDVDLRRRHVVIVADEPSGPDDLKSLKPFIKEYQPVLVGVGTGADVLRKAGYRPQLIVGDPDQISTEVLKCGAQVVLPADADGHAPGLERIQDLGVGANTFPAAGSATDLALLLADHHGAALLVTAGHAANIETFFDRTRvQSNPSTFLTRLRVGEKLVDAKAVATLYRNHISGGAIALLALTMLIAIIVALWVSRTDGVVLHWIIDYWNRFSLWVQHLVS>BL;Rv1698, H37Rv2.tab 1922745:1923686 forward MW:32391MISLRQHAVSLAAVFLALAMGVVLGSGFFSDTLLSSLRSEKRDLYTQIDRLTDQRDALREKLSAADNFDIQVGSRIVHDA(SEQ ID NO:481)LVGKSVVIFRTPDAHDDDIAAVSKIVGQAGGAVTATVSLTQEFVEANSAEKLRSVVNSSILPAGSQLSTKLVDQGSQAGDLLGIALLSNADPAAPTVEQAQRDTVLAALRETGFITYQPRDRIGTANATVVVTGGALSTDAGNQGVSVARFAAALAPRGSGTLLAGRDGSANRPAAVAVTRADADMAAEISTVDDIDAEPGRITVILALHDLINGGNVGHYGTGHGAMSVTVSQ>BL;Rv1754c, H37Rv2.tab 1984982:1986670 reverse MW:60608MYRYQVRVQQRRSEMNRWVATRSRRHTYQWITDHKSPRDHYRHISELRTSIATSSPGRCDMSPIPRIVSVSLAWAAAIGL(SEQ ID NO:482)MVPIGLAPPAMAAPCSGDAANAPPPPSAIVTDPGATALGPVRPGHGPIPTGRKPRGANDRAPLPKLGPLISALLNPGARNAAPLQQQALVPRANPGPNPAPNPPATGPQPPNATQLTPNFAPAPDPAPAAAPDPGATLAGATTSLAEWVTGPDSPNKTLERFGISGTDLGIPWDNGDPANRQVLMIFGDTFGYCAVDGHQWRYNTLFRSQDRDLGNGVHVTSGDASNRYSGSPVRQPGFSKQLINSIKWARDETGIIPTAGIAVGKTQYVNFMSIRNWGRDGEWTTNYSGIAVSKDNGQTWGVFPGTIRASGPDSGGKARFVPGNENFQMGAYLKSNDGYLYSFGTPPGRGGSAYLARvPQRFVPDLTKYQYWNGDSNSWVPNKPDAATPVIPGPVGEMSVQYNTYLKQYLALYTNGMNDVVARTAPAPQGPWSAEQMLVSSWQMPGGIYAPMMHPWSTGKDVYFNLSLWSAYNVNLMHTVLP>BL;Rv1782, H37Rv2.tab 2017740:2019257 forward MW:53689VAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTANALTRWRVRMEIEPGRRQTLAVVASVSAALVICLGALLWSFISPSG(SEQ ID NO:483)QLNESPIIADRDSGALYVRVGDRLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFSPKSPPASSWLVCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDAVVLRYGGDAWVIREGRRSRIEPTNRAVLLPLGLTPEQVSQARPMSRALFDALPVGPELLVPEVPNAGGPATFPGAPGPIGTVIVTPQISGPQQYSLVLGDGVQTLPPLVAQILQNAGSAGNTKPLTVEPSTLAKMPVVNRLDLSAYPDNPLEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEMNKVVSLVKADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVEDSKEARDALGLTLTPSLAPWVALRLLPQGPTLSRADALVEHDTLPMDMTPAELVVPK>BL;Rv1794, H37Rv2.tab 2031066:2031965 forward MW:32399MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSNDWLNEHPGMAVMREQGIVVNDAVNEQVAARMKVLAA(SEQ ID NO:484)PDLEVVALLSRGKLLYGVIDDENQPPGSRDIPDNEFRVVLARRGQHWVSAVRvGNDITVDDVTVSDSASIAALVMDGLESIHHADPAAINAVNVPMEEMLEATKSWQESGFNVPSGGDLRRMGISAATVAALGQALSDPAAEVAVYARQYRDDAKGPSASVLSLKDGSGGRIALYQQARTAGSGEAWLAICPATPQLVQVGVKTVLDTLPYGEWKTHSRV>DL;Rv1797, H37Rv2.tab 2035483:2036700 forward MW:44178MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVAWWVGVGVAAVVTLLSVVSYHGITVISGLATWVRDWS(SEQ ID NO:485)ADPGTTLGAGCTPAIDHQRRFGRDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADGLRQFDIHLDGIDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTVADRRRTWLVLRMNPQRNVAAVACRDSLASTLVAATERLVQDLDGQSCAARPVTADELTEVDSAVLADLEPTWSRPGWRHLKHFNGYATSFWVTPSDITSETLDELCLPDSPEVGTTVVTVRLTTRVGSPALSAWVRYHSDTRLPKEVAAGLNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPVGQELEHATSSFVGQ>BL;Rv1828, H37Rv2.tab 2073081:2073821 forward MW:26411VSAPDSPALAGMSIGAVLDLLRPDFPDVTISKIRFLEAEGLVTPRRASSGYRRFTAYDCARLRFILTAQRDHYLPLKVIR(SEQ ID NO:486)AQLDAQPDGELPPFGSPYVLPRLVPVAGDSAGGVGSDTASVSLTGIRLSREDLLERSEVADELLTALLKAGVITTGPGGFFDEHAVVILQCARALAEYGVEPRHLRAFRSAADRQSDLIAQIAGPLVKAGKAGARDRADDLAREVAALAITLHTSLIKSAVRDVLHR>BL;Rv1830, H37Rv2.tab 2074841:2075515 forward MW:23988VTQLVTRARSARGSTLGEQPRQDQLDFADHTGTAGDGNDGAAAASGPVQPGLFPDDSVPDELVGYRGPSACQIAGITYRQ(SEQ ID NO:487)LDYWARTSLVVPSIRSAAGSGSQRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVQDLANITLFSDGTTVYECTSAEEVVDLLQGGQGVFGIAVSGAMRELTGVIADFNGERADGGESIAAPEDELASRRKHRDRKIG>BL;Rv1836c, H37Rv2.tab 2082606:2084636 reverse MW:69677MGRHSKPDPEDSVDDLSDGNAAEQQNWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTG(SEQ ID NO:488)AEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLANPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAANDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRvSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS>BL;Rv1845c, H37Rv2.tab 2095221:2096168 reverse MW:32704VSALAFTILAVLLAGPTPALLARATWPLRAPRAANVLWQAIALAAVLSSFSAGIAIASRLLMPGPDGRPTTSFVGAAGRL(SEQ ID NO:489)GWPLWAAYITVFALTVLVGARLAVAVVRVATATRRRRAHHRMVVDLVGVGHNGALAQPCARARDLRVLDVAQPLAYCLPGVRSRVVVSEGTLTALADAEVAAILTHERAHLRARHDLVLEAFTAVHAAFPRLVRSANALGAVQLLVELLADDAAVRAAGRTPLARALVACASGRAPSGALAVGGPSTVLRVRRLSGRGNSAVLSAAAYLAAAAVLVVPTVALAVPWLTQLQRLFIA>BL;Rv1846c, H37Rv2.tab 2096186:2096599 reverse MW:15211MAKLTRLGDLERAVMDHLWSRTEPQTVRQVHEALSARRDLAYTTVMTVLQRLAKKNLVLQIRDDRAHRYAPVHGRDELVA(SEQ ID NO:490)GLMVDALAQAEDSGSRQAALVHFVERVGADEADALRRALAELEAGHGNRPPAGAATET>BL;Rv1861, H37Rv2.tab 2109165:2109467 forward MW:10330MDITATTEFSAMNLDGKTGIGWLGYIVIGGIAGWLASKIVKGGGSGILMNVVIGVVGAFGAGLVLNALGVDVNHGGYWFT(SEQ ID NO:491)FFVALGGAVVLLWIVCMVRKT>BL;Rv1871c, H37Rv2.tab 2121498:2121884 reverse MW:14663LNAAMNLKREFVNRVQRFVVNPIGRQLPMTMLETIGRKTGQPRRTAVGGRVVDNQFWMVSEHGEHSDYVYNIKANPAVRV(SEQ ID NO:492)RIGGRWRSGTAYLLPDDDPRQRLRGLPRLNSAGVRANGTDLLTIRVDLD>BL;Rv1883c, H37Rv2.tab 2133234:2133692 reverse MW:17280MCLDQVMEGSATVHMAAPPDKIWTLIADVRNTGRFSPETFEAEWLDGATGPALGARFRGHVRRNGIGPVYWTVCEPGREF(SEQ ID NO:493)GFAVLLGDRPVNNWHYRLTPTADGTEVTESFRLPPSVLTTVYYRVFGGWLRQRRNIRDMTKTLQRIKDLVEAG>BL;Rv1891, H37Rv2.tab 2139741:2140145 forward MW:14108MIRELVTTAAITGAAIGGAPVAGADPQRYDGDVPGMNYDASLGAPCSSWERFIFGRGPSGQAEACHFPPPNQFPPAETGY(SEQ ID NO:494)WVISYPLYGVQQVGAPCPKPQAAAQSPDGLPMLCLGARGWQPGWFTGAGFFPPEP>BL;Rv1893, H37Rv2.tab 2140486:2140701 forward MW:7467MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSIDIATHAVDRTKEVFTGKTDDEG(SEQ ID NO:495)>BL;Rv1906c, H37Rv2.tab 2152428:2152895 reverse MW:15536MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNP(SEQ ID NO:496)DGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP>BL;Rv1919c, H37Rv2.tab 2171064:2171525 reverse MW:16803MSGRKFSFEVTKTSSAPAATLFRLVTDGGNWATWAKPIVAQSSWARRGDPAPGGIGAIRKLGMWPVFVQEETVEYEQDRR(SEQ ID NO:497)HVYKLVGARTPVQDYFGEVVLTPNASGGTDLRWSGSFTEKVRGTGPVMRAALGGAVRFFAGQLVKAAEREAVRR>BL;Rv1976c, H37Rv2.tab 2218847:2219251 reverse MW:14997VRWIVDGMNVIGSRPDGWWRDRHRANVMLVERLEGWAITKARGDDVTVVFERPPSTAIPSSVVEVAHAPKAAANSADDEI(SEQ ID NO:498)VRLVRSGAQPQEIRVVTSDKALTDRVRDLGAAVYPAERFRDLIDPRGSNAARRTQ>BL;Rv2050, H37Rv2.tab 2307821:2308153 forward MW:12971MADRvLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPFADDAEIPGTWLCRNGMEGTLIEGDLPEPKKVKPPRT(SEQ ID NO:499)HWDMLLERRSIEELEELLKERLELIRSRRRG>BL;Rv2054, H37Rv2.tab 2313125:2313835 forward MW:25183MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKLISERIARAGYVVLTPNMYARGGRARCITRVFRELLT(SEQ ID NO:500)KRGRALDDILAARDHLLANPECSGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACPIVASFGTRDPLGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQPLVRIAGFGYNEAATEDAWRRVFEFFGQHLRAGSPGEP>BL;Rv2061c, H37Rv2.tab 2316684:2317085 reverse MW:14782VTPTFSDLAEAQYLLLTTFTKDGRPKPVPIWAALDTDRGDRLLVITEKKSWKVKRIRNTPRVTLATCTLRGRPTSEAVEA(SEQ ID NO:501)TAAILDESQTGAVYDAIVKRYGIQGKLFTFVSKLRGGMRNNIGLELKVAESETG>BL;Rv2091c, H37Rv2.tab 2348561:2349292 reverse MW:26019MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPAYPQQYPQPTPGYAQPEQFGA(SEQ ID NO:502)QPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ>BL;Rv2111c, H37Rv2.tab 2370601:2370792 reverse MW:6944MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEIDDVLEENAEDFVRAYVQKGGQ(SEQ ID NO:503)>BL;Rv2125, H372V2.tab 2386293:2387168 forward MW:31808VTPSEGNAPLPELHNTVVVAAFEGWNDAGDAAGDAVAHLAASWQALPIVEIDDEAYYDYQVNRPVIRQVDGVTRELQWPA(SEQ ID NO:504)MRISHCRPPGSDRDVVLMCGVEPNMRWRTFCDELLAVIDKLNVDTVVILGALLADTPHTRPVPVSGAAYSAASARQFGLQETRYEGPTGIAGVFQSACVGAGIPAVTFWAAVPHYVSHPPNPKATIALLRRvEDVLDVEVPLADLPAQAEAWEREITETIAEDHELAEYVQTLEQHGDAAVDMNEALGNIDGDALAAEFERYLRRRRPGFGR>BL;Rv2133c, H37Rv2.tab 2393854:2394639 reverse MW:28284VLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERPLWDFPDGTLAGRELSAYLVSTQLGWNLVPHTIIRDG(SEQ ID NO:505)PAGIGMLQLWVQQPGDAVDSDPLPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIRLRRMAVFDVLINNADRKGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPIDDQILQAVAGLADALGGPLAEALAGRIAAAEIGALRRRAQSLLDQPVMPGPNGHRPIPWPAF>BL;Rv2134c, H37Rv2.tab 2394653:2395237 reverse MW:21217MARAIHVFRTPDRFVAGTVGQPGNRTFYLQAVHDSRvVSVVLEKQQVAVLAERIGALLFEVNRRFGTPVPPEPTEIDDLS(SEQ ID NO:506)PLIMPVDAEFRVGTMGLGWDSEAQSVVVELLAVTDAEFDASVVLDDTEEGPDAVRVFLTPESARQFATRSYRVISAGRPPCPLCDEPLDPEGHICARTNGYRRDVLLGSGDDPAG>BL;Rv2137c, H37Rv2.tab 2396905:2397315 reverse MW:14965MRNMKSTSHESESGKLLSISSCRPREMVLQRYSLGMTVTADRHLADKREEFAVEDISTGIFASGYGQVGDGRSFSFHIEH(SEQ ID NO:507)RSLVVEIYRPRvAGPVPQAEDVVAMAVRGLVDIDLTDERSLAAAVRDSVASAAPVSR>BL;Rv2144c, H37Rv2.tab 2404168:2404521 reverse MW:12027MLIIALVLALIGLLALVFAVVTSNQLVAWVCIGASVLGVALLIVDALRERQQGGADEADGAGETGVAEEADVDYPEEAPE(SEQ ID NO:508)ESQAVDAGVIGSEEPSEEASEATEESAVSADRSDDSAK>BL;Rv2146c, H37Rv2.tab 2405669:2405956 reverse MW:10804LVVFFQILGFALFIFWLLLIARvVVEFIRSFSRDWRPTGVTVVILEIIMSITDPPVKVLRRLIPQLTIGAVRFDLSIMVL(SEQ ID NO:509)LLVAFIGMQLAFGAAA>BL;Rv2147c, H37Rv2.tab 2406121:2406843 reverse MW:27629VNSHCSHTFITDNRSPRARRGHAMSTLHKVKAYFGMAPMEDYDDEYYDDRAPSRGYARPRFDDDYGRYDGRDYDDARSDS(SEQ ID NO:510)RGDLRGEPADYPPPGYRGGYADEPRFRPREFDRAEMTRPRFGSWLRNSTRGALANDPRRMAMMFEDGHPLSKITTLRPKDYSEARTIGERFRDGSPVIMDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLSPADVDVSPEERRRIAETGFYAYQ>BL;Rv2164c, H37Rv2.tab 2427087:2428238 reverse MW:39647MRAKREAPKSRSSDRRRRADSPAAATRRTTTNSAPSRRIRSRAGKTSAPGRQARVSRPGPQTSPMLSPFDRPAPAKNTSQ(SEQ ID NO:511)AKARAKARKAKAPKLVRPTPMERLAARLTSIDLRPRTLANKVPFVVLVIGSLGVGLGLTLWLSTDAAERSYQLSNARERTRMLQQHKEALERDVREAASAPALAEAARRQGMIPTRDTAHLVQDPDGNWVVVGTPKPADGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVRVTPGPDDPAPPARSGPEVLVRTPDGTATLGGATHLPTQAGPQLPGPVPIPGAPGPMPAPPLGAVPSPAPAENPVPLQVGAAPPACLPGPAPVAATPGLSGGSQPMVAPPAPVPANGEQFGPVTAPVPTAPGAPR>BL;Rv2169c, H37Rv2.tab 2431568:2431969 reverse MW:14571MPLSDHEQRMLDQIESALYAEDPKFASSVRGGGFRAPTARRRLQGAALFIIGLGMLVSGVAFKETMIGSFPILSVFGFVV(SEQ ID NO:512)MFGGVVYAITGPRLSGRMDRGGSAAGASRQRRTKGAGGSFTSRMEDRFRRRFDE>BL;Rv2170, H37Rv2.tab 2432235:2432852 forward MW:22926LAIFLIDLPPSDMERRLGDALTVYVDAMRYPRGTETLRAPMWLEHIRRRGWQAVAAVEVTAAEQAEAADTTALPSAAELS(SEQ ID NO:513)NAPMLGVAYGYPGAPGQWWQQQVVLGLQRSGFPRLAIARLMTSYFELTELHILPRAQGRGLGEALARRLLAGRDEDNVLLSTPETNGEDNRAWRLYRRLGFTDIIRGYHFAGDPRAFAILGRTLPL>BL;Rv2172c, H37Rv2.tab 2433634:2434536 reverse MW:33007VTLNTIALELVPPNLEGGKERAIEDARKVVQYSAASGLDGRIRHVMMPGMIAEDDDRPIPMQPKLDVLDFWSIIKPELAG(SEQ ID NO:514)VHGLCTQVTAFMDEPSLHRRLVDLSDAGMEGIVFVGVPRTMQDGEGSGVAPTDALSLYRQLVANRGVIVIPTRDGEQGRLNFKCSRGATYGMTQLLYSDAIVGFLREFARTTEHRPEILLSFGFVPKVETRIGLINWLIQDPGNAAVADEQAFVQKLAGSEPARRRRLMVDLYKRVLDGVADLGFPLSIHLEATYGVSAAAFETFAEMLAYWSPAEPGKPD>BL;Rv2175c, H37Rv2.tab 2437449:2437886 reverse MW:15743MPGRAPGSTLARVGSIPAGDDVLDPDEPTYDLPRvAELLGVPVSKVAQQLREGHLVAVRRAGGVVIPQVFFTNSGQVVKS(SEQ ID NO:515)LPGLLTILHDGGYRDTEIMRWLFTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQAMAY>BL;Rv2179c, H37Rv2.tab 2441814:2442317 reverse MW:19488VRYFYDTEFIEDGNTIELISIGVVAEDGREYYAVSTEFDPERAGSWVRTHVLPKLPPPASQLWRSRQQIRLDLEEFLRID(SEQ ID NO:516)GTDSIELWAWVGAYDHVALCQLWGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQLRRFRLITSTDDAGRGAAR>BL;Rv2183c, H37Rv2.tab 2445418:2445810 reverse MW:13322VSGAHTDVRPELRKLAQAILDGIDPAVRvAAAMASGGGPGTGKCQQVWCPLCALAALVTGEQHPLLTVIADHSLALLEVI(SEQ ID NO:517)RAIVDDIDRSAKPPPEGPPGGGQTGASGGENTNGEGSMKSHYQAIPVTIEE>BL;Rv2185c, H37Rv2.tab 2447069:2447500 reverse MW:16292VADKTTQTIYIDADPGEVMKAIADIEAYPQWISEYKEVEILEADDEGYPKRARMLMDAAIFKDTLIMSYEWPEDRQSLSW(SEQ ID NO:518)TLESSSLLKSLEGTYRLAPKGSGTEVTYELAVDLAVPMIGMLKRKAERRLIDGALKDLKKRVEG>BL;Rv2186c, H37Rv2.tab 2447608:2447994 reverse MW:14573MNSIQIADETYVAADAARvSAAVADRCSWRRWWPDLRLQVTEDRADKGIRWTVTGALTGTMEIWLEPSMDGVLLHYFLHA(SEQ ID NO:519)EPTGVAAWQLARMNLARMTHHRRVAGKKMAFEVKTVLERSRPIGVSPVT>BL;Rv2197C, N37Rv2.tab 2461507:2462148 reverse MW:22480MVSRYSAYRRGPDVISPDVIDRILVGACAAVWLVFTGVSVAAAVALMDLGRGFHEMAGNPHTTWVLYAVIVVSALVIVGA(SEQ ID NO:520)IPVLLRARRMAEAEPATRPTGASVRGGRSIGSGHPAKRAVAESAPVQHADAFEVAAEWSSEAVDRIWLRGTVVLTSAIGIALIAVAAATYLMAVGHDGPSWISYGLAGVVTAGMPVIEWLYARQLRRVVAPQSS>BL;Rv2199c, H37Rv2.tab 2463236:2463652 reverse MW:14865MHIEARLFEFVAAFFVVTAVLYGVLTSMFATGGVEWAGTTALALTGGMALIVATFFRFVARRLDSRPEDYEGAEISDGAG(SEQ ID NO:521)ELGFFSPHSWWPIMVALSGSVAAVGIALWLPWLIAAGVAFILASAAGLVFEYYVGPEKH>BL;Rv2203, H37Rv2.tab 2468231:2468920 forward MW:24371MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPGVPPQVSYPKRRHKRLLIGIV(SEQ ID NO:522)VALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY>BL;Rv2206, H37Rv2.tab 2470958:2471329 forward MW:13988MMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPSALTLLFVMFAVPQVQFYLSPAMLILLALMTIDAIILGRKVGR(SEQ ID NO:523)LVDTKFPSNTESRWRLGLYAAGRASQIRRLRAPRPQVERGGDVG>BL;Rv2219, H37Rv2.tab 2486235:2486984 forward MW:26864MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDKRLLPYMIGAFLLIVGASVGVGVWAGGFTMFTMIPLG(SEQ ID NO:524)VLLGALVAFVIFGRRAQRTVYRKAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVGEGSAARVKPLLAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTRLPANITVKQMDTVESRLAALGSRAGAGVMPKGPLPTTAKMRSVQRTVRRK>BL;Rv2229c, H37Rv2.tab 2502738:2503472 reverse MW:26851MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSRFESEIDAVRKRGD(SEQ ID NO:525)RDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEIDQARHQHSSQRDMLTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRLEGFEE>BL;Rv2235, H37Rv2.tab 2507637:2508449 forward MW:29762MPRLAFLLRPGWLALALVVVAFTYLCFTVLAPWQLGKNAKTSRENQQIRYSLDTPPVPLKTLLPQQDSSAPDAQWRRVTA(SEQ ID NO:526)TGQYLPDVQVLARLRvVEGDQAFEVLAPFVVDGGPTVLVDRGYVRPQVGSHVPPIPRLPVQTVTITARLRDSEPSVAGKDPFVRDGFQQVYSINTGQVAALTGVQLAGSYLQLIEDQPGGLGVLGVPHLDPGPFLSYGIQWISFGILAPIGLGYFAYAEIRARRREKAGSPPPDKPMTVEQKLADRYGRRR>BL;Rv2239c, H37Rv2.tab 2511179:2511652 reverse MW:16962MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWGWDEDTDDDIRAAIEEACGGELLDEDTDEVIDVVLLW(SEQ ID NO:527)WRDGDGDLVDTLMDAIGPLAEDGVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQPKSRAGKR>BL;Rv2242, H37Rv2.tab 2515304:2516545 forward MW:44606VNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLPFFADLEASQRASVALVVQTAVVNFVEWMHD(SEQ ID NO:528)PHSDVGYTAQAFELVPQDLTRRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFTAATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAPATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAALTDVHGTWLVAIVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNAVAGWRGAPRPVLARELLPERALMGDASAIVALHTDVMRPLADAGPTLIETLDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLRVAATVGQLNYPTPH>BL;Rv2256c, H37Rv2.tab 2529344:2529874 reverse MW:18896VEPKEQQMRASNQFADVTSGVVYIHASPAAVCPHVEWALSSTLQAKANLVWTPQPALPPQLRAVTNWVGPVGTGARLANA(SEQ ID NO:529)LRSWSVLRFEVTEDPSPGVDGQRFSHTPQLGLWSGAMSANGDIMVGEMRLRAMMAQGADTLAAELDSVLGTAWDQALEVYRDGGDAGEVTWLSRGVG>BL;Rv2257c, H37Rv2.tab 2530007:2530822 reverse MW:28385MTALEVLGGWPVPAAAAAVIGPAGVLATHGDTARvFALASVTKPLVARAAQVAVEEGVVNLDTPAGPPGSTVRHLLAHTS(SEQ ID NO:530)GLAMHSDQALARPGTRRMYSNYGFTVLAESVQRESGIEFGRYLTEAVCEPLGMVTTRLDGGPAAAGFGATSTVADLAVFAGDLLRPSTVSAQMHADATTVQFPGLDGVLPGYGVQRPNDWGLGFEIENSKSPHWTGECNSTRTFGHFGQSGGFIWVDPKADLALVVLTARDFGDWALDLWPAISDAVLAEYT>BL;Rv2342, H37Rv2.tab 2620272:2620526 forward MW:9187LIGYVAVLGLGYVLGAKAGRRRYEQIASTYRALTGSPVARSMIEGGRRKIANRISPDAGFVTLAEIDNQTAVVQRGVERQ(SEQ ID NO:531)PKTAR>BL;Rv2347c, H37Rv2.tab 2626226:2626519 reverse MW:10977MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAEATSLDTMAQMNQAFRNIVNMLHGVRDGLVR(SEQ ID NO:532)DANNYEQQEQASQQILSS>BL;Rv2365c, H37Rv2.tab 2646750:2647088 reverse MW:11130MMRRPITLAEQLDAEDAKLVVLARAAMARAEAGAGAAVRDVDGRTYAAAPVALSALELTGLQAAVAAAVSSGATGLQAAV(SEQ ID NO:533)LVAGSVDDPGIAAVRELAPTAAIIVTDRAGNPL>BL;Rv2376c, H37Rv2.tab 2655612:2656115 reverse MW:16635MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGAPLPLDPASAPDVPTAAQLTSLLNSLADPNVSFANKG(SEQ ID NO:534)SLVEGGIGGTEARIADHKLKKAAEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTQNVTFVNQGGWMLSRASANELLQAAGN>BL;Rv2413c, H37Rv2.tab 2710354:2711301 reverse MW:33113LHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGAYELAELLSPSLFAEERIVVLGAAAEAGKDAAAVIES(SEQ ID NO:535)AAADLPAGTVLVVVHSGGGRAKSLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRvKVDDETVTALLDAVGSDVRELASACSQLVADTGGAVDAAAVRRYHSGKAEVRGFDIADKAVAGDVAGAAEALRWAMMRGEPLVVLADALAEAVHTIGRVGPQSGDPYRLAAQLGMPPWRvQKAQKQARRWSRDTVATAMRLVAELNANVKGAVADADYALESAVRQVAELVADRGR>BL;Rv2446c, H37Rv2.tab 2745770:2746138 reverse MW:13311MTDRSREPADPWKGFSAVMAATLILEAIVVLLAIPVVDAVGGGLRPASLGYLVGLAVLLILLTGLQRRPWAIWVNLGAQP(SEQ ID NO:536)VLVAGFAVYPGVCFIGVLFAALWVLIAYLRAEVRRRRDYRVSQ>BL;Rv2466c, H37Rv2.tab 2768264:2768884 reverse MW:23035MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHVMSLAILNENRDDLPEQYREGMARAWGPVRVAIAAEQ(SEQ ID NO:537)AHGAKVLDPLYTAMGNRIHNQGNHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGEDVGTPTIHVNGVAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTRTEPPQFD>BL;Rv2468c, H37Rv2.tab 2771647:2772147 reverse MW:17288MTHRSSRLEVGPVARGDVATIEHAELPPGWVLTTSGRISGVTEPGELSVHYPFPIADLVALDDALTYSSRACQVRFAIYL(SEQ ID NO:538)GDLGRDTAARAREILGKVPTPDNAVLLAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGELVDGLISAIRVLSAGIAPG>BL;Rv2476c, H37Rv2.tab 2777391:2782262 reverse MW:176902MTIDPGAKQDVEAWTTFTASADIPDWISKAYIDSYRGPRDDSSEATKAAEASWLPASLLTPAMLGAHYRLGRHRAAGESC(SEQ ID NO:539)VAVYRADDPAGFGPALQVVAEHGGMLMDSVTVLLHRLGIAYAAILTPVFDVHRSPTGELLRIEPKAEGTSPHLGEAWMHVALSPAVDHKGLAEVERLLPKVLADVQRVATDATALIATLSELAGEVESNAGGRFSAPDRQDVGELLRWLGDGNFLLLGYQRCRVADGMVYGEGSSGMGVLRGRTGSRPRLTDDDKLLVLAQARVGSYLRYGAYPYAIAVREYVDGSVVEHRFVGLFSVAAMNADVLEIPTISRRVREALAMAESDPSHPGQLLLDVIQTVPRPELFTLSAQRLLTMARAVVDLGSQRQALLFLRADRLQYFVSCLVYMPRDRYTTAVRMQFEDILVREFGGTRLEFTARVSESPWALMHFMVRLPEVGVAGEGAAAPPVDVSEANRIRIQGLLTEAARTWADRLIGAAAAAGSVGQADAMHYAAAFSEAYKQAVTPADAIGDIAVITELTDDSVKLVFSERDEQGVAQLTWFLGGRTASLSQLLPMLQSMGVVVLEERPFSVTRPDGLPVWIYQFKISPHPTIPLAPTVAERAATANRFAEAVTAIWHGRVEIDRFNELVMRAGLTWQQVVLLRAYAKYLRQAGFPYSQSYIESVLNEHPATVRSLVDLFEALFVPVPSGSASNRDAQAAAAAVAADIDALVSLDTDRILRAFASLVQATLRTNYFVTRQGSARCRDVLALKLNAQLIDELPLPRPRYEIFVYSPRVEGVHLRFGPVARGGLRWSDRRDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVVKRPPLPTGDPAADRDATRAEGVACYQLFISGLLDVTDNVDHATASVNPPPEVVRRDGDDAYLVVAADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGYDHKAMGITARGAWEAVKRHFREIGIDTQTQDFTVVGIGDMSGDVFGNGMLLSKHIRLIAAFDHRHIFLDPNPDAAVSWAERRRMFELPRSSWSDYDRSLISEGGGVYSREQKAIPLSAQVRAVLGIDGSVDGGAAEMAPPNLIRAILRAPVDLLFNGGIGTYIKAESESDADVGDRANDPVRVNANQVRAKVIGEGGNLGVTALGRVEFDLSGGRINTDALDNSAGVDCSDHEVNIKILIDSLVSAGTVKADERTQLLESMTDEVAQLVLADNEDQNDLMGTSRANAASLLPVHAMQIKYLVAERGVNRELEALPSEKEIARRSEAGIGLTSPELATLMAHVKLGLKEEVLATELPDQDVFASRLPRYFPTALRERFTPEIRSHQLRREIVTTMLINDLVDTAGITYAFRIAEDVGVTPIDAVRTYVATDAIFGVGHIWRRIRAANLPIALSDRLTLDTRRLIDRAGRWLLNYRPQPLAVGAEINRFAAMVKALTPRMSEWLRGDDKAIVEKTAAEFASQGVPEDLAYRVSTGLYRYSLLDIIDIADIADIDAAEVADTYFALMDRLGTDGLLTAVSQLPRHDRWHSLARLAIRDDIYGALRSLCFDVLAVGEPGESSEQKIAEWEHLSASRVARARRTLDDIRASGQKDLATLSVAARQIRRMTRTSGRGISG>BL;Rv2484c, H37Rv2.tab 2791022:2792494 reverse MW:52309MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTPDWDRFRTRFENASRRVLRLRQKVVVPTLPTAAPRWV(SEQ ID NO:540)VDPDFNLDFHVRRVRVSGPATLREVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDGVGGVEMFAQIYDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVGGVLDALSGAVSMAGRAVLEPVSTVSGILGYARSGIRVLNRAAEPSPLLRRRSLTTRTEAIDIRLADLHKAAKAGGGSINDAYLAGLCGALRRYHEALGVPISTLPMAVPVNLRAEGDAAGGNQFTGVNLAAPVGTIDPVARMKKIRAQMTQRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASNVPVYPGDTYLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFAQCLQAGFDEILALAGGPAPRVLPASFDTQGAGSVPRSVSGS>BL;Rv2507, H37Rv2.tab 2822438:2823256 forward MW:28520MNDPRRPQRFGPPLSGYGPTGPQVPPNPPTADPAYADQSPYASTYGGYVSPPWSPGGPPPRPPQWPPGPHEASPTQQLPQ(SEQ ID NO:541)YWQYDQPPPGGFPPDGLTPPPPQGPRTPRWLWFAAGSAVLLVVALVIALVIANGSVKKQTAIEPLPPMPGPSPTRPTTTTPTPPSPSAAPAPTTTTGTPSETVAGAMQTVVYDVTGEGRAISITYMDSGNVIQTEFNVALPWRKEVSLSKSSLHPASVTIVNIGHNVTCSVTVAGVQVRQRTGAGLTICDAPS>BL;Rv2520c, H37Rv2.tab 2837391:2837615 reverse MW:8341VVDRDPNTIKQEIDQTRDQLAATIDSLAERANPRRLADDAKTRVIAFLRKPIVTVSLVGIGSVVVVVVIHKIRNR(SEQ ID NO:542)>BL;Rv2525c, H37Rv2.tab 2849855:2850574 reverse MW:25369MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDYAAGVIPASQIRAAGAVGAIRYVSDRRPGGAWMLGKP(SEQ ID NO:543)IQLSEARDLSGNGLKIVSCYQYGKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYEQYKNQIVPYLRSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGSPKGYTHPAAHLHQVEIDKRKVGGVGVDVNQILKPQFGQWA>BL;Rv2536, H37Rv2.tab 2860452:2861141 forward MW:24626MTNWMLRGLAFAAAMVVLRLFQGALINAWQMLSGLISLVLLLLFAIGGVVWGVMDGRADAKASPDPDRRQDLAMTWLLAG(SEQ ID NO:544)LVAGALSGAVAWLISLFYKAIYTGGPINELTTFAAFTALIVFLVGIVGVAVGRWLVDRQLAKAPVRHHGLAAEHERAADTDVFSAVRADDSPTGEMQVAQPEAQTAAVATVEREAPTEVIRTTESDTPTEVIRTDTEADQTKPGDEPKKD>BL;Rv2588C, H37Rv2.tab 2915849:2916193 reverse MW:12966MESFVLFLPFLLIMGGFMYFASRRQRRAMQATIDLHDSLQPGERVHTTSGLEATIVAIADDTIDLEIAPGVVTTWMKLAI(SEQ ID NO:545)RDRILPDDDIDEELNEDLDKDVDDVAGERRVTNDS>BL;Rv2609c, H37Rv2.tab 2936813:2937865 reverse MW:38096MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLDSALARRAVVARAVAIDAYGGAPQGSRLAALADAAEG(SEQ ID NO:546)APRHARENAENELSAALANVNPASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGGTAVLPTYFEIVERPHALVNGDQGASGRRTSARVVLLDDSGAVLLLCGSDPANPAFRDGAAPKWWFTVGGQVRPGERLAQAAARELAEETGLRVAPADMIGPIWRRDEVFEFNGSLIDSEEFYLVHRTRRFEPAVQGRTELERRYIRDARWCDANDIAQLVAAGERWPLQLGELLPAANRLVDVALDNGAARDAGVPQPIR>BL;Rv2673, H37Rv2.tab 2989290:2990588 forward MW:48883VYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPAAVLSVLHRSIVLTTNGNITDDFKPVYRAVLNFRRGW(SEQ ID NO:547)DIYNEHFDYVDPHYLYPPGGTLLMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPALILAMFATETVTNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIGLTLVLKPLLGPLLLLPLLNRQWRALVAAVVVPVVVNVAALPLVSDPMSFFTRTLPYILGTRDYFNSSILGNGVYFGLPTWLILFLRILFTAITFGALWLLYRYYRTGDPLFWFTTSSGVLLLWSWLVMSLAQGYYSMMLFPFLMTVVLPNSVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGWSLLLIVTFTVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR>BL;Rv2680, H37Rv2.tab 2996104:2996733 forward MW:22555LTSAGDDAERSDEEERRLTSAEPALFREAVAANNAVTVRPEIELGPIRPPQRLAPYSYALGAEIKHPELDVIPERSEGDA(SEQ ID NO:548)FGRLIMLYDPDGSDAWDGTIRLVAYVQADLDSSEAVDPLLPEVAWSWLVDALTARTDQVRALGGTVTATTSVRYGDISGPPRAHQLELRASWTATTPDLGAHVQAFCDVLEHAAGLPPAGVTDLGSRSRA>BL;Rv2683, H37Rv2.tab 3000111:3000605 forward MW:17729MKVNIDPTAPTFATYRRDMRAEQMAEDYPVVSIDSDALDAARMLAEHRLPGLLVTAGAGKQYAVLPASQVVRFIVPRYVQ(SEQ ID NO:549)DDPLLAGVLNESTADRCAERLSGKKVRDVLPDHLVEVPPANADDTIIEVAAVMARLRSPLLAVVKDGSLLGVVTASRLLAAALKT>BL;Rv2695, H37Rv2.tab 3011915:3012619 forward MW:24154MAVDLDGVTTVLLPGTGSDNDYVRRAFSAPLRRAGAVLVTPVPHPGRLIDGYRAALDDAARDGPVVVGGVSLGAAVAAAW(SEQ ID NO:550)ALEHPDRAVAVLAALPAWTGEPELAPAAQAARYTAARLRCDGLAATTTRMRASSPVWLAEELTRSWRVQWPELPDAMEEAAAYVAPSRAELARLVAPLAVAAAVDDPIHPLQVAADWVSVAPHAALRTVTLDEIGADAAALGSACLAALAEVSGA>BL;Rv2696c, H37Rv2.tab 3012831:3013607 reverse MW:27216MAFGRRTGKDGGKRKAGHAPVQPADEHVRPEDTVVASAAAASGVEDQEELQGPFDIDDFDDPSVAVLARLDLGSVLIPMP(SEQ ID NO:551)AAGQVQVELTESGVPSAVWVITPNGRYSIAAYAAPKTGGLWREVAGELADSLRKDSAKVSIKDGPWGREVIGIAAGVVRFIGVDGYRWNIRCVVNGPQETVDALTEEAREALADTVVRRGDTPLPVRTPLPVHLPEPMAAQLREAAAAQADTQRQAAAGVARRGAQGSAMQQLRSTTGG>BL;Rv2698, H37Rv2.tab 3014172:3014654 forward MW:17530VSGTRLAPHSVRYRERLWVPWWWWPLAFALAALIAFEVNLGVAALPDWVPFATLFTVAAGTLLWLGRVEIRVTAGSADGA(SEQ ID NO:552)GVKLWAGPAHLPVAVIARSAEIPATAKSAALGRQLDPAAYVLHRAWVGPMVLVVLDDPNDPTPYWLVSCRHPERVLSALRS>BL;Rv2699c, H37Rv2.tab 3014665:3014964 reverse MW:10915MPTDYDAPRRTETDDVSEDSLEELKARRNEAASAVVDVDESESAESFELPGADLSGEELSVRVVPKQADEFTCSSCFLVQ(SEQ ID NO:553)HRSRLASEKNGVMICTDCAA>BL;Rv2700, H37Rv2.tab 3015202:3015849 forward MW:22627VVAQITEGTAFDKHGRPFRRRNPRPAIVVVAFLVVVTCVMWTLALTRPPDVREAAVCNPPPQPAGSAPTNLGEQVSRTDM(SEQ ID NO:554)TDVAPAKLSDTKVHVLNASGRGGQAADIAGALQDLGFAQPTAANDPIYAGTRLDCQGQIRFGTAGQATAAALWLVAPCTELYHDSRADDSVDLALGTDFTTLAHNDDIDAVLANLRPGATEPSDPALLAKIHANSC>BL;Rv2708c, H37Rv2.tab 3021550:3021795 reverse MW:8962MSGMQTQTIERTDADERVDDGTGSDTPKYFHYVKKDKIAESAVMGSHVVALCGEVFPVTRAPKPGSPVCPDCKRIYDTLK(SEQ ID NO:555)KG>BL;Rv2709, H37Rv2.tab 3021838:3022281 forward MW:16810MWDSRVNKHGLRLGFNGQFDDFDDFDDKGRPVLITAAAPSYEVEHRTRVRKYLTLMAFRVPALILAAIAYGAWHNGLISL(SEQ ID NO:556)LIVAASVPLPWMAVLIANDRPPRRADEPRRFDVARRRIPLFPTAERPALEPRRQPAERSAPRGFADHG>BL;Rv2714, H37Rv2.tab 3027064:3028035 forward MW:35520MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLD(SEQ ID NO:557)YRSRRPLMTFKTDHFTHSDDPELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKSDDDPT>BL;Rv2715c, H37Rv2.tab 3031042:3031536 reverse MW:17324MTPVRPPHTPDPLNLRGPLDGPRWRRAEPAQSRRPGRSRPGGAPLRYHRTGVGMSRTGHGSRPVPPATTVGLALLAAAIT(SEQ ID NO:558)LWLGLVAQFGQMITGGSADGSADSTGRvPDRLAVVRvETGESLYDVAVRvAPNAPTRQVADRIRELNGLQTPALAVGQTLIAPVG>BL;Rv2722, H37Rv2.tab 3034634:3034879 forward MW:9077MPCLARQPVDLPPWAGPRCGPYCPRARITLLQRTTIAKSNRKYYENGYPADVKLMPGHAAVVSNRAAARAGFALPCRKRQ(SEQ ID NO:559)PD>BL;Rv2728c, H37Rv2.tab 3040768:3041460 reverse MW:23455VLSAIGIVPSAPVLVPELAGAAAAELADLGAAVIAAASLLPKSWIAVGTGRADDVVRPTDVGTFAGFGADVRVGLAPQDG(SEQ ID NO:560)DGVAVPVELPLCALLTAWVRGQARPEARAQVHVYASDHGSDAAVARGRQLRADIDREPDPIGVLVVADGLNTLTPRAPGGYDPDGAGMQRALDDALASGDLAVLTRLPAQVLGRvAFQVLAGLAEPGPRSAKEFYRGAPHGVGYFAGVWQP>BL;Rv2732c, H37Rv2.tab 3044377:3044988 reverse MW:21989MMSHEHDAGDLDALRAEIEAAERRvAREIEPGARALVVAILVFVLLGSFILFHTGSVRGWDVLFSSHGAGRAAVALPSRV(SEQ ID NO:561)FAWLALVFGVGFSMLALLTRRWALAWVALAGSANASGTGLLAVWSRQTVAAGHPGPGIGLIVAWITAIVLTFXWAQVVWSRTIVQLAAEERRRRVVAQQQCKTLLDHVQTDSEAGTTPDRGTDR>BL;Rv2738c, H37Rv2.tab 3051808:3052011 reverse MW:7551MLAGVRLTEFHERVALHFGAAYGSSVLLDHVLTGFDGRSAAQAIEDGVEPRDVWRALCADFDVPHDRW(SEQ ID NO:562)>BL;Rv2740, H37Rv2.tab 3053232:3053678 forward MW:16593MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDLVYENVGFSRIRGGRRTATLLRRMQGRVGFEVKIHRI(SEQ ID NO:563)GADGAAVLTERTDALIIGPLRvQFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL>BL;Rv2771c, H37Rv2.tab 3080583:3081032 reverse MW:16000VRRLLIVHHTPSPHMQEMFEAVVSGATDPEIEGVEVVRRPALTVSPIEMLEADGYLLGTPANLGYISGALKHAFDVCYYL(SEQ ID NO:564)CLDTTRGRSFGAYIHGNEGTEGAERAVDAITTGLGWVQAAETVVVMGKPSKADIEACWNLGATVAAQLMG>BL;Rv2772c, H37Rv2.tab 3081121:3081591 reverse MW:17326MTRRTLYVQLIIAFMCVAMVAYLVMLGRvAVAMIGSGRAAAAGLGLALLILPVIGLWANIATLRAGFAYQRLARLIAEDG(SEQ ID NO:565)LDIDASALPRRASGRIQRDAADALFAAVRTELEDDADDWRRWYRLARAYDYAGDRRRAREANKTALQLEGRARPGAR>BL;Rv2795c, H37Rv2.tab 3103939:3104910 reverse MW:37568VTWKGSGQETVGAEPTLWAISDLHTGHLGNKPVAESLYPSSPDDWLIVAGDVAERTDEIRWSLDLLRRRFAKVIWVPGNH(SEQ ID NO:566)ELWTTNRDPMQIFGRARYDYLVNMCDEMGVVTPEHPFPVWTERGGPATIVPMFLLYDYSFLPEGANSKAEGVAIAKERNVVATDEFLLSPEPYPTRDAWCHERVAATRARLEQLDWMQPTVLVNHFPLLRQPCDALFYPEFSLWCGTTKTADWHTRYNAVCSVYGHLHIPRTTWYDGVRFEEVSVGYPREWRRRKPYSWLRQVLPDPQYAPGYLNDFGGHFVITPEMRTQAAQFRERLRQRQSR>BL;Rv2840c, H37Rv2.tab 3147961:3148257 reverse MW:10601VRTCVGCRKRGLAVELLRvVAVSTGNGNYAVIVDTATSLPGRGAWLHPLRQCAQQAIRRAFARALRIAGSPDTSAVVEY(SEQ ID NO:567)LESLGELEPPGNRTGSNRT>BL;2V2843, H37Rv2.tab 3150170:3150712 forward MW:17735VLPAAPVINRLTNRPISRRGVLAGGAALAALGVVSACGESAPKAPAVEELRSPLDQARHDGALAAAAATAIGIPPQVAAA(SEQ ID NO:568)LTVVATQRTSHARALATEIARAAGKLVSATSETSSSSPSPTDPAAPPPAVSDVIDSLRTSAGEASRLVATTSGYRAGLLASIAASCTASYTVALVPSGPSI>BL;Rv2844, H37Rv2.tab 3150712:3151197 forward MW:16940MTSSEPAHGATPKRSPSEGSADNAALCDALAVEHATIYGYGIVSALSPPGVNFLVADALKQHRHRRDDVIVMLSARGVTA(SEQ ID NO:569)PIAAAGYQLPMQVSSAANAARLAVRMENDGATAWRAVVEHAETADDRvFASTALTESAVMATRWNRvLGAWPITAAFPGGDE>BL;Rv2876, H37Rv2.tab 3187662:3187973 forward MW:11805MFGQWEFDVSPTGGIAVASTEVEHFAGSQHEVDTAEVPSAAWGWSRIDHRTWHIVGLCIFGFLLAMLRGNHVGHVEDWFL(SEQ ID NO:570)ITFAAVVLFVLARDLWGRRRGWIR>BL;Rv2898c, H37Rv2.tab 3207944:3208327 reverse MW:14223MTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELDVIACDAATRTVVFVEVKTRTGDGYGGLAHAVTERKV(SEQ ID NO:571)RRLRRLAGLWLADQEERWAAVRIDVIGVRVGPKNSGRTPELTHLQGIG>BL;Rv2901C, H37Rv2.tab 3211805:3212107 reverse MW:12225MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSVEMVPRNTDGEVYFELRLADAWVWDMYRPARFVKQVR(SEQ ID NO:572)VVTFKDVNIEEVEKPELRLPE>BL;Rv2917, H37Rv2.tab 3226362:3228239 forward MW:68334VRVTRLVDAESTRCDVGPAPKSVANLHFTAATSRFRLGRERANSVRSDGGWGVLQPVSATFNPPLRGWQRRALVQYLGTQ(SEQ ID NO:573)PRDFLAVATPGSGKTSFALRIAAELLRYHTVEQVTVVVPTEHLKVQWAHAAAAHGLSLDPKFANSNPQTSPEYHGVMVTYAQVASHPTLHRVRTEARKTLVVFDEIHNGGDAKTWGDAIREAFGDATRRLALTGTPFRSDDSPIPFVSYQPDADGVLRSQADHTYGYAEALADGVVRPVVFLAYSGQARWRDSAGEEYEARLGEPLSAEQTARAWRTALDPEGEWMPAVITAADRRLRQLRAHVPDAGGMIIASDRTTARAYARLLTTMTAEEPTVVLSDDPGSSARITEFAQGTSRWLVAVRMVSEGVDVPRLSVGVYATNASTPLFFAQAIGRFVRSRRPGETASIFVPSVPNLLQLASALEVQRNHVLGRPHRESAHDPLDGDPATRTQTERGGAERGFTALGADAELDQVIFDGSSFGTATPTGSDEEADYLGIPGLLDAEQMRALLHRRQDEQLRKRAQLQKGATQPATSGASASVHGQLRDLRRELHTLVSIAHHRTGKPHGWIHDERRRRCGGPPIAAATRAQIKARIDALRQLNSERS>BL;Rv2926c, H37Rv2.tab 3240550:3241170 reverse MW:22378VDLGGVRRRISLMARQHGPTAQRHVASPMTVDIARLGRRPGAMFELHDTVHSPARIGLELIAIDQGALLDLDLRVESVSE(SEQ ID NO:574)GVLVTGTVAAPTVGECARCLSPVRGRVQVALTELFAYPDSATDETTEEDEVGRVVDETIDLEQPIIDAVGLELPFSPVCRPDCPGLCPQCGVPLASEPGHRHEQIDPRWAKLVEMLGPESDTLRGER>BL;Rv2949c, H37Rv2.tab 3299973:3300569 reverse MW:22587MTECFLSDQEIRKLNRDLRILIAANGTLTRVLNIVADDEVIVQIVKQRIHOVSPKLSEFEQLGQVGVGRVLQRYIILKGR(SEQ ID NO:575)NSEHLFVAAESLIAIDRLPAAIITRLTQTNDPLGEVMAASHIETFKEEAKVWVGDLPGWLALHGYQNSRKRAVARRYRVISGGQPIMVVTEHFLRSVFRDAPHEEPDRWQFSNAITLAR>BL;Rv2968c, H37Rv2.tab 3323073:3323702 reverse MW:23100VVAARPAERSGDPAAVRVPVPSAWWVLIGGVIGLFASMTLTVEKVRILLDPIYVPSCNVNPIVSCGSVMTTPQASLLGFP(SEQ ID NO:576)NPLLGIAGFTVVVVTGVLAVAKVPLPRWYWIGLAVGILVGVAFVHWLIPQSLYRIGALCPYCMVVWAVIATLLVVVASIVFGPMRENRGSQERvGARLLYQWRWSLATLWFTTVFLLIMVRFWDYWSTLI>BL;Rv2980, H37Rv2.tab 3335959:3336501 forward MW:18752VTGESDGPPRAVLIAAAALAAAVIGVILVVAANRQPPERPVVIPAVPAPQATGPGCKALLAALPQRLGEYPRAPVAEPTT(SEQ ID NO:577)AGATAWRTGPNSTPVILRCGLDRPAEFVVGSAIQVVDRVQWFQVAAQNPDEPGRSTWYTVDRPVYVALTLPSGSGPTAIQELSDVIDHTIPAVPIOPAPAR>BL;Rv3005c, H37Rv2.tab 3363695:3364531 reverse MW:28827VTSSNDSHWQRPDDSPGPMPGRPVSASLVDPEDDLTPARYAGDFGSGTTTVIPPYDAASSGVGNSGYSLIEAAEPLPYVQ(SEQ ID NO:578)PQPGRQVPAGSAGIDMDDDERVRAAGRRGTQNLGLLILRVGLGAVLIAHGLQKLFGWWDGQGLAGFQNSLSDIGYQHAEILAYVSAGGEIVAGVLLVLGLFTPLAAAGALAFLINGLLAGISAQHSRPVAYFLQDGHEYQITLVVMAVAVILSGPGRYGLDAARGWAHRPFIGSFVALLGGIAAGIAVWVLLNGANPLA>BL;Rv3013, H37Rv2.tab 3371814:3372467 forward MW:22967VRSYLLRIELADRPGSLGSLAVALGSVGADILSLDVVERGNGYAIDDLVVELPPGAMPDTLITAAEALNGVRVDSVRPHT(SEQ ID NO:579)GLLEAHRELELLDHVAAAEGATARLQVLVNEAPRVLRVSWCTVLRSSGGELHRLAGSPGAPETRANSAPWLPIERAAALDGGADWVPQAWRDMDTTMVAAPLGDTHTAVVLGRPGPEFRPSEVARLGYLAGIVATMLR>BL;Rv3015c, H37Rv2.tab 3374653:3375663 reverse MW:34212VSVFATATGIGSWPGTAAREAAQVVVGELAGALAYLTELPARGVGADMLGRAGGLLVDVAIDTVPRGYRIAARPGAVTRR(SEQ ID NO:580)AASLLDEDMDALEEAWETAGLRGCGRAVKVQAPGPVTLVAGLELANGHRAITDPGAVRDLAASLAEGVAAHRAALARRLDTPVVVQFDEPSLPAALGGRLTGVTALSPVAPLDETVAEALLDTCIAAVDADVALHSCSPDLPWDLLQRSRISAVSVDASTLQAADLDAVAAFVESGRTVVLGLVPVTAPERAPSMEEVAAAAVAVTDRLGVPRSALRDRLGVSPACGLANATGQWARTAVGLARDVAEAFARDPEAI>BL;Rv3035, H37Rv2.tab 3395378:3396457 forward MW:37305LAAGPALSARGYLALNGQTPAGCSLMEWQNDNNGRQRWCVRLVQGGGFAGPLFDGFDNLYVGQPGAIISFPPTQWTRWRQ(SEQ ID NO:581)PVIGMPSTPRFLGHGRLLVSTHLGQLLVFDTRRGMVVGSPVDLVDGIDPTDATRGLADCAPARPGCPVAAAPAFSSVNGTVVVSVWQPGEPAAKLVGLKYHAEQLVREWTSDAVSAGVLASPVLSADGSTVYVNGRDHRLWALNAADGKAKWSAPLGFLAQTPPALTPHGLIVSGGGPDTALAAFRDAGDHAEGAWRRDDVTALSTASLAGTGVGYTVISGPNHDGTPGLSLLVFDPANGHTVNSYPLPGATGYPVGVSVGNDRRvVTATSDGQVYSFAP>BL;Rv3038c, H37Rv2.tab 3398427:3399407 reverse MW:36049MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENYDEKWSISYDQRCVDYARGRFDAIVPDEVIAQLPYDR(SEQ ID NO:582)ALELGCGTGFFLLNLIQAGVARRGSVTDLSPGMVKVATRNGQALGLDIDGRvADAEGIPYDDDAFDLVVGHAVLHHIPDVELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWRVVTNATKLPGLRGWRRPQGELDESSRAAALEALVDLHTFTPQDLQRIAHNAGAVEVQTATEEFTAANLGWPLRTFECTVPPGRLGWGWARFAFTSWKTLGWVDANVWRHVVPKGWFYNVMITGVKPS>BL;Rv3195, H37Rv2.tab 3564363:3565778 forward MW:49325VSTGEVMGDLPFGFSSGDDPPEDPSGRDKRGKDGADSGSGANPLGAFGIGGEFNMADLGQIFTRLGEMFGGVGTAMAAGK(SEQ ID NO:583)TSGPVNYDLARQVASSSIGFIAPIPAATNSAIADAVHLADTWLDGATSLPAGATKAVGWSPTDWVDNTLATWKRLCDPMAQQISTVWASSLPEEAKSMAGPLLSIMSQMGGIAFGSQLGQALGRLSREVLTSTDIGLPLGPKGVAAILPGAVESFAAGLEQPRSEILTFLATREAAHHRLFSHVPWLASQLLGAVEAYAMGMKIDMTGIEELARDINPTSLADPAAMEQLLSQGVFEPKATPAQTQALERLETLLALIEGWVQTVVTAALGERIPGEAALSETLRRRRASGGPAEQTFATLVGLELRPRKLREAGALWERLTRAVGMDARDAVWQHPDLLPATDDLDDPAAFIDRvIGGDTSGIDEAIAELERDQQARGADDSGHDGGPVDN>BL;Rv3205c, H37Rv2.tab 3581629:3582504 reverse MW:31352MGSTRLTGVNVEPPPEHVLVAFGLAGAQPILLGAGWEGGWRCGEVVLSMVADNARAAWSARvRETLFVDGVRLARPVRST(SEQ ID NO:584)DGRYVVSGWRADTFVAGAPEPRHDEVVSAAVRLHEATGKLERPRFLTQGPAAPWAEIDVFVAADRAGWEERPLQSVPPGVPTAPPAADPQRSIDLINQLAGLRKPTKSPNQLVUGDLYGTVLFAGTAPPGITDITPYWRPASWAAGVAVVDALSWGAADDGLIERWNALPEWPQMLLRALMFRLAVYALHPRSTAEAFPGLAHTAALVRLVL>BL;2V3207c, H37Rv2.tab 3583803:3584657 reverse MW:31034VSTYGWRAYALPVLMVLTTVVVYQTVTGTSTPRPAAAQTVRDSPAIGVVGTAILDAPPRGLAVFDANLPAGTLPDGGPFT(SEQ ID NO:585)EAGDKTWRvVPGTTPQVGQGTVKVFRYTVEIENGLDPTMYGGDNAFAQMVDQTLTNPKGWTHNPQFAFVRIDSGKPDFRISLVSPTTVRGGCGYEFRLETSCYNPSFGGMDRQSRVFINEARWVRGAVPFEGDVGSYRQYVINHEVGHAIGYLRHEPCDQQGGLAPVMMQQTFSTSNDDAAKFDPDFVKADGKTCRFNPWPYPIP>BL;Rv3208c, H37Rv2.tab 3585679:3585948 reverse MW:9400VEVKIGITDSPRELVFSSAQTPSEVEELVSNALRDDSGLLTLTDERGRRFLIHTARIAYVEIGVADARRvGFGVGVDAAA(SEQ ID NO:586)GSAGKVATSG>BL;Rv3209, H37Rv2.tab 3586273:3586830 forward MW:19118VALGAVATAVIINSGDSTSTKAIVGAPAPRTVISTSPRPTAPTSTSPHPSPSTLRPQLPPETVTTVAPPGTGPTTVPTRT(SEQ ID NO:587)PTAAPPQTAVPPPAPLNPRTVVYRvTGTKQLFDLVNVVYTDARGFPVTDFNVSLPWTKMVVLNPGVQTESVVATSLYSRLNCSIVNTGAQTVVASTNNAIIATCTR>BL;Rv3212, H37Rv2.tab 3589393:3590613 forward MW:42506MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGT(SEQ ID NO:588)VATGDGRQVDGRDPATGESLWSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLPSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRGDTLVALG>BL;Rv3217C, H37Rv2.tab 3593806:3594234 reverse MW:14260VPVRAPAAVRGAGLIVAVQGGAALVVAAALLVRGLAGADQHIVNGLGTAGWFVLVGGAVLAAGCRLAVGKLWGRGLAVFA(SEQ ID NO:589)QLLLLPVAWYLIVGSHQPAIGIPVGIIALGVLVLLFSPPSIRWAAGRDQRGAASAANRGPDSR>BL;Rv3242C, H37Rv2.tab 3621572:3622210 reverse MW:22481VLDLVLPLECGGCGAPATRWCAACAAELSVAAGEPHVVSPRVDPQVPVFALGRYAGVRRQAILAMKEHGRRDLVAPLACA(SEQ ID NO:590)LIVGVDHLLSWGMLENPLTMVPAPTRRWAARRRGGDPVSRMARIAGATLGRHHDVTVVPALRMRALARDSVGLGASARERNITGRVLLRGQRPRNEVVLVDDIITTGATARESVRVLQAAGVRVGAVLAVAAA>BL;Rv3256c, H37Rv2.tab 3636277:3637314 reverse MW:35277VNVARAIDLEDTEGLIAADRGALLRAASMAGAQVRAIAAAADEGELDLLRGSDRPRSVIWVTGRGTAETAGTILASTLGA(SEQ ID NO:591)GAAEPIVLASAAPPWVGPLDVLIVAGDDPGDPALVGAAAIGVRRGARVVVVAPYEGPLRDSTAGRVAVLEPRLRVPDEFGLSRYLAAGLAALQTVDPKLRIDLASLADELDAEALRNSAGREVFTNPAKALAARVSGCQLALAGDNAATLALARHGSSVMLRIANQVVAATRLSDAVVALRAGTPPDALFHDEEIDGPAPQRLRvLALALAGERTVVAARVAGLDDAYLVAAEDVPELLDAPVGSGGAVLAVRLEMAAVYLRLVRG>BL;2V3258c, H37Rv2.tab 3638813:3639301 reverse MW:16810MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVR(SEQ ID NO:592)HAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD>BL;Rv3259, H37Rv2.tab 3639424:3639840 forward MW:15649MRGPLLPPTVPGWRSRAERFDMAVLEAYEPIERRWQERVSQLDIAVDEIPRIAAKDPESVQWPPEVIADGPIALARLIPA(SEQ ID NO:593)GVDVRGNATRARIVLFRKPIERRAKDTEELGELLHEILVAQVAIYLDVDPSVIDPTIDD>BL;Rv3269, H37Rv2.tab 3650233:3650511 forward MW:9750MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAAALGLRGTRKAEEAAESARLKVADVMAEARERIGEES(SEQ ID NO:594)PTPAISDLHDHDH>BL;Rv3277, H37Rv2.tab 3659877:3660692 forward MW:30079MNEVTAGVRELATAIMVSRHLTGVLAGHGSQTVTYHFASILCSSVHSLVVSFADATIARLPGVVQPYAQRHHELIKFAIV(SEQ ID NO:595)GGTTFIIDTAIFYTLKLTVLEPKPVTAKVIAGIVAVIASYVLNREWSFRDRGGRERHHEALLFFAFSGVGVLLSMAPLWFSSYILQLRVPTVSLTMENIADFISAYIIGNLLQMAFRFWAFRRWVFPDEFARNPDKALESALTAGGIAEVFEDVLEGGFEDGNVTLLRAWRNRANRFAQLGDSSEPRvSKTS>BL;Rv3278C, H37Rv2.tab 3660653:3661168 reverse MW:19820MSYPENVLAAGEQVVLHRHPHWNRLIWPVVVLVLLTGLAAFGSGFVNSTPWQQIAKNVIHAVIWGIWLVIVGWLTLWPFL(SEQ ID NO:596)SWLTTHPVVTNRRvMFRHGVLTRSGIDIPLARINSVEFRDRIFERIFRTGTLIIESASQDPLEFYNIPRLREVHALLYHEVFDTLGSDESPS>BL;Rv3281, H37Rv2.tab 3663688:3664218 forward MW:19013MGTCPCESSERNEPVSRVSGTNEVSDGNETNNPAEVSDGNETNNPAEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPAP(SEQ ID NO:597)VSRvSGTNEVSDGNETNNPAPVTEKPLHPHEPHIEILRGQPTDQELAALIAVLGSISGSTPPAQPEPTRWGLPVDQLRYPVFSWQRITLQEMTHMRR>BL;Rv3311, H37Rv2tab 3698120:3699379 forward MW:45732MVADLVPIRLSLSAGDRYTLWAPRWRDAGDEWEAFLGKDDDLYGFESVSDLVAFVRTDTENDLVDHPAWQDLTGAHAHNL(SEQ ID NO:598)NPAEDNQFDLVVVEELLAEKPTAESVAALAASLAIVSAIGSVCELAAVSKFFNGNPILGTVSGGLEHFTGKAGNKRWNSIAEVIGRSWDDVLAAIDEIISTPEVDAELSEKVAEELAEEPEGAEEVAAEVEATQDTQEAAESDDEEADAPGDSVVLGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYAAAVREWEKLERFVESRLRRE>BL;Rv33S4, H37Rv2.tab 3769110:3769496 forward MW:12987MNLRRHQTLTLRLLAASAGILSAAAFAAPAQANPVDDAFIAALNNAGVNYGDPVDAKALGQSVCPILAEPGGSFNTAVAS(SEQ ID NO:599)VVARAQGMSQDMAQTFTSIAISMYCPSVMADVASGNLPALPDMPGLPGS>BL;Rv3368c, H37Rv2.tab 3780337:3780978 reverse MW:23733MTLNLSVDEVLTTTRSVRKRLDFDKPVPRDVLMECLELALQAPTGSNSQGWQWVFVEDAAKKKAIADVYLANARGYLSGP(SEQ ID NO:600)APEYPDGDTRGERMGRVRDSATYLAEHMHRAPVLLIPCLKGREDESAVGGVSFWASLFPAVWSFCLALRSRGLGSCWTTLIILLDNGEHKVADVLGIPYDEYSQGGLLPIAYTQGIDFRPAKRLPAESVTHWNGW>BL;Rv3412, H37Rv2.tab 3831725:3832132 forward MW:15269VRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEADLADLAVYEALLAHKGIRGLVVCCDECQQDHYHDWD(SEQ ID NO:601)MLRSNLLQLLIDGTVRPHEPAYDPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR>BL;Rv3415c, H37Rv2.tab 3833696:3834520 reverse MW:28627VNETPHAPVVEQVLVAAAFGNQPGSWPLPTAITPHHLWLRAVAAGGQGRYAHAYGDLSVLRRLVPAGPLASLAHSTQGSL(SEQ ID NO:602)LRQLGWHTLARGWDGRALALAGADREAGADALIGLAADALGVGRFAAAGALLDRADPLVVSPLVADRLAVRRRWVAAELAMATGDGATAVRHAEEAVELTQAMAVASARHRvKSDVVLAAALCSAGAVARARAVGEEALDATARFGLLPLRWALACLLIDIGTVTFSAQQLRELTKIRNICAGQVRRAGGCWRTA>BL;Rv3438, H37Rv2.tab 3857396:3858235 forward MW:29209VPRIRKLVAALHRRGPHRVLRGDLAFAGLPGVVYTPEAGLHLPGVAFGHDWLTGTSRYSGLLEHLASWGIVAAAPDSERG(SEQ ID NO:603)LAPSVLNLAFDLGVALDIVAGVRLGPGKISVHPAKLGLVGHGFGGSAAVFAAAGLTGTHVKSVAAIFPTVTNPAAEQPAATLDVPGLILTAPGDPKTLTSNALGLSRAWDKATLRIVSKARAGGLVEGRRLTKVLGLPGPHRRTQRSVRALLTGYLLYTLGGDKTYRRFADPDLQLPKTDPIDPEAPPITPGEKIVTLLK>BL;Rv3587c, H37Rv2.tab 4028971:4029762 reverse MW:27067VLDLEPRGPLPTEIYWRRRGLALGIAVVVVGIAVAIVIAFVDSSAGAKPVSADKPASAQSHPGSPAPQAPQPAGQTEGNA(SEQ ID NO:604)AAAPPQGQNPETPTPTAAVQPPPVLKEGDDCPDSTLAVKGLTNAPQYYVGDQPKFTMVVTNIGLVSCKRDVGAAVLAAYVYSLDNKRLWSNLDCAPSNETLVKTFSPGEQVTTAVTWTGMGSAPRCPLPRPAIGPGTYNLVVQLGNLRSLPVPFILNQPPPPPGPVPAPGPAQAPPPESPAQGG>BL;Rv3603c, H37Rv2.tab 4045210:4046118 reverse MW:31104MERFDGLRPARLKVGIISAGRVGTALGVALQRADHVVVACSAISHASRRRAQRRLPDTPVLPPLDVAASAELLLLAVTDS(SEQ ID NO:605)ELAGLVSGLAATSAVRPQTIVAHTSGANGIGILAPLAQQGCIPLAIHPAMTFTGSDEDISRLPDTCFGITAADDVGYAIGQSLVLEMGGEPFCVREDARILYHAALAHASNHIVTVLADALEALRAALSGGELLGQQTVDDQPGGIVERIVGPLARAALENTLQRGQAALTGPVARGDAAAVADHLAALADVDAALAQAYRINALRTAQRAHAPADVVEVLTA>BL;Rv3604c, H37Rv2.tab 4046306:4047691 reverse MW:49862VPRAASAMAEPAMGVGRRRCWPGGRPGMRGCLRGEFGRTAYPAKPCGNRRTGATRGLTSPGYSQAMTVLSRGARVRRGGR(SEQ ID NO:606)RPGWVLLTALLVLAIGASSALVFTDRVELLKLAVLLALWAAVAGAFVSVLYRRQSDVDQARVRDLKLVYDLQLDREISARREYELTLESQLRRELASELRAPAADEVAALRAELAALRTSLEILFDADLEHRPALGTVEKEARAARALDGESPPADWVSSDRVMAVRGGDGASRTDEASIIDVPEVGVPPVSGGPRHYEAPPPPQPEPLFEPRHRPPPLPPQQERPVWQPVTSHGQWLPAETPGSQWASVEPETTPAAPPPGRRRRARHASPADQAYNPPAYVELAAQYGESGRRSRHSAEHRDHDIGGSGAGTGERPPSPPMAPPPPAEPTRRHRTADTPPDDSGGLHARDPLTGGQSVADLMARLQVESTGGGRRRRRGE>BL;Rv3605c, H37Rv2.tab 4047708:4048181 reverse MW:16789MGPTRKRDLTAAVVGAAAVGYLLVAVLYRWFPPITVWTGLSLLAVAVAEALWARYVRVKISDGEIGDGPGWLHPLVVARS(SEQ ID NO:607)LMVAKASAWVGALVTGWWIGVLAYFLPRRSWLRAAAEDTTGTVVAAGSALALVVAALWLQHCCKSPQDPTEHADGAES>BL;Rv3614c, H37Rv2.tab 4054145:4054696 reverse MW:19802VDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTGPDLDNAHGQAETDTEQEIALFTVTNPPRTVSVSTLM(SEQ ID NO:608)DGRIDHVELSARvAWMSESQLASEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRvALLRKTVGETWGLPSPEEAAAAEAEVFATRYSDDCPAPDDESDPW>BL;Rv3615c, H37Rv2.tab 4054815:4055123 reverse MW:10795MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSL(SEQ ID NO:609)RIAAKIYSEADEAWRKAIDGLFT>BL;Rv3616c, H372V2.tab 4055200:4056375 reverse MW:39888MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALEELAAAFPGDGWLGSAADKYAGKNRNHVNFFQELADL(SEQ ID NO:610)DRQLISLIHDQANAVQTTRDILEGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALAYLVVKTLINATQLLKLLAKLAELVAAAIADIISDVADIIKGTLGEVWEFITNALNGLKELWDKLTGWVTGLFSRGWSNLESFFAGVPGLTGATSGLSQVTGLFGAAGLSASSGLAHADSLASSASLPALAGIGGGSGFGGLPSLAWVHAASTRQALRPRADGPVGAAAEQVGGQSQLVSAQGSQGMGGPVGMGGMHPSSGASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV>BL;Rv3619c, H37Rv2.tab 4059987:4060268 reverse MW:9832MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGN(SEQ ID NO:611)NMAQTDSAVGSSWA>BL;Rv3632, H37Rv2.tab 4071236:4071577 forward MW:13068MNWIQVLLIASIIGLLFYLLRSRRSARSRAWVKVGYVLFVLAGIYAVLRPDDTTVVANWFGVRRGTDLMLYALVIVIAFSFT(SEQ ID NO:612)TLSTYMRFKDLELRYARIARALALEGAQAPEQCR>BL;Rv3647c, H37Rv2.tab 4087613:4088188 reverse MW:20314VSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASALAEMIQEAGLVPEVARTDENTPLVRTAVDP(SEQ ID NO:613)LLCGIAAEWTRGAVKTVPPRWLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHS PLASALMRvGIAPTLIGTRGTRPALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT>BL;Rv3662c, H37Rv2.tab 4101268:4102035 reverse MW:26338VTVDPLAPLMELPGVAAASDRVRDALSRvHRHRANLRGWPVAAAEASLRAARASSVLDGGPARLHDAGAPTSGKPALSDP(SEQ ID NO:614)VFAGALRVGQALEGGAGPVVGVWRRAPLQALARLHMLAAADQVDDDRLGRPRSDADVGPRLELLALVVTHPTLASAPVVAAVAHGELLTLRPFGCADGVVARAVSRLVTIATGLDPHGLGVPEVIWMRQPAEYHDAARRFAGGTPDGVAGWLLLCCGAMLDGAREALSIAESLSPG>BL;Rv3668c, H37Rv2.tab 4109786:4110481 reverse MW:23102LQTAHRRFAAAFAAVLLAVVCLPANTAAADDKLPLGGGAGIVVNGDTMCTLTTIGHDKNGDLIGFTSAHCGGPGAQIAAE(SEQ ID NO:615)GAENAGPVGIMVAGNDGLDYAVIKFDPAKVTPVAVFNGFAINGIGPDPSFGQIACKQGRTTGNSCGVTWGPGESPGTLVMQVCGGPGDSGAPVTVDNLLVGMIHGAFSDNLPSCITKYIPLHTPAVVMSINADLADINAKNRPGAGFVPVPA>BL;Rv3669, H37Rv2.tab 4110827:4111342 forward MW:18887VSKIDRKNGVPSTLTTIPLADPHAGPAEPSIGDLIKDATTQMSTLVRAEVELARAEITRDVKKGLTGSVFFISSLVVGFY(SEQ ID NO:616)STFFFFFFVAELLDTWIWRWVAFLLVFAIMVVVTAVLALLGFLKVRRIRGPRQTIASVKETRTALTPGHDKTPVTPKPVTSDRATPVDPSGW>BL;Rv3680, H37Rv2.tab 4119795:4120952 forward MW:41405MSVTPKTLDMGAILADTSNRVVVCCGAGGVGKTTTAAALALRAAEYGRTVVVLTIDPAKRLAQALGINDLGNTPQRVPLA(SEQ ID NO:617)PEVPGELHAMMLDMRRTFDEMVMQYSGPERAQSILDNQFYQTVATSLAGTQEYMAMEKLGQLLSQDRWDLIVVDTPPSRNALDFLDAPKRLGSFMDSRLWRLLLAPGRGIGRLITGVMGLAMKALSTVLGSQMLADAAAFVQSLDATFGGFREKADRTYALLKRRGTQFVVVSAAEPDALREASFFVDRLSQESMPLAGLVFNRTHPMLCALPIERAIDAAETLDAETTDSDATSLAAAVLRIHAERGQTAKREIRLLSRFTGANPTVPVVGVPSLPFDVSDLEALRALADQLTTVGNDAGRAAGR>BL;Rv37OSc, H37Rv2.tab 4148321:4148962 reverse MW:22359MRIAAAVVSIGLAVIAGFAVPVADAHPSEPGVVSYAVLGKGSVGNIVGAPMGWEAVFTRPFQAFWVELPACNNWVDIGLP(SEQ ID NO:618)EVYDDPDLASFNGATTQTSATDQTHLVKQAVGVFASNDAADRAFHRvVDRTVGCSGQTTAIHLDDGTTQVWSFAGGPSTGTDEAWTKQEAGTDRRCFVQTRLRENVLLQAKVCQSGNAGPAVNVLAGAMQNTLG>BL;Rv3716c, H37Rv2.tab 4160515:4160913 reverse MW:13357MQPGGDMSALLAQAQQMQQKLLEAQQQLANSEVHGQAGGGLVKVVVKGSGEVIGVTIDPKVVDPDDIETLQDLIVGAMRD(SEQ ID NO:619)ASQQVTKMAQERLGALAGANRPPAPPAAPPGAPGMPGMPGMPGAPGAPPVPGI>BL;Rv3718c, H37Rv2.tab 4161818:4162258 reverse MW:15661MGQVSAASTILINAEPTATLDALADYETVRPKILSPHYSEYQVLEGGKGRGTVAKWRLQATQSRVRDVQVNVDVAGHTVI(SEQ ID NO:620)EKDMNSSMVTNWTVAPAGPGSSVTVKTTWTGAGGVKGFFEKTFAPLGLKKIQAEVLSNLKTELEGDA>BL;Rv3723, H37Rv2.tab 4168536:4169297 forward MW:27367MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGLRIATGALVGLAALPVVFTLLRTRKPELGTPQLALSM(SEQ ID NO:621)RIWSIMAHVLAGALIVGTAISEVWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKPKKPKQRRLRRKKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIESPGGEPESATREAPAAETATAEEPRGGLRNRRPTGKTSHRRRRTRSGVQVAKVDE>BL;Rv3753c, H37Rv2.tab 4199724:4200221 reverse MW:17917MQRPAADTPDGFGVAVVREEGRWRCSPMGPKALTSLRAAETELRELRSAGAVFGLLDVDDEFFVIVRPAPSGTRLLLSDA(SEQ ID NO:622)TAALDYDIAAEVLDNLDAEIDPEDLEDADPFEEGDLGLLSDIGLPEAVLGVILDETDLYADEQLGRIAREMGFADQLSAVIDRLGR>BL;Rv3760, H37Rv2.tab 4205538:4205837 forward MW:10533VPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLILILLLIFIAQNTASAQFAFFGWRWSLPLGVAILLAAVGGGLITVFAG(SEQ ID NO:623)TARILQLRRAAKKTHAAALR>BL;Rv3779, H37Rv2.tab 4224985:4226982 forward MW:71763VGLWFGTLIALILLIAPGAMVARIAQLRWPVAIAVGPALTYGVVALAIIPYGALGIPWNGWTALAALAVTCAVATGLQLL(SEQ ID NO:624)LARFRDLDAEALAVSRWPAVTVAAGVLLGALLIGWAAYRGIPHWQSIPSTWDAVWHANTVRFILDTGQASSTHMGELRNVETHAPLYYPSVFHGLVAVFCQLTGAAPTTGYTLSSLAASVWLFPVSAAVLTWRAVRSHPGALWSASCASAEWRAAGAAGTAAALSASFTAVPYVEFDTAAMPNLAAYGIAVPTMVLITSTLRHRDRIPVAVLALVGVFSLHITGGIVVALLVSAWWLFEALRHPVRSRLADLLTLAGVAAMAGLVMLPQFLSVRQQEDIIAGHAFPTYLSKKRGLFDAVFQHSRHLNDFPVQYALIVLAAIGGLILLVKKIWWPLAVWLLLIVMNVDAGTPLGGPIGGVAGALGEFFYHDPRRIAAATTLLLMLMAGVALFATVNLLVAAAKRLTDRFRPQPVSVWASATATLLIGATLVSAWHYFPRHRFLFGDKYDSVMIDQKDLDANAYLASLPGARDTLIGNANTDGTAWMYAVAGLHPLWTHYDYPLQQGPGYHRFIFWAYGRNGESDPRvLEAIQVLRIRYILTSTPTVRGFAVPDGLVSLETSRSWAKIYDNGEARIYEWRGTAAATHS>BL;Rv3780, H37Rv2.tab 4226989:4227522 forward MW:19484VRKRMVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDLVEQPAKVMRIGTMIKQLLEEVRAAPLDEASRNRLRD(SEQ ID NO:625)IHATSIRELEDGLAPELREELDRLTLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLQQMRQGALPPGVGKSGQHGHGTGQYL>BL;Rv3792, H37Rv2.tab 4237932:4239860 forward MW:69516MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAVVSLLAIARVEWPAFPSSNQLHALTTVGQVGCLAGLV(SEQ ID NO:626)GIGWLWRHGRFRRLARLGGLVLVSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTYIGLPPFYPPGWFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWWRMIRFEYALLVTVATAAVMLAYSSPEPYAAMITVLLPPMLVLTWSGLGARDRQGWAAVVGAGVFLGFAATWYTLLVAYGAFTVVLMALLLAGSRLQSGIKAAVDPLCRLAVVGAIAAAIGSTTWLPYLLRAARDPVSDTGSAQHYLPADGAALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVLAVYLWSLLSMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMAAAIGLAGAIAFSQDIPDVLRPDLTIAYTDTDGYGQRGDRRPPGSEKYYPAIDAAIRRVTGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDKRATQIDSWSGLSTADEFIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDVYPNQPNVRRYTVDLRTALFADPRFVVEDIGPFVLAIRKPQESA>BL;Rv38O2c, H37Rv2.tab 4263358:4264365 reverse MW:35448MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPSAVPPGVLPPGPTPAHPHKPRPAFQDASCPDVQMISV(SEQ ID NO:627)PGTWESSPQQNPLNPVQFPKALLLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGTRAMVAAMTDMNNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLVLGVTLIADGRRQQGVGNQVPPSPRGEGAEITLHEVPVLSGLGLTMTGPRPGGFGALDGRTNEICAQGDLICAAPAQAFSPANLPTTLNTLAGGAGQPVHAMYATPEFWNSDGEPATEWTLNWAHQLIENAPHPKHR>BL;Rv3805c, H37Rv2.tab 4266956:4268836 reverse MW:68710MVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLRTVRNLLAGNGPVFNQGERVEANTSTAWTYLLYVGGWVGGPMRLE(SEQ ID NO:628)YVALALAMVLSLLGMVLLMLGTGRLYAPSLRGRRAIMLPAGALVYIAVPPARDFATSGLESGLVLAYLGLLWWMMVCWSQPLRARPDSQMFLGALAFVAGCSVLVRPEFALIGGLALIMMLIAARTWRRRVLIVLAGGFLPVAYQIFRMGYYGLLVPSTALAKDAAGDKWSQGMIYVSNFNRPYALWVPLVLSVPLGLLLMTARRRPSFLRPVLAPDYGRvARAVQSPPAVVAFIVGSGVLQALYWIRQGGDFMHGRvLLAPLFCLLAPVGVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWSLWAANSPGMGDDATRVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMAAVLTALNNTPEGALLLPSGNYNQWDLVPMIRPSSGTAPGGKPAPKPQHAVFFTNMGMLGMNVGLDVRVIDQIGLVNPLAAHTERLKHARIGHDKNLFPDWVIADGPWVKWYPGIPGYIDQQWVTQAEAALQCPATRAVLNSVRAPITLHRFLSNVLHSYEFTRYRIDRVPRYELVRCGLDVPDGPGPPPRE>BL;Rv3807c, H37Rv2.tab 4269843:4270337 reverse MW:17218MVAVQSALVDRPGMLATARGLSHFGEHCIGWLILALLGAIALPRRRREWLVAGAGAFVAHAIAVLIKRLVRRQRPDHPAI(SEQ ID NO:629)AVNVDTPSQLSFPSAHATSTTAAALLMGRATGLPLPVVLVPPMALSRILLGVHYPSDVAVGVALGATVGAIVDSVGGGRQRARKR>BL;Rv3808c, H37Rv2.tab 4270369:4272279 reverse MW:71507MSELAASLLSRVILPRPGEPLDVRKLYLEESTTNARRAHAPTRTSLQIGAESEVSFATYFNAFPASYWRRWTTCKSVVLR(SEQ ID NO:630)VQVTGAGRVDVYRTKATGARIFVEGHDFTGTEDQPAAVETEVVLQPFEDGGWVWFDITTDTAVTLHSGGWYATSPAPGTANIAVOIPTFNRPADCVNALRELTADPLVDQVIGAVIVPDQGERKVRDHPDFPAAAARLGSRLSIHDQPNLGGSGGYSRVMYEALKNTDCQQILFMDDDIRLEPDSILRvLAMHRFAKAPMLVGGQMLNLQEPSHLNIMGEVVDRSIFMWTAAPHAEYDHDFAEYPLNDNNSRSKLLHRRIDVDYNGWWTCMIPRQVAEELGQPLPLFIKWDDADYGLRAAEHGYPTVTLPGAAIWHMAWSDKDDAIDWQAYFHLRNRLVVAAMHWDGPKAQVIGLVRSHLKATLKHLACLEYSTVAIQNKAIDDFLAGPEHIFSILESALPQVHRIRKSYPDAVVLPAASELPPPLHKNKAMKPPVNPLVIGYRLARGIMHNLTAANPQHHRRPEFNVPTQDARWFLLCTVDGATVTTADGCGVVYRQRDRAKNFALLWQSLRRQRQLLKRFEEMRRIYRDALPTLSSKQKWETALLPAANQEPEHG>BL;Rv3821, H37Rv2.tab 4285973:4286683 forward MW:24627MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYTMAGGVANVTLVVLGATPLAGHFSVAEVQIGTGLIAL(SEQ ID NO:631)LIAFALTTNVIGKHVRRATHARVGDDGGRVLRESVPPSGAHKLAVRARCFLQGDSLYVAGVSGLGAALPSANYMGAMAAILASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTRAFMAALQSWLRSRSRRDAALLVAAGGCLMLTLGLSNL>BL;Rv3835, H37Rv2.tab 4309047:4310393 forward MW:47043MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRALLLTALGGLLIAGLVTAIPAVGRAPERLAGYIASNP(SEQ ID NO:632)VPSTGAKINASFNRVASGDCLMWPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARIQQISEEQCEAAVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGLQSPGPNNQQLAFKGKVADIDQSKVWPAGTCLGIDATTNQPIDVPVDCAAPHAMEVSGTVNLAERFPDALPSEPEQDGFIKDACTRMTDAYLAPLKLRTTTLTLIYPTLTLPSWSAGSRVVACSIGATLGNGGWATLVNSAKGALLINGQPPVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQHLPAQQPVVTPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG>BL;Rv3843c, H37Rv2.tab 4315571:4316596 reverse MW:37353VIQVCSQCGTGWNVRERQRvWCPRCRGMLLAPLADMPAEARWRTPARPQVPTASDTRRTPPRLPPGFRWIAVRPGAAPPP(SEQ ID NO:633)RHGPRLRGPTPRYAGIPRWGLTDHVDQAPVPASAKAGPSPAAVRTTLLVSLLVFSIAVVVFVVRYVLLVINRNTLLNSVVASASVWLGVLVSLAAIAAAGTTIVLLVRWLVARRAAAFMHQGLPERRSARELWAGCLLPMVNLLWAPLYVIELALVEDRYTRLRRPIVVWWIVWIVSNAISMFAFATSWVTDAQGIANNTTMMVLAYLCAAAAVAAAARVFEGFEQKPVERPAHRWVVVNTDGRSAPASSVAVELDGQEPAA>BL;Rv3847, H37Rv2.tab 4321538:4322068 forward MW:18278MGTGSGGPIGVSPFHSRGALKGFVISGRWPDSTKEWAQLLMVAVRvASLPGLLSTTTVFGAREELPDEPEPGTVGLVLAE(SEQ ID NO:634)GTVFGESAIQPGYFADHQPPALLMLHPPSETTPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGTITSMVSRVGVDPISHPDTAILAMLLAA>BL;2V3849, H37Rv2.tab 4323499:4323894 forward MW:14708MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQLRSGNRTNPSGATMAALANFERIKAAYFTDDEY(SEQ ID NO:635)YEKLDKELQWLCTMRDDGVRRIAQRAHGLPSAAQQKVLDRIDELRRAEGIDA>BL;Rv3850, M37Rv2.tab 4324015:4324668 forward MW:23811MGLFGKRKSRATRRAEARAIKARAKLEAKLSAKNEARRIKAAQRAESKALKAQLKARRDSDRAALKVAEAELKVAREGKL(SEQ ID NO:636)LSPTRIRRLLTVSRLLAPILTPVIYRAANAARGLIDQRRADQLGVPLAQIGRFSGHGARLSARvGGAERSLRMVQEKKPKDVETKQFVSAVTNRLTDLSAIWAAAEHMPAKRRRTAHSAISSQLDGIEADLMARLGLT>BL;Rv3867, H37Rv2.tab 4342770:4343318 forward MW:19945MVDPPGNDDDHGDLDALDFSAAHTNEASPLDALDDYAPVQTDDAEGDLDALHALTERDEEPELELFTVTNPQGSVSVSTL(SEQ ID NO:637)MDGRIQHVELTDKATSMSEAQLADEIFVIADLARQKARASQYTFMVENIGELTDEDAEGSALLREFVGMTLNLPTPEEAAAAEAEVFATRYDVDYTSRYKADD>BL;Rv3869, H37Rv2.tab 4345039:4346478 forward MW:51093MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYsRSIALGIVVAVLILAGAALLAYFKPQGKLGGTSLFTDRA(SEQ ID NO:638)TNQLYVLLSGQLHPVYNLTSARLVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLCDTVARADSTSPVVQTAVIANPLEIDASIDPLQSHEAVLVSYQGETWIVTTKGRHAIDLTDRALTSSMGIPVTARPTPISEGMFNALPDMGPWQLPPIPAAGAPNSLGLPDDLVIGSVFQIHTDKGPQYYVVLPDGIAQVNATTAAALRATQAHGLVAPPAMVPSLVVRIAERvYPSPLPDEPLKIVSRPQDPALCWSWQRSAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGGKFVALQSPDPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPVLSKDAALLEHDTLPADPSPRKVPAGASGAP>DL;Rv3874, H37Rv2.tab 4352274:4352573 forward MW:10793MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQAAVVRFQEAANKQKQELDEISTNIRQAG(SEQ ID NO:639)VQYSRADEEQQQALSSQMGF>BL;Rv3876, H37Rv2.tab 4353010:4355007 forward MW:70645MAADYDKLFRPHEGMEAPDDMAAQPFFDPSASFPPAPASANLPKPNGQTPPPTSDDLSERFVSAPPPPPPPPPPPPPTPM(SEQ ID NO:640)PIAAGEPPSPEPAASKPPTPPMPIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQTPTGAPQQPESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRPAPQHSRRARRGHRYRTDTERNVGKVATGPSIQARLRAEEASGAQLAPGTEPSPAPLGQPRSYLAPPTRPAPTEPPPSPSPQRNSGRRAERRVHPDLAAQNAAAQPDSITAATTGGRRRKRAAPDLDATQKSLRPAAKGPKVKKVKPQKPKATKPPKVVSQRGWRHWVHALTRINLGLSPDEKYELDLHARVRRNPRGSYQIAVVGLKGGAGKTTLTAALGSTLAQVRADRILALDADPGAGNLADRVGRQSGATIADVLAEKELSHYNDIRAHTSVNAVNLEVLPAPEYSSAQRALSDADWHFIADPASRFYNLVLADCGAGFFDPLTRGVLSTVSGVVVVASVSIDGAQQASVALDWLRNNGYQDLASRACVVINHIMPGEPNVAVKDLVRHFEQQVQPGRVVVMPWDRHIAAGTEISLDLLDPIYKRKVLELAAALSDDFERAGRR>BL;Rv3877, H37Rv2.tab 4355007:4356539 forward MW:53981LSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPMETYIDDTVAVLSEVLEDTPADVLGGFDFTAQGVWAF(SEQ ID NO:641)ARPGSPPLKLDQSLDDAGVVDGSLLTLVSVSRTERYRPLVEDVIDAIAVLDESPEFDRTALNRFVGAAIPLLTAPVIGMAMRAWWETGRSLWWPLAIGILGIAVLVGSFVANRFYQSGHLAECLLVTTYLLIATAAALAVPLPRGVNSLGAPQVAGAATAVLFLTLMTRGGPRKRHELASFAVITAIAVIAAAAAFGYGYQDWVPAGGIAFGLFIVTNAAKLTVAVARIALPPIPVPGETVDNEELLDPVATPEATSEETPTWQAIIASVPASAVRLTERSKLAKQLLIGYVTSGTLILAAGAIAVVVRGHFFVHSLVVAGLITTVCGFRSRLYAERWCAWALLAATVAIPTGLTAKLIIWYPHYAWLLLSVYLTVALVALVVVGSMAHVRRvSPVVKRTLELIDGAMIAAIIPMLLWITGVYDTVRNIRF>BL;Rv3880c, H37Rv2.tab 4360202:4360546 reverse MW:12167VSMDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAETVEVTINGHQWLTGLRIEDGLLKKLGAEAVAQRVNEA(SEQ ID NO:642)LHNAQAASAYNDAAGEQLTAALSAMSRANNEGMA>BL;Rv3882c, H37Rv2.tab 4362035:4363420 reverse MW:50397MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLGVIVATVTFYGRRITGWVAAVYAWLRRRRRPPDSSSE(SEQ ID NO:643)PVVGATVKPGDHVAVRWQGEFLVAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIVSAGYRVGNTAAPDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQRRDEGVAGLARYLVASATRIADRLASHGVDAVCGRSFDDYDHATDIGFVREKWSMIKGRDAYTAAYAAPGGPDVWWSARADHTITRVRVAPGMAPQSTVLLTTADKPKTPRGFARLFGGQRPALQGQHLVANRHCQLPIGSAGVLVGETVNRCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQFEEFARLIGAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRvSPPEESRYQMALPK>BL;Rv3909, H37Rv2.tab 4394192:4396597 forward MW:83878VTALQLGWAALARVTSAIGVVAGLGMALTVPSAAPHHALAGEPSPTPFVQVRIDQVTPDVVTTSSEPHVTVSGTVTNTGDR(SEQ ID NO:644)PVRDVMVRLEHAAAVTSSTALRTSLDGGTDQYQPAADFLTVAPELDRGQEAGFTLSAPLRSLTRPSLAVNQPGIYPVLLVNVNGTPDYGAPARLDNARFLLPVVGVPPDQATDFGSAVAPETTAPVWITMLWPLADRPRLAPGAPGGTVPVRLVDDDLANSVNGTPDYGAPARLDNARFLLPVVGVPPDQATDFGSAVAPETTAPVWITMLWPLADRPRLAPGAPGGTVPVRLVDDDLANSLANGGRLDILLSAAEFATNREVDPDGAVGRALCLAIDPDLLITVNAMTGGYVVSDSPDGAAQLPGTPTHPGTGQAAASSWLDRLRTLVHRTCVTPLPFAQADLDALQRVNDPRLSAIATISPADIVDRILDVSSTRGATVLPDGPLTGRAINLLSTHGNTVAVAAADFSPEEQQGSSQIGSALLPATAPRRLSPRVVAAPFDPAVGAALAAAGTNPTVPTYLDPSLFVRIAHESITARRQDALGAMLWRSLEPNAAPRTQILVPPASWSLASDDAQVILTALATAIRSGLAVPRPLPAVIADAAARTEPPEPPGAYSAARGRFNDDITTQIGGQVARLWKLTSALTIDDRTGLTGVQYTAPLREDMLRALSQSLPPDTRNGLAQQRLAVVGKTIDDLFGAVTIVNPGGSYTLATEHSPLPLALHNGLAVPIRVRLQVDAPPGMTVADVGQIELPPGYLPLRVPIEVNFTQRVAVDVSLRTPDGVALGEPVRLSVHSNAYGKVLFAITLSAAAVLVTLAGRRLWHRFRGQPDRADLDRPDLPTGKHAPQRRAVASRDDEKHRV


[0236]

































































































































































































Claims
  • 1. A method for identification and the selection of essential genes for the survival or the virulence of mycobacterium species which comprises: a. Aligning the genomic sequence of a first mycobacterium species on a genomic sequence of the genomic sequence of a second mycobacterium species, b. Selecting a polypolynucleotide sequence highly conserved in both genomes with no counterparts in other bacterial genomic sequences and which corresponds to an essential gene for the survival or the virulence of mycobacterium species, and c. Optionally, testing the polypolynucleotide selected in step b) for its capacity of virulence or involved in the survival of a mycobacterium species said testing being based on the activation or inactivation of said polypolynucleotide in a bacterial host or said testing being based on the activity of the product of expression of said polynucleotide in vivo or in vitro.
  • 2. A method according to claim 1, wherein the first genomic sequence of mycobacterium belongs to Mycobacterium tuberculosis.
  • 3. A method according to claim 1, wherein the second genomic sequence of mycobacterium belongs to Mycobacterium leprae.
  • 4. A method according to any one of claims 1 to 3, wherein the complete genomic sequence of said mycobacterium species is analysed.
  • 5. A method for the identification and the selection in silico of essential genes for the survival or the virulence of mycobacterium species according to any one of claims 1 to 4.
  • 6. Purified polynucleotide molecule obtained by the method according to any one of claims 1 to 5.
  • 7. A purified polynucleotide molecule of claim 6 which encodes essential proteins or fragments of proteins of Mycobacteium species.
  • 8. A purified polynucleotide molecule of a formula selected from the group consisting of polynucleotidic sequences, which encode for polypeptides and regulatory sequences essential for the virulence and/or the survival of mycobacterium which are, in one hand, specific to Mycobacterium tuberculosis and, in the other hand, specific to Mycobacterium leprae, that is to say, said polynucleotidic sequences are not found in publicly accessible banks of non-Mycobacterium tuberculosis and non-Mycobacterium leprae genome.
  • 9. A purified polynucleotide molecule according to claim 8 obtained by the method according to any one of claims 1 to 5.
  • 10. A purified polynucleotide molecule that hybridizes to either strand of a denratured, double-stranded DNA comprising the purified polynucleotide sequence according to claims 6 to 9 under conditions of moderate stringency in 50% formamide and 6× SSC at 42° C. with washing conditions of 60° C., 0.5×SSC, 0.1% of SDS.
  • 11. The purified polynucleotide molecule as claimed in claim 10, wherein said purified polynucleotide molecule is derived by mutagenesis.
  • 12. A purified polynucleotide molecule degenerate from the purified polynucleotide molecule according to any one of claims 6 to 9 as a result of the genetic code.
  • 13. A purified polynucleotide according to any one of claims 6 to 9 which encodes M. tuberculosis or M. leprae marker polypeptide.
  • 14. A purified polynucleotide molecule according to any one of claims 6 to 9 which encodes an allelic variant of M. tuberculosis or M. leprae marker polypeptide according to claim 13.
  • 15. A purified polynucleotide molecule according to any one of claims 6 to 9 which encodes a similar sequence of M. tuberculosis or M. leprae marker polypeptide DNA according to claim 13.
  • 16. A purified polypeptide encoded by a polynucleotide molecule according to claim 8.
  • 17. A purified polypeptide of a formula selected from the group consisting of SEQ ID NO: X to SEQ ID NO: Y.
  • 18. A purified polypeptide according to claim 17 encoded by a polynucleotide molecule according to claims 6 or 9.
  • 19. A purified polypeptide according to claim 16 in non-glycosylated form.
  • 20. A purified polypeptide according to claim 16 in glycosylated form.
  • 21. A purified polypeptide according to claim 17 in non-glycosylated form.
  • 22. A purified polypeptide according to claim 17 in glycosylated form.
  • 23. A purified polypeptide according to claim 18 in non-glycosylated form.
  • 24. A purified polypeptide according to claim 18 in non-glycosylated form.
  • 25. Process of screening of active molecules comprising: a. Preparation of at least one purified polynucleotidic molecule or fragment thereof according to claims 6 to 9 in an acceptable medium, b. Contacting the purified polynucleotide sequence or a fragment thereof corresponding to said essential gene of interest with an active molecule to be tested, and c. Selecting the active molecule by inhibition or activation of the activity of the purified polynucleotide compared to the standard activity of the essential gene in absence of said active molecule.
  • 26. Process of screening of an active molecule comprising: a. Preparation of at least one purified polypeptide according to claims 16 to 24 or fragment thereof to be used as a target in an acceptable medium, b. Contacting said purified polypeptide or fragment thereof obtained in step a) with an active molecule to be tested, and c. Selecting the active molecule by inhibition or activation of the activity of the purified polypeptide obtainable after expression of the essential gene selected according to claim 1 and compared to the standard activity of said polypeptide.
  • 27. A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001 at the C.N.C.M. under the accession number I-2625.
  • 28. A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001 at the C.N.C.M. under the accession number I-2626.
  • 29. A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001 at the C.N.C.M. under the accession number I-2627.
  • 30. A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001 at the C.N.C.M. under the accession number I-2628.
  • 31. A recombinant BAC containing a fragment of M. tuberculosis genome deposited on Feb. 20, 2001 at the C.N.C.M. under the accession number I-2629.
  • 32. A recombinant cosmid containing a fragment of M. leprae genome deposited on Feb. 21, 2001 at the C.N.C.M. under the accession number I-2632.
  • 33. A recombinant cosmid containing a fragment of M. leprae genome deposited on Feb. 21, 2001 at the C.N.C.M. under the accession number I-2633.
  • 34. A recombinant purified vector that directs the expression of a polynucleotide molecule selected from the group consisting of purified polynucleotide molecule selected according to claims 1 or 6, 7, 8, 9, 13, 14 and 15.
  • 35. A recombinant purified vector that directs the expression of a purified polynucleotide molecule according to claims 10 or 11.
  • 36. A recombinant purified vector that directs the expression of a purified polynucleotide molecule according to claim 12.
  • 37. A recombinant purified vector containing a part of the polynucleotide insert of claims 27 to 32 or claims 33 to 35 which is a plasmid.
  • 38. A host cell transfected or transduced with the vector of claim 34.
  • 39. A host cell transfected or transduced with the vector of claim 35.
  • 40. A host cell transfected or transduced with the vector of claim 36.
  • 41. A host cell transfected or transduced with the plasmid of claim 37.
  • 42. A method for the production of mycobacterium purified marker polypeptide comprising culturing a host cell of claim 38 under conditions promoting expression, and recovering the polypeptide from the culture medium.
  • 43. A method for the production of mycobacterium purified marker polypeptide comprising culturing a host cell of claim 39 under conditions promoting expression, and recovering the polypeptide from the culture medium.
  • 44. A method for the production of mycobacterium purified marker polypeptide comprising culturing a host cell of claim 40 under conditions promoting expression, and recovering the polypeptide from the culture medium.
  • 45. A method for the production of mycobacterium purified marker polypeptide comprising culturing a host cell of claim 41 under conditions promoting expression, and recovering the polypeptide from the culture medium.
  • 46. The method according to any one of claims 42 to 45, wherein the host cell is selected from the group consisting of bacterial cells, yeast cells, plant cells, and mammalian cells.
  • 47. An immunological complex comprising a Mycobacterium purified marker polypeptide produced by a method according to any one of claims 42 to 45 and an antibody that specifically recognizes said polypeptide.
  • 48. A composition comprising at least a mycobacterium purified polypeptide marker produced by a method according to any one of claims 42 to 45.
  • 49. A method for detecting infection by mycobacteria, said method comprises: a. Providing a composition according to claim 48 with a biological sample suspected to be infected with a mycobacterium, b. Assaying for the presence of said mycobacterium, and c. Optionally, detecting the presence of mycobacteria in said biological sample if infected.
  • 50. The method of claim 49, which in step b) the assay is performed by electrophoresis or by immunoassay with antibodies that are immunologically reactive with M. tuberculosis and/or M. leprae.
  • 51. An in vitro diagnostic method for the detection of the presence or the absence of antibodies which bind to an antigen or fragment of antigen comprising a mycobacterium purified polypeptide molecule according to any one of claims 16 to 24, wherein the method comprises contacting the antigen or fragment of antigen with a biological fluid for a time and under conditions sufficient for the antigen and antibodies in the biological fluid to form an immunological complex, detecting the formation of the complex, and optionally measuring the formation of the immunologicalcomplex.
  • 52. The method as claimed in claim 51, wherein the formation of the immunological complex is detected by immunoassay method based on western blot technique, ELISA, indirect immuno-fluorescense assay, or immunoprecipitation assay.
  • 53. An in vitro diagnostic method for the detection of the presence or the absence of antibodies which bind to an antigen or fragment of antigen comprising a mycobacterium purified polypeptide molecule obtained by a method according to any one of claims 42 to 45, wherein the method comprises contacting the antigen or fragment of antigen with a biological fluid for a time and under conditions sufficient for the antigen and antibodies in the biological fluid to form an immunological complex, detecting the formation of the complex, and optionally measuring the formation of the immunological complex.
  • 54. The method as claimed in claim 53, wherein the formation of the immunological complex is detected by immunoassay method based on western blot technique, ELISA, indirect immuno-fluorescence assay, or immunoprecipitation assay.
  • 55. A kit for the in vitro diagnostics of mycobacterium infections comprising: a. A mycobacterium purified polypeptide molecule according to any one of claims 16 to 24 or mixture thereof, b. Antibodies capable of forming an immunological complex with said polypeptides, and c. Acceptable medium to permit the detection of the formation of the complex thereof.
  • 56. A kit as claimed in claim 55 useful for the detection of M. tuberculosis infections.
  • 57. A kit as claimed in claim 55 useful for the detection of M. leprae infections.
  • 58. An immunogenic composition comprising at least a purified polypeptide according to any one of claims 16 to 24 in an amount sufficient to induce an immunogenic or protective response in vivo, and a pharmaceutically acceptable carrier therefor.
  • 59. A polynucleotidic probe comprising a purified polynucleotide molecule or fragment thereof according to any one of claims 6 to 15.
  • 60. A polynucleotidic probe which is complementary to the full length sequence of a purified nucleic acid that hybridizes under conditions of moderate stringency in 50% formamide and 6× SSC at 42° C. with washing conditions of 60° C., 0.5×SSC, 0.1% SDS with a nucleic acid encoding a purified polypeptide according to any one of claims 6 to 24.
  • 61. A method for the detection of the presence or the absence of mycobacteria in a sample comprising: a. contacting a sample suspected to contain genetic material of mycobacteria with at least one probe according to claims 59 or 60, b. Detecting the hybridization under conditions of moderate stringency in 50% formamide and 6× SSC at 42° C. with washing conditions of 60° C., 0.5×SSC, 0.1% SDS.
  • 62. A method for the detection of the presence or the absence of mycobacteria according to claim 61, wherein said method is specific for the detection of M. tuberculosis infections.
  • 63. A method for the detection of the presence or the absence of mycobacteria according to claim 61, wherein said method is specific for the detection of M. leprae infections.
  • 64. A method for the detection of the presence or the absence of mycobacteria according to any one of claims 61 to 63, wherein the sample contains nucleic acids of at least one microorganism other than the mycobacteria.
  • 65. A method of selection according to claim 1, wherein the comparison of the genetic informations of different types of organisms, wherein the method comprises: a. Providing a database including sequence libraries for a plurality of types of organism, said libraries having multiple genomic sequences, b. Providing one or more probe sequences according to claims 59 or 60, c. Determining homologous matches between one or more of said probe sequences and one or more sequences of said sequences in said genomic libraries; and d. Displaying the results of said determination.
  • 66. A method according to claims 1 to 4, wherein the genomic sequence of a first mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 27.
  • 67. A method according to claims 1 to 4, wherein the genomic sequence of a first mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 28.
  • 68. A method according to claims 1 to 4, wherein the genomic sequence of a first mycobaterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 29.
  • 69. A method according to claims 1 to 4, wherein the genomic sequence of a first mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 30.
  • 70. A method according to claims 1 to 4, wherein the genomic sequence of a first mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 31.
  • 71. A method according to claims 1 to 4, wherein the genomic sequence of the second mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 32.
  • 72. A method according to claims 1 to 4, wherein the genomic sequence of the second mycobacterium species is the recombinant BAC deposited at the C.N.C.M. according to claim 33.
  • 73. An in vitro diagnostic method for the detection of the presence or the absence of essential nucleotidic sequences for the survival or the virulence in mycobacterium by hybridization or amplification of said specific sequence comprising: a. Providing a composition comprising a probe according to claims 59 or 60 with a sequence library of interest to be tested in an acceptable medium and in sufficient time to obtain an hybridization and/or an amplification of said sequence, b. Purifying the sequence which hybridizes with said probe; and c. Optionally, quantifying said sequence.
  • 74. A method according to claim 1, wherein the first genomic sequence of mycobacterium belongs to Mycobacterium microti.
CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is based on and claims the benefit of U.S. Provisional application Ser. No. 60/270,123, filed Feb. 22, 2001 (attorney docket no. 03495.6061) The entire disclosure of this application is relied upon and incorporated by reference herein.

Provisional Applications (1)
Number Date Country
60270123 Feb 2001 US