POLYPEPTIDES AND IMMUNIZING COMPOSITIONS CONTAINING BURKHOLDERIA POLYPEPTIDES AND METHODS OF USE

Abstract
This disclosure provides isolated polypeptides isolatable from a Burkholderia spp., compositions that include one or more of the polypeptides, and methods for making and methods for using the polypeptides. This disclosure also provides antibody that specifically binds one or more of the polypeptides isolatable from a Burkholderia spp, compositions that include such antibody, and methods for using such antibody.
Description
BACKGROUND


Burkholderia is a genus of proteobacteria known for its pathogenic members: Burkholderia mallei, responsible for glanders, a disease that occurs mostly in horses and related animals; Burkholderia pseudomallei, causative agent of melioidosis; and the Burkholderia cepacia complex, which includes pathogens that are involved in pulmonary infections of people with cystic fibrosis (CF). The Burkholderia genus name refers to a group of common gram-negative, motile, obligately aerobic rod-shaped bacteria including animal, human, and plant pathogens. Due to their antibiotic resistance and the high mortality rate from their associated diseases, Burkholderia mallei and Burkholderia pseudomallei are considered to be potential biological warfare agents, with livestock and humans as potential targets.


Humans and animals are believed to acquire Burkholderia infection by inhalation of contaminated dust or water droplets, ingestion of contaminated water, and contact with contaminated soil, especially through skin abrasions.


Many non-human animal species can be susceptible to melioidosis caused by Burkholderia pseudomallei, including many livestock and/or companion animal species such as, for example, sheep, goats, horses, swine, cattle, dogs, and cats.


In the absence of treatment with appropriate antibiotics, the septicemic form of melioidosis has a mortality rate that exceeds 90%. With appropriate antibiotic treatment, the mortality rate is about 10% for uncomplicated cases but up to 80% for cases with bacteremia or severe sepsis. Because of its severe course of infection, aerosol infectivity, and worldwide availability, B. pseudomallei is identified as a potential agent of biological warfare or bioterrorism and is listed on the Centers for Disease Control list as a Category B bioterrorism agent. There is currently no vaccine and the organism is often refractory to antibiotic therapy, especially after it is established in a host.





BRIEF DESCRIPTION OF THE FIGURES


FIG. 1. SDS-PAGE (10%) analysis of polypeptides isolated from B. thailandensis grown in either iron-rich (Fe, Lane 2) or iron-restricted (DP, Lane 3) medium. Lane 1: molecular weight standard.



FIG. 2. Two-dimensional gel electrophoresis analysis of extracts from B. thailandensis grown in either (A) iron-rich or (B) iron-depleted medium.



FIG. 3. Data from a vaccine trial in A/J mice. Kaplan-Meier survival curve of A/J mice immunized twice with varying doses of vaccine (50 ng/dose, 100 ng/dose, or 300 ng/dose) or a placebo (adjuvant only) and challenged intratracheally with B. thailandensis.



FIG. 4. Data from a vaccine trial in BALB/c mice. Kaplan-Meier survival curve of BALB/c mice immunized twice with varying doses of vaccine (50 ng/dose, 100 ng/dose, or 300 ng/dose) or a placebo (adjuvant only) and challenged intratracheally with B. thailandensis.



FIGS. 5A-B. Protein conservation across Burkholderia strains and species. (A) Heat map showing the percent amino acid identity for 25 receptor proteins was determined in 12 strains of B. pseudomallei, seven strains of B. mallei, three strains of B. thailandensis, six strains of B. cenocepacia, and five strains of B. multivorans using B. pseudomallei K96243 as the query. (B) Heat map showing the percent amino acid identity for 25 receptor proteins was determined in 12 strains of B. pseudomallei, seven strains of B. mallei, three strains of B. thailandensis, six strains of B. cenocepacia, and five strains of B. multivorans using B. thailandensis E264 as the query.



FIG. 6A-C. Multiple cross-species sequence alignment for the Burkholderia spp. polypeptides across five Burkholderia species: B. thailandensis E264, B. pseudomallei K96243, B. mallei NCTC10229, B. cenocepacia AU1054, and B. multivorans ATCC17616. (A) SEQ ID NO:1, (B) SEQ ID NO:2, and (C) SEQ ID NO:9. (* indicates identity).



FIG. 7. Antibody titers to individual recombinant Burkholderia iron-regulated polypeptides in vaccinated mice.





DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

In one aspect, this disclosure provides polypeptides and compositions including polypeptides. As used herein, “polypeptide” refers to a polymer of amino acids linked by peptide bonds. Thus, for example, the terms peptide, oligopeptide, protein, and enzyme are included within the definition of polypeptide. This term also includes polypeptides that may include one or more post-expression modifications of the polypeptide such as, for example, a glycosylation, an acetylation, a phosphorylation, and the like. The term polypeptide does not connote a specific length of a polymer of amino acids. A polypeptide may be isolatable directly from a natural source or can be prepared with the aid of recombinant, enzymatic, or chemical techniques. In the case of a polypeptide that is naturally occurring, such a polypeptide is typically isolated.


An “isolated” polypeptide is one that has been removed from its natural environment. For instance, an isolated polypeptide is a polypeptide that has been removed from the cytoplasm or from the membrane of a cell, and many of the polypeptides, nucleic acids, and other cellular material of its natural environment are no longer present.


A polypeptide characterized as “isolatable” from a particular source is a polypeptide that, under appropriate conditions, is produced by the identified source, although the polypeptide may be obtained from alternate sources using, for example, conventional recombinant, chemical, or enzymatic techniques. Thus, characterizing a polypeptide as “isolatable” from a particular source does not imply any specific source from which the polypeptide must be obtained or any particular conditions or processes under which the polypeptide must be obtained.


A “purified” polypeptide is one that is at least 60% free, preferably at least 75% free, and most preferably at least 90% free from other components with which they are naturally associated. Polypeptides that are produced outside the organism in which they naturally occur, e.g., through chemical or recombinant means, are considered to be isolated and purified by definition, since they were never present in a natural environment.


As used herein, a “polypeptide fragment” refers to a portion of a polypeptide that results from digestion of a polypeptide with a protease.


Unless otherwise specified, “a,” “an,” “the,” and “at least one” are used interchangeably and mean one or more than one. The terms “comprises” and variations thereof do not have a limiting meaning where these terms appear in the description and claims. Also, the term “and/or” means one or all of the listed elements or a combination of any two or more of the listed elements.


Generally, a polypeptide may be characterized by molecular weight, mass fingerprint, amino acid sequence, nucleic acid that encodes the polypeptide, immunological activity, or any combination of two or more such characteristics. The molecular weight of a polypeptide, typically expressed in kilodaltons (kDa), can be determined using routine methods including, for instance, gel filtration, gel electrophoresis including sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis (PAGE), capillary electrophoresis, mass spectrometry, liquid chromatography (including HPLC), and calculating the molecular weight from an observed or predicted amino acid sequence. Unless indicated otherwise, molecular weight refers to molecular weight as determined by resolving a polypeptide using an SDS polyacrylamide gel having a stacking gel of about 4% and a resolving gel of about 10% under reducing and denaturing conditions.


As used herein, a “mass fingerprint” refers to a population of polypeptide fragments obtained from a polypeptide after digestion with a protease. Often, a mass fingerprint can be generated by digesting a polypeptide with trypsin. In principle, however, a mass fingerprint may be generated by digesting the polypeptide with any suitable protease. Typically, the polypeptide fragments resulting from a digestion are analyzed using a mass spectrometric method. Each polypeptide fragment is characterized by a mass, or by a mass (m) to charge (z) ratio, which is referred to as an “m/z ratio” or an “m/z value.” Methods for generating a mass fingerprint of a polypeptide are routine. An example of such a method is disclosed in Example 3.


The polypeptides described herein may be metal-regulated. As used herein, a “metal-regulated polypeptide” is a polypeptide that is expressed by a microbe at a greater level when the microbe is grown in low metal conditions compared to when the same microbe is grown in high metal conditions. Low metal and high metal conditions are described herein. For instance, certain metal-regulated polypeptides produced by Burkholderia spp. are not expressed at detectable levels during growth of the microbe in high metal conditions but are expressed at detectable levels during growth in low metal conditions.


Examples of metal-regulated polypeptides isolatable from B. thailandensis after growth in low iron conditions include metal-regulated polypeptides having molecular weights of 88 kDa, 84 kDa, 83 kDa, 81 kDa, 58 kDa, 56 kDa, 55 kDa, 44 kDa, 43 kDa, 42 kDa, 27 kDa, 24 kDa, or 19 kDa.


Additional examples of metal-regulated polypeptides include recombinantly-produced versions of polypeptides described herein. A recombinantly-produced polypeptide may include the entire amino acid sequence translatable from an mRNA transcript. Alternatively, a recombinantly-produced metal-regulated polypeptide can include a fragment or portion of the entire translatable amino acid sequence. For example, a recombinantly-produced metal-regulated polypeptide may lack a cleavable sequence at either terminal of the polypeptide—e.g., a cleavable signal sequence at the amino terminal of the polypeptide.


Thus, a metal-regulated polypeptide can be a polypeptide that includes the amino acid sequence depicted in, for example, SEQ ID NO:6, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:1, SEQ ID NO:7, SEQ ID NO:16, SEQ ID NO:22, SEQ ID NO:2, SEQ ID NO:13, SEQ ID NO:25, SEQ ID NO:21, SEQ ID NO:9, SEQ ID NO:11, or SEQ ID NO:10.


This disclosure also describes certain polypeptides that are not metal-regulated. Such polypeptides are expressed in the presence of a metal ion such as, for example, in the presence of ferric chloride, and also expressed when grown in low iron conditions. Examples of such polypeptides isolatable from B. thailandensis have molecular weights of 55 kDa, 40 kDa, 39 kDa, or 19 kDa.


Additional examples of polypeptides that are not metal-regulated include recombinantly-produced versions of polypeptides described herein. A recombinantly-produced polypeptide may include the entire amino acid sequence translatable from an mRNA transcript. Alternatively, a recombinantly-produced non-metal-regulated polypeptide can include a fragment or portion of the entire translatable amino acid sequence. For example, a recombinantly-produced non-metal-regulated polypeptide may lack a cleavable sequence at either terminal of the polypeptide—e.g., a cleavable signal sequence at the amino terminal of the polypeptide.


Thus, a polypeptide that is not metal-regulated can be a polypeptide that includes the amino acid sequence depicted in, for example, SEQ ID NO:15, SEQ ID NO:12, SEQ ID NO:14, and SEQ ID NO:18.


This disclosure also describes certain polypeptides whose metal-regulation has not been established with certainty. Such polypeptides may have been detected in the extract when grown in low iron or in the presence of iron. Alternatively, they may have been identified through bioinformatics analysis using structural similarity to known iron receptor proteins or functional roles in iron acquisition or storage. Examples of such polypeptides isolatable from B. thailandensis have molecular weights of 85 kDa, 81 kDa, 78 kDa, 55 kDa, 36 kDa, or 8.5 kDa.


Additional examples of polypeptides whose metal regulation is uncertain include recombinantly-produced versions of polypeptides described herein. A recombinantly-produced polypeptide may include the entire amino acid sequence translatable from an mRNA transcript. Alternatively, a recombinantly-produced polypeptide whose metal regulation is uncertain can include a fragment or portion of the entire translatable amino acid sequence. For example, a recombinantly-produced polypeptide whose metal regulation is uncertain may lack a cleavable sequence at either terminal of the polypeptide—e.g., a cleavable signal sequence at the amino terminal of the polypeptide.


Thus, a polypeptide whose metal regulation is uncertain can be a polypeptide that includes the amino acid sequence depicted in, for example, SEQ ID NO:23, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:17, SEQ ID NO:20, and SEQ ID NO:19.


Whether a polypeptide is a metal-regulated polypeptide or a non-metal-regulated polypeptide can be determined by methods useful for comparing the presence of polypeptides, including, for example, gel filtration, gel electrophoresis including sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), capillary electrophoresis, mass spectrometry, isobaric tags for relative and absolute quantification (iTRAQ), and liquid chromatography including HPLC. Separate cultures of a microbe can be grown under high metal conditions and under low metal conditions, polypeptides may be isolated as described herein, and the polypeptides present in each culture can be resolved and compared. Typically, an equal amount of polypeptides from each culture is used. Preferably, the polypeptides can be resolved using an SDS polyacrylamide gel having a stacking gel of about 4% and a resolving gel of about 10% under reducing and denaturing conditions. For instance, 30 micrograms (μg) of total polypeptide from each culture may be used and loaded into wells of a gel. After running the gel and staining the polypeptides with Coomassie Brilliant Blue, the two lanes can be compared. When determining whether a polypeptide is or is not expressed at a detectable level, 30 μg of total polypeptide from a culture is resolved on an SDS-PAGE gel and stained with Coomassie Brilliant Blue using methods known in the art. A polypeptide that can be visualized by eye is considered to be expressed at a detectable level, while a polypeptide that cannot be visualized by eye is considered to not be expressed at a detectable level.


Alternatively, whether a polypeptide is a metal-regulated polypeptide or a non-metal-regulated polypeptide can be determined using microarray-based gene expression analysis. Separate cultures of a microbe can be grown under high metal conditions and under low metal conditions, RNA can be extracted from cells of each culture, and differences in RNA expression in cells grown in high metal conditions versus RNA expression in cells grown in low metal conditions can be detected and compared. For example, labeled cDNA can be prepared from 8-10 μg of bacterial RNA using established protocols. The labeled cDNA can be applied to a microarray of the Burkholderia spp. genome. Such microarrays are commercially available and gene expression using such arrays is routine.


The polypeptides described herein may have immunological activity. “Immunological activity” refers to the ability of a polypeptide to elicit an immunological response in an animal. An immunological response to a polypeptide is the development in an animal of a cellular and/or antibody-mediated immune response to the polypeptide. Usually, an immunological response includes but is not limited to one or more of the following effects: the production of antibodies, B cells, helper T cells, suppressor T cells, and/or cytotoxic T cells, directed to an epitope or epitopes of the polypeptide. “Epitope” refers to the site on an antigen to which specific B cells and/or T cells respond so that antibody is produced. The immunological activity may be protective. “Protective immunological activity” refers to the ability of a polypeptide to elicit an immunological response in an animal that inhibits or limits infection by Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. Whether a polypeptide has protective immunological activity can be determined by methods known in the art such as, for example, methods described in Example 6. For example, a polypeptide, or combination of polypeptides, can protect a rodent such as a mouse against challenge with a Burkholderia spp. A polypeptide may have seroactive activity. As used herein, “seroactive activity” refers to the ability of a candidate polypeptide to react with antibody present in convalescent serum from an animal infected with a Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. In some aspects, the convalescent serum may be from an animal infected with B. thailandensis E264, B. pseudomallei K96243, B. mallei NCTC10229, B. cenocepacia AU1054, or B. multivorans ATCC17616.


A polypeptide may have immunoregulatory activity. As used herein, “immunoregulatory activity” refers to the ability of a polypeptide to act in a nonspecific manner to enhance an immune response to a particular antigen. Methods for determining whether a polypeptide has immunoregulatory activity are known in the art.


A polypeptide as described herein may have the characteristics of a polypeptide expressed by a reference microbe—i.e., a reference polypeptide. The characteristics can include, for example, molecular weight, mass fingerprint, amino acid sequence, or any combination thereof. The reference microbe can be a gram negative, preferably a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. Exemplary strains of Burkholderia spp. and representative strains are listed in FIG. 5.


When the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has a molecular weight of 88 kDa, 84 kDa, 83 kDa, 81 kDa, 58 kDa, 56 kDa, 55 kDa, 44 kDa, 43 kDa, 42 kDa, 27 kDa, 24 kDa, or 19 kDa and has a mass fingerprint that is similar to the mass fingerprint of a metal-regulated polypeptide expressed by a reference microbe and having a molecular weight of—88 kDa, 84 kDa, 83 kDa, 81 kDa, 58 kDa, 56 kDa, 55 kDa, 44 kDa, 43 kDa, 42 kDa, 27 kDa, 24 kDa, or 19 kDa, respectively. Preferably, such polypeptides are metal-regulated. For instance, a candidate polypeptide can be a polypeptide as described herein if it has a molecular weight of 88 kDa and has a mass fingerprint similar to the mass fingerprint of an 88 kDa metal-regulated polypeptide produced by the reference strain B. thailandensis E264.


Alternatively, when the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has an amino acid sequence that is structurally similar, as described in detail below, to the amino acid sequence of SEQ ID NO:6, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:1, SEQ ID NO:7, SEQ ID NO:16, SEQ ID NO:22, SEQ ID NO:2, SEQ ID NO:13, SEQ ID NO:25, SEQ ID NO:21, SEQ ID NO:9, SEQ ID NO:11, or SEQ ID NO:10.


When the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has a molecular weight of 55 kDa, 40 kDa, 39 kDa, or 19 kDa and has a mass fingerprint that is similar to the mass fingerprint of a non-metal-regulated polypeptide expressed by a reference microbe and having a molecular weight of 55 kDa, 40 kDa, 39 kDa, or 19 kDa, respectively. Preferably, such polypeptides are non-metal-regulated. For instance, a candidate polypeptide can be a polypeptide as described herein if it has a molecular weight of 55 kDa and has a mass fingerprint similar to the mass fingerprint of a 55 kDa non-metal-regulated polypeptide produced by the reference strain B. thailandensis E264.


Alternatively, when the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has an amino acid sequence that is structurally similar, as described in detail below, to the amino acid sequence of SEQ ID NO:15, SEQ ID NO:12, SEQ ID NO:14, or SEQ ID NO:18.


When the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has a molecular weight of 85 kDa, 81 kDa, 78 kDa, 55 kDa, 36 kDa, or 8.5 kDa, respectively. Preferably, such polypeptides may or may not be metal-regulated. For instance, a candidate polypeptide can be a polypeptide as described herein if it has a molecular weight of 85 kDa and has a mass fingerprint similar to the mass fingerprint of an 85 kDa polypeptide produced by the reference strain B. thailandensis E264.


Alternatively, when the reference microbe is B. thailandensis E264, a candidate polypeptide can be considered to be a polypeptide as described herein if it has an amino acid sequence that is structurally similar, as described in detail below, to the amino acid sequence of SEQ ID NO:23, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:17, SEQ ID NO:20, or SEQ ID NO:19.


As used herein, a polypeptide may be “structurally similar” to a reference polypeptide if the amino acid sequence of the polypeptide possesses a specified amount of sequence similarity and/or sequence identity compared to the reference polypeptide. A polypeptide also may be “structurally similar” to a reference polypeptide if the polypeptide exhibits a mass fingerprint possessing a specified amount of identity compared to a comparable mass fingerprint of the reference polypeptide. Thus, a polypeptide may be “structurally similar” to a reference polypeptide if, compared to the reference polypeptide, it possesses a sufficient level of amino acid sequence identity, amino acid sequence similarity, mass fingerprint similarity, or any combination thereof.


Polypeptide Sequence Similarity and Polypeptide Sequence Identity

Structural similarity of two polypeptides can be determined by aligning the residues of the two polypeptides (for example, a candidate polypeptide and any appropriate reference polypeptide described herein) to optimize the number of identical amino acids along the lengths of their sequences; gaps in either or both sequences are permitted in making the alignment in order to optimize the number of identical amino acids, although the amino acids in each sequence must nonetheless remain in their proper order. A reference polypeptide may be a polypeptide described herein or any known metal-regulated polypeptide, as appropriate. A candidate polypeptide is the polypeptide being compared to the reference polypeptide. A candidate polypeptide can be isolated, for example, from a microbe, or can be produced using recombinant techniques, or chemically or enzymatically synthesized.


Unless modified as otherwise described herein, a pair-wise comparison analysis of amino acid sequences can be carried out using the BESTFIT algorithm in the GCG package (version 10.2, Madison Wis.). Alternatively, polypeptides may be compared using the Blastp program of the BLAST 2 search algorithm, as described by Tatiana et al. (FEMS Microbiol Lett, 174:247-250 (1999)), and available on the National Center for Biotechnology Information (NCBI) website. The default values for all BLAST 2 search parameters may be used, including matrix=BLOSUM62; open gap penalty=11, extension gap penalty=1, gap x_dropoff=50, expect=10, wordsize=3, and filter on.


In the comparison of two amino acid sequences, structural similarity may be referred to by percent “identity” or may be referred to by percent “similarity.” “Identity” refers to the presence of identical amino acids. “Similarity” refers to the presence of not only identical amino acids but also the presence of conservative substitutions. A conservative substitution for an amino acid in a polypeptide may be selected from other members of the class to which the amino acid belongs. For example, it is well-known in the art of protein biochemistry that an amino acid belonging to a grouping of amino acids having a particular size or characteristic (such as charge, hydrophobicity, or hydrophilicity) can be substituted for another amino acid without altering the activity of a protein, particularly in regions of the protein that are not directly associated with biological activity. For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and tyrosine. Polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine. The positively charged (basic) amino acids include arginine, lysine and histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Conservative substitutions include, for example, Lys for Arg and vice versa to maintain a positive charge; Glu for Asp and vice versa to maintain a negative charge; Ser for Thr so that a free —OH is maintained; and Gln for Asn to maintain a free —NH2. Likewise, biologically active analogs of a polypeptide containing deletions or additions of one or more contiguous or noncontiguous amino acids that do not eliminate a functional activity—such as, for example, immunological activity—of the polypeptide are also contemplated.


Thus, as used herein, reference to a polypeptide as described herein and/or reference to the amino acid sequence of one or more SEQ ID NOs can include a polypeptide with at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% amino acid sequence similarity to the reference amino acid sequence.


Alternatively, as used herein, reference to a polypeptide as described herein and/or reference to the amino acid sequence of one or more SEQ ID NOs can include a polypeptide with at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% amino acid sequence identity to the reference amino acid sequence.



FIG. 6 shows cross-species sequence alignment for polypeptides having the amino acid sequences shown in SEQ ID NO:1, SEQ ID NO:2, and SEQ ID NO:9. The alignment indicates amino acids that are conserved in the variants of each polypeptide across five Burkholderia species. The alignment also shows regions of variability in the variants of each polypeptide across five Burkholderia species. A person of ordinary skill in the art can deduce from such data regions of the polypeptide in which substitutions, particularly conservative substitutions, may be permitted without unduly affecting biological activity of the modified polypeptide.


Consequently, a polypeptide as described herein can include certain variants including, for example, homologous polypeptides that originate-biologically and/or recombinantly—from microbial species or strains other than the microbial species or strain from which the polypeptide was originally isolated and/or identified.


For example, a polypeptide as described herein can include a polypeptide commonly known as a TonB-dependent siderophore receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:1. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:51, SEQ ID NO:101, SEQ ID NO:141, and SEQ ID NO:189.


For example, a polypeptide as described herein can include a polypeptide commonly known as resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:2. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:52, SEQ ID NO:102, SEQ ID NO:142, and SEQ ID NO:190.


For example, a polypeptide as described herein can include a polypeptide commonly known as outer membrane ferric siderophore receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:3. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:53, SEQ ID NO:143, and SEQ ID NO:191.


For example, a polypeptide as described herein can include a polypeptide commonly known as TonB-dependent heme/hemoglobin receptor family protein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:4. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:54, SEQ ID NO:103, SEQ ID NO:144, and SEQ ID NO:192.


For example, a polypeptide as described herein can include a polypeptide commonly known as Fe(III) pyochelin receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:5. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:55 and SEQ ID NO:145.


For example, a polypeptide as described herein can include a polypeptide commonly known as TonB-dependent siderophore receptor family protein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:6. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:56, SEQ ID NO:104, and SEQ ID NO:146.


For example, a polypeptide as described herein can include a polypeptide commonly known as TonB-dependent copper receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:7. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:57, SEQ ID NO:105, SEQ ID NO:147, and SEQ ID NO:193.


For example, a polypeptide as described herein can include a polypeptide commonly known as a TonB-dependent siderophore receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:8. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:58, SEQ ID NO:106, SEQ ID NO:148, and SEQ ID NO:194.


For example, a polypeptide as described herein can include a polypeptide commonly known as an OmpA family protein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:9. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:59, SEQ ID NO:107, SEQ ID NO:149, and SEQ ID NO:195.


For example, a polypeptide as described herein can include a polypeptide commonly known as OmpA family outer membrane protein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:10. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:60, SEQ ID NO:108, SEQ ID NO:150, and SEQ ID NO:196.


For example, a polypeptide as described herein can include a polypeptide commonly known as OmpA family protein that differs from the OmpA family protein described above. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:11. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:61, SEQ ID NO:109, SEQ ID NO:151, and SEQ ID NO:197.


For example, a polypeptide as described herein can include a polypeptide commonly known as an outer membrane porin. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:12. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:62, SEQ ID NO:152, and SEQ ID NO:198.


For example, a polypeptide as described herein can include a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porin described above. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:13. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:63, SEQ ID NO:110, SEQ ID NO:153, and SEQ ID NO:199.


For example, a polypeptide as described herein can include a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:14. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:64, SEQ ID NO:111, SEQ ID NO:154, and SEQ ID NO:200.


For example, a polypeptide as described herein can include a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:15. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:65, SEQ ID NO:112, SEQ ID NO:155, and SEQ ID NO:201.


For example, a polypeptide as described herein can include a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:16. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:66, SEQ ID NO:156, and SEQ ID NO:202.


For example, a polypeptide as described herein can include a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:17. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:67, SEQ ID NO:113, SEQ ID NO:157, and SEQ ID NO:203.


For example, a polypeptide as described herein can include a polypeptide commonly known as bacterioferritin (Bfr). One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:18. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:68, SEQ ID NO:114, SEQ ID NO:158, and SEQ ID NO:204.


For example, a polypeptide as described herein can include a polypeptide commonly known as bacterioferritin-associated ferredoxin (Bfd). One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:19. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:69, SEQ ID NO:115, SEQ ID NO:159, and SEQ ID NO:205.


For example, a polypeptide as described herein can include a polypeptide commonly known as dipeptide ABC transporter permease. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:20. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:70, SEQ ID NO:116, SEQ ID NO:160, and SEQ ID NO:206.


For example, a polypeptide as described herein can include a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:21. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:71, SEQ ID NO:161, and SEQ ID NO:207.


For example, a polypeptide as described herein can include a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:22. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:72, SEQ ID NO:117, and SEQ ID NO:208.


For example, a polypeptide as described herein can include a polypeptide commonly known as a TonB-dependent receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:23. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:73, SEQ ID NO:118, SEQ ID NO:162, and SEQ ID NO:209.


For example, a polypeptide as described herein can include a polypeptide commonly known as a TonB-dependent receptor. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:24. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:74, SEQ ID NO:119, SEQ ID NO:163, and SEQ ID NO:210.


For example, a polypeptide as described herein can include a polypeptide commonly known as resistance-nodulation-cell division (RND) superfamily efflux transporter MFP subunit. One embodiment of this polypeptide is reflected in the amino acid sequence of SEQ ID NO:25. Variant embodiments are reflected in the amino acid sequences of SEQ ID NO:75, SEQ ID NO:120, SEQ ID NO:164, and SEQ ID NO:211.


Table 1 summarizes identifying characteristics of reference polypeptides natively expressed by reference microbe B. thailandensis E264.
















TABLE 1





SEQ

Molecular
Present by
Present by
Iron-
Immuno-
Animal


ID NO:
GI Number
Weight (kDa)b
2D gel/MSc
iTRAQ
regulationd
genicitye
protectionf






















1
83718630
81
+
+
IR
+
+


2
83717404
55
+
+
IR
+
+


3
83716122
84
+
+
IR
+
+


4
83717803
83
ND
+
IR
+
+


5
83716062
81
ND
+
U
+
+


6
83720594
88
ND
+
IR
+
+


7
83717103
81
+
+
IR
+
+


8
83717123
81
ND
ND
U
+
ND


9
83720098
27
+
+
IR
NT
+


10
83719210
19
ND
+
IR
NT
+


11
83720431
24
+
+
IR
NT
+


12
83719322
40
+
+
NIR
NT
+


13
83716488
44
+
+
IR
NT
+


14
83718334
39
+
+
NIR
NT
+


15
83716254
55
+
+
NIR
NT
+


16
83720933
58
+
+
IR
NT
+


17
83718989
55
+
+
U
NT
+


18
83717287
19
ND
+
NIR
NT
+


19
83718020
8.5
ND
ND
U
NT
ND


20
83718178
36
ND
+
U
NT
+


21
83718226
42
+
+
IR
NT
+


22
83719074
56
+
+
IR
NT
+


23
83718695
85
ND
ND
U
NT
ND


24
83717289
78
ND
ND
U
NT
ND


25
83720836
43
+
ND
IR
NT
+






aID#, unique number assigned to each polypeptide for tracking purposes.




bMolecular weight (predicted).




cA protein is considered to be present (+) under iron restriction if detected by 2D gel/mass spectroscopy or in iTRAQ. ND, not detected.




dIron regulation was determined using criteria described in Example 5. IR, iron-regulated; NIR, non-iron regulated; U, iron regulation uncertain.




eImmunogenicity of a single protein was determined by antibody and/or T cell recall responses to a purified recombinant protein in mice immunized with either the iron-restricted extract or a mix of recombinant proteins.



NT, not tested.



fprotection is considered positive (+) if the protein was determined to be present in the immunizing extract for experiments shown in FIGS. 3 and 4, either by mass spectroscopy, in trial 3 by iTRAQ, or through production of antibodies to recombinant proteins from mice immunized with the protective extract vaccine.







A polypeptide as described herein also can be designed to provide one or more additional sequences such as, for example, the addition of coding sequences for added C-terminal and/or N-terminal amino acids that may facilitate purification by trapping on columns or use of antibodies. Such tags include, for example, histidine-rich tags that allow purification of polypeptides on nickel columns. Such gene modification techniques and suitable additional sequences are well known in the molecular biology arts.


A polypeptide as described herein also may be designed so that certain amino acids at the C-terminal and/or N-terminal are deleted. For example, one difference between the amino acid sequences of SEQ ID NO:1 and SEQ ID NO:51 is that SEQ ID NO:51 possesses an N-terminal 17 amino acid addition that is not present in the reference polypeptide amino acid sequence of SEQ ID NO:1. Similar exemplary N-terminal additions, typically varying from about five amino acids to about 50 amino acids, are apparent when one compares, for example, the reference amino acid sequence of SEQ ID NO:101, SEQ ID NO:141, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:59, or SEQ ID NO:107 with certain variant embodiments of the respective amino acid sequence. Other amino acids additions and/or deletions, at either the N-terminal or the C-terminal, are possible.


A “modification” of a polypeptide as described herein includes a polypeptide (or an analog thereof such as, e.g., a fragment thereof) that is chemically or enzymatically derivatized at one or more constituent amino acids. Such a modification can include, for example, a side chain modification, a backbone modification, an N-terminal modification, and/or a C-terminal modification such as, for example, acetylation, hydroxylation, methylation, amidation, and the attachment of a carbohydrate and/or lipid moiety, a cofactor, and the like, and combinations thereof. Modified polypeptides as described herein may retain the biological activity—such as, for example, immunological activity—of the unmodified polypeptide or may exhibit a reduced or increased biological activity compared to the unmodified polypeptide.


A polypeptide as described herein (including a biologically active analog thereof and/or a modification thereof) can include a native (naturally occurring), a recombinant, a chemically synthesized, or an enzymatically synthesized polypeptide. For example, a polypeptide as described herein may be prepared by isolating the polypeptide from a natural source or may be prepared recombinantly by conventional methods including, for example, preparation as fusion proteins in bacteria or other host cells.


A polypeptide expressed by a reference microbe can be obtained by growing the reference microbe under low metal conditions as described herein and the subsequent isolation of a polypeptide by the processes disclosed herein. Alternatively, a polypeptide expressed by a reference microbe can be obtained by identifying coding regions expressed at higher levels when the microbe is grown in low metal conditions—i.e., metal-regulated. A metal-regulated coding region can be cloned and expressed, and the expressed metal-regulated polypeptide may be identified by the processes described herein. A candidate polypeptide can be isolatable from a microbe or identified from a microbe, preferably a gram negative microbe, more preferably, a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. A candidate polypeptide may also be produced using enzymatic or chemical techniques.


Mass Fingerprint Similarity

A candidate polypeptide may be evaluated by mass spectrometric analysis to determine whether the candidate polypeptide has a mass fingerprint similar to one of the polypeptides expressed by a reference microbe and referred to above by molecular weight. Typically, the candidate polypeptide can be isolated, for instance by resolving the candidate polypeptide by gel electrophoresis and excising the portion of the gel containing the candidate polypeptide. Any gel electrophoresis method that separates polypeptides based on differing characteristics can be used, including one-dimensional or two-dimensional gel electrophoresis, as well as liquid chromatographic separation based on, for instance, hydrophobicity, pI, or size. The candidate polypeptide can be fragmented, for instance by digestion with a protease. Preferably, the protease can cleave the peptide bond on the carboxy-terminal side of the amino acid lysine and the amino acid arginine, except when the amino acid following the lysine or the arginine is a proline. An example of such a protease is trypsin. Methods for digesting a polypeptide with trypsin are routine and known in the art. An example of such a method is disclosed in Example 3.


Methods for the mass spectrometric analysis of polypeptides are routine and known in the art and include, but are not limited to, nano high-pressure liquid chromatography electrospray tandem mass spectrometry (nanoLC-EDI-MS/MS). Often, when a candidate polypeptide is analyzed by mass spectroscopy, both the candidate polypeptide and the reference polypeptide—i.e., the polypeptide from the reference microbe—are prepared and analyzed together, thereby decreasing any potential artifacts resulting from differences in sample handling and running conditions. Preferably, all reagents used to prepare and analyze the two polypeptides are the same. For instance, the polypeptide from the reference microbe and the candidate polypeptide are isolated under substantially the same conditions, fragmented under substantially the same conditions, and analyzed by nanoLC-EDI-MS/MS on the same machine under substantially the same conditions. A candidate polypeptide may be considered to be “structurally similar” to a reference polypeptide if it exhibits a mass fingerprint possessing at least 80%, at least 90%, at least 95%, or substantially all of the m/z values present in the spectrum of the reference microbe polypeptide and above the background level of noise are also present in the spectrum of the candidate polypeptide. (See, e.g., United States Patent Application Publication No. 2006/0233824 A1).


In another aspect, a polypeptide can be considered to be a polypeptide as described herein if it has a molecular weight of a reference polypeptide described herein and has a mass fingerprint that includes a subpopulation including at least a specified percentage of the polypeptide fragments of the mass fingerprint of the reference polypeptide.


The mass fingerprint of a candidate polypeptide can be determined by a mass spectrometric method, for instance by nanoLC-EDI-MS/MS. The mass fingerprint of a candidate polypeptide will generally have additional polypeptide fragments and, therefore, can have additional m/z values other than those identified in any particular analysis. When the candidate polypeptide is being compared to a polypeptide as described herein, the candidate polypeptide can be isolatable from a microbe, preferably a gram negative microbe, more preferably, a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans.


Metal-regulated candidate polypeptides can be obtained by growing a microbe under low metal conditions and subsequently isolating a polypeptide by the processes described herein. Non-metal-regulated candidate polypeptides can be obtained by growing a microbe under low metal conditions or high metal conditions and subsequently isolating a polypeptide by the processes described herein. Alternatively, a candidate polypeptide can be obtained by recombinant expression of a polynucleotide that encodes the candidate polypeptide.


Polynucleotide Sequence Similarity and Polynucleotide Sequence Identity

Polypeptides as described herein also may be identified in terms of the polynucleotide that encodes the polypeptide. Thus, this disclosure provides polynucleotides that encode a polypeptide as described herein or hybridize, under standard hybridization conditions, to a polynucleotide that encodes a polypeptide as described herein, and the complements of such polynucleotide sequences.


As used herein, reference to a polynucleotide as described herein and/or reference to the nucleic acid sequence of one or more SEQ ID NOs can include polynucleotides having a sequence identity of at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to an identified reference polynucleotide sequence.


In this context, “sequence identity” refers to the identity between two polynucleotide sequences. Sequence identity is generally determined by aligning the bases of the two polynucleotides (for example, aligning the nucleotide sequence of the candidate sequence and a nucleotide sequence that includes, for example, the nucleotide sequence of SEQ ID NO:26 or SEQ ID NO:27) to optimize the number of identical nucleotides along the lengths of their sequences; gaps in either or both sequences are permitted in making the alignment in order to optimize the number of shared nucleotides, although the nucleotides in each sequence must nonetheless remain in their proper order. A candidate sequence is the sequence being compared to a known sequence—e.g., a nucleotide sequence that includes the nucleotide sequence of, for example, SEQ ID NO:76 or SEQ ID NO:77. For example, two polynucleotide sequences can be compared using the Blastn program of the BLAST 2 search algorithm, as described by Tatiana et al., (FEMS Microbiol Lett., 174:247-250 (1999)), and available on the world wide web at ncbi.nlm.nih.gov/BLAST/. The default values for all BLAST 2 search parameters may be used, including reward for match=1, penalty for mismatch=−2, open gap penalty=5, extension gap penalty=2, gap x_dropoff=50, expect=10, wordsize=11, and filter on.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a TonB-dependent siderophore receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:26. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:76, SEQ ID NO:121, SEQ ID NO:165, and SEQ ID NO:212.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:27. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:77, SEQ ID NO:122, SEQ ID NO:166, and SEQ ID NO:213.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as outer membrane ferric siderophore receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:28. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:78, SEQ ID NO:167, and SEQ ID NO:214.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as TonB-dependent heme/hemoglobin receptor family protein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:29. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:79, SEQ ID NO:123, SEQ ID NO:168, and SEQ ID NO:215.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as Fe(III) pyochelin receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:30. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:80 and SEQ ID NO:169.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as TonB-dependent siderophore receptor family protein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:31. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:81, SEQ ID NO:124, and SEQ ID NO:170.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as TonB-dependent copper receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:32. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:82, SEQ ID NO:125, SEQ ID NO:171, and SEQ ID NO:216.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a TonB-dependent siderophore receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:33. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:83, SEQ ID NO:126, SEQ ID NO:172, and SEQ ID NO:217.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an OmpA family protein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:34. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:84, SEQ ID NO:127, SEQ ID NO:173, and SEQ ID NO:218.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as OmpA family outer membrane protein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:35. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:85, SEQ ID NO:128, SEQ ID NO:174, and SEQ ID NO:219.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as OmpA family protein that differs from the OmpA family protein described above. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:36. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:86, SEQ ID NO:129, SEQ ID NO:175, and SEQ ID NO:220.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an outer membrane porin. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:37. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:87, SEQ ID NO:176, and SEQ ID NO:221.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porin described above. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:38. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:88, SEQ ID NO:130, SEQ ID NO:177, and SEQ ID NO:222.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:39. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:89, SEQ ID NO:131, SEQ ID NO:178, and SEQ ID NO:223.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:40. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:90, SEQ ID NO:132, SEQ ID NO:179, and SEQ ID NO:224.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:41. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:91, SEQ ID NO:180, and SEQ ID NO:225.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:42. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:92, SEQ ID NO:133, SEQ ID NO:181, and SEQ ID NO:226.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as bacterioferritin (Bfr). One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:43. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:93, SEQ ID NO:134, SEQ ID NO:182, and SEQ ID NO:227.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as bacterioferritin-associated ferredoxin (Bfd). One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:44. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:94, SEQ ID NO:135, SEQ ID NO:183, and SEQ ID NO:228.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as dipeptide ABC transporter permease. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:45. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:95, SEQ ID NO:136, SEQ ID NO:184, and SEQ ID NO:229.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as an outer membrane porin that differs from the outer membrane porins described above. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:46. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:96, SEQ ID NO:185, and SEQ ID NO:230.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a resistance-nodulation-cell division (RND) superfamily efflux system outer membrane lipoprotein. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:47. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:97, SEQ ID NO:137, and SEQ ID NO:231.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a TonB-dependent receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:48. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:98, SEQ ID NO:138, SEQ ID NO:186, and SEQ ID NO:232.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as a TonB-dependent receptor. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:49. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:99, SEQ ID NO:139, SEQ ID NO:187, and SEQ ID NO:233.


For example, a polynucleotide as described herein can include a polynucleotide that encodes a polypeptide commonly known as resistance-nodulation-cell division (RND) superfamily efflux transporter MFP subunit. One embodiment of this polynucleotide is reflected in the polynucleotide sequence of SEQ ID NO:50. Variant embodiments are reflected in the polynucleotide sequences of SEQ ID NO:100, SEQ ID NO:140, SEQ ID NO:188, and SEQ ID NO:234.


Finally, a polynucleotide as described herein can include any polynucleotide that encodes a polypeptide as described herein. Thus, the nucleotide sequence of the polynucleotide may be deduced from the amino acid sequence that is to be encoded by the polynucleotide.


This disclosure also provides whole cell preparations of a microbe, where the microbe expresses one or more of the polypeptides described herein. The cells present in a whole cell preparation may be inactivated such that the cells cannot replicate but the immunological activity of the polypeptides as described herein expressed by the microbe is maintained. Typically, the cells may be killed by exposure to agents such as glutaraldehyde, formalin, or formaldehyde.


Compositions

A composition as described herein may include at least one isolated polypeptide described herein, or a number of polypeptides that is an integer greater than one (e.g., at least two, at least three, at least four). Unless a specific level of sequence similarity and/or identity is expressly indicated herein (e.g., at least 80% sequence similarity, at least 90% sequence identity, etc.), reference to the amino acid sequence of an identified SEQ ID NO includes variants having the levels of sequence similarity and/or the levels of sequence identity described herein in the section headed “Polypeptide sequence similarity and polypeptide sequence identity.


A recombinantly-produced polypeptide may be expressed from a vector that permits expression of the polypeptide when the vector is introduced into an appropriate host cell. A host cell may be constructed to produce one or more recombinantly-produced polypeptides as described herein and, therefore, can include one more vectors that include at least one polynucleotide that encodes a polypeptide as described herein. Thus, each vector can include one or more polynucleotides as described herein—i.e., a polynucleotide that encodes a polypeptide as described herein.


Certain compositions such as, for example, those including recombinantly-produced polypeptides, can include a maximum number of polypeptides. In some embodiments, the maximum number of polypeptides can refer to the maximum total number of polypeptides. Certain compositions can include, for example, no more than 50 polypeptides such as, for example, no more than 40 polypeptides, no more than 30 polypeptides, no more than 25 polypeptides, no more than 20 polypeptides, no more than 15 polypeptides, no more than 10 polypeptides, no more than eight polypeptides, no more than seven polypeptides, no more than six polypeptides, no more than five polypeptides, no more than four polypeptides, no more than three polypeptides, no more than two polypeptides, or no more than one polypeptide. In other embodiments, a maximum number of recombinantly-produced polypeptides may be specified in a similar manner. In still other embodiments, a maximum number of nonrecombinantly-produced polypeptides may be specified in a similar manner.


A composition can include polypeptides isolatable from one microbe, or can be isolatable from a combination of two or more microbes. For instance, a composition can include polypeptides isolatable from two or more Burkholderia spp., or from a Burkholderia spp. and a different microbe that is not a member of the genus Burkholderia. In certain embodiments, a composition can include a whole cell preparation in which the whole cell expresses one or more of the polypeptides as described herein. In some of these embodiments, the whole cell can be a Burkholderia spp. In some embodiments, a composition can include whole cell preparations from two, three, four, five, or six strains.


Optionally, a polypeptide as described herein can be covalently bound or conjugated to a carrier polypeptide to improve the immunological properties of the polypeptide. Useful carrier polypeptides are known in the art. The chemical coupling of polypeptides as described herein can be carried out using known and routine methods. For instance, various homobifunctional and/or heterobifunctional cross-linker reagents such as bis(sulfosuccinimidyl) suberate, bis(diazobenzidine), dimethyl adipimidate, dimethyl pimelimidate, dimethyl superimidate, disuccinimidyl suberate, glutaraldehyde, m-maleimidobenzoyl-N-hydroxysuccinimide, sulfo-m-maleimidobenzoyl-N-hydroxysuccinimide, sulfosuccinimidyl 4-(N-maleimidomethyl) cycloheane-1-carboxylate, sulfosuccinimidyl 4-(p-maleimido-phenyl) butyrate and (1-ethyl-3-(dimethyl-aminopropyl) carbodiimide can be used (see, for instance, Harlow and Lane, Antibodies, A Laboratory Manual, generally and Chapter 5, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, N.Y. (1988)).


The compositions as described herein optionally further include a pharmaceutically acceptable carrier. “Pharmaceutically acceptable” refers to a diluent, carrier, excipient, salt, etc., that is compatible with the other ingredients of the composition, and not deleterious to the recipient thereof. Typically, the composition includes a pharmaceutically acceptable carrier when the composition is used as described herein. Exemplary pharmaceutically acceptable carriers include buffer solutions and generally exclude blood products such as, for example, whole blood and/or plasma. The compositions as described herein may be formulated in pharmaceutical preparations in a variety of forms adapted to the chosen route of administration, including routes suitable for stimulating an immune response to an antigen. Thus, a composition as described herein can be administered via known routes including, for example, oral; parenteral including intradermal, transcutaneous and subcutaneous; intramuscular, intravenous, intraperitoneal, etc. and topically, such as, intranasal, intrapulmonary, intramammary, intravaginal, intrauterine, intradermal, transcutaneous and rectally, etc. It is foreseen that a composition can be administered to a mucosal surface, such as by administration to the nasal or respiratory mucosa (e.g., via a spray or aerosol), in order to stimulate mucosal immunity, such as production of secretory IgA antibodies, throughout the animal's body.


A composition as described herein can also be administered via a sustained or delayed release implant. Implants suitable for use according to the invention are known and include, for example, those disclosed in International Publication No. WO 2001/037810 and/or International Publication No. WO 1996/001620. Implants can be produced at sizes small enough to be administered by aerosol or spray. Implants also can include nanospheres and microspheres.


A composition as described herein may be administered in an amount sufficient to treat certain conditions as described herein. The amount of polypeptides or whole cells present in a composition as described herein can vary. For instance, the dosage of polypeptides can be between 0.01 micrograms (μg) and 300 mg, typically between 0.1 mg and 10 mg. When the composition is a whole cell preparation, the cells can be present at a concentration of, for instance, 102 bacteria/ml, 103 bacteria/ml, 104 bacteria/ml, 105 bacteria/ml, 106 bacteria/ml, 107 bacteria/ml, 108 bacteria/ml, or 109 bacteria/ml. For an injectable composition (e.g. subcutaneous, intramuscular, etc.) the polypeptides may be present in the composition in an amount such that the total volume of the composition administered is 0.5 ml to 5.0 ml, typically 1.0 to 2.0 ml. When the composition is a whole cell preparation, the cells are preferably present in the composition in an amount that the total volume of the composition administered is 0.5 ml to 5.0 ml, typically 1.0 to 2.0 ml. The amount administered may vary depending on various factors including, but not limited to, the specific polypeptides chosen, the weight, physical condition and age of the animal, and the route of administration. Thus, the absolute weight of the polypeptide included in a given unit dosage form can vary widely, and depends upon factors such as the species, age, weight and physical condition of the animal, as well as the method of administration. Such factors can be determined by one of skill in the art. Other examples of dosages suitable for the invention are disclosed in U.S. Pat. No. 6,027,736.


The formulations may be conveniently presented in unit dosage form and may be prepared by methods well known in the art of pharmacy. Methods of preparing a composition with a pharmaceutically acceptable carrier include the step of bringing the active compound (e.g., a polypeptide or whole cell as described herein) into association with a carrier that constitutes one or more accessory ingredients. In general, the formulations are prepared by uniformly and intimately bringing the active compound into association with a liquid carrier, a finely divided solid carrier, or both, and then, if necessary, shaping the product into the desired formulations.


A composition including a pharmaceutically acceptable carrier also can include an adjuvant. An “adjuvant” refers to an agent that can act in a nonspecific manner to enhance an immune response to a particular antigen, thus potentially reducing the quantity of antigen necessary in any given immunizing composition, and/or the frequency of injection necessary in order to generate an adequate immune response to the antigen of interest. Adjuvants may include, for example, IL-1, IL-2, emulsifiers, muramyl dipeptides, dimethyl dioctadecyl ammonium bromide (DDA), avridine, aluminum hydroxide, oils, saponins, alpha-tocopherol, polysaccharides, emulsified paraffins (including, for instance, those available from under the tradename EMULSIGEN from MVP Laboratories, Ralston, Nebr.), ISA-70, RIBI and other substances known in the art. It is expected that polypeptides as described herein will have immunoregulatory activity and that such polypeptides may be used as adjuvants that directly act as T cell and/or B cell activators or act on specific cell types that enhance the synthesis of various cytokines or activate intracellular signaling pathways. Such polypeptides are expected to augment the immune response to increase the protective index of the existing composition.


In another embodiment, a composition as described herein including a pharmaceutically acceptable carrier can include a biological response modifier, such as, for example, IL-2, IL-4 and/or IL-6, TNF, IFN-α, IFN-γ, and other cytokines that effect immune cells. An immunizing composition can also include other components known in the art such as an antibiotic, a preservative, an anti-oxidant, or a chelating agent.


Methods of Making

This disclosure also provides methods for obtaining the polypeptides described herein. The polypeptides and whole cells as described herein may be isolatable from a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. Other gram negative microbes from which polypeptides can be isolated include, for example, Achromobacter spp., Ralstonia spp., Pseudomonas spp., Bordetella spp., and Acinetobacter spp. Microbes useful for obtaining polypeptides as described herein and making whole cell preparations are commercially available from a depository such as American Type Culture Collection (ATCC). In addition, such microbes are readily obtainable by techniques routine and known to the art. The microbes may be derived from an infected animal as a field isolate, and used to obtain the polypeptides and/or the whole cell preparations as described herein, or stored for future use, for example, in a frozen repository at from −20° C. to −95° C., or from −40° C. to −50° C., in bacteriological media containing 20% glycerol, and other like media.


When a polypeptide as described herein is to be obtained from a microbe, the microbe can be incubated under low metal conditions. As used herein, the phrase “low metal conditions” refers to an environment, typically bacteriological media, that contains an amount of a free metal that causes a microbe to express metal-regulated polypeptides at a detectable level. As used herein, the phrase “high metal conditions” refers to an environment that contains an amount of a free metal that causes a microbe to express a metal-regulated polypeptide at a decreased level compared to expression of the metal-regulated polypeptide under low metal conditions. In some cases, “high metal conditions” can refer to an environment that causes a cell to fail to express one or more of the metal-regulated polypeptides described herein at a detectable level.


In some cases, “high metal conditions” can include a metal-rich natural environment and/or culture in a metal-rich medium without a metal chelator. In contrast, in some cases, “low metal conditions” can include culture in a medium that includes a metal chelator, as described in more detail below. Metals are those present in the periodic table under Groups 1 through 17 (IUPAC notation; also referred to as Groups I-A, II-A, III-B, IV-B, V-B, VI-B, VII-B, VIII, I-B, II-B, III-A, IV-A, V-A, VI-A, and VII-A, respectively, under CAS notation). Preferably, metals are those in Groups 2 through 12, more preferably, Groups 3-12. Even more preferably, the metal is iron, zinc, copper, magnesium, nickel, cobalt, manganese, molybdenum, or selenium, most preferably, iron.


Low metal conditions are generally the result of the addition of a metal chelating compound to a bacteriological medium, the use of a bacteriological medium that contains low amounts of a metal, or the combination thereof. High metal conditions are generally present when a chelator is not present in the medium, when a metal is added to the medium, or the combination thereof. Examples of metal chelators include natural and synthetic compounds. Examples of natural compounds include plant phenolic compounds, such as flavenoids. Examples of flavenoids include the copper chelators catechin and naringenin, and the iron chelators myricetin and quercetin. Examples of synthetic copper chelators include, for instance, tetrathiomolybdate, and examples of synthetic zinc chelators include, for instance, N,N,N′,N′-Tetrakis (2-pyridylmethyl)-ethylene diamine Examples of synthetic iron chelators include 2,2′-dipyridyl (also referred to in the art as θ,θ′-bipyridyl), 8-hydroxyquinoline, ethylenediamine-di-O-hydroxyphenylacetic acid (EDDHA), desferrioxamine methanesulphonate (desferol), transferrin, lactoferrin, ovotransferrin, biological siderophores, such as, the catecholates and hydroxamates, and citrate. An example of a general divalent cation chelator is CHELEX resin. Preferably, 2,2′-dipyridyl is used for the chelation of iron. Typically, 2,2′-dipyridyl is added to the media at a concentration of at least 300 micrograms/milliliter (μg/ml), at least 600 μg/ml, or at least 900 μg/ml. High levels of 2,2′-dipyridyl can be, for example, 1200 μg/ml, 1500 μg/ml, or 1800 μg/ml.


The B. thailandensis genome encodes a ferric uptake regulator (Fur) homolog. The Fur protein controls the uptake of metal ions by regulating the expression of iron receptors such as the siderophore receptor proteins in response to iron limitation. It also plays a role in oxidative stress resistance and virulence. It is expected that a gram negative organism, preferably, a Burkholderia spp., with a mutation in the fur coding region will result in the constitutive expression of many, if not all, of the metal-regulated polypeptides as described herein. The production of a fur mutation in a gram negative, preferably, a Burkholderia spp., can be produced using routine methods including, for instance, transposon, chemical, or site-directed mutagenesis useful for generating gene missense or knock-out mutations in gram negative bacteria.


The medium used to incubate the microbe and the volume of media used to incubate the microbe can vary. When a microbe is being evaluated for the ability to produce one or more of the polypeptides described herein, the microbe can be grown in a suitable volume, for instance, 10 milliliters to 1 liter of medium. When a microbe is being grown to obtain polypeptides for use in, for instance, administration to animals, the microbe may be grown in a fermenter to allow the isolation of larger amounts of polypeptides. Methods for growing microbes in a fermenter are routine and known to the art. The conditions used for growing a microbe preferably include a metal chelator, more preferably an iron chelator, for instance 2,2′-dipyridyl, a pH of between 6.5 and 7.5, preferably between 6.9 and 7.1, and a temperature of 37° C.


In some aspects of the invention, a microbe may be harvested after growth. Harvesting includes concentrating the microbe into a smaller volume and suspending in a media different than the growth media. Methods for concentrating a microbe are routine and known in the art, and include, for example, filtration or centrifugation. Typically, the concentrated microbe is suspended in an appropriate buffer. An example of a buffer that can be used contains Tris-base (7.3 grams/liter), at a pH of 8.5. Optionally, the final buffer also minimizes proteolytic degradation. This can be accomplished by having the final buffer at a pH of greater than 8.0, preferably, at least 8.5, and/or including one or more proteinase inhibitors (e.g., phenylmethanesulfonyl fluoride). Optionally, the concentrated microbe is frozen at −20° C. or below until disrupted.


When the microbe is to be used as a whole cell preparation, the harvested cells may be processed using routine and known methods to inactivate the cells. Alternatively, when a microbe is to be used to prepare polypeptides as described herein, the microbe may be disrupted using chemical, physical, or mechanical methods routine and known to the art, including, for example, boiling, French press, sonication, digestion of peptidoglycan (for instance, by digestion with lysozyme), or homogenization. An example of a suitable device useful for homogenization is a model C500-B AVESTIN homogenizer, (Avestin Inc., Ottawa, Canada). As used herein, “disruption” refers to the breaking up of the cell. Disruption of a microbe can be measured by methods that are routine and known to the art, including, for instance, changes in optical density. Typically, a microbe is subjected to disruption until the percent transmittance is increased by 20% when a 1:100 dilution is measured. When physical or mechanical methods are used, the temperature during disruption is typically kept low, preferably at 4° C., to further minimize proteolytic degradation. When chemical methods are used the temperature may be increased to optimize for the cell disruption. A combination of chemical, physical, and mechanical methods may also be used to solubilize the cell wall of microbe. As used herein, the term “solubilize” refers to dissolving cellular materials (e.g., polypeptides, nucleic acids, carbohydrates) into the aqueous phase of the buffer in which the microbe was disrupted, and the formation of aggregates of insoluble cellular materials. Without intending to be limited by theory, the conditions for solubilization are believed to result in the aggregation of polypeptides as described herein into insoluble aggregates that are large enough to allow easy isolation by, for instance, centrifugation.


The insoluble aggregates that include one or more of the polypeptides as described herein may be isolated by methods that are routine and known to the art. Preferably, the insoluble aggregates are isolated by centrifugation. Typically, centrifugation of polypeptides, such as membrane polypeptides, can be accomplished by centrifugal forces of 100,000×g. The use of such centrifugal forces requires the use of ultracentrifuges, and scale-up to process large volumes of sample is often difficult and not economical with these types of centrifuges. The methods described herein provide for the production of insoluble aggregates large enough to allow the use of continuous flow centrifuges, for instance T-1 Sharples (Alfa Laval, Inc., Richmond, Va.), which can be used with a flow rate of 250 ml/minute at 17 psi at a centrifugal force of 46,000×g to 60,000×g. Other large scale centrifuges can be used, such as the tubular bowl, chamber, and disc configurations. Such centrifuges are routinely used and known in the art, and are commercially available from such manufactures as Pennwalt, Ltd., GEA Westfalia Separator Division (GEA Mechanical Equipment US, Inc.), or Alpha Laval, Inc.


The final harvested proteins can be washed and/or dialyzed against an appropriate buffer using conventional methods such as, for example, diafiltration, precipitation, hydrophobic chromatography, ion-exchange chromatography, affinity chromatography, or ultra-filtration, followed by washing the polypeptides, for instance, in alcohol, by diafiltration. After isolation, the polypeptides suspended in buffer and stored at low temperature, for instance, −20° C. or below.


In those aspects as described herein where a whole cell preparation is to be made, after growth a microbe can be killed with the addition of an agent such as glutaraldehyde, formalin, or formaldehyde, at a concentration sufficient to inactivate the cells in the culture. For instance, formalin can be added at a concentration of 0.3% (vol:vol). After a period of time sufficient to inactivate the cells, the cells can be harvested by, for instance, diafiltration and/or centrifugation, and washed.


In other aspects, an isolated polypeptide as described herein may be prepared recombinantly. When prepared recombinantly, a polynucleotide encoding the polypeptide may be identified and cloned into an appropriate expression host as described below in Example 6. The recombinant expression host may be grown in an appropriate medium, disrupted, and the polypeptides isolated as described above.


Methods of Use

In another aspect, this disclosure further provides methods of using the compositions as described herein. The methods include administering to an animal an effective amount of a composition as described herein. The animal can be, for instance, avian (including, for instance, chickens or turkeys), bovine (including, for instance, cattle), caprine (including, for instance, goats), ovine (including, for instance, sheep), porcine (including, for instance, swine), bison (including, for instance, buffalo), equine (including, for instance, horses), a companion animal (including, for instance, dogs or cats), members of the family Cervidae (including, for instance, deer, elk, moose, caribou and reindeer), or human.


In some aspects, the methods may further include additional administrations (e.g., one or more booster administrations) of the composition to the animal to enhance or stimulate a secondary immune response. A booster can be administered at a time after the first administration, for instance, one to eight weeks, preferably two to four weeks, after the first administration of the composition. Subsequent boosters can be administered one, two, three, four, or more times annually. Without intending to be limited by theory, it is expected that in some aspects as described herein annual boosters will not be necessary, as an animal will be challenged in the field by exposure to microbes expressing polypeptides present in the compositions having epitopes that are identical to or structurally related to epitopes present on polypeptides of the composition administered to the animal.


In one embodiment, the method can involve making antibody, for instance by inducing the production of antibody in an animal or by recombinant techniques. As used herein, the term “antibody”—when not preceded by a definite or indefinite article—can be used generically to refer to any preparation that includes at least one molecular species of immunoglobulin or a fragment (e.g., scFv, Fab, F(ab′)2 or Fv or other modified fragment) thereof. Therefore, “antibody” can generically include one or more monoclonal antibodies and/or a polyclonal antibody preparation. Antibody produced by the method can include antibody that specifically binds at least one polypeptide present in the composition. In this context, an “effective amount” is an amount effective to result in the production of antibody in the animal. Methods for determining whether an animal has produced antibody that specifically binds a polypeptide present in a composition can be determined as described herein. This disclosure therefore further provides antibody that specifically binds to a polypeptide as described herein, and compositions including such antibody.


The method may be used to produce antibody that specifically binds to a polypeptide expressed by a microbe other than the microbe from which the polypeptide of the composition were isolated. As used herein, antibody that can “specifically bind” a polypeptide is antibody that interacts with the epitope of the antigen that induced the synthesis of the antibody, or interacts with a structurally related epitope. At least some of the polypeptides present in the compositions as described herein typically include epitopes that are conserved in the polypeptides of different species and different genera of microbes. Accordingly, antibody produced using a composition derived from one microbe is expected to bind to polypeptides expressed by other microbes and provide broad spectrum protection against gram negative organisms. Examples of gram negative microbes to which the antibody may specifically bind are Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. Therefore, antibody produced using a composition of polypeptides as described herein may be used to identify and characterize polypeptides as described herein independent of the origin, source, and/or manner of obtaining the polypeptide.


In another aspect, this disclosure provides the use of such antibody to target a microbe expressing a polypeptide as described herein or a polypeptide having an epitope structurally related to an epitope present on a polypeptide as described herein. A compound can be covalently bound to an antibody, where the compound can be, for instance, a toxin. Likewise, such compounds can be covalently bound to a bacterial siderophore to target the microbe. The chemical coupling or conjugation of an antibody as described herein, or a portion thereof (such as a Fab fragment), can be carried out using known and routine methods.


In another aspect, this disclosure provides methods for treating an infection in an animal, including a human, caused by a gram negative microbe, preferably by a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. As used herein, the term “infection” refers to the presence of a gram negative microbe in an animal's body, which may or may not be clinically apparent. An animal with an infection by a member of the genus Burkholderia that is not clinically apparent is often referred to as an asymptomatic carrier.


Treating an infection can be prophylactic or, alternatively, therapeutic—in this context, treatment after a subject manifests one or more indication of infection by a microbe. Generally, treatment that is prophylactic—in this context, initiated before a subject is infected by a microbe or while an infection remains subclinical—is referred to herein as treatment of a subject that is “at risk” of infection. As used herein, the term “at risk” refers to an animal that may or may not actually possess the described risk—in this context, an animal that may or may not be infected by a particular microbe. Thus, typically, an animal “at risk” of infection by a microbe is an animal present in an area where animals have been identified as infected by the microbe and/or is likely to be exposed to the microbe even if the animal has not yet manifested any detectable indication of infection by the microbe and regardless of whether the animal may harbor a subclinical amount of the microbe. Accordingly, administration of a composition can be performed before, during, or after the animal has first contact with the microbe. Treatment initiated after the animal's first contact with the microbe may result in decreasing the severity of symptoms and/or clinical signs of infection by the microbe, completely removing the microbe, and/or decreasing the likelihood of experiencing a clinically evident infection compared to an animal to which the composition is not administered. The method includes administering an effective amount of the composition as described herein to an animal having, or at risk of having, an infection caused by a gram negative microbe, and determining whether the number of microbes causing the infection has decreased. In this context, an “effective amount” is an amount effective to reduce the number of the specified microbes in an animal or reduce the likelihood that the animal experiences a clinically-evident infection compared to an animal to which the composition is not administered. Methods for determining whether an infection is caused by a gram negative microbe are routine and known in the art, as are methods for determining whether the infection has decreased.


In another aspect, the present invention is directed to methods for treating one or more symptoms or clinical signs of certain conditions in an animal that may be caused by infection by a gram negative microbe, preferably by a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. The method includes administering an effective amount of a composition as described herein to an animal having or at risk of having a condition, or exhibiting symptoms and/or clinical signs of a condition, and determining whether at least one symptom and/or clinical sign of the condition is changed, preferably, reduced.


Melioidosis caused by B. pseudomallei tends to be more prevalent in people during the fourth and fifth decades of life, especially among those who have other comorbidities such as diabetes, alcoholism, immunosuppression, and renal failure. Melioidosis may be transmitted through direct skin contact with contaminated soil or water, ingestion of contaminated water, inhalation of dust contaminated with the bacterium, sexual contact, or intravenous drug use. Human-to-human transmission is rare but has been documented. The cutaneous form results in local infection with ulceration and swollen lymph glands. General symptoms include fever, rigors, night sweats, myalgia, anorexia, and headache. Additional symptoms are dependent on the route of exposure but may include chest pain, cough, photophobia, lacrimation, and diarrhea. Physical findings may include fever, cervical adenopathy, pustular skin lesions, hepatomegaly, or splenomegaly. During primary melioidosis, patients may experience severe urticaria. The chronic form may involve multiple abscesses that affect the liver, spleen, skin, or muscles, and may reactivate many years after the primary infection.


The pulmonary form of melioidosis may be manifest as pneumonia, pulmonary abscesses, and pleural effusions. Cutaneous abscesses may also develop and can take months to appear. Those who develop septicemia may develop respiratory distress, headaches, fever, diarrhea, pus-filled skin lesions, and abscesses throughout the body. Pustules often occur in association with regional lymphadenitis, cellulitis, or lymphangitis. Specifically, septicemia, high fever, and rigor are often present and may be accompanied by confusion, dyspnea, abdominal pain, muscle tenderness, pharyngitis, diarrhea, and jaundice. Although the foci of infection may be lungs or skin, once septicemia has developed, the disease spreads to the liver, spleen, kidney, brainstem, and parotid gland, leading to acidosis and shock with a high mortality rate exceeding 90% and death occurring within 24-48 hours.



B. mallei is the causative agent of glanders, which occurs primarily in horses and other solipeds. B. mallei is highly virulent and exhibits a pathophysiology in humans that is similar to glanders, where the clinical symptoms are similar to melioidosis as described above. Transmission is through direct skin or mucous membrane contact with infected animal tissues. Human-to-human transmission is possible and has been reported. Septicemia may include cutaneous, hepatic, and splenic involvement and is usually fatal within 7-10 days. The chronic form may involve multiple abscesses that affect the liver, spleen, skin, or muscles.


The Burkholderia cepacia complex is a group of at least 17 species responsible for opportunistic infections that are particularly problematic in diseases that cause impaired pulmonary function such as cystic fibrosis or chronic granulomatous disease. These organisms have also been a source of catheter-related infections in cancer patients and in those who are on hemodialysis. They have also been a source of skin and soft tissue infection, surgical wound infection, and genitourinary infection. The symptoms of pulmonary infection vary widely, ranging from asymptomatic infection to serious respiratory infections, especially in individuals that are immunocompromised, the young, the elderly, and people with lung disease. Symptoms are similar to other lung infections, with cough, wheezing shortness of breath, congestion, and fever. Thus, infection may be difficult to diagnose. Bacteria can persist in the lungs for years without symptoms.


Treatment of symptoms and/or clinical signs associated with these conditions can be prophylactic or, alternatively, therapeutic—in this context, treatment initiated after the subject exhibits one or more symptoms or clinical signs associated with a condition caused by infection by a gram negative microbe. As used herein, the term “symptom” refers to subjective evidence of disease or condition experienced by the patient and caused by infection by a microbe. As used herein, the term “clinical sign” or, simply, “sign” refers to objective evidence of disease or condition caused by infection by a microbe. Symptoms and/or clinical signs associated with conditions referred to herein and the evaluations of such symptoms are routine and known in the art. Treatment that is prophylactic—in this context, treatment that is initiated before a subject manifests symptoms or signs of a condition caused by a microbe—is referred to herein as treatment of a subject that is “at risk” of developing the condition. Thus, typically, an animal “at risk” of developing a condition is an animal present in an area where animals having the condition have been diagnosed and/or is likely to be exposed to a microbe causing the condition even if the animal has not yet manifested symptoms or signs of any condition caused by the microbe. Accordingly, administration of a composition can be performed before, during, or after the occurrence of the conditions described herein. Treatment initiated after the development of a condition may result in decreasing the severity of the symptoms of one of the conditions, or completely removing the symptoms. In this aspect of the invention, an “effective amount” is an amount effective to prevent the manifestation of symptoms of a disease, decrease the severity of the symptoms of a disease, and/or completely remove the symptoms. The successful treatment of a gram negative microbial infection in an animal is disclosed in Example 6, which demonstrates the protection against disease caused by B. thailandensis in a mouse model by administering a composition as described herein. These mouse models are a commonly accepted model for the study of human disease caused by these microbes.


This disclosure also provides methods for decreasing colonization by gram negative microbes, for instance blocking the attachment sites of gram negative microbe, including tissues of the skeletal system (for instance, bones, cartilage, tendons and ligaments), muscular system, (for instance, skeletal and smooth muscles), circulatory system (for instance, heart, blood vessels, capillaries and blood), nervous system (for instance, brain, spinal cord, and peripheral nerves), respiratory system (for instance, nose, trachea lungs, bronchi, bronchoceles, alveoli), digestive system (for instance, mouth, salivary glands esophagus, liver, stomach, large intestine, or small intestine), excretory system (for instance, kidneys, ureters, bladder and urethra), endocrine system (for instance, hypothalamus, pituitary, thyroid, pancreas and adrenal glands), reproductive system (for instance, ovaries, oviduct, uterus, vagina, mammary glands, testes, and seminal vesicles), lymphatic/immune systems (for instance, lymph, lymph nodes and vessels, mononuclear or white blood cells, such as macrophages, neutrophils, monocytes, eosinophils, basophils, and lymphocytes, including T cells and B cells), and specific cell lineages (for instance, precursor cells, epithelial cells, stem cells), and the like. Preferably, the gram negative microbe is a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. Protein conservation suggests that the Burkholderia vaccines may also decrease colonization by other gram negative microbes such as Achromobacter spp., Ralstonia spp., Pseudomonas spp., Bordetella spp., and Acinetobacter spp.


Decreasing colonization in an animal may be performed prophylactically or, alternatively, therapeutically—in this context, initiated after the animal is colonized by the microbe. Treatment that is prophylactic—in this context, initiated before a subject is colonized by a microbe or while any colonization remains undetected—is referred to herein as treatment of a subject that is “at risk” of colonization by the microbe. Thus, typically, an animal “at risk” of colonization by a microbe is an animal present in an area where animals have been identified as colonized by the microbe and/or is likely to be exposed to the microbe even if the animal has not yet manifested any detectable indication of colonization by the microbe and regardless of whether the animal may harbor a subcolonization number of the microbe. Accordingly, administration of a composition can be performed before, during, or after the animal has first contact with the microbe. Treatment initiated after the animal's first contact with the microbe may result in decreasing the extent of colonization by the microbe, completely removing the microbe, and/or decreasing the likelihood that the animal becomes colonized by the microbe compared to an animal to which the composition is not administered. Thus, the method includes administering an effective amount of a composition as described herein to an animal colonized by, or at risk of being colonized by, a gram negative microbe. In this context, an “effective amount” is an amount sufficient to decrease colonization of the animal by the microbe, where decreasing colonization refers to one or more of: decreasing the extent of colonization by the microbe, completely removing the microbe, and/or decreasing the likelihood that the animal becomes colonized by the microbe compared to an animal to which the composition is not administered. Methods for evaluating the colonization of an animal by a microbe are routine and known in the art. For instance, colonization of an animal's intestinal tract by a microbe can be determined by measuring the presence of the microbe in the animal's feces. It is expected that decreasing the colonization of an animal by a microbe will reduce transmission of the microbe to humans.


A composition as described herein can be used to provide for active or passive immunization against bacterial infection. Generally, the composition can be administered to an animal to provide active immunization. However, the composition also can be used to induce production of immune products, such as antibody—e.g., a polyclonal antibody preparation or a monoclonal antibody—that can be collected from the producing animal and administered to another animal to provide passive immunity Immune components such as, for example, antibody can be collected from serum, plasma, blood, colostrum, etc. to prepare compositions (preferably containing the collected antibody) for passive immunization therapies. Antibody compositions including monoclonal antibodies and/or anti-idiotypes can also be prepared using known methods. Chimeric antibodies include human-derived constant regions of both heavy and light chains and murine-derived variable regions that are antigen-specific (Morrison et al., 1984, Proc. Natl. Acad. Sci. USA 81(21):6851-6855; LoBuglio et al., 1989, Proc. Natl. Acad. Sci. USA 86(11):4220-4224; Boulianne et al., 1984, Nature 312(5995):643-646.). Humanized antibodies substitute the murine constant and framework (FR) (of the variable region) with the human counterparts (Jones et al., 1986, Nature 321(6069):522-525; Riechmann et al., 1988, Nature 332(6162):323-327; Verhoeyen et al., 1988, Science 239(4847):1534-1536; Queen et al., 1989, Proc. Natl. Acad. Sci. USA 86(24):10029-10033; Daugherty et al., 1991, Nucleic Acids Res. 19(9): 2471-2476.). Alternatively, certain mouse strains can be used that have been genetically engineered to produce antibodies that are almost completely of human origin; following immunization the B cells of these mice are harvested and immortalized for the production of human monoclonal antibodies (Bruggeman et al. 1997, Curr. Opin. Biotechnol. 8(4):455-458; Lonberg et al., 1995, Int. Rev. Immunol. 13(1):65-93; Lonberg et al., 1994, Nature 368:856-859; Taylor et al., 1992, Nucleic Acids Res. 20:6287-6295.).


Passive antibody compositions and fragments thereof, e.g., scFv, Fab, F(ab′)2 or Fv or other modified forms thereof, may be administered to a recipient in the form of serum, plasma, blood, colostrum, and the like. Antibody may, however, also be isolated from serum, plasma, blood, colostrum, and the like, using known methods for later use in a concentrated or reconstituted form such as, for instance, lavage solutions, impregnated dressings and/or topical agents and the like. Passive immunization preparations may be particularly advantageous for the treatment of acute systemic illness, or passive immunization of young animals that failed to receive adequate levels of passive immunity through maternal colostrum. Antibody useful for passive immunization also may be useful to conjugate to various drugs or antibiotics that could be directly targeted to bacteria expressing during a systemic or localized infection a polypeptide as described herein or a polypeptide having an epitope structurally related to an epitope present on a polypeptide as described herein.


Animal models, in particular mouse models, are available for experimentally evaluating the compositions as described herein. These mouse models are commonly accepted models for the study of human disease caused by members of the genus Burkholderia, and, in particular B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans. In those cases where a member of the genus Burkholderia causes disease in an animal, the natural host can be used to experimentally evaluate the compositions as described herein.


However, protection in a mouse model is not the only way to assess whether a composition can confer protection to an animal against infection by a Burkholderia spp. The adaptive immune response consists of two primary divisions: the humoral (antibody) response and the cellular (T cell) response. Following infection by a bacterial pathogen, dendritic cells at the infection site encounter microbial antigens and produce signaling molecules such as, for example, surface receptors and cytokines in response to conserved molecular patterns associated with the specific bacterium. These signals are shaped by the nature of the pathogen and ideally lead to the appropriate antibody and T cell responses that protect the host from disease. While some bacterial diseases are controlled primarily through antibody functions, others require T cell responses or both antibody and T cell responses for protection. The goal of vaccine biology is to identify the immune responses that provide protection and then design a vaccine to reproduce one or more of these responses in humans.


Antibodies can have many different functions in the conferring protection against infection such as, for example, complement fixation, opsonization, neutralization, and/or agglutination. Moreover, some subclasses of antibodies are better than others at specific functions; for example, for complement fixation the following hierarchy exists for human IgG subclasses: IgG3>IgG1>IgG2>IgG4.


Antibody immunological functions can be studied in a variety of ways. For instance, Western blots are used to identify antigen-specific binding based on size of separated proteins, while the standard enzyme-linked immunosorbent assay (ELISA) is used to produce quantitative information about antibody titers within serum. Antibody surface binding studies are used to determine whether antibody in serum are able to recognize antigens on the surface of intact bacteria, an important indicator of whether the antibodies have the potential to work in vivo. Thus, one skilled in the art recognizes that antibody binding assays such as a Western blot, ELISA (e.g., using human antisera), and/or surface binding correlate positively with the specifically-bound antigens providing immunological activity against infection by Burkholderia spp. However, one skilled in the art further recognizes that a lack of antibody binding in an assay such as, for example, a Western blot, ELISA, or surface binding assay does not mean that the assayed antigen fails to provide immunological activity against infection by Burkholderia spp.


For example, FIG. 7 and Table 8 provide data showing antibody titer (FIG. 7) and Western blot data for certain representative Burkholderia iron-regulated polypeptides.


Techniques such as opsonophagocytosis assays (OPA), in which antibody and complement-bound bacteria are combined with human or mouse phagocytes to determine levels of bacterial killing, are useful for studying antibody function. Positive OPA results correlate with vaccine-induced protection in a mouse model. (Stranger-Jones et al., 2006, Proc. Nati. Acad. Sci. 103(45):16942-16947). A similar oxidative burst assay can be used to assess the level of reactive oxygen species (ROS) by fresh human or mouse neutrophils following interaction with antibody and complement-bound bacteria.


In some cases, one can determine that a candidate polypeptide possesses cell-mediated immunological activity and, therefore, the candidate polypeptide may exhibit immunological activity in the absence of inducing the production of antibodies. (Spellberg et al., 2008, Infect. Immun. 76(10):4575-4580). Cytotoxic or CD8+ T cells primarily kill infected cells directly through various effector mechanisms, while helper CD4+ T cells function to provide important signaling in the way of cytokines. These T cell classes can be further subdivided based on the cytokines they produce, and different subclasses are effective against different bacterial pathogens. T cells are often studied by assessing their phenotypes with flow cytometry, where antibodies are used to visualize the levels of specific surface markers that enable classification of the T cells as, for example, a recently activated CD4+ T cell, a memory CD8+ T cell, etc. In addition, cytokines and other products of T cells can be studied by isolating the T cells from lymphoid tissue and re-stimulating them with cognate antigen. Following antigen stimulation the T cells produce cytokines that may be visualized by, for example, intracellular cytokine staining coupled with flow cytometry, or collecting the cell supernatants and using Luminex bead technology to measure 15-25 cytokines simultaneously.


For example, Table 9 provides cytokine recall response of spleen cells from mice immunized with representative Burkholderia iron-regulated polypeptides.


Thus, in addition to mouse models, those of ordinary skill in the art recognize that immunological activity commensurate with the methods described herein may correlate with any one or more of the following: Western blot data showing that serum from animals exposed to a Burkholderia spp. contains antibody that specifically binds to a candidate polypeptide, cell surface binding assays demonstrating that antibody that specifically binds to a candidate polypeptide specifically binds to a Burkholderia spp., opsonophagocytosis data, and cytokine induction.


Another aspect of the present invention provides methods for detecting antibody that specifically binds polypeptides as described herein. These methods are useful in, for instance, detecting whether an animal has antibody that specifically binds polypeptides as described herein, and diagnosing whether an animal may have a condition caused by a microbe expressing polypeptides described herein, or expressing polypeptides that share epitopes with the polypeptides described herein. Such diagnostic systems may be in kit form. The methods include contacting an antibody with a preparation that includes a polypeptide as described herein to result in a mixture. The antibody may be present in a biological sample, for instance, blood, milk, or colostrum. The method further includes incubating the mixture under conditions to allow the antibody to specifically bind the polypeptide to form a polypeptide:antibody complex. As used herein, the term “polypeptide:antibody complex” refers to the complex that results when an antibody specifically binds to a polypeptide. The preparation that includes the polypeptides as described herein may also include reagents, for instance a buffer, that provide conditions appropriate for the formation of the polypeptide:antibody complex. The polypeptide:antibody complex is then detected. The detection of antibodies is known in the art and can include, for instance, immunofluorescence or peroxidase. The methods for detecting the presence of antibodies that specifically bind to polypeptides as described herein can be used in various formats that have been used to detect antibody, including radioimmunoassay and enzyme-linked immunosorbent assay.


In another aspect, this disclosure provides a kit for detecting antibody that specifically binds polypeptides as described herein. The antibody detected may be obtained from an animal suspected to have an infection caused by a gram negative microbe, more preferably, a member of the family Burkholderiaceae, preferably, Burkholderia spp. such as, for example, B. thailandensis, B. mallei, B. pseudomallei, B. cenocepacia, or B. multivorans.


The kit can include at least one of the polypeptides as described herein (e.g., one, at least two, at least three, etc.), in a suitable packaging material in an amount sufficient for at least one assay. Optionally, other reagents such as buffers and solutions needed to practice the invention are also included. For instance, a kit may also include a reagent to permit detection of an antibody that specifically binds to a polypeptide as described herein, such as a detectably labeled secondary antibody designed to specifically bind to an antibody obtained from an animal. Instructions for use of the packaged polypeptides are also typically included. As used herein, the phrase “packaging material” refers to one or more physical structures used to house the contents of the kit. The packaging material is constructed by conventional methods, generally to provide a sterile, contaminant-free environment. The packaging material may have a label which indicates that the polypeptides can be used for detecting antibody that specifically binds polypeptides as described herein. In addition, the packaging material contains instructions indicating how the materials within the kit are employed to detect the antibody. As used herein, the term “package” refers to a container such as glass, plastic, paper, foil, and the like, capable of holding within fixed limits the polypeptides, and other reagents, for instance a secondary antibody. Thus, for example, a package can be a microtiter plate well to which microgram quantities of polypeptides have been affixed. A package can also contain a secondary antibody. “Instructions for use” typically include a tangible expression describing the reagent concentration or at least one assay method parameter, such as the relative amounts of reagent and sample to be admixed, maintenance time periods for reagent/sample admixtures, temperature, buffer conditions, and the like.


The present invention is illustrated by the following examples. It is to be understood that the particular examples, materials, amounts, and procedures are to be interpreted broadly in accordance with the scope and spirit of the invention as set forth herein.


EXAMPLES
Example 1
Preparation of Burkholderia Metal-Regulated Polypeptides

Compositions derived from B. thailandensis E264 included novel polypeptides expressed under iron restricted growth. Master seed stocks were prepared by inoculating a single colony of strain E264 into 100 mL Tryptic Soy Broth (TSB, Difco Laboratories, Detroit, Mich.) followed by incubation in a shaking incubator at 37° C. and 400 rpm overnight. The culture was expanded by a 1/100 dilution into fresh TSB and incubated as before until it reached the mid-log phase of growth. The bacteria were pelleted by centrifugation at 5000×g, 4° C., for 10 minutes. The supernatant was decanted, and an equal volume of PBS was added to resuspend the pellet. The bacteria were pelleted by centrifugation as before and the pellet was resuspended in TSB containing 15-50% glycerol at one-tenth of the original culture volume. Stocks were frozen in aliquots of 100-1000 μL and stored at −80° C. Working seed stocks were prepared using the same procedure but with the initial inoculum obtained from a frozen master stock vial.


The iron-regulated polypeptide (IRP) composition was prepared by inoculating B. thailandensis from a frozen stock into 10 mL tryptic soy broth (TSB) supplemented with 200 μM 2,2′-dipyridyl (DP) (Sigma-Aldrich, St. Louis, Mo.). Iron-replete cultures contained 300 μM FeCl3. Cultures were incubated at 37° C. on a shaker at 400 rpm. After 16-24 hours incubation, 10 mL of culture was transferred into 90 mL prewarmed TSB supplemented with either 200 μM DP or 300 μM FeCl3 and incubated at 37° C. on a shaker at 400 rpm. After 16-24 hours, 90 mL of culture was transferred into 900 mL prewarmed TSB supplemented with either 300 μM FeCl3 or 400 μM DP and incubated at 37° C. on a shaker at 400 rpm. After 16-20 hours the cells were harvested by centrifugation at 10,000×g for 20 minutes at 4° C., resuspended in PBS and centrifuged to obtain the final cell pellet. The cell pellets were weighed and stored frozen at −80° C.


The frozen cell pellet was thawed at room temperature and resuspended by the addition of 25 mL Tris-EDTA buffer (15 mM Tris-HCl, 3 mM EDTA, pH 8.5) per gram of pellet. The cell suspension was distributed into sterile 50 mL conical tubes at a volume of 35 mL/tube. Tubes were placed in a −80° C. freezer for a minimum of 30 minutes after which they were removed and thawed at 25° C.-37° C. The cells were disrupted by sonication (Branson, Danbury, Conn.) for 90 seconds on ice. The disrupted cell suspension was transferred to a sterile 40 mL round bottom centrifuge tube and centrifuged at 39,800×g for 20 minutes at 4° C. The soluble membrane fraction in the supernatant was transferred to a sterile 40 mL round bottom centrifuge tube, and 3 mL of 30% sarcosine (N-lauroylsarcosine sodium salt, Sigma-Aldrich, St. Louis, Mo.) was added to each tube. Tubes were incubated for 16-24 hours at 4° C. with rocking. The detergent-insoluble membrane fraction was pelleted by centrifugation at 39,800×g for two hours at 4° C. followed by removal of the supernatant. Pellets were dried by inverting the tubes for a minimum of five minutes. The pellets were resuspended in 75 μl PBS (pH 7.2).


A sample of the pellet was evaluated by denaturing SDS-PAGE using 10% gels stained with Coomassie Blue and imaged using a LI-COR Odyssey infrared imaging system (LI-COR Biosciences, Lincoln, Nebr.)(FIG. 1). The banding profile indicated that expression of polypeptides in the 64 kDa to 98 kDa region was increased when B. thailandensis was grown under iron restriction (lane 3, DP) relative to growth in the presence of iron (lane 2, Fe). This is similar to what has been observed for other Gram negative bacteria grown under iron restriction and corresponds to receptors that are involved in iron acquisition.


Example 2
Two-Dimensional (2D) Gel Analysis of Iron-Regulated Polypeptides

Two-dimensional PAGE separation of membrane extracts from B. thailandensis grown under either iron-rich or iron-depleted conditions was performed using an acidic polyacrylamide system with cationic detergent benzyldimethyl-n-hexadecylammonium chloride (16-BAC) for the first dimension and sodium dodecyl sulfate (SDS) for the second dimension (Hartinger et al., 1996, Anal Biochem 240:126-133). A 50 μg sample of membrane extract was solublized in 7.5 M urea, 10% 16-BAC (w/v), 75 mM DTT, and 0.05% pyronin Y, and electrophoresis was conducted using an 8.7% acrylamide gel with a 50 mM phosphoric acid running buffer. Electrophoretic separation in the 16-BAC phase was carried out at a current of 15 mA, from anode to cathode, at 4° C. overnight until the dye front migrated out of the gel. The gel was stained with 8250 Coomassie Blue. Each lane was excised and equilibrated through four changes of 0.1 M Tris, pH 6.8, with further equilibration in reducing buffer (75 mM Tris, 576 mM glycine, 0.3% SDS, 5% β-mercaptoethanol) for five minutes. Gel strips were overlaid onto the second dimension gel and fixed into place with 0.1% agarose. SDS-PAGE (5-16% gradient) separation was performed using a PROTEAN plus dodeca cell (Bio-Rad Laboratories, Inc., Hercules, Calif.) at 25 mA/gel until the dye front migrated out of the gel.


Separation in the first and second dimensions is dependent on molecular weight, hence, larger polypeptides appear in the upper right portion of the second dimension gel, and polypeptides of decreasing molecular weight appear on a diagonal toward the lower left portion of the gel (FIG. 2). The increased intensity of staining associated with iron restriction in FIG. 2B compared with extracts from cells grown in the presence of iron in FIG. 2A indicates that the expression of these polypeptides is upregulated under iron restriction.


Example 3
Identification of Iron-Regulated Polypeptides in the Extract

Extracts of B. thailandensis grown under iron depleted conditions were subjected to 2D gel electrophoresis as described in Example 2. Regions of the gel that stained positive for polypeptides were excised from the second dimension gel and analyzed by matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) according to the following procedure (Wilm et al., 1996, Nature 379:466-469). The excised gel spot was cut into smaller pieces and washed twice with water for 10 minutes each. All wash volumes were approximately equal to twice the volume of the gel pieces. The gel pieces were washed with a 1:1 mix of acetonitrile and 100 mM ammonium bicarbonate, pH 7.4, for 15-30 minutes. The wash was repeated once or twice as needed to remove the stain. The last wash was replaced with sufficient 100% acetonitrile (ACN) to cover the gel pieces until they turned opaque and sticky, whereupon they were removed and dried in a SAVANT SPEEDVAC (Thermo Fisher Scientific Inc., Waltham, Mass.) at 30° C. for 30 minutes. The dried gel pieces were placed into 25-30 μl of 50 mM ammonium bicarbonate containing trypsin at 110 ng/μl and digested for 16-18 hours at 37° C. Following digestion, the mixture was separated by centrifugation, the supernatant was removed, and a volume of 25-30 μl 0.1% trifluoroacetic acid (TFA) was added to extract the peptides. The samples were sonicated intermittently for 30 minutes, and supernatants containing the peptides were transferred into new tubes. The gel extraction was repeated using a solution of 0.1% TFA/30% ACN followed by 0.1% TFA/70% ACN. The pooled supernatants were concentrated in a SPEEDVAC to a final volume of 30-70 μl.


MALDI-MS analysis was performed using nano high-pressure liquid chromatography electrospray tandem mass spectrometry (nanoLC-ESI-MS/MS) methods with LTQ ORBITRAPS (Thermo Fisher Scientific Inc., Waltham, Mass.) coupled with NanoLC-2D pumps (Eksigent Technologies, LLC, AB SCIEX, Framingham, Mass.) for data acquisition and Scaffold analysis tool (Proteome Software, Portland, Oreg.) to compile the outputs from multiple search algorithms.


A polypeptide was considered to be present if at least two unique peptides for that polypeptide were identified in an excised gel spot. Fifteen polypeptides of interest were identified in the iron restricted extract (Table 2). Eight of the fifteen were detected only in the iron restricted extract and not in extracts from B. thailandensis grown in the presence of iron. These eight polypeptides are reflected in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:16, SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:25.









TABLE 2







Polypeptides identified in outer membrane preparations of



B. thailandensis grown under iron restricted conditions.














Molecular


SEQ

GI
Weight


ID #
Polypeptide
Number
(kDa)a













1
TonB-dependent siderophore receptor
83718630
81


2
RND efflux system outer membrane
83717404
55



lipoprotein


3
Outer membrane ferric siderophore
83716122
84



receptor


7
TonB-dependent copper receptor
83717103
81


9
OmpA family protein
83720098
27


11
OmpA family protein
83720431
24


12
Outer membrane porin
83719322
40


13
Outer membrane porin
83716488
44


14
Outer membrane porin
83718334
39


15
Outer membrane porin
83716254
55


16
RND efflux system outer membrane
83720933
58



lipoprotein


17
RND efflux system outer membrane
83718989
55



lipoprotein


21
Outer membrane porin
83718226
42


22
RND efflux system outer membrane
83719074
56



lipoprotein


25
RND efflux transporter MFP subunit
83720836
43






aMolecular weight (predicted).







Example 4
Differential Expression of Iron-Regulated Polypeptides in the Extract

The isobaric tags for relative and absolute quantification (iTRAQ) method was used to evaluate changes in expression of outer membrane proteins associated with growth of B. thailandensis in iron-depleted compared with iron-replete medium. Cultures were grown as described in Example 1, and 40 μg of outer membrane extract was evaluated. The primary amines of peptides and polypeptides were labeled with isobaric agents using ITRAQ-8plex reagents (Applied Biosystems, Life Technologies Corp., Carlsbad, Calif.) according to the manufacturer's instructions. Cation exchange chromatography was applied using an MCX column (Waters Corp., Milford, Mass.), and the peptides were separated using an UltiMate 3000 nano LC system (Dionex Corp., Sunnyvale, Calif.) coupled to ESI mode using a QSTAR XL mass spectrometer (Applied Biosystems, Life Technologies Corp., Carlsbad, Calif.).


The results are shown as the fold increase for a given protein in the iron restricted extract relative to the iron-replete extract (Table 3). A polypeptide was considered to be iron-regulated if it displayed a fold increase greater than 1 in at least two trials. Polypeptides identified as iron-regulated using iTRAQ analysis include those reflected in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:13, and SEQ ID NO:16.









TABLE 3







Detection and differential upregulation of B. thailandensis


outer membrane polypeptides during iron-restricted growth.










Molecular
Fold increase by iTRAQb



Weight
(Trial No.)













SEQ ID NO
GI Number
(kDa)a
(1)
(2)
(3)
(4)
















1
83718630
81
4.1
4.55
1.1
2.0


2
83717404
55
6.6
16.71
2.4
3.8


3
83716122
84
1.4
1.81
1.01
1.3


4
83717803
83
ND
ND
1.5
1.8


5
83716062
81
ND
ND
0.99
0.95


6
83720594
88
ND
ND
1.2
1.2


7
83717103
81
ND
0.64
0.37
0.52


9
83720098
27
1.6
1.74
0.39
1.6


10
83719210
19
1.7
1.72
0.42
1.1


11
83720431
24
1.4
1.55
0.47
0.68


12
83719322
40
0.99
0.99
0.24
0.43


13
83716488
44
0.93
1.16
1.0
1.3


14
83718334
39
0.81
1.35
0.31
0.67


15
83716254
55
0.58
0.8
0.53
0.39


16
83720933
58
1.2
1.29
0.52
0.89


17
83718989
55
0.73
0.72
ND
ND


18
83717287
19
0.18
0.17
0.17
0.14


20
83718178
36
2.7
ND
ND
ND


21
83718226
42
ND
ND
0.45
0.46


22
83719074
56
ND
ND
1.1
0.75






aMolecular weight (predicted).




bFold upregulation of polypeptides in extract vaccine from B. thailandensis E264 grown under iron-restricted compared with iron-replete conditions. A value of “ND” indicates that no peptides were detected.







Example 5
Identification of Additional Polypeptides Involved in Iron Uptake and Utilization, and Determination of Iron Regulation

A bioinformatics approach was undertaken to identify additional polypeptides that might be involved in iron uptake or utilization in B. thailandensis E264. Polypeptides of interest that had not already been identified as described in Examples 3 and 4 were downselected further to those shown in Table 4 based on structural similarity to known iron receptor proteins, localization in the bacterial outer membrane (computationally predicted by PSORTb V3.0), and additional information implicating them as iron receptors through comparison with B. pseudomallei orthologs. In addition, the polypeptide reflected in SEQ ID NO:19, which is involved in the acquisition and release of iron from intracellular storage by bacterioferritin, was also identified as a potential vaccine target.









TABLE 4







Additional B. thailandensis E264 polypeptides


involved in iron uptake and utilization










SEQ


Molecular


ID

GI
Weight


NO:
Polypeptide
Number
(kDa)a













8
TonB-dependent siderophore receptor
83717123
81


19
Bacterioferritin-associated ferredoxin- Bfd
83718020
8.5


23
TonB-dependent receptor
83718695
85


24
TonB-dependent receptor
83717289
78






aMolecular weight (predicted).







The final determination of metal regulation for candidate proteins was based on mass spectroscopy, iTRAQ analysis, and bioinformatics described in Examples 3, 4, and 5, respectively. Each of the 25 polypeptides was determined to be either iron-regulated, non-iron regulated, or had iron regulation that was uncertain (Table 5). This determination was made using the following, step-wise inclusion criteria. First, any protein that was detected in mass spectroscopy of iron-restricted but not iron replete extracts was considered to be iron-regulated. Second, any protein that demonstrated a fold increase in expression of greater than 1 in iTRAQ analysis was considered to be iron-regulated. This resulted in a total of 14 polypeptides being classified as iron-regulated. Of the remaining proteins, four were considered to be non-metal-regulated if they were detected in at least two iTRAQ trials but did not demonstrate a fold increase greater than 1. Seven polypeptides could not be definitively categorized as either iron-regulated or non-iron regulated, and these were classified as polypeptides whose iron regulation was uncertain.









TABLE 5







Summary of iron regulation for 25 candidate proteins











SEQ
Molecular
Iron



ID NO:
Weight (kDa)a
regulationb















1
81
IR



2
55
IR



3
84
IR



4
83
IR



5
81
U



6
88
IR



7
81
IR



8
81
U



9
27
IR



10
19
IR



11
24
IR



12
40
NIR



13
44
IR



14
39
NIR



15
55
NIR



16
58
IR



17
55
U



18
19
NIR



19
8.5
U



20
36
U



21
42
IR



22
56
IR



23
85
U



24
78
U



25
43
IR








aMolecular weight (predicted).





bIR, iron regulated; NIR, non-iron regulated; U, iron regulation is uncertain







Example 6
Animal Efficacy Studies

Vaccine efficacy studies were conducted in BALB/c and A/J mice using 15 mice per group. Vaccines were formulated to deliver doses of 50 ng, 100 ng, and 300 ng of extract vaccine containing 10 ng/dose of CpG and emulsified in 50% incomplete Freund's adjuvant (IFA). The placebo group consisted of IFA/CpG alone. Mice were immunized subcutaneously in a volume of 100 μl at Day 0 with a booster at Day 28.


On Day 42, mice were challenged intratracheally with a previously determined LD80 of B. thailandensis E264, and survival was monitored for 14 days. The challenge dose of B. thailandensis E264 was prepared as follows. A loop of bacteria was taken from a frozen glycerol stock, placed into 10 mL Luria Bertani (LB) broth, and incubated at 37° C. overnight. The 10 mL culture was inoculated into 100 ml of LB, incubated at 37° C. and monitored for growth by optical density at a wavelength of 600 nm (OD600). When the culture reached an OD600 of 1.9 (approximately 1×109 cfu/ml), the suspension was serially diluted with PBS to the predetermined challenge dose. All suspensions and dilutions were plated on LB agar to verify the actual concentration of bacteria administered.


Aerosolized challenge of mice was performed by intratracheal instillation according to an established protocol. In brief, mice were lightly anesthetized with a mixture of ketamine and xylazine (80 mg/kg ketamine and 20 mg/kg xylazine). The animal was manually restrained in an upright position and a padded forceps was used to gently open the mouth and hold the tongue down to the lower jaw to prevent swallowing. A second investigator then carefully administered 30 μl fluid to the back of the mouth using a sterile pipette tip and a p100 PIPETMAN (Gilson, Inc., Middleton, Wis.). This was followed by placing a gloved finger over the mouse nostrils to prevent obligate nasal breathing. The combination of holding the tongue to prevent swallowing and closing off the nostrils to prevent nasal breathing causes the mouse to inhale through the mouth and aspirate the instilled fluid. An immediate cough by the mouse, which can be detected both audibly and visibly, was used to verify that the procedure was performed correctly.


The results are shown in FIG. 3 and FIG. 4. In A/J mice, the 300 μg dose of vaccine resulted in 86% survival compared with 14% in the placebo. Lower doses of vaccine were not as protective. The increase in survival of vaccinated A/J mice, which are deficient in complement component C5, indicates that the vaccine is efficacious in animals that are predisposed to infection. Thus, the vaccine may also be effective in immunocompromised patients such as those with cystic fibrosis or diabetes. Protection in BALB/c mice was more robust, with increased survival demonstrated at all three doses of the vaccine. Compared with 13% survival in the placebo, vaccinates showed 67% and 60% survival at the 100 μg and the 50 μg doses, respectively, and 40% survival at the 300 μg dose.


Example 7
Conservation of Iron-Regulated Polypeptides

Because iron-regulated polypeptides tend to be evolutionarily conserved, the B. thailandensis extract vaccine would be expected to protect against infection by two of the most highly virulent species of Burkholderia, namely, B. pseudomallei and B. mallei. It may also protect against other Burkholderia such as those in the B. cepacia complex (BCC) that cause opportunistic infections in people with cystic fibrosis and chronic granulomatous disease. In this regard, B. cenocepacia (formerly BCC genomovar III) and B. multivorans (formerly BCC genomovar II) are two of the most common isolates from cystic fibrosis patients and are associated with increases in morbidity and mortality.


In the Example 6 lethal challenge studies, B. thailandensis was selected as a seed strain for the extract vaccine due to safety considerations associated with employing a BSL3 agent in manufacturing. However, recombinant polypeptide vaccines represent an alternative approach that is not subject to this limitation. Thus, a recombinant Burkholderia vaccine could employ polypeptides cloned directly from B. pseudomallei, which is highly lethal and difficult to treat, recognized as a potential bioweapon, endemic in certain parts of Asia and Australia, and considered to be an emerging infectious agent in other parts of the world. To address the possibility of whether a broad spectrum vaccine could be created using extract or recombinants, a bioinformatics approach was undertaken to compare the percent identity of the polypeptides identified in Examples 3, 4, and 5 across a variety of Burkholderia species and strains. Targeted vaccine polypeptides were compared by standard protein BLAST (blastp, NCBI) using a database of non-redundant polypeptide sequences and default parameters. To provide information on both the B. thailandensis extract vaccine and the B. pseudomallei recombinant vaccine, the analysis was performed using B. thailandensis E264 or B. pseudomallei K96243 polypeptides as the query against a subset of the sequenced Burkholderia strains available through GenBank. This panel of targeted strains was selected to achieve diversity based on geographic variability and differences in clinical disease and/or outcomes. Strains of particular interest to the Defense Threat Reduction Agency, as indicated in Broad Agency Announcements, were also included. Information on the selected strains and their sources is shown in Table 6.









TABLE 6







Names and sources of strains used to compare sequence


conservation of iron-regulated polypeptides









Species
Strain
Source






B.

K96243
Human, septicemia, fatal, Thailand, 1996



pseudomallei

Pakistan9
Human melioidosis, Pakistan, 1988



1106a
Human isolate from liver abscess aspirate,




survived but relapsed, Thailand, 1993



1710a
Human blood culture isolate (bacteriemia




with lung and soft tissue involvement),




Thailand, 1996



1710b
Human blood culture isolate (second




isolation from relapse of same patient




as 1710a), fatal, Thailand, 1999



576
Human melioidosis, atypical LPS and highly




virulent in mice, Thailand, 1989



Pasteur
Human melioidosis, Vietnam or Thailand



52237
(from the Finkelstein collection)



S13
Environmental strain, mucoid, Singapore



406e
Human, lethal disseminated disease




(bacteremia, lung, skin, and renal




involvement), Thailand, 1988



MSHR668
Human blood culture isolate, severe




melioidosis encephalomyelitis,




Australia, 1995



MSHR305
Human brain, encephalomyelitis, fatal,




Australia, 1994



1026b
Human blood culture isolate, disseminated




disease (bacteremia with skin, soft tissue,




joint, and spleen involvement),




Thailand, 1993



B.

NCTC10229
Hungary, 1961



mallei

ATCC23344
Human glanders/melioidosis, fatal,




Burma, 1944



NCTC10247
Unknown passage history, avirulent in




hamsters, Turkey, 1960



PRL-20
Horse blood isolate, Pakistan (Lahore




Polo Club outbreak), 2005



ATCC10399
Horse lung isolate, China, 1942



GB8 horse 4
Horse-passaged derivative of ATCC23344



SAVP1
Natural infection, overseas isolate,




severity likely mild



B.

E264
Environmental isolate, Thailand



thailandensis

MSMB43
Environmental isolate (water), Australia



MSMB121
Environmental isolate, Australia



B.

AU1054
Human blood isolate, cystic fibrosis



cenocepacia


patient, epidemic isolate



HI2424
Onion field soil isolate



J2315
Human sputum isolate, cystic fibrosis




patient, epidemic isolate, ET-12 lineage,




United Kingdom-most studied



MC0-3
Corn field soil isolate,




United States, 2004



PC184
Human, cystic fibrosis patient,




epidemic isolate, United States



H111
Human, cystic fibrosis patient,




Germany, 1993



B.

ATCC17616
Environmental soil isolate, United States



multivorans

ATCC
Human sputum, cystic fibrosis



BAA-247
patient, Belgium



CF2
Human sputum, United States



CGD1
Human sputum, chronic granulomatous




disease patient, United States



CGD2
Human blood, chronic granulomatous




disease patient, United States









Amino acid sequence comparisons for the polypeptides reflected in SEQ ID NO:1 through SEQ ID NO:25 are shown in FIG. 5A and FIG. 5B using B. pseudomallei K96243 or B. thailandensis E264 as the query, respectively. All strains of B. pseudomallei and B. mallei showed at least 98% identity with the B. pseudomallei K96243 vaccine strain. Burkholderia cenocepacia and B. multivorans were more divergent but still exhibited substantial homology that in some cases reached 90% or higher. A similar trend was noted using B. thailandensis E264 as the query. The high level of sequence identity with B. pseudomallei and B. mallei reflects their close phylogenetic relationship and suggests that B. thailandensis polypeptides would be capable of generating cross-reactive immune responses to these species. This may also be true for B. cenocepacia and B. multivorans, if there is sufficient conservation in regions of the polypeptides that are immunogenic.


To further evaluate the conservation of the iron-regulated polypeptides across multiple species, a cross-species alignment was performed for the polypeptides reflected in SEQ ID NO:1, SEQ ID NO:2, and SEQ ID NO:9 using Clustal Omega (v1.2.0, European Molecular Biology Laboratory-European Bioinformatics Institute) and one strain from each of the five species. These three polypeptides were selected because they are representative of three different types of iron-regulated polypeptides, namely, a siderophore receptor (SEQ ID NO:1), an RND efflux system polypeptide (SEQ ID NO:2), and an OmpA family outer membrane polypeptide (SEQ ID NO:9), and they showed varying levels of identity across species. The strains used for comparison were B. thailandensis E264, B. pseudomallei K96243, B. mallei NCTC10229, B. cenocepacia AU1054, and B. multivorans ATCC17616.


The alignments indicate that certain regions of these three polypeptides are highly conserved across strains selected from all five Burkholderia species of interest (FIG. 6). The polypeptide reflected in SEQ ID NO:1 shows greater variability at the amino terminus, which is where the signal sequence, generally subject to cleavage, is located. This was also apparent though less dramatic for the polypeptide reflected in SEQ ID NO:2. The polypeptide reflected in SEQ ID NO:9 was highly conserved, although B. thailandensis E264 was somewhat unique in that it had additional amino acids at the amino terminus that were not present in any of the other species. Further examination of B. thailandensis strains revealed that MSMB43 also had additional amino acids at the amino terminus, though fewer than E264, whereas MSMB121 did not (not shown). Nonetheless, the remainder of this polypeptide was highly conserved.


Example 6
Immunogenicity of IRPs
Vaccine Preparation

A comparative immunogenicity study was performed to evaluate the humoral and cellular immune responses elicited by the B. thailandensis IRP (iron-regulated polypeptide) extract vaccine, a purified subunit of the IRP extract vaccine, and a mixture of eight recombinant polypeptides cloned from B. pseudomallei K96243 (eight recombinant Burkholderia IRPs, rBIRP8, Table 7).









TABLE 7







Recombinant B. pseudomallei K96243 polypeptides










SEQ


Molecular


ID

GI
Weight


NO:
Polypeptide
Number
(kDa)a













51
Iron uptake receptor
53719389
83


52
Multidrug efflux system protein
53721329
55


53
Iron transport receptor protein
53722229
82


54
Exported heme receptor protein
53721278
82


55
Fe(III)-pyochelin receptor
53721627
82


56
Ton B-dependent outer membrane
53720334
81



receptor protein


57
Outer membrane copper receptor
53722763
80


58
TonB-dependent receptor family protein
53722050
88






aMolecular weight (predicted).







The extract vaccine used in these studies was prepared as described in Example 1, and the purified subunit vaccine was prepared with the following modifications. After breaking the cells, the soluble fraction was diluted 1:1 with 0.1 M sodium carbonate and mixed at 4° C. for one hour before continuing with centrifugation of the membrane fraction and sarcosine extraction. The insoluble extract product was subjected to a hot phenol extraction (Westphal et al., 1965, Methods in Carbohydrate Chemistry 5:83-91). In brief, the insoluble extract was resuspended in water and heated to 70° C. An equal volume of phenol was added, the temperature was re-equilibrated to 70° C., and the mixture was maintained at this temperature for 30 minutes. The phases were separated at room temperature using a glass separatory funnel. The phenol fraction and interface were combined and precipitated by the addition of methanol at a 3:1 v:v ratio. The polypeptides were recovered by centrifugation for one hour at 30,000×g and the pellet was washed with methanol. Methanol was removed through two sequential washes with a 50 mM Tris/1 mM EDTA buffer, and the final pellet was recovered by centrifugation for one hour at 45,000×g. The pellet was resuspended in a buffer containing 50 mM Tris, 1 mM EDTA, 7 M urea, 2 M thiourea, and 1% zwittergent by adding a volume of 1 ml per mg of pellet. The pellet was further solubilized by sonication at 60° C.-65° C. for 10 minute cycles interspersed with a 10 minute rest, followed by overnight incubation at room temperature on a rocking platform. The resulting product had an endotoxin level of <100 EU/mg as determined by a kinetic-turbidimetric Limulus amoebocyte lysate (LAL) assay (Charles River Laboratories, Wilmington, Mass.).


Eight IRP coding regions were cloned from B. pseudomallei K96243 DNA and expressed in E. coli (Table 7). The signal peptide sequence was predicted using SignalP (Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark) and, in some cases, PrediSi (Institute for Microbiology, Technical University of Braunschweig, Braunschweig, Germany) or Phobius (Stockholm Bioinformatics Centre, Stockholm University, Stockholm, Sweden). Gene-specific primers were designed to express the polypeptide without the signal sequence. PCR was performed using DNA Herculase II (Agilent Technologies, Santa Clara, Calif.), and the PCR products were ligated into the pQE30Xa expression vector (Qiagen, Valencia, Calif.), which adds a 6× Histidine tag to the N-terminus Ligation reactions were used to transform E. coli XL-1 blue. Clones were selected and verified by DNA sequencing (ACGT, Inc., Wheeling, Ill.). The recombinant B. pseudomallei iron-regulated polypeptides (rBIRPs) were expressed and purified using standard methods. In brief, frozen bacterial stocks (100 μl) were used to inoculate 20 ml of Luria-Bertani Broth containing 100 μg/ml of ampicillin, and the culture was grown at 37° C. in a shaking incubator at 250 rpm. After 16 hours, the culture was diluted added to 1 L of Luria-Bertani Broth containing 100 μg/ml of ampicillin, grown to an optical density (600 nm) of 0.6, and induced by the addition of 1 mM IPTG to a final concentration of 1 mM. Cultures were incubated for an additional 4-20 hours, depending on the optimum time for expression as previously determined for each clone. Bacterial cell pellets were harvested by centrifugation at 4,000 rcf for 20 minutes at 4° C., washed in PBS, and resuspended in 20 mM Tris buffer containing 100 μg/ml lysozyme. The cells were disrupted by sonication at 50% duty cycle and output (Branson Sonifier, Danbury, Conn.) for eight minutes on ice. The lysate was subjected to centrifugation for 10 minutes at 40,000×g at 4° C. to remove insoluble material. The soluble supernatants were processed by immobilized metal affinity chromatography (HisTrap FF 5 ml, GE Healthcare) to purify the histidine-tagged recombinant polypeptide, followed by anion exchange chromatography to increase the purity and remove endotoxin. Polypeptide concentration was determined using the BCA method (Thermo Scientific, Rockford, Ill.) and polypeptide purity was measured by SDS-PAGE and densitometry.


Immunization of Mice

BALB/c mice were vaccinated with 100 μg B. thailandensis E264 extract vaccine, 100 μg B. thailandensis E264 purified subunit vaccine (Example 6), or a mixture of eight rBIRPs (rBIRP8, polypeptides of SEQ ID NO:1-SEQ ID NO:8 at 10 μg each). Vaccines were formulated with 10 μg CpG per dose and emulsified in 50% IFA. The placebo group consisted of IFA/CpG alone. Each group contained a total of 5 mice. Vaccines were administered subcutaneously in a volume of 100 μl at Day 0 with a booster at Day 28. Mice were bled on Day −1 (one day before the initial vaccination) and Day 27 with a terminal bleed performed on Day 56. Blood was processed to obtain serum, which was stored at −80° C. Equal volumes of individual serum samples were pooled and evaluated for antibody production.


ELISA

IgG antibody titers to individual rBIRPs were determined by ELISA. In brief, 100 μl of polypeptide at 2 μg/ml, solubilized in 8 M urea, was added to each well of a 96-well EIA/RIA plate (Corning Inc., Tewksbuty, Mass.) and incubated overnight at 4° C. All remaining steps were performed at room temperature. The plate was washed three times with PBS wash buffer (PBS containing 0.05% Tween 20) followed by the addition of 200 μl/well sample buffer consisting of PBS containing 0.05% Tween 20 and 1% bovine serum albumin. After 90 minutes, the sample buffer was replaced with 100 μl/well PBS sample buffer. Serial ⅓ dilutions of the primary antisera were performed in the plate by the addition of 50 μl to the first row, mixing 10 times, and transfer of 50 μl to the next row. The plate was incubated for 90 minutes followed by three washes and addition of 100 μl/well of an HRP conjugated goat anti-mouse IgG, heavy chain specific antibody (Jackson ImmunoResearch Laboratories, Inc., West Grove, Pa.). After a 90 minute incubation, the plate was washed four times followed by the addition of 100 μl TMB/well (BioFX, SurModics, Inc., Eden Prairie, Minn.). Color was allowed to develop for 30 minutes, and the reaction was stopped by the addition of 100 μl stop reagent (BioFX, SurModics, Inc., Eden Prairie, Minn.). The absorbance was measured at a wavelength of 450 nm, and the titer was calculated as the inverse of the dilution corresponding to an absorbance of 1.0. Controls included a standardized primary serum included on each plate to monitor assay variability and wells that were uncoated to subtract background. The limit of detection for the assay was the inverse of the initial serum dilution (indicated as a dotted line on FIG. 7).


All eight rBIRPs were highly and equivalently immunogenic, eliciting antibody titers greater than 100,000 by ELISA (FIG. 7). This included the response to the polypeptide reflected n SEQ ID NO:8, which was identified solely through bioinformatics analysis (Example 5). Vaccination with the purified subunit vaccine also resulted in detectable antibodies to all eight rBIRPs, although the overall titers to the rBIRPS were lower than in mice immunized with rBIRP8. This indicates that all eight rBIRPs were present in the PSV which, having been derived from the extract, indicates they were also present in the extract vaccine. It also demonstrates that antibodies to the B. thailandensis E264 PS vaccine cross-react with rBIRPs from B. pseudomallei K96243, further highlighting the conservation of these polypeptides across species. Moreover, antibodies to the polypeptide reflected in SEQ ID NO:2 were detected at a titer of greater than 1×104 in mice vaccinated with insoluble extract. The polypeptide of SEQ ID NO:2 was one of the polypeptides in the extract based on the iTRAQ analysis in Example 4, and the immune response to this polypeptide suggests it may have been involved in the protection observed in animal efficacy studies described in Example 6. Serum from immunized mice was antigen-specific, since neither preimmune serum nor serum from the placebo reacted with any of the eight rBIRPs.


Western Blot

Antibody specificity was further evaluated by western blotting. Each rBIRP was electrophoresed on a 4-15% TGX gradient gel and the gel was blotted onto a nitrocellulose membrane. All membrane incubations were performed on a rocking platform at room temperature. Membranes were incubated for 1 hour in Odyssey blocking buffer (LI-COR Biosciences, Lincoln, Nebr.) diluted 1:1 in Tris buffered saline. The blocking buffer was decanted and replaced with the primary antiserum diluted in Odyssey blocking buffer as above with the addition of 0.05% Tween 20. After one hour, the primary antiserum was decanted and the membrane was washed three times for 10 minutes each with TBS containing 0.05% Tween. The final wash was decanted and goat anti-mouse IgG (IR dye conjugate, LI-COR), diluted in the same buffer as the primary antibody, was added at the dilution recommended by the manufacturer. After one hour the membrane was washed three times as before, rinsed with TBS, and dried overnight in the dark. Blots were imaged on a LI-COR Odyssey imaging system.


The results from western blots paralleled those for the ELISA (Table 8). Antibodies to all eight rBIRPs were detected in serum from mice vaccinated with either rBIRP8 or the PS vaccine. The polypeptide of SEQ ID NO:2 was detected by serum from mice immunized with the extract vaccine, and similar to what was seen in the ELISA, this polypeptide appears to be highly immunogenic for eliciting an antibody response.









TABLE 8







Reactivitya of antibodies to individual rBIRPs on Western blots.









Polypeptide on blot (SEQ ID NO:)
















Serum source
1/Dilution
1
2
3
4
5
6
7
8



















Preimmune
1000










Placebo
1000










Extract
1000

+








Purified
500
+
+
+
++
+
+
+
+


subunit
1000

+

+






rBIRP8
1000
+++
+++
+++
+++
++
++
+++
+++






aReactivity was assessed as: − (not detected), + (low), ++ (moderate), or +++ (strong).







Antigen-Specific Cytokine Production by Spleen Cells

At termination of the experiment on Day 56, spleens were harvested, processed into mononuclear cell preparations, and cultured individually with 10 μg/ml of individual rBIRPs or the purified subunit. After 48 hours, the cell supernatants were harvested and frozen at −80° C. until assessment for cytokine production using a cytometric bead array kit (BD Biosciences, San Jose, Calif.) and flow cytometer (FACSCanto2, BD Biosciences, San Jose, Calif.), performed according to the manufacturer's protocol. The net production of cytokine for each polypeptide stimulus was calculated by subtracting the corresponding value for the placebo group.









TABLE 9







Cytokine recall response of spleen cells from mice


immunized with iron-regulated polypeptide vaccines.









Stim-




ulus


(SEQ ID

Cytokine Production (pg/ml)















NO:)
Vaccinea
IL-2
IL-4
IL-6
IL-10
IL-17
TNF-α
IFN-γ





1
Extract
 2

b

 2

 1





PSc










rBIRP8
19
2
14

35
11
19


2
Extract
 4

 4

13

 2



PS
 2



15





rBIRP8
43
5
200 

491 
260 
480 


3
Extract


 2







PS










rBIRP8
 7
1
 5

10
 3
 4


4
Extract
 1



 2





PS




 2





rBIRP8
15
1
26

57
41
43


5
Extract


 1







PS

1

 4






rBIRP8
26
4
38

101 
30
94


6
Extract










PS






 2



rBIRP8
 7
2
220 

251 
405 
624 


7
Extract
 2

23

 6

 5



PS
 2



 1

 6



rBIRP8
10
4
229 

215 
211 
329 


8
Extract


 2







PS










rBIRP8
20

11

56
 6
20


Purified
Extract
 2

293 
15
 2

 1


subunit
PS
 4
2
503 

13
164 
10



rBIRP8


17


10







a5 mice were tested individually in each vaccine group




b—, below the level of the placebo group (background) or below the level of detection for the assay.




cPS, purified subunit







Recall responses to individual rBIRP polypeptides and the PS vaccine polypeptide mixture were observed in all three vaccine groups (extract, PS, and rBIRP8, Table 9). Negative control spleen cell cultures incubated with PBS did not exceed background levels in the placebo control and therefore are not shown. Mice immunized with the extract vaccine demonstrated low but measurable production of IL-2 and IL-17 in response to four of the eight rBIRPS (SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:7). In addition, the polypeptides reflected in SEQ ID NO:2 and SEQ ID NO:7 stimulated production of IFN-γ. The pattern of responses was similar though not identical for mice vaccinated with the extract or PS vaccines. Most notable was that both groups exhibited a strong IL-6 response to PS antigens. Mice immunized with rBIRP8 tended to exhibit stronger cytokine recall responses to individual rBIRPs than mice immunized with extract or PS vaccines, suggesting that responses to IRPs can be boosted by decreasing the complexity of the vaccine composition and increasing the amounts of targeted IRP antigens. Moreover, the Thl (IFN-γ, TNF-α)/Th17 (IL-17, TNF-α) type of response observed for rBIRP8, coupled with IL-2 production to promote proliferation and an undetectable IL-10 (regulatory) response, is intriguing since IFN-γ is generally associated with protection to intracellular pathogens, and IFN-γ,TNF-α, and IL-17 have been implicated in protective responses to Burkholderia infections.


Sequences














BtE264-400












ID*
MW
Protein GI#
Protein Locus#
SEQ ID NO:
Sequence
















401
81
83718630
YP_442936
26
1
ttaccagttg tatttcgccg tcgcgatcac ggtgcgctgg ttgccgtaca tgcacaccga 







61
atccgactgg cagccggcca cgtagcggcg gttgaacagg ttcgtcgcat tgagcgcgaa 







121
gcgccagttg cgcagctcgt agtgcagcgc cgcgtcgaac agcgtgtagc tcggcaccgt 







181
cagcgagttg tcggcggcgc ccgccgccgc gctcatgtag cgcacgcccg cgccgacgcc 







241
gaagcccgtg agcggcccgt tgcgccacgt ccagtcggcc cacagcgacg cgatctggcg 







301
cgggcgcggc acgtcgacgg gccacttgtt cagcgtgttg tcattcgcct gcacgttctt 







361
cacgtcctga tagacgtacg cggcgatcac cgacagctcg cgcgacaggt tgcccaccgc 







421
gctcagctcg acgccgcgcg agcgcacctc gcccacctgc acgaacgtgc tgctcgtcgg 







481
atcgttcgga ttgctcatcg cgacgttcgt ctggttgatc tggtagacgg ccgcgttcag 







541
catcaggttc ttgcccggcg gctgccagcg caggccagcc tcgatctggc ggcccttggt 







601
cggcgtcgcg agcccgccgc ccgcgagctt cacgccgatc tgcggattga acgacgtcga 







661
atagctgagg tacggcgcga ggccgtaatc gccgagatag gtcaggccga cgcgcccgct 







721
gaacgcgtgg tcgttctgct tgaagctcgc cgagttcgcg atgtcgtcct gcgacgtgcg 







781
cgtccagtcc tgccggccgc cgagcgtgag cacccagcgc tgccacttga tctggtcctg 







841
cacgtacagg ccgaaggcgt tgagcgtcgt cttcgtgtcg gtgcgcgggt acgcgttcgg 







901
cccgctgaag atgtcggccg gaatcggcgt gtagacgggc cggtacaggt tcaggccggg 







961
ccccttcgcg agccattcgc tgtcggtcgt cgtctgccgg ttgtagtcga agccgaacag 







1021
cagtgtgtgg ctcaacgggc ctgtcgtgaa tttcgcctgc gcctggttgt cgacgtcgaa 







1081
gcggctgtag ttgaactgga acaggcccgc gtagcgcgtc atcgtcgcca tcgtcgggtc 







1141
cgcgccgtcg aggccgccgc cgtagacgga cgcgtcgtcg agcgccaggt gcatccagcg 







1201
cacgttctgc cggaacgtcc acaccgggtt cagcttgtgc tcgaactgat agccgatcga 







1261
ccattgcttc ttgcggtagt ggtcgaagtt cgcatcggcc gtgtacaggt cgtccgagat 







1321
catgccgttc ggattcggca gcacggtgcc gcgcgacggc aggaagttgc tcgacgtgtc 







1381
gccccagtcc tgcaggtacg tcgcggcgag cgtgagcgac gtgtttgcgt tcggctgcca 







1441
cttgagcgac ggcgcgaacg acacgcgctg gtcggcgagc ggccccgtct gcgcgttgcc 







1501
gtcgcggccg acgccgacga tccggtacga cagcgtgcca tccttgtcga tcttgtcgcc 







1561
gatatcgaac atcagctgct tgcgcgcgta gttgcccacc tgcacgccga cctcgcgaat 







1621
gcgctcgccg ttcgcgagct tgctctgcac gtcgacgatc gcgcccgggt cgccctgccc 







1681
gtacagcacc gacgtcgggc cgcgcagcac ggcgatgctg tcgatcatgt acggatcgac 







1741
gcgccagctc gacaggttga tcgtgttcgg cacctgcagc ccgtcgacga acacggtcgg 







1801
cgtgaagccg cgcagcgccg cgtaccagtc cgagcggttg tccgatccgt aggtcgagaa 







1861
gcccgggatg tagcggaacg cctgattgat gtcggtcgcg cccgtcgcct cgatctgctg 







1921
cgcggtgacg atgttgatcg tctgcgggat ctcgacgatc ggcgtatcgg tcttcgtgcc 







1981
cgtggtgctg cgccgcgcga cgaggccgac gctcgcgtcg cgctccgccc ccgccgacac 







2041
cgagatcgcg ggcaacgtgc cgcccgccgc agtgtcccca ttcacctgac tgcccggctg 







2101
ccgcgcgtct gcccccggcc gcaccgcctg tgcctgcgca tgactcgccg ccgcgcagaa 







2161
cgccacgcct gccgcggccg cgatcgcacg cacgcgcgtg ctggttgccc actccat 









1
1
mewatstrvr aiaaaagvaf caaashaqaq avrpgadarq pgsqvngdta aggtlpaisv 







61
sagaerdasv glvarrsttg tktdtpivei pqtinivtaq qieatgatdi nqafryipgf 







121
stygsdnrsd wyaalrgftp tvfvdglqvp ntinlsswrv dpymidsiav lrgptsvlyg 







181
qgdpgaivdv qsklangeri revgvqvgny arkqlmfdig dkidkdgtls yrivgvgrdg 







241
naqtgpladq rvsfapslkw qpnants1tl aatylqdwgd tssnflpsrg tvlpnpngmi 







301
sddlytadan fdhyrkkqws igyqfehkln pvwtfrqnvr wmhlalddas vygggldgad 







361
ptmatmtrya glfqfnysrf dvdnqaqakf ttgplshtll fgfdynrqtt tdsewlakgp 







421
glnlyrpvyt pipadifsgp nayprtdtkt tlnafglyvq dqikwqrwvl tlggrqdwtr 







481
tsqddiansa sfkqndhafs grvgltylgd yglapylsys tsfnpqigvk laggglatpt 







541
kgrqieaglr wqppgknlml naavyqinqt nvamsnpndp tsstfvqvge vrsrgvelsa 







601
vgnlsrelsv iaayvyqdvk nvqandntln kwpvdvprpr qiaslwadwt wrngpltgfg 







661
vgagvrymsa aagaadnslt vpsytlfdaa lhyelrnwrf alnatnlfnr ryvagcqsds 







721
vcmygnqrtv iatakynw 





402
55
83717404
YP_440292
27
1
tcaacgtgcc gcgacgtcct gcttgttcgg ctcgcgcgaa ccgacggcca cgtcgtttcc 







61
ccaaccgccg ccgagcgcgc gaatcaggtt gaccgtcgat accgcctgcg tgcccgtcag 







121
ctgattcgac tgcaattgcg actcgagcac cgagcgctcg ctgtcgatca cctcgagata 







181
gctgaccgcg ccttcctgat actgcgtgcg cgacagcttc gccgcgcgac gcgacgcgtt 







241
gaccgcatcg ctctgcgcgc ggatctgatc gtcgagcaaa cgcagatcgg cgaggttgtc 







301
ctccacttcg cggaacgcga cgagcacctg ctgccgatag ttcgcgactt cctcgtcgta 







361
cttcgcgcgc gcctgcgcga cgcccgcgct gcggcgtccg ccgtcgaaga tcggcagcgt 







421
gagcgccgtg cccgcgaacg gtccgagcag gaacgtgcgg ctcgaccaca ggaacagatt 







481
gccgagcgtc gccgcttcat agccgaacga acccgtgatg tcgagcttcg ggaaatacgc 







541
ggacttcgcc aggccgatgc gcgcgttcgc ggccatcatc gcgcgctcgg ccgccgcgat 







601
gtccggccgc cgctcgagca gcgcggacgg caaccccgcc ggcacgcgca ccgcgaccgg 







661
cacgatcggc gtttccttga acgagaaatc agcgggcgcc ttgccgagca ggatcgcgag 







721
cgcgtgctcg gacgcggcgc gccgccgtgc gacgccgacc gagtccgcct gagcggtcgc 







781
gagctcgttc ttcgcgcgcg acacgtcgag ctcgctgatg tcgccttcgc tgaagcgccg 







841
ctgcacgagc ttcagcgcct cctcgcgcag ctcgaccgtg cggcggtaca gatcctgatc 







901
ggaatcgagc cggcgcagct cgaagtagtt ctgcgcgacg tccgcctgca acgcgagctg 







961
caccgagcgg aacagcgcct ggctctgcgc ttcgtccgca cgcgacgctt cgacgttgcg 







1021
gccgacgcgg ccgaacagat ccgcctcgta cgacaccgtg ccctgcgcgc gccagagcgt 







1081
cgcgttcgtc gggccgctgc cctgcggctg gaactgcgac gccggcgaca gcccctcgcg 







1141
cgtcggcccg aagcccacgc ccacttgcgg gaaccactgc gagcgcgcgg cgcgggtcgc 







1201
cgcgcgcgcc tgctcgacgc gcgccgccgc ggccttcagg ttctggttcg cggcgagcgc 







1261
ctgcgtttcg agcgcatcga gcgccggatc gccgaatacg cgccaccatt cgccgcgatg 







1321
cgcgtcgtcg gcgggctcgg ccgtcttcca cgtgccggcc tgctcgcccg gcgcgagcgc 







1381
cggcgcttcc ttgaacgcgg cgggcgtcgc gacatccggc cgcctgtagt cgggcccgac 







1441
cgcgcacgcg gcaagcagcg cgacgagcag cccgctcgcc gccgcgatct tcgcgacgcg 







1501
cgcgatgcgt tcgttgatgt tggtctt tt at 









2
1
mnktnineri arvakiaaas gllvallaac avgpdyrrpd vatpaafkea palapgeqag 







61
twktaepadd ahrgewwrvf gdpaldalet qalaanqnlk aaaarveqar aatraarsqw 







121
fpqvgvgfgp treglspasq fqpqgsgptn atlwraqgtv syeadlfgrv grnveasrad 







181
eaqsqalfrs vq1alqadva qnyfelrrld sdqdlyrrtv elreealklv qrrfsegdis 







241
eldvsrakne lataqadsvg varrraaseh alaillgkap adfsfketpi vpvavrvpag 







301
lpsallerrp diaaaeramm aanariglak sayfpkldit gsfgyeaatl gnlflwssrt 







361
fllgpfagta ltlpifdggr rsagvaqara kydeevanyr qqvlvafrev ednladlrll 







421
ddqiraqsda vnasrraakl srtqyqegav sylevidser svlesqlqsn qltgtqavst 







481
vnliralggg wgndvavgsr epnkqdvaar 





403
84
83716122
YP_439399
28
1
tcagtagcgc gcgttgagcg tgacgaacgc cgagcggccc ggcgcgatcg acgcgtagtg 







61
cgccggatag gcctgatcga agtacgtgcg gttgaacagg ttgttcacgt tgagccggac 







121
gtcgagcttc ttgttgatcc ggtattgcgc catcgcgtcg aagcgccagt acgacggcac 







181
cgcgcgcagg ttcgcggtgt cgccgaacac cttcgacatg tagaacgcgc cgccgccgac 







241
cgtgaacttc ggcgtcacgt cgtagttcgt ccatagcgtg aagctgtgct tcggcgtgtt 







301
cgggaactgg tggcccttgt ccgccgtgtt cttgccgttc tcgcgcagct cgctcttcag 







361
gtacgtgtag ccgccgaaca cctgccacgc gcgcgtgagc cggccggaga agccgagctc 







421
gagcccctgc acgcgcttgc tgccgaccat cgcgtactgg ttgttcggca gcgtgacgcg 







481
cgcgttcgtc gtgtcgatct gaaagagggc ggccgtgagc gcgagcttgt cgtcgagcac 







541
gttccacttg gtgccgagct cgatgctgcg gttcttctcc ggcgcgagct ggtcggcgtt 







601
cgagccgacg ccgcctcggc ccggcgtgag cgattgcgtc tccgcgcctt cgccgagcag 







661
cgcgccggcg ggcgtcgacg acgtcgcgta cgacgcatag atgctgccgt tgggcgccgg 







721
cttgaacacg aggccgagct gccagttcac gagcgtgtcg tcgcgcgtgt aggtcttgcc 







781
gccgttcgcg cgggtgtcgg tgaagcgggt cgagtagtcg tcgatgcgca cgccggcgtt 







841
cacctgccaa tgcttgctca gctcgaccgt gtcgaagccg tagagcgatt tcgtcacggt 







901
gcgcgcatgc gcgtagtcgt tgttgcgcct gatcgagccc gcccacggat cgttcgggtt 







961
cggcgaccac aggctcgtgc agttgtagcc ggacgccgcg ccgattcctt gctggcagat 







1021
cttgccggtg ccggtcgcga cgctgtacga atcgcgcttg ccccattcgc gcgacagctc 







1081
gatgccggtc gtgaagctgt gcttgaacgg gcccgtgcgg aattcgccga acagctcggt 







1141
ctggttcgcg atgctgttga tcgcgctgtt gcggttgttg ttgcgccgcc agacgcggcc 







1201
gttgatcacg ttgccctggc tgtcgtcggg ctgcgtccag atgtagtcct gcgtcgattc 







1261
cgtgtagcgc gtcgtgttgc gcaccgtcag cgacggcgtg atgtcgtgct cgatccgcag 







1321
cgtgccgatg tccgacgtcg tcttgcggaa gtcgcggtcg acgaggccgt agaagttgtg 







1381
ccgatcgacg ttcgccggat agatcgtgtc gacgttcgcc ggcttgttcg aggccgtgta 







1441
gaagtacggg atgccgccgt cgggcaggtc gtcggtcgac aggtggtagt agctcgcggt 







1501
cacgcgggtg ggcgtgccga gcccgaacgc gatcgacggc gcgacacccc agcgttc tt 







1561
gttcaccgcg tcgcggcccg cgacgtcgtt gttgtggctc atcaggttca ggcgaaacgc 







1621
cgcgtggtcg gcgaactgcc agttgccgtc cgcggtgaag cggcggtagc ggtcggtgcc 







1681
gagccccgcg ctcgcctcgg ctgtcgtgcc gagatgcggc gccttcgtga cgaggttgat 







1741
gctgccgccc gcgccgcctc ggccgccgta cgcgccgtcc gagcccttcg tgatctcgat 







1801
gcgctcggtg ttgaagatct cgcgcgtcgt cgcgcccgtg tcgcgcatcc cgtcgacgaa 







1861
caggctgccc tgcgtgtcgt agccgcggat gaacgggcgg tcgccgagcg gattgccgcc 







1921
ttcgcccgcg ccgaacgtga tgccgggcac cgtgcgcagc gcctcggtga gcgtcgccgc 







1981
gccgctgtcg tggatcagtt ccggcggaat cacggtgacg gatttgggcg tgtcgacgag 







2041
cggcgcggtg aatttcaccg acgacgagaa gtccgccttg tagctgcgct cggcggcgcc 







2101
cttgacctgg atcggcgcga gctccttgtc gtgatcggcg gttgccgcgg gcggcgcggc 







2161
gtcctgcgcg aacgccgggc tcgcggcgag cacgctgcag agggtggtga attttccgag 







2221
cttcaactcg tcggaacgcg acttcatcgg tggtatttcc tcgccaggtg tgactcgatc 







2281
gggcgaagca acggccgctg catgatgcgt tccgcccaa 









3
1
mggthhaaav aspdrvtpge eippmksrsd elklgkfttl csvlaaspaf aqdaappaat 







61
adhdkelapi qvkgaaersy kadfsssvkf taplvdtpks vtvippelih dsgaatltea 







121
lrtvpgitfg ageggnplgd rpfirgydtq gslfvdgmrd tgattreifn terieitkgs 







181
dgayggrgga ggsinlvtka phlgttaeas aglgtdryrr ftadgnwqfa dhaafrlnlm 







241
shnndvagrd avnnerwgva psiafglgtp trvtasyyhl stddlpdggi pyfytasnkp 







301
anvdtiypan vdrhnfyglv drdfrkttsd igtlriehdi tpsltvrntt rytestqdyi 







361
wtqpddsqgn vingrvwrrn nnrnsainsi anqtelfgef rtgpfkhsft tgielsrewg 







421
krdsysvatg tgkicqqgig aasgynctsl wspnpndpwa gsirrnndya hartvtksly 







481
gfdtvelskh wqvnagvrid dystrftdtr anggktytrd dtlvnwqlgl vfkpapngsi 







541
yasyatsstp agallgegae tqsltpgrgg vgsnadqlap eknrsielgt kwnvlddkla 







601
ltaalfqidt tnarvtlpnn qyamvgskrv qglelgfsgr ltrawqvfgg ytylkselre 







661
ngkntadkgh qfpntpkhsf tlwtnydvtp kftvgggafy mskvfgdtan lravpsywrf 







721
damaqyrink kldvrinvnn lfnrtyfdqa ypahyasiap grsafvtlna ry 





404
83
83717803
YP_440327
29
1
gtgcatcaac ctctgctggc gcggcggccg cttcgcgccg cgctgttcgg ggccttcggc 







61
ctctacgcgg cggccgcgcg cgccgcgggc gccgcctccg aacccgcggc cgccgcgccg 







121
ccgtccgccg cgtccgcacc gcaagtgcga cacgcggcga gcgcggccac acgcggcaac 







181
gcgcgcgccc tcgacccgat caccgtcacc gcgacgcgca ccgcgtcggc cgcgagccgc 







241
accgcggcga gcgtatcggt aatcaccgat tcggacctcg aagaacagca ggccgacaac 







301
atcaaggacg cgctgcgcta cgagccgggc gtcaccgtgc ggcgcaccgc gtaccggccg 







361
gcgaacgccg cgctcggcgg cggccgcgac ggcgattcga gcatcaacat ccgcggcctc 







421
gaaggcaacc gcgtgctgct gatggaagac ggcatccggc tgccgagcgc gttctcgttc 







481
ggtccgctcg aagccggccg cggcgattac gccgatctcg acacgctcaa gcgcatcgag 







541
atcctgcgcg gcccggcctc gtcgctgtac ggcagcgacg gcctgacggg cgccgtcaac 







601
ttcatcacga aagatccgtc cgatctgctg tcgatccacg ggaagaagac ctatttctcg 







661
ttgcggccga gctacgactc ggtcgaccgc agcgtcggcg cggccgtggc ggcggcgggc 







721
ggcaacgacc gcgtgcaggg gatgctgatc gcatccggcc gccgaggcca cgaggtcgac 







781
acgcgcggcg gcgacaattc cgcgagcacg aagcgcacga ccgcgaatcc ccaggatgtc 







841
tacacggaat cgctgctcgg caagctgacg atcacgccga cgccacgcga cacgttcaag 







901
ctcaccgccg agacggtgcg gcgccggatc gacacgaacg tactgtcggc gatcaatccg 







961
ccgacgacgc tcggcctcac cgcgaacgac aagctcgagc gcaaccgcct gagcatcgac 







1021
tacgatttcc gcgacgacgc gtcacgctgg ttccagaccg cgcacgtgca gctctactac 







1081
caggaatcga aacaggatca ggacgcgttc gagacgcgcg gcggccgcct gcaatcgcgc 







1141
tctcgctcga accattacag cgagcgcgcg ttcggcggct ccgcgttcgc cgagagcggc 







1201
ttcgcgaccg ggccgctcgc gcacaagctg ctgtacggcg tcgacggcag catcgaccgc 







1261
atcaagagcc tgcgtgacgg cgccgtcgcg agccccggcg agtcgttccc gaacaaggcg 







1321
tttccggaca ccgactacac gctgttcggc gcgttcgtgc aggaccagat cggcttcggc 







1381
aagctgctcg tcacgccggg cctacgcttc gacgcatatc ggctgagccc gagcgcgggc 







1441
gatccgctct tcaccggcaa gacggtcagc tcgagcgatc acgagctgtc gccgcgcctc 







1501
gcggtgctgt acgaggtgac gcccgcgctg atcccgtatg cgcagtacgc gcacggcttt 







1561
cgcacgccaa cgcccgacca ggtcaacaac agcttctcga atccgatcta tggctataca 







1621
tcgatcggca atccgaacct gaagcccgag acgagcgaca cgctcgaagc gggcctgcgc 







1681
ggcacgctcg gcacggggta cgggccgctg cgctacggcg tcgccgcgtt cgccggccgc 







1741
tatcggaact tcatcgcgca gcgcatgatc ggcggcagcg gccggccgaa cgatccgctc 







1801
gtgttccagt acgtgaactt cacgaacgcg cgcatccacg gcttcgaagg gcgcgccgag 







1861
tgggtgatgc ccaacggctt cacgctgaag accgcgatgg cgttcacgaa gggcacgacg 







1921
cgggacaacg gcgcggcaag cgagccgctc gacacggtca acccgttctc cgccgtgttc 







1981
ggcgtgcgct acgagccgag cgagcgctgg ttcgcgcagg ccgacctgct gttgcaggcg 







2041
ggcaagcgcg gccgcgacgt gtcgtcggcc gcgtgccggg agaagacgtg cttcacgccg 







2101
ccgtcgtcgt tcgtcgtcga tctgcgcggc ggctatcact tcaacaagca cgtgagcgcc 







2161
tacctcggca ttcacaacct gttcgaccgc aaatactgga actggtcgga cgtgcgcggc 







2221
atcgccgccg attcgaacgt gctcgacgca tacaccgctc cgggccgcag cgtcgcggtc 







2281
agcatgaagg tggatttctg a 









4
1
mhqpllarrp lraalfgafg lyaaaaraag aasepaaaap psaasapqvr haasaatrgn 







61
araldpitvt atrtasaasr taasvsvitd sdleeqqadn ikdalryepg vtvrrtayrp 







121
anaalgggrd gdssinirgl egnrvllmed girlpsafsf gpleagrgdy adldtlkrie 







181
ilrgpassly gsdgltgavn fitkdpsdll sihgkktyfs lrpsydsvdr svgaavaaag 







241
gndrvqgmli asgrrghevd trggdnsast krttanpqdv ytesllgklt itptprdtfk 







301
ltaetvrrri dtnvlsainp pttlgltand klernrlsid ydfrddasrw fqtahvqlyy 







361
qeskqdqdaf etrggrlqsr srsnhysera fggsafaesg fatgplahkl lygvdgsidr 







421
ikslrdgava spgesfpnka fpdtdytlfg afvqdqigfg kllvtpglrf dayrlspsag 







481
dplftgktvs ssdhelsprl avlyevtpal ipyaqyahgf rtptpdqvnn sfsnpiygyt 







541
signpnlkpe tsdtleaglr gtlgtgygpl rygvaafagr yrnfiaqrmi ggsgrpndpl 







601
vfqyvnftna rihgfegrae wvmpngftlk tamaftkgtt rdngaasepl dtvnpfsavf 







661
gvryepserw faqadlllqa gkrgrdvssa acrektcftp pssfvvdlrg gyhfnkhvsa 







721
ylgihnlfdr kywnwsdvrg iaadsnvlda ytapgrsvav smkvdf 





405
81
83716062
YP_440017
30
1
tcagaactgc ccgcgcacgg tcagcatcac gttgcgcggc tccccatagc gattgttcca 







61
gccgggctgg ctgagactct ggtagtaagt gcggtcgaac aggttgttga cgttgagcgc 







121
ggcgctccat tgcctgtcgt agcgatagcc gagccgcacg ctcgcgagcg cgtagccgcc 







181
ctgtcgcatc gtgacgccgt tcgacgccgc cgagaaatcg ctctgcacct gcacgccgcc 







241
gccgatgctc cagcggcgtt cctgccacgg caggtcgtag ttcgtccaca agcgcagcag 







301
gtggcgcggc gtgagctccg ccgcgaacgc gccgccgttg gcgacgtcct tcgcatagcg 







361
gatcgtgtcg aacgtgtagc tcgcccacac gctccaccac ggcgtgatgc gcccgttcgc 







421
ctcgaattcg aagccctggc tgcgcacgct gccgccgttc acgtagtagc agctccgccc 







481
ggcgcatgga tgtgcggcat cgacctgcgg attgttatcg agatcgatgc gaaacgccgc 







541
gagcgacacg ttgagcttgc cgcccgacag ctcgcccttg ataccggtct cgtaggtgcg 







601
gcccttcacc ggcgtcagga tgctgccgtc ccaggtcggc ttggtctgcg gctggaacac 







661
ttccgcatag ctcgcgtacc acgaccagtc gcgcgcgaca tcccagatca gcccgccata 







721
cggcgtgaac tggcggcccg cgttgtagtg cgcgccgagg ctgtcctggc tccaccagct 







781
cacgcgcccg cccagcacga gcgtgaccgg ctgcgcgagc ttgatgcggc cgaggccgta 







841
gacgcccttt tgcgagatgt cgttctgcat gtcctgccgg tacggcccga tacccggctc 







901
cggcacgctg tccggattcc agcggtacac gttcaccggc gtgccggcgc ccgccagcag 







961
cggcgcgctg agctgcccgt tgctgctgct cgcgtaggtt gcgccgaaca gcagttcgtg 







1021
cgcgagcccg aacgcgcgca ccgggccctg cacgttcgca tcgacgctgc ggctatagcc 







1081
gctgaactgg tatgcgcctc ccatcagcgt gccgcccgca cccgtggccc gatcgatcgc 







1141
gccgtacgag cccgcgtatt tcaggtcgga gcgcaccgcc tgatactcgc cgctcaccag 







1201
tgccttccag ccgccgccga gcttctgctc gaccgagccg aacgcgcgcg tcgtatccca 







1261
gtcgaagtgg ccccagccga cgtcgaggaa tgtcgatcgc gacaggccga ggctcgagcc 







1321
gtcgcgcgcc atcggcacgc cggacatgtc gggcaccgac gcgaccgact ggtattgcgc 







1381
gccgaccgtg acgagcgtgt cgcgcgtcac gtcgatctcg gcgatgccgt agatcgagcg 







1441
cgtatcctgc ttcgcgcgat cgtaaaagaa gccgcggtcc tcgtacgcgg cgacgagccg 







1501
cccgcgcacc gtgcccgccg ggttcagcgg cccgccgatg tcggcctgcg ctcggtagcg 







1561
cccccaactg ccgatgctcg cgctcgcgtt cgcggcaaac tgatattgcg gccgcttgcg 







1621
caccagattg acggtcgcgg ccggattgcc cgagccgtgc agcaagccgt tcgcgccgcg 







1681
caggatctcc acccgctcgt acaccgacat gtcctgcggc gagctcgcca tgtcgccgag 







1741
cacgacgggc acgccgtcga gctcgaacga atccaccttg aagccgcgca cgaagtacgc 







1801
ggtcgtcagc agcacgtatg gctgcacggt cacgccggcc gactgctgca tcacgtcgtc 







1861
gaggctgaac aggttctgct gctcgatccg ctcgcgcgtc gtcacactga ccgattgcgc 







1921
aatctcccgc agcgcgagcg gcatcttgcc gacagtcgcg atcgacgggt cgtccgccgc 







1981
atccttcgac gcgcttaccg aaatcgcctt caactcgccg ccggccggcc gtgcgccggc 







2041
ggctcgctgc cgcgtgcctt cctgcacgct ctcgttcttt tgcgtgtcgc tcttcgcggc 







2101
cgtgccggca agcgccgccg caccggggtg ggccgcgatc agcatgccca ggcaaagcga 







2161
cgaacgccac cgccttcccc cgaaacgccc aaccggcgcg gccacgcgcc gcgatctctc 







2221
aggtcccgcc acctcgtact gcat 









5
1
mqyevagper srrvaapvgr fggrrwrssl clgmliaahp gaaalagtaa ksdtqknesv 







61
qegtrqraag arpaggelka isvsaskdaa ddpsiatvgk mplalreiaq svsvttreri 







121
eqqnlfsldd vmqqsagvtv qpyvllttay fvrgfkvdsf eldgvpvvlg dmasspqdms 







181
vyerveilrg angllhgsgn paatvnlvrk rpqyqfaana sasigswgry raqadiggpl 







241
npagtvrgrl vaayedrgff ydrakqdtrs iygiaeidvt rdtlvtvgaq yqsvasvpdm 







301
sgvpmardgs slglsrstfl dvgwghfdwd ttrafgsveq klgggwkalv sgeyqavrsd 







361
lkyagsygai dratgaggtl mggayqfsgy srsvdanvqg pvrafglahe llfgatyass 







421
sngqlsapll agagtpvnvy rwnpdsvpep gigpyrqdmq ndisqkgvyg lgriklaqpv 







481
tlvlggrvsw wsqdslgahy nagrqftpyg gliwdvardw swyasyaevf qpqtkptwdg 







541
siltpvkgrt yetgikgels ggklnvslaa fridldnnpq vdaahpcagr scyyvnggsv 







601
rsqgfefean gritpwwsvw asytfdtiry akdvanggaf aaeltprhll rlwtnydlpw 







661
qerrwsiggg vqvqsdfsaa sngvtmrqgg yalasvrlgy rydrqwsaal nvnnlfdrty 







721
yqslsqpgwn nrygeprnvm ltvrgqf 





406
88
83720594
YP_441958
31
1
atgcaagcgg gacccgggtc ctcgcatgcc gatgatcggc gttcggccgc gcgcgcgatg 







61
ccatcgaggt tgcgggagtc gacgagcagt gaacagagcg aagcgggcgg gaaacgttcg 







121
gggccgcgtg gcgccgacga cagtggagcg tgcggcgtgc gctcgggcgg gggcggtgga 







181
cgaggcaatc cgtcgatgag ccgggcgcag catgcaccgt cgcgccgtcg cgcgttcgag 







241
cgcagcttcg cggcggcggc cgcccgcgcc gctcacggcg cgcctgccgc gcatggcccg 







301
cacgcgtgcc tcgccgccgc gtcgcgtcgc tcgacggccg tcgagcgctg cgtcaccgcg 







361
ctcgcgtgcg cggtgacggc gtccggcgcg ctcgccgccg acgcggaccc ggccgaatcc 







421
gcgcgcgacc cgcatcgcga actgccgacc gtgcgcgtga cgagcgacgc ggcgcacgca 







481
tcgccgctgt ccacgccgct cacggccggc tcgcgcctga agctcgcgag cctcgacatg 







541
cccgcgagcg tcgaggcgat cacgtccggg cagatggccg cgcgcggcga tcgcacgatc 







601
gtcgacgccg tcacgcgcgc gacgggcttc agcacggccg cggcgcccgg caacggcggc 







661
acggcgctca gcgtgcgcgg cttcgccggc caggaatcgg tgacgacgct cgtcgacggc 







721
gtgcgtctct accccggcgc gggcaccgtc acgttcccgt tctcgacgtg gtccgccgag 







781
cgcatcgagg tgctgcgcgg gccggcgtcg gtgctgcacg gcgaaggggc gatcggcggc 







841
gtcgtcgatg tcgtgacccg gcgcccgcgc cgcgaacgat cgacgacgct gcaggcgagc 







901
atcggcacgc aaggcgaaaa gcgcgtggcg ctcgacacga cgggcgcgct cggcccgcgt 







961
ctttcttatc gtttccatct gagcgacgag cgcacgcgcg gcttcgtcga gcgcggcgac 







1021
gcgcacgcga cggccgtcgg cggcgcgctg aagctcgatg tcgattcgcg tttgtcgatt 







1081
acgctcgatt acgactacgg ccgccagaag ccggcgacct atttcggcgt gccggcggcg 







1141
aacggcgtgc tcgaccgcgc gctgcgcgag cgcaactaca acgtcggcga cgcgacgatc 







1201
gcttaccacg atacctggac gcggcttgcc gcgacctatc gcgcgggcaa cggcgtgacg 







1261
ctcgacgccc agctctacta tctcgcgacg cgccgccatt ggcgcaacgc ggaatcgtac 







1321
gcgctcgacg cggcggcgcg caccgtcgcg cgcagcgact atctcgagat cttccatcgc 







1381
gagcggcagt tcggcgagcg cttcaccgca cgcatcgatt cgcgcgtgtt cggccgcgcg 







1441
aaccgcctcg tggtcggcgc cgagttcaac cagatcgcgt tcgacggcgc gaacaacgcg 







1501
ccgtatcgcg gcgaatcgac cgtcgccgcc gccggcttcg atcccggcgc gttcgcgagc 







1561
cccgatccga cgctgccgcg ctttcgcacg cgcacgcatc aggcggcggc gttcatcgag 







1621
aaccggctcg aggtcctgcc gcggctcgcg tgggtgagcg ggctgcgtta cgaccatctg 







1681
tcgtttcacc gcgacgatct ggtcgcgggc ggcgcgttcg acaagacgtt cgcgcacacc 







1741
ggctggcgca gcgggctcgt ctacgagatc gcgccgggcc tgaccgcgta cgcgcaatac 







1801
acgacgggcg cggagggcgt gggctcgctc gtcacgctgc cggcgtcgca ggcgaactac 







1861
acgctcgcga ccgggcgcca atgggaagcg ggcgtcaagc acgagatcga cgacgctcgc 







1921
gcgtactgga cgctcgccgt ctacgacatc gtcaagcgcg gcctcgtcag cgtcgacccg 







1981
ctgaatccgg cgcgcgcgca gcagatcggc cgccagtcgt cgcgcggcgt cgaactcgcg 







2041
ggcggcgtgc gtctgccggg cggcgtgacg atcgacgcaa acgcggcgtt gctgcgcgcg 







2101
cgttacgacg cgttcggcca gcgcgtcggc gacacggtcg tgcagcgcgc gggcaacgtg 







2161
ccgcacgaca tcgcgcggca gaccgcgaac ctgtggatcg gctgggcgtt cgcgccaggc 







2221
tggcgcgcga atgcgggctt gcgttacgtc ggccgtcgct acggcgacga cgcgaaccgc 







2281
gtgccggtgc cgtcgtacac ggtgttcgac gcgtcgctcg cgtggcaggc gacgcgcgac 







2341
gtcggcctcg cgctctacgc gcgcaacctt gcgaaccgta cgtacgcggc gtcgacgtcg 







2401
aacggcggcg cgcaatggct gctcggcccg tcgcgttcgg ccgagctcgt cgcgacgctg 







2461
cgcttctag 









6
1
mqagpgssha ddrrsaaram psrlrestss eqseaggkrs gprgaddsga cgvrsggggg 







61
rgnpsmsraq hapsrrrafe rsfaaaaara ahgapaahgp haclaaasrr stavercvta 







121
lacavtasga laadadpaes ardphrelpt vrvtsdaaha splstpltag srlklasldm 







181
pasveaitsg qmaargdrti vdavtratgf staaapgngg talsvrgfag qesvttlvdg 







241
vrlypgagtv tfpfstwsae rievlrgpas vlhgegaigg vvdvvtrrpr rersttlqas 







301
igtqgekrva ldttgalgpr lsyrfhlsde rtrgfvergd ahatavggal kldvdsrlsi 







361
tldydygrqk patyfgvpaa ngvldralre rnynvgdati ayhdtwtrla atyragngvt 







421
ldaqlyylat rrhwrnaesy aldaaartva rsdyleifhr erqfgerfta ridsrvfgra 







481
nrlvvgaefn qiafdganna pyrgestvaa agfdpgafas pdptlprfrt rthqaaafie 







541
nrlevlprla wvsglrydhl sfhrddlvag gafdktfaht gwrsglvyei apgltayaqy 







601
ttgaegvgsl vtlpasqany tlatgrqwea gvkheiddar aywtlavydi vkrglvsvdp 







661
lnparaqqig rqssrgvela ggvrlpggvt idanaallra rydafgqrvg dtvvqragnv 







721
phdiarqtan lwigwafapg wranaglryv grrygddanr vpvpsytvfd aslawqatrd 







781
vglalyarnl anrtyaasts nggaqwllgp srsaelvatl rf 





407
81
83717103
YP_438837
32
1
atgacaatca aattcctgcg tcacgcgcac cccgcgcgtg tcgaccgtgc gcgccgtcgg 







61
cggcgcgcga tcacgctcac cgttcccgcg ctcgcggcgg gcgcctttca tctcgcgccg 







121
gccgtcgcgc agacggacga tgccgtgcac cgtcacggcg cactcggcgc gacgagcggc 







181
gtcgcgaacg acgcggcacg cggcgcatcg agcgccaaat cgaacgctag ggcaaacgaa 







241
gcgaccgccg tgacgcgcag cgcgacgaac cgcgcgacag ccggcgccgc gccggaagca 







301
gcgatgctgc cgacgatcga gatcgtcgcg gcgcccgaat cgacgccgct cgtcgtcgtc 







361
accgatccga aaacgccgcg ccagccgttg cccgcgagcg acggcgccga ttatctgaag 







421
acgattcccg gcttcgcgtc gatccgcagc ggcggcacga acggcgaccc ggtgctgcgc 







481
gggatgttcg gctcgcggct gaacatcctc gcgaacggca tgccgacgct cggtgcgtgt 







541
cccggccgga tggacgcgcc gacgtcgtac atcgcgcccg agagctacga caaggtgacg 







601
gtcgtcaagg gcccgcagac cgtgctgtac gggccgggcg catcggcggg cacggtgctg 







661
ttcgaacgcg tgacgccgcg cttcaagacg ccgggcatgc gcttcgacgg cagtgtcgtc 







721
ggcggctcgt tcgggcgcaa cgatcagaac gtcgacgtga cggcaggcac gcccgacttc 







781
tacgggcgcg tgatcgcgaa ccatgcgcat tcgcaggatt acgaagacgg caacggtcgc 







841
acggtgccgt cgcaatggga caagtggaac gcggacgcgg cgctcggctg gacccccgac 







901
gacaacacgc ggctcgagtt gacggcgggc acgggcgacg gctacgcgcg ctacgcgggc 







961
cgcgggatgg acggtgcaca tttccggcgc gagacgttcg gcctgacgtt cgacaagaag 







1021
cacatcggcg acgtgctcga tcgcatcgag gcgcaggtct tctacaacga agccgatcac 







1081
gtgatggaca actacacgtt gcggatgccc gatccgacga gcagcatgcc gatgcgcatg 







1141
gcctccgaag tgcgccgccg cacgctcggc gcgcgcgtcg cggcgacgct gcgcttcacc 







1201
gacgcgttca agctcgtgac gggcgtcgac gcgcagtcga accgtctcga ttcgcgctcc 







1261
gcgatgggga tgcagaacta cggtgacaag ccgtggaatc cgcaggcgaa catgtggaac 







1321
gcgggcgcgt tcggcgagct gacctggtat gcgagcgatg cgtcgcgcgt gatcggcggc 







1381
gcgcggatcg actatgcggc cgcgcgcgac aagcgcgcga cgacgggcgg catgaagatg 







1441
agcatgcgca atccgacgtt cgacgatctc cgctcgcgcg tgctgccgag cggcttcgtg 







1501
cgctacgagc gcgatctcgc ttcgctgccc gtcacgtggt atgcgggcat cggacacgcg 







1561
caacgctttc ccgattattg ggaactgttc tcggcgaagc gcggcccgaa cggcgccgtc 







1621
aacgcgttct cggcgatcaa gcccgagaag acgacgcagc tcgacatcgg cgcgcagtac 







1681
aagagcgaca agctcgacgc gtgggtgtcc gcgtacgcgg gctacgtgca ggacttcatc 







1741
ctgttcgact acgcgacggg ctcgatggga cagacgacgc aggcgacgaa cgtcaatgcg 







1801
cagatcatgg gcggcgagct cggcgcgtcg tggcgtccgc tcgcgccgtg gcgtttcgat 







1861
gcgtcgctcg cgtatgcgtg ggggcgcaac gtgcaaagcg gcgcgccgct gccgcagatg 







1921
ccgccgctcg aagcgcgttt cggcgtcgag tacacacgcg ggccgtggtc ggcgggcggg 







1981
ctgtggcgaa tcgttgcgcc gcagcatcgc tatgcgctga acgagggcaa cgtcgtcggc 







2041
aaggacttcg gtccgagcgc cggtttcggc gtgctgtcgc tgcacgcgca gtacaacgtg 







2101
agcaaagcgg tgcagatctc ggtcggcatc gacaacgtgc tcgacaaggc ttatgcggag 







2161
cacctgaacc tcgccggcaa cgccggtttc ggctatccgg cgaacatgcc cgtcacggaa 







2221
cccggccgca cagcgtgggt tc tttgagc accaagctct ga 









7
1
mtikflrhah parvdrarrr rraitltvpa laagafhlap avaqtddavh rhgalgatsg 







61
vandaargas saksnarane atavtrsatn ratagaapea amlptieiva apestplvvv 







121
tdpktprqpl pasdgadylk tipgfasirs ggtngdpvlr gmfgsrlnil angmptlgac 







181
pgrmdaptsy iapesydkvt vvkgpqtvly gpgasagtvl fervtprfkt pgmrfdgsvv 







241
ggsfgrndqn vdvtagtpdf ygrvianhah sqdyedgngr tvpsqwdkwn adaalgwtpd 







301
dntrleltag tgdgyaryag rgmdgahfrr etfgltfdkk higdvldrie aqvfyneadh 







361
vmdnytlrmp dptssmpmrm asevrrrtlg arvaatlrft dafklvtgvd aqsnrldsrs 







421
amgmqnygdk pwnpqanmwn agafgeltwy asdasrvigg aridyaaard krattggmkm 







481
smrnptfddl rsrvlpsgfv ryerdlaslp vtwyagigha qrfpdywelf sakrgpngav 







541
nafsaikpek ttqldigaqy ksdkldawvs ayagyvqdfi lfdyatgsmg qttqatnvna 







601
qimggelgas wrplapwrfd aslayawgrn vqsgaplpqm pplearfgve ytrgpwsagg 







661
lwrivapqhr yalnegnvvg kdfgpsagfg vlslhaqynv skavqisvgi dnvldkayae 







721
hlnlagnagf gypanmpvte pgrtawvrls tkl 





408
81
83717123
YP_439577
33
1
tcagaactgc agcgtcgtca gcagcgacac ctgccgcgca tcgccgaccg acacgaagaa 







61
ccggttcacg ctcgacgggt aatacgtgcg gttgaacagg ttcttcacgt tcagctggaa 







121
ctgcagcttc tgcttgccga cgcgcgtgtc gtaggtcgcg aaagcatcgg ccgtcgcgta 







181
cgcgggcagc gtgaagctgt tcgccgaatc gcccggccgc gcgccgacgt agcgccccgc 







241
cgcgccgatc cgcaggtcgt cgccgccgaa cacggcgccc gcgtcgtaga cggccgcgag 







301
cgacgccgtg tgccgcgcga cgttccacag ccggttgccc gcgtacagcg gatcttcggt 







361
cgtcttcgca tcgatgtacg cgtagctcgc gatcacgttc cagcgctcgc cgatccgccc 







421
cgacacgtcg agctcgatcc cgcgcgagcg cgcgcgtccg gacgtgcgcc agtcggtctg 







481
gttcgtcgcg tcgttgtatt gcgacacgag cacgttcttc ttgtcgatct cgaagaacgc 







541
gagcgtgccg gcgaagccgc tcggcaggtc gagcttcgcg ccgagctccc acgacgcgcc 







601
ctgctcgggc gcggtcgagc cgtcgatcac gtagccgccc gacatcggcg cgatcttcga 







661
cgtcggcttc agcgactgcg tgtagctgcc gtacagcgac agcgcgtcgt tccacttgta 







721
gacgatgccc gcgcgcggca gccacttcgt gccgctcagg ttcgtgttga cccggaacgg 







781
ccggccgcgc ccggcgagct ggctgtagcg gacgaagcgc gcgccgccga cgagaatcca 







841
cttgtcggtc agatggatgc tgtcctggaa gaacagcgac gcggtgtgca gcgtgtcgct 







901
ctgatcgctg tcggacgccg atacgctcgt cgacggcggg atcagcccgt acgtcgggtt 







961
caggtagctg aacggcgtct tgatcgcctg gcgcagcatg tccgcgcggt agacctgccg 







1021
gtattcgctg tcgacgccga actgcacgtc gtgccgcatc ccggcgagcg tcacgcggcc 







1081
gtcgacgtat gcgattgcgt agctgtcggt gctgcgcgag ccgtgcgtcg cgtcgttgct 







1141
gcgcgtgagc gtgcccttga ccggatcgac cgccgtgatg cgtatctggt tcgcatcgta 







1201
ggtctcgcgg ttgtagctgt agccgaggtg cgccttccag ttgggcgcga gctgatggtc 







1261
gaccgttagt tgcaccagat tcgattcgcc gcgcatgtcg ttgaacggct cgtcgaggcg 







1321
gcggcgcgcc gggatcgcga gcggctcgtt cgtgcgcgga tcgagcgcgg tgccccgatc 







1381
gaacggcatc aggaagcgcc ggtactcgta cgacagcacg acctgcgtgt cgcggccgta 







1441
ccacgcgagc gacggcgcga cgagcgtctc gcgatgctcg ccgtagtcgc gccagtactg 







1501
ctcgttcgtc tgatcgacga tcagccggta cgcaagccgc gaatcgccga tcgcgcccgt 







1561
cgagtcgaac gtgagctcgc cgccattcct gccgtgcccg taggtcgagc cgagcaccga 







1621
gatcgcgttg tggcgcgtga gctgcggctg cttcgtgacg acgttgatca cgccgcccgg 







1681
gtccatgatc ccgtacagca gcgaagccgg ccccttcagc acttcgacgc tgtccgtcgt 







1741
cgcgttcagc gagcggccct gcacgatcgg catcccgttg cgcatcaccg agccgtcgcg 







1801
gttgtcgccg aagccgcgct tcatcaccgt gtcctgcgtg ctgccgagcg tgttgccctg 







1861
cgtgatgccg ctcacgttcg cgagcgcgtc gtcgaggttg cgcgggcgct ggtcgcgcag 







1921
cacctgcggc gcaacgatgt tgacggcctg cggcacctcg gcgagcggcg cgtcgctgcg 







1981
cagccccgcc gcctcgcgcg gcgcacggta gctgtccgcg tgcaggccgc tcgcgcgcac 







2041
cgagacggtc ggcagcgtcg tgtcggtcgc gagcgccgcc cccgcgctcg gcgcggccgg 







2101
cccggcgggc gacctgacga gcgtgtagcc gccgctcggc tgccgcagcg cgacgaggcc 







2161
cgtgcccgtc agcagccggt cgagcgccgc ggccgcatcc gcgcgaccgt gcaggccggg 







2221
gctttgcaa 









8
1
mqspglhgra daaaaldrll tgtglvalrq psggytlvrs pagpaapsag aalatdttlp 







61
tvsvrasglh adsyraprea aglrsdapla evpqavniva pqvlrdqrpr nlddalanvs 







121
gitqgntlgs tqdtvmkrgf gdnrdgsvmr ngmpivqgrs lnattdsvev lkgpasllyg 







181
imdpggvinv vtkqpqltrh naisvlgsty ghgrnggelt fdstgaigds rlayrlivdq 







241
tneqywrdyg ehretivaps lawygrdtqv vlsyeyrrfl mpfdrgtald prtneplaip 







301
arrrldepfn dmrgesnlvq ltvdhqlapn wkahlgysyn retydanqir itavdpvkgt 







361
ltrsndathg srstdsyaia yvdgrvtlag mrhdvqfgvd seyrqvyrad mlrqaiktpf 







421
sylnptygli ppstsvsasd sdqsdtlhta slffqdsihl tdkwilvgga rfvrysqlag 







481
rgrpfrvntn lsgtkwlpra givykwndal slygsytqsl kptskiapms ggyvidgsta 







541
peqgaswelg akldlpsgfa gtlaffeidk knvlvsqynd atnqtdwrts grarsrgiel 







601
dvsgrigerw nviasyayid akttedplya gnrlwnvarh taslaavyda gavfggddlr 







661
igaagryvga rpgdsansft lpayatadaf atydtrvgkq klqfqlnvkn lfnrtyypss 







721
vnrffvsvgd arqvsllttl qf 





409
27
83720098
YP_441413
34
1
ttactgcgcg gcttgcggtg cgcgcaggta gatctcgacg cggcggtttt gcgcgcggcc 







61
ggcttccgtc gcgttgtcgg cgatcgggtt cgacgcgccc atgccttgcg ccgacaggcg 







121
ggttgccgcg acgccgcgtt gcgcgagcgc gttcacgacg ctttgcgcgc gattctgcga 







181
cagcgtctgg ttgtgcgcgg ccgagcccgt gctgtcggtg tagccgacga ccgatgcggt 







241
gatctgcggg ttctggttca gcgtcgtcgc gaggtcgttc aggagcggcg tgaaggccgg 







301
cgtgaccgcg tactggtcgg tcgcgaacgt gaccgagctc ggcacgttca gcttgagcga 







361
gccgtccggc tgctcggtga cctgcgtgcc cgtctgctgc gccgacggcg cgagcttgtt 







421
cttgatcgct tgccagttgt aacccgtcac gccgccgacg agcgcgccga cgccggcgcc 







481
gatcgccgcg cccttgccgc cgccagccag cgcgccgatg cccgcgccga gtgcggcgcc 







541
cgtgccggtg ccgaccgccg tgttcgtgcc ttgctgggtt gcgcaacccg ccagcagggc 







601
gccggcgaga gcgaaaacgg acaagcgcgt cgcgattttg gtattcatct tggtttcctc 







661
tcttgagtga ctcgacaaga acaaaaaaga actgctcttt cggcgggggc gacccgcgcg 







721
ccgacgatca tgcaggcgcg cgagcgcctc gatccgtccg gcgtcaggtg ccggggccgt 







781
cttcat 









9
1
mktapapdag riealarlhd rrragrprrk sssflflssh sreetkmntk iatrlsvfal 







61
agallagcat qqgtntavgt gtgaalgagi galagggkga aigagvgalv ggvtgynwqa 







121
iknklapsaq qtgtqvteqp dgslklnvps svtfatdqya vtpaftplln dlattlnqnp 







181
qitasvvgyt dstgsaahnq tlsqnragsv vnalaqrgva atrlsaqgmg asnpiadnat 







241
eagraqnrrv eiylrapqaa q 





410
19
83719210
YP_441918
35
1
ttactgttga tagacgaggt ccgcgcggcg gttctgcgcc cacgacgctt cgtcgtggcc 







61
gagcgcgacc ggcttttcct tgccgaggct cacggcttcc atctgcgaat ccccgacgcc 







121
gagcagcgac agtgcgcgac gcaccgcttc cgcgcgcttc tggccgagcg cgaggttgta 







181
ctcgctcgtg ccgcgctcgt cggtgttgcc ctggatcagg atgtggcgct gcgggtggct 







241
cttcaggtat tgcgcgtgtt gctgcagcag cggctggtac tggtcctgca ccgagtagct 







301
gtcgaaatcg aagtagacgc tgcgcttcgc gagcgggctg ttcggatcgt tcagcggatc 







361
gacggtcact tgcgcgacgt tctcgggatt cggttgcgtg ctgacggcgc cgccctggtt 







421
cgcgtgttcg tcgagcttca cccccgactt gcacgcggcg agcgcgccga tcatcagcat 







481
cgcgaacgcg agacgaagtt ttttcgacat cat 









10
1
mmskklrlaf amlmigalaa cksgvkldeh anqggavstq pnpenvaqvt vdplndpnsp 







61
lakrsvyfdf dsysvgdqyq pllqqhagyl kshpqrhili qgntdergts eynlalgqkr 







121
aeavrralsl lgvgdsqmea vslgkekpva lghdeaswaq nrradlvyqq 





411
24
83720431
YP_441918
36
1
ttacttcgcc ggaacggtcg tcttctgcac ttcctgcgtg ccgaccactt cgacttccac 







61
gcggcggtcc ggtgcgaggc aagcgatgag ctgcttgcgg ttcttctgct tgcaggtgtt 







121
gcccgtgatc gggttgcgct tgcccttgcc ttccgtgtag accttgttcg ccgggacgcc 







181
cttgctgacg aggtacgact tgacggcttg cgcacggcgc agcgacagac ggtcgttgta 







241
cttgtccgag ccgatgcggt cggtgtagcc cgtggccacg accacttcga cgttcatgcc 







301
ttggatcttc gcggccagtt cgtcaagctt ctgcttgccg gccggcttca ggatggcctt 







361
gtcgaagtcg aacagcgtat cggcttggta cgtaatcttc tggctcgtga tggccggagc 







421
aaccggtgcg accggtgccg gggccggtgc ttgggcgacc agcgcgccgt cgcacttcgc 







481
gttggcggtg gccggcgtcc agaacgcatc gcgccagcag agctcgttcg tgccgttcat 







541
ccacacccat tcgcccgtgc cattcaccca gttgtcattc acggcttgtc gcgacgccgg 







601
caccgactgt gccgaagcgg atgcagccat aactgcggta gctgcaatga acgcgagctt 







661
tgaaagttta ttcat 









11
1
mnklsklafi aatavmaasa saqsvpasrq avndnwvngt gewvwmngtn elcwrdafwt 







61
patanakcdg alvaqapapa pvapvapait sqkityqadt lfdfdkailk pagkqkldel 







121
aakiqgmnve vvvatgytdr igsdkyndrl slrraqavks ylvskgvpan kvytegkgkr 







181
npitgntckq knrkqliacl apdrrvevev vgtqevqktt vpak 





412
40
83719322
YP_442131
37
1
ttagaagcga tgacgcaggc ctgccgtgac ggcgatctgc ttctcgctcg acgacatgcc 







61
gccgacgccg ttgatatagg cgccgaggcc cgtgccgtcg ctgttgacct tctgatacac 







121
gccttgcagg tacacgtccg tgcgcttcga cagcgcgtag gccgtttgca ggttgaactg 







181
attccagccc gggtggtggt tctcgatgaa gccggcggtg tacgtgtacg agcccgcgac 







241
cgtccatgcc ggcgtgatcg cgtagcgcgc gttcacttcg tagttgttga aacgcatgaa 







301
cgtgccgtcg agcgcgatgc cgctcgacac gcccgatgcg cccgagctga tcgccgtcgc 







361
gcgattgacg cgcgattgcg tgaacacgaa gccggccgtc gccgggccgt acgtgtaatt 







421
caggccgccg ccgaacacgc gctggcgctt gccgacgaac gtattgtcga gcgcgaccgc 







481
gcccgaagcg ttcgacacgg tcgggccgaa gtcgttgttc aactgcaggt agccggcgcc 







541
gacgttgaag ccggcgtagc tatacgacac gcccgcgcta tacgcacggt tgttcgcgaa 







601
ctcgttgctg ttcgagaagc cgtacaacgc gccgaacttc aggccggccc agttcacgct 







661
cgtgtacttg accgcgttgt tgatccggaa cgagttgttc aggttgtcgt tgtcgaacgg 







721
gtgggcaaac tgcacgccgc cgaattgcgt gccggtcagc gacagcggcc cgatgtagtc 







781
gacgacgcta tcgtactgac ggccgagcgt cagggagccg tattgatcgt gcgacagacc 







841
gacgaacgcc tggcggccga actcgcggcc gttctgcttg agcgtgccgt tattgatgcc 







901
gaagccgttt tccaacacaa aaatcgcctt caggccgccg ccgaggtctt cggcgccgcg 







961
caggccccag cggctgccgt tgaccgagcc gctggactgc gaccatgcac tgtggccgcc 







1021
ttggttgttc gtgtaggtga tgccggcgtc gatgaggccg tacagcgtca cgctgctttg 







1081
cgcgtgagcg gccgtcgcga agacgccaga gagagcagcg acgatgaggg tctttttcat 









12
1
mkktlivaal sgvfataaha qssvtlygli dagitytnnq gghsawsqss gsvngsrwgl 







61
rgaedlgggl kaifvlengf ginngtlkqn grefgrqafv glshdqygsl tlgrqydsvv 







121
dyigplsltg tqfggvqfah pfdndnlnns frinnavkyt svnwaglkfg alygfsnsne 







181
fannraysag vsysyagfnv gagylq1nnd fgptvsnasg avaldntfvg krqrvfgggl 







241
nytygpatag fvftqsrvnr ataissgasg vssgialdgt fmrfnnyevn aryaitpawt 







301
vagsytytag fienhhpgwn qfnlqtayal skrtdvylqg vyqkvnsdgt glgayingvg 







361
gmsssekqia vtaglrhrf 





413
44
83716488
YP_439650
38
1
tcagaacttg tgacggatcg ccgcgcggac cgcgaactgg ttcgccgatg ccgacgggcc 







61
gtccgtgccg acgacgtagc cgccatccgc gatcgtgccc gtcttgtcgc ccgcaacctt 







121
ctggtatgcg ccctgcagat agacgtcggt gcgcttcgac aggttgtagt ccgccatcag 







181
gccgatcgtg tggtacttcg gcttgaccgt gcccgcagcc gcgtcgaact tgccgtccgt 







241
gtacacatac tgcgcgccga tgaagagcgc cggcgtcagt tggtacttgc cgttgatttc 







301
gaagttctgg aacttcgtcg cggtcagacc gaggcccgcg aacgtcgccg tcggcagata 







361
gaccgtcgag accgggttct tcacgtcggt cttcgtgtag acgaagccga ccgtcgccgg 







421
gccgaacgtg tagttgacgc cgccgccgaa gatgcgcagg cggtcggccg tgaagttcgc 







481
gtcgttgtcg gcgatcgcgc cgcccgccgt cttgcccggg ttgttcgcct gcaggtacgc 







541
ggccgcgagt tgcaggccgc ccagcgtgta cgcggcgccg atgctgtatt gacggttgtt 







601
cgagaagccc gtcgcgttgc tgaagctgta cgtgccgccg acttgcagac cgttccagtc 







661
cgggctcgcg tacttgatcg tgttgttcac acggaacgag ttatccgtgt tgtcgttgtc 







721
gaacgggtgc gagaacagca ggccgcccca gttgccgttc gccgtcagcg gcgcgaggta 







781
gtcgacgacc gaatcgtact ggcggccgag cgtgagcgaa ccgtattgcg cgtgcgacag 







841
gccgacgaac gcctggcggc cgaacatccg gccgccctgg ccaagccggc cgttattcac 







901
gtcgaagcca ttttccaacg tgaagacggc cttcaggccg ccgcccaggt cttccgagcc 







961
gcgcaggccc cagcggctgc cctgcgcgaa gccgctcgcg agctggtagt tgctcttgcc 







1021
gacgccgttg acgttcacgt tgctcgtgta gttgaaaccc tcgtcgatca cgccgtacag 







1081
cgtgacgctg ctttgggcga aaaccggcgc ggcgaatgcg agcaatgcgg cagcggaaat 







1141
gatgtgcttt ttcattatcg atatctccag accttgattc ggcggcctcg cggatgtcgg 







1201
gccgccacga tgtgttcggc gtttctgcgc cccattccgg ggcat 









13
1
mprngaqkrr thrggptsar ppnqgleisi mkkhiisaaa llafaapvfa qssvtlygvi 







61
degfnytsnv nvngvgksny qlasgfaqgs rwglrgsedl ggglkavftl engfdvnngr 







121
lgqggrmfgr qafvglshaq ygsltlgrqy dsvvdylapl tangnwggll fshpfdndnt 







181
dnsfrvnnti kyaspdwngl qvggtysfsn atgfsnnrqy sigaaytlgg lqlaaaylqa 







241
nnpgktagga iadndanfta drlrifgggv nytfgpatvg fvytktdvkn pvstvylpta 







301
tfaglgltat kfqnfeingk yqltpalfig aqyvytdgkf daaagtvkpk yhtiglmady 







361
nlskrtdvyl qgayqkvagd ktgtiadggy vvgtdgpsas anqfavraai rhkf 





414
39
83718334
YP_439716
39
1
ttagaagcgg tgacgcagac caaccgtcgc tgcggtttgg ttgatcgacg tgctgaacgg 







61
cgtgctgagg tcgccgttgt agatcgacgc attcgcgttc ttcgacgaac gctggtacac 







121
agcttgcgcg tacacgtcgg tgcgcttcga cagcgcgtag tcggcttgga cgccaacttg 







181
gttccagtgc gcgctttcgc cgttcgcctt cgcgttcgtg tacgtgtaag caacgcccag 







241
acccagtgcc ggcgtcaggt tgtacttcac gttcgcttcg tagttgtcgg cacggatggt 







301
cggcgcgccg tttgccaggt tgtcgagacg cgattgcgtc cagagcaggc cgccttgcag 







361
cgggccgtaa gcgtagcttg cgccagcgcc gtacacgcgc gaacggcctt ggatgcttgc 







421
tgcgttcgtg ccgccgatgt tgaagccggt cagcggatcc gttgcgcccg tggtgttcgc 







481
gcctgcgttg ttagcttgcg agtacgcggc accgaccttc aggccttgga actggtacga 







541
agcgcccgcg ctgtacgcac ggttgttcgc gaattgcgag ttgttcgaga acgagtacgt 







601
gccgccgaat tgcaggccgg cgtagttcgc gctcgtgaac ttgaccgtgt tgttcaccgc 







661
gacgtcgccg ttcgtgttca ggcggtcgtt gttgagcggg tgcgcgaagt acgtgccacc 







721
ccaggtgccc gttgccgaca gcggcgacag gtagtcttgg gtcgcgtcgt actgacggcc 







781
cagcgtgacg gtgccgtagt tgctcgacag gccgacgaac gcttgacggt tgaacatgcc 







841
gccaccgttg ttgaagcggc cgttaccgat gttgaagccg ctttccaacg tgaagatcgc 







901
cttcaggccg ccgcccaggt cttccgaacc gcgcaaaccg aaacggcttt ggtcgacgcc 







961
ggcgccgacc gaccacagcg acttacccga gcccgacggc gtcgcgacgt tgctttggta 







1021
cgtgatgccc gcgtcgagca cgccgtacag cgtgacgctg ctttgcgcgt gtgcgacggt 







1081
agcgaacgat gcagcaactg ctgcaacaat cagagtcttg ttcat 









14
1
mnktlivaav aasfatvaha qssvtlygvl dagityqsnv atpsgsgksl wsvgagvdqs 







61
rfglrgsedl ggglkaiftl esgfnigngr fnngggmfnr qafvglssny gtvtlgrqyd 







121
atqdylspls atgtwggtyf ahplnndrin tngdvavnnt vkftsanyag lqfggtysfs 







181
nnsqfannra ysagasyqfq glkvgaaysq annaganttg atdpltgfni ggtnaasiqg 







241
rsrvygagas yaygplqggl lwtqsrldnl angaptirad nyeanykynl tpalglgvay 







301
tytnakange sahwnqvgvq adyalskrtd vyaqavyqrs sknanasiyn gdlstpfsts 







361
inqtaatvgl rhrf 





415
55
83716254
YP_438901
40
1
ttagaacttg tggcgcaggc ccagggcaac catctcttgc gacttcgtac cgccgatgcc 







61
gtacgagccg atcgacgcct gtgcgtcgcc gccgcccgct tgcgtgccgc ttgcgtgctg 







121
gtacgcgcca gccagataga cgtcagtccg cttcgacagc gagtagtctg cgcccagcga 







181
aacctggtgg tacttcgcgt tcgaatcgcc gctcgacttc gtgtagatat aaccgaggcc 







241
cagcagcagc gccgacgtcg cctggtacgt gacgaagccg cggcccgtgt tgtacttctc 







301
ggtcgagcca aatgccgatg cgctgtccga cttgtattgc gcgttgctgt agccgaggcc 







361
gaacgtgaac gggccaaaca catattgagc gccgacttga gcgatgccga tcgacttggc 







421
cgtcgcgtag ccgttgttca tgaacagcga gttgtccggg ccatcgaaga tcgcgtcgga 







481
ggagccgccc cagcccgtgc gaacgcctgc tgcgctaagc ggagccgagt tcgatgcgtg 







541
gaagtagccg gccccgatgc cgaccgggcc gttgttgtac gccgcggcgg ccgcccacgt 







601
ctgtccggta cccggcgagc ctgcgatgcc gctgaagccg tacaacgcct cgaactggaa 







661
gccggcaaac acgggcgacg tgtacttgac cgtattgttc acgcgcaggc tgttgtcgtt 







721
gttgtcgacg tcacccgggg tggcgaacag gctgccgaag tagttgtctg cagtgacggg 







781
ctgaaccagg tcgacgagcg gatcgtattg acggccgagc gtcagcgtgc cgtacgtatc 







841
gctttgcacg ccgacgaacg cctgacggcc gaacatgcgg ctgccttggc cgagcgcgcc 







901
gttagccgaa ttgaagccgt tctcgagctg gaaaatggct ttcaggccgc cgccgagatc 







961
ctcggtgccc ttcaggcccc agcgggcacc ttgcaggttg ccgctgccca tttgccatgc 







1021
gttgttggcc ttgccgtcgt taccatggac ataagtgatc gacgtatcaa tcacgccata 







1081
cagcgtgacg ctgctttgag cctgagctgc accggccgcg cccagcagcg cgagcgagag 







1141
ggtagacaga gcgagtttct tcattgagct atttctccac gcgagtgagt cgttatttgt 







1201
tgcggattgg agaatagcgc actgcatcaa atgcaagaaa tccaaaaaaa taaagtgtct 







1261
caaaaaggtg acagcgatga aaagtctttt tatatcaatc acataaacga ccatttcgat 







1321
tccgggaaca ctggccgatg gttgcgttgc gattgttgcg gaagcaagca tattgcttgt 







1381
gtgtggggcc gttttcgggc tgttttcgca gcaaagtcgg tgcatgcaaa cgtttgcggg 







1441
cgtggcggtg gctgcgatgg cgacaggcgc gcgctcgaac gccgcactct gctccgacac 







1501
gacgccggca tctttccgcg ccaggtgcgc ggtagatcga gcaggacgtt tgctcgggcg 







1561
caatacgcgc aa 









15
1
mrvlrpskrp arstahlark dagvvseqsa aferapvaia atatpanvcm hrlccenspk 







61
taphtsnmla satiatqpsa svpgiemvvy vidikrlfia vtflrhfifl dflhlmqcai 







121
lqsatnndsl awrnssmkkl alstlslall gaagaaqaqs svtlygvidt sityvhgndg 







181
kannawqmgs gnlqgarwgl kgtedlgggl kaifqlengf nsangalgqg srmfgrqafv 







241
gvqsdtygtl tlgrqydplv dlvqpvtadn yfgslfatpg dvdnndnslr vnntvkytsp 







301
vfagfqfeal ygfsgiagsp gtgqtwaaaa aynngpvgig agyfhasnsa plsaagvrtg 







361
wggssdaifd gpdnslfmnn gyataksigi aqvgaqyvfg pftfglgysn aqyksdsasa 







421
fgstekyntg rgfvtyqats alllglgyiy tkssgdsnak yhqvslgady slskrtdvyl 







481
agayqhasgt qagggdaqas igsygiggtk sqemvalglr hkf 





416
58
83720933
YP_442964
41
1
tcacgcacac gcgtcttcgc cgccgcacgc ggcgcgcgac cagccgccgc cgagcgcgcg 







61
atacagcgtg atggcgttcg tgagccgcaa ctgcttcagg cgaatcagtt cctgcccgga 







121
ctcgaacgtg ctgcgctgcg cgtcgagcaa ttcgagatag cttgcgacgc cgctgccgta 







181
gcgccgctcc gcgagccgca gccgctcggc gtccgcgccg tagaccgcct gctgcgcggc 







241
gagctgcgca tcgatctgat cgcgcgcggc gagcgcgtcg gccacttcgc ggaacgccgt 







301
ctggatcgtc ttttcgtatt cggcgaccgc gatatgcttg cgcgcgtcgg cgacatcgag 







361
attcgcgcga ttgcgtccgc ccgcgaagat cggcagcgtg agccgcggcg cgaacgtcca 







421
cacgctcgag cctgccgaga agagccccga gaacgcgtcg ctcaccgagc cgatgtcggt 







481
cgtgagcgcg atgcgcggga agaacgccgc gcgcgccgcg ccgatgttcg cgttcgccgc 







541
gacgagccgc tgctcggcct gccggatgtc aggccgctgc tcgagcaggt ccgacgacag 







601
ccccggcgac acccgcgcga ccgcgagttc gtcgagcgcg ggcgcatcgg cgggcagggc 







661
cgccgtgaag tcgcccgcga gcagcttcag cgcgctcgcc gcctgcgtgt gctcgcgctc 







721
gagcgccgcc ttcgatgcgc gcgcggacgc gacgagcatc tcggccgtgc gcagctcgat 







781
cgccgtgctc gtgcccgccg cgtagcggcg ctgcgtgagc gcgtaagtcg cgtcgcgcgc 







841
ggcgagcgtg cgttcggcga gcttcagttg gtcgacgagc gcgcgctcgc gcacataggc 







901
ggacgcgact tcggcgatca ggctgacgcg cgccgcgcgc tgcgcctcgg cggtcgcgaa 







961
gtactcggcg agcgccgcgt ccgacaggct cttcacgcgg ccgaacaggt cgatctcgaa 







1021
cgcgctcaag ccgacgctcg cgcgatacag cgagctcgtc gcgctctcgc gcagcaccgg 







1081
gtcgtagagc cgcgtgcgct cgtagccgag gctgccgtcg atcgacggca gccggtccgc 







1141
gcgcgcgacg ccgtaaaggc cgcgtgcctc ctcgatgcgc agcgtcgcga tccgcagatc 







1201
gcgattgttc gcgagcgccg cgtcgatcag cgcgcgcagc gccggatcgg tgaagtacgc 







1261
gcgccagtcg tcgaggcgca catcgtgcat ctcctgcaca ttgccgccgc cgcgcgcgtc 







1321
ggacgctggc tgcgcgtcgc gcgcgccgct gccggctgcc gccgggctcg cggcgcgcgg 







1381
cgcgcccgcc gccgcgggca gataattcgc cggcacgggc gcggccggac gctcgtagcg 







1441
cggcgcgagc gtgcagcccg tgacgagcgc ggccgcgagg gcggccggcc atgcccgcag 







1501
cccgcccggc cgccgcgaac cgtgcggccc gccctgacgg gccaggggtt tcatcatcgt 







1561
tttcacttga tctccagtgc cgcgcgcgcg ccgccgcggc gacgcggaac gatgtcgacc 







1621
acgcggccga cgcat 









16
1
mrrprgrhrs asprrrargt gdqvktmmkp larqggphgs rrpgglrawp aalaaalvtg 







61
ctlapryerp aapvpanylp aaagapraas paaagsgard aqpasdargg gnvqemhdvr 







121
lddwrayftd palralidaa lannrdlria tlrieeargl ygvaradrlp sidgslgyer 







181
trlydpvlre satsslyras vglsafeidl fgrvkslsda alaeyfatae aqraarvsli 







241
aevasayvre ralvdqlkla ertlaardat yaltqrryaa gtstaielrt aemlvasara 







301
skaalereht qaasalklla gdftaalpad apaldelava rvspglssdl leqrpdirqa 







361
eqrlvaanan igaaraaffp rialttdigs vsdafsglfs agssvwtfap rltlpifagg 







421
rnranldvad arkhiavaey ektiqtafre vadalaardq idaqlaaqqa vygadaerlr 







481
laerrygsgv asylelldaq rstfesgqel irlkqlrltn aitlyralgg gwsraacgge 







541
daca 





417
55
83718989
YP_441238
42
1
atgaagcgaa aacatgcttt gactgcactc gcagtcgcgc tgctcgccgc gggctgcacg 







61
ctcgcgccgc gctacgagcg tccggccgcg cccgtgtcgg gcgcgttccc gaccgacggc 







121
gtctacgccg cgcagccggg cgccgcggcc ggcgcgcgca gcgcgaacgg ccaggcggcc 







181
gtcgacatcg gctggcgcga gttcttcgtc gatccgcgcc tgcagcggct gatcgagatc 







241
gcgctgaaga acaaccgcga tctgcgcgtg tcggtgctca acgtcgaggc gtcgcgcgcg 







301
cagtatcgga tcacgcgcgc gggcctgttc ccgacgctcg acggcaccgg cacgggcacg 







361
atccagcgca cgccggccgg tgtgtcggta accggccagc cgctcatctc gcggacctac 







421
aatgtcggcg tgtccgcgtc gtgggagctc gacctgttcg gccgcgtgca gagcctgaag 







481
gatcaggcgc tcgcgcaata cttcgcgacc gcgcaggcgc gcaaggccgc ggagatctcg 







541
ctcgtcgcga gcgtcgccga tcagtatctg acgctgctgt cgaccgacga tctgctgcag 







601
atcacggaga acacgctgaa gacggcgcgc gcgtcgtacg acctgacgaa gctgcagttc 







661
gacaacggca ccggctcgga gctcgatctg cggcaggcgc agacggtcgt cgagaccgcg 







721
ctcgcgaacc agcaggcgca ggcgcgcgcc cgcgcgcagg cggtcaacgc gctcgtgctg 







781
ctgatcggcg agccgctgcc cgacgatctg ccggccggcc tgccgctcaa tgcgcagaat 







841
ctgctgaccg acatcccggc cgggctgccg tccgatctgc tgacgcgccg ccccgacatc 







901
atgcaggccg aggagacgct gcgcgcggcg aacgcgaaca tcggcgcggc gcgtgcggcg 







961
ttcttcccga aaatctcgct caccggcgcg ttcggcaccg cgagcccgac gctcggcggc 







1021
ctgttcaagg cgggcacggc ggcgtggtcg ttcgcgccga gcatcgcgct gccgatcttc 







1081
gagggcggac agaacatcgc caatctcgat ctcgcgcacg tgcagaagcg catcgagatc 







1141
gcgaactacg agaaggcgat ccagagcgcg ttccgcgagg tgtcggacgg gcttgccgcg 







1201
cgcggcacgt acgatcagca gatcgcggcg ctcgagcgca acgagcacgc gcagcagcgc 







1261
cgctacgatc tgtcggacct gcgctacaag aacggcgtcg acagttatct gtcggtgctg 







1321
accgcgcaga cggatctgta ttcggcgcag cagcaactga tcagcgcgcg gctcgcgcgc 







1381
tggacgaacc tcgtcgatct gtatcgcgcg ctgggcggcg gctggatcga acgcgcgggc 







1441
gagacgccgc gcccggcgga tgcgcctgtc gactacggca aggcggcagc gcctgcgccg 







1501
gcgtcggcgt cggcgtcggc ggcgcccgct gcggcggcca gcgcgccggc ggccggctga 









17
1
mkrkhaltal avallaagct lapryerpaa pvsgafptdg vyaaqpgaaa garsangqaa 







61
vdigwreffv dprlqrliei alknnrdlrv svlnveasra qyritraglf ptldgtgtgt 







121
iqrtpagvsv tgqplisrty nvgvsaswel dlfgrvqslk dqalaqyfat aqarkaaeis 







181
lvasvadqyl tllstddllq itentlktar asydltklqf dngtgseldl rqaqtvveta 







241
lanqqaqara raqavnalvl ligeplpddl paglpinaqn lltdipaglp sdlltrrpdi 







301
mqaeetlraa nanigaaraa ffpkisltga fgtasptlgg lfkagtaaws fapsialpif 







361
eggqnianld lahvqkriei anyekaiqsa frevsdglaa rgtydqqiaa lernehaqqr 







421
rydlsdlryk ngvdsylsvl taqtdlysaq qqlisarlar wtnlvdlyra lgggwierag 







481
etprpadapv dygkaaapap asasasaapa aaasapaag 





418
19
83717287
YP_440214
43
1
gtggcaagcg aagatcggca agattccggt cgcgagcgcg taaataggaa tgattttcag 







61
gccggatttc aatccggcct ttttgatgcg gcgaacccgt tccacctttc ggatatcgaa 







121
cggattcgcc cttggctttt cgcgaacgag cgtttatcgt tcaggaactc aagcccccac 







181
acgtcacaag gaacgaacat gcaaggcgac aagaaagtca tcgaatatct gaacgcgcag 







241
ttgaaaaacg agctgaccgc gatcaatcaa tatttcctgc atgcgcggat gtacaagcac 







301
tggggtctcg agaaactcgg caagcacgaa tacgacgaat cgatcggcga aatgaagcat 







361
gcggactggc tgatcgaacg cgtgttcatg ctcgacggcc tgccgaacct gcaggatctg 







421
cacaagctgc tcatcggcga ggaaaccgag gaaatcctga aatgcgatct gaagctcgaa 







481
caggtctcgc aggcgacgtg caaggaagcc atcgcgtatt gcgaatcggt gcgtgattac 







541
gtgtcgcgcg aaatcttcga gaaaatcctc gacgacaccg aagagcatat cgactggctc 







601
gaaacgcaaa tcgacctgat cggcaaggtc ggcattcaga actatcagca gtcgatgatg 







661
ggttcgccgg agtaa 









18
1
masedrqdsg rervnrndfq agfqsglfda anpfhlsdie rirpwlfane rlsfrnssph 







61
tsqgtnmqgd kkvieylnaq lkneltainq yflharmykh wgleklgkhe ydesigemkh 







121
adwliervfm ldglpnlqdl hklligeete eilkcdlkle qvsqatckea iaycesvrdy 







181
vsreifekil ddteehidwl etqidligkv giqnyqqsmm gspe 





419
8.5
83718020
YP_440216
44
1
atgatcgtct gcgtgtgcaa gtccgtttcc gatcggaaga ttcgcgcatc cctcgcggaa 







61
ggcgtgaaca ccttcgaaga actccagttc gaactcgggg tcgccacctg ctgcggcaag 







121
tgcgaggaaa ccgtgcgcga gatcatggcg gaacagggcg tttgtgcgag ccgctgcggt 







181
gtcgagcccc ccgcggccgt gccggtcccc gtcacgttct acgaacgcaa ggcggcctga 









19
1
mivcvcksvs drkiraslae gvntfeelqf elgvatccgk ceetvreima eqgvcasrcg 







61
veppaavpvp vtfyerkaa 





420
36
83718178
YP_439311
45
1
gtggggcgca tcgggtaccg gcggcgcgcg gctcgttcgc gcgccgcatc gagaggcaag 







61
cgcatgctga atttcctggt caaacgactg ttcgggctgc tgccgacgct cgcgtgcgtc 







121
gcggtgctcg tgttcctgtt cgtccacctg ctgccgggcg atcccgcgcg gctcgcggcc 







181
gggcccgaag ccgacgacgc gacggtcgcg ctcgtgcgcg ccgatctcgg cctcgacaag 







241
ccgctgcccg cgcagttcgc gagcttcttc gcgcggatcg cgcaaggcga cttcggcgtc 







301
tcgacgcgca gcaagcggcc cgtcgcgacc gagatcggcg agcgcttcat gccgacgctg 







361
tcgctgaccg tcgtcagcat ggcctgggcg acgctcttcg ggatggcgat cggcatcgcg 







421
tcggccgtgt ggcgcaaccg ctggccggac cgcgtcggca tgacgctcgc ggtgtcgggc 







481
atctcgtttc cggcgttcgc gctcggcatg ctgctgatgg aagtgttctc ggtgaagctc 







541
ggctggctgc ccgtcgtgcc ggacggctcg tggaagagct acgtgctgcc gtcgctgacg 







601
ctcggcgcgg ccgtggccgc cgtgatggcg cgcttcacgc gcgcttcgtt catcgaggtg 







661
ctgaacgagg atttcgtgcg caccgcgcgc gcgaagggcg tgcgcgagcc gatggtcgtg 







721
ctcaagcact gcctgcgcaa cgcgatgatt ccggtcgtca cgatgatggg gctgcagttc 







781
ggcttcctgc tcggcggctc gatcgtcgtc gaggcggtgt tcaactggcc ggggctcggc 







841
cgcctgctcg tcgatgcggt gacgatgcgc gactacccgg tgatccaggc gatcgtgctg 







901
ctgttctcgc tcgaattcat cctgatcaat ctgaccgtcg acgtgctgta cgcggtcatc 







961
aacccgacga tccgtttcaa gtga 









20
1
mgrigyrrra arsraasrgk rmlnflvkrl fgllptlacv avlvflfvhl lpgdparlaa 







61
gpeaddatva lvradlgldk plpaqfasff ariaqgdfgv strskrpvat eigerfmptl 







121
sltvvsmawa tlfgmaigia savwrnrwpd rvgmtlavsg isfpafalgm llmevfsvkl 







181
gwlpvvpdgs wksyvlpslt lgaavaavma rftrasfiev lnedfvrtar akgvrepmvv 







241
lkhclrnami pvvtmmglqf gfllggsivv eavfnwpglg rllvdavtmr dypviqaivl 







301
lfslefilin ltvdvlyavi nptirfk 





421
42
83718226
YP_439914
46
1
tcagaagcga tggatcagac cgacgccgac gcccacctgg ctgcgcgacg acgacggcga 







61
cgtgttgaag ccgtcgccga tcgacgcggt cgcgtcgatg atcttgccgc ccgcgagcgt 







121
cttgccgttc gcacgctggt acgcctcgac cgcgtaaagg cccgtgcgct tcgacaggct 







181
gtaatactgc gacagcgtga actggtggta ctgcgccgcg ctcgtgatgc cgttcgattg 







241
cgtcgcgcgc gtgtacgcgt agccgcccgc gaagtcccac tgagcggacg gcttgaagtg 







301
cagcaccgca cccgccgtgt tgaagatcgc ggtgttgcgg aacgccgagt tgacgcccgg 







361
aatgtactgc acgttcgagt acgacaccga gatgtcccac gcggacgaga actggtagcc 







421
ggccgtgacc gcgacgcgct gctgcgcctg cgcggtcttg tagccgttgt tgatcgccga 







481
cacggccgtc tgcgcgccgc cgttggacgt cgtcgaattc gcgccccagt cgccgccgcc 







541
cgacgtcgag ttgttcacgc gctggaacgc cgccgcgatg ccgagcggac cgttcatgta 







601
ctggatgccc gcgctccacg tggagccggc gttcacgctg cccggctggc cgccgaacgc 







661
atacgagccg ccgaacgtga agccgtagaa cttcggcgac atgtagacga gcgagttgtt 







721
cgtgcggtag ctggtgtcga gcgagtcgat gtcgcccggg tgcgcgccga agtaaccggt 







781
gagccaggtc gtcgggctgt acggcgacag cagcgtgtag tacgcggtgt attgacggcc 







841
ggccgtcagc gtaccgtacg tcggattggt caggccgacc cacgcctgcc gcgtgaagat 







901
gccgcccgcg aactgcgacg tgccgttggc cgtgctgaaa cccgattcga gctggaagat 







961
cgccttcgag ccgccgccga ggtcctcgct gcccttcagg ccgaagcggc tgcccgccca 







1021
cacgcccgtc gacatcttca ccgccgagcg gccgcccgtc gttgagccga gcgacgtgct 







1081
gctgctctga tacgcgatgc cgttatcgac gataccgtaa agcgtcacgc tgctctgagc 







1141
gtgcgcggtg gcgacagccg ccaatccggc ggccgtcatc gcgaccgctg tgcgcttttt 







1201
at 









21
1
mkkrtavamt aaglaavata haqssvtlyg ivdngiayqs sstslgsttg grsavkmstg 







61
vwagsrfglk gsedlgggsk aifqlesgfs tangtsqfag giftrqawvg ltnptygtlt 







121
agrqytayyt llspyspttw ltgyfgahpg didsldtsyr tnnslvymsp kfygftfggs 







181
yafggqpgsv nagstwsagi qymngplgia aafqrvnnst sgggdwgans ttsnggaqta 







241
vsainngykt aqaqqrvavt agyqfssawd isvsysnvqy ipgvnsafrn taifntagav 







301
lhfkpsaqwd faggyaytra tqsngitsaa qyhqftlsqy yslskrtgly aveayqrang 







361
ktlaggkiid atasigdgfn tspsssrsqv gvgvglihrf 





422
56
83719074
YP_443076
47
1
ttgccacaac aagtaattca gacagataaa tgtaaggaaa tccttgcagt gtccattatt 







61
cattccgagt caggaaacac agcgatgaaa accttcccgt tgtccgcttg ccgaaccgcc 







121
acggccgtcg cggtcgccgt gctcgcgctc gcgggatgcg cgaactattt cggcatcaag 







181
agcgacaagg cgatcgcgcc cgcgacgcag ttcgagagcg cgcagagcct gccggcgcaa 







241
ggcggccggt ggccgtcgct cgactgggcg aaccagttcg gcgatccgca actgcccaag 







301
ctgatcgacg aggcgctcga aggcaatccg acgatcgcgc aagcgcaggc gcgcatcgcg 







361
aaggcatcgt cgtacatcga atcgtcgcgc tcgacgctgc tgccgaaggc ggacgcgaag 







421
tactcgtgga cccgcgagct gtattcgagc aacgccctcg tcccgccgcc cttcggcggt 







481
cagtggtaca gcgagaacaa cgcgctcgcg agcgcgtcgt gggaactcga cctgtggggc 







541
aagaaccgcg cgcgcctcaa tgcggccgtg tcgcaggaaa aggccgccga agccgacatg 







601
cagcaggcgc gcgtgacgct cgcgacgtcg gtcgcgcgca cgtacaacca gctcgcgcag 







661
ttgtacgcgc tgcgcgacat cgccgagcgc gagatctcga accggcagac ggtcggcaag 







721
atcaccgacg gccgcgtcgg cgcgggcctc gacacgaacg tcgagcgcca gaccgcgctc 







781
ggcaacatcg cgacgagcca gtcgacgctg tccgatctcg acggccagat cacgaacgtc 







841
cgctatcagc tcgccgcgct gctcggcaag ggcccggacc gcgggctgca gattgccgcg 







901
ccggtgctga gcccgggcgg cggggtcgcg ctgccggaca acctgcccgc cgatctcgta 







961
tcgcgccgcc cggacctcgt cgccgcgcgc tggcaagtcg aggcggcgat gcacgacgtg 







1021
aaggaagcga aggccgagtt cttcccggac atcaatcttg cggcgggcct cggcttcgac 







1081
gcgttcggct ggggccgctt cctgaacttc gcgagccgcc aggcgcaatt cggcccggcg 







1141
atccatctgc cgatcttcga cggcggcgcg ctgcgcgcgc aactgaaggg ccgctacgcg 







1201
gacttcgatc tgtcggtcgc gaactacaac cagacgctcg tgagcgcgct gaacgacgtc 







1261
gcgacgcaag tcgcgtcgat ccgctcgatc gacacgcaga tgggcgacgc gcagcgcgcg 







1321
ctcgatgcgt cgacgcgcgc gtacgagctc gcggtgatcc gctacaaggc gggcctgtcg 







1381
ccgcagctgc aggtgctgaa cgcggacagc aaccggctcg ccgccgagca gacggtgacg 







1441
aacctgaaga tgcgccgccg cgatctgcag atcggtctcg tgaaggcgct cggcggcggc 







1501
ttcgacgcga ccggcacgcg gctcgccacg cccgcaccgg cagtggcggc ggccgccgca 







1561
cccgcgcggc acgcgtcgaa ctga 









22
1
mpqqviqtdk ckeilaysii hsesgntamk tfplsacrta tavavavlal agcanyfgik 







61
sdkaiapatq fesaqslpaq ggrwpsldwa nqfgdpqlpk lidealegnp tiaqaqaria 







121
kassyiessr stllpkadak yswtrelyss nalvpppfgg qwysennala sasweldlwg 







181
knrarlnaav sqekaaeadm qqarvtlats vartynqlaq lyalrdiaer eisnrqtvgk 







241
itdgrvgagl dtnverqtal gniatsqstl sdldgqitnv ryqlaallgk gpdrglqiaa 







301
pvlspgggva lpdnlpadlv srrpdlvaar wqveaamhdv keakaeffpd inlaaglgfd 







361
afgwgrflnf asrqaqfgpa ihlpifdgga lraqlkgrya dfdlsvanyn qtlvsalndv 







421
atqvasirsi dtqmgdaqra ldastrayel avirykagls pqlqvinads nrlaaeqtvt 







481
nlkmrrrdlq iglvkalggg fdatgtrlat papavaaaaa parhasn 





423
85
83718695
YP_442137
48
1
atgtcccatg aacttgccgc gcacgtcgcg cgcacgcggc tcgcggccgc ctgcgtcgcg 







61
gcgttcgcct ggcccgccgc gcacgccgtc acgacgggcg ccgccgtccc tgccgattca 







121
acgtccgccg ccgctgccga gacgaccgca tccgggaaaa ccctggatat cgtcagggtg 







181
accgcgcagc gccccgcatt cgcgtccgac acgcccggcg tcgtcgaggc gctcacgcgc 







241
gagcagatcg attcgcacgt caacgtgacg accgaagacg cgctcaagta cgcgccgaac 







301
ctgatggtgc gccggcgcta tatcggcgat cgcaactccg tgttcgccgg ccgcgatttc 







361
aacgagttgc agagcgcgcg cggactcgtt tacgcggacg gcatcctcct gtcgaatctg 







421
ctcggctcca gttactcgta tccgccgcgt tggtcgctga tccagcccga cgacatcgcg 







481
cgcgtcgacg tgctgtacgg cccgttctcc gcgctctacc cgggcaatgc gatcggctcg 







541
accgtgcaga tcacgacgcg caagccggat cggctcgagg cgtcggtgtc gacgcagttc 







601
ttcacgcagc gctatcgcga cggctacggc tttgccgaca gcttcggcgg caatcaccag 







661
accgcgcgcg tcgccgaccg cgtcgggcgc ttctggtatg cgctgtcgct cgaccggctc 







721
gagaacgaca gccagccgat gcaatacgcg agcccgaatg gcacgttcga tccgcggctc 







781
ggcgcgagcg tgccggtgac gggcgccgtt tccgacatcg gcccgaacgg ccggcctcgg 







841
acgatcgtcg gcgcgcagac gatcgagcgc accgagcagc tcaacgagac gctgcgcttc 







901
ggctatgcgt tcaccgacca cgtcgatgcg acggttacgc tcggccactg ggagaatcac 







961
taccggcagc acggcgacac gttcctgcgc gacgcggcgg gcaacccggt atacggcggc 







1021
aacgtgtcgt tcggcgggcg cagctacacg gtgtcgccga ccgcgttcgc gccgcagacc 







1081
ggcgaccagg agaactggct gtacgggctc gggctcgacg cgcgtctcgc atcaggctgg 







1141
aagctgtcgg cgaccgcgtc cgcgtacgag gtgtcgcgcg acgtgctgcg cagcgcgtcc 







1201
ggcgcgccga ccggcgcgtg ggacggcggc ccgggcacgg tattccatgg cgacggcacc 







1261
ggctggcgca ccgtcgattt gcgagcggag tcgcccgacg tgcgcgggca ccgcttctcg 







1321
ttcggctatc acttcgacac ctatttcctg cgcaacgcga cctacaacac ggcggactgg 







1381
caaaacgccg tgccgacgac gcttgcgaac cgttatcgcg gcaacacgcg cacgcaggcg 







1441
ctgtacgcgc aagacgcgtg gcgtttcgcg cccggctggc tcgcgacgct cggcctgcgc 







1501
tacgaacgat gggatgcata cggcggccag ctcggcaacg cgaacgcgac gctcggctac 







1561
gccggccgtg gcgcgaccgc gctgtcgccg aagctcgcgc tcgaatggca gccaacggac 







1621
gcatggcgcc tgcggctgtc gttcgcgacg ggcacgcgct ttccgaccgt ggccgaactg 







1681
ttccagggca cgatctcgaa caacgcgatc gtcaacaaca acccgaacct gcaaccggaa 







1741
aaggcgatcg actgggactt cacggccgag cgcgacgtcg gcttcggcgt cgtgcgcacg 







1801
agcgtgttcc agagcgatct gcgcaattcg atctacagcc agacgacggt cgcgggcgct 







1861
tcgacgtaca cgaacatctc gaacgtcgac cgcgtgcggg tgcgcggcgt cgaactcgcg 







1921
ttttcagggc aggacgtcgc gatcaagggg ctcgacgttg acgcgaacgt gtccgcgacg 







1981
aatgcgcaga cgctcgccga tgcggcgaat ccgaactacg tcggcgcgcg ttggccgcgg 







2041
attccacgga tgcgcgcgaa cttgctcgcg tcgtaccgct tcggcgagca ctggatgacg 







2101
agcgtcggcg tgcgctattc ggggcggcag tacaacgcgc tcgacaacag cgacgtgaac 







2161
ccgaacgtgt acggcggcac cagttcgttt gcggtcgtcg acctcaaggc gcgctaccgg 







2221
ttcgatcggc actggctcgc gtcgttcggc atcgacaacg tgaccgatcg ccgctactac 







2281
gtgtttcacc cttatccagg ccgcactttt tatggagagt tgaaatggtc gctgtga 









23
1
mshelaahva rtrlaaacva afawpaahav ttgaavpads tsaaaaetta sgktldivrv 







61
taqrpafasd tpgvvealtr eqidshvnvt tedalkyapn lmvrrryigd rnsvfagrdf 







121
nelqsarglv yadgillsnl lgssysyppr wsliqpddia rvdvlygpfs alypgnaigs 







181
tvqittrkpd rleasystqf ftqryrdgyg fadsfggnhq tarvadrvgr fwyalsldrl 







241
endsqpmqya spngtfdprl gasvpvtgav sdigpngrpr tivgaqtier teqlnetlrf 







301
gyaftdhvda tvtlghwenh yrqhgdtflr daagnpvygg nvsfggrsyt vsptafapqt 







361
gdqenwlygl gldarlasgw klsatasaye vsrdvlrsas gaptgawdgg pgtvfhgdgt 







421
gwrtvdlrae spdvrghrfs fgyhfdtyfl rnatyntadw qnavpttlan ryrgntrtqa 







481
lyaqdawrfa pgwlatlglr yerwdayggq lgnanatlgy agrgatalsp klalewqptd 







541
awrlrlsfat gtrfptvael fqgtisnnai vnnnpnlqpe kaidwdftae rdvgfgvvrt 







601
svfqsdlrns iysqttvaga stytnisnvd rvrvrgvela fsgqdvaikg ldvdanvsat 







661
naqtladaan pnyvgarwpr iprmranlla syrfgehwmt svgvrysgrq ynaldnsdvn 







721
pnvyggtssf avvdlkaryr fdrhwlasfg idnvtdrryy vfhpypgrtf ygelkwsl 





424
78
83717289
YP_438727
49
1
ttgatacgtt atatcgtttc gttgtcggcg cgcgattcgg cccgtgattc cgtccacccc 







61
gagaacaccg caccgattac cgcaatggat caccaccgac gtatcgcgcc gccttccgcc 







121
cgcaggctgc atcccttgtc gttgctgctc gccgcgagcc ttgcgcacgg cgagacgggc 







181
gagccagccg gccgcccgtc gagcgcgccg ccagcgacgg cgctcgcgcc gatcttcgtg 







241
accgcgaacc cgctcggcgc atcggcgctc acgtcgccga cggcgtcgct gtccggcgac 







301
gcgttgacgc tgcgccgcgc cgattcgctc ggcgatatgc tgaacggctt gcccggcgtg 







361
tcgacgacga cctacggtcc gctcgtcggc cgcccgatca tccgcgggat ggacggcgac 







421
cgcatccggc ttttgcagaa cggcgtcgcc gcatacgacg catcgtcgct gtcgtacgat 







481
cacgcggtgc cgcaggaccc gctgagcgtc gagcgcgtcg agatcgtgcg cgggccggcc 







541
gcgctgctgt acggcggcaa tgcggtcggc ggcgtcgtca acacgatcga caaccggatt 







601
ccgcgcgaag cggtcacggg cgtcacgggt gcgctcgacg cgagctacgg cggcgcgaac 







661
aacgcgcgcg cgggggcggc gctcgtggaa ggcggcaacg gccgcttcgc gttccatctc 







721
gacgcgttcg gccgcgagac cgacgagttg cgcatacccg gctatgcgca ctcggcgcgc 







781
cagcgcgcgc gcgacggcga ggacgcgagc gagccgtacg gcaagctgcc gaacagcgac 







841
ggccgccgtt acggcggcgc ggcgggcggc tcgtacacgt gggccgacgg ctacgtcggc 







901
gcgtcgtaca gcggctacga atcgaactac ggctcggtcg ccgaaaccga tgcgcggctg 







961
cggatgcggc aggagcgcgt cgcgatcgcg tcggaggcgc gcaatctgcg cgggccgttc 







1021
tcgcagctga aattcgactt cggctacacg aactatctgc accgggaaat cgaggacggc 







1081
gtgaccggca cgacgtttcg caaccacggc tacgaggcgc gcgtcgaggc gcgccaccgc 







1141
aagctcggcc cgttcgaagg cgcgctgggc gtgcaggtcg gccagaacac gttttccgcg 







1201
cttggcggcg aggcgctcgc gccgacgacg cgcacgacga gcgtcgcatt gttcggcctc 







1261
gaggaatggc aggcgaccga tgcactgaag ctgtccgccg gcgctcggat cgagcacgtg 







1321
cggctcgacc cgagcgcgaa cggggacgat aagtttggcg tcgcgcgctc acgcgatttc 







1381
aatgcgggca gcgtgtcggc gggcgcgctg taccagctcg cgcccgcgtg gtcggtcgcg 







1441
ggcaacgtgt cgtacacgga gcgcgcgccg acgttctacg aactgtacgc gaacggtccg 







1501
cacggcgcga cggggcaata cctgatcggc ctgccggatg cgcggaagga gaaggcggtg 







1561
tcgaccgatc tcgcgctgcg ctacgcgaac gggccgaacc gcggcagcgt cggcgtgttc 







1621
tacagccgct tgcggaacta cctcgccgaa tacgacaccg ggcggctcgt cgacgacgaa 







1681
ggcgcgcccg tggccccggg gggcgacgag acgctgcgcg aggccgtcta tcgcggcgtg 







1741
cgcgcggagt tctacggcgt cgagctcgag ggcaagtggc gcgcgttcga gaagcgcggg 







1801
catcgcatcg atctggagct gggcgccgac tacacgcatg cgcgcaacgc ggacacgggc 







1861
gagccgctgc cgcggatcgc gccgctgcgc gcgacgctcg cggccgatta cggctacggt 







1921
ccgttcggcg cgcgcgcgca gctcacgcat gcgtgggcgc agcatcgcgt gccggaaaac 







1981
gatctggcga ccgacggcta tacgtcgctc ggcgtcgtgc tgacgtacaa gttccgcgtc 







2041
ggcgcgacga actggctcgc gtacctgcgc ggcgacaacc tgaccaacca ggatatccgc 







2101
tatgcgagct cagtcgtgcg caacatcgcg ccgcagggcg ggcgcagcgt gacggtcggc 







2161
atgcgcacga cgttctga 









24
1
miryivslsa rdsardsvhp entapitamd hhrriappsa rrlhplslll aaslahgetg 







61
epagrpssap patalapifv tanplgasal tsptaslsgd altlrradsl gdmlnglpgv 







121
stttygplvg rpiirgmdgd rirllqngva aydasslsyd havpqdplsv erveivrgpa 







181
allyggnavg gvvntidnri preavtgvtg aldasyggan naragaalve ggngrfafhl 







241
dafgretdel ripgyahsar qrardgedas epygklpnsd grryggaagg sytwadgyvg 







301
asysgyesny gsvaetdarl rmrqervaia searnlrgpf sqlkfdfgyt nylhreiedg 







361
vtgttfrnhg yearvearhr klgpfegalg vqvgqntfsa lggealaptt rttsvalfgl 







421
eewqatdalk lsagariehv rldpsangdd kfgvarsrdf nagsysagal yqlapawsva 







481
gnvsyterap tfyelyangp hgatgqylig lpdarkekav stdlalryan gpnrgsvgvf 







541
ysrlrnylae ydtgrlvdde gapvapggde tlreavyrgv raefygvele gkwrafekrg 







601
hridlelgad ytharnadtg eplpriaplr atlaadygyg pfgaraqlth awaqhrvpen 







661
dlatdgytsl gvvltykfrv gatnwlaylr gdnltnqdir yassvvrnia pqggrsvtvg 







721
mrttf 





425
43
83720836
YP_441236
50
1
atgcgcgtcg aacgggttcc ataccgctta atcactgtcg cgacggctgc cgtttttctg 







61
gccgcgtgcg gaaaaaaaga atcggctccg ccccctcaaa cgcccgaagt cggcgtcgtc 







121
accgtccagc cgcaagccgt gccggtcgtc tccgaattgc cgggccgcac gagcgcgtat 







181
ctggtcgcgc aggtgcgcgc gcgggtggac ggcatcgtgt tgcggcgtga gttcacggaa 







241
ggcagcgacg tcaaggccgg ccagcgtctg tacaagatcg atcccgcccc gtacatcgcg 







301
cagttgaaca gcgcgaaggc gacgctcgcg aaggcgcagg cgaatctcgc gacgcagaac 







361
gcgctcgtcg cgcgctacaa ggtgctcgtc gccgcgaacg cggtcagcaa gcaggactac 







421
gacaacgcgg tggccacgca agggcaggcc gccgccgacg tggcggccgg caaggccgcg 







481
gtcgacaccg cgcagatcaa cctcggctat acggacgtgg tctcgccgat cacgggccgc 







541
gtcggcatct cgcaggtcac gccgggcgcg tacgtgcagg cgagccaggc gacgctgatg 







601
tcgaccgtcc agcagctcga tcccgtctac gtcgacctca cgcagtcgag cctcgacggc 







661
ctgaagctgc gccaggacat ccagagcggg cgcatcaaga cggagggtcc gggcgcggcg 







721
aaggtcacgc tgatcctcga ggacggcaag gcgtactcgg agccgggcaa gctgcagttc 







781
agcgacgtca cggtcgacca gacgacgggc tcggtcacga ttcgcgcgat cttccccaac 







841
aagcagcgcg tgctgctgcc gggcatgttc gtgcgcgcgc gcatcgaaga gggcgtcaac 







901
gacaacgcgt tcctcgtccc gcagatcggc gtcacgcgcg atccgaaggg ccaggcggtc 







961
gcgatgatcg tcgacggcaa gggcaaggtc gagccgcgcg tgctcacgac gagcggcacg 







1021
cagggccaga actgggtcgt cgaaggcggc ctgcaggcgg gcgaccgcgt gatcgtgcag 







1081
ggcatcgaca aggtgcgccc gggcatgacc gtgaagaccg ccgaagctca attgccggcc 







1141
gcggcgtccg gcgcgtccgg cgccgcgccc gcgggcggct cgcccgccca ggccgcggcg 







1201
gcgagcgcgg ccgcatcggg tgccgcgccg tcgagcgctg ccgccgcgtc gagcgcgaaa 







1261
taa 









25
1
mrvervpyrl itvataavfl aacgkkesap ppqtpevgvv tvqpqavpvv selpgrtsay 







61
lvaqvrarvd givlrrefte gsdvkagqrl ykidpapyia qlnsakatla kaqanlatqn 







121
alvarykvlv aanayskqdy dnavatqgqa aadvaagkaa vdtaqinlgy tdvvspitgr 







181
vgisqvtpga yvqasqatlm stvqqldpvy vdltqssldg lklrqdiqsg riktegpgaa 







241
kvtliledgk aysepgklqf sdvtvdqttg svtiraifpn kqrvllpgmf vrarieegvn 







301
dnaflvpqig vtrdpkgqav amivdgkgkv eprvlttsgt qgqnwvvegg lqagdrvivq 







361
gidkvrpgmt vktaeaqlpa aasgasgaap aggspaqaaa asaaasgaap ssaaaassak 










BpK96243-500












ID
MW
Protein GI#
Protein Locus#
SEQ ID NO:
Sequence
















501
81
53719389
YP_108375
76
1
ttaccagttg tatttcgcgg tcgcgatcac ggtgcgctgg ttgccgtaca tgcacaccgc 







61
atccgactgg cagccggcca cgtagcggcg gttgaacagg ttcgtcgcgt tgagcgcgaa 







121
acgccagttg cgcagctcgt agtgcagcgc cgcgtcgaac agcgtgtagc tcggcaccgt 







181
gagcgaattg tcggccgcgc ccgccgccgc gctcatgtag cgcacgcccg cgccgacgcc 







241
gaagcccgtg agcggcccgg tgcgccacgt ccaatcggcc cacagcgacg cgatctggcg 







301
cggccgcggc acgtcgacgg gccacttgtt cagcgtgttg tcgttcgccc gcacgttctt 







361
cacgtcctga tagacgtacg cggcgatcac cgacagctcg cgcgacaggt tgcccaccgc 







421
gctcagctcg acgccgcgcg agcgcacctc gcccacctgc acgaacgtgc tgctcgtcgg 







481
gtcgttcgga ttgctcatcg cgacgttcgt ctggttgatc tggtagacgg ccgcgttcag 







541
catcaggttc ttgccgggcg gctgccagcg caggcccgcc tcgatctggc ggcccttggt 







601
cggcgtcgcg agcccgccgc ccgcgagctt caggccgatc tgcggattga acgacgtcga 







661
atagctgaga tacggcgcga gcccgtagtc gccgagatac gtgaggccca cgcgcccgct 







721
gaacgcgtgg tcgttctgcc tgaagctcgc cgcgttcgcg atgtcgtcct gcgacgtgcg 







781
cgtccagtcc tgccggccgc cgagcgtgag cacccagcgc cgccacttga tctggtcctg 







841
cacgtacagg ccgaatgcgt tcagcgtcgt cttcgtgtcg gtgcgcgggt acgcgttcgg 







901
cccgctgaag atgtcggacg gaatcggcgt gtagacgggc cggtacaggt tcaggctcgg 







961
ccccttcgcg agccattcgc tatcggtcgt cgtctggcgg ttgtagtcga agccgaacag 







1021
cagcgtgtgc gaaagcgggc ccgtcgtgaa tttcgcctgc gcctggttgt cgacgtcgaa 







1081
gcggctgtag ttgaactgga acaggcccgc gtagcgcgtc atcgtcgcca tcgtcgggtc 







1141
tgcgtcgtcg aggccgccgc cgtagacgga cgcgtcgtcg agcgacaggt gcatccagcg 







1201
cacgttctgc cgcagcgtcc acaccgggtt cagcttgtgc tcgaactgat agccgagcga 







1261
ccactgcttc ttgcggtaat ggtcgaagtt cgcatcggcc gtgtacagat cgtccgagat 







1321
cgtgccgttc ggattcggca gcacggtgcc gcgcgacggc aggaagttgc tcgacgtgtc 







1381
gccccagtcc tgcaggtacg tcgcggcgag ggtgagcgac gtgtccgcgt tcggctgcca 







1441
tttgagcgac ggcgcgaacg acacgcgctg gtcggcgagc ggccccgtct gcgcgttgcc 







1501
gtcgcggccg acgccgacga tccggtacga cagcgtgccg tccttgccga tcgtgtcgcc 







1561
gatgtcgaac atcagctgct tgcgcgcgta gttgccgacc tgcacgccga gctcgcggat 







1621
gcgctcgccg ttcgcgagct tgctctgcac gtcgacgatc gcgccgggat cgccctgccc 







1681
gtacagcacc gacgtcgggc cgcgcagcac ggcgatgctg tcgatcatgt acggatcgac 







1741
gcgccagctc gacaggttga tcgtgttcgg cacctgcagc ccgtcgacga acacggtcgg 







1801
cgtgaagccg cgcagcgccg cgtaccagtc cgaacggttg tccgacccgt agctcgagaa 







1861
gcccggaatg tagcggaacg cctgattgat gtcggtcgcg cccgtcgcct cgatctgctg 







1921
cgcggtgacg acgttgatcg tctgtggaat ctcgatgatc ggcgtatccg ttttcgtgcc 







1981
cgtcatgctg cgccgcgcga cgaggccgac gctcgcgtcg cgctccgctg ctcccgacac 







2041
cgagatcgcg ggcaacgtgc cgcccgccgc ggtgtctgcc ttcgcctcgc cgcccggctg 







2101
ccgcgcgtcc gcccccggct gcgccgcctg tgcctgcgca tgacccgccg ccgccgcgta 







2161
gaacgccacg ccggccgcga tcgcacgcac gcgcgtgctg gttgcccact ccatcttccg 







2221
ttgctccaag ttgcaaagcg gccgcgaggg ccgcaccttc atcaa 









51
1
mmkvrpsrpl cnleqrkmew atstrvraia agvafyaaaa ghaqaqaaqp gadarqpgge 







61
akadtaaggt lpaisvsgaa erdasvglva rrsmtgtktd tpiieipqti nvvtaqqiea 







121
tgatdinqaf ryipgfssyg sdnrsdwyaa lrgftptvfv dglqvpntin lsswrvdpym 







181
idsiavlrgp tsvlygqgdp gaivdvqskl angerirelg vqvgnyarkq lmfdigdtig 







241
kdgtlsyriv gvgrdgnaqt gpladqrvsf apslkwqpna dtsltlaaty lqdwgdtssn 







301
flpsrgtvlp npngtisddl ytadanfdhy rkkqwslgyq fehklnpvwt lrqnvrwmhl 







361
slddasvygg glddadptma tmtryaglfq fnysrfdvdn qaqakfttgp lshtllfgfd 







421
ynrqtttdse wlakgpslnl yrpvytpips difsgpnayp rtdtkttlna fglyvqdqik 







481
wrrwvltlgg rqdwtrtsqd dianaasfrq ndhafsgrvg ltylgdygla pylsystsfn 







541
pqiglklagg glatptkgrq ieaglrwqpp gknlmlnaav yqinqtnvam snpndptsst 







601
fvqvgevrsr gvelsavgnl srelsviaay vyqdvknvra ndntlnkwpv dvprprqias 







661
lwadwtwrtg pltgfgvgag vrymsaaaga adnsltvpsy tlfdaalhye lrnwrfalna 







721
tnlfnrryva gcqsdavcmy gnqrtviata kynw 





502
55
53721329
YP_110314
77
1
atgaacaaga ccaacatcaa cgaacgcatc gcgcgcgtcg cgaagatcgc ggcggcgagc 







61
gggttgctcg tcgcgctgct cgccgcgtgc gcggtggggc ccgactaccg gcggcccgat 







121
gtcgcgacgc ccgcggcgtt caaggaagcg ccggcgctcg cgccgggcga gcaggccggc 







181
acgtggaagg cggccgagcc ggccgacgcc gcgcatcgcg gcgaatggtg gcgcgtattc 







241
ggcgatccgg tgctcgatgc gctcgaaacg caggcgcttg ccgcgaacca gaacctgaag 







301
gccgcggcgg cgcgcgtcga gcaggcgcgc gcggcgaccc gcgccgcgcg ctcgcagtgg 







361
ttcccgcaag tgggcgtggg cttcgggccg acgcgcgagg ggctgtcgtc ggcgtcgcag 







421
ttccagccgc agggaagcgg cccgacgaac gcgacgctct ggcgcgcgca gggcacggtg 







481
tcgtacgaag cggacctgtt cggccgcgtc ggccgcaacg tcgaagcgtc gcgcgcggac 







541
gaagcgcaga gccaggcgct gttccgctcg gtgcagctcg cgttgcaggc ggacgtcgcg 







601
cagaactact tcgagctgcg ccggctcgat tccgaccagg atctgtaccg gcgcacggtc 







661
gggctgcgcg aggaggcgct caagctcgtg cagcggcgct ttgccgaagg cgacatcagc 







721
gagctcgacg tgtcgcgcgc gaagaacgaa ctggcgaccg cgcaggcgga cgcggtgggc 







781
gtcgcgcggc ggcgcgccgc atcggagcat gcgctcgcga tcctgctcgg caaggcgccc 







841
gcggatttct cgttcagcga gacgccgatc gcgccggtgg tggtgcgcgt gccggcgggg 







901
ctgccgtcgg cattgctcga gaggcggccc gacatcgcgg cggccgagcg cgcgatgatg 







961
gccgcgaacg cgcgcatcgg cctcgcgaag tccgcgtact tcccgaagct cgacatcacc 







1021
ggcgcgttcg gctatgaggc ggcgacgctc ggcaacctgt tcctgtggtc gagccgcacg 







1081
ttcctgctcg ggccgttcgc gggcacggcg ctcacgctgc cgatcttcga cggcgggcgc 







1141
cgcagcgcgg gcgtcgcgca ggcgcgcgcg aagtacgacg aggaagtcgc gaactaccgg 







1201
cagcaggtgc tcgtcgcgtt ccgcgaagtg gaggacaacc ttgccgatct gcgtctgctc 







1261
gacgatcaga tccgcgcgca aagcgacgcc gtcaacgcgt cgcggcgcgc ggcgaagctg 







1321
tcgcgcacgc agtatcagga aggcgcggtc agctatctcg acgtgatcga cagcgagcgc 







1381
tcggtgctcg aatcgcaact gcaatcgaac cagttgacgg gtacgcaggc ggtatcgacg 







1441
gtcaacctga ttcgcgcgct cggcggcggc tgggggagcg atgctgcgct cggctcgcgc 







1501
gagccgggca agcaggacgt cgcgacgcgt tga 









52
1
mnktnineri arvakiaaas gllvallaac avgpdyrrpd vatpaafkea palapgeqag 







61
twkaaepada ahrgewwrvf gdpvldalet qalaanqnlk aaaarveqar aatraarsqw 







121
fpqvgvgfgp treglssasq fqpqgsgptn atlwraqgtv syeadlfgrv grnveasrad 







181
eaqsqalfrs vqlalqadva qnyfelrrld sdqdlyrrtv glreealklv qrrfaegdis 







241
eldvsrakne lataqadavg varrraaseh alaillgkap adfsfsetpi apvvvrvpag 







301
lpsallerrp diaaaeramm aanariglak sayfpkldit gafgyeaatl gnlflwssrt 







361
fllgpfagta ltlpifdggr rsagvaqara kydeevanyr qqvlvafrev ednladlrll 







421
ddqiraqsda vnasrraakl srtqyqegav syldvidser svlesqlqsn qltgtqavst 







481
vnliralggg wgsdaalgsr epgkqdvatr 





503
84
53722229
YP_111214
78
1
atgaagtcgc gttccgacga gttgaagctc ggaaaattca ccaccctctg cagcgtgctg 







61
gccgcgagcc cggcattcgc gcaggacgcc gcgccgcccg cggcgagcgc cgatcacgac 







121
aaggagctcg cgccgatcca gatcaagggc gccgccgaac acagctacaa agccgacttc 







181
tcgtcgtcgg tgaaattcac cgcgccgctc gtcgacacgc cgaaatccgt caccgtgatt 







241
ccgcaggaac tgatccacaa cagcggcgcg gccacgctca ccgaagcgtt gcgcaccgtg 







301
cccggcatca cgttcggcgc gggcgaaggc ggcaatccgc tcggcgaccg cccgttcatc 







361
cgcggctacg acacgcaggg cagcatgttc gtcgacggga tgcgcgacac gggcgcgacg 







421
acgcgcgaga tcttcaatac cgagcgcatc gagatcacga aaggctcgga cggcgcgtac 







481
ggcggccgcg gcggtgcggg cggcagcatc aacctcgtca cgaaggcgcc gcacctgggc 







541
acgacggccg aggcgagcgc ggggctcggc accgaccgct atcgccgctt caccgccgac 







601
ggcaactggc agttcgccga tcacgcggcg ttccgcctga acctgatgag ccacaacaac 







661
gacgtcgcgg gccgcgacgc cgtgaacaac gagcgctggg gcgtcgcgcc gtcgatcgcg 







721
ttcgggctcg gcacgccgac gcgcgtgagc gcgagctatt accacctgtc gaccgacgat 







781
ctgcccgacg gcggcattcc gtatttctac acgacggcga acaagccggc gaacgtcgat 







841
acgatctatc cggcgaacgt cgaccgccac aacttctacg gcctcgtcaa ccgcgacttc 







901
cgcaagacga cgacggacat cggcacgctg cgcatcgagc acgacatcac gccgtcgctg 







961
acggtgcgca acaccacgcg ctacacggaa tcgacgcagg actacatttg gacgcagccc 







1021
gacgacagcc agggcaacgt ggtcaacggc cgcgtctggc ggcgcaacaa caaccgcaat 







1081
agcgcgatca acagcatcgc gaaccagacc gagctgttcg gcgaattccg caccggcccg 







1141
ttcaagcaca gcttcacgac cggcatcgag ctgtcgcgcg aatggggcaa gcgcgattcg 







1201
tacagcgtcg cgaccggcac cggcaagatc tgccaacagg gcatcggcgc cgcgtcgggc 







1261
tacaactgca cgagcctggg gtcgccgaac ccgaacgatc cgtgggcggg ctcgatcacg 







1321
cgcaacaacg actacgcgca tgcgcgcacc gtcacgaaat cgctctacgg cttcgacacg 







1381
gtcgagctga gcaagcattg gcaggtgaac gccggcgtgc gcgtcgacga ctactcgacc 







1441
cgcttcaccg acacccgcgc gaacggcggc aagacctaca cgcgcgacga cacgctcgtg 







1501
aactggcagc tcggcctcgt gttcaagccc gcgcgcaacg gcagcgtgta cgcttcgtac 







1561
gcgacgtcgt cgacgcccgc gggcgcgctg ctcggcgaag gctcggagac gcagtcgctc 







1621
acgccgggcc gcggcggcgt cggcgcgaac gcggatcagc tcgcgccgga gaagaaccgc 







1681
agcatcgagc tcggcaccaa atggaacgtg ctcgacgaca agctctcgct cacggccgcc 







1741
ctgttccaga tcgacacgac gaacgcgcgc gtcacgctgc cgaacaacca gtacgcgatg 







1801
gtcggcaaca agcgcgtgca ggggctcgag ctcggcttcg cggggcagct cacgcgcgcg 







1861
tggcaggtgt tcggcggcta cacctacatg aagagcgaac tgcgcgacaa cggcaggaac 







1921
accgcggaca acggccatca gtttccgaac acgccgaagc acagcttcac gctgtggacg 







1981
aactacgacg tgacgccgaa attcaccgtc ggcggcggcg cgttctacat gtcgaaggta 







2041
ttcggcgata ccgcgaacct gcgcgccgtg ccgtcatact ggcgcttcga cgcgatggcg 







2101
cagtaccgga tcaacaagaa gctcgacctg cagctgaacg tgagcaacct gttcaaccgc 







2161
acgtacttcg atcaggcgta tccggcgcac tacgcgtcga tcgcgccggg ccgctcggcg 







2221
ttcgtcacgc tcaacgcgcg ctattga 









53
1
mksrsdelkl gkfttlcsvl aaspafaqda appaasadhd kelapiqikg aaehsykadf 







61
sssvkftapl vdtpksvtvi pqelihnsga atltealrtv pgitfgageg gnplgdrpfi 







121
rgydtqgsmf vdgmrdtgat treifnteri eitkgsdgay ggrggaggsi nlvtkaphlg 







181
ttaeasaglg tdryrrftad gnwqfadhaa frlnlmshnn dvagrdavnn erwgvapsia 







241
fglgtptrvs asyyhlstdd lpdggipyfy ttankpanvd tiypanvdrh nfyglvnrdf 







301
rktttdigtl riehditpsl tvrnttryte stqdyiwtqp ddsqgnvvng rvwrrnnnrn 







361
sainsianqt elfgefrtgp fkhsfttgie lsrewgkrds ysvatgtgki cqqgigaasg 







421
ynctslgspn pndpwagsit rnndyahart vtkslygfdt velskhwqvn agvrvddyst 







481
rftdtrangg ktytrddtlv nwqlglvfkp arngsvyasy atsstpagal lgegsetqsl 







541
tpgrggvgan adqlapeknr sielgtkwnv lddklsltaa lfqidttnar vtlpnnqyam 







601
vgnkrvqgle lgfagqltra wqvfggytym kselrdngrn tadnghqfpn tpkhsftlwt 







661
nydvtpkftv gggafymskv fgdtanlrav psywrfdama qyrinkkldl qlnvsnlfnr 







721
tyfdqaypah yasiapgrsa fvtlnary 





504
83
53721278
YP_110263
79
1
tcagaaatcc accttcatgc tgaccgcgac gctgaggccc ggggcggtgt atgcgtcgag 







61
cacgttcgaa tcggcggcga tgccgcgcac gtccgaccag ttccagtact tgaggtcgaa 







121
caggttgtga atgccgaggt aggcgctcac gtgcttgttg aagaggtagc cgccgcgcag 







181
atcgacgacg aacgacgacg gcggcgtgaa gcaggtcttt ttctggcacg cggccgacga 







241
cacgtcgcgg ccgcgcttgc ccgcctgcca cagcaggtcc gcctgcgcga accagcgctc 







301
gctcggctcg tagcgcacgc cgaacacggc ggagaacggg ttgaccgtat cgagcggctc 







361
gctcgccgcg ccgttgtcct gcgtcgtgcc cttcgtgaac gccatcgccg tcttcagcgt 







421
gaagccattc ggcatcaccc attcggcgcg tccctcgaag ccgtgaatgc gcgcgttcgc 







481
gaagttcacg tactggaaca cgagcggatc gttcggccgg ccactgccgc ccaccgtgag 







541
ctgagagatg aagttgagat agaggccggc gaaagaggcg acgctgtagc gcagaggccc 







601
gtagccggtg ccgagcgtgc cgcgcaggcc cgcttcgagc gtgtcgctcg tctcgggctt 







661
caggttcgga ttgccgatcg acgtatagcc atagatcgga ttcgagaagc tgttgttgat 







721
ctgatcgggc gtcggcgtgc gaaagccgtg cgcatactgc gcgtagggaa tcagcgcggg 







781
cgacacttca tagagcatcg cgacgcgcgg cgacagctcg tgatcgctcg agctgaccgt 







841
cttgccggtg aacagcggat cgcccgagct cgggctgagc cgatacgcgt cgaagcgcag 







901
gcccggcgtg acgagcagct tgccgaagcc gatctgatcc tgcacgaacg cgccgaacag 







961
cgagtagtcg gtgtccggaa acgccttgtt cgggaacgat tcgccggggc tcgcgacggt 







1021
gccctcgcgc aggctcttga tgaggtcgat gctgccgtcg acgccgtaca gcagcttgtg 







1081
cgcgagcggc ccggtcgcga agccgctctc ggcgaacgcg gagccgccga gcgcgcgctc 







1141
gctgtagtgg ttcgagcgcg aacgcgattg gagccgcccg ccgcgcgtct cgaacgcgtc 







1201
ctgatcctgc gtcgactcct gatagtagaa ctgcacgtgc gcggtctgga acccgcgcgc 







1261
ggcggcgtcg cgcaaatcgt agtcgatact gaagaggttg cgctcgagcc tgtcgttcgc 







1321
ggtgaggccg agcgttgtcg gcggattgat cgccgacagc acgttcgtgt cgatccgccg 







1381
ccgcaccgtt tcggcggcga gcttgatcgt gtcgcgcgcg gtcggcgtga tcgtcagctt 







1441
gccgagcagc gattccgtgt agacatcctg aggattcgcg cgcgtgcgcc gcgtgctcgc 







1501
ggaattgtcg tcgccgtgag tgtcgagttc gtggccgcgg cggccggacg cgatcagcat 







1561
cgcctgcacg cgatcgttgc cgcccgccgc cgtcacggtc gcgccgatgc tgaggtcgac 







1621
cgagtcgtag ctcggccgga acgagaaata ggtctttttt cgatggatcg acagcagatc 







1681
ggacggatct ttcgtgatga agttgacggc gcccgtcagg ccgtcgctgc catacagcgc 







1741
ggacgccgga ccgcgcagga tctcgatgag cgcgagcgtg tcgagatcgg cgtaatcgcc 







1801
gcggccggct tcgagagggc cgaacgagaa cgcgctcggc agccggatgc cgtcttccat 







1861
cagcagcacg cggttgcctt cgaggccgcg gatgttgatg ctcgaatcgc cgtcgcggcc 







1921
gccgccgagc gcggcgttcg ccgggcggta agaggtgcgt cgcacggtga cgcccggctc 







1981
gtagcgcagc gcgtccttga tgttgtcggc ctgctgctcc tcgaggtctg aatcggtgat 







2041
caccgatacg ctcgcagagg tgaggctcgc ggccgacgcg gtgcgcgtcg cggtgacggt 







2101
gatcggatcg agtgccggcg cgtccttgag tgaggcggcg atcgccgcgt gccgcacctg 







2161
cgacgtggat gcggcggatg cggcggccgc gggttcggaa gcggggccgg cggcgcgcgc 







2221
ggccgccgca tagaggccga aggccccgaa cagcgcggcg cgaagcggcc gccgcgccaa 









54
1
marrplraal fgafglyaaa araagpasep aaaasaasts qvrhaaiaaa rkdapaldpi 







61
tvtatrtasa asrtaasysv itdsdleeqq adnikdalry epgvtvrrta yrpanaalgg 







121
grdgdssini rglegnrvll medgirlpsa fsfgpleagr gdyadldtla rieilrgpas 







181
alygsdgltg avnfitkdps dllsihrkkt yfsfrpsyds vdrsigatvt aaggndrvqa 







241
mliasgrrgh eldthgddns astrrtranp qdvytesllg kltitptard tiklaaetvr 







301
rridtnvlsa inppttlglt andrlernrf sidydlrdaa argfqtahvq fyyqestqdq 







361
dafetrggrl qsrsrsnhys eralggsafa esgfatgpla hkllygvdgs idrikslreg 







421
tvaspgesfp nkafpdtdys lfgafvqdqi gfgkllvtpg lrfdayrlsp ssgdplftgk 







481
tvsssdhels prvamlyevs palipyaqya hgfrtptpdq innsfsnpiy gytsignpnl 







541
kpetsdtlea glrgtlgtgy gplrysvaaf agryrnfisq rtvggsgrpn dplvfqyvnf 







601
anarihgfeg raewvmpngf tlktamaftk gttqdngaas epldtvnpfs avfgvryeps 







661
erwfaqadll wqagkrgrdv ssaacqkktc ftppssfvvd lrggyrfnkh vsaylgihnl 







721
fdrkywnwsd vrgiaadsnv ldaytapgrs vaysmkvdf 





505
81
53721627
YP_110612
80
1
atgcggtacg gggtggcggg acctgagaga tcgcggcgcg gcgccgcgcc ggtcgggcgt 







61
ttcgggggaa ggcggtggcg ctcgtcgctc tgcctgggcg tgctgatcgc ggcccacccg 







121
ggcgcggcgg cgttggccgg cacggccgcg aagagcgacg cgcaaaagaa cgacatgcaa 







181
aggaacgaaa acgttcggga agacacccgg cggctcgccg ccggcgcgcc gcctgccggc 







241
ggcgagctga aggcgatttc ggtgagcgcg ccgaaggacg cggcggacga tccgtcggtc 







301
gcgaccgtcg gcaagatgcc gctcgcgctg cgggagatcg cgcaatcggt cagcgtgacg 







361
acgcgcgagc ggatcgagca gcagaacctg ttcagcctcg acgacgtgat gcagcaatcg 







421
gccggcgtga ccgtgcagcc gtacgtgctg ctgacgaccg cgtacttcgt gcgcggcttc 







481
aaggtcgatt cgttcgagtt cgacggcgtg cccgtcgtgc tcggcgacat ggcgagcgcg 







541
ccgcaggaca tgtcggctta cgagcgcgcg gagatcctgc gcggcgcgaa cggcttgctg 







601
cacgggtcgg gcaatccggc cgcgaccgtc aatctggtgc gcaagcgccc gcaacaccgg 







661
tttgccgcga gcgcgagcgt cggcagttgg ggccgctatc gcgcgcaggc cgacatcggc 







721
gggccgctca acccggcggg cacggtgcgc gggcggctcg tcgccgcgta cgaggatcgc 







781
ggctttttct acgaccgcgc gaagcaggat acgcgctcga tctacggcat cgccgagatc 







841
gacgtggcgc gcgacacgct cgtcaccgtc ggcgcgcaat accagtcggt cgcgtcggtg 







901
cccgacatgt ccggcgtgcc gatggcgcgc gacggcacga gcctcgggct gccgcgatcg 







961
acgttcctcg acgtcggctg gggccatttc gactgggata cgacgcgcgt gttcggctcg 







1021
gtcgagcgca agctcggcgg gggctggaag gcgaaggtga gcggcgagta tcaggcggtg 







1081
cgctccgatc tgaaatacgc gggcgcgtac ggcgcgatcg atcgggccac gggcgcgggc 







1141
ggcacgctga tgggcagcgc ataccagttc agcagctaca gccgcagtgt cgatgcgaac 







1201
gtgcagggcc cggtgcgcgc gttcggcctc acgcatgagc tgctgttcgg cgcgacctac 







1261
gcgagcagca gcagcgggca gctcaccgcg ccgctgctgg cgggggccgg cacgccggtg 







1321
aacgtgtacc gctggaaccc ggacggcgtg ccggagccgg gcgtcgggcc gtaccggcag 







1381
gacatgcaga acgacatctc gcaaaagggc gtctacggcc tcggccgcat caagctcgcg 







1441
cagccggtca cgctcgtgct gggcgggcgc ctgagctggt ggaaccagga cagcctcggc 







1501
gcgcactaca acgcgggccg ccagttcacg ccgtatggcg ggctgatctg ggatgtcgcg 







1561
cgcgactggt cgtggtacgc gagctacgcg gaggtgttcc agccgcagac caagccgacc 







1621
tgggacggcg gcatcctgac gccggtgaag ggccgcactt acgagacggg catcaagggc 







1681
gagctggcgg gcggcaagct caacgtatcg ctcgcggcgt ttcggatcga tctcgacaac 







1741
aatccgcgga tcgacgccac gcgtccgtgc gccgggcgca gctgctacta cgtgaacggc 







1801
ggcagcgtgc gcagccaggg gttcgaattc gaggcgaacg gacgcatcac gccgtggtgg 







1861
agcgtgtggg cgagctacac gttcgacacg atccgctatg cgaaggatgt cgccaacggc 







1921
ggcgcgttcg cggcggagct cacgccgcgc cacctgttgc gcgtgtggac gaactacgac 







1981
ctgccgtggc aggagcgccg ctggagcgtc ggcggcggcg tgcaggtgca gagcgatttc 







2041
tcggcggcgt cgagcggcgt cacgatgcgc cagggcggct acgcgctcgc gagcgtgcgg 







2101
ctcggctatc gctacgacag gcactggcgc gccgcgctga acatcaacaa cctgttcgac 







2161
cgcacgtact accagagcct cagccagccc ggctggaaca atcgttatgg ggagccgcgc 







2221
aacgtgatgc tgaccgtgcg cgggcagttc tga 









55
1
mrygvagper srrgaapvgr fggrrwrssl clgvliaahp gaaalagtaa ksdaqkndmq 







61
rnenvredtr rlaagappag gelkaisvsa pkdaaddpsv atvgkmplal reiaqsvsvt 







121
trerieqqnl fslddvmqqs agvtvqpyvl lttayfvrgf kvdsfefdgv pvvlgdmasa 







181
pqdmsayera eilrgangll hgsgnpaatv nlvrkrpqhr faasasvgsw gryraqadig 







241
gpinpagtvr grlvaayedr gffydrakqd trsiygiaei dvardtlvtv gaqyqsvasv 







301
pdmsgvpmar dgtslglprs tfldvgwghf dwdttrvfgs verklgggwk akvsgeyqav 







361
rsdlkyagay gaidratgag gtlmgsayqf ssysrsvdan vqgpvrafgl thellfgaty 







421
assssgqlta pllagagtpv nvyrwnpdgv pepgvgpyrq dmqndisqkg vyglgrikla 







481
qpvtlvlggr lswwnqdslg ahynagrqft pyggliwdva rdwswyasya evfqpqtkpt 







541
wdggiltpvk grtyetgikg elaggklnvs laafridldn npridatrpc agrscyyvng 







601
gsvrsqgfef eangritpww svwasytfdt iryakdvang gafaaeltpr hllrvwtnyd 







661
lpwqerrwsv gggvqvqsdf saassgvtmr qggyalasvr lgyrydrhwr aalninnlfd 







721
rtyyqslsqp gwnnrygepr nvmltvrgqf 





506
88
53720334
YP_109320
81
1
ctagaagcgc agcgtcgcga cgagctcgac cgagcgcgac gggccgagca gccattgcgc 







61
gccgccgttc gacgtcgacg ccgcgtacgt gcggttcgcg aggttgcgcg catagaacgc 







121
gaggtcgacg tcgcgcgtcg cccgccacgc gagcgacgcg tcgaacaccg tgtaagaagg 







181
caccggcacg cggttcgcgt cgtcgccgta gcggggaccg acgtagcgca ggcccgcgtt 







241
cgcgcgccag ccgggcgcga acgcccagcc gatccacagg ttcgcgctct gctgcgcgat 







301
gtcgtgcggc acgttgccgg cgcgctgcac gaccgtgtcg ccgacgcgct ggtcgaacgc 







361
gtcgtagcgc gcgcgcagca acgccgcgtt tgcgtcgacc gtcacaccgc ccggcagccg 







421
cacgccgccc gcgagctcga cgccgcgcga cgactggcgg ccgatctgct gcgcgcgcgc 







481
cggattcagc ggatcgacgc tgacgaggcc gcgcttgacg atgtcgtaga cggcgagcgt 







541
ccagtacgcg cgagcgccgt cgatctcgtg cttgatgccc gcttcccatt gacgcccggt 







601
cgcgagcgtg tagttcgcct gcgacgccga cagcgtgacg agcgagccca cgccctgcgc 







661
acccgtcgtg tactgcgcgt acgcggcgag gcccggcgcg atctcgtaga cgagcccgct 







721
gcgccagccg gtgtgcgcga acgtcttgtc gaacgcgccg cccgcgatca gatcgtcgcg 







781
atgaaacgac agatggtcgt aacgcagccc gctcacccac gcgagccgcg gcaggacctc 







841
gagccggttc tcgatgaacg ccgcggcctg atgcgtgcgc gtgcgaaagc gcggcagcgt 







901
cggatcgggg ctcgcgaacg cgccgggatc gaagccggcg gcggcgacgg tcgattcgcc 







961
gcgatacggc gcgttgttcg cgccgtcgaa cgcgatctgg ttgaactcgg cgccgagcac 







1021
gagccggttc gcgcggccga acacgcgcga gtcgatgcgt gcggtgaagc gctcgccgaa 







1081
ctgccgctcg cgatggaaga tctcgagata gtcgctgcgc gcgacggtgc gcgccgccgg 







1141
gtcgagcgca tacgattccg cgttgcgcca atggcggcgc gtcgcgagat agtagagctg 







1201
ggcatcgagc gtcacgccgt tgcccgcgcg ataggtcgcg gcaagccgcg tccaggtgtc 







1261
gtggtatgcg atcgtcgcgt cgccgacgtt gtagttgcgc tcgcgcagcg cgcgctcgag 







1321
cacgccgttc gtcgccggca cgccgaaata ggtcgccggc ttctggcggc cgtagtcgta 







1381
atcgagcgtg atcgacagac gcgaatcgac atcgagcttc agcgcgccgc cgacggccgt 







1441
cacgtgcgcg tcgccgcgct cgacgaagcc gcgcgtgcgc tcgtcgctca gatggaaacg 







1501
ataggaaaga cgcgcgccga gcgcgcccgt cgtatcgagc gccacgcgct tttcgccttg 







1561
cgtgccgatg cccgcctgca gcgtcgtcga gcgttcgcgg cgcgggcgcc gggtcacgac 







1621
gttggcgacg ccgccgatcg ccccttcgcc gtacagcacc gacgccggcc cgcgcagcac 







1681
ctcgatgcgc tcggcggacc acgtcgagaa cgggaacgtg acggtgcccg cgccggggta 







1741
gaggcgcacg ccgtcgacaa gcgtcgtcac cgattcctgg ccggcgaagc cgcgcacgct 







1801
gagcgccgtg ccgccgttgc cgggcgccgc ggccgtgctg aagcccgtcg cgcgcgtgac 







1861
ggcgtcgacg atcgtgcgat cgccgcgcgc ggcgatctgc gcggaggtga tcgcctcgac 







1921
gctcgcgggc gtgtcgaggc tcgcgagctt caggcgcgag ccggccgcga gcggcgtaga 







1981
cagcggcgac gcgtgcgccc cgtcgctcgt cacgcgcacg gtcggcagtt cgcgatgcgg 







2041
gccgcgcgcg gcgtcggccg gttccgcctc ggcggcgagc gcgccggacg ccgtcaccgc 







2101
gcacgcgagc gcggtgacgc agcactcgac ggccgtcgag cgacgcgacg cggcggcgaa 







2161
gcacgcgtgc acgccatgcg cgacaggcgc gccgtgagcg gcgcgagcag ccgccgcgaa 







2221
cccgcgcccg aacgcgcgac ggcgcgacgg tgcatgcggc gcccggctca t 









56
1
msraphapsr rrafgrgfaa aaraahgapv ahgvhacfaa asrrstavec cvtalacavt 







61
asgalaaeae padaargphr elptvrvtsd gahasplstp laagsrlkla sldtpasvea 







121
itsaqiaarg drtivdavtr atgfstaaap gnggtalsvr gfagqesvtt lvdgvrlypg 







181
agtvtfpfst wsaerievlr gpasvlygeg aiggvanvvt rrprrerstt lqagigtqge 







241
krvaldttga lgarlsyrfh lsdertrgfv ergdahvtav ggalkldvds rlsitldydy 







301
grqkpatyfg vpatngvler alrernynvg datiayhdtw trlaatyrag ngvtldaqly 







361
ylatrrhwrn aesyaldpaa rtvarsdyle ifhrerqfge rftaridsrv fgranrlvlg 







421
aefnqiafdg annapyrges tvaaagfdpg afaspdptlp rfrtrthqaa afienrlevl 







481
prlawvsglr ydhlsfhrdd liaggafdkt fahtgwrsgl vyeiapglaa yaqyttgaqg 







541
vgslvtlsas qanytlatgr qweagikhei dgaraywtla vydivkrglv svdpinpara 







601
qqigrqssrg velaggvrlp ggvtvdanaa llrarydafd qrvgdtvvqr agnvphdiaq 







661
qsanlwigwa fapgwranag lryvgprygd danrvpvpsy tvfdaslawr atrdvdlafy 







721
arnlanrtya astsnggaqw llgpsrsvel vatlrf 





507
81
53722763
YP_111748
82
1
tcagagcttg gtgctcaaac gaacccacgc ggtacggccg ggttcggtga caggcagatt 







61
cgccggatag ccgaaaccgg cgttgcccgc gaggttcagg tgctccgcat aagccttgtc 







121
gagcacgttg tcgacgccga ccgagatctg caccgtcttg ctcacgttgt actgcgcgtg 







181
cagcgacagc acgccgaaac cggcgctcgg accaaagtcc ttgccgacga cgttgccctc 







241
gttcagcgcg tagcgatgct gcggcgcaac gacccgccac agcccgcccg ccgaccacgg 







301
cccgcgagtg tactcgacgc cgaagcgtgc ctcgagcggc ggcatctgcg gcagcggcgc 







361
accgctttgc acgttgcgcc cccacgcata cgcgagcgac ccttcgaagc gccacggcgc 







421
gagcggacgc cacgacgcgc ccacctcacc gcccatgatc tgcgcgttga cgttcgtcgc 







481
ctgcgtgatc tgtcccatcg ggcccgtcgc atagtcgaac aggatgaagt cctgcacgta 







541
gcccgcatag gcggacaccc aggcgtcgag cttgtcgctc ttgtactgcg cgccgatgtc 







601
gagctgcgtc gtcttctcgg gcttgatcgc ggagaatgcg ttgatcgaac cgttcgggcc 







661
gcgcttggcg gagaacagct cccagtaatc agggaagcgc tgcgcatggc cgatgcccgc 







721
gtaccacgtg acgggcagcg acgcgagatc acgctcgtag cgcacgaagc cgctcggcag 







781
cacgcgcgag cgaagatcgt cgaacgtcgg attgcgcatg ctcatcttca tgccgcccgt 







841
cgtcgcgcgc ttgtcgcgcg cggccgcata gtcgatccgc gcgccgccga tcacgcgcga 







901
cgcatcgctc gcataccagg tcagctcgcc gaacgcgccc gcgttccaca tgttcgcctg 







961
cggattccac ggcttgtcgc cgtagttctg catccccatc gccgagcgcg agtcgaggcg 







1021
gttcgactgc gcatcgacgc ccgtcacgag cttgaacgcg tcggtcaagc gcagcgtcgc 







1081
cgcgacgcgc gcgccgagcg tgcggcggcg cacttcggag gccatgcgca tcggcatgct 







1141
gctcgtcgga tcgggcatcc gcaacgtgta gttgtccatc acgtgatcgg cttcgttgta 







1201
gaagacctgc gcatcgatgc gatcgagcac gtcgccgatg tgcttcttgt cgaacttcag 







1261
accgaacgtc tcgcgccgga aatgcgcgcc gtccattccg cggcccgcat agcgcgcgta 







1321
gccgtcgccc gtgcccgccg tcagctcgag ccgcgtgttg tcgtcgggcg tccagccgag 







1381
cgccgcgtcc gcgttccact tgtcccattg cgacggcacc gtgcggccgt tgccgtcctc 







1441
gtagtcctgc gagtgcgcat ggttcgcgct cacgcgcccg tagaagtcgg gcgtgccggc 







1501
cgtcacgtcg acgttctgat cgttgcgccc gaacgagccg ccgacgacgc tgccgtcaaa 







1561
gcgcatgccc ggcgtcttga agcgcggcgt cacgcgctcg aacagcaccg tgcccgccga 







1621
tgcgcccggc ccgtacagca cggtctgcgg ccccttgacg agcgtcacct tgtcgtagct 







1681
ctcgggcgcg atgtacgacg tcggcgcgtc catccggccg ggacacgcac cgagcgtcgg 







1741
catgccgttc gcgagaatgt tcagccgcga gccgaacatc ccgcgcagca ccgggtcgcc 







1801
gttcgtgccg ccgctgcgga tcgacgcgaa gccgggaatc gtcttcagat aatcggcgcc 







1861
gtcgctcgcg ggcagcggct ggcgcggcgt cttcggatcg gtgacgacga cgagcggcgt 







1921
cgattcgggc gccgcgacga tttcgatcgt cggcagcgtc gtggcgtccg gcgcggcgcc 







1981
ggtcgtcgtg cggctcgccg cggcgctcgc cgcttcgcga accgcgccgt ccgatttggc 







2041
gctcgccgcg tcggtgtcgg cgcggctcgc tacgccggac gcgccgagcg tgccgtggcc 







2101
gtgcacggcc tcgctcgtct gcgcgacggc cggcgcgagg tgaaaggcgc ccgcggcgag 







2161
cgcgggaacg gtaagcgtga tcgcgcgccg ccggcggcgc gcgtggtctc cacgcgcggt 







2221
gggcgcgtga cgcaggaatg tggatgtcat 









57
1
mtstflrhap targdharrr rraitltvpa laagafhlap avaqtseavh ghgtlgasgv 







61
asradtdaas aksdgavrea asaaasrttt gaapdattlp tieivaapes tplvvvtdpk 







121
tprqplpasd gadylktipg fasirsggtn gdpvlrgmfg srlnilangm ptlgacpgrm 







181
daptsyiape sydkvtlvkg pqtvlygpga sagtvlferv tprfktpgmr fdgsvvggsf 







241
grndqnvdvt agtpdfygrv sanhahsqdy edgngrtvps qwdkwnadaa lgwtpddntr 







301
leltagtgdg yaryagrgmd gahfrretfg lkfdkkhigd vldridaqvf yneadhvmdn 







361
ytlrmpdpts smpmrmasev rrrtlgarva atlrltdafk lvtgvdaqsn rldsrsamgm 







421
qnygdkpwnp qanmwnagaf geltwyasda srviggarid yaaardkrat tggmkmsmrn 







481
ptfddlrsrv lpsgfvryer dlaslpvtwy agighaqrfp dywelfsakr gpngsinafs 







541
aikpekttql digaqyksdk ldawvsayag yvqdfilfdy atgpmgqitq atnvnaqimg 







601
gevgaswrpl apwrfegsla yawgrnvqsg aplpqmpple arfgveytrg pwsagglwrv 







661
vapqhryaln egnvvgkdfg psagfgvlsl haqynvsktv qisvgvdnvl dkayaehlnl 







721
agnagfgypa nlpvtepgrt awvrlstkl 





508
81
53722050
YP_111035
83
1
ttgcacgcgc cgtgcatgcc gcaattcgcc ccgcaattcg ccccgcgccg cgtcgtcgcg 







61
cgcgtggccg ccgccgcgct gttctccgcg tgcgccgcgc cgggcgtatc gggcgcggcc 







121
gagccggccg ccgcgacgcg cgcgtacgac attccggcgg gcccgctcga cgcggcgctg 







181
acccgcttcg gccgcgaggc cggcatcctg ctgtcgttcc cgggcgagct gaccacgggc 







241
ctgcgaagcc ccgggctgca cggccgcgcg gatccggccg cggcgctcga ccggctgctg 







301
acgggcacgg ggctcgtcgc gctccggcag ccgagcggcg gctacacgct cgccaggctg 







361
cccgggccgg ccgccgccgg cgccgatgcc gcgctcgcgg ccgacacgac gctgcccacc 







421
gtcgccgtgc gcgcgagcgg cccgcacgcg gacagctacc ggccgccgcg cgaggcggcg 







481
gggctgcgca gcgacgcgcc gctcgccgag gtgccgcaag cggtcgccat cgtcgcgcag 







541
caggtgctgc gcgaccagcg cccgcgcaat ctcgacgacg cgctcgcgaa cgtgagcggc 







601
atcacgcagg gcaacacgct cggcagcacg caggacacgg tgatgaagcg cggcttcggc 







661
gacaaccgcg acggctcggt gatgcgcaac ggaatgccga tcgtgcaggg ccgctcgctg 







721
aacgcgacga cggacagcgt cgaagtgctg aagggcccgg cctcgctgct gtacggcatc 







781
atggacccgg gcggcgtgat caacgtcgtc acgaagcagc cgcagctcgc gcgccgccac 







841
gcgatctcgg cgctcggctc gacgtacggc ggcggcagga acggcggcga gctcacgttc 







901
gactcgacgg gcgcgatcgg cgagtcgcgc gttgcatacc ggctgatcgt cgatcagacg 







961
aacgaacagt attggcgcaa ctacggcgag caccgcgaga cgctcgtcgc gccttcgctc 







1021
gcgtggtacg ggcgcgatac gcaggtggtg ctgtcgtacg agtaccggcg cttcctgatg 







1081
ccgttcgatc gcggcaccgc gctcgatccg cgcacgaacg cgccgctcgc gatcccggcg 







1141
cgccgccggc tcgacgagcc gttcaacgac atgcgcggcg aatcgaatct cgcgcagctc 







1201
gcgatcgatc atcagctcgc gccggactgg aaggtgcacg tcggctacag ctacaaccgg 







1261
gagacctacg atgcgaacca gattcgcatc acggcggtcg atccgctcaa gggcacgctc 







1321
acgcgcagca acgacgcgac gcacggctcg cgcagcaccg acagctacgg catcgcgtac 







1381
gtcgacggcc gcgtgacgct cgccgggatg cggcacgacg tgcagttcgg cgtcgacggc 







1441
gaataccggc aggtctaccg cgcggacatg ctgcgccagc cgatcaagac gccgttcagc 







1501
tatctgaacc cgacgtacgg gctcgtcccg ccgtcgacga gcgtatcggc ctccgacagc 







1561
gaccagagcg acacgctgca caccgcctcg ctgttcttcc aggacagcat ccatctgagc 







1621
gagcggtgga tgctcgtcgg cggcgcgcgc tgggtccgct acagccagct tgccgggcgc 







1681
ggccggccgt tccaggtcaa cacgaacctg agcggcacga agtggctgcc gcgcgcgggc 







1741
atcgtctaca agtggaacga cgcgctgtcg ctgtacggca gctacacgca gtcgctgaag 







1801
ccgacctcga agatcgcgcc gatggcgggc ggctacgtga tcgacggctc gaccgcgccc 







1861
gaggaaggcg cgtcgtggga gctcggcgcg aagctcgaca tgccgggcgg cctcgccggc 







1921
acgctcgcgt tcttcgacat cgacaagaag cacgtgctcg tctcgcaata cgacgacgcg 







1981
acgaaccaga ccgcgtggcg cacgtccggc cgcgcgcgct cgcgcgggat cgagctcgac 







2041
gtgtcggggc ggatcggcgc gcgctggaac gtgatcgcga gctacgcgta catcgacgcg 







2101
aagacgaccg aagatccgct gtacgcgggc aaccggctgt ggaacgtcgc gcggcacacg 







2161
gcctcgctcg cggcggtcta cgacgtgggc acggtgctcg gcggcgacga cctgcgcatc 







2221
ggcgcggcgg ggcgctacgt cggcgcgcgc ccgggcgatt cggccaacag cttcacgctg 







2281
ccggcgtacg cgaccgccga cgcgttcgcg acctacgaca cgcggcttgg caagcagaag 







2341
ctgcagttcc agctcaacgt gaagaatctg ttcaaccgca cgtattaccc gtcgagcgtg 







2401
aaccggttct tcgtgtcggt gggcgacgcg cggcaggtgt cgctgctgac gacgctgcag 







2461
ttctga 









58
1
mhapcmpqfa pqfaprrvva rvaaaalfsa caapgvsgaa epaaatrayd ipagpldaal 







61
trfgreagil lsfpgelttg lrspglhgra dpaaaldrll tgtglvalrq psggytlarl 







121
pgpaaagada alaadttlpt vavrasgpha dsyrppreaa glrsdaplae vpqavaivaq 







181
qvlrdqrprn lddalanvsg itqgntlgst qdtvmkrgfg dnrdgsvmrn gmpivqgrsl 







241
nattdsvevl kgpasllygi mdpggvinvv tkqpqlarrh aisalgstyg ggrnggeltf 







301
dstgaigesr vayrlivdqt neqywrnyge hretlvapsl awygrdtqvv lsyeyrrflm 







361
pfdrgtaldp rtnaplaipa rrrldepfnd mrgesnlaql aidhqlapdw kvhvgysynr 







421
etydanqiri tavdplkgtl trsndathgs rstdsygiay vdgrvtlagm rhdvqfgvdg







481
eyrqvyradm lrqpiktpfs ylnptyglvp pstsysasds dqsdtlhtas lffqdsihls 







541
erwmlvggar wvrysqlagr grpfqvntnl sgtkwlprag ivykwndals lygsytqslk 







601
ptskiapmag gyvidgstap eegaswelga kldmpgglag tlaffdidkk hvlvsqydda 







661
tnqtawrtsg rarsrgield vsgrigarwn viasyayida kttedplyag nrlwnvarht 







721
aslaavydvg tvlggddlri gaagryvgar pgdsansftl payatadafa tydtrlgkqk 







781
lqfqlnvknl fnrtyypssv nrffvsvgda rqvsllttlq f 





509
27
53718641
YP_107627
84
1
ttactgcgcc gcttgcggcg cgcgcaggta gatctcgacg cggcggtttt gcgcgcggcc 







61
ggcttccgtt gcgttgtcgg cgatcgggtt cgacgcgccc atgccttgcg cggacaggcg 







121
gttcgccgcg acgccgcgct gcgcgagcgc gttcacgacg ctttgcgcgc gattttgcga 







181
cagcgtttga ttgtgcgcgg ccgagcccgt gctgtcggtg tagccgacga ccgacgcggt 







241
gatctgcggg ttctggttca gcgtcgtcgc gaggtcgttc agcagcggcg tgaacgccgg 







301
cgtgatcgcg tactggtcgg tcgcgaacgt gaccgagctg ggcacgttca gcttgagcga 







361
gccgtccggc tgttcggtga cctgcgtgcc cgtttgctgc gccgacggcg cgagcttgtt 







421
cttgatcgct tgccagttgt aacccgtcac gccgccgacg agcgcgccga cgccggcgcc 







481
gatcgccgcg cccttgccgc cgccagcgag cgcgccgatg cccgcgccga gcgcggcgcc 







541
cgtgccggtg ccgacagccg tgttggtgcc ttgctgggtt gcgcaacccg ccagcagggc 







601
gccggcgaga gcgaaaacgg acaagcgcgt cgcgattttg gtattcat 









59
1
mntkiatrls vfalagalla gcatqqgtnt avgtgtgaal gagigalagg gkgaaigagv 







61
galvggvtgy nwqaiknkla psaqqtgtqv teqpdgslkl nvpssvtfat dqyaitpaft 







121
pllndlattl nqnpqitasv vgytdstgsa ahnqtlsqnr aqsvvnalaq rgvaanrlsa 







181
qgmgasnpia dnateagraq nrrveiylra pqaaq 





510
19
53720375
YP_109361
85
1
atgatgtcga agaaacttcg tctggcgttc gcgatgctga tgatcggcgc gctcgcggca 







61
tgcaagtcgg gcgtgaagct cgacgagcac gcgaaccagg gcgacgcggt cagcacgcaa 







121
ccgaatcccg aaaacgtcgc gcaggtgacc gtcgatccgc tgaacgatcc gaacagcccg 







181
ctcgcgaagc gcagcgtgta cttcgatttc gacagctact cggtgcagga ccaataccag 







241
gcgctgctgc agcaacacgc gcaatacctg aagagccatc cgcaacgcca catcctgatc 







301
cagggcaaca ccgacgagcg cggcacgagc gagtacaacc tcgcgctcgg ccaaaagcgt 







361
gcggaagccg tgcgccgcgc gctgtcgctg ctcggcgtcg gggatgcgca gatggaagcc 







421
gtgagtctcg gcaaggaaaa gccggttgcg ctcggccacg acgaagcgtc gtgggcgcag 







481
aaccgccgtg cggacctcgt ctatcaacag taa 









60
1
mmskklrlaf amlmigalaa cksgvkldeh anqgdaystq pnpenvaqvt vdplndpnsp 







61
lakrsvyfdf dsysvqdqyq allqqhaqyl kshpqrhili qgntdergts eynlalgqkr 







121
aeavrralsl lgvgdaqmea vslgkekpva lghdeaswaq nrradlvyqq 





511
24
53720132
YP_109118
86
1
atgaataaac tttcaaagct cgcgttcatt gcagctaccg cagttatggc tgcatccgct 







61
tcggcacagt cggtgccggc gtcgcgacaa gccgtgaatg acaactgggt gaatggcacg 







121
ggcgaatggg tgtggatgaa cggcacgaac gagctctgct ggcgcgatgc gttctggacg 







181
ccggccaccg ccaacgccaa gtgcgatggc gcactggtcg cccaggcacc ggcaccggcg 







241
ccggtcgcac cggttgctcc ggccatcacg agccagaaga ttacgtacca agccgatacg 







301
ctgttcgact tcgacaaggc cgtcctgaag ccggccggca agcagaagct tgacgaactg 







361
gccgcgaaga tccagggcat gaacgtcgaa gtggtcgtgg ccacgggcta cacggaccgc 







421
atcggttcgg acaagtacaa cgaccgtctg tcgctgcgcc gcgcgcaagc cgtcaagtcg 







481
tacctcgtca gcaagggtgt cccggcgaac aaggtctaca cggaaggcaa gggcaagcgc 







541
aacccggtca cgggcaacac ctgcaagcag aagaaccgca agcagctcat cgcctgcctc 







601
gcaccggacc gccgcgtgga agtcgaagtg gtcggcacgc aggaagtgca gaagacgacc 







661
gttccggcgc agtaa 









61
1
mnklsklafi aatavmaasa saqsvpasrq avndnwvngt gewvwmngtn elcwrdafwt 







61
patanakcdg alvaqapapa pvapvapait sqkityqadt lfdfdkavlk pagkqkldel 







121
aakiqgmnve vvvatgytdr igsdkyndrl slrraqavks ylvskgvpan kvytegkgkr 







181
npvtgntckq knrkqliacl apdrrvevev vgtqevqktt vpaq 





512
40
53720170
YP_109156
87
1
atgaaaaaga ccctcatcgt tgccgctctt tccggcgttt tcgcaacggc cgctcacgcg 







61
caaagcagcg tgacgctgta cggcctgatc gacgccggca tcacctacac gaacaaccaa 







121
ggcggccaca gcgcatggtc gcaatccacc ggctcggtca acggcagccg ctggggcctg 







181
cgcggcgccg aggatctcgg cggcggcctg aaggcgattt tcgtgttgga aaacggcttc 







241
ggcatcaata acggcacgct gaagcagaac ggccgcgagt tcggccgtca ggcgttcgtc 







301
ggcctgtcgc acgagcaata cggcgcgctg acgctcggcc gtcaatacga cagcgtcgtc 







361
gactacctcg ggccgctgtc gctgacgggc acgcaattcg gcggcacgca gttcgcccac 







421
ccgttcgaca acgacaacct gaacaattcg ttccggatca acaacgcggt caagtacacg 







481
agcgtgaact gggcgggcct gaaattcggc gcgttgtacg gcttctcgaa caacaatcag 







541
ttcgcgaaca accgcgccta tagcgcgggc gtatcgtaca gctacgccgg cttcaacatc 







601
ggcgccggct acctgcagtt gaacaacaac ttcggcccga cggtctccaa cgcatccggc 







661
gcggtcgcgc tcgacaacac gttcgtcggc aagcgccagc gcgtgttcgg cggcggcctg 







721
aactacacgt tcggcccggc aacggccggc ttcgtgttca cgcaatcgcg cgtcaaccgc 







781
gcgacggcaa tcggcgcggg cgcatcgggc gtgtcgagcg gcattgcgct cgacggcacg 







841
ttcatgcgct tcaacaacta cgaagtgaac gcgcgctacg cgatcacgcc ggcatggacg 







901
gtggccggtt cgtacacgta caccgccggc ttcatcgaga accaccaccc gggctggaac 







961
caattcaacc tgcaaacggc ctacgcgctg tccaagcgca cggacgtgta cctgcaaggc 







1021
gtgtatcaga aggtcaacaa cgacggcacg ggcctcggcg cgtacatcaa cggtatcggc 







1081
ggcatgtcgt cgacggaaaa acagatcgcc gtcacggccg gcctgcgtca ccgcttctaa 









62
1
mkktlivaal sgvfataaha qssvtlygli dagitytnnq gghsawsgst gsvngsrwgl 







61
rgaedlgggl kaifvlengf ginngtlkqn grefgrqafv glsheqygal tlgrqydsvv 







121
dylgplsltg tqfggtqfah pfdndnlnns frinnavkyt svnwaglkfg alygfsnnnq 







181
fannraysag vsysyagfni gagylqlnnn fgptvsnasg avaldntfvg krqrvfgggl 







241
nytfgpatag fvftqsrvnr ataigagasg vssgialdgt fmrfnnyevn aryaitpawt 







301
vagsytytag fienhhpgwn qfnlqtayal skrtdvylqg vyqkvnndgt glgayingig 







361
gmsstekqia vtaglrhrf 





513
44
53721967
YP_110952
88
1
atgaaaaagc acgtcatttc cgcagccgca ttgctggcct tcgccgcgcc ggttttcgcc 







61
caaagcagcg tcacgctgta cggcgtgatc gacgaaggct tcaactacac gagcaacgtg 







121
aacgtcaacg gcgtcggcaa gagcaactac cagctcgcga gcggtttcgt gcagggcagc 







181
cgctggggcc tgcgcggctc ggaagacctg ggcggcggcc tgaaagccat cttcacgctc 







241
gaaaacggtt tcgacgtgaa caacggccgg ctcggccagg gcggccggat gttcggccgc 







301
caggcgttcg tcggcctgtc gcacgcgcaa tacggctcgc tcacgctcgg ccgccagtac 







361
gattcgctcg tcgactacct cgcgccgttg accgcgaacg gcaactgggg cggcacgctg 







421
ttctcgcatc cgttcgacaa cgacaacacg gacaactcgt tccgcgtgaa caacacggtc 







481
aagtacgcga gcccggactg gaacggtctg caagtcggcg gcacgtacag cttcagcaac 







541
gcgacgggct tctcgaacaa ccgccagtac agcatcggcg ccgcgtacac gctgggcggc 







601
ctgcaacttg cggccgcgta cctgcaggcg aacaacccgg gcaagacggc gggcggcgcg 







661
atcgccgaca acgacgcgaa cttcacggcg gaccgcctgc gcatcttcgg cggcggcgtc 







721
aactacacgt tcggcccggc gacggtcggt ttcgtctaca cgaagaccga cgtgaagaac 







781
ccggtctcga cggtctatct gccgacggcg acgttcgcgg gcctcggcct gagcgcgacg 







841
aagttccaga acttcgaaat caacggcaaa taccagctga cgccggcgct cttcatcggc 







901
gcgcagtacg tgtacacgga cggcaagttc gatgcggccg cgggcagcgt caagccgaag 







961
taccacacgg tcggcctgat ggcggactac aacctgtcga agcgcaccga cgtctatctg 







1021
cagggcgcgt accagaaggt cgcgggcgac aagacgggca cgatcgcgga tggcggctac 







1081
gtcgtcggga cggacggccc ctcggcgtcg gcgaaccagt tcgcggttcg cgcggcgatc 







1141
cgtcacaagt tctga 









63
1
mkkhvisaaa llafaapvfa qssvtlygvi degfnytsnv nvngvgksny qlasgfvqgs 







61
rwglrgsedl ggglkaiftl engfdvnngr lgqggrmfgr qafvglshaq ygsltlgrqy 







121
dslvdylapl tangnwggtl fshpfdndnt dnsfrvnntv kyaspdwngl qvggtysfsn 







181
atgfsnnrqy sigaaytlgg lqlaaaylqa nnpgktagga iadndanfta drlrifgggv 







241
nytfgpatvg fvytktdvkn pvstvylpta tfaglglsat kfqnfeingk yqltpalfig 







301
aqyvytdgkf daaagsvkpk yhtvglmady nlskrtdvyl qgayqkvagd ktgtiadggy 







361
vvgtdgpsas anqfavraai rhkf 





514
39
53721903
YP_110888
89
1
atgaacaaga ctctgattgt tgcagcagtt gctgcatcgt tcgcaaccgt cgctcacgcg 







61
caaagcagcg tcacgctgta cggtgtgctc gacgcgggca tcacgtacca aagcaacgtc 







121
gcgacgccgt cgggctcggg caagtcgctg tggtcggtcg gcgccggcgt cgaccaaagc 







181
cgtttcggtc tgcgcggctc ggaagacctg ggtggcggcc tgaaggcgat cttcacgttg 







241
gaaagcggct tcaacatcgg taacggccgc ttcaacaacg gtggcggcat gttcaaccgt 







301
caagcgttcg tcggtctgtc gagcaactac ggcaccgtca cgctgggccg tcagtacgac 







361
gcaacccaag actacctgtc gccgctgtcg gcaacgggca cctggggcgg cacgtacttc 







421
gcgcaccggc tcaacaacga ccgcctgaac acgaacggcg acgtcgcggt gaacaacacg 







481
gtcaagttca cgagcgcgaa ctacgccggc ctgcaattcg gcggcacgta ctcgttctcg 







541
aacaactcgc aattcgcgaa caaccgtgca tacagcgcgg gcgcttcgta ccagttccaa 







601
ggcctgaagg tcggtgcggc gtactcgcaa gctaacaacg ctggcgcgaa caccacgggc 







661
gcaacggatc cgctgaccgg cttcaacatc ggcggcacga acgcagcaag catccaaggc 







721
cgttcgcgcg tgtacggcgc cggcgcaagc tacgcttacg gcccgctgca aggcggcctg 







781
ctctggacgc aatcgcgtct cgacaacctg gcaaacggcg cgccgaccac ccgtgccgac 







841
aactacgaag caaacgtgaa gtacaacctg acgccggctc tgggtctggg tgttgcttac 







901
acgtacacga acgcgaaggc gaacggcgaa agcactcact ggaaccaagt tggcgtccag 







961
gccgactacg cgctgtcgaa gcgcaccgac gtgtacgcac aagccgtgta ccagcgttcg 







1021
tcgaagaacg cgaatgcgtc gatctacaac ggcgacctca gcacgccgtt cagcacgtcg 







1081
atcaaccaaa ccgcagcgac ggttggtctg cgtcaccgct tctaa 









64
1
mnktlivaav aasfatvaha qssvtlygvl dagityqsnv atpsgsgksl wsvgagvdqs 







61
rfglrgsedl ggglkaiftl esgfnigngr fnngggmfnr qafvglssny gtvtlgrqyd 







121
atqdylspls atgtwggtyf ahrlnndrln tngdvavnnt vkftsanyag lqfggtysfs 







181
nnsqfannra ysagasyqfq glkvgaaysq annaganttg atdpltgfni ggtnaasiqg 







241
rsrvygagas yaygplqggl lwtqsrldnl angapttrad nyeanykynl tpalglgvay







301
tytnakange sthwnqvgvq adyalskrtd vyaqavyqrs sknanasiyn gdlstpfsts 







361
inqtaatvgl rhrf 





515
55
53722699
YP_111684
90
1
atgaagaaac tcgctctgtc taccctctcg ctcgcgctgc tgggcgcagc cggtgcagct 







61
caggctcaaa gcagcgtcac gctgtacggc gtgattgata cgtcgatcac ttatgtccac 







121
ggcaacgacg gcaaggccaa caacgcctgg ctgatgggca gcggcaacct gcaaggcagc 







181
cgctggggcc tgaagggcac cgaggatctc ggcgccggcc tgaaggcgat cttccagttg 







241
gaaaatggct tcaactcgaa cgacggcacg ctcggccaag gcaagcgcat gttcggccgc 







301
caggcgttcg tcggtctgca gcatgaccag tacggcacgc tgacgctcgg ccgtcagtac 







361
gatccgctcg tcgacctcgt gcaaccggtg acggccgaca actacttcgg cagcctgttc 







421
gccaccccgg gtgacgtcga caacaacgac aacagcctgc gcgtgaacaa cacggtcaag 







481
tacacgtcgc ccgtgttcgc cggcttccag ttcgaagccc tgtacggctt cagcggcatc 







541
gcgggctcgc cgggccaagg ccagacgtgg tcggccgccg ccgcatacaa caacggcccg 







601
atcggcgtcg cggccggcta cttctacacg tcgaacccgt cgccgacggc cggctcgcgc 







661
tcgacgtgga gcggctcgtc cgacgcgatc ttcgacggcg cgatcaacag cggctacacg 







721
agcgcgaagt cgatcggcat cgcacaggtg gccggccagt acgtgttcgg cccggtcacg 







781
gtcggcctcg gctacagcaa cgcgcaatac aagccggacg gcttctcggg cttctcgtcg 







841
accgagaagt acaacacggg ccgcggcttc gtgacgtacc aggccacgcc ggcgctgctg 







901
ctcggcctcg gctacgcata cacgaaggcg agcggcgaca cggacgcgaa gtaccaccaa 







961
gtgtcgatcg gcgcggacta cgcgctgtcc aagcgcacgg acgtctacct cgccggtgcg 







1021
taccagcacg cgagcggcac gcagcgcgtc gatgcgacga cgacgcaaac ggcgcaggca 







1081
tcgatcggct cgtacggcgt gaacggtacg aagtcgcaag agatggtggc tctgggcctg 







1141
cgccacaagt tctaa 









65
1
mkklalstls lallgaagaa qaqssvtlyg vidtsityvh gndgkannaw lmgsgnlqgs 







61
rwglkgtedl gaglkaifql engfnsndgt lgqgkrmfgr qafvglqhdq ygtltlgrqy 







121
dplvdlvqpv tadnyfgslf atpgdvdnnd nslrynntvk ytspvfagfq fealygfsgi 







181
agspgqgqtw saaaaynngp igvaagyfyt snpsptagsr stwsgssdai fdgainsgyt 







241
saksigiaqv agqyvfgpvt vglgysnaqy kpdgfsgfss tekyntgrgf vtyqatpall 







301
lglgyaytka sgdtdakyhq vsigadyals krtdvylaga yqhasgtqrv datttqtaqa 







361
sigsygvngt ksqemvalgl rhkf 





516
58
53719415
YP_108401
91
1
tcaagcacac ccatcgccgc cgcacccggc gcgcgaccag cccccgccca gcgcgcgata 







61
cagcgtgatc gcgttcgtga gccgcagctg cttcaggcga atcagctcct gcccggactc 







121
gaacgtgctg cgctgcgcgt cgagcagctc gagatagctc gccacgccgc tgtcgtagcg 







181
ccgctgcgcg agccgcagcc gctcggcgtc cgcaccgtag accgcctgct gcgcggcaag 







241
ctgcgcgtcg atctgatcgc gcgccgcgag cgcatcggcg acttcgcgaa acgccgtctg 







301
gatcgtcttc tcgtattcgg cgaccgcgat atgcttgcgc gcgtccgcca cgtcgagatt 







361
cgcacgattg cgcccgcccg cgaaaatcgg cagcgtgagc cgcggcgcga acgtccacac 







421
gctcgagccg gccgagaaga gtcccgagaa cgcgtcgctc accgagccga cgtcggtcgt 







481
gagcgcgatg cgcgggaaga acgccgcgcg cgccgcgccg atgttcgcgt tcgccgcgac 







541
gaggcgctgc tcggcctgcc ggatgtccgg gcgctgctcg agcagatccg acgacagccc 







601
cggcgacacc cgcgccaccg cgagcgcgtc gagcgcgggc gcatcggcgg gcagcgccgt 







661
catgaagtcg cccgcgagca gcttcagcgc gctcgccgcc tgcgtgtgct cgcgctcgag 







721
cgccgccttc gacgcgcgcg ccgacgcgac gagcatctcg gccgtgcgca gctcgatcgc 







781
cgtgctcgtg ccggccgcgt aacggcgctg cgtgagcgcg taggccgcgt cgcgcgcggc 







841
gagcgtgcgt tcggcgagcc ccagttgatc gacgagcgcg cgctcggtca cgtacgcgga 







901
cgcgacttcg gcgatcaggc tgatgcgcgc cgcgcgctgc gcctcggcgg tcgcgaagta 







961
ctcggcgagc gccgcgtccg agaggctctt cacgcggccg aacagatcga tctcgaacgc 







1021
gctcacgccg acactcgcgc gatacagcga gctcgtcgcg ctctcgcgca gcaccgggtc 







1081
gtaaagccgc gtgcgctcgt agccgaggct gccgtcgatc gacggcagcc ggtccgcgcg 







1141
cgcgacgccg tagaggccgc gcgcttcctg gatacgcagc gtcgcgatcc gcaggtcgcg 







1201
attgttcgcg agcgccgcgt cgatcagcgc gcgcagcgcc ggatcggtga aatacgcgcg 







1261
ccagtcgtcg agccgcgcgt cgcgggcgtc gcgcccggcg tcgccgcggc gcgcgtccgg 







1321
catcgcgcgc gggccgccgg cggccgccgg gcccgcctcg cgcggcgcgc ccgtcgccgt 







1381
cgaccaataa tgcgccggca cgggcgcggc cggccgctcg tagcgcggcg cgagcgtgca 







1441
gcccgcgacg agcgcggccg cgaccgcgag cggccaggcc cgcggcccgt tcggccgccg 







1501
cgctgcgcgc gcgccgcgcc gccgggcccg gggtttcatc atcgctttca c 









66
1
mkammkprar rrgaraarrp ngprawplav aaalvagctl apryerpaap vpahywstat 







61
gapreagpaa aggprampda rrgdagrdar darlddwray ftdpalrali daalannrdl 







121
riatlriqea rglygvarad rlpsidgslg yertrlydpv lresatssly rasvgvsafe 







181
idlfgrvksl sdaalaeyfa taeaqraari sliaevasay vteralvdql glaertlaar 







241
daayaltqrr yaagtstaie lrtaemlvas araskaaler ehtqaasalk llagdfmtal 







301
padapaldal avarvspgls sdlleqrpdi rgaegrlvaa nanigaaraa ffprialttd 







361
vgsysdafsg lfsagssvwt faprltlpif aggrnranld vadarkhiav aeyektiqta 







421
frevadalaa rdgidaqlaa qqavygadae rlrlaqrryd sgvasylell daqrstfesg 







481
qelirlkqlr ltnaitlyra lgggwsragc ggdgca 





517
55
53718454
YP_107440
92
1
atgaagcgaa aacatgcttt gactgcactc gcagtcgcgc tgctcgccgc gggctgcacg 







61
ctcgcgccgc gctacgagcg tccggccgcg ccggtgtcgg gcgcgttccc cgccgacggc 







121
gtctatgccg cgcagccggg cgccgcgccc ggcgcgcgca gcgcgaacgg ccaggcggcc 







181
gtcgatatcg gctggcgcga gttcttcgtc gatccgtgcc tgcagcggct gatcgagatc 







241
gcgctgaaga acaaccgcga cctgcgcgtg tcggtgctca acgtcgaggc gtcgcgcgcg 







301
cagtatcaga tcacgcgcgc ggggctcttc ccgacgttga gcggcaccgg cacgggcacg 







361
atccagcgca cgccggccgg cgtgtcgatc accggccagc cgctcatctc gcggacctac 







421
aacgtcggcg tctccgcgtc gtgggagctc gacctgttcg gccgcgtgca gagcctgaag 







481
gatcaggcgc tcgcgcaata cttcgcgacc gcgcaggcgc gcaaggccgc ggagatctcg 







541
ctcgtcgcga gcgtcgccga tcagtacctg acgctgctgt cgaccgacga tctgctgcag 







601
gtcacgcaga acacgctgaa gtcggcgcgc gcgtcctacg atctgacgaa gctgcagttc 







661
gacaacggca ccggctcgga gctcgacctg cgccaggcgc agacggtggt cgagaccgcg 







721
ctcgcgagcc agcaggcgca ggcgcgcgcc cgcgcgcagg cgctcaacgc gctcgtgctg 







781
ctgatcggcg agccgctgcc cgacgatctg ccggccggcc tgccgctcaa tgcgcagaac 







841
ctgctcaccg acattccggc cgggctgccg tccgatctgc tcacgcggcg gcccgacatc 







901
atgcaggccg aggagacgct gcgcgcggcg aacgcgaaca tcggcgcggc gcgcgcggcg 







961
ttcttcccga agatctcgct caccagcgcg ttcggcaccg cgagcccgac gctcggcggc 







1021
ctgttcaagg cgggcacggc ggcgtggtcg ttcgcgccga acatcgcgct gccgatcttc 







1081
gagggtgggc agaacatcgc gaacctcgat ctcgcgcacg tgcagaagcg catcgagatc 







1141
gcgaactacg agaaggcgat ccagagcgcg tttcgcgagg tgtcggacgg gcttgccgcg 







1201
cgcggcacgt acgatcagca gatcgcggcg ctcgagcgca acgagcacgc gcagcagcgc 







1261
cgctacgatc tgtcggacct gcgctacagg aacggcgtcg acagctatct gtcggtgctg 







1321
accgcgcaga cggacctgta ttcggcgcag cagcagttga tcagcgcgcg gctcgcgcgc 







1381
tggacgaacc tcgtggacct gtatcgcgcg ctgggcggcg ggtggctcga gcgcgcgggc 







1441
gagacgccgc gcccggcgga cgcgcccgtc gattacggca aggcggcgcc cgcgccggcg 







1501
gcggccgtgc cggctgcgtc ggccagcgcg ccggcggcgg gctga 









67
1
mkrkhaltal avallaagct lapryerpaa pvsgafpadg vyaaqpgaap garsangqaa 







61
vdigwreffv dpclqrliei alknnrdlrv svlnveasra qyqitraglf ptlsgtgtgt 







121
iqrtpagvsi tgqplisrty nvgvsaswel dlfgrvqslk dqalaqyfat aqarkaaeis 







181
lvasvadqyl tllstddllq vtqntlksar asydltklqf dngtgseldl rqaqtvveta 







241
lasqqaqara raqalnalvl ligeplpddl paglplnaqn lltdipaglp sdlltrrpdi 







301
mqaeetlraa nanigaaraa ffpkisltsa fgtasptlgg lfkagtaaws fapnialpif 







361
eggqnianld lahvqkriei anyekaiqsa frevsdglaa rgtydqqiaa lernehaqqr 







421
rydlsdlryr ngvdsylsvl taqtdlysaq qqlisarlar wtnlvdlyra lgggwlerag 







481
etprpadapv dygkaapapa aavpaasasa paag 





518
19
53721406
YP_110391
93
1
ttattccggc gagcccatca tcgactgctg atagttctga atgccgacct tgccgatcag 







61
gtcgatttgc gtctcgagcc agtcgatgtg ctcctcggta tcgtcgagga ttttctcgaa 







121
gatttcgcgc gacacgtaat cacgcaccga ttcgcaataa gcgatggctt ccttgcacgt 







181
ggcctgcgat acctgttcga gcttcaggtc gcatttcagg atttcctcgg tttcctcgcc 







241
gacgagcagc ttgtgcagat cctgcaggtt cggcaggccg tcgagcatga acacgcgctc 







301
gatcagccag tccgcgtgct tcatttcgcc gatcgattcg tcgtattcgt gcttgccgag 







361
tttttcgagg ccccagtgtt tgtacatccg cgcatgcagg aaatattgat tgatcgcggt 







421
cagttcgttt ttcaactgcg cgttcagata ttcgatgact ttcttgtcgc cttgcat 









68
1
mqgdkkviey lnaqlknelt ainqyflhar mykhwglekl gkheydesig emkhadwlie 







61
rvfmldglpn lqdlhkllvg eeteeilkcd lkleqvsqat ckeaiayces vrdyvsreif 







121
ekilddteeh idwletqidl igkvgiqnyq qsmmgspe 





519
8.5
53721404
YP_110389
94
1
tcaggccgcc ttgcgttcgt agaacgtgac ggcgaccggt acggccgcag ggcgctcggc 







61
accgcaacgg ctcgcacaaa cgccctgttc cgccatgatc tcgcgcacgg tttcctcgca 







121
cttgccgcag caggtggcga ccccgagttc gaactggagt tcttcgaacg tgttcacgcc 







181
ttccgcgagg gatgcgcgaa tcttccgatc ggaaacggac ttgcacacgc agacgatcat 









69
1
mivcvcksvs drkiraslae gvntfeelqf elgvatccgk ceetvreima eqgvcasrcg 







61
aerpaavpva vtfyerkaa 





520
36
53722329
YP_111314
95
1
tcacttgaaa cggatcgtcg ggttgatgac cgcgtacagc acgtcgacgc tcaggttgat 







61
caggatgaac tccagcgaga acagcagcac gatcgcctgg atcaccgggt agtcgcgcat 







121
cgtcaccgca tcgacgagca ggcgcccgag ccccggccag ttgaacaccg cctcgacgac 







181
gatcgagccg ccgagcagga agccgaactg cagccccatc atcgtgacga ccggaatcat 







241
cgcgttgcgc aggcagtgct tgagcacgac catcggctcg cgcacgccct tcgcgcgcgc 







301
ggtgcgcacg aaatcctcgt tcagcacctc gacgaacgac gcgcgcgtga agcgcgccat 







361
cacggcggcc acggccgcgc cgagcgtcac cgacggcggc acgtagctct tccacgagcc 







421
gtccggcacg acgggcagcc agccgagctt caccgagaac acctccatca gcagcatgcc 







481
gagcgcgaac gccggaaacg agatgcccga caccgcgagc gtcatgccga tgcggtccgg 







541
ccagcggttg cgccacacgg ccgacgcgat gccgatcgcc atgccgaaga gcgtcgccca 







601
ggccatgctg acgacggtca gcgtcagcgt cggcatgaag cgctcgccga tctcggtcgc 







661
tacgggccgc ttgctgcgcg tcgagacgcc gaagtcgccg tgcgcgatcc gcgcgaagaa 







721
gctcgcgaac tgcgccggca gcggcttgtc gaggccgaga tcggcgcgca cgagcgcgac 







781
ggtcgcgtcg tcggcctcgg ggccggccgc gagccgcgcg ggatcgcccg gcagcaggtg 







841
gacgaacagg aacaccagca ccgcgacgca cgccagcgtc ggcagcagcc cgaacagtcg 







901
tttgaccaga aaattcagca t 









70
1
mlnflvkrlf gllptlacva vlvflfvhll pgdparlaag peaddatval vradlgldkp 







61
lpagfasffa riahgdfgvs trskrpvate igerfmptlt ltvvsmawat lfgmaigias 







121
avwrnrwpdr igmtlavsgi sfpafalgml lmevfsvklg wlpvvpdgsw ksyvppsvtl 







181
gaavaavmar ftrasfvevl nedfvrtara kgvrepmvvl khclrnamip vvtmmglqfg 







241
fllggsivve avfnwpglgr llvdavtmrd ypviqaivll fslefilinl svdvlyavin







301
ptirfk 





521
42
53721737
YP_110722
96
1
atgaaaaagc gcactgcggt cgcgatgacg gccgccggac tggcggctgt cgccaccgcg 







61
cacgctcaga gcagcgtgac gctttacggc atcgtcgata acggcatcgc gtatcaaagc 







121
agcagcacgt cgctcggctc gacgacgggc ggccgctcgg cggtgaagat gtcgacgggc 







181
gtgtgggcgg gcagccgctt cggcctgaag ggcagcgagg atctcggtgg cggctcgaag 







241
gcgatcttcc agctcgaatc gggcttcagc acggccaacg gcacgtcgca gttcgcgggc 







301
ggcatcttca cgcgccaggc gtgggtcggc ctgaccaacc cgacgtacgg cacgctgacg 







361
gcgggccgcc agtacaccgc gtactacacg ctgctgtcgc cgtatagccc gacgacctgg 







421
ctcaccggct atttcggcgc gcatccgggt gacatcgatt cgctcgatac cagctaccgc 







481
acgaacaatt cgctcgtcta cctgtcgccg aagttctacg gcttcacgtt cggcggctcg 







541
tatgcgttcg gcggacagcc gggcagcgtg aacgccggct ccacgtggag cgcgggcatc 







601
cagtacatga acggtccgct cggcatcgcg gcggcgttcc agcgcgtgaa caactcgacg 







661
tcgggcggcg gcgactgggg cgcgaattcg acgacgtcga acggcggcgc gcagacggcc 







721
gtgtcggcga tcaacaacgg ctacaagacc gcgcaggcgc agcagcgcgt cgcggtgacg 







781
gccggctatc agttctcgtc cgcatgggac atctcggtgt cgtactcgaa cgtgcagtac 







841
atcccgggcg tgaactcgac gttccgcaac acggcgatct tcaacacggc gggcgcggtg 







901
ctgcacttca agccgtcggc tcagtgggac ttcgcgggcg gctacgcgta cacgcgcgcg 







961
acccagtcga acggcatcac gagcgccgcg cagtaccatc agttcacgct gtcgcagtac 







1021
tacagcctgt cgaagcgcac gggcctgtac gcggtcgaag cgtatcagcg cgcgaacggc 







1081
aagacgctcg cgggcggcaa gatcatcgac gcgaccgcgt cgatcggcga cggcttcaac 







1141
acgtcgccgt cgtcgtcgcg cagccaggtg ggcgtcggcg tcggcctgat ccaccgtttc 







1201
tga 









71
1
mkkrtavamt aaglaavata haqssvtlyg ivdngiayqs sstslgsttg grsavkmstg 







61
vwagsrfglk gsedlgggsk aifqlesgfs tangtsqfag giftrqawvg ltnptygtlt 







121
agrqytayyt llspyspttw ltgyfgahpg didsldtsyr tnnslvylsp kfygftfggs 







181
yafggqpgsv nagstwsagi qymngplgia aafqrvnnst sgggdwgans ttsnggaqta 







241
vsainngykt aqaqqrvavt agyqfssawd isvsysnvqy ipgvnstfrn taifntagav 







301
lhfkpsaqwd faggyaytra tqsngitsaa qyhqftlsqy yslskrtgly aveayqrang 







361
ktlaggkiid atasigdgfn tspsssrsqv gvgvglihrf 





522
56
53719527
YP_108513
97
1
atgaaaacct tcccgttgtc cgcttgccgg accgccgtgg ccgtcgcggt cgccgcgctc 







61
gcgctcgcgg gatgcgcgaa ctatttcggc ctcaagagcg acaaggcgat cgctcccgcg 







121
gcgcagttcg aaagcgcgcg gagcctgccc gcgcagggcg gccaatggcc gtcgctcgac 







181
tgggcgaacc agttcggcga tccgcaattg ccgaagctga tcgacgaggc gctcgaaggc 







241
aatccgacga tcgcgcaggc gcaggcgcgc atcgcgaagg cgtcgtcgta catcgagtcg 







301
tcgcgctcga cgctgctgcc gaaggccgac gcgaagtact cgtggacccg cgagctgtac 







361
tcgggcaacg cccttttccc gcccccgtac ggcggccagt ggtacagcga gaacaacgcg 







421
ctcgcgagcg cgtcgtggga gctcgatctg tggggcaaga accgcgcgcg cctgaatgcg 







481
gccgtgtcgc aggaaaaggc cgccgaagcc gacctgcagc aggcgcgcgt gacgctcgcg 







541
acgtcggtcg cgcgcacgta caaccagctc gcgcaactgt acgcgctgcg cgacatcgcc 







601
gggcgcgaga tctcgaaccg gcagacggtc ggcaagatca ccgacggccg cgtcggcgcg 







661
ggcctcgaca cgaacgtcga gcgccagacc gcgcgcggca acatcgcgac gagccagtcg 







721
acgctgtccg atctcgacgg ccagatcacg aacgtccgct atcagctcgc cgcgctgctc 







781
ggcaaaggtc cggaccgcgg gctgcagatc gccgcgcccg tgctgagcgc gggcggcgcg 







841
gtcgcgctgc cggagaacct gcccgccgat ctcgtgtcgc gccgcccgga tctcgtcgcc 







901
gcgcgctggc aggtcgaagc ggcgatgcac gacgtgaagg aagcgaaagc cgagttcttt 







961
ccggacatca acctcgcggc gggcttcggc ttcgacgcgt tcggctgggg ccgattcctg 







1021
aacttcgcga gccgccaggc gcagttcggc cccgcgatcc acctgccgat cttcgacggc 







1081
ggcgcgctgc gcgcgcaatt gaagggccgc tacgcggact tcgatctgtc ggtcgcgaac 







1141
tacaaccaga cgctcatcaa cgcgctgaac gacgtcgcga cgcaagtcgc gtcgattcgc 







1201
tcggtcgaca cgcaaatggg cgacgcgcag cgcgcgctcg atgcgtcgac gcgcgcctac 







1261
gagctcgcgg tgatccgcta caaggcgggc ctgtcgccgc agttgcaggt gctcaacgcg 







1321
gacagcaacc ggctcgccgc cgagcagacg gtgacgaacc tgaagatgcg ccgccgcgac 







1381
ctgcagatcg gcctcgtgaa ggcgctcggc ggcggcttcg acgcggccgg cacgcggctc 







1441
gccgcgcccg cccccgcctc ggcgccggcc gcgcccgcgc agcacgcatc gaactga 









72
1
mktfplsacr tavavavaal alagcanyfg lksdkaiapa aqfesarslp aqggqwpsld 







61
wanqfgdpql pklidealeg nptiaqaqar iakassyies srstllpkad akyswtrely 







121
sgnalfpppy ggqwysenna lasasweldl wgknrarlna aysqekaaea dlqqarvtla 







181
tsvartynql aqlyalrdia greisnrqtv gkitdgrvga gldtnverqt argniatsqs 







241
tlsdldgqit nvryqlaall gkgpdrglqi aapvlsagga valpenlpad lvsrrpdlva 







301
arwqveaamh dvkeakaeff pdinlaagfg fdafgwgrfl nfasrqaqfg paihlpifdg 







361
galraqlkgr yadfdlsvan ynqtlinaln dvatqvasir svdtqmgdaq raldastray 







421
elavirykag lspqlqvlna dsnrlaaeqt vtnlkmrrrd lqiglvkalg ggfdaagtrl 







481
aapapasapa apaqhasn 





523
85
53720164
YP_109150
98
1
tcacagcgac catttcaact ctccataaaa agtgcggccc ggataaggat ggaacacgta 







61
gtagcggcga tcggtcacgt tgtcgatgcc gaacgacgca agccagtgcc gatcgaaccg 







121
atagcgcgcc ttcaggtcga cgaccatgaa cgagctggtg ccgccgtata cgcccggatt 







181
cacgtcgctg ttgtcgagcg cgttgtactg ccgccccgaa tagcgaacgc cgacgctcgt 







241
catccaatgc tcgtcgaagc ggtacgacgc gagcaggttc gcgcgcatcc gcggaatccg 







301
cggccagcgc gcgccgacgt agttcggatt ggccgcatcg gcaagcgtct gcgcattcgt 







361
cgcggacacg ttcgcgtcga cgtcgagccc cttgagcgcg acgtcctgcc ccgagaacgc 







421
gagctcgacg ccgcgcaccc gcacgcggtc gacgttcgag acgttcgtgt acgtcgacgc 







481
gccggcaagc gtggtctggc tgtagatcga attgcgcaga tcgctctgga acacgctcgc 







541
gcgcacgacg ccgaagccga cgtcgcgctc ggcggtgaaa tcccagtcga tcgccttttc 







601
cggctgcaga ttcgggttgt tgttgacgat cgcgttgttc gagatcgtgc cctggaacag 







661
ctcggcgacc gtcggaaagc gcgtgcccgt cgcgaacgac aaccgcaggc gccacgcgct 







721
ggccggctgc cattcgagcg agagcttcgg cgagaacgcg gtcgcgccgc gctccgcgta 







781
gccgagcgtc gcggtcgcgc cgccgagctc gccgccgtat gcatcccagc gctcgtagcg 







841
caggccgagc gtcgcgagcc agccgggcgc gacgcgccac gcgtcctgcg catacagcgc 







901
ctgcgtgcgc gtgttgccgc gatagcggtt cacgagcgtc gtcggcaccg cgttttgcca 







961
gtcggccgtg ttgtaggtcg cgttgcgcag gaaatacgta tcgaagtggt agccgaacga 







1021
aaaacggtgc ccgcgcacgt cgggcgactc cgcgcgcaag tcgacggtgc gccagccggt 







1081
gccgtcgcca tggaacaccg tgcccggccc gccgtcccag gcgccgggcg gtgcgccgga 







1141
cgcgctgcgc agcacgtcgc gcgacacttc gtaggcggac gcggtggccg acagcttcca 







1201
gccggacgcg agccgcgcgt cgagcccgag cccgtacagc cagttctcct ggtcgcccgt 







1261
ctgcggcgcg aacgcgcccg gcgacaccgt gtagttgcgc ccgccgaacg acacgttgcc 







1321
gccgtacacc gggttgcccg ccgcgtcgcg caggaacgtg tcgccgtgct gccggtagtg 







1381
attctcccag tggccgagcg tgaccgtcgc gtcgacgtga tcggtgaacg cgtagccgaa 







1441
gcgcaacgtc tcgttgagct gctcggtgcg ctcgatcgtc tgcgcgccga cgatcgtccg 







1501
cggccggccg ttcggcccga tgtcggagac ggcccccgtc accggcacgg gtgcgccgag 







1561
cctcggatcg aacgcgccgt tcgggctcgc gtattgcatc ggctggctgt cgttctcgag 







1621
ccggtcgagc gacagcgcat accagaaacg tccgacgcga tcggcgatgc gcgcgctctg 







1681
gtgattgccg ccgaagctgt cggcgaagcc gtagccgtcg cgatagcgct gcgtgaagaa 







1741
ctgcgtcgac accgacgcct cgagccgctg cggcttgtgc gtggtgatct gcacggtcga 







1801
gccgatcgcg ttgcccgggt agagcgcgga aaacgggccg tacagcacgt ccacgcgtgc 







1861
gatgtcgtcg ggctggatca gcgaccagcg cggcggatac gaatagctgg aaccgagcag 







1921
attggacagc aggatgccgt ccgcatagac gagtccgcgc gcgctctgca gctcattgaa 







1981
gtctcggccg gcgaacactg aattgcggtc gccgatgtag cgccggcgca ccatcaggtt 







2041
cggcgcgtac ttgagcgcgt cttcggtcgt cacgttgacg tgcgaatcga tctgctcgcg 







2101
cgtgagcgcc tcgacgacgc cgggcgcgtc gggcgcgaac gcggcgcgct gcgcggtgac 







2161
cttgacggta tccaaggttt tcccggatgc ggtcatctcc gcggcggcgg ccggcgttga 







2221
atcggcaggc acggcggcgg ccgtcgaagc cgcgtgggcg gcgggccagg cgagcgccgc 







2281
ggcgcaggcg gccgcgagcc gcctgcgcgc gccgggcgcg gcaagttgat gaaacat 









73
1
mfhqlaapga rrrlaaacaa alawpaahaa staaavpads tpaaaaemta sgktldtvkv 







61
taqraafapd apgvvealtr eqidshvnvt tedalkyapn lmvrrryigd rnsvfagrdf 







121
nelqsarglv yadgillsnl lgssysyppr wsliqpddia rvdvlygpfs alypgnaigs 







181
tvqitthkpq rleasystqf ftqryrdgyg fadsfggnhq sariadrvgr fwyalsldrl 







241
endsqpmqya spngafdprl gapvpvtgav sdigpngrpr tivgaqtier teqlnetlrf 







301
gyaftdhvda tvtlghwenh yrqhgdtflr daagnpvygg nvsfggrnyt vspgafapqt 







361
gdqenwlygl gldarlasgw klsatasaye vsrdvlrsas gappgawdgg pgtvfhgdgt 







421
gwrtvdlrae spdvrghrfs fgyhfdtyfl rnatyntadw qnavpttlvn ryrgntrtqa 







481
lyaqdawrva pgwlatlglr yerwdaygge lggatatlgy aergatafsp klslewqpas 







541
awrlrlsfat gtrfptvael fqgtisnnai vnnnpnlqpe kaidwdftae rdvgfgvvra 







601
svfqsdlrns iysqttlaga stytnvsnvd rvrvrgvela fsgqdvalkg ldvdanvsat 







661
naqtladaan pnyvgarwpr iprmranlla syrfdehwmt svgvrysgrq ynaldnsdvn 







721
pgvyggtssf mvvdlkaryr fdrhwlasfg idnvtdrryy vfhpypgrtf ygelkwsl 





524
78
53722869
YP_111854
99
1
tcagaatgtc gtgcgcatcc cgatgctcac gctgcgcccg ccctgcggcg cgatgttgcg 







61
caccaccgag ctcgcgtagc ggatgtcctg gttcgtcagg ttgtcgccgc gcagataggc 







121
gagccagttc gtcgcgccga cgcgcaactt gtaggtgagc acgacgccga gcgacgtgta 







181
gccgtcggtc gcgagatcgt gctcgggcac gcgatgctgc gcccacgcat gcgtgagctg 







241
cgcgcgcgca ccgaacgggc cgtagccgta atcggccgcg agcgtcgcgc gcagcggcgc 







301
gatccgcggc agcggctcgc ccgtgtccgc gttgcgcgcg tgcgtgtagt cggcgctcag 







361
ctcgagatcg acgcgatgcc cgcgccgctc gaacgcgcgc cacctgcctt cgagctcgac 







421
gccgtagaac tccgcgcgca cgccgcgata gacggcctcg cgcagcgcgt cgtcggcgcc 







481
cggggccacg ggcacgccgt cgtcgtcgac gagccgtccg gtgtcgtatt cggcgaggta 







541
gttccgcagg cggctgtaga acacgccgat gctgccgcgg ttcggaccgc tcgcatagcg 







601
cagcgcgaga tcggtcgaca ccgccttctc cttctgcgca tcgggccggc cgatcaggta 







661
ctgccccgtc gcgccgtgcg ggccgttcgc gtacagctcg tagaaggtcg gcgcgcgctc 







721
cgtgtacgac acgttgcccg cgagcgacca cgcgggcgcg agctgataca gcgcgcccgc 







781
cgacacgctg cccgcattga aatcgcgcga gcgcgcgaag ccgaacttgt cgtcgccgtt 







841
cgcgctcgga tcgagccgca cgtgctcgat gcgcgcgccg gcggacagct tcagcgcgtc 







901
ggtcgcctgc cactgctcga ggccgaacag cgcgacgctc gtcgtgcgcg tggtaggcgc 







961
gagcgcctcg ccgccgagcg cggagaacgt gttctgtccg acctgcacgc cgagcgcgcc 







1021
ttcgaacggg ccgagcttgc ggtgccgcgc ctcgacgcgc gcctcgtagc cgtgattgcg 







1081
aaacgtcgtg cccgtcacgc cgtcctcgat ttccttatgc tggtaattcg tgtagccgaa 







1141
gtcgaatttc agctgcgaga acggcccgcg cagattgcgc acctccgacg cgagcgcgac 







1201
gcgctcctgc cgcatctgaa gccgcgcgtc ggtttcggcg acggagccgt agttcgattc 







1261
gtagccgctg tacgatgcgc cgacgtagcc gtccgcccac gtgtacgagc cgcccgccgc 







1321
gccgccgtaa cggcggccgt cgctgttcgg cagcttgccg tacggctcgc tggcgtcctc 







1381
gccgtcgagc gcgcgctggc gcgccgaatg cgcatggccc ggaatccgca gcgcgtcggt 







1441
ctcgcggccg aacgcgtcga gatgaaacgc gaagcggccg ttgccgcctt cgacgagcgc 







1501
cgcgcccgca cgcgcattgt tcgcgccgcc gtagctcgcg tcgagcgcgc ccgagacgcc 







1561
cgtgatcgct tcgcgcggaa tccggttgtc gatcgtgttg acgacgccgc cgaccgcgtt 







1621
gccgccgtac agcaacgccg ccggcccgcg cacgatctcg atgcgctcga cgctcagcgg 







1681
atcctgcggc accgcgtgat cgtaggacag cgacgacgcg tcgtacgcgg cgacgccgtt 







1741
ctgcaggagg cggatgcgat cgccgtccat cccgcggatg atcggacggc cgacgagcgg 







1801
gccataggtc gtcgtggaca cgccgggcag gccgttcagc gtgtcgccga gcgaatcggt 







1861
gcggcgcagc gtcagcgcgt cgccggacag cgacgcggtg ggcgagctca gcgccgacgc 







1921
gccgagcggg ttcgcggtca cgaagatcgg cgcgagcgcc gtcgcgggcg gcgcgtccga 







1981
tcggcgctcg gcgggcggcg cgcccgtctc gccatgcgcg aggctcgcgg cgagcagcag 







2041
cgacaaggga tgcagcctgc gagcgaaagg cggcgcgata cgtcggtggt catccat 









74
1
mddhrriapp farrlhplsl llaaslahge tgappaerrs dappatalap ifvtanplga







61
salssptasl sgdaltlrrt dslgdtlngl pgvstttygp lvgrpiirgm dgdrirllqn 







121
gvaaydassl sydhavpqdp lsverieivr gpaallyggn avggvvntid nripreaitg 







181
vsgaldasyg gannaragaa lveggngrfa fhldafgret dalripghah sarqraldge 







241
dasepygklp nsdgrrygga aggsytwadg yvgasysgye snygsvaetd arlqmrqerv 







301
alasevrnlr gpfsqlkfdf gytnyqhkei edgvtgttfr nhgyearvea rhrklgpfeg 







361
algvqvgqnt fsalggeala pttrttsval fgleqwqatd alklsagari ehvrldpsan 







421
gddkfgfars rdfnagsvsa galyqlapaw slagnvsyte raptfyelya ngphgatgqy 







481
ligrpdaqke kaystdlalr yasgpnrgsi gvfysrlrny laeydtgrlv dddgvpvapg 







541
addalreavy rgvraefygv elegrwrafe rrghrvdlel sadytharna dtgeplpria 







601
plratlaady gygpfgaraq lthawaqhrv pehdlatdgy tslgvvltyk lrvgatnwla 







661
ylrgdnltnq diryassvvr niapqggrsv sigmrttf 





525
43
53718452
YP_107438
100
1
atgcgcgtcg aacgggttcc ataccgctta atcactgtcg cgacggctgc cgttttcctg 







61
gccgcgtgcg gaaaaaaaga atcggctccg ccccctcaaa cgcccgaagt cggcgtcgtc 







121
accgtccagc cgcagcccgt gccggtcgtc tccgaactgc cgggccgtac gagcgcctat 







181
ctggtcgcgc aggtgcgcgc acgggtcgac ggcatcgtgt tgcggcgcga gttcacggaa 







241
ggcagcgacg tcaaggccgg ccagcgcctg tacaagatcg atccggcacc ctatatcgcg 







301
caattgaaca gcgcgaaggc gacgctcgcg aaggcgcagg cgaacctcgc gacgcagaac 







361
gcgctcgtcg cgcgctacaa ggtgctcgtc gccgcgaacg cggtcagcaa gcagcagtac 







421
gacgatgcgg tggccgcgca agggcaggcg gccgccgacg tgggcgcggg caaggccgcc 







481
gtcgagaccg cgcagatcaa cctcggctat acggatgtcg tctcgccgat cacgggccgc 







541
gtcggcatct cgcaggtcac gccgggcgcg tacgtgcagg cgagccaggc gacgctgatg 







601
tcgaccgtcc agcagctcga tccggtctac gtcgatctca cgcagtcgag cctcgacggc 







661
ctgaagctgc gccaggacat ccagagcggg cgcatcaaga cggaaggccc gggcgcggcg 







721
aaggtcacgc tgattctcga ggacggcaag ccgtacccgg agcggggcaa gctgcagttc 







781
agcgatgtca cggtcgacca gacgacgggc tcggtcacga tccgcgcgat cttcccgaac 







841
aagcagcgcg tgctgctgcc gggcatgttc gtgcgcgcgc gcatcgaaga gggcgtcaac 







901
gagaacgcgt tcctcgttcc gcagatcggc gtcacgcacg atccgaaggg ccaggcgatc 







961
gcgatgatcg tcgacggcaa gggcaaggtc gagccgcgcg tgctggtcac gggcggcacg 







1021
cagggccaga actgggtggt cgagagcggc ctgcaggcgg gcgaccgcgt gatcgtgcag 







1081
ggcatcgaca aggtgcgccc gggcatgacc gtgaaggccg ccgaggctca actgccggcc 







1141
gcggcggccg gcgcgtcggg tgccgcgccc gcgggcggct cgcccgccca ggccgcggcg 







1201
gcgagcgcgg ccgcgtcggg tgccgcgccg tcgagcgctg ccgccgcgtc gagcgcgcaa 







1261
taa 









75
1
mrvervpyrl itvataavfl aacgkkesap ppqtpevgvv tvqpqpvpvv selpgrtsay 







61
lvaqvrarvd givlrrefte gsdvkagqrl ykidpapyia qlnsakatla kaqanlatqn 







121
alvarykvlv aanavskqqy ddavaaqgqa aadvgagkaa vetaqinlgy tdvvspitgr 







181
vgisqvtpga yvqasqatlm stvqqldpvy vdltqssldg lklrqdiqsg riktegpgaa 







241
kvtliledgk pypergklqf sdvtvdqttg svtiraifpn kqrvllpgmf vrarieegvn 







301
enaflvpqig vthdpkgqai amivdgkgkv eprvlvtggt qgqnwvvesg lqagdrvivq 







361
gidkvrpgmt vkaaeaqlpa aaagasgaap aggspaqaaa asaaasgaap ssaaaassaq 










Bm10229-600












ID
MW
Protein GI#
Protein Locus#
SEQ ID NO:
Sequence 
















601
81
124386268
YP_001026287
121
1
ttaccagttg tatttcgcgg tcgcgatcac ggtgcgctgg ttgccgtaca tgcacaccgc 







61
atccgactgg cagccggcca cgtagaggcg gttgaacagg ttcgtcgcgt tgagcgcgaa 







121
acgccagttg cgcagctcgt agtgcagcgc cgcgtcgaac agcgtgtagc tcggcaccgt 







181
gagcgaattg tcgaccgcgc ccgccgccgc gctcatgtag cgcacgcccg cgccgacgcc 







241
gaagcccgtg agaggcccgg tgcgccacgt ccagtcggcc cacagcgacg cgatctggcg 







301
cggcagaggc acgtcgacgg gccacttgtt cagcgtgttg tcgttcgccc gcacgttctt 







361
cacgtcctga tagacgtacg cggcgatcac cgacagctcg cgcgacaggt tgcccaccgc 







421
gctcagctcg acgccgcgcg agcgcacctc gcccacctgc acgaacgtgc tgctcgtcgg 







481
gtcgttcgga ttgctcatcg cgacgttcgt ctggttgatc tggtagacgg ccgcgttcag 







541
catcaggttc ttgccgggcg gctgccagcg caggcccgcc tcgatctggc ggcccttggt 







601
cggcgtcgcg agcccgccgc ccgcgagctt caggccgatc tgcggattga acgacgtcga 







661
atagctgagg tacggcgcga gcccgtagtc gccgagatac gtgaggccca cgcgcccgct 







721
gaacgcgtgg tcgttctgcc tgaagctcgc cgcgttcgcg atgtcgtcct gcgacgtgcg 







781
cgtccagtcc tgccggccgc cgagcgtgag cacccagcgc cgccacttga tctggtcctg 







841
cacgtacagg ccgaatgcgt tcagcgtcgt cttcgtgtcg gtgcgcgacg cgttaggccc 







901
gctgaagatg tcggacggaa tcggcgtgta gacgggccgg tacaggttca ggctaggccc 







961
cttcgcgagc cattcgctat cggtcgtcgt ctggcggttg tagtcgaagc cgaacagcag 







1021
cgtgtgcgaa agagggcccg tcgtgaattt cgcctgcgcc tggttgtcga cgtcgaagcg 







1081
gctgtagttg aactggaaca ggcccgcgta gcgcgtcatc gtcgccatcg tcgggtctgc 







1141
gtcgtcgagg ccgccgccgt agacggacgc gtcgtcgagc gacaggtgca tccagcgcac 







1201
gttctgccgc agcgtccaca ccgggttcag cttgtgctcg aactgatagc cgagcgacca 







1261
ctgcttcttg cggtaatggt cgaagttcgc atcggccgtg tacaggtcgt ccgagatcgt 







1321
gccgttcgga ttcggcagca cggtgccgcg cgacggcagg aagttgctcg acgtgtcccc 







1381
ccagtcctgc aggtacgtcg cggcgagggt gagcgacgtg tccgcgttcg gctgccattt 







1441
gagcgacggc gcgaacgaca cgcgctggtc ggcgagcggc cccgtctgcg cgttgccgtc 







1501
gcggccgacg ccgacgatcc ggtacgacag cgtgccgtcc ttgccgatcg tgtcgccgat 







1561
gtcgaacatc agttgcttgc gcgcgtagtt gccgacctgc acgccgagct cgcggatgcg 







1621
ctcgccgttc gcgagcttgc tctgcacgtc gacgatcgcg ccgggatcgc cctgcccgta 







1681
cagcaccgac gtcgggccgc gcagcacggc gatgctgtcg atcatgtacg gatcgacgcg 







1741
ccagctcgac aggttgatcg tgttcggcac ctgcagcccg tcgacgaaca cggtcggcgt 







1801
gaagccgcgc agcgccgcgt accagtccga acggttgtcc gacccgtagc tcgagaagcc 







1861
cggaatgtag cggaacgcct gattgatgtc ggtcgcgccc gtcgcctcga tctgctgcgc 







1921
ggtgacgacg ttgatcgtct gtggaatctc gatgatcggc gtatccgttt tcgtgcccgt 







1981
catgctgcgc cgcgcgacga ggccgacgct cgcgtcgcgc tccgctgctc ccgacaccga 







2041
gatcgcgggc aacgtgccgc ccgccgcggt gtctgccttc gcctcgccgc ccggctgccg 







2101
cgcgtccgcc cccggctgcg ccgcctgtgc ctgcgcatga cccgccgccg ccgcgtagaa 







2161
cgccacgccg gccgcgatcg cacgcacgcg cgtgctggtt gcccactcca tcttccgttg 







2221
ctccaagttg caaagcggcc gcgagggccg caccttcat 









101
1
mkvrpsrplc nleqrkmewa tstrvraiaa gvafyaaaag haqaqaaqpg adarqpggea 







61
kadtaaggtl paisysgaae rdasvglvar rsmtgtktdt piieipqtin vvtaqqieat 







121
gatdinqafr yipgfssygs dnrsdwyaal rgftptvfvd glqvpntinl sswrvdpymi 







181
dsiavlrgpt svlygqgdpg aivdvqskla ngerirelgv qvgnyarkql mfdigdtigk 







241
dgtlsyrivg vgrdgnaqtg pladqrvsfa pslkwqpnad tsltlaatyl qdwgdtssnf 







301
lpsrgtvlpn pngtisddly tadanfdhyr kkqwslgyqf ehklnpvwtl rqnvrwmhls







361
lddasvyggg lddadptmat mtryaglfqf nysrfdvdnq aqakfttgpl shtllfgfdy 







421
nrqtttdsew lakgpslnly rpvytpipsd ifsgpnasrt dtkttlnafg lyvqdqikwr 







481
rwvltlggrq dwtrtsqddi anaasfrqnd hafsgrvglt ylgdyglapy lsystsfnpq 







541
iglklagggl atptkgrqie aglrwqppgk nlmlnaavyq inqtnvamsn pndptsstfv 







601
qvgevrsrgv elsavgnlsr elsviaayvy qdvknvrand ntlnkwpvdv prprqiaslw 







661
adwtwrtgpl tgfgvgagvr ymsaaagavd nsltvpsytl fdaalhyelr nwrfalnatn 







721
lfnrryvagc qsdavcmygn qrtviataky nw 





602
55
124383029
YP_001025939
122
1
atgaacaaga ccaacatcaa cgaacgcatc gcgcgcgtcg cgaagatcgc ggcggcgagc 







61
gggttgctcg tcgcgctgct cgccgcgtgc gcggtggggc ccgactaccg gcggcccgat 







121
gtcgcgacgc ccgcggcgtt caaggaagcg ccggcgctcg cgccgggcga gcaggccggc 







181
acgtggaagg cggccgagcc ggccgacgcc gcgcatcgcg gcgaatggtg gcgcgtattc 







241
ggcgatccgg tgctcgatgc gctcgaaacg caggcgcttg ccgcgaacca gaacctgaag 







301
gccgcggcgg cgcgcgtcga gcaggcgcgc gcggcgaccc gcgccgcgcg ctcgcagtgg 







361
ttcccgcaag tgggcgtggg cttcgggccg acgcgcgagg ggctgtcgtc ggcgtcgcag 







421
ttccagccgc agggcagcgg cccgacgaac gcgacgctct ggcgcgcgca gggcacggtg 







481
tcgtacgaag cggacctgtt cggccgcgtc ggccgcaacg tcgaagcgtc gcgcgcggac 







541
gaagcgcaga gccaggcgct gttccgctcg gtgcagctcg cgttgcaggc ggacgtcgcg 







601
cagaactact tcgagctgcg ccggctcgat tccgaccagg atctgtaccg gcgcacggtc 







661
gggctgcgcg aggaggcgct caagctcgtg cagcggcgct ttgccgaagg cgacatcagc 







721
gagctcgacg tgtcgcgcgc gaagaacgaa ctggcgaccg cgcaggcgga cgcggtgggc 







781
gtcgcgcggc ggcgcgccgc atcggagcat gcgctcgcga tcctgctcgg caaggcgccc 







841
gcggatttct cgttcagcga gacgccgatc gcgccggtgg cggtgcgcgt gccggcgggg 







901
ctgccgtcgg cattgctcga gaggcggccc gacatcgcgg cggccgagcg cgcgatgatg 







961
gccgcgaacg cgcgcatcgg cctcgcgaag tccgcgtact tcccgaagct cgacatcacc 







1021
ggcgcgttcg gctatgaggc ggcgacgctc ggcaacctgt tcctgtggtc gagccgcacg 







1081
ttcctgctcg ggccgttcgc gggcacggcg ctcacgctgc cgatcttcga cggcgggcgc 







1141
cgcagcgcgg gcgtcgcgca ggcgcgcgcg aagtatgacg aggaagtcgc gaactaccgg 







1201
cagcaggtgc tcgtcgcgtt ccgcgaagtg gaggacaacc ttgccgatct gcgtctgctc 







1261
gacgatcaga tccgcgcgca aagcgacgcc gtcaacgcgt cgcggcgcgc ggcgaagctg 







1321
tcgcgcacgc agtatcagga aggcgcggtc agctatctcg acgtgatcga cagcgagcgc 







1381
tcggtgctcg aatcgcaact gcaatcgaac cagttgacgg gcacgcaggc ggtatcgacg 







1441
gtcaacctga ttcgcgcgct cggcggcggc tgggggagcg atgcggcgct cggctcgcgc 







1501
gagccgggca agcaggacgt cgcggcgcgt tga 









102
1
mnktnineri arvakiaaas gllvallaac avgpdyrrpd vatpaafkea palapgeqag 







61
twkaaepada ahrgewwrvf gdpvldalet qalaanqnlk aaaarveqar aatraarsqw 







121
fpqvgvgfgp treglssasq fqpqgsgptn atlwraqgtv syeadlfgrv grnveasrad 







181
eaqsqalfrs vqlalqadva qnyfelrrld sdqdlyrrtv glreealklv qrrfaegdis 







241
eldvsrakne lataqadavg varrraaseh alaillgkap adfsfsetpi apvavrvpag 







301
lpsallerrp diaaaeramm aanariglak sayfpkldit gafgyeaatl gnlflwssrt 







361
fllgpfagta ltlpifdggr rsagvaqara kydeevanyr qqvlvafrev ednladlrll 







421
ddqiraqsda vnasrraakl srtqyqegav syldvidser svlesqlqsn qltgtqavst 







481
vnliralggg wgsdaalgsr epgkqdvaar 





604
83
124382070
YP_001024918
123
1
ttggcgcggc ggccgcttcg cgccgcgctg ttcggggcct tcggcctcta tgcggcggcc 







61
gcgcgcgccg ccggccccgc ttccgaaccc gcggccgccg cgccgccggc cgccgcatcc 







121
gccgcatccg ccgcatccac gtcgcaggtg cggcacgcgg cgatcgccgc cgcgcgcaag 







181
gacgcgccgg cactcgatcc gatcaccgtc accgcgacgc gcaccgcgtc ggccgcgagc 







241
cgcaccgcgg cgagcgtatc ggtgatcacc gattcagacc tcgaggagca gcaggccgac 







301
aacatcaagg acgcgctgcg ctacgagccg ggcgtcaccg tgcgacgcac cgcgtaccgc 







361
ccggcgaacg ccgcgctcgg cggcggccgc gacggcgatt cgagcatcaa catccgcggc 







421
ctcgaaggca accgcgtgct gctgatggaa gacggcatcc ggctgccgag cgcgttctcg 







481
ttcggcccgc tcgaagccgg ccgcggcgat tacgccgatc tcgacacgct cgcgcgcatc 







541
gagatcctgc gcggtccggc gtccgcgctg tatggcagcg acggcctgac gggcgccgtc 







601
aacttcatca cgaaagatcc gtccgatctg ctgtcgatcc atcgaaaaaa gacctatttc 







661
tcgttccggc cgagctacga ctcggtcgac cgcagcatcg gcgcgaccgt gacggcggcg 







721
ggcggcaacg atcgcgtgca ggcgatgctg atcgcgtccg gccgccgcgg ccacgaactc 







781
gacacgcacg gcgacgacaa ttccgcgagc acgcggcgca cgcgcgcgaa tcctcaggat 







841
gtctacacgg aatcgctgct cggcaagctg acgatcacgc cgaccgcgcg cgacacgatc 







901
aagctcgccg ccgaaacggt gcggcggcgg atcgacacga acgtgctgtc ggcgatcaat 







961
ccgccgacaa cgctcggcct caccgcgaac gacaggctcg agcgcaaccg cttcagtatc 







1021
gactacgatt tgcgcgacgc cgccgcgcgc gggttccaga ccgcgcacgt gcagttctac 







1081
tatcaggagt cgacgcagga tcaggacgcg ttcgagacgc gcggcgggcg gctccaatcg 







1141
cgttcgcgct cgaaccacta cagcgagcgc gcgctcggcg gctccgcgtt cgccgagagc 







1201
ggcttcgcga ccgggccgct cgcgcacaag ctgctgtacg gcgtcgacgg cagcatcgac 







1261
cgcatcaaga gcctgcgcga gggcaccgtc gcgagccccg gcgaatcgtt cccgaacaag 







1321
gcgtttccgg acaccgacta ctcgctgttc ggcgcgttcg tgcaggatca gatcggcttc 







1381
ggcaagctgc tcgtcacgcc gggcctgcgc ttcgacgcgt atcggctcag cccgagctcg 







1441
ggcgatccgc tgttcaccgg caagacggtc agctcgagcg atcacgagct gtcgccgcgc 







1501
gtcgcgatgc tctatgaagc gtcgcccgcg ctgattccct acgcgcagta tgcgcacggc 







1561
tttcgcacgc cgacgcccga tcaggtcaac aacagcttct cgaatccgat ctatggctat 







1621
acatcgatcg gcaatccgaa cctgaagccc gagacgagcg acacgctcga agcgggcctg 







1681
cgcggcacgc tcggcaccgg ctacgggccg ctgcgctaca gcgtcgccgc gttcgccggc 







1741
cgctatcgca acttcatctc gcagcgcacg gtaggcggca gtggccggcc gaacgatccg 







1801
ctcgtgttcc agtacgtaaa cttcgcgaac gcgcgcattc acggcttcga gggacgcgcc 







1861
gaatgggtga tgccgaatgg cttcacgctg aagacggcga tggcgttcac gaagggcacg 







1921
acgcaggaca acggcgcggc gagcgagccg ctcgatacgg tcaacccgtt ctccgccgtg 







1981
ttcggcgtgc gctacgagcc gagcgagcgc tggttcgcgc aggcggacct gctgtggcag 







2041
gcgggcaagc gcggccgcga cgtgtcgtcg gccgcgtgcc agaaaaagac ctgcttcacg 







2101
ccgccgtcgt cgttcgtcgt cgatctgcgc ggcggctacc gcttcaacaa gcacgtgagc 







2161
gcctacctcg gcattcacaa cctgttcgac cgcaagtact ggaactggtc ggacgtgcgc 







2221
ggcatcgccg ccgattcgaa cgtgctcgac gcatacaccg ccccgggccg cagcgtcgcg 







2281
gtcagcatga aggtggattt ctga 









103
1
marrplraal fgafglyaaa araagpasep aaaappaaas aasaastsqv rhaaiaaark 







61
dapaldpitv tatrtasaas rtaasvsvit dsdleeqqad nikdalryep gvtvrrtayr 







121
panaalgggr dgdssinirg legnrvllme dgirlpsafs fgpleagrgd yadldtlari 







181
eilrgpasal ygsdgltgav nfitkdpsdl lsihrkktyf sfrpsydsvd rsigatvtaa 







241
ggndrvqaml iasgrrghel dthgddnsas trrtranpqd vytesllgkl titptardti 







301
klaaetvrrr idtnvlsain ppttlgltan drlernrfsi dydlrdaaar gfqtahvqfy 







361
yqestqdqda fetrggrlqs rsrsnhyser alggsafaes gfatgplahk llygvdgsid 







421
rikslregtv aspgesfpnk afpdtdyslf gafvqdqigf gkllvtpglr fdayrlspss 







481
gdplftgktv sssdhelspr vamlyeaspa lipyaqyahg frtptpdqvn nsfsnpiygy 







541
tsignpnlkp etsdtleagl rgtlgtgygp lrysvaafag ryrnfisqrt vggsgrpndp 







601
lvfqyvnfan arihgfegra ewvmpngftl ktamaftkgt tqdngaasep ldtvnpfsav 







661
fgvryepser wfaqadllwq agkrgrdvss aacqkktcft ppssfvvdlr ggyrfnkhvs 







721
aylgihnlfd rkywnwsdvr giaadsnvld aytapgrsva vsmkvdf 





606
88
124384342
YP_001028663
124
1
atgagccggg cgccgcatgc accgtcgcgc cgtcgcgcgt tcgggcgcgg gttcgcggcg 







61
gctgctcgcg ccgctcacgg cgcgcctgtc gcgcatggcg cgcacgcgtg cttcgccgcc 







121
gcgtcgcgtc gcgcgacggc cgtcgagcgc tgcgtcaccg cgctcgcgtg cgcggtgacg 







181
gcgtccggcg cgctcgccgc cgaggcggaa ccggccgacg ccgcgcgcgg cccgcatcgc 







241
gaactgccga ccgtgcgcgt gacgagcgac ggggtgcacg cgtcgccgct gtctacgccg 







301
ctcgcggccg gctcgcgcct gaagctcgcg agcctcgaca cgcccgcgag cgtcgaggtg 







361
atcacctccg cgcagatcgc cgcgcgcggc gatcgcacga tcgtcgacgc cgtcacgcgc 







421
gcgacgggct tcagcacggc cgcggcgccc ggcaacggcg gcacggcgct cagcgtgcgc 







481
ggcttcgccg gccaggaatc ggtgacgacg cttgtcgacg gcgtgcgcct ctaccccggc 







541
gcgggcaccg tcacgttccc gttctcgacg tggtccgccg agcgcatcga ggtgctgcgc 







601
gggccggcgt cggtgctgta cggcgaaggg gcgatcggcg gcgtcgccaa cgtcgtgacc 







661
cggcgcccgc gccgcgaacg ctcgacgacg ctgcaggcgg gcatcggcac gcaaggcgaa 







721
aagcgcgtgg cgctcgatac gacgggcgcg ctcggcgcgc gtctttccta tcgtttccat 







781
ctgagcgacg agcgcacgcg cggcttcgtc gagcgcggcg acgcgcacgt gacggccgtc 







841
ggcggcgcgc tgaagctcga tgtcgattcg cgtctgtcga tcacgctcga ttacgactac 







901
ggccgccaga agccggcgac ctatttcggc gtgccggcga cgaacggcgt gctcgagcgc 







961
gcgctgcgcg agcgcaacta caacgtcggc gacgcgacga tcgcatacca cgacacctgg 







1021
acgcggcttg ccgcgaccta tcgcgcgggc aacggcgtga cgctcgatgc ccagctctac 







1081
tatctcgcga cgcgccgcca ttggcgcaac gcggaatcgt atgcgctcga cccggcggcg 







1141
cgcaccgtcg cgcgcagcga ctatctcgag atcttccatc gcgagcggca gttcggcgag 







1201
cgcttcaccg cacgcatcga ctcgcgcgtg ttcggccgcg cgaaccggct cgtgctcggc 







1261
gccgagttca accagatcgc gttcgacggc gcgaacaacg cgccgtatcg cggcgaatcg 







1321
accgtcgccg ccgccggctt cgatcccggc gcgttcgcga gccccgatcc gacgctgccg 







1381
cgctttcgca cgcgcacgca tcaggccgcg gcgttcatcg agaaccggct cgaggtcctg 







1441
ccgcggctcg cgtgggtgag cgggctgcgt tacgaccatc tgtcgtttca tcgcgacgat 







1501
ctgatcgcgg gcggcgcgtt cgacaagacg ttcgcgcaca ccggctggcg cagcgggctc 







1561
gtctacgaga tcgcgccggg cctcgccgcg tacgcgcagt acacgacggg tgcgcagggc 







1621
gtgggctcgc tcgtcacgct gtcggcgtcg caggcgaact acacgctcgc gaccgggcgt 







1681
caatgggaag cgggcatcaa gcacgagatc gacggcgctc gcgcgtactg gacgctcgcc 







1741
gtctacgaca tcgtcaagcg cggcctcgtc agcgtcgatc cgctgaatcc ggcgcgcgcg 







1801
cagcagatcg gccgccagtc gtcgcgcggc gtcgagctcg cgggcggcgt gcggctgccg 







1861
ggcggtgtga cggtcgacgc aaacgcggcg ttgctgcgcg cgcgctacga cgcgttcgac 







1921
cagcgcgtcg gcgacacggt cgtgcagcgc gccggcaacg tgccgcacga catcgcgcag 







1981
cagagcgcga acctgtggat cggctgggcg ttcgcgcccg gctggcgcgc gaacgcgggc 







2041
ctgcgctacg tcggtccccg ctacggcgac gacgcgaacc gcgtgccggt gccttcttac 







2101
acggtgttcg acgcgtcgct cgcgtggcgg gcgacgcgcg acgccgacct cgtgctctat 







2161
gcgcgcaacc tcgcgaaccg cacgtacgcg gcgtcgacgt tgaacggcgg cgcgcaatgg 







2221
ctgctcggcc cgtcgcgctc ggccgagctc gtcgcgacgc tgcgcttcta g 









104
1
msraphapsr rrafgrgfaa aaraahgapv ahgahacfaa asrrataver cvtalacavt 







61
asgalaaeae padaargphr elptvrvtsd gvhasplstp laagsrlkla sldtpasvev 







121
itsaqiaarg drtivdavtr atgfstaaap gnggtalsvr gfagqesvtt lvdgvrlypg 







181
agtvtfpfst wsaerievlr gpasvlygeg aiggvanvvt rrprrerstt lqagigtqge 







241
krvaldttga lgarlsyrfh lsdertrgfv ergdahvtav ggalkldvds rlsitldydy 







301
grqkpatyfg vpatngvler alrernynvg datiayhdtw trlaatyrag ngvtldaqly 







361
ylatrrhwrn aesyaldpaa rtvarsdyle ifhrerqfge rftaridsrv fgranrlvlg 







421
aefnqiafdg annapyrges tvaaagfdpg afaspdptlp rfrtrthqaa afienrlevl 







481
prlawvsglr ydhlsfhrdd liaggafdkt fahtgwrsgl vyeiapglaa yaqyttgaqg 







541
vgslvtlsas qanytlatgr qweagikhei dgaraywtla vydivkrglv svdpinpara 







601
qqigrqssrg velaggvrlp ggvtvdanaa llrarydafd qrvgdtvvqr agnvphdiaq 







661
qsanlwigwa fapgwranag lryvgprygd danrvpvpsy tvfdaslawr atrdadlvly 







721
arnlanrtya astlnggaqw llgpsrsael vatlrf 





607
81
124383128
YP_001025604
125
1
atgacatcca cattcctgcg tcacgcgccc gccgcgcgtg aaggccacgc gcgccgccgg 







61
cggcgcgcga tcacgcttac cgttcccgcg ctcgccgcgg gcgcctttca cctcgcgccg 







121
gccgtcgcgc agacgagcga ggccgtgcac ggccacggca cgctcggcgc gtccggcgta 







181
gcgagccgcg ccgagaccga cgcggcgagc gccaaatcgg acggcgcggt tcgcgaagcg 







241
gcgagccgca cggcaaccgg cgccgcgccg gacgccacga cgctgccgac gatcgaaatc 







301
gtcgcggcgc ccgaatcgac gccgctcgtc gtcgtcaccg atccgaagac gccgcgccag 







361
ccgctgcccg cgagcgacgg cgccgattat ctgaagacga ttcccggctt cgcgtcgatc 







421
cgcagcggcg gcacgaacgg cgacccggtg ctgcgcggga tgttcggctc gcggctgaac 







481
attctcgcga acggcatgcc gacgctcggt gcgtgtcccg gccggatgga cgcgccgacg 







541
tcgtacatcg cgtccgagag ctacgacaag gtgacgctcg tcaaggggcc gcagaccgtg 







601
ctgtacgggc cgagcgcatc ggcgggcacg gtgctgttcg agcgcgtgac gccgcgcttc 







661
aagacgccgg gcatgcgctt tgacggcagc gtcgtcggcg gctcgttcgg gcgcaacgat 







721
cagaacgtcg acgtgacggc cggcacgccc gacttctacg ggcgcgtgag cgcgaaccat 







781
gcgcactcgc aggactacga ggacggcaac ggccgcacgg tgccgtcgca atgggacaag 







841
tggaacgcgg atgcggcgct cggctggacg cccgacgaca acacgcggct cgagctgacg 







901
gcaggcacgg gcgacggcta cgcgcgctat gcgggccgcg gaatggacgg cgcgcatttc 







961
cggcgcgaga cgttcggtct gaagttcgac aagaagcaca tcggcgacgt gctcgatcgc 







1021
atcgaggcgc aggtcttcta caacgaagcc gatcacgtga tggacaacta cacgttgcgg 







1081
atgcccgatc cgacgagcag catgccgatg cgcatggcct ccgaagtgcg ccgccgcacg 







1141
ctcggcgcgc gcgtcgcggc gacgctgcgc ttgaccgacg cgttcaagct cgtgacgggc 







1201
gtcgatgcgc agtcgaaccg cctcgactcg cgctcggcga tggggatgca gaactacggc 







1261
gacaagccgt ggaatccgca ggcgaacatg tggaacgcgg gcgcgttcgg cgagctgacc 







1321
tggtatgcga gcgatgcgtc gcgcgtgatc ggcggcgcgc ggatcgacta tgcggccgcg 







1381
cgcgacaagc gcgcgacgac gggcggcatg aagatgagca tgcgcaatcc gacgttcgac 







1441
gatctccgct cgcgcgtgct gccgagcggc ttcgtgcgct acgagcgtga tctcgcgtcg 







1501
ctgcccgtca cgtggtacgc gggcatcggc catgcgcagc gcttccctga ttactgggag 







1561
ctgttctccg ccaagcgcgg cccgaacggt tcgatcaacg cgttctccgc gatcaagccc 







1621
gagaagacga cgcagctcga catcggcgcg cagtacaaga gcgacaagct cgacgcctgg 







1681
gtgtccgcct atgcgggcta cgtgcaggac ttcatcctgt tcgactatgc gacgggcccg 







1741
atgggacaga tcacgcgggc gacgaacgtc aacgcgcaga tcatgggcgg tgaggtgggc 







1801
gcgtcgtggc gtccgctcgc gccgtggcgc ttcgaagggt cgctcgcgta tgcgtggggg 







1861
cgcaacgtgc aaagcggtgc gccgctgccg cagatgccgc cgctcgaggc acgcttcggc 







1921
gtcgagtaca ctcgcgggcc gtggtcggcg ggcgggctgt ggcgggtcgt tgcgccgcag 







1981
catcgctacg cgctgaacga gggcaacgtc gtcggcaagg acttcggtcc gagcgccggt 







2041
ttcggcgtgc tgtcgctgca cgcgcagtac aacgtgagca agacggtgca gatctcggtc 







2101
ggcgtcgaca acgtgctcga caagacttat gcggagcacc tgaacctcgc gggcaacgcc 







2161
ggtttcggct atccggcgaa tctgcctgtc accgaacccg gccgcaccgc gtgggttcgt 







2221
ttgagcacca agctctga 









105
1
mtstflrhap aaregharrr rraitltvpa laagafhlap avaqtseavh ghgtlgasgv 







61
asraetdaas aksdgavrea asrtatgaap dattlptiei vaapestplv vvtdpktprq 







121
plpasdgady lktipgfasi rsggtngdpv lrgmfgsrln ilangmptlg acpgrmdapt 







181
syiasesydk vtlvkgpqtv lygpsasagt vlfervtprf ktpgmrfdgs vvggsfgrnd 







241
qnvdvtagtp dfygrvsanh ahsqdyedgn grtvpsqwdk wnadaalgwt pddntrlelt 







301
agtgdgyary agrgmdgahf rretfglkfd kkhigdvldr ieaqvfynea dhvmdnytlr 







361
mpdptssmpm rmasevrrrt lgarvaatlr ltdafklvtg vdaqsnrlds rsamgmqnyg 







421
dkpwnpqanm wnagafgelt wyasdasrvi ggaridyaaa rdkrattggm kmsmrnptfd 







481
dlrsrvlpsg fvryerdlas lpvtwyagig haqrfpdywe lfsakrgpng sinafsaikp 







541
ekttqldiga qyksdkldaw vsayagyvqd filfdyatgp mgqitratnv naqimggevg 







601
aswrplapwr fegslayawg rnvqsgaplp qmpplearfg veytrgpwsa gglwrvvapq 







661
hryalnegnv vgkdfgpsag fgvlslhaqy nvsktvqisv gvdnvldkty aehlnlagna 







721
gfgypanlpv tepgrtawvr lstkl 





608
81
124381644
YP_001024240
126
1
tcagaactgc agcgtcgtca gcagcgacac ctgccgcgcg tcgcccaccg acacgaagaa 







61
ccggttcacg ctcgacgggt aatacgtgcg gttgaacaga ttcttcacgt tgagctggaa 







121
ctgcagcttc tgcttgccga gccgcgtgtc gtaggtcgcg aacgcgtcgg cggtcgcgta 







181
cgccggcagc gtgaagctgt tggccgaatc gcccgggcgc gcgccgacgt agcgccccgc 







241
cgcgccgatg cgcaggtcgt cgccgccgag caccgtgccc acgtcgtaga ccgccgcgag 







301
cgaggccgtg tgccgcgcga cgttccacag ccggttgtcc gcgtacagcg gatcttcggt 







361
cgtcttcgcg tcgatgtacg cgtagctcgc gatcacgttc caacgcgcac cgatccgccc 







421
cgacacgtcg agctcgatcc cgcgcgagcg cgcgcggccg gacgtgcgcc acgcggtctg 







481
gttcgtcgcg tcgtcgtatt gcgagacgag cacgtgcttc ttgtcgatgt cgaagaacgc 







541
gagcgtaccg gcgaggccgc ccggcatgtc gagcttcgcg ccgagctccc acgacgcgcc 







601
ttcctcgggc gcggtcgagc cgtcgatcac gtagccgccc gccatcggcg cgatcttcga 







661
ggtcggcttc agcgactgcg tgtagctgcc gtacagcgac agcgcgtcgt tccacttgta 







721
gacgatgccc gcgcgcggca gccacttcgt gccgctcagg ttcgtgttga cctggaacgg 







781
ccggccgcgc ccggcaagct ggctgtagcg gacccagcgc gcgccgccga cgagcatcca 







841
ccgctcgctc agatggatgc tgtcctggaa gaacagcgag gcggtgtgca gcgtgtcgct 







901
ctggtcgctg tcggaggccg atacgctcgt cgacggcggg acgagcccgt acgtcgggtt 







961
cagatagctg aacggcgtct tgatcggctg gcgcagcatg tccgcgcggt agacctgccg 







1021
gtattcgccg tcgacgccga actgcacgtc gtgccgcatc ccggcgagcg tcacgcggcc 







1081
gtcgacgtac gcgatgccgt agctgtcggt gctgcgcgaa ccgtgcgtcg cgtcgttgct 







1141
gcgcgtgagc gtgcccttga gcggatcgac cgccgtgatg cgaatctggt tcgcatcgta 







1201
ggtctcccgg ttgtagctgt agccgacgtg caccttccag tccggcgcga gctgatgatc 







1261
gatcgcgagc tgcgcgagat tcgattcgcc gcgcatgtcg ttgaacggct cgtcgagccg 







1321
gcggcgcgcc gggatcgcga gcggcgcgtt cgtgcgcgga tcgagcgcgg tgccgcgatc 







1381
gaacggcatc aggaagcgcc ggtactcgta cgacagcacc acctgcgtat cgcgcccgta 







1441
ccacgcgagc gaaggcgcga cgagcgtctc gcggtgctcg ccgtagttgc gccaatactg 







1501
ttcgttcgtc tgatcgacga tcagccggta tgcaacgcgc gactcgccga tcgcgcccgt 







1561
cgagtcgaac gtgagctcgc cgccgttcct gccgccgccg tacgtcgagc cgagcgccga 







1621
gatcgcgtgg cggcgcgcga gctgcggctg cttcgtgacg acgttgatca cgccgcccgg 







1681
gtccatgatt ccgtacagca gcgaggccgg gcccttcagc acttcgacgc tgtccgtcgt 







1741
cgcgttcagc gagcggccct gcacgatcgg cattccgttg cgcatcaccg agccgtcgcg 







1801
gttgtcgccg aagccgcgct tcatcaccgt gtcctgcgtg ctgccgagcg tgttgccctg 







1861
cgtgatgccg ctcacgttcg caagcgcgtc gtcgagattg cgcgggcgct ggtcgcgcag 







1921
cacctgctgc gcgacgatgg cgaccgcttg cggcacctcg gcgagcggcg cgtcgctgcg 







1981
cagccccgcc gcctcgcgcg gcggccggta gctgtccgcg tgcgggccgc tcgcgcgcac 







2041
ggcgacggtg ggcagcgtcg tgtcggccgc gagcgcggca tcggcgccgg cggcggccgg 







2101
cccgggcagc ctggcgagcg tgtagccgcc gctcggctgc cggagcgcga cgagccccgt 







2161
gcccgtcagc agccggtcga gcgccgcggc cggatccgcg cggccgtgca gcccggggct 







2221
tcgcaggccc gtggtcagct cgcccgggaa cgacagcagg atgccggcct cgcggccgaa 







2281
gcgggtcagc gccgcgtcga gcgggcccgc cggaatgtcg tacccgcgcg tcgcggcggc 







2341
cggctcggcc gcgcccgata cgcccggcgc ggcgcacgcg gagaacagcg cggcggcggc 







2401
cacgcgcgcg acgacgcggc gcggggcgaa ttgcggcatg cacggcgcgt gcaacgtgtg 







2461
cgacgcgtgc gatgtgtgcg atgtggcttg ccgcgcgcat cgcggcgcgg ccgcgtcgcg 







2521
cggccggcct gcgccgagcc gcgcggcggg ctgagccgcc tcgggcaaac cggccggcgc 







2581
gtgcgccgta tccgcgcaca cgccggcgtg cgcgcggcgg gggcggcgat tgcagattga 







2641
atccatgatc gatgactcgt tgacgatggg tggaatgcag ccttcatttc ccatgtcacg 







2701
cgagcatcga aaacagctca ggcccggcga aaaaatttcg atgaggcatg cggcgcgcgt 







2761
gcgcaa 









106
1
mrtraaclie ifspglscfr csrdmgnegc ippivnessi mdsicnrrpr rahagvcadt 







61
ahapaglpea aqpaarlgag rprdaaaprc arqatshtsh ashtlhapcm pqfaprrvva 







121
rvaaaalfsa caapgvsgaa epaaatrgyd ipagpldaal trfgreagil lsfpgelttg 







181
lrspglhgra dpaaaldrll tgtglvalrq psggytlarl pgpaaagada alaadttlpt 







241
vavrasgpha dsyrppreaa glrsdaplae vpqavaivaq qvlrdqrprn lddalanvsg 







301
itqgntlgst qdtvmkrgfg dnrdgsvmrn gmpivqgrsl nattdsvevl kgpasllygi 







361
mdpggvinvv tkqpqlarrh aisalgstyg ggrnggeltf dstgaigesr vayrlivdqt 







421
neqywrnyge hretlvapsl awygrdtqvv lsyeyrrflm pfdrgtaldp rtnaplaipa 







481
rrrldepfnd mrgesnlaql aidhqlapdw kvhvgysynr etydanqiri tavdplkgtl







541
trsndathgs rstdsygiay vdgrvtlagm rhdvqfgvdg eyrqvyradm lrqpiktpfs 







601
ylnptyglvp pstsvsasds dqsdtlhtas lffqdsihls erwmlvggar wvrysqlagr 







661
grpfqvntnl sgtkwlprag ivykwndals lygsytqslk ptskiapmag gyvidgstap 







721
eegaswelga kldmpgglag tlaffdidkk hvlvsqydda tnqtawrtsg rarsrgield 







781
vsgrigarwn viasyayida kttedplyad nrlwnvarht aslaavydvg tvlggddlri 







841
gaagryvgar pgdsansftl payatadafa tydtrlgkqk lqfqlnvknl fnrtyypssv 







901
nrffvsvgda rqvsllttlq f 





609
27
124385832
YP_001028928
127
1
ttactgcgcc gcttgcggcg cgcgcaggta gatctcgacg cggcggtttt gcgcgcggcc 







61
ggcttccgtt gcgttgtcgg cgatcgggtt cgacgcgccc atgccttgcg cggacaggcg 







121
gttcgccgcg acgccgcgct gcgcgagcgc gttcacgacg ctttgcgcgc gattttgcga 







181
cagcgtttga ttgtgcgcgg ccgagcccgt gctgtcggtg tagccgacga ccgacgcggt 







241
gatctgcggg ttctggttca gcgtcgtcgc gaggtcgttc agcagcggcg tgaacgccgg 







301
cgtgatcgcg tactggtcgg tcgcgaacgt gaccgagctg ggcacgttca gcttgagcga 







361
gccgtccggc tgttcggtga cctgcgtgcc cgtttgctgc gccgacggcg cgagcttgtt 







421
cttgatcgct tgccagttgt aacccgtcac gccgccgacg agcgcgccga cgccggcgcc 







481
gatcgccgcg cccttgccgc cgccagcgag cgcgccgatg cccgcgccga gcgcggcgcc 







541
cgtgccggtg ccgacagccg tgttggtgcc ttgctgggtt gcgcaacccg ccagcagggc 







601
gccggcgaga gcgaaaacgg acaagcgcgt cgcgattttg gtattcat 









107
1
mntkiatrls vfalagalla gcatqqgtnt avgtgtgaal gagigalagg gkgaaigagv 







61
galvggvtgy nwqaiknkla psaqqtgtqv teqpdgslkl nvpssvtfat dqyaitpaft 







121
pllndlattl nqnpqitasv vgytdstgsa ahnqtlsqnr aqsvvnalaq rgvaanrlsa 







181
qgmgasnpia dnateagraq nrrveiylra pqaaq 





610
19
124383503
YP_001028612
128
1
ttactgttga tagacgaggt ccgcacggcg gttctgcgcc cacgacgctt cgtcgtggcc 







61
gagcgcaacc ggcttttcct tgccgagact cacggcttcc atctgcgcat ccccgacgcc 







121
gagcagcgac agcgcgcggc gcacggcttc cgcacgcttt tggccgagcg cgaggttgta 







181
ctcgctcgtg ccgcgctcgt cggtgttgcc ctggatcagg atgtggcgtt gcggatggct 







241
cttcaggtat tgcgcgtgtt gctgcagcag cgcctggtat tggtcctgca ccgagtagct 







301
gtcgaaatcg aagtacacgc tgcgcttcgc gagcgggctg ttcggatcgt tcagcggatc 







361
gacggtcacc tgcgcgacgt tttcgggatt cggttgcgtg ctgaccgcgt cgccctggtt 







421
cgcgtgctcg tcgagcttca cgcccgactt gcatgccgcg agcgcgccga tcatcagcat 







481
cgcgaacgcc agacgaagtt tcttcgacat at 









108
1
mmskklrlaf amlmigalaa cksgvkldeh anqgdavstq pnpenvaqvt vdplndpnsp 







61
lakrsvyfdf dsysvqdqyq allqqhaqyl kshpqrhili qgntdergts eynlalgqkr 







121
aeavrralsl lgvgdaqmea vslgkekpva lghdeaswaq nrradlvyqq 





611
24
124386037
YP_001026945
129
1
atgaacggca cgaacgagct ctgctggcgc gatgcgttct ggacgccggc caccgccaac 







61
gccaagtgcg atggcgcact ggtcgcccag gcaccggcac cggcgccggt cgcaccggtt 







121
gctccggcca tcacgagcca gaagattacg taccaagccg atacgctgtt cgacttcgac 







181
aaggccgtcc tgaagccggc cggcaagcag aagcttgacg aactggccgc gaagatccag 







241
ggcatgaacg tcgaagtggt cgtggccacg ggctacacgg accgcatcgg ttcggacaag 







301
tacaacgacc gtctgtcgct gcgccgcgcg caagccgtca agtcgtacct cgtcagcaag 







361
ggtgtcccgg cgaacaaggt ctacacggaa ggcaagggca agcgcaaccc ggtcacgggc 







421
aacacctgca agcagaagaa ccgcaagcag ctcatcgcct gcctcgcacc ggaccgccgc 







481
gtggaagtcg aagtggtcgg cacgcaggaa gtgcagaaga cgaccgttcc ggcgcagtaa 









109
1
mngtnelcwr dafwtpatan akcdgalvaq apapapvapv apaitsqkit yqadtlfdfd 







61
kavlkpagkq kldelaakiq gmnvevvvat gytdrigsdk yndrlslrra qavksylvsk 







121
gvpankvyte gkgkrnpvtg ntckqknrkq liaclapdrr vevevvgtqe vqkttvpaq 





613
44
124383016
YP_001024349
130
1
tcagaacttg tgacggatcg ccgcgcgaac cgcgaactgg ttcgccgacg ccgaggggcc 







61
gtccgtcccg acgacgtagc cgccatccgc gatcgtgccc gtcttgtcgc ccgcgacctt 







121
ctggtacgcg ccctgcagat agacgtcggt gcgcttcgac aggttgtagt ccgccatcag 







181
gccgaccgtg tggtacttcg gcttgacgct gcccgcggcc gcatcgaact tgccgtccgt 







241
gtacacgtac tgcgcgccga tgaagagcgc cggcgtcagc tggtatttgc cgttgatttc 







301
gaagttctgg aacttcgtcg cgctcaggcc gaggcccgcg aacgtcgccg tcggcagata 







361
gaccgtcgag accgggttct tcacgtcggt cttcgtgtag acgaaaccga ccgtcgccgg 







421
gccgaacgtg tagttgacgc cgccgccgaa gatgcgcagg cggtccgccg tgaagttcgc 







481
gtcgttgtcg gcgatcgcgc cgcccgccgt cttgcccggg ttgttcgcct gcaggtacgc 







541
ggccgcgagt tgcaggccgc ccagcgtgta cgcggcgccg atgctgtact ggcggttgtt 







601
cgagaagccc gtcgcgttgc tgaagctgta cgtgccgccg acttgcagac cgttccagtc 







661
cgggctcgcg tacttgaccg tgttgttcac gcggaacgag ttgtccgtgt tgtcgttgtc 







721
gaacggatgc gagaacagcg tgccgcccca gttgccgttc gcggtcaacg gcgcgaggta 







781
gtcgacgagc gaatcgtact ggcggccgag cgtgagcgag ccgtattgcg cgtgcgacag 







841
gccgacgaac gcctggcggc cgaacatccg gccgccctgg ccgagccggc cgttgttcac 







901
gtcgaaaccg ttttcgagcg tgaagatggc tttcaggccg ccgcccaggt cttccgagcc 







961
gcgcaggccc cagcggctgc cctgcacgaa accgctcgcg agctggtagt tgctcttgcc 







1021
gacgccgttg acgttcacgt tgctcgtgta gttgaagcct tcgtcgatca cgccgtacag 







1081
cgtgacgctg ctttgggcga aaaccggcgc ggcgaaggcc agcaatgcgg ctgcggaaat 







1141
gacgtgcttt ttcat 









110
1
mkkhvisaaa llafaapvfa qssvtlygvi degfnytsnv nvngvgksny qlasgfvqgs 







61
rwglrgsedl ggglkaiftl engfdvnngr lgqggrmfgr qafvglshaq ygsltlgrqy 







121
dslvdylapl tangnwggtl fshpfdndnt dnsfrvnntv kyaspdwngl qvggtysfsn 







181
atgfsnnrqy sigaaytlgg lqlaaaylqa nnpgktagga iadndanfta drlrifgggv 







241
nytfgpatvg fvytktdvkn pvstvylpta tfaglglsat kfqnfeingk yqltpalfig 







301
aqyvytdgkf daaagsvkpk yhtvglmady nlskrtdvyl qgayqkvagd ktgtiadggy 







361
vvgtdgpsas anqfavraai rhkf 





614
39
124383230
YP_001024431
131
1
ttagaagcgg tgacgcagac caaccgtcgc tgcggtttgg ttgatcgacg tgctgaacgg 







61
cgtgctgagg tcgccgttgt agatcgacgc attcgcgttc ttcgacgaac gctggtacac 







121
ggcttgtgcg tacacgtcgg tgcgcttcga cagcgcgtag tcggcctgga cgccaacttg 







181
gttccagtga gtgctttcgc cgttcgcctt cgcgttcgtg tacgtgtacg tgtaagcaac 







241
acccagaccc agagccggcg tcaggttgta cttcacgttt gcttcgtagt tgtcggcacg 







301
gatggtcggc gcgccgtttg ccaggttgtc gagacgcgat tgcgtccaga gcaggccgcc 







361
ttgcagcggg ccgtaagcgt agcttgcacc ggcgccgtac acgcgcgagc ggccttggat 







421
gcttgctgcg ttcgtgccgc cgatgttgaa gccggtcagc ggatccgttg cgcccgtggt 







481
gttcgcgcca gcgttgttag cttgcgagta cgccgcaccg accttcaggc cttggaactg 







541
gtacgaagcg cccgcgctgt atgcacggtt gttcgcgaat tgcgagttgt tcgagaacga 







601
gtacgtgccg ccgaattgca ggccggcgta gttcgcgctc gtgaacttga ccgtgttgtt 







661
caccgcgacg tcgccgttcg tgttcaggcg gtcgttgttg agcgggtgcg cgaagtacgt 







721
gccgccccag gtgcccgttg ccgacagcgg cgacaggtag tcttgggttg cgtcgtactg 







781
acggcccagc gtgacggtgc cgtagttgct cgacagaccg acgaacgctt gacggttgaa 







841
catgccgcca ccgttgttga agcggccgtt accgatgttg aagccgcttt ccaacgtgaa 







901
gatcgccttc aggccgccac ccaggtcttc cgagccgcgc agaccgaaac ggctttggtc 







961
gacgccggcg ccgaccgacc acagcgactt gcccgagccc gacggcgtcg cgacgttgct 







1021
ttggtacgtg atgcccgcgt cgagcacacc gtacagcgtg acgctgcttt gcgcgtgcgc 







1081
gacggttgcg aacgatgcag caactgctgc aacaatcaga gtcttgttca t 









111
1
mnktlivaav aasfatvaha qssvtlygvl dagityqsnv atpsgsgksl wsvgagvdqs 







61
rfglrgsedl ggglkaiftl esgfnigngr fnngggmfnr qafvglssny gtvtlgrqyd 







121
atqdylspls atgtwggtyf ahplnndrln tngdvavnnt vkftsanyag lqfggtysfs 







181
nnsqfannra ysagasyqfq glkvgaaysq annaganttg atdpltgfni ggtnaasiqg 







241
rsrvygagas yaygplqggl lwtqsrldnl angaptirad nyeanykynl tpalglgvay 







301
tytytnakan gesthwnqvg vqadyalskr tdvyaqavyq rssknanasi yngdlstpfs 







361
tsinqtaatv glrhrf 





615
55
124381981
YP_001025679
132
1
ttagaacttg tggcgcaggc ccagagccac catctcttgc gacttcgtac cgttcacgcc 







61
gtacgagccg atcgatgcct gcgccgtttg cgtcgtcgtc gcgtcgacgc gctgcgtgcc 







121
gctcgcgtgc tggtacgcac cggcgaggta gacgtccgtg cgcttggaca gcgcgtagtc 







181
cgcgccgatc gacacttggt ggtacttcgc gtccgtgtcg ccgctcgcct tcgtgtatgc 







241
gtagccgagg ccgagcagca gcgccggcgt ggcctggtac ctcacgaagc cgcggcccgt 







301
gttgtacttc tcggtcgacg agaagcccga gaagccgtcc ggcttgtatt gcgcgttgct 







361
gtagccgagg ccgaccgtga ccgggccgaa cacgtactgg ccggccacct gtgcgatgcc 







421
gatcgacttc gcgctcgtgt agccgctgtt gatcgcgccg tcgaagatcg cgtcggacga 







481
gccgctccac gtcgagcgcg agccggccgt cggcgacggg ttcgacgtgt agaagtagcc 







541
ggccgcgacg ccgatcgggc cgttgttgta tgcggcggcg gccgaccacg tctggccttg 







601
gcccggcgag cccgcgatgc cgctgaagcc gtacagggct tcgaactgga agccggcgaa 







661
cacgggcgac gtgtacttga ccgtgttgtt cacgcgcagg ctgttgtcgt tgttgtcgac 







721
gtcacccggg gtggcgaaca ggctgccgaa gtagttgtcg gccgtcaccg gttgcacgag 







781
gtcgacgagc ggatcgtact gacggccgag cgtcagcgtg ccgtactggt catgctgcag 







841
accgacgaac gcctggcggc cgaacatgcg cttgccttgg ccgagcgtgc cgtcgttcga 







901
gttgaagcca ttttccaact ggaagatcgc cttcaggccg gcgccgagat cctcggtgcc 







961
cttcaggccc cagcggctgc cttgcaggtt gccgctgccc atcagccagg cgttgttggc 







1021
cttgccgtcg ttgccgtgga cataagtgat cgacgtatca atcacgccgt acagcgtgac 







1081
gctgctttga gcctgagctg caccggctgc gcccagcagc gcgagcgaga gggtagacag 







1141
agcgagtttc ttcat 









112
1
mkklalstls lallgaagaa qaqssvtlyg vidtsityvh gndgkannaw lmgsgnlqgs 







61
rwglkgtedl gaglkaifql engfnsndgt lgqgkrmfgr qafvglqhdq ygtltlgrqy 







121
dplvdlvqpv tadnyfgslf atpgdvdnnd nslrvnntvk ytspvfagfq fealygfsgi 







181
agspgqgqtw saaaaynngp igvaagyfyt snpsptagsr stwsgssdai fdgainsgyt 







241
saksigiaqv agqyvfgpvt vglgysnaqy kpdgfsgfss tekyntgrgf vryqatpall 







301
lglgyaytka sgdtdakyhq vsigadyals krtdvylaga yqhasgtqrv datttqtaqa 







361
sigsygvngt ksqemvalgl rhkf 





617
55
124385469
YP_001028406
133
1
atgaagcgaa aacatgcttt gactgcactc gcagtcgcgc tgctcgccgc gggctgcacg 







61
ctcgcgccgc gctacgagcg tccggccgcg ccggtgtcgg gcgcgttccc cgccgacggc 







121
gtctatgccg cgcagccggg cgccgcgccc ggcgcgcgca gcgcgaacgg ccaggcggcc 







181
gtcgatatcg gctggcgcga gttcttcgtc gatccgcgcc tgcagcggct gatcgagatc 







241
gcgctgaaga acaaccgcga cctgcgcgtg tcggtgctca acgtcgaagc gtcgcgcgcg 







301
cagtatcaga tcacgcgcgc ggggctcttc ccgacgttga gcggcaccgg cacgggcacg 







361
atccagcgca cgccggccgg cgtgtcgatc accggccagc cgctcatctc gcggacctac 







421
aacgtcggcg tctccgcgtc gtgggagctc gacctgttcg gccgcgtgca gagcctgaag 







481
gaccaggcgc tcgcgcaata cttcgccacc gcgcaggcgc gcaaggccgc ggagatctcg 







541
ctcgtcgcga gcgtcgccga tcagtacctg acgctgctgt cgaccgacga tctgctgcag 







601
gtcacgcaga acacgctgaa gtcggcgcgc gcgtcctacg atctgacgaa gctgcagttc 







661
gacaacggca ccggctcgga gctcgacctg tgccaggcgc agacggtggt cgagaccgcg 







721
ctcgcgagcc agcaggcgca ggcgcgcgcc cgcgcgcagg cgctcaacgc gctcgtgctg 







781
ctgatcggcg agccgctgcc cgacgatctg ccggccgacc tgccactcaa tgcgcagaac 







841
ctgctcaccg acattccggc cgggctgccg tccgatctgc tcacgcggcg gcccgacatc 







901
atgcaagccg aggagacgct gcgcgcggcg aacgcgaaca tcggcgcggc gcgcgcggcg 







961
ttcttcccga agatctcgct caccagcgcg ttcggcaccg cgagcacgac gctcggcggc 







1021
ctgttcaagg cgggcacggc ggcgtggtcg ttcgcgccga acatcgcgct gccgatcttc 







1081
gagggcgggc agaacatcgc gaacctcgat ctcgcgcacg tgcagaagcg catcgagatc 







1141
gcgaactacg agaaggcgat ccagagcgcg tttcgcgagg tgtcggacgg gcttgccgcg 







1201
cgcggcacgt acgatcagca gatcgcggcg ctcgagcgca acgagcacgc gcagcagcgc 







1261
cgctacgatc tgtcggacct gcgctacagg aacggcgtcg acagctatct gtcggtgctg 







1321
accgcgcaga cggacctgta ttcggcgcag cagcagttga tcagcgcgcg gctcgcgcgc 







1381
tggacgaacc tcgtggacct gtatcgcgcg ctgggcggcg ggtggctcga gcgcgcgggc 







1441
gagacgccgc gcccggcgga cgcgcccgtc gattacggca aggcggcgcc cgcgccggcg 







1501
gcggccgtgc cggctgcgtc ggccagcgcg ccggcggcgg gctga 









113
1
mkrkhaltal avallaagct lapryerpaa pvsgafpadg vyaaqpgaap garsangqaa 







61
vdigwreffv dprlqrliei alknnrdlrv svlnveasra qyqitraglf ptlsgtgtgt 







121
iqrtpagvsi tgqplisrty nvgvsaswel dlfgrvqslk dqalaqyfat aqarkaaeis 







181
lvasvadqyl tllstddllq vtqntlksar asydltklqf dngtgseldl cqaqtvveta 







241
lasqqaqara raqalnalvl ligeplpddl padlpinaqn lltdipaglp sdlltrrpdi 







301
mqaeetlraa nanigaaraa ffpkisltsa fgtasttlgg lfkagtaaws fapnialpif 







361
eggqnianld lahvqkriei anyekaiqsa frevsdglaa rgtydqqiaa lernehaqqr 







421
rydlsdlryr ngvdsylsvl taqtdlysaq qqlisarlar wtnlvdlyra lgggwlerag 







481
etprpadapv dygkaapapa aavpaasasa paag 





618
19
124381380
YP_001024890
134
1
atgcaaggcg acaagaaagt catcgaatat ctgaacgcgc agttgaaaaa cgaactgacc 







61
gcgatcaatc aatatttcct gcatgcgcgg atgtacaaac actggggcct cgaaaaactc 







121
ggcaagcacg aatacgacga atcgatcggc gaaatgaagc acgcggactg gctgatcgag 







181
cgcgtgttca tgctcgacgg cctgccgaac ctgcaggatc tgcacaagct gctcgtcggc 







241
gaggaaaccg aggaaatcct gaaatgcgac ctgaagctcg aacaggtatc gcaggccacg 







301
tgcaaggaag ccatcgctta ttgcgaatcg gtgcgtgatt acgtgtcgcg cgaaatcttc 







361
gagaaaatcc tcgacgatac cgaggagcac atcgactggc tcgagacgca aatcgacctg 







421
atcggcaagg tcggcattca gaactatcag cagtcgatga tgggctcgcc ggaataa 









114
1
mqgdkkviey lnaqlknelt ainqyflhar mykhwglekl gkheydesig emkhadwlie 







61
rvfmldglpn lqdlhkllvg eeteeilkcd lkleqvsqat ckeaiayces vrdyvsreif 







121
ekilddteeh idwletqidl igkvgiqnyq qsmmgspe 





619
8.5
124382608
YP_001024892
135
1
atgatcgtct gcgtgtgcaa gtccgtttcc gatcggaaga ttcgcgcatc cctcgcggaa 







61
ggcgtgaaca cgttcgaaga actccagttc gaactcgggg tcgccacctg ctgcggcaag 







121
tgcgaggaaa ccgtgcgcga gatcatggcg gaacagggcg tttgtgcgag ccgctgcggt 







181
gccgagcgcc ccgcggccgt accggtcgcc gtcacgttct acgaacgcaa ggcggcctga 









115
1
mivcvcksvs drkiraslae gvntfeelqf elgvatccgk ceetvreima eqgvcasrcg 







61
aerpaavpva vtfyerkaa 





620
36
124382074
YP_001024065
136
1
atgctgaatt ttctggtcaa acgactgttc gggctgctgc cgacgctcgc gtgcgtcgcg 







61
gtgctggtgt tcctgttcgt ccacctgctg ccgggcgatc ccgcgcggct cgcggccggc 







121
cccgaggccg acgacgcgac cgtcgcgctc gtgcgcgccg atctcggcct cgacaagccg 







181
ctgccggcgc agttcgcgag cttcttcgcg cggatcgcgc acggcgactt cggcgtctcg 







241
acgcgcagca agcggcccgt agcgaccgag atcggcgagc gcttcatgcc gacgctgacg 







301
ctgaccgtcg tcagcatggc ctgggcgacg ctcttcggca tggcgatcgg catcgcgtcg 







361
gccgtgtggc gcaaccgctg gccggaccgc atcggcatga cgctcgcggt gtcgggcatc 







421
tcgtttccgg cgttcgcgct cggcatgctg ctgatggagg tgttctcggt gaagctcggc 







481
tggctgcccg tcgtgccgga cggctcgtgg aagagctacg tgctgccgtc ggtgacgctc 







541
ggcgcggccg tggccgccgt gatggcgcgc ttcacgcgcg cgtcgttcgt cgaggtgctg 







601
aacaaggatt tcgtgcgcac cgcgcgcgcg aagggcgtgc gcgagccgat ggtcgtgctc 







661
aagcactgcc tgcgcaacgc gatgattccg gtcgtcacga tgatggggct gcagttcggc 







721
ttcctgctcg gcggctcgat cgtcgtcgag gcggtgttca actggccggg gctcgggcgc 







781
ctgctcgtcg atgcggtgac gatgcgcgac tacccggtga tccaggcgat cgtgctgctg 







841
ttctcgctgg agttcatcct gatcaacctg agcgtcgacg tgctgtacgc ggtcatcaac 







901
ccgacgatcc gtttcaagtg a 









116
1
mlnflvkrlf gllptlacva vlvflfvhll pgdparlaag peaddatval vradlgldkp 







61
lpaqfasffa riahgdfgvs trskrpvate igerfmptlt ltvvsmawat lfgmaigias 







121
avwrnrwpdr igmtlavsgi sfpafalgml lmevfsvklg wlpvvpdgsw ksyvlpsvtl 







181
gaavaavmar ftrasfvevl nkdfvrtara kgvrepmvvl khclrnamip vvtmmglqfg 







241
fllggsivve avfnwpglgr llvdavtmrd ypviqaivll fslefilinl svdvlyavin 







301
ptirfk





622
56
124383712
YP_001026175
137
1
atgaaaacct tcccgttgtc cgcttgccgg accgccgtgg ccgtcgcggt cgccgcgctc 







61
gcgctcgcgg gatgcgcgaa ctatttcggc ctcaagaacg acaaggcgat cgctcccgcg 







121
gcgcagttcg aaagcgcgcg gagcctgccc gcgcagggcg gccaatggcc gtcgctcgac 







181
tgggcgaacc agttcggcga tccgcaattg ccgaagctga tcgacgaggc gctcgaaggc 







241
aatccgacga tcgcgcaggc gcaggcgcgc atcgcgaagg cgtcgtcgta catcgagtcg 







301
tcgcgctcga cgctgctgcc gaaggccgac gcgaagtact cgtggacccg cgagctgtac 







361
tcgggcaacg ctcttttccc gcccccgtac ggcggccagt ggtacagcga gaacaacgcg 







421
ctcgcgagcg cgtcgtggga gctcgatctg tggggcaaga accgcgcgcg cctgaatgcg 







481
gccgtgtcgc aggaaaaggc cgccgaagcc gacttgcagc aggcgcgcgt gacgctcgcg 







541
acgtcggtcg cgcgcacgta caaccagctc gcgcaactgt acgcgctgcg cgacatcgcc 







601
gggcgcgaga tctcgaaccg gcagacggtc gacaagatca ccgacggccg cgtcggcgcg 







661
ggcctcgaca cgaacgtcga gcgccagacc gcgcgcggca acatcgcgac gagccagtcg 







721
acgctgtccg atctcgacgg ccagatcacg aacgtccgct atcagctcgc cgcgctgctc 







781
ggcaaaggtc cggaccgcgg gctgcagatc gccgcgcccg tgctgagcgc gggcggcgcg 







841
gtcgcgctgc cggagaacct gcccgccgat ctcgtgtcgc gccgcccgga tctcgtcgcc 







901
gcgcgctggc aggtcgaagc ggcgatgcac gacgtgaagg aagcgaaagc cgagttcttt 







961
ccggacatca acctcgcggc gggcttcggc ttcgacgcgt tcggctgggg ccgattcctg 







1021
aacttcgcga gccgccaggc gcagttcggc cccgcgatcc acctgccgat cttcgacggc 







1081
ggcgcgctgc gcgcgcaatt gaagggccgc tacgcggact tcgatctgtc ggtcgcgaac 







1141
tacaaccaga cgctcatcaa cgcgctgaac gacgtcgcga cgcaagtcgc gtcgattcgc 







1201
tcggtcgaca cgcaaatggg cgacgcgcag cgcgcgctcg atgcgtcgac gcgcgcctac 







1261
gagctcgcgg tgatccgcta caaggcgggc ctgtcgccgc agttgcaggt gctcaacgcg 







1321
gacagcaacc ggctcgccgc cgagcagacg gtgacgaacc tgaagatgcg ccgccgcgac 







1381
ctgcagatcg gcctcgtgaa ggcgctcggc ggcggcttcg acgcggccgg cacgcggctc 







1441
gccgcgcccg cccccgcctc ggcgccggcc gcgcccgcgc agcacgcatc gaactga 









117
1
mktfplsacr tavavavaal alagcanyfg lkndkaiapa aqfesarslp aqggqwpsld 







61
wanqfgdpql pklidealeg nptiaqaqar iakassyies srstllpkad akyswtrely 







121
sgnalfpppy ggqwysenna lasasweldl wgknrarlna aysqekaaea dlqqarvtla 







181
tsvartynql aqlyalrdia greisnrqtv dkitdgrvga gldtnverqt argniatsqs 







241
tlsdldgqit nvryqlaall gkgpdrglqi aapvlsagga valpenlpad lvsrrpdlva 







301
arwqveaamh dvkeakaeff pdinlaagfg fdafgwgrfl nfasrqaqfg paihlpifdg 







361
galraqlkgr yadfdlsvan ynqtlinaln dvatqvasir svdtqmgdaq raldastray 







421
elavirykag lspqlqvlna dsnrlaaeqt vtnlkmrrrd lqiglvkalg ggfdaagtrl 







481
aapapasapa apaqhasn 





623
85
124383487
YP_001026988
138
1
tcacagcgac catttcaact ctccataaaa agtgcggccc ggataaggat ggaacacgta 







61
gtagcggcga tcggtcacgt tgtcgatgcc gaacgacgcg agccagtgcc gatcgaaccg 







121
atagcgcgcc ttcaggtcga cgaccatgaa cgagctggtg ccgccgtata cgcccggatt 







181
cacgtcgctg ttgtcgagcg cgttgtactg ccgccccgaa tagcgaacgc cgacgctcgt 







241
catccaatgc tcgtcgaagc ggtacgacgc gagcaggttc gcgcgcatcc gcggaatccg 







301
cggccagcgc gcgccgacgt agttcggatt ggccgcatcg gcaagcgtct gcgcattcgt 







361
cgcggacacg ttcgcgtcga cgtcgagccc cttgagcgcg acgtcctgcc ccgagaacgc 







421
gagctcgacg ccgcgcaccc gcacgcggtc gacgttcgag acgttcgtgt acgtcgacgc 







481
gccggcaagc gtggtctggc tgtagatcga attgcgcaga tcgctctgga acacgctcgc 







541
gcgcacgacg ccgaagccga cgtcgcgctc ggcggtgaaa tcccagtcga tcgccttttc 







601
cggctgcaga ttcgggttgt tgttgacgat cgcgttgttc gagatcgtgc cctggaacag 







661
ctcggcgacc gtcggaaagc gcgtgcccgt cgcgaacgac aaccgcaggc gccacgcgct 







721
ggccggctgc cattcgagcg agagcttcgg cgagaacgcg gtcgcgccgc gctccgcgta 







781
gccgagcgtc gcggtcgcgc cgccgagctc gccgccgtat gcatcccagc gctcgtagcg 







841
caggccgagc gtcgcgagcc agtcgggcgc gacgcgccac gcgtcctgcg catacagcgc 







901
ctgcgtgcgc gtgttgccgc gatagcggtt cacgagcgtc gtcggcaccg cgttttgcca 







961
gtcggccgtg ttgtaggtcg cgttgcgcag gaaatacgta tcgaagtggt agccgaacga 







1021
gaaacggtgc ccgcgcacgt cgggcgactc cgcgcgcaag tcgacggtgc gccagccggt 







1081
gccgtcgcca tggaacaccg tgcccggccc gccgtcccag gcgccgggcg gtgcgccgga 







1141
cgcgctgcgc agcacgtcgc gcgacacctc gtaggcggac gcgatggccg acagcttcca 







1201
gccggacgcg agccgcgcgt cgagcccgag cccgtacagc cagttctcct ggtcgcccgt 







1261
ctgcggcgcg aacgcgcccg gcgacaccgt gtagttgcgc ccgccgaacg acacgttgcc 







1321
gccgtacacc gggttgcccg ccgcgtcgcg caggaacgtg tcgccgtgct gccggtagtg 







1381
attctcccag tggccgagcg tgaccgtcgc gtcgacgtga tcggtgaacg cgtagccgaa 







1441
gcgcaacgtc tcgttgagct gctcggtgcg ctcgatcgtc tgcgcgccga cgatcgtccg 







1501
cggccggccg ttcggcccga tgtcggagac ggcccccgtc accggcacgg gtgcgccgag 







1561
cctcggatcg aacgcgccgt tcgggctcgc gtattgcatc ggctggctgt cgttctcgag 







1621
ccggtcgagc gacagcgcat accagaaacg tccgacgcga tcggcgatgc gcgcgctctg 







1681
gtgattgccg ccgaagctgt cggcgaagcc gtagccgtcg cgatagcgct gcgtgaagaa 







1741
ctgcgtcgac accgacgcct cgagccgctg cggcttgtgc gtggtgatct gcacggtcga 







1801
gccgatcgcg ttgcccgggt agagcgcgga aaacgggccg tacagcacgt ccacgcgtgc 







1861
gatgtcgtcg ggctggatca gcgaccagcg cggcggatac gaatagctgg aaccgagcag 







1921
attggacagc aggatgccgt ccgcatagac gagtccgcgc gcgctctgca gctcattgaa 







1981
gtctcggccg gcgaacactg aattgcggtc gccgatgtag cgccggcgca ccatcaggtt 







2041
cggcgcgtac ttgagcgcgt cttcggtcgt cacgttgacg tgcgaatcga tctgctcgcg 







2101
cgtgagcgcc tcgacgacgc cgggcgtgtc gggcgcgaac gcggcgcgct gcgcggtgac 







2161
cttgacggta tccaaggttt tcccggatgc ggtcatctcc gcggcggcgg ccggcgttga 







2221
atcggcaggc acggcggcgg ccgtcgaagc cgcgtgggcg gcgggccagg cgagcgccgc 







2281
ggcgcaggcg gccgcgagcc gcctgcgcgc gccgggcgcg gcaagttgat ggaacat 









118
1
mfhqlaapga rrrlaaacaa alawpaahaa staaavpads tpaaaaemta sgktldtvkv 







61
taqraafapd tpgvvealtr eqidshvnvt tedalkyapn lmvrrryigd rnsvfagrdf 







121
nelqsarglv yadgillsnl lgssysyppr wsliqpddia rvdvlygpfs alypgnaigs 







181
tvqitthkpq rleasystqf ftqryrdgyg fadsfggnhq sariadrvgr fwyalsldrl 







241
endsqpmqya spngafdprl gapvpvtgav sdigpngrpr tivgaqtier teqlnetlrf 







301
gyaftdhvda tvtlghwenh yrqhgdtflr daagnpvygg nvsfggrnyt vspgafapqt 







361
gdqenwlygl gldarlasgw klsaiasaye vsrdvlrsas gappgawdgg pgtvfhgdgt 







421
gwrtvdlrae spdvrghrfs fgyhfdtyfl rnatyntadw qnavpttlvn ryrgntrtqa 







481
lyaqdawrva pdwlatlglr yerwdaygge lggatatlgy aergatafsp klslewqpas 







541
awrlrlsfat gtrfptvael fqgtisnnai vnnnpnlqpe kaidwdftae rdvgfgvvra 







601
svfqsdlrns iysqttlaga stytnvsnvd rvrvrgvela fsgqdvalkg ldvdanvsat 







661
naqtladaan pnyvgarwpr iprmranlla syrfdehwmt svgvrysgrq ynaldnsdvn 







721
pgvyggtssf mvvdlkaryr fdrhwlasfg idnvtdrryy vfhpypgrtf ygelkwsl 





624
78
124382059
YP_001025431
139
1
atggcgcccg cggcgacgcg ggcgccgagc cgtcgtatcg atgcgctgcc tcacgggcct 







61
tcctgtgcgc tgatattgat acgttatatc gtttcctgtt catcgcgcga ttcggcccgt 







121
gattccgtcc accccgacca caccacaccg attaccgcaa tggatgacca ccgacgtatc 







181
gcgccgcctt tcgctcgcag gctgcatccc ttgtcgctgc tgctcgccgc gagcctcgcg 







241
catggcgaga cgggcgcgcc gcccgccgag cgccgatcgg acgcgccgcc cgcgacggcg 







301
ctcgcgccga tcttcgtgac cgcgaacccg ctcggcgcgt cggcgctgag ctcgcccacc 







361
gcgtcgctgt ccggcgacgc gctgacgctg cgccgcaccg attcgctcgg cgacacgctg 







421
aacggcctgc ccggcgtgtc cacgacgacc tatggcccgc tcgtcggccg tccgatcatc 







481
cgcgggatgg acggcgatcg catccgcctc ctgcagaacg gcgtcgccgc gtacgacgcg 







541
tcgtcgctgt cctacgatca cgcggtgccg caggatccgc tgagcgtcga gcgcatcgag 







601
atcgtgcgcg ggccggcggc gttgctgtac ggcggcaacg cggtcggcgg cgtcgtcaac 







661
acgatcgaca accggattcc gcgcgaagcg atcacgggcg tctcgggcgc gctcgacgcg 







721
agctacggcg gcgcgaacaa cgcgcgtgcg ggcgcggcgc tcgtcgaagg cggcaacggc 







781
cgcttcgcgt ttcatctcga cgcgttcggc cgcgagaccg acgcgctgcg gattccgggc 







841
catgcgcatt cggcgcgcca gcgcgcgctc gacggcgagg acgccagcga gccgtacggc 







901
aggctgccga acagcgacgg ccgccgttac ggcggcgcgg cgggcggctc gtacacgtgg 







961
gcggacggct acgtcggcgc atcgtacagc ggctacgaat cgaactacgg ctccgtcgcc 







1021
gaaaccgacg cgcggcttca gatgcggcag gagcgcgtcg cgctcgcgtc ggaggtgcgc 







1081
aatctgcgcg ggccgttctc gcagctgaaa ttcgacttcg gctacacgaa ttaccagcat 







1141
aaggaaatcg aggacggcgt gacgggcacg acgtttcgca atcacggcta cgaggcgcgc 







1201
gtcgaggcgc ggcaccgcaa gctcggcccg ttcgaaggcg cgctcggcgt gcaggtcgga 







1261
cagaacacgt tctccgcgct cggcggcgag gcgctcgcgc ctaccacgcg cacgacgagc 







1321
gtcgcgctgt tcggcctcga gcagtggcag gcgaccgacg cgctgaagct gtccgccggc 







1381
gcgcgcatcg agcacgtgcg gctcgatccg agcgcgaacg gcgacgacaa gttcggcttc 







1441
gcgcgctcgc gcgatttcaa tgcgggcagc gtgtcggcgg gcgcgctgta ccagctcgcg 







1501
cccgcgtggt cgctcgcggg caacgtgtcg tacacggagc gcgcgccgac cttctacgag 







1561
ctgtacgcga acggcccgca cggcgcgacg ggacagtacc tgatcggccg gcccgatgcg 







1621
cagaaggaga aggcggtgtc gaccgatctc gcgctgcgct atgcgagcgg tccgaaccgt 







1681
ggcagcatcg gcgtgttcta cagccgcctg cggaactacc tcgccgaata cgacaccgga 







1741
cggctcgtcg acgacgacgg cgtgcccgtg gccccgggcg ccgacgacgc gctgcgcgag 







1801
gccgtctatc gcggcgtgcg cgcggagttc tacggcgtcg agctcgaagg caggtggcgc 







1861
gcgttcgaac ggcgcgggca tcgcgtcgat ctcgagctga gcgccgacta cacgcacgcg 







1921
cgcaacgcgg acacgggcga gccgctgccg cggatcgcgc cgctgcgcgc gacgctcgcg 







1981
gccgattacg gctacggccc gttcggtgcg cgcgcgcagc tcacgcatgc atgggcgcag 







2041
catcgcgtgc ccgagcacga tctcgcgacc gacggctaca cgtcgctcgg cgtcgtgctc 







2101
acctacaagt tgcgcgtcgg cgcgacgaac tggctcgcct atctgcgcgg cgacaacctg 







2161
acgaaccagg acatccgcta cgcgagctcg gtggtgcgca acatcgcgcc gcagggcggg 







2221
cgcagcgtga gcatcgggat gcgcacgaca ttctga 









119
1
mapaatraps rridalphgp scaliliryi vscssrdsar dsvhpdhttp itamddhrri 







61
appfarrlhp lslllaasla hgetgappae rrsdappata lapifvtanp lgasalsspt 







121
aslsgdaltl rrtdslgdtl nglpgvsttt ygplvgrpii rgmdgdrirl lqngvaayda 







181
sslsydhavp qdplsverie ivrgpaally ggnavggvvn tidnriprea itgvsgalda 







241
syggannara gaalveggng rfafhldafg retdalripg hahsarqral dgedasepyg 







301
rlpnsdgrry ggaaggsytw adgyvgasys gyesnygsva etdarlqmrq ervalasevr 







361
nlrgpfsqlk fdfgytnyqh keiedgvtgt tfrnhgyear vearhrklgp fegalgvqvg 







421
qntfsalgge alapttrtts valfgleqwq atdalklsag ariehvrldp sangddkfgf 







481
arsrdfnags vsagalyqla pawslagnvs yteraptfye lyangphgat gqyligrpda 







541
qkekaystdl alryasgpnr gsigvfysrl rnylaeydtg rlvdddgvpv apgaddalre 







601
avyrgvraef ygvelegrwr aferrghrvd lelsadytha rnadtgeplp riaplratla 







661
adygygpfga raqlthawaq hrvpehdlat dgytslgvvl tyklrvgatn wlaylrgdnl 







721
tnqdiryass vvrniapqgg rsysigmrtt f 





625
43
124383587
YP_001028404
140
1
atgcgcgtcg aacgggttcc ataccgctta atcactgtcg cgacggctgc cgttttcctg 







61
gccgcgtgcg gaaaaaaaga atcggctccg ccccctcaaa cgcccgaagt cggcgtcgtc 







121
accgtccagc cgcagcccgt gccggtcgtc tccgaactgc cgggccgtac gagcgcctat 







181
ctggtcgcgc aggtgcgcgc acgggtcgac ggcatcgtgt tgcggcgcga gttcacggaa 







241
ggcagcgacg tcaaggccgg ccagcgcctg tacaagatcg atccggcacc ctatatcgcg 







301
caattgaaca gcgcgaaggc gacgctcgcg aaggcgcagg cgaacctcgc gacgcagaac 







361
gcgctcgtcg cgcgctacaa ggtgctcgtc gccgcgaacg cggtcagcaa gcagcagtac 







421
gacgatgcgg tggccgcgca agggcaggcg gccgccgacg tgggcgcggg caaggccgcc 







481
gtcgagaccg cgcagatcaa cctcggctat acggatgtcg tctcgccgat cacgggccgc 







541
gtcggcatct cgcaggtcac gccgggcgcg tacgtgcagg cgagccaggc gacgctgatg 







601
tcgaccgtcc agcagctcga tccggtctac gtcgatctca cgcagtcgag cctcgacggc 







661
ctgaagctgc gccaggacat ccagagcggg cgcatcaaga cggaaggccc gggcgcggcg 







721
aaggtcacgc tgattctcga ggacggcaag ccgtacccgg agcggggcaa gctgcagttc 







781
agcgatgtca cggtcgacca gacgacgggc tcggtcacga tccgcgcgat cttcccgaac 







841
aagcagcgcg tgctgctgcc gggcatgttc gtgcgcgcgc gcatcgaaga gggcgtcaac 







901
gagaacgcgt tcctcgttcc gcagatcggc gtcacgcacg atccgaaggg ccaggcgatc 







961
gcgatgatcg tcgacggcaa gggcaaggtc gagccgcgcg tgctggtcac gggcggcacg 







1021
cagggccaga actgggtggt cgagagcggc ctgcaggcgg gcgaccgcgt gatcgtgcag 







1081
ggcatcgaca aggtgcgccc gggcatgacc gtgaaggccg ccgaggctca attgccggcc 







1141
gcggcggccg gcgcgtcggg tgccgcgccc gcgggcggct cgcccgccca ggccgcggcg 







1201
gcgagcgcgg ccgcgtcggg tgccgcgccg tcgagcgctg ccgccgcgtc gagcgcgcaa 







1261
taa 









120
1
mrvervpyrl itvataavfl aacgkkesap ppqtpevgvv tvqpqpvpvv selpgrtsay 







61
lvaqvrarvd givlrrefte gsdvkagqrl ykidpapyia qlnsakatla kaqanlatqn 







121
alvarykvlv aanayskqqy ddavaaqgqa aadvgagkaa vetaqinlgy tdvvspitgr 







181
vgisqvtpga yvqasqatlm stvqqldpvy vdltqssldg lklrqdiqsg riktegpgaa 







241
kvtliledgk pypergklqf sdvtvdqttg svtiraifpn kqrvllpgmf vrarieegvn 







301
enaflvpqig vthdpkgqai amivdgkgkv eprvlvtggt qgqnwvvesg lqagdrvivq 







361
gidkvrpgmt vkaaeaqlpa aaagasgaap aggspaqaaa asaaasgaap ssaaaassaq 










BcAU1054-700












ID
MW
Protein GI#
Protein Locus#
SEQ ID NO:
Sequence
















701
81
107022717
YP_621044
165
1
atgaaaaaag tggagcagaa gaagatggag tgggcaacag gcacgcgttt gcgagagatc 







61
gcagccgcgg caagcgtggc gttcggtacg gcggcggcag ggcacgcata cgcccagacg 







121
gcgccggccg tgaacgcggg cgccacggcg tcggccagca gtgcgcaaac cggcgcgacg 







181
gcgacgacgt cgaccagcgc gcaaaacggc acgctgccgg cgatcaccgt caacgcggcc 







241
tcggcaggtg acggcacggt cgggctcgtc gcgaagcgca gcacgaccgg caccaagacc 







301
gacacgccgc tcaacgagat cccgcagacg atcaacgtcg tcaccgcgca gcagatcgag 







361
atgaccggcg ccaccgacgt gaaagaggcg ctgcgctacg tgccgggctt ctcgtcgtac 







421
gggtcggaca accgctccga ctggtacgcg gcgctgcgcg gcttcacgcc gaccgcctac 







481
gtgaacgggc tgcaggtgcc gaacacgatc aacctcgcga gctggcgcgt cgatccgtac 







541
atgatcgaca gcatcagcgt gctgcgcggg ccgacctcgg tgctgtacgg agagggcgat 







601
ccgggcgcga tcatcgacgt gcacaccaag ctcgccgacg gcgagcgcgt gcgcgaagca 







661
ggcgtgcaga tcggcaacta cgcgcgcaag cagttcatga tcgacgtcgg cgacaagctc 







721
gacccggacg gcaagtatgc gtaccggttc gtcggcgtcg cgcgcgacgg caacgcgctg 







781
acgggcccga acaacgacca gcgcgtcgcg ctcgcgccgt cgttccgctg gcgcccggac 







841
gcggatacgt cgctgacgct gtccgcgacc tacctgcagg actggggcga catctcgtcg 







901
aacttcctgc ccgcgcaagg cacggtgctg ccgaacccga acggccagat caacaaggac 







961
atctacgaag gcgacggcaa cttcaactac taccgcaaga agcagtggtc ggtcggttac 







1021
cagttcgagc gcaacctgac gccggcctgg acgttccgcc agaacacgcg gctgatgcac 







1081
ctgtcgctcg acaacggctc ggtgttcggc aacggcttcg tcgaaggcag cacgaccgac 







1141
gtgtcgcgct gggccggcgt gttccagatg aactacagcc gcttcgacat cgacaacaac 







1201
ctggagggcc gcttcgcgac gggcccgttg cagcacacgc tgctgctcgg cttccagtac 







1261
aaccgccaga ccgcgaccga cagcgaatgg ctcgccgccg cgccgccgct gaacatctac 







1321
aacccggtgt accagcccgt cacgacggcg gtgttcacgc ccgacgcgac gatccgcacc 







1381
aacacgtaca cgacgatcaa cacgttcggc ctgtacgcgc aggaccagat caagtggaac 







1441
cgctggacgc tgacgctcgg cggccgcgag gactgggtca acatgcggat ggacgaccgc 







1501
gcggccggca cgtcgacgaa ggcggacgtc acggcgttca ccggccgcgt cggcctcacg 







1561
taccagggcg actacgggct gtcgccgtac gtcagctacg cgacgtcgtt caatccgctg 







1621
atcggcgtga acctggtcgg cggcgggctg ccgcagccga cgcgcggcaa gcagatcgaa 







1681
gccggcctgc gctggcagcc gcccggcaag aacctgatgc tgaacgcggc gatctaccag 







1741
atcaaccaga ccaacgtgct cacgtcggcg ctgccgagcc aggacccgac cggcacgaag 







1801
tcggtgcaga cgggcgaagt gcgctcgcgc gggatcgagc tgagcgcgac cggcaaggtc 







1861
acgcgaaacc tgtcggtgat cgcgtcgtac gtctatcagg acgtgaagaa cgtgcaggcc 







1921
aacgacgtct cgctgaacaa ctggccggtc gacattccgc gtccgcgcca gatggcgtcg 







1981
ctgtggaccg actggacctg gtacacgggg ccgctcgccg gcttcggcct cggcggcggc 







2041
attcgctacc agagcgcgtc ggccggtgcg gccgacaact cgctgacggt atcgagcgtc 







2101
acgctgttcg acgcgggcct gcattacgac gtgcgtaact ggcgcttcgc cgtgaacggg 







2161
acgaacctgt tcaaccgcca ttacatcagc ggttgccagt cgaacaacgt gtgcattttc 







2221
gggaccgacc gcaccgtgat cgcgaccgcg aaatacaact ggtga 









141
1
mkkveqkkme watgtrlrai aaaasvafgt aaaghayaqt apavnagata sassaqtgat 







61
attstsaqng tlpaitvnaa sagdgtvglv akrsttgtkt dtplneipqt invvtaqqie 







121
mtgatdvnaa lryvpgfssy gsdnrsdwya alrgftptay vnglqvpnti nlaswrvdpy 







181
midsisvlrg ptsvlygagd pgaiidvhtk ladgervrea gvqignyark qfmidvgdkl 







241
dpdgkyayrf vgvardgnal tgpnndqrva lapsfrwrpd adtsltlsat ylqdwgdiss 







301
nflpaqgtvl pnpngqinkd iyegdgnfny yrkkqwsvgy qfernltpaw tfrqntrlmh 







361
lsldngsvfg ngfvegsttd vsrwagvfqm nysrfdidnn legrfatgpl qhtlllgfqy 







421
nrqtatdsew laaapplniy npvyqpvtta vftpdatirt ntyttintfg lyaqdqikwn 







481
rwtltlggre dwvnmrmddr aagtstkadv taftgrvglt yqgdyglspy vsyatsfnpl 







541
igvnlvgggl pqptrgkqie aglrwqppgk nlmlnaaiyq inqtnvltsa lpsqdptgtk 







601
svqtgevrsr gielsatgkv trnlsviasy vyqdvknvqa ndvslnnwpv diprprqmas 







661
lwtdwtwytg plagfglggg iryqsasaga adnsltvssv tlfdaglhyd vrnwrfavng 







721
tnlfnrhyis gcqsnnvcif gtdrtviata kynw 





702
55
107025374
YP_622885
166
1
atgaataact tgcacaacac gaacggcctg atgcgcttcg cgaaggtggc ggccgcgagc 







61
accctgctcg cgacgctgct cgccgcgtgc gcggtcggcc ccgattacaa gcgtccggac 







121
gcggcggcgc ccgccgcgtt caaggaagcg ccgaccctcg ccgccggcga gcaggccggc 







181
acgtggaaga cggccgagcc ggcggacggc gaacatcgcg gcgaatggtg gaaggtgttc 







241
ggcgacccgg tgctcgattc gctcgagacg caggcgctcg ccgcgaacca gaacctgaag 







301
gccgcggccg cgcgggtcga ggaagcgcgt gcggcgaccc ggtcggcacg ctcgcaatgg 







361
ttcccgcagg tcggcgctgg tttcgggccg acgcgcgagg ggctgtcgtc ggcgtcgcag 







421
ttccagccgc agggcaccgg cccgaccaac gcgacgctgt ggcgcgcgca gggcacggtg 







481
tcgtacgaag ccgacctgtt cggccgcgtc ggccgcaacg tcgaggcgtc gcgcgccgac 







541
caggcgcaga gcgaagcgct gttccgctcg gtgcagctcg cgttgcaggc cgacgtcgcg 







601
cagaactact tcgaattgcg tcagctcgat tcggatcagg acctgtaccg ccgcacggtg 







661
gagttgcgcg agcaggcgct gaagctcgtg cagcgccgct tcaacgaagg cgacatcagc 







721
gagctcgacg tgtcgcgcgc gaagaacgaa ctggcgagcg cgcaggccga tgcggtcggc 







781
gtcgcgcgcc ggcgcgcggc gtccgagcat gcgctggcca ttctgctcgg caaggcgcct 







841
gcggatttcg cgttcaagga aacgccgatc gtgccggtcg cggtgaagat tccgccgggc 







901
ctgccgtccg cgctgctcga acgccgcccg gacgtgtcgg cggccgagcg cgcgatggcg 







961
gccgcgaacg cgcggatcgg gctcgcgaag tcggcgtact tcccgaagct cgatatcacc 







1021
gggtcgttcg ggtatgaagc gtcgacgctc ggcaacctgt tcctgtggtc gagccgcacg 







1081
ttcctgctcg ggccgttcgc cggcaccgcg ctgacgctgc cgctgttcga cggcgggcgg 







1141
cgggctgctg gtgtgcagca ggcgcgcgcg cagtacgacg agcaggtcgc gaactaccgg 







1201
cagcaggtgc tcgtcgcgtt ccgcgaggtc gaggacaatc ttgctgatct gcgtttgctc 







1261
gatgatcaga tccgcgcgca ggatgcggcc gtcaacgcgt cgcgtcgggc ggcgacgttg 







1321
tcgcgcacgc aatatcagga aggtgaggtc gcgtatctcg acgtgatcga cagtgagcgg 







1381
tcggtgttgc agtcgcagtt gcaggcgaac cagttgacgg gggcgcaggc ggtgtcgacc 







1441
gtcaacctga tccgcgcgct gggcggcggg tgggggaatg cgccggcgcc gacggcggtg 







1501
ggtgatgcgg cgtctggcaa ggctgacgta gctgcgcggt aa 









142
1
mnnlhntngl mrfakvaaas tllatllaac avgpdykrpd aaapaafkea ptlaageqag 







61
twktaepadg ehrgewwkvf gdpvldslet qalaanqnlk aaaarveear aatrsarsqw 







121
fpqvgagfgp treglssasq fqpqgtgptn atlwraqgtv syeadlfgry grnveasrad 







181
qaqsealfrs vqlalqadva qnyfelrqld sdqdlyrrtv elreqalklv qrrfnegdis 







241
eldvsrakne lasaqadavg varrraaseh alaillgkap adfafketpi vpvavkippg 







301
lpsallerrp dvsaaerama aanariglak sayfpkldit gsfgyeastl gnlflwssrt 







361
fllgpfagta ltlplfdggr raagvqqara qydeqvanyr qqvlvafrev ednladlrll 







421
ddqiraqdaa vnasrraatl srtqyqegev ayldvidser svlqsqlqan qltgaqavst 







481
vnliralggg wgnapaptav gdaasgkadv aar 





703
84
107025915
YP_623426
167
1
tcagtagcgc gcgttcagcg tgacgaacgc cgaacggccg ggcgcgatcg acgcatagtg 







61
cgccggatac gcctgatcga agtaggtgcg gttgaacagg ttgttgacgt tcagttgcag 







121
gtcgagcttc ttgttgatcc ggtactgcgc catcgcgtcg aagcgccagt acgacggcac 







181
ggcgcgcagg ttcgcgggat cgccgaatac ttccgacata tagaacgcgc cgccgccgac 







241
cgtgaacttc ggcgtgacgt cgtagttcga ccacatcgtc aggctgtgct tcggcgtgtt 







301
cgggaagcgg ttgccgttgt tcgcggcgtc cttgccgttg tcgcgcagtt ggctcttcat 







361
gtacgtgtag ccgccgaaca cctgccattg cttcgtgatc tggccggcga ggccgagctc 







421
gagaccctgc acgcgcttgt tgctgaccat cgcgtactgg ttgttcggca gcgtcacgcg 







481
tgcgttggtc gtgtcgatct ggaacagcgc ggccgtcagc gacagcttgt cgttcagcac 







541
gttccacttc gtgccgagct cgatgctgcg gtttttttcc ggcgaaagct gatcggcgtt 







601
cgggccgacg ccgccgcggc ccggcgtgag cgactgcgtt tcgctgcctt cgccgagcat 







661
catgccggcg ggcgtcgacg aggtcgcata cgacgcgtag atgctgccgt tctgcgcggg 







721
cttgaacacg aggccggcct gccagttgaa gagcgtgtcg tcgcgcgtat aggtcttgcc 







781
gccattcgcc ttggtatcgg tgaagcgggt cgagtagtcg tcgacgcgca cgccggcgtt 







841
gacttgccag cgcggcgtga tctcgatcgt gtcgaagccg tagatcgact tcgtcgtggt 







901
gcgcgcatgc gcgtagtcgt tgttgcgcgt gatcgaaccg gcccacggat cgttcgggtt 







961
cggcgaccac aggctcgtgc agttgtagcc cgacgcggcg ccgatgcctt tctggcagat 







1021
cgtgcccttg tcggtcgcga ccgtgtacga atcgcgcttg ccccattcgc gcgacagctc 







1081
gataccggtc gtgaagctgt gcttgaacgg gcccgtgcgg aattcgccga acagctcggt 







1141
caggttcgcg aggctgttga tcgagctgtt gcggttgttg ttgcggcgcc agaccttgcc 







1201
gttcaccacg ttgccctggc tgtcgtccgg ctgcgtccag atgtagtcct gtgtcgattc 







1261
cgtgtagcgc gtggtgttgc gcaccgtgag gttcggcgtg atgtcgtgct cgatcttgat 







1321
cgtgctgatg tccgacgtgg tcttgcggaa gtcgcggtcg atcaggccgt agaagttgtg 







1381
gcgatcgacg ggcgccggat agatcgtgcc gacgttcgcg ggcttgttcg acgtcgtgta 







1441
gaagtacggg atgccgccgt cgggcatgtc gtccgtctgc aggtggtagt agctcgcggt 







1501
cacgcgcgtc ggcgtgccga ggccgaacgc gatcgacggc gcgacgcccc agcgctcgtt 







1561
gttgacggca tcgcgaccgg cgacgtcgtt gttgtggctc atcaggttca ggcggaacgc 







1621
ggcgtgctcg gcgaactgcc agttgccgtc ggcggtgaag cggcgatagc ggtcggtgcc 







1681
gaggcccgcg ctcgcggcgg ccgtcgtgcc gaggtgcggg gccttcgtga tcaggttgat 







1741
gctgccgccg gcgccgccgc ggccgccgta cgcgccgtcg gaacccttcg tgatctcgac 







1801
gcgttcggta ttgaagatct cgcgcgtggt ggcgcccgtg tcgcgcatgc cgtcgacgaa 







1861
catgctgccc tgcgtgtcgt agccgcgaat gaacggacgg tcgccgagcg ggttgccgcc 







1921
ttcgccggcg ccgaacgtga tgccgggcac ggtgcgcagc gcttcggtca gcgtcgccgc 







1981
gccgctgctc tggatcagtt cctgcggaat cacggtgacg gatttcggcg tgtcgacgag 







2041
cggcgcggtg aatttcgcgg aagcggaaaa gtcggccttg tagctgtgct cggtcttgcc 







2101
ctggatctcg atcggcgcga ggtggccttc ggtgctggcg ggcgccgcgg gcggcgtacc 







2161
gtcggcgaac gcggggctcg cggcgagcac gctgcacaag gtggtgaact taccgagctt 







2221
cagctcgtcc ggacgggatt tcat 









143
1
mksrpdelkl gkfttlcsvl aaspafadgt ppaapasteg hlapieiqgk tehsykadfs 







61
asakftaplv dtpksvtvip qeliqssgaa tltealrtvp gitfgagegg nplgdrpfir 







121
gydtqgsmfv dgmrdtgatt reifnterve itkgsdgayg grggaggsin litkaphlgt 







181
taaasaglgt dryrrftadg nwqfaehaaf rlnlmshnnd vagrdavnne rwgvapsiaf 







241
glgtptrvta syyhlqtddm pdggipyfyt tsnkpanvgt iypapvdrhn fyglidrdfr 







301
kttsdistik iehditpnlt vrnttrytes tqdyiwtqpd dsqgnvvngk vwrrnnnrns 







361
sinslanite lfgefrtgpf khsfttgiel srewgkrdsy tvatdkgtic qkgigaasgy 







421
nctslwspnp ndpwagsitr nndyahartt tksiygfdti eitprwqvna gvrvddystr 







481
ftdtkanggk tytrddtlfn wqaglvfkpa qngsiyasya tsstpagmml gegsetqslt 







541
pgrggvgpna dqlspeknrs ielgtkwnvl ndklsltaal fqidttnarv tlpnnqyamv 







601
snkrvgglel glaggitkqw qvfggytymk sqlrdngkda anngnrfpnt pkhsltmwsn 







661
ydvtpkftvg ggafymsevf gdpanlravp sywrfdamaq yrinkkldlq lnvnnlfnrt 







721
yfdqaypahy asiapgrsaf vtlnary 





704
83
107027766
YP_625277
168
1
tcagaaatcc actttcatgc tgacggcgac cgtgcggccg ggcgacgaat atgcgtcgag 







61
cacctgcgaa tcggccgcga tgccgcgcac gtccgaccag ttccagtact tgcggtcgaa 







121
caggttgcga atgccgatcg tcgcgctcac gtgcttgttg aagcgatagc cgccgcgcag 







181
gtcgacgacg aacgacgacg gcggcgtgaa gcacgccttg ttcgaacagt cggacttgtc 







241
gacgtccttg tcgcgtttgg cggcctggaa cagcaggtcg gtctgcacga accagcgttc 







301
ggtcggctcg taccgcacgc cgaacacggc cgagaacggg ttcacggtat tgagcggctg 







361
gctcgccgcg ccgtcgttct gcgtcgagcc cttcgtgaac gccatcgccg tcttcagcgt 







421
gatgccgttc ggcatcaccc attcggcgcg gccttcgagg ccgtggatgc gcgcgtcggc 







481
gaagttcacg tactggaaca cgaacgggtc gaccggccgg ccgctgccgg cgagcgtcgt 







541
acgcgagatg aagttgcggt agcggccggt gaaggcggcc gcgctgtagc gcacgacgcc 







601
gtagccggtg ccggccttgc cgcgcaggcc ggcttcgaac gtgtcgctgg tttcgggctt 







661
caggttcgga ttgccgatcg acgtgtagcc gtacaccggg ttcgagaagc tgctgttcac 







721
ctggtcgggc gtcggcgcgc ggaagccgtg cgcgtactgc acgtacggaa tgaccgctgg 







781
agtgatttcg tagagcacgg cgacgcgcgg cgacagctcg ttggcgctcg tcgacacggc 







841
cttgccggtg aacagcggat cgttttcggt cgggctcagc cggtacgtgt cgaagcgcag 







901
gcccggcgtg acgagcaggc ggccgtagcc gatctggtcc tgcacgaacg cgccgaacag 







961
cgtgtagtcg gtgtcgggaa acgccttgtt cgggaacgct tcgccgacgc ccggcaccgt 







1021
gccgtcgcgc aggttcgtca cgcgcgacag gctgccgtcg acgccgtaca acagcttgtg 







1081
cgcgagcggg ccggtcgaaa agccgctttc ggcgaacgcg gcgccgccga acgtgcgttc 







1141
cttgtactgg ttgtcgcgcg aacgcgaggg tagcctgccg cgcgtctcga acgcgtactg 







1201
gtcctgcttc gcgtcctggt agtagaactg cacgtgcgcg gtctggaacc agcggaacgc 







1261
gtcgtcgcga aagtcgtagt cgacgctgaa gcggttgcgc tcgagccggt cgtgggtcgt 







1321
gaggccgagc gtggccggcg cattgatcgc cgacagcacg tcggtgctca cgcgccgctg 







1381
caccgtttcg gcggtgaact tgatcgtgtc gcgtgtggtg ggcgtcagca cgagcttgcc 







1441
gagcagcgat tccgaataca cgtcctgcgg attcgacgtc gtgcgcagcg tgctcgccga 







1501
attgttgtcg ccgcgcgtgt cgacctcgtg gccgcgccgg ccgtcggcga tgatcatccc 







1561
ctggatgcga tcgttgccgc ccgcggccga tacggtcgcg ccgatgctgc ggtcggtcga 







1621
gtcgtagctc ggccggaacg agaagtagta aggcttgtgg tagatcgaca gcaggtcgcg 







1681
cgggtccttc gtgatgaagt tcaccgcgcc ggtcaggccg tcgctgccgt acagcgccga 







1741
cgccggcccg cgcaggatct cgatgcgctt gagcgtgtcg agatcggcgt agtcgccgcg 







1801
gcccgcttcg agcggaccga acgagaacgc gttcggcagg cggatgccgt cttccatcag 







1861
cagcacgcga ttgccttcga ggccgcggat gttgatgctc gaatcgccgt cgcggccgcc 







1921
gccgagcgcg gcgctgccgg gccggtacgc ggtgcggcgc accgtgatgc ccggctcgta 







1981
gcgcagcgcg tccttgatat tggtggcctg ttgctcttcg agatcctcgt cggtgatcac 







2041
cgacacggac gccgccgtgc ggctcgccgc ggtggcggtg cgggtggccg tgacggtgac 







2101
gggatcgagc agcgccgcct cgccgcgcgc ggatgcggcg gtcaggacgg cggaaggcgg 







2161
gggggcgcct tgcggcgacg aatcggcgtg agccggggcg gcggacaggc cgaacgcgcc 







2221
gaacagcgcg gcacagatcg gccgccgcgc cagcgtatag caatgcac 









144
1
mhcytlarrp icaalfgafg lsaapahads spqgapppsa vltaasarge aalldpvtvt 







61
atrtataasr taasysvitd edleeqqatn ikdalryepg itvrrtayrp gsaalgggrd 







121
gdssinirgl egnrvllmed girlpnafsf gpleagrgdy adldtlkrie ilrgpasaly 







181
gsdgltgavn fitkdprdll siyhkpyyfs frpsydstdr sigatvsaag gndriqgmii 







241
adgrrghevd trgdnnsast lrttsnpqdv ysesllgklv ltpttrdtik ftaetvqrrv 







301
stdvlsaina patlgltthd rlernrfsvd ydfrddafrw fqtahvqfyy qdakqdqyaf 







361
etrgrlpsrs rdnqykertf ggaafaesgf stgplahkll ygvdgslsrv tnlrdgtvpg 







421
vgeafpnkaf pdtdytlfga fvqdqigygr llvtpglrfd tyrlsptend plftgkavst 







481
sanelsprva vlyeitpavi pyvqyahgfr aptpdqvnss fsnpvygyts ignpnlkpet 







541
sdtfeaglrg kagtgygvvr ysaaaftgry rnfisrttla gsgrpvdpfv fqyvnfadar 







601
ihglegraew vmpngitlkt amaftkgstq ndgaasqpln tvnpfsavfg vryepterwf 







661
vqtdllfqaa krdkdvdksd csnkacftpp ssfvvdlrgg yrfnkhvsat igirnlfdrk 







721
ywnwsdvrgi aadsqvlday sspgrtvavs mkvdf 





705
81
107025731
YP_623242
169
1
atgcggtacg aggtacggga acctgagaga tcgatgatgc agggcgcacg cgtcgcccgt 







61
ttcgggcgct ggcgtccgtc gctgtacctc ggcctgctga tcgccgcgca tccggcggcg 







121
gcgttcgccg atgcggcgcc ggcggccggc gcggaagtcc gccgggacaa agcggccgcc 







181
gcgcagcctg ccgggagcga gttgaaggcg attgcggtga atgcgtcgcg cggcgtcgcc 







241
gacgatccgt ccgtggcgac ggtcggcaag atgccgctcg cgttgcgcga gattccgcag 







301
tcggtcagcg tgacgacgcg cgagcggatc gaccagcaga acctgttctc gctcgacgag 







361
gtgatgcagc agtcggccgg cgtgaccgtc cagccgtacg tgctgctcac cactgcgtac 







421
ttcgtgcgcg gcttcaaggt cgattcgttc gaattcgacg gcgtgccggt cgtgatcggc 







481
gacatggcga gcgcgccgca ggacatctcg gtgtacgaac gcgtcgaaat cctgcgcggc 







541
gcgaacgggc tgctgcacgg ctcgggcaat ccggccgcga ccgtcaacct cgtgcgcaag 







601
cgcccgcaat accagttctc cgcgcatgcg accgccagcg tcggcagctg ggaccgctat 







661
cgcgcggaag ccgacatcgg cggcccgctc aatgcggccg gcaccgtgcg cagccggctc 







721
gtcgccgcgt acgaggatcg ccacttcttc tacgaccacg cgaagcagga cacgcgttcg 







781
atctacagcg tcacggaagt cgacgtgacg cgcgacacgc tgctcacgtt cggcgcgcag 







841
taccagacca cgacatcggt gcccgacatg tccggcgtgc cgatggcgcg cgacgggtcc 







901
agcctcggcc tgtcgcgctc gacgttcctc gacaccgcgt ggggccgttt caactgggac 







961
acgacgcgcg cgttcgcgtc gatcgagcag aagctcggcg ccggctggaa ggcgaaggtc 







1021
agcggcgaat accagagcgt gcgctcggac ctgaagtacg cgggttcgtt tggcgcgatc 







1081
gatccggcga cgggcgccgg cggccggctg accggcgccg cgtaccagtt cagcagctac 







1141
agccgcagca tcgatgcgaa cgtgcagggc cccgtgcatg cgttcgggct cacgcacgac 







1201
ctgctgttcg gcgtcaccta cgcgaacagc agcagcgggc agatgaccgc gccgctgctc 







1261
ggcgacgtgg ccggcacgcc ggtgaacgtg taccgctgga acccgagcag cgtgcccgag 







1321
cccggcatcg ggccgtacca gcagtcgcag cagaacgacg tttcgcagaa gggcgtctac 







1381
gggctgggcc gcatcaagct cgcggagccg ctgacgctcg tgctcggcgg ccggatgagc 







1441
tggtggaacc aggacagcct cggcgcgcac tacaacaccg gccaccagtt cacgccgtac 







1501
ggcgggctga tctgggattt cgcgcgcgac tggtcgtggt acgcgagcta tgcggaggtg 







1561
ttccagccgc agaccaagtc gatgtggggc ggcggcatcc tgacgccggt gaaggggcgc 







1621
acctacgaaa cgggcgtgaa gggcgaactg gcgggcggca agctcgacgt gtcgctcgcg 







1681
gcgttccgca tcgatctcga caacaacccg caggtcgacc tcgcgcatcc gtgcgcgggg 







1741
ccgagctgct actacgtgaa cggcggcagc gtgcgcagcc agggcttcga gttcgaggcg 







1801
aacgggcgca tcacgccgtg gtggagcgtg tgggcgagct acacgtacga cacgatgcgt 







1861
tatgcggaca acctcgcgaa cgcgggttcg ttcgcgccgc tgctgaatcc gcgccacctg 







1921
ttccggctgt ggaccaacta cgacctgccg tggcaggagc ggcgctggag catcggcggc 







1981
ggcgtgcagg tgcagagcag ctactcggcg caggcgaacg gcgtcacgat gagccagggc 







2041
ggctacgcgc tcgcgagcgt gcggctcggc taccgctacg acaagcactg gagcgccgcg 







2101
ctcaacgtca acaacctgtt cgaccgcacc tactacctga gcctgagcca gccgggctgg 







2161
aacaaccgtt acggagaacc gcgcaacgtg atgctgaccg tgcgcgggca attctga 









145
1
mryevreper smmqgarvar fgrwrpslyl glliaahpaa afadaapaag aevrrdkaaa 







61
aqpagselka iavnasrgva ddpsvatvgk mplalreipq sysvttreri dqqnlfslde 







121
vmqqsagvtv qpyvllttay fvrgfkvdsf efdgvpvvig dmasapqdis vyerveilrg 







181
angllhgsgn paatvnlvrk rpqyqfsaha tasvgswdry raeadiggpl naagtvrsrl 







241
vaayedrhff ydhakqdtrs iysvtevdvt rdtlltfgaq yqtttsvpdm sgvpmardgs 







301
slglsrstfl dtawgrfnwd ttrafasieq klgagwkakv sgeyqsvrsd lkyagsfgai 







361
dpatgaggrl tgaayqfssy srsidanvqg pvhafglthd llfgvtyans ssgqmtapll 







421
gdvagtpvnv yrwnpssvpe pgigpyqqsq qndvsqkgvy glgriklaep ltlvlggrms 







481
wwnqdslgah yntghqftpy ggliwdfard wswyasyaev fqpqtksmwg ggiltpvkgr 







541
tyetgvkgel aggkldvsla afridldnnp qvdlahpcag pscyyvnggs vrsqgfefea 







601
ngritpwwsv wasytydtmr yadnlanags fapllnprhl frlwtnydlp wqerrwsigg 







661
gvqvqssysa qangvtmsqg gyalasvrlg yrydkhwsaa lnvnnlfdrt yylslsqpgw 







721
nnrygeprnv mltvrgqf 





706
88
107022421
YP_620748
170
1
atggccacca ccggcgcgct ggccgccgaa gctgcggcgt cggcgccgga tgaccggcac 







61
gcgctgccgg cgatcaacgt caccgcgtcg tccgcgccgg ccgatccgct gacgcaaccg 







121
ctcgaaacgg gctcgcggct cgggcttgcc agtctcgaca cgcccgcgag cgtcgaaacc 







181
gtcacggccg acacgatcga cgcacgcggc gatcgcacgg tgctcgacgc ggtcacgcgc 







241
acggccggct ttgcgagcgc gatcgcaccg ggcaccggcg gcaccgcact gagcgtgcgc 







301
ggcttcagcg ggcaggaatc ggtgatgacg ctgcttgacg gcgtgcggct gatgccggcg 







361
gccggcacga tcacgtttcc gttcgacacg tggtcggtcg cgcgcatcga ggtgctgcgc 







421
ggcccggcat cggtgctgta cggcgaaggg gcgatcggcg gagtcgtgaa cgtggtgccg 







481
aagcggccgc agcgcacgcg cgagacgacg ctgcaggcgg gcgtcggccc cgacggcgcg 







541
aagcgcttcg cgttcgatac gacgggcgcg ctcggcccgc ggctgtcgta ccgcttctac 







601
gcaagcgatg cgcgtgcgaa cggtctcgcc gagcgggccg acacccatac gacggccatc 







661
ggcggcgcgc tgacattcga cgtgagcccg cagctcacgc tgacgctcga ctacgactac 







721
gggcgccaga tgccggcgac ctactatggc gtgccggcgc cgcacggcgt gctcgatccg 







781
tcgctgcgca agctgaacta cacggtcggc gacgcgacga tttcgtacta cgaccagtgg 







841
acgcgcctgt cggccagcta ccggcccgcg cccggcgtga cgatcagcaa ccagctctac 







901
tacctgacat cgaaccgtca ctggcgtaac gccgagtcat atgtgctcga cctcgcgacg 







961
ggacgcgtca cgcgcggcga ctatctcgac atcggccatc atcagcggca gatcggcgat 







1021
cggctgagcg cgcgtttcga cgggatgctg ttcggtcgcg cgaaccgttt cgtcgtcggc 







1081
acggaattca gccagatcac gttcagcggt accaacaatt cgccgtatgg tggcgaaacg 







1141
acggtgcccg cgcacggctt cgatcctggt gtgttcacga gcccggaccc gaccgtgccg 







1201
caattcagca cgcgcgcacg gcaggcggcg gtattcgcgg agaaccggct cgaggtgctg 







1261
ccgcggctcg cgtgggtgag cgggctgcgc tacgaccaca tcgcgttcag tcgcgagcag 







1321
gcggcgaccg gcgcgggctt cgacaagcgg ttcgcgaacg tcggctggcg caccggtgtc 







1381
gtgttcgata tcgcgccggc gttcaccgcc tatgcgcagt acacgacggg cgcggagggc 







1441
gtcggctcgc tcgtgacgct gtccgcgtcg cagatgaacg tccggctcgc gaccggcgaa 







1501
cagtgggaag ccgggctcaa gcagacgctg ctcgacgggc gcgcgtactg gacggtcgcg 







1561
gtgtacgaca tcacgaagcg caacctgctc agcaccgatc cgttcaatcc ggcgctgcgc 







1621
cagcaggtcg gccggcagtc gtcgcgcggc gtcgatctga ccggcggcgc gcggctgtcg 







1681
cacggctgga cgatcgatgc gaacgtcgcg ttgctgcgtg cgcgctacga cgcattcaac 







1741
cagacggtcg gcggcgcgac ggtgtcgcgg gccggcaacg tgccgtcggg cgtgccgcag 







1801
cagaccgcga acctgtggct cggctgggca ttcgccgagc gctggcacgt gaatgcgggc 







1861
gtgcgttacg tcggcgcgac ctacggcgac gatgcgaatc gcgtgcgggt gccgtcgtat 







1921
acggtgttcg atgcgtcgct gcgctggcag ccgacgtcgc gcaccgagct cgcgctgtat 







1981
ctgcgcaatc tcgcgaaccg cacgtatgcg gtaacgacgt cgaacggcgg cgagcaatgg 







2041
ctgctcggcc cgtcgcgctc ggcggagctc gtcgcgacga tgcgcttcta g 









146
1
mattgalaae aaasapddrh alpainvtas sapadpltqp letgsrlgla sldtpasvet 







61
vtadtidarg drtvldavtr tagfasaiap gtggtalsvr gfsgqesvmt lldgvrlmpa 







121
agtitfpfdt wsvarievlr gpasvlygeg aiggvvnvvp krpqrtrett lqagvgpdga 







181
krfafdttga lgprlsyrfy asdarangla eradthttai ggaltfdvsp qltltldydy 







241
grqmpatyyg vpaphgvldp slrklnytvg datisyydqw trlsasyrpa pgvtisnqly 







301
yltsnrhwrn aesyvldlat grvtrgdyld ighhqrqigd rlsarfdgml fgranrfvvg 







361
tefsqitfsg tnnspygget tvpahgfdpg vftspdptvp qfstrarqaa vfaenrlevl 







421
prlawvsglr ydhiafsreq aatgagfdkr fanvgwrtgv vfdiapafta yaqyttgaeg 







481
vgslvtlsas qmnvrlatge qweaglkqtl ldgraywtva vyditkrnll stdpfnpalr 







541
qqvgrqssrg vdltggarls hgwtidanva llrarydafn qtvggatvsr agnvpsgvpq 







601
qtanlwlgwa faerwhvnag vryvgatygd danrvrvpsy tvfdaslrwq ptsrtelaly 







661
lrnlanrtya vttsnggeqw llgpsrsael vatmrf 





707
81
107026796
YP_624307
171
1
ctacagcttc gcgctcacgc gcacccacgc ggtacgcccc ggctccatca ccggcgcgtt 







61
cgccggatag ccgaaacccg cgttgccggc gaggttcaga tgctcggtgt aggccttgtt 







121
cagtacgttg tcgacgccga ccgagatttg cacggtcttg ctgacgttgt attgcgtatg 







181
cagcgacagc acgccgaatc cggcgctcgg gccgaagtcc ttgccgacca cgttgccctc 







241
gttcagcgcg tagcgatgct gcggcgcaac gatgcgccac aggccgcccg ccgaccacgc 







301
gccgcgcgtg tattcgaggc cgatgcgcgc ctcgagcggc ggcatctgcg gcagcggatc 







361
gccgctcgcc acgttgcgcc cccacgcata cgcgagcgac gtctcgacgc gcagcggcgc 







421
gaccggccgc cacgacacgc cggcttcgcc gcccatgatc tgcgcgttga cgttggttgc 







481
ctgcgtgatc gggcccatca tgccggccgc atagttgaac aggatgaagt cctggacgta 







541
gcccgcgtac gccgacaccc atgcatcgaa ccggtcgctc ttgtattgcg cgccgatatc 







601
gagctgcgtg gtcttctccg gctgcaccgc cgagaacgcg ttgatcgaac cggccggccc 







661
gcgcgtcgcg gagaacagtt cccagtagtc cggataacgc tccgcgtggc cgatccccgc 







721
gtaccacgtg acgggtagcg acgcgagatc gcgctcgtag cgcacgaagc cgctcggcag 







781
caccttcgtc cgatcgtcgt cgaaagtcgg gttcggcttg ctcatcatca tgccgctctt 







841
catcgcgcgc ttgtcgcgtg cgctcgcata gtcgacgcgc gcgccgccga tcacacgcga 







901
gacgtcgctc gcataccacg tcagctcgct gaacacgccc gcgctccaca tggtcgcctg 







961
ggcatcccac ggctggtcgc ggtagttctg ctgccccatc gacgcacgcg aatcgagccg 







1021
gttcgattgc gcatcgacgc ccgtgacgag cttgaaatcg tcgccgaagc ggaacgtcgc 







1081
ggcggcacgc gcgccgaccg tgcggcgccg cacgtccgcg gccatgcgca tcggcatgct 







1141
gctcgtcggg tcgggctgcc gcaacgtgta gttgtccatc acgtgatcgg cttcgttgta 







1201
gtacacgcgc gcctcgatcc ggtcgagcac gtcgccgaga tgccgcttgt cgaacgacag 







1261
gccgaacgtc tcgcggcgga aatgcgcgcc atccatcccg cggcccgcgt agcgcgcata 







1321
gccgtcgccg gtgccggcgg tcagctcgag tcgcgtatgg tcgtcgggcg tccagccgag 







1381
cgctgcatcg gcgttccatt tgtcccactg cgacggcacg gtgttgccgt tgccgtcctt 







1441
gtagtcctgc gaatgcgcgt ggttcgcggt cacgcggcca tagatgtccg gcgtgcctgc 







1501
ggtcaggtcg atgttctggt cgttgcggcc gaacgacccg ccgacgagac tgccgtcgaa 







1561
gcgcatgccg gggcgctcga aacgcggcgt cacgcgctcg aacagcacgg tgccggccga 







1621
cgcacccgga ccgtacagca cggtctgcgg ccccttcacg acggtgacct tgtcatagct 







1681
ttccggcgcg atgtacgagg tcggcgcatc catccggttc gggcacgcgc cgagcgtcgg 







1741
cattccgttc gccaggatgt tcagccgcga accgaacatc ccgcgcagca cggggtcgcc 







1801
gttcgtaccg ccgctacgga tcgacgtgaa gccgggaatc gtcttcaggt aatcggcgcc 







1861
gtcgctggcg ggcagcggct ggcgcggcgc cttcggatcg gtgacgacga cgagcggcgt 







1921
cgacagcggc gacgcgacga cttcgacggg cggcaacagc atcgccgcat cgcccgacgg 







1981
cgcaccggcc gccggcgtgg tttcggctgc ggcagcgctc gcggccagcg cgccgacagc 







2041
cagcgcagga acggtgagtt tcagtacgcg cggcagaggc cgcgcaccgg cattgcgtgg 







2101
cgcccgcggc gcacgcaaca ggaaattggt at 









147
1
mtnfllrapr aprnagarpl prvlkltvpa lavgalaasa aaaettpaag apsgdaamll 







61
ppvevvaspl stplvvvtdp kaprqplpas dgadylktip gftsirsggt ngdpvlrgmf 







121
gsrinilang mptlgacpnr mdaptsyiap esydkvtvvk gpqtvlygpg asagtvlfer 







181
vtprferpgm rfdgslvggs fgrndqnidl tagtpdiygr vtanhahsqd ykdgngntvp 







241
sqwdkwnada algwtpddht rleltagtgd gyaryagrgm dgahfrretf glsfdkrhlg 







301
dvldriearv yyneadhvmd nytlrqpdpt ssmpmrmaad vrrrtvgara aatfrfgddf 







361
klvtgvdaqs nrldsrasmg qqnyrdqpwd aqatmwsagv fseltwyasd vsrviggarv 







421
dyasardkra mksgmmmskp nptfdddrtk vlpsgfvrye rdlaslpvtw yagighaery 







481
pdywelfsat rgpagsinaf savqpekttq ldigaqyksd rfdawvsaya gyvqdfilfn 







541
yaagmmgpit qatnvnaqim ggeagvswrp vaplrvetsl ayawgrnvas gdplpqmppl 







601
earigleytr gawsagglwr ivapqhryal negnvvgkdf gpsagfgvls lhtqynvskt 







661
vqisvgvdnv lnkaytehln lagnagfgyp anapvmepgr tawvrvsakl 





708
81
107022434
YP_620761
172
1
atgggtaatg aaagcagccc tagcccatcg tccaccgact ctccgatcat ggcttccatc 







61
cgtctcacgt atcagcgccg tccggcgccc gcccgtcatc tgccgcgcgc gtcgcgcccg 







121
gccgcgtcgg gccgtcttgc ccgccggctg gcgggtgccg tcctgctgtc ggcactgctg 







181
ccgctgcccg cgctggccga caccgacgcc gatgcggcgc ccgccacgca gcgcgcgtcg 







241
cgtcgggcgt tcgacattcc ggccggcccg ctcgaagcgg cgctgaaccg gttcggccgc 







301
gacgcgggca tcctgctcgc atttccgccc gagttgacgg ccggtctcgc gagcggcggc 







361
gtgcacggcc ggttcgacgt cgacggcgcg ttcgaccgcc tgctggccgg cacggggctc 







421
gtcgcgctgc gccagccggg cggtggctac acgctgatgc gcgccgatgg ctcggcggcc 







481
ggccctgtgg cggccggcgt cgcgccagcc gccgagttgc cgacgatcga cgtgcgctcc 







541
agcgccttgc gcgccgaaag ctatcgcgcg ccgaaggaag cgggcgtgct gcgctccgac 







601
attccgctgc tcgacaccgc gcaggccgtc aacatcgtgc ccgcgcaggt gctgcgcgac 







661
cagcggccgc gcaatctcga cgatgcgctc ggcaacgtga gcggcatcac gcagggcaac 







721
acgctcgcgg gcacgcagga cacgatcatg aagcgcggct tcggcggcaa ccgcgacggc 







781
tcgatcatgc agaacggcat gccgctcgtg cagggccgcg cgttcaacgc ggcgaccgac 







841
agcgtcgagg tgctgaaggg gccgacgtcg ttgctgtacg ggctgatgga cccgggcggc 







901
gtcgtcaacg tcgtcaccaa gcagccgcag ctcgtgcgct acaacgtgat ctcgctcggc 







961
gcgtcgacgt tcgggcacgg caagaacggc ggcagcgcga cgttcgactc gaccgggccg 







1021
gtcggcgatt cgcggctcgc gtaccggctg atcgtcgacc agtcgaacga acagtactgg 







1081
cgcaacttcg gcgagtaccg gcagaccttc gtcgcgccgt cgctcgcgtg gtacggccgc 







1141
gatacgcagg tcgcggtgtc gtaccagtac cgcaagttcc attcgccgtt cgatcgcggc 







1201
accgcgctcg acccgcgcac caatgcgccg ctcgacatcc ccgcacggcg gcgcatcgac 







1261
gagccgttca acaacatgga cggcgaatcg cagctcgcgc aactgagcgt cgatcaccag 







1321
ttcaacgcgg actggagcgc gcatttcggc tacagctaca accgcgagac ctacgacgcg 







1381
aaccagttgc gcacgaccgg cgtcgaccct gtgaagggca cgatgacgcg cagcaacgat 







1441
gcgacgcacg gctcgctcag caccgacagc tacgggatcg gctacgtgac cggcaagctg 







1501
acgctcgcgg ggatgcggca cgacgtgcag gtcggcttcg ataccgaata ccgccgcatc 







1561
tatcgcaagg acctgctgcg tcaggcggtg aagacgccgt tcagttatat cgatccggtg 







1621
tacgggctgc tgccgccgtc gagcacggtg tccgcgagcg acagcgacca gaccgacacg 







1681
ctgcacgatg catccgcgtt cttccaggat acggtccacc tgaccgacaa gtggatcgtg 







1741
tcgggcggcc tccgctacat cacgtacaac caggtggcgg ggcgcggccg gccgttcgtc 







1801
gcgaacaccg atctcagcgg ctcgaagtgg ctgccgcgcg cgggcgtcgt ctacaagtgg 







1861
accgatgcgt tctcgctgta cggcagctat tcgcagtcgc tgaagccgtc gtcgtcgatc 







1921
gcgccgatga cgggctacat catcgacggc gcgacgccgc ccgaggaagc gaccgcatgg 







1981
gaagtgggcg gcaagctcgg cttggccggc gggatgaccg gcacgctcgc gctgttcaac 







2041
atcgacaaga agaacgtgct cgtgtcgcag tacaacgacg cgacgaagct gaccgactgg 







2101
cgcacgtcgg gcaaggcgcg ctcgcgcggc gtcgagctcg acgtctccgg caagctcggc 







2161
gagcgcgtga acgtgatcgc gagctacgcg tacatcgacg cgaagacgac cgaggacccg 







2221
ctgtacgcgg gcaaccagct gtggaacgtc gcgcgtcaca ccgcgtcgct cgcggccgtc 







2281
tacgacttcg gcacggtggc gggcggcgac gatctgcgta tcggcgcgga cgtgcgttac 







2341
gtcggcgcgc ggcccggcga ttcggcgaac agcttcacgc tgccgtcgta cgtgctcgcc 







2401
gacgcgttcg cgacctacga cacgcggatc ggcaagcaga agctgtcgtt ccagctcaac 







2461
gtgaagaacc tgttcaaccg cacgtactac ccgtcgagcg cgaaccgcta cttcgtcgcg 







2521
gtcggcgacg cgcgccaggt gtcgctgctg accacgctgc agttctga 









148
1
mgnesspsps stdspimasi rltyqrrpap arhlprasrp aasgrlarrl agavllsall 







61
plpaladtda daapatqras rrafdipagp leaalnrfgr dagillafpp eltaglasgg 







121
vhgrfdvdga fdrllagtgl valrqpgggy tlmradgsaa gpvaagvapa aelptidvrs 







181
salraesyra pkeagvirsd iplldtaqav nivpaqvlrd qrprnlddal gnvsgitqgn 







241
tlagtqdtim krgfggnrdg simqngmplv qgrafnaatd svevlkgpts llyglmdpgg 







301
vvnvvtkqpq lvrynvislg astfghgkng gsatfdstgp vgdsrlayrl ivdqsneqyw 







361
rnfgeyrqtf vapslawygr dtqvavsyqy rkfhspfdrg taldprtnap ldiparrrid 







421
epfnnmdges qlaqlsvdhq fnadwsahfg ysynretyda nqlrttgvdp vkgtmtrsnd 







481
athgslstds ygigyvtgkl tlagmrhdvq vgfdteyrri yrkdllrqav ktpfsyidpv 







541
ygllppsstv sasdsdqtdt lhdasaffqd tvhltdkwiv sgglryityn qvagrgrpfv 







601
antdlsgskw lpragvvykw tdafslygsy sqslkpsssi apmtgyiidg atppeeataw 







661
evggklglag gmtgtlalfn idkknvlvsq yndatkltdw rtsgkarsrg veldvsgklg 







721
ervnviasya yidakttedp lyagnqlwnv arhtaslaav ydfgtvaggd dlrigadvry 







781
vgarpgdsan sftlpsyvla dafatydtri gkqklsfqln vknlfnrtyy pssanryfva 







841
vgdarqvsll ttlqf 





709
27
107023374
YP_621701
173
1
atgaacatga aaatcgcgac tcgcctgtcc gtattcgcac tcgccggcgc actgctggca 







61
ggctgcgcca cgcagcaagg caacaatacg gccgtcggta ccggcacggg cgcggcgctg 







121
ggcgcgggga tcggggcact ggccggtggc ggcaagggcg cggcgatcgg cgcgggcgtc 







181
ggcgcactgg tcggtggcgt gacgggttac aactggcagg cgatcaagaa caagctcgcg 







241
ccgtcggcag cgcagacggg cacgcaggtc accgagcagc cggacggctc gctgaagctg 







301
aacgtgccga gctcggtcac gttcgcgacg aaccagtacg cgatcacgcc ggcgttcacg 







361
ccgctgctga acgacctcgc gacgacgctg aaccagaacc cgcaagtcac ggcgtcgatc 







421
gtcggctaca cggacagcac gggctcgcag cagctgaacc agacgctgtc gcagaaccgc 







481
gcgcaaagcg tcgtgaacgc gctcgtgcag cgcggcgtga atggcggccg tctgtcggcg 







541
caaggcatgg gcccgtcgaa cccgatcgcg gacaacgcga ccgaagccgg ccgcgcacag 







601
aaccgccgcg tcgagatcta cctgcgcgcg ccgcaacagc atcagtaa 









149
1
mnmkiatrls vfalagalla gcatqqgnnt avgtgtgaal gagigalagg gkgaaigagv 







61
galvggvtgy nwqaiknkla psaaqtgtqv teqpdgslkl nvpssvtfat nqyaitpaft 







121
pllndlattl nqnpqvtasi vgytdstgsq qlnqtlsqnr aqsvvnalvq rgvnggrlsa 







181
qgmgpsnpia dnateagraq nrrveiylra pqqhq 





710
19
107021874
YP_620201
174
1
ttactgttgg tagacgaggt cggcgcgacg gttctgcgcc cacgatgctt cgtcgtgacc 







61
cgttgcctgc ggcttttcct tgccgaggct cacggcttcc atttgcgaat cgttcacgcc 







121
gagcagtgcc atcgcacggc ggacggcttc cgcacgcttc tggcccagcg cgaggttgta 







181
ctcgctcgtg ccgcgttcgt cggtgttgcc ctggatcagc acgtggcgct gcgggtggct 







241
cttcaggtac tgcgcgtgct gctgcatcag cggctggtac tcgtccttca ccgaatagct 







301
gtcgaagtcg aagtagatgc tgcgcttcgc gagcgggctg ttcggatcgt tcagcggatc 







361
gacgttcact tgcgcgacgt tgtcggcgct cggctgcgtg ctgactgcgc ccgcgttgtt 







421
tgccttgtcg tcgagcttca cgcccgactt gcacgctgcg agcgcgctga tcatcatcac 







481
ggccagggcc agacgagctt tattcgacat cat 









150
1
mmsnkarlal avmmisalaa cksgvklddk annagavstq psadnvaqvn vdplndpnsp 







61
lakrsiyfdf dsysvkdeyq plmqqhaqyl kshpqrhvli qgntdergts eynlalgqkr 







121
aeavrramal lgvndsqmea vslgkekpqa tghdeaswaq nrradlvyqq 





711
24
107022117
YP_620444
175
1
ttagtttgcc ggaacggtcg tcttctgcac ttgctgcgta ccaaccactt cgacttccac 







61
gcggcggtcc ggtgcgaggc aggcgatgag ttgcttgcgg ttcttctggt tgcagccagt 







121
cgtaaccggg ttgcgcttgc ccttgccttc cgtgtagatc ttgttggccg gcacaccctt 







181
gctgaccagg tacgacttca cggcttgcgc acggcgcagc gacagacggt cgttgtactt 







241
gtccgaaccg atacggtccg tgtagcccgt tgcgacgacc acttccgtgt tcatgccttc 







301
gatcttcgaa gccagttcgt ccagcttttg cttgcccagc ggcttgagcg ttgccttgtc 







361
gaagtcgaac agtgcgtcag cttgatacgt gatcttctgg ctcgtgatgg ccggagcgac 







421
cggagcgacc ggcggctgcg gtgcctgggc gaccagtgcg ccatcgcact tcgcgttggc 







481
ggtggccggc gtccagaacg catcgcgcca gcagagctcg ttcgtgccgt tcatccacac 







541
ccattcgccc gtgccgttca cccagttgtc attgacggct tgacgcgacg ccggcaccga 







601
ctgtgccgaa gcggatgcag ccataactgc ggtagctgca atgaacgcga gctttgaaag 







661
tttattcat 









151
1
mnklsklafi aatavmaasa saqsvpasrq avndnwvngt gewvwmngtn elcwrdafwt 







61
patanakcdg alvaqapqpp vapvapaits qkityqadal fdfdkatlkp lgkqkldela 







121
skiegmntev vvatgytdri gsdkyndrls lrraqavksy lvskgvpank iytegkgkrn 







181
pvttgcnqkn rkqliaclap drrvevevvg tqqvqkttvp an 





712
40
107022082
YP_620409
176
1
tcagaagcga tggcgcaggc cggcggtcac cgcgatctgc ttgttggtcg acgaagcgcc 







61
accgaggcca ttgacatagg cgccgatgcc caggccgtcc gtgctgactc gctgatacac 







121
gccttgcaca tacacgtcgg tccgcttgga cagcgcatag tcggtctgca ggttgaactg 







181
attccagccc ggatgacggc cgtccaggaa gccggcggtg tacgtgtacg acccggccag 







241
cgacaaggcc ggcgtgagcg cgtagcgcgc gttcacttcg tagttgttga agcgggcgaa 







301
cgtgccgtcg agacccaggc cgctcgacac gcccgaggcg cccgcggaga tgccgagcgc 







361
ccggttcaca cgcgtttgcg tgaatacgaa gccggccgtg gcggggccga acgtgtagtt 







421
caggccgccg ccgaacacgc gctgacgctt gccgacgaac gtgttgtcgc ctgccaccgc 







481
gccggcgctg ttggtgaccg cagctgccgt gagccccgag gtgtcattgt tcaactgcag 







541
gtagccggcg ccgatgttga agccggcgta ggtgtacgac acccccgcgc tgtatgcacg 







601
gttgttcgag aactggttgc tgttcgagaa gccgtacaac gcgccgaact tcaggccgcc 







661
gaaattgacg ctcgtatatt tgaccgagtt attgatccgg aacgaattgt tcaggttgtc 







721
gttgtcgaac gggtgggcaa attgagtgcc gccgaattgc gtgccggtca gcgacagcgg 







781
cccgatgtaa tcgacgacgc tatcgtattg gcgaccgagg gtgagggagc caaactggtc 







841
ttgcgacagg ccgacgaacg cctggcggcc gaactcgcgg ccattttggc ccagcgtgcc 







901
gttggcgatg ttgaagccgt tctccaacac gaagatggct ttcagtccgc cgccgagatc 







961
ttccgtgccg cgcaggcccc agcggctgcc gttgatctgg ccgctgcgct ccgaccatga 







1021
gctatgcccg ccctgattgt tcgtgtaggt gatgcctgca tcgatcaggc cgtacagcgt 







1081
aacgctgctt tgagcatggg ctgcggtaac gaatacaccc gacaa 









152
1
msgvfvtaah aqssvtlygl idagitytnn qgghsswser sgqingsrwg lrgtedlggg 







61
lkaifvleng fniangtlgq ngrefgrqaf vglsqdqfgs ltlgrqydsv vdyigplslt 







121
gtqfggtqfa hpfdndnlnn sfrinnsvky tsvnfgglkf galygfsnsn qfsnnraysa 







181
gvsytyagfn igagylqlnn dtsgltaaav tnsagavagd ntfvgkrqrv fggglnytfg 







241
patagfvftq trvnralgis agasgvssgl gldgtfarfn nyevnaryal tpalslagsy 







301
tytagfldgr hpgwnqfnlq tdyalskrtd vyvqgvyqrv stdglgigay vnglggasst 







361
nkqiavtagl rhrf 





713
44
107026115
YP_623626
177
1
atgaagcacc ccgtcatttt tgccgccgcg ctcgctgcct tcgccgcgcc ggcctttgcc 







61
gagaccagcg tgacgctgta cggcgtgatc gacgaaggct tcaattacac gaacaacgtc 







121
aacgtcaatg gggtcggaaa gaccaattac cagctcgcga gcggctatgc gcagggcagc 







181
cgctggggcc tgaagggcac cgaggatctc ggcggcgggc tgaaggcggt cttcacgctg 







241
gaaaacggtt tcgacgtgaa taacggccgg ctcggccagg gcggccgcat gttcggccgt 







301
caggcgttcg tcgggctgtc cgagtcgcgc ttcggcacgc tgaccttcgg ccgccagtac 







361
gatgcggtcg tcgactatct cgcgccgctg accgcgaacg gcaactgggg cggcacgctg 







421
ttctcgcacc cgttcgacaa cgacaacacg gacaactcgt tccgcgtcaa caacacggtc 







481
aagtacgcga gcgccgactg gaacggcctg acgttcggcg gcacgtacag cttcagcaac 







541
agcacgggct tctcgaacaa ccgccagtac agcatcggcg cgcagtattc gctggccggg 







601
ctgcaggtcg cggccgcgta cctgcaggcg aacaacccgg gcatcggcag cgcgggcgcg 







661
atcgcggccg acgacgcgaa cttcgtcgcc gaccggctgc gcatcttcgg cggcggcgtc 







721
aattacacgt tcggcccggc gacggtcggc gtcgtctaca cgaagaccga cgtgaagaac 







781
ccggtatcga cggtctacct gccggcgtcg acgttcgccg gtctcgggct gaccgcgacg 







841
aagttccaga acttcgaaat caacggcaag taccagctca cgcccgattt ctatctcggt 







901
gcgcaatacg tatacacgga cggcaagtac gatgcggcgg ccggttcgtt caagccgaag 







961
taccacacgg tcggcctgat ggccgactac agcctgtcga agcgcaccga cgtgtacctg 







1021
cagggcgcgt ggcagaaggt gggcggcgac aagacgggca cggcggccga cggcggctat 







1081
gtcgtcggga cggacggccc gtcggcgtcg tcgaaccagt tctcggtgcg cgccgcgatt 







1141
cgccacaagt tctga 









153
1
mkhpvifaaa laafaapafa etsvtlygvi degfnytnnv nvngvgktny qlasgyaqgs 







61
rwg1kgtedl ggglkavftl engfdvnngr lgqggrmfgr qafvglsesr fgtltfgrqy 







121
davvdylapl tangnwggtl fshpfdndnt dnsfrvnntv kyasadwngl tfggtysfsn 







181
stgfsnnrqy sigaqyslag lqvaaaylqa nnpgigsaga iaaddanfva drlrifgggv 







241
nytfgpatvg vvytktdvkn pvstvylpas tfaglgltat kfqnfeingk yqltpdfylg 







301
aqyvytdgky daaagsfkpk yhtvglmady slskrtdvyl qgawqkvggd ktgtaadggy 







361
vvgtdgpsas snqfsvraai rhkf 





714
39
107025986
YP_623497
178
1
atgaacaaga ctctgatcgt tgcagcagct gcagcatcgt ttgctaccgt cgctcacgcg 







61
caaagcagcg tcacgctgta cggcgtgctg gacgcaggca tcacctacca aagcaacgtc 







121
ggcggcaagt cgctgtggtc gatgggctcg ggcatcgacc aaagccgttt cggtctgcgt 







181
ggttcggaag acctcggtgg cggcctgaag gcaatcttca cgttggaaag cggcttcaac 







241
atcggtaacg gccgcttcgc aaacggcaac ggcggcatgt tcaaccgcca agctttcgtc 







301
ggcctgtcga gccagtacgg cacggtcacg ctgggtaagc agtatgacgc aacgcaagac 







361
tacctggcgc cgctgacggc aacgggctcg tggggcggca cgtacttcgc gcacccgctg 







421
aacaacgacc gtctgagcac gaacggcgac gtcgcgctga acaactcgat caagtacacg 







481
agcgctaact acgctggcct gcaattcggc ggtacgtact cgttctcgaa caacacgaac 







541
ttcggcaaca accgcgcata cagcggcggt ctgtcgtacc agttccaagg tctgaagctg 







601
ggtgcagcat actcgcaagc aaacctgggt gacggtacca acacgaacgg cgcatcgacg 







661
ctgggtggcc aaggccgtgt ccgtacgtac ggtgctgctg ctggttacgc attcggcccg 







721
gcacaagtcg gcgctgcatg gacgcaatcg cgtatcgaca accaagctgc tggcgttccg 







781
acgctgcgcg ctgacaacta cgaagtcaac gcaaagtaca acctgacgcc ggctctcggc 







841
ctgggtgctg cttacacgta cacgaacgcg aaggtcaaca acggcagctc gcactggaac 







901
cagttcggcg tgcaagctga ctacgcgctg tcgaagcgca ccgacgtgta cgcacaagct 







961
gtgtaccagc gtggcgcgaa gggcaacaac atcgttggca cgggcatcta caacggcgac 







1021
aacacgacgg catcgagctc gtcggtcaac caaaccgcag caacggttgg tctgcgtcac 







1081
cgcttctaa 









154
1
mnktlivaaa aasfatvaha qssvtlygvl dagityqsnv ggkslwsmgs gidqsrfglr 







61
gsedlggglk aiftlesgfn igngrfangn ggmfnrqafv glssqygtvt lgkqydatqd 







121
ylapltatgs wggtyfahpl nndrlstngd valnnsikyt sanyaglqfg gtysfsnntn 







181
fgnnraysgg lsyqfqglkl gaaysqanlg dgtntngast lggqgrvrty gaaagyafgp 







241
aqvgaawtqs ridnqaagvp tlradnyevn akynltpalg lgaaytytna kvnngsshwn 







301
qfgvqadyal skrtdvyaqa vyqrgakgnn ivgtgiyngd nttassssvn qtaatvglrh 







361
rf 





715
55
107026730
YP_624241
179
1
atgaagaaac ttgctctctc gaccctctcg ctcgccctgc tgggcgcggc cggcgctgct 







61
caagctcagt cgagcgtcac gctgtacggc gtgatcgaca cgtcgatcac ctatgttcac 







121
ggtaacgatg gtcaaggtaa caattcgtgg tcgatgggta gcggcaacct gcaaggcagc 







181
cgcttcggcc tgaagggttc ggaagacctc ggtggcggcc tgaaggcgat cttccagttg 







241
gaaaacggct tcaactcggc atcgggcgca ctgggccagg gtggccgcat gttcggtcgt 







301
caagccttcg tcggcctgca aagcgaccag tacggtacgc tgacgctggg tcgccagtac 







361
gacccgctgg tcgacctggt ccaggcagtc acggctgaca actacttcgg cagcgtgttc 







421
gccacgccgg gtgacgtcga caacaacgac aacagcctgc gcgtcagcaa cgcgatcaag 







481
tacacgtcgc cggtctgggc aggcctccag gtcgaagcgc tgtacgcact cggcggcgtc 







541
gccggctcga ccggcaaggg tcagacgtgg gcggcagcgg cagcgtacaa caacggcccg 







601
ctcggcctgg ctgcaggtta cttccacgcg aacaactcgg ctccgctgag cgcagttggc 







661
cagcgcaccg gctgggccgg cacgtcggac gcgatcttcg acggctcggg taacttcatc 







721
aacaacgcgt acacgtcggc atcgtcgatc ggcatcgcgc aagttgctgg ccagtacgca 







781
ttcggcccgg tgacgttcgg cctcggctac agcaacgcgc aatacaaggc ggatgcgaac 







841
tcgggcttcg gcacgaacca gaagtacaac acgggtcgcg cattcgtcac gtaccaggca 







901
tcggcaccgc tgctgctggg cgtcggccac atctacacga agggcagcgg ccagacggac 







961
gccaagtacc accaggtttc gctcggcgcg gactactcgc tgtcgaagcg tacggacatc 







1021
tacctggttg gcgcgtacca gcacgcaagc ggcaagaacg ccgacggtac cgatgcgcaa 







1081
gcgtcggtcg gctcgtacgg catcgccggc aagagctcgc aggaaatcgt cgcactcggc 







1141
ctgcgtcaca agttctaa 









155
1
mkklalstls lallgaagaa qaqssvtlyg vidtsityvh gndgqgnnsw smgsgnlqgs 







61
rfglkgsedl ggglkaifql engfnsasga lgqggrmfgr qafvglqsdq ygtltlgrqy 







121
dplvdlvqav tadnyfgsvf atpgdvdnnd nslrvsnaik ytspvwaglq vealyalggv 







181
agstgkgqtw aaaaaynngp lglaagyfha nnsaplsavg qrtgwagtsd aifdgsgnfi 







241
nnaytsassi giaqvagqya fgpvtfglgy snaqykadan sgfgtnqkyn tgrafvtyqa 







301
saplllgvgh iytkgsgqtd akyhqvslga dyslskrtdi ylvgayqhas gknadgtdaq 







361
asvgsygiag kssqeivalg lrhkf 





716
58
107022697
YP_621024
180
1
atgtttgcgc tgaatgcacg tggcgcgctg cgggcaccgc tcgcgctggc cgccgcgctc 







61
gcgctcgccg ggtgttcgct cgcgccgcgt tacgagcgtc cggccgcgcc ggtgccggcg 







121
acctatacgc cggtcgacgg cggcacgacg ccggccgccg aaccggccac cgcgcaggat 







181
gccgcgctgc tcgacgactg gcatgcgtat ttcaccgatc cggcgctgca ggcgtggatc 







241
gacgcggcgc tcgcgaacaa ccgcgaccta cggatcgccg ccggccggct cgaggaggca 







301
cgtgcgctgt acggcgtgca gcgcgcggac cggatgccgt cggtcgatgc gaatctcggc 







361
tatgaacgcg cgcgccagta cgacccggtc gtgcgcgaaa gcgcgatcag cgggctgtat 







421
cgcgcgggtg tcggcgtcag cgcgtacgag ctcgacctgt tcggccgcgt gcgcagcctg 







481
tccgaggcgg cgctcgccga ctatttcgcg acggccgatg cgcagcgcac ggtccgcatc 







541
ggcgtgatcg ccgaagtggc gggcgcatac gtttccgaac gatcgttgca cgagcaactg 







601
gcgctcgccc agcgcacgct cgatgcgcgc gagcggatgg ccgcgctcac gcaacgccgc 







661
tatgccgccg gcacgagcga cgcgatcgag ctgcgttcgg ccgagatgct ggtggcgtcc 







721
gcgcgcgcgt cgcaagccgc gctgcagcgc gagcatgcgc aggccgtccg cgcgctgcaa 







781
ctgctggcgg gcgatttcgc gcgcaacgtg cccgacgacg cgaccgcgct tgacacgctg 







841
tcgatcgcgc ccgtggcgcc gggtgcgccg agccagttgc tcgaacggcg gccggacatc 







901
cggcaggccg aggggcggct gaaggccgcg aacgcgaaca tcggcgcggc gcgcgcggcg 







961
ttcttcccgc gcatcgcgtt gacgaccgac tacggctcgg tgagcgacgc gttctcgaac 







1021
ctgttcgcgg ccggcacgag cgtatggacg ttcgcaccgc gcatcacgct gccgatcttc 







1081
gcgggcggac gcaatcgcgc gaacctcgac gttgcgcacg cgcgcaagga cattgcggtc 







1141
gccgaatacg agaaggcagt gcagaccgcg tttcgcgaag tggccgatgc gttcgccgcg 







1201
cgcgactgga tcgaccggca gctgaccgcg cagcaggacg tgtatgcggc ggacggcgcg 







1261
cggctgaagc tcgcggagcg tcgttatgcg ggtggtgtcg cgacctatct cgaactgctc 







1321
gacgcgcagc gcagcacgta cgagtcgggg caggagctga tccggctcag gcagctcagg 







1381
ctggcgaatg cgatcgcgct gtatcgcgcg ctcggcggcg gctgggcgcc ggccacggcg 







1441
gaggccgcgg cttccgcgtg a 









156
1
mfalnargal raplalaaal alagcslapr yerpaapvpa tytpvdggtt paaepataqd 







61
aallddwhay ftdpalqawi daalannrdl riaagrleea ralygvqrad rmpsvdanlg 







121
yerarqydpv vresaisgly ragvgvsaye ldlfgrvrsl seaaladyfa tadaqrtvri 







181
gviaevagay vserslheql alaqrtldar ermaaltqrr yaagtsdaie lrsaemlvas 







241
arasqaalqr ehaqavralq llagdfarnv pddataldtl siapvapgap sqllerrpdi 







301
rqaegrlkaa nanigaaraa ffprialttd ygsvsdafsn lfaagtsvwt fapritlpif 







361
aggrnranld vaharkdiav aeyekavqta frevadafaa rdwidrqlta qqdvyaadga 







421
rlklaerrya ggvatylell daqrstyesg qelirlrqlr lanaialyra lgggwapata 







481
eaaasa 





717
55
107023544
YP_621871
181
1
ttacccgttc gtcgccgttg ccgacgcggg cgcaggagcc gccgccttgt cgtagtcgac 







61
cggcgcatcc ggcgcgcgcg gcgcttcgcc ggcatgctgg atccagccgc cgccgagcgc 







121
gcggtacagg tcgaccaggt tcgtccagcg cgccaggcgt gcgctgatca gcgactgctg 







181
cgccgagtac agatccgtct gcgcagtcag caccgacagg tagctgtcga caccgttctt 







241
gtaacgcagg tccgacaggt cgaagcgacg ttgctgcgcg tgctcgttgc gctcgagcgc 







301
cgcgatctgc tggtcgtacg tgccgcgtgc ggcgaggcca tccgacactt cgcggaatgc 







361
cgactggatc gccttctcgt agttcgcgat ctcgatgcgc ttctgcacgt gcgcgagatc 







421
gaggttcgcg atgttctgcc cgccctcgaa gatcggcagt gcgatgctcg gcgcgaacga 







481
ccatgccgcc gtgccggcct tgaacaggcc gcccagcgtc gggctcgcgg tgccgaacgc 







541
gcccgtcagc gaaatcttcg ggaagaacgc cgcgcgcgcc gcgccgatgt tcgcgttcgc 







601
ggccagcagc gtctgctcgg cctgcatcac gtcaggacga cgcgtcagca gatccgacgg 







661
caggccagcg ggcacgtccg tcagcaggtt ctgcgcatcg agcggcatgc ccgccggcag 







721
gtcgtccggc agcggctcgc cgatcagcag caccagcgcg ttcagcgcct gcgcgcgggc 







781
acgggcctgc gcctgctggt tcgcgagcgc cgtctcgacg accgtctgcg cctgacgcag 







841
ctcgagctcc gagcccgtac cgttgtcgaa ctgcagcttg gtcaggtcgt acgatgcctg 







901
cgcggtcttc agcgtgtcct ccgtgacctt cagcaggtcg tcggtcgaca gcagcgtcag 







961
gtattgatcc gccacctgcg acaccagcga gatctccgac gcctgccgcg catacgcggt 







1021
cgacaggtat tgcgcgagcg cctgatcctt caggctctgc acgcggccga acaggtcgag 







1081
ctcccacgac gcggacaatc caacgttata ggtgcgcgaa atcagcggcg ccctcgtcgt 







1141
cgacacaccg gccggcagac gctggatgtt gcccgtgccc gtaccgtcga gcgtcgggaa 







1201
cagccccgca cgcgtgatct ggtactgagc acgcgccgcc tcgatattga gcaccgacac 







1261
gcgcaggtcg cggttgtttt tcagcgcgat ctcgatcagc cgctgcaggc gcggatcgac 







1321
gaagaactcg cgccagccga tggcagtcgc cgcctggccg ttcgcgctgc gcgcgccggc 







1381
tgcgcccggc tgcgtcgcgt agacgccgcc ggccgggtac gcctgcgcga cgggtgcgtc 







1441
gggccgcttg tagtgcggcg ccatcgtgca gcccgtggca aagagcgcga ctgcgattgc 







1501
agtcaaagcg tgtttttgca tcat  









157
1
mmqkhaltai avalfatgct maphykrpda pvaqaypagg vyatqpgaag arsangqaat 







61
aigwreffvd prlqrlieia lknnrdlrvs vlnieaaraq yqitraglfp tldgtgtgni 







121
qrlpagvstt raplisrtyn vglsasweld lfgrvqslkd qalaqylsta yarqaseisl 







181
vsqvadqylt llstddllkv tedtlktaqa sydltklqfd ngtgselelr qaqtvvetal 







241
anqqaqarar aqalnalvll igeplpddlp agmpldaqnl ltdvpaglps dlltrrpdvm 







301
qaeqtllaan anigaaraaf fpkisltgaf gtasptlggl fkagtaawsf apsialpife 







361
ggqnianldl ahvqkrieia nyekaiqsaf revsdglaar gtydqqiaal ernehaqqrr 







421
fdlsdlrykn gvdsylsvlt aqtdlysaqq slisarlarw tnlvdlyral gggwiqhage 







481
aprapdapvd ydkaaapapa satatng 





718
19
107028625
YP_625720
182
1
ttactcggcc gaacccatca tc tttgctg atagttctgc aggccgacct tgccgatcag 







61
gtcgatctgc gtttcgagcc agtcgatgtg ctcttcggtg tcgtcgagga tcttctcgaa 







121
gatctcgcgc gacacgtagt cgcgtaccga ttcgcagtag gcgatcgctt ccttgcaggt 







181
ggactgcgag atctgctcga gcttcaggtc gcacttcagg atctcttcgg tttcctcgcc 







241
gacgagcagc ttgtgcaggt cctgcaggtt cggcaggccg tcgagcatga acacgcgctc 







301
gatcagccag tcggcgtgct tcatttcgcc gatcgactcg tcgtattcgt gcttgccgag 







361
cttgtcgagg ccccagtgct tgtacatgcg cgcatgcagg aagtactggt tgatcgccgt 







421
gagttcattc ttcagttggg cgttcagata ttcgatgact ttcttgtcgc cttgcat 









158
1
mqgdkkviey lnaqlknelt ainqyflhar mykhwgldkl gkheydesig emkhadwlie 







61
rvfmldglpn lqdlhkllvg eeteeilkcd lkleqisqst ckeaiayces vrdyvsreif 







121
ekilddteeh idwletqidl igkvglqnyq qtmmgsae 





719
8.5
107028623
YP_625718
183
1
tcaggccgcc ttgcggtcgt agaacgtgac cgggatcgga tgagcatgat gctcgacacc 







61
gcaccggctc gcgcagacgc cttgctcggc catgaggtcg cgtacggact cctcgcactt 







121
gccgcagcac gtggccacgc cgagctcgaa ctggagttca tcgaaagagt tcacgccctc 







181
ggcgagggac gcgcgaatct tgcgatcgga aacagacttg cacacgcaga cgatcat 









159
1
mivcvcksvs drkiraslae gvnsfdelqf elgvatccgk ceesvrdlma eqgvcasrcg 







61
vehhahpipv tfydrkaa 





720
36
107026550
YP_624061
184
1
tcatgctggc ctcacttgaa ccggatggtc gggttgatga ccgcgtacag cacgtcgacg 







61
gtcaggttga tcaggatgaa ttccagcgag aacagcagca cgatcgcctg gatcaccggg 







121
tagtcgcgca tcgtcaccgc atcgaccagc aggcggccga gccccggcca gttgaacacg 







181
gcctcgacga cgatcgagcc gccgagcagg aagccgaact gcagccccca tcatcgtgac 







241
gaccggaatc atcgcgttgc gcaggcaatg cttgagcacg accatcggct cgtgcacgcc 







301
cttcgcgcgc gcggtgcgca cgaagtcctc gttcagcacc tcgacgaacg acgcgcgcgt 







361
gaagcgcgcc atcacggcgg cgaccgcggc gccgagcgtc agcgacggca gcacgtagct 







421
cttccacgtg ccgtccggca cgaccggcag ccagccgagc ttcaccgaga agatctccat 







481
cagcagcatg ccgagcgcga acgcgggaaa cgaaataccc gacaccgcga tcgtcatccc 







541
gagtcggtcg ggccagcgat tgcgccacac ggccgacgcg atcccgatcg ccatcccgat 







601
cgccgtcgcc cacaccatgc tgacgagcgt gagcagcagc gtcggcatga agcgctcccc 







661
gatctcggtc gacaccggcc gcttgctgcg cgtcgagatg ccgaaatcgc cgtgcgcgat 







721
cttcacgaag aagttcgcga actgggtcgg catcggcttg tcgagcccaa gatcggcacg 







781
cacgagcgcg accgtcgcgt cgtcggcttc ggggccggcc gcgagccgcg ccgggtcgcc 







841
gggcagcagg tgcacgaaca ggaacaccag caccgcgacg atcgcgagcg tgggcagcag 







901
gccgaacagg cgtttgacga ggaaattcag cat 









160
1
mlnflykrlf gllptlaiva vlvflfvhll pgdparlaag peaddatval yradlgldkp 







61
mptqfanffv kiahgdfgis trskrpvste igerfmptll ltlvsmvwat aigmaigias 







121
avwrnrwpdr lgmtiavsgi sfpafalgml lmeifsvklg wlpvvpdgtw ksyylpsltl







181
gaavaavmar ftrasfvevl nedfvrtara kgvhepmvvl khclrnamip vvtmmgaavr 







241
lparrldrrr grvqlagarp pagrcgddar lpgdpgdraa vlagihpdqp drrravrghq 







301
pdhpyqvrpa 





721
42
107026454
YP_623965
185
1
ttagaagcgg tggatcatgc cgacgccgac gccgatctgg ttctgcttcg aacctgcgcc 







61
cacgccgtca ccgatcgacg tggtcgccga gatgattgcg ttgccgcggc cgttcagcgt 







121
gttgccgctt gcgtgctggt aagcctcaac cgcgtacagg cccgtgcgct tcgacaggct 







181
gtagtactgc gacagcgtga cctggtggta cttggccgag ctcgtgatgc cgttcgactg 







241
cgttgcggcc gtgtacgagt agcctgccgc gaagtcccac tgagcagctg ccttccagtg 







301
cagcacggcg ccagccgtgt tgaagattgc cttgttccgg aagaacgaac ccgtgcccgg 







361
cgtgtactgg acgttcgagt acgacgccga aatatcccat gccggcgtga actggtagcc 







421
cgccgtcacg gcgatacgct gctgcgattg cgcggtcgcg tagccgttgt tgatcgacga 







481
gactgccggc tggttgccgt cgcccgctgc cgtcaggccg ttctggaccg tcgagttcgc 







541
gccccacacg ccgccgccga gcgtcgagtt gttgaccttc tgatagccga ccgcgatgcc 







601
tgccgggccg ttcaggtact ggatcgccgc gctccacgtc gagccgcggt tcgtcgcgcc 







661
tgcgacgccg ccgaacgagt acgaaccgcc gaccgtgaag ccgtagaact tcggcgacat 







721
gtagacgagc gagttgttcg cgcggtagct ggtatccagc gaatcgatat cgcccgggtg 







781
cgcgccgtag tagccggtca gccaggtcgt cgggctgtac ggcgacagca gcgtgtagta 







841
cgcggtgtac tggcggccgg ccgtcagcgt accgtacgtg gcgttggtca ggccaaccca 







901
tgcctgacgc gtgaagatgc cgttggtcca ctgcgacgag ccgttagccg tgttgacgcc 







961
ggcttccaac tggaaaatcg ccttcgtgcc gccgccgaga tcttcgctgc ctttcaggcc 







1021
gaagcggctg cctgcccaca cgccggtgga catctgcacc ttcgagtgac cacccgaggt 







1081
tgcgccgact gccgacgagt t ttctgcca agcgataccg ttgtcgacga taccgtagag 







1141
ggtcacgctg ctctgggcgt gggcggcggt agcggctgcg agaccgactg ccgtcatggc 







1201
gaaagcgacg cgctttttca t 









161
1
mkkrvafamt avglaaataa haqssvtlyg ivdngiawqn nssavgatsg ghskvqmstg 







61
vwagsrfglk gsedlgggtk aifqleagvn tangssqwtn giftrqawvg ltnatygtlt 







121
agrqytayyt llspyspttw ltgyygahpg didsldtsyr annslvymsp kfygftvggs 







181
ysfggvagat nrgstwsaai qylngpagia vgyqkvnnst lgggvwgans tvqngltaag 







241
dgnqpavssi nngyataqsq qriavtagyq ftpawdisas ysnvqytpgt gsffrnkaif 







301
ntagavlhwk aaaqwdfaag ysytaatqsn gitssakyhq vtlsqyysls krtglyavea 







361
yqhasgntln grgnaiisat tsigdgvgag skqnqigvgv gmihrf 





723
85
107022089
YP_620416
186
1
atgtcgttca cctttgccgc gcggccctcg cgcggccggc tcgcgctcgc ctgcgcggcc 







61
gctttcgcat ggccggccgc ttacgcggct gccaccgacg atgtccgcgc cgggaccgcc 







121
cgcggcggtg gcgtcgatcg tcccccgacc actgcgaccg ccgcgccggc tgccgccggc 







181
gtcgcggcag cgtccgcacc ggccggcgat acgctgagcg ccgtcagcgt gaccgcgcag 







241
cggcaaccgg tcgacccgga tacgccggcc gtcgtcacgt caattacgcg cgagcagatc 







301
gatgcgcata ccaacgtcac caccgaggac gcgctgaagt acgcgccgaa cctgatggtc 







361
cgcaagcgct acatcggtga ccgcaacagc atattcgcgg gtcgcgactt caacgaactg 







421
cagagcgcgc gcgggctcgt ctatgcggac ggcgtgttgc tgtcgaacct gctcgggtcg 







481
agctatgcat acccgccgcg ctggtcgctg attccgcccg acgacatcgc gcgcgtcgac 







541
gtgctctatg gccctttttc cgcgctctat ccgggcaacg cgatcggctc cactgtgctg 







601
ctcaccacgc gccgcccgga acaactcgag gcgtcgctgt cgacgcagtt ctttacgcag 







661
cgctaccacg acggctacgg ctttgccgac agcttcggcg gcaatcacca gacggcgcgg 







721
atcgcgaacc gcgtcggccg gttctggttc gcgttgtcgc tcgaccggct ggagaacaac 







781
ggccagccga tgcagtacgc gggccccaat tcgacctaca acccgaaact gggcgccgcc 







841
gtgccggtga cgggcgccgc gaccgacatc ggccccaacg gcaagccgcg gaccatcgtc 







901
ggcgcgcaga cgatcgagcg gaccgaacag ctcaacgaga cggtccggat gggctatgcg 







961
ttcaccgatc acgtcgatgc gacgctgacg ctggggcatt gggagaacca ttaccggcag 







1021
cacggcgaga ccttcctgcg agatgcggcg ggcaatcccg tctacggcgg caacgtgtcg 







1081
atcggcggcc agaacatgac ggtcgcgccg aacgcgttcg cgccgcagcg cggcgaccag 







1141
gagaactggc tgtacgcgct cgggttgaac ggccggctcg attccggctg gcggctgtca 







1201
ggcgtcgtgt cggcgtacga cgtgtcgcgc gacgtgctgc gcgcggcctc gaccgtgcag 







1261
ggtggggcgg gcacgctgtt ccagggcgac ggcaccggct ggcgcacgct cgatctgaag 







1321
gccgaggcgc cggtagtgaa aggtcatacg ttcacgttcg gctatcacta cgacaactac 







1381
ttcctgcgca acgtcacgta caacacggcc gactggctgg ccggcccgac cacgtcgctc 







1441
gcgagcgtct atcgcggcga cacgcgcacg caggcgctgt tcgggcagga cgcgtggcgt 







1501
ttcgcgccgc gctggctcgc gacgctcggg ctgcgctacg agcgctggga tgcgtatggc 







1561
ggcgcgctcg gcaatgcgcg cggcacgctc ggctatgcgg accgcggcgc gaacgcgctg 







1621
tcgccgaaag tcgcgctgca gtgggacgcg acggaggtct ggcgcttccg gctgtcgttc 







1681
gcgaccggca cgcggttccc gacggtcggc gaattgttcc agggcacgat ctcgaacaac 







1741
gcgatcgtca acaacaaccc gaacctgcgg ccggagaagg cgatcgactg ggacttcacg 







1801
gccgaacgcg acgtgggcgt cggcgtcgtg cgcgccagcg tattccagag cgatctgcgc 







1861
gactcgatct acagccagac gacggtgtcg ggcgcgacca ccgtcaccaa catctcgaac 







1921
gtcgaccgcg tgcgcgtgcg cggcgtcgaa ctcgcgttca gcggcgagaa cgtcggggtg 







1981
cgtgggctcg atctcgacgc gaacgtatcg gcgagcaacg cgcagatcct cgccgatgcc 







2041
gcgaacccgg cctacgtcgg ttcgcgtttc ccgcgcattc ctcgcatgcg cgcgaacctg 







2101
ctcgcgtcgt atcgcttcga cgagcactgg ctcacgagcg tcggcgtgcg ctactcgggc 







2161
cgccagttca acacgctcga caacagcgac gtgaatccgg acgtctacgg cggcacgagt 







2221
tcgttcacgg tcgtcgatct gaaggcgcgc tatcgcttcg atcgtcactg gaccgcgtcg 







2281
gccggcatcg acaacgtgac ggaccgccgc tactacacgt tccacccgta tccgggccgc 







2341
acgttctatg gagaactgaa atggtcgctg tga 









162
1
msftfaarps rgrlalacaa afawpaayaa atddvragta rgggvdrppt tataapaaag 







61
vaaasapagd tlsavsvtaq rqpvdpdtpa vvtsitreqi dahtnvtted alkyapnlmv 







121
rkryigdrns ifagrdfnel qsarglvyad gvllsnllgs syaypprwsl ippddiarvd 







181
vlygpfsaly pgnaigstvl lttrrpeqle aslstqfftq ryhdgygfad sfggnhqtar 







241
ianrvgrfwf alsldrlenn gqpmqyagpn stynpklgaa vpvtgaatdi gpngkprtiv 







301
gaqtierteq lnetvrmgya ftdhvdatlt lghwenhyrq hgetflrdaa gnpvyggnvs 







361
iggqnmtvap nafapqrgdq enwlyalgln grldsgwrls gvvsaydvsr dvlraastvq 







421
ggagtlfqgd gtgwrtldlk aeapvvkght ftfgyhydny flrnvtynta dwlagpttsl 







481
asvyrgdtrt qalfgqdawr faprwlatlg lryerwdayg galgnargtl gyadrganal 







541
spkvalqwda tevwrfrlsf atgtrfptvg elfqgtisnn aivnnnpnlr pekaidwdft 







601
aerdvgvgvv rasvfqsdlr dsiysqttvs gattvtnisn vdrvrvrgve lafsgenvgv 







661
rgldldanvs asnaqilada anpayvgsrf priprmranl lasyrfdehw ltsvgvrysg 







721
rqfntldnsd vnpdvyggts sftvvdlkar yrfdrhwtas agidnvtdrr yytfhpypgr 







781
tfygelkwsl 





724
78
107026278
YP_623789
187
1
tcagaacgtg gtgcgcaatc cggccatcag gcttcggccg ccttgcgggg cgaagccgcg 







61
cacgaccgac gtcgagtagc gaatctcctg gttcgtcagg ttgtcgccgc gcaggtgggc 







121
gagccagtgc gtcgggccga cgcggaattt gtacgtcagc atcacgccga gcgacgtata 







181
gccgtccgtc gagaagtcgt cgtcgggcac gcgatgctgc gaccacgcgt gcgtgacctg 







241
tgcgcgcgca ccgaacgggc cgtaaccgta gtcggccgcg agcgtcgcgc gcagcggcgc 







301
gatgcgcggc agcggctgcc ccgtgtcgac gttgcgcgcg tgcgtgtagt cggccgtcag 







361
ttcgagatcg accgtgtggc cgcgccgcga gaacgcacgc cacttgccgt ccagctcgac 







421
gccgtagaac tcggcgcgca cgccgcggta gatcgcctcg ttgagcgagc cgtcagtgcc 







481
gggcgcgacg ggctcgccgt cgtcgtccac cacgcggccg gtgttgtact cggtcagata 







541
gttcgagaag cggttgtaga acacgccgac gctgccgcgg ttcggcccgc tcgcatagcg 







601
cagcgacagg tcggtcgaca cggccttttc cttcgacgcg ttcgggttgc cgatcaggaa 







661
ctggccggtc gcatcgtgcg ggccgttcga atacagctcg tagaaggtcg gcgcgcgttc 







721
ggtgtaggcg acgttcgctg cgatcgacca taccggcgtc agcgagaaca gcgcaccggc 







781
cgacaggctg cccgcattga agtcgcgcgg ctgcgcgccc gcgaatttct cgacgcccgc 







841
cggatcggga tcgaccttca cgtgctcgaa gcggccgccg aggctcagct tcagcgccgg 







901
gacgacctgc cattcctcga ggccgaacag cgcgacgctg ttggtgcggg tggacggcac 







961
gagcatttcg tcgccgagcg ccgagaacgt gttctggccg aactgcacgc cgatcgcgcc 







1021
ttcgaacggc ccgatcttgc gatggcgtgc ctcgatgcgc gcctcgtaac cgcgattgcg 







1081
gaaggtggtc gccgtctcgc cgttgtccac ttccttgtgg cggtagtcgg tgtacgcgaa 







1141
atcgaatttc agcttcgtga acggcccgct caggttgcgc acctcggacg cgaacgcgag 







1201
gcgttcctgg cgcatccgca gccgcacgtc gctttccgcg acggagccgt agttcgattc 







1261
gtagccgctg tacgacaggc ccgcgaaacc gtccgcccac gtgtacgacg cgccgaccgc 







1321
accgccgtgc acgcgcccgt cgctgttcgg cacgttgcct tccggttgcg gcgcgtcggg 







1381
gccgtcgatc gcgcgctgct ggctgctgcg cgcatagccg ggaatccgca gcttgctggt 







1441
ttcgcgatcg aacgcgtcga cgtggaacgc gaagcggccg ttgccgcctt cgacctgcgc 







1501
ggcgccggcg cgcacggaat tcgcgccgcc gtagcgggcg tcgagcgcgc ccgtcatgcc 







1561
ttcgatcgct tcgcgcggaa tccggttgtc gatcgtgttg accacgccgc ccaccgcgtt 







1621
gccgccgtac agcagcgcgg ccgggccgcg cacgatctcg acgcgctcga tcgacagcgg 







1681
gtcctgcggc accgcgtgat cgtacgacag cgacgacgcg tcatacgctg cgacgccgtt 







1741
ctgcagcagc cggatccggt cgccgtccat cccgcggatg atcgggcggc cgaccatcgg 







1801
cccgtaggtg gtggtcgaca cgccgggcag cccgttgagc gtttcgccga gcgaatcggc 







1861
ctggcggcgc gtcagcgcgt cgccggaaag ctggacggtc ggtgcgatca gttcggtgtc 







1921
gccgagcggg ttcgcggtca cgaagatcgg cgcgagcggt gcgggtgcgg cggcaggcgt 







1981
gccggacgcg gcgggcgcat cggtctgcgc atgggcgacg gcggcgcaca gcatcagcga 







2041
cagggacgag agcgggcgaa gcggtaagtg agggaggtcg cgcatggtcg gtgaatcggt 







2101
tggaatcgga tgatcggata acgagcgatg cgggtcactg ttagatacga tatattgttg 







2161
cgtaactgac gcgcagtgac ccgttggaac atggctcggc aaggcgcggc cacctggttg 







2221
gaacggcat 









163
1
mpfqpggral pshvptghca svtqqyivsn sdphrslsdh piptdsptmr dlphlplrpl 







61
sslslmlcaa vahaqtdapa asgtpaaapa plapifvtan plgdteliap tvqlsgdalt 







121
rrqadslget lnglpgvstt tygpmvgrpi irgmdgdrir llqngvaayd asslsydhav 







181
pqdplsierv eivrgpaall yggnavggvv ntidnripre aiegmtgald aryggansvr 







241
agaaqveggn grfafhvdaf dretsklrip gyarssqqra idgpdapqpe gnvpnsdgrv 







301
hggavgasyt wadgfaglsy sgyesnygsv aesdvrlrmr qerlafasev rnlsgpftkl 







361
kfdfaytdyr hkevdngeta ttfrnrgyea riearhrkig pfegaigvqf gqntfsalgd 







421
emlvpstrtn svalfgleew qvvpalklsl ggrfehvkvd pdpagvekfa gaqprdfnag







481
slsagalfsl tpvwsiaanv ayteraptfy elysngphda tgqflignpn askekavstd 







541
lslryasgpn rgsvgvfynr fsnylteynt grvvdddgep vapgtdgsln eaiyrgvrae 







601
fygveldgkw rafsrrghtv dleltadyth arnvdtgqpl priaplratl aadygygpfg 







661
araqvthaws qhrvpdddfs tdgytslgvm ltykfrvgpt hwlahlrgdn ltnqeiryst 







721
svvrgfapqg grslmaglrt tf 





725
43
107023546
YP_621873
188
1
ttattgcgcg ctcgacgcgg cggcagcact cgccggcgcg gcgcccgatg cggcggcacc 







61
ggaagcggca gcggcgccgg acccggcagc ggccggcgcg gcgggggcag cggcacccga 







121
ggcggcatcg gccgccgggg cgagctgcgc ggcgacggtc ttcacggtcg cgcccgggcg 







181
caccttgtcg acgccttgca cgatcacgtg atcgcccgcc tgcagaccgc cttcgacgat 







241
ccagttctgg ccttgcatgc cggtggtctt cagcggacgc ggctcgacct tgttgctggc 







301
gttcaccacc atcgcgatcg cctggccctt ctggtcatgc gtgacgccga tctgcggcac 







361
caggaaggcg ttctcgttca cgccttcctc gatccgtgcg cgcacgaaca tccccggcag 







421
cagcacgcgg cccgggttcg ggaacaccgc gcggatcgtc accgagccgg tggtctggtc 







481
gaccgtcacg tccgagaact gcagcttgcc cggctccgag taggtcttgc cgtcttccag 







541
gatcagcgac accttcgccg cgcccgggcc gctcgtcttc agacggccgc tctgcacgtc 







601
ctgacgcagc ttcagccctt cgaggctcga ctgcgtgagg tcgacgtaca ccggatcgag 







661
ctgctgcacg gtcgacatca gcgtcgcctg gctcgcctgc acgtatgcgc ccggcgtcac 







721
ttgcgagatg ccgacgcggc cgctgatcgg cgacacgacg tccgtatagc cgaggttgat 







781
ctgcgcggta tcgaccgccg ccttgccggc cgcgacgtcc gccgcggcct ggccttgcgt 







841
ggcgaccgcg ttgtcgtagt cctgcttgct gaccgcgttc gcggccacca gcaccttgta 







901
gcgcgcgacc agcgcgttct gcgtgacgag gttcgcctgc gccttcgcga gcgtcgcctt 







961
cgcgctgttc agcgccgcga tgtacggcgc cggatcgatc ttgtagaggc gctgaccggc 







1021
cttgacgtcg gtgccttcgg tgaattcacg gcgcagcacg atgccgtcga cccgcgcacg 







1081
cacttgcgcg acgaggaatg cactggtgcg gcccggcagg tcggtgaaga ccggtacggc 







1141
ttgcggctgg acggtgacga cgccgacttc cggcgtttgc ggcggcggtg ccgattcttt 







1201
tttcccgcac gcggccagga aaacggcggc cgtcgcgaca gtgattaagc ggtatggaac 







1261
ccgttcgacg cgcat 









164
1
mrvervpyrl itvataavfl aacgkkesap ppqtpevgvv tvqpqavpvf tdlpgrtsaf 







61
lvaqvrarvd givlrrefte gtdvkagqrl ykidpapyia alnsakatla kaqanlvtqn 







121
alvarykvlv aanavskqdy dnavatqgqa aadvaagkaa vdtaqinlgy tdvvspisgr 







181
vgisqvtpga yvqasqatlm stvqqldpvy vdltqssleg lklrqdvqsg rlktsgpgaa 







241
kvsliledgk tysepgklqf sdvtvdqttg svtiravfpn pgrvllpgmf vrarieegvn 







301
enaflvpqig vthdqkgqai amvvnasnkv eprplkttgm qgqnwivegg lqagdhvivq 







361
gvdkvrpgat vktvaaqlap aadaasgaaa paapaaagsg aaaasgaaas gaapasaaaa 







421
ssaq 










Bmu17616-800












ID
MW
Protein GI#
Protein Locus#
SEQ ID NO:
Sequence
















801
81
161524767
YP_001579779
212
1
tcaccagtta tatttcgcgg tggcgatcac cgtgcggtcg ttgccgaaca tgcagaccga 







61
ggtcgactgg caaccgctga tgtaatgacg gttgaacagg ttcgtcgcgt tcaccgcgaa 







121
gcgccagttg cgcacgtcgt aatgcacggc cgcgtcgatc agcgtgtagc tcgacaccgt 







181
cagcgagttg tcggccgcgc cggcggacgc gctctggtag cgcacgccgc cgccgagccc 







241
gaagcccgcg agcggccccg tgtgccacgt ccagtcggcc cacagcgacg ccatctggcg 







301
cggccgcgga atgtcgaccg gccagttgtt cagcgtcgcg tcgttggcct gcacgacctt 







361
cacgtcctga tacgcgtacg acgcgatcag cgacaggttc ggcgtgacct tgccggttgc 







421
gctcagctcg atcccgcgcg cgcgcacttc accggtctgc acggatttcg tgccgcccgg 







481
atcctgcgtc ggcagcgccg gcgtaacgcc gttggtctgg ttgatctgat agatcgcggc 







541
attcagcatc aggttcttgc cgggcggctg ccagcgcagg cccgcttcga tctgcctgcc 







601
gcgcgtcggc ttcggcaccc cgccgtcgag caggctcacg ccgatgatcg gattgaacga 







661
cgtcgaatag ctgatgtacg gcgacagccc gtagccggcc tgatacgtga ggccgacgcg 







721
gcccgtaaac gccgacacgt cggccttcga ctgcgtcccg gccgcgcggt cgtccatccg 







781
catgttgacc cagtcctcgc ggccgccgag cgtcagcgtc cagcggttcc acttgatctg 







841
gtcctgcgcg tacaggccga acgtgttcat cgtcgtgtac gtgttggtgc ggtacgtggt 







901
gtccggatcg gaaaacacgg ccgtcgtgac cggcgtgtac accgggttgt agaggttcag 







961
cgtcggcgcg gcggcgagcc attcgctatc ggtcgccgtc tggcggttgt actggaagcc 







1021
gagcagcagt gtgtgctgga gcggacccgt cgcaaaacgg ccctcgaggt tgttgtcgat 







1081
gtcgaagcgg ctgtagttca tctggaacac gccggcccag cggttgattt cggtcagcgt 







1141
ttcgtcggcg aagcccgctc cccacaccga gccgttgtcg accgacagat gcatccatcg 







1201
cgtgttctgc cggaacgtcc acatcgagtt caggttgtgc tcgaactgat agccgagcga 







1261
ccactgcttc ttgcggtagt agttgaagtt gccgtcgcct tcgtagatgt ccttcgtgat 







1321
ctgcccgttc ggattcggca gcaccgtgcc ggccgcgggc aggaagttcg acgagatgtc 







1381
gccccagtcc tgcagatacg tggccgacag cgtcagcgac gtgtcggcgt tcgggcgcca 







1441
gcggaacgac ggtgcgagcg cgacgcgctg atcgttgttc gggccggtca ccgcgttgcc 







1501
gtcgcgcgcg acgccgacga accggtacgc gtacttgccg tccggatcga gcttgtcgcc 







1561
gacgtcgatc atgaactgct tgcgtgcgta gttgccgatt tcgacgcccg cttcgcgcac 







1621
gcgctcgccg tcggcgagct tcgtcttcac gtcgacgatc gcacccggat cgcccgcgcc 







1681
gtacagcacc gacgtcgggc cgcgcagcac ggtgatgctg tcgatcatgt acggatcgac 







1741
gcgccagctc gacaggttca gcgtgttggg cacctgcagg ccgttcacgt acgcggtcgg 







1801
cgtgaagccg cgcagcgccg cgtaccagtc ggagcggttg tccgagccgt acgacgaaaa 







1861
gcccggcacg tagcgcagcg cggtgttcac gtcggttgcg cccgtcatct cgatctgctg 







1921
cgcggtcacg acgttgatcg tctgcgggat ctcctcgatc ggcgtgtcgg tcttcgtgcc 







1981
ggtccggctg cgcttcgcga cgagccccac cgtgccgtcg ccttccgacg ccgcgttcac 







2041
ggtgatcgcc ggcagcgtgc cggccgacgc agcgttcggt gcagcggcct gcgcattcgc 







2101
cgccggcgtg gtctgcgcga acgcctgtcc ggcagccgcc gcaccgaatg ccacgctcgc 







2161
cgcggccgcg atcgcacgca accgcgtgcc tgttgcccaa tccat 









189
1
mdwatgtrlr aiaaaasvaf gaaaagqafa qttpaanaqa aapnaasagt lpaitvnaas 







61
egdgtvglva krsrtgtktd tpieeipqti nvvtaqqiem tgatdvntal ryvpgfssyg 







121
sdnrsdwyaa lrgftptayv nglqvpntln lsswrvdpym idsitvlrgp tsvlygagdp 







181
gaivdvktkl adgervreag veignyarkq fmidvgdkld pdgkyayrfv gvardgnavt 







241
gpnndqrval apsfrwrpna dtsltlsaty lqdwgdissn flpaagtvlp npngqitkdi 







301
yegdgnfnyy rkkqwslgyq fehnlnsmwt frqntrwmhl svdngsvwga gfadetltei 







361
nrwagvfqmn ysrfdidnnl egrfatgplq htlllgfqyn rqtatdsewl aaaptlnlyn 







421
pvytpvttav fsdpdttyrt ntyttmntfg lyaqdqikwn rwtltlggre dwvnmrmddr 







481
aagtqskadv saftgrvglt yqagyglspy isystsfnpi igvslldggv pkptrgrqie 







541
aglrwqppgk nlmlnaaiyq inqtngvtpa lptqdpggtk svqtgevrar gielsatgkv 







601
tpnlsliasy ayqdvkvvqa ndatlnnwpv diprprqmas lwadwtwhtg plagfglggg 







661
vryqsasaga adnsltvssy tlidaavhyd vrnwrfavna tnlfnrhyis gcqstsvcmf 







721
gndrtviata kynw 





802
55
161519924
YP_001583351
213
1
atgcttgagg acaacaagat ggacaacatg cacaacacaa acggcctgat gcgcatcgcg 







61
aaggtggcgg ccgcgagcac gctgctcgcg acgctgctcg ccgcgtgcgc ggtgggcccc 







121
gactacaagc gcccggacgt cacgacgccc gcggcgttca aggaagcgcc gacgctcgca 







181
cccggcgagc aggccggtac ttggaagccg gccgagccgg ccgacggcgc gcatcgcggc 







241
gaatggtgga aggtgttcgg ggatcccgtg ctcgacgcgc tcgaggaaca ggcgctcgcc 







301
gcgaaccaga acctgaaggc cgcggccgca cgcgtcgagg aggcgcgcgc cgcgacgcgc 







361
actgcgcgct cgcagtggtt cccgcaggtc ggcgtcggct tcgggccgac gcgcgaaggg 







421
ctctcgtcgg cgtcgcagtt ccagccgcag ggcacagggc cgaccaatgc gacgctgtgg 







481
cgcgcgcagg gcaccgtgtc atacgaggcc gacctgttcg gccgcgtgag ccgcaacgtc 







541
gaagcgtcgc gcgccgatca ggcgcagagc gaagcgctgt tccgttcggt gcagctcgcg 







601
ctgcaggccg acgtcgcgca gaactacttc gaactgcgcc agctcgattc ggatcaggac 







661
ctgtaccgcc gcacggtgga gctgcgcgag gaagcgctga agctcgtgca gcgccggttc 







721
aacgaaggcg acatcagcga gctcgacgtg tcgcgtgcga agaacgagct tgcgtccgcg 







781
caggccgatg cggtcggcgt cgcgcgccgg cgcgcggcgt ccgaacatgc gctcgcgatt 







841
ctgctcggta aggcgccggc cgatttcgcg ttcaaggaaa cgccgctcgt gccggtcgcg 







901
gtgaagattc cgccgggatt gccgtccgcg ctgctcgaac gccgtccgga cgtcgcggcg 







961
gccgagcgtg cgatggcagc tgcgaacgca cggatcgggc tcgcgaagtc cgcgtatttc 







1021
ccgaagctcg acatcaccgg atcgttcggc tatgaagcgt cgacgctcgg caacctgttc 







1081
ctgtggtcga gccgcacgtt cctgctcgga ccgttcgcgg gcaccgcgtt gacgctgccg 







1141
ctgttcgacg gcggacgccg cgcagccggc gtgcagcagg cgcgcgcgca gtacgacgaa 







1201
caggtcgcga actatcggca gcaggtgctc gtcgcgttcc gcgaggtcga ggacaatctc 







1261
gccgatctgc ggctgctcga cgatcagatt cgcgcgcaga atgcggccgt caatgcgtcg 







1321
cgccgtgcgg cgacgctgtc gcggacgcag tatcaggaag gcgaagtcgc gtatctcgac 







1381
gtgatcgata gtgagcggtc cgtgttgcag tcgcagttgc aggcgaatca gctcacgggt 







1441
gcgcaggccg tgtcgaccgt caacctcatc cgtgcacttg gtggcgggtg gggtgaggcg 







1501
ccgacggcgg tcgacggggc ggcggcggcc aaggcggaga tcgcggggcg ttga 









190
1
mlednkmdnm hntnglmria kvaaastlla tllaacavgp dykrpdvttp aafkeaptla 







61
pgeqagtwkp aepadgahrg ewwkvfgdpv ldaleeqala anqnlkaaaa rveearaatr 







121
tarsqwfpqv gvgfgptreg lssasqfqpq gtgptnatlw raqgtvsyea dlfgrvsrnv 







181
easradqaqs ealfrsvqla lqadvaqnyf elrqldsdqd lyrrtvelre ealklvqrrf 







241
negdiseldv sraknelasa qadavgvarr raasehalai llgkapadfa fketplvpva 







301
vkippglpsa llerrpdvaa aeramaaana riglaksayf pklditgsfg yeastlgnlf 







361
lwssrtfllg pfagtaltlp lfdggrraag vqqaraqyde qvanyrqqvl vafrevednl 







421
adlrllddqi raqnaavnas rraatlsrtq yqegevayld vidsersvlq sqlqanqltg 







481
aqaystvnli ralgggwgea ptavdgaaaa kaeiagr 





803
84
161520344
YP_001583771
214
1
tcagtagcgc gcgttcagcg tgacgaacgc ggagcggccc ggcgcgatcg acgcgtagtg 







61
cgccggatac gcctgatcga agtacgtgcg gttgaacagg ttgttcacgt tcagctgcag 







121
gtcgagcttc ttgttgatgc ggtactgcgc catcgcatcg aagcgccagt acgacggcac 







181
cgcgcgcagg ttcgccggat cgccgaacac ttccgacatg tagaacgcgc cgccgccgac 







241
cgtgaacttc ggcgtgacgt cgtagttcgt ccacatcgtg aagctgtgct tcggcgtatt 







301
cgggaagcga tggccgttgt tcgcggtgtc cttgccgttg tcgcgcagtt cgctcttcat 







361
gtacgtgtag ccgccgaaca cctgccactg tttcgtgagc tggcccgcga cgccgagctc 







421
gaggccctgc acgcgcttgt tgccgaccat cgcgtactga ttgttcggca gcgtcacgcg 







481
cgcgttcgtc gtatcgatct ggaacagcgc ggccgtcagc gacagcttgt cgttcagcac 







541
gttccatttc gtgccgacct cgatgctgcg gttcttctcg ggcgacagct ggtcggcgtt 







601
cgagccgacg ccgccgcggc ccggcgtcag cgactgcgtc tcgctgcctt cgccgagcag 







661
catcccggcc ggcgtcgacg acgtcgcgta cgacgcatag atgctgccgt tctgcgcggg 







721
cttgaacacc gcgccgagct gccagttgaa cagcgtgtcg tcgcgcgtgt aggtcttgcc 







781
gccgttcgcc ttggtatcgg tgaagcgcgt cgagtaatcg tcgacgcgca cgccggcatt 







841
gagctgccag cgcggcgtca gctcgatcgt gtcgaagccg tagatcgact tcgtcgtcgt 







901
gcgcgcatgc gcatagtcgt tgttgcgcga gatcgaaccg gcccacggat cggccgggtt 







961
cggcgaccac aggctcgtgc agttgtagcc ggacggcgcg ccgatgccgt tgttgcagat 







1021
cgcgccggtg ccggtcgcga ccgtgtacgt gtcgcgcttg ccccattcgc gcgacagctc 







1081
gatgccggtc gtgaagctgt gcttgaacgg accggtgcgg aattcgccga acagctcggt 







1141
ctggttcgcg atgctgttga tcgaactgtt gcggttgttg ttgcggcgcc agaccttgcc 







1201
gttgatcacg ttgccctgac tgtcgtcggg ctgcgtccag atgtagtcct gcgtcgattc 







1261
ggtgtagcgc gtcgtgttgc gcacggtcag cgccggcgtg atgtcgtgct cgatgcgcag 







1321
cgtgctgatg tccgacgtcg tcttgcggaa gtcgcggtcg atcaggccgt agaagttgtg 







1381
gcggtcgacg ggcgccggat agatcgtgtc gacgttcgcg ggcttgttcg tcgtcgtgta 







1441
gaagtacgga atgccgccgt cgggcaggtc gtccgtcgac agatggtagt agctcgcggt 







1501
cacgcgtgtc ggcgtgccga ggccgaacgc gatcgacggc gcgacgcccc agcgttcgtt 







1561
gttcacggcg tcgcggcccg ccacgtcgtt gttgtggctc atcaggttca gccggaacgc 







1621
cgcatgatcg gcgaactgcc agttgccgtc ggcggtgaag cgacggtagc gatcggtgcc 







1681
gagcccggcg ctcgcggcgg ccgtcgtgcc gaggtgcggc gccttcgtga tcaggttgat 







1741
gctgccgccc gcgccgccgc ggccgccgta cgcgccgtcc gagcctttcg tgatctcgac 







1801
gcgttcggtg ttgaagattt cgcgcgtggt cgcgccggtg tcgcgcatgc cgtcgacgaa 







1861
catgctgccc tgcgtgtcgt agccgcggat gaacgggcgg tcgccgagcg ggttgccgcc 







1921
ttcgccggcg ccgaacgtga tgccgggcac ggtgcgcagc gcttcggtca gcgtcgaggc 







1981
gccgctcgtc tggatcagct cctgcggaat cacggtgacc gatttcggcg tgtcgacgag 







2041
cggcgcggtg aatttggccg acgcggaaaa atcggccttg tagctgtgct cagcgcggcc 







2101
cttgatctcg atcggcgcga gatggccttc cgcgtctgcc ggcgcgggcg gcggggcgcc 







2161
gtcggcggcg aatgcgggac cggcggcgag cacgctgcac agggtggtga attttccgag 







2221
cttcgactcg tcggaacggg acttcat 









191
1
mksrsdeskl gkfttlcsvl aagpafaadg apppapadae ghlapieikg raehsykadf 







61
sasakftapl vdtpksvtvi pqeliqtsga stltealrtv pgitfgageg gnplgdrpfi 







121
rgydtqgsmf vdgmrdtgat treifnterv eitkgsdgay ggrggaggsi nlitkaphlg 







181
ttaaasaglg tdryrrftad gnwqfadhaa frlnlmshnn dvagrdavnn erwgvapsia 







241
fglgtptrvt asyyhlstdd lpdggipyfy tttnkpanvd tiypapvdrh nfyglidrdf 







301
rkttsdistl riehditpal tvrnttryte stqdyiwtqp ddsqgnving kvwrrnnnrn 







361
ssinsianqt elfgefrtgp fkhsfttgie lsrewgkrdt ytvatgtgai cnngigapsg 







421
ynctslwspn padpwagsis rnndyahart ttksiygfdt ieltprwqln agvrvddyst 







481
rftdtkangg ktytrddtlf nwqlgavfkp aqngsiyasy atsstpagml lgegsetqsl 







541
tpgrggvgsn adqlspeknr sievgtkwnv lndklsltaa lfqidttnar vtlpnnqyam 







601
vgnkrvqgle lgvagqltkq wqvfggytym kselrdngkd tannghrfpn tpkhsftmwt 







661
nydvtpkftv gggafymsev fgdpanlrav psywrfdama qyrinkkldl qlnvnnlfnr 







721
tyfdqaypah yasiapgrsa fvtlnary 





804
83
161519887
YP_001583314
215
1
tcagaaatcg accttcatgc tgactgcgac ggtacggccg ggcgacgtat aggcgtcgag 







61
caccggcgag tcggcggcga tgccgcgcac gtcggaccag ttccagtact tgcggtcgaa 







121
caggttgcgg atgccgaccg tcgcgctcac gtgcttgttg aagcgatagc cgccgcgcag 







181
atcgacgaca aacgacgaag gcggcgcgaa gcaggctcgc ttcgaacagt cggaccggtc 







241
gatgtccttg tcgcgcttcg cggcctggaa cagcagatcg gtctgcacga accagcgttc 







301
ggtcggctcg tagcgcacgc cgaacacggc agaaaacgga ttcacggtgt tgagcggctg 







361
gctcgcggcg ccgtcgtttt gcgtcgagcc gcgcgtcagc gcgagcgcgg tgcgcagcgt 







421
gatgccgttc ggcatcgccc agtcggcgcg gccttcgatg ccgtgaacgc gcgcatcggc 







481
gaagttcacg tactggaaca cgaacggatc ggtcggccgg ccgctgccgg cgatcgtcgt 







541
acgcgcgatg aagttgcggt agcggccggt gaacgcggcc gcgctgtagc gaatcgtgcc 







601
gtagccggtg ccggcccgcc cgcgcagccc ggcttcgaac gtgtcgctgg tctcgggctt 







661
caggttcgga ttgccgatcg aggtatagcc ctgcacggga ttcgagaagc tgctgttgac 







721
ctgatcgggc gtcggcgcgc ggaatccatg tgcgtactgc agatacggaa tcagcgcagg 







781
cgcgatctcg tacagcaccg cgacgcgcgg cgacagttcg ttcgcgtgcg tcgacaccgc 







841
gcggccggtg aacagcggat cgttgcccga cggcttcagc cggtacgtgt cgaagcgcag 







901
ccccggcgtg acgagcaggc ggccgtagcc gatctgatcc tgcacgaacg cgccgaacag 







961
cgtgtagtcg gtgtcgggga aggccttgtt cgggaacgtc tcgccgacgc cgggcaccgt 







1021
gccgtcgcgc aggttcgtga tccgcgacag gctgccgtcg acgccgccca gcagcttgtg 







1081
cgcaagcggg ccggtcgaga atccgctttc ggcgaacgcg gagccgccga acgtgcgttc 







1141
gcggtagcgg ttgtcgcgcg agcgtgacgg cagcgtgccg cgcgtttcga acgcgtactg 







1201
gtcctgcgtc gcttcctggt aatagaactg cacgtgcgcg ttctgcagcc agcgcgaggc 







1261
gtcgtcgtga aaggcgtaat cggcgctgaa gcggttgcgt tcgagccggt cgctcgtcgt 







1321
gaggccgagc gtcgcgggcg cgacgatcgc agatagcacg tcggtgccga cgcgccgctg 







1381
cacggtttcg gcggtcagcc tgatcgtgtc gcgcacggtg ggcgtcagca cgagcttgcc 







1441
gagcagcgat tcgctgtaca cgtgctgcgg gttcgacgtc gtgcgcagct tgcctgcgga 







1501
gttgttgctg ccgcgcgtgt cgatctcgtg accgcggcgg ccgtcggcga tgatcatccc 







1561
ctggatgcga tcgttgccgc cggcggccga cacggtcgcg ccgatgctgc ggtcggtcga 







1621
atcgtagctc ggccggaacg agaagtaggt cggcttgtgg tagatcgaca gcaggtcctg 







1681
cggatccttc gtgatgaagt tgacggcacc ggtcagtccg tcgctgccgt acagcgccga 







1741
cgccggcccg cgcacgatct cgatgcgctt cagcgtatcg agatcggtgt agtcgccgcg 







1801
gcccgcctcg agcgggccga acgagaatgc gttcggcagg cggatgccgt cttccatcag 







1861
catcacgcga ttgccttcga ggccgcggat gttgatgctc gaatcgccgt cgcggccgcc 







1921
gccgagcgcc gcgttgccgg gacggtacgc ggtgcgccgc accgtgatgc cgggctcgta 







1981
gcgcaaggcg tccttgatgt tggtggcctg ctgttcgtcg aggtcgtccg cgtcgatcac 







2041
cgacacggaa gcggcggtgc ggctcgcggc cgtcgcggtg cgcgtcgccg tgacggtgac 







2101
cgggtcgagc agcgcgctgc tggcgcgtgc ggctgccgcg tcggccgtga cggacgaggc 







2161
ggcggtgtgg gacggcgaag aatcggcgtg agccgtggcg gcggacaatc cgaccgcgcc 







2221
gatgagcgcg gcaaagatcg gctggcgcgc cagcgtagag cgatgcac 









192
1
mhrstlarqp ifaaligavg lsaatahads spshtaassv tadaaaaras salldpvtvt 







61
atrtataasr taasvsvida ddldeqqatn ikdalryepg itvrrtayrp gnaalgggrd 







121
gdssinirgl egnrvmlmed girlpnafsf gpleagrgdy tdldtlkrie ivrgpasaly 







181
gsdgltgavn fitkdpqdll siyhkptyfs frpsydstdr sigatvsaag gndriqgmii 







241
adgrrgheid trgsnnsagk lrttsnpqhv ysesllgklv ltptvrdtir ltaetvqrrv 







301
gtdvlsaiva patlglttsd rlernrfsad yafhddasrw lgnahvgfyy qeatqdqyaf 







361
etrgtlpsrs rdnryrertf ggsafaesgf stgplahkll ggvdgslsri tnlrdgtvpg 







421
vgetfpnkaf pdtdytlfga fvgdqigygr llvtpglrfd tyrlkpsgnd plftgravst 







481
hanelsprva vlyeiapali pylqyahgfr aptpdqvnss fsnpvqgyts ignpnlkpet 







541
sdtfeaglrg ragtgygtir ysaaaftgry rnfiarttia gsgrptdpfv fqyvnfadar 







601
vhgiegradw ampngitlrt alaltrgstq ndgaasqpln tvnpfsavfg vryepterwf 







661
vqtdllfqaa krdkdidrsd cskracfapp ssfvvdlrgg yrfnkhvsat vgirnlfdrk 







721
ywnwsdvrgi aadspvlday tspgrtvavs mkvdf 





807
81
161521211
YP_001584638
216
1
ctacagcttc gcgctgacgc gcacccaggc ggtgcggccc ggctcgttga ccggcgcgtt 







61
cgccggatag ccgaagcccg cattgccggc gaggttcaga tgctcggcat aggccttgtt 







121
cagcaggttg tcgacgccga ccgagatctg cacggtcttg ctcacgttgt actgcgcatg 







181
cagcgacagc acgccgaagc ccgcgctcgg gccgaagtcc ttgccgacca cgttgccctc 







241
gttcaccgca tagcgatgct gcggcgcaac gacccgccac aaaccgccgg ccgaccacgc 







301
gccgcgcgtg tactcgagcc cgacgcgcgc ctcgagcggc ggcatctgcg gcaacggatc 







361
gccgctcgtc gcattgcgcc cccacgcgta cgcaaccgag gtctcgatac gcagcggtgc 







421
gaccggccgc cagccgacgc cggcttcgcc gcccatgatc tgcgcgttca cgttgctcgc 







481
ccgcgtggtc gggcccatca tgccgctcgc gtagtcgaac aggatgaagt cctgcacata 







541
gcctgcatac gcggagaccc acgcgtcgag ccgatcgctc ttgtactgcg cgccgatgtc 







601
gagctgcgtc gtcttctcgg gctgcacggc cgagaaagcg ttgaccgaac cggccggccc 







661
gcgtttcgcg gaaaacagct cccagtagtc cggatagcgc tccgcatggc cgatgcccgc 







721
ataccaggtc acgggcagcg acgccaggtc gcgctcgtag cgcaggaagc cgctcggcag 







781
cacgcgcgcg cgatcgtcgt cgaacgtcgg gttcggccgg ctcgtcatca tgccgcccgt 







841
cgtcgcgcgc ttgtcacgcg cgctcgcata gtcgacgcgc gcgccgccga tcacgcgcgc 







901
cgcctccgcc gcgtaccacg tgagctcgcc gaacgcgccg gcgttccaca tcgtcgcctg 







961
cgcgtcccac ggctggtcgc catagttctg ccgccccatc gcggcgcgcg aatcgagccg 







1021
gttcgattgc gcgtcgacgc ccgtcacgag cttgaacgcg tcgccgaatc gccacgtcgc 







1081
ggccgcacgc gcgccgagcg ttcgacgccg cacctcggac gccatccgca tcggcatgct 







1141
gctcgccgga tcgggctgcc gcagcgtgta gttgtccatc acgtgatcgg cttcgttgta 







1201
gtagacgcgc gcttcgatcc ggtcgagcac gtcgccgaga tgccgcttgt cgaacgacag 







1261
gccgaacgtg tcgcggcgga agtgcgcgcc gtccatcccg cgccccgcat agcgcgcgta 







1321
gccatcgccg gtgccggccg tcagctcgac gcgcgtgtgc acgtcgggcg tccagccgag 







1381
cgcggcatcg gcgttccatt tgtcccattg cgacggcacc gtgcggccgt tgccgtcgcg 







1441
gtagtcctgc gcatgcgcgt gattcgcgat gacgcggccg tagaaatcgg gcgtgccggc 







1501
cgtcacgtcg acgttctggt cgttgcggcc gaacgacccg ccgacgacgc tgccgtcgaa 







1561
gcgcatgccg ggccgctcga aacgcggcgt cgtgcgctcg aacagcacgg tgccggccga 







1621
tgcgccgggg ccgtacagca ccgtctgcgg ccctttcacg accgtcacct tgtcgtagct 







1681
ttccggtgcg atataggaag tcggcgcgtc catccggttc ggacacgcgc cgagcgtcgg 







1741
catcccgttc gcgagcacgt tgagccgcga gccgaacatg ccgcgcagta cggggtcgcc 







1801
gttcgttccg ccgctgcgga tcgacgtgaa gccggggatc gtcttcagat agtcggcgcc 







1861
gtcgctcgcg ggcagcggct ggcgcggcgc cttcggatcg gtgacgacga cgagcggcgt 







1921
cgacaacggc gacgcgacga cttcgacggg cagcaacagc atgcccggcg tgtcgggcgg 







1981
tacatcggcg cgatgtgcgg gggcgtcggc gtcggccgcg gcggccgtcg cgctcagcac 







2041
gccggcggtc agcgcgggaa cggcaagttt cagcatgcgc ggcgcgggcc gcgcgcaggc 







2101
actgggcgac gccggcggcg tgcgcaacga aaaaatggtc at 









193
1
mtifslrtpp aspsacarpa prmlklavpa ltagvlsata aaadadapah radvppdtpg 







61
mlllpvevva splstplvvv tdpkaprqpl pasdgadylk tipgftsirs ggtngdpvlr 







121
gmfgsrlnvl angmptlgac pnrmdaptsy iapesydkvt vvkgpqtvly gpgasagtvl 







181
ferttprfer pgmrfdgsvv ggsfgrndqn vdvtagtpdf ygrvianhah aqdyrdgngr 







241
tvpsqwdkwn adaalgwtpd vhtrveltag tgdgyaryag rgmdgahfrr dtfglsfdkr 







301
hlgdvldrie arvyyneadh vmdnytlrqp dpassmpmrm asevrrrtlg araaatwrfg 







361
dafklvtgvd aqsnrldsra amgrqnygdq pwdaqatmwn agafgeltwy aaeaarvigg 







421
arvdyasard krattggmmt srpnptfddd rarvlpsgfl ryerdlaslp vtwyagigha 







481
erypdywelf sakrgpagsv nafsavqpek ttqldigaqy ksdrldawvs ayagyvqdfi 







541
lfdyasgmmg pttrasnvna qimggeagvg wrpvaplrie tsvayawgrn atsgdplpqm 







601
pplearvgle ytrgawsagg lwrvvapqhr yavnegnvvg kdfgpsagfg vlslhaqynv 







661
sktvqisvgv dnllnkayae hlnlagnagf gypanapvne pgrtawvrvs akl 





808
81
161525105
YP_001580117
217
1
tcagaactgc agcgtggtca gcagcgacac ctgccgcgcg tcgccgatcg cgacgaagta 







61
gcggttcgcg ctcgacgggt agtacgtgcg gttgaacagg ttcttcacgt tcagctggaa 







121
cgacagcttc tgcttgccga gccgcgtgtc gtaggttgcg aacgcgtcgg cgagcacgta 







181
cgacggcagc gtgaagctgt tcgccgaatc gccgggccgt gcgccgacat agcgtgcagc 







241
cgcaccgatg cgcagatcgt cgccgcccgc gagcgtgccc acgtcgtaca cggccgcgag 







301
cgacgcggtg tgacgcgcga cgttccacag ccggttgccg gcgtacagcg gatcctcgat 







361
cgtcttcgcg tcgatgtacg cgtagctcgc gatcacgttc acgcgttcgc cgatgcggcc 







421
cgacacgtcg agctcgacgc cgcgcgagcg cgccttgccc gacgtgcgcc agtcggtcag 







481
cttcgtcgcg tcgttgaact gcgagacgag cacgttcttc ttgtcgatgt cgaacagcgc 







541
gagcgtgccc gtgatgccgc ccggcaggtc gagcttgccg ccgacctccc acgcggtcgc 







601
ctgctccggc ggcgtcgcgg cgtcgatcac gtagccggac gccatcggcg cgatcgacga 







661
cgacggcttc agcgactgcg aatagctgcc gtacagcgag aagctatcgg tccacttgta 







721
gacgatgccc gcgcgcggca gccagcgcga gccgctcaga tcggtgttcg catgaaacgg 







781
ccggccgcgc cccgcgacct ggttgtacgt gatatagcgc acgccgcccg agacgatcca 







841
cttgtcggtc agatgaatcg tgtcctgcag aaacgcggac gcgtcgtgca gcgcgtcggt 







901
ctggtcgctg tcgctcgccg acaccgtgct cgacggcggc agcagcccgt acacgggatc 







961
gacgtagctg aacggcgtct tcaccgcctg ccgcagcatg tccttgcgat agatgcggcg 







1021
gtactcggtg tcgaagccga cctgcacgtc gtgccgcatc cccgcgagcg tcagcttgcc 







1081
gttcacgtag ccgatcccgt agctgtcggt gctcagcgag ccgtgcgtcg cgtcgttgct 







1141
gcgcgacagc gtacccgtca ccggatcgac gcccgtcgtg cgcagctggt tcgcatcgta 







1201
ggtttcgcgg ttgtagctgt agccgaaatg cgcgctccag tccgcgttga actggtgatc 







1261
gacgctcagc tgggcgagat gcgattcgcc gtccatgttg ttgaacggct cgtcgatgcg 







1321
ccgccgcgcg ggaatgtcga gcggcgcgtt ggtgcgcgga tcgagcgccg tgccgcgatc 







1381
gaacggcgag tggaacttgc ggtactcgta cgacaccacg acctgcgtgt cgcggccgta 







1441
ccacgcgagc gacggcgcga cgaaggtctg ccggtattcg ccgaaattgc gccagtactg 







1501
ctcgttcgac tggtcgacga tcagccggta cgcgagccgc gaatcgccga tcggcccggt 







1561
cgaatcgaac gtcgcgctgc cgccgttctt gccgtgcccg aacgtcgatg cgccgagcga 







1621
aaccgcgttg tagcgcgtga gctgcggctg cttgctgacg acgttgacga cgccgcccgg 







1681
gtccatcagc ccgtacagca gcgaggtcgg ccccttcagc acctcgacgc tgtcgacggc 







1741
ggcgttgaac gagcgcccct gcacgatcgg catgccgttg tgcatgaccg agccgtcgcg 







1801
gttgtcgccg aagccgcgct tcatgatcgt gtcctgcgtg cccgcgagcg tgttgccctg 







1861
cgtgatgccg ctgacgttgc cgagcgcgtc gtcgaggttg cgcggacgct ggtcgcgcag 







1921
cacctgcgcc ggcacgacgt tgacggcctg cgcggtatcg agcaccggga tatcggcgcg 







1981
cagcacgccc gcctccttcg gcggccggta gctttgcgcg cgcagcgcgc tgctgcgcac 







2041
cgcgatcgtc ggcagttcgg ccgccgtatc ggccgacggc gccggcacgg cggcaccgct 







2101
catgccgttc gcgccgttcg cgccgttcgc gcgcatcaac gtatagccgc cgcccgactg 







2161
acgcagcgcg acgaggccag tgcccgcgag cacgcgcgcg agcgcatcgt cgacgtcgaa 







2221
gcggccgtgc acgccgccgc tcgacaggcc ggcggtcagc tcaggcggaa acgcgatcag 







2281
caggccggcg tcgcggccga accggttcag cgtcgcttcg agcggcccgg ccggcacgtc 







2341
gaatgcgcgg cgcgacgtgc gcggcgcggc ggcggccgcg tcggtgtcgg catcggcgag 







2401
cgcgggcaac ggcagcagcg cggacagcag gacggcgccg gcaagccggt gcgcgacgcg 







2461
gttcggcgcg gcgggcacga tcgtgcgccg cgtgtggcgg gcggacgccg gacggcgccg 







2521
cggcatgaga cggatggaaa ccatgatcga gaagtcggta aacgatgggc cggtgcggct 







2581
ttcatttccc at 









194
1
mgnesrtgps ftdfsimvsi rlmprrrpas arhtrrtivp aapnrvahrl agavllsall 







61
plpaladadt daaaaaprts rrafdvpagp leatlnrfgr daglliafpp eltaglssgg 







121
vhgrfdvdda larvlagtgl valrqsgggy tlmrangang angmsgaavp apsadtaael 







181
ptiavrssal raqsyrppke agvlradipv ldtaqavnvv paqvlrdqrp rnlddalgnv 







241
sgitqgntla gtqdtimkrg fgdnrdgsvm hngmpivqgr sfnaavdsve vlkgptslly 







301
glmdpggvvn vvskqpqltr ynayslgast fghgknggsa tfdstgpigd srlayrlivd 







361
qsneqywrnf geyrqtfvap slawygrdtq vvvsyeyrkf hspfdrgtal dprtnapldi 







421
parrridepf nnmdgeshla qlsvdhqfna dwsahfgysy nretydanql rttgvdpvtg 







481
tlsrsndath gslstdsygi gyvngkltla gmrhdvqvgf dteyrriyrk dmlrqavktp 







541
fsyvdpvygl lppsstvsas dsdqtdalhd asaflqdtih ltdkwivsgg vryitynqva 







601
grgrpfhant dlsgsrwlpr agivykwtds fslygsysqs lkpsssiapm asgyvidaat 







661
ppeqatawev ggkldlpggi tgtlalfdid kknvlvsqfn datkltdwrt sgkarsrgve 







721
ldvsgriger vnviasyayi daktiedply agnrlwnvar htaslaavyd vgtlaggddl 







781
rigaaaryvg arpgdsansf tlpsyvlada fatydtrlgk qklsfqlnvk nlfnrtyyps 







841
sanryfvaig darqvslltt lqf 





809
27
161524037
YP_001579049
218
1
ttactggtgc tgcggcgcgc gcagatagat ctcgacgcgg cggttctgtg cgcggccggc 







61
ttcggtcgag ttgtccgcga ccgggttcga cgggcccatg ccctgcgccg acaggcggtt 







121
gcccgcgacg ccgcgctgca ccagcgcgtt gacgacgctt tgcgcgcggt tctgcgacag 







181
cgtctggttc agctgcgcgg agcccgtgct gtcggtgtag ccgatcaccg acgccgtcac 







241
ttgcgggttc tggttcagcg tcgttgccag gtcgttcagc agcggcgtga aggccggcgt 







301
gatcgcgtac tggttcgtcg cgaacgtgac cgagctcggc acgttcagct tcagcgagcc 







361
gtccggctgc tcggtgacct gcgtgcccgt ctgcgcagcc gacggcgcga gcttgttctt 







421
gatcgcttgc cagttgtagc ccgtcacgcc gccgaccagc gcgccgacgc ccgcgccgat 







481
cgccgcgccc ttgccgccgc ctgccagtgc gccgatgcct gcgcccagcg ccgcacccgt 







541
gccggtgccg acggccgtgt tgttgccttg ctgcgtcgcg cagcctgcca gcagggcgcc 







601
ggccagtgcg aagacggaca agcgggtcgc gattttcatg ttcat 









195
1
mnmkiatrls vfalagalla gcatqqgnnt avgtgtgaal gagigalagg gkgaaigagv 







61
galvggvtgy nwqaiknkla psaaqtgtqv teqpdgslkl nvpssvtfat nqyaitpaft 







121
pllndlattl nqnpqvtasv igytdstgsa qlnqtlsqnr aqsvvnalvq rgvagnrlsa 







181
qgmgpsnpva dnsteagraq nrrveiylra pqhq 





810
19
161525757
YP_001580769
219
1
atgaccacaa ggagagtaac catgatgtcg aaaaaagttc gtctggccct ggccgtgatg 







61
atgatcggcg cgctcgcagc gtgtaagtcg ggtgtgaagc tcgacgacaa ggcaaacgcc 







121
ggtgcgatga gcacgcaacc gagcgccgac aacgtcgcgc aagtgaacgt cgatccgctg 







181
aacgatccga acagcccgct cgcgaagcgc agcatctact tcgacttcga cagctattcg 







241
gtgaaggacg agtatcagcc gctgctgcaa cagcacgctc agtatctgaa gagccacccg 







301
cagcgccacg tgctgatcca gggcaatacc gacgaacgcg gcacgagcga gtacaacctc 







361
gcgctgggcc agaagcgtgc ggaagccgtg cgccgcgcgc tcgcgctgct cggcgttgcc 







421
gattcgcaga tggaagccgt gagcctcggc aaggaaaagc cgcaagcaac gggtcacgac 







481
gaagcgtcgt gggcgcagaa ccgccgcgcc gatctcgtct accaacagta a 









196
1
mttrrvtmms kkvrlalavm migalaacks gvklddkana gamstqpsad nvaqvnvdpl 







61
ndpnsplakr siyfdfdsys vkdeyqpilq qbagylkshp qrhvliqgnt dergtseynl 







121
algqkraeav rralallgva dsqmeayslg kekpqatgbd easwaqnrra dlvyqq 





811
24
161525435
YP_001580447
220
1
atgaataaac tttcaaagct cgcgttcatt gcagctaccg cagttatggc tgcatccgct 







61
tcggcacagt cggtgccggc gtcgcgtcaa gccgtcaatg acaactgggt gaacggcacg 







121
ggcgaatggg tgtggatgaa cggcacgaac gagctctgct ggcgtgacgc gttctggacg 







181
ccggccaccg ccaacgcgaa gtgcgacggc gcactggtcg cccaggcacc gcagccgccg 







241
gtcgctccgg tcgctccggc catcacgagc cagaagatca cgtatcaagc cgacgcactg 







301
ttcgacttcg acaaggcgac gctcaagccg ctgggcaagc agaagctcga cgaactggct 







361
tcgaagatcc agggcatgaa cacggaagtg gtcgtcgcaa cgggctacac cgaccgcatc 







421
ggttcggaca agtacaacga ccgtctgtcg ctgcgccgtg cgcaagccgt caagtcgtac 







481
ctggtcagca agggtgtccc ggccaacaag atctacacgg aaggcaaggg caagcgcaac 







541
ccggtcacga cgggctgcaa ccagaagaac cgcaagcaac tgatcgcctg cctcgcaccg 







601
gatcgccgcg tggaagtgga agtggtcggt acgcaagaag tgcagaagac gaccgttccg 







661
gcgcagtaa 









197
1
mnklsklafi aatavmaasa saqsvpasrq avndnwvngt gewvwmngtn elcwrdafwt 







61
patanakcdg alvaqapqpp vapvapaits qkityqadal fdfdkatlkp lgkqkldela 







121
skiqgmntev vvatgytdri gsdkyndrls lrraqavksy lvskgvpank iytegkgkrn 







181
pvttgcnqkn rkqliaclap drrvevevvg tqevqkttvp aq 





812
40
161525565
YP_001580577
221
1
atgaaaaaga ccctgatctt cgccgccctg tccggcgcgt gcgcggccgc gcacgcgcaa 







61
agcagcgtca cgctctacgg gctgatcgac gccggcatca cgtatacgaa caaccagggc 







121
ggccacagcg cgtggcagga gacgagcggc tcgatcaacg gcagccgctg ggggctgcgc 







181
ggtaccgagg acctcggcgg cggactgaag gcgatcttca cgctcgagaa cggcttcggg 







241
atcaacaacg gcacgctgaa gcagaacggc cgcgagttcg gccgccaggc attcgtcggc 







301
ctcgcgcaca gcgggttcgg ctcgctgacg ctcggtcgcc agtacgacag cgtcgtcgac 







361
ttcctcggcc cgctgtcgct gacggggacg caatacggcg gcacgcagtt cgcgcatccg 







421
ttcgacaacg acaacctgaa caactcgttc cggatcagca atgcggtcaa gtatcaaagc 







481
gcagacttcg gtggactgaa gttcggcgca ctctacggct tctcgaattc gaccgatttc 







541
tcgaacaacc gcgcgtacag cgtcggcgcg tcgtacagct tcatgggctt caacgtcgcg 







601
gccgcctaca tgcaactgaa caacaacatc aattcgctcg cgctcgcggc cagcgatccg 







661
ggcgcagtgg ccggcgactg gacgttcgcg gcaagccggc agcgcacttg gggcgccggc 







721
ctcaactacg cgttcggccc ggcaaccgcg ggcttcgtgt tcacgcagac gcgcctcacc 







781
gattcggccg gcatcagcgc cggccaatcg ggcgtgtcgg gcggcatccc gctgacgggc 







841
gacacgcgct tcaacaacta cgaattgaac gcgcgctatg cactgacgcc ggcgttctcg 







901
ctcgcgggct cgtacacgta caccgacggc cgcatggaag gacagaagcc gagttggcac 







961
cagttcaacc tgcaggccga ctacgcgctg tcgaagcgca ccgatctgta tctgcagggc 







1021
gaataccagc gcgtgaacgg ggacggtctc gcggtcggcg cgaacatcaa cggcctcggc 







1081
gtcgcgtcgt cgaccaacaa gcagatcgcg gtgacggccg gcatgcgtca ccgcttctga 









198
1
mkktlifaal sgacaaahaq ssvtlyglid agitytnnqg ghsawqetsg singsrwglr 







61
gtedlggglk aiftlengfg inngtlkqng refgrqafvg lahsgfgslt lgrqydsvvd 







121
flgplsltgt qyggtqfahp fdndnlnnsf risnavkyqs adfgglkfga lygfsnstdf 







181
snnraysvga sysfmgfnva aaymqlnnni nslalaasdp gavagdwtfa asrqrtwgag 







241
lnyafgpata gfvftqtrlt dsagisagqs gvsggipltg dtrfnnyeln aryaltpafs 







301
lagsytytdg rmegqkpswh qfnlqadyal skrtdlylqg eyqryngdgl avganinglg 







361
vasstnkqia vtagmrhrf 





813
44
161520585
YP_001584012
222
1
atgaggaagc acgtcatttt tgccgccgcg ctcgccgcgt ttgccgcgcc ggccttcgcg 







61
caggacagcg tgacgctgta cggtgtgatc gacgagggct tcaactacac gaacaacgtg 







121
agcgtcaacg gcgcgggcaa ggccgattac cagctcgcga gcggctatgc gcagggcagc 







181
cgctgggggc tcaaggggag cgaggatctc ggcggcgggc tgaaggcgat cttcacgctc 







241
gaaagcggtt tcgacgtgaa caacggccgg ctcgggcagg gcagccggat gttcggccgg 







301
caggcgttcg tcggcctcgg cacgtcgcgt ttcggcacgc tgacgttcgg ccgccagtac 







361
gattcggtcg tcgactatct cgcgccgctg accgcgaacg gcaactgggg cggcacgctg 







421
ttctcgcatc cgttcgacaa cgacaacacc gacaactcgt ttcgcgtgaa caacacggtg 







481
aagtacgcga gcccggactg gaacgggctg tcggtcggcg gcacgtacag cttcagcaac 







541
agcaccggct tctcgaacaa ccgtcagtac agcctcggcg cgcagtattc gctcgccggc 







601
ctgcaggtcg cggccgcgta cctgcaggcc aaccatccgg gcgccggcgg cgcgggtgcg 







661
atcgccgcgg acgacgcgaa cttcgtcgcc gagcggctgc gggttttcgg cggcgggctg 







721
aactacacgt tcgggcccgc gacggtcggc ttcgtctaca cgaagaccga cttgaagaac 







781
ccggtgtcga ccgtctatct gccggcgtcg acgttcgcgg gcctcgggct cggcgcgacg 







841
aaattcgaca acttcgaggt caacggcaag tatcagctga cgtccgcgtt ctacgtcggc 







901
gcgcagtacg tgtatacgga cggcaagttc gatgcggcga ccggatcgat caagccgaag 







961
taccacacgg tcgggctgat ggccgactac agcctgtcga agcgcaccga cgtctacctg 







1021
cagggcgcgt ggcaaaaggt ggccggcgat cgcacgggta ccgcggccga cggcggctac 







1081
gtggtcggga ccgacggccc gtcgtcgtcg tcgaaccagt tcgcggtgcg cgccgcgatt 







1141
cgtcacaagt tctga 









199
1
mrkhvifaaa laafaapafa qdsvtlygvi degfnytnnv svngagkady qlasgyaqgs 







61
rwglkgsedl ggglkaiftl esgfdvnngr lgqgsrmfgr qafvglgtsr fgtltfgrqy 







121
dsvvdylapl tangnwggtl fshpfdndnt dnsfrvnntv kyaspdwngl svggtysfsn 







181
stgfsnnrqy slgaqyslag lqvaaaylqa nhpgaggaga iaaddanfva erlrvfgggl 







241
nytfgpatvg fvytktdlkn pvstvylpas tfaglglgat kfdnfevngk yqltsafyvg 







301
aqyvytdgkf daatgsikpk yhtvglmady slskrtdvyl qgawqkvagd rtgtaadggy 







361
vvgtdgpsss snqfavraai rhkf 





814
39
161520486
YP_001583913
223
1
atgaacaaga ctctgatcgt tgcagcagct gcagcatcgt tcgcaaccgt cgctcacgcg 







61
caaagcagcg tcacgctgta cggcgtgctg gacgcaggca tcacgtacca aagcaacgtt 







121
cagccggccc cgggccaggc tggcaagtcg ctgtggtcga tgggctcggg cattgaccag 







181
agccgcttcg gcctgcgtgg ttcggaagac ctcggtggtg gcctgaaggc gatcttcacg 







241
ttggagagcg gcttcgacat cggcaacggc cgtttccgta acaacggcgg catgttcaac 







301
cgtcaggctt tcgtcggtct gtcgagccag tacggcaccg tcacgctggg caagcagtat 







361
gactccgttc aagactacct ggctccgctg accgcaacgg gcagctgggg tggtacgtac 







421
ttcgcgcacg tcggcaactt cgacaacctg agcacgaacg gtggctactc gccgaacaac 







481
agcatcaagt tcacgagcgc taactacgct ggcctgcaat tcggcggcac gtactcgttc 







541
tcgaacaaca cgaacttcgg caacaaccgt gcttacagcg gcggcgtgtc gtaccagttc 







601
caaggcctga agatcgcagg tgcgtactcg caactgaaca acccgggcca aacgaccggc 







661
ggcgcagttg acacgctgca aacgcaaggc cgcgtccgta cgtacggcgc agctgctggc 







721
tacgcattcg gcccggcaca agtcggcgca gcatggacgc aagctcgtct ggacaacacg 







781
ggcgcgaccg gtacgtcggt tcgcatcgac aactacgaag tcaacggcaa gtacaacctg 







841
acgccggctc tgggtctggg cgttgcttac acgtactcga acgcgcgtct gggtcaggac 







901
agcgctcact ggcaccaagt tggcctgcag gctgactacg cactgtcgaa gcgtaccgac 







961
gtgtacgcac aggctgtgta ccagcgtgct agcggcgcga acgcgtcgat ctacaacggc 







1021
aacatcgaca cgctgccgag ctcgtcgatc aaccaaaccg cagcaacggt tggtctgcgt 







1081
caccgcttct aa 









200
1
mnktlivaaa aasfatvaha qssvtlygvl dagityqsnv qpapgqagks lwsmgsgidq 







61
srfglrgsed lggglkaift lesgfdigng rfrnnggmfn rqafvglssq ygtvtlgkqy 







121
dsvqdylapl tatgswggty fahvgnfdnl stnggyspnn sikftsanya glqfggtysf 







181
snntnfgnnr aysggvsyqf qglkiagays qlnnpgqttg gavdtlqtqg rvrtygaaag 







241
yafgpaqvga awtqarldnt gatgtsvrid nyevngkynl tpalglgvay tysnarlgqd 







301
sahwhqvglq adyalskrtd vyaqavyqra sganasiyng nidtlpsssi nqtaatvglr 







361
hrf 





815
55
161521143
YP_001584570
224
1
atgaagaaac ttgctctctc ggccctctcg ctcgccctgc tgggcgccgc cggcgctgca 







61
caggctcaat ccagcgtgac gctgtacggc gtgatcgata cgtcgattgc ctatgttcat 







121
ggcaatgacg gccaagccaa caacatgtgg caaatgctgt cgggcaacct gcaaggcagc 







181
cgctggggcc tgaagggtgc tgaagacctg ggcggcggcc tgaaggcaat cttccagatc 







241
gaaaacggct tcaacccggg cacgggcaag ctgagcgcag caaacacgat cttcaaccgc 







301
caggcattcg tcggtctgca aagcaaccag tacggtacgc tgacgctcgg ccgccagtac 







361
gacccggtcg tcgaccttgt ccaggcagtc acggctgaca actacttcgg cagcttcttc 







421
gcgaccccgg gtgacgtcga caacaacgac aacagcctgc gcgtcagcaa cgcgatcaag 







481
tacacgtcgc ccgtctacgc cggcttccag ttcgaaggca tgtacggcct gagcggcatc 







541
gcaggcaagc cgggccaggg tcagacgtgg tcggctgctg cggcctacaa caacggcccg 







601
atcggcatcg cagccggcta cttctacgcg aacaacccgt cgccgacgac caccgcggtc 







661
cgtggcggct ggggctcgac gacgtcggac aacatcgtcg acggcccgat caacgccggc 







721
tatgtgacgg cgaagtcgat cggcatcgcg caagtcgcag ctcagtacgc gatcggcccg 







781
gtgacgttcg gcctcggcta cagcaacgcg caatacaagc cggacgctta ctcgacgttc 







841
tcgtcgaccg agaagtacaa cacgggccgc ggcttcgtga cgtaccaggt cacggcgccg 







901
ctgctgctgg gcctcggcta ctcgtacacg aaggcaagcg gcaacacgga cgcgaagtac 







961
caccaggttt cgctgggcgc agactactcg ctgtcgaagc gcacggacgt ctacctggtc 







1021
ggcgcgtacc aacacgccag cggcacgcag ctcaacacgg acggcacgac gtcggcagcg 







1081
caagcgtcga tcggctcgta tggctatgcc ggcacgaagt cgcaggagat ggttgccctc 







1141
ggcctgcgcc acaagttcta a 









201
1
mkklalsals lallgaagaa qaqssvtlyg vidtsiayvh gndgqannmw qmlsgnlqgs 







61
rwglkgaedl ggglkaifqi engfnpgtgk lsaantifnr qafvglqsnq ygtltlgrqy 







121
dpvvdlvqav tadnyfgsff atpgdvdnnd nslrvsnaik ytspvyagfq fegmyglsgi 







181
agkpgqgqtw saaaaynngp igiaagyfya nnpsptttav rggwgsttsd nivdgpinag 







241
yvtaksigia qvaaqyaigp vtfglgysna qykpdaystf sstekyntgr gfvtyqvtap 







301
lllglgysyt kasgntdaky hqvslgadys lskrtdvylv gayqhasgtq lntdgttsaa 







361
qasigsygya gtksqemval glrhkf 





816
58
161524787
YP_001579799
225
1
tcacgcgtcg tcggcggccg tgccgggcgt ccagccaccg ccgagcgcgc gatacagcgc 







61
gatcgcgttc gcgagccgca gctgcctgag ccggatcagc tcctgtcccg attcgtacgt 







121
gctgcgctgc gcgtcgagca gttcgagata cgtcgcgacg ccgcccgcgt aacgacgttc 







181
cgcgagcttc agccgcgcgt cgtcggccgc gtgcacgtcg cgctgcgcgg ccagctggcg 







241
ctcgatccag tcgcgtgccg taaaggcgtc cgcgacctcg cggaacgcga cctgcacggt 







301
cttctcatac tcggcgaccg caatgtgccg ccgcgcgttc gcgacgtcga gattcgcgcg 







361
attgcgcccg cccgcaaaaa tcggcagcgt gatgcgcggc gcgaaggtcc agacgctcgt 







421
gccggccgcg aacaggctcg aaaacgcgtc gctgaccgag ccgtaatcgg tcgtcagcgc 







481
gatgcgcggg aaaaacgcgg cgcgcgcggc gccgatctgc gcgtgcgcgg ccttgagccg 







541
cgactcggcc tgccggatgt ccggacgtcg ctcgagcaga tcgctcggcg caccgggcgc 







601
gaccggcgcg atcgtcagcg cgtcgagcgc cgtatcgtcg tcgcgcacgt cgcgcgcgaa 







661
atcgccggca agcagttgca gcgcgcgcac ggcctgcgcg tgctcgcgct gcagcgccgc 







721
gtgcgatgca cgcgcggccg cgaccagcat ctcggccgaa cgcagttcga tcgcatcgct 







781
cgtgccggcc gcatagcggc gttgcgtcag cgcggcgatc cgttcgcgcg catcgagcgt 







841
gcgctcggcg agcgcgcgct gttcgttcag cgcgcgttcc gcgacgtagg cattcgcgac 







901
ttcggcgatc acgccgatac ggaccgtgcg ctgcgcgtcg gccgtcgcga aatactcggc 







961
cagcgctgca tcggacagat tgcgcacgcg cccgaacaga tcgagttcgt atgcgctgat 







1021
gccgacgccc gcccgataca gcccgctcac cgcgctctcg cgcaggaccg gatcgtattg 







1081
ccgcgtgcgt tcgtaaccgg tgctcgcgtc gatcgacggc atccgctccg cacgctgcac 







1141
gccgtacatc gcgcgcgcct cgtcgagccg gccggccgcg atccgcagat cgcgattgtt 







1201
ggcgagcgcc gcgtcgatcc atgcatgcag cgcgggcgcg gtgaaatagc tgcgccagtc 







1261
gtcgagcagc gccgcatcgg cgcgcggcgc gaaagccgac ggcgcgtcgt gcgtggcgtc 







1321
gacgccgttc gtatacgcgg acggcaccgg cgcagccgga cgctcgtagc gcggcgcgag 







1381
cgaacagccg gccagcacga gcgcggccgc cagcgcgaac ggcgcgcgcc gcgcggtgcg 







1441
gcgcgaatcg gaacgatgcc tcat 









202
1
mrhrsdsrrt arrapfalaa alvlagcsla pryerpaapv psaytngvda thdapsafap 







61
radaallddw rsyftapalh awidaalann rdlriaagrl dearamygvq raermpsida 







121
stgyertrqy dpvlresavs glyragvgis ayeldlfgrv rnlsdaalae yfatadaqrt 







181
vrigviaeva nayvaeraln eqralaertl dareriaalt qrryaagtsd aielrsaeml 







241
vaaarashaa lqrehaqavr alqllagdfa rdvrdddtal daltiapvap gapsdllerr 







301
pdirqaesrl kaahaqigaa raaffprial ttdygsysda fsslfaagts vwtfapritl 







361
pifaggrnra nldvanarrh iavaeyektv qvafrevada ftardwierq laaqrdvhaa 







421
ddarlklaer ryaggvatyl elldaqrsty esgqelirlr qlrlanaial yralgggwtp 







481
gtaadda 





817
55
161523870
YP_001578882
226
1
atgatgcaaa aacacgcttt gactgcaatc gcggtcgcgc tccttgccgc aggctgcacg 







61
ttggcgccgc actacacgcg gcccgacgcg cccgtcgcgc aggcgtttcc gtccggcggc 







121
gtctatgcga cgcagccggg cgctgccggc gcgcgcagcg cgaacggcca ggcggcgacc 







181
gccatcggct ggcgcgaatt cttcgtcgat ccgcgcctgc agcggctgat cgagatcgcg 







241
ctgaagaaca accgcgacct gcgcgtgtcg gtgctgaaca tcgaggcggc gcgcgcgcag 







301
tatcagatca cgcgcgcggg cctgttcccg acgctcgacg gcacgggcac cggcaatatc 







361
cagcgcgtgc cgcagggcgt gtcgcagacg ggtgcgccgt atatctcgcg cgtctacaac 







421
gtcggcttgt cggcgtcgtg ggagctcgat ctgttcggcc gcgtgcagag cctgaaggac 







481
caggcgctcg cgcagtatct gtcgaccgcg tatgcgcggc aggctgccga gatctcgctc 







541
gtgtcgcagg tcgcggatca gtatctgacg ctgctgtcga ccgacgatct gctgaaggtc 







601
acggaagaca cgctgaagac ggctcaggcc tcttacgacc tgacgaagct gcagttcgac 







661
aacggcacgg gctccgagct cgagctgcgt caggcgcaga cggtcgtcga gcaggcgctc 







721
gcgaaccagc aggcgcaggc gcgcgctcgt gcgcaggcgc tcaacgcgct ggtgctgctg 







781
atcggcgagc cgctgccgga cgatctgccg ccggggctgc cgctcgatgc gcaaaatctg 







841
ctcgccgacg tgccggccgg gctgccgtcg gatctgctga cgcgccgtcc cgacatcatg 







901
caggccgagc agacgctgct cgccgcgaac gcgaacatcg gcgcggcgcg cgcggcattc 







961
ttcccgcgca tctcgctgac gggcgcgttc ggcaccgcga gcccgacgct cggcgggctg 







1021
ttcaaggccg gcaccgcggc gtggtcgttc gcgccgcaga tcacgatgcc gatcttcgaa 







1081
ggcgggcaga acatcgcgaa cctgaacctc gcgaacgtgc agaagcgcat cgagatcgcg 







1141
aactacgaga aggcgatcca gagcgcgttc cgcgaagtgg ccgacggtct ggccgcgcgc 







1201
ggcacgtacg atcagcagat cgccgcgctc gagcgcaaca cgcatgcgca gcagcgccgc 







1261
ttcgatctgt cggacctgcg ctacaagaac ggtgtcgaca gctatctgtc cgtgctgacc 







1321
gcgcagaccg atctgtacac ggcgcagcag gcgctgatca acgcgcggct cgcgcgctgg 







1381
acgaacctcg tcgatctgta ccgtgcgctc ggcggcggct ggatcgagcg ggccggcgag 







1441
acgccgcggg cgcccgatca gacggtcgat tacgacaagg cctcggcgcc tgcgtcggcg 







1501
tcggctgcgg cgacgaacgg gtaa 









203
1
mmqkhaltai avallaagct laphytrpda pvaqafpsgg vyatqpgaag arsangqaat 







61
aigwreffvd prlqrlieia lknnrdlrvs vinieaaraq yqitraglfp tldgtgtgni 







121
qrvpqgvsqt gapyisrvyn vglsasweld lfgrvqslkd qalaqylsta yarqaaeisl 







181
vsqvadqylt llstddllkv tedtlktaqa sydltklqfd ngtgselelr qaqtvveqal 







241
anqqaqarar aqalnalvll igeplpddlp pglpldaqnl ladvpaglps dlltrrpdim







301
qaeqtllaan anigaaraaf fprisltgaf gtasptlggl fkagtaawsf apqitmpife 







361
ggqnianlnl anvqkrieia nyekaiqsaf revadglaar gtydqqiaal ernthaqqrr 







421
fdlsdlrykn gvdsylsvlt aqtdlytaqq alinarlarw tnlvdlyral gggwierage 







481
tprapdqtvd ydkasapasa saaatng 





818
19
161524257
YP_001579269
227
1
ttactccggc gaacccatca tcgactgctg atagttctgc aggccgacct tgccgatcag 







61
gtcgatctgc gtttccagcc agtcgatatg ctcttcggta tcgtcgagga tcttttcgaa 







121
gatctcgcgc gacacgtagt cgcgcaccga ttcgcaatag gcgatcgctt ccttgcaggt 







181
ggcctgcgag atctgctcga gcttcaggtc gcacttcagg atttcttcgg tctcctcgcc 







241
gacgagcagc ttgtgcagat cctgcaggtt cggcaggccg tcgagcatga acacgcgttc 







301
gatcagccag tccgcgtgtt tcatttcgcc gatcgactcg tcgtattcgt gcttgccgag 







361
cttgtcgagg ccccagtgct tgtacatgcg ggcatgcagg aagtactggt tgatcgccgt 







421
gagttcgttc ttcagctggg cgttcaggta ttcgatgact ttcttgtcgc cttgcat 









204
1
mqgdkkviey lnaqlknelt ainqyflhar mykhwgldkl gkheydesig emkhadwlie 







61
rvfmldglpn lqdlhkllvg eeteeilkcd lkleqisqat ckeaiayces vrdyvsreif 







121
ekilddteeh idwletqidl igkvglqnyq qsmmgspe 





819
8.5
161524255
YP_001579267
228
1
tcaggccgcc ttgcgttcgt agaacgtcac cggaatcgga tgagcgtgat gctcgacgcc 







61
gcaccggctc gcgcacacgc cgtgttcggc catcaggtcg cgtaccgact cctcgcattt 







121
gccgcagcag gtcgccacgc caagctcgaa ctgaagctcg tcgaacgagt ccacgccctc 







181
cgcgagggac gcgcgaatct tccgatcgga aacagacttg catacgcaga cgatcat 









205
1
mivcvcksvs drkiraslae gvdsfdelqf elgvatccgk ceesvrdlma ehgvcasrcg 







61
vehhahpipv tfyerkaa 





820
36
161520970
YP_001584397
229
1
tcacttgaac cggatggtcg gattgatgac cgcgtagagc acgtcgacgg tcaggttgat 







61
caggatgaat tcgagcgaga acagcagcac gatcgcctgg atcaccgggt aatcgcgcat 







121
cgtcaccgaa tcgacgagca gccggccgag ccccggccag ttgaacacgg cctcgacgac 







181
gatcgacccg ccgagcagga agccgaactg cagccccatc atcgtgacga ccggaatcat 







241
cgcgttgcgc aggcagtgtt tgagcacgac cttcggctcg tgcacaccct tcgcgcgcgc 







301
ggtgcgcacg aagtcctcgt tcatcacctc gacgaacgac gcgcgcgtga agcgcgccat 







361
cacggccgcg accgccgcgc cgagcgtgag cgacggcagc acgtagctct tccacgtgtc 







421
gtccggcacg acgggcagcc agccgagctt caccgagaag atctccatca agagcatgcc 







481
gagcgcgaac gccggaaacg agatgccgga caccgcgagc gtcatgccga gccggtccgg 







541
ccagcggttg cgccatacgg ccgacgcgat gccgatcgcc atcccgaaca gcgtcgccca 







601
cgccatgctg acgatcgtca gcatcagcgt cggcatgaag cgctcgccga tttccgtcga 







661
taccggccgc ttgctgcgcg tcgatacgcc gaaatcgccg tgtgcgatct tcacgaagaa 







721
attcgcgaac tgcgtgggca gcggccgatc gaggccgaga tcggcgcgca cgagcgcgac 







781
cgtcgcttcg tcggcctcgg ggccggccgc gagccgcgcc gggtcgcccg gcagcagatg 







841
cacgaacaga aacacgagca ccgcgacgat cgcgagcgtg ggcagcaggc cgaacagccg 







901
tttgatgatg aaagtcagca t 









206
1
mltfiikrlf gllptlaiva vlvflfvhll pgdparlaag peadeatval vradlgldrp 







61
lptqfanffv kiahgdfgvs trskrpvste igerfmptlm ltivsmawat lfgmaigias 







121
avwrnrwpdr lgmtlavsgi sfpafalgml lmeifsvklg wlpvvpddtw ksyvlpsltl







181
gaavaavmar ftrasfvevm nedfvrtara kgvhepkvvl khclrnamip vvtmmglqfg 







241
fllggsivve avfnwpglgr llydsvtmrd ypviqaivll fslefilinl tvdvlyavin 







301
ptirfk 





821
42
161520872
YP_001584299
230
1
tcagaagcgg tggatcatgc cgacgccgac ggcgatctgg ttctggccgg cacctgctgc 







61
cacgccgtcg ccgatttgcg tcgttgccga ctggatcgcg ccagccttgc tcagcgtatt 







121
gccgctcgcg tgctggtacg cctcgagtgc gtacaggccc gtgcgcttcg acaggctgta 







181
gtactgcgac agcgtgacct ggtggtactt cgccgagctc gagatgccgt tcgacttcgt 







241
tgccgccgtg tacgagtagc ccgccgcgaa gtcccactga gccgctgcct tccagtgcag 







301
cacggcaccg gccgtgttga agatcgcggt gttgcggaac gacgagccga cgcccgggat 







361
gtactggacg ttcgtgtagg acacggacac gtcccacgcc ggcgtgaact ggtagccggc 







421
cgtcacgccg aggcgctgct gcgattgcgc cgttgcgtag ccgttgttga tcgccgacac 







481
ggccggctgg ccgccgctcg tgaccgtcga gttgtcgccc cacgcgccgc cgccgagcgt 







541
cgagttgttg acgcgctgat agccgaccgc gatgcctgcc gggccgttca ggtactggat 







601
cgccgcgctc cacgtcgagc cgcggttcac gctgcccggc acgccgccga acgagtacga 







661
gccgccgacc gtgaagccgt agaacttcgg cgacatgtag acgagcgaat tgttcgcgcg 







721
gtagctcgta tcgagcgagt cgatatcacc cgggtgcgcg ccgtatgcgc ccgtcagcca 







781
cgtcgtcggg ctgtacggcg acagcagcgt gtagtacgcc gtgtactggc gaccagccgt 







841
cagcgtaccg tacgccgcgt tcgtcaggcc gacccacgcc tgacgcgtga agatgccgcc 







901
cgaccactgc gatgcgcccg ttgccgtgtt cacgccggct tcgagctgga agatcgcctt 







961
cgtgccgccg ccgaggtctt cgttgccctt caggccgaag cggctgcctg cccacacgcc 







1021
ggtggacatc gacaccttcg aatgaccgcc gctcgtcgcg ccggtcgacg gtgccgcgtt 







1081
attctggtac gccagaccgt tatcgacgat accgtacagg gtcacgctgc tctgagcgtg 







1141
ggcggcggta gcggccgcaa ggccggctgc cgtcatggcg acgacgacgc gctttttcat 









207
1
mkkrvvvamt aaglaaataa haqssvtlyg ivdnglayqn naapstgats gghskvsmst 







61
gvwagsrfgl kgnedlgggt kaifqleagv ntatgasqws ggiftrqawv gltnaaygtl 







121
tagrqytayy tllspysptt wltgaygahp gdidsldtsy rannslvyms pkfygftvgg 







181
sysfggvpgs vnrgstwsaa iqylngpagi avgyqrvnns tlgggawgdn stvtsggqpa 







241
vsainngyat aqsqqrlgvt agyqftpawd vsvsytnvqy ipgvgssfrn taifntagav 







301
lhwkaaaqwd faagysytaa tksngisssa kyhqvtlsqy yslskrtgly aleayqhasg 







361
ntlskagaiq sattqigdgv aagagqnqia vgvgmihrf 





822
56
161524923
YP_001579935
231
1
atgaaatcct ccccgttgtc cgtgcgtgcc gggtcgcgcc gcaccgccgt tgccgttgcg 







61
gtcgccgcac tggcactggc gggctgcgcg aactacatcg gcatcaagag cgacaagcag 







121
atcgcccccg cgtcgcagtt cgaaaccgcg cagagccttc cggcccaggg cggccactgg 







181
ccgtcgctcg actgggcgag ccagttcggc gatccgcagc tgccgaagct gatcgacgag 







241
gcgctcgccg gcaatccgtc gatcgcgcag gcgcaggcgc gcatcgcgaa ggcgtcgtcg 







301
tatatcgaat cgtcgcgctc gaacctgctg ccgaaggccg aagcgagcta ctcgtggacg 







361
cgcgagctct attcggcgaa cggcctcttc ccgccgccgt acggcggcca gtggtacagc 







421
gagaacaacg tgctcgccag cgcatcgtgg gagctcgacc tgtggggcaa gaaccgcgag 







481
cgcctgcgca cggccgtgtc gcaggaaaag gccgcggaag ccgacatgca gcaggcgcgc 







541
atcacgctcg cctcgtcggt cgcgcgcacc tacaactcgc tcgcgcagct gtacgcgctg 







601
cgcgacatcg cgcagcgcga gatcgccaac cgcgagtcgg tcggcaagat caccgacggc 







661
cgcgtgtcgg ccggcctcga cacgaacgtc gaacgccaga ccgcacgcgg caacatcgcg 







721
acgacgcagg cgtcgctgtc cgatctcgac ggtcagatca cgaccgtgcg ctaccagctc 







781
gccgcactgc tcggcaaagg gccggaccgc gggctgcaga tcgccgcgcc ggtgctgaac 







841
ccggccggcg acgtcgcgct acccgacaac ctgcccgccg acctcgtatc gcgccgcccc 







901
gacatcgtcg ccgcgcgctg gcaggtcgaa gcggcaatgc acgacgtgaa ggaagcgaag 







961
gccgagttct tcccggacgt gaacctcgcg gccggtttcg gtttcgatgc gttcggctgg 







1021
ggcaaattcc tgaacttcac gagccgccag gcgcagttcg gcccggcgat ccatctgccg 







1081
atcttcgacg ccggcgcgct gcgcgcgcag ctcaagggcc gctacgcgga cttcgatctg 







1141
tcggtcgcga actacaacca gacgctgatc agcgcgctga acgacgtggc gacgcaagtt 







1201
gccgcgatcc gtgcgatcga tcgccagatg ggcgacgcgc agcgcgcgct cgacgcgtcg 







1261
acgcgcgcat acgatctcgc ggtgattcgc tacaaggccg gcctgtcgcc gcagctgcag 







1321
gtgctgaccg cggacagcaa ccggctcgcc tcggagcaga ccgtgacgaa tctgaagatg 







1381
cgccgtcgcg acatgcagct cgcgctgatc aaggcgctcg gcggcggctt cgatgcgacc 







1441
ggcaccgcgc tcgcggcgcc cgagaccgcg aaacagaccc gacaggccgc caactga 









208
1
mkssplsvra gsrrtavava vaalalagca nyigiksdkq iapasqfeta qslpaqgghw 







61
psldwasqfg dpqlpklide alagnpsiaq aqariakass yiessrsnll pkaeasyswt 







121
relysanglf pppyggqwys ennvlasasw eldlwgknre rlrtavsqek aaeadmqqar 







181
itlassvart ynslaqlyal rdiaqreian resvgkitdg rvsagldtnv erqtargnia 







241
ttqaslsdld gqittvryql aallgkgpdr glqiaapvln pagdvalpdn lpadlvsrrp 







301
divaarwqve aamhdvkeak aeffpdvnla agfgfdafgw gkflnftsrq aqfgpaihlp 







361
ifdagalraq lkgryadfdl svanynqtli salndvatqv aairaidrqm gdaqraldas 







421
traydlavir ykaglspqlq vltadsnrla seqtvtnlkm rrrdmqlali kalgggfdat 







481
gtalaapeta kqtrqaan 





823
85
161525558
YP_001580570
232
1
tcacagcgac catttcagtt ctccatagaa agtgcgaccc ggatacgggt ggaacacgta 







61
gtagcggcgg tccgtcaggt tgtcgatgcc aaccgatgcg gtccagtgac gatcgaagcg 







121
atagcgcgcc ttcacgtcga cgaccgtaaa cgcgctcgtg ccgccataga cgtcgggatt 







181
cacgtcgctg ttgtcgagcg tgttgaactg ccggcccgaa tagcgcacgc cgacgctcgc 







241
gagccagtgc tcgtcgaagc gatacgacgc gagcagattc gcgcgcatcc gcggaatgcg 







301
cggaaagcgc gagccgacgt acgccgggtt ggcggcatcg gcgagaatct gcgcattgct 







361
cgccgatacg ttcgcgtcga tcgcgagccc cttgagcccg acgttttcgc cgctgaacgc 







421
gagttcgacg ccgcgcacgc gcacgcggtc gacgttcgag atgttggtga ccgtcgaggc 







481
gccggcgacc gtcgtctggc tgtagatgga atcgcgcaag tcgctctgga acacgctcgc 







541
acgcacgacg ccgacgccga cgtcgcgctc cgcggtgaag tcccagtcga tcgccttttc 







601
ggggcgcagg ttcggattgt tgttgacgat cgcgttgttc gagatcgtcc cctggaacag 







661
ctcgccgacc gtcggaaaac gcgtgccggt cgcgaacgac agccggaagc gccacacgtc 







721
ggtcgcatcc cattgcagcg caactttcgg cgacagcgcg ttggcgctgc gatccgcata 







781
gccgaacgtg ccgctcgcgt tgccgagcgc gccgccgtag gcgtcccagc gctcgtagcg 







841
cagcccgagc gtcgcgagcc agcccggcgc gaagcgccat gcgtcctgcg cgtagagcgc 







901
ctgcgtgcgc gtatcgccgc gatagacgct ggcgagcgac gtgaccggac cgttcagcca 







961
gtcggccgtg ttgtatgtga cgttgcgcag gaaatagttg tcgtagtgat agccgaacgt 







1021
gaacgcatga ccgtgcacct cgggcgcttc ggccttcagg tcgagcgtgc gccagccggt 







1081
gccgtcaccc tggaacagcg tgcccgcccc gccctgcgcg gtagacgcgg cgcgcagcac 







1141
gtcgcgcgac acgtcgtacg ccgacacgat gcccgacagc cgccagcccg acgcgagccg 







1201
gccgttcagc ccgaacgcgt acagccagtt ctcctggtcg ccgcgctgcg gcgcgaatgc 







1261
gccgggcgcg atcgtcaggt tctgcccgcc gatcgccacg ttgccgccat agaccggctg 







1321
gccggccgcg tcgcgcagaa acgtgtcgcc gtgctggcgg tagtggttct cccaatgccc 







1381
gagcgtcagc gtcgcgtcga cgcggtcggt gaacgcatag cccatgcgca ccgtttcgtt 







1441
gatctgctcg gtgcgctcga gcgattgcgc gccgacgatc gtgcgcgcct tgccgttcgg 







1501
cccgatgtcg gtggcggcgc ccgtgacggg caccgcgcgg ccgagtttcg cgttgtagct 







1561
cgcgttcggg ctcgcgtact gcatcggctg accgtcgttc tcgagccggt cgagcgacag 







1621
cgagaaccag aagcggccga tgcggttcgc gatccgcgca gtctgatgat tgccgccgaa 







1681
gctgtcggcg aatccgtagc cgtcgcgata gcgctgcgtg aagaactgcg tcgacagcga 







1741
cgcctcgagc ttgtccggcc gccgcgtggt cagtagcacg gtcgagccga tcgcattgcc 







1801
cggatacagc gccgaaaacg ggccgtacag cacgtcgacg cgcgcgatgt cgtcgggcgc 







1861
gatcagcgac cagcgcggcg gatacgcata gctcgaaccg agcaggttcg acaacagcag 







1921
gccgtccgca tagacgaggc cccgcgcgct ctgcagctcg ttgaagtcgc gccccgcgaa 







1981
cacgctgttg cggtcgccga tatagcgctt gcgcaccatc aggttcggcg cgtacttcag 







2041
cgtgtcctcg gccgtgacgt tggtatgcgt gtcgatgcgc tcgcgcgtca tcgattcgac 







2101
gacggccggc gtatcgggat cgagcggctg gcgctgggcc gtcacgctga cggcgctcag 







2161
cgtgtcgccg gccggttcgg cggaagcagc cgaagccgcg gaagccgcgg gcacgacggc 







2221
ccgcgcggag cggttcgcgg gatcgcggac cggatcggtc ggcgcgctgt cggacgatgc 







2281
cgcatatgcg ccgggcaagg caaaagcggc cgcgcacgcc agcgcgagcc ggccgcgcgt 







2341
cggtcgcgcg gcaagggtaa gcaacat 









209
1
mlltlaarpt rgrlalacaa afalpgayaa ssdsaptdpv rdpanrsara vvpaasaasa 







61
asaepagdtl savsvtaqrq pldpdtpavv esmtreridt htnvtaedtl kyapnlmvrk 







121
ryigdrnsvf agrdfnelqs arglvyadgl llsnllgssy aypprwslia pddiarvdvl 







181
ygpfsalypg naigstvllt trrpdkleas lstqfftqry rdgygfadsf ggnhqtaria 







241
nrigrfwfsl sldrlendgq pmqyaspnas ynaklgravp vtgaatdigp ngkartivga 







301
qslerteqin etvrmgyaft drvdatltlg hwenhyrqhg dtflrdaagq pvyggnvaig 







361
gqnitiapga fapqrgdqen wlyafglngr lasgwrlsgi vsaydvsrdv lraastaqgg 







421
agtlfqgdgt gwrtldlkae apevhghaft fgyhydnyfl rnvtyntadw lngpvtslas 







481
vyrgdtrtqa lyaqdawrfa pgwlatlglr yerwdaygga lgnasgtfgy adrsanalsp 







541
kvalqwdatd vwrfrlsfat gtrfptvgel fqgtisnnai vnnnpnlrpe kaidwdftae 







601
rdvgvgvvra svfqsdlrds iysqttvaga stvtnisnvd rvrvrgvela fsgenvglkg 







661
laidanvsas naqiladaan payvgsrfpr iprmranlla syrfdehwla svgvrysgrq 







721
fntldnsdvn pdvyggtsaf tvvdvkaryr fdrhwtasvg idnltdrryy vfhpypgrtf 







781
ygelkwsl 





824
78
161520731
YP_001584158
233
1
tcagaacgtc gtgcgcaatc cggccatcac gctgcggccg ccttgcggcg cgaagccgcg 







61
cacgacggag gtcgcgtagc ggatgtcctg gttcgtcagg ttgtcgccgc gcagatacgc 







121
gagccagtgc gtcgcgccga cgcggaactt gtacgtcagc accacgccga gcgaggtata 







181
gccggcggtc ggttcatcgt cggcgggcac gcgatgctgc gaccacgcgt gcgtgagctg 







241
cgcgcgtgcg ccgaacggac cgtagccgta atcggccgcg agcgtcgcgc gcagcggtgc 







301
gatgcgcggc agcggctcgc cggtgtcgac gttgcgcgca tgcgtatagt cggccgtcaa 







361
ctcgaggtcg accgtgtggc cgctgtgtgc atagacgcga tacttgccgt cgagctcgat 







421
gccgtagaac tccgcgcgca cgccgcgata gacggcctcg gcgagcgcgt cgtcggtgcc 







481
gcgcgggacc ggctcgccgt cgctgtcgac gatgcgcccg gtcgcgtatt cggccagata 







541
gttcgaaaag cggttgtaga acatgccgac gctgccgcga ttcggtccgc tcgcatagcg 







601
cagcgacaga tcggtcgaca cggccttttc cttcgacgca ttcggattgc cgatcaggta 







661
ctgcccggtc gcatcgtgcg ggccgttcga atagagctcg tagaacgtgg gcgcgcgttc 







721
ggtgtacgcg acgttcgcgg cgatcgacca gacgggcgtc agcgaaaaca gcgcgccggc 







781
cgacacgctg cctgcgttga acgtgcgcga ttgcgcgccc gcgaatttct cgacgccggc 







841
cggatcggga tcgacgttca catgctcgat gcgtccgccg aggctcagct tcagcgccgg 







901
atcgacctgc cattcctcga ggccgaatag cgcgacgcta ttggtgcgcg tcgacggaac 







961
cagcgtttcg tcgccgagcg cggagaacgt attgccgccg aactgcacgc cgatcgcgcc 







1021
ttcgagcggc ccaagcttgc ggtgccgcgc ctcgatgcgc gcctcgtagc cgcgattgcg 







1081
gaacgtcgtc gcggtttcgc cgttgtcgac ttccttgtgg cggtaatcgg tgtaggcgaa 







1141
gtcgaatttc agcgtcgaga acggccccgg caggttgcgc gcctccgacg cgaaggcgag 







1201
gcgctcctga cgcatgcgca gccgcacgtc gtcttcggcg accgacccgt agttcgattc 







1261
gtagccgctg tacgacaggc ccgcgaaacc gtcggcccac gtgtacgacg cgccgaccgc 







1321
gccgccgtgc acgcggccgt cgctgttcgg cacgctgccg accggctgcg gcgtgtcggg 







1381
gccgtcgagc gcgcgttgcg tatcgctgcg cgcatagccg ggaatccgca gcttgccggt 







1441
ttcgcgatcg aatgcgtcga cgtggaacgc gaagcggcca ttgccgcctt cgacctgcgc 







1501
ggcgcccgcg cgcgtggcgt tcgcgccgcc gtagcggacg tcgagcgtac cggtcggacc 







1561
gtcgatcgcc tcgcgcggga tccggttgtc gatcgtgttc acgacgccgc cgatcgcgtt 







1621
gccgccgtac agcaacgccg ccgggccgcg cacgatctcg acgcgctcga tcgacagcgg 







1681
gtcttccggc acggcatggt cgtacgaaag cgacgacgca tcgtaggcgg ccacgccgtt 







1741
ctgcagcagc cggatccggt cgccgtccat gccgcgcacg atcgggcggc cgaccatcgg 







1801
cccgtacgtc gtggtcgaca cgccgggcag gccgttcagc gtttcgccga gcgaatcggc 







1861
ctggcggcgc agcagcgcat cgcccgacag ctgcgcggtg ggcgcgatca gttcggcgtc 







1921
gccgagcgga ttcgcggtga cgaaaatcgg cgcgagcggc gcgccgctcg aggacggcga 







1981
cgtcgcggtt gcgggcgtgt cggtttgcgc atgcgcaagg gcggcgagcg ccatcagcga 







2041
aacgggcgag agcgggcgga gcggaagacg agggagacgt ggaagacgtg ggaggtcgcg 







2101
catggtcgag gagtcggttg aagtctgtcg gaatcggatc gggggcgagc ccggcgcgcc 







2161
gcgaatcggt ttcat 









210
1
mkpirgapgs ppirfrqtst dsstmrdlpr lprlprlplr plspvslmal aalahaqtdt 







61
patatspsss gaplapifvt anplgdaeli aptaqlsgda llrrqadslg etlnglpgvs 







121
tttygpmvgr pivrgmdgdr irllqngvaa ydasslsydh avpedplsie rveivrgpaa 







181
llyggnaigg vvntidnrip reaidgptgt ldvryggana tragaaqveg gngrfafhvd 







241
afdretgklr ipgyarsdtq raldgpdtpq pvgsvpnsdg rvhggavgas ytwadgfagl 







301
sysgyesnyg svaeddvrlr mrqerlafas earnlpgpfs tlkfdfaytd yrhkevdnge 







361
tattfrnrgy eariearhrk lgplegaigv qfggntfsal gdetlvpstr tnsvalfgle 







421
ewqvdpalkl slggriehvn vdpdpagvek fagaqsrtfn agsvsagalf sltpvwsiaa 







481
nvayterapt fyelysngph datgqylign pnaskekavs tdlslryasg pnrgsvgmfy 







541
nrfsnylaey atgrivdsdg epvprgtdda laeavyrgvr aefygieldg kyrvyahsgh 







601
tvdleltady tharnvdtge plpriaplra tlaadygygp fgaraqltha wsqhrvpadd 







661
eptagytslg vvltykfrvg athwlaylrg dnltnqdiry atsvvrgfap qggrsvmagl 







721
rttf 





825
43
161523868
YP_001578880
234
1
atgcgcgtcg aacgggttcc ataccgctta ctcactgtcg cgacggccgc cgttttcctg 







61
gccgcgtgcg ggaaaaaaga atcggcaccg ccgccgcaaa cgccggaagt cggcgtcgtc 







121
acggtccagc cgcaggccgt accggtcttc accgaactgc ctggccgcac cagcgcgttc 







181
ctcgtcgcgc aggtccgcgc gcgggtcgac ggcatcgtgc tgcgccgtga attcaccgaa 







241
ggcagcgacg tcaaggccgg tcagcgcctc tacaagatcg acccggcacc gtatatcgcc 







301
gcgctgaaca gcgcgaaggc gacgctcgcg aaggcgcagg cgaacctcgc cacgcagaac 







361
gcgctcgtcg cgcgctacaa ggtgctggtg gccgcgaacg cggtcagcaa gcaggactac 







421
gacaacgcgg tggccgcgca agggcaggcc gccgccgacg tcgcggccgg caaggcgtcg 







481
gtcgagaccg cgcagatcaa cctcggctac acggacgtcg tatcgccgat caccggccgc 







541
gtcggcatct cgcaggtgac gccgggcgcg tacgtgcagg cgagccaggc gacgctgatg 







601
tcgaccgtgc agcagctcga cccggtctac gtcgacctca cgcaatcgag cctcgagggg 







661
ctgaagctgc gtcaggacgt gcagagcggc cgtctgaaga cgacgggccc cggtgccgcg 







721
aaggtgtcgc tgatcctgga ggacggccgc acctattcgc agcccggcaa gctgcagttc 







781
tccgacgtga cggtcgacca gacgaccggc tcggtcacga tccgtgcggt gttcccgaac 







841
ccgggcaagg tcctgctgcc ggggatgttc gtgcgcgcgc gcatcgagga aggcgtgaac 







901
gacaacgcgt atctggtgcc gcagatcggc gtcacgcacg accagaaggg ccaggcggtc 







961
gcgatggtcg tcaacgcgaa caacaaggtc gagccgcgca cgctgaccgc aacgcgcatc 







1021
gaaggtcaga actgggtcgt cgaaagcggg ctgcagcccg gcgatcgcgt gatcgtgcag 







1081
ggtgtcgaca aggtgcgtcc gggcgcgacc gtgaaggccg ttccggcgca gctcgcgtcg 







1141
gcgccgaacg acgcgtcggg tgccgccgcg ccggccgccg ccgcaccggc ttcggccgcc 







1201
gcggcatcgg ccgccgcagc atcgagtgcc ggcgcgtcgg gcgcgccggc tagcgccgcc 







1261
gcggggtcgt ccgccgccgc ttcgggcgcg caataa 









211
1
mrvervpyrl ltvataavfl aacgkkesap ppqtpevgvv tvqpqavpvf telpgrtsaf 







61
lvaqvrarvd givlrrefte gsdvkagqrl ykidpapyia alnsakatla kaqanlatqn 







121
alvarykvlv aanayskqdy dnavaaqgqa aadvaagkas vetaqinlgy tdvvspitgr 







181
vgisqvtpga yvqasqatlm stvqqldpvy vdltqssleg lklrqdvqsg rlkttgpgaa 







241
kvsliledgr tysqpgklqf sdvtvdqttg svtiravfpn pgkvllpgmf vrarieegvn 







301
dnaylvpqig vthdqkgqav amvvnannkv eprtltatri egqnwvvesg lqpgdrvivq 







361
gvdkvrpgat vkavpaqlas apndasgaaa paaaapasaa aasaaaassa gasgapasaa 







421
agssaaasga q 





*This ID number is an internal laboratory designation that was used to identify the polypeptides in U.S. Provisional Patent Application Ser. No. 61/922,504, filed Dec. 31, 2013.






The complete disclosure of all patents, patent applications, and publications, and electronically available material (including, for instance, nucleotide sequence submissions in, e.g., GenBank and RefSeq, and amino acid sequence submissions in, e.g., SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq) cited herein are incorporated by reference. In the event that any inconsistency exists between the disclosure of the present application and the disclosure(s) of any document incorporated herein by reference, the disclosure of the present application shall govern. The foregoing detailed description and examples have been given for clarity of understanding only. No unnecessary limitations are to be understood therefrom. The invention is not limited to the exact details shown and described, for variations obvious to one skilled in the art will be included within the invention defined by the claims.


Unless otherwise indicated, all numbers expressing quantities of components, molecular weights, and so forth used in the specification and claims are to be understood as being modified in all instances by the term “about.” Accordingly, unless otherwise indicated to the contrary, the numerical parameters set forth in the specification and claims are approximations that may vary depending upon the desired properties sought to be obtained by the present invention. At the very least, and not as an attempt to limit the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.


Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the invention are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. All numerical values, however, inherently contain a range necessarily resulting from the standard deviation found in their respective testing measurements.


All headings are for the convenience of the reader and should not be used to limit the meaning of the text that follows the heading, unless so specified.

Claims
  • 1. A composition comprising: at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 2. The composition of claim 1 wherein at least one of the isolated polypeptides comprises the amino acid sequence of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or an immunogenic fragment or structurally similar variant thereof.
  • 3. The composition of claim 1 wherein at least one of the isolated polypeptides comprises the amino acid sequence of SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, or an immunogenic fragment or structurally similar variant thereof.
  • 4. The composition of claim 1 wherein at least one of the isolated polypeptides comprises the amino acid sequence of SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:111, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, SEQ ID NO:119, SEQ ID NO:120, or an immunogenic fragment or structurally similar variant thereof.
  • 5. The composition of claim 1 wherein at least one of the isolated polypeptides comprises the amino acid sequence of SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, or an immunogenic fragment or structurally similar variant thereof.
  • 6. The composition of claim 1 wherein at least one of the isolated polypeptides comprises the amino acid sequence of SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:192, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:211, or an immunogenic fragment or structurally similar variant thereof.
  • 7. A method comprising: administering to a subject an amount of a composition effective to induce the subject to produce antibody that specifically binds to at least one component of the composition, wherein the composition comprises:at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 8. A method for treating an infection in a subject, the method comprising: administering an effective amount of a composition to a subject having or at risk of having an infection caused by a Burkholderia spp., wherein the composition comprises:at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 9. A method for treating a symptom in a subject, the method comprising: administering an effective amount of a composition to a subject having or at risk of having an infection caused by a Burkholderia spp., wherein the composition comprises:at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 10. A method for decreasing colonization in a subject, the method comprising: administering an effective amount of a composition to a subject colonized by a Burkholderia spp., wherein the composition comprises:at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 11. A method for treating an infection in a subject, the method comprising: administering an effective amount of a composition to a subject having or at risk of having an infection caused by a Burkholderia spp., wherein the composition comprises:antibody that specifically binds at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 12. A method for treating a symptom in a subject comprising: administering an effective amount of a composition to a subject having or at risk of having an infection caused by a Burkholderia spp., wherein the composition comprises:antibody that specifically binds at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 13. A method for decreasing colonization in a subject, the method comprising: administering an effective amount of a composition to a subject colonized by a Burkholderia spp., wherein the composition comprises:antibody that specifically binds at least two isolated polypeptides that are involved in iron acquisition by a Burkholderia spp., wherein each polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 14. A kit for detecting antibody that specifically binds a polypeptide, comprising in separate containers: an isolated polypeptide involved in iron acquisition by a Burkholderia spp., wherein the polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide; anda reagent that detects an antibody that specifically binds the polypeptide.
  • 15. A kit for detecting a polypeptide, comprising in separate containers: an antibody that specifically binds an isolated polypeptide involved in iron acquisition by a Burkholderia spp., wherein the polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide; anda second reagent that specifically binds the polypeptide.
  • 16. A composition comprising: isolated antibody that specifically binds to a polypeptide involved in iron acquisition by a Burkholderia spp., wherein the polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 17. A composition comprising: an isolated whole cell that comprises a polypeptide involved in iron acquisition by a Burkholderia spp., wherein the polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 18. A composition comprising: isolated antibody that specifically binds to a whole cell that comprises a polypeptide involved in iron acquisition by a Burkholderia spp., wherein the polypeptide is isolatable from a Burkholderia spp. when the Burkholderia spp. is grown under conditions effective for the Burkholderia spp. to express the polypeptide.
  • 19. A recombinant cell comprising increased expression of a Burkholderia spp. metal-regulated polypeptide compared to a wild-type control.
CROSS-REFERENCE TO RELATED APPLICATION

This application claims priority to U.S. Provisional Patent Application Ser. No. 61/922,504, filed Dec. 31, 2013, which is incorporated herein by reference.

Provisional Applications (1)
Number Date Country
61922504 Dec 2013 US