The invention relates to the genomic sequence and to nucleotide sequences encoding polypeptides of Listeria monocytogenes, such as cell envelope, secreted or specific polypeptides, or polypeptides involved in metabolism, in the replication process or in virulence, and also to vectors which include said sequences and to cells or animals transformed with these vectors. The invention also relates to methods for detecting these nucleic acids or polypeptides and to kits for diagnosing Listeria monocytogenes infection. The invention is also directed toward a method for selecting compounds capable of modulating bacterial infection and a method of biosynthesis or of biodegradation of molecules of interest using said nucleotide sequences or said polypeptides. Finally, the invention comprises pharmaceutical compositions, in particular vaccinal compositions, for preventing and/or treating bacterial infections, in particular Listeria monocytogenes infections.
Listeria monocytogenes is a facultative intracellular pathogen. It is the etiological agent of listeriosis, a food-related infection which poses increasingly great public health problems, with a considerable economic impact for the European food industry. Listeriosis is the most lethal food-related infection (approximately 30% mortality). Listeria monocytogenes has the unusual property of being able to cross three barriers: the intestinal barrier, the blood-brain barrier and the placental barrier. Clinical manifestations of listeriosis include meningitis, meningo-encephalitis, abortions and septicemias. This infection is opportunistic and mainly affects pregnant women, babies, elderly individuals and immunodepressed individuals, in particular individuals suffering from AIDS. This disease apparently also affects healthy individuals and is responsible for a considerable number of epidemics due to contaminated food products. Listeria monocytogenes is also important in veterinary terms, with the main risk being for members of the ovine family (sheep) and of the bovine family. Listeria monocytogenes is particularly resistant to stress or to extreme conditions, and it is important to search for its presence with care not only for problems with food safety, but also for problems of environmental safety.
The physical map and a preliminary genetic map of the Listeria monocytogenes genome have been established for the LO28 strain. However, no fine genetic map is available for the moment. The genome of this bacterium is circular and comprises approximately 3 000 kilobases. Its GC content is approximately 38%. Studies of virulence factors have enabled the identification of a 15 kb locus, which may be considered to be a pathogenicity island insofar as it contains most of the genes whose function in virulence has been clearly identified. In addition to this locus, some other genes have been identified, in particular the invasion and motility genes and genes which encode a murein hydrolase, a superoxide dismutase, sigma factors, etc.
An important family of Listeria monocytogenes proteins is the surface protein family. Evolutionary processes have allowed the development of a number of unique mechanisms on Gram+ bacteria, by which they can immobilize proteins at their surface. The functions of these various cell wall proteins are extremely diverse. However, many proteins covalently attached to the surface of Gram+ pathogens are felt to be important for survival of the pathogen inside the infected host. For Listeria monocytogenes, the ability to penetrate into eukaryotic cells has been linked to a family of surface and/or secreted proteins, the internalins. So far, nine members of the internalin family have been identified (InlA, InlB, InlC, InlC2, INlD, InlE, InlF, InlG and InlH). It is thought that they are anchored in the cell wall by proteolytic cleavage of the T-G bond of the LP×TG motif and that the covalent bonding of the carboxylic group of the threonine to a free amino group in the peptidoglycan [lacuna]. Recent studies have shown that there are two classes of internalins; a group of proteins associated with the cell wall, of high molecular weight (>50 kDa), and a group of smaller proteins (<30 kDa) which are secreted. These two classes have similar, very homologous LRR motifs, and also the LRR flanking regions, and the N-terminal signal peptide sequence. The small internalins (s-inl) do not have the B repeat region or the sequence for anchoring in the cell wall, and are thus secreted.
The study of Listeria monocytogenes requires new approaches, in particular genetic approaches, in order to improve understanding of the various metabolic pathways of this organism.
Thus, an object of the present invention is to disclose the complete sequence of the genome of Listeria monocytogenes EGD-e deposited with the CNCM [National Collection of Microorganism Cultures] on Apr. 11, 2000, under the number I-2440, and also of all the genes contained in said genome.
In fact, knowledge of the genome of this organism makes it possible to define more clearly the interactions between the various genes, the various proteins and, by the same token, the various metabolic pathways. In fact, and unlike the disclosure of isolated sequences, the complete genomic sequence of an organism makes up a whole entity, which immediately makes it possible to obtain all the information required by this organism in order to grow and function.
The present invention therefore relates to a nucleotide sequence of Listeria monocytogenes, characterized in that it corresponds to the sequence SEQ ID NO. 1.
The present invention also relates to a nucleotide sequence of Listeria monocytogenes, characterized in that it is chosen from:
- a) a nucleotide sequence comprising at least 80%, 85%, 90%, 95% or 98% identity with SEQ ID NO. 1;
- b) a nucleotide sequence which hybridizes, under high stringency conditions, with SEQ ID NO. 1;
- c) a nucleotide sequence complementary to SEQ ID NO. 1 or complementary to a nucleotide sequence as defined in a) or b), or a nucleotide sequence of the corresponding RNA;
- d) a nucleotide sequence of a representative fragment of SEQ ID NO. 1, or of a representative fragment of a nucleotide sequence as defined in a), b) or c);
- e) a nucleotide sequence comprising a sequence as defined in a), b), c) or d); and
- f) a modified nucleotide sequence of a nucleotide sequence as defined in a), b), c), d) or e).
More particularly, a subject of the present invention is also the nucleotide sequences characterized in that they are derived from SEQ ID NO. 1 and in that they encode a polypeptide chosen from the polypeptides of sequence SEQ ID NO. 2 to SEQ ID NO. 2854, preferably encoding a cell envelope polypeptide or a polypeptide present at the surface of Listeria monocytogenes of sequence SEQ ID NO. 2 to SEQ ID NO. 41, or encoding a polypeptide involved in vitamin B12 bio synthesis of sequence SEQ ID NO. 42 to SEQ ID NO. 64.
More generally, the present invention also relates to the nucleotide sequences derived from SEQ ID NO.1, and encoding a polypeptide of L. monocytogenes, as may be isolated from SEQ ID NO. 1.
In addition, the nucleotide sequences characterized in that they comprise a nucleotide sequence chosen from:
- a) a nucleotide sequence encoding a polypeptide chosen from the sequences SEQ ID NO. 2 to SEQ ID NO. 2854, preferably chosen from the polypeptides of sequence SEQ ID NO. 2 to SEQ ID NO. 41 or SEQ ID NO. 42 to SEQ ID NO. 64;
- b) a nucleotide sequence comprising at least 80%, 85%, 90%, 95% or 98% identity with a nucleotide sequence encoding a polypeptide chosen from the sequences SEQ ID NO. 2 to SEQ ID NO. 2854, preferably chosen from the polypeptides of sequence SEQ ID NO. 2 to SEQ ID NO. 41 or SEQ ID NO. 42 to SEQID NO. 64;
- c) a nucleotide sequence which hybridizes, under high stringency conditions, with a nucleotide sequence encoding a polypeptide chosen from the sequences SEQ ID NO. 2 to SEQ ID NO. 2854, preferably chosen from the polypeptides of sequence SEQ ID NO. 2 to SEQ ID NO. 41 or SEQ ID NO. 42 to SEQ ID NO. 64;
- d) a complementary nucleotide sequence or an RNA sequence corresponding to a sequence as defined in a), b) or c);
- e) a nucleotide sequence of a representative fragment of a sequence as defined in a), b), c) or d); and
- f) a modified nucleotide sequence of a sequence as defined in a), b), c), d) or e), are also subjects of the invention.
The terms “nucleic acid”, “nucleic acid sequence”, “polynucleotide”, “oligonucleotide”, “polynucleotide sequence” and “nucleotide sequence”, which will be used indifferently in the present description, are intended to denote a precise series of nucleotides, which may or may not be modified, making it possible to define a fragment or a region of a nucleic acid, which may or may not comprise unnatural nucleotides, and which may correspond equally to a double-stranded DNA, a single-stranded DNA and products of transcription of said DNAs. Thus, the nucleic acid sequences according to the invention also encompass PNAs (peptide nucleic acids), or the like.
It should be understood that the present invention does not relate to the nucleotide sequences in their natural chromosomal environment, i.e. in the natural state. They are sequences which have been isolated and/or purified, i.e. they have been taken directly or indirectly, for example by copying, their environment having been at least partially modified. Nucleic acids obtained by chemical synthesis are also intended to be denoted.
For the purpose of the present invention, the term “percentage identity” between two nucleic acid or amino acid sequences is intended to denote a percentage of nucleotides or of amino acid residues which are identical between the two sequences to be compared, obtained after the best alignment, this percentage being purely statistical and the differences between the two sequences being distributed randomly and over their entire length. The term “best alignment” or “optimal alignment” is intended to denote the alignment for which the percentage identity determined as below is the highest. Sequence comparisons between two nucleic acid or amino acid sequences are conventionally carried out by comparing these sequences after having aligned them optimally, said comparison being made by segment or by “window of comparison” so as to identify and compare local regions of sequence similarity. The optimal alignment of the sequences for the comparison may be produced, besides manually, by means of the local homology algorithm of Smith and Waterman (1981, Ad. App. Math. 2: 482), by means of the local homology algorithm of Neddleman and Wunsch (1970, J. Mol. Biol. 48 : 443), by means of the similarity search method of Pearson and Lipman (1988, Proc. Natl. Acad. Sci. USA 85 : 2444), by means of computer programs using these algorithms (GAP, BESTFIT, BLAST P, BLAST N, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.). In order to obtain the optimal alignment, the BLAST program is preferably used, with the BLOSUM 62 matrix. The PAM or PAM250 matrices may also be used.
The percentage identity between two nucleic acid or amino acid sequences is determined by comparing these two sequences aligned in an optimal manner in which the nucleic acid or amino acid sequence to be compared may comprise additions or deletions compared to the reference sequence for an optimal alignment between these two sequences. The percentage identity is calculated by determining the number of identical positions for which the nucleotide or the amino acid residue is identical between the two sequences, dividing this number of identical positions by the total number of positions compared and multiplying the result obtained by 100 so as to obtain the percentage identity between these two sequences.
The expression “nucleic acid sequences having a percentage identity of at least 80%, preferably 85% to 90%, more preferably 95% or even 98%, after optimal alignment with a reference sequence” is intended to denote the nucleic acid sequences having, compared to the reference nucleic acid sequence, certain modifications, such as in particular a deletion, a truncation, an extension, a chimeric fusion and/or a substitution, in particular of the point type, and in which the nucleic acid sequence exhibits at least 80%, preferably 85%, 90%, 95% or 98%, identity after optimal alignment with the reference nucleic acid sequence. They are preferably sequences whose complementary sequences are capable of hybridizing specifically with the reference sequences. Preferably, the specific or high stringency hybridization conditions will be such that they ensure at least 80%, preferably 85%, 90%, 95% or 98%, identity after optimal alignment between one of the two sequences and the sequence complementary to the other.
Hybridization after high stringency conditions means that the conditions of temperature and of ionic strength are chosen such that they allow hybridization between two complementary DNA fragments to be maintained. By way of illustration, high stringency conditions in the hybridization step for the purposes of defining the polynucleotide fragments described above are advantageously as follows.
The DNA-DNA or DNA-RNA hybridization is carried out in two steps: (1) prehybridization at 42° C. for 3 hours in phosphate buffer (20 mM, pH 7.5) containing 5×SSC (1×SSC corresponds to a solution of 0.15 M NaCl+0.015 M sodium citrate), 50% of formamide, 7% of sodium dodecyl sulfate (SDS), 10×Denhardt's, 5% of dextran sulfate and 1% of salmon sperm DNA; (2) hybridization per se for 20 hours at a temperature which depends on the length of the probe (i.e. 42° C. for a probe >100 nucleotides in length), followed by 2 washes of 20 minutes at 20° C. in 2×SSC +2% SDS, and I wash of 20 minutes at 20° C. in 0.1×SSC+0.1% SDS. The final wash is carried out in 0.1 ×SSC+0.1% SDS for 30 minutes at 60° C. for a probe >100 nucleotides in length. The high stringency hybridization conditions described above for a polynucleotide of defined length may be adjusted by those skilled in the art for oligonucleotides which are longer or shorter, according to the teaching of Sambrook et al., (1989, Molecular cloning: a laboratory manual. 2nd Ed. Cold Spring Harbor).
In addition, the expression “representative fragment of sequences according to the invention” is intended to denote any nucleotide fragment having at least 15 consecutive nucleotides, preferably at least 20, 25, 30, 50, 75, 100, 150, 300 and 450 consecutive nucleotides, of the sequence from which it is derived.
The term “representative fragment” is in particular intended to mean a nucleic acid sequence encoding a biologically active fragment of a polypeptide, as defined below.
The term “representative fragment” is also intended to mean the intergenic sequences, and in particular the nucleotide sequences carrying the regulatory signals (promoters, terminators, or even enhancers, etc).
Among said representative fragments, preference is given to those having nucleotide sequences corresponding to open reading frames, named ORF sequences (ORF for open reading frame), generally included between an initiation codon and a stop codon, or between two stop codons, and encoding polypeptides, preferably of at least 100 amino acids, such as, for example, without being limited thereto, the ORF sequences which will subsequently be described.
The numbering of the ORF nucleotide sequences which will subsequently be used in the present description corresponds to the numbering of the amino acid sequences of the proteins encoded by said ORFs.
The representative fragments according to the invention may be obtained, for example, by specific amplification, such as PCR, or after digestion, with suitable restriction enzymes, of nucleotide sequences according to the invention, this method being described in particular in the work by Sambrook et al. Said representative fragments may also be obtained by chemical synthesis when they are not too long, according to methods well known to those skilled in the art.
The sequences containing sequences according to the invention, or representative fragments, are also intended to include the sequences which are naturally framed by sequences which exhibit at least 80%, 85%, 90%, 95% or 98% identity with the sequences according to the invention.
The term “modified nucleotide sequence” is intended to mean any nucleotide sequence obtained by mutagenesis according to techniques well known to those skilled in the art, and comprising modifications, preferably a maximum of 10% of modified nucleotides, compared to the normal sequences, for example mutations in the regulatory and/or promoter sequences for expression of the polypeptide, in particular leading to a modification of the level of expression or of the activity of said polypeptide.
The term “modified nucleotide sequence” is also intended to mean any nucleotide sequence encoding a modified polypeptide as defined below.
The representative fragments according to the invention may also be probes or primers, which may be used in methods for detecting, identifying, assaying or amplifying nucleic acid sequences.
For the purpose of the invention, a probe or primer is defined as being a single-stranded nucleic acid fragment or a denatured double-stranded fragment, comprising, for example, from 12 bases to a few kb, in particular from 15 bases to a few hundreds of bases, preferably from 15 to 50 or 100 bases, and possessing a specificity of hybridization under given conditions so as to form a hybridization complex with a target nucleic acid.
The probes and primers according to the invention may be labeled directly or indirectly with a radioactive or nonradioactive compound using methods well known to those skilled in the art, in order to obtain a detectable and/or quantifiable signal.
The unlabeled polynucleotide sequences according to the invention may be used directly as a probe or primer.
The sequences are generally labeled so as to obtain sequences which can be used for many applications. The primers or the probes according to the invention are labeled with radioactive elements or with nonradioactive molecules.
Among the radioactive isotopes used, mention may be made of 32P, 33P, 35S, 3H or 125I. The nonradioactive entities are selected from ligands, such as biotin, avidin, streptavidin or dioxigenin, haptens, dyes and luminescent agents, such as radioluminescent, chemiluminescent, bioluminescent, fluorescent or phosphorescent agents.
The polynucleotides according to the invention may thus be used as a primer and/or probe in methods using in particular the PCR (polymerase chain reaction) technique) (Rolfs et al., 1991, Berlin: Springer-Verlag). This technique requires choosing pairs of oligonucleotide primers framing the fragment which must be amplified. Reference may, for example, be made to the technique described in U.S. Pat. No. 4,683,202. The amplified fragments can be identified, for example after agarose or polyacrylamide gel electrophoresis, or after a chromatographic technique such as gel filtration or ion exchange chromatography, and then sequenced. The specificity of the amplification can be controlled using, as a primer, the nucleotide sequences of polynucleotides of the invention and, as a matrix, plasmids containing these sequences or else the derived amplification products. The amplified nucleotide fragments may be used as reagents in hybridization reactions in order to demonstrate the presence, in a biological sample, of a target nucleic acid with the sequence complementary to that of said amplified nucleotide fragments.
The invention is also directed toward the nucleic acids which can be obtained by amplification using primers according to the invention.
Other techniques for amplifying the target nucleic acid may advantageously be used as an alternative to PCR (PCR-like), using a pair of primers of nucleotide sequences according to the invention. The term “PCR-like” is intended to denote all the methods carrying out direct or indirect reproductions of nucleic acid sequences, or in which the labeling systems have been amplified; these techniques are of course well known; in general, they involve amplification of the DNA by a polymerase; when the sample of origin is an RNA a prior reverse transcription should be carried out. There are currently a large number of methods for this amplification, such as, for example, the SDA (strand displacement amplification) technique (Walker et al., 1992, Nucleic Acids Res. 20: 1691), the TAS (transcription-based amplification system) technique described by Kwoh et al. (1989, Proc. Natl. Acad. Sci. USA, 86, 1173), the 3SR (self-sustained sequence replication) technique described by Guatelli et al. (1990, Proc. Natl. Acad. Sci. USA 87: 1874), the NASBA (nucleic acid sequence based amplification) technique described by Kievitis et al. (1991, J. Virol. Methods, 35, 273), the TMA (transcription mediated amplification) technique, the LCR (ligase chain reaction) technique described by Landegren et al. (1988, Science 241, 1077), the RCR (repair chain reaction) technique described by Segev (1992, Kessler C. Springer Verlag, Berlin, New-York, 197-205), the CPR (cycling probe reaction) technique described by Duck et al. (1990, Biotechniques, 9, 142) or the Q-beta-replicase amplification technique described by Miele et al. (1983, J. Mol. Biol., 171, 281). Some of these techniques have since been improved.
When the target polynucleotide to be detected is an mRNA, an enzyme of the reverse transcriptase type is advantageously used, prior to carrying out an amplification reaction using the primers according to the invention or to carrying out a detection method using the probes of the invention, in order to obtain a cDNA from the mRNA contained in the biological sample. The cDNA obtained will then serve as a target for the primers or the probes used in the amplification or detection method according to the invention.
The probe hybridization technique may be performed in many ways (Matthews et al., 1988, Anal. Biochem., 169, 1-25). The most general method consists in immobilizing the nucleic acid extracted from the cells of various tissues, or from cells in culture, on a support (such as nitrocellulose, nylon or polystyrene) and in incubating, under well-defined conditions, the immobilized target nucleic acid with the probe. After hybridization, the excess probe is removed and the hybrid molecules formed are detected using a suitable method (measurement of the radioactivity, of the fluorescence or of the enzymatic activity linked to the probe).
According to another embodiment of the nucleic acid probes according to the invention, the latter may be used as capture probes. In this case, a probe, termed “capture probe”, is immobilized on a support and is used to capture, by specific hybridization, the target nucleic acid obtained from the biological sample to be tested, and the target nucleic acid is then detected using a second probe, termed “detection probe”, labeled with a readily detectable element.
Among the advantageous nucleic acid fragments, mention should thus in particular be made of antisense oligonucleotides, i.e. oligonucleotides with a structure which ensures, by hybridization with the target sequence, inhibition of expression of the corresponding product. Mention should also be made of the sense oligonucleotides which, by interacting with proteins involved in the regulation of expression of the corresponding product, will induce either inhibition or activation of this expression.
Preferably, the probes or primers according to the invention are covalently or noncovalently immobilized on a support. In particular, the support may be a DNA chip or a high density filter, which are also subjects of the present invention.
The term “DNA chip” or “high density filter” is intended to denote a support to which DNA sequences are attached, it being possible to pinpoint each one of these sequences via its geographical location. These chips or filters differ mainly in their size, the material of the support and, possibly, the number of DNA sequences which are attached thereto.
The probes or primers according to the first invention can be attached to solid supports, in particular DNA chips, using various production methods. In particular, in situ synthesis may be performed by photochemical addressing or by inkjet. Other techniques consist in performing ex situ synthesis and attaching the probes to the DNA chip support by mechanical or electronic addressing or by inkjet. These various methods are well known to those skilled in the art.
A nucleotide sequence (probe or primer) according to the invention therefore makes it possible to detect or amplify specific nucleic acid sequences. In particular, the detection of these said sequences is facilitated when the probe is attached to a DNA chip, or to a high density filter.
The use of DNA chips or of high density filters in fact makes it possible to determine gene expression in an organism having a genomic sequences close to L. monocytogenes EGD-e.
The genomic sequence of L. monocytogenes EGD-e, supplemented by the identification of all the genes in this organism, as presented in the present invention, serves as a basis for constructing these DNA chips or filters.
The preparation of these filters or chips consists in synthesizing oligonucleotides corresponding to the 5′ and 3′ ends of the genes. These oligonucleotides are chosen using the genomic sequence and its annotations disclosed by the present invention. The temperature for pairing of these oligonucleotides at the corresponding sites on the DNA should be approximately the same for each oligonucleotide. This makes it possible to prepare DNA fragments corresponding to each gene using suitable PCR conditions in a highly automated environment. The amplified fragments are then immobilized on filters or supports made of glass, silicon or synthetic polymers and these media are used for the hybridization.
The availability of such filters and/or chips and of the corresponding annotated genomic sequence makes it possible to study the expression of large groups, or even all, of the genes in the microorganisms associated with Listeria monocytogenes, by preparing these complementary DNAs and hybridizing them to the DNA or to the oligonucleotides immobilized on the filters or the chips. In addition, the filters and/or the chips make it possible to study the variability of strains or of species, by preparing the DNA of these organisms and hybridizing it to the DNA or to the oligonucleotides immobilized on the filters or the chips.
The differences between the genomic sequences of the various strains or species can greatly affect the strength of hybridization and, consequently, affect the interpretation of the results. It may therefore be necessary to have the precise sequence of the genes of the strains intended to be studied. The method for detecting genes described below in detail, involving determining the sequence of random fragments of a genome and organizing them according to the sequence of the complete genome of Listeria monocytogenes EGD-e disclosed in the present invention, may be very useful.
The nucleotide sequences according to the invention may be used in DNA chips to carry out mutation analysis. This analysis is based on constituting chips capable of analyzing each base of a nucleotide sequence according to the invention. For this purpose, techniques for microsequencing on a DNA chip may in particular be used. The mutations are detected by extension of immobilized primers which hybridize with the matrix of the sequences analyzed, in a position just adjacent to that of the mutated nucleotide sought. A single-stranded RNA or DNA matrix of the sequences to be analyzed will advantageously be prepared according to conventional methods, using products amplified according to techniques such as PCR. The single-stranded DNA or RNA matrices thus obtained are then deposited onto the DNA chip, under conditions which allow them to hybridize specifically to the immobilized primers. A heat-stable polymerase, for example Tth or Taq DNA polymerase, specifically extends the 3′ end of the immobilized primer with a labeled nucleotide analog complementary to the nucleotide in the position of the variable site; for example, a thermal cycle is performed in the presence of fluorescent dideoxyribonucleotides. The experimental conditions will be adjusted in particular to the chips used, to the primers immobilized, to the polymerases used and to the labeling system chosen. An advantage of microsequencing, compared to techniques based on probe hybridization, is that it makes it possible to identify all the variable nucleotides with optimum discrimination under homogeneous reaction conditions; used on DNA chips, it allows optimum resolution and specificity for the routine and industrial multiplex detection of mutations.
The use of the high density filters and/or of the chips thus makes it possible to obtain new knowledge regarding gene regulation in organisms of industrial importance, and in particular listeria propagated under various conditions. It also allows rapid identification of the differences between the genomes of the strains used in many industrial applications.
In addition, a DNA chip or a filter may be an extremely advantageous tool for determining, detecting and/or identifying a microorganism. Thus, preference is also given to the DNA chips according to the invention which also contain at least one nucleotide sequence of a microorganism other than Listeria monocytogenes, immobilized on the support of said chip. Preferably, the microorganism chosen is selected from the bacteria of the Listeria genus (hereinafter designated as L. monocytogenes-associated bacteria), or the variants of Listeria monocytogenes EGD-e.
A DNA chip or a filter according to the invention is a very useful element of certain kits or packs for detecting and/or identifying microorganisms, in particular bacteria belonging to the species Listeria monocytogenes or associated microorganisms, which are also subjects of the invention.
Moreover, the DNA chips or the filters according to the invention, containing probes or primers specific for Listeria monocytogenes, are very advantageous elements of kits or packs for detecting and/or quantifying the expression of genes of Listeria monocytogenes (or of associated microorganisms).
In fact, the control of gene expression is a critical point for optimizing the growth and yield of a strain, either by allowing the expression of one or more new genes, or by modifying the expression of genes already present in the cell. The present invention provides all the sequences naturally active in L. monocytogenes allowing gene expression. It thus makes it possible to determine all the sequences expressed in L. monocytogenes. It also provides a tool for locating the genes the expression of which follows a given scheme. To do this, the DNA of some or all of the genes of L. monocytogenes may be amplified using primers according to the invention, and then attached to a support, such as, for example, glass or nylon or a DNA chip, in order to construct a tool which makes it possible to follow the expression profile of these genes. This tool, consisting of this support containing the coding sequences, is used as a matrix for hybridization to a mixture of labeled molecules which reflect the messenger RNAs expressed in the cell (in particular the labeled probes according to the invention). By repeating this experiment at various times and combining all of these data by a suitable processing, the expression profiles of all of these genes are then obtained. It is also possible to take advantage of knowledge of the sequences which follow a given regulatory scheme in order to search, in a directed manner, for example by homology, for other sequences which follow the same regulatory scheme overall, but in a slightly different manner. In addition, it is possible to isolate each control sequence present upstream of the segments being used as probes and to follow the activity thereof using suitable means, such as a reporter gene (luciferase, β-galactosidase, GFP (for green fluorescent protein)). These isolated sequences can then be modified and assembled by metabolic engineering with sequences of interest with a view to optimal expression thereof.
Using the genomic sequence presented in the present invention, those skilled in the art will be able to identify the genes encoding proteins which regulate gene transcription in L. monocytogenes. Moreover, table I provides the list of open reading frames (ORF) identified on the Listeria monocytogenes genome (SEQ ID NO. 1), with their position on said genome, and the putative functions which may be attributed to them. However, such a list should not be considered to be limiting, a protein possibly being made to have several roles in the cell.
Modifying the structure or the integrity of these genes may make it possible to modify the expression of the target genes controlled by promoters which are targets for these regulators. Thus, those skilled in the art may choose the regulator(s) relevant for the desired application and also their target, which makes it possible to optimize the expression of genes of interest. The use of the tools described above, such as the DNA chips, also makes it possible to pinpoint all the genes the regulation of which is modified by inactivation of certain genes. It is thus possible to select a set of control sequences corresponding, to within a few slight differences, to the same type of regulation. These sequences may then be used to control the expression of genes of interest.
The invention also relates to the polypeptides encoded by a nucleotide sequence according to the invention, preferably by a representative fragment of the sequence SEQ ID NO. 1, and corresponding to an ORF sequence, as described in table I. In particular, the polypeptides of Listeria monocytogenes characterized in that they are chosen from the polypeptides of sequence SEQ ID NO. 2 to SEQ ID NO. 2854, preferably of sequence SEQ ID NO. 2 to SEQ ID NO. 41 and SEQ ID NO. 42 to SEQ ID NO. 64, are a subject of the invention.
The invention also comprises the polypeptides characterized in that they comprise a polypeptide chosen from:
- a) a polypeptide according to the invention;
- b) a polypeptide exhibiting at least 80%, preferably 85%, 90%, 95% and 98%, identity with a polypeptide according to the invention;
- c) a fragment of at least 5 amino acids of a polypeptide according to the invention, or as defined in b);
- d) a biologically active fragment of a polypeptide according to the invention, or as defined in b) or c); and
- e) a modified polypeptide of a polypeptide according to the invention, or as defined in b), c) or d).
The nucleotide sequences encoding the polypeptides described above are also a subject of the invention.
In the present description, the terms “polypeptides”, “polypeptide sequences”, “peptides” and “proteins” are interchangeable.
It should be understood that the invention does not relate to the polypeptides in natural form, i.e. they are not taken in their natural environment, but it has been possible to isolate or obtain them by purification from natural sources, or else obtain them by genetic recombination or by chemical synthesis, and they may then comprise unnatural amino acids, as will be described below.
The expression “polypeptide having a certain percentage identity with another”, which will also be denoted by the term “homologous polypeptide”, is intended to denote polypeptides which, compared to natural polypeptides, have certain modifications, in particular a deletion, addition or substitution of at least one amino acid, a truncation, an extension, a chimeric solution and/or a mutation, or the polypeptides which have post-translational modifications. Among the homologous polypeptides, preference is given to those the amino acid sequence of which exhibits at least 80%, preferably 85%, 90%, 95% and 98%, homology with the amino acid sequences of the polypeptides according to the invention. In the case of a substitution, one or more consecutive or nonconsecutive amino acid(s) is (are) replaced with “equivalent” amino acids. The expression “equivalent amino acids” is herein intended to denote any amino acid which can be substituted for one of the amino acids of the basic structure without, however, essentially modifying the biological activities of the corresponding peptides, and as will be defined subsequently.
These equivalent amino acids may be determined either based on their structural homology with the amino acids for which they substitute, or based on the results of comparative assays of biological activity between the various polypeptides liable to be produced.
By way of example, mention is made of the possibilities of substitution which can be performed without resulting in a profound modification of the biological activity of the corresponding modified polypeptide. It is thus possible to replace leucine with valine or isoleucine, aspartic acid with glutamic acid, glutamine with asparagine, arginine with lysine, etc, it naturally being possible to envisage the reverse substitution under the same conditions.
The homologous polypeptides also correspond to the polypeptides encoded by the homologous or identical nucleotide sequences as previously defined, and thus comprise, in the present definition, mutated polypeptides or polypeptides corresponding to inter- or intraspecies variations, possibly existing in Listeria, and which in particular correspond to truncations, substitutions, deletions and/or additions of at least one amino acid residue.
It is understood that the percentage identity between two polypeptides is calculated in the same way as between two nucleic acid sequences. Thus, the percentage identity between two polypeptides is calculated after optimal alignment of these two sequences, on a window of maximum homology. To define said window of maximum homology, it is possible to use the same algorithms as for the nucleic acid sequences.
The expression “biologically active fragment of a polypeptide according to the invention” is intended in particular to denote a polypeptide fragment, as defined below, having at least one of the biological characteristics of the polypeptides according to the invention, in particular in that it is capable of generally exercising an activity, even a partial activity, such as, for example:
- an enzymatic (metabolic) activity or an activity which may be involved in the biosynthesis or the biodegradation of organic or inorganic compounds;
- a structural activity (cell envelope, chaperone molecule, ribosome);
- a transport activity (transporting energy, transporting an ion); or an activity in protein secretion;
- an activity in the process of replication, amplification, preparation, transcription, translation or maturation, in particular of DNA, of RNA or of proteins.
The expression “polypeptide fragment according to the invention” is intended to denote a polypeptide comprising a minimum of 5 amino acids, preferably 10, 15, 25, 50, 100 and 150 amino acids.
The polypeptide fragments may correspond to isolated or purified fragments naturally present in strains of Listeria, or to fragments which may be obtained by cleaving said polypeptide with a proteolytic enzyme such as trypsin or chymotrypsin or collagenase, or with a chemical reagent (cyanogen bromide, CNBr), or by placing said polypeptide in a very acidic environment (for example at pH=2.5). Polypeptide fragments may also be prepared by chemical synthesis, and using hosts transformed with an expression vector according to the invention, which contain a nucleic acid allowing expression of said fragment and placed under the control of the appropriate regulatory and/or expression elements.
The “modified polypeptide” of a polypeptide according to the invention is intended to denote a polypeptide obtained by genetic recombination or by chemical synthesis, as described below, which has at least one modification compared to the normal sequence, preferably at most 10% of modified amino acids compared to the normal sequence. These modifications may in particular be carried on amino acids required for the specificity or the effectiveness of the activity, or responsible for the structural conformation, for the charge or for the hydrophobicity of the polypeptide according to the invention. It is thus possible to create polypeptides with equivalent, increased or decreased activity, or with equivalent, narrower or broader specificity. Among the modified polypeptides, mention should be made of the polypeptides in which up to five amino acids may be modified, truncated at the N— or C-terminal end, or else deleted, or added.
As is indicated, the aim of the modifications of a polypeptide are in particular:
- to allow the use thereof in methods of biosynthesis or biodegradation of organic or inorganic compounds,
- to allow the use thereof in methods of replication, amplification, repair and regulation of transcription, translation or maturation, in particular of DNA, RNA or proteins,
- to allow the improved secretion thereof,
- to modify the solubility thereof, or the effectiveness or specificity of the activity thereof, or alternatively to facilitate the purification thereof.
Chemical synthesis also has the advantage of being able to use unnatural amino acids or nonpeptide bonds. Thus, it may be advantageous to use unnatural amino acids, for example in D form, or analogs of amino acids, in particular sulfur-containing forms.
The present invention provides the nucleotide sequence of the genome of Listeria monocytogenes EGD-e, and also certain polypeptide sequences. Those skilled in the art may determine the other ORFs using known methods and suitable software.
Among the genes identified in the genomic sequence of L. monocytogenes, mention may in particular be made of the genes involved in vitamin B12 biosynthesis (SEQ ID NO. 42 to SEQ ID NO. 64). This bacterium is thus capable of naturally synthesizing this vitamin, and knowledge of the genes leading to their synthesis allows those skilled in the art to optimize expression of these genes or to modify them for the purpose of increasing production of this vitamin. Thus, a subject of the present invention is also a method for producing vitamin B12, characterized in that a host cell containing the genes corresponding to SEQ ID NO. 42 to SEQ ID NO. 64 is provided with the starting substrate, in that it is cultured under conditions suitable for the production of vitamin B12, and in that said vitamin is recovered. The host cell is preferably a bacterial cell, more preferably a bacterium of the Bacillus or Listeria genus. A method for producing vitamin B12 using a nucleic acid or polypeptide sequence according to the invention, a host cell according to the invention, or an animal or plant according to the invention is also a subject of the present invention.
In general, the list of SEQ ID sequences, or their corresponding coding nucleic acid sequence, may be determined by those skilled in the art using the most probable putative functions determined for each of the SEQ ID sequences in table I hereinafter for each of the classes of activity listed hereinafter.
Thus and preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in vitamin B12 biosynthesis. It is preferably a polypeptide sequence of SEQ ID NO. 42 to SEQ ID NO. 64.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a cell envelope polypeptide or polypeptide at the surface of Listeria monocytogenes, or a fragment thereof. It is preferably a polypeptide of sequence SEQ ID NO. 2 to SEQ ID NO. 41.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in amino acid biosynthesis.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the biosynthesis of cofactors, prosthetic groups and transporters.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the cellular machinery.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in central intermediate metabolism.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in energetic metabolism.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the metabolism of fatty acids and phospholipids.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the metabolism of nucleotides, purines, pyrimidines or nucleosides.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in regulatory functions.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the replication process.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the transcription process.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the translation process.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the process of transport and binding of proteins.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in adaptation to atypical conditions.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in sensitivity to medicinal products and analogs.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in functions relating to transposons.
Preferably, the invention relates to a nucleotide sequence according to the invention, characterized in that it encodes a polypeptide specific for Listeria monocytogenes, or a fragment thereof.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in vitamin B12 biosynthesis. It is preferably a polypeptide of sequence SEQ ID NO. 42 to SEQ ID NO. 64.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a cell envelope polypeptide or a surface polypeptide of Listeria monocytogenes, or a fragment thereof. It is preferably a polypeptide of sequence SEQ ID NO. 2 to SEQ ID NO. 41.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in amino acid biosynthesis.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the biosynthesis of cofactors, prosthetic groups and transporters.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the cellular machinery.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in central intermediate metabolism.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in energetic metabolism.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the metabolism of fatty acids and phospholipids.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the metabolism of nucleotides, purines, pyrimidines or nucleosides.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in regulatory functions.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the replication process.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the transcription process.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the translation process.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in the process of transport and binding of proteins.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in adaptation to atypical conditions.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in sensitivity to medicinal products and analogs.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide of Listeria monocytogenes, or a fragment thereof, involved in functions relating to transposons.
In another aspect, a subject of the invention is preferably a polypeptide according to the invention, characterized in that it is a polypeptide specific for Listeria monocytogenes, or a fragment thereof.
It is important to note, however, that a living organism is a whole entity and should be taken as such. Thus, in order to be able to develop and to exhibit its properties, any organism needs interactions between the various metabolic pathways. Thus, the classification stated above should not be considered to be limiting, a gene possibly being involved in two different metabolic pathways.
A subject of the present invention is also the nucleotide and/or polypeptide sequences according to the invention, characterized in that said sequences are recorded on a recording medium, the form and nature of which facilitate the reading, analysis and/or exploitation of said sequence(s). These media may also contain other information extracted from the present invention, in particular the analogies with already known sequences, and/or information concerning the nucleotide and/or polypeptide sequences of other microorganisms, in order to facilitate the comparative analysis and exploitation of the results obtained.
Among these said recording media, preference is given in particular to computer-readable media, such as magnetic, optical, electrical or hybrid media, in particular computer disks, CD-ROMs and computer servers. Such recording media are also a subject of the invention.
The recording media according to the invention, with the information provided, are very useful for choosing nucleotide primers or probes for determining genes in Listeria monocytogenes or strains close to this organism. Similarly, the use of these media for studying the genetic polymorphism of a strain close to Listeria monocytogenes, in particular by determining the regions of colinearity, is very useful insofar as these media provide not only the nucleotide sequence of the genome of Listeria monocytogenes egb, but also the genomic organization in said sequence. Thus, the uses of recording media according to the invention are also subjects of the invention.
The analysis of homology between various sequences is in fact advantageously performed using sequence comparison programs, such as the Blast program or the programs of the GCG package, described above.
The invention is also directed toward the cloning and/or expression vectors which contain a nucleotide sequence according to the invention. Preference is particularly given to the nucleotide sequences encoding cell envelope or surface polypeptides, or polypeptides involved in the cellular machinery, in particular secretion, central intermediate metabolism, in particular sugar production, energetic metabolism, and the processes of vitamin B12 synthesis, of transcription and translation and of polypeptide synthesis.
The vectors according to the invention preferably comprise elements which allow expression and/or secretion of the nucleotide sequences in a given host cell.
The vector should then comprise a promoter, translation initiation and determination signals, and also regions suitable for regulating transcription. It must be possible for it to be maintained stably in the host cell and it may optionally contain particular signals which specify secretion of the translated protein. These various elements are chosen and optimized by those skilled in the art depending on the cellular host used. To this effect, the nucleotide sequences according to the invention may be inserted into vectors which replicate autonomously in the host chosen, or may be vectors which integrate in the host chosen.
Such vectors are prepared by methods commonly used by those skilled in the art, and the resulting clones may be introduced into a suitable host using standard methods, such as lipofection, electroporation, heat shock or chemical methods.
The vectors according to the invention are, for example, vectors of plasmid or viral origin. They are of use in transforming host cells in order to clone or express the nucleotide sequences according to the invention.
The invention also comprises the host cells transformed with a vector according to the invention.
The cellular host may be chosen from prokaryotic or eukaryotic systems, for example bacterial cells but also yeast cells or animal cells, in particular mammalian cells. Insect cells or plant cells may also be used. The preferred host cells according to the invention are in particular prokaryotic cells, preferably bacteria belonging to the Listeria genus, to the species Listeria monocytogenes, or microorganisms associated with the species Listeria monocytogenes. The invention also relates to the animals, except humans, which comprise a transformed cell according to the invention. The transformed cells according to the invention can be used in methods for preparing recombinant polypeptides according to the invention. The methods for preparing a polypeptide according to the invention in recombinant form. characterized in that they use a vector and/or a cell transformed with a vector according to the invention, are themselves included in the present invention. Preferably, a cell transformed with a vector according to the invention is cultured under conditions which allow expression of said polypeptide, and said recombinant peptide is recovered. The host cells according to the invention may also be used for preparing food compositions, which are themselves a subject of the present invention.
As has been mentioned, the cellular host may be chosen from prokaryotic or eukaryotic systems. In particular, it is possible to identify nucleotide sequences according to the invention which facilitate secretion in such a prokaryotic or eukaryotic system. A vector according to the invention carrying such a sequence may therefore be advantageously used for producing recombinant proteins intended to be secreted. As a result, the purification of these recombinant proteins of interest will be facilitated by the fact that they are present in the cell culture supernatant rather than inside the host cells.
The polypeptides according to the invention may also be prepared by chemical synthesis. Such a method of preparation is also a subject of the invention. Those skilled in the art are aware of the methods of chemical synthesis, for example techniques using solid phases (see in particular Steward et al., 1984, Solid phase peptides synthesis, Pierce Chem. Company, Rockford, 111, 2nd ed. (1984)) or techniques using partial solid phases, by fragment condensation or by conventional synthesis in solution. The polypeptides obtained by chemical synthesis, and possibly comprising corresponding unnatural amino acids, are also included in the invention.
The invention also relates to hybrid polypeptides having at least one polypeptide, or a fragment thereof, according to the invention, and a sequence of a polypeptide capable of inducing an immune response in humans or animals.
Advantageously, the antigenic determinant is such that it is capable of inducing a humoral and/or cellular response.
Such a determinant may comprise a polypeptide, or a fragment thereof, according to the invention in a glycosylated form used for the purpose of obtaining immunogenic compositions capable of inducing the synthesis of antibodies directed against multiple epitopes. Said polypeptides, or the glycosylated fragments thereof, are also part of the invention.
These molecules may consist partly of a molecule bearing polypeptides, or fragments thereof, according to the invention, combined with an optionally immunogenic component, in particular an epitope of diphtheria toxin, tetanus toxin, a surface antigen of the hepatitis B virus (patent FR 79 21811), the VP1 antigen of the poliomyelitis virus or any other viral or bacterial antigen or toxin.
The methods for synthesizing the hybrid molecules encompass the methods used in genetic engineering for constructing hybrid nucleotide sequences encoding the desired polypeptide sequences. Reference may, for example, advantageously be made to the technique for obtaining genes encoding fusion proteins, described by Minton in 1984.
Said hybrid nucleotide sequences encoding a hybrid polypeptide and also the hybrid polypeptides according to the invention, characterized in that they are recombinant polypeptides obtained by expressing said hybrid nucleotide sequences, are also part of the invention.
The invention also comprises the vectors characterized in that they contain one of said hybrid nucleotide sequences. The host cells transformed with said vectors, the transgenic animals comprising one of said transformed cells and also the methods for preparing recombinant polypeptides using said vectors, said transformed cells and/or said transgenic animals are, of course, also part of the invention.
The coupling between a polypeptide according to the invention and an immunogenic polypeptide may be carried out chemically or biologically. Thus, according to the invention, it is possible to introduce one or more attachment elements, in particular amino acids, so as to facilitate the reactions for coupling between the polypeptide according to the invention and the immunostimulatory polypeptide, the covalent coupling of the immunostimulatory antigen possibly taking place at the N— or C-terminal end of the polypeptide according to the invention. Bifunctional reagents for this coupling are determined as a function of the end chosen for carrying out this coupling, and the coupling techniques are well known to those skilled in the art.
The conjugates derived from a peptide coupling may be prepared by genetic recombination. The hybrid peptide (conjugate) may in effect be produced by recombinant DNA techniques, by insertion or addition of a sequence encoding the antigenic, immunogenic or hapten peptide(s) into or to the DNA sequence encoding the polypeptide according to the invention. These techniques for preparing hybrid peptides by genetic recombination are well known to those skilled in the art (see, for example, Makrides, 1996, Microbiological Reviews 60, 512-538).
Preferably, said immune polypeptide is chosen from the group of peptides containing toxoids, in particular diphtheria toxoid or tetanus toxoid, streptococcus-derived proteins (such as the human serum albumin-binding protein), OMPA membrane proteins and outer membrane protein complexes, outer membrane vesicles or heat shock proteins.
The hybrid polypeptides according to the invention are very useful for obtaining monoclonal or polyclonal antibodies capable of specifically recognizing the polypeptides according to the invention. In fact, a hybrid polypeptide according to the invention allows potentiation of the immune response, against the polypeptide according to the invention coupled to the immunogenic molecule. Such monoclonal or polyclonal antibodies, fragments thereof or the chimeric antibodies, which recognize the polypeptides according to the invention, are also subjects of the invention.
The specific monoclonal antibodies may be obtained according to the conventional method of hybridoma culturing described by Köhler and Milstein (1975, Nature 256, 495).
The antibodies according to the invention are, for example, chimeric antibodies, humanized antibodies, or Fab or F(ab′)2 fragments. It may also be in the form of an immunoconjugate or of an antibody which is labeled in order to obtain a detectable and/or quantifiable signal.
Thus, the antibodies according to the invention may be used in a method for depicting and/or identifying bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, in a biological sample, characterized in that it comprises the following steps:
- a) bringing the biological sample into contact with an antibody according to the invention;
- b) demonstrating the antigen-antibody complex possibly formed.
The antibodies according to the present invention can also be used in order to detect expression of a gene of Listeria monocytogenes or of associated microorganisms. Specifically, the presence of the expression product of a gene recognized by an antibody specific for said expression product can be detected via the presence of an antigen-antibody complex formed after the Listeria monocytogenes strain or the associated microorganism has been brought into contact with an antibody according to the invention. The bacterial strain used may have been “prepared”, i.e. centrifuged, lyzed, and placed in an appropriate reagent for constituting the medium suitable for the immunoreaction. In particular, preference is given to a method for detecting the expression in the gene corresponding to a Western blot, which may be performed after polyacrylamide gel electrophoresis of a lysate of the bacterial strain, in the presence or absence of reducing conditions (SDS-PAGE). After migration and separation of the proteins on the polyacrylamide gel, said proteins are transferred onto a suitable membrane (for example made of nylon) and the presence of the protein or of the polypeptide of interest is detected by bringing said membrane into contact with an antibody according to the invention.
Thus, the present invention also comprises the kits or packs for carrying out a method as described (for detecting the expression of a gene of Listeria monocytogenes, or an associated microorganism, or for detecting and/or identifying bacteria belonging to the species Listeria monocytogenes, or an associated microorganism), comprising the following elements:
- a) a polyclonal or monoclonal antibody according to the invention;
- b) optionally, the reagents for constituting the medium suitable for the immunoreaction;
- c) optionally, the reagents for demonstrating the antigen-antibody complexes produced by the immunoreaction.
The polypeptides and the antibodies according to the invention may advantageously be immobilized on a support, in particular a protein chip. Such a protein chip is a subject of the invention and may also contain at least one polypeptide of a microorganism other than Listeria monocytogenes, or an antibody directed against a compound of a microorganism other than Listeria monocytogenes.
The protein chips or high density filters containing proteins according to the invention may be constructed in the same way as the DNA chips according to the invention. In practice, it is possible to carry out the synthesis of the polypeptides attached directly to the protein chip, or to carry out an ex situ synthesis followed by a step of attaching the synthesized polypeptide to said chip. The latter method is preferable when the intention is to attach proteins of considerable size to the support, which are advantageously prepared by genetic engineering. However, if the intention is to attach only peptides to the support of said chip, it may be more advantageous to synthesize said peptides directly in situ.
The protein chips according to the invention may advantageously be used in kits or packs for detecting and/or identifying bacteria associated with the species Listeria monocytogenes, or with a microorganism, or more generally in kits or packs for detecting and/or identifying microorganisms. When the polypeptides according to the invention are attached to DNA chips, the presence of antibodies in the samples tested is sought, the attachment of an antibody according to the invention to the support of the protein chip allowing identification of the protein for which said antibody is specific.
Preferably, an antibody according to the invention is attached to the support of the protein chip and the presence of the corresponding antigen, specific for Listeria monocytogenes, or for an associated microorganism, is detected.
A protein chip described above may be used for detecting gene products, in order to establish an expression profile for said genes, in addition to a DNA chip according to the invention.
The protein chips according to the invention are also extremely useful for proteomic experiments, which study interactions between the various proteins of a given microorganism. In a simplified manner, peptides representative of the various proteins of an organism are attached to a support. Said support is brought into contact with labeled proteins and, after an optional rinsing step, interactions between said labeled proteins and the peptides attached to the protein chip are detected.
Thus, the protein chips comprising a polypeptide sequence according to the invention are an antibody according to the invention are a subject of the invention, as are the kits or packs containing them.
The present invention also covers a method for detecting and/or identifying bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, in a biological sample, which uses a nucleotide sequence according to the invention.
It should be clearly understood that, in the present invention, the term “biological sample” concerns samples taken from a living organism (in particular blood, tissues, organs or other samples taken from a mammal) or a sample containing biological material, i.e. DNA. Such a biological sample therefore encompasses food compositions containing bacteria (for example cheeses, dairy products), but also food compositions containing yeast (beers, breads) or others.
The method for detection and/or identification using the nucleotide sequences according to the invention may be diverse in nature.
A method comprising the following steps is preferred:
- a) optionally isolating the DNA from the biological sample to be analyzed, or obtaining a cDNA from the RNA of the biological sample;
- b) specifically amplifying the DNA of bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, using at least one primer according to the invention;
- c) demonstrating the amplification products.
This method is based on the specific amplification of the DNA, in particular via a polymerase chain reaction.
A method comprising the following steps is also preferred:
- a) bringing a nucleotide probe according to the invention into contact with a biological sample, the nucleic acid contained in the biological sample having, where appropriate, been previously made accessible to hybridization, under conditions which allow hybridization of the probe to the nucleic acid of a bacterium belonging to the species Listeria monocytogenes, or to an associated microorganism;
- b) demonstrating the hybrid possibly formed between the nucleotide probe and the DNA of the biological sample.
Such a method should not be limited to detecting the presence of the DNA contained in the biological sample in question, it may also be used to detect the RNA contained into said sample. This method in particular encompasses Southern and Northern blotting.
Another preferred method according to the invention comprises the following steps:
- a) bringing a nucleotide probe immobilized on a support according to the invention into contact with a biological sample, the nucleic acid of the sample having, where appropriate, previously been made accessible to hybridization, under conditions which allow hybridization of the probe to the nucleic acid of a bacterium belonging to the species Listeria monocytogenes, or to an associated microorganism;
- b) bringing the hybrid formed between the nucleotide probe immobilized on a support and the nucleic acid contained in the biological sample, where appropriate after removing the DNA of the biological sample which has not hybridized with the probe, into contact with a labeled nucleotide probe according to the invention;
- c) demonstrating the new hybrid formed in step b).
This method is advantageously used with a DNA chip according to the invention, the nucleic acid being sought hybridizing with a probe present at the surface of said chip, and being detected using a labeled probe. This method is advantageously carried out by combining a prior step of amplifying the DNA or the complementary DNA optionally obtained by reverse transcription, using primers according to the invention.
Thus, the present invention also encompasses the kits or packs for detecting and/or identifying bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, characterized in that it comprises the following elements:
- a) a nucleotide probe according to the invention;
- b) optionally, the reagents required for carrying out a hybridization reaction;
- c) optionally, at least one primer according to the invention and also the reagents required for a DNA amplification reaction.
Similarly, the present invention also encompasses the kits or packs for detecting and/or identifying bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, characterized in that it comprises the following elements:
- a) a nucleotide probe, termed capture probe, according to the invention;
- b) an oligonucleotide probe, termed detection probe, according to the invention;
- c) optionally, at least one primer according to the invention and also the reagents required for a DNA amplification reaction.
Finally, the kits or packs for detecting and/or identifying bacteria belonging to the species Listeria monocytogenes, or to an associated microorganism, characterized in that it comprises the following elements:
- a) at least one primer according to the invention;
- b) optionally, the reagents required for carrying out a DNA amplification reaction;
- c) optionally, a component for verifying the sequence of the amplified fragment, more particularly an oligonucleotide probe according to the invention,
- are also subjects of the present invention.
Preferably, said primers and/or probes and/or polypeptides and/or antibodies according to the present invention, used in the methods and/or kits or packs according to the present invention, are chosen from the primers and/or probes and/or polypeptides and/or antibodies specific for the species Listeria monocytogenes. Preferably, these elements are chosen from the nucleotide sequences encoding a secreted protein, from the secreted polypeptides, or from the antibodies directed against secreted polypeptides of Listeria monocytogenes.
A subject of the present invention is also the strains of Listeria monocytogenes, and/or of associated microorganisms, containing one or more mutation(s) in a nucleotide sequence according to the invention, in particular an ORF sequence, or regulatory elements thereof (in particular promoters).
According to the invention, preference is given to the strains of Listeria monocytogenes having one or more mutation(s) in the nucleotide sequences encoding polypeptides involved in the cellular machinery, in particular secretion, central intermediate metabolism, energetic metabolism, and processes of amino acid synthesis, of transcription and translation, and of polypeptide synthesis.
Said mutations may lead to inactivation of this gene or, in particular when they are located in the regulatory elements of said gene, to overexpression of this gene.
The invention also relates to the use of a nucleotide sequence according to the invention, of a polypeptide according to the invention, of an antibody according to the invention, of a cell according to the invention and/or of a transformed animal according to the invention, for selecting an organic or inorganic compound capable of modulating, regulating, inducing or inhibiting gene expression, and/or of modifying cell replication in eukaryotic or prokaryotic cells, or capable of inducing, inhibiting or worsening pathological conditions linked to an infection with Listeria monocytogenes, or a microorganism associated therewith.
The invention also comprises a method for selecting compounds capable of binding to a polypeptide, or a fragment thereof, according to the invention, capable of binding to a nucleotide sequence according to the invention, or capable of recognizing an antibody according to the invention, and/or capable of modulating, regulating, inducing or inhibiting gene expression, and/or modifying the cell growth or replication in eukaryotic or prokaryotic cells, or capable of inducing, inhibiting or worsening, in an animal or human organism, pathological conditions linked to an infection with Listeria monocytogenes, or a microorganism associated therewith, characterized in that it comprises the following steps:
- a) bringing said compound into contact with said polypeptide or said nucleotide sequence or with a transformed cell according to the invention and/or administering said compound to a transformed animal according to the invention;
- b) determining the ability of said compound to bind to said polypeptide or said nucleotide sequence, or to modulate, regulate, induce or inhibit gene expression, or to modulate cell growth or replication, or to induce, inhibit or worsen, in said transformed animal, pathological conditions linked to an infection with Listeria monocytogenes or a microorganism associated therewith.
The transformed cells and/or animals according to the invention may advantageously serve as a model and be used in methods for studying, identifying and/or selecting compounds which may be responsible for pathological conditions induced or worsened by Listeria monocytogenes, or which may prevent and/or treat these pathological conditions, such as, for example, genital, eye or systemic diseases, in particular diseases of the lymphatic system. In particular, the transformed host cells, especially the bacteria of the Listeriae family, the transformation of which with a vector according to the invention may, for example, increase or inhibit its infectious capacity, or modulate the pathological conditions usually induced or worsened by the infection, may be used to infect animals in which the appearance of the pathological conditions will be monitored. These nontransformed animals, infected, for example, with transformed Listeriae bacteria, may serve as a study model. In the same way, the transformed animals according to the invention may be used in methods for selecting compounds capable of preventing and/or treating diseases due to Listeria. Said methods using said transformed cells and/or transformed animals are part of the invention.
The compounds liable to be selected may be organic compounds, such as polypeptides or carbohydrates, or any other already known organic or inorganic compounds, or new organic compounds developed using molecular modeling techniques and obtained by chemical or biochemical synthesis, these techniques being known to those skilled in the art.
Said selected compounds may be used for modulating cell growth and/or replication in Listeria monocytogenes, or any other associated microorganism, and also for controlling infection with these microorganisms. Said compounds according to the invention may also be used for modulating cell growth and/or replication in any eukaryotic or prokaryotic cells, in particular tumor cells and infectious microorganisms, for which said compounds will prove to be active, the methods for determining said modulations being well known to those skilled in the art.
The expression “compound capable of modulating the growth of a microorganism” is intended to denote any compound making it possible to intervene in, modify, limit and/or reduce the development, growth, rate of proliferation and/or viability of said microorganism.
This modulation may be carried out, for example, using an agent capable of binding to a protein and thus of inhibiting or potentiating its biological activity, or capable of binding to an outer surface membrane protein of a microorganism and blocking the penetration of said microorganism into the host cell or promoting the action of the immune system of the infected organism, directed against said microorganism. This modulation may also be carried out using an agent capable of binding to a nucleotide sequence of a DNA or RNA of a microorganism and blocking, for example, the expression of a polypeptide the biological or structural activity of which is necessary for the growth or for the reproduction of said microorganism.
In the present invention, the term “associated microorganism” is intended to denote any microorganism in which the gene expression may be modulated, regulated, induced or inhibited, or the cell growth or replication of which may also be modulated, by a compound of the invention. In the present invention, the term “associated microorganism” is also intended to denote any microorganism comprising nucleotide sequences or polypeptides according to the invention. These microorganisms may, in certain cases, comprise polypeptides or nucleotide sequences identical or homologous to those of the invention [lacuna] may also be detected and/or identified using the methods or kit for detection and/or identification according to the invention and may also serve as a target for the compounds of the invention.
The invention relates to the compounds which may be selected using a election method according to the invention.
- The invention also relates to a pharmaceutical composition comprising a compound chosen from the following compounds:
- a) a nucleotide sequence according to the invention;
- b) a polypeptide according to the invention;
- c) a vector according to the invention;
- d) an antibody according to the invention; and
- e) a compound which may be selected using a selection method according to the invention, optionally in combination with a pharmaceutically acceptable vehicle.
The term “effective amount” is intended to denote a sufficient amount of said compound or antibody, or of polypeptide of the invention, for modulating the growth of Listeria monocytogenes or of an associated microorganism.
The invention also relates to a pharmaceutical composition according to the invention, for preventing or treating an infection with a bacterium belonging to the species Listeria monocytogenes , or with an associated microorganism.
The invention is also directed toward an immunogenic and/or vaccinal composition, characterized in that it comprises one or more polypeptides according to the invention and/or one or more hybrid polypeptides according to the invention.
The invention also comprises the use of a transformed cell according to the invention, for preparing a vaccinal composition.
The invention is also directed toward a vaccinal composition, characterized in that it contains a nucleotide sequence according to the invention, a vector according to the invention and/or a transformed cell according to the invention.
The invention also relates to the vaccinal compositions according to the invention, for preventing or treating an infection with a bacterium belonging to the species Listeria monocytogenes, or with an associated microorganism.
Preferably, the immunogenic and/or vaccinal compositions according to the invention intended for the prevention and/or treatment of infection with Listeria monocytogenes, or with an associated microorganism will be chosen from the immunogenic and/or vaccinal compositions comprising a polypeptide, or a fragment thereof, corresponding to a protein, or a fragment thereof, of the cell envelope of Listeria monocytogenes. The vaccinal compositions comprising nucleotide sequences will preferably also comprise nucleotide sequences encoding a polypeptide, or a fragment thereof, corresponding to a protein, or a fragment thereof, of the cell envelope of Listeria monocytogenes.
Among these preferred immunogenic and/or vaccinal compositions, the most preferred are those comprising a polypeptide, or a fragment thereof, or a nucleotide sequence, or a fragment thereof, the sequences of which are chosen from the nucleotide or amino acid sequences identified in this functional group and listed previously.
The polypeptides of the invention, or the fragments thereof, which are part of the immunogenic compositions according to the invention may be selected using techniques known to those skilled in the art, such as, for example, the ability of said polypeptides to stimulate T cells, which, for example, causes the proliferation thereof or the secretion of interleukins, and which results in the production of antibodies directed against said polypeptides.
In mice, to which a weight dose of the vaccinal composition comparable to the dose used in humans is administered, the antibody reaction is tested by taking a serum sample and then studying the formation of a complex between the antibodies present in the serum and the antigen of the vaccinal composition, according to usual techniques.
According to the invention, said vaccinal compositions will preferably be in combination with a pharmaceutically acceptable vehicle and, where appropriate, with one or more suitable adjuvants of immunity.
Today, various types of vaccine are available for protecting humans against infectious diseases: attenuated live microorganisms (M. bovis—BCG for tuberculosis), inactive microorganisms (flu virus), acellular extracts (Bordetella pertussis for whooping cough), recombined proteins (hepatitis B virus surface antigens) and polysaccharides (pneumococci). Vaccines prepared from synthetic peptides or from genetically modified microorganisms expressing heterologous antigens are undergoing experimentation. Even more recently, recombined plasmid DNAs carrying genes encoding protective antigens have been proposed as an alternative vaccinal strategy. This type of vaccination is performed with a particular plasmid derived from an E. coli plasmid which does not replicate in vivo and which encodes only the immunizing protein. Animals have been immunized by simply injecting the naked plasmid DNA into muscle. This technique leads to the expression of the immunizing protein in situ and to an immune response of the cellular type (CTL) and of the humoral type (antibodies). This double induction of the immune response is one of the main advantages of the technique of vaccination with naked DNA.
The vaccinal compositions comprising nucleotide sequences or vectors into which said sequences are inserted are in particular described in international application No. WO 90/11092 and also in international application No. WO 95/11307.
The nucleotide sequence constituting the vaccinal composition according to the invention may be injected into the host after having been coupled to compounds which promote penetration of this polynucleotide into the cell or its transport as far as the cell nucleus. The resulting conjugates may be encapsulated in polymeric microparticles, as described in international application No. WO 94/27238 (Medisorb Technologies International).
According to another embodiment of the vaccinal composition according to the invention, the nucleotide sequence, preferably a DNA, is complexed with DEAE-dextran, with nuclear proteins or with lipids, or encapsulated in liposomes, or alternatively introduced in the form of a gel which facilitates its transfection into cells. The polynucleotide or the vector according to the invention may also be in suspension in a buffer solution or may be associated with liposomes.
Advantageously, such a vaccine will be prepared in accordance with the technique described by Tacson et al. or Huygen et al. in 1996, or else in accordance with the technique described by Davis et al. in international application No. WO 95/11307.
Such a vaccine may also be prepared in the form of a composition containing a vector according to the invention, placed under the control of regulatory elements for its expression in humans or animals. As a vector for in vivo expression of the polypeptide antigen of interest, use may, for example, be made of the plasmid pcDNA3 or the plasmid pcDNA1/neo, both marketed by Invitrogen (R & D Systems, Abi{overscore (ng)}don, United Kingdom). Such a vaccine will advantageously comprise, besides the recombinant vector, a saline solution, for example a sodium chloride solution.
The expression “pharmaceutically acceptable vehicle” is intended to denote a compound, or a combination of compounds, included in a pharmaceutical or vaccinal composition, which does not cause any side reactions and which makes it possible, for example, to facilitate administration of the active compound, to increase the lifetime thereof and/or the effectiveness thereof in the organism, to increase the solubility thereof in solution or else to improve the conservation thereof. These pharmaceutically acceptable vehicles are well known and will be adjusted by those skilled in the art depending on the nature and on the method of administration of the active compound chosen.
With regard to the vacinnal formulations, they may comprise suitable adjuvants of immunity which are known to those skilled in the art, such as, for example, aluminum hydroxide, a representative of the muramyl peptide family, such as one of the peptide derivatives of N-acetylmuramyl, a bacterial lyzate, or incomplete Freund's adjuvant.
Preferably, these compounds will be administered systemically, in particular intravenously, intramuscularly, intradermally or subcutaneously, or orally. More preferably, the vaccinal composition comprising polypeptides according to the invention will be administered several times, spread out over time, intradermally or subcutaneously.
The optimal methods of administration, doses and pharmaceutical forms of these compounds can be determined according to the criteria generally taken into account in establishing a suitable treatment for a patient, such as, for example, the age or body weight of the patient, the seriousness of his or her general condition, the tolerance to the treatment and the side effects noted.
The invention comprises the use of a composition according to the invention, for treating or preventing genital diseases induced or worsened by Listeria monocytogenes.
Finally, the invention comprises the use of a composition according to the invention, for treating or preventing diseases induced or worsened by the presence of Listeria monocytogenes.
Finally, the invention comprises the use of a composition according to the invention, for treating or preventing systemic diseases, in particular diseases of the lymphatic system, induced or worsened by the presence of Listeria monocytogenes.
Moreover, a subject of the present invention is also a genomic DNA library of a bacterium of the Listeria genus, preferably Listeria monocytogenes, preferably the EGD-e strain, said DNA library being cloned in bacterial artificial chromosomes (BACs). Such a genomic DNA library contains very large inserts of the Listeria genome, in particular inserts of between 50 and 200 kb in length.
One of the advantages of using the BAC system compared to a cosmid system is that there is only one or a maximum of two copies of the plasmids used per transformed cell, which decreases the potential for recombination between DNA fragments and, more importantly, which eliminates the risk of lethal overexpression of bacterial cloned genes. However, the presence of the BAC as a single copy means that the plasmid DNA must be extracted from a large volume of culture in order to obtain sufficient DNA for the sequence. In addition, the stability and fidelity with which the clones are maintained in a BAC library allows the identification of genomic differences between various Listeria strains, and the identification of these genetic differences which may be responsible for the phenotypic variations observed between the various strains.
The genomic DNA library described in the present invention, in particular the LM_baclim library deposited with the CNCM [National Collection of Microorganism Cultures] on Apr. 11, 2000, under the number I-2439, in fact covers the Listeria monocytogenes genome. However, although certain regions could not be cloned into said library, due to problems of lethality in Escherichia coli, these regions can easily be amplified and identified by those skilled in the art, using oligonucleotides specific for the sequences of the ends of the various clones which form the contigs.
The present invention also relates to the methods for isolating a polynucleotide of interest present in a strain of Listeria and absent from another strain, which use as at least one DNA library based on a BAC, containing the Listeria genome. The method according to the invention for isolating a polynucleotide of interest may comprise the following steps:
- a) isolating at least one polynucleotide contained in a clone from the DNA library based on a BAC, of listeria origin;
- b) isolating:
- at least one genomic polynucleotide or cDNA of a listeria, said listeria belonging to a strain which is different from the strain used to construct the BAC DNA library of step a) or, alternatively,
- at least one polynucleotide contained in a clone from a DNA library based on a BAC prepared from the genome of a listeria which is different from the listeria used to construct the DNA library based on the BAC of step a);
- c) hybridizing the polynucleotide of step a) to the polynucleotide of step b);
- d) selecting the polynucleotides of step a) which have not formed a hybridization complex with the polynucleotides of step b);
- e) characterizing the polynucleotide selected.
The polynucleotide of step a) may be prepared by digesting at least one recombinant BAC clone with a suitable restriction enzyme and, optionally, amplifying the polynucleotide insert which results therefrom.
Thus, the method of the invention allows those skilled in the art to perform comparative genomic studies between the various strains or species of the listeria genus, for example between the pathogenic strains and their nonpathogenic equivalents.
In particular, it is possible to study and determine the regions of polymorphism between said strains.
EXAMPLES
Example 1
Production of the BAC Library of Listeria monocytogenes DNA
Blocks containing the chromosomal DNA of Listeria monocytogenes, with an average weight of 80 mg, were prepared in agarose using methods known to those skilled in the art. They were kept in a solution of 500 mM EDTA, pH 8.0.
8 blocks are used to construct the library. These agarose blocks are washed twice for 30 min on ice in TE/PMSF (10 mM Tris-HCl, pH 8.0; 1 mM EDTA; 100 μM PMSF). The agarose blocks are then washed twice for 30 min on ice in TE (10 mM Tris-HCl, pH 8.0; 1 mM EDTA). Each agarose block is cut into eight thin slices, and sixteen slices are placed together in a single tube. The blocks are then incubated in a preincubation solution (20 μM spermidine, 5 mM DTT, 1×restriction buffer) for 40 min on ice. The preincubation buffer is changed once and incubated for a further 40 min.
The partial digestions are carried out in a digestion buffer (2 mM spermidine, 0.5 mM DTT, 0.02 μg BSA, 0.5×restriction buffer) for 30 min at 4° C., with no restriction enzyme, and then 0.1, 0.25, 0.4 or 0.5 units of EcoRI (Life Tech) per tube are added for 2 h at 37° C. The partial digestion is stopped by replacing the digestion buffer with 200 μl of 0.5M EDTA, the tubes being placed on ice.
The agarose blocks are then placed on a 1% SeaKem GTG agarose (FMC) gel containing 0.5×TBE buffer. Pulsed field gel electrophoresis is then carried out with the following conditions:
- initial pulse time: 90 seconds,
- final pulse time: 90 seconds,
- 6 V/cm,
- included angle: 120°,
- gel time: 16 h, 12° C.
The region between 50 and 200 kb is cut from the gel and separated into three pieces (50-100 kb, 100-150 kb, 150-200 kb). The gel for cloning should not be stained with ethidium bromide.
These agarose blocks are placed in a new 1% SeaKem GTG agarose (FMC) gel containing 0.5×TBE buffer. A further pulsed field gel electrophoresis is performed with the following conditions:
- initial pulse time: 5 seconds,
- final pulse time: 5 seconds,
- 4 V/cm,
- included angle: 120°,
- gel time: 15 h, 12° C.
The region between 50 and 200 kb is again cut out, without having stained the DNA with ethidium bromide, and separated into three pieces (50-100 kb, 100-150 kb, 150-200 kb).
The pieces of agarose are cut into small pieces, each of approximately 100 mg.
The agarose is incubated at 67° C. for 10 min and then cooled to 42° C., and 1 μl of beta-agarose (FMC, 1 U/μl) is added. The mixture is incubated for 30 min at 42° C., and the beta-agarose is denatured, after complete digestion of the agarose, by incubation for 10 min at 67° C. and then incubation on ice.
150 ng of DNA vectors digested with EcoRI and phosphorylated (CIP, Roche) are used to construct the BAC library. The vector pBelaBAC-Kan (Mozo et al., Mol. Gen. Genet., 1998, 258, 562-70) is used to construct said library. 150 ng of said vector are therefore incubated with 150 ng of DNA inserts, in a 1×ligation buffer with 5 units of ligase (USB 1 unit/μl) for 16 h at 12° C.
The ligation buffer is the buffer recommended by the manufacturer.
The transformation is carried out by adding 5 μl of the ligation reaction to 40 μl of DH10B electrocompetent cells, and the electroporation is performed in a Life Tech electroporator with the following conditions:
- 330 μF, 4 kΩ, DC volts: lowΩ, charge rate: fast, 1.5 mm electroporation cuvette.
1 mm of SOC culture medium is added to the cells, which are incubated for 45 min at 37° C. The cells are then plated out on a dish containing LB agar and which also contains 30 g/ml of kanamycin, 100 μg/ml of IPTG and 40 μg/ml of X-Gal. The recombinant cells which are white compared to the blue cells are selected, and the BAC library is characterized by preparing restriction maps for each of the clones thus obtained.
Example 2
Annotations and Analysis of the Genomic Sequence of Listeria monocytogenes
Bioinformatics has a key role in the three phases of a genome project: shotgun follow-up of the inserts produced using the random sequencing method, genome sequence closure phase, and annotations. The inventors have developed a complete software package which makes it possible to satisfy these three requirements: GMP-Tool-box (GMPTB).
Shotgun follow-up: During the random sequencing procedure, GMPTB extracts from the results files (Phrap format) or the characteristics required for the assembly (number of contigs, number of sequences, etc) and displays them in a table. This table may be used to create graphics which show the progression of this method and which allow rapid identification of the assembly problems. Importantly, GMPTB allows comparison between the assembly results and creates an HTML page to explain the relationship between new and old contigs (fusion, creation, etc).
Sequence closure phase: Various strategies are used by GMPTB to predict links between contigs. GMPTB in particular searches for all the clones which allow links, on the basis of the location and of the orientation of the terminal sequences. It can also indicate misassemblies. GMPTB can also predict links, on the basis of genome comparisons, by searching for similarities between the ends of the contigs and other genomic sequences (at nucleotide and amino acid level).
Annotations: GMPTB makes it possible to begin the annotation during the terminal phase. In fact, GMPTB creates an individual protein file (IPF) for each open reading frame (ORF) at the time of assembly. These are text files in a specific format which contain three categories of fields:
- the minimum fields contain an identification number, a version number, location and sequences. The nucleotide sequence exported corresponds to the sequence of the open reading frame with 500 additional bases before the first stop codon and 200 additional bases after the second stop.
- the automatic field contains results added to the IPF by different programs. It concerns the DNA sequence (search for ribosome binding sites, promoters or terminators, coding capacity, etc) and the predicted protein sequence (homology, domain, etc).
- the manual field contains results and comments added by the users. After a new assembly, GMPTB extracts all the ORFs and creates new IPFs according to the IPF sequences derived from the previous assemblies. GMPTB recognizes the modified IPFs which are the only ones used for a new automatic analysis after each assembly.
The specificity of this strategy is that the annotation of the IPFs is independent of the assembly step, unless its sequence is modified. The IPFs are connected to a Sybases genomic databank (of the SubtiList model) and are accessible via a web server. They can be modified and annotated by different inventors during the genome project phase.
Depositing of Biological Material
The following organisms were deposited, on Apr. 11, 2000, at the Collection Nationale de Cultures de Microorganismes (CNCM) [National Collection of Microorganism Cultures], 25 rue du Docteur Roux, 75724 Paris Cedex 15, France, according to the provisions of the Treaty of Budapest:
- Listeria monocytogenes strain EGD-e, number I-2440;
BAC library of Listeria DNA (145 clones), LM-baclib, number I-2439. Said BAC library (1-2439) was produced in the E. coli strain DH10B (Grant et al., PNAS, 87, 4645, 1990), constructed after partial digestion of the Listeria monocytogenes DNA with the EcoRI enzyme in the vector pBelaBAC-Kan (Mozo et al., Mol. Gen. Genet., 1998, 258, 562-70).
TABLE 1
|
|
Location of nucleic acid
sequence of the ORF on
SEQ IDNameSEQ ID NO. 1Function
|
|
SEQ ID NO. 2LM-1000.1From 589066to 589362Unknown, peptdidoglycan
bound protein (LPXTG
motif)
SEQ ID NO. 3LM-1002.1From 587012to 589033Unknown, similar to
internalin protein
SEQ ID NO. 4LM-1050.1From 2907153to 2909708Unknown, LPXTG protein
with LRR repeats
SEQ ID NO. 5LM-1179.1From 761552to 763468Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 6LM-118.2From 2787416to 2788363Unknown, peptidoglycan
anchored protein (LPXTG
motif)
SEQ ID NO. 7LM-1235.1From 875721to 881855Unknown, surface protein
(LPXTG motif)
SEQ ID NO. 8LM-1248.1From 865530Fromunknown, surface protein
865530(LPXTG motif)
SEQ ID NO. 9LM-1248.1From 865530Fromunknown, surface protein
865530(LPXTG motif)
SEQ ID NO. 10LM-1305.1From 919020to 920408Unknown, similar to wall
associated protein
precursor (LPXTG motif)
SEQ ID NO. 11LM-1490.1From 649864to 651633Unknown, similar to
internalin proteins
SEQ ID NO. 12LM-1514.1From 664242to 668990Unknown, peptidoglycan
bound protein (LPXTG
motif) similar to adhesion
SEQ ID NO. 13LM-1660.3From 501312to 501617Unknown
SEQ ID NO. 14LM-1738.1From 547520to 549337Unknown, similar to
internalin proteins
SEQ ID NO. 15LM-1756.2From 2679599to 2681125Unknown, surface protein
(GW repeat) similar to
N-acetylmuramidase
SEQ ID NO. 16LM-1778.1From 1442368to 1443687Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 17LM-1972.3From 1313654to 1315435Unknown, similar to
internalin proteins
SEQ ID NO. 18LM-1974.3From 1315767to 1317563Unknown, similar to
internalin proteins
SEQ ID NO. 19LM-2137.2From 360936to 366272Unknown, similar to
internalin proteins
SEQ ID NO. 20LM-229.1From 1170002to 1171621Unknown, putative
interanlin, similar to InlA
SEQ ID NO. 21LM-2323.1From 344850to 346049Unknown, similar to surface
proteins
SEQ ID NO. 22LM-2435.1From 159663to 161378Unknown, surface anchored
protein (LPXTG motif)
SEQ ID NO. 23LM-2438.1From 157089to 159470Unknown, surface anchored
protein (LPXTG motif)
SEQ ID NO. 24LM-2503.1From 2108500to 2109603Unknown, putative cell
surface protein, similar to
internalin proteins
SEQ ID NO. 25LM-2504.1From 2106329to 2108209Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 26LM-3009.3From 1149887to 1152475Unknown, similar to
fibrinogen-binding protein
(LPXTG motif)
SEQ ID NO. 27LM-311.2From 131419to 133773Unknown, similar to 5-
nucleotidase (LPXTG motif)
SEQ ID NO. 28LM-3144.1From 351459to 355505Unknown, similar to cell
surface proteins (LPXTG
motif)
SEQ ID NO. 29LM-3369.1From 1869532to 1872243Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 30LM-3418.2From 1717193to 1722328Unknown, peptidoglycan
linked protein (LPxTG)
SEQ ID NO. 31LM-3477.1From 828168to 830108Unknown, similar to
internalin
SEQ ID NO. 32LM-3609.1From 1106041to 1107759Unknown, similar to
AUTOLYSIN (EC 3.5.1.28)
(N-ACETYLMURAMOYL-L-
ALANINE AMIDASE)
SEQ ID NO. 33LM-3691.2From 2162323to 2164011Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 34LM-3700.2From 2653211to 2657803Unknown, peptidoglycan
anchored protein (LPXTG
motif)
SEQ ID NO. 35LM-3752.3From 357775to 359676Unknown, similar to
internalin
SEQ ID NO. 36LM-757.1From 2544267to 2545433Unknown, similar to
internalin proteins
SEQ ID NO. 37LM-814.2From 2264772to 2268230Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 38LM-816.2From 2259753to 2264591Unknown, putative
peptidoglycan bound protein
(LPXTG motif)
SEQ ID NO. 39LM-894.1From 2469093to 2471915Unknown, similar to
internalin proteins
SEQ ID NO. 40LM-966.1From 173309to 174556Unknown, cell wall
anchored protein
SEQ ID NO. 41LM-973.1From 169510to 172008Unknown, similar to
internalin proteins
SEQ ID NO. 42LM-133.1From 1230773to 1232308Unknown, similar to cobyric
acid synthase CbiP
SEQ ID NO. 43LM-134.1From 1229967to 1230773Unknown, similar to cobalt
transport ATP-binding
protein CbiO
SEQ ID NO. 44LM-135.1From 1229277to 1229954Unknown, similar to cobalt
transport protein Q
SEQ ID NO. 45LM-136.1From 1228994to 1229290Unknown, similar to putative
cobalt transport protein
CbiN
SEQ ID NO. 46LM-137.1From 1228263to 1228997Unknown, similar to
cobalamin biosynthesis
protein M
SEQ ID NO. 47LM-138.1From 1227556to 1228266Unknown, similar to
S-adenosyl-methionine:
precorrin-2
methyltransferase
SEQ ID NO. 48LM-139.1From 1226778to 1227563unknown, similar to
anaerobic Cobalt Chelatase
In Cobalamin Biosynthesis
SEQ ID NO. 49LM-141.1From 1225300to 1226781Unknown, similar
uroporphyrinogen-III
methyltransferase/uroporphyrinogen-
III synthase
SEQ ID NO. 50LM-142.1From 1224548to 1225300Unknown, similar to
cobalamin biosynthesis J
protein CbiJ
SEQ ID NO. 51LM-143.1From 1223826to 1224551Unknown, similar to
precorrin methylase
SEQ ID NO. 52LM-144.1From 1222798to 1223829Unknown, similar to
cobalamin biosynthesis
protein G CbiG
SEQ ID NO. 53LM-145.1From 1222062to 1222811Unknown, similar to
precorrin-3 methylase
SEQ ID NO. 54LM-146.1From 1221487to 1222056Unknown, similar to
precorrin decarbocylase
SEQ ID NO. 55LM-147.1From 1220901to 1221497Unknown, similar to
precorrin methylase
SEQ ID NO. 56LM-149.1From 1219783to 1220904Unknown, similar to
cobalamin biosynthesis
protein CbiD
SEQ ID NO. 57LM-150.1From 1219135to 1219767Unknown, similar to
precorrin isomerase
SEQ ID NO. 58LM-151.1From 1218175to 1219122Unknown, similar to
cobalamine synthesis
protein CbiB
SEQ ID NO. 59LM-152.1From 1216963to 1217322FALSE ORF
SEQ ID NO. 60LM-181.1From 1196097to 1197182unknown, similar to
Salmonella typhimurium
CobD protein and to
histidinol-phosphate
aminotransferase
SEQ ID NO. 61LM-207.1From 1180152to 1181036Regulatory protein similar to
Salmonella typhimurium
PocR protein
SEQ ID NO. 62LM-209.1From 1179168to 1179743unknown, similar to alpha-
ribazole-5-phosphatase
SEQ ID NO. 63LM-210.1From 1178421to 1179167Unknown, highly similar to
cobalamin (5-phosphatase)
synthetase
SEQ ID NO. 64LM-212.1From 1177862to 1178419unknown, similar to
bifunctional cobalamin
biosynthesis protein CopB,
(cobinamide kinase;
cobinamide phosphatase
guanylyltransferase)
SEQ ID NO. 65LM-1.3From 2710869to 2712755Unknown, similar to NADH
dehydrogenase
SEQ ID NO. 66LM-10.1From 2718312to 2719055Unknown
SEQ ID NO. 67LM-100.1From 2775644to 2776486Unknown, similar to
aldo/keto reductase
SEQ ID NO. 68LM-1003.1From 586329to 586994Unknown
SEQ ID NO. 69LM-1004.1From 585015to 585962Unknown, similar to B. subtilis DeoR
transcriptional
regulator
SEQ ID NO. 70LM-1005.1From 583507to 584757Unknown, similar to putative
NAD(P)-dependent
oxidoreductase
SEQ ID NO. 71LM-1007.1From 583072to 583452Unknown
SEQ ID NO. 72LM-1008.1From 582526to 583047Unknown, similar to PTS
system, glucitol/sorbitol-
specific enzyme II CII
component
SEQ ID NO. 73LM-1009.1From 581519to 582505Unknown, similar to PTS
system, glucitol/sorbitol-
specific enzyme IIBC
component
SEQ ID NO. 74LM-101.1From 2776532to 2776777Unknown, similar to B. subtilis
YaaL protein
SEQ ID NO. 75LM-1010.1From 581150to 581500Unknown, similar to PTS
system, glucitol/sorbitol-
specific enzyme IIA
component
SEQ ID NO. 76LM-1011.1From 580119to 581039Unknown, similar to ABC
transporter (binding protein)
SEQ ID NO. 77LM-1012.1From 578886to 580079Unknown, similar to
penicillin-binding protein
SEQ ID NO. 78LM-1013.1From 577632to 578648Unknown, similar to
tagatose-1,6-diphosphate
aldolase
SEQ ID NO. 79LM-1015.1From 576403to 577584Unknown, similar to N-acyl-
L-amino acid
amidohydrolase
SEQ ID NO. 80LM-1017.1From 575166to 576437Unknown, similar to N-
carbamyl-L-amino acid
amidohydrolase
SEQ ID NO. 81LM-1018.1From 573740to 575062Unknown, similar to 6-
phospho-beta-glucosidase
SEQ ID NO. 82LM-102.1From 2776792to 2777388Unknown, highly similar to
recombination protein recR
SEQ ID NO. 83LM-1020.1From 572600to 573568Unknown, similar to
transcription regulator (Lacl
family)
SEQ ID NO. 84LM-1022.1From 571204to 572559Unknown, similar to
unknown proteins
SEQ ID NO. 85LM-1023.1From 570916to 571185Unknown, similar to
unknown proteins
SEQ ID NO. 86LM-1024.1From 570008to 570838Unknown
SEQ ID NO. 87LM-1025.1From 569094to 569948Unknown
SEQ ID NO. 88LM-1029.1From 567030to 569081Unknown
SEQ ID NO. 89LM-103.1From 2777497to 2777814Unknown, highly similar to
B. subtilis YaaK protein
SEQ ID NO. 90LM-1031.1From 565764to 567014Unknown, conserved
hypothetical protein similar
to putative
glucosaminyltransferase
SEQ ID NO. 91LM-1032.1From 564257to 565753Unknown, hypothetical
secreted protein
SEQ ID NO. 92LM-1033.1From 562789to 564264Unknown, transmembrane
protein
SEQ ID NO. 93LM-1036.1From 561819to 562559Unknown, similar to
transcription regulator (TipA
from Streptomyces
coelicolor)
SEQ ID NO. 94LM-1037.1From 560443to 561774Unknown
SEQ ID NO. 95LM-1040.2From 2915744to 2916097Unknown
SEQ ID NO. 96LM-1041.2From 2915219to 2915641Unknown similar to
transcriptional regulator
(MarR family)
SEQ ID NO. 97LM-1043.1From 2914057to 2915205Unknown, similar to efflux
proteins
SEQ ID NO. 98LM-1045.1From 2912595to 2913686Unknown, highly similar to
phosphoserine
aminotransferase
SEQ ID NO. 99LM-1047.1From 2911415to 2912602Unknown, similar to D-3-
phosphoglycerate
dehydrogenase
SEQ ID NO. 100LM-1048.1From 2910085to 2911326Unknown, similar to an
hypothetical protein from
Thermotoga maritima
SEQ ID NO. 101LM-1049.1From 2909753to 2910076Unknown
SEQ ID NO. 102LM-1052.1From 2904598to 2906940Unknown, amino-terminal
domain similar to
transcription regulators
SEQ ID NO. 103LM-1054.1From 2903200to 2904402Unknown, similar to
carboxypeptidase
SEQ ID NO. 104LM-1055.1From 2901800to 2903200Unknown, similar to
transmembrane efflux
protein
SEQ ID NO. 105LM-1056.1From 2900594to 2901784Unknown, similar to
peptidases
SEQ ID NO. 106LM-1057.1From 2899313to 2900506Unknown, similar to
transport protein
SEQ ID NO. 107LM-1058.1From 2898535to 2899266Unknown, similar to
reductases
SEQ ID NO. 108LM-1059.1From 2897847to 2898374Unknown, similar to
transcriptional regulator
SEQ ID NO. 109LM-1060.1From 2897116to 2897823Unknown
SEQ ID NO. 110LM-1061.1From 2896241to 2897059Unknown, similar to D-
alanyl-D-alanine
carboxypeptidase
SEQ ID NO. 111LM-1064.1From 2894510to 2895883Unknown, similar to
GTPase
SEQ ID NO. 112LM-1066.1From 2892596to 2894485Unknown, highly similar to
GidA protein
SEQ ID NO. 113LM-1067.2From 2892057to 2892437Unknown, hypothetical
secreted protein
SEQ ID NO. 114LM-1069.3From 434945to 435817Unknown
SEQ ID NO. 115LM-107.1From 2777844to 2779583Unknown, highly similar to
DNA polymerase III
(gamma and tau subunits)
SEQ ID NO. 116LM-1070.1From 433153to 434745Unknown, similar to
phosphoenolpyruvate
synthase (N-terminal part)
SEQ ID NO. 117LM-1074.1From 429630to 432095Internalin
SEQ ID NO. 118LM-1075.1From 428981to 429403Unknown
SEQ ID NO. 119LM-1076.1From 428576to 428968Unknown
SEQ ID NO. 120LM-1077.1From 428127to 428507Unknown, similar to B. subtilis
YyaH protein
SEQ ID NO. 121LM-1079.1From 427104to 428111Unknown, similar to
phosphate transport protein
SEQ ID NO. 122LM-108.1From 2779910to 2780374Unknown
SEQ ID NO. 123LM-1080.1From 426597to 426995Unknown
SEQ ID NO. 124LM-1081.1From 426190to 426606Unknown
SEQ ID NO. 125LM-1082.1From 424130to 426064Unknown, similar to
transcriptional
antiterminator (BglG family)
SEQ ID NO. 126LM-1083.1From 421469to 424096Unknown, highly similar to
E. col YbgG protein, a
putative sugar hydrolase
SEQ ID NO. 127LM-1084.1From 420335to 421447Unknown, similar to
fructose-specific
phosphotransferase
enzyme IIC
SEQ ID NO. 128LM-1085.1From 419999to 420331Unknown, similar to
fructose-specific
phosphotransferase
enzyme IIB
SEQ ID NO. 129LM-1088.1From 419544to 420002Unknown, similar to
phosphotransferase system
enzyme IIA
SEQ ID NO. 130LM-1089.1From 418891to 419373Unknown, similar to
unknown proteins
SEQ ID NO. 131LM-109.1From 2780933to 2782297Unknown, similar to PTS
system, cellobiose-specific
enzyme IIC
SEQ ID NO. 132LM-1091.1From 417963to 418763Unknown, similar to 1-
pyrroline-5-carboxylate
reductase (ProC)
SEQ ID NO. 133LM-1092.1From 417535to 417960Unknown, weakly similar to
blasticidin S-
acetyltransferase
SEQ ID NO. 134LM-1093.1From 416796to 417479Unknown, similar to L. monocytogenes
extracellular P60 protein
SEQ ID NO. 135LM-1094.1From 416203to 416652Unknown
SEQ ID NO. 136LM-1097.1From 415224to 416168Unknown, highly similar to
B. subtilis YqfA protein
SEQ ID NO. 137LM-1098.1From 414928to 415227Unknown
SEQ ID NO. 138LM-1099.1From 414095to 414760Unknown, similar to uracil-
DNA glycosylase
SEQ ID NO. 139LM-11.1From 2719056to 2719775Unknown
SEQ ID NO. 140LM-1100.1From 412849to 413964Unknown, low temperature
requirement protein A
SEQ ID NO. 141LM-1101.1From 412457to 412825Unknown
SEQ ID NO. 142LM-1102.1From 411992to 412363Unknown, similar to B. subtilis
YhdG protein
SEQ ID NO. 143LM-1105.1From 409944to 411860Unknown, similar to B. subtilis
IolD protein, to
acetolactate synthase
SEQ ID NO. 144LM-1107.1From 408954to 409931Unknown, similar to B. subtilis
IolC protein and to
fructokinase
SEQ ID NO. 145LM-1108.1From 408116to 408937Unknown, similar to B. subtilis
IolB protein
SEQ ID NO. 146LM-1110.2From 406637to 408103Unknown, highly similar to
B. subtilis methylmalonate-
semialdehyde
dehydrogenase IolA
SEQ ID NO. 147LM-1112.2From 405680to 406441Unknown, similar to B. subtilis
transcription
repressor of myo-inositol
catabolism operon IolR
SEQ ID NO. 148LM-1114.1From 405221to 405607Unknown
SEQ ID NO. 149LM-1115.3From 404365to 404994Unknown
SEQ ID NO. 150LM-1116.4From 807976to 810792Unknown, similar to
transcriptional regulator
(NifA/NtrC family)
SEQ ID NO. 151LM-1117.1From 807255to 807689Unknown, similar to
mannose-specific
phosphotransferase system
(PTS) component IIA
SEQ ID NO. 152LM-1118.1From 806770to 807255Unknown, similar to
mannose-specific
phosphotransferase system
(PTS) component IIB
SEQ ID NO. 153LM-112.1From 2783466to 2784107Unknown
SEQ ID NO. 154LM-1121.1From 805784to 806596Unknown, similar to
mannose-specific
phosphotransferase system
(PTS) component IIC
SEQ ID NO. 155LM-1122.1From 804887to 805765Unknown, similar to
mannose-specific
phosphotransferase system
(PTS) component IID
SEQ ID NO. 156LM-1123.1From 804382to 804729Unknown
SEQ ID NO. 157LM-1125.1From 803745to 804212Unknown
SEQ ID NO. 158LM-1126.1From 803201to 803548Unknown
SEQ ID NO. 159LM-1127.1From 802356to 802739Unknown
SEQ ID NO. 160LM-1128.1From 801336to 802202Unknown, similar to
transcription regulator
(repressor)
SEQ ID NO. 161LM-1129.1From 800965to 801297Unknown
SEQ ID NO. 162LM-113.1From 2784354to 2784674Unknown, similar to
hypothetical proteins
SEQ ID NO. 163LM-1130.1From 799996to 800925Unknown, conserved
hypothetical protein
SEQ ID NO. 164LM-1131.1From 798828to 799817Unknown, similar to alcohol
dehydrogenase
SEQ ID NO. 165LM-1132.1From 798141to 798788Unknown, similar to
transcription regulator
SEQ ID NO. 166LM-1133.1From 797353to 798105Unknown
SEQ ID NO. 167LM-1134.1From 796075to 797193Unknown, similar to
transcriptional regulator
(LacI family)
SEQ ID NO. 168LM-1135.1From 795034to 796062Unknown, similar to alpha-
1,6-mannanase
SEQ ID NO. 169LM-1136.1From 793777to 795030Unknown, similar to sugar
ABC transporter,
periplasmic sugar-binding
protein
SEQ ID NO. 170LM-1137.1From 792879to 793754Unknown, similar to ABC
transporter, permease
protein
SEQ ID NO. 171LM-1138.1From 791977to 792867Unknown, similar to putative
sugar ABC transporter,
permease protein
SEQ ID NO. 172LM-1139.1From 790662to 791960Unknown
SEQ ID NO. 173LM-114.1From 2784805to 2786319Unknown, highly similar to
gluconate kinase
SEQ ID NO. 174LM-1140.1From 789520to 790509Unknown, similar to lipoate-
protein ligase
SEQ ID NO. 175LM-1141.1From 788642to 789514Unknown, similar to
unknown proteins
SEQ ID NO. 176LM-1142.1From 787320to 788576Unknown, similar to
ATP/GTP-binding protein
SEQ ID NO. 177LM-1143.1From 786372to 786992Unknown, similar to
unknown proteins
SEQ ID NO. 178LM-1144.1From 785754to 786362Unknown
SEQ ID NO. 179LM-1146.1From 784788to 785747Unknown
SEQ ID NO. 180LM-1147.1From 783901to 784788Unknown
SEQ ID NO. 181LM-1149.1From 782794to 783774Unknown, similar to
hypothetical proteins
SEQ ID NO. 182LM-1151.1From 781896to 782801Unknown; Similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 183LM-1152.1From 781103to 781924Unknown, similar to
unknown proteins
SEQ ID NO. 184LM-1153.2From 780377to 780988Unknown, weakly similar to
a bile acid 7-alpha
dehydratase
SEQ ID NO. 185LM-1154.1From 779616to 780296Unknown, similar to
transcription regulator
Crp/Fnr family
SEQ ID NO. 186LM-1155.1From 778654to 779490Unknown, weakly similar to
a putative haloacetate
dehalogenase
SEQ ID NO. 187LM-1156.1From 778262to 778558Unknown
SEQ ID NO. 188LM-1157.1From 777692to 778207Unknown
SEQ ID NO. 189LM-1158.1From 777209to 777496Unknown
SEQ JD NO. 190LM-1159.1From 776761to 777102Unknown
SEQ ID NO. 191LM-1160.1From 776398to 776697Unknown, hypothetical
SEQ ID NO. 192LM-1161.1From 775744to 776247Unknown
SEQ ID NO. 193LM-1164.1From 773715to 775715Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 194LM-1165.1From 772996to 773700Unknown
SEQ ID NO. 195LM-1166.1From 772310to 772999Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 196LM-1167.1From 771949to 772326Unknown, similar to
transcriptional regulator
(GntR family)
SEQ ID NO. 197LM-1168.1From 771026to 771685Unknown, similar to putative
transcription regulator
SEQ ID NO. 198LM-1169.1From 769639to 771012Unknown, similar to 6-
phospho-beta-glucosidase
SEQ ID NO. 199LM-117.1From 2786435to 2787373Unknown, secreted protein
with 1 GW repeat
SEQ ID NO. 200LM-1171.1From 767766to 769619Unknown, similar to
phosphotransferase system
(PTS) beta-glucoside-
specific enzyme IIABC
component
SEQ ID NO. 201LM-1172.1From 766810to 767742Unknown
SEQ ID NO. 202LM-1173.1From 766351to 766797Unknown, similar to ribose
5-phosphate isomerase
SEQ ID NO. 203LM-1174.1From 765683to 766354Unknown, similar to
Ribulose-5-Phosphate 3-
Epimerase
SEQ ID NO. 204LM-1175.1From 764465to 765469Unknown, similar to
transcriptional regulator
(LacI family)
SEQ ID NO. 205LM-1176.1From 763798to 764307Unknown, similar to
transcription regulator
SEQ ID NO. 206LM-1181.1From 760455to 760859Unknown
SEQ ID NO. 207LM-1182.2From 759972to 760310Unknown
SEQ ID NO. 208LM-1183.2From 759479to 759958Unknown
SEQ ID NO. 209LM-1184.3From 2916291to 2916890Unknown, similar to yeast
protein Frm2p involved in
fatty acid signaling
SEQ ID NO. 210LM-1186.1From 2916946to 2917266Unknown, similar to
thioredoxin
SEQ ID NO. 211LM-1189.1From 2917389to 2918051Unknown, similar to
phosphoglucomutase
SEQ ID NO. 212LM-119.2From 1239842to 1240201Unknown
SEQ ID NO. 213LM-1190.1From 2918048to 2919169Unknown; similar to
unknown proteins
SEQ ID NO. 214LM-1195.1From 2919166to 2921343Unknown, similar to a
maltose phosphorylase
SEQ ID NO. 215LM-1196.1From 2921340to 2922371Unknown, similar to
oxidoreductases
SEQ ID NO. 216LM-1198.1From 2922429to 2923223Unknown, highly similar to
an E. coli protein
SEQ ID NO. 217LM-1199.1From 2923236to 2924288Unknown, similar to alcohol
dehydrogenase
SEQ ID NO. 218LM-1201.1From 2924307to 2925158Unknown, similar to sugar
ABC transporter permease
protein
SEQ ID NO. 219LM-1202.1From 2925145to 2926026Unknown, similar to sugar
ABC transporter permease
protein
SEQ ID NO. 220LM-1203.1From 2926101to 2927393Unknown, similar to sugar
binding protein (ABC
transporter)
SEQ ID NO. 221LM-1205.1From 2927390to 2928631Unknown, similar to
Sucrose phosphorylase
SEQ ID NO. 222LM-1206.1From 2928668to 2929090Unknown, weakly similar to
sucrose phosphorylase
SEQ ID NO. 223LM-1208.1From 2929315to 2930340Unknown, similar to
transcriptional regulator
SEQ ID NO. 224LM-1209.1From 2930479to 2932473Unknown
SEQ ID NO. 225LM-121.1From 1239039to 1239797Unknown, similar to rRNA
methylase
SEQ ID NO. 226LM-1210.1From 2932609to 2933148Unknown, similar to
unknown proteins
SEQ ID NO. 227LM-1212.1From 2933303to 2934715Unknown, similar to
transmembrane efflux
proteins
SEQ ID NO. 228LM-1213.1From 2934756to 2935070Unknown, highly similar to
B. subtilis YuID protein
SEQ ID NO. 229LM-1214.1From 2935083to 2935904Unknown, highly similar to
rhamnulose-1-phosphate
aldolase
SEQ ID NO. 230LM-1216.1From 2935917to 2937179Unknown, highly similar to
L-rhamnose isomerase
SEQ ID NO. 231LM-1218.1From 2937192to 2938643Unknown; similar to
rhamnulokinase
SEQ ID NO. 232LM-122.1From 1237934to 1239013Unknown, similar to endo-
1,4-beta-glucanase and to
aminopeptidase
SEQ ID NO. 233LM-1220.1From 2938662to 2939930Unknown, similar to sugar
transport proteins
SEQ ID NO. 234LM-1221.1From 2940053to 2941033Unknown, similar to AraC-
type regulatory protein
SEQ ID NO. 235LM-1222.1From 2941083to 2941544Unknown
SEQ ID NO. 236LM-1223.2From 2941599to 2942219Unknown, highly similar to
B. subtilis Jag protein
SEQ ID NO. 237LM-1224.3From 2942216to 2943079Unknown, highly similar to
B. subtilis SpoIIIJ protein
SEQ ID NO. 238LM-1225.2From 884170to 885270Unknown, similar to
excinuclease ABC, chain C
(UvrC)
SEQ ID NO. 239LM-1228.1From 882962to 884065Unknown, similar to B. subtilis
YxjH and YxjG
proteins
SEQ ID NO. 240LM-123.1From 1236836to 1237822Unknown, similar to N-
acetylmuramoyl-L-alanine
amidase (autolysin)
SEQ ID NO. 241LM-1230.1From 882328to 882705Unknown, conserved
hypothetical protein
SEQ ID NO. 242LM-1231.1From 881992to 882249Unknown, similar to B. subtilis
protein YsdA
SEQ ID NO. 243LM-1239.1From 872616to 875258unknown, similar to cation
(clacium) transporting
ATPase
SEQ ID NO. 244LM-1241.1From 871940to 872410Unknown
SEQ ID NO. 245LM-1242.1From 870587to 871807Unknown, similar to
Tetracycline resistance
protein
SEQ ID NO. 246LM-1243.1From 869095to 870480unknown, highly similar to
hexose phosphate transport
protein
SEQ ID NO. 247LM-1245.1From 867163to 868800Unknown, ABC transporter
(ATP binding protein)
SEQ ID NO. 248LM-1246.1From 866577to 866990Unknown, similar to B. subtilis
YrkR protein
SEQ ID NO. 249LM-1249.1From 864791to 865537Unknown
SEQ ID NO. 250LM-125.1From 1235670to 1236539Unknown, similar to N-
acetylmuramoyl-L-alanine
amidase (autolysin)
SEQ ID NO. 251LM-1250.1From 863798to 864688Unknown; similar to
transcriptional regulator
SEQ ID NO. 252LM-1251.1From 863322to 863606Unknown, similar to
transposase
SEQ ID NO. 253LM-1254.1From 861731to 862642Unknown
SEQ ID NO. 254LM-1255.1From 859656to 861617unknown, highly similar to
fructose-1,6-
bisphosphatase
SEQ ID NO. 255LM-1257.1From 855759to 859406unknown, highly similar to
pyruvate-flavodoxin
oxidoreductase
SEQ ID NO. 256LM-1258.1From 854745to 855581Unknown, similar to
transposases
SEQ ID NO. 257LM-1259.1From 854443to 854748Unknown, similar to
transposases
SEQ ID NO. 258LM-1261.1From 852704to 854338Unknown, similar to
transport protein
SEQ ID NO. 259LM-1262.1From 851225to 852505Unknown, similar to 3-
hydroxy-3-methylglutaryl-
coenzyme a reductase
SEQ ID NO. 260LM-1263.1From 850216to 851184Unknown
SEQ ID NO. 261LM-1265.1From 849314to 850138Unknown, similar to
oxydoreductases
SEQ ID NO. 262LM-1266.1From 848804to 849193Unknown, similar to
transcriptional regulators
SEQ ID NO. 263LM-1267.1From 848126to 848788Unknown
SEQ ID NO. 264LM-1268.1From 847552to 848109unknown, some similarity to
acetyltransferases
SEQ ID NO. 265LM-1269.1From 846594to 847442Unknown
SEQ ID NO. 266LM-1271.2From 843949to 846579unknown, similar to cation
transporting ATPase
SEQ ID NO. 267LM-1273.1From 939106to 939819Unknown, similar to
transcription regulator
(GntR family)
SEQ ID NO. 268LM-1275.2From 937739to 939025Unknown, similar to PTS
system, cellobiose-specific
IIC component
SEQ ID NO. 269LM-1277.2From 937202to 937594Unknown
SEQ ID NO. 270LM-1278.1From 936136to 936603Unknown, similar to B. subtilis
YdcK protein
SEQ ID NO. 271LM-128.1From 1233904to 1234434Unknown, similar to
unknown proteins
SEQ ID NO. 272LM-1283.1From 933959to 936136Unknown, conserved
hypothetical protein
SEQ ID NO. 273LM-1285.1From 932257to 933882Unknown, similar to
transport proteins
SEQ ID NO. 274LM-1286.1From 931451to 932050Indirect negative regulation
of sigma B dependant gene
expression (serine
phosphatase)
SEQ ID NO. 275LM-1287.1From 930671to 931450RNA polymerase sigma-37
factor (sigma-B)
SEQ ID NO. 276LM-1288.1From 930220to 930693sigma-B activity negative
regulator RsbW
SEQ ID NO. 277LM-1289.1From 929892to 930236anti-anti-sigma factor
(antagonist of RsbW)
SEQ ID NO. 278LM-129.1From 1233366to 1233815Unknown, similar to
unknown proteins
SEQ ID NO. 279LM-1290.1From 928738to 929742Unknown, highly similar to
serine phosphatase RsbU
SEQ ID NO. 280LM-1291.1From 928311to 928721Unknown, highly similar to
positive regulation of sigma-
B activity
SEQ ID NO. 281LM-1293.1From 927952to 928308Unknown, highly similar to
negative regulation of
sigma-B activity
SEQ ID NO. 282LM-1294.1From 927110to 927946Unknown, highly similar to
positive regulator of sigma-
B activity
SEQ ID NO. 283LM-1295.1From 926511to 926858Unknown, similar to B. subtilis
YdcE protein
SEQ ID NO. 284LM-1296.1From 926229to 926507Unknown, similar to B. subtilis
YdcD protein
SEQ ID NO. 285LM-1297.1From 924917to 926023Unknown, similar to alanine
racemase
SEQ ID NO. 286LM-1298.1From 924542to 924898Unknown, similar to holo-
acyl-carrier protein synthase
SEQ ID NO. 287LM-13.1From 2719735to 2720484Unknown, similar to
creatinine amidohydrolase
SEQ ID NO. 288LM-130.1From 1232884to 1233351Unknown, similar to
unknown proteins
SEQ ID NO. 289LM-1300.1From 923161to 924540Unknown, similar to
protoporphyrinogen IX and
coproporphyrinogen III
oxidase (HemY)
SEQ ID NO. 290LM-1302.1From 921537to 923012Unknown, similar to B. subtilis
YbtB protein
SEQ ID NO. 291LM-1303.1From 921062to 921544Unknown, similar to B. subtilis
YdbS protein
SEQ ID NO. 292LM-1304.1From 920452to 920928unknown
SEQ ID NO. 293LM-1306.1From 918135to 918902unknown
SEQ ID NO. 294LM-1307.1From 917164to 918117Unknown, similar to
oxidoreductases
SEQ ID NO. 295LM-1309.1From 916436to 917164Unknown, similar to B. subtilis
NagB protein
(glucosamine-6-phosphate
isomerase)
SEQ ID NO. 296LM-131.1From 1232272to 1232838Unknown, similar to
unknown protein
SEQ ID NO. 297LM-1312.1From 915090to 916415Unknown, similar to PTS
system, Lichenan-specific
enzyme IIC component
SEQ ID NO. 298LM-1313.1From 914735to 915070Unknown, similar to PTS
system, beta-glucoside
enzyme IIB component
SEQ ID NO. 299LM-1314.1From 914405to 914734Unknown, similar to PTS
system enzyme IIA
component
SEQ ID NO. 300LM-1316.1From 912372to 914390Unknown; Similar to
transcriptional regulator
(antiterminator)
SEQ ID NO. 301LM-1317.1From 910998to 912164Unknown; similar to
antibiotic resistance protein
SEQ ID NO. 302LM-1318.1From 910517to 910870Unknown, similar to B. subtilis
YtcD protein
SEQ ID NO. 303LM-1319.1From 910173to 910484unknown
SEQ ID NO. 304LM-1320.1From 909159to 910157unknown
SEQ ID NO. 305LM-1321.1From 908499to 909056unknown
SEQ ID NO. 306LM-1322.1From 907979to 908467unknown
SEQ ID NO. 307LM-1325.1From 905962to 907524Unknown, similar to ATP-
dependent RNA helicase
SEQ ID NO. 308LM-1326.1From 903837to 905510Unknown, similar to
phosphomannomutase
SEQ ID NO. 309LM-1327.1From 902773to 903840unknown
SEQ ID NO. 310LM-1328.1From 901837to 902751unknown
SEQ ID NO. 311LM-1329.1From 900303to 901835Unknown, similar to oligo-
1,6-glucosidase
SEQ ID NO. 312LM-1331.1From 899454to 900287Unknown, similar to sugar
ABC transporter, permease
protein-
SEQ ID NO. 313LM-1332.1From 898589to 899467Unknown, similar to sugar
ABC transporter, permease
protein
SEQ ID NO. 314LM-1333.3From 897273to 898592Unknown, similar to putative
sugar ABC transporter,
periplasmic sugar-binding
protein,
SEQ ID NO. 315LM-1335.3From 1104048to 1104851Unknown, highly similar to
teichoic acid translocation
permease protein TagG
SEQ ID NO. 316LM-1337.2From 1102858to 1103757Unknown, similar to metal
binding protein
SEQ ID NO. 317LM-1338.1From 1099266to 1102706Unknown, highly similar to
pyruvate carboxylase
SEQ ID NO. 318LM-1339.1From 1097786to 1098988Unknown, similar to cell-
division protein RodA and
FtsW
SEQ ID NO. 319LM-1341.1From 1097322to 1097603Unknown, similar to B. subtilis
YlaN protein
SEQ ID NO. 320LM-1342.1From 1096874to 1097116Unknown, similar to B. subtilis
YlaI protein
SEQ ID NO. 321LM-1343.2From 1095975to 1096835unknown
SEQ ID NO. 322LM-1349.1From 1093965to 1095803Unknown, similar to GTP-
binding ptotein TypA/BipA
(tyrosine phosphorylated
protein A) from E. coli and
B. subtilis (YlaG)
SEQ ID NO. 323LM-1351.1From 1093000to 1093773Unknown, similar to
extragenic suppressor
protein SuhB and to myoinositol-
1(or 4)-
monophosphatase
SEQ ID NO. 324LM-1352.1From 1092255to 1092869Unknown, similar to B. subtilis
YktB protein
SEQ ID NO. 325LM-1353.1From 1091110to 1092054Unknown, similar to
membrane and transport
proteins
SEQ ID NO. 326LM-1354.1From 1090375to 1091043Unknown, similar to ABC
transporter (permease)
SEQ ID NO. 327LM-1357.1From 1088941to 1090362Unknown, conserved
hypothetical protein
SEQ ID NO. 328LM-1358.1From 1087409to 1088854Unknown, similar to sensor
protein histidine kinases (2
components regulatory
systems)
SEQ ID NO. 329LM-1359.1From 1086772to 1087434Unknown, similar to
transcription response
regulator
SEQ ID NO. 330LM-1360.1From 1086100to 1086630unknown
SEQ ID NO. 331LM-1361.1From 1085806to 1086078Unknown, similar to B. subtilis
YktA protein
SEQ ID NO. 332LM-1363.1From 1084853to 1085806Unknown, similar to L-
lactate dehydrogenase
SEQ ID NO. 333LM-1364.1From 1084413to 1084724unknown
SEQ ID NO. 334LM-1367.1From 1082822to 1084225Unknown, highly similar to
dihydrolipoamide
dehydrogenase, E3 subunit
of pyruvate dehydrogenase
complex
SEQ ID NO. 335LM-1368.1From 1081183to 1082817Unknown, highly similar to
pyruvate dehydrogenase
(dihydrolipoamide
acetyltransferase E2
subunit)
SEQ ID NO. 336LM-1369.1From 1080095to 1081072Unknown, highly similar to
pyruvate dehydrogenase
(E1 beta subunit)
SEQ ID NO. 337LM-1370.1From 1078977to 1080092Unknown, highly similar to
pyruvate dehydrogenase
(E1 alpha subunit)
SEQ ID NO. 338LM-1371.1From 1077592to 1078143Unknown, similar to
formylmethionine
deformylase and to B. subtilis
YkrB protein
SEQ ID NO. 339LM-1372.1From 1076989to 1077543Unknown, similar to B. subtilis
YdfE protein
SEQ ID NO. 340LM-1373.1From 1075860to 1076858Unknown, similar to
molybdopterin biosynthesis
protein MoeB
SEQ ID NO. 341LM-1374.1From 1075360to 1075848Unknown, similar to
molybdenum cofactor
biosynthesis protein B
SEQ ID NO. 342LM-1376.1From 1074324to 1075325Unknown, similar to
molybdenum cofactor
biosynthesis protein A
SEQ ID NO. 343LM-1377.1From 1073813to 1074295Unknown, similar to
molybdenum cofactor
biosynthesis protein C
SEQ ID NO. 344LM-1379.1From 1073552to 1073800Unknown, similar to
molybdopterin converting
factor (subunit 1).
SEQ ID NO. 345LM-1380.1From 1073146to 1073568Unknown, similar to
molybdopterin converting
factor, subunit 2
SEQ ID NO. 346LM-1381.1From 1072664to 1073149Unknown, similar to
molybdopterin-guanine
dinucleotide biosynthesis
MobB
SEQ ID NO. 347LM-1383.1From 1071462to 1072685Unknown, similar to
molybdopterin biosynthesis
protein moeA
SEQ ID NO. 348LM-1385.1From 1070597to 1071367Unknown, similar to
molybdate binding protein
SEQ ID NO. 349LM-1386.1From 1069821to 1070492Unknown, similar to
molybdenum transport
protein
SEQ ID NO. 350LM-1387.1From 1069156to 1069818unknown, similar to ABC
transporter
SEQ ID NO. 351LM-1388.1From 1068594to 1069175unknown, weakly similar to
molybdopterin-guanine
dinucleotide biosynthesis
protein A
SEQ ID NO. 352LM-1389.1From 1067723to 1068526unknown, highly similar to
B. subtilis YoaT protein
SEQ ID NO. 353LM-1390.1From 1066397to 1067662unknown
SEQ ID NO. 354LM-1391.1From 1064521to 1066377unknown, similar to
phosphotransferase system
(PTS) beta-glucoside-
specific enzyme IIABC
SEQ ID NO. 355LM-1392.1From 1063061to 1064524unknown, similar to glycerol
kinase
SEQ ID NO. 356LM-1394.1From 1062108to 1063064unknown, similar to
transketolase
SEQ ID NO. 357LM-1396.1From 1061291to 1062115unknown, similar to
transketolase
SEQ ID NO. 358LM-1398.3From 1059872to 1061275unknown, similar to
hypothetical proteins
SEQ ID NO. 359LM-1401.1From 235524to 237020lysyl-tRNA synthetase
SEQ ID NO. 360LM-1402.1From 234414to 235409Unknown, conserved
hypothetical protein
SEQ ID NO. 361LM-1404.1From 233858to 234337unknown, similar to 7,8-
dihydro-6-
hydroxymethylpterin
pyrophosphokinase
SEQ ID NO. 362LM-1405.1From 233491to 233865unknown, highly similar to
dihydroneopterin aldolase
SEQ ID NO. 363LM-1407.1From 232662to 233477unknown, highly similar to
dihydropteroate synthases
SEQ ID NO. 364LM-1408.1From 230840to 231766Unknown, highly similar to
cysteine synthase
SEQ ID NO. 365LM-1410.1From 229840to 230724Unknown, conserved
hypothetical protein
SEQ ID NO. 366LM-1411.1From 229045to 229824Unknown, conserved
hypothetical protein
SEQ ID NO. 367LM-1413.1From 226854to 228929Unknown, highly similar to
cell division protein ftsH
SEQ ID NO. 368LM-1415.1From 224562to 226508Unknown, fusion protein, N-
terminal part similar to B. subtilis
YacA protein, C-
terminal part similar to
hypoxanthine-guanine
phosphoribosyltransferase
SEQ ID NO. 369LM-1418.1From 224027to 224455Unknown,
polyribonucleotide
nucleotidyltransferase
domain present
SEQ ID NO. 370LM-1420.1From 223490to 223876Unknown, similar to B. subtilis
DivlC protein
SEQ ID NO. 371LM-1426.1From 221370to 222959Unknown, conserved
membrane-spanning protein
SEQ ID NO. 372LM-1429.1From 217800to 221339Transcription-repair
coupling factor
SEQ ID NO. 373LM-1430.1From 217125to 217685Unknown, similar to
peptidyl-tRNA hydrolase
SEQ ID NO. 374LM-1432.1From 216434to 217018Unknown
SEQ ID NO. 375LM-1433.2From 215721to 216344Unknown, similar to B. subtilis
general stress
protein
SEQ ID NO. 376LM-1434.2From 214486to 215427Unknown, similar to L-
lactate dehydrogenase
SEQ ID NO. 377LM-1435.1From 213735to 214409Unknown
SEQ ID NO. 378LM-1436.1From 213336to 213668Unknown, conserved
hypothetical protein
SEQ ID NO. 379LM-1437.1From 212824to 213285Unknown, hypothetical
lipoprotein
SEQ ID NO. 380LM-1438.1From 212366to 212689Unknown
SEQ ID NO. 381LM-1439.1From 211425to 212294Phospholipase C
SEQ ID NO. 382LM-1442.1From 209470to 211389Actin-assembly inducing
protein precursor
SEQ ID NO. 383LM-1444.1From 207739to 209271Zinc metalloproteinase
precursor
SEQ ID NO. 384LM-1445.1From 205819to 207408Listeriolysin O precursor
SEQ ID NO. 385LM-1446.1From 204624to 205577Phosphatidylinositol-specific
phospholipase c
SEQ ID NO. 386LM-1447.1From 203640to 204353Listeriolysin positive
regulatory protein
SEQ ID NO. 387LM-1448.1From 202641to 203597phosphoribosyl
pyrophosphate synthetase
SEQ ID NO. 388LM-1449.3From 201217to 202590Unknown, highly similar to
UDP-N-acetylglucosamine
pyrophosphorylase
SEQ ID NO. 389LM-1451.1From 1891236to 1891880Unknown, weakly similar to
thiamin pyrophosphokinase
SEQ ID NO. 390LM-1453.1From 1891944to 1892600Unknown, similar to
ribulose-5-phosphate 3-
epimerase
SEQ ID NO. 391LM-1454.1From 1892603to 1893478Unknown, similar to
unknown proteins
SEQ ID NO. 392LM-1457.1From 1893497to 1895464Unknown, similar to putative
serine/threonine-specific
protein kinase
SEQ ID NO. 393LM-1458.1From 1895461to 1896219Unknown, similar to putative
phosphoprotein
phosphatase
SEQ ID NO. 394LM-1459.1From 1896242to 1897576Unknown, similar to RNA-
binding Sun protein
SEQ ID NO. 395LM-1461.1From 1897577to 1898515Unknown, similar to
methionyl-tRNA
formyltransferase
SEQ ID NO. 396LM-1463.1From 1898529to 1900922Unknown, similar to
primosomal replication
factor Y
SEQ ID NO. 397LM-1465.1From 1900927to 1902126Unknown, similar to
pantothenate metabolism
flavoprotein homolog
SEQ ID NO. 398LM-1466.2From 1902484to 1903101Unknown, similar to
guanylate kinases
SEQ ID NO. 399LM-1467.2From 1903120to 1903995Unknown, simlar to
conserved hypothetical
protein
SEQ ID NO. 400LM-1468.1From 1904152to 1905864Unknown, similar to
fibronectin binding proteins
SEQ ID NO. 401LM-1469.1From 1905952to 1906551Unknown, similar to
conserved hypotheticl
proteins
SEQ ID NO. 402LM-1470.1From 1906592to 1907221Unknown, highly similar to
orotate
phosphoribosyltransferases
SEQ ID NO. 403LM-1471.1From 1907218to 1907919Unknown, highly similar to
orotidine 5-phosphate
decarboxylases
SEQ ID NO. 404LM-1472.1From 1907916to 1908830Unknown, highly similar to
dihydroorotase
dehydrogenase
SEQ ID NO. 405LM-1473.1From 1908827to 1909591Unknown, highly similar to
dihydroorotate
dehydrogenase (electron
transfer subunit)
SEQ ID NO. 406LM-1474.1From 1909614to 1912826Unknown, highly similar to
carbamoyl-phosphate
synthetase (catalytic
subunit)
SEQ ID NO. 407LM-1476.1From 1912819to 1913910Unknown, highly similar to
carbamoyl-phosphate
synthetase (glutaminase
subunit)
SEQ ID NO. 408LM-1477.1From 1913907to 1915187Unknown, highly similar to
dihydroorotase
SEQ ID NO. 409LM-1478.1From 1915175to 1916086Unknown, highly similar to
aspartate
carbamoyltransferase
SEQ ID NO. 410LM-1480.1From 1916166to 1917452Unknown, highly similar to
uracil permease
SEQ ID NO. 411LM-1481.2From 1917581to 1918132Unknown, highly similar to
pyrimidine operon
regulatory protein
SEQ ID NO. 412LM-1482.2From 1918363to 1918593unknown
SEQ ID NO. 413LM-1483.2From 645397to 645858Unknown, similar to
transcription regulator MarR
family
SEQ ID NO. 414LM-1484.1From 645878to 647590Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 415LM-1486.1From 647587to 649404Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 416LM-1487.1From 649520to 649819Unknown, similar to E. coli
phage shock protein E
SEQ ID NO. 417LM-1491.1From 651794to 652420Unknown, similar to acyl-
carrier protein
phosphodiesterase and
NAD(P)H dehydrogenase
SEQ ID NO. 418LM-1492.1From 652589to 653044Unknown, similar to
transcription regulator MarR
family
SEQ ID NO. 419LM-1493.1From 653041to 653982Unknown, similar to
oxidoreductase
SEQ ID NO. 420LM-1494.1From 654078to 654611Unknown, conserved
hypothetical protein
SEQ ID NO. 421LM-1495.1From 654608to 654850Unknown
SEQ ID NO. 422LM-1498.1From 654957to 656708Unknown, C-terminal
domain similar to
glycerophosphoryl diester
phosphodiesterase
SEQ ID NO. 423LM-1499.1From 656749to 657243Unknown
SEQ ID NO. 424LM-15.1From 2720497to 2721489Unknown, similar to
Phosphotriesterase
SEQ ID NO. 425LM-1500.1From 657323to 658465Unknown, similar to protein
kinase
SEQ ID NO. 426LM-1501.1From 658572to 659027Unknown
SEQ ID NO. 427LM-1502.1From 659083to 659475Unknown
SEQ ID NO. 428LM-1504.1From 659572to 660312Unknown, conserved
hypothetical protein
SEQ ID NO. 429LM-1506.1From 660398to 660676Unknown, hypothetical
SEQ ID NO. 430LM-1507.1From 660692to 660958Unknown
SEQ ID NO. 431LM-1508.1From 660996to 661439Unknown, similar to
unknown proteins
SEQ ID NO. 432LM-1510.3From 661470to 662171Unknown
SEQ ID NO. 433LM-1511.3From 662257to 663936Unknown, similar to
unknown protein
SEQ ID NO. 434LM-1516.1From 669423to 669950Unknown
SEQ ID NO. 435LM-1518.1From 670309to 672339Unknown, similar to
transcription antiterminator
BglG family
SEQ ID NO. 436LM-1519.1From 672341to 672793Unknown, similar to PTS
system, fructose-specific IIA
component
SEQ ID NO. 437LM-1520.1From 672794to 673855Unknown, similar to PTS
system, fructose-specific IIC
component
SEQ ID NO. 438LM-1521.1From 673870to 674178Unknown, similar to PTS
system, fructose-specific IIB
component
SEQ ID NO. 439LM-1523.1From 674205to 675473Unknown, similar to an E. coli
putative tagatose 6-
phosphate kinase
SEQ ID NO. 440LM-1524.1From 675563to 676267Unknown
SEQ ID NO. 441LM-1525.1From 676372to 676788Unknown, similar to
unknown proteins
SEQ ID NO. 442LM-1527.1From 676802to 677395Unknown, weakly similar to
methyltransferase
SEQ ID NO. 443LM-1528.1From 678494to 679123Unknown
SEQ ID NO. 444LM-1529.1From 679763to 680296Unknown, similar to a
transcription regulator
(surface protein PAg
negative regulator par)
SEQ ID NO. 445LM-153.1From 1216820to 1218178Unknown, similar to
cobyrinic acid a,c-diamide
synthase
SEQ ID NO. 446LM-1530.1From 680360to 681277Unknown, similar to
oxidoreductase
SEQ ID NO. 447LM-1532.1From 681543to 683423Unknown, similar to heavy
metal-transporting ATPase
SEQ ID NO. 448LM-1533.1From 683519to 684247Unknown
SEQ ID NO. 449LM-1535.1From 684390to 685040Unknown, similar to putative
transaldolase
SEQ ID NO. 450LM-1536.3From 685083to 686903Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 451LM-1537.2From 468169to 469437Unknown, weakly similar to
a module of peptide
synthetase
SEQ ID NO. 452LM-1538.1From 467519to 468136unknown
SEQ ID NO. 453LM-154.1From 1215993to 1216487unknown
SEQ ID NO. 454LM-1540.1From 466517to 467362Unknown, conserved
hypothetical protein
SEQ ID NO. 455LM-1541.1From 465934to 466419Unknown, similar to
unknown proteins
SEQ ID NO. 456LM-1545.1From 459681to 465722Unknown, putative
peptidoglycan linked protein
(LPXTG motif)
SEQ ID NO. 457LM-1547.1From 457021to 458913Internalin B
SEQ ID NO. 458LM-1549.1From 454534to 456936Internalin A
SEQ ID NO. 459LM-155.1From 1215128to 1215679Unknown, similar to
transcriptional regulator
SEQ ID NO. 460LM-1551.1From 453107to 453853Unknown, similar to
oxidoreductase
SEQ ID NO. 461LM-1553.1From 452524to 453093Unknown, similar to
acetyltransferase
SEQ ID NO. 462LM-1554.1From 451531to 452406Unknown, similar to
transcriptional regulator
(LysR family)
SEQ ID NO. 463LM-1557.1From 448926to 451508Unknown, similar to sugar
hydrolase
SEQ ID NO. 464LM-1558.1From 447804to 448910Unknown, similar to PTS
fructose-specific enzyme IIC
component
SEQ ID NO. 465LM-156.1From 1213636to 1215087unknown
SEQ ID NO. 466LM-1560.1From 447471to 447791Unknown, similar to PTS
fructose-specific enzyme IIB
component
SEQ ID NO. 467LM-1561.1From 447010to 447474Unknown, similar to PTS
fructose-specific enzyme IIA
component
SEQ ID NO. 468LM-1564.1From 445049to 447010Unknown, similar to
transcription antiterminator
BglG family
SEQ ID NO. 469LM-1566.1From 444031to 444888Unknown, similar to
Staphylococcus xylosus
glucose uptake protein
SEQ ID NO. 470LM-1568.1From 443303to 443851Unknown, similar to RNA
polymerase ECF-type
sigma factor
SEQ ID NO. 471LM-1569.1From 442869to 443300Unknown, similar to
unknown protein
SEQ ID NO. 472LM-157.1From 1212983to 1213426Similar to ethanolamine
utilization protein EutQ
SEQ ID NO. 473LM-1570.1From 441622to 442872Unknown, similar to rod
shape-determining protein
RodA
SEQ ID NO. 474LM-1571.1From 440758to 441570Unknown, conserved
hypothetical protein
SEQ ID NO. 475LM-1572.1From 440056to 440610Unknown, similar to
unknown protein
SEQ ID NO. 476LM-1573.1From 439677to 439976Unknown
SEQ ID NO. 477LM-1574.1From 439214to 439624Unknown
SEQ ID NO. 478LM-1575.2From 437482to 438882Unknown, similar to endo-
1,4-beta-xylanase
SEQ ID NO. 479LM-1577.4From 436379to 437188Unknown, conserved
membrane protein
SEQ ID NO. 480LM-1578.2From 977051to 9978856Unknown, similar to heat
shock protein HtpG
SEQ ID NO. 481LM-1579.1From 976065to 977039Unknown
SEQ ID NO. 482LM-158.1From 1211869to 1212990Unknown, similar to
ethanolamine utilization
protein EutH - Escherichia
coli
SEQ ID NO. 483LM-1580.1From 975071to 976033Unknown
SEQ ID NO. 484LM-1581.1From 974229to 974975Unknown
SEQ ID NO. 485LM-1582.1From 973758to 974216Unknown, similar to protein-
tyrosine-phosphatase
SEQ ID NO. 486LM-1583.1From 972719to 973459Unknown, similar to
Nitroflavin-reductase
SEQ ID NO. 487LM-1584.1From 972197to 972706Unknown, similar to B. subtilis
CspR protein, rRNA
methylase homolog
SEQ ID NO. 488LM-1585.1From 971037to 972176Unknown, similar to B. subtilis
YhbA protein
SEQ ID NO. 489LM-1586.1From 969549to 970496Unknown, similar to B. subtilis
YkcC protein
SEQ ID NO. 490LM-1587.1From 968819to 969424Unknown, conserved
hypothetical protein
SEQ ID NO. 491LM-1589.1From 967784to 968779Unknown, similar to lipoate
protein ligase A
SEQ ID NO. 492LM-159.1From 1211308to 1211853Unknown, similar to
Salmonella enterica PduT
protein
SEQ ID NO. 493LM-1590.1From 967029to 967760Unknown, conserved
hypothetical protein, similar
to B. subtilis Yhfl protein
SEQ ID NO. 494LM-1592.1From 966245to 966913Unknown, similar to sortase
SEQ ID NO. 495LM-1593.1From 965513to 966136Unknown, similar to 3-
methyladenine DNA
glycosylase
SEQ ID NO. 496LM-1594.1From 963294to 965255Unknown, hypothetical
transmembrane protein
SEQ ID NO. 497LM-1596.1From 962607to 963263Unknown, similar to
transcription regulator
(TetR/AcrR family)
SEQ ID NO. 498LM-1597.1From 961469to 962584Unknown, putative
membrane protein
SEQ ID NO. 499LM-16.1From 2721559to 2722857Unknown, similar to
membrane proteins
SEQ ID NO. 500LM-160.1From 1211046to 1211315Unknown, similar to carbondioxide
concentrating
mechanism protein
SEQ ID NO. 501LM-1600.1From 960748to 961221Unknown, similar to ABC
transporter, ATP-binding
protein (N-terminal part)
SEQ ID NO. 502LM-1602.1From 959711to 960631Unknown, similar to
pantothenate kinase
SEQ ID NO. 503LM-1603.1From 958783to 959622Unknown, similar to B. subtilis
YcgQ protein
SEQ ID NO. 504LM-1605.1From 957724to 958764Unknown, similar to B. subtilis
YcgR protein
SEQ ID NO. 505LM-1606.1From 956031to 957602Unknown, similar to ABC
transporter ATP-binding
protein (antibiotic
resistance)
SEQ ID NO. 506LM-1607.1From 953842to 955740Unknown, similar to
transcription antiterminator
BglG family
SEQ ID NO. 507LM-1608.1From 952318to 953769Unknown, similar to beta-
glucosidase
SEQ ID NO. 508LM-1609.1From 951956to 952306Unknown, similar to
phosphotransferase system
enzyme IIA
SEQ ID NO. 509LM-161.1From 1210459to 1211031unknown
SEQ ID NO. 510LM-1610.1From 950673to 951941Unknown, similar to
phosphotransferase system
enzyme IIC
SEQ ID NO. 511LM-1611.1From 950363to 950668Unknown, similar to PTS
system, IIB component
SEQ ID NO. 512LM-1613.1From 948754to 950220Unknown, similar to
succinate semialdehyde
dehydrogenase
SEQ ID NO. 513LM-1614.1From 947718to 948530Unknown, similar to
transporters (formate)
SEQ ID NO. 514LM-1615.1From 947101to 947604Unknown
SEQ ID NO. 515LM-1617.1From 945915to 947018Unknown
SEQ ID NO. 516LM-1618.1From 945520to 945912Unknown, similar to
transcription regulator, GntR
family
SEQ ID NO. 517LM-1619.1From 944243to 945379Unknown, similar to
membrane proteins
SEQ ID NO. 518LM-162.1From 1209821to 1210462Unknown, similar to
Salmonella enterica PduL
protein
SEQ ID NO. 519LM-1620.1From 943526to 944200Unknown, similar to
phosphoglycerate mutase
SEQ ID NO. 520LM-1622.1From 942148to 943497Unknown, similar to
glutathione Reductase
SEQ ID NO. 521LM-1623.1From 941615to 942112unknown
SEQ ID NO. 522LM-1624.2From 940810to 941484Unknown
SEQ ID NO. 523LM-1625.2From 469575to 470078Unknown
SEQ ID NO. 524LM-1626.1From 470122to 472158Unknown, similar to
penicillin-binding protein (D-
alanyl-D-alanine
carboxypeptidase)
SEQ ID NO. 525LM-1627.1From 472330to 472644Unknown
SEQ ID NO. 526LM-1629.1From 472781to 473710Unknown, similar to B. subtilis
transcription
regulator LytR
SEQ ID NO. 527LM-163.1From 1209053to 1209808Unknown, similar to
cobalamin adenosyl
transferase
SEQ ID NO. 528LM-1631.1From 473936to 476716Unknown, conserved
hypothetical protein
SEQ ID NO. 529LM-1632.1From 476960to 478447Unknown, similar to
transcription regulator
SEQ ID NO. 530LM-1634.1From 478721to 479710Unknown, similar to
penicillin acylase and to
conjugated bile acid
hydrolase
SEQ ID NO. 531LM-1635.1From 479765to 481153Unknown, similar to
glutamate decarboxylase
SEQ ID NO. 532LM-1636.1From 481250to 482701Unknown, similar to amino
acid antiporter
SEQ ID NO. 533LM-1637.1From 483166to 483891Unknown
SEQ ID NO. 534LM-1638.1From 483934to 484599Unknown, similar to
unknown proteins
SEQ ID NO. 535LM-1639.1From 484620to 485375Unknown
SEQ ID NO. 536LM-164.1From 1208603to 1208887Unknown, similar to putative
carboxysome structural
protein
SEQ ID NO. 537LM-1644.1From 485493to 487652Unknown, similar to
unknown proteins
SEQ ID NO. 538LM-1645.1From 487649to 488797Unknown, conserved
hypothetical proteins
SEQ ID NO. 539LM-1646.1From 488802to 489749Unknown, conserved
hypothetical protein similar
to B. subtilis YeaC
SEQ ID NO. 540LM-1647.1From 489905to 491500Unknown, similar to
unknown proteins
SEQ ID NO. 541LM-1650.1From 491634to 492917Unknown, similar to
permeases
SEQ ID NO. 542LM-1652.1From 492918to 494018Unknown, similar to
unknown proteins
SEQ ID NO. 543LM-1653.1From 494011to 495561Unknown, similar to
hydantoinase
SEQ ID NO. 544LM-1655.1From 496381to 497919Unknown, similar to
transcription regulator (VirR
From Streptococcus
pyogenes)
SEQ ID NO. 545LM-1656.1From 498171to 500240Unknown, putative
membrane associated
lipoprotein
SEQ ID NO. 546LM-1658.1From 500306to 500779Unknown
SEQ ID NO. 547LM-1659.3From 500799to 501284Unknown
SEQ ID NO. 548LM-166.1From 1207111to 1208571Unknown, similar to
acetaldehyde
dehydrogenase/alcohol
dehydrogenase
SEQ ID NO. 549LM-1661.4From 2632907to 2633761Unknown, similar to
fructose-1,6-bisphosphate
aldolase
SEQ ID NO. 550LM-1662.2From 2631303to 2632586Unknown, weakly similar to
human N-
acetylglucosaminyl-
phosphatidylinositol
biosynthetic protein
SEQ ID NO. 551LM-1663.1From 2630285to 2631310Unknown, similar to
galactosyltransferase
SEQ ID NO. 552LM-1664.1From 2629208to 2630278Unknown, conserved
hypothetical protein
SEQ ID NO. 553LM-1665.1From 2627816to 2629087Unknown, highly similar to
UDP-N-acetylglucosamine
1-carboxyvinyltransferase
SEQ ID NO. 554LM-1666.2From 2626442to 2627713Unknown, highly similar to
transcription terminator
factor rho
SEQ ID NO. 555LM-1668.2From 2625414to 2626361Unknown, similar to glycosyl
transferases
SEQ ID NO. 556LM-1669.1From 2624980to 2625417wall teichoic acid
glycosylation protein GtcA
SEQ ID NO. 557LM-167.1From 1206590to 1207111Unknown, similar to putative
carboxysome structural
protein
SEQ ID NO. 558LM-1671.1From 2623125to 2624411Unknown, highly similar to
homoserine dehydrogenase
SEQ ID NO. 559LM-1674.1From 2622067to 2623122Unknown, highly similar to
threonine synthase
SEQ ID NO. 560LM-1676.1From 2621201to 2622067Unknown, highly similar to
homoserine kinase
SEQ ID NO. 561LM-1678.1From 2620514to 2621089Unknown, similar to
thymidine kinase
SEQ ID NO. 562LM-1679.1From 2619415to 2620491Unknown, highly similar to
peptide chain release
factor 1
SEQ ID NO. 563LM-168.1From 1205922to 1206575Unknown, similar to putative
carboxysome structural
protein (eutL)
SEQ ID NO. 564LM-1681.1From 2618577to 2619428Unknown, similar to
protoporphyrinogen oxidase
SEQ ID NO. 565LM-1682.1From 2617239to 2618276Unknown, similar to yeast
translation initiation protein
SEQ ID NO. 566LM-1683.1From 2616835to 2617242Unknown, similar to
phosphatases
SEQ ID NO. 567LM-1684.1From 2615458to 2616699Unknown, highly similar to
glycine
hydroxymethyltransferase
SEQ ID NO. 568LM-1685.1From 2614692to 2615321Unknown, highly similar to
uracil
phosphoribosyltransferase
SEQ ID NO. 569LM-1686.1From 2613435to 2614574Unknown, similar to UDP-N-
acetylglucosamine 2-
epimerase
SEQ ID NO. 570LM-1691.1From 2612205to 2612603unknown, highly similar to
ATP synthase subunit i
SEQ ID NO. 571LM-1693.1From 2611482to 2612198unknown, highly similar to
H+-transporting ATP
synthase chain a
SEQ ID NO. 572LM-1694.1From 2610592to 2611104unknown, highly similar to
H+-transporting ATP
synthase chain b
SEQ ID NO. 573LM-1695.1From 2610056to 2610595unknown, highly similar to
H+-transporting ATP
synthase chain delta
SEQ ID NO. 574LM-1697.1From 2608515to 2610029unknown, highly similar to
H+-transporting ATP
synthase chain alpha
SEQ ID NO. 575LM-1698.1From 2607605to 2608477unknown, highly similar to
H+-transporting ATP
synthase chain gamma
SEQ ID NO. 576LM-170.1From 1205018to 1205899Unknown, similar to
ethanolamine ammonia-
lyase, light chain
SEQ ID NO. 577LM-1700.1From 2606123to 2607544unknown, highly similar to
H+-transporting ATP
synthase chain beta
SEQ ID NO. 578LM-1701.1From 2605697to 2606101unknown, highly similar to
H+-transporting ATP
synthase chain epsilon
SEQ ID NO. 579LM-1702.1From 2605337to 2605573Unknown, similar to B. subtilis
YwzB protein
SEQ ID NO. 580LM-1704.1From 2603863to 2605155unknown, UDP-N-
acetylglucosamine 1-
carboxyvinyltransferase
SEQ ID NO. 581LM-1705.2From 2602705to 2603700unknown, similar to MreB-
like protein
SEQ ID NO. 582LM-1706.2From 521863to 522591Unknown
SEQ ID NO. 583LM-1707.1From 522628to 523521Unknown, similar to
transcriptional regulator
(LysR family)
SEQ ID NO. 584LM-1708.1From 523719to 525713Unknown, similar to
NADH: flavin oxidoreductase
SEQ ID NO. 585LM-171.1From 1203634to 1204998Unknown, similar to
ethanolamine ammonia-
lyase, heavy chain
SEQ ID NO. 586LM-1710.1From 525813to 526688Unknown, similar to
shikimate 5-dehydrogenase
SEQ ID NO. 587LM-1712.1From 526754to 527512Unknown, similar to 3-
dehydroquinate
dehydratase
SEQ ID NO. 588LM-1713.1From 527580to 528488Unknown, similar to
transcriptional regulator
(LysR family)
SEQ ID NO. 589LM-1714.1From 528563to 530323Unknown, similar to acylase
SEQ ID NO. 590LM-1715.1From 530515to 531108Unknown, weakly similar to
esterase
SEQ ID NO. 591LM-1716.1From 531294to 532253Unknown, similar to
transmembrane protein
SEQ ID NO. 592LM-1717.1From 532328to 532552Unknown, similar to B. subtilis
YnzC protein
SEQ ID NO. 593LM-1718.2From 532603to 534111Unknown, similar to sugar
transferase
SEQ ID NO. 594LM-1719.1From 534307to 534756Unknown, similar to ribose
5-phosphate isomerase
SEQ ID NO. 595LM-1721.1From 534753to 535421Unknown, similar to
ribulose-5-phosphate 3
epimerase
SEQ ID NO. 596LM-1722.1From 535428to 536078Unknown, similar to
transaldolase
SEQ ID NO. 597LM-1723.1From 536187to 538247Unknown, similar to
transcription antiterminator
BglG family
SEQ ID NO. 598LM-1724.1From 538251to 538853Unknown, similar to putative
sugar-phosphate isomerase
SEQ ID NO. 599LM-1725.1From 538881to 539348Unknown, similar to PTS
fructose-specific enzyme IIA
component
SEQ ID NO. 600LM-1726.1From 539365to 539763Unknown
SEQ ID NO. 601LM-1727.1From 539774to 540424Unknown, similar to
ribulose-5-phosphate 3-
epimerase
SEQ ID NO. 602LM-1728.1From 540421to 541467Unknown, similar to polyol
(sorbitol) dehydrogenase
SEQ ID NO. 603LM-1729.1From 541483to 541776Unknown, similar to PTS
system, Galactitol-specific
IIB component
SEQ ID NO. 604LM-173.1From 1202171to 1203592Unknown, similar to
ethanolamine utilization
protein EutA (putative
chaperonin)
SEQ ID NO. 605LM-1731.1From 541791to 543062Unknown, similar to PTS
system, Galactitol-specific
IIC component
SEQ ID NO. 606LM-1733.1From 543202to 544137Unknown, similar to
phosphoribosyl
pyrophosphate synthetase
SEQ ID NO. 607LM-1734.1From 544923to 545501Unknown
SEQ ID NO. 608LM-1735.1From 545609to 546289Unknown, conserved
hypothetical protein
SEQ ID NO. 609LM-1736.1From 546314to 546673Unknown
SEQ ID NO. 610LM-1737.1From 546680to 547138Unknown, weakly similar to
transcription regulator
SEQ ID NO. 611LM-174.1From 1200625to 1202082unknown, similar to sensory
transduction histidine kinase
SEQ ID NO. 612LM-1740.1From 549438to 549869Unknown, conserved
hypothetical protein
SEQ ID NO. 613LM-1741.1From 549916to 551346Unknown, similar to Bacillus
anthracis encapsulation
protein CapA
SEQ ID NO. 614LM-1742.2From 551467to 552282Unknown, similar to
phosphoglycerate mutase
SEQ ID NO. 615LM-175.1From 1200051to 1200632unknown, similar to two-
component response
regulator
SEQ ID NO. 616LM-1755.2From 2678302to 2679330Unknown, similar to ATP
binding proteins
SEQ ID NO. 617LM-1757.1From 2681167to 2682018Unknown, similar to
oxidoreductase, aldo/keto
reductase family
SEQ ID NO. 618LM-1758.1From 2682040to 2682462Unknown, similar to
transcription regulators
(MerR family)
SEQ ID NO. 619LM-1759.1From 2682584to 2682943Unknown
SEQ ID NO. 620LM-176.1From 1198637to 1199776unknown, similar to
NADPH-dependent butanol
dehydrogenase
SEQ ID NO. 621LM-1760.1From 2683187to 2684056Unknown, similar to
unknown proteins
SEQ ID NO. 622LM-1761.1From 2684251to 2684643ribosomal protein S9
SEQ ID NO. 623LM-1762.1From 2684665to 2685102ribosomal protein L13
SEQ ID NO. 624LM-1763.1From 2685297to 2686043unknown, highly similar to
pseudouridylate synthase I
SEQ ID NO. 625LM-1766.1From 2686049to 2686846Unknown, highly similar to
B. subtilis YbaF protein
SEQ ID NO. 626LM-1769.1From 2686849to 2687715Uknown similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 627LM-1771.1From 2687691to 2688530Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 628LM-1772.1From 2688661to 2689323Unknown, conserved
hypothetical protein
SEQ ID NO. 629LM-1773.1From 2689442to 2690332Unknown
SEQ ID NO. 630LM-1774.1From 2690371to 2691465Unknown
SEQ ID NO. 631LM-1775.2From 2691673to 2692080ribosomal protein L17
SEQ ID NO. 632LM-1776.3From 1441313to 1441540Unknown
SEQ ID NO. 633LM-1777.3From 1441581to 1442114Unknown, modulates DNA
topology
SEQ ID NO. 634LM-1779.1From 1443873to 1445042Unknown, similar to Acetyl-
CoA: acetyltransferase
SEQ ID NO. 635LM-1780.2From 1445192to 1446358Unknown, similar to
hydroxy-3-methylglutaryl
coenzyme A synthase
SEQ ID NO. 636LM-1781.2From 1446383to 1446982Unknown
SEQ ID NO. 637LM-1782.1From 1447052to 1448296Unknown, highly similar to
B. subtilis YxiO protein
SEQ ID NO. 638LM-1783.1From 1448315to 1449583Unknown, weakly similar to
pyrophosphatase
SEQ ID NO. 639LM-1784.1From 1449647to 1450684Unknown, conserved
hypothetical protein
SEQ ID NO. 640LM-1785.1From 1450701to 1451597Unknown, weakly similar to
UDP-N-acetylglucosaminyl-
3-enolpyruvate reductase
SEQ ID NO. 641LM-1787.1From 1451813to 1452799Unknown, similar to glycine
betaine/carnitine/choline
ABC transporter
SEQ ID NO. 642LM-1789.1From 1452796to 1454310Unknown, similar to glycine
betaine/carnitine/choline
ABC transporter (membrane
protein)
SEQ ID NO. 643LM-179.1From 1197172to 1198047unknown, similar to
Salmonella enterica PduX
protein
SEQ ID NO. 644LM-1790.1From 1454347to 1455177Unknown
SEQ ID NO. 645LM-1791.2From 1455316to 1456662Unknown, similar to metal
ion transport proteins
SEQ ID NO. 646LM-1794.2From 1456775to 1457446Unknown, similar to
betaine/carnitine/choline
ABC transporter
(membrane p)
SEQ ID NO. 647LM-1795.1From 1457461to 1458387Unknown, similar to glycine
betaine/carnitine/choline
ABC transporter
(osmoprotectant-binding
protein)
SEQ ID NO. 648LM-1796.1From 1458389to 1459045Unknown, similar to glycine
betaine/carnitine/choline
ABC transporter (membrane
protein)
SEQ ID NO. 649LM-1798.1From 1459049to 1460242Unknown, similar to glycine
betaine/carnitine/choline
ABC transporter (ATP-
binding protein)
SEQ ID NO. 650LM-1799.1From 1460534to 1461094Unknown, similar to
unknown proteins
SEQ ID NO. 651LM-18.1From 2722884to 2723156Unknown, similar to
hypothetical PTS enzyme
IIB component
SEQ ID NO. 652LM-1800.2From 1461343to 1461726Unknown, similar to
unknown proteins
SEQ ID NO. 653LM-1801.2From 1461866to 1463467Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 654LM-1802.1From 1463506to 1464156Unknown
SEQ ID NO. 655LM-1803.1From 1464209to 1465549Unknown, similar to
glutathione reductase
SEQ ID NO. 656LM-1805.1From 1465638to 1467305Unknown, similar to
unknown proteins
SEQ ID NO. 657LM-1806.1From 1467325to 1468206Unknown, similar to
dihydrodipicolinate synthase
SEQ ID NO. 658LM-1807.1From 1468221to 1469432Unknown, similar to
aspartokinase I (alpha and
beta subunits)
SEQ ID NO. 659LM-1809.1From 1469444to 1470487Unknown, similar to
aspartate-semialdehyde
dehydrogenase
SEQ ID NO. 660LM-1810.1From 1470685to 1472850unknown, similar to
penicillin-binding protein
SEQ ID NO. 661LM-1811.1From 1472980to 1473588superoxide dismutase
SEQ ID NO. 662LM-1813.1From 1473889to 1474689unknown, similar to
unknown proteins
SEQ ID NO. 663LM-1814.1From 1474802to 1475908Unknown, similar to putative
peptidoglycan acetylation
protein
SEQ ID NO. 664LM-1815.1From 1475944to 1476651Unknown, similar to
transport proteins
SEQ ID NO. 665LM-1816.1From 1476642to 1476968Unknown
SEQ ID NO. 666LM-1817.1From 1477050to 1477934Unknown, similar to protein
secretion PrsA (post-
translocation molecular
chaperone)
SEQ ID NO. 667LM-1818.1From 1478071to 1478496transcriptional regulator
ZurR (ferric uptake
regulation)
SEQ ID NO. 668LM-182.1From 1194890to 1196083Unknown, similar to acetate
kinase
SEQ ID NO. 669LM-1820.1From 1478477to 1479355metal transport protein
SEQ ID NO. 670LM-1821.1From 1479330to 1480103ABC transporter
SEQ ID NO. 671LM-1823.1From 1480246to 1481172Unknown, conserved
hypothetical protein
SEQ ID NO. 672LM-1825.1From 1481188to 1482081unknown, similar to
endonuclease IV
SEQ ID NO. 673LM-1826.1From 1482096to 1483403Unknown, similar to ATP-
dependent RNA helicase,
DEAD-box family (deaD)
SEQ ID NO. 674LM-1827.1From 1483549to 1484544Unknown, similar to E. coli
LytB protein
SEQ ID NO. 675LM-1829.1From 1484590to 1485711Unknown, conserved
hypothetical protein
SEQ ID NO. 676LM-183.1From 1194120to 1194824Unknown, similar to glycerol
uptake facilitator protein
SEQ ID NO. 677LM-1830.1From 1485708to 1486412Unknown, conserved
hypothetical protein
SEQ ID NO. 678LM-1833.2From 1486480to 1487604RNA polymerase sigma
factor RpoD
SEQ ID NO. 679LM-1834.2From 289562to 290521Unknown, similar to other
proteins
SEQ ID NO. 680LM-1835.1From 289139to 289555Unknown, similar to
transcriptional regulators
SEQ ID NO. 681LM-1836.1From 287853to 288992Unknown, similar to
succinyldiaminopimelate
desuccinylase
SEQ ID NO. 682LM-1838.1From 286219to 287718internalin E
SEQ ID NO. 683LM-184.1From 1192974to 1194092unknown, similar to
NADPH-dependent butanol
dehydrogenase
SEQ ID NO. 684LM-1840.1From 284365to 286011internalin H
SEQ ID NO. 685LM-1842.1From 282755to 284227internalin G
SEQ ID NO. 686LM-1843.1From 281021to 282481Unknown, similar to
phospho-beta-glucosidase
SEQ ID NO. 687LM-1844.1From 280410to 280955Unknown, similar to
unknown proteins
SEQ ID NO. 688LM-185.1From 1191549to 1192958Unknown, similar to
ethanolamine utilization
protein EutE
SEQ ID NO. 689LM-1852.1From 276728to 280333RNA polymerase (beta
subunit)
SEQ ID NO. 690LM-1854.1From 273003to 276557RNA polymerase (beta
subunit)
SEQ ID NO. 691LM-1856.2From 271324to 272502Unknown, similar to
unknown protein
SEQ ID NO. 692LM-1858.1From 269754to 270257Unknown, similar to
unknown protein
SEQ ID NO. 693LM-1859.2From 269067to 269729Unknown
SEQ ID NO. 694LM-1860.2From 1993372to 1994637Unknown, similar to
unknown proteins
SEQ ID NO. 695LM-1861.1From 1991047to 1993326Unknown, similar to
pyruvate formate-lyase
SEQ ID NO. 696LM-1862.1From 1989805to 1990812Unknown, similar to
peptidase
SEQ ID NO. 697LM-1863.1From 1988159to 1989802Unknown, similar to
malolactic enzyme (malate
dehydrogenase)
SEQ ID NO. 698LM-1864.1From 1987365to 1988048Unknown, similar to
unknown proteins
SEQ ID NO. 699LM-1865.1From 1986342to 1987346Unknown, similar to
unknown proteins
SEQ ID NO. 700LM-1866.1From 1985218to 1986345Unknown, similar to
unknown proteins
(hypothetical sensory
transduction histidine
kinase)
SEQ ID NO. 701LM-1867.1From 1984056to 1985192Unknown, similar to
unknown proteins
(hypothetical sensory
transduction histidine
kinase)
SEQ ID NO. 702LM-1869.1From 1982630to 1983736Unknown, similar to
oxidoreductases
SEQ ID NO. 703LM-1871.1From 1981774to 1982640Unknown, similar to
unknown proteins
SEQ ID NO. 704LM-1872.1From 1981299to 1981634Unknown, similar to
unknown proteins
SEQ ID NO. 705LM-1874.1From 1980511to 1981302Unknown, similar to
dihydrodipicolinate
reductase
SEQ ID NO. 706LM-1875.1From 1980091to 1980495Unknown, similar to
methylglyoxal synthase
SEQ ID NO. 707LM-1876.1From 1978895to 1980076Unknown, similar to tRNA
CCA-adding enzyme
SEQ ID NO. 708LM-1877.1From 1977928to 1978905Unknown, similar to
transcriptional regulator and
biotin acetyl-CoA-
carboxylase synthetase
SEQ ID NO. 709LM-1879.1From 1977307to 1977780unknown; similar to
thioredoxin
SEQ ID NO. 710LM-188.1From 1190547to 1191542Unknown, hyghly similar to
Salmonella enterica PduO
protein
SEQ ID NO. 711LM-1880.1From 1976307to 1977140unknown, similar to
ketopantoate
hydroxymethyltransferases
SEQ ID NO. 712LM-1881.1From 1975446to 1976303unknown, similar to
panthotenate synthetases
SEQ ID NO. 713LM-1882.1From 1975059to 1975442unknown, similar to
aspartate 1-decarboxylases
SEQ ID NO. 714LM-1884.1From 1972176to 1974962unknown, similar to ATP-
dependent helicases
SEQ ID NO. 715LM-1885.1From 1971547to 1972131unknown, similar to
hypothetical proteins
SEQ ID NO. 716LM-1886.1From 1970343to 1971524unknown, similar to
aspartate
aminotransferases
SEQ ID NO. 717LM-1888.1From 1969037to 1970329unknown, similar to
sparaginyl-tRNA
synthetases
SEQ ID NO. 718LM-189.1From 1190269to 1190532Unknown, similar to carbondioxide
concentrating
mechanism protein
SEQ ID NO. 719LM-1891.1From 1968169to 1968888unknown, similar to
chromosome replication
initiation protein
SEQ ID NO. 720LM-1893.1From 1967499to 1968158unknown, probable
endonuclease III (DNA
repair)
SEQ ID NO. 721LM-1894.1From 1967015to 1967506Unknown
SEQ ID NO. 722LM-1896.1From 1964489to 1966972unknown, similar to
penicillin-binding protein 2A
SEQ ID NO. 723LM-1897.2From 1963852to 1964457unknown, similar to DNA
repair and homologous
recombination protein
SEQ ID NO. 724LM-1898.2From 2208576to 2210351Unknown, similar to
maltogenic amylase
SEQ ID NO. 725LM-19.1From 2723159to 2723593Unknown, similar to
mannitol-specific PTS
enzyme IIA component
SEQ ID NO. 726LM-190.1From 1189785to 1190264Unknown
SEQ ID NO. 727LM-1900.1From 2207103to 2208362Unknown, similar to
maltose/maltodextrin-
binding protein
SEQ ID NO. 728LM-1904.1From 2205709to 2207016Unknown, similar to
maltodextrin transport
system permease
SEQ ID NO. 729LM-1905.1From 2204857to 2205708Unknown, similar to
maltodextrin transport
system permease
SEQ ID NO. 730LM-1906.1From 2203996to 2204829Unknown, similar to
maltodextrose utilization
protein MalA
SEQ ID NO. 731LM-1907.1From 2201719to 2203980Unknown, similar to
maltosephosphorylase
SEQ ID NO. 732LM-1908.1From 2200723to 2201544Unknown, similar to
unknown proteins
SEQ ID NO. 733LM-1910.1From 2199365to 2200723Unknown, similar to
unknown proteins
SEQ ID NO. 734LM-1911.1From 2197763to 2199115Unknown, similar to
phosphoglucomutase
SEQ ID NO. 735LM-1912.1From 2197267to 2197728Unknown, similar to
unknown proteins
SEQ ID NO. 736LM-1913.1From 2196382to 2197206Unknown
SEQ ID NO. 737LM-1915.1From 2194399to 2196339Unknown, similar to ABC
transporter (permease)
SEQ ID NO. 738LM-1916.1From 2193645to 2194412Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 739LM-1918.2From 2192693to 2193448Unknown, similar to
unknown proteins
SEQ ID NO. 740LM-1919.2From 2191525to 2192256Unknown, similar to FMN-
containing NADPH-linked
nitro/flavin reductase
SEQ ID NO. 741LM-192.1From 1188949to 1189788Unknown, similar to
ethanolamine utilization
protein EutJ
SEQ ID NO. 742LM-1920.1From 2190489to 2191445Unknown, similar to
mannnose-6 phospate
isomelase
SEQ ID NO. 743LM-1921.1From 2189605to 2190402Unknown, similar to
hydrolase
SEQ ID NO. 744LM-1922.1From 2188430to 2189563Unknown, similar to N-
acetylglucosamine-6-
phosphate deacetylase
SEQ ID NO. 745LM-1923.1From 2187663to 2188424Unknown, similar to
transcriptional regulator
(DeoR family)
SEQ ID NO. 746LM-1924.1From 2186557to 2187411Unknown, similar to
unknown proteins
SEQ ID NO. 747LM-1927.1From 2184344to 2186338Unknown, similar to ferrous
iron transport protein B
SEQ ID NO. 748LM-1930.1From 2182823to 2183800Unknown, similar to
phosphotransacetylase
SEQ ID NO. 749LM-1932.1From 2182218to 2182784Unknown
SEQ ID NO. 750LM-1933.2From 2181329to 2182216Unknown, similar to a
protein required for
pyridoxine synthesis
SEQ ID NO. 751LM-1935.1From 1610206to 1611165unknown, highly similar to
6-phosphofructokinase
SEQ ID NO. 752LM-1936.1From 1611449to 1612405unknown, highly similar to
acetyl CoA carboxylase
(alpha subunit)
SEQ ID NO. 753LM-1938.1From 1612395to 1613279unknown, highly similar to
acetyl-CoA carboxylase
beta subunit
SEQ ID NO. 754LM-1942.1From 1613497to 1616823unknown, highly similar to
DNA polymerase III (alpha
subunit) DnaE
SEQ ID NO. 755LM-1944.1From 1616945to 1617880Unknown, similar to
unknown proteins
SEQ ID NO. 756LM-1945.1From 1617901to 1619214Unknown, similar to
unknown proteins
SEQ ID NO. 757LM-1946.1From 1619240to 1619926Unknown, similar to
unknown proteins
SEQ ID NO. 758LM-1947.1From 1620091to 1621188Unknown, similar to X-Pro
dipeptidase
SEQ ID NO. 759LM-1948.1From 1621230to 1622342Unknown, similar to alanine
dehydrogenase
SEQ ID NO. 760LM-1949.1From 1622583to 1623047Unknown, similar to
unknown protein
SEQ ID NO. 761LM-195.1From 1188293to 1188928Unknown, similar to
Salmonella enterica PduL
protein
SEQ ID NO. 762LM-1950.1From 1623099to 1624292unknown, highly similar to
acetate kinase
SEQ ID NO. 763LM-1951.1From 1624316to 1625314Unknown, weakly similar to
site specific DNA-
methyltransferase
SEQ ID NO. 764LM-1952.1From 1625505to 1626002Unknown, similar to thiol
peroxidases
SEQ ID NO. 765LM-1953.1From 1626064to 1626600Unknown, similar to
unknown proteins
SEQ ID NO. 766LM-1954.1From 1626620to 1627633Unknown, similar to
proteases
SEQ ID NO. 767LM-1955.1From 1627812to 1628615Unknown, similar to
unknown proteins
SEQ ID NO. 768LM-1957.1From 1628647to 1629591unknown, highly similar to
ornithine
carbamoyltransferase
SEQ ID NO. 769LM-1958.1From 1629594to 1630754unknown, highly similar to
N-acetylornithine
aminotransferase
SEQ ID NO. 770LM-1959.1From 1630751to 1631503unknown, highly similar to
N-acetylglutamate 5-
phosphotransferase
SEQ ID NO. 771LM-1960.1From 1631516to 1632712unknown, highly similar to
ornithine acetyltransferase
and amino-acid
acetyltransferases
SEQ ID NO. 772LM-1961.2From 1632728to 1633759unknown, similar to N-
acetylglutamate gamma-
semialdehyde
dehydrogenases
SEQ ID NO. 773LM-1962.2From 1633933to 1635144Unknown, similar to thiamin
biosynthesis protein Thil
SEQ ID NO. 774LM-1964.1From 1635146to 1636285Unknown, similar to iron-
sulfur cofactor synthesis
protein nifS
SEQ ID NO. 775LM-1966.2From 1636412to 1638127Unknown, similar to B. subtilis
negative regulator of
FtsZ ring formation (EzrA)
SEQ ID NO. 776LM-1968.2From 1308351to 1310318unknown, highly similar to
DNA gyrase-like protein
(subunit B)
SEQ ID NO. 777LM-197.1From 1187552to 1187989Unknown, similar to
Salmonella enterica PduK
protein
SEQ ID NO. 778LM-1970.1From 1310315to 1312774unknown, highly similar to
DNA gyrase-like protein
(subunit A)
SEQ ID NO. 779LM-1971.1From 1312857to 1313324Unknown, conserved
hypothetical protein
SEQ ID NO. 780LM-1976.1From 1317596to 1319464Unknown; similar to
acyltransferase (to B. subtilis
YrhL protein)
SEQ ID NO. 781LM-1978.1From 1319677to 1320375Unknown similar to
glycerophosphodiester
phosphodiesterase
SEQ ID NO. 782LM-1979.1From 1320608to 1322284Unknown, similar to glycerol
3 phosphate
dehydrogenase
SEQ ID NO. 783LM-198.1From 1187192to 1187539unknown, similar to diol
dehydratase-reactivating
factor small chain
SEQ ID NO. 784LM-1980.1From 1322410to 1323327Unknown, similar to tRNA
isopentenylpyrophosphate
transferase
SEQ ID NO. 785LM-1981.1From 1323450to 1323683Unknown, similar to host
factor-1 protein
SEQ ID NO. 786LM-1983.1From 1323794to 1325017Unknown, conserved
hypothetical protein similar
to B. subtilis YnbA protein
SEQ ID NO. 787LM-1984.1From 1325010to 1326236Unknown, similar to
aluminum resistance protein
and to B. subtilis YnbB
protein
SEQ ID NO. 788LM-1985.1From 1326440to 1326808Unknown, similar to
glutamine synthetase
repressor
SEQ ID NO. 789LM-1986.1From 1326879to 1328213unknown, highly similar to
glutamine synthetases
SEQ ID NO. 790LM-1988.1From 1328357to 1329652Unknown, similar to arsenic
efflux pump protein
SEQ ID NO. 791LM-1989.1From 1329696to 1330217Unknown, conserved
hypothetical protein
SEQ ID NO. 792LM-1991.2From 1330247to 1330861unknown, highly similar to
SOS response regulator
lexA, transcription repressor
protein
SEQ ID NO. 793LM-1992.2From 1331018to 1331347unknown, similar to B. subtilis
YneA protein
SEQ ID NO. 794LM-1995.1From 1331813to 1333807unknown, highly similar to
transketolase
SEQ ID NO. 795LM-1996.1From 1334028to 1334267unknown, highly similar to
B. subtilis YneF protein
SEQ ID NO. 796LM-1997.1From 1334318to 1335160Unknown
SEQ ID NO. 797LM-1998.1From 1335179to 1335910unknown, weakly similar to
arginine N-
methyltransferases
SEQ ID NO. 798LM-1999.1From 1335932to 1336438unknown, similar to E. coli
YbdM protein
SEQ ID NO. 799LM-200.1From 1185375to 1187195diol dehydratase-
reactivating factor large
subunit
SEQ ID NO. 800LM-2000.1From 1336448to 1337752unknown, similar to E. coli
YbdN protein
SEQ ID NO. 801LM-2001.1From 1337742to 1338947Unknown
SEQ ID NO. 802LM-2002.1From 1338925to 1339299Unknown
SEQ ID NO. 803LM-2003.1From 1339592to 1340320unknown, highly similar to
uridylate kinases
SEQ ID NO. 804LM-2004.1From 1340320to 1340877unknown, highly similar to
ribosome recycling factors
SEQ ID NO. 805LM-2005.2From 1341107to 1341865Unknown, similar to
undecaprenyl diphosphate
synthase
SEQ ID NO. 806LM-2006.2From 290593to 291228Unknown, similar to
phosphoglycerate mutase
SEQ ID NO. 807LM-2008.1From 291312to 293198Unknown, similar to
transporter
SEQ ID NO. 808LM-2009.1From 293212to 293841Unknown
SEQ ID NO. 809LM-201.1From 1184818to 1185330Unknown, similar to diol
dehydrase (diol
dehydratase) gamma
subunit (pddC)
SEQ ID NO. 810LM-2010.1From 293845to 295281unknown, highly similar to
phospho-beta-glucosidase
SEQ ID NO. 811LM-2011.1From 295401to 296213Unknown, conserved
hypothetical protein similar
to B. subtilis YxeH protein
SEQ ID NO. 812LM-2012.1From 296349to 296852Unknown
SEQ ID NO. 813LM-2013.1From 296889to 297551Unknown
SEQ ID NO. 814LM-2014.1From 297810to 298652Unknown, C-terminal part
similar to B. subtilis ComEC
protein
SEQ ID NO. 815LM-2015.1From 298669to 299490Unknown, conserved
hypothetical protein
SEQ ID NO. 816LM-2017.1From 299607to 300578Unknown, similar to
oxidoreductase
SEQ ID NO. 817LM-2018.1From 300617to 301717Unknown, similar to sugar
ABC transporter, ATP-
binding protein
SEQ ID NO. 818LM-2019.1From 302008to 304140Unknown, highly similar to
anaerobic ribonucleoside-
triphosphate reductase
SEQ ID NO. 819LM-202.1From 1184142to 1184801Unknown, similar to diol
dehydrase (diol
dehydratase) gamma
subunit
SEQ ID NO. 820LM-2020.1From 304133to 304684Unknown, highly similar to
anaerobic ribonucleotide
reductase activator protein
SEQ ID NO. 821LM-2022.1From 304942to 305613Unknown
SEQ ID NO. 822LM-2023.1From 305775to 306554Unknown, conserved
hypothetical protein
SEQ ID NO. 823LM-2024.1From 306644to 307306Unknown, similar to ABC
transporter permease
protein
SEQ ID NO. 824LM-2026.1From 307303to 308319Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 825LM-2027.2From 308334to 309155Unknown, putative
lipoprotein
SEQ ID NO. 826LM-203.1From 1182440to 1184104unknown, highly similar to
propanediol dehydratase,
alpha subunit
SEQ ID NO. 827LM-2035.1From 2671624to 2672511Unknown, similar to
transcription regulator
TetR/AcrR family
SEQ ID NO. 828LM-2037.1From 2670042to 2671523Unknown, similar to drug-
export proteins
SEQ ID NO. 829LM-2038.1From 2669152to 2669994Unknown, conserved
hypothetical proteins
SEQ ID NO. 830LM-2039.1From 2665666to 2668653Unknown, similar to formate
dehydrogenase alpha chain
SEQ ID NO. 831LM-204.1From 1181720to 1182421Unknown, similar to
Salmonella typhimurium
PduB protein
SEQ ID NO. 832LM-2040.1From 2665190to 2665666Unknown, similar to B. subtilis
YrhD protein
SEQ ID NO. 833LM-2042.1From 2664343to 2665128Unknown, similar to formate
dehydrogenase associated
protein
SEQ ID NO. 834LM-2043.1From 2663619to 2664296unknown, similar to two-
component response
regulator
SEQ ID NO. 835LM-2044.1From 2662243to 2663622Unknown, similar to
response regulator histidine
kinase
SEQ ID NO. 836LM-2045.1From 2661076to 2662164Unknown, conserved
hypothetical protein
SEQ ID NO. 837LM-2046.1From 2660408to 2661076Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 838LM-2047.1From 2659852to 2660145Unknown, conserved
hypothetical protein
SEQ ID NO. 839LM-2048.2From 2658966to 2659841Unknown
SEQ ID NO. 840LM-2049.3From 403743to 404198Unknown
SEQ ID NO. 841LM-205.1From 1181338to 1181625Unknown, similar to
Salmonella typhimurium
PduA protein
SEQ ID NO. 842LM-2050.1From 403279to 403725Unknown
SEQ ID NO. 843LM-2051.1From 402641to 403060Unknown
SEQ ID NO. 844LM-2053.1From 401619to 402539Unknown, similar to putative
transcription regulator
SEQ ID NO. 845LM-2054.1From 400997to 401299Unknown, similar to PTS
betaglucoside-specific
enzyme IIB component
SEQ ID NO. 846LM-2055.1From 399645to 400979Unknown, similar to PTS
betaglucoside-specific
enzyme IIC component
SEQ ID NO. 847LM-2056.1From 398210to 399652Unknown, similar to beta-
glucosidase
SEQ ID NO. 848LM-2058.1From 397340to 398053Unknown, similar to
transcription regulator
(GntR family)
SEQ ID NO. 849LM-2059.1From 396964to 397299Unknown, conserved
hypothetical protein
SEQ ID NO. 850LM-2060.1From 396208to 396927Unknown, conserved
hypothetical protein, highly
similar to B. subtilis Yeel
protein
SEQ ID NO. 851LM-2061.1From 395602to 396111Unknown, similar to
different proteins
SEQ ID NO. 852LM-2063.1From 394264to 395529Unknown, conserved
hypothetical protein similar
to B. subtilis YwbN protein
SEQ ID NO. 853LM-2064.1From 393086to 394246Unknown, conserved
hypothetical protein,
putative lippoprotein
SEQ ID NO. 854LM-2065.1From 391604to 393052Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 855LM-2066.1From 390529to 391473Unknown, similar to
transcription regulator
SEQ ID NO. 856LM-2067.1From 389813to 390430Unknown, similar to
Salmonella typhimurium
peptidase E
SEQ ID NO. 857LM-2068.1From 388807to 389541Unknown, similar to
conserved hypothetical
integral membrane protein
SEQ ID NO. 858LM-2069.1From 387697to 388467Unknown, similar to
transcriptional regulator
(DeoR family)
SEQ ID NO. 859LM-2070.1From 386780to 387640Unknown, similar to D-
fructose-1,6-biphosphate
aldolase
SEQ ID NO. 860LM-2072.1From 385386to 386780Unknown, similar to PTS
system, fructose-specific
enzyme IIBC component
SEQ ID NO. 861LM-2073.1From 384926to 385372Unknown, similar to PTS
system, enzyme IIA
component
SEQ ID NO. 862LM-2074.1From 383710to 384726Unknown, similar to
oxidoreductase
SEQ ID NO. 863LM-2076.1From 382016to 383536Unknown, similar to
Flavocytochrome C
Fumarate Reductase
chain A
SEQ ID NO. 864LM2077.1From 380253to 381779Unknown, similar to fatty-
acid-CoA ligase
SEQ ID NO. 865LM-2078.1From 379794to 380207Unknown, similar to
unknown proteins
SEQ ID NO. 866LM-2080.2From 378964to 379728unknown, highly similar to
regulatory proteins (DeoR
family)
SEQ ID NO. 867LM-2082.3From 2052233to 2052664Unknown, similar to
unknown proteins
SEQ ID NO. 868LM-2083.1From 2052816to 2053301Unknown, similar to
unknown proteins
SEQ ID NO. 869LM-2084.1From 2053274to 2053801Unknown, similar to
unknown proteins
SEQ ID NO. 870LM-2088.1From 2054493to 2056187Unknown, similar to
dihydroxy-acid dehydratase
SEQ ID NO. 871LM-2089.1From 2056205to 2057926Unknown, similar to
acetolactate synthase
(acetohydroxy-acid
synthase) (large subunit)
SEQ ID NO. 872LM-2091.1From 2057927to 2058418Unknown, similar to
acetolactate synthase
(acetohydroxy-acid
synthase) (small subunit)
SEQ ID NO. 873LM-2092.1From 2058516to 2059511Unknown, similar to ketol-
acid reductoisomerase
(acetohydroxy-acid
isomeroreductase)
SEQ ID NO. 874LM-2097.1From 2059669to 2061207Unknown, similar to 2-
isopropylmalate synthase
SEQ ID NO. 875LM-2098.1From 2061209to 2062261Unknown, similar to 3-
isopropylmalate
dehydrogenase
SEQ ID NO. 876LM-21.1From 2723786to 2725852Unknown, similar to
transcriptional
antiterminator
SEQ ID NO. 877LM-2100.1From 2062263to 2063651Unknown, similar to 3-
isopropylmalate
dehydratase (large subunit)
SEQ ID NO. 878LM-2101.1From 2063638to 2064219Unknown, similar to 3-
isopropylmalate
dehydratase (small subunit)
SEQ ID NO. 879LM-2102.1From 2064238to 2065506Unknown, similar to
threonine dehydratase
SEQ ID NO. 880LM-2104.1From 2065698to 2066417Unknown, similar to alpha-
acetolactate decarboxylase
SEQ ID NO. 881LM-2106.1From 2066455to 2067756Unknown, similar to
pyrimidine-nucleoside
phosphorylase
SEQ ID NO. 882LM-2107.3From 2067887to 2068897Unknown, similar to
transcription regulators
(Lacl family)
SEQ ID NO. 883LM-2109.2From 1677409to 1680009Unknown, similar to
Alcohol-acetaldehyde
dehydrogenase
SEQ ID NO. 884LM-2110.1From 1680132to 1680560Unknown, similar to
unknown proteins
SEQ ID NO. 885LM-2111.1From 1680644to 1681564Unknown, similar to similar
to ABC transporter (ATP-
binding protein)
SEQ ID NO. 886LM-2113.1From 1681561to 1682628Unknown, similar to
membrane proteins
SEQ ID NO. 887LM-2114.1From 1682664to 1683638Unknown, similar to
unknown proteins
SEQ ID NO. 888LM-2115.1From 1683635to 1684216Unknown, similar to dna-3-
methyladenine glycosidase
SEQ ID NO. 889LM-2116.1From 1684229to 1684462Unknown
SEQ ID NO. 890LM-2119.1From 1684563to 1687265unknown, highly similar to
aconitate hydratases
SEQ ID NO. 891LM-2120.1From 1687420to 1688223Unknown, similar to putative
sigma factor regulator
SEQ ID NO. 892LM-2121.1From 1688248to 1688670Unknown
SEQ ID NO. 893LM-2123.1From 1688670to 1691888Unknown, similar to SNF2-
type helicase
SEQ ID NO. 894LM-2127.1From 1691972to 1695043Unknown, similar to ATP-
dependent dsDNA
exonuclease SbcC
SEQ ID NO. 895LM-2128.1From 1695040to 1696164unknown, similar to putative
exonucleases SbcD
SEQ ID NO. 896LM-2129.1From 1696280to 1696888unknown, similar to 1-
acylglycerol-3-phosphate O-
acyltransferases
SEQ ID NO. 897LM-2130.1From 1696940to 1697302Unknown
SEQ ID NO. 898LM-2131.1From 1697476to 1697991Unknown
SEQ ID NO. 899LM-2133.1From 1698165to 1698653unknown, similar to
hypothetical proteins
SEQ ID NO. 900LM-2134.1From 1698734to 1700539unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 901LM-2135.2From 1700523to 1702292unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 902LM-2138.1From 366508to 367032Unknown
SEQ ID NO. 903LM-214.1From 1177116to 1177865Unknown
SEQ ID NO. 904LM-2140.1From 367533to 367901Unknown
SEQ ID NO. 905LM-2141.1From 367898to 369373Unknown
SEQ ID NO. 906LM-2142.1From 369377to 369757Unknown
SEQ ID NO. 907LM-2143.1From 370032to 370403Unknown, weakly similar to
inorganic pyrophosphatase
SEQ ID NO. 908LM-2145.1From 370732to 371064Unknown
SEQ ID NO. 909LM-2147.1From 371130to 373127Unknown, similar to
transketolase
SEQ ID NO. 910LM-2148.1From 373129to 373785Unknown, similar to
transaldolase
SEQ ID NO. 911LM-215.1From 1176660to 1177091unknown, similar to
Salmonella enterica PduV
protein
SEQ ID NO. 912LM-2150.1From 373832to 374596Unknown, similar to
dehydrogenase/reductase
SEQ ID NO. 913LM.-2151.1From 374621to 375067Unknown, similar to sugar-
phosphate isomerase
SEQ ID NO. 914LM-2152.1From 375074to 375838Unknown, similar to
triosephosphate isomerase
SEQ ID NO. 915LM-2154.1From 375842to 376492Unknown, similar to
dihydroxyacetone kinase
SEQ ID NO. 916LM-2155.1From 376514to 377509Unknown, similar to
dihydroxyacetone kinase
SEQ ID NO. 917LM-2156.1From 377531to 377872Unknown
SEQ ID NO. 918LM-2157.2From 377888to 378319Unknown
SEQ ID NO. 919LM-2158.2From 378404to 378781Unknown, similar to
unknown proteins
SEQ ID NO. 920LM-216.1From 1176304to 1176654Unknown, similar to
Salmonelle enterica PduU
protein
SEQ ID NO. 921LM-2161.2From 72408to 74222Unknown, similar to toxin
components
SEQ ID NO. 922LM-2162.1From 72062to 72394Unknown
SEQ ID NO. 923LM-2163.1From 71360to 72061Unknown
SEQ ID NO. 924LM-2164.1From 71064to 71363Unknown
SEQ ID NO. 925LM-2166.1From 70676to 71071Unknown
SEQ ID NO. 926LM-217.1From 1175602to 1176156Unknown, similar to
Salmonella enterica PduT
protein
SEQ ID NO. 927LM-2170.1From 66160to 70656Unknown, highly similar to
B. subtilis YukA protein
SEQ ID NO. 928LM-2171.1From 64950to 66146Unknown, similar to B. subtilis
YukC protein
SEQ ID NO. 929LM-2172.1From 64677to 64928Unknown, similar to B. subtilis
YukD protein
SEQ ID NO. 930LM-2174.1From 64144to 64659Unknown
SEQ ID NO. 931LM-2178.1From 60948to 64154Unknown, similar to B. subtilis
YueB protein
SEQ ID NO. 932LM-2179.1From 60506to 60799Unknown, similar to a small
heat shock protein of
Clostridium acetobutylicum
SEQ ID NO. 933LM-218.1From 1174241to 1175605unknown, similar to
Salmonella enterica PduS
protein
SEQ ID NO. 934LM-2180.2From 58897to 60189unknown, highly similar to
adenylosuccinate
synthetase
SEQ ID NO. 935LM-2182.2From 1814403to 1815080Unknown, similar to two-
component response
regulator
SEQ ID NO. 936LM-2184.1From 1813457to 1814332Unknown, similar to
unknown proteins
SEQ ID NO. 937LM-2185.1From 1812978to 1813436Unknown
SEQ ID NO. 938LM-2186.1From 1811219to 1812961unknown, highly similar to
adenine deaminases
SEQ ID NO. 939LM-2188.1From 1810157to 1811197Unknown, similar to two-
component sensor histidine
kinase
SEQ ID NO. 940LM-2189.1From 1809118to 1809759Unknown, similar to amino
acid (glutamine) ABC
transporter, permease
protein
SEQ ID NO. 941LM-2191.1From 1808460to 1809107Unknown, similar to amino
acid (glutamine) ABC
transporter (ATP-binding
protein)
SEQ ID NO. 942LM-2193.1From 1807643to 1808458Unknown, similar to amino
acid ABC transporter
(binding protein)
SEQ ID NO. 943LM-2194.1From 1806446to 1807552Unknown, similar to glycerol
dehydrogenase
SEQ ID NO. 944LM-2195.1From 1805925to 1806443Unknown, similar to
unknown proteins
SEQ ID NO. 945LM-2196.1From 1805041to 1805928transcription activator of
glutamate synthase operon
GltC
SEQ ID NO. 946LM-2199.1From 1800256to 1804848Unknown, similar to
glutamate synthase (large
subunit)
SEQ ID NO. 947LM-22.1From 2726008to 2727195unknown, highly similar to
translation elongation factor
EF-Tu
SEQ ID NO. 948LM-220.1From 1173643to 1174122unknown, similar to
uroporphyrin-III C-
methyltransferase
SEQ ID NO. 949LM-2201.1From 1798772to 1800241Unknown, similar to
glutamate synthase (small
subunit)
SEQ ID NO. 950LM-2203.1From 1797897to 1798727Unknown, similar to sugar
transport protein
SEQ ID NO. 951LM-2204.1From 1796987to 1797880Unknown, similar to sugar
transport protein
SEQ ID NO. 952LM-2205.2From 1795681to 1796925Unknown, similar to sugar
binding protein
SEQ ID NO. 953LM-2207.2From 2505425to 2505835Unknown
SEQ ID NO. 954LM-2209.1From 2505858to 2506370Unknown
SEQ ID NO. 955LM-221.1From 1173144to 1173551Unknown
SEQ ID NO. 956LM-2210.1From 2506494to 2506835Unknown
SEQ ID NO. 957LM-2212.1From 2506867to 2507250Unknown
SEQ ID NO. 958LM-2213.1From 2507306to 2508202Unknown, similar to
transcription regulator
SEQ ID NO. 959LM-2214.1From 2508247to 2508816Unknown
SEQ ID NO. 960LM-2215.1From 2508992to 2509411Unknown
SEQ ID NO. 961LM-2216.1From 2510361to 2514293Unknown, similar to
glycosidase
SEQ ID NO. 962LM-2217.1From 2514308to 2515210Unknown, similar to
internalin
SEQ ID NO. 963LM-2219.1From 2515312to 2518587Unknown, similar to
glycosidase
SEQ ID NO. 964LM-2220.1From 2518971to 2519879Unknown, similar to
transcription regulator
SEQ ID NO. 965LM-2221.1From 2519935to 2520399Unknown, conserved
hypothetical protein
SEQ ID NO. 966LM-2222.1From 2520415to 2522796Unknown, similar to
exoribonuclease RNase-R
SEQ ID NO. 967LM-2224.2From 2522830to 2523576Unknown, similar to
carboxylesterase
SEQ ID NO. 968LM-2225.2From 2343056to 2343685Unknown, similar to
phosphoglucomutase
SEQ ID NO. 969LM-2226.1From 2341865to 2343013Unknown, similar to
aspartate aminotransferase
SEQ ID NO. 970LM-2227.1From 2341064to 2341789Unknown, similar to amino
acid ABC transporter (ATP-
binding protein)
SEQ ID NO. 971LM-2228.1From 2339611to 2341071Unknown, similar to amino
acid ABC transporter,
permease protein
SEQ ID NO. 972LM-2229.1From 2338420to 2339418Unknown, similar to low-
affinity inorganic phosphate
transporter
SEQ ID NO. 973LM-223.1From 1172797to 1173036Unknown
SEQ ID NO. 974LM-2230.1From 2337786to 2338406Unknown, similar to
unknown proteins
SEQ ID NO. 975LM-2232.1From 2336515to 2337399Unknown, similar to
oxidoreductase
SEQ ID NO. 976LM-2233.1From 2335862to 2336518Unknown, similar to
unknown proteins
SEQ ID NO. 977LM-2234.1From 2335468to 2335860Unknown, similar to
unknown proteins
SEQ ID NO. 978LM-2235.1From 2334586to 2335455Unknown, similar to putative
ribosomal large subunit
pseudouridine synthase
SEQ ID NO. 979LM-2236.1From 2334003to 2334569Unknown, similar to
methylphosphotriester-DNA
alkyltransferase and
transcriptional regulator
SEQ ID NO. 980LM-2237.1From 2333375to 2333857Unknown, similar to O6-
methylguanine-DNA
methyltransferase
SEQ ID NO. 981LM-2238.1From 2332888to 2333283Unknown, similar to
transcriptional regulators
(GntR family)
SEQ ID NO. 982LM-2239.1From 2332020to 2332898Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 983LM-224.1From 1172076to 1172648Unknown, similar to ATP-
dependent Clp protease
proteolytic component
SEQ ID NO. 984LM-2241.1From 2330959to 2332023Unknown
SEQ ID NO. 985LM-2242.1From 2329731to 2330921Unknown, similar to
transport system permease
protein
SEQ ID NO. 986LM-2244.1From 2328363to 2329571Unknown, similar to
transport system permease
protein
SEQ ID NO. 987LM-2246.1From 2327461to 2328330Unknown, similar to
oxidoreductase
SEQ ID NO. 988LM-2248.1From 2325477to 2327402Unknown, similar to NADH
oxidase
SEQ ID NO. 989LM-225.1From 1171662to 1172009Unknown
SEQ ID NO. 990LM-2250.1From 2324554to 2325408Unknown, similar to
unknown proteins
SEQ ID NO. 991LM-2251.1From 2323375to 2324253Unknown, similar to
transcriptional regulators
(LysR family)
SEQ ID NO. 992LM-2253.1From 2322031to 2323335Unknown, similar to
unknown proteins
SEQ ID NO. 993LM-2254.2From 2320906to 2321775Unknown, similar to
unknown proteins
SEQ ID NO. 994LM-2255.2From 2320416to 2320841Unknown, similar to
arsenate reductase
SEQ ID NO. 995LM-2256.3From 1405578to 1405859Unknown
SEQ ID NO. 996LM-2257.3From 1404714to 1405541Unknown, similar to B. subtilis
SpoIIIJ protein
SEQ ID NO. 997LM-2258.3From 1403227to 1404678two-component sensor
histidine kinase
SEQ ID NO. 998LM-2260.1From 1402550to 1403230two-component response
regulator
SEQ ID NO. 999LM-2261.2From 1400941to 1402359Unknown, similar to 6-
phosphogluconate
dehydrogenase
SEQ ID NO. 1000LM-2262.2From 1399699to 1400796Unknown, similar to
aminotripeptidase
SEQ ID NO. 1001LM-2263.1From 1398077to 1399327Unknown, similar to
branched-chain alpha-keto
acid dehydrogenase E2
subunit (lipoamide
acyltransferase)
SEQ ID NO. 1002LM-2266.1From 1397063to 1398046Unknown, similar to
branched-chain alpha-keto
acid dehydrogenase E1
subunit (2-oxoisovalerate
dehydrogenase beta
subunit)
SEQ ID NO. 1003LM-2267.1From 1396051to 1397046Unknown, similar to
branched-chain alpha-keto
acid dehydrogenase E1
subunit (2-oxoisovalerate
dehydrogenase alpha
subunit)
SEQ ID NO. 1004LM-2269.1From 1394599to 1396026Unknown, similar to
branched-chain alpha-keto
acid dehydrogenase E3
subunit
SEQ ID NO. 1005LM-2270.1From 1393517to 1394584Unknown, similar to
branched-chain fatty-acid
kinase
SEQ ID NO. 1006LM-2271.1From 1392514to 1393380Unknown, similar to
phosphotransbutyrylase
SEQ ID NO. 1007LM-2273.1From 1390694to 1392385DNA repair and genetic
recombination
SEQ ID NO. 1008LM-2274.1From 1390222to 1390671Unknown, similar to arginine
repressor
SEQ ID NO. 1009LM-2275.1From 1389207to 1390031Unknown, conserved
hypothetical protein
SEQ ID NO. 1010LM-2276.1From 1387396to 1389210Unknown, similar to D-1-
deoxyxylulose 5-phosphate
synthase
SEQ ID NO. 1011LM-2278.1From 1385934to 1386815Unknown, similar to
geranyltranstransferase
SEQ ID NO. 1012LM-2280.1From 1384362to 1385714Unknown, similar to
exodeoxyribonuclease VII
(large subunit)
SEQ ID NO. 1013LM-2283.1From 1383489to 1384343unknown, highly similar to
methylenetetrahydrofolate
dehydrogenase and
methenyltetrahydrofolate
cyclohydrolase
SEQ ID NO. 1014LM-2284.1From 1383007to 1383393Unknown, similar to
transcription termination
protein (NusB)
SEQ ID NO. 1015LM-2285.1From 1382569to 1382976unknown, similar to B. subtilis
YqhY protein
SEQ ID NO. 1016LM-2287.3From 1381201to 1382565acetyl-CoA carboxylase
subunit (biotin carboxylase
subunit)
SEQ ID NO. 1017LM-2289.3From 1380720to 1381187Unknown, similar to acetyl-
CoA carboxylase subunit
(biotin carboxyl carrier
subunit)
SEQ ID NO. 1018LM-2290.3From 1380004to 1380561unknown, highly similar to
elongation factor P (EF-P)
SEQ ID NO. 1019LM-2291.2From 1275845to 1276693unknown, similar to B. subtilis
YxkD protein
SEQ ID NO. 1020LM-2292.1From 1274883to 1275542unknown, similar to
regulator of the Fnr CRP
family (including PrfA)
SEQ ID NO. 1021LM-2293.1From 1273473to 1274678unknown; similar to
antibiotic resistance protein
SEQ ID NO. 1022LM-2294.1From 1273231to 1273476Unknown
SEQ ID NO. 1023LM-2295.1From 1272717to 1273193unknown, weakly similar to
8-oxo-dGTPase (mutT)
SEQ ID NO. 1024LM-2296.1From 1272286to 1272549Unknown
SEQ ID NO. 1025LM-2298.1From 1270849to 1272261Unknown, similar to ATP-
dependent RNA helicase
(DEAD motif)
SEQ ID NO. 1026LM-2299.1From 1269834to 1270433unknown weakly similar to
phosphoglycerate mutase 1
SEQ ID NO. 1027LM-230.1From 1168390to 1169541Unknown
SEQ ID NO. 1028LM-2300.1From 1269262to 1269669Unknown
SEQ ID NO. 1029LM-2301.1From 1268515to 1269108unknown, similar to B. subtilis
Ydel protein
SEQ ID NO. 1030LM-2302.1From 1267115to 1268473Unknown
SEQ ID NO. 1031LM-2303.1From 1266040to 1266564unknown, conserved
hypothetical protein, similar
to B. subtilis YsnB protein
SEQ ID NO. 1032LM-2304.1From 1265392to 1266003unknown, conserved
hypothetical protein, similar
to B. subtilis YsnA protein
SEQ ID NO. 1033LM-2305.1From 1264642to 1265388Unknown, similar to
ribonuclease PH
SEQ ID NO. 1034LM-2306.1From 1263829to 1264629Unknown, similar to
glutamate racemase
SEQ ID NO. 1035LM-2307.1From 1263187to 1263681unknown, similar to B. subtilis
YslB protein
SEQ ID NO. 1036LM-2308.1From 1261917to 1263131Unknown, similar to
aspartokinase II alpha
subunit
SEQ ID NO. 1037LM-231.1From 1168021to 1168368unknown, similar to
regulatory proteins
SEQ ID NO. 1038LM-2310.1From 1259917to 1261728unknown, highly similar to
excinuclease ABC
subunit C
SEQ ID NO. 1039LM-2312.1From 1259530to 1259841Thioredoxin
SEQ ID NO. 1040LM-2315.1From 1257092to 1259449unknown, similar to MutS
protein (MutS2)
SEQ ID NO. 1041LM-2317.1From 1255357to 1257069unknown, similar to DNA
polymerase beta, to B. subtilis
YshC protein
SEQ ID NO. 1042LM-2318.3From 1254722to 1255264unknown, similar to B. subtilis
YshB protein
SEQ ID NO. 1043LM-2319.2From 347658to 348428Unknown, similar to
unknown proteins
SEQ ID NO. 1044LM-232.1From 1167645to 1167953unknown, similar to B. subtilis
YjcS protein
SEQ ID NO. 1045LM-2320.1From 347219to 347611Unknown, similar to
unknown proteins
SEQ ID NO. 1046LM-2322.1From 346377to 347036Unknown, similar to
unknown proteins
SEQ ID NO. 1047LM-2324.1From 343221to 344636Unknown, similar to
phospho-beta-glucosidase
SEQ ID NO. 1048LM-2325.1From 342551to 343195Unknown, similar to thiamin-
phosphate
pyrophosphorylase (ThiE)
SEQ ID NO. 1049LM-2327.1From 341751to 342554Unknown, similar to
phosphomethylpyrimidine
kinase (ThiD)
SEQ ID NO. 1050LM-2329.1From 340945to 341754Unknown, similar to
hydroxyethylthiazole kinase
(ThiM)
SEQ ID NO. 1051LM-2332.1From 340278to 340952Unknown, similar to thiamin
biosynthesis protein
SEQ ID NO. 1052LM-2333.1From 339270to 340055Unknown, similar to
unknown protein
SEQ ID NO. 1053LM-2334.1From 338341to 339087Unknown, conserved
hypothetical protein
SEQ ID NO. 1054LM-2335.2From 337161to 338363Unknown, similar to
unknown proteins
SEQ ID NO. 1055LM-2336.3From 336541to 337161Unknown
SEQ ID NO. 1056LM-2338.2From 1026871to 1029045ATP-dependent protease
SEQ ID NO. 1057LM-2339.1From 1026376to 1026855unknown, similar to
methylated-DNA-protein-
cystein methyltransferase
SEQ ID NO. 1058LM-234.1From 1165962to 1167608unknown, similar to ABC
transporters, ATP-binding
proteins (HI0664)
SEQ ID NO. 1059LM-2340.1From 1025159to 1026190unknown, similar to B. subtilis
YkrP protein
SEQ ID NO. 1060LM-2341.1From 1024761to 1025129Unknown
SEQ ID NO. 1061LM-2342.1From 1023346to 1024680unknown, similar to Na+-
transporting ATP synthase
subunit J
SEQ ID NO. 1062LM-2344.1From 1022535to 1023302unknown, conserved
hypothetical protein
SEQ ID NO. 1063LM-2345.1From 1021665to 1022432unknown, conserved
hypothetical protein
SEQ ID NO. 1064LM-2346.1From 1020022to 1021383unknown, conserved
hypothetical protein
SEQ ID NO. 1065LM-2347.1From 1019545to 1019970unknown, similar to
regulatory proteins (MarR
family)
SEQ ID NO. 1066LM-2348.1From 1017870to 1019438unknown, similar to peptide
chain release factor 3
(RF-3)
SEQ ID NO. 1067LM-2349.1From 1016934to 1017788unknown, similar to
Streptococcus agalactiae
CylB protein
SEQ ID NO. 1068LM-235.2From 1164245to 1165960ATP-binding transport
protein
SEQ ID NO. 1069LM-2350.1From 1016030to 1016941unknown, similar to
antibiotic ABC transporter,
ATP-binding protein,
SEQ ID NO. 1070LM-2351.1From 1015605to 1016033Unknown
SEQ ID NO. 1071LM-2352.1From 1015153to 1015608unknown, weakly similar to
two-component response
regulator
SEQ ID NO. 1072LM-2353.1From 1014540to 1015019unknown, similar to
glutathione peroxidase
SEQ ID NO. 1073LM-2355.1From 1013475to 1014527unknown, similar to
glucanase and peptidase
SEQ ID NO. 1074LM-2358.1From 1011860to 1013335unknown, similar to efflux
transporter
SEQ ID NO. 1075LM-2359.1From 1011074to 1011826unknown, similar to ABC
transporter transmembrane
component
SEQ ID NO. 1076LM-236.2From 1163277to 1163996unknown, similar to
transcription regulators
SEQ ID NO. 1077LM-2361.1From 1010313to 1011077unknown, similar to
daunorubicin resistance
ATP-binding proteins
SEQ ID NO. 1078LM-2362.1From 1009098to 1010117unknown; similar to
branched-chain amino acid
aminotransferase
SEQ ID NO. 1079LM-2363.1From 1008130to 1008879unknown, similar to B. subtilis
YjcH protein
SEQ ID NO. 1080LM-2364.1From 1007669to 1008112unknown, similar to B. subtilis
YjcF protein
SEQ ID NO. 1081LM-2365.2From 1006959to 1007633unknown, similar to ribose
5-phosphate isomerase
SEQ ID NO. 1082LM-2366.2From 1937783to 1938415unknown, similar to
hemolysinIII proteins,
putative integral membrane
protein
SEQ ID NO. 1083LM-2368.1From 1938538to 1939155unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1084LM-2369.1From 1939168to 1939980unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1085LM-237.1From 1162666to 1163280Unknown, similar to
unknown proteins
SEQ ID NO. 1086LM-2371.1From 1939999to 1942638unknown, similar to
pyruvate phosphate
dikinase
SEQ ID NO. 1087LM-2372.1From 1942687to 1943064unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1088LM-2373.1From 1943068to 1944051unknown, similar to
conserved hypothetical
proteins, putative integral
membrane protein
SEQ ID NO. 1089LM-2375.1From 1944145to 1944768unknown; similar to alkaline
phosphatase
SEQ ID NO. 1090LM-2378.1From 1944908to 1946419unknown, similar to
phosphoglucomutases
SEQ ID NO. 1091LM-2379.1From 1946488to 1947363unknown, similar to
methyltransferases
SEQ ID NO. 1092LM-238.1From 1161689to 1162696Unknown
SEQ ID NO. 1093LM-2380.1From 1947416to 1947898unknown, similar to
dihydrofolate reductases
SEQ ID NO. 1094LM-2381.1From 1947914to 1948858unknown, similar to
thymidylate synthase
SEQ ID NO. 1095LM-2385.1From 1948871to 1950763unknown, similar to putative
ABC transporters (ATP-
binding protein)
SEQ ID NO. 1096LM-2386.1From 1950883to 1951305Unknown, similar to formyltetrahydrofolate
synthetase
C-terminal part
SEQ ID NO. 1097LM-2387.2From 1951284to 1952564Unknown, similar to formyltetrahydrofolate
synthetase
N-terminal part
SEQ ID NO. 1098LM-2388.2From 1952721to 1953149unknown, similar to
transcriptional regulators
SEQ ID NO. 1099LM-239.1From 1161220to 1161675Unknown
SEQ ID NO. 1100LM-2390.2From 148354to 150009Unknown, similar to ABC
transporter oligopeptide-
binding protein
SEQ ID NO. 1101LM-2391.1From 147884to 148291Unknown
SEQ ID NO. 1102LM-2392.1From 147399to 147764Unknown
SEQ ID NO. 1103LM-2393.1From 146764to 147381
SEQ ID NO. 1104LM-2394.1From 146306to 146656Unknown
SEQ ID NO. 1105LM-2395.1From 145869to 146306Unknown
SEQ ID NO. 1106LM-2397.1From 144397to 144840Unknown
SEQ ID NO. 1107LM-2398.1From 144097to 144273Unknown
SEQ ID NO. 1108LM-24.1From 2727304to 2729391unknown, highly similar to
translation elongation
factor G
SEQ ID NO. 1109LM-240.1From 1160778to 1161230unknown, similar to E. coli
YjaB protein
SEQ ID NO. 1110LM-2400.1From 143366to 143722Unknown
SEQ ID NO. 1111LM-2401.1From 141618to 143006Unknown
SEQ ID NO. 1112LM-2402.1From 141337to 141705Unknown
SEQ ID NO. 1113LM-2404.1From 141052to 141336Unknown
SEQ ID NO. 1114LM-2406.1From 139960to 140853Unknown, similar to
oligopeptide ABC
transporter, permease
protein
SEQ ID NO. 1115LM-2409.1From 138999to 139949Unknown, similar to
oligopeptide ABC
transporter, permease
protein
SEQ ID NO. 1116LM-241.2From 1160220to 1160753Unknown
SEQ ID NO. 1117LM-2411.2From 137323to 138897Unknown, similar to
oligopeptide ABC transport
system substrate-binding
proteins
SEQ ID NO. 1118LM-2412.2From 2164106to 2165377Unknown, weakly similar to
transcription regulators
SEQ ID NO. 1119LM-2414.1From 2166012to 2167343Unknown, similar to
unknown proteins
SEQ ID NO. 1120LM-2415.1From 2167353to 2167943Unknown, similar to
transcription regulators
SEQ ID NO. 1121LM-2416.1From 2168054to 2169097Unknown, similar to lipases
SEQ ID NO. 1122LM-2417.1From 2169312to 2170526Unknown, similar to
argininosuccinate synthase
SEQ ID NO. 1123LM-2418.1From 2170530to 2171900Unknown, similar to
argininosuccinate lyase
SEQ ID NO. 1124LM-242.2From 1159393to 1159830Unknown
SEQ ID NO. 1125LM-2421.1From 2172068to 2173591glycine betaine transporter
BetL
SEQ ID NO. 1126LM-2423.1From 2173836to 2174486Unknown, similar to L-
fuculose-phosphate
aldolase
SEQ ID NO. 1127LM-2424.1From 2174488to 2175420Unknown, similar to 1-
phosphofructokinase
SEQ ID NO. 1128LM-2426.1From 2175438to 2176787Unknown, similar to PTS
system galactitol-specific
enzyme IIC component
SEQ ID NO. 1129LM-2428.1From 2176815to 2177090Unknown, similar to PTS
system galactitol-specific
enzyme IIB component
SEQ ID NO. 1130LM-2429.1From 2177096to 2177563Unknown, similar to PTS
system galactitol-specific
enzyme IIA component
SEQ ID NO. 1131LM-243.1From 1158930to 1159403Unknown
SEQ ID NO. 1132LM-2431.1From 2177571to 2179577Unknown, similar to
transcription antiterminator
SEQ ID NO. 1133LM-2432.2From 2179758to 2181203Unknown, similar to
transcriptional regulator
(GntR family) and to
aminotransferase
(MocR-like)
SEQ ID NO. 1134LM-2433.2From 161420to 162250Unknown
SEQ ID NO. 1135LM-2439.1From 155993to 156805Unknwown, conserved
hypothetical protein
SEQ ID NO. 1136LM-244.1From 1158425to 1158844Unknown
SEQ ID NO. 1137LM-2441.1From 153608to 155947Unknown, similar to ATP
dependent helicase
SEQ ID NO. 1138LM-2442.1From 152648to 153328Unknown
SEQ ID NO. 1139LM-2444.1From 151839to 152645Unknown, similar to high-
affinity zinc ABC transporter
(membrane protein)
SEQ ID NO. 1140LM-2446.1From 151186to 151890Unknown, similar to high-
affinity zinc ABC transporter
(ATP-binding protein)
SEQ ID NO. 1141LM-2447.2From 150232to 151173Unknown, similar to a
probable high-affinity zinc
ABC transporter (Zn(II)-
binding lipoprotein)
SEQ ID NO. 1142LM-2448.2From 2635167to 2637920autolysin, amidase
SEQ ID NO. 1143LM-245.1From 1158020to 1158388Unknown
SEQ ID NO. 1144LM-2450.1From 2637964to 2639562unknown, highly similar to
CTP synthases
SEQ ID NO. 1145LM-2452.1From 2639930to 2640466Unknown, similar to B. subtilis
RNA polymerase
delta subunit
SEQ ID NO. 1146LM-2453.1From 2640832to 2642502arginyl tRNA synthetase
SEQ ID NO. 1147LM-2455.1From 2642499to 2642948Unknown
SEQ ID NO. 1148LM-2456.1From 2643026to 2643679Unknown, conserved
hypothetical protein
SEQ ID NO. 1149LM-2457.1From 2643973to 2645295Unknown, conserved
hypothetical protein
SEQ ID NO. 1150LM-2458.1From 2645310to 2646146Unknown
SEQ ID NO. 1151LM-2459.1From 2646686to 2647009Unknown
SEQ ID NO. 1152LM-246.1From 1157592to 1158008Unknown
SEQ ID NO. 1153LM-2460.1From 2647164to 2648825Unknown, similar to
dipeptide ABC transporter
(dipeptide-binding protein)
SEQ ID NO. 1154LM-2461.1From 2648960to 2649574Unknown
SEQ ID NO. 1155LM-2462.1From 2649598to 2650230Unknown, similar to
nicotinamidase
SEQ ID NO. 1156LM-2463.1From 2650231to 2650755Unknown, similar to Chain
A, Dihydrofolate Reductase
SEQ ID NO. 1157LM-2464.1From 2650758to 2651756Unknown, similar to zinc-
binding dehydrogenase
SEQ ID NO. 1158LM-2465.1From 2651853to 2652125Unknown
SEQ ID NO. 1159LM-2467.2From 2652174to 2653085Unknown, similar to cation
transport protein
SEQ ID NO. 1160LM-2469.1From 331543to 332688Unknown
SEQ ID NO. 1161LM-247.3From 1156384to 1157241unknown, similar to
methylases
SEQ ID NO. 1162LM-2470.1From 331063to 331446Unknown
SEQ ID NO. 1163LM-2473.1From 329923to 330999Unknown, similar to low
specificity L-allo-threonine
aldolase
SEQ ID NO. 1164LM-2474.1From 328701to 329966Unknown
SEQ ID NO. 1165LM-2475.1From 327941to 328495Unknown, putaive secreted,
lysin rich protein
SEQ ID NO. 1166LM-2476.1From 327621to 327905Unknown
SEQ ID NO. 1167LM-2477.1From 327113to 327454Unknown, similar to PTS
beta-glucoside-specific
enzyme IIA component
SEQ ID NO. 1168LM-2478.1From 325729to 327120Unknown, similar to
phospho-beta-glucosidase
and phospho-beta-
galactosidase
SEQ ID NO. 1169LM-2480.1From 325422to 325712Unknown, similar to PTS
beta-glucoside-specific
enzyme IIB component
SEQ ID NO. 1170LM-2482.1From 324104to 325417Unknown, similar to PTS
beta-glucoside-specific
enzyme IIC component
SEQ ID NO. 1171LM-2483.1From 322134to 324005Unknown, similar to
transcriptional
antiterminator (BglG family)
SEQ ID NO. 1172LM-2485.1From 321490to 322128Unknown
SEQ ID NO. 1173LM-2486.1From 320542to 321279Unknown, similar to FMN-
containing NADPH-linked
nitro/flavin reductase
SEQ ID NO. 1174LM-2487.1From 319560to 320423Unknown, similar to
transcription regulator LysR-
gltR family
SEQ ID NO. 1175LM-2488.2From 319049to 319528Unknown, conserved
hypothetical protein, highly
similar to B. subtilis YydA
proteinYyd
SEQ ID NO. 1176LM-2489.2From 2118812to 2119786Unknown, similar to
phospho-N-acetylmuramoyl-
pentapeptide transferase
SEQ ID NO. 1177LM-249.2From 2788504to 2790243unknown, highly similar to
ABC transporter (ATP-
binding protein) required for
expression of
cytochrome BD
SEQ ID NO. 1178LM-2490.1From 2117304to 2118671Unknown, similar to UDP-N-
acetylmuramoylalanine
D-glutamate ligase
SEQ ID NO. 1179LM-2491.1From 2116216to 2117307Unknown, similar to
peptidoglycan synthesis
enzymes, putative
phospho-N-acetylmuramoyl-
pentapeptide-transferase
SEQ ID NO. 1180LM-2493.1From 2115381to 2116193Unknown, similar to cell-
division initiation protein
divIB
SEQ ID NO. 1181LM-2495.1From 2113734to 2115014Unknown, highly similar to
cell-division protein FtsA
SEQ ID NO. 1182LM-2496.1From 2112492to 2113667Unknown, highly similar to
cell-division initiation protein
FtsZ
SEQ ID NO. 1183LM-2497.1From 2111684to 2112373Unknown, similar to
unknown proteins
SEQ ID NO. 1184LM-2499.1From 2111222to 2111680Unknown, similar to
unknown proteins
SEQ ID NO. 1185LM-25.1From 2729457to 2729927Ribosomal protein S7
SEQ ID NO. 1186LM-2500.1From 2110909to 2111199Unknown, similar to
unknown proteins
SEQ ID NO. 1187LM-2501.1From 2110009to 2110785Unknown, similar to
unknown proteins
SEQ ID NO. 1188LM-2506.1From 2104901to 2106001Unknown, similar to
quinolinate synthetase
SEQ ID NO. 1189LM-2507.1From 2104059to 2104904Unknown, similar to
nicotinate-nucleotide
pyrophosphorylase
SEQ ID NO. 1190LM-2508.2From 2102608to 2104062Unknown, similar to L-
aspartate oxidase
SEQ ID NO. 1191LM-2509.2From 2101371to 2102477Unknown, similar to a NifS-
like protein required for
NAD biosynthesis
SEQ ID NO. 1192LM-251.1From 2790243to 2791967Unknown, highly similar to
ABC transporter required for
expression of
cytochrome BD
SEQ ID NO. 1193LM-2510.2From 2034693to 2035517Unknown, similar to
ferrichrome ABC transporter
(ATP-binding protein)
SEQ ID NO. 1194LM-2511.1From 2035507to 2036505Unknown, similar to
oxidoreductases
SEQ ID NO. 1195LM-2512.1From 2036521to 2037141Unknown, similar to
transcription regulators
(TetR family)
SEQ ID NO. 1196LM-2514.2From 2037141to 2037956Unknown, similar to
unknown proteins
SEQ ID NO. 1197LM-2515.2From 2037953to 2038840Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1198LM-2517.1From 2039441to 2039998Unknown, similar to
unknown proteins
SEQ ID NO. 1199LM-2518.1From 2040274to 2040933Unknown, similar to
unknown proteins
SEQ ID NO. 1200LM-252.1From 2791967to 2792980unknown, highly similar to
cytochrome D ubiquinol
oxidase subunit II
SEQ ID NO. 1201LM-2520.1From 2040911to 2042110Unknown, similar to toxic
ion resistance proteins
SEQ ID NO. 1202LM-2521.1From 2042157to 2042900unknown, similar to
creatinine amidohydrolases
SEQ ID NO. 1203LM-2522.1From 2042913to 2043521Unknown, similar to 2-keto-
3-deoxygluconate-6-
phosphate aldolase
SEQ ID NO. 1204LM-2523.1From 2043540to 2044457Unknown, similar to putative
phosphotriesterase related
proteins
SEQ ID NO. 1205LM-2524.1From 2044481to 2045749Unknown, similar to
unknown proteins
SEQ ID NO. 1206LM-2527.1From 2046046to 2046489Unknown, similar to PTS
system enzyme II A
component
SEQ ID NO. 1207LM-2528.1From 2046507to 2047256Unknown, similar to
transcription regulators,
(GntR family)
SEQ ID NO. 1208LM-2529.1From 2047444to 2048514Unknown, similar to E. coli
DNA-damage-inducibile
protein dinP
SEQ ID NO. 1209LM-2530.1From 2048623to 2049414Unknown, similar to
oxidoreductase
SEQ ID NO. 1210LM-2531.1From 2049419to 2050339Unknown, similar to
unknown proteins
SEQ ID NO. 1211LM-2532.4From 2050639to 2052114Unknown, similar to
glucose-6-phosphate
1-dehydrogenase
SEQ ID NO. 1212LM-2533.2From 1731955to 1732893Unknown, similar to
menaquinone biosynthesis
proteins
SEQ ID NO. 1213LM-2536.1From 1732926to 1734779unknown, similar to
5-methyltetrahydrofolate
homocysteine
methyltransferase (metH)
SEQ ID NO. 1214LM-2538.1From 1734776to 1735948unknown; similar to
cystathionine beta-lyase
SEQ ID NO. 1215LM-2539.1From 1735941to 1737065unknown, similar to
cystathionine gamma-
synthase
SEQ ID NO. 1216LM-254.1From 2792967to 2794373Unknown, highly similar to
cytochrome D ubiquinol
oxidase subunit I
SEQ ID NO. 1217LM-2540.2From 1737087to 1739384unknown, similar to
cobalamin-independent
methionine synthase
SEQ ID NO. 1218LM-2542.2From 2088797to 2091454Unknown, similar to putative
sugar hydrolases
SEQ ID NO. 1219LM-2543.1From 2087489to 2088793Unknown, similar to
unknown proteins
SEQ ID NO. 1220LM-2544.1From 2086817to 2087428unknown, similar to
unknown proteins
SEQ ID NO. 1221LM-2545.1From 2085021to 2086760Unknown, similar to two-
component sensor histidine
kinase
SEQ ID NO. 1222LM-2546.1From 2083537to 2085021Unknown, similar to two-
component response
regulator
SEQ ID NO. 1223LM-2547.1From 2082495to 2083424Unknown, similar to putative
transport system integral
membrane protein
SEQ ID NO. 1224LM-2548.1From 2081505to 2082476Unknown, similar to ABC
transporter, permease
protein
SEQ ID NO. 1225LM-2549.1From 2080027to 2081484Unknown, similar to putative
sugar-binding lipoproteins
SEQ ID NO. 1226LM-255.1From 2794755to 2795225Unknown, conserved
hypothetical proteins
SEQ ID NO. 1227LM-2553.1From 2078132to 2079829Unknown, similar to alpha-
acetolactate synthase
protein, AIsS
SEQ ID NO. 1228LM-2554.3From 2077019to 2078014Unknown, similar to
oxidoreductase
SEQ ID NO. 1229LM-2557.2From 634479to 635279Unknown, similar to
transport proteins
(formate?)
SEQ ID NO. 1230LM-2559.2From 633709to 634251Unknown
SEQ ID NO. 1231LM-2560.1From 632811to 633581Unknown, similar to
unknown membrane
proteins
SEQ ID NO. 1232LM-2561.1From 631045to 632811Unknown, similar to a fusion
of two types of conserved
hypothetical
proteinconserved
hypothetical
SEQ ID NO. 1233LM-2562.1From 630629to 631042Unknown
SEQ ID NO. 1234LM-2563.1From 629088to 630491Unknown, similar to DNA
photolyase
SEQ ID NO. 1235LM-2565.1From 626506to 628971Unknown, putative secreted
protein
SEQ ID NO. 1236LM-2566.1From 625463to 626488Unknown
SEQ ID NO. 1237LM-2567.2From 624682to 625395Unknown, putative secreted
protein
SEQ ID NO. 1238LM-2568.3From 702088to 702954Unknown, conserved
hypothetical proteins
SEQ ID NO. 1239LM-2570.1From 701247to 702062Unknown, highly similar to
phosphomethylpyrimidine
kinase thiD
SEQ ID NO. 1240LM-2571.1From 700876to 701184Unknown, similar to
unknown proteins
SEQ ID NO. 1241LM-2573.1From 700537to 700821unknown, similar to
transposases
SEQ ID NO. 1242LM-2574.1From 699410to 700306Unknown, similar to
transcription regulator
(Rgg type)
SEQ ID NO. 1243LM-2575.1From 698785to 699420Unknown, conserved
hypothetical protein
SEQ ID NO. 1244LM-2577.1From 698317to 698733Unknown
SEQ ID NO. 1245LM-2578.1From 697634to 698197Unknown, conserved
hypothetical protein
SEQ ID NO. 1246LM-2579.1From 696883to 697590Unknown, similar to
phosphoprotein
phosphatases
SEQ ID NO. 1247LM-258.1From 2795420to 2796997Unknown, similar to
acetate-CoA ligase
SEQ ID NO. 1248LM-2581.1From 695496to 696416Unknown
SEQ ID NO. 1249LM-2583.1From 694973to 695509Unknown, similar to
unknown proteins
SEQ ID NO. 1250LM-2584.1From 694317to 694964Unknown, similar to
transcription regulator
SEQ ID NO. 1251LM-2587.1From 691581to 694271Unknown, conserved
membrane protein
SEQ ID NO. 1252LM-2589.1From 690897to 691538Unknown, similar to
transcription regulators
SEQ ID NO. 1253LM-259.1From 2797039to 2797758Unknown, similar to
glucosamine-6-phosphate
isomerase
SEQ ID NO. 1254LM-2590.1From 689923to 690873Unknown, similar to
membrane proteins
SEQ ID NO. 1255LM-2592.1From 689541to 689825Unknown
SEQ ID NO. 1256LM-2593.1From 688619to 689458Unknown, similar to
unknown proteins
SEQ ID NO. 1257LM-2594.2From 687138to 688529Unknown, similar to amino
acid transporter
SEQ ID NO. 1258LM-2597.3From 1113759to 1115630unknown, similar to B. subtilis
minor teichoic acids
biosynthesis protein GgaB
SEQ ID NO. 1259LM-2598.3From 1115647to 1116513Unknown, similar to
glucose-1-phosphate
thymidyl transferase
SEQ ID NO. 1260LM-2599.1From 1116532to 1117092Unknown, similar to dTDP-
sugar epimerase
SEQ ID NO. 1261LM-26.1From 2729958to 2730371ribosomal protein S12
SEQ ID NO. 1262LM-260.1From 2797838to 2798572Unknown, similar to merR-
family transcriptional
regulator
SEQ ID NO. 1263LM-2600.1From 1117093to 1118079Unknown, similar to dTDP-
D-glucose 4,6-dehydratase
SEQ ID NO. 1264LM-2601.1From 1118082to 1118912unknown, similar to DTDP-
L-rhamnose synthetase
SEQ ID NO. 1265LM-2602.1From 1118992to 1121022unknown, similar to teichoic
acid biosynthesis protein B
SEQ ID NO. 1266LM-2603.1From 1121098to 1121808unknown, similar to CDP-
ribitol pyrophosphorylase
SEQ ID NO. 1267LM-2604.1From 1121805to 1122830unknown, similar to glucitol
dehydrogenase
SEQ ID NO. 1268LM-2606.1From 1122918to 1124078unknown, similar to teichoic
acid biosynthesis protein B
precursor
SEQ ID NO. 1269LM-2607.1From 1124079to 1124462unknown, highly similar to
glycerol-3-phosphate
cytidylyltransferase (gct),
CDP-glycerol
pyrophosphorylase (teichoic
acid biosynthesis protein D)
SEQ ID NO. 1270LM-2608.1From 1124484to 1125467unknown, similar to
glycosyltransferases
SEQ ID NO. 1271LM-2609.1From 1125482to 1126495unknown, siumilar to
glysosyltransferases
SEQ ID NO. 1272LM-261.1From 2798626to 2799087Unknown, similar to
unknown proteins
SEQ ID NO. 1273LM-2611.1From 1126704to 1128194unknown, conserved
hypothetical protein, similar
to B. subtilis YueK protein
SEQ ID NO. 1274LM-2613.1From 1128206to 1129030unknown, similar to NH(3)-
dependent NAD(+)
synthetases, nitrogen
regulatory protein
SEQ ID NO. 1275LM-2614.1From 1129043to 1129351Unknown
SEQ ID NO. 1276LM-2615.1From 1129412to 1129816unknown, similar to PTS
system, cellobiose-specific
IIB component (cel A)
SEQ ID NO. 1277LM-2616.1From 1129945to 1131501unknown, highly similar to
GMP synthetase
SEQ ID NO. 1278LM-2618.1From 1131558to 1132760unknown, similar to
integrases
SEQ ID NO. 1279LM-2619.1From 1133698to 1134117unknown, similar to a
protein encoded by Tn916
SEQ ID NO. 1280LM-262.1From 2799108to 2799551Unknown, similar to
unknown proteins
SEQ ID NO. 1281LM-2621.4From 1134472to 1136595cadmium resistance protein
SEQ ID NO. 1282LM-2623.2From 2119833to 2121308Unknown, similar to UDP-N-
acetylmuramoylalanyl-D-
glutamate-2,6-
diaminopimelate ligase
SEQ ID NO. 1283LM-2625.2From 2121483to 2123738Unknown, similar to
penicillin-binding protein 2B
SEQ ID NO. 1284LM-2626.2From 2123735to 2124097Unknown, similar to cell-
division protein FtsL
SEQ ID NO. 1285LM-2628.1From 2124114to 2125052Unknown, similar to
unknown proteins
SEQ ID NO. 1286LM-2629.1From 2125065to 2125496Unknown, similar to
unknown proteins
SEQ ID NO. 1287LM-263.1From 2799609to 2800961Unknown, conserved
hypothetical proteins
SEQ ID NO. 1288LM-2631.2From 2125699to 2126946Unknown, similar to integral
membrane proteins
SEQ ID NO. 1289LM-2632.1From 2127062to 2128876Unknown, similar to
transporter binding proteins
SEQ ID NO. 1290LM-2633.1From 2128806to 2129192Unknown
SEQ ID NO. 1291LM-2635.1From 2129185to 2130078Unknown, weakly similar to
ketopantoate reductase
involved in thiamin
biosynthesis
SEQ ID NO. 1292LM-2637.1From 2130492to 2131028Unknown, similar to
unknown proteins
SEQ ID NO. 1293LM-2638.1From 2131158to 2132330Unknown, similar to
unknown proteins
SEQ ID NO. 1294LM-2639.1From 2132412to 2134652Unknown, similar to
excinuclease ABC
(subunit A)
SEQ ID NO. 1295LM-264.1From 2800958to 2801407Unknown, similar to
transcription regulators
SEQ ID NO. 1296LM-2641.1From 2134695to 2135735Unknown, weakly similar to
proteases
SEQ ID NO. 1297LM-2642.1From 2135754to 2136236Unknown, similar to
phosphopantetheine
adenylyltransferase
SEQ ID NO. 1298LM-2643.1From 2136239to 2136796Unknown, similar to
unknown proteins
SEQ ID NO. 1299LM-2645.1From 2136893to 2137174Unknown, similar to
unknown proteins
SEQ ID NO. 1300LM-2647.1From 2137205to 2137654Unknown, similar to
unknown proteins
SEQ ID NO. 1301LM-2648.1From 2137702to 2138757Unknown, similar to
unknown proteins
SEQ ID NO. 1302LM-265.1From 2801577to 2802458Unknown, similar to
unknown proteins
SEQ ID NO. 1303LM-2650.2From 2138904to 2139809unknown, highly similar to
heme A farnesyltransferase
SEQ ID NO. 1304LM-2651.2From 1994751to 1995434Unknown, similar to
unknown proteins
SEQ ID NO. 1305LM-2652.1From 1995520to 1996116Unknown, similar to
unknown proteins
SEQ ID NO. 1306LM-2653.1From 1996204to 1996749Unknown, similar to
unknown proteins
SEQ ID NO. 1307LM-2654.1From 1996784to 1998037Unknown, similar to
unknown proteins
SEQ ID NO. 1308LM-2657.1From 1998168to 1999454Unknown, similar to 5-
enolpyruvylshikimate-3-
phosphate synthase
SEQ ID NO. 1309LM-2659.1From 1999468to 2000571Unknown, similar to
prephenate dehydrogenase
SEQ ID NO. 1310LM-266.1From 2802543to 2802998Unknown, similar to
transcription regulator MerR
family
SEQ ID NO. 1311LM-2660.1From 2000592to 2001674Unknown, similar to
histidinol-phosphate
aminotransferase and
tyrosine/phenylalanine
aminotransferase
SEQ ID NO. 1312LM-2661.1From 2001674to 2002048Unknown, similar to
chorismate mutase
SEQ ID NO. 1313LM-2662.1From 2002045to 2003142Unknown, similar to
3-dehydroquinate synthase
SEQ ID NO. 1314LM-2664.2From 2003145to 2004311Unknown, similar to
chorismate synthase
SEQ ID NO. 1315LM-267.1From 2802998to 2803402Unknown, similar to
unknown proteins
SEQ ID NO. 1316LM-2675.4From 1748263to 1748709unknown, similar to
transcription regulators (Fur
family), PerR in B. subtilis
SEQ ID NO. 1317LM-2676.2From 1356769to 1357683Unknown, highly similar to
tRNA pseudouridine 55
synthase
SEQ ID NO. 1318LM-2677.1From 1356327to 1356671Unknown, highly similar to
ribosome-binding factor A
SEQ ID NO. 1319LM-2679.1From 1356032to 1356310unknown, conserved
hypothetical protein similar
to B. subtilis YlxP protein
SEQ ID NO. 1320LM-268.1From 2803445to 2804056unknown, similar to
phosphatase
SEQ ID NO. 1321LM-2683.1From 1353696to 1356035Unknown, highly similar to
translation initiation factor
IF-2
SEQ ID NO. 1322LM-2684.1From 1353374to 1353673unknown, conserved
hypothetical protein, similar
to B. subtilis YlxQ protein
SEQ ID NO. 1323LM-2685.1From 1353097to 1353381unknown, similar to B. subtilis
YlxR protein
SEQ ID NO. 1324LM-2687.1From 1351964to 1353082unknown, highly similar to N
utilization substance protein
A (NusA protein)
SEQ ID NO. 1325LM-2689.1From 1351467to 1351934unknown, conserved
hypothetical protein, similar
to B. subtilis YlxS protein
SEQ ID NO. 1326LM-269.1From 2804070to 2804438unknown, similar to
transcription regulator (RpiR
family)
SEQ ID NO. 1327LM-2692.1From 1346951to 1351285unknown, highly similar to
DNA polymerase III (alpha
subunit)
SEQ ID NO. 1328LM-2693.1From 1345140to 1346846prolyl-tRNA synthetase
SEQ ID NO. 1329LM-2696.2From 1343838to 1345100unknown, conserved
hypothetical protein similar
to B. subtilis YluC protein
SEQ ID NO. 1330LM-2698.2From 1342682to 1343824Unknown, similar to
deoxyxylulose 5-phosphate
reductoisomerase
SEQ ID NO. 1331LM-2699.2From 1341879to 1342667unknown, similar to
phosphatidate
cytidylyltransferase (CDP-
diglyceride synthase)
SEQ ID NO. 1332LM-27.1From 2730564to 2731961Unknown, similar to dGTP
triphosphohydrolase
SEQ ID NO. 1333LM-270.1From 2804522to 2805274unknown
SEQ ID NO. 1334LM-2701.3From 1779714to 1780802unknown, similar to putative
outer surface protein
SEQ ID NO. 1335LM-2702.1From 1780824to 1781126unknown, similar to
phosphotransferase system
(PTS) lichenan-specific
enzyme IIA component
SEQ ID NO. 1336LM-2703.1From 1781196to 1781504unknown, similar to
phosphotransferase system
(PTS) lichenan-specific
enzyme IIB component
SEQ ID NO. 1337LM-2705.2From 1781661to 1784339unknown, similar to
transcriptional regulator
(NifA/NtrC family)
SEQ ID NO. 1338LM-2707.2From 1784473to 1785798unknown, similar to ATP-
dependent RNA helicases
SEQ ID NO. 1339LM-2708.1From 1785841to 1786614unknown
SEQ ID NO. 1340LM-2710.1From 1786607to 1787287unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1341LM-2711.1From 1787280to 1787648unknown, similar to
transcriptional regulator
(GntR family)
SEQ ID NO. 1342LM-2712.1From 1787769to 1788752unknown, similar to
hypothetical proteins
SEQ ID NO. 1343LM-2713.1From 1788812to 1789777unknown, similar to
transcription regulators
(Lacl family)
SEQ ID NO. 1344LM-2714.1From 1789825to 1793085unknown, some similarities
to cellobiose-phosphorylase
SEQ ID NO. 1345LM-2717.2From 1793082to 1795253unknown, similar to beta-
glucosidases
SEQ ID NO. 1346LM-2718.2From 1561978to 1564242Unknown, similar to protein-
export membrane protein
SecDF
SEQ ID NO. 1347LM-272.1From 2805440to 2807398Unknown, similar to PTS
system, fructose-specific
IIABC component
SEQ ID NO. 1348LM-2720.1From 1564343to 1564633Unknown, similar to
unknown proteins
SEQ ID NO. 1349LM-2721.1From 1564776to 1565105Unknown, similar to
unknown proteins
SEQ ID NO. 1350LM-2722.1From 1565139to 1566278Unknown, similar to tRNA-
guanine transglycosylase
Tgt
SEQ ID NO. 1351LM-2723.1From 1566365to 1567393Unknown, similar to S-
adenosylmethionine: tRNA
ribosyltransferase-
isomerase
SEQ ID NO. 1352LM-2725.1From 1567397to 1568404unknown, highly similar to
Holliday junction DNA
helicase RuvB
SEQ ID NO. 1353LM-2726.1From 1568420to 1569025unknown, highly similar to
Holliday junction DNA
helicase (ruvA)
SEQ ID NO. 1354LM-2728.1From 1569149to 1570084Unknown, similar to L-
lactate dehydrogenase
SEQ ID NO. 1355LM-2729.1From 1570154to 1570879Unknown, similar to
unknown proteins
SEQ ID NO. 1356LM-273.1From 2807459to 2810107Unknown, weakly similar to
sugar hydrolase
SEQ ID NO. 1357LM-2730.1From 1570990to 1571838Unknown, similar to
prephenate dehydratase
PheA
SEQ ID NO. 1358LM-2732.1From 1571905to 1573194Unknown, conserved GTP
binding protein
SEQ ID NO. 1359LM-2735.1From 1573354to 1574847Unknown, similar to glycerol
kinase
SEQ ID NO. 1360LM-2736.2From 1574922to 1575740Unknown, similar to glycerol
uptake facilitator
SEQ ID NO. 1361LM-2737.2From 623251to 624423Unknown, conserved
hypothetical membrane
protein
SEQ ID NO. 1362LM-274.1From 2810109to 2811791Unknown, similar to
Sucrose phosphorylase
SEQ ID NO. 1363LM-2741.1From 620805to 623135Unknown, similar to
preprotein translocase SecA
subunit
SEQ ID NO. 1364LM-2743.3From 618932to 620380P60 extracellular protein,
invasion associated protein
lap
SEQ ID NO. 1365LM-2746.1From 617660to 618844Unknown, conserved
hypothetical protein
SEQ ID NO. 1366LM-2747.1From 616973to 617629Unknown, weakly similar to
carboxylesterase
SEQ ID NO. 1367LM-275.1From 2811788to 2812921Unknown, conserved
hypothetical protein
SEQ ID NO. 1368LM-2752.1From 615825to 616577Unknown, putative
conserved membrane
protein
SEQ ID NO. 1369LM-2753.1From 615353to 615811Unknown
SEQ ID NO. 1370LM-2754.1From 613836to 615299Unknown
SEQ ID NO. 1371LM-2755.2From 612678to 613406Unknown, similar to
transcription regulator GntR
family
SEQ ID NO. 1372LM-2758.2From 2343858to 2345150Unknown, similar to
unknown proteins
SEQ ID NO. 1373LM-2759.1From 2345260to 2345526Unknown
SEQ ID NO. 1374LM-276.1From 2812925to 2813881Unknown, similar to
transcriptional regulator
(Lacl family)
SEQ ID NO. 1375LM-2760.1From 2345567to 2346088Unknown, similar to
unknown proteins
SEQ ID NO. 1376LM-2761.1From 2346252to 2346596Unknown, hypothetical CDS
SEQ ID NO. 1377LM-2762.1From 2346935to 2347285unknown, similar to
phosphotransferase system
(PTS) beta-glucoside-
specific enzyme IIA
SEQ ID NO. 1378LM-2763.1From 2347276to 2348151Unknown, similar to
unknown proteins
SEQ ID NO. 1379LM-2764.1From 2348231to 2348539Unknown, similar to
unknown proteins
SEQ ID NO. 1380LM-2765.1From 2348604to 2349356Unknown, similar to
unknown proteins
SEQ ID NO. 1381LM-2766.1From 2349475to 2350320unknown, similar to unknown
proteins
SEQ ID NO. 1382LM-2768.1From 2350398to 2351300Unknown, similar to
unknown proteins
SEQ ID NO. 1383LM-2769.1From 2351384to 2351746Unknown, similar to
unknown proteins
SEQ ID NO. 1384LM-2770.1From 2351767to 2352615Unknown, similar to
unknown proteins
SEQ ID NO. 1385LM-2771.1From 2352616to 2356323Unknown, similar to ATP-
dependent
deoxyribonuclease
(subunit A)
SEQ ID NO. 1386LM-2772.3From 2356325to 2359798Unknown, similar to ATP-
dependent
deoxyribonuclease
(subunit B)
SEQ ID NO. 1387LM-2774.2From 2097176to 2099941isoleucyl-tRNA synthetase
SEQ ID NO. 1388LM-2776.1From 2096075to 2097064Unknown, similar to
diaminopimelate epimerase
SEQ ID NO. 1389LM-2778.1From 2095366to 2096061Unknown, similar to
unknown proteins
SEQ ID NO. 1390LM-278.1From 2813949to 2815283Unknown, conserved
hypothetical protein similar
to hypothetical hemolysin
SEQ ID NO. 1391LM-2780.2From 2091698to 2094808Unknown, similar to alpha-
mannosidase
SEQ ID NO. 1392LM-2781.3From 2504300to 2505157Unknown, similar to
transcription antiterminator
SEQ ID NO. 1393LM-2782.1From 2503931to 2504236Unknown, similar to B. subtilis
YfhL protein
SEQ ID NO. 1394LM-2784.1From 2502401to 2503804unknown, highly similar to
glutamate decarboxylases
SEQ ID NO. 1395LM-2785.1From 2501603to 2502361Unknown, similar to
acetylesterase
SEQ ID NO. 1396LM-2788.1From 2500179to 2501150Unknown, similar to B. subtilis
ferrichrome ABC
transporter fhuD precursor
(ferrichrome-binding
protein)
SEQ ID NO. 1397LM-2790.1From 2499169to 2500179Unknown, similarto B. subtilis
ferrichrome ABC
transporter (permease)
FhuG
SEQ ID NO. 1398LM-2792.1From 2498381to 2499172Unknown, similar to B. subtilis
ferrichrome ABC
transporter (ATP-binding
protein) FhuC
SEQ ID NO. 1399LM-2793.1From 2497069to 2498238Unknown, similar to cell
division proteins RodA,
FtsW
SEQ ID NO. 1400LM-2794.1From 2495818to 2496993Unknown, similar to cell
division proteins RodA,
FtsW
SEQ ID NO. 1401LM-2795.2From 2495309to 2495662Unknown, conserved
hypothetical proteins
SEQ ID NO. 1402LM-2796.1From 1674961to 1676325unknown, highly similar to
anthranilate synthase alpha
subunit
SEQ ID NO. 1403LM-2797.1From 1674359to 1674964unknown, highly similar to
anthranilate synthase beta
subunit
SEQ ID NO. 1404LM-2798.1From 1673368to 1674387unknown, highly similar to
anthranilate
phosphoribosyltransferase
SEQ ID NO. 1405LM-2799.1From 1672613to 1673371unknown, highly similar to
indol-3-glycerol phosphate
synthases
SEQ ID NO. 1406LM-28.1From 2731976to 2732455Unknown, similar to
spermidine/spermine N1-
acetyl transferase
SEQ ID NO. 1407LM-280.1From 2815401to 2816090unknown, similar to
regulatory proteins of the
SIR2 family
SEQ ID NO. 1408LM-2801.1From 1672008to 1672616phosphoribosyl anthranilate
isomerase
SEQ ID NO. 1409LM-2804.1From 1670803to 1672005unknown, highly similar to
tryptophan synthase (beta
subunit)
SEQ ID NO. 1410LM-2805.1From 1670037to 1670810unknown, highly similar to
tryptophan synthase (alpha
subunit)
SEQ ID NO. 1411LM-2806.1From 1669529to 1669966Unknown
SEQ ID NO. 1412LM-2808.1From 1667831to 1669444Unknown, similar to putative
transporters
SEQ ID NO. 1413LM-2809.1From 1666107to 1667720Unknown, similar to putative
transporters
SEQ ID NO. 1414LM-.2810.1From 1665444to 1666097Unknown, similar to
unknown proteins
SEQ ID NO. 1415LM-2811.2From 1664575to 1665405Unknown, similar to
unknown proteins
SEQ ID NO. 1416LM-2812.2From 49585to 4987830S ribosomal protein S6
SEQ ID NO. 1417LM-2813.2From 49934to 50470unknown, highly similar to
single-strand binding protein
(SSB)
SEQ ID NO. 1418LM-2816.1From 50907to 51518Unknown
SEQ ID NO. 1419LM-2817.1From 51775to 52389Unknown, similar to
Staphylococcus AgrB
protein
SEQ ID NO. 1420LM-2818.1From 52630to 53925Unknown, similar to sensor
histidine kinase (AgrC from
Staphylococcus)
SEQ ID NO. 1421LM-2819.1From 53944to 54672Unknown, similar to 2-
components response
regulator protein (AgrA from
Staphylococcus)
SEQ ID NO. 1422LM-282.1From 2816083to 2816373Unknown
SEQ ID NO. 1423LM-2820.1From 54839to 56812Unknown, highly similar to
B. subtilis YybT protein
SEQ ID NO. 1424LM-2822.1From 56815to 5726150S ribosomal protein L9
SEQ ID NO. 1425LM-2823.2From 57286to 58638unknown, highly similar to
replicative DNA helicases
SEQ ID NO. 1426LM-2826.1From 2580470to 2581780Unknown, similar to cell wall
binding proteins
SEQ ID NO. 1427LM-2827.1From 2581900to 2583105peptidoglycan lytic protein
P45
SEQ ID NO. 1428LM-2829.1From 2583181to 2584065unknown, highly similar to
cell-division protein FtsX
SEQ ID NO. 1429LM-283.1From 2816363to 2817571Unknown, similar to drug-
efflux transporters
SEQ ID NO. 1430LM-2831.1From 2584055to 2584741unknown, highly similar to
the cell-division ATP-
binding protein FtsE
SEQ ID NO. 1431LM-2832.1From 2585245to 2586108Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1432LM-2833.1From 2586114to 2587097unknown, highly similar to
peptide chain release
factor 2
SEQ ID NO. 1433LM-2834.2From 2587300to 2589813translocase binding subunit
(ATPase)
SEQ ID NO. 1434LM-2835.2From 1052509to 1053429Unknown
SEQ ID NO. 1435LM-2836.1From 1051695to 1052354unknown, similar to a
bacterial K(+)-uptake
system
SEQ ID NO. 1436LM-2838.1From 1051038to 1051676unknown, similar to two-
component response
regulator, in particular B. subtilis
YvqC protein
SEQ ID NO. 1437LM-2839.1From 1049983to 1051041unknown, similar to two-
component sensor histidine
kinase in particular B. subtilis
YvqE protein
SEQ ID NO. 1438LM-284.1From 2817688to 2818035Unknown
SEQ ID NO. 1439LM-2840.1From 1049273to 1049986unknown, similar to B. subtilis
YvqF protein
SEQ ID NO. 1440LM-2842.1From 1048266to 1049138unknown, similar to B. subtilis
YitL protein
SEQ ID NO. 1441LM-2843.1From 1047490to 1048185unknown, similar to E. coli
copper homeostasis protein
CutC
SEQ ID NO. 1442LM-2845.1From 1046886to 1047377unknown, similar to
phosphotransferase system
glucose-specific enzyme IIA
SEQ ID NO. 1443LM-2846.1From 1045869to 1046771unknown, highly similar to
glycine betaine ABC
transporters (glycine
betaine-binding protein)
SEQ ID NO. 1444LM-2847.1From 1045007to 1045855unknown, highly similar to
glycine betaine ABC
transporters (permease)
SEQ ID NO. 1445LM-285.1From 2818054to 2818698Unknown, similar to
transaldolase
SEQ ID NO. 1446LM-2851.1From 1043821to 1045014unknown, highly similar to
glycine betaine ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1447LM-2853.1From 1042605to 1043450unknown, similar to
conserved hypothetical
proteins like to B. subtilis
YkuT protein
SEQ ID NO. 1448LM-2854.1From 1041446to 1042561unknown, similar to N-acyl
L-amino acid
amidohydrolases
SEQ ID NO. 1449LM-2856.1From 1040672to 1041382unknown, similar to
tetrahydrodipicolinate
succinylase
SEQ ID NO. 1450LM-2857.1From 1039746to 1040624unknown, similar to
transcription regulator (LysR
family).
SEQ ID NO. 1451LM-2858.1From 1039297to 1039749unknown, similar to B. subtilis
YkuL protein
SEQ ID NO. 1452LM-2859.2From 1038863to 1039096unknown, similar to B. subtilis
YkuJ protein
SEQ ID NO. 1453LM-286.1From 2818841to 2819530Unknown, weakly similar to
transcription regulators
CRP/FNR family
SEQ ID NO. 1454LM-2860.1From 600837to 601148Unknown, similar to
phosphorybosil-AMP-
cyclohydrolase (HisI2
protein)
SEQ ID NO. 1455LM-2861.1From 601149to 601466Unknown, similar to
phosphoribosyl-AMP
cyclohydrolase (HisI1
protein)
SEQ ID NO. 1456LM-2863.1From 601463to 602218unknown, highly similar to
cyclase HisF
SEQ ID NO. 1457LM-2864.1From 602208to 602930unknown, highly similar to
phosphoribosylformimino-5-
aminoimidazole
carboxamide ribotide
isomerase
SEQ ID NO. 1458LM-2866.1From 602909to 603535unknown, similar to
amidotransferases
SEQ ID NO. 1459LM-2867.1From 603536to 604120Imidazoleglycerol-
phosphate dehydratase
SEQ ID NO. 1460LM-2868.1From 604121to 605404unknown, highly similar to
histidinol dehydrogenases
SEQ ID NO. 1461LM-287.1From 2819860to 2821587Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1462LM-2870.1From 605401to 606042Unknown, similar to ATP
phosphoribosyltransferase
SEQ ID NO. 1463LM-2871.2From 606039to 607220histidyl-tRNA synthetase
SEQ ID NO. 1464LM-2872.2From 1645958to 1646413Unknown, similar to
unknown proteins
SEQ ID NO. 1465LM-2873.3From 1646669to 1647781Unknown, similar to
aminopeptidase
SEQ ID NO. 1466LM-2875.1From 1647821to 1648366Unknown, similar to 2-cys
peroxiredoxin
SEQ ID NO. 1467LM-2877.1From 1648511to 1649854unknown, similar to UDP-N-
acetyl muramate-alanine
ligases
SEQ ID NO. 1468LM-2878.2From 1650149to 1652500Unknown, similar to DNA
translocase
SEQ ID NO. 1469LM-288.1From 2821641to 2821892Unknown
SEQ ID NO. 1470LM-2880.1From 1652810to 1653427Unknown, similar
phenylalanyl-tRNA
synthetase (beta subunit)
SEQ ID NO. 1471LM-2881.1From 1653433to 1654233Unknown, similar to
unknown proteins
SEQ ID NO. 1472LM-2882.1From 1654268to 1654579Unknown, similar to
thioredoxin
SEQ ID NO. 1473LM-2883.1From 1654902to 1655975Unknown, similar to
aminopeptidase
SEQ ID NO. 1474LM-2884.1From 1656173to 1656484Unknown, similar to
unknown proteins
SEQ ID NO. 1475LM-2885.1From 1656521to 1656769Unknown, similar to
unknown proteins
SEQ ID NO. 1476LM-2886.1From 1656903to 1657754Unknown, similar to
unknown proteins
SEQ ID NO. 1477LM-2887.2From 1657815to 1658459Unknown, similar to
unknown proteins
SEQ ID NO. 1478LM-2889.2From 1658466to 1659248Unknown, similar to
unknown proteins
SEQ ID NO. 1479LM-289.1From 2821912to 2823195seryl-trna synthetase
SEQ ID NO. 1480LM-2890.2From 1307009to 1307605unknown, conserved
hypothetical protein, similar
to B. subtilis YneS protein
SEQ ID NO. 1481LM-2892.1From 1306095to 1306967Unknown, similar to
Lactococcus lactis LacX
protein
SEQ ID NO. 1482LM-2894.1From 1305781to 1306092Unknown, similar to B. subtilis
YneQ protein
SEQ ID NO. 1483LM-2895.1From 1305390to 1305758Unknown, similar to B. subtilis
YneP protein
SEQ ID NO. 1484LM-2898.1From 1304457to 1305236unknown, highly similar to
B. subtilis CodY protein
SEQ ID NO. 1485LM-29.1From 2732589to 2733233Unknown, similar to
ribulose-phosphate 3-
epimerase
SEQ ID NO. 1486LM-290.1From 2823530to 2823949Unknown, similar to B. subtilis
stress protein YdaG
SEQ ID NO. 1487LM-2900.1From 1303027to 1304436unknown, highly similar to
ATP-dependent Clp
protease-like proteins
SEQ ID NO. 1488LM-2901.1From 1302474to 1303013unknown, highly similar to
beta-type subunit of the 20S
proteasome
SEQ ID NO. 1489LM-2902.2From 1301551to 1302453unknown, similar to
integrase/recombinase
SEQ ID NO. 1490LM-2903.2From 1299964to 1301268unknown, similar glucose
inhibited division protein A
SEQ ID NO. 1491LM-2907.2From 1297823to 1299901unknown, highly similar to
DNA topoisomerase I TopA
SEQ ID NO. 1492LM-2908.2From 1296691to 1297551Unknown, similar to
polypeptide deformylase,
similar to B. subtilis Smf
protein
SEQ ID NO. 1493LM-2909.1From 1295772to 1296557unknown, similar to
ribonuclease H rnh
SEQ ID NO. 1494LM-291.1From 2824079to 2824651Unknown, similar to
glutamine amidotransferase
SEQ ID NO. 1495LM-2910.1From 1294912to 1295775unknown, conserved
hypothetical protein similar
to B. subtilis YlqF protein
SEQ ID NO. 1496LM-2911.1From 1294360to 1294902Unknown, similar to signal
peptidase I
SEQ ID NO. 1497LM-2912.2From 1293689to 1294258Unknown, similar to signal
peptidase I
SEQ ID NO. 1498LM-2914.2From 1498096to 1500252Unknown, similar to
unknown proteins
SEQ ID NO. 1499LM-2915.1From 1500268to 1501227Unknown, similar to
phosphate starvation
induced protein PhoH
SEQ ID NO. 1500LM-2916.1From 1501415to 1501861Unknown, similar to
unknown proteins
SEQ ID NO. 1501LM-2917.1From 1502252to 1503019Unknown, similar to
unknown proteins
SEQ ID NO. 1502LM-2918.1From 1503020to 1503964Unknown, similar to
ribosomal protein L11
methyltransferase
SEQ ID NO. 1503LM-292.1From 2824648to 2826354Unknown, similar to para-
aminobenzoate synthase
component I
SEQ ID NO. 1504LM-2921.1From 1504037to 1505170heat shock protein DnaJ
SEQ ID NO. 1505LM-2922.1From 1505312to 1507153class I heat-shock protein
(molecular chaperone)
DnaK
SEQ ID NO. 1506LM-2924.1From 1507187to 1507762heat shock protein GrpE
SEQ ID NO. 1507LM-2925.1From 1507804to 1508841transcription repressor of
class I heat-shock gene
HrcA
SEQ ID NO. 1508LM-2926.1From 1508988to 1510145unknown, highly similar to
coproporphyrinogen III
oxidase
SEQ ID NO. 1509LM-2928.1From 1510234to 1511262Unknown, similar to
oxidoreductase
SEQ ID NO. 1510LM-2929.1From 1511375to 1511812Unknown, similar to
transcriptional regulator
(MerR family)
SEQ ID NO. 1511LM-2931.3From 1511858to 1513684unknown, highly similar to
GTP-binding protein LepA
SEQ ID NO. 1512LM-2932.2From 2535342to 2536268Unknown, similar to
dipeptidases
SEQ ID NO. 1513LM-2933.1From 2533840to 2535183RNA polymerase sigma-54
factor (sigma-L)
SEQ ID NO. 1514LM-2935.1From 2532352to 2533398Unknown, similar to B. subtilis
CggR hypothetical
transcriptional regulator
SEQ ID NO. 1515LM-2936.1From 2531310to 2532320unknown, highly similar to
glyceraldehyde 3-phosphate
dehydrogenase
SEQ ID NO. 1516LM-2937.1From 2529985to 2531175unknown, highly similar to
phosphoglycerate kinase
SEQ ID NO. 1517LM-2939.1From 2529184to 2529939unknown, highly similar to
triose phosphate isomerase
SEQ ID NO. 1518LM-294.1From 2826515to 2828236Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1519LM-2942.1From 2527650to 2529182unknown, highly similar to
phosphoglycerate mutase
SEQ ID NO. 1520LM-2944.2From 2526222to 2527514unknown, highly similar to
enolase
SEQ ID NO. 1521LM-2945.1From 2525946to 2526119Unknown
SEQ ID NO. 1522LM-2946.2From 2525094to 2525813Unknown, similar to lipolytic
enzyme
SEQ ID NO. 1523LM-2950.2From 316873to 318375Unknown, similar to heat-
shock protein htrA serine
protease
SEQ ID NO. 1524LM-2952.1From 315945to 316775Unknown, conserved
hypothetical protein similar
to B. subtilis YycJ protein
SEQ ID NO. 1525LM-2953.1From 314984to 315823Unknown, similar to B. subtilis
Yycl protein
SEQ ID NO. 1526LM-2954.1From 313659to 314981Unknown, similar to B. subtilis
YycH protein
SEQ ID NO. 1527LM-2957.1From 311830to 313662Unknown, similar to two-
component sensor histidine
kinase
SEQ ID NO. 1528LM-2958.1From 310932to 311645Unknown, similar to two-
component response
regulator
SEQ ID NO. 1529LM-2959.2From 309538to 310719Unknown, similar to
aminotransferase
SEQ ID NO. 1530LM-296.1From 2828236to 2830008Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1531LM-2960.3From 2692097to 2693041unknown, highly similar to
RNA polymerase (alpha
subunit)
SEQ ID NO. 1532LM-2962.2From 2693151to 2693540ribosomal protein S11
SEQ ID NO. 1533LM-2964.2From 2693563to 2693928ribosomal protein S13
SEQ ID NO. 1534LM-2965.1From 2694164to 2694382unknown, highly similar to
initiation factor IF-I
SEQ ID NO. 1535LM-2967.1From 2694769to 2695416unknown, highly similar to
adenylate kinases
SEQ ID NO. 1536LM-2968.1From 2695476to 2696771unknown, highly similar to
preprotein translocase
subunit
SEQ ID NO. 1537LM-297.1From 2830284to 2830523Unknown
SEQ ID NO. 1538LM-2970.1From 2696771to 2697211ribosomal protein L15
SEQ ID NO. 1539LM-2972.1From 2697463to 2697966ribosomal protein S5
SEQ ID NO. 1540LM-2973.1From 2697988to 2698347ribosomal protein L18
SEQ ID NO. 1541LM-2974.1From 2698387to 2698923ribosomal protein L6
SEQ ID NO. 1542LM-2975.1From 2698954to 2699352ribosomal protein S8
SEQ ID NO. 1543LM-2977.1From 2699600to 2700139ribosomal protein L5
SEQ ID NO. 1544LM-2979.1From 2700166to 2700477ribosomal protein L24
SEQ ID NO. 1545LM-298.1From 2830542to 2831879Unknown, similar to D-
alanyl-D-alanine
carboxypeptidase
(penicillin-binding protein 5)
SEQ ID NO. 1546LM-2981.1From 2700515to 2700883ribosomal protein L14
SEQ ID NO. 1547LM-2983.1From 2701435to 2701869ribosomal protein L16
SEQ ID NO. 1548LM-2984.1From 2701872to 2702528ribosomal protein S3
SEQ ID NO. 1549LM-2985.2From 2702532to 2702888ribosomal protein L22
SEQ ID NO. 1550LM-2986.2From 2702909to 2703187ribosomal protein S19
SEQ ID NO. 1551LM-2990.2From 1593100to 1594407unknown, highly similar to
glutamyl-tRNA reductase
SEQ ID NO. 1552LM-2991.1From 1592167to 1593096unknown, highly similar to
porphobilinogen
deaminases
(hydroxymethylbilane
synthase)
SEQ ID NO. 1553LM-2992.1From 1591448to 1592170Unknown, similar to
uroporphyrinogen III
cosynthase (HemD)
SEQ ID NO. 1554LM-2993.1From 1590477to 1591451unknown, highly similar to
delta-aminolevulinic acid
dehydratases
(porphobilinogen synthase)
SEQ ID NO. 1555LM-2995.1From 1589175to 1590464unknown, highly similar to
glutamate-1-semialdehyde
2,1-aminotransferases
SEQ ID NO. 1556LM-2997.1From 1586201to 1588852valyl-tRNA synthetase
SEQ ID NO. 1557LM-2999.2From 1584850to 1586139unknown, similar to Folyl-
polyglutamate synthetase
SEQ ID NO. 1558LM-3.1From 2712931to 2713365Unknown
SEQ ID NO. 1559LM-3000.1From 1583928to 1584638Unknown, similar to B. subtilis
late competence
protein ComC (type IV
prepilin peptidase)
SEQ ID NO. 1560LM-3001.1From 1583242to 1583916Unknown, similar to DNA
repair protein RadC
SEQ ID NO. 1561LM-3004.1From 1581777to 1582790unknown, similar to cell-
shape determining protein
MreB
SEQ ID NO. 1562LM-3005.1From 1580804to 1581691unknown, similar to cell-
shape determining protein
MreC
SEQ ID NO. 1563LM-3006.1From 1580283to 1580801unknown, similar to cell-
shape determining protein
MreD
SEQ ID NO. 1564LM-3007.2From 1579428to 1580105unknown, similar to cell-
division inhibition (septum
placement) protein MinC
SEQ ID NO. 1565LM-301.1From 2832058to 2833725Unknown, similar to acylase
and diesterase
SEQ ID NO. 1566LM-3010.1From 1149406to 1149720unknown, highly similar to
TN916 ORF23
SEQ ID NO. 1567LM-3011.1From 1149011to 1149385unknown, highly similar to
TN916 ORF22
SEQ ID NO. 1568LM-3012.1From 1147603to 1149003unknown, highly similar to
TN916 ORF21
SEQ ID NO. 1569LM-3013.1From 1146228to 1147412unknown, highly similar to
TN916 ORF20
SEQ ID NO. 1570LM-3014.1From 1145941to 1146231unknown, similar to
unknown proteins
SEQ ID NO. 1571LM-3016.1From 1145211to 1145711unknown, highly similar to
TN916 ORF18
SEQ ID NO. 1572LM-3017.1From 1144533to 1144928unknown, highly similar to
TN916 ORF17
SEQ ID NO. 1573LM-3018.1From 1142099to 1144549unknown, highly similar to
TN916 ORF16
SEQ ID NO. 1574LM-3020.1From 1139940to 1142099unknown, highly similar to
TN916 ORF15
SEQ ID NO. 1575LM-3022.1From 1138930to 1139940unknown, highly similar to
TN916 ORF14 and to L. monocytogenes
P60 protein
SEQ ID NO. 1576LM-3023.1From 1137997to 1138914unknown, highly similar to
TN916 ORF13
SEQ ID NO. 1577LM-3024.1From 1137533to 1137868unknown, similar to
cadmium efflux system
accessory proteins
SEQ ID NO. 1578LM-3025.3From 1254459to 1254722unknown, similar to B. subtilis
YshA protein
SEQ ID NO. 1579LM-3027.3From 1253379to 1254311unknown, similar to B. subtilis
ribonuclease HIII
SEQ ID NO. 1580LM-3028.1From 1252667to 1253341Unknown, similar to uracil-
DNA glycosylase
SEQ ID NO. 1581LM-3029.1From 1249360to 1252560unknown; similar to
transporter, (to B. subtilis
YdgH protein)
SEQ ID NO. 1582LM-3030.1From 1248892to 1249344unknown; similar to
transcriptional regulator
(MarR family).
SEQ ID NO. 1583LM-3031.2From 1245384to 1248794unknown, similar to different
proteins
SEQ ID NO. 1584LM-3032.2From 1244669to 1245370unknown, similar to ABC
transporter, ATP-binding
proteins
SEQ ID NO. 1585LM-3035.2From 1608166to 1609923unknown, highly similar to
pyruvate kinases
SEQ ID NO. 1586LM-3036.1From 1607671to 1608048Unknown, similar to
unknown proteins
SEQ ID NO. 1587LM-3037.1From 1607053to 1607514Unknown, similar to
unknown proteins
SEQ ID NO. 1588LM-3038.1From 1605895to 1607016unknown, highly similar to
citrate synthase subunit II
SEQ ID NO. 1589LM-3039.1From 1604613to 1605875unknown, highly similar to
isocitrate dehyrogenases
SEQ ID NO. 1590LM-304.1From 2833834to 2835987Unknown, similar to DNA
topoisomerase III
SEQ ID NO. 1591LM-3041.1From 1601820to 1604447DNA polymerase I
SEQ ID NO. 1592LM-3042.1From 1600975to 1601796unknown, highly similar to
formamidopyrimidine-DNA
glycosylases
SEQ ID NO. 1593LM-3043.2From 1600356to 1600958Unknown, similar to
unknown proteins
SEQ ID NO. 1594LM-3044.2From 1875574to 1876560Unknown, similar to FtsY of
E. coli and SRP receptor
alpha-subunit
SEQ ID NO. 1595LM-3049.1From 1876575to 1880135Unknown, similar to Smc
protein essential for
chromosome condensation
and partition
SEQ ID NO. 1596LM-3050.1From 1880158to 1880847Unknown, similar to
ribonuclease III
SEQ ID NO. 1597LM-3053.1From 1881381to 1882124Unknown, similar to 3-
ketoacyl-acyl carrier protein
reductase
SEQ ID NO. 1598LM-3054.1From 1882128to 1883069Unknown, similar to malonyl
CoA-acyl carrier protein
transacylase
SEQ ID NO. 1599LM-3056.1From 1883062to 1884075Unknown, similar to plsX
protein involved in fatty
acid/phospholipid synthesis
SEQ ID NO. 1600LM-3058.2From 1884096to 1884665Unknown, similar to
unknown proteins
SEQ ID NO. 1601LM-3059.2From 817914to 818762Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1602LM-3062.1From 813451to 817878Unknown
SEQ ID NO. 1603LM-3065.3From 811813to 813189Unknown, similar to amino
acid transporter
SEQ ID NO. 1604LM-3066.2From 2159224to 2159604Unknown
SEQ ID NO. 1605LM-3067.1From 2157949to 2159094Unknown
SEQ ID NO. 1606LM-3068.1From 2157383to 2157844Unknown, similar to
unknown proteins
SEQ ID NO. 1607LM-3069.1From 2156694to 2157386Unknown, similar to
glycoprotease
SEQ ID NO. 1608LM-307.1From 2836011to 2837783Unknown, similar to ATP-
dependent DNA helicases
SEQ ID NO. 1609LM-3070.1From 2156242to 2156697Unknown, similar to
ribosomal protein alanine
acetyltransferase
SEQ ID NO. 1610LM-3072.1From 2155223to 2156245Unknown, similar to
glycoprotein endopeptidase
SEQ ID NO. 1611LM-3073.1From 2153720to 2154679Unknown, similar to
unknown proteins
SEQ ID NO. 1612LM-3074.1From 2151744to 2153696Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1613LM-3075.1From 2150786to 2151433Unknown, similar to a
putative DNA binding
proteins
SEQ ID NO. 1614LM-3076.2From 2149868to 2150554Unknown, similar to
unknown proteins
SEQ ID NO. 1615LM-3077.2From 992978to 993886unknown, similar to
proteases
SEQ ID NO. 1616LM-3080.1From 993900to 995126unknown, similar to
proteases
SEQ ID NO. 1617LM-3081.1From 995210to 995767unknown, Listeria epitope
LemA
SEQ ID NO. 1618LM-3083.1From 995790to 996704unknown, similar to putative
heat shock protein HtpX,
Listeria epitope LemB
SEQ ID NO. 1619LM-3084.1From 996739to 997557unknown, similar to B. subtilis
YjbH protein
SEQ ID NO. 1620LM-3085.1From 997795to 998379unknown, similar to B. subtilis
YjbK protein
SEQ ID NO. 1621LM-3086.2From 998396to 998863Unknown
SEQ ID NO. 1622LM-3087.2From 999022to 999690unknown, similar to B. subtilis
YjbM protein
SEQ ID NO. 1623LM-3088.1From 999722to 1000516unknown, similar to
conserved hypothetical
proteins like to B. subtilis
YjbN protein
SEQ ID NO. 1624LM-3089.1From 1000535to 1001425unknown, similar to
ribosomal large subunit
pseudouridine synthetase
SEQ ID NO. 1625LM-309.2From 2837944to 2839410Unknown, similar to inosine-
monophosphate
dehydrogenase
SEQ ID NO. 1626LM-3090.1From 1001503to 1002291unknown, similar to enoyl-
acyl-carrier protein
reductase
SEQ ID NO. 1627LM-3091.1From 1002319to 1003593DltD protein for D-alanine
esterification of lipoteichoic
acid and wall teichoic acid
SEQ ID NO. 1628LM-3092.1From 1003855to 1005039DltB protein for D-alanine
esterification of lipoteichoic
acid and wall teichoic acid
SEQ ID NO. 1629LM-3093.2From 1005036to 1006568D-alanine-activating
enzyme (dae), D-alanine-D-
alanyl carrier protein ligase
(dcl)
SEQ ID NO. 1630LM-3095.2From 1364031to 1364570unknown, similar to 5-
formyltetrahydrofolate cycloligase
SEQ ID NO. 1631LM-3097.1From 1364669to 1366207unknown similar to B. subtilis
yqgP
SEQ ID NO. 1632LM-3098.1From 1366457to 1367425Unknown, similar to glucose
kinase
SEQ ID NO. 1633LM-3099.1From 1367559to 1368650unknown, similar to B. subtilis
YqgU protein
SEQ ID NO. 1634LM-3100.1From 1368668to 1368985Unknown, weakly similar to
B. subtilis comG operon
protein 7 (comGG)
SEQ ID NO. 1635LM-3101.1From 1368982to 1369449Unknown, similar to B. subtilis
comG operon
protein 6
SEQ ID NO. 1636LM-3102.1From 1369412to 1369696Unknown, similar to comG
operon protein 5 (comGE)
SEQ ID NO. 1637LM-3103.1From 1369683to 1370111Unknown, similar to comG
operon protein 4 (comGD)
SEQ ID NO. 1638LM-3104.1From 1370108to 1370431Unknown, similar to B. subtilis
comG operon
protein 3
SEQ ID NO. 1639LM-3105.1From 1370445to 1371476Unknown, similar to B. subtilis
comG operon
protein 2
SEQ ID NO. 1640LM-3106.1From 1371454to 1372476Unknown, similar to B. subtilis
comG operon
protein 1
SEQ ID NO. 1641LM-3107.1From 1373015to 1374103Unknown, similar to
aminomethyltransferase
SEQ ID NO. 1642LM-3110.3From 1374119to 1375465Unknown, similar to glycine
dehydrogenase
(decarboxylating) subunit 1
SEQ ID NO. 1643LM-3111.1From 1375462to 1376928Unknown, similar to glycine
dehydrogenase
(decarboxylating) subunit 2
SEQ ID NO. 1644LM-3112.1From 1376968to 1377348Unknown
SEQ ID NO. 1645LM-3113.1From 1377410to 1377682Unknown
SEQ ID NO. 1646LM-3114.2From 1377695to 1378660Unknown, similar to B. subtilis
YqhQ protein
SEQ ID NO. 1647LM-3115.3From 1962698to 1963243unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1648LM-3116.2From 1962225to 1962566unknown, similar to
hypothetical proteins
SEQ ID NO. 1649LM-3119.1From 1960496to 1961644unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1650LM-312.1From 130652to 131380Unknown, similar to
autolysin: N-
acetylmuramoyl-L-alanine
amidase
SEQ ID NO. 1651LM-3120.1From 1958968to 1960476unknown, similar to
probable thermostable
carboxypeptidases
SEQ ID NO. 1652LM-3121.1From 1958162to 1958740unknown, similar to
xanthine
phosphoribosyltransferase
SEQ ID NO. 1653LM-3124.1From 1956848to 1958155unknown, similar to
probable permeases
SEQ ID NO. 1654LM-3126.1From 1955598to 1956656unknown, similar to
chitinases
SEQ ID NO. 1655LM-3127.2From 1955205to 1955474unknown, similar to
ribosomal protein S14
SEQ ID NO. 1656LM-3128.3From 1954255to 1955127unknown, similar to 5′-3′-
exonuclease
SEQ ID NO. 1657LM-3129.1From 248555to 249013Unknown, highly similar to
transcription repressor of
class III stress genes (CtsR)
SEQ ID NO. 1658LM-313.1From 130249to 130671Unknown, similar to a
protein from Bacteriophage
phi-105 (ORF 45)
SEQ ID NO. 1659LM-3130.1From 249026to 249544Unknown, similar to B. subtilis
YacH protein
SEQ ID NO. 1660LM-3131.1From 249541to 250563Unknown, similar to arginine
kinase
SEQ ID NO. 1661LM-3135.1From 250592to 253054endopeptidase Clp ATP-
binding chain C
SEQ ID NO. 1662LM-3136.2From 253200to 254573unknown, similar to DNA
repair protein Sms
SEQ ID NO. 1663LM-3139.2From 254707to 255780Unknown, highly similar to
B. subtilis YacL protein
SEQ ID NO. 1664LM-314.1From 129694to 130230Unknown, weakly similar to
protein gp20 from
Bacteriophage A118
SEQ ID NO. 1665LM-3140.2From 255800to 256498Unknown, similar to
nucleotidylyl transferase;
pyrophosphorylase
SEQ ID NO. 1666LM-3141.2From 348586to 349065Unknown
SEQ ID NO. 1667LM-3142.2From 349326to 350228Unknown, similar to
transcriptional regulators
SEQ ID NO. 1668LM-3143.1From 350392to 351264Unknown, similar to
transcriptional regulators
SEQ ID NO. 1669LM-3147.1From 355643to 356323Unknown
SEQ ID NO. 1670LM-3149.2From 2602249to 2602683Unknown, similar to
hydroxymyristoyl-(acyl
carrier protein) dehydratase
SEQ ID NO. 1671LM-315.1From 129218to 129697Unknown
SEQ ID NO. 1672LM-3150.1From 2601632to 2601997Unknown, similar to single-
strand DNA-binding protein
SEQ ID NO. 1673LM-3151.1From 2600408to 2601241Unknown, similar to
hypothetical cell wall
binding protein from B. subtilis
SEQ ID NO. 1674LM-3152.1From 2599548to 2600282Unknown, similar to B. subtilis
TagA protein
involved in polyglycerol
phosphate biosynthesis
SEQ ID NO. 1675LM-3153.1From 2598423to 2599547Unknown, similar to B. subtilis
O-succinylbenzoate-
CoA synthase (MenC)
SEQ ID NO. 1676LM-3154.1From 2597363to 2598415Unknown, similar to B. subtilis
TagO teichoic acid
linkage unit synthesis
protein
SEQ ID NO. 1677LM-3155.1From 2596266to 2597324Unknown, similar to B. subtilis
putative
transcriptional regulator
LytR
SEQ ID NO. 1678LM-3156.1From 2595485to 2596090Unknown
SEQ ID NO. 1679LM-3157.1From 2594908to 2595543Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1680LM-3158.1From 2593837to 2594523Unknown, similar to B. subtilis
two-component
response regulator DegU
SEQ ID NO. 1681LM-3159.1From 2592965to 2593816Unknown, similar to B. subtilis
YviA (DegV) protein
SEQ ID NO. 1682LM-316.1From 128628to 129203Unknown
SEQ ID NO. 1683LM-3162.1From 2591451to 2592770unknown, similar to late
competence protein comFA
SEQ ID NO. 1684LM-3163.2From 2590802to 2591458unknown, similar to late
competence protein comFC
SEQ ID NO. 1685LM-3164.2From 2590044to 2590607Unknown, similar to
conserved hypothetical
proteins like to B. subtilis
YvyD protein
SEQ ID NO. 1686LM-3167.1From 2020738to 2021256Unknown, similar to similar
to acyl-CoA hydrolase
SEQ ID NO. 1687LM-3168.1From 2021272to 2023062Unknown, similar to two-
component sensor histidine
kinase (ResE)
SEQ ID NO. 1688LM-3169.1From 2023163to 2023879Unknown, similar to two-
component response
regulator (ResD)
SEQ ID NO. 1689LM-317.1From 128314to 128613Unknown
SEQ ID NO. 1690LM-3171.1From 2024062to 2024796Unknown, similar to
unknown proteins
SEQ ID NO. 1691LM-3172.1From 2024799to 2025395Unknown, similar to
unknown proteins
SEQ ID NO. 1692LM-3174.1From 2025392to 2026141Unknown, similar to
unknown proteins
SEQ ID NO. 1693LM-3177.1From 2026157to 2027467Unknown, similar to
diaminopimelate
decarboxylase
SEQ ID NO. 1694LM-3178.1From 2027648to 2028466Unknown, similar to purine-
nucleoside phosphorylase
SEQ ID NO. 1695LM-3179.2From 2028485to 2029669Unknown, similar to
phosphopentomutase
SEQ ID NO. 1696LM-3181.2From 2029698to 2030591Unknown, similar to
integrase/recombinase
SEQ ID NO. 1697LM-3184.2From 635317to 636423Unknown, similar to
homoserine O-
acetyltransferase
SEQ ID NO. 1698LM-3186.1From 636440to 637717Unknown, similar to O-
acetylhomoserine
sulfhydrylase
SEQ ID NO. 1699LM-3188.1From 638165to 638692Unknown, similar to
unknown proteins
SEQ ID NO. 1700LM-3189.1From 638781to 639482Unknown, similar to
transcription regulator
CRP/FNR family
SEQ ID NO. 1701LM-319.1From 127188to 128324Unknown, similar to protein
gp18 from Bacteriophage
A118
SEQ ID NO. 1702LM-3190.2From 639558to 640106Unknown, similar to proteins
involved in biotin
metabolism (BioY)
SEQ ID NO. 1703LM-3191.2From 1537815to 1538501Unknown, similar to two-
component response
regulators
SEQ ID NO. 1704LM-3192.1From 1538498to 1539937Unknown, similar to two-
component sensor histidine
kinase
SEQ ID NO. 1705LM-3194.1From 1539979to 1542375Unknown, similar to
exodeoxyribonuclease V
SEQ ID NO. 1706LM-3195.1From 1542403to 1543041Unknown, similar to
unknown proteins
SEQ ID NO. 1707LM-3196.1From 1543082to 1543822Unknown, similar to
unknown proteins
SEQ ID NO. 1708LM-3198.1From 1543940to 1545055Unknown, similar to putative
tRNA (5-
methylaminomethyl-2-
thiouridylate)-
methyltransferase
SEQ ID NO. 1709LM-32.1From 2733230to 2735245Unknown, similar to
transketolase
SEQ ID NO. 1710LM-3200.2From 1545074to 1546222Unknown, similar to iron-
sulfur cofactor synthesis
protein
SEQ ID NO. 1711LM-3201.2From 1527422to 1527904Unknown, similar to
transcription elongation
factor GreA
SEQ ID NO. 1712LM-3203.2From 1526748to 1527371Unknown, similar to
unknown proteins
SEQ ID NO. 1713LM-3204.1From 1525997to 1526698Unknown, similar to 5-
methylthioadenosine/S-
adenosylhomocysteine
nucleosidase
SEQ ID NO. 1714LM-3205.1From 1524154to 1525962Unknown, similar to
oligopeptidase
SEQ ID NO. 1715LM-3206.2From 1523471to 1523992Unknown, similar to
unknown proteins
SEQ ID NO. 1716LM-3208.2From 1522374to 1523474Unknown, similar to
unknown proteins
SEQ ID NO. 1717LM-3209.1From 1521533to 1522354Unknown, similar to
shikimate 5-dehydrogenase
(AroD)
SEQ ID NO. 1718LM-321.1From 126360to 127178Unknown, similar to phage
proteins
SEQ ID NO. 1719LM-3210.1From 1521243to 1521533Unknown, similar to
unknown proteins
SEQ ID NO. 1720LM-3211.1From 1520659to 1521225Unknown, similar to
unknown proteins
SEQ ID NO. 1721LM-3212.1From 1520097to 1520672Unknown, similar to
unknown proteins
SEQ ID NO. 1722LM-3213.1From 1519736to 1520092Unknown
SEQ ID NO. 1723LM-3214.1From 1518950to 1519681Unknown, similar to
unknown proteins
SEQ ID NO. 1724LM-3215.1From 1518245to 1518847Unknown, similar to integral
membrane protein ComEA
SEQ ID NO. 1725LM-3216.1From 1517613to 1518173Unknown, similar to B. subtilis
ComEB protein
SEQ ID NO. 1726LM-3217.2From 1515356to 1517578Unknown, similar to putative
integral membrane protein
ComEC specifically required
for DNA uptake but not for
binding
SEQ ID NO. 1727LM-3218.2From 1418803to 1420095Unknown, similar to putative
proteases
SEQ ID NO. 1728LM-3220.1From 1417735to 1418685Unknown, similar to sugar
ABC transporter, permease
protein
SEQ ID NO. 1729LM-3221.1From 1416686to 1417738Unknown, similar to
permease proteins
SEQ ID NO. 1730LM-3223.1From 1415152to 1416693Unknown, similar to sugar
ABC transporter, ATP-
binding protein
SEQ ID NO. 1731LM-3226.3From 1413646to 1414719Unknown, CD4+ T cell-
stimulating antigen,
lipoprotein
SEQ ID NO. 1732LM-3228.3From 1412455to 1413294Unknown, similar to
pyrroline-5-carboxylate
reductase
SEQ ID NO. 1733LM-3231.1From 1410148to 1412421Unknown, similar to DNA
translocase
SEQ ID NO. 1734LM-3234.2From 843311to 843724Unknown, similar to E. coli
PhnB protein
SEQ ID NO. 1735LM-3235.1From 842781to 843287Unknown, similar to B. subtilis
regulatory protein
PaiA
SEQ ID NO. 1736LM-3237.1From 842313to 842765unknown, similar to
transcription regulators
SEQ ID NO. 1737LM-3239.1From 841157to 842086unknown, similar to
oxidoreductases
SEQ ID NO. 1738LM-324.1From 124495to 126363unknown, similar to
bacteriophage minor tail
proteins
SEQ ID NO. 1739LM-3241.1From 840200to 841072unknown, similar to
fructokinases
SEQ ID NO. 1740LM-3243.3From 839520to 840062unknown
SEQ ID NO. 1741LM-3244.3From 838751to 839452unknown, similar to
carbonic anhydrase
SEQ ID NO. 1742LM-3246.1From 837548to 838621Unknown, similar to
spermidine/putrescine-
binding protein
SEQ ID NO. 1743LM-3248.1From 836745to 837551unknown, similar to
spermidine/putrescine ABC
transporter, permease
protein
SEQ ID NO. 1744LM-3249.1From 835939to 836748unknown, similar to
spermidine/putrescine ABC
transporter, permease
protein
SEQ ID NO. 1745LM-3250.1From 834845to 835939unknown, similar to
spermidine/putrescine ABC
transporter, ATP-binding
protein
SEQ ID NO. 1746LM-3251.2From 834287to 834829unknown, similar to
transcription regulator
SEQ ID NO. 1747LM-3252.2From 1724612to 1725565unknown, similar to ABC
transporter and adhesion
proteins
SEQ ID NO. 1748LM-3254.1From 1725584to 1726993similar to O-succinylbenzoic
acid-CoA ligase
SEQ ID NO. 1749LM-3255.1From 1727009to 1727827unknown, similar to
dihydroxynapthoic acid
synthetase
SEQ ID NO. 1750LM-3256.1From 1727824to 1728651unknown, similar to prolyl
aminopetidases
SEQ ID NO. 1751LM-3258.1From 1728653to 1730395unknown, similar to 2-
succinyl-6-hydroxy-2,4-
cyclohexadiene-1-
carboxylate synthase/2-
oxoglutarate decarboxylase
SEQ ID NO. 1752LM-326.1From 124104to 124508unknown
SEQ ID NO. 1753LM-3260.2From 1730392to 1731780unknown, similar to
menaquinone-specific
isochorismate synthase
SEQ ID NO. 1754LM-3261.2From 2308741to 2311464Unknown, similar to
unknown proteins
SEQ ID NO. 1755LM-3262.1From 2311461to 2312696Unknown, similar to
unknown proteins
SEQ ID NO. 1756LM-3263.1From 2312839to 2313192Unknown, similar to
unknown proteins
SEQ ID NO. 1757LM-3265.1From 2313274to 2314407Unknown, similar to
unknown proteins
SEQ ID NO. 1758LM-3266.1From 2314511to 2315878Unknown, similar to
fumarate hydratase
SEQ ID NO. 1759LM-3267.1From 2315948to 2316742Unknown, similar to
unknown proteins
SEQ ID NO. 1760LM-3268.1From 2316739to 2317644Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1761LM-3269.2From 2317950to 2320094Unknown, similar to
penicillin-binding protein
SEQ ID NO. 1762LM-3274.2From 558526to 560187Unknown, similar to putative
sulfate transporter
SEQ ID NO. 1763LM-3275.1From 558037to 558480Unknown, similar to B. subtilis
YybC protein
SEQ ID NO. 1764LM-3276.1From 557178to 557930Unknown, similar to
transcription regulator
SEQ ID NO. 1765LM-3278.1From 555660to 556976Unknown, similar to 6-
phospho-beta-glucosidase
SEQ ID NO. 1766LM-328.1From 123760to 124062Unknown
SEQ ID NO. 1767LM-3281.1From 554624to 555628Unknown, similar to
transcription regulator
SEQ ID NO. 1768LM-3283.2From 553078to 554493Unknown, similar to
multidrug resistance protein
SEQ ID NO. 1769LM-3284.1From 514004to 514381Unknown, putative secreted
protein
SEQ ID NO. 1770LM-3285.1From 514681to 515058Unknown, putative secreted
protein
SEQ ID NO. 1771LM-3286.1From 515391to 515750Unknown, putative secreted
protein
SEQ ID NO. 1772LM-3288.1From 516076to 516636Unknown
SEQ ID NO. 1773LM-329.1From 123200to 123712antigen A
SEQ ID NO. 1774LM-3290.1From 516746to 518446Unknown, similar to
unknown proteins
SEQ ID NO. 1775LM-3292.1From 518597to 519700Unknown, similar to
conserved hypothetical
proteins, highly similar to B. subtilis
YloN protein
SEQ ID NO. 1776LM-3293.1From 519789to 520268Unknown
SEQ ID NO. 1777LM-3294.2From 520426to 520791Unknown
SEQ ID NO. 1778LM-3296.2From 1561538to 1561882Unknown, similar to
unknown proteins
SEQ ID NO. 1779LM-3298.1From 1559081to 1561432Unknown, similar to single-
stranded-DNA-specific
exonuclease (RecJ)
SEQ ID NO. 1780LM-3299.1From 1558570to 1559091unknown, similar to adenine
phosphoribosyltransferase
SEQ ID NO. 1781LM-330.1From 122798to 123187antigen B
SEQ ID NO. 1782LM-3301.1From 1556148to 1558364unknown, similar to
(p)ppGpp synthetase
SEQ ID NO. 1783LM-3302.1From 1555680to 1556132Unknown, similar to
unknown proteins
SEQ ID NO. 1784LM-3303.2From 1554360to 1555643Unknown, similar to N-
acetylmuramoyl-L-alanine
amidase
SEQ ID NO. 1785LM-3305.2From 2279524to 2279919Unknown, similar to
unknown proteins
SEQ ID NO. 1786LM-3306.1From 2280238to 2281206Unknown, similar to
oligopeptide ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1787LM-3307.1From 2281203to 2282279Unknown, similar to
oligopeptide ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1788LM-3309.1From 2282297to 2283331Unknown, similar to
oligopeptide ABC
transporter (permease)
SEQ ID NO. 1789LM-331.1From 122104to 122520unknown, similar to
Antigen C
SEQ ID NO. 1790LM-3310.1From 2283331to 2284260Unknown, similar to
oligopeptide ABC
transporter (permease)
SEQ ID NO. 1791LM-3312.2From 2284539to 2286215Unknown, similar to
pheromone binding protein
SEQ ID NO. 1792LM-3315.1From 1528098to 1528727unknown, similar to Uridine
kinase
SEQ ID NO. 1793LM-3316.1From 1528724to 1529377Unknown, similar to O-
methyltransferase
SEQ ID NO. 1794LM-3319.1From 1529454to 1530524Unknown, similar to
unknown proteins
SEQ ID NO. 1795LM-332.1From 121664to 122092unknown, similar to
Antigen D
SEQ ID NO. 1796LM-3320.1From 1530727to 1531353Unknown, similar to
unknown proteins
SEQ ID NO. 1797LM-3321.1From 1531398to 1531700Unknown, similar to
unknown proteins
SEQ ID NO. 1798LM-3322.1From 1531716to 1532132Unknown, similar to
unknown proteins
SEQ ID NO. 1799LM-3323.1From 1532129to 1532401Unknown
SEQ ID NO. 1800LM-3326.1From 1532493to 1535132alanyl-tRNA synthetase
SEQ ID NO. 1801LM-3327.1From 1535439to 1536134Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1802LM-3329.2From 1536143to 1537648Unknown, similar to
transporter
SEQ ID NO. 1803LM-333.1From 121112to 121447Unknown, similar to putative
repressor C1 From
lactococcal bacteriophage
Tuc2009
SEQ ID NO. 1804LM-3330.2From 512291to 513268Unknown, similar to
oxetanocin A resistance
protein oxrB
SEQ ID NO. 1805LM-3331.1From 511966to 512208Unknown
SEQ ID NO. 1806LM-3332.1From 511215to 511562Unknown
SEQ ID NO. 1807LM-3333.1From 510163to 511212Unknown
SEQ ID NO. 1808LM-3334.1From 509400to 510287Unknown
SEQ ID NO. 1809LM-3335.2From 507723to 508733Unknown
SEQ ID NO. 1810LM-3336.3From 506368to 506997Unknown, weakly similar to
site-specific DNA-
methyltransferase
SEQ ID NO. 1811LM-3337.3From 505351to 506223Unknown
SEQ ID NO. 1812LM-3338.2From 1487699to 1489579DNA primase
SEQ ID NO. 1813LM-3339.1From 1489599to 1490039Unknown, similar to
unknown proteins
SEQ ID NO. 1814LM-334.1From 120654to 121106Unknown, similar to protein
gp35 From Bacteriophage
A118
SEQ ID NO. 1815LM-3340.1From 1490229to 1491053Unknown, similar to
unknown protein
SEQ ID NO. 1816LM-3341.1From 1491087to 1493153Unknown, similar to glycyl-
tRNA synthetase beta chain
SEQ ID NO. 1817LM-3342.1From 1493146to 1494036Unknown, similar to glycyl-
tRNA synthetase alpha
chain
SEQ ID NO. 1818LM-3343.1From 1494317to 1495084Unknown, similar to B. subtilis
RecO protein
involved in DNA repair and
homologous recombination
SEQ ID NO. 1819LM-3344.1From 1495221to 1495850Unknown
SEQ ID NO. 1820LM-3345.1From 1495899to 1496804Unknown, similar to GTP
binding proteins
SEQ ID NO. 1821LM-3347.1From 1496801to 1497196Unknown, similar to cytidine
deaminase
SEQ ID NO. 1822LM-3348.1From 1497221to 1497616Unknown, similar to
diacylglycerol kinase
SEQ ID NO. 1823LM-3349.1From 1497594to 1498079Unknown, similar to
unknown proteins
SEQ ID NO. 1824LM-335.1From 119740to 120435Unknown, weakly similar to
transcription regulators,
Fnr/Crp family
SEQ ID NO. 1825LM-3355.1From 2012582to 2013727Unknown, similar to similar
to ribosomal protein S1 like
protein
SEQ ID NO. 1826LM-3356.1From 2014086to 2014760Unknown, similar to
cytidylate kinase
SEQ ID NO. 1827LM-3357.1From 2014776to 2015738Unknown, similar to
asparaginase
SEQ ID NO. 1828LM-3359.1From 2015819to 2016538Unknown, similar to
unknown proteins
SEQ ID NO. 1829LM-336.1From 119063to 119770Unknown
SEQ ID NO. 1830LM-3360.1From 2016858to 2018261Unknown, similar to ATP-
dependent DNA helicase
SEQ ID NO. 1831LM-3361.2From 2018258to 2019265Unknown, similar to
unknown proteins
SEQ ID NO. 1832LM-3362.2From 2019420to 2019644Unknown, similar to
ferredoxin
SEQ ID NO. 1833LM-3363.3From 2019686to 2020297Unknown, similar to
unknown protein
SEQ ID NO. 1834LM-3364.2From 1867802to 1869460Unknown, similar to
unknown protein
SEQ ID NO. 1835LM-3371.2From 1872724to 1873620Unknown, similar to protein-
tyrosine phosphatase
SEQ ID NO. 1836LM-3373.2From 1873764to 1875116Unknown, similar to signal
recognition particle protein
Ffh
SEQ ID NO. 1837LM-3374.2From 1875129to 1875461Unknown, similar to
unknown proteins
SEQ ID NO. 1838LM-3375.2From 1358834to 1359103ribosomal protein S15
SEQ ID NO. 1839LM-3376.1From 1359352to 1361523Polynucleotide
phosphorylase (PNPase)
SEQ ID NO. 1840LM-3377.1From 1361564to 1362604unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1841LM-3378.2From 1362765to 1363244unknown, similar to B. subtilis
YqzC protein
SEQ ID NO. 1842LM-3379.2From 1363272to 1363628unknown, similar to B. subtilis
YqzD protein
SEQ ID NO. 1843LM-338.1From 117841to 118956Unknown, similar to lipase
SEQ ID NO. 1844LM-3380.2From 1918681to 1919505unknown, similar to
unknown proteins
SEQ ID NO. 1845LM-3381.1From 1919525to 1920436unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1846LM-3382.1From 1920436to 1920900unknown, highly similar to
signal peptidase II
SEQ ID NO. 1847LM-3384.1From 1920982to 1922265unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1848LM-3385.1From 1922439to 1923809unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1849LM-3387.1From 1923825to 1924757unknown, similar to
adhesion binding proteins
and lipoproteins with
multiple specificity for metal
cations
SEQ ID NO. 1850LM-3389.1From 1924754to 1925596unknown, similar to integral
membrane proteins, ABC
transporter
SEQ ID NO. 1851LM-3390.3From 1925600to 1926322unknown, similar to
probable ABC transporter,
ATP-binding proteins
SEQ ID NO. 1852LM-3391.3From 262657to 263262RNA polymerase sigma-30
factor (sigma-H)
SEQ ID NO. 1853LM-3392.1From 262064to 262576Unknown, similar to B. subtilis
Yacp protein
SEQ ID NO. 1854LM-3393.1From 261306to 262061Unknown, similar to
conserved hypothetical
proteins like to B. subtilis
YacO protein
SEQ ID NO. 1855LM-3394.1From 260896to 261306Unknown, highly similar to
B. subtilis YazC protein
SEQ ID NO. 1856LM-3398.1From 259477to 260892cysteinyl-tRNA synthetase
SEQ ID NO. 1857LM-340.1From 116972to 117805Unknown, similar to
transcriptional regulatory
proteins, AraC family
SEQ ID NO. 1858LM-3400.1From 258856to 259470unknown, similar to serine
O-acetyltransferase
SEQ ID NO. 1859LM-3403.3From 256983to 258458unknown, highly similar to
glutamyl-tRNA synthetase
SEQ ID NO. 1860LM-3404.3From 256491to 256964Unknown, similar to B. subtilis
YacN protein
SEQ ID NO. 1861LM-3405.2From 1862459to 1862893Unknown, similar to
transcription regulator
SEQ ID NO. 1862LM-3406.2From 1862893to 1863480Unknown, weakly similar to
Nad(P)h Oxidoreductase
chain B
SEQ ID NO. 1863LM-3407.1From 1863501to 1864217Unknown, similar to
unknown proteins
SEQ ID NO. 1864LM-3408.1From 1864248to 1864637Unknown
SEQ ID NO. 1865LM-3409.1From 1864657to 1865394Unknown, similar to E. coli
tRNA (guanine-N1)
methyltransferase
SEQ ID NO. 1866LM-3410.1From 1865394to 1865912Unknown, similar to putative
16S rRNA processing
protein RimM
SEQ ID NO. 1867LM-3411.1From 1865922to 1866308Unknown, similar to
unknown proteins
SEQ ID NO. 1868LM-3412.1From 1866336to 1867070Unknown, similar to
unknown proteins
SEQ ID NO. 1869LM-3414.2From 1867427to 1867699ribosomal protein S16
SEQ ID NO. 1870LM-3415.4From 1723649to 1724122unknown, some similarity to
hypothetical proteins
SEQ ID NO. 1871LM-3416.3From 1723356to 1723598unknown, some similarity to
hypothetical proteins
SEQ ID NO. 1872LM-3417.3From 1722431to 1723339unknown, similar to L-
lactate dehydrogenases
SEQ ID NO. 1873LM-342.1From 115087to 116829Unknown, ABC transporter,
ATP-binding protein
SEQ ID NO. 1874LM-3420.4From 1716858to 1717160Unknown
SEQ ID NO. 1875LM-3421.2From 1748792to 1749733unknown; similar to
glycerate dehydrogenases
SEQ ID NO. 1876LM-3422.1From 1749878to 1751176glutamate-1-semialdehyde
aminotransferase
SEQ ID NO. 1877LM-3424.1From 1751283to 1752371unknown, similar to
hypothetical proteins
SEQ ID NO. 1878LM-3426.1From 1752413to 1752940unknown, similar to
hypothetical proteins
SEQ ID NO. 1879LM-3427.1From 1753095to 1753841unknown, similar to glucose
1-dehydrogenase
SEQ ID NO. 1880LM-3428.1From 1753845to 1754942unknown, similar to A/G-
specific adenine
glycosylase
SEQ ID NO. 1881LM-3429.1From 1755143to 1756123unknown, similar to
hypothetical proteins
SEQ ID NO. 1882LM-343.1From 113313to 115094Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 1883LM-3430.2From 1756156to 1756617unknown, similar to
deoxyuridine triphosphate
nucleotidohydrolases
SEQ ID NO. 1884LM-3431.2From 2891269to 2891652Unknown, hypothetical
secreted protein
SEQ ID NO. 1885LM-3432.2From 2890779to 2891171Unknown, hypothetical
secreted protein
SEQ ID NO. 1886LM-3433.1From 2890274to 2890666Unknown, hypothetical
secreted protein
SEQ ID NO. 1887LM-3434.1From 2888994to 2890277Unknown
SEQ ID NO. 1888LM-3436.1From 2888639to 2889001Unknown
SEQ ID NO. 1889LM-3439.2From 2887499to 2888215GidB protein
SEQ ID NO. 1890LM-3442.2From 885440to 886882unknown, similar to
Glutamine binding and
transport protein
SEQ ID NO. 1891LM-3444.1From 886875to 887603unknown, similar to amino
acid ABC transporter, ATP-
binding protein
SEQ ID NO. 1892LM-3445.2From 887652to 889502Unknown, similar to
amidases
SEQ ID NO. 1893LM-3447.3From 2068977to 2069648Unknown, similar to
deoxyribose-phosphate
aldolase
SEQ ID NO. 1894LM-3448.1From 2069710to 2070657Unknown, similar to
transcription repressor of
dra/nupC/pdp operon DeoR
SEQ ID NO. 1895LM-3449.1From 2070754to 2071170Unknown, similar to PTS
mannose-specific enzyme
IIA component
SEQ ID NO. 1896LM-345.1From 112386to 113288Unknown, similar to
transcription regulator
SEQ ID NO. 1897LM-3450.1From 2071187to 2072182Unknown, similar to opine
catabolism protein
SEQ ID NO. 1898LM-3451.1From 2072205to 2073269Unknown, weakly similar to
glucosamine-fructose-6-
phosphate
aminotransferase
SEQ ID NO. 1899LM-3452.1From 2073282to 2074103Unknown, similar to
mannose-specific enzyme
IID component
SEQ ID NO. 1900LM-3453.1From 2074084to 2074902Unknown, similar to PTS
mannose-specific enzyme
IIC component
SEQ ID NO. 1901LM-3455.1From 2074925to 2075392Unknown, similar to PTS
mannose-specific enzyme
IIB component
SEQ ID NO. 1902LM-3456.2From 2075412to 2076113Unknown, similar to
transcription regulator GntR
family
SEQ ID NO. 1903LM-3458.2From 2076115to 2076846Unknown, similar to
transcription regulator GntR
family
SEQ ID NO. 1904LM-3459.2From 607372to 608199Unknown, similar histidinol
phosphate phosphatase
SEQ ID NO. 1905LM-3460.1From 608184to 608480Unknown, similar to
methyltransferase
SEQ ID NO. 1906LM-3461.1From 608557to 609588Unknown
SEQ ID NO. 1907LM-3462.2From 609629to 610924Unknown, conserved
hypothetical protein
SEQ ID NO. 1908LM-3463.2From 1776257to 1776829Unknown
SEQ ID NO. 1909LM-3465.1From 1774991to 1775983unknown, similar to cell-
shape determining proteins
SEQ ID NO. 1910LM-3466.1From 1773514to 1774773unknown, similar to
multidrug resistance protein,
integral membrane protein
SEQ ID NO. 1911LM-3468.3From 1772178to 1773410unknown, highly similar to
aminopeptidases
SEQ ID NO. 1912LM-3469.2From 833117to 833587Unknown
SEQ ID NO. 1913LM-3473.1From 831025to 833073Unknown, similar to putative
Na+/H+ antiporter
SEQ ID NO. 1914LM-3475.1From 830259to 830897Unknown, weakly similar to
GTP-pyrophosphokinase
SEQ ID NO. 1915LM-3479.2From 827666to 828016Unknown, similar to B. subtilis
YqkB protein
SEQ ID NO. 1916LM-3480.3From 826752to 827513Unknown
SEQ ID NO. 1917LM-3481.2From 2011128to 2012438Unknown, similar to
unknown protein
SEQ ID NO. 1918LM-3482.1From 2010089to 2011105Unknown, similar to
NAD(P)H-dependent
glycerol-3-phosphate
dehydrogenase
SEQ ID NO. 1919LM-3483.2From 2009006to 2009986Unknown, similar to protein-
tyrosine/serine phosphatase
SEQ ID NO. 1920LM-3484.1From 2008319to 2008594Unknown, similar to non-
specific DNA-binding
protein HU
SEQ ID NO. 1921LM-3485.1From 2007515to 2008084Unknown, similar to GTP
cyclohydrolase I
SEQ ID NO. 1922LM-3486.1From 2006689to 2007456Unknown, similar to
heptaprenyl diphosphate
synthase component I
SEQ ID NO. 1923LM-3487.1From 2005953to 2006666Unknown, similar to 2-
heptaprenyl-1,4-
naphthoquinone
methyltransferase
SEQ ID NO. 1924LM-3488.1From 2004977to 2005942Unknown, similar to
heptaprenyl diphosphate
synthase component II
(menaquinone biosynthesis)
SEQ ID NO. 1925LM-3489.2From 2004515to 2004958Unknown, similar to
nucleoside diphosphate
kinase
SEQ ID NO. 1926LM-349.1From 110013to 112283Unknown, highly similar to
chitinase B
SEQ ID NO. 1927LM-3490.2From 1029208to 1029819unknown, similar to
hypothetical protein
SEQ ID NO. 1928LM-3491.1From 1029965to 1030756Unknown
SEQ ID NO. 1929LM-3493.3From 1030757to 1032229unknown, similar to
PHYTOENE
DEHYDROGENASE
(EC 1.3.—.—) (PHYTOENE
DESATURASE)
SEQ ID NO. 1930LM-3494.3From 1032333to 1032617Unknown, similar to B. subtilis
protein YkvS
SEQ ID NO. 1931LM-3495.3From 1032924to 1033190PHOSPHOCARRIER
PROTEIN HPR
(HISTIDINE-CONTAINING
PROTEIN).
SEQ ID NO. 1932LM-3497.4From 1033190to 1034908Phosphotransferase system
enzyme I
SEQ ID NO. 1933LM-3498.1From 1035021to 1036055Unknown, conserved
hypothetical protein
SEQ ID NO. 1934LM-35.1From 2735279to 2735953Unknown, similar to
ribulose-5-phosphate 3-
epimerase
SEQ ID NO. 1935LM-3500.1From 1036068to 1036928Unknown, similar to 3-
hydroxyisobutyrate
dehydrogenase (B. subtilis
YkwC protein)
SEQ ID NO. 1936LM-3501.2From 1036971to 1038116Unknown, similar to
aminotransferases (to B. subtilis
PatA protein)
SEQ ID NO. 1937LM-3504.2From 641208to 642308Unknown, similar to cell
surface protein
SEQ ID NO. 1938LM-3505.1From 642414to 642914Unknown, weakly similar to
transcription regulator
SEQ ID NO. 1939LM-3506.1From 642962to 643348Unknown
SEQ ID NO. 1940LM-3507.1From 643400to 643744Unknown, similar to B. subtilis
YvlA protein
SEQ ID NO. 1941LM-3508.2From 643891to 645231Unknown, conserved
hypothetical membrane
protein
SEQ ID NO. 1942LM-351.2From 109349to 109720Unknown
SEQ ID NO. 1943LM-3510.2From 2145652to 2146044Unknown
SEQ ID NO. 1944LM-3511.1From 2146089to 2146298Unknown
SEQ ID NO. 1945LM-3512.1From 2146449to 2147426Unknown, similar to
conjugated bile acid
hydrolase
SEQ ID NO. 1946LM-3514.2From 2147684to 2149312Class I heat-shock protein
(chaperonin) GroEL
SEQ ID NO. 1947LM-3517.2From 1888615to 1890273Unknown, similar to
unknown proteins
SEQ ID NO. 1948LM-3518.1From 1887913to 1888575Unknown, similar to
phosphoglycerate
dehydrogenase
SEQ ID NO. 1949LM-3519.1From 1886887to 1887777Unknown, similar to L-
serine dehydratase
SEQ ID NO. 1950LM-352.2From 108624to 109256Unknown, similar to NADH
oxidase
SEQ ID NO. 1951LM-3520.2From 1884846to 1886894Unknown, similar to ATP-
dependent DNA helicase
recG
SEQ ID NO. 1952LM-3523.1From 2143109to 2144734Unknown, similar to copper
export proteins
SEQ ID NO. 1953LM-3524.1From 2142480to 2143097Unknown, similar to
unknown protein
SEQ ID NO. 1954LM-3525.1From 2141820to 2142461Unknown, similar to
unknown protein
SEQ ID NO. 1955LM-3527.1From 2141071to 2141814Unknown, similar to
potassium channel subunit
SEQ ID NO. 1956LM-3528.2From 2140046to 2140963Unknown, similar to heme O
oxygenase
SEQ ID NO. 1957LM-353.1From 108320to 108610Unknown
SEQ ID NO. 1958LM-3533.1From 1053426to 1054277unknown
SEQ ID NO. 1959LM-3534.1From 1054319to 1055284Unknown, similar to B. subtilis
LytR protein
SEQ ID NO. 1960LM-3535.1From 1055393to 1057060Unknown, similar to
conserved hypothetical
proteins (in particular B. subtilis
YkqC)
SEQ ID NO. 1961LM-3537.1From 1057761to 1058531Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1962LM-3538.3From 1058580to 1059608Unknown, similar to
transcriptional regulator,
LacI family
SEQ ID NO. 1963LM-3539.1From 2286839to 2287183Unknown
SEQ ID NO. 1964LM-354.1From 107915to 108208Unknown, similar to
transcription regulator
SEQ ID NO. 1965LM-3541.1From 2287518to 2288513Tryptophanyl-tRNA
synthetase
SEQ ID NO. 1966LM-3543.1From 2288571to 2288987Unknown, similar to
unknown protein
SEQ ID NO. 1967LM-3544.1From 2288971to 2289423Unknown, similar to
transcription regulator
SEQ ID NO. 1968LM-3547.1From 2289545to 2290786Unknown, similar to 3-
oxoacyl-acyl-carrier protein
synthase
SEQ ID NO. 1969LM-3549.2From 2290989to 2291927Unknown, similar to 3-
oxoacyl-acyl-carrier protein
synthase
SEQ ID NO. 1970LM-355.1From 107499to 107849Unknown
SEQ ID NO. 1971LM-3551.1From 1242100to 1244508Phenylalanyl-tRNA
synthetase beta subunit
SEQ ID NO. 1972LM-3553.2From 1241048to 1242100Phenylalanyl-tRNA
synthetase beta subunit
SEQ ID NO. 1973LM-3555.2From 2305632to 2305994Unknown, similar to
unknown protein
SEQ ID NO. 1974LM-3556.1From 2305180to 2305602Unknown, similar to
histidine triad (HIT) protein
SEQ ID NO. 1975LM-3557.1From 2304267to 2305019Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 1976LM-3558.1From 2303060to 2304283Unknown, similar to ABC
transporter (membrane
protein)
SEQ ID NO. 1977LM-3559.1From 2302518to 2303021Unknown, similar to
unknown protein
SEQ ID NO. 1978LM-356.1From 106988to 107377unknown
SEQ ID NO. 1979LM-3560.1From 2301260to 2302321Unknown, similar to
uroporphyrinogen III
decarboxylase
SEQ ID NO. 1980LM-3561.1From 2300334to 2301263Unknown, similar to
ferrochelatase
SEQ ID NO. 1981LM-3562.2From 2299833to 2300177Unknown
SEQ ID NO. 1982LM-3563.2From 1548034to 1548456Unknown, similar to
unknown protein
SEQ ID NO. 1983LM-3565.2From 1548701to 1549906Unknown, similar to
ammonium transporter
NrgA
SEQ ID NO. 1984LM-3566.1From 1549920to 1550285Unknown, similar to
nitrogen regulatory PII
protein
SEQ ID NO. 1985LM-3567.1From 1550435to 1550815Unknown
SEQ ID NO. 1986LM-3569.1From 1550854to 1552629Aspartyl-tRNA synthetase
SEQ ID NO. 1987LM-357.1From 105951to 106862Unknown, similar to PTS
system mannose-specific
factor IID
SEQ ID NO. 1988LM-3570.2From 1552632to 1553909Histidyl-tRNA synthetase
SEQ ID NO. 1989LM-3571.4From 1713234to 1715099Unknown, similar to
asparagine synthetase
SEQ ID NO. 1990LM-3572.1From 1712634to 1713209Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 1991LM-3573.2From 1711672to 1712637Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 1992LM-3574.3From 896003to 897028Unknown, similar to
transcription regulator lacl
family
SEQ ID NO. 1993LM-3576.1From 895278to 895991Unknown, similar to
carboxylesterase
SEQ ID NO. 1994LM-3577.1From 893838to 895211UDP-N-
acetylmuramoylalanyl-D-
glutamyl-2,6-
diaminopimelate-D-alanyl-
D-alanyl ligase
SEQ ID NO. 1995LM-3578.1From 892662to 893774D-alanine-D-alanine ligase
SEQ ID NO. 1996LM-3579.1From 892192to 892512Unknown, similar to E. coli
SugE protein
(transmembrane
chaperone)
SEQ ID NO. 1997LM-358.1From 105123to 105929Unknown, similar to PTS
system mannose-specific,
factor IIC
SEQ ID NO. 1998LM-3581.1From 891848to 892189Unknown, similar to E. coli
SugE protein
(transmembrane
chaperone)
SEQ ID NO. 1999LM-3582.1From 891277to 891831Unknown, similar to
transcription regulator
TetR/AcrR family
SEQ ID NO. 2000LM-3583.3From 890439to 891197Unknown
SEQ ID NO. 2001LM-3585.2From 1926935to 1928425Unknown, similar to
carboxy-terminal processing
proteinase
SEQ ID NO. 2002LM-3588.1From 1928798to 1931011Unknown, similar to heavy
metal-transporting ATPases
SEQ ID NO. 2003LM-3589.1From 1931026to 1931319Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2004LM-359.1From 104134to 105099Unknown, similar to PTS
system mannose-specific,
factor IIAB
SEQ ID NO. 2005LM-3590.2From 1931442to 1932266Unknown, similar to similar
to D-alanyl-D-alanine
carboxypeptidases
SEQ ID NO. 2006LM-3591.3From 1932324to 1933025Purine nucleoside
phosphorylase
SEQ ID NO. 2007LM-3592.2From 2487261to 2488337Unknown
SEQ ID NO. 2008LM-3593.1From 2488379to 2489209Unknown, conserved
lipoprotein
SEQ ID NO. 2009LM-3596.1From 2489274to 2489948Unknown, similar to ABC
transporter, permease
protein
SEQ ID NO. 2010LM-3597.1From 2489945to 2490967Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2011LM-3599.1From 2491552to 2492694Unknown, similar to two-
component sensor histidine
kinase
SEQ ID NO. 2012LM-36.1From 2735962to 2736402Unknown, similar to ribose
5-phosphate epimerase
SEQ ID NO. 2013LM-360.1From 103204to 103836unknown
SEQ ID NO. 2014LM-3600.1From 2492684to 2493379Unknown, similar to two-
component response
regulator
SEQ ID NO. 2015LM-3601.3From 2493454to 2494329Unknown, conserved
hypothetical protein
SEQ ID NO. 2016LM-3602.2From 1435635to 1437866Pyruvate-formate lyase
SEQ ID NO. 2017LM-3603.1From 1437943to 1438689Pyruvate-formate lyase
activating enzyme
SEQ ID NO. 2018LM-3604.1From 1438724to 1439254Unknown, similar to
unknown proteins
SEQ ID NO. 2019LM-3605.2From 1439421to 1440614Unknown, similar to
multidrug-efflux transporter
SEQ ID NO. 2020LM-3608.2From 1104863to 1105864Unknown, similar to
TEICHOIC ACID
TRANSLOCATION ATP-
BINDING PROTEIN TAGH
(ABC transporter)
SEQ ID NO. 2021LM-361.1From 102468to 103028unknown
SEQ ID NO. 2022LM-3612.1From 1107975to 1109663Unknown, similar to
TEICHOIC ACID
BIOSYNTHESIS PROTEIN
B PRECURSOR
SEQ ID NO. 2023LM-3613.1From 1109705to 1110577Unknown, similar to putative
UDP-glucose
pyrophosphorylases
SEQ ID NO. 2024LM-3614.3From 1110769to 1113627Unknown, similar to B. subtilis
YfhO protein
SEQ ID NO. 2025LM-3615.2From 1277312to 1278025Unknown, similar to
transcription regulator GntR
family
SEQ ID NO. 2026LM-3616.1From 1278056to 1279702Unknown, similar to
alpha, alpha-
phosphotrehalase
SEQ ID NO. 2027LM-3617.3From 1279721to 1281205Unknown, similar to PTS
system trehalose specific
enzyme IIBC
SEQ ID NO. 2028LM-3618.3From 1434661to 1435209Unknown, similar to putative
anti-terminator regulatory
protein
SEQ ID NO. 2029LM-362.1From 102155to 102478Unknown, similar to ATP
synthase epsilon chain
SEQ ID NO. 2030LM-3620.2From 1432839to 1434644DNA mismatch repair
protein
SEQ ID NO. 2031LM-3623.1From 1430237to 1432819DNA mismatch repair
(recognition)
SEQ ID NO. 2032LM-3624.2From 1429765to 1430127Unknown, similar to B. subtilis
YmcA protein
SEQ ID NO. 2033LM-3625.2From 758656to 759396Unknown, similar to
riboflavin kinase/FAD
synthase
SEQ ID NO. 2034LM-3626.2From 756738to 758543Unknown, similar to L-
glutamine-D-fructose-6-
phosphate
amidotransferase
SEQ ID NO. 2035LM-3627.2From 598020to 598925Unknown, putative
membrane protein
SEQ ID NO. 2036LM-3628.2From 598968to 600344Unknown, similar to NADP-
specific glutamate
dehydrogenase
SEQ ID NO. 2037LM-363.1From 100772to 102142Unknown, similar to ATP
synthase beta chain
SEQ ID NO. 2038LM-3630.4From 134782to 136290Unknown, similar to inosine
monophosphate
dehydrogenase
SEQ ID NO. 2039LM-3631.2From 133961to 134710Unknown, conserved
hypothetical protein
SEQ ID NO. 2040LM-3632.2From 2292047to 2293174Unknown, similar to N-
acetylmuramoyl-L-alanine
amidase and to internalin B
SEQ ID NO. 2041LM-3633.1From 2293775to 2294464Unknown, similar to
phosphoglyceromutase 1
SEQ ID NO. 2042LM-3635.2From 2294555to 2297155Unknown, similar to
endopeptidase Clp ATP-
binding chain B (ClpB)
SEQ ID NO. 2043LM-3636.2From 1281321to 1281773unknown
SEQ ID NO. 2044LM-3637.1From 1281820to 1282284unknown
SEQ ID NO. 2045LM-3638.1From 1282473to 1283393unknown
SEQ ID NO. 2046LM-3639.1From 1283413to 1284660Gamma-glutamyl phosphate
reductase
SEQ ID NO. 2047LM-364.1From 99902to 100771Unknown, similar to ATP
synthase gamma chain
SEQ ID NO. 2048LM-3640.1From 1284644to 1285474Gamma-glutamyl kinase
SEQ ID NO. 2049LM-3643.3From 1285610to 1286749unknown
SEQ ID NO. 2050LM-3644.4From 1286830to 1287225Unknown, similar to
transcriptional regulator
(phage-related)
SEQ ID NO. 2051LM-3645.2From 1765915to 1767294unknown, similar to similar
to RNA methyltransferases
SEQ ID NO. 2052LM-3647.1From 1765499to 1765900unknown, similar to
glutathione transferase -
fosfomycin resistance
protein
SEQ ID NO. 2053LM-3648.1From 1765117to 1765470unknown
SEQ ID NO. 2054LM-3649.1From 1763941to 1764843unknown, some similarities
to methyl-accepting
chemotaxis proteins
SEQ ID NO. 2055LM-3650.1From 1763209to 1763751unknown, similar to
ribosomal-protein-alanine
N-acetyltransferase
SEQ ID NO. 2056LM-3652.2From 1762200to 1763165unknown, similar to putative
transmembrane proteins
SEQ ID NO. 2057LM-3656.2From 2840286to 2841896Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2058LM-3657.2From 2839698to 2840228Unknown, similar to
unknown protein
SEQ ID NO. 2059LM-3658.1From 1595227to 1597149threonyl-tRNA synthetase
SEQ ID NO. 2060LM-3659.1From 1597499to 1598422primosome component
(helicase loader) Dnal
SEQ ID NO. 2061LM-366.1From 98409to 99905Unknown, similar to ATP
synthase alpha chain
SEQ ID NO. 2062LM-3660.2From 1598432to 1599808chromosome replication
initiation/membrane
attachment protein DnaB
SEQ ID NO. 2063LM-3662.3From 2708276to 2709358Unknown, conserved
hypothetical lipoprotein
SEQ ID NO. 2064LM-3663.2From 2707235to 2708194Unknown, weakly similar to
E. coli MenA protein
SEQ ID NO. 2065LM-3664.1From 2706415to 2707215Unknown, similar to B. subtilis
YbaF protein
SEQ ID NO. 2066LM-3666.1From 2705746to 2706054ribosomal protein S10
SEQ ID NO. 2067LM-3667.1From 2705082to 2705711ribosomal protein L3
SEQ ID NO. 2068LM-3668.1From 2704433to 2705056ribosomal protein L4
SEQ ID NO. 2069LM-3669.1From 2704149to 2704433ribosomal protein L23
SEQ ID NO. 2070LM-367.1From 97381to 98412Unknown, weakly similar to
ATP synthase delta chain
SEQ ID NO. 2071LM-3671.2From 2703275to 2704108ribosomal protein L2
SEQ ID NO. 2072LM-3673.2From 2033587to 2034528Unknown, similar to
ferrichrome binding protein
SEQ ID NO. 2073LM-3674.1From 2032460to 2033485Unknown, similar to
ferrichrome ABC transporter
(permease)
SEQ ID NO. 2074LM-3676.2From 2031438to 2032460Unknown, similar to
ferrichrome ABC transporter
(permease)
SEQ ID NO. 2075LM-3677.2From 1756864to 1757673unknown, similar to
hypothetical proteins
SEQ ID NO. 2076LM-3678.1From 1757771to 1758673unknown, similar to CDP-
abequose synthase
SEQ ID NO. 2077LM-3679.2From 1758694to 1761291unknown, similar to putative
membrane proteins
SEQ ID NO. 2078LM-368.1From 97127to 97369Unknown, similar to ATP
synthase C chain
SEQ ID NO. 2079LM-3680.2From 1761318to 1761992unknown, similar to
unknown proteins
SEQ ID NO. 2080LM-3681.2From 1708312to 1708518unknown
SEQ ID NO. 2081LM-3684.3From 1708857to 1711268leucyl-tRNA synthetase
SEQ ID NO. 2082LM-3692.1From 2161161to 2162054Unknown
SEQ ID NO. 2083LM-3693.1From 2160538to 2161164unknown
SEQ ID NO. 2084LM-3694.1From 2160043to 2160432Unknown, similar to
unknown protein
SEQ ID NO. 2085LM-3695.2From 980177to 981307Unknown, similar to B. subtilis
ComEC protein
SEQ ID NO. 2086LM-3696.1From 981743to 982960unknown, hypothetical
transport protein
SEQ ID NO. 2087LM-3697.2From 982957to 983646unknown, similar to
transcription regulator
SEQ ID NO. 2088LM-3698.2From 983768to 984814unknown, conserved
hypothetical membrane
protein
SEQ ID NO. 2089LM-3699.2From 2658024to 2658851Unknown, conserved
hypothetical protein
SEQ ID NO. 2090LM-370.1From 94764to 97070unknown
SEQ ID NO. 2091LM-3701.3From 1936764to 1937603unknown, similar to
hypothetical proteins
SEQ ID NO. 2092LM-3703.3From 1935964to 1936749unknown, similar to
hypothetical proteins
SEQ ID NO. 2093LM-3704.3From 1935334to 1935948unknown, similar to
hypothetical proteins
SEQ ID NO. 2094LM-3705.1From 1934765to 1935298unknown, similar to peptidyl
methionine sulfoxide
reductases
SEQ ID NO. 2095LM-3706.1From 1934321to 1934758unknown, similar to
transcriptional regulator
(PilB family)
SEQ ID NO. 2096LM-3707.3From 1933333to 1934238unknown, similar to
dehydogenases and
hypothetical proteins
SEQ ID NO. 2097LM-3708.2From 822611to 823531Unknown, conserved
hypothetical protein
SEQ ID NO. 2098LM-3709.1From 821886to 822527Unknown, similar to B. subtilis
YwnB protein
SEQ ID NO. 2099LM-3710.1From 821070to 821771Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 2100LM-3711.3From 820145to 821035Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 2101LM-3712.2From 1767359to 1767739unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2102LM-3713.1From 1767846to 1768490unknown, similar to
deoxyguanosine
kinase/deoxyadenosine
kinase(I) subunit
SEQ ID NO. 2103LM-3714.1From 1768520to 1769389unknown, similar to
transport proteins
SEQ ID NO. 2104LM-3715.1From 1769720to 1770526unknown, similar to
aminoglycoside N3-
acetyltransferases
SEQ ID NO. 2105LM-3716.2From 1770648to 1771406unknown, similar to
methionine
aminopeptidases
SEQ ID NO. 2106LM-3717.2From 1818007to 1818516Unknown
SEQ ID NO. 2107LM-3718.1From 1817112to 1817879Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2108LM-3719.3From 1815143to 1817122Unknown, similar to ABC
transporter (permease)
SEQ ID NO. 2109LM-3720.2From 1293088to 1293654unknown, similar to type-I
signal peptidase
SEQ ID NO. 2110LM-3722.1From 1291710to 1292969ATP-dependent Clp
protease ATP-binding
subunit ClpX
SEQ ID NO. 2111LM-3724.1From 1290241to 1291524trigger factor (prolyl
isomerase)
SEQ ID NO. 2112LM-3725.2From 1289188to 1290126unknown
SEQ ID NO. 2113LM-3726.3From 1703330to 1703824unknown, putative cell
surface protein
SEQ ID NO. 2114LM-3727.1From 1704618to 1705181unknown, similar to
unknown proteins
SEQ ID NO. 2115LM-3728.1From 1705199to 1705630unknown
SEQ ID NO. 2116LM-373.1From 88888to 94767Unknown
SEQ ID NO. 2117LM-3732.2From 1706204to 1707088similar to translation
elongation factor
SEQ ID NO. 2118LM-3733.3From 1707168to 170791730S ribosomal protein S2
SEQ ID NO. 2119LM-3735.3From 1645412to 1645936Unknown, similar to general
stress protein
SEQ ID NO. 2120LM-3736.3From 1644106to 16451913-deoxy-D-arabino-
heptulosonate 7-phosphate
synthase
SEQ ID NO. 2121LM-3737.1From 1642865to 1643872catabolite control protein A
SEQ ID NO. 2122LM-3739.2From 1641284to 1642543tyrosyl-tRNA synthetase
SEQ ID NO. 2123LM-374.1From 88121to 88810unknown
SEQ ID NO. 2124LM-3746.2From 1428442to 1428939Unknown, similar to N-
acetyltransferase
SEQ ID NO. 2125LM-3747.1From 1426766to 1428328Unknown, similar to
unknown protein
SEQ ID NO. 2126LM-3749.2From 1425419to 1426465Recombination protein recA
SEQ ID NO. 2127LM-375.1From 86747to 87730Unknown, similar to
oxidoreductases
SEQ ID NO. 2128LM-3750.2From 356890to 357705Unknown, similar to
transposase
SEQ ID NO. 2129LM-3754.2From 360172to 360507Unknown
SEQ ID NO. 2130LM-3756.2From 1407859to 1408818Unknown, similar to
unknown protein
SEQ ID NO. 2131LM-3757.2From 1406742to 1407818Unknown, similar to
unknown protein
SEQ ID NO. 2132LM-3758.2From 824936to 826396Unknown, similar to lysine-
specific permease
SEQ ID NO. 2133LM-3759.2From 824423to 824884Unknown
SEQ ID NO. 2134LM-376.1From 86323to 86691Unknown, similar to
transcription regulator
(merR family)
SEQ ID NO. 2135LM-3763.1From 1659457to 1660926Unknown, similar to
multidrug-efflux transporter
SEQ ID NO. 2136LM-3764.1From 1660923to 1661363Unknown, similar to
transcription regulator MarR
family
SEQ ID NO. 2137LM-3766.3From 1661588to 1662457D-Amino Acid
Aminotransferase
SEQ ID NO. 2138LM-3767.2From 2212384to 2212620unknown
SEQ ID NO. 2139LM-3768.2From 2213218to 2215041Unknown, similar to
unknown protein
SEQ ID NO. 2140LM-3769.2From 2306177to 2306716Unknown
SEQ ID NO. 2141LM-377.1From 85902to 86228Unknown
SEQ ID NO. 2142LM-3770.2From 2306833to 2307714Unknown, similar to post-
translocation molecular
chaperone
SEQ ID NO. 2143LM-3772.2From 2307755to 2308696Unknown, similar to S. aureus
Cbf1 protein
SEQ ID NO. 2144LM-3774.3From 991810to 992865unknown, similar to
UNDECAPRENYL-
PHOSPHATE N-
ACETYLGLUCOSAMINYL
TRANSFERASE
SEQ ID NO. 2145LM-3775.1From 990968to 991690unknown, similar to
transcription regulator
(GntR family)
SEQ ID NO. 2146LM-3776.2From 990248to 990952GLUCOSAMINE-6-
PHOSPHATE ISOMERASE
(EC 5.3.1.10)
(GLUCOSAMINE-6-
PHOSPHATE
DEAMINASE) (GNPDA)
(GLCN6P DEAMINASE).
SEQ ID NO. 2147LM-3778.2From 1861907to 1862251ribosomal protein L19
SEQ ID NO. 2148LM-3779.3From 1860200to 1861090internalin C
SEQ ID NO. 2149LM-378.1From 85025to 85627Unknown
SEQ ID NO. 2150LM-3783.2From 989099to 990232N-
ACETYLGLUCOSAMINE-6-
PHOSPHATE
DEACETYLASE (EC
3.5.1.25) (GLCNAC 6-P
DEACETYLASE).
SEQ ID NO. 2151LM-3785.2From 988065to 988910unknown
SEQ ID NO. 2152LM-379.1From 84451to 84849Unknown
SEQ ID NO. 2153LM-3790.3From 1420076to 1421362Unknown, similar to putative
protease
SEQ ID NO. 2154LM-3791.2From 1778415to 1779482unknown, similar to
hypothetical proteins
SEQ ID NO. 2155LM-3792.1From 1777763to 1778338unknown, similar to putative
transcription regulators
SEQ ID NO. 2156LM-3793.2From 1776910to 1777578unknown, similar to
hypothetical proteins
SEQ ID NO. 2157LM-3795.3From 1421454to 1422185Unknown, similar to 3-
ketoacyl-acyl carrier protein
reductase
SEQ ID NO. 2158LM-3797.2From 1422236to 1423165Unknown, similar to
unknown protein
SEQ ID NO. 2159LM-3798.1From 1423255to 1423833Unknown, similar to
phosphatidylglycerophosphate
synthase
SEQ ID NO. 2160LM-380.1From 82959to 84437Unknown
SEQ ID NO. 2161LM-3800.3From 1423902to 1425146Unknown, similar to
competence-damage
inducible protein CinA
SEQ ID NO. 2162LM-3801.3From 984823to 985722unknown
SEQ ID NO. 2163LM-3802.4From 985743to 986564unknown
SEQ ID NO. 2164LM-3806.3From 1578625to 1579425unknown, highly similar to
cell division inhibitor
(septum placement) protein
MinD
SEQ ID NO. 2165LM-3807.3From 1577212to 1578573Unknown, similar to
ribonuclease G
SEQ ID NO. 2166LM-3808.2From 2298927to 2299712Unknown
SEQ ID NO. 2167LM-3809.2From 2298093to 2298863Unknown, similar to
unknown protein
SEQ ID NO. 2168LM-3810.3From 333534to 335195Unknown, similar to
unknown protein
SEQ ID NO. 2169LM-3811.3From 335350to 336426Unknown
SEQ ID NO. 2170LM-3812.3From 1662570to 1663982Unknown, similar to Xaa-His
dipeptidase
SEQ ID NO. 2171LM-3818.1From 2211280to 2212245Unknown, similar to
transcription regulator, Lacl
family
SEQ ID NO. 2172LM-3819.1From 2210609to 2211244Unknown
SEQ ID NO. 2173LM-382.1From 81661to 82617Unknown, similar to
phosphoglycerate
dehydrogenase
SEQ ID NO. 2174LM-3821.1From 2159697to 2160053Unknown, similar to
unknown protein
SEQ ID NO. 2175LM-3822.1From 2149348to 2149632class I heat-shock protein
(chaperonin) GroES
SEQ ID NO. 2176LM-3823.3From 2943120to 2943479ribonuclease P protein
component
SEQ ID NO. 2177LM-3825.4From 2943871to 2944341hypothetical protein
SEQ ID NO. 2178LM-3826.1From 1240358to 1240681unknown, similar to
unknown protein
SEQ ID NO. 2179LM-383.1From 80892to 81530Unknown, conserved
hypothetical protein
SEQ ID NO. 2180LM-3845.2From 979762to 980064unknown, similar to B. subtilis
YneR protein
SEQ ID NO. 2181LM-3846.1From 979059to 979529non-heme iron-binding
ferritin
SEQ ID NO. 2182LM-3847.1From 940316to 940696unknown, conserved
hypothetical protein
SEQ ID NO. 2183LM-3848.1From 1357753to 1358697unknown, highly similar to
riboflavin kinase and FAD
synthase
SEQ ID NO. 2184LM-385.1From 79817to 80869Unknown, similar to E. coli
Ada protein (O6-
methylguanine-DNA
methyltransferase)
SEQ ID NO. 2185LM-3851.1From 1378840to 1379901Unknown, similar to
aminopeptidase P
SEQ ID NO. 2186LM-3853.2From 501693to 501959Unknown, weakly similar to
transposase
SEQ ID NO. 2187LM-3856.2From 1953836to 1954237unknown, similar to similar
to RNase HI
SEQ ID NO. 2188LM-3857.1From 1963240to 1963605unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2189LM-3858.2From 2030744to 2031196Unknown, similar to
transcriptional regulator (Fur
family)
SEQ ID NO. 2190LM-386.1From 79047to 79820Unknown, similar to
carboxyphosphonoenol-
pyruvate phosphonomutase
SEQ ID NO. 2191LM-3863.2From 2494891to 2495268Unknown, similar to glycine
cleavage system protein H
SEQ ID NO. 2192LM-3865.2From 200094to 200402Unknown, similar to B. subtilis
SpoVG protein
SEQ ID NO. 2193LM-3867.1From 1890288to 1890653Unknown, similar to
unknown protein
SEQ ID NO. 2194LM-3868.2From 1428945to 1429748Unknown, conserved
hypothetical protein
SEQ ID NO. 2195LM-3869.2From 1440748to 1441296Unknown
SEQ ID NO. 2196LM-387.1From 78362to 78811Unknown
SEQ ID NO. 2197LM-3871.1From 2100845to 2101366Unknown, similar to
unknown protein
SEQ ID NO. 2198LM-3872.1From 2100224to 2100751Unknown, similar to cell-
division initiation protein
(septum placement)
SEQ ID NO. 2199LM-3873.1From 2891649to 2892032Unknown, hypothetical
secreted protein
SEQ ID NO. 2200LM-3876.1From 520961to 521536Unknown
SEQ ID NO. 2201LM-3878.2From 640625to 641215Unknown
SEQ ID NO. 2202LM-3879.2From 640300to 640632Unknown, conserved
hypothetical protein
SEQ ID NO. 2203LM-388.1From 78090to 78374unknown
SEQ ID NO. 2204LM-3883.1From 1664057to 1664470Unknown, weakly similar to
E. coli MutT protein (dGTP
pyrophosphohydrolase)
SEQ ID NO. 2205LM-3886.1From 1702695to 1703315unknown, putative
cellsurface protein
SEQ ID NO. 2206LM-3887.1From 1038336to 1038557unknown
SEQ ID NO. 2207LM-3889.1From 2524168to 2524911Unknown, similar to
carboxylesterase
SEQ ID NO. 2208LM-389.1From 77149to 77406unknown
SEQ ID NO. 2209LM-3890.1From 2523835to 2524068Unknown
SEQ ID NO. 2210LM-3891.1From 1724196to 1724462unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2211LM-3893.1From 611240to 612634Unknown, similar to beta-
glucosidase
SEQ ID NO. 2212LM-3896.1From 1546608to 1547885Unknown, similar to
unknown protein
SEQ ID NO. 2213LM-3897.3From 1514257to 1515288Unknown, similar to
unknown protein
SEQ ID NO. 2214LM-3898.1From 1594535to 1595119Unknown, similar to
hypothetical GTP binding
protein
SEQ ID NO. 2215LM-39.1From 2736597to 2737628Unknown, similar to polyol
dehydrogenase
SEQ ID NO. 2216LM-390.3From 76262to 76984unknown
SEQ ID NO. 2217LM-3902.4From 1715236to 1716435Unknown, similar to S-
methionine
adenosyltransferase
SEQ ID NO. 2218LM-3904.3From 819389to 820039Unknown
SEQ ID NO. 2219LM-3905.2From 502899to 504629Unknown
SEQ ID NO. 2220LM-3907.1From 1234779to 1235330unknown
SEQ ID NO. 2221LM-3908.1From 1234465to 1234776unknown, similar to
unknown protein
SEQ ID NO. 2222LM-3910.2From 1409070to 1409996Unknown, similar to
unknown protein
SEQ ID NO. 2223LM-3912.1From 2709522to 2710421Unknown, conserved
lipoprotein
SEQ ID NO. 2224LM-3914.1From 2297304to 2297981Unknown, similar to
unknown protein
SEQ ID NO. 2225LM-3915.1From 2145127to 2145513Unknown, similar to large
conductance
mechanosensitive channel
protein
SEQ ID NO. 2226LM-392.3From 75832to 76125unknown
SEQ ID NO. 2227LM-3922.2From 987735to 988049Unknown
SEQ ID NO. 2228LM-3924.2From 986722to 987180Unknown
SEQ ID NO. 2229LM-3929.1From 833615to 834076unknown
SEQ ID NO. 2230LM-393.3From 75342to 75665Unknown
SEQ ID NO. 2231LM-3934.1From 356567to 356872Unknown, similar to
transposase
SEQ ID NO. 2232LM-3942.3From 2633930to 2634850Unknown, conserved
hypothetical protein
SEQ ID NO. 2233LM-3943.4From 1576107to 1576397ribosomal protein L27
SEQ ID NO. 2234LM-3944.4From 1576411to 1576728Unknown, similar to
unknown protein
SEQ ID NO. 2235LM-3947.1From 1599814to 1600278Unknown, similar to
unknown protein
SEQ ID NO. 2236LM-3949.2From 1638420to 1638908Unknown, similar to
unknown protein
SEQ ID NO. 2237LM-395.2From 74239to 75228Unknown, similar to
dinitrogenase reductase
ADP-ribosylation system
SEQ ID NO. 2238LM-3950.2From 1639120to 1639722ribosomal protein S4
SEQ ID NO. 2239LM-3951.1From 1640225to 1641004Unknown
SEQ ID NO. 2240LM-3953.1From 502424to 502660Hypothetical orf
SEQ ID NO. 2241LM-3954.2From 504674to 504997Unknown
SEQ ID NO. 2242LM-3955.1From 1307763to 1308170unknown, conserved
hypothetical protein, similar
to B. subtilis YneT protein
SEQ ID NO. 2243LM-3957.1From 2360908to 2361141Unknown
SEQ ID NO. 2244LM-3958.1From 2360435to 2360713Unknown, similar to
competence transcription
factor ComK, N terminal
part
SEQ ID NO. 2245LM-3959.2From 2359948to 2360184Unknown
SEQ ID NO. 2246LM-396.2From 1859272to 1859787translation initiation factor
IF-3
SEQ ID NO. 2247LM-3961.1From 2494522to 2494806Unknown, similar to
thioredoxin
SEQ ID NO. 2248LM-3966.4From 1739674to 1740849unknown, similar to
transmembrane transport
proteins
SEQ ID NO. 2249LM-397.3From 1858651to 1859010ribosomal protein L20
SEQ ID NO. 2250LM-3970.1From 1933037to 1933270unknown, similar to
hypothetical protein
SEQ ID NO. 2251LM-3972.2From 818789to 819265Unknown, similar to
transcription regulator
(EbsC from Enterococcus
faecalis)
SEQ ID NO. 2252LM-3973.2From 1136900to 1137400Unknown, similar to
lipoprotein signal peptidase
SEQ ID NO. 2253LM-3976.4From 267530to 267916Unknown, similar to
repressor (penicilinase
repressor)
SEQ ID NO. 2254LM-3978.3From 267010to 267372ribosomal protein L12
SEQ ID NO. 2255LM-3979.3From 266431to 266931ribosomal protein L10
SEQ ID NO. 2256LM-398.1From 1857856to 1858611Unknown, similar to 3-exodeoxyribonuclease
exoA
SEQ ID NO. 2257LM-3981.1From 2361897to 2362274protein gp30 [Bacteriophage
A118]
SEQ ID NO. 2258LM-3982.1From 2362306to 2362656protein gp29 [Bacteriophage
A118]
SEQ ID NO. 2259LM-3984.1From 270372to 270977Unknown, conserved
hypothetical protein
SEQ ID NO. 2260LM-399.1From 1857242to 1857772Unknown
SEQ ID NO. 2261LM-3990.1From 222983to 223261Unknown, highly similar to
B. subtilis YabO protein
SEQ ID NO. 2262LM-3993.1From 200522to 200830Unknown, similar to B. subtilis
SpoVG protein
SEQ ID NO. 2263LM-3995.1From 143016to 143258unknown
SEQ ID NO. 2264LM-3998.1From 823638to 824168Unknown, conserved
hypothetical protein
SEQ ID NO. 2265LM-3999.3From 810985to 811620Unknown, similar to acyl-
carrier protein
phosphodiesterase and to
NAD(P)H dehydrogenase
SEQ ID NO. 2266LM-4.1From 2713398to 2713940Unknown
SEQ ID NO. 2267LM-40.1From 2737630to 2738682Unknown, similar to sorbitol
dehydrogenase
SEQ ID NO. 2268LM-4003.2From 1513917to 1514171ribosomal protein S20
SEQ ID NO. 2269LM-401.1From 1855952to 1857184Unknown, similar to
aminotripeptidase
(peptidase T)
SEQ ID NO. 2270LM-4013.2From 1926482to 1926898unknown, similar to
transcriptional regulator
(MarR family)
SEQ ID NO. 2271LM-4015.1From 1771594to 1772040unknown, similar to putative
flavodoxin
SEQ ID NO. 2272LM-402.1From 1855571to 1855933Unknown
SEQ ID NO. 2273LM-4027.2From 263535to 263714unknown, highly similar to
preprotein translocase
subunit
SEQ ID NO. 2274LM-4029.2From 263844to 264377transcription antitermination
factor
SEQ ID NO. 2275LM-403.1From 1854748to 1855521Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2276LM-4031.1From 264432to 264902Unknown
SEQ ID NO. 2277LM-4032.1From 265029to 265454ribosomal protein L11
SEQ ID NO. 2278LM-4033.1From 265494to 266183ribosomal protein L1
SEQ ID NO. 2279LM-404.1From 1854077to 1854682Unknown, similar to
unknown protein
SEQ ID NO. 2280LM-4040.1From 504972to 505277Unknown
SEQ ID NO. 2281LM-4041.1From 552563to 552931Unknown, similar to
unknown protein
SEQ ID NO. 2282LM-4053.1From 1270465to 1270725unknown
SEQ ID NO. 2283LM-4055.2From 1288261to 1288926unknown, weakly similar to
oligopeptide ABC
transporter AppA
SEQ ID NO. 2284LM-4058.1From 1405945to 1406226Unknown
SEQ ID NO. 2285LM-4059.1From 1406366to 1406713Unknown
SEQ ID NO. 2286LM-4065.1From 2045765to 2046046Unknown, similar to pentitol
PTS system enzyme II B
component
SEQ ID NO. 2287LM-4082.1From 2782739to 2782894Unknown
SEQ ID NO. 2288LM-4083.1From 2780577to 2780798Unknown
SEQ ID NO. 2289LM-4084.1From 2780392to 2780574Unknown
SEQ ID NO. 2290LM-4088.1From 2701254to 2701445ribosomal protein L29
SEQ ID NO. 2291LM-4089.1From 2700963to 2701226ribosomal protein S17
SEQ ID NO. 2292LM-4090.1From 2699383to 2699568ribosomal protein S14
SEQ ID NO. 2293LM-4091.1From 2697267to 2697446ribosomal protein L30
SEQ ID NO. 2294LM-4096.1From 367248to 367532Unknown
SEQ ID NO. 2295LM-4097.1From 370416to 370634Unknown
SEQ ID NO. 2296LM-4098.1From 389538to 389717Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 2297LM-4099.1From 401386to 401580unknown
SEQ ID NO. 2298LM-41.1From 2738719to 2739990Unknown, similar to PTS
system galactitol-specific
enzyme IIC component
SEQ ID NO. 2299LM-4106.1From 439009to 439212Unknown, similar to putative
transcription regulator
SEQ ID NO. 2300LM-4113.1From 521601to 521771ribosomal protein L32
SEQ ID NO. 2301LM-4115.1From 616742to 616951Unknown, similar to
unknown protein
SEQ ID NO. 2302LM-4118.1From 669086to 669361Unknown
SEQ ID NO. 2303LM-4119.1From 696606to 696815unknown
SEQ ID NO. 2304LM-4120.1From 707310to 707486unknown
SEQ ID NO. 2305LM-4121.1From 709601to 709810unknown
SEQ ID NO. 2306LM-4122.1From 731765to 732001Unknown, weakly similar to
flagellar switch protein
SEQ ID NO. 2307LM-4123.1From 756079to 756291unknown, LPXTG motif
protein
SEQ ID NO. 2308LM-4124.1From 756494to 756637Hypothetical CDS
SEQ ID NO. 2309LM-4130.1From 890069to 890215Unknown, hypothetical
protein
SEQ ID NO. 2310LM-4134.1From 961203to 961469Unknown, similar to ABC
transporter, ATP-binding
protein (C-terminal part)
SEQ ID NO. 2311LM-4135.1From 973509to 973670unknown
SEQ ID NO. 2312LM-4137.1From 981317to 981544unknown
SEQ ID NO. 2313LM-4139.1From 987274to 987501Unknown
SEQ ID NO. 2314LM-4142.1From 1003593to 1003829D-alanyl carrier protein
SEQ ID NO. 2315LM-4144.1From 1057067to 1057276unknown, similar to B. subtilis
YkzG protein
SEQ ID NO. 2316LM-4147.1From 1133466to 1133696unknown, highly similar to
TN916 ORF8
SEQ ID NO. 2317LM-4148.1From 1145723to 1145944unknown, highly similar to
TN916 ORF19
SEQ ID NO. 2318LM-4149.2From 1154864to 1156381unknown
SEQ ID NO. 2319LM-415.1From 1848166to 1848690Unknown, similar to
unknown protein
SEQ ID NO. 2320LM-4150.1From 1188014to 1188289Unknown, similar to
carboxysome structural
protein
SEQ ID NO. 2321LM-4152.2From 1287376to 1287591unknown, similar to
transcriptional regulator
SEQ ID NO. 2322LM-4153.1From 1331439to 1331666unknown, similar to B. subtilis
YnzC protein
SEQ ID NO. 2323LM-4154.1From 1363826to 1363975ribosomal protein L33
SEQ ID NO. 2324LM-4155.1From 1366220to 1366432unknown similar to B. subtilis
yqgQ
SEQ ID NO. 2325LM-4157.1From 1385704to 1385931Unknown, similar to
exodeoxyribonuclease small
subunit
SEQ ID NO. 2326LM-4158.1From 1387014to 1387214similar to cold shock protein
SEQ ID NO. 2327LM-4161.1From 1501881to 150205430S ribosomal protein S21
SEQ ID NO. 2328LM-4162.2From 1576747to 1577055ribosomal protein L21
SEQ ID NO. 2329LM-4166.1From 1654691to 1654909Unknown, hypothetical gene
SEQ ID NO. 2330LM-417.1From 1847472to 1847960hosphoribosylaminoimidazole
carboxylase I
SEQ ID NO. 2331LM-4172.1From 1756629to 1756862unknown
SEQ ID NO. 2332LM-4174.1From 1764871to 1765077unknown
SEQ ID NO. 2333LM-4175.1From 1769479to 1769700unknown
SEQ ID NO. 2334LM-4179.1From 1859050to 1859250ribosomal protein L35
SEQ ID NO. 2335LM-418.1From 1846355to 1847479Phosphoribosylaminoimidazole
carboxylase II
SEQ ID NO. 2336LM-4180.1From 1867180to 1867410Unknown, similar to
unknown protein
SEQ ID NO. 2337LM-4181.1From 1881041to 1881274Unknown, highly similar to
acyl carrier proteins
SEQ ID NO. 2338LM-4182.1From 1890951to 1891139ribosomal protein L28
SEQ ID NO. 2339LM-4183.1From 1902281to 1902484unknown
SEQ ID NO. 2340LM-4184.1From 1928579to 1928785unknown, similar to putative
mercuric ion binding
proteins
SEQ ID NO. 2341LM-4186.1From 1953513to 1953713similar to cold shock protein
SEQ ID NO. 2342LM-4189.1From 2094877to 2095077similar to major cold-shock
protein
SEQ ID NO. 2343LM-419.1From 1845044to 1846336adenylosuccinate lyase
SEQ ID NO. 2344LM-4192.1From 2144874to 2145065unknown
SEQ ID NO. 2345LM-4193.1From 2150556to 2150759unknown
SEQ ID NO. 2346LM-4195.1From 2184120to 2184347Unknown
SEQ ID NO. 2347LM-4197.1From 2192379to 2192609Unknown
SEQ ID NO. 2348LM-420.1From 1844250to 1844963Phosphoribosylaminoimidazole
succinocarboxamide
synthetase
SEQ ID NO. 2349LM-4200.1From 2234357to 2234533unknown
SEQ ID NO. 2350LM-4201.1From 2242097to 2242282unknown, similar to B. subtilis
YwmG protein
SEQ ID NO. 2351LM-4203.1From 2293497to 2293685Unknown, similar to
unknown protein
SEQ ID NO. 2352LM-4206.1From 2346593to 2346784unknown,
SEQ ID NO. 2353LM-4207.1From 2361582to 2361791unknown
SEQ ID NO. 2354LM-4208.1From 2366288to 2366446protein gp22 [Bacteriophage
A118]
SEQ ID NO. 2355LM-4209.1From 2387499to 2387663Bacteriophage A118 gp65
protein
SEQ ID NO. 2356LM-421.1From 1843993 to1844238Unknown, similar to
unknown protein
SEQ ID NO. 2357LM-4210.1From 2388528to 2388710Hypothetical protein
SEQ ID NO. 2358LM-4211.1From 2389208to 2389387Unknown
SEQ ID NO. 2359LM-4212.1From 2389491to 2389658unknown
SEQ ID NO. 2360LM-4213.1From 2391020to 2391217Unknown
SEQ ID NO. 2361LM-4214.1From 2394790to 2394984Unknown
SEQ ID NO. 2362LM-4215.1From 2395586to 2395801gp44 [Bacteriophage A118]
SEQ ID NO. 2363LM-4216.1From 2398287to 2398529Unknown, similar to
transcription regulator
SEQ ID NO. 2364LM-422.1From 1843306to 1843989Unknown, similar to
phosphoribosylformylglycinamidine
synthetase II
SEQ ID NO. 2365LM-4225.1From 2467388to 2467621Unknown, similar to B. subtilis
YuzB protein
SEQ ID NO. 2366LM-4226.1From 2468545to 2468703Unknown
SEQ ID NO. 2367LM-4227.1From 2480325to 2480528Unknown, similar to
repressor protein
SEQ ID NO. 2368LM-4228.1From 2501306to 2501470Unknown
SEQ ID NO. 2369LM-423.1From 1841094to 1843313Phosphoribosylformylglycinamidine
synthetase I
SEQ ID NO. 2370LM-4230.1From 2559617to 2559817Unknown, similar to B. subtilis
yvlC protein
SEQ ID NO. 2371LM-4231.1From 2567696to 2567935Unknown, similar to B. subtilis
CsbA protein
SEQ ID NO. 2372LM-4233.1From 2611192to 2611410unknown, highly similar to
H+-transporting ATP
synthase chain c
SEQ ID NO. 2373LM-4234.1From 2624633to 2624878ribosomal protein L31
SEQ ID NO. 2374LM-4235.1From 2643753to 2643938Unknown, similar to 4-
oxalocrotonate isomerase
SEQ ID NO. 2375LM-4236.1From 2646464to 2646664unknown
SEQ ID NO. 2376LM-4246.1From 145354to 145560Unknown, hypothetical
protein
SEQ ID NO. 2377LM-4247.1From 145085to 145171Unknown, hypothetical
protein
SEQ ID NO. 2378LM-4248.1From 136714to 136992Unknown, similar to E. coli
YjdJ protein
SEQ ID NO. 2379LM-4249.1From 136469to 136702Unknown, similar to E. coli
YjdI protein
SEQ ID NO. 2380LM-425.1From 1839682to 1841109glutamine
phosphoribosylpyrophosphate
amidotransferase
SEQ ID NO. 2381LM-4251.1From 77863to 78066Unknown, Hypothetical
SEQ ID NO. 2382LM-4253.1From 52373to 52534Unknown
SEQ ID NO. 2383LM-4254.1From 50514to 5075330S ribosomal protein S18
SEQ ID NO. 2384LM-426.1From 1838614to 1839663Phosphoribosylaminoimidazole
synthetase
SEQ ID NO. 2385LM-4262.2From 267909to 268949Unknown, similar to
penicillinase antirepressor
SEQ ID NO. 2386LM-4267.1From 1153010to 1153783unknown, similar to
regulatory proteins
SEQ ID NO. 2387LM-4268.1From 1153848to 1154204unknown
SEQ ID NO. 2388LM-427.1From 1838051to 1838617unknown, highly similar to
phosphoribosylglycinamide
formyltransferases
SEQ ID NO. 2389LM-4276.2From 2943569to 2943703ribosomal protein L34
SEQ ID NO. 2390LM-4277.1From 2693947to 2694060ribosomal protein L36
SEQ ID NO. 2391LM-428.1From 1836516to 1838045Bifunctional
phosphoribosylaminoimidazole
carboxyformyl
formyltransferase and
inosine-monophosphate
cyclohydrolase
SEQ ID NO. 2392LM-429.1From 1835229to 1836491phosphoribosylglycinamide
synthetase
SEQ ID NO. 2393LM-4293.1From 2130228to 2130401ribosomal protein L32
SEQ ID NO. 2394LM-4295.1From 2052696to 2052833Unknown
SEQ ID NO. 2395LM-43.1From 2740052to 2740333Unknown, similar to PTS
system galactitol-specific
enzyme IIB component
SEQ ID NO. 2396LM-430.1From 1834795to 1835091Unknown, similar to
unknown protein
SEQ ID NO. 2397LM-432.1From 1834523to 1834777Unknown
SEQ ID NO. 2398LM-433.1From 1833125to 1834480Unknown, similar to putative
sodium-dependent
transporter
SEQ ID NO. 2399LM-4342.1From 2317644to 2317841Unknown, similar to
unknown protein
SEQ ID NO. 2400LM-4349.1From 1287710to 1288243Unknown
SEQ ID NO. 2401LM-435.1From 1832242to 1832919Unknown, similar to
unknown protein
SEQ ID NO. 2402LM-4352.1From 436161to 436352Unknown
SEQ ID NO. 2403LM-4353.1From 2250846to 2251004Unknown
SEQ ID NO. 2404LM-4358.1From 2491285to 2491473unknown
SEQ ID NO. 2405LM-4359.1From 2173635to 2173832
SEQ ID NO. 2406LM-436.2From 1829982to 1832177ATP-dependent DNA
helicase
SEQ ID NO. 2407LM-4360.1From 263366to 263515
SEQ ID NO. 2408LM-438.1From 1827941to 1829956Unknown, similar to DNA
ligase
SEQ ID NO. 2409LM-439.1From 1826829to 1827944Unknown, similar to
unknown protein
SEQ ID NO. 2410LM-44.1From 2740390to 2740854Unknown, similar to PTS
system galactitol-specific
enzyme IIA component
SEQ ID NO. 2411LM-440.1From 1826411to 1826704glutamyl-tRNA(Gln)
amidotransferase (subunit
C)
SEQ ID NO. 2412LM-441.1From 1824936to 1826387glutamyl-tRNA(Gln)
amidotransferase (subunit
A)
SEQ ID NO. 2413LM-442.1From 1823494to 1824924glutamyl-tRNA(Gln)
amidotransferase (subunit
B)
SEQ ID NO. 2414LM-443.2From 1822421to 1823353Unknown, similar to
unknown protein
SEQ ID NO. 2415LM-444.2From 1821518to 1822273Unknown
SEQ ID NO. 2416LM-445.1From 1820133to 1821494Unknown, similar to
hypothetical RNA
methyltransferase
SEQ ID NO. 2417LM-446.1From 1819244to 1819798Unknown, similar to
unknown protein
SEQ ID NO. 2418LM-447.2From 1818740to 1819213Unknown, similar to
shikimate kinase
SEQ ID NO. 2419LM-45.2From 2740888to 2742957unknown, similar to
transcriptional
antiterminator (BglG family)
SEQ ID NO. 2420LM-451.1From 2435023to 2436141Unknown, similar to S. pyogenes
RofA regulatory
protein
SEQ ID NO. 2421LM-452.1From 2434627to 2434866Hypothetical protein
SEQ ID NO. 2422LM-453.1From 2433224to 2434618Unknown, similar to
glutamate decarboxylase
SEQ ID NO. 2423LM-455.1From 2431688to 2433211Unknown, similar to amino
acid antiporter (acid
resistance)
SEQ ID NO. 2424LM-456.1From 2430826to 2431296Unknown, conserved
hypothetical protein
SEQ ID NO. 2425LM-458.1From 2428010to 2430793Unknown, transmembrane
protein
SEQ ID NO. 2426LM-460.1From 2427045to 2427890Unknown, conserved
hypothetical protein
SEQ ID NO. 2427LM-463.1From 2426270to 2427001Unknown, similar to N-
acetylglucosamine-6-
phosphate isomerase
SEQ ID NO. 2428LM-464.1From 2425723to 2426253Unknown, similar to
unknown protein
SEQ ID NO. 2429LM-466.1From 2424922to 2425533Unknown
SEQ ID NO. 2430LM-468.1From 2423480to 2424721Unknown; Similar to
multidrug resistance protein
SEQ ID NO. 2431LM-469.1From 2421878to 2423476Unknown, conserved
hypothetical protein
SEQ ID NO. 2432LM-47.2From 2743326to 2744138Unknown
SEQ ID NO. 2433LM-471.1From 2419797to 2421749Unknown, similar to putative
Na+/H+ antiporter
SEQ ID NO. 2434LM-472.1From 2418871to 2419770Unknown, similar to LysR
family transcription regulator
SEQ ID NO. 2435LM-473.1From 2418083to 2418628Unknown, similar to NADH-
dependent FMN reductase
SEQ ID NO. 2436LM-475.1From 2417531to 2418067Unknown, similar to B. subtilis
YtmI protein
SEQ ID NO. 2437LM-476.1From 2416705to 2417514Unknown, similar to amino
acid ABC transporter
(binding protein)
SEQ ID NO. 2438LM-477.1From 2415967to 2416683Unknown, similar to amino
acid ABC-transporter
(permease)
SEQ ID NO. 2439LM-478.1From 2415246to 2415953Unknown, similar to amino
acid ABC transporter
(permease)
SEQ ID NO. 2440LM-48.1From 2744212to 2744574Unknown, conserved
hypothetical protein
SEQ ID NO. 2441LM-480.1From 2414470to 2415249Unknown, similar to amino
acid ABC-transporter, ATP-
binding protein
SEQ ID NO. 2442LM-481.1From 2413478to 2414473Unknown, conserved
hypothetical protein
SEQ ID NO. 2443LM-482.1From 2413219to 2413485Unknown, similar to B. subtilis
YtnI protein
SEQ ID NO. 2444LM-483.1From 2411900to 2413222Unknown, similar to
nitrilotriacetate
monooxygenase
SEQ ID NO. 2445LM-484.1From 2411126to 2411827Unknown, similar to 16S
pseudouridylate synthase
SEQ ID NO. 2446LM-485.1From 2409875to 2410993similar to kinases
SEQ ID NO. 2447LM-486.1From 2408967to 2409878Unknown, similar to Erwinia
chrysanthemi IndA protein
SEQ ID NO. 2448LM-487.1From 2408450to 2408893Unknown, conserved
hypothetical protein
SEQ ID NO. 2449LM-488.1From 2407128to 2408453aminopeptidase C
SEQ ID NO. 2450LM-49.1From 2744571to 2744939Unknown
SEQ ID NO. 2451LM-490.1From 2406161to 2406913Unknown, similar to
regulatory protein DeoR
family
SEQ ID NO. 2452LM-491.1From 2405241to 2406164fructose-1-phosphate
kinase
SEQ ID NO. 2453LM-492.1From 2403341to 2405239unknown, highly similar to
phosphotransferase system
(PTS) fructose-specific
enzyme IIABC component
SEQ ID NO. 2454LM-494.1From 2402888to 2403235Unknown, similar to
transcriptional regulator
SEQ ID NO. 2455LM-495.1From 2402357to 2402833competence transcription
factor
SEQ ID NO. 2456LM-496.1From 2401008to 2402366putative integrase
[Bacteriophage A118]
SEQ ID NO. 2457LM-497.1From 2400264to 2400944Unknown, weakly similar to
gp32_Bacteriophage A118
protein
SEQ ID NO. 2458LM-498.1From 2399542to 2400243Unknown, similar to protein
gp33 [Bacteriophage A118]
SEQ ID NO. 2459LM-5.1From 2713962to 2714942Unknown, similar to
heptaprenyl diphosphate
synthase component II
SEQ ID NO. 2460LM-500.1From 2398685to 2399161Unknown, similar to a
putative repressor protein
[Bacteriophage A118]
SEQ ID NO. 2461LM-502.1From 2397800to 2398084Unknown
SEQ ID NO. 2462LM-503.1From 2397493to 2397774Unknown, similar to protein
gp41 [Bacteriophage A118]
SEQ ID NO. 2463LM-506.1From 2396454to 2397230Unknown, similar to antirepressor
[Bacteriophage
A118]
SEQ ID NO. 2464LM-507.1From 2395798to 2396331gp43 [Bacteriophage A118]
SEQ ID NO. 2465LM-509.1From 2394317to 2394793Unknown, similar to
bacteriophage proteins
SEQ ID NO. 2466LM-51.1From 2744985to 2745791Unknown, weakly similar to
AraC-like transcription
regulator
SEQ ID NO. 2467LM-510.1From 2393613to 2394311Unknown
SEQ ID NO. 2468LM-512.1From 2392622to 2393596Unknown, similar to protein
gp49 [Bacteriophage A118]
SEQ ID NO. 2469LM-513.1From 2391813to 2392625Unknown, similar to site-
specific DNA-
methyltransferase
SEQ ID NO. 2470LM-514.1From 2391220to 2391816Unknown, similar to protein
gp51 [Bacteriophage A118]
SEQ ID NO. 2471LM-517.1From 2390580to 2391023Unknown, similar to a
bacteriophage protein
SEQ ID NO. 2472LM-518.1From 2390113to 2390583Unknown
SEQ ID NO. 2473LM-52.1From 2745907to 2746377Unknown, conserved
hypothetical protein
SEQ ID NO. 2474LM-520.1From 2389655to 2390116Unknown
SEQ ID NO. 2475LM-522.1From 2388729to 2389211unknown, similar to single-
stranded DNA-binding
protein
SEQ ID NO. 2476LM-523.1From 2388179to 2388583Unknown
SEQ ID NO. 2477LM-524.1From 2387792to 2388175unknown
SEQ ID NO. 2478LM-525.1From 2387046to 2387480Protein gp66
[Bacteriophage A118]
SEQ ID NO. 2479LM-527.1From 2386141to 2386680Unknown
SEQ ID NO. 2480LM-529.1From 2385301to 2386095Unknown, similar to putative
terminase small subunit
from Bacteriophage A118
SEQ ID NO. 2481LM-53.1From 2746423to 2746878Unknown, similar to ribose
5-phosphate epimerase
SEQ ID NO. 2482LM-530.1From 2384001to 2385332putative terminase large
subunit from Bacteriophage
A118
SEQ ID NO. 2483LM-531.1From 2382228to 2383988putative portal protein
[Bacteriophage A118]
SEQ ID NO. 2484LM-532.1From 2381088to 2382227Protein gp4 [Bacteriophage
A118]
SEQ ID NO. 2485LM-533.1From 2380419to 2381009Unknown, putative
scaffolding protein
[Bacteriophage A118]
SEQ ID NO. 2486LM-535.1From 2379418to 2380419Unknown, similar to coat
protein [Bacteriophage
SPP1]
SEQ ID NO. 2487LM-536.1From 2379004to 2379399Protein gp8 [Bacteriophage
A118]
SEQ ID NO. 2488LM-537.1From 2378642to 2379004Protein gp9 [Bacteriophage
A118]
SEQ ID NO. 2489LM-538.1From 2378304to 2378642Protein gp10
[Bacteriophage A118]
SEQ ID NO. 2490LM-539.1From 2377897to 2378304Portein gp11
[Bacteriophage A118]
SEQ ID NO. 2491LM-54.1From 2747073to 2747414Unknown
SEQ ID NO. 2492LM-540.1From 2377460to 2377894major tail shaft protein
[Bacteriophage A118]
SEQ ID NO. 2493LM-541.1From 2377198to 2377530Protein gp13
[Bacteriophage A118]
SEQ ID NO. 2494LM-542.1From 2376721to 2377143Protein gp14
[Bacteriophage A118]
SEQ ID NO. 2495LM-543.1From 2376110to 2376715Protein gp15
[Bacteriophage A118]
SEQ ID NO. 2496LM-548.1From 2370736to 2376099Unknown, putative tape-
measure [Bacteriophage
A118]
SEQ ID NO. 2497LM-549.1From 2369916to 2370734Protein gp17
[Bacteriophage A118]
SEQ ID NO. 2498LM-55.1From 2747427to 2748683Unknown, similar to UV-
damage repair protein
SEQ ID NO. 2499LM-550.1From 2368882to 2369907Protein gp18
[Bacteriophage A118]
SEQ ID NO. 2500LM-552.1From 2367853to 2368881Protein gp19
[Bacteriophage A118]
SEQ ID NO. 2501LM-553.1From 2366780to 2367853protein gp20 [Bacteriophage
A118]
SEQ ID NO. 2502LM-554.1From 2366451to 2366768protein gp21 [Bacteriophage
A118]
SEQ ID NO. 2503LM-555.1From 2365893to 2366258protein gp23 [Bacteriophage
A118]
SEQ ID NO. 2504LM-557.1From 2365599to 2365880holin [Bacteriophage A118]
SEQ ID NO. 2505LM-558.1From 2364754to 2365599L-alanoyl-D-glutamate
peptidase
SEQ ID NO. 2506LM-559.1From 2363709to 2364260Unknown
SEQ ID NO. 2507LM-56.1From 2748685to 2749497Unknown, similar to
hydrolase (esterase)
SEQ ID NO. 2508LM-560.1From 2363135to 2363632Unknown, similar to an
unknown bacteriophage
protein
SEQ ID NO. 2509LM-562.3From 2362661to 2363110Protein gp28
[Bacteriophage A118]
SEQ ID NO. 2510LM-563.3From 318to 1673Chromosomal replication
initiation protein DnaA
SEQ ID NO. 2511LM-564.1From 1867to 3012DNA polymerase III, beta
chain
SEQ ID NO. 2512LM-565.1From 3121to 4464Unknown, conserved
hypothetical protein
SEQ ID NO. 2513LM-566.1From 4644to 4865Unknown, similar to B. subtilis
YaaA protein
SEQ ID NO. 2514LM-567.1From 4869to 5981RecF protein
SEQ ID NO. 2515LM-57.1From 2749538to 2750233Unknown, similar to two
components response
regulator
SEQ ID NO. 2516LM-570.1From 6030to 7970DNA gyrase subunit B
SEQ ID NO. 2517LM-573.1From 8065to 10593DNA gyrase subunit A
SEQ ID NO. 2518LM-575.1From 10728to 12242Unknown, similar to
cardiolipin synthase
SEQ ID NO. 2519LM-576.1From 12258to 12776diamine N-acetyltransferase
SEQ ID NO. 2520LM-578.1From 12918to 13886Unknown, similar to
mevalonate kinase
SEQ ID NO. 2521LM-580.1From 13843to 14814unknown, similar to
mevalonate diphosphate
decarboxylase
SEQ ID NO. 2522LM-581.1From 14795to 15874Unknown, similar to
mevalonate kinases
SEQ ID NO. 2523LM-582.1From 16219to 17325AA3-600 quinol oxidase
subunit II
SEQ ID NO. 2524LM-584.1From 17344to 19323AA3-600 quinol oxidase
subunit I
SEQ ID NO. 2525LM-585.1From 19311to 19922AA3-600 quinol oxidase
subunit III
SEQ ID NO. 2526LM-586.1From 19924to 20256unknown, highly similar to
quinol oxidase aa3-600
chain IV
SEQ ID NO. 2527LM-587.1From 20308to 21426Unknown, similar to Bacillus
anthracis CapA protein
(polyglutamate capsule
biosynthesis)
SEQ ID NO. 2528LM-589.1From 21652to 23085beta-glucosidase
SEQ ID NO. 2529LM-591.1From 23132to 23953Unknown
SEQ ID NO. 2530LM-592.1From 24194to 24934Unknown, similar to
transcriptional regulator
(GntR family)
SEQ ID NO. 2531LM-593.1From 24950to 25351Unknown, similar to PTS
system, fructose-specific IIA
component
SEQ ID NO. 2532LM-594.1From 25351to 25839Unknown, similar to PTS
system, fructose-specific IIB
component
SEQ ID NO. 2533LM-596.1From 25862to 26665Unknown, similar to PTS
system, fructose-specific IIC
component
SEQ ID NO. 2534LM-597.1From 26640to 27467Unknown, similar to PTS
system, mannose-specific
IID component
SEQ ID NO. 2535LM-598.1From 27495to 28199Unknown, similar to
phosphoheptose isomerase
SEQ ID NO. 2536LM-599.1From 28254to 28895Unknown, similar to E. coli
copper homeostasis protein
CutC
SEQ ID NO. 2537LM-6.1From 2715023to 2716363Unknown
SEQ ID NO. 2538LM-600.1From 29100to 31004Unknown, similar to PTS
system, beta-glucosides
specific IIABC component
SEQ ID NO. 2539LM-601.1From 31088to 31963Unknown, similar to E. coli
microcin C7 self-immunity
protein (MccF)
SEQ ID NO. 2540LM-602.1From 32197to 32535Unknown
SEQ ID NO. 2541LM-604.1From 32571to 33380Unknown, conserved
hypothetical protein
SEQ ID NO. 2542LM-605.1From 33397to 34452Unknown, transcriptional
regulator Lacl family
SEQ ID NO. 2543LM-606.1From 34654to 35619Unknown, similar to xylose
repressor
SEQ ID NO. 2544LM-608.1From 35616to 38018Unknown, similar to
endoglucanase
SEQ ID NO. 2545LM-609.1From 38031to 39383Unknown, similar to PTS
system, cellobiose-specific
IIC component
SEQ ID NO. 2546LM-610.1From 39385to 40470Unknown, similar to
Glucosamine-fructose-6-
phosphate
aminotransferase (C-
terminal domain)
SEQ ID NO. 2547LM-611.1From 40705to 41730Unknown, similar to
ornithine
carbamoyltransferase
SEQ ID NO. 2548LM-613.1From 41803to 43188Unknown, similar to amino
acid transporter
SEQ ID NO. 2549LM-614.1From 43175to 44266Unknown, conserved
hypothetical protein
SEQ ID NO. 2550LM-615.1From 44282to 45223carbamate kinase
SEQ ID NO. 2551LM-616.1From 45325to 46434Unknown, conserved
hypothetical protein
SEQ ID NO. 2552LM-617.1From 46451to 47230Unknown, conserved
hypothetical protein,
hypothetical regulator
SEQ ID NO. 2553LM-618.1From 47335to 47994Unknown, similar to E. coli
DedA protein
SEQ ID NO. 2554LM-619.2From 48074to 49306Unknown, similar to arginine
deiminase
SEQ ID NO. 2555LM-62.1From 2750230to 2752920Unknown, similar to the two
components sensor protein
kdpD
SEQ ID NO. 2556LM-620.2From 703688to 703951Unknown
SEQ ID NO. 2557LM-621.3From 703128to 703691Unknown, similar to acetyl
transferase
SEQ ID NO. 2558LM-622.1From 703961to 704338Unknown, similar to
unknown protein
SEQ ID NO. 2559LM-623.1From 704452to 705387Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2560LM-624.1From 705380to 706150Unknown, conserved
membrane protein
SEQ ID NO. 2561LM-625.1From 706408to 707292Unknown, similar to
oxidoreductase
SEQ ID NO. 2562LM-626.1From 707612to 708220Unknown
SEQ ID NO. 2563LM-627.1From 709134to 709562Unknown, similar to
unknown protein
SEQ ID NO. 2564LM-628.1From 709829to 710749Unknown
SEQ ID NO. 2565LM-629.1From 711121to 711435Unknown
SEQ ID NO. 2566LM-631.1From 711428to 712195Unknown, similar to flagellar
biosynthesis protein FliP
SEQ ID NO. 2567LM-632.1From 712208to 712480Unknown, similar to flagellar
biosynthesis protein FliQ
SEQ ID NO. 2568LM-633.1From 712483to 713244Unknown, similar to flagellar
biosynthetic protein FliR
SEQ ID NO. 2569LM-635.1From 713260to 714306Unknown, similar to flagellar
biosynthetic protein flhB
SEQ ID NO. 2570LM-636.1From 714353to 716428Unknown, similar to flagellar-
associated protein flhA
SEQ ID NO. 2571LM-637.1From 716450to 717673Unknown, similar to
flagellar biosynthesis
protein FlhF
SEQ ID NO. 2572LM-639.1From 717670to 718449Unknown, similar to flagellar
hook-basal body protein
FlgG
SEQ ID NO. 2573LM-641.1From 718478to 719266Unknown, similar to
chemotactic
methyltransferase CheR
SEQ ID NO. 2574LM-642.1From 719291to 719626Unknown
SEQ ID NO. 2575LM-643.1From 719653to 720504Unknown, similar to motility
protein (flagellar motor
rotation) MotA
SEQ ID NO. 2576LM-645.1From 720464to 721291Unknown, similar to motility
protein (flagellar motor
rotation) MotB
SEQ ID NO. 2577LM-647.1From 721301to 721801Unknown
SEQ ID NO. 2578LM-648.1From 721824to 723737Unknown, similar to
unknown protein
SEQ ID NO. 2579LM-649.1From 723750to 724658Unknown, similar to CheA
activity-modulating
chemotaxis protein CheV
SEQ ID NO. 2580LM-65.1From 2752936to 2753508potassium-transporting
ATPase c chain
SEQ ID NO. 2581LM-650.3From 724896to 725759flagellin protein
SEQ ID NO. 2582LM-651.1From 726034to 726393Chemotaxis response
regulator CheY
SEQ ID NO. 2583LM-652.1From 726413to 728269two-component sensor
histidine kinase CheA
SEQ ID NO. 2584LM-653.1From 728282to 728581Unknown, similar to flagellar
motor switch protein fliY C-
terminal part
SEQ ID NO. 2585LM-654.1From 728600to 729010Unknown
SEQ ID NO. 2586LM-657.1From 729026to 730072Unknown
SEQ ID NO. 2587LM-658.1From 730074to 730496unknown, similar to flagellar
hook assembly protein
SEQ ID NO. 2588LM-659.1From 730516to 731751Unknown, similar to flagellar
hook protein FlgE
SEQ ID NO. 2589LM-660.1From 732022to 733014Unknown, similar to flagellar
switch protein FliM
SEQ ID NO. 2590LM-664.1From 733017to 734564Unknown, similar to flagellar
motor switch protein fliY
SEQ ID NO. 2591LM-665.1From 734570to 735973Unknown
SEQ ID NO. 2592LM-668.1From 735996to 737162Unknown
SEQ ID NO. 2593LM-669.1From 737269to 737733Unknown
SEQ ID NO. 2594LM-670.1From 737743to 738171Unknown
SEQ ID NO. 2595LM-671.1From 738191to 739711Unknown, similar to flagellar
hook-associated protein
FlgK
SEQ ID NO. 2596LM-672.1From 739723to 740598Unknown, similar to flagellar
hook-associated protein 3
FlgL
SEQ ID NO. 2597LM-674.1From 740610to 741899Unknown, similar to flagellar
hook-associated protein 2
FliD
SEQ ID NO. 2598LM-675.1From 741918to 742304Unknown, similar to
hypothetical flagellar protein
SEQ ID NO. 2599LM-676.1From 742276to 742557Unknown
SEQ ID NO. 2600LM-677.1From 742578to 742979Unknown, similar to flagellar
basal-body rod protein flgB
SEQ ID NO. 2601LM-679.1From 742991to 743401Unknown, similar to flagellar
basal-body rod protein flgC
SEQ ID NO. 2602LM-680.1From 743418to 743714Unknown, similar to flagellar
hook-basal body complex
protein FliE
SEQ ID NO. 2603LM-682.1From 743782to 745434Unknown, similar to flagellar
basal-body M-ring protein
fliF
SEQ ID NO. 2604LM-685.1From 745437to 746543Unknown, similar to flagellar
motor switch protein fliG
SEQ ID NO. 2605LM-686.1From 746530to 747222Unknown
SEQ ID NO. 2606LM-688.1From 747219to 748520Unknown, similar to H+-
transporting ATP synthase
alpha chain FliI, flagellar-
specific,
SEQ ID NO. 2607LM-689.1From 748537to 749205Unknown, similar to
transglycosylase
SEQ ID NO. 2608LM-69.1From 2753523to 2755568potassium-transporting
atpase b chain
SEQ ID NO. 2609LM-690.1From 749219to 749863Unknown
SEQ ID NO. 2610LM-691.1From 749985to 750311Unknown, similar to
unknown protein
SEQ ID NO. 2611LM-692.1From 750311to 750634Unknown
SEQ ID NO. 2612LM-693.1From 750680to 751327putative fibronectin-binding
protein
SEQ ID NO. 2613LM-695.1From 751598to 753328Unknown, similar to
pyruvate oxidase
SEQ ID NO. 2614LM-697.1From 753490to 755295Unknown, similar to methyl-
accepting chemotaxis
protein
SEQ ID NO. 2615LM-698.1From 755308to 756036Unknown, similar to B. subtilis
YvpB protein
SEQ ID NO. 2616LM-699.2From 2842413to 2843867Unknown, similar to beta-
glucosidase
SEQ ID NO. 2617LM-7.1From 2716408to 2717394Unknown
SEQ ID NO. 2618LM-700.1From 2843914to 2844216Unknown, similar to PTS
cellobiose-specific enzyme
IIB
SEQ ID NO. 2619LM-702.1From 2844249to 2845601Unknown, similar to PTS
cellobiose-specific enzyme
IIC component
SEQ ID NO. 2620LM-703.1From 2845613to 2846497Unknown, similar to xylose
operon regulatory protein
and to glucose kinase
SEQ ID NO. 2621LM-704.1From 2846490to 2846798Unknown, similar to PTS
cellobiose-specific enzyme
IIA
SEQ ID NO. 2622LM-705.1From 2846839to 2847573Unknown, similar to
hypothetical transcriptional
regulator
SEQ ID NO. 2623LM-706.1From 2847673to 2848335Unknown
SEQ ID NO. 2624LM-707.2From 2848406to 2849536Unknown
SEQ ID NO. 2625LM-708.3From 2849563to 2850450Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 2626LM-709.2From 2850620to 2852950Unknown, similar to
gamma-glutamylcysteine
synthetase (for the
N_terminal part) and to
cyanophycin synthetase (C-
terminal part)
SEQ ID NO. 2627LM-710.1From 2852988to 2854436Unknown, similar to beta-
glucosidase
SEQ ID NO. 2628LM-711.2From 2854429to 2856282Unknown, similar to beta-
glucoside-specific enzyme
IIABC
SEQ ID NO. 2629LM-712.2From 2856379to 2857218Unknown, similar to
transcription antiterminator
SEQ ID NO. 2630LM-714.1From 2857580to 2858203Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 2631LM-715.1From 2858196to 2860364Unknown
SEQ ID NO. 2632LM-716.1From 2860426to 2860821Unknown
SEQ ID NO. 2633LM-717.1From 2861424to 2862614Unknown, similar to efflux
protein
SEQ ID NO. 2634LM-718.1From 2862774to 2863283Unknown
SEQ ID NO. 2635LM-720.1From 2863561to 2864661Unknown, similar to
probable GTP-binding
protein
SEQ ID NO. 2636LM-721.1From 2864813to 2865121Unknown, similar to
cellobiose PTS enzyme IIA
SEQ ID NO. 2637LM-723.1From 2865127to 2867397Unknown, similar to beta-
glucosidase
SEQ ID NO. 2638LM-724.1From 2867432to 2867731Unknown, similar to PTS,
cellobiose-specific IIB
component
SEQ ID NO. 2639LM-725.1From 2867746to 2869041Unknown, similar to
cellobiose
phosphotransferase system
enzyme IIC
SEQ ID NO. 2640LM-726.1From 2869191to 2871107Unknown, simliar to
lichenan operon
transcription antiterminator
IicR
SEQ ID NO. 2641LM-727.1From 2871318to 2872784catalase
SEQ ID NO. 2642LM-728.1From 2872932to 2873915Unknown
SEQ ID NO. 2643LM-73.1From 2755580to 2757265unknown, highly similar to
potassium-transporting
atpase a chain
SEQ ID NO. 2644LM-730.1From 2873915to 2875837beta-glucoside-specific
phosphotransferase
enzyme II
SEQ ID NO. 2645LM-731.2From 2875945to 2876775transcription antiterminator
SEQ ID NO. 2646LM-733.2From 2877160to 2878011Partition protein ParB
homolg
SEQ ID NO. 2647LM-734.1From 2878004to 2878765Partition protein, ParA
homolog
SEQ ID NO. 2648LM-735.1From 2878989to 2879747Unknown
SEQ ID NO. 2649LM-736.1From 2879906to 2880205Unknown
SEQ ID NO. 2650LM-737.1From 2880217to 2881071Unknown, highly similar to
B. subtilis DNA-binding
protein Spo0J-like homolog
YyaA
SEQ ID NO. 2651LM-739.1From 2881243to 2882049Unknown, similar to E. coli
RpiR transcription regulator
SEQ ID NO. 2652LM-74.1From 2757606to 2757911Unknown, similar to
cellobiose
phosphotransferase
enzyme IIB component
SEQ ID NO. 2653LM-740.1From 2882033to 2882938Unknown, similar to
transcription regulator
SEQ ID NO. 2654LM-741.1From 2882963to 2883409Unknown, similar to
phosphotransferase system
mannitol-specific enzyme
IIA
SEQ ID NO. 2655LM-742.1From 2883422to 2884078Unknown, similar to
phosphatase
SEQ ID NO. 2656LM-744.1From 2884141to 2885547Unknown, similar to
phosphotransferase system
mannitol-specific enzyme
IIBC
SEQ ID NO. 2657LM-747.1From 2885581to 2886582Unknown, similar to
dehydrogenase
SEQ ID NO. 2658LM-748.2From 2886584to 2887297Unknown, similar to a
putative N-
acetylmannosamine-6-
phosphate epimerase
SEQ ID NO. 2659LM-749.2From 2536347to 2538581unknown, similar to
transport protein
SEQ ID NO. 2660LM-750.1From 2538588to 2539196Unknown, similar to
transcription regulator
SEQ ID NO. 2661LM-751.1From 2539213to 2539611Unknown
SEQ ID NO. 2662LM-752.1From 2539838to 2540173Unknown
SEQ ID NO. 2663LM-753.1From 2540421to 2541857Unknown, similar to
chitinase and chitin binding
protein
SEQ ID NO. 2664LM-754.1From 2542014to 2542610ATP-dependent Clp
protease proteolytic subunit
SEQ ID NO. 2665LM-755.1From 2542658to 2544049Unknown, similar to amino
acid transporter
SEQ ID NO. 2666LM-758.1From 2545487to 2546503Unknown, similar to NADH
oxidase
SEQ ID NO. 2667LM-759.1From 2546530to 2547501Unknown, conserved
hypothetical protein
SEQ ID NO. 2668LM-760.1From 2547509to 2548477unknown, conserved
hypothetical protein
SEQ ID NO. 2669LM-761.1From 2548479to 2549354unknown, conserved
hypothetical protein
SEQ ID NO. 2670LM-762.1From 2549462to 2551192Unknown, similar to
phosphomannomutase and
phosphoglucomutase
SEQ ID NO. 2671LM-763.1From 2551234to 2552295Unknown, similar to aldose
1-epimerase (mutarotase)
SEQ ID NO. 2672LM-764.1From 2552312to 2553295UDP-glucose 4-epimerase
SEQ ID NO. 2673LM-765.1From 2553414to 2554373thioredoxin reductase
SEQ ID NO. 2674LM-766.1From 2554452to 2555927Unknown
SEQ ID NO. 2675LM-767.1From 2556013to 2556510Unknown, similar to
acetyltransferase
SEQ ID NO. 2676LM-768.1From 2556514to 2557167Unknown, similar to B. subtilis
P-Ser-HPr
phosphatase
SEQ ID NO. 2677LM-769.1From 2557211to 2558044unknown, highly similar to
prolipoprotein diacylglyceryl
transferase
SEQ ID NO. 2678LM-77.1From 2758046to 2759353Unknown, similar to
cellobiose
phosphotransferase
enzyme IIC component
SEQ ID NO. 2679LM-771.1From 2558130to 2559068HPr-P(Ser)
kinase/phosphatase
SEQ ID NO. 2680LM-772.1From 2559255to 2559608Unknown, similar to B. subtilis
YvlD protein
SEQ ID NO. 2681LM-774.1From 2559841to 2561052unknown
SEQ ID NO. 2682LM-776.1From 2561120to 2562382Unknown, similar to B. subtilis
YvlB protein
SEQ ID NO. 2683LM-778.1From 2562591to 2565461excinuclease ABC (subunit
A)
SEQ ID NO. 2684LM-78.1From 2759390to 2759692Unknown, similar to
cellobiose
phosphotransferase
enzyme IIA component
SEQ ID NO. 2685LM-780.1From 2565469to 2567445excinuclease ABC (subunit
B)
SEQ ID NO. 2686LM-781.1From 2567956to 2568603Unknown
SEQ ID NO. 2687LM-782.1From 2568625to 2569011unknown
SEQ ID NO. 2688LM-783.1From 2569117to 2569422Unknown, similar to
transcription regulator ArsR
family
SEQ ID NO. 2689LM-784.1From 2569690to 2570349Unknown, similar to
negative regulator of
phosphate regulon
SEQ ID NO. 2690LM-786.1From 2570362to 2571141unknown, similar to
phosphate ABC transporter
(ATP-binding protein)
SEQ ID NO. 2691LM-788.1From 2571156to 2571971unknown, similar to
phosphate ABC transporter
(ATP-binding protein)
SEQ ID NO. 2692LM-789.1From 2571999to 2572883Unknown, similar to
phosphate ABC transporter
(permease protein)
SEQ ID NO. 2693LM-791.1From 2572880to 2573803Unknown, similar to
phosphate ABC transporter
(permease protein)
SEQ ID NO. 2694LM-793.1From 2573897to 2574805Unknown, similar to
phosphate ABC transporter
(binding protein)
SEQ ID NO. 2695LM-794.1From 2575061to 2576836two-component sensor
histidine kinase
SEQ ID NO. 2696LM-795.1From 2576836to 2577546two-component response
regulator
SEQ ID NO. 2697LM-796.2From 2577696to 2578871Unknown
SEQ ID NO. 2698LM-797.2From 2578899to 2580347Unknown, similar to
cardiolipin synthase
SEQ ID NO. 2699LM-799.2From 2278639to 2279292competence negative
regulator mecA
SEQ ID NO. 2700LM-8.1From 2717422to 2718234Unknown
SEQ ID NO. 2701LM-80.1From 2760147to 2760680unknown
SEQ ID NO. 2702LM-801.1From 2277398to 2278513Unknown, similar to a
putative competence protein
from streptococcus
pneumoniae
SEQ ID NO. 2703LM-803.1From 2275520to 2277325Unknown, similar to
oligoendopeptidase
SEQ ID NO. 2704LM-804.1From 2274961to 2275221Unknown
SEQ ID NO. 2705LM-805.1From 2274127to 2274750Unknown
SEQ ID NO. 2706LM-806.1From 2272403to 2274112Unknown
SEQ ID NO. 2707LM-807.1From 2271444to 2272316Unknown, similar to
ferrichrome ABC transporter
(binding protein)
SEQ ID NO. 2708LM-809.1From 2270465to 2271454Unknown, similar to
ferrichrome ABC transporter
(permease)
SEQ ID NO. 2709LM-81.1From 2760845to 2761954Unknown, simlilar to cell
division protein FtsW
SEQ ID NO. 2710LM-810.1From 2269705to 2270484Unknown, similar to
ferrichrome ABC transporter
(ATP-binding protein)
SEQ ID NO. 2711LM-812.1From 2268961to 2269701Unknown, similar to
unknown protein
SEQ ID NO. 2712LM-813.2From 2268424to 2268900Unknown, similar to
unknown protein
SEQ ID NO. 2713LM-817.1From 2259366to 2259713Unknown, similar to
unknown protein
SEQ ID NO. 2714LM-818.1From 2258697to 2259260Unknown, similar to
transcriptional regulator
(tetR family)
SEQ ID NO. 2715LM-820.1From 2257754to 2258515Unknown, similar to
dehydrogenase
SEQ ID NO. 2716LM-821.1From 2256507to 2257580Unknown, similar to
unknown proteins
SEQ ID NO. 2717LM-823.1From 2255034to 2256401Unknown, similar to sigma-
54-dependent
transcriptional activator
SEQ ID NO. 2718LM-826.1From 2253221to 2254804Unknown, similar to
propionate CoA-transferase
SEQ ID NO. 2719LM-827.1From 2252001to 2253224Unknown, similar to
antiporter proteins
SEQ ID NO. 2720LM-83.1From 2761951to 2763081Unknown, simlilar to cell
division protein FtsW
SEQ ID NO. 2721LM-830.1From 2251050to 2251979Unknown, similar to
unknown proteins
SEQ ID NO. 2722LM-831.1From 2250431to 2250820Unknown, similar to
glyoxalase I
SEQ ID NO. 2723LM-833.1From 2249683to 2250300Unknown, similar to
unknown proteins
SEQ ID NO. 2724LM-834.1From 2248946to 2249617Unknown
SEQ ID NO. 2725LM-835.1From 2247998to 2248693Unknown, similar to
transcription regulator
CRP/FNR family
SEQ ID NO. 2726LM-836.1From 2247050to 2247928Unknown, similar to
transcriptional regulator
(AraC/XylS family)
SEQ ID NO. 2727LM-837.1From 2245899to 2246975Unknown, similar to
oxidoreductase
SEQ ID NO. 2728LM-838.1From 2245143to 2245883Unknown, similar to
unknown proteins
SEQ ID NO. 2729LM-839.1From 2244419to 2245141Unknown
SEQ ID NO. 2730LM-840.1From 2243448to 2244416Unknown, similar to
unknown proteins
SEQ ID NO. 2731LM-841.1From 2242376to 2243425Unknown, similar to
oxidoreductase
SEQ ID NO. 2732LM-842.1From 2240068to 2241969Unknown
SEQ ID NO. 2733LM-843.1From 2239672to 2240013Unknown
SEQ ID NO. 2734LM-845.1From 2236844to 2239135Unknown, similar to
ribonucleoside-diphosphate
reductase, subunit alpha
SEQ ID NO. 2735LM-846.1From 2235739to 2236788Unknown, similar to
ribonucleoside-diphosphate
reductase, subunit beta
SEQ ID NO. 2736LM-847.1From 2235305to 2235742Unknown, similar to
flavodoxin
SEQ ID NO. 2737LM-848.1From 2234982to 2235308Unknown, similar to
thioredoxin
SEQ ID NO. 2738LM-849.1From 2234575to 2234874Unknown, similar to
unknown proteins
SEQ ID NO. 2739LM-85.1From 2763082to 2765652Unknown, highly similar to
Mg2+ transport ATPase
SEQ ID NO. 2740LM-850.1From 2233841to 2234158Unknown, similar to
unknown proteins
SEQ ID NO. 2741LM-851.1From 2233150to 2233791Unknown, similar to
unknown proteins
SEQ ID NO. 2742LM-853.1From 2232087to 2233094Unknown, similar to
unknown proteins
SEQ ID NO. 2743LM-854.1From 2231127to 2231981Unknown, similar to
transcription regulator LysR
family
SEQ ID NO. 2744LM-856.1From 2230042to 2231058Unknown, similar to
unknown protein
SEQ ID NO. 2745LM-857.1From 2229193to 2229927Unknown, similar to
transcription regulator GntR
family
SEQ ID NO. 2746LM-858.1From 2227412to 2229154Unknown, weakly similar to
mannose-6-phosphate
isomerase
SEQ ID NO. 2747LM-859.1From 2226533to 2227198Unknown
SEQ ID NO. 2748LM-860.1From 2225471to 2225944Unknown, similar to
unknown protein
SEQ ID NO. 2749LM-861.1From 2224219to 2225454Unknown, similar to ABC
transporter (membrane
protein)
SEQ ID NO. 2750LM-862.1From 2223324to 2224226Unknown, similar to ABC
transporter (ATP-binding
protein)
SEQ ID NO. 2751LM-863.1From 2221671to 2223128Unknown, similar to
transcription regulator
SEQ ID NO. 2752LM-865.1From 2221186to 2221659Unknown, similar to PTS
system, fructose-specific
enzyme IIA component
SEQ ID NO. 2753LM-866.1From 2220862to 2221173Unknown, similar to PTS
system, fructose-specific
enzyme IIB component
SEQ ID NO. 2754LM-867.1From 2219751to 2220845Unknown, similar to PTS
system, fructose-specific
enzyme IIC component
SEQ ID NO. 2755LM-869.1From 2218879to 2219733Unknown, similar to
fructose-1,6-biphosphate
aldolase type II
SEQ ID NO. 2756LM-87.1From 2766207to 2766773Unknown, similar to
transcription regulator, TetR
family
SEQ ID NO. 2757LM-870.1From 2217963to 2218862Unknown, similar to
fructose-1,6-biphosphate
aldolase type II
SEQ ID NO. 2758LM-871.1From 2217222to 2217953Unknown
SEQ ID NO. 2759LM-872.1From 2216054to 2216743Unknown
SEQ ID NO. 2760LM-874.2From 2486136to 2486921Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 2761LM-875.1From 2484816to 2486117Unknown, similar to
aminotransferase
SEQ ID NO. 2762LM-876.1From 2483589to 2484815Unknown, similar to
aminotransferase
SEQ ID NO. 2763LM-877.1From 2483149to 2483592Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2764LM-878.1From 2481732to 2483126Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2765LM-879.1From 2481023to 2481574Unknown
SEQ ID NO. 2766LM-880.1From 2480531to 2480944Unknown
SEQ ID NO. 2767LM-881.1From 2479607to 2479921unknown
SEQ ID NO. 2768LM-883.1From 2478623to 2479465unknown, similar to B. subtilis
YunF protein
SEQ ID NO. 2769LM-884.1From 2478225to 2478590Unknown
SEQ ID NO. 2770LM-885.1From 2477385to 2478224Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2771LM-886.1From 2475860to 2477251Unknown, similar to B. subtilis
YunD protein
SEQ ID NO. 2772LM-887.1From 2475588to 2475863Unknown, similar to B. subtilis
YutD protein
SEQ ID NO. 2773LM-888.1From 2474801to 2475568Unknown, similar to
conserved hypothetical
protein and to B. subtilis
YutF protein
SEQ ID NO. 2774LM-889.1From 2474309to 2474752Unknown, similar to
acetyltransferase
SEQ ID NO. 2775LM-89.1From 2766935to 2768707Unknown, similar to
autolysin, N-
acetylmuramidase
SEQ ID NO. 2776LM-890.1From 2472963to 2474273Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2777LM-892.1From 2472438to 2472938Unknown, low temperature
requirement C protein, also
similar to B. subtilis YutG
protein
SEQ ID NO. 2778LM-893.1From 2472089to 2472325Unknown, similar to NifU
protein
SEQ ID NO. 2779LM-896.1From 2468210to 2468392Unknown, hypothetical CDS
SEQ ID NO. 2780LM-897.1From 2467643to 2467969Unknown, similar to B. subtilis
YuzD protein
SEQ ID NO. 2781LM-899.1From 2466713to 2467342Unknown, conserved
hypothetical protein similar
to B. subtilis YhfK protein
SEQ ID NO. 2782LM-90.1From 2768944to 2769273Unknown
SEQ ID NO. 2783LM-901.1From 2465630to 2466625Unknown, similar to
hypothetical thioredoxine
reductase
SEQ ID NO. 2784LM-903.1From 2463969to 2465180Unknown, similar to NADH
dehydrogenase
SEQ ID NO. 2785LM-904.1From 2463066to 2463884Unknown, similar to B. subtilis
YwqG protein
SEQ ID NO. 2786LM-906.1From 2461803to 2463029Unknown, conserved
hypothetical protein
SEQ ID NO. 2787LM-907.1From 2461219to 2461692Unknown, similar to B. subtilis
YuiD protein
SEQ ID NO. 2788LM-908.1From 2460720to 2461091Unknown, similar to B. subtilis
YuxO protein
SEQ ID NO. 2789LM-91.1From 2769381to 2770007Unknown, similar to
thymidylate kinase
SEQ ID NO. 2790LM-910.1From 2460090to 2460683unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2791LM-911.1From 2459819to 2460106unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2792LM-912.1From 2459343to 2459822unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2793LM-913.1From 2457852to 2459336unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2794LM-915.1From 2457512to 2457859unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2795LM-916.1From 2457087to 2457512unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2796LM-917.1From 2454695to 2457103unknown, similar to proteins
involved in resistance to
cholate and to NA(+) and in
pH homeostasis
SEQ ID NO. 2797LM-918.1From 2453138to 2454349Unknown, similar to multidrug
resistance efflux pump
SEQ ID NO. 2798LM-919.1From 2452322to 2452906Unknown, similar to
peptidyl-prolyl cis-trans
isomerase
SEQ ID NO. 2799LM-92.1From 2770060to 2771439Unknown, similar to lysine
decarboxylase
SEQ ID NO. 2800LM-920.1From 2451840to 2452235Unknown
SEQ ID NO. 2801LM-922.1From 2450437to 2451798Unknown, similar to
aspartate kinase
SEQ ID NO. 2802LM-923.1From 2449878to 2450192Unknown, similar to
phosphotransferase system
(PTS) beta-glucoside-
specific enzyme IIB
component
SEQ ID NO. 2803LM-924.1From 2449155to 2449835unknown, similar to ABC-
transporter ATP binding
proteins
SEQ ID NO. 2804LM-925.1From 2448052to 2449155Unknown, similar to putative
ABC-transporter
transmembrane subunit
SEQ ID NO. 2805LM-926.1From 2446747to 2447913Unknown, similar to
aminotransferase
SEQ ID NO. 2806LM-927.1From 2446240to 2446593Unknown, similar to B. subtilis
general stress
protein 13 containing a
ribosomal S1 protein
domain
SEQ ID NO. 2807LM-929.1From 2445685to 2446092Unknown
SEQ ID NO. 2808LM-930.2From 2444231to 2445583glucose-6-phosphate
isomerase
SEQ ID NO. 2809LM-931.2From 2443368to 2444126Unknown, similar to
transcription regulator DeoR
family
SEQ ID NO. 2810LM-935.1From 198201to 199409Unknown, similar to ABC
transporter, ATP-binding
protein
SEQ ID NO. 2811LM-936.1From 197512to 198204ABC transporter, ATP-
binding protein
SEQ ID NO. 2812LM-937.1From 196786to 197463unknown
SEQ ID NO. 2813LM-938.1From 195793to 196611Unknown, similar to PurR,
transcription repressor of
purine operon of B. subtilis
SEQ ID NO. 2814LM-939.1From 194892to 195629Unknown, similar to a
putative phospho-beta-
glucosidase
SEQ ID NO. 2815LM-94.1From 2771623to 2772612Unknown, similar to
dihydroxyacetone kinase
SEQ ID NO. 2816LM-941.1From 193989to 194870Unknown, similar to B. subtilis
YabH protein
SEQ ID NO. 2817LM-942.1From 193593to 193850Unknown, highly similar to
B. subtilis Veg protein
SEQ ID NO. 2818LM-943.1From 192586to 193473dimethyladenosine
transferase (16S rRNA
dimethylase)
SEQ ID NO. 2819LM-945.1From 192018to 192593Unknown, similar to B. subtilis
YabF protein
SEQ ID NO. 2820LM-946.1From 190690to 191916Unknown, similar to B. subtilis
YabE protein
SEQ ID NO. 2821LM-947.1From 189625to 190398Unknown, similar to
conserved hypothetical
proteins
SEQ ID NO. 2822LM-949.1From 187867to 189528Unknown, similar to oligo-
1,6-glucosidase
SEQ ID NO. 2823LM-95.1From 2772634to 2773230Unknown, similar to
hypothetical
dihydroxyacetone kinase
SEQ ID NO. 2824LM-950.1From 185572to 187863Unknown, similar to alpha-
glucosidase
SEQ ID NO. 2825LM-955.1From 182267to 185569Unknown, similar to alpha-
xylosidase and alpha-
glucosidase
SEQ ID NO. 2826LM-958.1From 180928to 182184Unknown, similar to sugar
ABC transporter,
periplasmic sugar-binding
protein
SEQ ID NO. 2827LM-96.1From 2773234to 2773608unknown
SEQ ID NO. 2828LM-960.1From 180052to 180900Unknown, similar to sugar
ABC transporter, permease
protein
SEQ ID NO. 2829LM-961.1From 179174to 180052Unknown, similar to sugar
ABC transporters,
permease proteins
SEQ ID NO. 2830LM-962.1From 177924to 179138Unknown, similar to xylose
repressor
SEQ ID NO. 2831LM-964.1From 175766to 177760methionyl-tRNA synthetase
SEQ ID NO. 2832LM-965.1From 174834to 175694Unknown, similar to glucose
uptake protein
SEQ ID NO. 2833LM-968.1From 172903to 173208Unknown, similar to
transposase
SEQ ID NO. 2834LM-969.1From 172410to 172802Unknown, similar to
transposase (N-terminal
part)
SEQ ID NO. 2835LM-97.1From 2773714to 2774565Unknown, similar to putative
transcription regulator
SEQ ID NO. 2836LM-970.1From 172072to 172410Unknown, similar to
transposase C-terminal part
SEQ ID NO. 2837LM-975.1From 168005to 169270Unknown
SEQ ID NO. 2838LM-977.1From 167086to 167943Unknown, similar to a
glucose uptake protein
SEQ ID NO. 2839LM-978.1From 166687to 166971Unknown, similar to B. subtilis
transcription
regulatory protein AbrB
SEQ ID NO. 2840LM-98.1From 2774684to 2775523Unknown, similar to
conserved hypothetical
protein
SEQ ID NO. 2841LM-981.1From 165761to 166642Unknown, conserved
hypothetical protein
SEQ ID NO. 2842LM-982.1From 165492to 165764unknown, similar to B. subtilis
YazA protein
SEQ ID NO. 2843LM-983.1From 164756to 165508unknown, conserved
hypothetical protein
SEQ ID NO. 2844LM-984.1From 164309to 164698Unknown, similar to B. subtilis
YabA protein
SEQ ID NO. 2845LM-987.1From 163465to 164298Unknown
SEQ ID NO. 2846LM-988.2From 162467to 163459Unknown, similar to B. subtilis
DNA polymerase III
(delta subunit)
SEQ ID NO. 2847LM-990.2From 596580to 597620Unknown, conserved
hypothetical protein
SEQ ID NO. 2848LM-991.1From 595843to 596538Unknown, similar to
phosphoglycerate mutase
SEQ ID NO. 2849LM-992.1From 595135to 595842Unknown, similar to
phosphoglycerate mutase
SEQ ID NO. 2850LM-994.1From 593528to 595006Unknown, similar to di-
tripeptide ABC transporter
(membrane protein)
SEQ ID NO. 2851LM-995.1From 592257to 593438Unknown, similar to NADH-
dependent butanol
dehydrogenase
SEQ ID NO. 2852LM-996.1From 591412to 592047Unknown
SEQ ID NO. 2853LM-997.1From 590324to 591355Unknown, similar to
unknown protein
SEQ ID NO. 2854LM-998.1From 589406to 590248Unknown
|