The present invention relates to the field of anti-microbial therapy, and more precisely to methods for the screening of antimicrobial substances active against bacteria possessing a cell wall comprising peptidoglycan.
Bacterial infections remain among the most common and deadly causes of human disease. Unfortunately, the overuse of antibiotics has led to antibiotic resistant pathogenic strains of bacteria. Indeed, bacterial resistance to the new chemical analogs of these drugs appears to be out-pacing the development of such analogs. For example, life-threatening strains of three species of bacteria (Enterococcus faecalis, Mycobacterium tuberculosis, and Pseudomonas aeruginosa) have evolved to be resistant against all known antibiotics. [Stuart B. Levy, “The Challenge of Antibiotic Resistance”, in Scientific American, pgs. 46-53 (March 1998)].
Antibacterial substances that have already been identified include low-molecular weight substances that are produced as secondary metabolites by certain groups of micro-organisms, especially Streptomyces, Bacillus, and a few molds (Penicillium and Cephalosporium) that are inhabitants of soils. These antibacterial substances may have a bactericidal effect or a static effect on a range of micro-organisms.
Antibacterial substances that have already been identified also include chemotherapeutic agents which are chemically synthesized, as well as semi-synthetic antibiotics, wherein an antibacterial substance that is naturally produced by a micro-organism is subsequently modified by chemical methods to achieve desired properties.
Antibiotics effective against prokaryotes which kill or inhibit a wide range of Gram-positive and Gram-negative bacteria are said to be of broad spectrum. If effective against Gram-positive or Gram-negative bacteria, they are of narrow spectrum. If effective against a single organism or disease, they are referred to as limited spectrum.
Antibacterial substances achieve their bactericidal or static effects by altering various metabolic pathways of the target micro-organisms.
Several antibacterial substances act as cell membrane inhibitors that disorganise the structure or inhibit the function of bacterial membranes, like polymixin B, which binds to membrane phospholipids and thereby interferes with membrane function, mainly against Gram-negative bacteria.
Several other antibacterial substances act as protein synthesis inhibitors, like tetracyclines, chloramphenicol, macrolides and aminoglycosides.
Still other antibacterial substances affect the synthesis of DNA or DNA, or can bind to DNA or RNA, like quinolones and rifamycins.
Yet other antibacterial substances act as competitive inhibitors of essential metabolites or growth factors, like sulfonamides.
Important antibacterial substances act as inhibitors of the cell wall synthesis, and more specifically as inhibitors of the synthesis of the bacterial peptidoglycan. The peptidoglycan is a macromelular structure found on the outer face of the cytoplasmic membrane of almost all bacteria. This structure is of importance for the maintenance of the integrity of the bacteria and for the cell division process. The basic unit of the peptidoglycan is a disaccharide peptide assembled by a series of cytoplasmic and membrane reactions. The resulting unit is composed of N-acetylglucosamine (GlcNAc) linked to N-acetylmuramic acid (MurNAc) substituted by a stem peptide. In the majority of pathogenic Gram positive bacteria such as Staphylococcus, Streptococcus and Enterococcus, the stem peptide consists in a conserved L-alanyl-γ-D-glutamyl-L-lysyl-D-alanyl-D-alanine pentapeptide and variable side chains linked to the ε-amino group of the third residue (L-Lys3). The structure of the side chain conserved in the members of the same species consists of glycines or various L-amino acids added by the transferases which used the corresponding specific aminoacyl-tRNAs as substrates. Once this basic unit have been transferred through the cytoplasmic membrane, the final steps of peptidoglycan synthesis involve its polymerization to glycan strands by glycosyltransferases and the cross-linking of the stem peptides by multiple D,D-transpeptidases. In Enterococcus faecium peptidoglycan, the side chain consists of one D-Asp or one D-Asn which is linked by its β-carboxyl group to the ε-amino group of L-Lys3. The resulting unit is composed of GlcNAc-MurNAc substituted by an L-alanyl-γ-D-glutamyl-L-(Nε-D-isoaspartyl)lysyl-D-alanyl-D-alanine or an L-alanyl-γ-D-glutamyl-L-(Nε-D-isoasparaginyl)lysyl-D-alanyl-D-alanine stem hexapeptide (D-Asx-pentapeptide)(4-6). At the late stage of the polymerisation, the interpeptide bridge synthesized by the D,D-transpeptidases consist in a peptide bond between the carboxyl group of D-Ala at position 4 of a donor stem peptide and the amino group of the D-Asn or D-Asp (D-Asx) linked to the L-Lys3 of an acceptor peptide stem.
Peptidoglycan synthesis inhibitors exert their selective toxicity against eubacteria, since mammal cells lack peptidoglycan. All beta lactams have a common mechanism of action and act as suicide substrates of the D,D-transpeptidase catalytic domain of the penicillin binding proteins (PBPs) responsible for the last cross-linking step of the cell wall assembly.
The main inhibitors of the cell wall synthesis are those of the beta lactam family, which include penicillins and cephalosporins. The beta lactam antibiotics are stereochemically related to D-alanyl-D-alanine which is a substrate for the last step in peptidoglycan synthesis, i.e. the final cross-linking between peptide side chains. Beta lactam compounds include natural and semi-synthetic penicillins, clavulanic acid, cephalosposrins, carbapenems and monobactams. Other inhibitors also encompass glycopeptides such as vancomycin.
Over the past three decades, there has been an increasing use of beta lactams, which have entered clinical use since 1965. Unfortunately, the widespread use of these antibacterial substances has resulted in an alarming increase in the number of resistant strains, especially among clinically important bacteria such as the genera Salmonella, Enterobacteriacae, Pseudomonas and Staphylococcus.
Generally, bacterial resistance to beta lactams occurs primarily through three mechanisms: (i) destruction of the antibiotic by beta-lactamases, (ii) decreased penetration due to changes in bacterial outer membrane composition and (iii) alteration in penicillin-binding proteins (PBPs) resulting in interference with beta lactam binding. The latter pathway is especially important, as the binding of beta lactams to PBPs is essential for inhibiting peptidoglycan biosynthesis. For glycopeptides, increasing numbers of Vancomycin-resistant strains of enterococci have been found since 1988. Vancomycin-resistant enterococci exhibit changes in the cell wall production.
Overuse of antibiotics, non-compliance with a full course of antibiotic treatment, routine prophylactic use and sub-therapeutic drug levels all contribute to the development of resistant strains of bacteria.
There is thus a need in the art for identifying novel antibacterial substances exerting an inhibiting effect on the peptidoglycan biosynthesis, as well as for novel methods for their screening.
Notably, there is a need in the art for identifying inhibitors of peptidoglycan biosynthesis that are active against antibiotic-resistant bacteria, including beta lactams-resistant bacteria.
This need in the art includes identifying novel bacterial target proteins that are involved in peptidoglycan biosynthesis that will allow performing screening methods of active antibacterial substances. Such screening methods encompass in vitro screening methods wherein inhibitory activity of candidate substances against newly identified bacterial target protein(s) is assayed. Such screening methods also encompass in silico screening methods wherein blocking biological activity of newly identified bacterial target protein(s) can be assayed, once said target protein(s) is (are) identified and its (their) tridimensional structure deciphered.
The present invention relates primarily to a method for the screening of antibacterial substances comprising a step of determining the ability of a candidate substance to inhibit the activity of a purified enzyme selected from the group consisting of:
This invention also pertains to a method for the screening of antibacterial substances, wherein said method comprises the steps of:
This invention also concerns a crystallized L,D-transpeptidase having the amino acid sequence starting at the amino acid located in position 119 and ending at the amino acid located in position 466 of the amino acid sequence of SEQ ID No 13 defined herein.
This invention also pertains to a method for selecting a compound that interacts with the catalytic site of the L,D-transpeptidase defined herein, wherein said method comprises the steps of:
The present invention also relates to various other methods for the screening of an antibiotic candidate substance that take benefit from the availability of the three-dimensional structure of the L,D-transpeptidase that is defined in detail in the present specification.
This invention also concerns computer systems and methods that are useful for performing methods for the screening of antibiotic candidate substances acting on the target L,D-transpeptidase that is defined in detail in the present specification.
According to the invention, it has been characterised two proteins that have been found to be both involved in the bacterial cell wall peptidoglycan biosynthesis.
The findings of the invention according to which these two proteins are involved in the peptidoglycan biosynthesis has allowed the inventors to design various methods for the screening of candidate antibacterial substances, the effect of which is targeted against these two proteins.
More precisely, the two proteins that have been identified according to the invention consist of enzymes, thus target proteins for which alterations in their biological activity by candidate antibacterial substances may be easily detected.
Further, one of these two enzymes has been crystallized and its spatial conformation deciphered, including the spatial conformation of its catalytic site, thus allowing the design of in silico screening methods for substances that can enter the catalytic site and prevent availability of said catalytic site for natural substrate(s). In silico methods for screening antibiotics have already proved their efficiency, for instance in the case of screening for aminoglycoside complexing with RNA, using bacterial ribosomal RNA crystal structure as the antibiotics target.
The first enzyme, the involvement of which in the peptidoglycan biosynthesis has been found according to the present invention, consists of a D-aspartate ligase. It is to be noticed that a D-aspartic activating enzyme activity was previously described as being present in enzyme preparations from Streptococcus faecalis (in fact probably from Enterococcus faecium), but without any structural characterization of the corresponding protein(s) (See Staudenbauer and Strominger, 1972, The Journal of Biological Chemistry, Vol. 247(17): 5289-5296).
Said D-aspartate ligase catalyses incorporation of D-aspartate on UDP-MurNac pentapeptide to form the side chain of peptidoglycan precursor. It has been found according to the invention that recombinant expression of the gene encoding said D-aspartate ligase, in a host organism wherein this gene is not naturally present, induces the recombinant host organism to synthesise a cell wall peptidoglycan wherein D-aspartate residues are linked to the ε-amino group of L-Lys3 of the main monomers of the peptidoglycan, which shows that said D-aspartate ligase characterised according to the invention is functional in various bacteria that do not naturally express said enzyme.
Further, it has been found structurally similar D-aspartate ligases in various bacteria for which existing data show that they produce D-aspartate-containing branched cell wall precursors of the peptidoglycan, including bacteria from the Lactobacilli species, Lactococci species and Pediococci species.
The second enzyme, the involvement of which in the peptidoglycan biosynthesis has been found according to the present invention, consists of a L,D-transpeptidase. It is to be noticed that a beta lactam-insensitive L,D-transpeptidase activity was previously described as being present in membrane preparations from Enterococcus faecium bacteria, but without any structural characterization of the corresponding protein(s) (See Mainardi et al., 2002, The Journal of Biological Chemistry, Vol. 277(39): 35801-35807).
Said L,D-transpeptidase catalyses the L,D transpeptidation of peptidoglycan subunits containing a tetrapeptide stem.
The L,D-transpeptidase characterised according to the present invention has a high value as a target protein for the screening of novel antibacterial substances.
Further, the catalytic site of the L,D-transpeptidase characterised according to the present invention has been identified, both (i) biologically, through directed mutagenesis experiments, and (ii) structurally, through the characterisation of the three-dimensional structure of this enzyme, including the characterisation of the three dimensional structure of its active site, after crystallisation of this enzyme.
Thus, according to the invention, the biological effectors for the previously known bacterial D-aspartate ligase activity and L,D-transpeptidase activity in certain bacteria have been characterized, isolated and recombinantly produced for the first time.
These findings have allowed the inventors to design methods for the screening of antibacterial substances having the ability to cause disorders in the bacterial peptidoglycan normal biosynthesis.
Thus, a first object of the invention consists of a method for the screening of antibacterial substances comprising a step of determining the ability of a candidate substance to inhibit the activity of a purified enzyme selected from the group consisting of:
According to the invention, it has been found that the D-aspartate ligase of SEQ ID No 1 originating from Enterococcus faecium bacteria, that has been newly characterised and isolated, possesses structural and functional similarities with proteins characterized herein as consisting of D-aspartate ligases originating from various other bacteria, including those originating from, respectively, Lactococcus lactis (SEQ ID No 2), Lactococcus cremoris SK11 (SEQ ID No 3), Lactobacillus gasseri (SEQ ID No 4), Lactobacillus johnosonii NCC 533 (SEQ ID No 5), Lactobacillus delbruckei Subsp. bulgaricus (SEQ ID No 6), Lactobacillus casei (SEQ ID No 7), Lactobacillus acidophilus (SEQ ID No 8), Lactobacillus brevis (SEQ ID No 9) and Pediococcus pentosaceus (SEQ ID No 10).
More specifically, beyond their amino acid sequence similarity with the D-aspartate ligase of SEQ ID No 1, the D-aspartate ligases of SEQ ID No 2 to SEQ ID No 10 all originate from bacteria which produce D-Asp-containing branched cell wall peptidoglycan precursors. Conversely, no nucleic acid sequences encoding proteins having similarities with the D-aspartate ligase of SEQ ID No 1 are found in the genome of bacteria having cell wall peptidoglycan with either (i) direct crosslinks or (ii) crosslinks containing glycine or L-amino acids.
The amino acid sequence of SEQ ID No 11 consists of the C-terminal end located from the amino acid residue in position 340 and ending at the amino acid residue in position 466 of the L,D-transpeptidase originating from Enterococcus faecium bacteria of SEQ ID No 13, that catalyses the L,D transpeptidation of peptidoglycan subunits containing a tetrapeptide stem. More precisely, it has been found according to the invention that the C-terminal portion of SEQ ID No 11 of said L,D-transpeptidase comprises the catalytic site of said enzyme, both by directed mutagenesis experiments and by crystallisation of this protein. Notably, it has been found that important amino acid residues comprised in the catalytic site of said L,D-transpeptidase include the Serine residue located at position 439 of SEQ ID No 13 and the Cysteine residue located at position 442 of SEQ ID No 13. From crystallisation data, it has further been found that the Histidine residue located at position 421 of SEQ ID No 13 and the Histidine residue located at position 440 of SEQ ID No 13 both form part of the catalytic site of said L,D-transpeptidase. More generally, the catalytic site of said L,D-transpeptidase of SEQ ID No 13 is comprised in the amino acid sequence beginning at the Isoleucine residue located at position 368 of SEQ ID No 13 and ending at the Methionine residue located at position 450 of SEQ ID No 13.
The L,D-transpeptidase notably comprises a C-terminal portion of SEQ ID No 12, which includes the amino acid sequence of SEQ ID No 11 at its C-terminal end. The L,D-transpeptidase C-terminal portion of SEQ ID No 12 forms a protein domain that is also found in proteins originating from various other bacteria, notably Gram-positive bacteria. Proteins having strong amino acid sequence identity with the L,D-transpeptidase comprising SEQ ID No 11 or 12 are found in proteins originating from Lactobacillus plantarum, Clostridium acetobutylicum, Enterococcus faecalis and Bacillus anthracis.
The complete amino acid sequence of the L,D-transpeptidase that has been characterised according to the invention consists of the amino acid sequence of SEQ ID No 13.
As intended herein, a D-aspartate ligase or a L,D-transpeptidase characterized according to the invention, or any biologically active peptide thereof, “comprises” a polypeptide as defined above because, in certain embodiments, said D-aspartate ligase or said L,D-transpeptidase may not simply consist of said polypeptide defined above. Illustratively, a D-aspartate ligase or a L,D-transpeptidase characterized according to the invention, or any biologically active peptide thereof, may comprise, in addition to a polypeptide as defined above, additional amino acid residues that are located (i) at the N-terminal end, (ii) at the C-terminal end or (iii) both at the N-terminal end and at the C-terminal end of said polypeptide above. Generally, at the N-terminal end or at the C-terminal end of a polypeptide defined above, there is no more than 30 additional amino acid residues and often no more than 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5 or 4 additional amino acid residues. Illustratively, a polypeptide as defined above that possesses a D-aspartate ligase or a L,D-transpeptidase activity possesses, at its C-terminal end, six additional Histidine amino acid residues.
As intended herein, a polypeptide or a protein having at least 50% amino acid identity with a reference amino acid sequence possesses at least 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 99.5% amino acid identity with said reference amino acid sequence.
For the purpose of determining the percent of identity of two amino acid sequences according to the present invention, the sequences are aligned for optimal comparison purposes. For example, gaps can be introduced in one or both of a first and a second amino acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes.
For optimal comparison purposes, the percent of identity of two amino acid sequences can be achieved with CLUSTAL W (version 1.82) with the following parameters: (1) CPU MODE=ClustalW mp; (2) ALIGNMENT=<<full>>; (3) OUTPUT FORMAT=<<aln w/numbers>>; (4) OUTPUT ORDER=<<aligned>>; (5) COLOR ALIGNMENT=<<no>>; (6) KTUP (word size)=<<default>>; (7) WINDOW LENGTH=<<default>>; (8) SCORE TYPE=<<percent>>; (9) TOPDIAG=<<default>>; (10) PAIRGAP=<<default>>; (11) PHYLOGENETIC TREE/TREE TYPE=<<none>>; (12) MATRIX=<<default>>; (13) GAP OPEN=<<default>>; (14) END GAPS=<<default>>; (15) GAP EXTENSION=<<default>>; (16) GAP DISTANCES=<<default>>; (17) TREE TYPE=<<cladogram>> et (18) TREE GRAP DISTANCES=<<hide>>.
By a “biologically active fragment” of a D-aspartate ligase or of a L,D-transpeptidase that are defined above, it is intended herein a polypeptide having an amino acid length that is shorter than the amino acid length of the enzyme polypeptide of reference, while preserving the same D-aspartate ligase or of a L,D-transpeptidase activity, that is the same specificity of catalytic activity and an activity of at least the same order of magnitude than the activity of the parent enzyme polypeptide.
A biologically active fragment of a D-aspartate ligase characterized according to the invention possesses a D-aspartate ligase activity that is assessed, using, as substrates, D-aspartate and a compound selected from the group consisting of UDP-MurNac pentapeptide and UDP-MurNac tetrapeptide, and then quantifying the UDP-MurNac pentapeptide-Asp or the UDP-MurNac tetrapeptide-Asp that is produced. Said fragment consists of a biologically active fragment of a D-aspartate ligase according to the invention if the rate of production of UDP-MurNac tetrapeptide-Asp is at least 0.1 the rate of the D-aspartate ligase of SEQ ID No 1.
A biologically active fragment of a L,D-transpeptidase characterised according to the invention possesses a L,D-transpeptidase activity that is assessed using, as substrates, (i) a donor compound consisting of a tetrapeptide preferably selected from the group consisting of L-Ala-D-Glu-L-Lys-D-Ala, Ac2-L-Lys-D-Ala and disaccaharide-tetrapeptide(iAsn) and (ii) an acceptor compound selected from the group consisting of a D-amino acid or a D-hydroxy acid. Said fragment consists of a biologically active fragment of a L,D-transpeptidase according to the invention if the rate of production of the final dimer product is at least 0.1 the rate of the L,D-transpeptidase of SEQ ID No 12, or of the L,D-transpeptidase of SEQ ID No 13.
Generally, a biologically active fragment of a D-aspartate ligase or of a L,D-transpeptidase according to the invention has an amino acid length of at least 100 amino acid residues. Usually, a biologically active fragment of a D-aspartate ligase or of a L,D-transpeptidase according to the invention comprises at least 100 consecutive amino acid residues of a D-aspartate ligase or of a L,D-transpeptidase as defined above.
Advantageously, a biologically active fragment of a D-aspartate ligase as defined above comprises, or consists of, a polypeptide consisting of 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439 or 440 consecutive amino acid residues of a D-aspartate ligase as defined above, it being understood that the amino acid length of said biologically active peptide fragment is necessary limited by the amino acid length of the D-aspartate ligase from which said biologically active peptide fragment derives.
Advantageously, a biologically active fragment of a L,D-transpeptidase as defined above comprises, or consists of, a polypeptide consisting of 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459 or 461 consecutive amino acid residues of a L,D-transpeptidase as defined above, it being understood that the amino acid length of said biologically active peptide fragment is necessary limited by the amino acid length of the L,D-transpeptidase from which said biologically active peptide fragment derives.
In a preferred embodiment of the method for the screening of antibacterial substances that is defined above, said method comprises the steps of:
As intended herein, a candidate substance to be tested inhibits the catalytic activity of said D-aspartate ligase or of said L,D-transpeptidase if the activity of said enzyme, when the candidate substance is present, is lower than when said enzyme is used without the candidate substance under testing.
Preferably, the candidate substances that are positively selected at step d) of the method above are those that cause a decrease of the production rate of the final product by said D-aspartate ligase or by said L,D-transpeptidase that leads to less than 0.5 times the production rate of the same enzyme in the absence of the candidate substance, more preferably a decrease that leads to less 0.3, 0.2, 0.1, 0.05 or 0.025 times the production rate of the same enzyme in the absence of the candidate substance. The most active candidate substances that may be positively selected at step d) of the method above may completely block the catalytic activity of said enzyme, which leads to a production rate of the final product by said D-aspartate ligase or by said L,D-transpeptidase which is undetectable, i.e. zero, or very close to zero.
In a preferred embodiment of the screening method above, said enzyme consists of a D-aspartate ligase comprising a polypeptide having an amino acid sequence possessing at least 60% amino acid identity with an amino acid sequence selected from the group consisting of SEQ ID No 1 to SEQ ID No 10, or a biologically active fragment thereof.
In another preferred embodiment of the screening method above, said enzyme consists of a D-aspartate ligase comprising a polypeptide having an amino acid sequence possessing at least 90% amino acid identity with the amino acid sequence of SEQ ID No 1 to SEQ ID No 10, or a biologically active fragment thereof.
In a further preferred embodiment of the screening method above, said enzyme consists of a D-aspartate ligase comprising a polypeptide having an amino acid sequence possessing at least 90% amino acid identity with the amino acid sequence of SEQ ID No 1, or a biologically active fragment thereof.
In still a further embodiment, said enzyme consists of the D-aspartate ligase comprising a polypeptide having the amino acid sequence of SEQ ID No 1, or a biologically active fragment thereof.
In yet a further embodiment, said enzyme consists of the D-aspartate ligase of SEQ ID No 1, or a biologically active fragment thereof.
In one preferred embodiment of the screening method above, the D-aspartate ligase activity is assessed using, as substrates, D-aspartate and a compound selected from the group consisting of UDP-MurNac pentapeptide and UDP-MurNac tetrapeptide.
Preferably, radioactively labeled D-aspartate is used, such as D-[14C] aspartate or D-[3H] aspartate.
Usually, the reaction mixture comprising (i) labeled D-aspartate, (ii) UDP-MurNac pentapeptide or UDP-MurNac tetrapeptide and (iii) optionally the candidate inhibitor compound is incubated in the suitable reaction medium during a time period of from 1 h to 3 h, advantageously from 1.5 h to 2.5 h, at a temperature ranging from 35° C. to 39° C., advantageously from 36° C. to 38° C. and most preferably at 37° C., before the reaction is stopped. Usually, the reaction is stopped by boiling the resulting reaction mixture during the appropriate time period, which may be 3 min.
Then, the remaining labeled D-aspartate is separated from the reaction product consisting of labeled UDP-MurNac hexapeptide or UDP-MurNac pentapeptide, e.g. [14C]UDP-MurNac hexapeptide or [14C]UDP-MurNac pentapeptide, depending of the substrate which is used, preferably by performing a chromatography separation step. Usually, non-reacted labeled D-aspartate is separated from the other reaction products by descending paper chromatography, such as disclosed in the examples.
Then, the reaction products are further separated, preferably by performing a subsequent chromatography step, such as a step of reverse phase high-pressure liquid chromatography (rpHPLC), such as disclosed in the examples.
In order to confirm the structure of the final product, the reaction step described above may be performed with non-radioactive D-aspartate and samples of UDP-MurNac-peptide products may be isolated by rpHPLC and then lyophilized. Said lyophilized product may then be resuspended, for example in water, and analyzed by Mass spectrometry (MS) and MS/MS, as disclosed in the examples herein, for instance by performing the technique previously described by Bouhss et al. (2002).
Detection of the labeled reaction product resulting from the D-aspartate ligase catalytic activity may be performed simultaneously with said chromatographic step. For example, if the initial substrate, and thus also the reaction product, are radioactively labeled, then the detection of the reaction product, or the detection and the quantification, of the reaction product, may be performed with a suitable radioactivity detector that is coupled to the chromatography device, such as disclosed in the examples.
Thus, in one preferred embodiment of the screening method above, the D-aspartate ligase activity is assessed by quantifying the UDP-MurNac pentapeptide-Asp or the UDP-MurNac tetrapeptide-Asp that is produced, as it is detailed above and is fully described in the examples.
In another preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase comprising a polypeptide having an amino acid sequence possessing at least 60% amino acid identity with the amino acid sequence of SEQ ID No 11, or a biologically active fragment thereof.
In a further preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase comprising a polypeptide having an amino acid sequence possessing at least 90% amino acid identity with the amino acid sequence of SEQ ID No 11, or a biologically active fragment thereof.
In a still further preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase comprising a polypeptide having the amino acid sequence of SEQ ID No 11, or a biologically active fragment thereof.
It is reminded here that the amino acid sequence of SEQ ID No 11 comprises the C-terminal part of the L,D-transpeptidase of SEQ ID No 13, said amino acid sequence of SEQ ID No 11 comprising the important amino acid residues that form part of the active site of said enzyme, including HIS421, S439, HIS440 and CYS442.
Thus, in another preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase comprising a polypeptide having at least 90% aminoacid identity with the amino acid sequence possessing at least 90% amino acid identity with the amino acid sequence of SEQ ID No 12, or a biologically active fragment thereof. The amino acid sequence of SEQ ID No 12 consists of a C-terminal portion of the L,D-transpeptidase of SEQ ID No 13. The amino acid sequence of SEQ ID No 12 is longer than, and comprises SEQ ID No 11. The amino acid sequence of SEQ ID No 12 also comprises the important amino acid residues that form part of the active site of said enzyme, including HIS421, S439, HIS440 and CYS442. It has been shown according to the invention that the L,D-transpeptidase consisting of SEQ ID No 12 has the same catalytic activity than the L,D-transpeptidase consisting of SEQ ID No 13, despite it lacks the N-terminal end of the L,D-transpeptidase of SEQ ID No 13.
Thus, in one preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase comprising a polypeptide having at least 90% aminoacid identity with the amino acid sequence of SEQ ID No 12, or a biologically active fragment thereof
According to further preferred embodiment of the screening method above, said enzyme consists of a L,D-transpeptidase having at least 90% amino acid identity with the amino acid sequence of SEQ ID No 13, or a biologically active peptide fragment thereof.
In yet a further embodiment of said screening method, said enzyme consists of the L,D-transpeptidase comprising a polypeptide consisting of the amino acid sequence of SEQ ID No 13, or a biologically active peptide fragment thereof.
In still a further embodiment of said screening method, said enzyme consists of the L,D-transpeptidase consisting of the amino acid sequence of SEQ ID No 13, or a biologically active peptide fragment thereof.
Preferably, any one of the biologically active peptide fragments of the polypeptide of SEQ ID No 13 comprises at least 100 consecutive amino acids of SEQ ID No 13 and comprises the amino acid residues SER439 and CYS442.
Preferably, any one of the biologically active peptide fragments of the polypeptide of SEQ ID No 13 comprises at least 100 consecutive amino acids of SEQ ID No 13 and comprises the amino acid residues HIS421, SER439, HIS440 and CYS442.
Preferably, any one of the biologically active peptide fragments of the polypeptide of SEQ ID No 13 comprises the amino acid sequence beginning at the Isoleucine amino acid residue located at position 368 and ending at the Methionine amino acid residue located at position 450 of the L,D-transpeptidase of SEQ ID No 13.
A specific embodiment of a biologically active peptide fragment of the polypeptide of SEQ ID No 13 consists of a polypeptide comprising, or consisting of, the amino acid sequence beginning at the amino acid residue located at position 119 and ending at the amino acid residue located at position 466 of SEQ ID No 13.
In a preferred embodiment of the screening method above, the L,D-transpeptidase activity is assessed using, as substrates, (i) a donor compound consisting of a tetrapeptide preferably selected from the group consisting of L-Ala-D-Glu-L-Lys-D-Ala, Ac2-L-Lys-D-Ala and disaccharide-tetrapeptide(iAsn) and (ii) an acceptor compound selected from the group consisting of a D-amino acid or a D-hydroxy acid.
In certain embodiments of the method above, said D-amino acid is selected from the group consisting of D-methionine, D-asparagine and D-serine.
In certain other embodiments of the method above, said D-hydroxy acid is selected from the group consisting of D-2-hydroxyhexanoic acid and D-lactic acid.
Preferably, the L,D-transpeptidase activity is assessed by performing a standard exchange assay that is based on incubation of non-radioactive Ac2-L-Lys-D-Ala and D-[14C]Ala and determination of Ac2-L-Lys-D[14C]Ala formed by the L,D-transpeptidase catalytic activity, such as disclosed by Mainardi et al. (J. L. Mainardi et al., J. Biol. Chem. 277, 35801 (2002)) as well as in the examples herein.
In an illustrative embodiment of said standard exchange assay, a reaction mixture is provided, which reaction mixture contains (i) purified L,D-transpeptidase, (ii) Ac2-L-Lys-D-Ala, (iii) D-[14C]Ala and (iv) optionally the candidate inhibitor compound. Then the enzyme reaction is performed until completion, generally during a time period of rom 1.5 h to 2.5 h, most preferably 1 h, at a temperature range comprised between 36° C. and 38° C., advantageously between 36.5° C. and 37.5° C., most preferably at 37° C. Then, the enzyme reaction is stopped, for example by boiling the resulting reaction product mixture for a time period sufficient to inactivate the enzyme, such as for a period of time ranging from 3 min to 20 min, most preferably a period of time of about 15 min.
Then, the resulting reaction product mixture is centrifuged and a sample collected from the supernatant of centrifugation is analysed by chromatography, preferably by carrying out a reverse phase high-pressure liquid chromatography (rpHPLC), most preferably with isocratic elution.
Detection of the labeled reaction product resulting from the L,D-transpeptidase catalytic activity may be performed simultaneously with said chromatographic step. For example, if the initial substrate, and thus also the reaction product, are radioactively labeled, then the detection, or the detection and the quantification, of the reaction product may be performed with a suitable radioactivity detector that is coupled to the chromatography device, such as disclosed in the examples.
To assay for in vitro transpeptidation, the one skilled in the art may prepare a reaction mixture comprising (i) purified L,D-transpeptidase, (ii) GlcNAc-MurNAc-L-Ala-D-iGln-L-(M-D-iAsn)Lys-D-Ala, GlcNAc-MurNAc-L-Ala-D-iGln-L-(M-D-iAsn)Lys and GlcNAc-MurNAc-L-Ala-D-iGln-L-Lys-D-Ala and (iii) optionally the inhibitor candidate compound, in a suitable reaction buffer. Then, the transpeptidation reaction is allowed to proceed during a time period preferably ranging from 1.5 h to 2.5 h, most preferably of about 2 h, at a preferred temperature range between 36.5° C. and 37.5° C., most preferably of about 37° C. Then, when brought to completion, the transpeptidation reaction is stopped, for example by boiling for a time period sufficient to inactivate the L,D-transpeptidase, e.g. for a a period of time ranging from 3 min to 20 min, most preferably a period of time of about 15 min.
Then, the resulting reaction product mixture is centrifuged and an aliquot sample is collected from the supernatant of centrifugation.
Said supernatant sample is then used to determine, and usually also quantify, the formation of dimers.
Preferably, the formation of dimers is determined, and usually quantified, by mass-spectrometry. A tandem-mass spectrometry is usually also performed after having cleaved the ether link internal to MurNac by treatment of a sample from the supernantant resulting product reaction mixture with ammonium hydroxide, such as disclosed by Arbeloa et al. (A. Arbeloa et al., J. Biol. Chem. 279, 41546 (2004)). Then, the resulting lactoyl-peptides are fragmented using N2 as the collision gas, such as disclosed by Arbeloa et al. (A. Arbeloa et al., J. Biol. Chem. 279, 41546 (2004)). Any of the D-aspartate ligases or of the L,D-transpeptidases that are defined throughout the present specification can be produced by performing various techniques of protein synthesis that are well known by the one skilled in the art, including chemical synthesis and genetic engineering methods for producing recombinant proteins.
Preferably, any one of the D-aspartate ligases and any one of the L,D-transpeptidases that are defined throughout the present specification are produced as recombinant proteins.
The description below relates primarily to production of the D-aspartate ligases or of the L,D-transpeptidases according to the invention by culturing cells transformed or transfected with a vector containing nucleic acid encoding corresponding polypeptides. It is, of course, contemplated that alternative methods that are well known in the art may be employed to prepare the polypeptides of interest according to the invention. For instance, the polypeptide sequence of interest, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques. See, e.g., Stewart et al., Solid-Phase Peptide Synthesis (W.H. Freeman Co.: San Francisco, Calif., 1969); Merrifield, J. Am. Chem. Soc., 85: 2149-2154 (1963). In vitro protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be accomplished, for instance, with an Applied Biosystems Peptide Synthesizer (Foster City, Calif.) using manufacturer's instructions. Various portions of the polypeptide of interest may be chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length polypeptide of interest
DNA encoding the polypeptide of interest may be obtained from a cDNA library prepared from tissue believed to possess the mRNA encoding it and to express it at a detectable level. Accordingly, DNAs encoding the D-aspartate ligases or the L,D-transpeptidases can be conveniently obtained from cDNA libraries prepared from bacteria.
Generally, a DNA encoding a D-aspartate ligase or a L,D-transpeptidase as defined herein may be obtained by amplification of bacterial genomic DNA or bacterial cDNA by a specific pair of primers.
A specific pair of primers can be easily designed by the one skilled in the art who has the knowledge of the nucleic acid sequence that encodes the enzyme of interest.
The nucleic acid sequences that encode the D-aspartate ligases of SEQ ID No 1 to 10 consist of the polynucleotides of SEQ ID No 22 to 31, respectively. The nucleic acid sequences that encode the L,D-transpeptidase of SEQ ID No 13 consists of the polynucleotide of SEQ ID No 32.
Illustratively, a DNA encoding a D-aspartate ligase of SEQ ID No 1 may be easily obtained by amplifying bacterial DNA with the pair of primers of SEQ ID No 14 and 15, as shown in the examples.
Illustratively, a DNA encoding a L,D-transpeptidase of SEQ ID No 13 may be easily obtained by amplifying bacterial DNA with the pair of primers of SEQ ID No 18 and 19, as shown in the examples.
Host cells are transfected or transformed with expression or cloning vectors described herein for polypeptide of interest production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, temperature, pH, and the like, can be selected by the skilled artisan without undue experimentation. In general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: A Practical Approach, M. Butler, ed. (IRL Press, 1991).
Methods of transfection are known to the ordinarily skilled artisan, for example, CaPO4 treatment and electroporation. Depending on the host cell used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for prokaryotes or other cells that contain substantial cell-wall barriers. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology, 52:456-457 (1978) can be employed. General aspects of mammalian cell host system transformations have been described in U.S. Pat. No. 4,399,216. Transformations into yeast are typically carried out according to the method of Van Solingen et al., J. Bact., 130: 946 (1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA), 76: 3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g., polybrene or polyornithine, may also be used. For various techniques for transforming mammalian cells, see, Keown et al., Methods in Enzymology, 185: 527-537 (1990) and Mansour et al., Nature, 336: 348-352 (1988).
Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include, but are not limited to, eubacteria, such as Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli strain W3110 (ATCC 27,325); and K5772 (ATCC 53,635). Other suitable prokaryotic host cells include Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. subtilis and B. licheniformis (e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 Apr. 1989), Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting. Strain W3110 is one particularly preferred host or parent host because it is a common host strain for recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes encoding proteins endogenous to the host, with examples of such hosts including E. coli W3110 strain 1A2, which has the complete genotype tonA; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli W3110 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompT kan.sup.r; E. coli W3110 strain 37D6, which has the complete genotype tona ptr3 phoA E15 (argF-lac)169 degP ompT rbs7 ilvG kan.sup.r; E. coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. Pat. No. 4,946,783 issued 7 Aug. 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable.
In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or expression hosts for vectors encoding the polypeptide of interest. Saccharomyces cerevisiae is a commonly used lower eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature, 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Pat. No. 4,943,529; Fleer et al., Bio/Technology, 9: 968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al., J. Bacteriol., 737 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology, 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol., 28: 265-278 [1988]); Candida; Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA, 76: 5259-5263 [1979]); Schwanniomyces such as Schwanniomyces occidentalis (EP 394,538 published 31 Oct. 1990); and filamentous fungi such as, e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 Jan. 1991), and Aspergillus hosts such as A. nidulans (Ballance et al., Biochem. Biophys. Res. Commun., 112: 284-289 [1983]; Tilburn et al., Gene, 26: 205-221 [1983]; Yelton et al., Proc. Natl. Acad. Sci. USA, 81: 1470-1474 [1984]) and A. niger (Kelly and Hynes, EMBO J., 4: 475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methylotrophs, 269 (1982).
Suitable host cells for the expression of nucleic acid encoding glycosylated polypeptides of interest are derived from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., J. Gen. Virol., 36: 59 (1977)); Chinese hamster ovary cells/−DHFR(CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA, 77:4216 (1980)); mouse sertoli cells (TM4, Mather, Biol. Reprod., 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the art.
The nucleic acid (e.g., cDNA or genomic DNA) encoding the polypeptide of interest may be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. Vector components generally include, but are not limited to, one or more of a signal sequence if the sequence is to be secreted, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable vectors containing one or more of these components employs standard ligation techniques that are known to the skilled artisan.
The polypeptide of interest may be produced recombinantly not only directly, but also as a fusion polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or it may be a part of the DNA encoding the polypeptide of interest that is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces.alpha.-factor leaders, the latter described in U.S. Pat. No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 published 4 Apr. 1990), or the signal described in WO 90/13646 published 15 Nov. 1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral secretory leaders.
Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2.mu. plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV, or BPV) are useful for cloning vectors in mammalian cells.
Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli.
An example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take up the nucleic acid encoding the polypeptide of interest such as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA, 77: 4216 (1980). A suitable selection gene for use in yeast is the trp 1 gene present in the yeast plasmid YRp7. Stinchcomb et al., Nature, 282: 39 (1979); Kingsman et al., Gene, 7: 141 (1979); Tschemper et al, Gene, 10: 157 (1980). The trp1 gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1. Jones, Genetics, 85: 12 (1977).
Expression and cloning vectors usually contain a promoter operably linked to the nucleic acid sequence encoding the polypeptide of interest to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include the .beta.-lactamase and lactose promoter systems (Chang et al., Nature, 275: 615 (1978); Goeddel et al., Nature, 281: 544 (1979)), alkaline phosphatase, a tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res., 8:4057 (1980); EP 36,776), and hybrid promoters such as the tac promoter (deBoer et al., Proc. Natl. Acad. Sci. USA, 80: 21-25 (1983)). promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked to the DNA encoding the polypeptide of interest.
Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem., 255: 2073 (1980)) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg., 7: 149 (1968); Holland, Biochemistry, 17: 4900 (1978)), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase.
Other yeast promoters that are inducible promoters having the additional advantage of transcription controlled by growth conditions are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in EP 73,657.
Nucleic acid of interest transcription from vectors in mammalian host cells is controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 Jul. 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus, and Simian Virus 40 (SV40); by heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter; and by heat-shock promoters, provided such promoters are compatible with the host cell systems.
Transcription of a DNA encoding the polypeptide of interest by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, .alpha.-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5′ or 3′ to the sequence coding for polypeptides of interest, but is preferably located at a site 5′ from the promoter.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5′ and, occasionally 3′, untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding polypeptide of interest.
Still other methods, vectors, and host cells suitable for adaptation to the synthesis of the of interest in recombinant vertebrate cell culture are described in Gething et al., Nature, 293: 620-625 (1981); Mantei et al., Nature, 281: 40-46 (1979); EP 117,060; and EP 117,058.
Forms of polypeptides of interest may be recovered from culture medium or from host cell lysates. If membrane-bound, it can be released from the membrane using a suitable detergent solution (e.g., TRITON-X™ 100) or by enzymatic cleavage. Cells employed in expression of nucleic acid encoding the polypeptide of interest can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell-lysing agents. It may be desired to purify the polypeptide of interest from recombinant cell proteins or polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; Protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind epitope-tagged forms of the polypeptide of interest. Various methods of protein purification may be employed and such methods are known in the art and described, for example, in Deutscher, Methods in Enzymology, 182 (1990); Scopes, Protein Purification: Principles and Practice (Springer-Verlag: New York, 1982). The purification step(s) selected will depend, for example, on the nature of the production process used and the particular polypeptide produced.
Finally, specific embodiments for obtaining a nucleic acid encoding a D-aspartate ligase or a L,D-transpeptidase as defined throughout the present specification, inserting said nucleic acid in a suitable expression vector, and transfecting host cells with said vector in order to produce the corresponding protein are disclosed in the examples herein.
As detailed previously in the specification, this invention encompasses methods for the screening of candidate antibacterial substances that inhibit the activity of a D-aspartate ligase or a L,D-transpeptidase as defined herein.
However, this invention also encompasses methods for the screening of candidate antibacterial substances, that are based on the ability of said candidate substances to bind to a D-aspartate ligase or to a L,D-transpeptidase as defined herein, thus methods for the screening of potentially antibacterial substances
The binding assays can be performed in a variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art.
All binding assays for the screening of candidate antibacterial substances are common in that they comprise a step of contacting the candidate substance with a D-aspartate ligase or with a L,D-transpeptidase as defined herein, under conditions and for a time sufficient to allow these two components to interact.
These screening methods also comprise a step of detecting the formation of complexes between said D-aspartate ligase or said L,D-transpeptidase and said candidate antibacterial substances.
Thus, screening for antibacterial substances include the use of two partners, through measuring the binding between two partners, respectively (i) a D-aspartate ligase or a L,D-transpeptidase as defined herein and (ii) the candidate compound.
In binding assays, the interaction is binding and the complex formed between a D-aspartate ligase or a L,D-transpeptidase as defined above and the candidate substance that is tested can be isolated or detected in the reaction mixture. In a particular embodiment, (i) the D-aspartate ligase or the L,D-transpeptidase as defined above or (ii) the antibacterial candidate substance is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the D-aspartate ligase or the L,D-transpeptidase as defined above and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the D-aspartate ligase or for the L,D-transpeptidase as defined above to be immobilized can be used to anchor it to a solid surface. The assay is performed by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized component, e.g., the coated surface containing the anchored component. When the reaction is complete, the non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are detected. When the originally non-immobilized component carries a detectable label, the detection of label immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized component does not carry a label, complexing can be detected, for example, by using a labeled antibody specifically binding the immobilized complex.
The binding of the antibacterial candidate substance to a D-aspartate ligase or to a L,D-transpeptidase as defined above may be performed through various assays, including traditional approaches, such as, e.g., cross-linking, co-immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers (Fields and Song, Nature (London), 340: 245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA, 88: 9578-9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA, 89: 5789-5793 (1991). Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast expression system described in the foregoing publications (generally referred to as the “two-hybrid system”) takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating proteins are fused to the activation domain. The expression of a GAL1-lacZ reporter gene under control of a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for .beta.-galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two-hybrid technique is commercially available from Clontech. This system can also be extended to map protein domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for these interactions.
Thus, another object of the invention consists of a method for the screening of antibacterial substances, wherein said method comprises the steps of:
The same method may also be defined as a method for the screening of antibacterial substances, wherein said method comprises the steps of:
The candidate substances, which may be screened according to the screening method above, may be of any kind, including, without being limited to, natural or synthetic compounds or molecules of biological origin such as polypeptides.
According to one embodiment of the screening method above, step b) comprises a step of proteolysis of said D-aspartate ligase or of said L,D-transpeptidase prior to the detection of a binding between the candidate inhibitor substance and said enzyme.
More precisely, according to this specific embodiment of step b) of the screening method described above, said D-aspartate ligase or said L,D-transpeptidase is incubated with a protease during a time period sufficient to generate a plurality of peptide fragments. Then, a step of detection of formation of eventual complexes between at least one of these peptide fragments and the candidate inhibitor compound is performed.
According to this specific embodiment of step b) of the screening method above, said step b) of assaying for the binding of said candidate substance to a D-aspartate ligase or to a L,D-transpeptidase as defined above comprises the following steps:
At step b1), any one of the proteases known in the art may be used. However, the most preferred protease consists of trypsin.
Trypsin digestion of said D-aspartate ligase or said L,D-transpeptidase is performed according to methods well known in the art.
Typically, said purified D-aspartate ligase or said purified L,D-transpeptidase in a suitable liquid buffer is subjected to trypsin digestion at 37° C. for a time period ranging from 1 h to 24 h, depending on the respective concentrations of said purified enzyme and of trypsin, respectively. Illustratively, said purified D-aspartate ligase or said purified L,D-transpeptidase is present in a suitable buffer selected from the group consisting of (i) a 1% (w/v) ammonium bicarbonate buffer, a 25 mM potassium buffer and (iii) a 50 mM Tris-HCl buffer at pH 8.0. Then, the proteolysis reaction is stopped, for example by adding (i) 1% trifluoroacetic acid solution or (ii) phenylmethyl sulfonyl fluoride (PMSF) solution to the resulting proteolysis mixture.
Then, at step b2), the various peptide fragments that are generated by trypsin proteolysis are subjected to a separation step.
In certain embodiments, said separation step may consist of an electrophoresis gel separation of the peptide fragments, using conventional electropheresis conditions that are well known when performing classical Western blotting peptide separation.
In certain other embodiments, said separation step consists of a step of High Pressure Liquid Chromatography (HPLC), for example using a LC-Packing® system, which is sold by Dionex, as used in the examples herein.
Then, at step b3), detection of the complexes eventually formed between one or more of the peptide fragments separated at step b2) and the inhibitor candidate substance is performed.
In most embodiments of step b3), detection of the complexes eventually formed between one or more of the peptide fragments separated at step b2) and the inhibitor candidate substance is performed by:
When step b2) consists of a conventional gel electrophoresis separation step, the differences between the two peptide separation patterns (i) and (ii) that are detected at step b3) consist of differences in the migration location on the gel of one or more peptide fragments onto which said inhibitor candidate compound is bound. Illustratively, the one or more peptides that are bound to the candidate substance generally migrate faster in the gel than the same unbound peptide(s).
When step b2) consists of an HPLC step, the differences between the two peptide separation patterns (i) and (ii) that are detected at step c3) consist of differences in the elution time of the one or more peptide fragments onto which said inhibitor candidate compound is bound.
In certain embodiments, said screening method may also comprises an additional step b4) of identification of the peptide fragment(s) onto which is bound said inhibitor candidate substance.
Usually, step b4) is performed by subjecting the peptide fragment(s) onto which is bound said inhibitor candidate substance to identification by mass spectrometry, for example by using an ion trap mass spectrometer as it is disclosed in the examples. Performing step b4) allows to identify precisely the binding location of said inhibitor candidate substance onto said D-aspartate ligase or onto said L,D-transpeptidase, so as to determine, notably, if said inhibitor candidate compound binds to the active site or close to the active site of the enzyme, or conversely binds at a protein location which is distant of the active site of said enzyme. This will allow to discriminate, notably, between competitive and non-competitive candidate inhibitor substances.
Two-hybrid screening methods are performed for the screening of candidate substances that consist of candidate polypeptides.
In a preferred embodiment, of the screening method, the candidate polypeptide is fused to the LexA binding domain, the D-aspartate ligase or the L,D-transpeptidase as defined above is fused to Gal 4 activator domain and step (b) is carried out by measuring the expression of a detectable marker gene placed under the control of a LexA regulation sequence that is responsive to the binding of a complete protein containing both the LexA binding domain and the Gal 4 activator domain. For example, the detectable marker gene placed under the control of a LexA regulation sequence can be the β-galactosidase gene or the HIS3 gene, as disclosed in the art.
In a particular embodiment of the screening method, the candidate compound consists of the expression product of a DNA insert contained in a phage vector, such as described by Parmley and Smith (1988). Specifically, random peptide libraries are used. The random DNA inserts encode for peptides of 8 to 20 amino acids in length (Oldenburg et al., 1992, Proc. Natl. Acad. Sci. USA, 85(8): 2444-2448; Valadon et al., 1996, J Mol Biol, 261: 11-22; Lucas, 1994, In: Development and Clinical Uses of Haemophilus b Conjugate; Westerink, 1995, Proc. Natl. Acad. Sci. USA, 92: 4021-4025; Felici et al., 1991, J Mol Biol, 222: 301-310). According to this particular embodiment, the recombinant phages expressing a polypeptide that specifically binds to a D-aspartate ligase or to a L,D-transpeptidase as defined above, are retained as expressing a candidate substance for use in the screening method above.
More precisely, In a first preferred embodiment of the screening method above, the screening system used in step (b) includes the use of a Two-hybrid screening assay. The yeast two-hybrid system is designed to study protein-protein interactions in vivo and relies upon the fusion of a bait protein to the DNA binding domain of the yeast Gal4 protein. This technique is described in the U.S. Pat. No. 5,667,973.
The general procedure of the two-hybrid assay is described hereafter. In an illustrative embodiment, the polynucleotide encoding the D-aspartate ligase or to the L,D-transpeptidase as defined above is fused to a polynucleotide encoding the DNA binding domain of the Gal4 protein, the fused protein being inserted in a suitable expression vector, for example pAS2 or pM3.
Then, the polynucleotide encoding the candidate polypeptide is fused to a nucleotide sequence in a second expression vector that encodes the activation domain of the Gal4 protein.
The two expression plasmids are transformed into yeast cells and the transformed yeast cells are plated on a selection culture medium which selects for expression of selectable markers on each of the expression vectors as well as GAL4 dependent expression of the HIS3 gene. Transformants capable of growing on medium lacking histidine are screened for gal4 dependent LacZ expression. Those cells which are positive in the histidine selection and the Lac Z assay denote the occurrence of an interaction between the D-aspartate ligase or the L,D-transpeptidase as defined above and the candidate polypeptide and allow to quantify the binding of the two protein partners.
Since its original description, the yeast two-hybrid system has been used extensively to identify protein-protein interactions from many different organisms. Simultaneously, a number of variations on a theme based on the original concept have been described. The original configuration of the two-hybrid fusion proteins was modified to expand the range of possible protein-protein interactions that could be analyzed. For example, systems were developed to detect trimeric interactions. Finally, the original concept was turned upside down and ‘reverse n-hybrid systems’ were developed to identify peptides or small molecules that dissociate macromolecular interactions (Vidal et al., 1999, Yeast forward and reverse ‘n’-hybrid systems. Nucleic Acids Res. 1999 Feb. 15; 27(4):919-29). These variations in the two-hybrid system can be applied to the disruption of the interaction between candidates antibacterial polypeptides and a D-aspartate ligase a L,D-transpeptidase as defined above and enters in the scope of the present invention.
In another preferred embodiment, of the screening method according to the invention, step (b) consists of subjecting to a gel migration assay the mixture obtained at the end of step (a) and then measuring the binding of the candidate polypeptide with the D-aspartate ligase or with the L,D-transpeptidase as defined above by performing a detection of the complexes formed between the candidate polypeptide and said D-aspartate ligase or said L,D-transpeptidase as defined above.
The gel migration assay can be carried out by conventional widely used western blot techniques that are well known from the one skilled in the art.
The detection of the complexes formed between the candidate polypeptide and the D-aspartate ligase or the L,D-transpeptidase as defined above can be easily observed by determining the stain position (protein bands) corresponding to the proteins analysed since the apparent molecular weight of a protein changes if it is in a complex.
On one hand, the stains (protein bands) corresponding to the proteins submitted to the gel migration assay can be detected by specific antibodies for example antibodies specifically directed against the D-aspartate ligase or the L,D-transpeptidase as defined above or against the candidate polypeptide, if the latter are available. Alternatively, the candidate polypeptide or the D-aspartate ligase or the L,D-transpeptidase as defined above can be tagged for an easier revelation of the gel, for example by fusion to GST, HA, poly Histidine chain, or other probes in order to facilitate the identification of the different protein on the gel, according to widely known techniques.
In another preferred embodiment of the screening method above, the screening system used in step (b) includes the use of an optical biosensor such as described by Edwards and Leatherbarrow (1997, Analytical Biochemistry, 246: 1-6) or also by Szabo et al. (1995, Curr. Opinion Struct. Biol., 5(5): 699-705). This technique permits the detection of interactions between molecule in real time, without the need of labelled molecules. This technique is based on the surface plasmon resonance (SPR) phenomenon. Briefly, a first protein partner molecule, for example the candidate polypeptide, is attached to a surface (such as a carboxymethyl dextran matrix). Then, the second protein partner molecule, in this case the D-aspartate ligase or the L,D-transpeptidase as defined above, is incubated with the first partner, in the presence or in the absence of the candidate compound to be tested and the binding, including the binding level, or the absence of binding between the first and second protein partner molecules is detected. For this purpose, a light beam is directed towards the side of the surface area of the substrate that does not contain the sample to be tested and is reflected by said surface. The SPR phenomenon causes a decrease in the intensity of the reflected light with a specific combination of angle and wavelength. The binding of the first and second protein partner molecules causes a change in the refraction index on the substrate surface, which change is detected as a change in the SPR signal.
According to the preferred embodiment of the screening method cited above, the “first partner” of the screening system consists of the substrate onto which the first protein partner molecule is immobilised, and the “second partner” of the screening system consists of the second partner protein molecule itself.
Candidate compounds for use in the screening method above can also be selected by any immunoaffinity chromatography technique using any chromatographic substrate onto which (i) the candidate polypeptide or (ii) the D-aspartate ligase or the L,D-transpeptidase as defined above, have previously been immobilised, according to techniques well known from the one skilled in the art.
In a preferred embodiment of the invention, the screening method includes the use of affinity chromatography.
The a D-aspartate ligase or the L,D-transpeptidase as defined above may be attached to a column using conventional techniques including chemical coupling to a suitable column matrix such as agarose, activated affinity media (for example, Affi Gel® sold by Bio-Rad), or other matrices familiar to those of skill in the art. In some embodiment of this method, the affinity column contains chimeric proteins in which the D-aspartate ligase or the L,D-transpeptidase as defined above, is fused to glutathion-s-transferase (GST). Then a candidate compound is applied to the affinity column. The amount of the candidate compound retained by the immobilized D-aspartate ligase or L,D-transpeptidase as defined above allows measuring the binding ability of said candidate compound on the enzyme and thus allows to assess the potential antibacterial activity of said candidate compound.
In another preferred embodiment of the screening method according to the invention, at step (b), the candidate substance and the D-aspartate ligase or the L,D-transpeptidase as defined above are labelled by a fluorophore. The measurement of the binding of the candidate compound to the D-aspartate ligase or to the L,D-transpeptidase as defined above, at step (b) consists of measuring a fluorescence energy transfer (FRET). Disruption of the interaction by a candidate compound is then followed by decrease or absence of fluorescence transfer. As an example, the one skilled in the art can make use of the TRACE technology of fluorescence transfer for Time Resolved Amplified Cryptate Emission developed by Leblanc V, et al. for measuring the FRET. This technique is based on the transfer of fluorescence from a donor (cryptate) to an acceptor of energy (XL665), when the two molecules are in close proximity in cell extracts.
Generally, the method for the screening of antibacterial substance that binds to a D-aspartate ligase or to a L,D-transpeptidase as defined above comprises further steps wherein the candidate substances that bind to the enzyme and which are thus positively selected at the end of step (b) of the screening method, are then assayed for their ability to actually inhibit said enzyme activity, by performing, as step (c) of said method, the corresponding screening method comprising a step of determining the ability of said candidate substances to inhibit the activity of a purified enzyme selected from the group consisting of:
As shown in the examples herein, a L,D-transpeptidase as defined above has been crystallized.
More precisely, it has been obtained a high quality crystal of the L,D-transpeptidase consisting of the amino acid sequence beginning at the amino acid residue located at position 119 and ending at the amino acid residue located at position 466 of the L,D-transpeptidase of SEQ ID No 13.
Said amino acid sequence 119-466 portion of SEQ ID No 13 may also be termed SEQ ID No 33 throughout the present specification. Usually, for the amino acid residue numbering of SEQ ID No 33 herein, it is referred to the numbering of the same amino acid residue found in the complete amino acid sequence of said L,D-transpeptidase of SEQ ID No 13, without any indication to the contrary.
A method for preparing said crystallized L,D-transpeptidase is fully disclosed in the examples herein.
Most preferably, for crystallization, said L,D-transpeptidase is equilibrated against a reservoir containing 12.5% PEG 2000, 100 mM ammonium sulfate, 300 mM NaCl and 100 mM sodium acetate trihydrate at pH 4.6.
Thus, another object of the invention consists of crystallized L,D-transpeptidase having the amino acid sequence of SEQ ID No 33.
It has been found according to the invention that the crystallized L,D-transpeptidase having the amino acid sequence of SEQ ID No 33 belong to the space group P3121 (a=b=115.976 and c=68.275) with one molecule per asymmetric unit and a solvent content of 64%.
Using a grown crystal of the L,D-transpeptidase according to the present invention, X-ray diffraction data can be collected by a variety of means in order to obtain the atomic coordinates of the molecules in the crystallized L,D-transpeptidase. In the examples herein, X-ray diffraction data were collected at the European Synchrotron Radiation Facility (ESRF) with the ESRF FIP-BM30A beamline. Then, the X-ray diffraction data were processed with the CCP4 program suite (containing the softwares named MOSFLM and SCALA).
With the aid of specifically designed computer software, such crystallographic data can be used to generate a three dimensional structure of the L,D-transpeptidase molecule. Various methods used to generate and refine a three dimensional structure of a molecular structure are well known to those skilled in the art, and include, without limitation, multiwavelength anomalous dispersion (MAD), single wavelength anomalous dispersion (SAD), multiple isomorphous replacement, reciprocal space solvent flattening, molecular replacement, and single isomorphous replacement with anomalous scattering (SIRAS).
The method for determining the structure of the L,D-transpeptidase disclosed in the examples herein consists of the single wavelength anomalous dispersion (SAD). The position of the three ordered Se atoms (out of a possible 5) were found using the CNS (Crystallography & NMR Software) software.
After density modification using the CNS SAD phase, the three-dimensional model of the L,D-transpeptidase was manually built using the program 0 described by Jones et al. (Jones, T. A., Zou, J. Y., Cowan, S. W. and Kjeldgaard, M. (1991); Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Cryst. A47, 110-119).
Most preferably, the structure is refined at 2.4 Å resolution using CNS, such as described by Brünger et al. (Brunger, A., Adams, P., Clore, G., DeLano, W., Gros, P., Grosse-Kunstleve, R., Jiang, J.-S., Kuszewski, J., Nilges, N., Pannu, N., et al. 1998. Crystallography and NMR system (CNS): A new software system for macromolecular structure determination, Acta Crystallogr. D 54: 905-921), with 20838 unique reflections (99.2% completeness).
The final three-dimensional model (Rcryst=22.0% and Rfree=25.7%; test set: 5% of the reflections) consists of residues 217-398 and 400 466 of SEQ ID No 13, one sulfate ion, one zinc ion and 295 water molecules. The 97 amino acid residues beginning at the amino acid residue located at position 119 and ending at the amino acid residue located at position 216 of SEQ ID No 13 could not be located in the map.
Most preferably, the final model of the three-dimensional structure of said L,D-transpeptidase, or of the 217-466 amino acid sequence thereof, is validated using the PROCHECK® software described by Laskowski et al. (Laskowski, R. A., McArthur, M. W., Moss, D. S., & Thornton, J. M. (1993). PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26, 283-291).
Ramachandran analysis has indicated that, for the three-dimensional model of the L,D-transpeptidase that is disclosed herein, 83.3% of the amino acid residues are in the most favored region, 15.3% of the amino acid residues are additionally allowed, and 1.4% of the amino acid residues are generously allowed.
The pertinency of the three-dimensional structure of the LD-transpeptidase (217-466 of SEQ ID No 13) has been performed by comparing the covalent bond distances and angles found from the X-ray diffraction data with standard values of covalent bond distances and angles for proteins, such as those standard values found in the book of Engh and Huber (Engh R. A. and Huber R., <<accurate Bond and Angle Parameters for X-ray Protein structure refinement>>, Acta Crytsallogr, A47 (1991): 392-400).
It was found that the three-dimensional model of the crystallized L,D-transpeptidase of the invention (i) has a root mean square deviation of bonds of 0.008 Å in respect to standard values and (ii) has a root mean square deviation of angles of 1.2° in respect to standard values, which are the total average deviation values that are found in standard dictionnaries, including that of Engh and Huber that is referred to above.
As it can be noticed, the structural coordinates of the crystallized L,D-transpeptidase (217-466 of SEQ ID No 13) begin, in Table 3, with the amino acid residue LYS217, because of too much poor structural data concerning the amino acid residues 119-216 of said crystallized enzyme.
The X-ray diffraction data generated from the crystallized L,D-transpeptidase of SEQ ID No 33 has allowed to determine the spatial location of every atom of the polypeptide having the amino acid sequence beginning at the amino acid residue located at position 217 and ending at the amino acid residue located at position 466 of the L,D-transpeptidase of SEQ ID No 13
The cartesian coordinates which define one and every structural conformation feature of the L,D-transpeptidase (217-466 of SEQ ID No 13) of the invention are listed in Table 3.
In Table 3:
As used herein, “structural coordinates” are the cartesian coordinates corresponding to an atom's spatial relationship to other atoms in a molecule or molecular complex. Various software programs allow for the graphical representation of a set of structural coordinates of the present invention may be modified from the original sets provided in Table 3 by mathematical manipulation, such as by inversion or integer additions or subtractions. As such, it is recognised that the structural coordinates of the present invention are relative, and are in no way specifically limited by the actual x, y, z coordinates in Table 3.
As used herein, “Root mean square deviation” is the square root of the arithmetic mean of the squares of the deviations from the mean, and is a way of expressing deviation or variation from the structural coordinates described herein. The present invention includes all embodiments comprising conservative substitutions of the noted amino acid residues resulting in the same structural coordinates within the stated root mean square deviation.
It will be obvious to the one skilled in the art that the numbering of the amino acid residues of the crystallized L,D-transpeptidase defined herein may be different than set forth herein, and may contain certain conservative amino acid substitutions that yield similar three-dimensional structures as those defined in Table 3 herein. Corresponding amino acids and conservative substitutions are easily identified by visual inspection of the relevant amino acid sequences or by using commercially available homology modelling software programs, such as MODELLER (MSI, San Diego, Calif., USA).
As used herein, “conservative substitutions” are those amino acid substitutions which are functionally equivalent to the substituted amino acid residue, either by way of having similar polarity, steric arrangement, or by belonging to the same class as the substituted residue (e.g. hydrophobic, acidic or basic), and includes substitutions having an inconsequential effect on the three dimensional structure of the crystallized protein complex of the invention with respect to the use of said structures for the identification of ligand compounds which interact with the catalytic site of the L,D-transpeptidase of SEQ iD No 33 or of SEQ ID No 13, more particularly, inhibitor compounds, for molecular replacement analyses and/or for homology modelling.
As shown in the examples, the various amino acid residues from the catalytic site of the L,D-transpeptidase of SEQ ID No 33 or of SEQ ID No 13 that delineate the inner space area of said catalytic site have been determined, using the structural coordinates of the crystallized protein complex which are set forth in Table 3.
The crystallized L,D-transpeptidase of SEQ ID No 33, and more specifically the inner space area of its catalytic site, can also be defined exclusively as respect to the various amino acid residues which are involved in delineating it.
From the three-dimensional structure of the crystallized L,D-transpeptidase of SEQ ID No 33 that can be determined from the structure coordinates of (217-466 of SEQ ID No 13) found in Table 3, the structure of the catalytic site of said enzyme has been deciphered. The structural data strictly corroborate the biological data found by directed mutagenesis.
It has been found that the most important amino acid residues contained in the active site are SER439 and CYS442, respectively, which are phylogenetically conserved residues on the basis of which a specific protein family can be defined.
It has also been found that two additional amino acid residues are important in the active site, respectively HIS421 and HIS440.
Another object of the invention consists of a crystallized L,D-transpeptidase of SEQ ID No 33, a three-dimensional atomic structure of the catalytic sites is defined by a set of structure coordinates having a root mean square deviation of not more than 1.5 Å from the set of structure coordinates corresponding to amino acid residues HIS421, SER439, HIS440 and CYS442 according to Table 3.
It has also been found according to the invention that the whole amino acid residues that delineate the catalytic site of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33 are encompassed within the polypeptide beginning at the amino acid residue ILE368 and ending at the amino acid residue MET450 of SEQ ID No 13.
Thus, the three-dimensional structure of the catalytic site of the crystallized L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33 comprises, in addition to the set of data corresponding to the relative structural coordinates of amino acid residues HIS421, SER439, HIS440 and CYS442 according to Table 3, equally a set of data corresponding to the relative structural coordinates of one or more of the following amino acid residues: ILE368, VAL369, SER370, GLY371, LYS372, PRO373, THR374, THR375, PRO376, THR377, PRO378; ALA379, GLY380, VAL381, PHE382, TYR383, VAL384, TRP385, ASN386, LYS387, GLU388, GLU389, ASP390, ALA391, THR392, LEU393, LYS394, GLY395, THR396, ASN397, ASP398, ASP399, GLY400, THR401, PRO402, TYR403, GLU404, SER405, PRO406, VAL407, ASN408, TYR409, TRP410, MET411, PRO412, ILE413, ASP414, TRP415, THR416, GLY417, VAL418, GLY419, ILE420, ASP422, SER423, ASP424, TRP425, GLN426, PRO427, GLU428, TYR429, GLY430, GLY431, ASP432, LEU433, TRP434, LYS435, THR436, ARG437, GLY438, GLY441, ILE443, ASN444, THR445, PRO446, PRO447, SER448, VAL449, MET450, LYS451, GLU452, LEU453, PHE454, GLY455, MET456, VAL457, GLU458, LYS459, GLY460, THR461, PRO462, VAL463, LEU464, VAL465 and PHE466.
The availability, according to the present invention, of the whole structural coordinates of the 217-466 portion of the L,D-transpeptidase of SEQ ID No 13 described above, and specifically of the structural coordinates of the various amino acid residues which are involved for forming the catalytic site, allows the one skilled in the art to generate models of docking compounds of a known chemical structure within said catalytic site and select those compounds that are potential or actual antibacterial compounds, that is compounds that potentially inhibit said L,D-transpeptidase.
More particularly, according to the invention, a compound which will behave as an inhibitor of the L,D-transpeptidase of SEQ ID No 13 or of SEQ id No 33 consists of a compound that, when docked in its catalytic site, either:
In another aspect, the present invention is directed to a method for identifying a ligand compound, more particularly an inhibitor of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33, said method comprising a step of docking or fitting the three-dimensional structure of a candidate compound with the three-dimensional structure of the catalytic site of the L,D-transpeptidase of Seq ID No 13 or of SEQ ID No 33.
Thus, another object of the invention consists of a method for selecting a compound that fits in the catalytic site of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33, wherein said method comprises the steps of:
A further object of the invention consists of a method for selecting an inhibitor compound for the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33, wherein said method comprises the steps of:
In order to further precise the class of compounds to which the selected ligand belongs, step b) may further comprise specific sub-steps wherein it is determined whether the compound, which has been primarily selected for its ability to interact with the catalytic site of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33, further induces stabilisation or, in contrast, steric constraints onto chemical groups belonging to the amino acid residues involved in said catalytic site so as to stabilise the spatial conformation of the catalytic site or, in contrast, cause a change in the spatial conformation of the catalytic site that reduces or even blocks the catalytic activity of the L,D-transpeptidase.
According to a first aspect of the screening method above, the candidate ligand compound, more particularly the candidate inhibitor compound, is selected from a library of compounds previously synthesised.
According to a second aspect of the screening method above, the candidate ligand compound, more particularly the candidate inhibitor compound, is selected from compounds, the chemical structure of which is defined in a database, for example an electronic database.
According to a third embodiment of the screening method above, the candidate ligand compound, more particularly the candidate inhibitor compound, is conceived de novo, by taking into account the spatial conformation stabilisation or, in contrast, the spatial conformation changes, that chemical group(s) of said compound may cause, when docked within the catalytic site of the L,D-transpeptidase of SEQ ID No 33. Indeed, after its de novo conception, and if positively selected, said candidate ligand compound, more particularly said candidate inhibitor compound, can be actually chemically synthesised.
Generally, computational methods for designing an inhibitor of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33 determine which amino acid or which amino acids of the catalytic site interact with a chemical moiety (at least one) of the ligand compound using a three dimensional model of the crystallized protein complex of the invention, the structural coordinates of which are set forth in Table 3.
These computational methods are particularly useful in designing an inhibitor of the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33, wherein said inhibitor compound has a chemical moiety, or chemical group(s) that allow the formation of hydrogen bonds with the side chains of the amino acid residues that are involved in the catalytic site, and more particularly the side chains of HIS421 (NH group), SER439 (OH group), HIS440 (NH group) and CYS442 (SH group).
Methods for Docking or Fitting Candidate Compounds with the Catalytic Site of Said L,D-Transpeptidase.
The three-dimensional structure of the L,D-transpeptidase of SEQ ID No 33 will greatly aid in the development of inhibitors of L,D transpeptidases that can be used as antibacterial substances. In addition, said L,D-transpeptidase is overall well suited to modern methods including three dimensional structure elucidation and combinatorial chemistry such as those disclosed in the European patent No EP 335 628 and the U.S. Pat. No. 5,463,564, which are incorporated herein by reference. Computer programs that use crystallographic data when practising the present invention will enable the rational design of ligand to, particularly inhibitor of, the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33.
Generally, the computational method of designing a synthetic ligand to, particularly a synthetic inhibitor of, the L,D-transpeptidase of SEQ ID No 13 or of SEQ ID No 33 comprises two steps:
As shown herein, interacting amino acids form contacts with the ligand and the center of the atoms of the interacting amino acids are usually 2 to 4 angstroms away from the center of the atoms of the ligand. Generally these distances are determined by computer as discussed herein and as it is described by Mc Ree (1993), however distances can be determined manually once the three dimensional model is made. Also, it has been described how performing stereochemical figures of three dimensional models using for instance the program Bobscript on the Wold Wide Web at strubi.ox.ac.uk/bobscript/doc24.html#StereoPS.
More commonly, the atoms of the ligand and the atoms of interacting amino acids are 3 to 4 angstroms apart. The invention can be practiced by repeating step 1 and 2 above to refine the fit of the ligand to the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) and to determine a better ligand, specifically an inhibitor compound. The three dimensional model of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) can be represented in two dimensions to determine which amino acids contact the ligand and to select a position on the ligand for chemical modification and changing the interaction with a particular amino acid compared to that before chemical modification. The chemical modification may be made using a computer, manually using a two dimensional representation of the three dimensional model or by chemically synthesizing the ligand. The ligand can also interact with distant amino acids after chemical modification of the ligand to create a new ligand. Distant amino acids are generally not in contact with the ligand before chemical modification. A chemical modification can change the structure of the ligand to make a new ligand that interacts with a distant amino acid usually at least 4.5 angstroms away from the ligand, preferably wherein said first chemical moiety is 6 to 12 angstroms away from a distant amino acid. Often distant amino acids will not line the surface of the binding activity for the ligand, they are too far away from the ligand to be part of a pocket or binding cavity. The interaction between a catalytic site amino acid and an atom of a ligand can be made by any force or attraction described in nature. Usually the interaction between the atom of the amino acid and the ligand will be the result of a hydrogen bonding interaction, charge interaction, hydrophobic effect, van der Waals interaction or dipole interaction. In the case of the hydrophobic effect it is recognized that is not a per se interaction between the amino acid and ligand, but rather the usual result, in part, of the repulsion of water or other hydrophilic group from a hydrophobic surface. Reducing or enhancing the interaction of the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) and a ligand can be measured by calculating or testing binding energies, computationally or using thermodynamic or kinetic methods as known in the art.
Chemical modifications will often enhance or reduce interactions of an atom of a catalytic site amino acid and an atom of the ligand. Steric hindrance will be a common means of changing the interaction of the catalytic cavity with the ligand.
However, as will be understood by those of skill in the art upon this disclosure, other structure based design methods can be used. Various computational structure based design methods have been disclosed in the art.
For example, a number computer modeling systems are available in which the sequence of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) structure, particularly of the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) structure (i.e., atomic coordinates of the catalytic site, the bond and dihedral angles, and distances between atoms in the active site such as provided in Table 3) can be input. This computer system then generates the structural details of the site in which a potential ligand compound binds so that complementary structural details of the potential modulators can be determined. Design in these modelling systems is generally based upon the compound being capable of physically and structurally associating with the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13). In addition, the compound must be able to assume a conformation that allows it to associate with said catalytic site.
Methods for screening chemical entities or fragments for their ability to associate with the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), and more particularly the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), are also well known. Often these methods begin by visual inspection of the active site on the computer screen. Selected fragments or chemical entities are then positioned with the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13). Docking is accomplished using software such as QUANTA and SYBYL, followed by energy minimization and molecular dynamics with standard molecular mechanic forcefields such as CHARMM and AMBER. Examples of computer programs which assist in the selection of chemical fragment or chemical entities useful in the present invention include, but are not limited to, GRID (Goodford, P. J. J. Med. Chem. 1985 28: 849-857), AUTODOCK (Goodsell, D. S. and Olsen, A. J. Proteins, Structure, Functions, and Genetics 1990 8: 195-202), and DOCK (Kunts et al. J. Mol. Biol. 1982 161:269-288).
Upon selection of preferred chemical entities or fragments, their relationship to each other and the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) can be visualized and then assembled into a single potential modulator. Programs useful in assembling the individual chemical entities include, but are not limited to, CAVEAT (Bartlett et al. Molecular Recognition in Chemical and Biological Problems Special Publication, Royal Chem. Soc. 78, 00. 182-196 (1989)) and 3D Database systems (Martin, Y. C. J. Med. Chem. 1992 35:2145-2154).
Alternatively, compounds may be designed de novo using either an empty active site or optionally including some portion of a known inhibitor. Methods of this type of design include, but are not limited to LUDI (Bohm H-J, J. Comp. Aid. Molec. Design 1992 6:61-78) and LeapFrog (Tripos Associates, St. Louis Mo.).
For “fitting” or “docking” a ligand compound to the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), starting from the structural coordinates of the protein complex of the invention which are set forth in Table 3, the one skilled in the art may use known techniques such as those reviewed by Sheridan et al. (1987), Goodford (1984), Beddell (1985), Hol (1986), Verlinde et al. (1994) and Blundell et al. (1987).
Fitting or docking a ligand compound to the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), starting from the structural coordinates of the protein complex of the invention which are set forth in Table 3, can also be performed using software such as QUANTA and SYBYL, followed by energy minimisation and molecular dynamics with standard molecular mechanic force fields such as CHARMM and AMBER. Examples of computer programs which assist in the selection of chemical fragment or chemical entities useful in the present invention include, but are not limited to, GRID (Goodford, P. J. J. Med. Chem. 1985 28: 849-857), AUTODOCK (Goodsell, D. S. and Olsen, A. J. Proteins, Structure, Functions, and Genetics 1990 8: 195-202), and DOCK (Kunts et al. J. Mol. Biol. 1982 161:269-288).
Most preferably, according to the invention, the structure determination of a crystallized protein complex, whether free of a ligand compound or under the form of a complex with a ligand compound, is performed by molecular replacement using AMoRe, as described by Navaza et al. (1994) with the crystallized L,D-transpeptidase that is described herein as the search model.
Use of a computer program has two main goals: complex prediction and virtual screening.
In the first approach (complex prediction), one starts from a small molecule selected on the basis of a visual examination of the ligand-binding pocket as revealed by X-ray crystallography or predicted from homology modelling. Indeed, the knowledge of the ligand-binding pocket gives indications about the size, the shape, and putative anchoring groups of the ligand. Once a suitable candidate is selected, its molecular model can be built thanks to modules of programs such as the QUANTA Molecular Modeling Package (Accelrys, San Diego, Calif., USA). Then the putative ligand is docked manually in the ligand-binding pocket by the one skilled in the art to evaluate its suitability as a candidate ligand, based on:
This procedure can be referred to as “manual” design.
In an improved procedure, the position of the manually docked ligand in the catalytic site is optimised through the use of an energy minimization algorithm such as the one provided in CNS (Brunger, A. T. et al. (1998) “Crystallography and NMR system (CNS): A new software system for macromolecular structure determination” Acta Cryst. D54: 905-921). In an even further improved procedure, docking programs are used to predict the geometry of the protein-ligand complex and estimates the binding affinity. Programs that perform flexible protein-ligand docking include GOLD (Jones et al. (1995) J. Mol. Biol. 245:43-53), FlexX (Rarey, M. et al. (1995) “Time-efficient docking of flexible ligands into active sites of proteins” Proc. Int. Conf. Intell. Syst. Mol. Biol. 3:300-308, AAAI Press, Menlo Park, Calif., USA), and Dock (Ewing, T. J. A. and Kuntz, I.D. (1997) “Critical evaluation of search algorithms for automated molecular docking and database screening” J. Comput. Chem. 18:1175-1189). The SuperStar program (Verdonk, M. L. et al. (1999) “A knowledge-based approach for identifying interaction sites in proteins” J. Mol. Biol. 289; 1093-1108) is used for the prediction of favourable interaction sites in proteins.
In the second approach (virtual screening), a more advanced procedure, the computer program is used to search a whole small-molecule database (see for instance: Makino, S. and Kuntz, I.D. (1997) “Automated flexible ligand docking method and its application for database search” J. Comp. Chem. 18:1812-1825).
Further Characterization as L,D-Transpeptidase Inhibitors of the Compounds that are Positiviely Selected at the End of Step b) of the Method.
Once a ligand has been selected on the basis of its predicted binding to the receptor through docking studies as described above, it can be validated according to any of the methods below:
(i) Detecting of the direct binding of the ligand to the catalytic site of the L,D-transpeptidase of SEQ ID No 33, that can be demonstrated by electrospray ionisation mass spectrometry (ESI MS) under non-denaturing conditions, a technique allowing the detection of non-covalent compexes (Loo, J. A., (1997) “Studying noncovalent protein complexes by electrospray ionisation mass spectrometry” Mass Spectrom, Rev. 16: 1-23);
(ii) Measuring the L,D-transpeptidase activity in the presence of the candidate ligand.
In order to further characterise the biological activity of the compound which has been positively selected by performing steps (a) and (b) of the screening method above, it may be required to assay for the actual biological activity of said positively selected compound, in respect to the catalytic activity of the L,D-transpeptidase of SEQ ID No 13, or of the SEQ ID No 33 polypeptide portion thereof.
According to a first aspect, a further biological assay using said positively selected compound will confirm that said candidate compound that is positively selected at the end of step (b) of the method effectively reduces or blocks the catalytic activity of the L,D-transpeptidase.
Thus, in a further embodiment, the screening method above, said method further comprises the steps of:
In a most preferred embodiment, step d) of the screening method above consists of performing the screening method which has been previously described in detail in the present specification, which screening method makes use of a L,D-transpeptidase comprising a polypeptide having an amino acid sequence possessing at least 50% amino acid identity with the amino acid sequence of SEQ ID No 13, or a biologically active fragment thereof.
In a preferred embodiment of said screening method, in step d), the compound which has been selected in step b) is used as the candidate inhibitor compound in step a) of the biological screening method which is used in step d).
Thus, from above, assays are known and available for determining whether a ligand identified or designed according to the present invention actually inhibits L,D-transpeptidase activity. High-affinity, high-specificity ligands found in this way can then be used for in vitro and in vivo assays aiming at determining the antibacterial properties of said ligand, including its spectrum of activity against various bacteria strains, species or genus.
Finally, from above, assays are available for determining whether these ligands may be useful therapeutically.
The present invention further relates to a method for selecting a compound that interacts with the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), wherein said method consists in:
The selection or the design of said candidate inhibitor compound is carried out by one of the methods which are extensively described above.
Thus, in a further embodiment, the screening method above, said method further comprises the steps of:
In a preferred embodiment of said screening method, in step c), the compound which has been selected in step a) is used as the candidate inhibitor compound in step b) of the biological screening method which is described in the present specification and in the examples.
As already described previously in the present specification, an object of the present invention consists of a method for selecting an inhibitor compound for the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13), wherein said method comprises the steps of:
In a specific embodiment, the screening method above, said method further comprises the steps of:
In a preferred embodiment of said screening method, in step d), the compound which has been selected in step c) is used as the candidate inhibitor compound in step b) of the biological screening method which is disclosed in the present specification.
According to a first aspect of the screening method above, the candidate ligand compound, more particularly the candidate inhibitor compound, is selected from a library of compounds previously synthesised.
According to a second aspect of the screening method above, the candidate ligand compound, more particularly the candidate agonist or antagonist compound, is selected from compounds, the chemical structure of which is defined in a database, for example an electronic database.
According to a third embodiment of the screening method above, the candidate ligand compound, more particularly the candidate inhibitor compound, is conceived de novo, by taking into account the spatial conformation stabilisation or, in contrast, the spatial conformation changes, that chemical group(s) of said compound may cause, when docked within the catalytic site of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13). Indeed, after its de novo conception, and if positively selected, said candidate ligand compound, more particularly said candidate inhibitor compound, can be actually chemically synthesised.
The present invention is also directed to a molecular model comprising:
The present invention is also directed to a machine-readable data storage medium, comprising a data storage material encoded with machine-readable data, wherein said machine-readable data consist of the X-ray structural coordinate data of the L,D-transpeptidase (119-466 or 217-466 of SEQ ID No 13) according to Table 3.
A used herein, a “machine-readable data storage medium” refers to any media which can be read and accessed directly by a computer. Such media include, but are not limited to, magnetic storage media such as floppy discs, hard disc storage medium and magnetic tape; optical storage media such as optical discs or CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media.
This invention is also directed to a machine-readable data storage medium, comprising a data storage material encoded with machine-readable data which, when using a machine programmed with instructions for using said data, displays a graphical three-dimensional representation of a crystal of the catalytic site of the L,D-transpeptidase of SEQ ID No 33.
This invention is also directed to a machine-readable data storage medium, comprising a data storage material encoded with machine-readable data which, when using a machine programmed with instructions for using said data, displays a graphical three-dimensional representation of a crystal of the L,D-transpeptidase of SEQ ID No 33 that is complexed with one candidate inhibitor of the L,D-transpeptidase of SEQ ID No 33.
This invention is also directed to a system for generating a three-dimensional model of at least a portion of the L,D-transpeptidase of SEQ ID No 13, said system comprising:
In preferred embodiments of the system above, said system further comprises a display device for displaying the three-dimensional model generated by said processing unit. ASSESSMENT OF THE EX VIVO ACTIVITY OF THE INHIBITOR COMPOUNDS POSITIVELY SELECTED BY THE IN VITRO OR IN SILICO SCREENING METHODS DISCLOSED ABOVE
Inhibitor substances that have been positively selected at the end of any one of the screening methods that are previously described in the present specification may then be assayed for their ex vivo antibacterial activity, in a further stage of their selection as a useful antibacterial active ingredient of a pharmaceutical composition.
By “ex vivo” antibacterial activity, it is intended herein the antibacterial activity of a positively selected candidate compound against bacteria cells that are cultured in vitro.
Thus, any substance that has been shown to behave like an inhibitor of a D-aspartate ligase or of a L,D-transpeptidase, after positive selection at the end of any one of the screening methods that are disclosed previously in the present specification, may be further assayed for his ex vivo antibacterial activity.
Consequently, any one of the screening methods that are described above may comprise a further step of assaying the positively selected inhibitor substance for its ex vivo antibacterial activity.
Usually, said further step consists of preparing in vitro bacterial cultures and then adding to said bacterial cultures the candidate compound to be tested, before determining the ability of said candidate compound to block bacterial growth or even most preferably kill the cultured bacterial cells.
For assaying the ex vivo antibacterial activity of a candidate compound that has previously been shown to affect the catalytic activity of a D-aspartate ligase encompassed ny the present invention, bacteria cells that are cultured in vitro are preferably selected from the group consisting of Enterococcus faecium, Lactococcus lactis, Lactococcus cremoris SK11, Lactobacillus gasseri, Lactobacillus johnosonii NCC 533, Lactobacillus delbruckei Subsp. bulgaricus, Lactobacillus casei, Lactobacillus acidophilus, Lactobacillus brevis and Pediococcus pentosaceus.
Typically, bacterial cells are plated in Petri dishes containing the appropriate culture medium, generally in agar gel, at a cell number ranging from 10 to 103 bacterial cells, including from 10 to 102 bacterial cells. In certain embodiments, serials of bacterial cultures are prepared with increasing numbers of seeded bacterial cells.
Typically, the candidate compound to be tested is then added to the bacterial cultures, preferably with a serial of amounts of said candidate compounds for each series of a given plated cell number of bacterial cultures.
Then, the bacterial cultures are incubated in the appropriate culture conditions, for instance in a cell incubator at the appropriate temperature, and for an appropriate time period, for instance a culture time period ranging from 1 day to 4 days, before counting the resulting CFUs (Colony Forming Units), either manually under a light microscope or binocular lenses, or automatically using an appropriate apparatus.
Generally, appropriate control cultures are simultaneously performed, i.e; negative control cultures without the candidate substance and positive control cultures with an antibiotic that is known to be toxic against the cultured bacterial cells.
Finally, said candidate compound is positively selected at the end of the method if it reduces the number of CFUs, as compared with the number of CFUs found in the corresponding negative control cultures.
Thus, another object of the present invention consists of a method for the ex vivo screening of a candidate antibacterial substance which comprises the steps of:
Inhibitor substances that have been positively selected at the end of any one of the screening methods that are previously described in the present specification may then be assayed for their in vivo antibacterial activity, in a further stage of their selection as a useful antibacterial active ingredient of a pharmaceutical composition.
Thus, any substance that has been shown to behave like an inhibitor of a D-aspartate ligase or a L,D-transpeptidase, after positive selection at the end of any one of the screening methods that are disclosed previously in the present specification, may be further assayed for his in vivo antibacterial activity.
Consequently, any one of the screening methods that are described above may comprise a further step of assaying the positively selected inhibitor substance for its in vivo antibacterial activity.
Usually, said further step consists of administering the inhibitor substance to a mammal and then determining the antibacterial activity of said substance.
Mammals are preferably non human mammals, at least at the early stages of the assessment of the in vivo antibacterial effect of the inhibitor compound tested. However, at further stages, human volunteers may be administered with said inhibitor compound to confirm safety and pharmaceutical activity data previously obtained from non human mammals.
Non human mammals encompass rodents like mice, rats, rabbits, hamsters, guinea pigs. Non human mammals and also cats, dogs, pigs, veals, cows, sheep, goats. Non human mammals also encompass primates like macaques and baboons.
Thus, another object of the present invention consists of a method for the in vivo screening of a candidate antibacterial substance which comprises the steps of:
Preferably, serial of doses containing increasing amounts of the inhibitor substance are prepared in view of determining the antibacterial effective dose of said inhibitor substance in a mammal subjected to a bacterial infection. Generally, the ED50 dose is determined, which is the amount of the inhibitor substance that is effective against bacteria in 50% of the animals tested. In some embodiments, the ED50 value is determined for various distinct bacteria species, in order to assess the spectrum of the antibacterial activity.
In certain embodiments, it is made use of serial of doses of the inhibitor substance tested ranging from 1 ng to 10 mg per kilogram of body weight of the mammal that is administered therewith.
Several doses may comprise high amounts of said inhibitor substance, so as to assay for eventual toxic or lethal effects of said inhibitor substance and then determine the LD50 value, which is the amount of said inhibitor substance that is lethal for 50% of the mammal that has been administered therewith.
The inhibitor substance to be assayed may be used alone under the form of a solid or a liquid composition.
When the inhibitor substance is used alone, the solid composition is usually a particulate composition of said inhibitor substance, under the form of a powder.
When the inhibitor substance is used alone, the liquid composition is usually a physiologically compatible saline buffer, like Ringer's solution or Hank's solution, in which said inhibitor substance is dissolved or suspended.
In other embodiments, said inhibitor substance is combined with one or more pharmaceutically acceptable excipients for preparing a pre-pharmaceutical composition that is further administered to a mammal for carrying out the in vivo assay.
Before in vivo administration to a mammal, the inhibitor substances selected through any one of the in vitro screening methods above may be formulated under the form of pre-pharmaceutical compositions. The pre-pharmaceutical compositions can include, depending on the formulation desired, pharmaceutically acceptable, usually sterile, non-toxic carriers or diluents, which are defined as vehicles commonly used to formulate pharmaceutical compositions for animal or human administration. The diluent is selected so as not to affect the biological activity of the combination. Examples of such diluents are distilled water, physiological saline, Ringer's solutions, dextrose solution, and Hank's solution. In addition, the test composition or formulation may also include other carriers, adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like.
Compositions comprising such carriers can be formulated by well known conventional methods. These test compositions can be administered to the mammal at a suitable dose. Administration of the suitable compositions may be effected by different ways, e.g., by intravenous, intraperitoneal, subcutaneous, intramuscular, topical, intradermal, intranasal or intrabronchial administration. The dosage regimen will be determined by taking into account, notably, clinical factors. As is well known in the medical arts, dosages for any one mammal depends upon many factors, including the mammal's size, body surface area, age, the particular compound to be administered, sex, time and route of administration and general health. Administration of the suitable pre-pharmaceutical compositions may be effected by different ways, e.g., by intravenous, intraperitoneal, subcutaneous, intramuscular, topical or intradermal administration. If the regimen is a continuous infusion, it should also be in the range of 1 ng to 10 mg units per kilogram of body weight per minute, respectively. Progress can be monitored by periodic assessment. The pre-pharmaceutical compositions of the invention may be administered locally or systemically. Administration will generally be parenterally, e.g., intravenously. Preparations for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for example, anti-oxidants, chelating agents, and inert gases and the like.
The inhibitor substances may be employed in powder or crystalline form, in liquid solution, or in suspension.
The injectable pre-pharmaceutical compositions may take such forms as suspensions, solutions, or emulsions in oily or aqueous vehicles, and may contain various formulating agents. Alternatively, the active ingredient may be in powder (lyophilized or non-lyophilized) form for reconstitution at the time of delivery with a suitable vehicle, such as sterile water. In injectable compositions, the carrier is typically comprised of sterile water, saline, or another injectable liquid, e.g., peanut oil for intramuscular injections. Also, various buffering agents, preservatives and the like can be included.
Topical applications may be formulated in carriers such as hydrophobic or hydrophilic base formulations to provide ointments, creams, lotions, in aqueous, oleaginous, or alcoholic liquids to form paints or in dry diluents to form powders.
Oral pre-pharmaceutical compositions may take such forms as tablets, capsules, oral suspensions and oral solutions. The oral compositions may utilize carriers such as conventional formulating agents and may include sustained release properties as well as rapid delivery forms.
In certain embodiments of the in vivo screening assay, the inhibitor substance is administered to a mammal which is the subject of a bacterial infection. For non human mammals, these animals have been injected with a composition containing bacteria prior to any administration of the inhibitor compound.
In certain other embodiments of the in vivo screening assay, non human animals are administered with the inhibitor compound to be tested prior to being injected with a composition containing bacteria.
For the in vivo assay, bacteria may be of various species, including Gram-positive and Gram-negative bacteria possessing a peptidoglycan cell wall. Bacteria of interest encompass streptococci, bacilli, micrococci, lactobacili, lactococci, enterococci and pediococci.
Generally, non human mammals are injected with a number of bacteria cells ranging from 1×102 to 1×1012 cells, including from 1×106 to 1×109 cells. Generally, bacteria cells that are injected to non human mammals are contained in a physiologically acceptable liquid solution, usually a saline solution like Ringer's solution or Hank's solution.
Generally, in the embodiment wherein the inhibitor compound to be tested is administered subsequently to bacterial inoculation, said inhibitor compound is administered form 1 hour to 96 hours after bacterial injection, including from 6 hours to 48 hours after bacterial injection.
Generally, in the embodiment wherein the inhibitor compound to be tested is administered prior to bacterial injection, said inhibitor compound is administered from 1 min to 3 hours prior to bacterial injection.
Generally, all animals are sacrificed at the end of the in vivo assay.
For determining the in vivo antibacterial activity of the inhibitor compound that is tested, blood or tissue samples of the tested animals are collected at determined time periods after administration of said inhibitor compound and bacteria counts are performed, using standard techniques, such as staining fixed slices of the collected tissue samples or plating the collected blood samples and counting the bacterial colonies formed.
Then, the values of the bacteria counts found for animals having been administered with increasing amounts of the inhibitor compound tested are compared with the value(s) of bacteria count(s) obtained from animals that hey been injected with the same number of bacteria cells but which have not been administered with said inhibitor compound.
Another object of to invention consists of any one of the D-aspartate ligases that are disclosed in the present specification, including the D-aspartate ligases of SEQ ID No 1 to 10, as well as any one of the biologically active fragments thereof.
A further object of the invention consists of any one of the L,D-transpeptidases that are disclosed in the present specification, including the L,D-transpeptidase of SEQ ID No 13, as well as any one of the biologically active fragments thereof, including those fragments of SEQ ID No 11 and SEQ ID No 12.
A still further object of the present invention consists of a nucleic acid that encodes a D-aspartate ligase or any one of the biologically active fragments thereof, including the nucleic acids of SEQ ID No 22 to 31 that encode the D-aspartate ligases of SEQ ID No 1 to 10, respectively.
A yet further object of the present invention consists of a nucleic acid that encodes a L,D-transpeptidase or any one of the biologically active fragments thereof, including the nucleic acid of SEQ ID No 32 that encodes the L,D-transpeptidase of SEQ ID No 13.
Both polypeptides or nucleic acids of the invention are preferably under a purified form.
Nucleic acids of the invention may be inserted into suitable vectors, particularly expression vectors, such as those that are described elsewhere in the present specification. Recombinant vectors comprising a nucleic acid as defined above that is inserted therein are also part of the invention.
Host cells, particularly prokaryotic cells including yeast cells and cells from E. coli that have been transfected or transformed by a nucleic acid above or a recombinant vector above form also part of the present invention. Such recombinant host cells are for example those that are described elsewhere in the present specification.
Polypeptides of the invention are preferably recombinantly produced, illustratively according to any one of the techniques of production of recombinant proteins that are disclosed elsewhere in the present specification.
A yet further object of the present invention consists of an antibody directed against a D-aspartate ligase or a L,D-transpeptidase that is disclosed in the present specification, or to a biologically active peptide fragment thereof. Any one of these antibodies may be useful for purifying or detecting the corresponding D-aspartate ligase or the corresponding L,D-transpeptidase.
There is no particular limitation on the antibodies encompassed by the present invention, as long as they can bind specifically to the desired D-aspartate ligase or the desired biologically active fragment thereof, or to the desired L,D-transpeptidase or the desired biologically active fragment thereof. It is possible to use mouse antibodies, rat antibodies, rabbit antibodies, sheep antibodies, chimeric antibodies, humanized antibodies, human antibodies and the like, as appropriate. Such antibodies may be polyclonal or monoclonal, but are preferably monoclonal because uniform antibody molecules can be produced stably. Polyclonal and monoclonal antibodies can be prepared in a manner well known to those skilled in the art.
In principle, monoclonal antibody-producing hybridomas can be prepared using known techniques, as follows. Namely, the desired antigen or the desired antigen-expressing cell is used as a sensitizing antigen and immunized in accordance with conventional procedures for immunization. The resulting immunocytes are then fused with known parent cells using conventional procedures for cell fusion, followed by selection of monoclonal antibody-producing cells (hybridomas) through conventional screening procedures. Preparation of hybridomas may be accomplished according to, for example, the method of Milstein et al. (Kohler, G. and Milstein, C., Methods Enzymol. (1981) 73:3-46). If an antigen used is less immunogenic, such an antigen may be conjugated with an immunogenic macromolecule (e.g., albumin) before use in immunization.
In addition, antibody genes are cloned from hybridomas, integrated into appropriate vectors, and then transformed into hosts to produce antibody molecules using gene recombination technology. The genetically recombinant antibodies thus produced may also be used in the present invention (see, e.g., Carl, A. K. Borrebaeck, James, W. Larrick, <<Therapeutic monoclonal antibodies>>, Published in the United Kingdom by MacMillan Publishers Ltd, 1990). More specifically, cDNA of antibody variable domains (V domains) is synthesized from hybridoma mRNA using reverse transcriptase. Upon obtaining DNA encoding the target antibody V domains, the DNA is ligated to DNA encoding desired antibody constant domains (C domains) and integrated into an expression vector. Alternatively, the DNA encoding the antibody V domains may be integrated into an expression vector carrying the DNA of the antibody C domains. The DNA construct is integrated into an expression vector such that it is expressed under control of an expression regulatory region, e.g., an enhancer or a promoter. Host cells are then transformed with this expression vector for antibody expression.
In a case where antibody genes are isolated and then transformed into appropriate hosts to produce antibodies, any suitable combination of host and expression vector can be used for this purpose. When eukaryotic cells are used as hosts, animal cells, plant cells and fungal cells may be used. Animal cells known for this purpose include (1) mammalian cells such as CHO, COS, myeloma, BHK (baby hamster kidney), HeLa and Vero, (2) amphibian cells such as Xenopus oocytes, and (3) insect cells such as sf9, sf21 and Tn5. Plant cells include those derived from Nicotiana plants (e.g., Nicotiana tabacum), which may be subjected to callus culture. Fungal cells include yeasts such as Saccharomyces (e.g., Saccharomyces serevisiae) and filamentous fungi such as Aspergillus (e.g., Aspergillus niger). When prokaryotic cells are used, there are production systems employing bacterial cells. Bacterial cells known for this purpose are E. coli and Bacillus subtilis. Antibodies can be obtained by introducing target antibody genes into these cells via transformation and then culturing the transformed cells in vitro.
The present invention also relates to compositions or kits for the screening of antibacterial substances.
In certain embodiments, said compositions or kits comprise a purified D-aspartate ligase or a purified L,D-transpeptidase, preferably under the form of a recombinant protein.
In said compositions or said kits, said D-aspartate ligase or said L,D-transpeptidase may be under a solid form or in a liquid form.
Solid forms encompass powder of said D-aspartate ligase or said L,D-transpeptidase under a lyophilized form.
Liquid forms encompass standard liquid solutions known in the art to be suitable for protein long time storage.
Preferably, said D-aspartate ligase or said L,D-transpeptidase is contained in a container such as a bottle, e.g. a plastic or a glass container.
In certain embodiments, each container comprises an amount of said D-aspartate ligase or said L,D-transpeptidase ranging from 1 ng to 10 mg, either in a solid or in a liquid form.
Further, said kits may comprise also one or more reagents, typically one or more substrate(s), necessary for assessing the enzyme activity of said D-aspartate ligase or of said L, D-transpeptidase.
Illustratively, if said kit comprises a container of D-aspartate ligase, then said kit may also comprise (i) a container comprising labeled aspartate such as [14C]aspartate or [3H] aspartate and/or (ii) a container comprising UDP-MurNac pentapeptide and UDP-MurNac tetrapeptide.
Illustratively, if said kit comprises a container of L,D-transpeptidase, then said kit may also comprise (i) a container comprising a donor compound consisting of a tetrapeptide preferably selected from the group consisting of L-Ala-D-Glu-L-Lys-D-Ala, Ac2-L-Lys-D-Ala and disaccharide-tetrapeptide(iAsn) and (ii) a container comprising an acceptor compound selected from the group consisting of a D-amino acid or a D-hydroxy acid.
In certain embodiments, a kit according to the invention comprises one or more of each of the containers described above.
The present invention is further illustrated by, without in any way being limited to, the examples hereunder.
Enterococcus faecium D359V8 was grown to an A650 nm of 0.7 in 20 litters of BHI broth (Difco, Elancourt, France), harvested by centrifugation (6 000×g for 20 min at 4° C.), and washed twice in 50 mM sodium phosphate buffer (pH 7.0). Bacteria were disrupted with glass beads in a refrigerated cell disintegrator (B. Braun, Sartorius, Palaiseau, France) for 3×30 s. The extract was centrifuged (7 000×g for 10 min at 4° C.) to remove cell debris and the supernatant was ultracentrifuged at 100 000×g for 1 h at 4° C. The supernatant was saved (cytoplasmic fraction) and the pellet was washed twice in 50 mM sodium phosphate buffer (pH 7.0) (membrane fraction). The protein contents were determined with the Bio-Rad protein assay (Bio-Rad, Ivry-Sur-seine, France).
A.2. In Vitro Addition of D-Aspartate onto UDP-MurNac-Pentapeptide-
The assay was performed in a total volume of 25 μl containing Tris-Hcl (100 mM, pH 8.5), MgCl2 (50 mM), ATP (20 mM), D-[14C]aspartic acid (0.11 mM, 55 mCi/mmol, Isobio, Fleurus, Belgium), UDP-MurNac-pentapeptide (0.15 mM) purified from S. aureus as previously described (Billot-Klein et al., 1997), and membrane or cytoplasmic extracts (60 μg). The reaction mixture was incubated 2 h at 37° C. and the reaction was stopped by boiling the samples for 3 min. D-[14C]aspartic acid was separated from [14C]UDP-MurNac-hexapeptide by descending paper chromatography (Whatman no. 4 filter paper) with a mobile phase composed of isobutyric acid and 1 M ammonia (5:3, vol/vol). The products of the reaction were also separated by reverse phase high-pressure liquid chromatography (rpHPLC) on a Hypersil C18 column (3 m, 4.6×250 nm, Interchrom, Montlugon, France) at a flow rate of 0.5 ml/min using isocratic elution (10 mM ammonium acetate, pH 5.0) and detected by the absorbance at 262 nm and liquid scintillation with a Radioflow Detector (LB508; Perkin Elmer, Courtaboeuf, France) coupled to the HPLC apparatus (L-62000A; Merck, Nogent-Sur-Marne, France).
A.3 Purification of the E. faecium D-Aspartate Ligase.
The D-aspartate ligase was partially purified from extracts of E. faecium D359V8 using three chromatographic steps and the D-aspartate ligase activity was detected in the fractions by the formation of [14C]UDP-MurNac-hexapeptide as described above. Briefly, soluble proteins from supernatant (1.3 g) were dialyzed against 50 mM phosphate buffer (pH 6.0) containing 200 mM NaCl (buffer A) and loaded onto a cation exchange HiLoad™ 26/10 SP Sepharose™ HP column (Amersham Pharmacia Boitech, Saclay, France) equilibrated in buffer A and elution was performed with a 0.2 to 2 M NaCl gradient in buffer A. Actives fractions, eluted between 0.8 and 0.9 M NaCl, were pooled (12 mg of proteins), concentrated with Polyethylene glycol (PEG), and loaded onto a gel filtration column (Superdex 75 HR26/60, Amersham Pharmacia Boitech) equilibrated with buffer A. Active fractions (1.8 mg of proteins) were loaded onto cation exchange HiTrap SP Sepharose Fastflow 1 ml column (Amersham Pharmacia Boitech) equilibrated in buffer A and elution was performed with a 0.2 to 2M NaCl gradient in buffer A. Proteins (200 μg), eluting between 0.8 and 0.95 M NaCl, were dialyzed against buffer A, concentrated by lyophilisation and deposited on a 12% SDS PAGE.
Candidate proteins were excised from the 12% SDS page, reduced with DTT (dithiothreitol, Sigma), alkylated with iodoacetamide and digested with trypsin (modified trypsin, sequencing grade, Roche) overnight at 37° C., using the automatic DIGESTPRO digester from ABIMED. Tryptic digests were dried under vacuum in a Speed-Vac. Samples were resuspended in 4 μl of 0.1% formic acid. They were then separated by HPLC in the LC-Packing® system, sold by Dionex at a flow rate of 200 nl/min using a gradient starting at 2% acetonitril (AcCN) in 0.1% formic acid for 1 min, increasing to 50% AcCN over 40 min, and finally increasing to 90% AcCN over 10 minutes. The LC system is connected to an ion trap mass spectrometer (LCQ Deca, Finnigan Corp, San Jose, Calif.), running Excalibur. The spray voltage was set at 2.1 kV, the temperature of the ion transfer tube was set at 180° C. and the normalized collision energies were set at 35% for MS/MS. The sequences of the uninterpreted spectra were identified by correlation with the peptide sequences present in the NCBI non redundant protein database, using the SpectrumMill program (Millenium Pharmaceuticals).
A.5. Cloning and Purification of the Aspartate Ligase in E. coli.
The ORF coding for the putative aspartate ligase gene of E. faecium, designated hereafter AsIfm, was amplified with primers AsI1 and AsI2. Primer AsI1 (GAGAGACCATGGTGAACAGTATTGAAAATGAAG—SEQ ID No 14) contained NcoI restriction site (bolded) and 21-bp of asI-5′ extremity. Primer AsI2 (CTCCATGGCTAGGATCCTTCTTTCACATGAAAATACTTTTTG—SEQ ID No 15) contained BamHI restriction site (bolded) and 25-bp of the asI-3′ end without stop codon. The asIfm sequence was amplified using Pfu Turbo DNA polymerase (Stratagene, La Jolla, Calif., USA) and E. faecium chromosomal DNA as template (Williamson et al., 1985). The PCR product was cloned into NcoI-BamHI-restricted pET2818 μlasmid, a derivative of pET2816 (Chastanet et al., 2005) generating pSJL1. This plasmid was introduced by electroporation into E. coli BL21 (DE3) harboring pREP4 μlasmid (Amrein et al., 1995). E. coli BL21(DE3) harboring pSJL1 was grown to an optical density at 600 nm of 0.7 under gentle shaking in 2 liters of BHI broth containing of kanamycin (50 μg/ml) and ampicillin (100 μg/ml). Isopropyl-β-D-thiogalactopyranoside (IPTG) was added (0.5 mM) and incubation was continued for 3.5 h. Bacteria were harvested by centrifugation (7 000×g for 20 min at 4° C.), washed in Tris-HCl 50 mM, pH 8.0 containing 150 mM of NaCl (buffer B) and resuspended in the same buffer. Bacteria were disrupted as previously described and the extract was centrifuged at 100 000×g for 1 h at 4° C. The supernatant was mixed with 1 ml of Ni2+-nitrilotriacetate-agarose resin (Qiagen, Courtabeuf, France) previously equilibrated with buffer B. After incubation overnight at 4° C., solution was loaded onto a poly-prep column (Bio-rad, Marnes-la-Coquette, France), resin was washed with 12 column volumes of buffer B and proteins were eluted with buffer B containing 250 mM of imidazole. Proteins eluted were dialyzed overnight at 4° C. against buffer A and loaded onto a HiTrap SP-sepharose fast flow (Pharmacia, Orsay, France) equilibrated with buffer A. Proteins were eluted with a gradient of NaCl (0.2-2M), concentrated against buffer B containing glycerol 50% and stored at −20° C. The purified protein was tested for the D-aspartate ligase activity as described above but using 2 μg of purified protein and 0.8 mM of UDP-MurNac-pentapeptide. To confirm its structure the synthesis of the hexapeptide was done in presence of non radio-active D-aspartate (3 mM) and samples of UDP-MurNAc-peptide products were isolated by rpHPLC, lyophilized, resuspended in water and analyzed by MS and MS/MS as previously described (Bouhss et al., 2002).
Antiserum against AsIfm was obtained by injection subcutaneously of 200 μg of purified protein in a rabbit and used in Western blotting experiments carried out as previously described (Towbin et al., 1979).
A.6. Heterospecific Expression of the AsIfm Gene in E. faecalis.
The shuttle vector (pJEH11) was constructed by amplification of the chloramphenicol acetyl transferase (CAT) gene from pNJ2 μlasmid with primers pJE1 and pJE2 (Arbeloa et al., 2004).
contained SacI restriction site (italicized), VanY Shine-Dagarno sequence (from E. faecium BM4107) underlined and NcoI restriction site bolded.
contained a XbaI restriction site (italicized), a stop codon, a BglII restriction site (bolded), a stop codon, 6 histidine codons (underlined) and a BamHI restriction site (bolded and italicized). The PCR product was digested with SacI and XbaI enzymes and cloned into SacI-XbaI digested pAT392 vector generating pJEH11 μlasmid. The NcoI-BamHI fragment of pSJL1 containing asIfm open reading frame was cloned under the control of the p2 promoter into NcoI-BamHI restricted pJEH11 generating pSJL2 μlasmid. This vector was introduced into E. faecalis JH2.2 by electroporation and clones were selected on BHI-agar plates containing 256 μg/ml of gentamicin.
E. faecalis JH2-2/pSJL2asIfm and of the parental strain JH2-2/pJEH11 were grown at 37° C. to an optical density of 0.7 in 250 ml of BHI broth, containing or not D-aspartate (50 mM) (Sigma-Aldrich). Peptidoglycan was extracted with 4% SDS and muropeptides obtained as previously described (Arbeloa et al., 2004; Mainardi et al., 1998). Lactoyl peptide peptidoglycan fragments were produced and separated by rp-HPLC as previously described (Arbeloa et al., 2004). The relative abundance of peptidoglycan fragments was estimated as the percentage of the total integrated area of the identified peaks. The peaks were individually collected, lyophilized and dissolved in 100 μl of water. The mass of the peptidoglycan fragments were determined using an electrospray time-of-flight mass spectrometer operating in positive mode (Qstar Pulsar I, Applied Biosystem, Courtaboeuf, France) (Arbeloa et al., 2004b). The determination of the structures of the muropeptides was performed by fragmentation. The ions were selected based on the m/z value ([M+H]1+) in the high resolution mode, and fragmentation was performed with nitrogen as collision gas with an energy of 36-40 eV.
We first tested if the aspartate ligase activity was found in the membrane or the cytoplasmic extracts obtained from 20 liters culture of E. faecium D359V8. The assay was performed with 60 μg of membranes or cytoplasmic extracts in presence of D-[14C]aspartic acid, UDP-MurNac-pentapeptide, MgCl2 and ATP. After 2 hours at 37° C., the percentage of conversion was about 5% in the different extracts and both paper (
Since the D-aspartate ligase activity present in the cytoplasmic extracts represented almost 50% of the total activity it was used for further purification of the enzyme. To overcome the precipitation of the protein, all the purification steps were performed at an ionic strength above 200 mM NaCl. A partially purified preparation enriched in D-aspartate ligase activity was obtained from 1.3 gram of soluble proteins by 3 chromatography steps. LC-MS-MS was performed on different candidate proteins excised from a 12% SDS page. Among them, a 50 kDa protein with a ATP grasp motif (Galperin et al., 1997) was identified as the most likely candidate for the D-aspartate ligase from the protein bank deduced from the incomplete genome of E. faecium (Enterococcus faecium at NCBI: Efae 03003049).
The gene (asIfm) encoding the putative D-aspartate ligase was amplified, cloned and introduced into E. coli BL21 The presence of C-terminal six-His tag allowed the purification of the D-aspartate ligase in two steps after successive chromatography on a nickel column and a cation exchange column. SDS-page revealed the presence of the expected ca.49 kDa protein band estimated to be >95% pure (data not shown). Addition of 2 μg of purified protein in the D-aspartate ligase assay resulted in the formation of a radioactive product corresponding to the labeled hexapeptide (peak B in
To ensure that the labeled product in peak B was the expected hexapeptide (UDP-MurNAc-(D-Asp)pentapeptide), the D-aspartate ligase assay was scaled up for mass spectrometry and MS/MS analysis (
To assess the in vivo activity of the D-aspartate ligase, pSJL2(asIfm) was introduced in the heterologous host E. faecalis JH2-2. The expression of D-aspartate ligase and its activity were detected in the cytoplasmic extracts by a Western blot assay using an anti-AsIfm antiserum and the standard D-aspartate ligase assay respectively (data not shown).
The peptidoglycan structure of E. faecalis JH2-2/pSJL2(asIfm) and that of the parental strain JH2-2 containing the native plasmid pJEH11, were analyzed by liquid chromatography coupled to mass spectrometry. Since the structures of the muropeptides present in the peaks of JH2-2/pJEH11 peptidoglycan were identical to those found in JH2-2 (17), the same numbering was used (peak 1 to 10,
The L,D-transpeptidase was purified from E. faecium M512 (Mainardi et al., 2000)) in four chromatographic steps using the radioactive exchange assay (see below) to detect active fractions. Briefly, E. faecium M512 was grown to an OD650 of 0.7 in 24 liters of brain heart infusion (BHI) broth (Difco, Elancourt, France), harvested by centrifugation, and washed twice in 10 mM sodium phosphate (pH 7.0). Bacteria were disrupted with glass beads in a cell disintegrator (The Mickle Laboratory Engineering Co, Gromshall, United Kingdom) for 2 h at 4° C. The extract was centrifuged (5000×g for 10 min at 4° C.) to remove cell debris and the supernatant was ultracentrifuged at 100,000×g for 30 min at 4° C. Soluble proteins (1 g) were loaded onto an anion exchange column (Hi-Load™ 26/10 Q Sepharose™ HP, Amersham Pharmacia Biotech, Saclay, France) equilibrated with 25 mM sodium cacodylate buffer (pH 7.86) (buffer A). Elution was performed with a linear 0 to 2M NaCl gradient in buffer A. Active fractions were pooled (30 mg of proteins), concentrated by ultrafiltration (Centricon YM10, Millipore, Saint-Quentin-en-Yvelines, France), and loaded onto a gel filtration column (Superdex 75 HR26/60, Amersham Pharmacia Biotech) equilibrated with buffer A containing 0.3M NaCl. Active fractions (1 mg of proteins) were loaded onto a weak anion exchange column (HiTrap™ DEAE fast flow M 1 ml, Amersham Pharmacia Biotech) equilibrated with buffer A. Proteins (300 μg) eluting between 0.2 and 0.3 M NaCl were concentrated by ultrafiltration (Amicon ultra-4, Millipore) and loaded onto a gel filtration column (Superdex 200 PC 3.2/30, Amersham Pharmacia Biotech) equilibrated with buffer A containing 0.3M NaCl. Active fractions (70 μg of proteins) were concentrated (Amicon ultra-4) and analyzed by SDS-PAGE revealing a major 48-kDa protein band which was transferred onto polyvinylidene difluoride membrane (Problott, Applied Biosystems, Framingham, Mass.) by passive adsorption (Messer et al., 1997). N-terminal Edman sequencing was performed on an Applied Biosystems Procise 494HT instrument with reagents and methods recommended by the manufacturer. The open reading frame for the L,D-transpeptidase was identified by similarity searches between the N-terminal sequence of the 48-kDa protein (AEKQEIDPVSQNHQKLDTTV [SEQ ID No 20]) and the partial genome sequence of E. faecium using the software tBLAST at the National Center for Biotechnology Information Web site (available on the World Wide Web at www.ncbi.nlm.nih.gov).
A portion of the ldtfm open reading frame of E. faecium M512 was amplified with primers 5′-TTCCATGGCAGAAAAACAAGAAATAGATC C-3′ (SEQ ID No 18) and 5′-TTGGATCCGAAGACCAATACAGGCG-3′ (SEQ ID No 19). The PCR product digested with NcoI and BamHI (underlined) was cloned into pET2818, a derivative of pET2816 (Chastanet et al., 2003) lacking the sequence specifying the thrombin cleavage site (our laboratory collection). The resulting plasmid, pET2818Ω/ldtfm, encoded a fusion protein consisting of a methionine specified by the ATG initiation codon of pET2818, the sequence of the protein purified from E. faecium (residues 119 to 466), and a C-terminal polyhistidine tag GSH6. E. coli BL21(DE3) pREP4GroESL (Amrein et al., 1995) harboring pET2818Ωldtfm was grown at 37° C. to an OD650 of 0.8 in three liters of BHI broth containing ampicillin (100 μg/ml). Isopropyl-D-thiogalactopyranoside was added to a final concentration of 0.5 mM and incubation was continued for 17 h at 16° C. Ldtfm was purified from a clarified lysate by affinity chromatography on Ni2+-nitrilotriacetate-agarose resin (Qiagen GmbH, Hilden, Germany) followed by anion exchange chromatography (MonoQ HR5/5, Amersham Pharmacia Biotech, Uppsala Sweden) with a NaCl gradient in TrisHCl pH 7.5. An additional gel filtration was performed on a Superdex HR10/30 column (Amersham Pharmacia Biotech) equilibrated with 50 mM Tris-HCl (pH 7.5) containing 300 mM NaCl at a flow rate of 0.5 ml/min. Site-directed mutagenesis was performed according to the QuickChange procedure of Stratagene (La Jolla, Calif.).
The dipeptide Nα,Nε-diacetyl-L-lysyl-D-alanine (Ac2-L-Lys-D-Ala) was prepared by coupling Boc2-L-Lys p-nitrophenylester with D-Ala-Obenzyl p-toluenesulfonate (Novabiochem, Laüfelfingen, Switzerland) in the presence of triethylamine followed by acetylation with acetic anhydride in the presence of pyridine as previously described (Mainardi et al., 2002). Nα,Nε-diacetyl-L-lysine-D-alanyl-D-alanine (Ac2-L-Lys-D-Ala-D-Ala), L-Ala-D-iGlu-L-Lys-D-Ala-D-Ala (pentapeptide), and amino acids were purchased from Sigma-Aldrich (Saint-Quentin Fallavier, France). D-2-hydroxy acids were obtained from Acros Organics (Noisy-le-Grand, France). UDP-N-acetylmuramyl-L-Ala-D-iGlu-L-Lys-D-Ala-D-Ala (UDP-MurNAc-pentapeptide) was prepared from Staphylococcus aureus (Billot-Klein et al., 1997). The R39 D,D-carboxypeptidase was used to generate UDP-MurNAc-tetrapeptide and tetrapeptide from UDP-MurNAc-pentapeptide and pentapeptide, respectively (Billot-Klein et al., 1992). Disacccharide-peptide fragments of the peptidoglycan (muropeptides) were prepared by scaling up a previously published procedure (Arbeloa et al., 2004). Briefly, E. gallinarum strain SC1 (Grohs et al., 2000) was grown in 3 liters of BHI broth at 37° C. to an OD650 of 0.7. Peptidoglycan was extracted with 4% sodium dodecyl sulfate at 100° C., treated overnight with pronase and trypsin, and digested with mutanolysin and lysozyme. Soluble disaccharide-peptides were purified by reversed-phase high pressure liquid chromatography (rp-HPLC) on a C18 column, individually collected, lyophilized, and dissolved in water. The concentration of muropeptides was estimated after acidic hydrolysis with a Biotronik model LC2000 amino acid analyzer (Mengin-Lecreulx et al., 1999). The structure of the different substrates was confirmed by mass spectrometry and tandem mass spectrometry with an electrospray quadrupole time-of-flight mass spectrometer operated in the positive mode (Qstar Pulsar I, Applied Biosystems, Courtabœuf, France), as previously described (Arbeloa et al., 2004).
The standard exchange assay was based on incubation of non-radioactive Ac2-L-Lys-D-Ala and D-[14C]Ala and determination of Ac2-L-Lys-D-[14C]Ala formed by the L,D-transpeptidase (Mainardi et al., 2002; Coyette et al., 1974). Briefly, the assay (50 μl) contained Ac2-L-Lys-D-Ala (5 mM), D-[14C]Ala (0.15 mM; 2.0 GBq/mmol, ICN Pharmaceuticals, Orsay, France), 10 mM sodium cacodylate buffer (pH 6.0), and 0.1% triton X-100 (v/v). The reaction was allowed to proceed at 37° C. and stopped by boiling the samples for 3 min. After centrifugation (10,000×g, 2 min), 45 μl of the supernatant was analyzed by rpHPLC at 25° C. on a μ-Bondapak C18 column (3.9 by 300 mm, Waters, Saint Quentin en Yvelines, France) with isocratic elution (0.05% TFA in water/methanol 9:1 per volume) at a flow rate of 0.5 ml/min. Products were detected by scintillation with a Radioflow Detector (LB508, Perkin Elmer) coupled to the HPLC device. To test different donors, 3 μg of Ldtfm were incubated for 60 min in the same conditions, except that Ac2-L-Lys-D-Ala was replaced by UDP-MurNAc-tetrapeptide (2.5 mM), UDP-MurNAc-pentapeptide (2.5 mM), tetrapeptide (2.5 mM), pentapeptide (2.5 mM), GlcNAc-MurNAc-tetrapeptide-iAsn (1 mM), and GlcNAc-MurNAc-pentapeptide-iAsn (1 mM).
To assay for in vitro transpeptidation, the L,D-transpeptidase (3 μg) was incubated with the momomeric muropeptides GlcNAc-MurNAc-L-Ala-D-iGln-L-(M-D-iAsn)Lys-D-Ala (25 nmoles), GlcNAc-MurNAc-L-Ala-D-iGln-L-(M-D-iAsn)Lys (5 nmoles) and GlcNAc-MurNAc-L-Ala-D-iGln-L-Lys-D-Ala (5 nmoles) for 2 h at 37° C. in 25 μl of 5 mM sodium phosphate buffer (pH 6.0). The reaction was stopped by boiling the sample for 3 min and the mixture was centrifuged (10,000×g, 2 min). The formation of dimers was determined by mass spectrometry on a 10-μl aliquot. For tandem mass spectrometry analysis, the remaining of the reaction mixture was treated with ammonium hydroxyde to cleave the ether link internal to MurNAc (Arbeloa et al., 2004). The conditions for fragmentation of the resulting lactoyl-peptides with N2 as the collision gas were as previously described (Arbeloa et al., 2004).
We identified the gene encoding the L,D-transpeptidase responsible for the formation of the L-Lys3⋄D-iAsn-L-Lys3 cross-links in E. faecium M512 by partial purification of the enzyme (
To gain insight in the activity of Ldtfm, various 2-amino and 2-hydroxy acids were tested as potential acceptor substrates (Table 2) in an exchange reaction using the model dipeptide substrate Ac2-L-Lys-D-Ala as the donor (
The Ldtfm specificity for peptide donors was explored with the exchange assay using D-[14C]Ala as the acceptor. Formation of radioactive peptides was observed not only with Ac2-L-Lys-D-Ala (
We have previously detected similar Ldtfm activity in crude extracts from the ampicillin-resistant E. faecium mutant M512 and from the susceptible parental strain D344S (Mainardi et al., 2002). The identification of the corresponding gene, ldtfm, allowed us to confirm that its sequence was identical in both strains and in the E. faecium genome data base. These observations indicate that activation of the L,D-transpeptidation pathway (
Since Ldtfm had all the characteristics expected for a peptidoglycan cross-linking enzyme, we investigated the formation of L-Lys3→D-iAsn-L-Lys3 cross-links with substrates closely mimicking the natural peptidoglycan precursors. Such substrates were prepared from the peptidoglycan of Enterococcus gallinarum, as it contains large amounts of uncross-linked monomers containing a tetrapeptide-iAsn stem (Grohs et al., 2000). L,D-transpeptidation was assayed with a reconstituted pool of three muropeptides to simultaneously test six combinations of donors and acceptors (
Sequence comparisons indicated that Ldtfm is the first representative of a novel family of proteins which is sporadically distributed among taxonomically distant bacteria. Close homologs (
EfLDT (119-466 of SEQ ID No 13) was crystallized using the sitting-drop vapour-diffusion method at 295 K. Rock-shaped crystals of SeMet-derivatised protein with approximate dimensions 200 μ×200 μ×200μ were obtained at a concentration of 10 mg/ml using 12.5% PEG 2000, 100 mM ammonium sulfate, 300 mM NaCl and 100 mM sodium acetate trihydrate pH 4.6. X-ray diffraction data (2.4 Å) were collected at the ESRF FIP-BM30A beamline, processed with the CCP4 programm suite (MOSFLM and SCALA).
The structure of EfLDT was determined by single anomalous diffraction and the position of three ordered Se atoms (out of a possible 5) were found using the program CNS. After density modification using the CNS SAD phase, the model was manually built with one molecule per asymmetric unit. The final model consists of residues 217-398 and 400-466, one sulfate and one zinc ions and 295 water molecules. The 97 residues 119-216 could not be located in the map. Ramachandran analysis indicates that 83.3% of residues are in the most favored region, 15.3% are additionally allowed, and 1.4% are generously allowed.
The results from the X-ray diffraction experiment of the crystallized L,D-transpeptidase consisting of the amino acid sequence 119-466 of SEQ ID No 13 are shown in Table 3 hereunder.
The three-dimensional structure of the crystallized L,D-transpeptidase consisting of the amino acid sequence 119-466 of SEQ ID No 13 is shown in
The protein is constituted by 2 domains: the domain 1 is constituted by residues 217 to 338 (shown in light grey on top of
†The structure was determined from the observed monoisotopic mass of lactoyl peptides and for monomers and dimers (indicated by star) directly determined by tandem mass spectrometry. Tri, tripeptide L-Ala1-D-iGLN2-L-Lys3; Tetra, tetrapeptide; L-Alai-D-iGLN2-L-Lys3-D-Ala4; penta, pentapeptide Ala1-D-iGLN2-L-Lys3-D-Ala4-D-Ala5;
‡amino acid(s) present in the cross-bridge between two stem peptides
§amino acid(s) present in the free N-terminal side chain
†Ionic current intensity (product/product + substrate)
‡ ND, not detected
Bulgaricus
Number | Date | Country | Kind |
---|---|---|---|
05300644.1 | Aug 2005 | EP | regional |
This application is a divisional of U.S. application Ser. No. 11/997,705 filed Mar. 5, 2009, which is a national phase application under 35 U.S.C. §371 of International Application No. PCT/EP2006/064903 filed Aug. 1, 2006, which claims priority to European Application No. 05300644.1 filed Aug. 2, 2005. The entire text of each of the above-referenced disclosures is specifically incorporated herein by reference without disclaimer.
Number | Date | Country | |
---|---|---|---|
Parent | 11997705 | Mar 2009 | US |
Child | 14508084 | US |