The present invention relates to novel enzymes involved in the biosynthesis of mycolic acids and to the use thereof for screening antibiotics, medicaments that can be used to treat infections in humans or in animals, caused by Corynebacterineae, and more particularly infections caused by pathogenic mycobacteria (Mycobacterium tuberculosis, M. africanum, M. leprae, M. ulcerans, M. microti, M. bovis, M. abscissus, M. avium, M. fortuitum, M. kansasii, etc.) and also infections caused by other genera such as Nocardia, Rhodococcus, Gordona, etc.
Over the past fifteen or so years, a new upsurge in tuberculosis (the agent of which is M. tuberculosis) has been observed, in particular in industrialized countries, due both to the multiplication of cases of tuberculosis/HIV coinfection and to the appearance of tubercular bacillus strains multiresistant to antibiotics. Thus, the design of new antitubercular medicaments has become a priority. The identification of pharmacological targets is therefore necessary for the development of new medicaments.
Mycobacterial mycolic acids, also known as eumycolates, are α-alkylated and β-hydroxylated very long-chain (C60-C90) fatty acids present in the form of esters in the wall of bacteria of a particular phylogenetic line of actinomycetes, the suborder Corynebacterineae, also called “mycolata”, comprising, inter alia, the bacterial genera: Mycobacterium, Corynebacterium, Rhodococcus, Nocardia, Gordona and Tsukamurella. The mycolic acids are thus specific for the envelope of mycobacteria and of related bacteria (Brennan & Nikaido, 1995); a distinction can be made between mycobacterial mycolic acids and the mycolic acids of related genera, which have shorter chain lengths (Barry et al. 1998). These compounds are important to the architecture and the permeability of the bacterial envelope for which they represent a hydrophobic diffusion barrier (Brennan & Nikaido, 1995). In addition, they have an essential role in the survival of mycobacteria, and they are involved in the virulence and the persistence of the tubercular bacillus in the infected host (Dubnau et al., 2000; Glickman et al., 2000).
Among the mycolata are major pathogens, in particular the mycobacteria Mycobacterium tuberculosis, which is the tuberculosis agent, and Mycobacterium leprae, which is the leprosy agent. More specifically, the Mycobacterium genus comprises several species classified in 3 groups as a function of their risk of infection with respect to humans. Several species are found in the strict pathogen group, including the leprosy bacillus M. leprae and also M. tuberculosis which belongs to the subgroup known as “M. tuberculosis complex”. This subgroup is made up of 4 conventional members: M. tuberculosis, M. bovis, M. microti and M. africanum, and of the 3 recently defined members M. canetti, M. pinnipedii and M. caprae (Aranaz et al., 2003; Cousins et al., 2003; van Soolingen et al., 1997). The second group corresponds to the human opportunistic pathogens (such as M. avium, M. fortuitum, M. kansasii, etc.) which have been isolated from certain animals and in the environment. These mycobacteria are responsible for pathologies in humans (mycobacterioses), in particular in individuals with a weak immune system (for example, individuals suffering from AIDS). Finally, the third group corresponds to the nonpathogenic saprophytic mycobacteria (M. smegmatis, M. phlei, M. gastri, etc.) present in the environment. Among the antitubercular medicaments are those which interfere with mycobacterial envelope biosynthesis, such as isoniazid, ethionamide and ethambutol (WEBB et al., Molecular Biology and Virulence I: 287-307 (eds. Ratledge, C. & Dale, J.) (Blackwell Science Ltd, Oxford), 1999). The principle target of isoniazid, specifically antitubercular antibiotic, is any enzyme of mycolic acid metabolism: it inhibits the activity of the InhA protein, which is part of the FAS-II enzymatic complex or system, the function of which is to produce, by successive cycles of elongation, long-chain fatty acids (acyl-ACP up to C32 in vitro, or up to C60 in vivo), which are precursors of mycolic acids, using malonyl-ACP as elongation unit. InhA, a 2-trans-enoyl-ACP reductase, catalyzes the 4th step of an elongation cycle which comprises four steps (
More specifically, the initiation of the elongation system of the mycobacterial FAS-II system is carried out by a β-ketoacyl-ACP synthase III (mtFabH or KasIII), which probably makes the connection between the two systems FAS-I and FAS-II by catalyzing the condensation of the acyl-CoA derived from FAS-I with the elongation unit, malonyl-ACP (Choi et al., 2000) (
However, there exists a real need to find inhibitors of other enzymes of this cycle, in order to broaden the panopy of available medicaments, given the frequency of appearance of antibiotic resistances. Furthermore, multiplying the targets increases the chances of designing pharmacologically active molecules which may actually be used clinically.
Among the enzymes of the mycobacterial FAS-II system, the (3R)-hydroxyacyl-ACP dehydratase enzyme, which is involved in the third step of the cycle and catalyzes the conversion of (3R)-hydroxyacyl-ACP into 2-trans-enoyl-ACP, has not yet been identified. The hydroxyacyl dehydratase enzymes or domains, as a whole, are difficult to identify owing to the lack of similarity between the sequences. This was, for example, the case of the dehydratase domain of FAS-I of M. bovis BCG and of the (3R)-hydroxyacyl-ACP dehydratase of the mitochondrial FAS-II system of Saccharomyces cerevisiae (Fernandes & Kolattukudy, 1996; Kastaniotis et al., 2004). A (3R)-hydroxyacyl-ACP dehydratase activity has been observed in bacteria (including Streptococcus, Staphylococcus, E. coli and Haemophilus influenzae), and corresponds to the FabZ (dehydratase) and FabA (dehydratase-isomerase) enzymes (patent U.S. Pat. No. 6,951,729).
Research for proteins carrying the FabA/FabZ consensus motif (‘F-x(1,2)-G-H-[FI]-P-x(5)-P-G-V-x(3)-E-[AGS]-[LM]-A-Q’) (SEQ ID No. 17) in the Mycobacterium genus using the ScanProsite software (www.expasy.ch/tools/scanProsite) has not made it possible to identify candidate (3R)-hydroxyacyl dehydratase proteins in mycobacteria. These results have made it possible to conclude that no protein of FabZ or FabA type is present in the mycobacterial species available through the ScanProsite software.
This would suggest that the (R)-specific dehydratase/hydratase proteins of mycobacteria must have a catalytic motif different than the common FabA/FabZ motif.
The inventors have now determined the enzymes involved in the third step of the fatty acid elongation cycle of the mycobacterial FAS-II system and the advantage thereof as a target for screening medicaments and in particular antibiotics, active on microorganisms (bacteria, parasites, for example) containing a FAS-II system, in which the third step of the elongation cycle is catalyzed by a dehydratase containing a hydratase 2 motif (
The term “similar basic motif” is intended to mean the basic motif D-x(4)-H, in which the aspartic acid (D) is replaced with an amino acid chosen from the group constituted of asparagine (N), glutamic acid (E) and glutamine (Q), and/or the histidine (H) is replaced with a basic amino acid such as arginine (R) or lysine (L).
Molecules having an inhibitory capacity on the expression or the activity of (3R)-hydroxyacyl-ACP dehydratases of the mycobacterial FAS-II system may also potentially inhibit other essential metabolisms involving proteins of hydratase 2 type, in a given microorganism.
The inventors have found that the enzymes responsible for the third step (dehydration) in the FAS-II system of M. tuberculosis H37Rv correspond to proteins comprising protein subunits including Rv0635 (SEQ ID No. 4), Rv0636 (SEQ ID No. 2) or Rv0637 (SEQ ID No. 6) of M. tuberculosis H37Rv. Various enzymes containing Rv0636 are ACP-dependent and have a specificity for long-chain substrates.
Consequently, a subject of the present invention is a purified and isolated enzyme involved in the FAS-II system, preferably the mycobacterial FAS-II system, and having the following characteristics:
in which X represents ACP and n≧0, preferably n≧4, or CoA and n≧0, preferably n≧8.
The protein not carrying a hydratase 2 motif (i.e. without a catalytic site) is, for example, a long-acyl-chain-binding protein.
The term “multimer” signifies at least three proteins.
The methods for identifying a protein, the three-dimensional structure of which exhibits a hotdog fold, are well known. By way of example, proteins, the three-dimensional structure of which exhibits a hotdog fold, have been described by Leesong et al., 1996 and Hisano et al., 2003.
According to one advantageous embodiment of the invention, the protein as defined in (i) is selected from the group constituted of the Rv0636 protein (SEQ ID No. 2) comprising a hydratase 2 motif Y-A-G-V-S-G-D-L-N—P—I—H—W-D-D-E-I-A (SEQ ID No. 16) and a protein which has at least 51% identity or at least 72% similarity, preferably at least 85% identity or at least 90% similarity, with the Rv0636 protein SEQ ID No. 2 of M. tuberculosis H37Rv.
According to another advantageous embodiment of the invention, the protein as defined in (ii), which does not comprise a hydratase 2 motif, is selected from the group constituted of:
According to another advantageous embodiment of the invention, the dimer or multimer is selected from the group constituted of:
The enzymes in accordance with the invention are thus involved in the third step of dehydration of the elongation cycle of the FAS-II system, and in particular the mycobacterial FAS-II system, as (3R)-hydroxyacyl dehydratase (
The BLAST searches have made it possible to show that the three proteins are very conserved among mycobacteria (
In addition, the sequence of other Corynebacterinae, namely Corynebacterium glutamicum, Rhodococcus sp. RHA1 and Nocardia farcinica has been analyzed. Although Corynebacterium produces short mycolic acids (C32-C36), Rhodococcus and Nocardia have mycolic acids of intermediate size (C34-C48 and C44-C60, respectively), exhibiting a meromycolic chain of medium length (C22-C30 and C32-C42), i.e. shorter than that observed in the Mycobacterium genus. As has already been observed for other enzymes involved in the FAS-II system, it has not been possible to detect polypeptides similar to Rv0635, Rv0636 and Rv0637, in Corynebacterium. This is consistent with the fact that its mycolic acid biosynthesis pathway does not include a fatty acid elongation step. Proteins orthologous to Rv0635 and Rv0636 are present in Rhodococcus and Nocardia (see results above), as observed for the InhA, MabA and KasA proteins. On the other hand, no orthologue of the Rv0637 protein is present, just as for KasB. It should be noted that the genes equivalent to Rv0635 and Rv0636 are fused into a single long gene in Nocardia (
The Rv0635, Rv0636 and Rv0637 proteins, and also the proteins having the identity or similarity percentages specified above, can advantageously serve as a target for screening antibodies of use as anti-infective medicaments.
The expression “x % identity between a polypeptide P of length at least equal to that of the reference sequence, and a reference sequence” is intended to mean that, when the two sequences are aligned, x % of the amino acids of P are identical to the corresponding amino acid of said reference sequence.
The expression “x % similarity between a polypeptide P of length at least equal to that of the reference sequence, and a reference sequence” is intended to mean that, when the two polypeptides are aligned, x % of the amino acids of P are identical to the corresponding amino acid of said reference sequence or are replaced with an amino acid of the same group. When the polypeptide P is shorter in length than the reference sequence, the alignment is carried out over the total length of the polypeptide P.
The term “amino acid of the same group” is intended to mean an amino acid having substantially identical chemical properties. In particular, this term is intended to mean amino acids having substantially the same charge and/or the same size and/or the same hydrophilicity or hydrophobicity and/or the same aromaticity.
Such groups of amino acids include, in particular:
Other substitutions can be envisioned, in which one amino acid is replaced with another amino acid which is comparable but not natural (hydroxyproline, norleucine, ornithine, citrulline, cyclohexylalanine, dextrorotary amino acids, etc.).
The identity percentages and similarity percentages defined can be obtained using the BLAST program (or blast2seq, default parameters) (Tatustore et al., FEMS Microbiol. Lett. 1999, 174, 247-250) or PSI-BLAST program (Altshul et al., 1997) with a comparison window corresponding to the total length of the sequences SEQ ID Nos. 2, 4 and 6, when the polypeptide P compared is of length at least equal to that of the sequences SEQ ID Nos. 2, 4 and 6. When the polypeptide P to be compared is shorter in length than the sequences SEQ ID Nos. 2, 4 and 6, the comparison window corresponds to the total length of the polypeptide P.
According to one advantageous embodiment of the invention, said enzymes are selected from the group constituted of: the homomultimer of Rv0636, the heterodimer or the heteromultimer Rv0635-Rv0636 and the heterodimer or the heteromultimer Rv0636-Rv0637.
Surprisingly, the enzymes comprising Rv0636 combined as a heterodimer or as a heteromultimer with Rv0635 or Rv0637 have an enoyl-CoA hydratase activity in vitro.
In particular, these two heterodimers effectively catalyze the hydration of 2-trans-enoyl-CoA species with chain lengths of C12-C20, whereas no activity is detected in the presence of short-chain substrates (C4-C8). They therefore exhibit a specificity for substrates of size greater than or equal to 12 carbon atoms, a property comparable with the InhA, KasA and KasB proteins of the mycobacterial FAS-II system, and with the FAS-II system itself (
Of course, the specificity of an enzyme according to the invention for a given size of the acyl chain of the substrates may depend on the origin (plant, parasite, bacterium, mitochondrion) of said enzyme.
According to another advantageous embodiment of the invention, the (3R)-hydroxyacyl and/or 2-trans-enoyl substrate of said enzymes is preferably a substrate having an acyl chain of length greater than or equal to C8, preferably C12-C20, derived from ACP (
Rv0636 can combine with a partner which does not comprise a hydratase 2 catalytic motif. Thus, the partner protein (Rv0635 or Rv0637) is probably present in order to stabilize the long acyl chain of the substrate.
The combining of Rv0636 with Rv0635 (so as to form a heterodimer or a heteromultimer) induces a different specificity, but also a much greater specific activity in comparison with that obtained with the Rv0636-Rv0637 heterodimer (
In addition, the fact that the specificity of substrates for Rv0636-Rv0637 is shifted toward the long chains shows that it could be involved, like KasB, in the final steps of elongation of the meromycolic chain, and take over from Rv0635-Rv0636 (
Surprisingly, the results obtained show that the two heterodimers or multimers Rv0635-Rv0636 and Rv0636-Rv0637 are involved in the dehydration step of the mycobacterial FAS-II system, and more specifically that Rv0635-Rv0636 appears to be involved in the first elongation cycles, whereas Rv0636-Rv0637 appears to be involved in the later cycles.
The inventors have produced and purified the various dimers or multimers in recombinant form and have determined their catalytic activity.
The results obtained by the inventors show that these various dimers actually correspond to the (3R)-hydroxy-acyl-ACP dehydratase involved in the third, dehydration step of the mycobacterial FAS-II system and that they are key enzymes in the assembly of the envelope of mycolata. Experiments for predicting essentialness in Mycobacterium tuberculosis suggests that the Rv0635 and Rv0636 genes are essential for the viability of this species (Sassetti et al,. 2003).
Enzymes belonging to the family of hydratases 2 in the form of heteromultimers (for example, of heterodimers), composed of units with a single hotdog fold corresponding to a catalytic subunit and to a subunit without a catalytic site, have never previously been described.
In fact, in the prior art, there has only been a structure of “asymmetrical” type described for the double hotdog hydratase domain of the eukaryotic MFE-2 proteins and predicted (by modeling) for the Rv3398c hydratase/dehydratase enzyme of M. tuberculosis, but, in these two cases, the structure is composed of a single polypeptide having two structural domains, one catalytic, the other noncatalytic, which are covalently bonded. The original structure of the enzymes according to the invention may favor the discovery of specific ligands that are potentially inhibitors of these enzymes.
A subject of the present invention is also the poly-nucleotides encoding the various proteins as defined above, especially the Rv0635, Rv0636 and Rv0637 proteins; in particular, the polynucleotide of SEQ ID No. 1 encodes the Rv0636 protein; the polynucleotide of SEQ ID No. 3 encodes the Rv0635 protein and the poly-nucleotide SEQ ID No. 5 encodes the Rv0637 protein; the polynucleotide comprising the sequence which encodes an RV0636 protein and the sequence which encodes an Rv0635 protein and/or an Rv0637 protein.
The subject of the present invention is also a poly-nucleotide represented in the sequence SEQ ID No. 7 of the sequence listing attached in the annex; it comprises the ORFs encoding the Rv0635, Rv0636 and Rv0637 proteins located between positions 73190 and 73325 of the genome of M. tuberculosis H37Rv. The chromosomal coordinates of the ORFS in said poly-nucleotide are the following: Rv0635: 731930-732406 (positions 201-677 of SEQ ID No. 7), Rv0636: 732393-732821 (positions 664-1092 of SEQ ID No. 7); Rv0637: 732825-733325 (positions 1096-1596 of SEQ ID No. 7) (
A subject of the present invention is also an expression vector comprising a polynucleotide as defined above, and in particular a polynucleotide encoding an Rv0636 protein in accordance with the invention, alone or combined with a polynucleotide encoding the Rv0635 protein and/or the Rv0637 protein, and also a prokaryotic or eukaryotic host cell transformed with said expression vector.
The vector used is any vector of the prior art.
Said vector may also comprise regulatory sequences for the expression of the protein encoded by the poly-nucleotide (promoter, stop site, etc.).
The introduction of the polynucleotide or of the expression vector into the modified host cell can be carried out by any known method, such as, for example, transfection, infection, fusion, electroporation, microinjection or biolistics.
A subject of the present invention is also a method for producing an enzyme in accordance with the invention, characterized in that it comprises culturing a host cell in accordance with the invention in a suitable medium and purifying said enzyme from said culture.
Advantageously, said purification is carried out by affinity chromatography, for example IMAC, and/or exclusion chromatography. By way of nonlimiting example, mentioned may be made of the IMAC Ni-Sepharose FF column and the HiLoad 16/60 Superdex 75 prep grade column.
A subject of the present invention is also a method for inhibiting the biosynthesis of compounds of the envelope of mycolata, characterized in that it comprises inhibiting the expression or the activity of at least one of the enzymes as defined above, selected from the group constituted of the Rv0636 protein, the Rv0635 protein, the Rv0637 protein and/or the protein complexes as defined above (dimers and multimers comprising the Rv0636 protein). For example, such inhibitors may be antibodies directed against these proteins, antisense RNAs or interfering RNAs targeting the genes encoding these proteins, molecules which bind to the active site or to the substrate-binding site of these proteins, or, in particular, compounds having a phenylbenzopyrone structure, such as flavonoids (Brown et al., 2007).
According to one advantageous embodiment of said method, it comprises:
The subject of the present invention is also a method for screening for ligands which inhibit an enzyme according to any one of claims 1 to 7 or a protein selected from the group constituted of the Rv0635, Rv0636 and Rv0637 proteins, characterized in that it comprises a step of identifying the inhibition of the expression or of the activity of said enzyme or of said protein.
The inhibition of the hydratase/dehydratase activity may be carried out by any method known to those skilled in the art, and in particular by measuring a variation in absorbance linked to the disappearance of the substrate or to the appearance of a product, at an appropriate wavelength or by means of a tandem reaction (secondary enzyme or antibody that will detect the residual amount of substrate or the amount of product formed).
For example, the detection of the inhibitory capacity of said test molecule is carried out by:
Thus, the test for inhibition of the proteins constituted of at least one of the proteins Rv0635, Rv0636 and Rv0637 or of the enzymes as defined above, by potential inhibitors, can be easily and rapidly monitored by spectrophotometry, by following the hydration of the trans double bond in the 2-position of the trans-2-enoyl substrates or the dehydration of the (3R)-hydroxyacyl substrates, at 263 nm. The inhibition constants (Ki) and the mechanism of inhibition (competitive inhibition, noncompetitive inhibition, incompetitive inhibition, irreversible inhibition, slow binding, etc.) for each molecule can be deduced therefrom. In addition, tests for ligand binding to the proteins can also be carried out by spectrofluorimetry, by virtue of the presence of Trp residues, in particular in the substrate binding site. After excitation at 303 nm, the variation in intensity of fluorescence emission at the maximum emission makes it possible to detect the binding of a ligand and to deduce therefrom the dissociation constant (Kd). The simplicity of these methods of measurement, and the relatively small volumes that they require, should enable miniaturization of the inhibition or ligand-binding tests, for the automatic high-throughput screening of combinatorial libraries, by virtue of an automated device that has a spectrophotometer or a spectrofluorimeter.
Owing to its nature predicted to be essential for viability, and to the specificity of its function, the (3R)-hydroxyacyl-ACP dehydratase of the mycobacterial FAS-II system constitutes an excellent potential target for the desired new antimycobacterial medicaments, which in particular act on the growth and/or the viability of pathogenic mycobacteria (Mycobacterium tuberculosis, M. africanum, M. leprae, M. ulcerans, M. microti, M. bovis, M. abscissus, M. avium, M. fortuitum, M. kansasii, etc.) or of new medicaments which act on the growth and/or the viability of genera related to mycobacteria belonging to the Corynebacterineae (for example, Nocardia, Rhodococcus, Gordona, etc.).
The subject of the present invention is also the use of an enzyme as defined above, for screening for antibiotics that are active on microorganisms containing a FAS-II system, in which the dehydration step is catalyzed by a dehydratase containing a hydratase 2 motif [YF]-x(1,2)-[LIVG]-[STGC]-G-D-x-N—P-[LIV]-H-x(5)-[AS] (SEQ ID No. 15) (in which x(n) represents n amino acids, said amino acids being any amino acid, and the amino acids between square brackets representing alternatives) or a catalytic motif, derived from the hydratase 2 motif and constituted at least of the basic motif D-x(4)-H, in which x(4) represents 4 amino acids, said amino acids being any amino acid, or of a motif similar to the basic motif.
According to one advantageous embodiment of this use, the hydratase 2 motif is the motif Y-A-G-V-S-G-D-L-N—P—I—H—W-D-D-E-I-A (SEQ ID No. 16).
According to another advantageous embodiment of this use, said microorganisms are Corynebacterinae, preferably bacteria of the Rhodococcus, Nocardia and Mycobacterium genera, preferably the Mycobacterium genus.
The subject of the present invention is also a method for screening for ligands of a protein (preferably RV0635, Rv0636 and Rv0637, in monomeric form, and more preferably Rv0636) or of an enzyme in accordance with the invention, characterized in that it comprises:
The subject of the present invention is also a method for screening for ligands of a protein (preferably Rv0635, Rv0636 and Rv0637, and more preferably Rv0636) or of an enzyme in accordance with the invention, characterized in that it comprises the following steps:
In addition to the above arrangements, the invention also comprises other arrangements which will emerge from the description which follows, which refers to exemplary embodiments of the invention and also to the attached drawings, in which:
1.A. Sequence Analyses
The genome sequence analyses were carried out using the Internet servers of the Sanger Institute and TubercuList for Mycobacterium tuberculosis H37Rv (Cole et al., 1998) and the “National Center for Biotechnology Information” (NCBI) Internet site for the other genomes (www.ncbi.nlm.nih.gov/genomes/lproks.cgi). The sequence alignments were carried out using the BLAST or PSI-BLAST, Clustal W version 1.8 or MultAlin software with the default parameters (Altschul et al., 1997; Corpet, 1998; Thompson et al., 1994).
1.B. Definition of the FabA/FabZ Specific Common Motif
The PRATT version 2.1 program of the PROSITE database (Jonassen et al., 1995) was used with the default parameters to define the two distinct specific motifs of the FabZ or FabA annotated proteins, based on two independent lists of 36 protein sequences of FabZ type, and 24 sequences of FabA type [including the FabZ1 protein of Enterococcus faecalis, the sequence of which is more related to the FabZ proteins but which has an FabA activity (dehydratase/isomerase) (Wang & Cronan, 2004)]. Each motif was then used as a probe against the databanks proposed (Swiss-Prot, EMBL and TrEMBL) by the ScanProsite software (default parameters) in order to evaluate the specificity of the motif for the two protein families (Gattiker et al., 2002). A common consensus motif specific for FabA/FabZ was then determined manually on the basis of the two previously defined motifs. This common consensus motif was used as a probe against all the protein databanks or against the banks of proteins predicted for various species of the Mycobacterium genus that are available, by this software. A control experiment was carried out using the same strategy as that described above, but with the proteins of the FabG family. This analysis against the predicted proteins of the Mycobacterium genus made it possible to pick out the annotated proteins such as FabG of the mycobacterial species and it showed that mycobacterial proteins could be detected by this strategy.
2.1 PCR Amplification
Rv0636
The gene encoding the Rv0636 protein was amplified by PCR (Polymerase Chain Reaction) in two steps, using the total DNA of the M. tuberculosis strain H37Rv.
In a first PCR reaction, the pair of primers 5′-GAT TTT CTG ATG GCG CTG CGT GAG TTC-3′ (sense primer) (SEQ ID No. 8) and 5′-CGG TCT TGA GCG CCA TAA ACT A-3′ (anti-sense primer) (SEQ ID No. 9) was used. In a second PCR reaction, an N-terminal tag of 6 histidines was introduced using the sense primer 5′-ATG GCT CAT CAT CAT CAT CAT CAT GGT GCG CTG CGT GAG TTC AGC TCG G-3′ (SEQ ID No. 10). The PCR reactions were carried out with the PfuUltra polymerase (Stratagene).
Rv0635-Rv0636
In order to coexpress the Rv0635 and Rv0636 proteins, the nucleotides 731 930 to 733 325 of the genome of the M. tuberculosis strain H37Rv, containing the three genes Rv0635, Rv0636 and Rv0637, were amplified by PCR using the cosmid MTCY20H10 (http://genolist.pasteur.fr/Tuberculist). An N-terminal tag of 6 histidines were simultaneously introduced in the PCR reaction by using the pair of primers 5′-ATG GCT CAT CAT CAT CAT CAT CAT GGT GCG TTG AGC GCA GAC ATC G-3′ (sense primer) (SEQ ID No. 11) and 5′-CAG TTG CTA ATT ACG CGG TC-3′ (antisense primer) (SEQ ID No. 12), and the PfuUltra polymerase (Stratagene).
Rv0636-Rv0637
In order to coexpress the Rv0636 and Rv0637 proteins, a DNA fragment containing the two genes Rv0636 and Rv0637 was amplified using the cosmid MTCY20H10 (http://genolist.pasteur.fr/Tuberculist). An N-terminal tag of 6 histidines were simultaneously introduced in the PCR reaction by using the pair of primers 5′-ATG GCT CAT CAT CAT CAT CAT CAT GGT GCG CTG CGT GAG TTC AGC TCG G-3′ (sense primer) (SEQ ID No. 13) and 5′-CAG TTG CTA ATT ACG CGG TC-3′ (antisense primer) (SEQ ID No. 14), and the PfuUltra polymerase (Stratagene).
2.2 DNA Ligation
After the addition of a 3′ protruding poly(A) end by incubation with Taq polymerase (New England Biolabs), the genes amplified with the pCR T7 TOPO or pEXP5-CT/TOPO vector (Invitrogen) were ligated under the conditions specified by the supplier.
2.3 Cloning
The cloning was carried out in E. coli TOP10 (Invitrogen), and the exactness of the genes isolated was controlled by DNA sequence analysis.
The expression of the genes was carried out with E. coli BL21-AI (Invitrogen) transformed with Rv0636, and with E. coli BL21 Star (DE3) (Invitrogen) transformed with Rv0636-Rv0637 or with Rv0635-Rv0636-Rv0637, the genes being integrated into the constructs described below:
BL21 Star (DE3)/pEXP5-CT/TOPO::H-Rv0635-Rv0636-Rv0637;
The bacteria were cultured in LB medium (Luria Broth Base, DIFCO-BRL) supplemented with 50 μg/ml of ampicillin, at 37° C.
Rv0636
At an OD600=0.7-0.9, the expression of the Rv0636 target gene was induced with 0.02% of arabinose for 3-4 hours. After centrifugation of the bacterial cultures, the cell pellet was resuspended in a lysis buffer (50 mM NaH2PO4, 300 mM NaCl, 10 mM imidazole, pH 8.0), and lyzed using freeze-thawing cycles. The major part of the Rv0636 protein is in insoluble form.
After centrifugation at 19 000 rpm (43 000×g) for 20 minutes, the cell-free extract was loaded onto a preequilibrated Ni Sepharose FF column (1 ml, GE Healthcare). After washing with 20 column volumes of buffer (20 mM imidazole, 50 mM NaH2PO4, 300 mM NaCl, pH 8.0), the protein was eluted with 5 column volumes of 250 mM imidazole in the same buffer.
The fractions containing the protein are combined, and their buffer is changed by chromatography on a PD-10 column (GE Healthcare). The enzyme is further purified on a HiLoad 16/60 Superdex 75 prep grade column (GE Healthcare) equilibrated with 50 mM HEPES buffer containing 150 mM NaCl, pH 7.5. The solution of purified protein is analyzed by Coomassie blue staining on SDS-PAGE (
Rv0635-Rv0636 and Rv0636-Rv0637
At an OD600=0.5-0.7, the expression of the Rv0635-RV0636 and Rv0636-Rv0637 genes was induced with 0.2 mM of IPTG for 3-4 hours. After centrifugation of the bacterial cultures, the cell pellet was resuspended in a lysis buffer (50 mM NaH2PO4, 300 mM NaCl, 10 mM imidazole, 10% (v/v) glycerol, pH 8.0) with 1 mg/ml lysozyme, 0.01 mg/ml RNAse A, 0.02 mg/ml DNAse I and 1 mM PMSF, and lyzed using the One Shot Cell disruptor (Constant Systems Ltd.).
The soluble fraction was loaded onto a preequilibrated Ni Sepharose FF column (1 ml, GE Healthcare). After washing with 20 column volumes of buffer (20 mM imidazole, 50 mM NaH2PO4 or HEPES, 300 mM NaCl, 10% (v/v) glycerol, pH 8.0), the protein was eluted with 5 column volumes of the same buffer comprising 250 mM imidazole.
The fractions containing the enzyme are combined, and their buffer is changed by chromatography on a PD-10 column (GE Healthcare). The enzyme is further purified on a HiLoad 16/60 Superdex 75 pre grade column (GE Healthcare) equilibrated with 50 mM HEPES, 150 mM NaCl, pH 7.5. Glycerol is added to the fractions (up to 50% v/v) and the protein solution is stored at −20° C.
The solution of purified protein is evaluated by Coomassie blue staining on SDS-PAGE (
Rv0636 Heterodimers
During the chromatography steps described above, coelution of the Rv0635 (fused with the N-terminal polyhistidine tag) and Rv0636 proteins, and elution of the Rv0636 (fused with the N-terminal polyhistidine tag) and Rv0637 proteins was demonstrated (
The Dynamic Light Scattering measurements were carried out with a DynaPro-801 TC (Protein Solutions, Inc), at a protein concentration of 20 mg/ml in 50 mM NaHEPES, 150 mM NaCl, pH 7.5, at temperatures of 8 to 20° C. The data were analyzed with Dynamics. The sample showed a monomodal size distribution.
These measurements confirm that this enzyme (33.1 kDa) forms a tetramer at low temperature (8° C.). However, at higher temperature (20° C.) it behaves like a heterodimer. The results are shown in table 1 below.
Synthesis of Fatty Acid Derivatives
The 2-trans-enoyl-CoA was synthesized and purified according to the protocol described in Quemard at al., 1995. The synthesis of 3-cis-decenoyl-CoA was carried out as follows.
The 3-cis-decenoyl-CoA was prepared and purified from 3-cis-decenoic acid and CoA, according to the same mixed anhydride method as for 2-trans-enoyl-CoA (Goldman & Vagelos, 1961). The 3-cis-decenoyl-CoA was purified by reverse-phase HPLC using a Nucleosil C18 8×250 nm column (Bischoff Chromatography). The column was preequilibrated with a methanol/water mixture containing 20 mM of NaH2PO4, and the elution was carried out with a gradient of 10 to 60% of methanol in water at 1.5 ml/min. The detection was carried out by measuring the absorbance at 260 nm.
The 3-cis-decanoic acid was synthesized by oxidation of 3-cis-decen-1-ol with pyridinium dichromate in dichloromethane and dry dimethylformamide, according to the protocol described by Besra (Besra et al., 1993). It was purified by silica gel chromatography using an 80/20 (v/v) mixture of CHCl3/CH3OH as eluent, and characterized by 1H-NMR spectroscopy. The 3-cis-decen-1-ol was obtained by catalytic semihydrogenation (1 atm) of the solution of 3-decyn-1-ol (158 mg) in dry diethyl ether on the Lindlar catalyst (40 mg) with vigorous stirring until the hydrogen gas (1.6 mmol) was absorbed. It was then purified by adsorption chromatography on silica gel with silver nitrate, using increasing concentrations of methanol in dichloromethane as eluent.
The 2-trans-octenoyl-ACP was synthesized from E. coli holo-ACP (Sigma®) and 2-trans-octenoic acid as previously described (Cronan & Klages, 1981).
Calibration of the Reagent Solutions
The concentrations of the solutions or substrates 2-trans-enoyl-CoA, 3-cis-decenoyl-CoA, and 2-trans-octenoyl-ACP were determined by spectrophotometry at 260 nm with the following molar extinction coefficients: for the 2-trans-enoyl-CoAs, ε260=22 600 M−1 cm−1, for the 3-cis-decenoyl-CoA, ε260=16 800 M−1 cm−1, or at 280 nm for the 2-trans-octenoyl-ACP (ε280=1800 M−1 cm−1). a more precise calibration of the concentration of 2-trans-octenoyl-ACP was then carried out in the presence of the purified M. tuberculosis InhA enzyme (25 nM) and of NADH (at saturating concentration: 100 μM). The reaction was monitored by spectrophotometry at 340 nm, which is the wavelength at which the oxidation of NADH is monitored, and the total concentration of converted substrate was deduced.
Principle of the Enzymatic Test
The (R)-specific hydratases/dehydratases are more active in vitro in the direction of hydration than in the direction of dehydration when they are isolated from their complex. Thus, their in vitro activity is often studied in the presence of enoyl derivatives rather than (3R)-hydroxyacyl derivatives.
The reaction monitored is the conversion of a 2-trans-enoyl-CoA (or ACP) into (3R)-hydroxyacyl-CoA (or ACP) corresponding to the hydration reaction, or the conversion of a (3R)-hydroxyacyl-CoA into 2-trans-enoyl-CoA corresponding to the dehydration reaction.
Standard Conditions
These reactions are monitored by spectrophotometry at 263 nm by means of a thermostated Uvikon 923 spectrophotometer (Kontron Instruments). They are carried out in a quartz cuvette (optical path of 1 cm) in a total volume of 700 μl at 25° C. The reaction mixture contains 100 mM sodium phosphate buffer, pH 7.0, and varying concentrations of substrates. After equilibration of the base line on the reaction mixture in the absence of the enzyme, the reactions are triggered by the addition of enzyme, and then monitored for 1.5 to 5 min. The reaction rate corresponds to the initial rate measured by plotting the tangent of the curve OD=f(time) at time zero.
The conversions corresponding to the hydration reaction and to the dehydration reaction are associated, respectively, with a decrease and with an increase in the absorbance at 263 nm, linked to the double bond conjugated to the carbonyl. A variation in OD263 of 0.67 corresponds to a variation in concentration of 100 μM.
Determination of the Specific Hydratase/Dehydratase Activity of the Rv0635-Rv0636 and Rv0636-Rv0637 Enzymes
In order to carry out the various enzymatic tests, several concentrations of enzyme were tested, thereby making it possible to define, for each enzyme, the concentration most suitable for the measurements.
In order to determine whether the enzymes are active, the first hydration tests were carried out in the presence of 25 μM of crotonoyl-CoA or of 2-trans-octenoyl-CoA and of the following ranges of enzyme concentrations: 0.8 to 280 nM of the Rv0635-Rv0636 heterodimer, 0.8 to 543 nM of the Rv0636-Rv0637 heterodimer. As for the dehydratase activity, it was studied in the presence of 25 or 75 μM of hydroxybutyryl-CoA substrate and the same enzyme concentration as for the hydratase activity tests, i.e. 0.8 to 280 nM of the Rv0635-Rv0636 heterodimer, 0.8 to 543 nM of the Rv0636-Rv0637 heterodimer.
Determination of the Substrate Specificity of the Enzymes
The substrate specificity was studied in the presence of 2-trans-enoyl-CoA of various chain lengths (C4-C20). The initial rates were measured and compared at a very low substrate concentration in order to reduce as much as possible the solubility problems encountered with these amphiphilic compounds. The reactions were carried out in the presence of 2.5 or 25 μM of substrate; the respective concentrations of the Rv0635-Rv0636 and Rv0636-Rv0637 enzymes were the following: 80 nM and 543 nM.
Activity on C4-C8 Substrates
The activity of the purified Rv0635-Rv0636 and Rv0636-Rv0637 heterodimers was tested in the presence of a 2-trans-enoyl-CoA since the (R)-specific dehydratases/hydratases preferentially function in the direction of hydration, in vitro, when they are isolated from their complex.
The tests were carried out in 100 mM sodium phosphate buffer, pH 7.0, in the presence of 3-hydroxybutyryl-CoA (25 or 75 μM) or of 2-trans-octenoyl-CoA (25 μM) or of crotonoyl-CoA (25 μM), and of enzyme (Rv0635-Rv0636: 80 nM; Rv0636-Rv0637: 543 nM). Several concentrations of enzyme were tested. When no activity was detected, the maximum concentration tested is indicated. After addition of the enzyme, the reactions were monitored by spectrophotometry at 263 nM for 1.5 min.
The results show these enzymes are not active under these conditions on C4 or C8 substrates, since no decrease in absorbance was observed.
Activity on C12-C20 Substrates
The activity of the various enzymes was tested on long-chain derivatives, C12-C20 trans-2-enoyl-CoAs. The tests were carried out in the presence of a low substrate concentration (2.5 μM) for the Rv0635-Rv0636 heterodimer in order to reduce as much as possible the solubility problems associated with these amphiphilic molecules. For the Rv0636-Rv0637 heterodimer, the experiments were carried out in the presence of a higher substrate concentration (25 μM) since, at 2.5 μM, no activity could be detected. In order to be able to make a comparison, the study of the specificity of Rv0635-Rv0636 was also carried out at 25 μM (
These results demonstrate important differences in the level of substrate specificity of the enzymes in accordance with the invention. The apparent preference of the Rv0635-Rv0636 heterodimer for hexadecenoyl-CoA (C16) is reminiscent of that of the InhA protein of the mycobacterial FAS-II system (Quemard et al., 1995) (
These results confirm that the Rv0635, Rv0636 and Rv0637 proteins are involved in the mycobacterial FAS-II elongation system since the enzymes that they form exhibit a medium-chain and long-chain (C12-C20) substrate specificity, like the FAS-II system and some of its already characterized proteins.
Determination of the Isomerase Activity of the Enzymes
As in certain FAS-II systems described, the dehydration step can be catalyzed either by a (3R)-hydroxyacyl-ACP dehydratase, or by a (3R)-hydroxyacyl-ACP dehydratase/isomerase of FabA type which, in vivo, converts the 2-trans-enoyls, formed by the dehydration step, into 3-cis-enoyls. The isomerase activity can be studied in vitro by studying the reverse reaction, i.e. the conversion of a 3-cis-enoyl compound into a 2-trans-enoyl compound.
The ability of the proteins to catalyze the isomerization reaction was studied in the presence of 3-cis-decenoyl-CoA (10 μM) and of Rv0635-Rv0636 enzyme (80 nM) or Rv0636-Rv0637 enzyme (543 nM). The control experiments were carried out in the presence of 2-trans-decenoyl-CoA (10 μM) and of the same concentrations of enzyme. The reactions were monitored at 263 nm, where the increase in absorbance results from the conversion of the 3-cis-decenoyl-CoA into 2-trans-decenoyl-CoA.
No variation in absorbance was observed in the presence of 3-cis-decenoyl-CoA, whereas for the control experiments, the Rv0635-Rv0636 and Rv0636-Rv0637 enzymes exhibited a respective specific activity of 0.20 μmol/min/mg of protein and 0.02 μmol/min/mg of protein. Under the conditions tested, the enzymes studied do not exhibit any isomerase activity.
Study of the ACP-Dependency
The particularity of the FAS-II systems is that they have an ACP-dependent activity. FAS-II is the only fatty acid biosynthesis system which is ACP-dependent in mycobacteria.
The activities of the enzymes Rv0635-Rv0636 (80 nM), Rv0636-Rv0637 (311 nM) or Rv0636 alone (311 nM) were compared in the presence of 2-trans-octenoyl-ACP (2 μM) or of 2-trans-octenoyl-CoA (2 μM).
The Rv0635-Rv0636 heterodimer exhibited a specific activity in the presence of 2-trans-ocetenoyl-ACP of 0.28 μmol/min/mg, whereas no significant activity was detected in the presence of the CoA derivative under these conditions (
Tandem Reaction with MabA and InhA
The inventors tested the ability of the two heterodimers to function in a coupled reaction in the presence of MabA and InhA, which are the two reductases of FAS-II that catalyze the reactions upstream and downstream of the dehydration step in the cycle (
The reaction media were analyzed by MALDI-TOF mass spectrometry (MALDI-TOF MS). The reaction media were first of all diluted 10 times in water. The samples (1 μl) were deposited onto the target plates, mixed with 1 μl of matrix [10 mg/ml of 2,5-dihydroxybenzoic acid in water:acetonitrile, 8:2 (v/v)] and left to crystallize at ambient temperature. The analyses of the coupling reactions were carried out in reflectron mode on a 4700 Analyser mass spectrometer (Applied Biosystems) equipped with an Nd:YAG laser (wavelength of 355 nm, pulse duration<500 ps and 200 Hz repetition rate). 2500 shots were accumulated in positive or negative ion mode and the mass spectrometry data were acquired using the instrument default calibration.
The MALDI-TOF mass spectrometry analysis of the reaction media demonstrates that dodecanoyl-CoA, the saturated product of InhA, is formed very rapidly (in 3 min) in the presence of Rv0635-Rv0636 (
These results as a whole strongly suggest that the Rv0635, Rv0636 and Rv0637 proteins are involved in the FAS-II elongation system. The two heterodimers, Rv0635-Rv0636 and Rv0636-Rv0637, are active only in the presence of ACP derivatives at the low substrate concentration used. These data are reminiscent of what has been observed for the enzymes of the FAS-II systems that have been described, in particular for the mycobacterial enzymes InhA, KasA and KasB, and also for the (3R)-hydroxyacyl-ACP dehydratases of other organisms which, although they show a preference for ACP derivatives, remain active in the presence of CoA derivatives. Furthermore, the two heterodimers exhibit a (3R)-hydroxyacyl dehydratase activity in the presence of the two reductases of the FAS-II complex.
The Rv0636 protein has the following properties:
(i) the ubiquity of the protein among the mycobacterial species and also the presence in related genera, insofar as mycolic acids are compounds characteristic of the Corynebacterineae; (ii) the absence of the protein in Corynebacterium; (iii) the presence in the region of ORF on the chromosome of genes described as being involved in mycolic acid metabolism, and (iv) the presence of a catalytic motif.
More specifically, the sequence encoding the Rv636 protein is located close to the group of genes mmaA1-4 involved in the biosynthesis of oxygenated mycolic acids in M. tuberculosis. In addition, this protein has a conserved characteristic catalytic sequence, known as hydratase 2 motif (
Summary of the Properties of the Rv0636 and Rv0637 Proteins
The Rv0635 and Rv0637 proteins have the following properties:
(i) they are ubiquitous among the mycobacteria and only Rv0635 is found in related genera (Corynebacterinae); (ii) they are absent from Corynebacterium; (iii) their predicted structure corresponds to an SHD fold; (iv) but they do not comprise a catalytic motif; (v) they have the property of associating, independently with Rv0636, forming heterodimers or multimers.
Summary Concerning Enzymes Comprising the Rv0635, Rv0636 and Rv0637 Proteins
Surprisingly, the enzymes comprising the Rv0635, Rv0636 and Rv0637 proteins have the following properties:
(i) they form quaternary structures corresponding to dimers or multimers of Rv0636 alone, of the association Rv0635-Rv0636 and of the association Rv0636-Rv0637;
(ii) the quaternary structures comprising a catalytic subunit and a noncatalytic subunit (Rv0635-Rv0636 and Rv0636-Rv0637) correlate with their specificity for long-chain substrates C12), it being possible for the noncatalytic subunit to play the role of acyl long chain acceptor; (iii) these enzymes appear to exhibit a marked specificity for ACP derivatives compared with CoA derivatives; (iv) their chain-length specificity and their specificity for the acyl-chain carrier group of the substrate correspond to the properties described for the mycobacterial FAS-II system and for the enzymes of which it is composed; (v) the preference of Rv0636-Rv0637 for longer-chain substrates compared with Rv0635-Rv0636, and the absence of a protein orthologous to Rv0637 in the Mycobacterium-related genera comprising short mycolic acids, suggest that Rv0635-Rv0636 is involved in the early elongation cycles catalyzed by FAS-II, whereas Rv0636-Rv0637 is involved in the late elongation cycles.
W. & Liprnan, D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25, 3389-3402.
Number | Date | Country | Kind |
---|---|---|---|
0700930 | Feb 2007 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FR2008/000162 | 2/8/2008 | WO | 00 | 2/5/2010 |