Colistin Synthetases and Corresponding Gene Cluster

Information

  • Patent Application
  • 20160348088
  • Publication Number
    20160348088
  • Date Filed
    January 26, 2015
    9 years ago
  • Date Published
    December 01, 2016
    8 years ago
Abstract
PmxA synthetase involved in polymyxin E synthesis, comprising four adenylation sites, characterized in that the second adenylation site has at least 90% identity with the peptide sequence SEQ ID NO 1:
Description

The present invention relates to bacterial genes and enzymes involved in the synthesis of polymyxin E, an antiobiotic molecule used in therapy against infections with gram-negative bacteria.


PRIOR ART

Polymyxins are antibiotic molecules isolated from Bacillus species or from Paenibacillus species. These molecules are lipopeptides, consisting:

    • of a peptide chain comprising ten amino acids, organized in a cyclic heptapeptide and a side chain of three amino acids, and
    • of a fatty acid attached at the N-terminal end of the peptide.


At least 16 types of polymyxins have been identified since the discovery of polymyxin B in 1947. Among them, polymyxins B and E have antimicrobial activity on most of the strains of Escherichia coli and of Pseudomonas aeruginosa, and also on all strains of Salmonellae and Shigellae, and of numerous gram-negative bacteria. These antiobiotics have therefore been used as therapeutic agents in many cases. Unfortunately, their toxic effects are such that they have been gradually replaced by antiobiotics that are better tolerated.



FIG. 3 shows the general structure of polymyxins, and the two specific amino acids are indicated X and Y. As indicated, polymyxins B and E differ by a single amino acid, the “X” residue being a phenylalanine for polymyxin B and a leucine polymyxin E, it being possible for the “Y” residue to be a leucine, isoleucine or valine.


Because of the increasing demand for new antibiotics for which there is not yet any resistance in the population, new antiobiotic agents are sought, and polymyxins are again used clinically in combating multiresistant gram-negative bacteria.


Colistin or polymyxin E, identified under CAS number 1264-72-8, and as described in patent application WO 1998/020836, is an antibiotic of the polymyxin family, making it possible to treat infections due to multiresistant gram-negative bacteria. Its peptide structure is indicated in FIG. 1. Administered by injection, colistin is used in the treatment of neuromeningeal infections, urinary infections, urogenital infections, septicemia and superinfections of wounds, burns and ulcers. Administered by inhalation, colistin is used in the treatment of pulmonary infections, in particular those associated with cystic fibrosis. Finally, in combination with hydrocortisone and bacitracin, colistin is used for treating bacterial and inflammatory infections of the eye, and also infections in ophthalmic surgery.


Colistin is responsible for numerous unwanted effects, in particular nephrotoxic and neurotoxic effects. These toxic effects are attributable to the cationic nature of this molecule, which contains five positive charges, but also to the low degree of purity of the molecule, produced in a fermenter by culturing the original bacterial strain which synthesizes it, and purified from the fermentation medium.


Indeed, polymyxins are currently obtained by isolation and purification from culture media of bacterial strains which produce these molecules, in particular strains of Paenibacillus polymyxa. The purification of the molecule from fermentation media is unsatisfactory, and this is why new production techniques are envisioned. In particular, studies are carried out to identify and clone the genes encoding polymyxin synthetases.


Polymyxin synthetases are part of the “Non-Ribosomal Peptide Synthetase” (NRPS) family; the genes encoding these synthetases are organized in a cluster comprising several modules, the order and specificity of which determine the structure of the peptide product. NRPSs allow the synthesis of peptides which exhibit a broader structural variety than that which could be obtained if these peptides were translated from messenger RNA by ribosomes. Furthermore, the peptides thus produced can undergo modifications, in particular through the creation of bonds with hydroxylated acids, and oxidations in the peptide chain making it possible to obtain cyclic structures and also acylations, glycosylations, and an N-methylation of the residues.


Such NRPS complexes have been described in many microorganisms and are involved in the production of tyrocidine, gramicidin, vancomycin, penicillin, and fusaricidins.


NRPS synthetases consist of various modules. Each module is capable of activating an amino acid, optionally of modifying it and of subsequently transferring it onto the activated amino acid in the adjacent module so as to constitute a peptide. Each module is composed of several domains and allow the incorporation of a particular amino acid into the peptide undergoing elongation. Each module contains a minimum of 3 domains:

    • an adenylation domain (A): a domain central in the action of peptide synthetases. It allows the binding of a specific amino acid and its activation through a reaction of adenylation of the latter (conversion of the amino acid to adenylated aminoacyl);
    • a thiolation domain (T or PCP for Peptidyl Carrier Protein): allows the peptide being synthesized to remain bound to the synthetase throughout the elongation process via a thioester bond;
    • a condensation domain (C): allows the formation of the peptide bond between two amino acids of adjacent modules.


The release of the peptide is induced by a specific domain, the Te (thioesterase) domain. This domain will cleave the thioester bond which links the synthesized peptide to the final T domain. This domain also allows the cyclization of certain peptides.


There are also optional secondary domains which make it possible to carry out modifications on the amino acids, such as epimerization (conversion of an amino acid of the L series to an isomer of the D series), methylation (addition of a methyl group) or else formylation (addition of a formyl group).


In the case of polymyxins, and as indicated in FIG. 2, three synthetases are essential for the synthesis of the peptide chain:

    • the PmxA module has four adenylation domains, organized three-dimensionally as “binding pockets” which each integrate a specific amino acid into the polymyxin molecule;
    • the PmxB module has one adenylation domain, and is responsible for the integration of a single amino acid;
    • the PmxE module has five adenylation domains which integrate the other five amino acids which make up the peptide chain of polymyxin.


The article by Choi et al. (Journal of Bacteriology, 2009) and also patent U.S. Pat. No. 8,329,430 describe the isolation of a cluster (group) of genes of the gram-positive strain Paenibacillus polymyxa E681 secreting polymyxin A. Five genes constitute this cluster: pmxA, pmxB, pmxC, pmxD and pmxE. The pmxC and pmxD genes encode transport proteins, while the other three genes, pmxA, pmxB and pmxE, encode synthetases. This is in particular demonstrated by a mutant strain which has undergone a mutation by insertion into the pmxE gene: this strain no longer produces polymyxin. Conversely, a strain of Bacillus subtilis transformed by the introduction of this gene cluster becomes capable of producing polymyxin A, but only in the presence of L-2,4-diaminobutyric acid (Dab, major residue of the polymyxin peptide chain).


In the polymyxins described in the literature, it is very rare to observe the presence of D-Dab residues, except for polymyxins A, C and P.


Shaheen et al. (Chemistry and Biology, 2011) have described the identification and sequencing of a gene cluster derived from the Paenibacillus polymyxa PKB1 strain. The involvement of this genes in polymyxin B biosynthesis is demonstrated by insertional mutation in the pmxE gene: the mutated strain no longer produces polymyxin. During the characterization of the cluster, Shaheen et al. identified an epimerization domain in the third module of the pmx cluster. This suggests the integration of a D-Dab in position 3. Thus, these genes are involved in the biosynthesis of polymyxin B variants, comprising, in position 3, not a D-2,4-diaminobutyrate residue, but its enantiomer, L-2,4-diaminobutyrate.


Other gene clusters derived from Paenibacillus polymyxa strains have been isolated, but their functions and specificities are only predicted and in no way functionally demonstrated.


Moreover, “variant” molecules exhibiting slight structural differences with the known polymyxins, and lesser toxic effects, have been reported. Thus, patent application WO 2009/098357 and also the article by Vaara and Vaara (Peptides, 2010) describe polymyxin B variants comprising only three positive charges and having reduced toxic effects. Polymyxin B variants have also been described in the article by Shaheen et al. (2011).


The technique currently used to produce colistin, based on the fermentation of bacterial producer strains in rich media, is not satisfactory, in particular in terms of purity of the molecule, which must be purified from the fermentation medium.


The identification of the genes involved in colistin synthesis would make it possible, by means of genetic engineering processes, to produce a purer polymyxin E molecule, and also to create derived synthetic molecules, having fewer harmful side effects and/or better activity.


SUMMARY OF THE INVENTION

The present invention describes the isolation of a gene cluster derived from a strain of Paenibacillus alvei isolated from the environment, producing polymyxin E, and uses thereof.


The present invention relates to a PmxA synthetase involved in polymyxin E synthesis, comprising four adenylation domains, characterized in that the second adenylation domain comprises or has a particular peptide sequence represented in SEQ ID NO 1, this sequence forming a binding pocket specific for a leucine, isoleucine or valine residue.


The present invention also relates to a PmxE synthetase involved in polymyxin E synthesis, comprising five adenylation domains and one epimerization domain.


The present application also relates to a group of genes encoding enzymes involved in polymyxin E synthesis, comprising a gene encoding the PmxA synthetase, genes encoding the PmxB and PmxE synthetases, and genes encoding the PmxC and PmxD transport proteins.


The present invention also relates to transformed microorganisms expressing at least the modified or nonmodified pmxA, pmxB and pmxE genes, said transformed microorganisms producing polymyxin E or variant molecules of this polymyxin.





FIGURES


FIG. 1. Peptide structure of polymyxin E.



FIG. 2. Structure of the NRPS cluster involved in polymyxin E synthesis.



FIG. 3. Chemical structures of the various polymyxins B and E according to Govaerts and collaborators (Govaerts et al., 2002a; Govaerts et al., 2002b).



FIG. 4. Antimicrobial effect of 20 μl of the culture supernatant of the B-LR bacterium and of mutant 5 on a bacterial layer of P. aeruginosa.



FIG. 5. PmxA peptide sequence; in bold: adenylation domain sequences; in bold and underlined: specific motifs of the binding pockets.



FIG. 6. (A) MS/MS spectrum corresponding to the fragmentation of the doubly charged precursor ion (m/z 585.39). (B) Structure proposed for the corresponding peptide: colistin E1. FA: fatty acid (C9H17O); L-Dab: L-2,4-diaminobutyric acid; Thr: threonine; Leu: leucine. The numbers in boxes correspond to the ions produced in the first series, the encircled numbers correspond to the ions produced in the second series.



FIG. 7. (A) MS/MS spectrum corresponding to the fragmentation of the doubly-charged precursor ion (m/z 578.38). (B) Structure proposed for the corresponding peptide: colistin E2. FA: fatty acid (C8H15O); L-Dab: L-2,4-diaminobutyric acid; Thr: threonine; leu: leucine. The numbers in boxes correspond to the ions produced in the first series.



FIG. 8. (A) MS/MS spectrum corresponding to the fragmentation of the doubly charged precursor ion (m/z 571.38). (B) Structure proposed for the corresponding peptide: Val-E2. FA: fatty acid (C8H15O); L-Dab: L-2,4-diaminobutyric acid; Thr: threonine; Ile: isoleucine; Leu: leucine; Val: valine. The numbers in boxes correspond to the ions produced in the first series, the encircled numbers correspond to the ions produced in the second series.





DETAILED DESCRIPTION

Polypeptides and Polynucleotides


The invention relates to the isolation and identification of new genes encoding polymyxin E synthetases, in particular of a gene encoding the “PmxA” synthetase having a particular adenylation site, forming a specific binding pocket allowing the integration into the polymyxin molecule of a leucine or isoleucine residue.


All the technical terms used in the present application are well known to the person skilled in the art and are defined in the reference book by Sambrook et al. “Molecular Cloning: a Laboratory Manual”.


The term “polynucleotide” denotes a chain of covalently bonded nucleotides. For the purposes of the invention, this term denotes nucleic acids such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA).


For the purposes of the present invention, the “percentage identity” between two nucleic acid sequences is determined by comparing the two optimally aligned sequences through a comparison window.


The part of the nucleotide sequence in the comparison window may comprise additions or deletions (for example gaps) compared with the reference sequence (which does not comprise these additions or these deletions) so as to obtain optimal alignment between the two sequences.


The percentage identity is calculated by determining the number of positions at which an identical nucleic base is observed for the two sequences compared, then by dividing the number of positions at which there is identity between the two nucleic bases by the total number of positions in the comparison window, then by multiplying the result by one hundred in order to obtain the percentage nucleotide identity of the two sequences with respect to one another.


The optical alignment of the sequences for the comparison may be carried out by a computer using known algorithms.


Entirely preferably, the percentage sequence identity is determined using the CLUSTAL W software (version 1.82), the parameters being fixed as follows: (1) CPU MODE=ClustalW mp; (2) ALIGNMENT=“full”; (3) OUTPUT FORMAT=“aln w/numbers”; (4) OUTPUT ORDER=“aligned”; (5) COLOR ALIGNMENT=“no”; (6) KTUP (word size)=“default”; (7) WINDOW LENGTH=“default”; (8) SCORE TYPE=“percent”; (9) TOPDIAG=“default”; (10) PAIRGAP=“default”; (11) PHYLOGENETIC TREE/TREE TYPE=“none”; (12) MATRIX=“default”; (13) GAP OPEN=“default”; (14) END GAPS=“default”; (15) GAP EXTENSION=“default”; (16) GAP DISTANCES=“default”; (17) TREE TYPE=“cladogram” and (18) TREE GAP DISTANCES=“hide”.


The term “polypeptide” denotes a chain of covalently bonded amino acids.


The term “polymyxin E synthetases” denotes the enzymes capable of integrating a specific amino acid into the chain of amino acids forming polymyxin E, and where appropriate of converting the amino acid so as to form a cyclic structure.


The term “adenylation site” or “adenylation domain” denotes, in the polymyxin synthetase molecule, the domain which plays a role in the choice and activation of the amino acid, while the “condensation domain” catalyzes the formation of a peptide bond and the “thiolation domain” is responsible for the transport of the compounds undergoing formation between the modules.


The term “epimerization site” denotes, in a synthetase, a domain which makes it possible to convert a residue which has a levorotary chirality “L” into its enantiomer of dextrorotary chirality “D”, in particular an L-α,γ-diaminobutyric acid (L-Dab) into a D-α,γ-diaminobutyric acid (D-Dab).


The term “binding pocket” denotes a zone of the protein which has a three-dimensional structure in the form of a “pocket”, in which high-specificity interactions with a substrate or, in the present case, an amino acid, will make it possible to select and activate this amino acid.


The expression “the underlined amino acids being conserved” means that, even if slight sequence variations may be observed between two proteins having the same catalytic activity, the amino acids indicated are essential to the specificity and to the activity of this protein and may not therefore be modified, without risk of losing or reducing the specificity and/or the activity of this protein.


The present invention relates to a PmxA synthetase involved in polymyxin E synthesis, comprising four adenylation sites, characterized in that the second adenylation site has at least 90% identity with the following peptide sequence:









(SEQ ID NO 1)


VTEAEKADLLGRFNDTTTEFPRGKTLIQLFEEQVERIPDAAAITLNEQE





LTYRELNERVNRLARTLRSHGISKGRLVAILAERSIEMVVGMLAAHKAG





AAYVPIDPEYPEERIRFLIEDSGGQVMLTQSRLRERLAGSDPVILLDDE





SFYHEDGTNLNTGIEATDLACVIYTSGTTGKPKGNPVSHRNIVRVVQNT





NYIDITERDHVLQLSSYSFDGATFDIFGALTNGARLVLVPYETLLEIGR





LADLIQRERISVMFITTAFFNILVDVNVDCLRDVRAILFGGERVSVGHV





RKALAHIGPGRLNHVYGPTESTVYTTYLPVDFVDELAVTVPIGRPISNT





TVYIVDSRNKLLPIGVAGELCVGGEGLVRGYNNRPELTAEKFVDNPFVP





GERMYRTGDLAKWLPDGTIEYVGRTDDQVKIRGFRIELGEIEAQLQKVE





GIRKTTVFARENASGEKQLCAYYEADCELPAAELKSVLSKELPAYMIPA





YLIQLERLPLTTNGKVDRRSLPAPEESLQPGGG,






the underlined amino acids DGFFLGVVYK being conserved and forming a binding pocket specific for a leucine, isoleucine or valine residue.


According to one particular aspect of the invention, the second adenylation site of PmxA has 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identity with the peptide sequence SEQ ID NO 1, the underlined amino acids DGFFLGVVYK being conserved and forming a binding pocket specific for a leucine, isoleucine or valine residue.


According to one particular aspect of the invention, the second adenylation site of PmxA has a peptide sequence comprising or consisting of the peptide sequence SEQ ID NO 27, the underlined amino acids DGFFLGVVYK being conserved and forming a binding pocket specific for a leucine, isoleucine or valine residue.


According to one particular aspect of the invention, the PmxA synthetase comprises four specific binding pockets comprising the following amino acids:

    • DAWIVGAIVK (SEQ ID NO 2), specific for a leucine residue;
    • DGFFLGVVYK (SEQ ID NO 3), specific for a leucine, isoleucine or valine residue;
    • DVGEISAIDK (SEQ ID NO 4), specific for a diaminobutyric acid residue;
    • DVGEISAIDK (SEQ ID NO 5), specific for a diaminobutyric acid residue.


It is clearly understood that these sequences SEQ ID NO 2, 3, 4 and 5 group together the essential amino acids forming a functional binding pocket in a three-dimensional structure, but are not consecutive amino acids in a peptide sequence.


According to one particular aspect of the invention, the PmxA synthetase comprises the polypeptide sequence presented in SEQ ID NO 6, and shown in FIG. 5, the adenylation domains being indicated in bold, the amino acids forming the binding pocket being underlined. In particular, this sequence may comprise, in addition to the sequence presented in SEQ ID NO 6, additional amino acids at the N- and C-terminal ends.


According to one particular aspect, the polypeptide sequence of the PmxA synthetase consists of the polypeptide sequence presented in SEQ ID NO 6.


According to another aspect of the invention, the PmxA synthetase has the sequence which comprises, or consists of, the peptide sequence presented in SEQ ID NO 25.


The PmxA synthetase has four adenylation sites at the following positions:

    • from residue 234L to residue 751Y;
    • from residue 1741V to residue 2263G;
    • from residue 2815P to residue 3345T;
    • from residue 3900E to residue 4428G.


It is understood that these positions are identified according to a particular peptide sequence, in this case according to SEQ ID NO 6 comprising 4967 amino acids, and may be redefined for sequences of larger size.


In particular, when the PmxA synthetase has a sequence consisting of the sequence SEQ ID NO 25, the four adenylation sites are in the following positions:

    • from residue 265L to residue 782Y;
    • from residue 1772V to residue 2294G;
    • from residue 2846P to residue 3376T;
    • from residue 3931E to residue 4459G.


The present invention also relates to a PmxE synthetase involved in polymyxin E synthesis, comprising five adenylation sites and one epimerization site, characterized in that the epimerization site has at least 90% identity with the peptide sequence as presented in SEQ ID NO 30.


According to one particular aspect of the invention, the peptide sequence of the PmxE synthetase comprises or consists of the sequence represented in sequence SEQ ID NO 14.


This synthetase is responsible for the integration of residues 1 to 5 (see FIG. 1 for the numbering) into the peptide chain of the polymyxin E molecule. This synthetase has the functional characteristic of being able to convert the L-Dab residue into its stereoisomer D-Dab, and of incorporating said stereoisomer in position 3 in a polymyxin E peptide chain. Such a variant polymyxin molecule might have improved antimicrobial properties, as has been proposed in the literature (Hong S Y et al., 1999; Lee D L et al., 2004).


The present invention also relates to a nucleic acid comprising an open reading frame, encoding a synthetase involved in polymyxin E synthesis, having at least 90% identity with the nucleotide sequence SEQ ID NO 7, with the proviso that the nucleotides encoding the underlined amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 are conserved.


According to one particular aspect of the invention, the nucleic acid encoding a synthetase involved in polymyxin E synthesis has 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence SEQ ID NO 7, with the proviso that the nucleotides encoding the underlined amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 are conserved.


The expression “the nucleotides [ . . . ] are conserved” indicate that these nucleotides encoding the amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 must be either identical to those observed in the same position in the sequence SEQ ID NO 7, or different but with the proviso that the codons formed by these nucleotides encode the amino acids DGFFLGVVYK underlined in the sequence SEQ ID NO 1.


This is because, and as indicated above, these amino acids are essential to the specificity and to the activity of this synthetase and may not therefore be modified, without the risk of losing or reducing the specificity of this binding pocket.


According to another particular aspect of the invention, the nucleic acid encoding a PmxA synthetase involved in polymyxin E synthesis comprises a nucleotide sequence encoding the second adenylation site of this PmxA synthetase, consisting of the nucleotide sequence presented in sequence SEQ ID NO 28. This sequence encodes the peptide sequence represented in SEQ ID NO 27 and also encodes the peptide sequence represented in SEQ ID NO 1 which has two amino acids fewer than SEQ ID NO 27 at the C-terminal end of the domain.


The present invention also relates to a group of genes encoding enzymes involved in polymyxin E synthesis, comprising in particular:

    • a gene encoding the PmxA synthetase as defined above, and
    • genes encoding respectively the PmxB and PmxE synthetases.


According to one preferred aspect of the invention, this group of genes comprises the pmxE gene encoding a PmxE synthetase comprising an epimerization domain, the peptide sequence of which comprises or consists of the sequence represented in SEQ ID NO 30.


According to one preferred aspect of the invention, this group of genes also comprises genes encoding respectively the PmxC and PmxD transport proteins. The sequences of the PmxC and PmxD proteins isolated from the Paenibacillus alvei strain are shown in sequences SEQ ID NO 12 and NO 13, respectively.


According to another particular aspect of the invention, this group of genes involved in polymyxin E synthesis comprises:

    • a pmxA gene having at least 90% identity with the sequence SEQ ID NO 7, with the proviso that the nucleotides encoding the underlined amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 are conserved,
    • a pmxB gene having at least 90% identity with the sequence SEQ ID NO 8, and
    • a pmxE gene having at least 90% identity with the sequence SEQ ID NO 9.


According to another aspect of the invention, this group of genes involved in polymyxin E synthesis comprises:

    • a pmxA gene having at least 95% identity with the sequence SEQ ID NO 7, with the proviso that the portion of nucleic acid encoding the underlined amino acids DGFFLGVVYK is conserved,
    • a pmxB gene having at least 95% identity with the sequence SEQ ID NO 8, and
    • a pmxE gene having at least 95% identity with the sequence SEQ ID NO 9.


According to another aspect of the invention, this group of genes involved in polymyxin E synthesis comprises:

    • a pmxA gene comprising a nucleotide sequence having 100% identity with the sequence SEQ ID NO 7, and in particular comprising a nucleotide sequence as presented in SEQ ID NO 26,
    • a pmxB gene comprising a nucleotide sequence having 100% identity with the sequence SEQ ID NO 8, and
    • a pmxE gene comprising a nucleotide sequence having 100% identity with the sequence SEQ ID NO 9.


The present invention also relates to a nucleic acid comprising or consisting of a sequence SEQ ID NO 10, or a sequence SEQ ID NO 29, representing the complete sequence of the cluster derived from the Paenibacillus alvei strain, comprising the pmxA, pmxB, pmxC, pmxD and pmxE genes involved in polymyxin E synthesis.


Table 1 below presents the various nucleic acid and protein sequences referenced in the present application:















Sequence





number
Type
Size
Definition







SEQ ID
Protein
523
Peptide sequence of the second


NO 1

amino
adenylation site of a PmxA




acids
synthetase derived from a






Paenibacillus alvei strain



SEQ ID
Protein
10
Binding pocket of the 1st adenylation


NO 2

amino
domain of PmxA, these amino acids




acids
forming a binding pocket specific for





a leucine residue


SEQ ID
Protein
10
Binding pocket of the 2nd adenylation


NO 3

amino
domain of PmxA, these amino acids




acids
forming a binding pocket specific





for a leucine, isoleucine or valine





residue


SEQ ID
Protein
10
Binding pocket of the 3rd adenylation


NO 4

amino
domain of PmxA, these amino acids




acids
forming a binding pocket specific for





a diaminobutyric acid residue


SEQ ID
Protein
10
Binding pocket of the 4th adenylation


NO 5

amino
domain of PmxA, these amino acids




acids
forming a binding pocket specific for





a diaminobutyric acid residue


SEQ ID
Protein
4967
Complete peptide sequence of a PmxA


NO 6

amino
synthetase derived from a Paenibacillus




acids

alvei strain; see also FIG. 5



SEQ ID
DNA
14901
Coding sequence for the PmxA synthetase


NO 7

base
having the sequence SEQ ID NO 6




pairs


SEQ ID
DNA
3306
Coding sequence for the PmxB synthetase


NO 8

base
derived from Paenibacillus alvei




pairs


SEQ ID
DNA
18876
Coding sequence for the PmxE synthetase


NO 9

base
derived from Paenibacillus alvei, having




pairs
the sequence SEQ ID NO 31


SEQ ID
DNA
41169
Sequence encoding a complete cluster


NO 10

base
comprising the pmxA, pmxB, pmxC,




pairs
pmxD and pmxE genes derived from






Paenibacillus alvei



SEQ ID
Protein
1102
Peptide sequence of the PmxB synthetase


NO 11

amino
derived from a Paenibacillus alvei strain




acids


SEQ ID
Protein
608
Peptide sequence of the PmxC protein


NO 12

amino
derived from a Paenibacillus alvei strain




acids


SEQ ID
Protein
577
Peptide sequence of the PmxD protein


NO 13

amino
derived from a Paenibacillus alvei strain




acids


SEQ ID
Protein
6292
Peptide sequence of the PmxE synthetase


NO 14

amino
derived from a Paenibacillus alvei strain




acids


SEQ ID
DNA
20 base
Artificial sequences - oligonucleotides


NO 15-24

pairs
(see Table 2)


SEQ ID
Protein
4999
Complete peptide sequence of a PmxA


NO 25

amino
synthetase derived from a Paenibacillus




acids

alvei strain



SEQ ID
DNA
14997
Coding sequence for the PmxA synthetase


NO 26

base
having the sequence SEQ ID NO 25




pairs


SEQ ID
Protein
525
Peptide sequence of the second


NO 27

amino
adenylation site of a PmxA




acids
synthetase derived from a






Paenibacillus alvei strain



SEQ ID
DNA
1575
Coding sequence for the 2nd adenylation


NO 28

base
domain of PmxA, having the sequence




pairs
SEQ ID NO 27


SEQ ID
DNA
41172
Sequence encoding a complete cluster


NO 29

base
comprising the pmxA, pmxB, pmxC,




pairs
pmxD and pmxE genes derived from






Paenibacillus alvei



SEQ ID
Protein
461
Sequence of the epimerization domain of


NO 30


the PmxE synthetase derived from a






Paenibacillus alvei strain



SEQ ID
Protein
431
Complete peptide sequence of the enzyme


NO 31


allowing Dab biosynthesis, derived from






a Paenibacillus alvei strain



SEQ ID
DNA
1293
Sequence encoding the enzyme having the


NO 32

base
sequence SEQ ID NO 31, derived from




pairs

Paenibacillus alvei










Expression Vectors


The present invention also relates to an expression vector comprising a gene encoding a PmxA synthetase as defined above, or a nucleic acid encoding a PmxA synthetase as defined above, and/or a gene encoding a PmxE synthetase as defined above.


In the context of the invention, the terms “vector”, “expression vector” and “plasmid” are equivalent and are used in accordance with the usual acceptance in the molecular biology, genetic engineering and microbiology field. Very briefly, it is a non-viral DNA molecule housed by a host cell, distinct from the natural chromosomal DNA of said host cell and capable of autonomous replication. A “vector” is obtained by conventional molecular biology and genetic engineering techniques, and is a molecule into which one or more exogenous nucleotide sequences have been inserted (or cloned). The present invention relates to a vector for the expression, by a host cell, of exogenous nucleotide sequences, comprising:

    • an origin of replication,
    • elements allowing the expression of the genes introduced, such as an appropriate promoter, enhancers, etc.,
    • at least one nucleotide sequence of which the expression is desired.


The choice of the vector and, more particularly, of the origin of replication that it carries depends on the host cell that will house it. Depending on the type of host cells, several copies of a vector and/or several different vectors may be introduced into the same host cell, simultaneously or sequentially.


The choice of the vector will also depend on the size of the nucleic acid sequence to be expressed. In particular, for sequences greater than 20 kilobase pairs, fosmids, which are vectors capable of containing large nucleic acid sequences (up to 40 kilobase pairs), will be preferred. It is also possible to use several vectors, each comprising a portion of a set of genes, and to transfer several vectors into the same host cell, this making it possible to express an entire set of genes in the same strain.


Other vectors that may contain sequences of large sizes are cosmids and bacterial artificial chromosomes (BACs).


According to one preferred aspect of the invention, the expression vector totally or partly comprises a group of genes as defined above.


According to another preferred aspect of the invention, the expression vector totally or partly comprises a nucleic acid as defined above, in particular comprising the sequence presented in SEQ ID NO 10 or in SEQ ID NO 29. Preferably, the expression vector is a fosmid comprising all of the DNA sequence encoding the complete cluster comprising the pmxA, pmxB, pmxC, pmxD and pmxE genes derived from Paenibacillus alvei, having the sequence SEQ ID NO 10.


Other Enzymes Involved in Polymyxin Synthesis


Other enzymes are involved in polymyxin synthesis, in particular polymyxin E synthesis. These are enzymes which allow the biosynthesis of the Dab residue, this residue being the residue predominantly integrated into the peptide chain of polymyxin E, and enzymes which allow the bonding of the fatty acid on the peptide part.


An enzyme for biosynthesis of the α,γ-diaminobutyric acid (Dab) residue has been identified and isolated from the bacterial strain of Paenibacillus alvei described in the examples. In particular, the peptide sequence of this enzyme comprises or consists of the peptide sequence presented in sequence SEQ ID NO 31. The nucleic acid encoding this enzyme comprises or consists of the nucleotide sequence presented in sequence SEQ ID NO 32.


Preferably, this enzyme with transaminase activity may be expressed in a microorganism intended to produce polymyxin E, in combination with the PmxA, PmxB and PmxE synthetases, and preferably also with the PmxC and PmxD transport molecules.


This transaminase comprising 431 residues has a sequence homology with enzymes exhibiting similar functions, derived from other microorganisms, as is shown in Table 2 below:













TABLE 2







% Identity






with sequence




derived from


Accession
Length
B-LR (SEQ ID

Microorganism


number
(AA)
NO 31)
Function
of origin







CP000154.1
420
73
Diaminobutyrate--

P. polymyxa






2-oxoglutarate
E681





transaminase


EJW20319.1
389
89
Diaminobutyrate--

P. alvei






2-oxoglutarate
DSM 29





transaminase





EctB


YP_005959913.1
420
74
diaminobutyrate-

P. polymyxa






2-oxoglutarate
M1





transaminase


EGG38373.1
430
50
diaminobutyrate--
P. sp.





2-oxoglutarate
HGF5





transaminase


EGL19185.1
434
65
diaminobutyrate--
P. sp.





2-oxoglutarate
HGF7





transaminase





[Paenibacillus sp.





HGF7]


ABX39524.1
429
50
diaminobutyrate-

Halobacillus






2-oxoglutarate

halophilus






transaminase
DSM 2266









Enzymes which allow the bonding of the fatty acid on the peptide chain of polymyxin E, said enzymes therefore having acyl transferase activity, have also been identified and isolated from the bacterial strain of Paenibacillus alvei described above.


Preferably, at least one acyl transferase will be expressed in a microorganism intended for producing polymyxin E, in combination with the PmxA, PmxB and PmxE synthetases, and preferably also with the PmxC and PmxD transport molecules, and more preferably also with an enzyme for the biosynthesis of the α,γ-diaminobutyric acid (Dab) residue.


Alternatively, the bonding of the fatty acid on the peptide chain of polymyxin E may also be carried out by chemical coupling, according to one of the techniques well known to those skilled in the art.


Host Cells


The term “host cell” or “host microorganism” is used in the context of the present invention to denote a cell which has been transformed, i.e. into which the exogenous DNA has been introduced, this exogenous DNA being in particular in the form of an expression vector comprising a sequence of interest, which will be expressed by means of the cellular machinery of the host cell, capable of synthesizing, from the exogenous DNA, the messenger RNAs and the proteins corresponding to this DNA.


The present application relates in particular to a microorganism transformed by introducing a gene or group of genes as defined above, or one or more expression vectors as presented above.


The invention relates in particular to a microorganism transferred by introducing:

    • a gene encoding a PmxA synthetase as defined above, or
    • a gene encoding a PmxE synthetase as defined above, or
    • an expression vector comprising a DNA sequence encoding a PmxA synthetase as defined above, in particular the sequence as presented in SEQ ID NO 7 or in SEQ ID NO 26.


The invention also relates to a microorganism transformed by introducing a group of genes as defined above, or a nucleic acid encoding all or part of the pmxA, pmxB and pmxE genes, as defined above, or at least one expression vector comprising a gene encoding all or part of the pmxA, pmxB and pmxE genes, as defined above.


According to one particular aspect of the invention, the microorganism is in addition transformed so as to comprise at least one nucleic acid encoding an enzyme involved in the biosynthesis of the Dab residue, and in particular a nucleic acid comprising a sequence comprising or consisting of the sequence SEQ ID NO 32.


According to another aspect of the invention, the microorganism is in addition transformed so as to comprise at least one nucleic acid encoding an enzyme with acyl transferase activity, catalyzing the bonding of a fatty acid on the peptide chain of polymyxin E.


The transformation of microorganisms is a technique commonly used in molecular biology laboratories, which makes it possible to introduce exogenous DNA into the microorganism. Various techniques enable the transformation of a microorganism, and in particular of a competent bacterium, and are all well known to those skilled in the art. Mention may in particular be made of electroporation, and the use of calcium chloride followed by a heat shock.


In one preferred aspect of the invention, in the transformed microorganism, the gene or the group of genes introduced is overexpressed.


The overexpression of a gene may be defined by an increased expression of this gene, i.e. a greater production of messenger RNA and the protein encoded by this gene, in the cell. The expression of the protein will in particular be increased by 50%, by 100%, by 150%, by 200%, or even by 300% compared with the level of endogenous expression observed in a bacterium expressing this gene naturally, before any transformation.


The overexpression of a gene may be obtained in several ways, all well known to those skilled in the art. Mention will in particular be made of: using strong promoters for controlling the level of gene expression and increasing the number of copies of the gene in the cell. Preferably, the vector used will comprise a strong promoter under the control of which the gene will be inserted, and said vector will be present in a high number of copies, in particular 10, 20, 50 or 100 copies in the host cell.


Those skilled in the art will know how to choose the host microorganism most suitable for the expression of a vector comprising the gene(s) according to the invention. In particular, gram-positive bacteria will be preferred. A microorganism which is particularly suitable is a bacterium belonging to the Bacillus or Paenibacillus genus. The species B. subtilis and Paenibacillus polymyxa are particularly suitable for the expression of polymyxin synthetases and the production of polymyxin E. The strains of the Paenibacillus genus, and in particular of Paenibacillus alvei, and more particularly the Paenibacillus alvei strain isolated from the environment by the inventors, may also be used as host microorganism to be transformed with the vector or the nucleic acid as defined above. Finally, synthetic bacteria may also be used in the context of the present invention, as host cells.


Method for Producing Polymyxin E


The present application also relates to a method for producing polymyxin E, comprising:

    • culturing a transformed microorganism as defined above, in an appropriate mineral medium, and
    • purifying, from the culture medium, the polymyxin E produced.


The term “appropriate mineral medium” denotes a culture medium which allows the growth of microorganisms, and which comprises in particular mineral salts and nutrients. A preferred medium has the following specific composition: 0.45% (w/v) anhydrous KH2PO4, 1.13% (w/v) K2HPO4. 3H2O, 0.6% (w/v) (NH4)2SO4, 0.6% (w/v) glucose, 0.001%0 (w/v) thiamine, 0.02% (w/v) MgSO4.7H2O. This medium may also comprise, where appropriate, a precursor required for polymyxin synthesis, in particular diaminobutyric acid.


According to one preferred aspect of the invention, the culturing is carried out at a temperature of 30° C., and lasts at least 25 hours.


Preferably, the culturing takes place in 200 ml of mineral medium in 1 L flasks, with shaking, for 30 h at 30° C.


The present invention also relates to a method for producing polymyxin E variants, comprising culturing a microorganism as defined above, in an appropriate mineral medium, and purifying, from the culture medium, the polymyxin E variant(s).


The present invention also relates to polymyxin E variants obtained according to the production method as described above.


The “polymyxin E variants” denote molecules derived from the structure of polymyxin E (see FIG. 1) and which may for example have one or more different amino acids, without however being classified as belonging to another type of polymyxin.


In particular, the types of molecules detected in the culture medium for the Paenibacillus alvei strain BL-R are polymyxin E variants which have the following structural difference: the residue in position 3 may be a D-Dab residue in place of an L-Dab residue observed in the conventional structure of polymyxin E.


The variant forms may be natural or synthetic. The term “natural variants” is intended to mean variants synthesized by non-transformed microorganisms; the term “synthetic variants” is intended to mean variant forms synthesized by transformed microorganisms, which have not been identified in the natural state.


In particular, the polymyxin synthetases encoded by the pmxA, pmxB and pmxE genes may be modified so as to become specific for the binding of certain amino acids which are part of the composition of polymyxin E, in order to synthesize polymyxin E variants which have fewer cationic charges and which are therefore less toxic to the human organism.


These polymyxin E variants have advantages and in particular a lower toxicity than polymyxin E.


Moreover, the clinical use of various polymyxin E variants makes it possible to reduce the appearance of antiobiotic resistance.


It is probable that, for the variants comprising D-Dab residues in place of L-Dab residues, the antimicrobial activity of these molecules is increased compared with the antimicrobial properties observed with the conventional polymyxins.


The present invention relates to these variants, and also to the use thereof in the treatment of bacterial infections with gram-negative bacteria.


The examples below illustrate the invention claimed, but are not limiting.


EXAMPLES
Example 1
Isolation and Sequencing of the Genetic System for Producing Colistin

A microorganism (B-LR) producing colistin (polymyxins E1 and E2), belonging to the Paenibacillus alvei species, was isolated from the environment and cultured. A genomic DNA library was constructed in Escherichia coli using fosmid vectors (900 clones). The search for the genes encoding the enzymes which synthesize colistin was carried out by degenerate PCRs. The primers used were defined with the aim of amplifying the domains specializing in the integration of diaminobutyric acid (Dab), which is the amino acid most representing the structure of colistin (6 amino acids/12). Three clones of interest were selected from the sequence (Roche GS FLX). The sequences obtained could be assembled over 50 kb. The open reading frames were sought. Five of them called A, B, C, D and E describe a cluster of approximately 41 kb with A (14.9 kb), B (3.3 kb), C (1.8 kb), D (1.7 kb) and E (18.9 kb), which is represented in FIG. 2.


The A, B and E genes were identified, by sequence homology, as encoding synthetases. An in silico study made it possible to predict the involvement of these synthetases in the assembly of colistin (http://nrps.informatik.uni-tuebingen.de/ and http://nrps.igs.umaryland.edu/nrps/). The C and D genes could, for their part, be involved in the export and resistance of the producer microorganism with respect to colistin.









TABLE 3





Primers for amplifying the genes of interest of


the colistin cluster of B-LR
















A 
(F)ATGGCTTTTGAAAAAGAAAC



(R)GAACGCAAATTCGATCGTAT





B
(F)ATGAAATCTTTATTTGAAAA



(R)GCTTCCATGCAGTACCCCGG





E
(F)ATGGAAATTATGAATCCGGG



(R)GAAAATAATATCAATGGCCT





C
(F)ATGGAAGCTGACCGACAGCC



(R)GTGTACGCCACCTCCCTGCG





D
(F)ATGAAAAAGGGCGGATGGCT



(R)GCCGTACAGCCGGGCGTAAT





These oligonucleotides are represented in SEQ ID NOS 15 to 24.






Example 2
Comparison of the Adenylation Domains of Various Paenibacillus Strains Producing Molecules Belonging to the Polymyxin Family

The enzymes involved in polymyxin production belong to the non-ribosome synthetase (NRPS) family. These synthetases are capable of producing a particular peptide without relying on the translation of an mRNA template. The specificity of the peptide chain produced depends on a precise sequence of amino acids of the synthetase constituting an adenylation domain. Four adenylation domains were identified in silico for the synthetase A of B-LR, one for B and five for E. Each adenylation domain contains a signature of ten amino acids which confers it on the specificity of integrating a precise amino acid into the non-ribosomal peptide during elongation. These signatures were identified in silico and compared to those described in the literature (Table 4) using the program available at the following web address: http://nrps.informatik.uni-tuebingen.de/.


The E681 and ATCC21830 strains are described in patent U.S. Pat. No. 8,329,430.


The PKB1 strain is described in the article by Shaheen et al., 2011.


The M-1 strain is described in the article by Niu et al., 2013.









TABLE 4





Amino acids conferring the specificity of the adenylation domains of


the polymyxin synthetases of the E681, PKB1, ATCC21830, M-1 and B-LR strains







Pmx A, 4 domains A
















1

2

3

4






E681
DAWIVGAIVK
Leu
DFWNIGMVHK
Thr
DVGEISAIDK
Dab
DVGEISAIDK
Dab


PKB1
DAWTIAAIAK
Phe
DGFLLGLVYK
Leu
DVGEISAIDK
Dab
DVGEISAIDK
Dab


21830
DAWIVGAIVK
Leu
DGFLLGLVYK
Leu
DVGEISAIDK
Dab
DVGEISAIDK
Dab


M-1
DAWTIAAIAK
Phe
DFWNIGMVHK
Thr
DVGEISAIDK
Dab
DVGEISAIDK
Dab


B-LR
DAWIVGAIVK
Leu
DGFFLGVVYK
Leu/ 
DVGEISAIDK
Dab
DVGEISAIDK
Dab






Ileu/










Val










Pmx B, 1 domain A
















1












E681
DFWNIGMVHK
Thr








PKB1
DFWNIGMVHK
Thr








21830
DFWNIGMVHK
Thr








M-1
DFWNIGMVHK
Thr








B-LR
DFWNIGMVHK
Thr










Pmx E, 5 domains A


















1

2

3

4

5






E681
DVGEISS
Dab
DFWNIGM
Thr
DVGEISS
Dab
DVGEIS
Dab
DVGEIS
Dab



IDK

VHK

IDK

AIDK

AIDK



PKB1
DVGEISS
Dab
DFWNIGM
Thr
DVGEISS
Dab
DVGEIS
Dab
DVGEIS
Dab



IDK

VHK

IDK

AIDK

AIDK



21830
DVGEISS
Dab
DFWNIGM
Thr
DVGEISS
Dab
DVGEIS
Dab
DVGEIS
Dab



IDK

VHK

IDK

AIDK

AIDK



M-1
DVGEISS
Dab
DFWNIGM
Thr
DVGEISS
Dab
DVGEIS
Dab
DVGEIS
Dab



IVK

VHK

IDK

AIDK

AIDK



B-LR
DVGEISS
Dab
DFWNIGM
Thr
DVGELS
Dab
DVGEIS
Dab
DVGEIS
Dab



IDK

VHK

SIDK

AIDK

AIDK









The synthetases of B-LR differ at the level of the second adenylation domain of the PmxA synthetase and of the third domain of the PmxE synthetase.


The comparisons with the protein sequences derived from the BL-R strain with the protein sequences already known are presented in Table 5 below:











TABLE 5









% protein sequence ID











sizes

ATCC














Sequences
pb
aa
PKB1
E681
21830


















A
14901
4967
94
90
96



A
14997
4999
94
90
95



B
3306
1102
96
95
96



E
18876
6292
95
96
84



C
1824
608



D
1731
577










Example 3
Construction of a pmxE-Mutant and Analysis of the Colistin Production Thereof

A sequence of 1650 bp beginning at the end of pmxD and finishing at the start of pmxE was selected and amplified. It has the particularity of possessing a unique SacII enzyme restriction site. The amplified sequence was cloned into the pGEM® 7Z plasmid. The gene encoding apramycin resistance was integrated at the SacII restriction site. This construct was introduced into B-LR by electroporation. The selection of the mutants having undergone a double recombination event was carried out by culturing on agar medium supplemented with apramycin. The antiomicrobial activity of the supernatant of the mutants was compared with that of B-LR. The culture supernatant of mutant 5 is less active than that of the wild-type B-LR bacterium on P. aeruginosa (FIG. 4).


Example 4
Characteristics of the PmxE Synthetase Derived from Paenibacillus

The peptide sequence of the PmxE synthetase is presented in sequence SEQ ID NO 14. The sequence of the epimerization domain is presented in SEQ ID NO 30; it comprises at least 461 residues comprised between and including the residues Val3205 to Leu3665 of the sequence SEQ ID NO 14. One of the characteristics of non-ribosomal peptides is the presence of amino acids with (L) stereoisomerism. Usually, an (L) amino acid is activated by the A domain, which then incorporates it directly. However, sometimes, this amino acid of the (L) series may be converted into its (D) form by any epimerization domain, before the formation of the peptide bond. According to the in silico predictions, two epimerization domains were identified in modules 3 and 6 of the pmx cluster. This suggests that a stereochemical change in the amino acids present at these sites is possible. Thus, the amino acids present in position 3 (Dab-3) and in position 6 (Leu-6) would be incorporated into the peptide chain in their (D) form. This was known for Leu-6, incorporated in its (D) form into all polymyxins E, but is unusual for the Dab-3 residue.


The PmxE synthetase according to the invention therefore has the property of converting the L-Dab residue into its stereoisomer (D), and of integrating it in position 3 (see FIG. 1) in the polymyxin E molecule synthesized.


Said polymyxin E molecules comprising a D-Dab residue in position 3 are polymyxin E variants.


Example 5
Production of Polymyxin E by a Microorganism Expressing the Combination of a PmxA, a PmxB and a PmxE Characterized in the Preceding Examples

The microorganism (B-LR) produces several types of polymyxin E molecules comprising a leucine, isoleucine or valine residue in position 7.


These various types of molecules could be detected in the culture medium by mass spectrometry, according to the protocol detailed below:


A control solution of colistin sulfate at 0.0002% was prepared in the microorganism culture medium (M63T medium). Four fractions were recovered and analyzed by UPLC-MS (Ultra Performance Liquid Chromatography—tandem mass spectrometry). The two peaks corresponding to the polymyxins E1 and E2 present in the positive control were found in the fourth extraction fraction.


The B-LR strain exhibits a maximum level of antimicrobial activity at 30 hours of culture. At this time, the culture supernatant was recovered and filtered (0.22 μm) before being subjected to a solid-phase extraction step. The peptides of the fourth fraction were analyzed by UPLC-MS. Seventeen elution peaks were observed in this fourth fraction. Among them, the main peaks exhibiting retention times of 1.61 and 1.52 minutes have [M+H]+ molecular masses respectively of 1169.7727 and 1155.7555. They correspond respectively to the known polymyxins E1 and E2, which have the same peptide structure and differ by their fatty acids, respectively of formulae C9H17O and C8H15O.


MS/MS analyses were carried out in order to confirm these structures, as have previously been presented in the articles by Govaerts et al., 2002; and by DeCrescenzo et al., 2007. Tandem mass spectrometry (MS/MS) is used to determine the series of amino acids through fragmentation of the peptide bond. This technique consists in combining two analyzers separated by a collision cell. A first step consists in selecting an ion derived from the source of ionization in the quadrupole: this is the parent or precursor ion. This ion undergoes fragmentation in the collision cell under the effect of a bombardment by argon atoms, and gives “daughter” or “product” ions that will be analyzed and protected by the time-of-flight (TOF) analyzer. The TOF analyzer is based on the measurement of the time taken by the ions to cover a given distance, thereby making it possible to detect the m/z ratios and thus to determine the mass of the corresponding ions.


During these experiments, the leucine and isoleucine residues cannot be differentiated, because of their identical molecular masses.


Three decapeptides with a ring containing seven amino acids were purified from the B-LR supernatant. Their molecular weights range from 1140.74 Da to 1168.77 Da. The comparison of these molecules shows homologies at the level of residues 1 to 6 and 8 to 10 and differences at the level of residue 7 and also at the level of the fatty acid. These three molecules correspond to polymyxins (colistins) E1, E2 and to Val-colistin E2. The deduced chemical formulae thereof are presented in Table 6 below.













TABLE 6






Fatty acid

Mass
Molecular


Name
(FA)
Y
(Da)
formula







Colistin E1
C9H17O
Leu/Ile
1168.77
C53H101N16O13


Colistin E2
C8H15O
Leu/Ile
1154.76
C52H99N16O13


Val-Colistin E2
C8H15O
Val
1140.74
C51H97N16O13









The mass spectra obtained for colistin E1, colistin E2 and val-colistin E2 have respective [M+2H]2+ values of 585.39, 578.38 and 571.38. The fragmentation of the parent ions of colistins E1 and E2 give daughter ions of m/z 829, 728, 628, 427, 327 and 227 (FIGS. 6 and 7) which are formed by loss of amino acid fragments. The fragmentation of the parent ion of val-colistin E2 gives daughter ions of m/z 815, 714, 614, 514, 413, 313 and 213 (FIG. 8) also formed by loss of amino acid fragments.


These daughter ions obtained are identical to those described in the literature during the fragmentation of colistins E1, E2 and Val-E2 (Govaerts et al., 2002; DeCrescenzo et al., 2007). Colistins E1 and E2 have the same amino acid sequences with a molecular mass difference of 14 Da which corresponds to the loss of a CH2 at the level of the fatty acid.


In conclusion, these results show that the B-LR strain produces colistin E1, colistin E2 and valine-colistin E2 (Val-E2).


Example 6
Heterologous Expression of the pmxA, B and E Genes Derived from Paenibacillus in Bacillus subtilis and Detection of Colistin


Bacillus is a heterologous expression host that is phylogenetically close to B-LR. The fosmid manipulated to house all of the colistin production cluster is transferred into the recipient strain by electroporation. The transformed strains are selected on agar medium supplemented with an antibiotic. The colistin production is carried out in liquid medium supplemented with diaminobutyric acid (synthesis precursor). The colistin is detected after separation of the biomass from the culture supernatant. Extractions make it possible to isolate the colistin before characterization thereof by mass spectrometry.


REFERENCES
Patents



  • U.S. Pat. No. 8,329,430

  • WO 1998/020836

  • WO 2009/098357



Non-Patents



  • Sambrook et al. ‘Molecular Cloning: a Laboratory Manual’. Second Edition, 1989, Cold Spring Harbor Lab., Cold Spring Harbor, N.Y.

  • Niu B, Vater J, Rueckert C, Blom J, Lehmann M, Ru J J, Chen X H, Wang Q, Borriss R. “Polymyxin P is the active principle in suppressing phytopathogenic Erwinia spp. by the biocontrol rhizobacterium Paenibacillus polymyxa M-1”. BMC Microbiol. 2013 Jun. 18; 13(1):137.

  • Choi S K, Park S Y, Kim R, Kim S B, Lee C H, Kim J F, Park S H—“Identification of a polymyxin synthetase gene cluster of Paenibacillus polymyxa and heterologous expression of the gene in Bacillus subtilis” J. Bacteriol. May 2009 vol. 191 no. 10-3350-3358

  • M Shaheen, J Li, A C Ross, J C Vederas, S E Jensen—“Paenibacillus polymyxa PKB1 Produces Variants of Polymyxin B-Type Antibiotics”—Chemistry & biology, Volume 18, Issue 12, 23 Dec. 2011, Pages 1640-1648

  • Martti Vaara, Timo Vaara—“Structure-activity studies on novel polymyxin derivatives that carry only three positive charges”—Peptides, Volume 31, Issue 12, December 2010, Pages 2318-2321

  • Govaerts, C., Orwa, J., Van Schepdael, A., Roets, E. & Hoogmartens, J. Characterization of polypeptide antibiotics of the polymyxin series by liquid chromatography electrospray ionization ion trap tandem mass spectrometry.—J Pept Sci 8, 45-55. (2002a).

  • Govaerts, C., Orwa, J., Van Schepdael, A., Roets, E. & Hoogmartens, J. Liquid chromatography-ion trap tandem mass spectrometry for the characterization of polypeptide antibiotics of the colistin series in commercial samples. J Chromatogr A 976, 65-78. (2002b).

  • Hong S Y, Oh J E, Lee K H. Effect of D-amino acid substitution on the stability, the secondary structure, and the activity of membrane-active peptide. Biochem Pharmacol. 1999 Dec. 1; 58(11):1775-80.

  • Lee D L, Powers J P, Pflegerl K, Vasil M L, Hancock R E, Hodges R S. Effects of single D-amino acid substitutions on disruption of beta-sheet structure and hydrophobicity in cyclic 14-residue antimicrobial peptide analogs related to gramicidin S. J Pept Res. 2004 February; 63(2):69-84.

  • DeCrescenzo Henriksen, D. R. Phillips and J. B. Doran Peterson. Polymyxin E production by P. amylolyticus E. Letters in Applied Microbiology, Volume 45, Issue 5, pages 491-496, November 2007.


Claims
  • 1.-20. (canceled)
  • 21. PmxA synthetase involved in polymyxin E synthesis, comprising four adenylation sites, characterized in that the second adenylation site has at least 90% identity with the peptide sequence SEQ ID NO 1:
  • 22. PmxA synthetase according to claim 21, characterized in that it comprises four specific binding pockets comprising the following amino acids: DAWIVGAIVK (SEQ ID NO 2), specific for a leucine residue;DGFFLGVVYK (SEQ ID NO 3), specific for a leucine, isoleucine or valine residue;DVGEISAIDK (SEQ ID NO 4), specific for a diaminobutyric acid residue;DVGEISAIDK (SEQ ID NO 5), specific for a diaminobutyric acid residue.
  • 23. PmxA synthetase according to claim 21, characterized in that it comprises the peptide sequence SEQ ID NO 6, and preferably comprises the peptide sequence SEQ ID NO 25.
  • 24. PmxE synthetase involved in polymyxin E synthesis, comprising five adenylation sites and one epimerization site, characterized in that the epimerization site has at least 90% identity with the peptide sequence SEQ ID NO 30.
  • 25. Nucleic acid comprising an open reading frame, encoding a synthetase involved in polymyxin E synthesis according to claim 21, having at least 90% identity with the nucleotide sequence SEQ ID NO 7, with the proviso that the nucleotides encoding the underlined amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 are conserved.
  • 26. Group of genes encoding enzymes involved in polymyxin E synthesis, comprising a gene encoding a PmxA synthetase according to claim 21, and genes encoding respectively the PmxB and PmxE synthetases.
  • 27. Group of genes according to claim 26, characterized in that it also comprises genes encoding respectively the PmxC and PmxD transport proteins.
  • 28. Group of genes according to claim 26, comprising: a pmxA gene having at least 90% identity with the sequence SEQ ID NO 7, with the proviso that the nucleotides encoding the underlined amino acids DGFFLGVVYK of the sequence SEQ ID NO 1 are conserved,a pmxB gene having at least 90% identity with the sequence SEQ ID NO 8, anda pmxE gene having at least 90% identity with the sequence SEQ ID NO 9.
  • 29. Nucleic acid of sequence SEQ ID NO 10 or of sequence SEQ ID NO 29, comprising a group of genes involved in polymyxin E synthesis.
  • 30. Expression vector comprising a gene encoding a PmxA synthetase according to claim 21, or a nucleic acid according to claim 25, and/or a gene encoding a PmxE synthetase according to claim 24.
  • 31. Expression vector comprising a gene encoding a PmxA synthetase according to claim 21, and a gene encoding a PmxE synthetase according to claim 24.
  • 32. Expression vector comprising a nucleic acid according to claim 25.
  • 33. Expression vector comprising a nucleic acid according to claim 25 and a gene encoding a PmxE synthetase according to claim 24.
  • 34. Expression vector comprising a gene encoding a PmxE synthetase according to claim 24.
  • 35. Expression vector totally or partly comprising the group of genes according to claim 26.
  • 36. Expression vector totally or partly comprising the nucleic acid according to claim 29.
  • 37. Microorganism transformed by introducing a gene encoding a PmxA synthetase according to claim 21.
  • 38. Microorganism transformed by introducing a nucleic acid according to claim 25.
  • 39. Microorganism transformed by introducing a vector according to claim 30.
  • 40. Microorganism transformed by introducing a group of genes according to claim 26.
  • 41. Microorganism transformed by introducing a nucleic acid according to claim 29.
  • 42. Microorganism transformed by introducing at least one vector according to claim 36.
  • 43. Transformed microorganism according to claim 37, characterized in that the gene is overexpressed.
  • 44. Transformed microorganism according to claim 37, characterized in that it belongs to the Bacillus or Paenibacillus genus.
  • 45. Method for producing polymyxin E, comprising culturing a microorganism according to claim 37 in an appropriate mineral medium, and purifying, from the culture medium, the polymyxin E produced.
  • 46. Method for producing polymyxin E, comprising culturing a microorganism according to claim 44 in an appropriate mineral medium, and purifying, from the culture medium, the polymyxin E produced
  • 47. Method for producing polymyxin E according to claim 45, characterized in that the culturing is carried out at a temperature of 30° C., and lasts at least 25 hours.
  • 48. Method for producing polymyxin E variants, comprising culturing a microorganism according to claim 37 in an appropriate mineral medium, and purifying, from the culture medium, the polymyxin E variant(s).
  • 49. Polymyxin E variants obtained according to the production method of claim 48.
  • 50. Polymyxin E variants according to claim 49, for use thereof in the treatment of bacterial infections with gram-negative bacteria.
Priority Claims (1)
Number Date Country Kind
1450641 Jan 2014 FR national
PCT Information
Filing Document Filing Date Country Kind
PCT/IB2015/050571 1/26/2015 WO 00