The present invention generally relates to biotechnology engineering, and specifically to a genetically engineered bacterium capable of producing cytokinins with isoprenoid side chains (isoprenoid cytokinins), and the preparation and application thereof.
Cytokinins are essential plant hormones that control numerous plant growth and developmental processes from seed germination to plant and leaf senescence, characterized by their ability for induction of cell division (Mok, Martin, and Mok 2000). Naturally occurring cytokinins are adenine derivatives with either an aromatic side chain (aromatic cytokinins) or an isoprene-derived side chain at the N6-terminus of adenine (isoprenoid cytokinins). The N6-side-chain structure and configuration determine both the cytokinin type and its activity. The isoprenoid cytokinins are widespread and are produced naturally in plants and algae, and also in many species of bacteria, fungi, nematodes, and parasitic insects (Stirk and van Staden 2010). Natural isoprenoid cytokinins are: N6-(D2-isopentenyl)adenine (iP), trans-zeatin (tZ), cis-zeatin (cZ) and dihydrozeatin (DZ) (
Isoprenoid cytokinins are derivatives of adenine or adenosine in which the exocyclic amino group at 6 has been modified by attachment of a dimethylallyl side chain, resulting in the parent compound N6-(D2-isopentenyl)adenine (iP) and its riboside, N6-(D2-isopentenyl)adenosine (iPR). The dimethylallyl side chain can be hydroxylated at 9′ to give trans-zeatin (tZ) or its riboside, trans-ribosylzeatin (tZR). Also, the exocyclic double bond can be reduced to give dihydrozeatin (DZ) or its riboside, ribosyl dihydrozeatin (DZR) (
The majority of naturally occurring cytokinins exist in distinctive structural derivatives or forms, such as free bases, ribosides, and nucleotides, or conjugates with glucose, xylose, or amino acid residues. In plants, the cytokinin free bases are considered the most biologically active forms while glucose conjugates are considered to be either permanently inactive or reversible storage forms, depending on the location of glycosylation.
Zeatin is the predominant form of cytokinin in plants. Zeatin is an adenine derivative with a hydroxylated isoprene-derived side chain at the N6 position of adenine, which can exist in cis- or trans-configuration (
Despite the structural similarity, cis-zeatin is synthesized through the tRNA pathway, whereas in contrast, trans-zeatin, and biosynthetically related compounds iP and DZ are synthesized in plant cells through the de novo (or AMP) biosynthesis pathway.
In the tRNA pathway, cis-zeatin is a recycled product of degradation of isopentenylated tRNAs. Cis-zeatin is synthesized in almost all organisms except Archaea (Schafer et al. 2015) at extremely low rates by tRNA-isopentenyltransferases (tRNA-IPTs) that catalyze the prenylation of adenine 37 on specific (UNN-)tRNAs leading to the formation of isopentenyl adenine (IP)-containing tRNA. In the de novo cytokinin biosynthesis pathway, the first step is N-prenylation of adenosine 5′-phosphates (AMP, ADP, or ATP) with dimethylallyl diphosphate (DMAPP) catalyzed by adenylate isopentenyltransferase (IPT, EC 2.5.1.27), which produces isopentenyladenine nucleotide (iP). iP, produced by IPT in plants, then undergoes hydroxylation at the prenyl side chain to result in tZ-nucleotides. In Arabidopsis, two cytochrome P450 monooxygenases, CYP735A1, and CYP735A2, catalyze the hydroxylation reaction. The CYP735As preferentially utilize iP-nucleotides rather than the iP-nucleoside and iP. Since this reaction is stereo-specific, the CYP735As produce tZ-nucleotides (Takei, Yamaya, and Sakakibara 2004). In the final step of both pathways, the de novo and the tRNA pathway, the cytokinin-activating enzyme cytokinin riboside 5-monophosphate phosphoribohydrolase ‘Lonely guy’ (LOG, EC 3.2.2.n1) removes the ribosyl moiety and converts cytokinin nucleotides to their active nucleobases (Kurakawa et al. 2007). All four cytokinin nucleoside monophosphates, iPRMP, tZRMP, DZRMP, and cZRMP are utilized by LOG.
In the case of plant infection by phytopathogenic bacteria, such as Agrobacterium tumefaciens, biosynthesis of tZ is initiated to facilitate the infection. During the infection process, tZ or iP biosynthesis can occur inside the bacterial cells or bacterial IPT gene homologs can be integrated into the host's nuclear genome and expressed in the infected plant cells.
Importantly, substrate specificities differ between IPTs of bacteria, and higher plants (Kakimoto 2001; Sakakibara 2005). In the first aspect, bacterial IPTs, such as Tmr and Tzs from A. tumefaciens, use only AMP as an acceptor (whereas plant IPT enzymes preferentially use ADP and ATP) to form trans-zeatin riboside 5′-monophosphate (tZRMP) (Sakakibara 2006; Kamada-Nobusada and Sakakibara 2009). Besides, Agrobacterium IPTs Tzs and Tmr are capable of using either DMAPP or 1-hydroxy-2-methyl-2-butenyl 4-diphosphate (HMBDP), the hydroxylated precursor of DMAPP in the methylerythritol phosphate (MEP) pathway, as the side chain donor. When DMAPP is used as a substrate, the primary product is iPRMP, whereas tZRMP is formed when IPT utilizes HMBDP. Therefore, the expression of Agrobacterium IPT Tmr in the chloroplasts of the plant cells during the infection creates a metabolic bypass for direct synthesis of tZRMP, sequestering the efficient substrate supply of isoprenoid precursors HMBDP in plastids of the plant cells. As in the native pathway, LOG phosphoribohydrolase finally releases the ribose monophosphate moiety from either iP- or tZ-nucleoside monophosphate (iPRMP, tZRMP), creating the biologically active molecule iP and tZ.
The tumor-inducing plant pathogenic bacterium A. tumefaciens has 2 IPTs: Tmr and Tzs. Tmr and Tzs are homologous proteins, both are DMAPP:AMP isopentenyltransferases, but their amino acids crucial for substrate recognition differ from those of plant adenylate IPTs (Chu et al., 2010). The tmr (ipt) gene is located in the T-region of the Ti-plasmid, which mediates infection in host plants. The tmr gene is transferred from the bacterium to the plant genome to force the host plants to produce cytokinins, which results in tumor formation. In vitro experiments demonstrated that Tmr transferred both DMAPP and HMBDP to AMP with similar Km values (Sakakibara et al. 2005). Nopaline-producing strains of A. tumefaciens possess another gene for DMAPP:AMP isopentenyltransferase, tzs, which is present in the vir region of Ti-plasmid and is not translocated to the plant cells. Tzs enables the high level of cytokinin production and secretion by these A. tumefaciens strains (Morris et al. 1993), and can also use HMBDP or DMAPP to produce iPRMP and tZRMP. Tzs has 51.3% protein sequence identity to Tmr. Genes coding for DMAPP:AMP isopentenyltransferases homologous to tmr/tzs are present in other bacteria of the genus Agrobacterium, A. vitis and A. rhizogenes, as well as in other plant-pathogenic bacteria, such as Pseudomonas syringae pv. savastanoi, Pseudomonas solanacearum, Pantoea agglomerans, and Rhodococcus fascians (Kakimoto, 2003).
Isopentenyl donor, dimethylallyl diphosphate (DMAPP), and its precursor 1-hydroxy-2-methyl-2-butenyl 4-diphosphate (HMBDP) are produced by the methylerythritol phosphate (MEP) pathway in chloroplasts of plants and bacteria, also in Bacillus subtilis. In the MEP pathway, pyruvate and glyceraldehyde-3-phosphate are condensed by the enzyme 1-deoxy-D-xylulose-5-phosphate synthase (Dxs, EC 2.2.1.7) into a metabolic cascade ultimately producing HMBDP, DMAPP, and isoprene or larger terpenoid compounds. The first step in the MEP pathway, mediated by DXS, is the rate-limiting step for isoprenoid production in plants and bacteria (Julsing et al. 2007).
The natural biosynthesis rate of cytokinins in plant-pathogenic bacteria through the de novo pathway during infection is very low. It depends on the expression of the IPT enzyme of bacterial origin, the LOG enzyme, and the building blocks, supplied by the metabolism of the host cells. Such an infection-based system is not readily transferable to industrial-scale production. Due to the high activity and potential of using cytokinin hormones in agricultural applications, there is thus a need for efficient and sustainable production of cytokinins such as iP, tZ, and ribosides tZR and iPR in genetically and biotechnologically amenable host strains.
The object of the present invention is to provide means allowing more efficient production of cytokinins with isoprenoid side chains (isoprenoid cytokinins), such as tZ and iP, and their ribosides tZR and iPR. More particularly, it is an object of the present invention to provide means allowing the production of cytokinins with isoprenoid side chains (isoprenoid cytokinins), such as tZ and iP, and their ribosides tZR and iPR, at higher nominal yield.
This is achieved by the present inventors who have engineered bacterial strains, which a) express a heterologous polypeptide having adenylate isopentenyltransferase activity and optionally b) have been modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. As shown in the Examples, such engineered bacterial strains surprisingly show unusually high titers of isoprenoid cytokinins of over 10 mg/L in the supernatant. In consequence, this means that the context of plant cell infection is no longer required and the biosynthetic substrates and cofactors for efficient biosynthesis of isoprenoid cytokinins such as tZ, and iP, and their ribosides tZR and iPR are effectively supplied by the engineered bacterial cell.
The present invention thus provides in a first aspect a bacterium expressing a heterologous polypeptide having adenylate isopentenyltransferase activity. More particularly, the present invention provides a bacterium, which a) expresses a heterologous polypeptide having adenylate isopentenyltransferase activity and optionally b) has been modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity compared to an otherwise identical bacterium that does not carry said modification.
The present invention further provides in a second aspect a method for producing a cytokinin or riboside derivative thereof, particularly an isoprenoid cytokinin or riboside derivative thereof, comprising cultivating a bacterium according to the present invention under suitable culture conditions in a suitable culture medium.
The present invention may be summarized by the following items:
The present invention is now described in more detail below.
Unless specifically defined herein, all technical and scientific terms used have the same meaning as commonly understood by a skilled artisan in the fields of biochemistry, genetics, and microbiology.
All methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, with suitable methods and materials being described herein. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will prevail. Further, the materials, methods, and examples are illustrative only and are not intended to be limiting, unless otherwise specified.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, and recombinant DNA, which are within the skill of the art. Such techniques are explained fully in the literature. See, for example, Current Protocols in Molecular Biology (Frederick M. AUSUBEL, 2000, Wiley and son Inc, Library of Congress, USA); Molecular Cloning: A Laboratory Manual, Third Edition, (Sambrook et al, 2001, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Pat. No. 4,683,195; Nucleic Acid Hybridization (B. D. Harries & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the series, Methods In ENZYMOLOGY (J. Abelson and M. Simon, eds.-in-chief, Academic Press, Inc., New York), specifically, Vols. 154 and 155 (Wu et al. eds.) and Vol. 185, “Gene Expression Technology” (D. Goeddel, ed.).
Bacterium of the Invention
As indicated above, the present inventors have engineered bacterial strains, which a) express a heterologous polypeptide having adenylate isopentenyltransferase activity and optionally b) have been modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. As shown in the Examples, such engineered bacterial strains surprisingly show unusually high titers of isoprenoid cytokinins of over 10 mg/L in the supernatant. In consequence, this means that the context of plant cell infection is no longer required and the biosynthetic substrates and cofactors for efficient biosynthesis of isoprenoid cytokinins such as tZ, and iP, and their ribosides tZR and iPR are effectively supplied by the engineered bacterial cell.
The present invention thus provides in a first aspect a bacterium expressing a heterologous polypeptide having adenylate isopentenyltransferase activity. More particularly, the present invention provides a bacterium, which a) expresses a heterologous polypeptide having adenylate isopentenyltransferase activity and optionally b) has been modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity compared to an otherwise identical bacterium that does not carry said modification.
Adenylate isopentenyltransferases (IPTs) are a well-defined class of enzymes catalyzing the first step in the de novo cytokinin biosynthesis pathway, the N-prenylation of adenosine 5′-phosphates (AMP, ADP, or ATP) with either dimethylallyl diphosphate (DMAPP) or 1-hydroxy-2-methyl-2-butenyl 4-diphosphate (HMBDP). There are two types of IPTs (EC 2.5.1.27 and EC 2.5.1.112). The IPT enzymes are present in bacteria and plants. IPTs from bacteria, such as from Agrobacterium which contains two IPT homologs, Tzs and Tmr, use AMP as a prenyl acceptor, and HMBDP or DMAPP as the donor (EC 2.5.1.27). The product of the reaction with HMBDP is tZ-nucleotide. Moreover, IPTs from bacteria belong to the Pfam family IPT (PF01745) in the Pfam database of protein families and domains (https://pfam.xfam.org/family/ipt). The IPT Pfam domain genes are phylogenetically scattered and found only in some members of Actinobacteria, Cyanobacteria, α-Proteobacteria, β-Proteobacteria, and γ-Proteobacteria and in the eukaryote Dictyostelium discoideum (Nishii et al., 2018). IPTs from higher plants use predominantly ATP or ADP as a prenyl acceptor and DMAPP as the donor (EC 2.5.1.112) and belong to the Pfam family IPPT (PF01715) (https://pfam.xfam.org/family/ippt). The product of the reaction is iP, which is then hydroxylated at the prenyl side chain to result in tZ-nucleotides.
Tzs of Agrobacterium tumefaciens (IPT (PF01745); EC 2.5.1.27; SEQ ID NO: 1) consists of 2 domains: the N-terminal domain with a five-stranded parallel β-sheet, which is surrounded by 3 α-helices (α1-α3), and the C-terminal domain with 5 α helices (α4-α8). The N-terminal domain contains a nucleotide-binding p loop motif Gly-8-Pro-Thr-Cys-Ser-Gly-Lys-Thr-15, and is structurally related to the p loop-containing nucleoside triphosphate hydrolase (pNTPase) superfamily. The C-terminal side of α8 extends to the N-terminal and is attached to it. The interface between the domains forms a solvent-accessible channel, which binds AMP. The prenylation site of AMP binds to Asp-33 and Ser-45. DMAPP is bound to Asp-173, Tyr-211, and His-214. Thr-10, Asp-33, and Arg-138 are fully conserved among IPTs. In Tzs, the hydrophilic region formed by the side chains contains His-214 and Asp-173, which are two critical amino acid residues within the substrate-binding pocket that distinguish the presence or absence of the hydroxyl group of the prenyl-donor substrate, which allows the use of HMBDP (Sugawara et al. 2008).
Generally, the polypeptide having adenylate isopentenyltransferase activity employed according to the invention will be heterologous to the bacterium, which means that said polypeptide is normally not found in or made (i.e. expressed) by the bacterium, but is derived from a different species. Moreover, the polypeptide having adenylate isopentenyltransferase activity is derived from or corresponds to a member of the Pfam family IPT (PF01745), and preferably is a bacterial polypeptide having adenylate isopentenyltransferase activity. With “bacterial polypeptide having adenylate isopentenyltransferase activity” it is meant that the polypeptide having adenylate isopentenyltransferase activity is derived from a bacterium, such as Agrobacterium tumefaciens.
Polypeptides having adenylate isopentenyltransferase activity most suitable for the biosynthesis of isoprenoid cytokinins including tZ, iP, tZR and iPR, are enzymes capable of using both HMBDP and DMAPP as prenyl donors, such as Tzs or Tmr of Agrobacterium tumefaciens. Moreover, they belong to the Pfam family IPT (PF01745) and contain an Asp residue in a position equivalent to position 173 of SEQ ID NO:1, a Tyr residue in a position equivalent to position 211 of SEQ ID NO:1 and/or a His residue in a position equivalent to position 214 of SEQ ID NO:1.
A polypeptide having adenylate isopentenyltransferase activity for use according to the invention may for instance be a polypeptide having adenylate isopentenyltransferase activity selected from the group consisting of: i) a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 33; and ii) a polypeptide comprising an amino acid sequence, which has at least about 50%, such as at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 1 to 33.
A polypeptide having adenylate isopentenyltransferase activity for use according to the invention may for instance be a polypeptide having adenylate isopentenyltransferase activity selected from the group consisting of: i) a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 33; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 1 to 33.
A polypeptide having adenylate isopentenyltransferase activity for use according to the invention may for instance be a polypeptide having adenylate isopentenyltransferase activity selected from the group consisting of: i) a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 10; and ii) a polypeptide comprising an amino acid sequence, which has at least about 85%, such as at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 1 to 10.
A polypeptide having adenylate isopentenyltransferase activity for use according to the invention may for instance be a polypeptide having adenylate isopentenyltransferase activity selected from the group consisting of: i) a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 10; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 1 to 10.
A polypeptide having adenylate isopentenyltransferase activity for use according to the invention may for instance be a polypeptide having adenylate isopentenyltransferase activity selected from the group consisting of: i) a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 10; and ii) a polypeptide comprising an amino acid sequence, which has at least about 85%, such as at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 1 to 10.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 1. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 1.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 2. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 2.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 3. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 3.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 4. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 4.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 5. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 5.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 6. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 6.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 7. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 7.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 8. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 8.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 9. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 9.
According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 50%, such as at least 55%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 60%, such as at least 65%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 70%, such as at least 75%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 80%, such as at least 85%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 90%, such as at least 95%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises an amino acid sequence having at least 95%, such as at least 97%, sequence identity with the amino acid sequence SEQ ID NO: 10. According to some embodiments, the polypeptide having adenylate isopentenyltransferase activity comprises the amino acid sequence of SEQ ID NO: 10.
Techniques for determining adenylate isopentenyltransferase activity are well known to the skilled person. Exemplary methods have been described, e.g. by Takei, Sakakibara, and Sugiyama (2001) and by Frébortová, Greplová et al, (2015). The adenylate isopentenyltransferase activity may for instance be determined in accordance with any of the following assays:
Besides expressing a heterologous polypeptide having adenylate isopentenyltransferase activity, the bacterium of the present invention may optionally be (further) modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity compared to an otherwise identical bacterium that does not carry said modification.
Thus, according to some embodiments, a bacterium of the present invention expresses a heterologous polypeptide having adenylate isopentenyltransferase activity and has been modified to have an increased protein expression of a polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity compared to an otherwise identical bacterium that does not carry said modification.
By “increased protein expression” it is meant that the amount of the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity produced by the thus modified bacterium is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression” it is meant that the amount of the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity produced by the thus modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared to an otherwise identical bacterium that does not carry said modification. The amount of protein in a given cell can be determined by any suitable quantification technique known in the art, such as ELISA, Immunohistochemistry, or Western Blotting.
An increase in protein expression may be achieved by any suitable means well-known to those skilled in the art. For example, an increase in protein expression may be achieved by increasing the number of copies of the gene encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity in the bacterium, such as by introducing into the bacterium an exogenous nucleic acid, such as a vector, comprising the gene encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity operably linked to a promoter that is functional in the bacterium to cause the production of an mRNA molecule.
An increase in protein expression may also be achieved by the integration of at least a second copy of the gene encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity into the genome of the bacterium.
An increase in protein expression may also be achieved by increasing the strength of the promoter operably linked to the gene encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity, e.g. by replacing the native promoter with a promoter that enables higher expression and overproduction of polypeptide compared to the native promoter. The promoters that can be used include natural promoters from Bacillus subtilis, Bacillus amyloliquefaciens or similar, such as P43, P15, Pveg, Pylb, PgroES, PsigX, PtrnQ, Ppst, PsodA, PrpsF, PlepA, PliaG, PrpsF, Ppst, PfusA, PsodA, Phag as well as artificial promoters active in Bacillus subtilis or inducible Bacillus subtilis promoters, such as PmtlA, Pspac, PxylA, PsacB, or similar. Further examples include natural promoters from Corynebacterium, such as P CP_2454, Ptuf and Psod, natural promoters from E. coli, such as T7, ParaBAD, Plac, Ptac and Ptrc, and the promoter P F1 derived from the corynephage BFK20.
An increase in protein expression may also be achieved by modifying the ribosome binding site on the mRNA molecule encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. By modifying the sequence of the ribosome binding site, the translation initiation rate may be increased, thus increasing translation efficiency.
According to some embodiments, the increase in the number of copies of the gene is achieved by introducing into the bacterium one or more (such as two or three) exogenous nucleic acid molecules (such as one or more vectors) comprising the gene operably linked to a promoter that is functional in the host cell to cause the production of an mRNA molecule.
According to some embodiments, a bacterium of the invention comprises an exogenous nucleic acid molecule (such as a vector) comprising one or more (such as two, three or four) nucleotide sequences encoding the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. Suitably, the exogenous nucleic acid molecule further comprises a promoter that is functional in the bacterium to cause the production of an mRNA molecule and that is operably linked to the nucleotide sequence encoding said polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. According to some embodiments, the exogenous nucleic acid molecule is stably integrated into the genome of the bacterium.
A polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity is a cytokinin-activating enzyme with the enzymatic function as cytokinin riboside 5′-monophosphate phosphoribohydrolase. Such polypeptides, also referred to as Lonely Guy (LOG) proteins, are encoded in the genomes of a wide range of organisms, and a majority of LOG proteins are prokaryotic. Enzymes from several organisms, such as Oryza sativa, Arabidopsis thaliana, Claviceps purpurea, Mycobacterium tuberculosis and Corynebacterium glutamicum have been characterized as LOGs by biochemical and functional studies. LOG protein was originally characterized as a phytohormone-activating plant enzyme, while bacterial LOG homologs were mistakenly designated as putative lysine decarboxylases (LDCs) without experimental evidence. Their true enzymatic activity was recently confirmed by functional analysis of the Corynebacterium glutamicum homolog (Seo et al. 2016). Two types of LOG proteins were identified in bacteria, dimeric type I LOGs and hexameric type II LOGs. Type II LOG proteins have different oligomeric state and residues at the prenyl-binding site. Type I LOGs can be further divided into 2 subgroups, type Ia and type Ib. Type Ia includes dimeric LOGs from most organisms, whereas type Ib includes dimeric LOGs from Actinomycetales. Type II LOG are divided into type IIa, which include hexameric LOGs from most organisms, and type IIb with LOGs from higher plants (Seo and Kim 2017).
LOG proteins produce active cytokinins via dephosphoribosylation, hydrolysis of the bond between N6-substituted bases and ribose 5′-monophosphates in cytokinin precursors such as iPRMP or trans-zeatin riboside 5′-monophosphate (tZRMP). C. glutamicum LOG is composed of two identical monomers with a central β-sheet, formed by seven parallel β-strands, which is surrounded by eight α-helices. LOG proteins contain “PGGXGTXXE” motif that contributes to the formation of an active site. The active site is formed in the pocket of the dimer, and the conserved “PGGXGTXXE” motif is on the surface of the pocket. The “PGGXGTXXE” motif is a nucleotide-binding site, and the conserved residues stabilize bound AMP (Seo et al. 2016). The motif is highly conserved in all LOG enzymes (Seo and Kim 2017).
The polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity may be derived from the same species as the bacterium in which it is expressed or may be derived from a species different to the one in which it is expressed (i.e. it is heterologous). According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity is derived from the same species as the bacterium in which it is expressed. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity is derived from a species different from the one in which it is expressed (i.e. it is heterologous).
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity is a bacterial polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity. With “bacterial polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity” it is meant that the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity is naturally derived from a bacterium, such as Corynebacterium glutamicum.
A polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity for use according to the invention may for instance be a polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity selected from the group consisting of: i) a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 34 to 62; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 34 to 62.
A polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity for use according to the invention may for instance be a polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity selected from the group consisting of: i) a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 34 to 44; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 34 to 44.
According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 34 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 34. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 34 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 34. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 34 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 34. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 34.
According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 35 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 35.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 35 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 35. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 35 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 35. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 35.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 36 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 36. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 36 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 36. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 36 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 36. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 36.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 37 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 37. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 37 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 37. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 37 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 37. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 37.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 38 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 38. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 38 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 38. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 38 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 38. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 38.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 39 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 39. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 39 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 39. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 39 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 39. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 39.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 40 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 40. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 40 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 40. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 40 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 40.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 40.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 41 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 41. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 41 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 41. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 41 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 41. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 41.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 42 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 42. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 42 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 42. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 42 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 42. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 42.
According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 43 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 43. According to some embodiments, the polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 43 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 43.
According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 43 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 43. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 43.
According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 44 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 44. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 44 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 44. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 44 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 44. According to some embodiments, the polypeptide having cytokinin riboside 5′-monophosphate phosphoribohydrolase activity comprises the amino acid sequence of SEQ ID NO: 44.
Techniques for determining cytokinin riboside 5′-monophosphate phosphoribohydrolase activity are well known to the skilled person. Exemplary methods have been described, e.g. by Seo et al. (2016). The cytokinin riboside 5′-monophosphate phosphoribohydrolase activity may for instance be determined in accordance with the following method:
The phosphoribohydrolase activity is determined by detecting adenine ring compounds separated by thin-layer chromatography (TLC) method. The enzyme reaction is carried out in a mixture of 20 mM AMP, 36 mM Tris-HCl, pH 8.0, and 23 μM purified enzyme at 30° C., and then the reaction is stopped by heating the mixture at 95° C. for 1.5 min. The reaction mixture is then dotted on a PEI-cellulose-F plastic TLC sheet (Merck Millipore). The mobile phase is 1 M sodium chloride. After development in the TLC chamber, the sheet is dried completely. Adenine ring-including compounds are detected by UV lamp (290 nm) (Seo et al. 2016).
In order to increase the supply of the isopentenyl side chain precursor of isoprenoid cytokinins, metabolic flux through the methylerythritol 4-phosphate (MEP) pathway may be increased. This is achieved primarily through overexpression of 1-Deoxy-D-xylulose 5-phosphate (DXP) synthase (DXS).
Hence, according to some embodiments, the bacterium of the present invention is characterized in that it has been (further) modified to have an increased protein expression of a polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity compared to an otherwise identical bacterium that does not carry said modification.
By “increased protein expression” it is meant that the amount of the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity produced by the thus modified bacterium is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression” it is meant that the amount of the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity produced by the thus modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared an otherwise identical bacterium that does not carry said modification. The amount of protein in a given cell can be determined by any suitable quantification technique known in the art, such as ELISA, Immunohistochemistry, or Western Blotting.
An increase in protein expression may be achieved by any suitable means well-known to those skilled in the art. For example, an increase in protein expression may be achieved by increasing the number of copies of the gene or genes encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity in the bacterium, such as by introducing into the bacterium an exogenous nucleic acid, such as a vector, comprising the gene or genes encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity operably linked to a promoter that is functional in the bacterium to cause the production of an mRNA molecule.
An increase in protein expression may also be achieved by the integration of at least a second copy of the gene encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity into the genome of the bacterium.
An increase in protein expression may also be achieved by increasing the strength of the promoter operably linked to the gene encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity, e.g. by replacing the native promoter with a promoter that enables higher expression and overproduction of polypeptide compared to the native promoter. The promoters that can be used include natural promoters from Bacillus subtilis, Bacillus amyloliquefaciens or similar, such as P43, P15, Pveg, Pylb, PgroES, PsigX, PtrnQ, Ppst, PsodA, PrpsF, PlepA, PliaG, PrpsF, Ppst, PfusA, PsodA, Phag as well as artificial promoters active in Bacillus subtilis or inducible Bacillus subtilis promoters, such as PmtlA, Pspac, PxylA, PsacB, or similar. Further examples include natural promoters from Corynebacterium, such as P CP_2454, Ptuf and Psod, natural promoters from E. coli, such as T7, ParaBAD, Plac, Ptac and Ptrc, and the promoter P F1 derived from the corynephage BFK20.
An increase in protein expression may also be achieved by modifying the ribosome binding site on the mRNA molecule encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity. By modifying the sequence of the ribosome binding site, the translation initiation rate may be increased, thus increasing translation efficiency.
According to some embodiments, the increase in the number of copies of the gene is achieved by introducing into the bacterium one or more (such as two or three) exogenous nucleic acid molecules (such as one or more vectors) comprising the gene operably linked to a promoter that is functional in the bacterium to cause the production of an mRNA molecule.
According to some embodiments, a bacterium of the invention comprises an exogenous nucleic acid molecule (such as a vector) comprising one or more (such as two, three, or four) nucleotide sequences encoding the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity. Suitably, the exogenous nucleic acid molecule further comprises a promoter that is functional in the bacterium to cause the production of an mRNA molecule and that is operably linked to the nucleotide sequence encoding said polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity. According to some embodiments, the exogenous nucleic acid molecule is stably integrated into the genome of the bacterium.
A polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity is an enzyme that catalyzes the condensation between D-glyceraldehyde 3-phosphate and pyruvate to produce 1-deoxy-D-xylulose 5-phosphate (DXP) (DXS; E.C. 2.2.1.7). DXS catalyzes the first enzymatic step of the isoprenoid biosynthesis of HMBDP and DMAPP in the methylerythritol phosphate (MEP) pathway, whose kcat/Km value is substantially lower than for other enzymes in this pathway (Kuzuyama et al. 2000). DXS is present in a single copy in eubacteria, whereas green algae and higher plants have two or more genes encoding DXS, which form three distinguishable groups. In higher plants, the expression of the distinct DXS isoenzymes depends on the type of tissue and developmental stage.
DXS is highly conserved in bacteria and plants. Its protein sequence has about 20% identity with transketolase and pyruvate dehydrogenase E1 subunit. All three enzymes catalyze similar reactions and require coenzyme thiamine pyrophosphate (TPP). DXS from E. coli contains three domains (1, 11, and 111), which are homologous to the equivalent domains in transketolase and the E1 subunit of pyruvate dehydrogenase. Two DXS monomers are arranged side-by-side in a dimer. Domain I (residues 1-319) is above domains II (residues 320-495) and III (residues 496-629) of the same monomer. All three domains have the α/β fold with a central, mostly parallel β-sheet between a helices. Domain I contains a five-stranded parallel β-sheet, domain II contains a six-stranded parallel β-sheet, and domain II contains a five-stranded β-sheet with the first strand anti-parallel to the other four strands. Domain I has several extended surface segments, at the N-terminus (residues 1-49), after the first strand (residues 81-122), and in the connection between the fourth and fifth strands (residues 184-250).
The active site of DXS is located at the interface of domains I and II in the same monomer. The C-terminal ends of the central parallel β-sheets of the two domains are pointed towards each other, and the TPP coenzyme is located at the bottom of a pocket in this interface. The amino-pyrimidine ring of TPP interacts with domain II, while the pyrophosphate group interacts with domain I. The C2 atom of the thiazolium ring is exposed to the substrate-binding site. The C2 atom participates in the reaction. The pyrophosphate group of TPP has numerous polar interactions with the enzyme. The active site is composed of a magnesium ion bound between the two phosphate groups, and the side chains of Asp154, Asn183 and Met185. The Gly153-Asp-Gly155-Asn183 sequence in DXS is consistent with the TPP binding motif «GDG-X(25-30)-N». The C2 constitutes pyruvate-binding site. GAP is located in the pocket (Xiang et al., 2007).
Increasing DXS activity is recognized as the most effective strategy for increased terpenoid biosynthesis in many species, also in B. subtilis (Yang et al. 2019), as several studies indicate that the formation of DXP is the limiting step of the MEP pathway. A single amino acid mutation in Dxs of E. coli and Deinococcus radiodurans increased their catalytic activities. The mutation Y392F of E. coli Dxs increased the relative catalytic activity by more than 2.5-fold compared to the wild type (Xiang et al. 2012). DXS is regulated by a negative feedback mechanism by the end products of the MEP pathway, IPP, and DMAPP (Banerjee et al. 2013). DXS has been a subject of site-directed mutagenesis for alleviating the negative feedback inhibition of IPP and DMAPP. Mutation at A147G/A352G of Populus trichocarpa DXS reduced its IPP binding affinity slightly, but it increased Km for TPP and pyruvate, and it decreased the catalytic efficiency of the enzyme about 15-fold (Banerjee et al., 2016). Overexpression of several other enzymes of the pathway besides DXS has also been tested with mixed results in different bacteria. Also, the overexpression of the entire MEP pathway as artificial operons (operon dxs-ispD-ispF-ispH and ispC/dxr-ispE-ispG-ispA) has been tested in B. subtilis (Xue et al. 2015).
The polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity may be derived from the same species as the bacterium in which it is expressed or may be derived from a species different from the one in which it is expressed (i.e. it is heterologous). According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity is derived from the same species as the bacterium in which it is expressed. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity is derived from a species different from the one in which it is expressed (i.e. it is heterologous).
Preferably, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity is a bacterial polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity. With “bacterial polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity” it is meant that the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity is naturally derived from a bacterium, such as Bacillus subtilis.
A polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity for use according to the invention may for instance be a polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity selected from the group consisting of: i) a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 63 to 70; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 63 to 70.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 63 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 63. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 63 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 63. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 63 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 63. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 63.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 64 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 64. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 64 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 64. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 64 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 64. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 64.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 65 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 65. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 65 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 65. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 65 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 65. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 65.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 66 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 66. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 66 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 66. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 66 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 66. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 66.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 67 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 67. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 67 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 67. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 67 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 67. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 67.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 68 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 68. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 68 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 68. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 68 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 68. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 68.
As noted above, modification of the wild type DXS sequence can increase the catalytic activities. The present invention thus particularly contemplates the use of mutant DXS having increased activity or an inactivated negative feedback mechanism compared to the wild type DXS enzyme from which it is derived. Non-limiting examples of such mutant DXS enzymes are those set out in SEQ ID NOs. 69 and 70.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 69 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 69, with the proviso that the amino acid at position 392 is not Y, preferably with the proviso that the amino acid at position 392 is F. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 69 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 69, with the proviso that the amino acid at position 392 is not Y, preferably with the proviso that the amino acid at position 392 is F. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 69 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 69, with the proviso that the amino acid at position 392 is not Y, preferably with the proviso that the amino acid at position 392 is F. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 69.
According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 70 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 70, with the proviso that the amino acid at position 389 is not Y, preferably with the proviso that the amino acid at position 389 is F. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 70 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 70, with the proviso that the amino acid at position 389 is not Y, preferably with the proviso that the amino acid at position 389 is F. According to some embodiments, the polypeptide 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 70 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 70, with the proviso that the amino acid at position 389 is not Y, preferably with the proviso that the amino acid at position 389 is F. According to some embodiments, the polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity comprises the amino acid sequence of SEQ ID NO: 70.
Techniques for determining 1-deoxy-D-xylulose-5-phosphate synthase activity are well known to the skilled person. Exemplary methods have been described, e.g. by Kuzuyama et al (2000) and by Kudoh et al (2017). The 1-deoxy-D-xylulose-5-phosphate synthase activity may for instance be determined in accordance with the following methods:
Purine nucleotides are structural components of DNA and RNA, energy carriers (i.e. ATP and GTP), enzyme cofactors (i.e. NAD+ and NADP+), and as such are essential metabolites for cellular physiology. The synthesis of purine nucleotides starts with the synthesis of 5′-phosphoribosylpyrophosphate (PRPP) from D-ribose 5-phosphate and ATP. The enzyme PRPP synthase (Ribose-phosphate diphosphokinase; EC 2.7.6.1) catalyzes the transfer of the diphosphoryl group of ATP to the D-ribose 5-phosphate, with the simultaneous formation of AMP. PRPP synthase is ubiquitous among free-living organisms. Most bacteria have one gene encoding PRPP synthase, whereas more than one gene is present in eukaryotes. The next step is the synthesis of 5-phospho-β-D-ribosylamine from PRPP and glutamine catalyzed by glutamine 5-phosphoribosyl-1-pyrophosphate (PRPP) amidotransferase (amidophosphoribosyltransferase; EC 2.4.2.14), which is encoded by purF in B. subtilis. This is the rate-limiting reaction of purine de novo synthesis. The further 10 steps leading from 5-phospho-β-D-ribosylamine to the synthesis of inosine-5-phosphate (IMP) are catalyzed by the enzymes encoded in the pur operon. The pur operon is negatively regulated by PurR repressor at transcriptional level encoded by purR, and by PurBoxes, specific DNA sequences in the upstream control regions of affected genes. IMP is the branching point for the synthesis of AMP and GMP. AMP is synthesized from IMP in two enzymatic steps, catalyzed by PurA and PurB, whereas GMP is synthesized from IMP by GuaB and GuaA. The expression of genes or operons encoding enzymes of the purine synthesis is regulated by purine bases and nucleosides in the growth medium. The enzymes PRPP synthetase, PRPP amidotransferase, adenylosuccinate synthetase and IMP dehydrogenase are regulated by the feedback inhibition of the end products of the pathway. PurR repressor inhibits the initiation of transcription. The salvage pathways also participate to generate the corresponding mononucleotides AMP and GMP by utilization of hypoxanthine, guanine, and adenine.
The purine nucleotide biosynthesis pathway is well-studied because of the role of purine nucleotides in the primary metabolism. It includes both the de novo synthesis pathway and the salvage pathway. Deregulation of the purine nucleotide biosynthesis pathway at transcription and metabolic levels enhanced the metabolic flow through the purine nucleotide biosynthesis pathway and consequently increased the yield of products directly stemming from the purine nucleotide biosynthesis pathway: inosine, guanosine, adenosine, and riboflavin. Various modifications including gene overexpression, gene deletions, and enzyme deregulation by mutations had been used successfully for increasing the yield of purine nucleotide biosynthesis pathway products.
Exemplary modifications in B. subtilis enzymes with a positive effect on the increase in purine flux are listed in Table 1 below.
Hence, according to some embodiments, the bacterium of the present invention is characterized in that it has been (further) modified to have an increased expression and/or activity of at least one enzyme involved in the purine nucleotide biosynthesis pathway compared to an otherwise identical bacterium that does not carry said modification.
According to some embodiments, the bacterium of the present invention is characterized in that it has been (further) modified to have an increased expression and/or activity of at least one enzyme involved in the adenosine monophosphate biosynthesis pathway compared to an otherwise identical bacterium that does not carry said modification.
The at least one enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the at least enzyme involved in the adenosine monophosphate biosynthesis pathway) may be derived from the same species as the bacterium in which it is expressed or may be derived from a species different to the one in which it is expressed (i.e. it is heterologous). According to some embodiments, at least one enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the at least enzyme involved in the adenosine monophosphate biosynthesis pathway) is derived from the same species as the bacterium in which it is expressed. According to some embodiments, the at least one enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the at least enzyme involved in the adenosine monophosphate biosynthesis pathway) is derived from a species different from the one in which it is expressed (i.e. it is heterologous).
By “increased protein expression” it is meant that the amount of the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway) produced by the thus modified bacterium is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression” it is meant that the amount of the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway) produced by the thus modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared to an otherwise identical bacterium that does not carry said modification. The amount of protein in a given cell can be determined by any suitable quantification technique known in the art, such as ELISA, Immunohistochemistry, or Western Blotting.
An increase in protein expression may be achieved by any suitable means well-known to those skilled in the art. For example, an increase in protein expression may be achieved by increasing the number of copies of the gene encoding the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway) in the bacterium, such as by introducing into the bacterium an exogenous nucleic acid, such as a vector, comprising the gene encoding the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine biosynthesis pathway) operably linked to a promoter that is functional in the bacterium to cause the production of an mRNA molecule.
An increase in protein expression may also be achieved by the integration of at least a second copy of the gene encoding the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway) into the genome of the bacterium.
An increase in protein expression may also be achieved by increasing the strength of the promoter operably linked to the gene encoding the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway), e.g. by replacing the native promoter with a promoter that enables higher expression and overproduction of the enzyme compared to the native promoter. The promoters that can be used include natural promoters from Bacillus subtilis, Bacillus amyloliquefaciens or similar, such as P43, P15, Pveg, Pylb, PgroES, PsigX, PtrnQ, Ppst, PsodA, PrpsF, PlepA, PliaG, PrpsF, Ppst, PfusA, PsodA, Phag as well as artificial promoters active in Bacillus subtilis or inducible Bacillus subtilis promoters, such as PmtlA, Pspac, PxylA, PsacB, or similar. Further examples include natural promoters from Corynebacterium, such as P CP_2454, Ptuf and Psod, natural promoters from E. coli, such as T7, ParaBAD, Plac, Ptac and Ptrc, and the promoter P F1 derived from the corynephage BFK20.
An increase in protein expression may also be achieved by modifying the ribosome binding site on the mRNA molecule encoding the enzyme involved in the purine nucleotide biosynthesis pathway. By modifying the sequence of the ribosome binding site, the translation initiation rate may be increased, thus increasing translation efficiency.
According to some embodiments, the increase in the number of copies of the gene is achieved by introducing into the bacterium one or more (such as two or three) exogenous nucleic acid molecules (such as one or more vectors) comprising the gene operably linked to a promoter that is functional in the host cell to cause the production of an mRNA molecule.
According to some embodiments, a bacterium of the invention comprises an exogenous nucleic acid molecule (such as a vector) comprising one or more (such as two, three, or four) nucleotide sequences encoding the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., the enzyme involved in the adenosine monophosphate biosynthesis pathway). Suitably, the exogenous nucleic acid molecule further comprises a promoter that is functional in the bacterium to cause the production of an mRNA molecule and that is operably linked to the nucleotide sequence encoding said the enzyme involved in the purine nucleotide biosynthesis pathway (e.g., said enzyme involved in the adenosine biosynthesis monophosphate pathway). According to some embodiments, the exogenous nucleic acid molecule is stably integrated into the genome of the bacterium.
The at least one enzyme involved in the purine nucleotide biosynthesis pathway may be an enzyme selected from the group consisting of: an enzyme having ribose-phosphate diphosphokinase activity, an enzyme having amidophosphoribosyltransferase activity, an enzyme having formyltetrahydrofolate deformylase activity, an enzyme having phosphoribosylamine-glycine ligase activity, an enzyme having phosphoribosylglycineamide formyltransferase activity, an enzyme having phosphoribosylformylglycinamidine synthase activity, an enzyme having phosphoribosylformylglycineamidine cyclo-ligase activity, an enzyme having N5-carboxyaminoimidazole ribonucleotide synthetase activity, an enzyme having N5-carboxyaminoimidazole ribonucleotide mutase activity, an enzyme having phosphoribosylaminoimidazolesuccinocarboxamide synthase activity, an enzyme having adenylosuccinate lyase activity, an enzyme having phosphoribosylaminoimidazole-carboxamide formyltransferase activity, an enzyme having IMP cyclohydolase activity, an enzyme having adenylosuccinate synthase activity, an enzyme having adenylate kinase activity, an enzyme having ATP synthase activity, an enzyme having adenosine kinase activity, an enzyme having IMP dehydrogenase activity and an enzyme having GMP synthase activity.
The at least one enzyme involved in the adenosine monophosphate biosynthesis pathway may be an enzyme selected from the group consisting of: an enzyme having ribose-phosphate diphosphokinase activity, an enzyme having amidophosphoribosyltransferase activity, an enzyme having formyltetrahydrofolate deformylase activity, an enzyme having phosphoribosylamine-glycine ligase activity, an enzyme having phosphoribosylglycineamide formyltransferase activity, an enzyme having phosphoribosylformylglycinamidine synthase activity, an enzyme having phosphoribosylformylglycineamidine cyclo-ligase activity, an enzyme having N5-carboxyaminoimidazole ribonucleotide synthetase activity, an enzyme having N5-carboxyaminoimidazole ribonucleotide mutase activity, an enzyme having phosphoribosylaminoimidazolesuccinocarboxamide synthase activity, an enzyme having adenylosuccinate lyase activity, an enzyme having phosphoribosylaminoimidazole-carboxamide formyltransferase activity, an enzyme having IMP cyclohydolase activity, an enzyme having adenylosuccinate synthase activity, an enzyme having adenylate kinase activity, an enzyme having ATP synthase activity and an enzyme having adenosine kinase activity.
According to some embodiments, the at least one enzyme involved in the purine nucleotide biosynthesis pathway (such as in the adenosine monophosphate biosynthesis pathway) is selected from the group consisting of: an enzyme having ribose-phosphate diphosphokinase activity, an enzyme having amidophosphoribosyltransferase activity, an enzyme having formyltetrahydrofolate deformylase activity, an enzyme having adenylosuccinate lyase activity, an enzyme having phosphoribosylaminoimidazole-carboxamide formyltransferase activity, an enzyme having adenylosuccinate synthase activity and an enzyme having adenosine kinase activity.
According to some embodiments, the bacterium of the present invention has been (further) modified to have any one of the modifications as disclosed in Table 1 in relation to one or more of its endogenous enzymes involved in the purine nucleotide biosynthesis pathway. Particularly, the bacterium may be the result of random mutagenesis making it resistant to an inhibitor of the enzyme in question.
According to some embodiments, the bacterium of the present invention is characterized in that it has been (further) modified to have a decreased expression and/or activity of at least one endogenous enzyme involved in the purine nucleotide degradation pathway compared to an otherwise identical bacterium that does not carry said modification.
According to some embodiments, the bacterium of the invention may be modified to have a decreased expression of at least one endogenous enzyme involved in the purine nucleotide degradation pathway compared to an otherwise identical bacterium that does not carry said modification.
According to some embodiments, the bacterium of the invention may be modified to have a decreased expression level of the endogenous gene encoding said at least one endogenous enzyme involved in the purine nucleotide degradation pathway compared to an otherwise identical bacterium that does not carry said modification. The expression level of the endogenous gene may, for example, be decreased by at least 50%, such as by at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 100% compared to the otherwise identical bacterium.
According to some embodiments, the endogenous gene encoding said enzyme has been inactivated, such as by deletion of part of or the entire gene sequence.
According to some embodiments, the endogenous gene encoding said enzyme has been inactivated by introducing or expressing in the microorganism a rare-cutting endonuclease able to selectively inactivate by DNA cleavage, preferably by a double-strand break, the endogenous gene encoding said enzyme. A rare-cutting endonuclease to be used in accordance with the present invention to inactivate the endogenous gene may, for instance, be a transcription activator-like effector (TALE) nuclease, meganuclease, zinc-finger nuclease (ZFN), or RNA-guided endonuclease.
One way to inactivate the endogenous gene encoding said enzyme is to use the CRISPRi system. The CRISPRi system was developed as a tool for targeted repression of gene expression or for blocking targeted locations on the genome. The CRISPRi system consists of the catalytically inactive, “dead” Cas9 protein (dCas9) and a guide RNA that defines the binding site for the dCas9 to DNA.
Thus, according to some embodiments, the endogenous gene encoding said enzyme is inactivated by introducing or expressing in the bacterium an RNA-guided endonuclease, such as a catalytically inactive Cas9 protein, and a single guide RNA (sgRNA) specifically hybridizing (e.g. binding) under cellular conditions with the genomic DNA encoding a said enzyme.
According to some embodiments, the expression of said at least one endogenous enzyme involved in the purine nucleotide degradation pathway is decreased by way of inhibition.
Inhibition of the expression of the said endogenous enzyme may be achieved by any suitable means known in the art. For example, the expression may be inhibited by gene silencing techniques involving the use of inhibitory nucleic acid molecules, such as antisense oligonucleotides, ribozymes, or interfering RNA (RNAi) molecules, such as microRNA (miRNA), small interfering RNA (siRNA), or short hairpin RNA (shRNA).
According to some embodiments, the expression of said at least one endogenous enzyme involved in the purine nucleotide degradation pathway is decreased (e.g., inhibited) by transcriptional and/or translational repression of the endogenous gene encoding said polypeptide.
According to some embodiments, the expression of said at least one endogenous enzyme involved in the purine nucleotide degradation pathway is inhibited by introducing or expressing in the bacterium an inhibitory nucleic acid molecule. For example, the inhibitory nucleic acid molecule may be introduced by way of an exogenous nucleic acid molecule comprising a nucleotide sequence encoding said inhibitory nucleic acid molecule operably linked to a promoter, such as an inducible promoter, that is functional in the bacterium to cause the production of said inhibitory nucleic acid molecule. Suitably, the inhibitory nucleic acid molecule is one that specifically hybridizes (e.g. binds) under cellular conditions with cellular mRNA and/or genomic DNA encoding the endogenous enzyme. Depending on the target, transcription of the encoding genomic DNA and/or translation of the encoding mRNA is/are inhibited.
According to some embodiments, the inhibitory nucleic acid molecule is an antisense oligonucleotide, ribozyme, or interfering RNA (RNAi) molecule. Preferably, such nucleic acid molecule comprises at least 10 consecutive nucleotides of the complement of the cellular mRNA and/or genomic DNA encoding the polypeptide or enzyme of interest (e.g., the cellular mRNA and/or genomic DNA encoding the polypeptide.
According to some embodiments, the inhibitory nucleic acid is an antisense oligonucleotide. Such antisense oligonucleotide is a nucleic acid molecule (either DNA or RNA), which specifically hybridizes (e.g. binds) under cellular conditions with the cellular mRNA and/or genomic DNA encoding the polypeptide.
According to some embodiments, the inhibitory nucleic acid molecule is a ribozyme, such as a hammerhead ribozyme. A ribozyme molecule is designed to catalytically cleave the mRNA transcript to prevent translation of the polypeptide.
According to some embodiments, the inhibitory nucleic acid molecule is an interfering RNA (RNAi) molecule. RNA interference is a biological process in which RNA molecules inhibit expression, typically destroying specific mRNA. Exemplary types of RNAi molecules include microRNA (miRNA), small interfering RNA (siRNA), and short hairpin RNA (shRNA). According to some embodiments, the RNAi molecule is a miRNA. According to some embodiments, the RNAi molecule is a siRNA. According to some embodiments, the RNAi molecule is an shRNA.
According to some embodiments, the bacterium of the invention has been modified to have a decreased activity of at least one endogenous enzyme involved in the purine nucleotide degradation pathway compared to an otherwise identical bacterium that does not carry said modification.
A decrease of the activity of the endogenous enzyme involved in the purine nucleotide degradation pathway may be achieved by any suitable means known in the art. For example, the activity may be decreased by introducing one or more mutations in the active site of the enzyme resulting in the reduction or loss of activity. Thus, according to some embodiments, the activity of the endogenous enzyme involved in the purine nucleotide degradation pathway is decreased by at least one active-site mutation resulting in the reduction or loss of activity. At least one active-site mutation may, for example, be at least one non-conservative amino acid substitution.
According to some embodiments, the at least one enzyme involved in the purine nucleotide degradation pathway is selected from the group consisting of: a purine nucleoside phosphorylase and adenosine-phosphoribosyltransferase. According to some embodiments, the at least one endogenous enzyme involved in the purine nucleotide degradation pathway is a purine nucleoside phosphorylase. According to some embodiments, the at least one endogenous enzyme involved in the purine nucleotide degradation pathway is adenosine-phosphoribosyltransferase.
According to some embodiments, the bacterium of the invention has been (further) modified to have a decreased expression and/or activity of at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway compared to an otherwise identical bacterium that does not carry said modification.
According to some embodiments, the bacterium of the invention may be modified to have a decreased expression level of the endogenous gene encoding said at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway compared to an otherwise identical bacterium that does not carry said modification. The expression level of the endogenous gene may, for example, be decreased by at least 50%, such as by at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 100% compared to the otherwise identical bacterium.
According to some embodiments, the endogenous gene encoding said enzyme has been inactivated, such as by deletion of part of or the entire gene sequence.
According to some embodiments, the endogenous gene encoding said enzyme has been inactivated by introducing or expressing in the microorganism a rare-cutting endonuclease able to selectively inactivate by DNA cleavage, preferably by a double-strand break, the endogenous gene encoding said enzyme. A rare-cutting endonuclease to be used in accordance with the present invention to inactivate the endogenous gene may, for instance, be a transcription activator-like effector (TALE) nuclease, meganuclease, zinc-finger nuclease (ZFN), or RNA-guided endonuclease.
One way to inactivate the endogenous gene encoding said enzyme is to use the CRISPRi system. The CRISPRi system was developed as a tool for targeted repression of gene expression or for blocking targeted locations on the genome. The CRISPRi system consists of the catalytically inactive, “dead” Cas9 protein (dCas9) and a guide RNA that defines the binding site for the dCas9 to DNA.
Thus, according to some embodiments, the endogenous gene encoding said enzyme is inactivated by introducing or expressing in the bacterium an RNA-guided endonuclease, such as a catalytically inactive Cas9 protein, and a single guide RNA (sgRNA) specifically hybridizing (e.g. binding) under cellular conditions with the genomic DNA encoding a said enzyme.
According to some embodiments, the expression of said at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway is decreased by way of inhibition.
Inhibition of the expression of the said endogenous enzyme may be achieved by any suitable means known in the art. For example, the expression may be inhibited by gene silencing techniques involving the use of inhibitory nucleic acid molecules, such as antisense oligonucleotides, ribozymes, or interfering RNA (RNAi) molecules, such as microRNA (miRNA), small interfering RNA (siRNA), or short hairpin RNA (shRNA).
According to some embodiments, the expression of said at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway is decreased (e.g., inhibited) by transcriptional and/or translational repression of the endogenous gene encoding said polypeptide.
According to some embodiments, the expression of said at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway is inhibited by introducing or expressing in the bacterium an inhibitory nucleic acid molecule. For example, the inhibitory nucleic acid molecule may be introduced by way of an exogenous nucleic acid molecule comprising a nucleotide sequence encoding said inhibitory nucleic acid molecule operably linked to a promoter, such as an inducible promoter, that is functional in the bacterium to cause the production of said inhibitory nucleic acid molecule. Suitably, the inhibitory nucleic acid molecule is one that specifically hybridizes (e.g. binds) under cellular conditions with cellular mRNA and/or genomic DNA encoding the endogenous enzyme. Depending on the target, transcription of the encoding genomic DNA and/or translation of the encoding mRNA is/are inhibited.
According to some embodiments, the inhibitory nucleic acid molecule is an antisense oligonucleotide, ribozyme, or interfering RNA (RNAi) molecule. Preferably, such nucleic acid molecule comprises at least 10 consecutive nucleotides of the complement of the cellular mRNA and/or genomic DNA encoding the polypeptide or enzyme of interest (e.g., the cellular mRNA and/or genomic DNA encoding the polypeptide.
According to some embodiments, the inhibitory nucleic acid is an antisense oligonucleotide. Such antisense oligonucleotide is a nucleic acid molecule (either DNA or RNA), which specifically hybridizes (e.g. binds) under cellular conditions with the cellular mRNA and/or genomic DNA encoding the polypeptide.
According to some embodiments, the inhibitory nucleic acid molecule is a ribozyme, such as a hammerhead ribozyme. A ribozyme molecule is designed to catalytically cleave the mRNA transcript to prevent translation of the polypeptide.
According to some embodiments, the inhibitory nucleic acid molecule is an interfering RNA (RNAi) molecule. RNA interference is a biological process in which RNA molecules inhibit expression, typically destroying specific mRNA. Exemplary types of RNAi molecules include microRNA (miRNA), small interfering RNA (siRNA), and short hairpin RNA (shRNA). According to some embodiments, the RNAi molecule is a miRNA. According to some embodiments, the RNAi molecule is a siRNA. According to some embodiments, the RNAi molecule is an shRNA.
According to some embodiments, the bacterium of the invention has been modified to have a decreased activity of at least one endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway compared to an otherwise identical bacterium that does not carry said modification.
A decrease of the activity of the endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway may be achieved by any suitable means known in the art. For example, the activity may be decreased by introducing one or more mutations in the active site of the enzyme resulting in the reduction or loss of activity. Thus, according to some embodiments, the activity of the endogenous enzyme involved in the guanosine monophosphate biosynthesis pathway is decreased by at least one active-site mutation resulting in the reduction or loss of activity. At least one active-site mutation may, for example, be at least one non-conservative amino acid substitution.
According to some embodiments, the at least one enzyme involved in the guanosine monophosphate biosynthesis pathway is selected from the group consisting of: IMP dehydrogenase and GMP synthetase.
According to some embodiments, the at least one enzyme involved in the guanosine monophosphate biosynthesis pathway is IMP dehydrogenase.
According to some embodiments, the at least one enzyme involved in the guanosine monophosphate biosynthesis pathway is GMP synthetase.
CYP450s are a diverse group of heme-containing enzymes, which catalyze a wide range of oxidative reactions. Cytochrome P450 monooxygenases (CYP450) also catalyze the hydroxylation of isopentenyladenine-type cytokinins. In Arabidopsis, 2 cytochrome P450 monooxygenases for hydroxylation of isopentenyladenine-type cytokinins are present, CYP735A1 and CYP735A2. The CYP735As catalyze a stereo-specific reaction of hydroxylation of iP-nucleotides, and with lower affinity the hydroxylation of the iP-nucleoside or iP to synthesize tZ (Takei et al., 2004b). Most CYP450s of plant origin are membrane-anchored to the endoplasmic reticulum (ER). Cytochrome P450 monooxygenases are also present in bacteria, such as Rhodococcus fascians. The cytochromes P450 generally fall into two broad classes, depending on the nature of the auxiliary protein(s). Class I P450s are found on the membranes of mitochondria and in bacteria, and class II cytochromes P450 are typified by the liver microsomal enzymes in mammalian cells. Class I P450s are three-component systems comprising a flavin adenine dinucleotide (FAD)-containing reductase, an iron-sulfur protein (ferredoxin), and the P450. Class 11 cytochromes P450 are composed of an FAD-containing, flavin mononucleotide (FMN)-containing NADPH-dependent cytochrome P450 reductase and a P450. Class III and class IV CYP450s have also been described in bacteria, but class I are the most common CYP450s in bacteria. The best-characterized bacterial cytochrome P450 monooxygenase system from Pseudomonas putida P450cam consists of three soluble proteins: putidaredoxin reductase; putidaredoxin, an intermediary iron-sulfur protein; and the cytochrome P450cam. The CYP450 Fas1 is encoded in the fas region of the linear plasmid and is thought to hydroxylate cytokinins produced by R. fascians (Frebort et al., 2011).
Cytochrome P450 monooxygenase (CYP450) can hydroxylate iP-nucleotides to produce trans-zeatin in three enzymatic steps from AMP and DMAPP. The enzyme can stereo-specifically hydroxylate the prenyl side chain of iPRMP and can thus make a bypass to tZRMP, which can be further activated to produce trans-zeatin.
Hence, according to some embodiments, the bacterium of the present invention is characterized in that it has been (further) modified to have increased protein expression of a polypeptide having cytochrome P450 monooxygenase (CYP450) activity compared to an otherwise identical bacterium that does not carry said modification.
By “increased protein expression” it is meant that the amount of the polypeptide having cytochrome P450 monooxygenase (CYP450) activity produced by the thus modified bacterium is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression” it is meant that the amount of the polypeptide having cytochrome P450 monooxygenase (CYP450) activity produced by the thus modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared to an otherwise identical bacterium that does not carry said modification. The amount of protein in a given cell can be determined by any suitable quantification technique known in the art, such as ELISA, Immunohistochemistry, or Western Blotting.
An increase in protein expression may be achieved by any suitable means well-known to those skilled in the art. For example, an increase in protein expression may be achieved by increasing the number of copies of the gene encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity in the bacterium, such as by introducing into the bacterium an exogenous nucleic acid, such as a vector, comprising the gene encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity operably linked to a promoter that is functional in the bacterium to cause the production of an mRNA molecule.
An increase in protein expression may also be achieved by the integration of at least a second copy of the gene encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity into the genome of the bacterium.
An increase in protein expression may also be achieved by increasing the strength of the promoter operably linked to the gene encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity. The promoters that can be used include natural promoters from Bacillus subtilis, Bacillus amyloliquefaciens or similar, such as P43, P15, Pveg, Pylb, PgroES, PsigX, PtrnQ, Ppst, PsodA, PrpsF, PlepA, PliaG, PrpsF, Ppst, PfusA, PsodA, Phag as well as artificial promoters active in Bacillus subtilis or inducible Bacillus subtilis promoters, such as PmtlA, Pspac, PxylA, PsacB, or similar. Further examples include natural promoters from Corynebacterium, such as P CP_2454, Ptuf and Psod, natural promoters from E. coli, such as T7, ParaBAD, Plac, Ptac and Ptrc, and the promoter P F1 derived from the corynephage BFK20.
An increase in protein expression may also be achieved by modifying the ribosome binding site on the mRNA molecule encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity. By modifying the sequence of the ribosome binding site the translation initiation rate may be increased, thus increasing translation efficiency.
According to some embodiments, the increase in the number of copies of the gene is achieved by introducing into the bacterium one or more (such as two or three) exogenous nucleic acid molecules (such as one or more vectors) comprising the gene operably linked to a promoter that is functional in the host cell to cause the production of an mRNA molecule.
According to some embodiments, a bacterium of the invention comprises an exogenous nucleic acid molecule (such as a vector) comprising one or more (such as two, three, or four) nucleotide sequences encoding the polypeptide having cytochrome P450 monooxygenase (CYP450) activity. Suitably, the exogenous nucleic acid molecule further comprises a promoter that is functional in the bacterium to cause the production of an mRNA molecule and that is operably linked to the nucleotide sequence encoding said polypeptide having cytochrome P450 monooxygenase (CYP450) activity. According to some embodiments, the exogenous nucleic acid molecule is stably integrated into the genome of the bacterium.
The polypeptide having cytochrome P450 monooxygenase (CYP450) activity may be derived from the same species as the bacterium in which it is expressed or may be derived from a species different to the one in which it is expressed (i.e. it is heterologous). According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is derived from the same species as the bacterium in which it is expressed. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is derived from a species different from the one in which it is expressed (i.e. it is heterologous). By way of example, in case that a bacterium does not have an endogenous gene encoding a polypeptide having cytochrome P450 monooxygenase (CYP450) activity, then the polypeptide having cytochrome P450 monooxygenase (CYP450) activity expressed in said bacterium will be heterologous to said bacterium.
According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is a bacterial polypeptide having cytochrome P450 monooxygenase (CYP450) activity. With “bacterial polypeptide having cytochrome P450 monooxygenase (CYP450) activity” it is meant that the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is naturally derived from a bacterium, such as Rhodococcus fascians. A non-limiting example of a bacterial polypeptide having cytochrome P450 monooxygenase (CYP450) activity is set out in SEQ ID NO: 93.
According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is a plant polypeptide having cytochrome P450 monooxygenase (CYP450) activity. With “plant polypeptide having cytochrome P450 monooxygenase (CYP450) activity” it is meant that the polypeptide having cytochrome P450 monooxygenase (CYP450) activity is naturally derived from a plant, such as Arabidopsis thaliana. Non-limiting examples of a plant polypeptide having cytochrome P450 monooxygenase (CYP450) activity are set out in SEQ ID NOs: 94 and 95.
A polypeptide having cytochrome P450 monooxygenase (CYP450) activity for use according to the invention may for instance be a polypeptide having cytochrome P450 monooxygenase (CYP450) activity selected from the group consisting of: i) a polypeptide comprising an amino acid sequence of any one of SEQ ID NOs: 93 to 95; and ii) a polypeptide comprising an amino acid sequence, which has at least about 70%, such as at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 93%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, sequence identity to the amino acid sequence of any one of SEQ ID NOs: 93 to 95.
According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 93 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 93. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 93 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 93. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 93 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 93. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 93.
According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 94 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 94. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 94 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 94. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 94 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 94. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 94.
According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 95 or an amino acid sequence having at least 70%, such as at least 75%, sequence identity with SEQ ID NO: 95. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 95 or an amino acid sequence having at least 80%, such as at least 85%, sequence identity with SEQ ID NO: 95. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 95 or an amino acid sequence having at least 90%, such as at least 95%, sequence identity with SEQ ID NO: 95. According to some embodiments, the polypeptide having cytochrome P450 monooxygenase (CYP450) activity comprises the amino acid sequence of SEQ ID NO: 95.
A bacterium according to the present invention can be produced from any suitable bacterium. The bacterium may be Gram-positive or Gram-negative. Non-limiting examples for Gram-negative bacterial host cells include species from the genera Escherichia, Erwinia, Klebsiella, and Citrobacter.
Non-limiting examples of Gram-positive bacterial host cells include species from the genera Bacillus, Lactococcus, Lactobacillus, Clostridium, Corynebacterium, Streptomyces, Streptococcus, and Cellulomonas. According to some embodiments, the bacterium of the present invention is Gram-positive. According to some embodiments, the bacterium of the present invention is Gram-negative.
According to some embodiments, the bacterium of the present invention is a bacterium of the family selected from the group consisting of Enterobacteriaceae, Bacillaceae, Lactobacillaceae, and Corynebacteriaceae. According to some embodiments, the recombinant host cell is a bacterium of the family Enterobacteriaceae. According to some embodiments, the recombinant host cell is a bacterium of the family Bacillaceae. According to some embodiments, the recombinant host cell is a bacterium of the family Corynebacteriaceae.
According to some embodiments, the bacterium of the present invention is a bacterium, which may be a bacterium of the genus Bacillus, Lactococcus, Lactobacillus, Clostridium, Corynebacterium, Geobacillus, Thermoanaerobacterium, Streptococcus, Pseudomonas, Streptomyces, Escherichia, Shigella, Acinetobacter, Citrobacter, Salmonella, Klebsiella, Enterobacter, Erwinia, Kluyvera, Serratia, Cedecea, Morganella, Hafnia, Edwardsiella, Providencia, Proteus, or Yersinia.
According to some embodiments, the bacterium of the present invention is a bacterium of the genus Bacillus. Non-limiting examples of a bacterium of the genus Bacillus are Bacillus subtilis, Bacillus amyloliquefaciens, Bacillus licheniformis, and Bacillus mojavensis. According to some embodiments, the bacterium of the present invention is Bacillus subtilis.
According to some embodiments, the bacterium of the present invention is a bacterium of the genus Corynebacterium. Non-limiting examples of a bacterium of the genus Corynebacterium are Corynebacterium glutamicum and Corynebacterium stationis. According to some, the bacterium of the present invention is Corynebacterium glutamicum. According to some, the bacterium of the present invention is Corynebacterium stationis. Within the context of the present invention, Corynebacterium stationis and Corynebacterium ammoniagenes refer to the same species and may be used interchangeably.
According to some embodiments, the bacterium of the present invention is a bacterium of the genus Escherichia. A non-limiting example of a bacterium of the genus Escherichia is Escherichia coli. According to some, the bacterium of the present invention is Escherichia coli.
As noted above, a bacterium of the invention is modified to express one or more polypeptides as detailed herein, which may mean that an exogenous nucleic acid molecule, such as a DNA molecule, which comprises a nucleotide sequence encoding said polypeptide has been introduced in the bacterium. Thus, a bacterium of the present invention may comprise an exogenous nucleic acid molecule, such as a DNA molecule, which comprises a nucleotide sequence encoding the polypeptide in question. Techniques for introducing an exogenous nucleic acid molecule, such as a DNA molecule, into a bacterial cell are well-known to those of skill in the art and include transformation (e.g., heat shock or natural transformation) among others.
In order to facilitate (over)expression of a polypeptide in the bacterium, the exogenous nucleic acid molecule may comprise suitable regulatory elements such as a promoter that is functional in the bacterial cell to cause the production of an mRNA molecule and that is operably linked to the nucleotide sequence encoding said polypeptide.
Promoters useful in accordance with the invention are any known promoters that are functional in a given host cell to cause the production of an mRNA molecule. Many such promoters are known to the skilled person. Such promoters include promoters normally associated with other genes, and/or promoters isolated from any bacteria. The use of promoters for protein expression is generally known to those skilled in the art of molecular biology, for example, see Sambrook et al., Molecular cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989. The promoter employed may be inducible, such as a temperature-inducible promoter (e.g., a pL or pR phage lambda promoters, each of which can be controlled by the temperature-sensitive lambda repressor c1857). The term “inducible” used in the context of a promoter means that the promoter only directs transcription of an operably linked nucleotide sequence if a stimulus is present, such as a change in temperature or the presence of a chemical substance (“chemical inducer”). As used herein, “chemical induction” according to the present invention refers to the physical application of an exogenous or endogenous substance (incl. macromolecules, e.g., proteins or nucleic acids) to a host cell. This has the effect of causing the target promoter present in the host cell to increase the rate of transcription. Alternatively, the promoter employed may be constitutive. The term “constitutive” used in the context of a promoter means that the promoter is capable of directing transcription of an operably linked nucleotide sequence in the absence of a stimulus (such as heat shock, chemicals, etc.).
Temperature induction systems work, for example, by employing promoters that are repressed by thermolabile repressors. These repressors are active at lower temperatures for example at 30° C., while unable to fold correctly at 37° C. and are therefore inactive. Such circuits therefore can be used to directly regulate the genes of interest also by genome integration of the genes along with the repressors. Examples of such a temperature-inducible expression system are based on the pL and/or pR X phage promoters, which are regulated by the thermolabile c1857 repressor. Similar to the genome integrated DE3 system, the expression of the T7 RNA polymerase gene may also be controlled using a temperature-controlled promoter system, while the expression of the genes of interest can be controlled using a T7 promoter.
Non-limiting examples of promoters functional in bacteria include both constitutive and inducible promoters such as T7 promoter, the beta-lactamase and lactose promoter systems; alkaline phosphatase (phoA) promoter, a tryptophan (trp) promoter system, tetracycline promoter, lambda-phage promoter, ribosomal protein promoters; and hybrid promoters such as the tac promoter. Other bacterial and synthetic promoters are also suitable.
Besides a promoter, the exogenous nucleic acid molecule may further comprise at least one regulatory element selected from a 5′ untranslated region (5′UTR) and 3′ untranslated region (3′ UTR). Many such 5′ UTRs and 3′ UTRs derived from prokaryotes and eukaryotes are well known to the skilled person. Such regulatory elements include 5′ UTRs and 3′ UTRs normally associated with other genes, and/or 5′ UTRs and 3′ UTRs isolated from any bacteria.
Usually, the 5′ UTR contains a ribosome binding site (RBS), also known as the Shine-Dalgarno sequence, which is usually 3-10 base pairs upstream from the initiation codon.
The exogenous nucleic acid molecule may be a vector or part of a vector, such as an expression vector. Normally, such a vector remains extrachromosomal within the bacterial cell, which means that it is found outside of the nucleus or nucleoid region of the bacterium.
It is also contemplated by the present invention that the exogenous nucleic acid molecule is stably integrated into the genome of the bacterium. Means for stable integration into the genome of a host cell, e.g., by homologous recombination, are well known to the skilled person.
Method of the Invention
The present invention also provides methods for producing an isoprenoid cytokinin or riboside derivative thereof, comprising cultivating a bacterium according to the present invention under suitable culture conditions in a suitable culture medium.
The method may further comprise collecting the isoprenoid cytokinin or riboside derivative thereof from the culture medium.
According to some embodiments, the isoprenoid cytokinin or riboside derivative thereof is selected from the group consisting of trans-zeatin (tZ), trans-zeatin riboside (tZR), N6-(D2-isopentenyl)adenine (iP), N6-(dimethylallyl)adenosine (iPR), dihydrozeatin (DZ), ribosyl dihydrozeatin (DZR), and combinations thereof.
According to some embodiments, the isoprenoid cytokinin or riboside derivative thereof is trans-zeatin (tZ) and trans-zeatin riboside (tZR), respectively.
The culture medium employed may be any conventional medium suitable for culturing a bacterium cell in question, and may be composed according to the principles of the prior art. The medium will usually contain all nutrients necessary for the growth and survival of the respective bacterium, such as carbon and nitrogen sources and other inorganic salts. Suitable media, e.g. minimal or complex media, are available from commercial suppliers or may be prepared according to published receipts, e.g. the American Type Culture Collection (ATCC) Catalogue of strains. Non-limiting standard media well known to the skilled person include Luria Bertani (LB) broth, Sabouraud Dextrose (SD) broth, MS broth, Yeast Peptone Dextrose, BMMY, GMMY, or Yeast Malt Extract (YM) broth, which are all commercially available. A non-limiting example of suitable media for culturing bacterial cells, such as B. subtilis, L. lactis or E. coli cells, including minimal media and rich media such as Luria Broth (LB), M9 media, M17 media, SA media, MOPS media, Terrific Broth, YT and others.
The carbon source may be any suitable carbon substrate known in the art, and particularly any carbon substrate commonly used in the cultivation of microorganisms and/or fermentation. Non-limiting examples of suitable fermentable carbon substrates include carbohydrates (e.g., C5 sugars such as arabinose or xylose, or C6 sugars such as glucose), glycerol, glycerine, acetate, dihydroxyacetone, one-carbon source, methanol, methane, oils, animal fats, animal oils, plant oils, fatty acids, lipids, phospholipids, glycerolipids, monoglycerides, diglycerides, triglycerides, renewable carbon sources, polypeptides (e.g., a microbial or plant protein or peptide), yeast extract, component from a yeast extract, peptone, casaminoacids or any combination of two or more of the foregoing.
As the nitrogen source, various ammonium salts such as ammonia and ammonium sulfate, other nitrogen compounds such as amines, a natural nitrogen source such as peptone, soybean-hydrolysate, and digested fermentative microorganism can be used. As minerals, potassium monophosphate, magnesium sulfate, sodium chloride, ferrous sulfate, manganese sulfate, calcium chloride, and the like can be used.
In order to further improve the production of the isoprenoid cytokinin or riboside derivative thereof, such as is trans-zeatin (tZ) and trans-zeatin riboside (tZR), respectively, the culture medium may be supplemented with a source of adenine or adenosine, such as adenine sulfate. Thus, according to some embodiments, the culture medium comprises adenine sulfate. The concentration of adenine sulfate in the culture medium may generally be in the range from about 0.1 g/L to about 4 g/L, such as from about 1 g/L to about 3.5 g/L. According to some embodiments, the concentration of adenine sulfate in the culture medium is in the range from about 2.5 g/L to about 3.5 g/L. However, other sources of adenine or adenosine, such as yeast extract, are also contemplated for use in accordance with the invention.
The cultivation can be preferably performed under aerobic conditions, such as by a shaking culture, and by a stirring culture with aeration, at a temperature of about 20 to about 45° C., such as about 30 to 38° C., such as at about 37° C. According to some embodiments, the cultivation is performed at a temperature of about 30 to 38° C., such as at about 37° C. The pH of the culture is usually above 5, such as in a range from about 6 to about 8, preferably from about 6.5 to about 7.5, more preferably from about 6.8 to about 7.2. According to some embodiments, the cultivation is performed at a pH from about 6 to about 8. The pH of the culture can be adjusted with ammonia, calcium carbonate, various acids, various bases, and buffers. The cultivation may be carried out for a period in the range from 10 to 70 h, such as in the range of 10 to 24 h or 10 to 48 h.
After cultivation, solids such as cells can be removed from the culture medium by centrifugation or membrane filtration. The isoprenoid cytokinin or riboside derivative thereof can be collected by a conventional method for the isolation and purification of chemical compounds from a medium.
Well-known purification procedures include, but are not limited to, centrifugation or filtration, precipitation, ion exchange, chromatographic methods such as e.g. ion-exchange chromatography or gel filtration chromatography, and crystallization methods.
The present invention thus provides an isoprenoid cytokinin or riboside derivative thereof obtainable by a method as detailed herein.
Abbreviations
As used herein, a “polypeptide having adenylate isopentenyltransferase activity” or a “polypeptide, which has adenylate isopentenyltransferase activity” means a polypeptide that catalyzes the reactions: Dimethylallyl diphosphate+AMP<=>diphosphate+N(6)-(dimethylallyl)adenosine 5-phosphate (EC 2.5.1.27) and optionally 1-hydroxy-2-methyl-2-butenyl 4-diphosphate+AMP<=>diphosphate+trans-zeatin riboside 5-phosphate. Non-limiting examples of such polypeptides are provided in SEQ ID NOs: 1 to 33.
As used herein, a “polypeptide having cytokinin riboside 5-monophosphate phosphoribohydrolase activity” or a “polypeptide which has cytokinin riboside 5-monophosphate phosphoribohydrolase activity” means a polypeptide that catalyzes the reaction: N(6)-(Delta(2)-isopentenyl)-adenosine 5-phosphate+H(2)O<=>N(6)-(dimethylallyl)adenine+D-ribose 5-phosphate (EC 3.2.2.n1). Non-limiting examples of such polypeptides are provided in SEQ ID NOs: 34 to 62.
As used herein, a “polypeptide having 1-deoxy-D-xylulose-5-phosphate synthase activity” or a “polypeptide, which has 1-deoxy-D-xylulose-5-phosphate synthase activity” means a polypeptide that catalyzes the reaction: Pyruvate+D-glyceraldehyde 3-phosphate<=>1-deoxy-D-xylulose 5-phosphate+CO(2) (EC 2.2.1.7). Non-limiting examples of such polypeptides are provided in SEQ ID NOs: 63 to 70.
As used herein, an “enzyme having ribose-phosphate diphosphokinase activity” or an “enzyme, which has ribose-phosphate diphosphokinase activity” means an enzyme that catalyzes the reaction: ATP+D-ribose 5-phosphate<=>AMP+5-phospho-alpha-D-ribose 1-diphosphate (EC 2.7.6.1). An enzyme having ribose-phosphate diphosphokinase activity is encoded by the bacterial gene prs or an ortholog thereof.
As used herein, an “enzyme having amidophosphoribosyltransferase activity” or an “enzyme, which has amidophosphoribosyltransferase activity” means an enzyme that catalyzes the reaction: 5-phospho-beta-D-ribosylamine+diphosphate+L-glutamate<=>L-glutamine+5-phospho-alpha-D-ribose 1-diphosphate+H2O (EC 2.4.2.14). An enzyme having amidophosphoribosyltransferase activity is encoded by the bacterial gene purF or an ortholog thereof.
As used herein, an “enzyme having formyltetrahydrofolate deformylase activity” or an “enzyme, which has formyltetrahydrofolate deformylase activity” means an enzyme that catalyzes the reaction: 10-formyltetrahydrofolate+H2O<=>formate+tetrahydrofolate (EC 3.5.1.10). An enzyme having amidophosphoribosyltransferase activity is encoded by the bacterial gene purU or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylamine-glycine ligase activity” or an “enzyme, which has phosphoribosylamine-glycine ligase activity” means an enzyme that catalyzes the reaction: ATP+5-phospho-beta-D-ribosylamine+glycine<=>ADP+phosphate+N1-(5-phospho-beta-D-ribosyl)glycinamide (EC 6.3.4.13). An enzyme having phosphoribosylamine-glycine ligase activity is encoded by the bacterial gene purD or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylglycineamide formyltransferase activity” or an “enzyme, which has phosphoribosylglycineamide formyltransferase activity” means an enzyme that catalyzes the reaction: 10-formyltetrahydrofolate+N1-(5-phospho-beta-D-ribosyl)glycinamide<=>tetrahydrofolate+N2-formyl-N1-(5-phospho-beta-D-ribosyl)glycinamide (EC 2.1.2.2). An enzyme having phosphoribosylglycineamide formyltransferase activity is encoded by the bacterial gene purT or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylformylglycinamidine synthase activity” or an “enzyme, which has phosphoribosylformylglycinamidine synthase activity” means an enzyme that catalyzes the reaction: ATP+N2-formyl-N1-(5-phospho-beta-D-ribosyl)glycinamide+L-glutamine+H2O<=>ADP+phosphate+2-(formamido)-N1-(5-phospho-beta-D-ribosyl)acetamidine+L-glutamate (EC 6.3.5.3). An enzyme having phosphoribosylformylglycinamidine synthase activity is encoded by the bacterial gene purL or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylformylglycineamidine cyclo-ligase activity” or an “enzyme, which has phosphoribosylformylglycineamidine cyclo-ligase activity” means an enzyme that catalyzes the reaction: ATP+2-(formamido)-N1-(5-phospho-beta-D-ribosyl)acetamidine<=>ADP+phosphate+5-amino-1-(5-phospho-beta-D-ribosyl)imidazole (EC 6.3.3.1). An enzyme having phosphoribosylformylglycineamidine cyclo-ligase activity is encoded by the bacterial gene purM or an ortholog thereof.
As used herein, an “enzyme having N5-carboxyaminoimidazole ribonucleotide synthetase activity” or an “enzyme, which has N5-carboxyaminoimidazole ribonucleotide synthetase activity” means an enzyme that catalyzes the reaction: ATP+5-amino-1-(5-phospho-D-ribosyl)imidazole+HCO3−<=>ADP+phosphate+5-carboxyamino-1-(5-phospho-D-ribosyl)imidazole (EC 6.3.4.18). An enzyme having N5-carboxyaminoimidazole ribonucleotide synthetase activity is encoded by the bacterial gene purK or an ortholog thereof.
As used herein, an “enzyme having N5-carboxyaminoimidazole ribonucleotide mutase activity” or an “enzyme, which has N5-carboxyaminoimidazole ribonucleotide mutase activity” means an enzyme that catalyzes the reaction: 5-carboxyamino-1-(5-phospho-D-ribosyl)imidazole<=>5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxylate (EC 5.4.99.18). An enzyme having N5-carboxyaminoimidazole ribonucleotide mutase activity is encoded by the bacterial gene purE or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylaminoimidazolesuccinocarboxamide synthase activity” or an “enzyme, which has phosphoribosylaminoimidazolesuccinocarboxamide synthase activity” means an enzyme that catalyzes the reaction: ATP+5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxylate+L-aspartate<=>ADP+phosphate+(S)-2-(5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxamido)succinate (EC 6.3.2.6). An enzyme having phosphoribosylaminoimidazolesuccinocarboxamide synthase activity is encoded by the bacterial gene purC or an ortholog thereof.
As used herein, an “enzyme having adenylosuccinate lyase activity” or an “enzyme, which has adenylosuccinate lyase activity” means an enzyme that catalyzes the reaction: (S)-2-(5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxamido)succinate<=>fumarate+5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxamide (EC 4.3.2.2). An enzyme having adenylosuccinate lyase activity is encoded by the bacterial gene purB or an ortholog thereof.
As used herein, an “enzyme having phosphoribosylaminoimidazole-carboxamide formyltransferase activity” or an “enzyme, which has phosphoribosylaminoimidazole-carboxamide formyltransferase activity” means an enzyme that catalyzes the reaction: 10-formyltetrahydrofolate+5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxamide<=>tetrahydrofolate+5-formamido-1-(5-phospho-D-ribosyl)imidazole-4-carboxamide (EC 2.1.2.3). An enzyme having phosphoribosylaminoimidazole-carboxamide formyltransferase activity is encoded by the bacterial gene purH or an ortholog thereof.
As used herein, an “enzyme having IMP cyclohydrolase activity” or an “enzyme, which has IMP cyclohydrolase activity” means an enzyme that catalyzes the reaction: IMP+H2O<=>5-formamido-1-(5-phospho-D-ribosyl)imidazole-4-carboxamide (EC 3.5.4.10). An enzyme having IMP cyclohydrolase activity is, for example, encoded by the bacterial gene purH or an ortholog thereof.
As used herein, an “enzyme having adenylosuccinate synthase activity” or an “enzyme, which has adenylosuccinate synthase activity” means an enzyme that catalyzes the reaction: GTP+IMP+L-aspartate<=>GDP+phosphate+N6-(1,2-dicarboxyethyl)-AMP (EC 6.3.4.4). An enzyme having adenylosuccinate synthase activity is encoded by the bacterial gene purA or an ortholog thereof.
As used herein, an “enzyme having adenylate kinase activity” or an “enzyme, which has adenylate kinase activity” means an enzyme that catalyzes the reaction: ATP+AMP<=>2 ADP (EC 2.7.4.3). An enzyme having adenylate kinase activity is encoded by the bacterial gene adk or an ortholog thereof.
As used herein, an “enzyme having ATP synthase activity” or an “enzyme, which has ATP synthase activity” means an enzyme that catalyzes the reaction: ATP+H2O+H+ (cytosol)<=>ADP+phosphate+H+ (periplasm) (EC 3.6.3.14). An enzyme having ATP synthase activity is, for example, the ATP synthase F0 or F1 complex encoded by the bacterial atp operon (including the genes atpB, atpF, atpE, atpD, atpG, atpA, atpH and atpC) or orthologs thereof.
As used herein, an “enzyme having adenosine kinase activity” or an “enzyme, which has adenosine kinase activity” means an enzyme that catalyzes the reaction: ATP+adenosine<=>ADP+AMP (EC 2.7.1.20). An enzyme having adenosine kinase activity is encoded by the bacterial gene adk or orthologs thereof.
As used herein, an “enzyme having IMP dehydrogenase activity” or an “enzyme, which has IMP dehydrogenase activity” means an enzyme that catalyzes the reaction: Inosine 5-phosphate+NAD++H2O<=>xanthosine 5-phosphate+NADH (EC 1.1.1.205). An enzyme having IMP dehydrogenase activity is encoded by the bacterial gene guaB or an ortholog thereof.
As used herein, an “enzyme having GMP synthase activity” or an “enzyme, which has GMP synthase activity” means an enzyme that catalyzes the reaction: ATP+XMP+L-glutamine+H2O<=>AMP+diphosphate+GMP+L-glutamate (EC 6.3.5.2). An enzyme having GMP synthase activity is encoded by the bacterial gene guaA or an ortholog thereof.
As used herein, an “enzyme having purine nucleoside phosphorylase activity” or an “enzyme, which has purine nucleoside phosphorylase activity” means an enzyme that catalyzes the reaction: Purine nucleoside+phosphate<=>purine+alpha-D-ribose 1-phosphate (EC 2.4.2.1). An enzyme having purine nucleoside phosphorylase activity is, for example, encoded by the bacterial gene deoD or an ortholog thereof.
As used herein, an “enzyme having adenosine phosphoribosyltransferase activity” or an “enzyme, which has adenosine phosphoribosyltransferase activity” means an enzyme that catalyzes the reaction: AMP+diphosphate<=>adenine+5-phospho-alpha-D-ribose 1-diphosphate (EC 2.4.2.7). An enzyme having adenosine phosphoribosyltransferase activity is, for example, encoded by the bacterial gene apt or an ortholog thereof.
As used herein, “purine nucleotide biosynthesis pathway” is understood to include both the de novo biosynthesis pathway and the salvage pathway by which nucleotides are synthesized.
As used herein, “adenosine monophosphate biosynthesis pathway” is understood to include both the de novo biosynthesis pathway and the salvage pathway by which adenosine monophosphate is synthesized.
As used herein, “guanosine monophosphate biosynthesis pathway” is understood to include both the de novo biosynthesis pathway and the salvage pathway by which guanosine monophosphate is synthesized.
As used herein, a “polypeptide having cytochrome P450 monooxygenase (CYP450) activity” or a “polypeptide, which has cytochrome P450 monooxygenase (CYP450) activity” means a polypeptide that catalyzes the trans-hydroxylation reactions: 1) N6-(Δ2-isopentenyl)-adenosine 5-monophosphate+a reduced [NADPH-hemoprotein reductase]+oxygen→trans-zeatin riboside monophosphate+an oxidized [NADPH-hemoprotein reductase]+H2O; 2) N6-(Δ2-isopentenyl)-adenosine 5-triphosphate+a reduced [NADPH-hemoprotein reductase]+oxygen→trans-zeatin riboside triphosphate+an oxidized [NADPH-hemoprotein reductase]+H2O; 3) N6-(Δ2-isopentenyl)-adenosine 5-diphosphate+a reduced [NADPH-hemoprotein reductase]+oxygen→trans-zeatin riboside diphosphate+an oxidized [NADPH-hemoprotein reductase]+H2O (EC 1.14.14.-). A polypeptide having cytochrome P450 monooxygenase (CYP450) activity is, for example, encoded by the bacterial gene FAS1, or Arabidopsis thaliana (plant) genes CYP735A1 and CYP735A2, or an ortholog thereof. Non-limiting examples include SEQ ID 93 to 95.
“Polypeptide” and “protein” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation, phosphorylation, lipidation, myristoylation, ubiquitination, etc.). Included within this definition are D- and L-amino acids, and mixtures of D- and L-amino acids.
“Nucleic acid” or “polynucleotide” are used interchangeably herein to denote a polymer of at least two nucleic acid monomer units or bases (e.g., adenine, cytosine, guanine, thymine) covalently linked by a phosphodiester bond, regardless of length or base modification.
“Recombinant” or “non-naturally occurring” when used with reference to, e.g., a bacterium, nucleic acid, or polypeptide, refers to a material, or a material corresponding to the natural or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic materials and/or by manipulation using recombinant techniques. Non-limiting examples include, among others, recombinant bacterial cells expressing genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise expressed at a different level.
“Heterologous” or “exogenous” as used herein in the context of a gene or nucleic acid molecule refer to a gene or nucleic acid molecule (i.e. DNA or RNA molecule) that does not occur naturally as part of the genome of the bacterium in which it is present or which is found in a location or locations in the genome that differ from that in which it occurs in nature. Thus, a “heterologous” or “exogenous” gene or nucleic acid molecule is not endogenous to the bacterium and has been exogenously introduced into the microorganism. A “heterologous” gene or nucleic acid molecule DNA molecule may be from a different organism, a different species, a different genus, or a different kingdom, as the host DNA.
“Heterologous” as used herein in the context of a polypeptide means that a polypeptide is normally not found in or made (i.e. expressed) by the host microorganism, but derived from a different organism, a different species, a different genus, or a different kingdom.
As used herein, the term “ortholog” or “orthologs” refers to genes, nucleic acid molecules encoded thereby, i.e., mRNA, or proteins encoded thereby that are derived from a common ancestor gene but are present in different species.
By “decreased expression level” of a gene it is meant that the amount of the transcription product, respectively the amount of the polypeptide encoded by said gene produced by the modified bacterium is decreased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “decreased expression level” of a gene it is meant that the amount of the transcription product, respectively the amount of the polypeptide encoded by said gene produced by the modified bacterium is decreased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90% or at least 100%, compared to an otherwise identical bacterium that does not carry said modification. The level of expression of a gene can be determined by well-known methods, including PCR, Southern blotting, and the like. In addition, the level of gene expression can be estimated by measuring the amount of mRNA transcribed from the gene using various well-known methods, including Northern blotting, quantitative RT-PCR, and the like. The amount of the polypeptide encoded by the gene can be measured by well-known methods, including ELISA, Immunohistochemistry, or Western Blotting, and the like.
Expression of a gene can be decreased by introducing a mutation into the gene in the genome of the modified bacterium so that the intracellular activity of the polypeptide encoded by the gene is decreased as compared to an otherwise identical bacterium that does not carry the said mutation. Mutations, which result in a decreased expression of the gene include the replacement of one nucleotide or more to cause an amino acid substitution in the polypeptide encoded by the gene (missense mutation), the introduction of a stop codon (nonsense mutation), deletion, or insertion of nucleotides to cause a frameshift, insertion of a drug-resistance gene, or deletion of a part of the gene or the entire gene (Qiu and Goodman, 1997; Kwon et al., 2000). The expression can also be decreased by modifying an expression regulating sequence such as the promoter, the Shine-Dalgarno (SD) sequence, etc. Expression of the gene can also be decreased by gene replacement (Datsenko and Wanner, 2000), such as the “lambda-red mediated gene replacement”. The lambda-red mediated gene replacement is a particularly suitable method to inactive one or more genes as described herein.
“Inactivating”, “inactivation” and “inactivated”, when used in the context of a gene, means that the gene in question no longer expresses a functional protein. It is possible that the modified DNA region is unable to naturally express the gene due to the deletion of a part of or the entire gene sequence, the shifting of the reading frame of the gene, the introduction of missense/nonsense mutation(s), or the modification of an adjacent region of the gene, including sequences controlling gene expression, such as a promoter, enhancer, attenuator, ribosome-binding site, etc. Preferably, a gene of interest is inactivated by the deletion of a part of or the entire gene sequence, such as by gene replacement. Inactivation may also be accomplished by introducing or expressing a rare-cutting endonuclease able to selectively inactivate by DNA cleavage, preferably by the double-strand break, the gene of interest. A “rare-cutting endonuclease” within the context of the present invention includes transcription activator-like effector (TALE) nucleases, meganucleases, zinc-finger nucleases (ZFN), and RNA-guided endonucleases.
The presence or absence of a gene in the genome of a bacterium can be detected by well-known methods, including PCR, Southern blotting, and the like. In addition, the level of gene expression can be estimated by measuring the amount of mRNA transcribed from the gene using various well-known methods, including Northern blotting, quantitative RT-PCR, and the like. The amount of the protein encoded by the gene can be measured by well-known methods, including SDS-PAGE followed by an immunoblotting assay (Western blotting analysis), and the like.
By “increased expression level” of a gene it is meant that the amount of the transcription product, respectively the amount of the polypeptide encoded by said gene produced by the modified bacterium is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression level” of a gene it is meant that the amount of the transcription product, respectively the amount of the polypeptide encoded by said gene produced by the modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared to an otherwise identical bacterium that does not carry said modification. The level of expression of a gene can be determined by well-known methods, including PCR, Southern blotting, and the like. In addition, the level of gene expression can be estimated by measuring the amount of mRNA transcribed from the gene using various well-known methods, including Northern blotting, quantitative RT-PCR, and the like. The amount of the polypeptide encoded by the gene can be measured by well-known methods, including ELISA, Immunohistochemistry, or Western Blotting, and the like.
By “increased expression level” of a polypeptide it is meant that the amount of the polypeptide in question produced by the modified microorganism is increased compared to an otherwise identical bacterium that does not carry said modification. More particularly, by “increased expression level” of a polypeptide it is meant that the amount of the polypeptide in question produced by the modified bacterium is increased by at least 10%, such as at least 20%, at least 30%, at least 40%, at least 50% at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, at least 150%, at least 200%, at least 300%, at least 400%, at least 500%, at least 600%, at least 700% at least 800%, at least about 900%, at least about 1000%, at least about 2000%, at least about 3000%, at least about 4000%, at least about 5000%, at least about 6000%, at least about 7000%, at least about 8000% at least about 9000% or at least about 10000%, compared an otherwise identical bacterium that does not carry said modification. The amount of a polypeptide produced in a given cell can be determined by any suitable quantification technique known in the art, such as ELISA, Immunohistochemistry, or Western Blotting.
An increase in polypeptide expression may be achieved by any suitable means well-known to those skilled in the art. For example, an increase in polypeptide expression may be achieved by increasing the number of copies of the gene or genes encoding the polypeptide in the microorganism, such as by introducing into the microorganism an exogenous nucleic acid, such as a vector, comprising the gene or genes encoding the polypeptide operably linked to a promoter that is functional in the microorganism to cause the production of an mRNA molecule. An increase in polypeptide expression may also be achieved by the integration of at least a second copy of the gene or genes encoding the polypeptide into the genome of the microorganism. An increase in polypeptide expression may also be achieved by increasing the strength of the promoter(s) operably linked to the gene or genes encoding the polypeptide. An increase in polypeptide expression may also be achieved by modifying the ribosome binding site on the mRNA molecule encoding the polypeptide. By modifying the sequence of the ribosome binding site, the translation initiation rate may be increased, thus increasing translation efficiency.
As used herein, “decreased”, “decreasing” or “decrease of” expression of a polypeptide (e.g. an enzyme involved in the purine nucleotide degradation pathway) means that the expression of said polypeptide in a modified bacterium is reduced compared to the expression of said polypeptide in an otherwise identical bacterium that does not carry said modification (control). The expression of a polypeptide in a modified bacterium may be reduced by at least about 10%, and preferably by at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 99% or 100%, or any percentage, in whole integers between 10% and 100% (e.g., 6%, 7%, 8%, etc.), compared to the expression of said polypeptide in an otherwise identical bacterium that does not carry said modification (control). More particularly, “decreased”, “decreasing” or “decrease of” expression of a polypeptide means that the amount of the polypeptide in the modified bacterium is reduced by at least about 10%, and preferably by at least about 20%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 99% or 100%, or any percentage, in whole integers between 10% and 100% (e.g., 6%, 7%, 8%, etc.), compared to the amount of said polypeptide in an otherwise identical bacterium that does not carry said modification (control). The expression or amount of a polypeptide in a microorganism can be determined by any suitable means known in the art, including techniques such as ELISA, Immunohistochemistry, Western Blotting, or Flow Cytometry.
As used herein, “decreased”, “decreasing” or “decrease of” activity of a polypeptide (e.g. an enzyme involved in the purine nucleotide degradation pathway) means that the catalytic activity of said polypeptide in a modified bacterium is reduced compared to the catalytic activity of said polypeptide in an otherwise identical bacterium that does not carry said modification (control). The activity of a polypeptide in a modified bacterium may be reduced by at least about 10%, and preferably by at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 99% or 100%, or any percentage, in whole integers between 10% and 100% (e.g., 6%, 7%, 8%, etc.), compared to the expression of said polypeptide in an otherwise identical bacterium that does not carry said modification (control). The activity of a polypeptide in a microorganism can be determined by any suitable protein and enzyme activity assay.
As used herein, “inhibitor of the enzyme” refers to any chemical compound, natural or synthetic, that inhibits the catalytic activity of the enzyme. An inhibitor of the enzyme does not necessarily need to achieve 100% or complete inhibition. In this regard, an inhibitor of the enzyme can induce any level of inhibition.
“Substitution” or “substituted” refers to the modification of the polypeptide by replacing one amino acid residue with another, for instance, the replacement of a Serine residue with a Glycine or Alanine residue in a polypeptide sequence is an amino acid substitution. When used with reference to a polynucleotide, “substitution” or “substituted” refers to a modification of the polynucleotide by replacing one nucleotide with another, for instance, the replacement of cytosine with thymine in a polynucleotide sequence is a nucleotide substitution.
“Conservative substitution”, when used with reference to a polypeptide, refers to a substitution of an amino acid residue with a different residue having a similar side chain, and thus typically involves the substitution of the amino acid in the polypeptide with amino acids within the same or similar class of amino acids. By way of example and not limitation, an amino acid with an aliphatic side chain may be substituted with another aliphatic amino acid, e.g., alanine, valine, leucine, and isoleucine; an amino acid with a hydroxyl side chain is substituted with another amino acid with a hydroxyl side chain, e.g., serine and threonine; an amino acid having an aromatic side chain is substituted with another amino acid having an aromatic side chain, e.g., phenylalanine, tyrosine, tryptophan, and histidine; an amino acid with a basic side chain is substituted with another amino acid with a basic side chain, e.g., lysine and arginine; an amino acid with an acidic side chain is substituted with another amino acid with an acidic side chain, e.g., aspartic acid or glutamic acid; and a hydrophobic or hydrophilic amino acid is replaced with another hydrophobic or hydrophilic amino acid, respectively.
“Non-conservative substitution”, when used with reference to a polypeptide, refers to a substitution of an amino acid in a polypeptide with an amino acid with significantly differing side chain properties. Non-conservative substitutions may use amino acids between, rather than within, the defined groups and affects (a) the structure of the peptide backbone in the area of the substitution (e.g., serine for glycine), (b) the charge or hydrophobicity, or (c) the bulk of the side chain. By way of example and not limitation, an exemplary non-conservative substitution can be an acidic amino acid substituted with a basic or aliphatic amino acid; an aromatic amino acid substituted with a small amino acid, and a hydrophilic amino acid substituted with a hydrophobic amino acid.
“Expression” includes any step involved in the production of a polypeptide (e.g., encoded enzyme) including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
As used herein, the “regulatory region” of a gene refers to a nucleic acid sequence that affects the expression of a coding sequence. Regulatory regions are known in the art and include, but are not limited to, promoters, enhancers, transcription terminators, polyadenylation sites, matrix attachment regions, and/or other elements that regulate the expression of a coding sequence.
As used herein, “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid molecule to which it has been linked. One type of vector is a “plasmid”, which refers to a circular double-stranded nucleic acid loop into which additional nucleic acid segments can be ligated. Certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as “expression vectors”. Certain other vectors are capable of facilitating the insertion of an exogenous nucleic acid molecule into a genome of a bacterium. Such vectors are referred to herein as “transformation vectors”. In general, vectors of utility in recombinant nucleic acid techniques are often in the form of plasmids. In the present specification, “plasmid” and “vector” can be used interchangeably as the plasmid is the most commonly used form of a vector. Large numbers of suitable vectors are known to those of skill in the art and commercially available.
As used herein, “promoter” refers to a sequence of DNA, usually upstream (5) of the coding region of a structural gene, which controls the expression of the coding region by providing recognition and binding sites for RNA polymerase and other factors, which may be required for initiation of transcription. The selection of the promoter will depend upon the nucleic acid sequence of interest. A suitable “promoter” is generally one, which is capable of supporting the initiation of transcription in a bacterium of the invention, causing the production of an mRNA molecule. “Strong” promoters include, for example, natural promoters from Bacillus subtilis, Bacillus amyloliquefaciens or similar, such as P43, P15, Pveg, Pylb, PgroES, PsigX, PtrnQ, Ppst, PsodA, PrpsF, PlepA, PliaG, PrpsF, Ppst, PfusA, PsodA, Phag as well as artificial promoters active in Bacillus subtilis or inducible Bacillus subtilis promoters, such as PmtlA, Pspac, PxylA, PsacB, or similar, that enable efficient expression and overproduction of proteins. Further examples of “strong” promoters include natural promoters from Corynebacterium, such as P CP_2454, Ptuf and Psod, natural promoters from E. coli, such as T7, and the promoter P F1 derived from the corynephage BFK20.
As used herein, “operably linked” refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner. A control sequence “operably linked” to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequence. A promoter sequence is “operably-linked” to a gene when it is in sufficient proximity to the transcription start site of a gene to regulate transcription of the gene.
“Percentage of sequence identity,” “% sequence identity” and “percent identity” are used herein to refer to comparisons between an amino acid sequence and a reference amino acid sequence. The “% sequence identity”, as used herein, is calculated from the two amino acid sequences as follows: The sequences are aligned using Version 9 of the Genetic Computing Group's GAP (global alignment program), using the default BLOSUM62 matrix with a gap open penalty of −12 (for the first null of a gap) and a gap extension penalty of −4 (for each additional null in the gap). After alignment, percentage identity is calculated by expressing the number of matches as a percentage of the number of amino acids in the reference amino acid sequence.
“Reference sequence” or “reference amino acid sequence” refers to a defined sequence to which another sequence is compared. In the context of the present invention, a reference amino acid sequence may, for example, be an amino acid sequence set forth in SEQ ID NO: 1.
As used herein, the term “about” means plus or minus 10% of the numerical value of the number with which it is being used.
Where a numerical limit or range is stated herein, the endpoints are included. Also, all values and sub-ranges within a numerical limit or range are specifically included as if explicitly written out.
As used herein, the indefinite articles “a” and “an” mean “at least one” or “one or more” unless the context clearly dictates otherwise.
As used herein, the terms “comprising”, “including”, “having” and grammatical variants thereof are to be taken as specifying the stated features, steps, or components but do not preclude the addition of one or more additional features, steps, components or groups thereof.
Having generally described this invention, a further understanding can be obtained by reference to certain specific examples, which are provided herein for purposes of illustration only, and are not intended to be limiting unless otherwise specified.
Different Bacillus strains can be used as starting strains for the engineering of isoprenoid cytokinin production (Table 2). Bacillus spp. strains can be isolated from nature or obtained from culture collections. Among others, starting strains for isoprenoid cytokinin production can be selected among Bacillus subtilis strains that have already been subjected to classical methods of mutagenesis and selection to overproduce metabolites related to the purine nucleotide biosynthetic pathway. For example, strains overproducing riboflavin, inosine, guanosine or adenosine may be selected. Strains subjected to random mutagenesis and toxic metabolic inhibitors from purine nucleotide and riboflavin pathway are preferred and are included in Table 3.
B. subtilis
B. subtilis subsp.
spizizenii
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
B. subtilis
Bacillus sp.
Bacillus sp.
Bacillus subtilis VKPM B2116 is a hybrid strain of B. subtilis 168 (most common B. subtilis host strain with approx. 4 Mbp genome) with a 6.4 kbp island of DNA from the strain B. subtilis W23. Such architecture is common for most B. subtilis industrial strains and was obtained by transforming B. subtilis 168 (tryptophan auxotroph trpC) with W23 (prototrophic TrpC+) DNA. The 6.4 kbp W23 island in the genome of VKPM B21216 is the same as in B. subtilis SMY, which is one of the B. subtilis legacy strains with genome publicly available (Zeigler et al. 2008). B. subtilis VKPM B2116 is a direct descendant strain of B. subtilis SMY, obtained by classical mutagenesis and selection. Another name for this strain is B. subtilis VNII Genetika 304. The description of the construction of the strain is in Soviet Union patent SU908092, filed in 1980. The mutations were obtained by subsequent mutagenesis and selection of metabolic inhibitors. The strain VKPM B2116 is resistant to roseoflavin, a toxic analogue of vitamin B2, due to a mutation in the ribC gene, encoding a flavin kinase. This strain is also resistant to 8-azaguanine, a toxic purine analogue.
The amino acid sequences of adenylate dimethylallyltransferase (IPT) Tzs from Agrobacterium tumefaciens (syn. Agrobacterium fabrum (strain C58/ATCC 33970), gene tzs, E.C.2.5.1.27, UniProt: P58758) (SEQ ID NO: 1) and of cytokinin riboside 5′-monophosphate phosphoribohydrolase (LOG) from Corynebacterium glutamicum (strain ATCC 13032/DSM 20300/NCIMB 10025, gene Cg/2379, E.C. 3.2.2.n1, UniProt: Q8NN34) (SEQ ID NO: 34) were used for generating codon-optimized nucleotide sequences for gene expression in B. subtilis by using GENEius (Eurofins). The synthetic DNA fragment IPT-LOG (SEQ ID NO: 71) was designed with the addition of an RBS sequence, and with short adapter sequences at both ends of the synthetic fragment for further assembly of the synthetic isoprenoid cytokinin operon expression cassette.
The amino acid sequences of adenylate dimethylallyltransferase (IPT) Tmr from Agrobacterium tumefaciens (syn. Agrobacterium fabrum (strain C58/ATCC 33970), gene izt, E.C.2.5.1.27, Uniprot: P0A3L5) (SEQ ID NO: 2) and of cytokinin riboside 5′-monophosphate phosphoribohydrolase (LOG) from Corynebacterium glutamicum (gene Cg/2379, E.C. 3.2.2.n1, UniProt: Q8NN34) (SEQ ID NO: 34) were used for generating codon-optimized nucleotide sequences for gene expression in B. subtilis using GENEius (Eurofins). The synthetic DNA fragment IPT-LOG (SEQ ID NO: 72) was designed with the addition of an RBS sequence, and with short adapter sequences at both ends of the synthetic fragment for further assembly of the synthetic isoprenoid cytokinin operon expression cassette.
The synthetic fragments containing synthetic genes IPT-LOG (SEQ ID NO: 71 and SEQ ID NO: 72) for isoprenoid cytokinin biosynthesis were assembled in artificial isoprenoid cytokinin operons. The initial and end fragments containing gene integration homology, the promoter sequence, and the erythromycin selectable marker (SEQ ID NO: 73) were designed and synthesized for stable genome integration into the amyE locus in the genome of B. subtilis.
The first step of the artificial operon assembly was PCR amplification of separate DNA fragments performed using primer pair SEQ ID NO: 74 and SEQ ID 75 for the initial fragment SEQ ID NO: 75, and for synthetic fragments IPT-LOG SEQ ID NO: 71 and SEQ ID NO: 72 containing genes for isoprenoid-cytokinin biosynthesis. The primer set SEQ ID NO: 77 and SEQ ID NO: 78 was used for amplification of the end fragment SEQ ID NO: 79.
The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 68.5° C. 25 s, 72° C. 23/25 s), 72° C. 5 min, 10° C. hold.
PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial operon by repetitive steps of restriction and ligation. A combination of SpeI (BcuI) and XbaI restriction sites was used to provide compatible restriction ends for a successful ligation. After each step of ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected enzyme (SpeI (BcuI), and XbaI, Thermo Fisher), and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer. The first two fragments were used in the ligation reaction with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and the addition of 5% PEG 4000 and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 80 and SEQ ID NO: 81, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. The restriction digest, cleaning, and ligation steps were repeated for ligation of the end fragment. PCR was run on 0.8% agarose gel, the fragment was excised from the gel, digested and cleaned as before, and ligated as before. The final operon containing the amy E homology, promoter with RBS sequence, IPT and LOG genes, and erythromycin resistance cassette was amplified using the primer pair SEQ ID NO: 76 and SEQ ID NO: 77, cleaned and ligated as described before. The constructed synthetic trans-zeatin operons containing IPT-LOG SEQ ID NO: 82 and SEQ ID NO: 83 were used for the transformation of Bacillus subtilis VKPM B2116. The transformation with IPT-LOG operon SEQ ID NO: 83 resulted in the transformed strains TZAB1, TZAB2, TZAB3, and TZAB4. The transformation with IPT-LOG operon SEQ ID NO: 82 resulted in the transformed strains TZAB14 and TZAB15. The correct integration of the artificial operons at amyE integration site was confirmed by cPCR.
Bacillus subtilis strains obtained
The isoprenoid side chain for the biosynthesis of isoprenoid cytokinins is synthesized by the MEP pathway in B. subtilis. The protein sequence of 1-deoxyxylulose-5-phosphate synthase (DXS, E.C. 2.2.1.7) from B. subtilis was used for generation of the synthetic nucleotide sequence of dxs by using a codon-optimizing feature GENEius (Eurofins) for gene expression in B. subtilis. The synthetic DNA fragments of dxs SEQ ID NO: 84 and SEQ ID NO: 85 were designed in two overlapping parts, which were joined by overlap PCR. The entire joined synthetic gene dxs SEQ ID NO: 86 was assembled in an artificial operon. The initial and end fragments containing lacA homology, the promoter sequence, and the spectinomycin selectable marker SEQ ID NO: 87 were designed and synthesized for stable genome integration into the genome of B. subtilis. The fragments were assembled as described in Example 4.
The first step of the artificial operon assembly was PCR amplification of separate DNA fragments performed with a set of primers SEQ ID NO: 74 and SEQ ID NO: 75 for the initial fragment SEQ ID NO: 88, the end fragment SEQ ID 89, and the joined synthetic gene dxs SEQ ID NO: 86.
The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles. The PCR cycling conditions used were 98° C. 30 s, 30 cycles of (98° C. 30 s, 71° C. 25 s, 72° C. 23/25 s), 72° C. 5 min, 10° C. hold.
PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial operon by repetitive steps of restriction and ligation. A combination of SpeI (BcuI) and XbaI restriction sites was used to provide compatible restriction ends for a successful ligation. After each step of ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected enzyme (SpeI (BcuI), and XbaI, Thermo Fisher), and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer. The first two fragments were used in the ligation reaction with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 74 and SEQ ID NO: 75, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. The restriction digest, cleaning and ligation steps were repeated for ligation of the end fragment. In the last step, 1 μl of inactivated ligation was used as a template for complete synthetic dxs operon SEQ ID NO: 90 in a new 50 μL PCR using the primers SEQ ID NO: 91 and SEQ ID NO: 92, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. PCR was run on 0.8% agarose gel, the fragment was excised from the gel, cleaned as before, and ligated as before. The constructed synthetic dxs operon was used for the transformation of Bacillus subtilis TZAB15 with IPT-LOG operon at amyE (SEQ ID NO: 80). The transformation with the artificial DXS operon (SEQ ID NO: 90) resulted in the transformed strain TZAB43. The correct integration of the artificial operon at lacA integration site was confirmed by cPCR.
Bacillus subtilis strains obtained by transformation
All of the constructed strains and control strain have been cultivated following this procedure. The frozen stocks of the strains VKPM B2116, TZAB1, TZAB2, TZAB3, TZAB4, TZAB14, TZAB15 and TZAB43, preserved in 20% glycerol at −80° C. were spread onto solid seed medium containing erythromycin and lincomycin in the appropriate concentration and incubated for approximately 1 day at 37° C. For further testing, a vegetative-stage medium was inoculated with 1 to 5 plugs of the culture on solid seed plates per baffled 250-mL Erlenmeyer flask containing 25 mL of vegetative medium and appropriate amounts of antibiotics. The cultures were incubated at 37° C. for 18-20 h at 220 RPM. The culture in the vegetative medium was used as seed culture, and it was inoculated into the production medium. A 10-% inoculum was used (2.5 mL per 25 mL of production medium in 250-mL Erlenmeyer flask). The production medium was used as described or it was amended with adenine sulfate (end concentration 3 g/L). The cultures were incubated at 30° C. or 37° C. for up to 48 h at 220 RPM. The fermented cultures were sampled and analyzed as described in the Example 7. The titer of trans-zeatin, trans-zeatin riboside, and isopentenyl adenine was measured using the LC/MS as described in the Example 7.
The ingredients of the vegetative medium (Table 7) were mixed and the pH was set to 7.2-7.4. KH2PO4—K2HPO4 solution was then added in the final concentration for KH2PO4 1.5 g/L and K2HPO4 3.5 g/L. The medium was distributed into Erlenmeyer flasks (25 ml/250 ml-baffled Erlenmeyer flask) and autoclaved 30 min, 121° C. Sterile glucose was added after autoclaving in the final concentration of 7.5 g/L. Antibiotics were added before inoculation.
The ingredients of the production medium (Table 8) were mixed and the pH was set to 7.2-7.4. KH2PO4—K2HPO4 solution was then added in the final concentration for KH2PO4 1.5 g/L and K2HPO4 3.5 g/L. The medium was autoclaved at 121° C. for 30 min. Sterile urea solution (20 ml of stock solution, final concentration is 6 g/L), sterile glucose solution (500 ml of stock solution, final concentration is 100 g/L glucose) were added after autoclaving to obtain 1 L of production medium.
Appropriate antibiotics were added before inoculation. The medium was then distributed into sterile Erlenmeyer flasks (25 ml/250 ml-baffled Erlenmeyer flask).
All of the strains that had been cultivated following the procedure described in the Example 6 were analysed according to this procedure. The fermented production medium was sampled and immediately frozen at −20° C. For extraction of the metabolites, the fermented production medium was diluted 1:1 with the extraction buffer composed of a 1:1 mixture of methanol and 100 mM ammonium acetate pH 4. The samples were extracted for 1 h at room temperature with constant agitation, centrifuged at 4000-4500 RPM for 15 min, and filtered (0.22 um). The samples were immediately analyzed by LC/MS or stored at −20° C. until analysis.
The samples were analyzed on Thermo Accela 1250 HPLC instrument coupled to Thermo TSQ Quantum Access MAX, MS/MS capable mass spectrometer. The method is based on Thermo Accucore C30, 150×4.6 mm, 2.6 um particle size column, kept at 60° C., with mobile phase A—0.1% formic acid in water and mobile phase B—methanol, in gradient program, with starting conditions: 95% A, linear gradient increase in B % to 50% at 10 min, and 5 min stabilization to initial conditions, at 1 ml/min flow rate. Mass spectrometer was equipped with hESI ion source, operated in positive (+) mode, with spray voltage set at 4600 V, vaporizer temperature at 350° C., collision pressure at 1.0 torr and 10 V collision energy. Trans-zeatin was observed in MRM mode with transitions from parent 219.9 m/z to daughter ions: 185.2, 148.0 and 136.0.
Trans-zeatin riboside (tZR) was observed in MRM mode with transitions from parent 352.5 m/z to daughter ions: 220.1, 202.1, 148.0 and 136.1. Isopentenyl adenine (iP) was observed in MRM mode with transitions from parent 204.4 m/z to daughter ions: 148.4, 136.3, and 119.2. Isopentenyl adenine riboside (iPR) was observed in MRM mode with transitions from parent 336.5 m/z to daughter ions: 204.1, 148.0, and 113.1.
The cultivation was performed as described in the Example 6. The extraction and analysis were performed as described in Example 7. The yields of isoprenoid cytokinins detected are shown in
The cultivation was performed as described in the Example 6. The production medium was composed as described in the Example 6, or it was amended with adenine sulfate in the final concentration 3 g/L. The fermentation broth was extracted and analyzed as described in the Example 7. The results are shown in
The cultivation was performed as described in the Example 6. The extraction and analysis were performed as described in the Example 7. The yields are shown in
The cultivation was performed as described in the Example 6. The extraction and analysis were performed as described in the Example 7. The strains TZAB1, TZAB2, TZAB3 and TZAB4 with IPT-LOG SEQ ID NO: 83 and the strains TZAB14 and TZAB 15 with IPT-LOG SEQ ID NO: 82 produced isoprenoid cytokinins. The results are shown in
The synthetic DNA fragments containing genes IPT1-LOG 1 (SEQ ID NO: 71) for isoprenoid cytokinin biosynthesis was assembled in artificial isoprenoid cytokinin operons as described in Example 4.
The IPT1 expression cassette containing only IPT1 (SEQ ID NO: 1) was assembled by PCR amplification of first part of fragments IPT1-LOG 1 (SEQ ID NO: 82), using primers SEQ ID NO: 123 and SEQ ID NO: 183 and separately final part of IPT1-LOG 1 (SEQ ID NO: 82) using primers SEQ ID NO: 123 and SEQ ID NO: 128, with Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 68.5° C. 25 s, 72° C., 25 s), 72° C. 5 min, 10° C. hold. The PCR reaction products was run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial IPT1 expression cassette by restriction and ligation. A XbaI restriction sites was used to provide compatible restriction ends for a successful ligation. After ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected XbaI (Thermo Fisher) restriction enzyme, and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer. The first two fragments were used in the ligation reaction with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and the addition of 5% PEG 4000 and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 123 and SEQ ID NO: 128, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. Generating the final expression cassette with IPT fragment (SEQ ID NO: 184).
The final expression cassettes constructed for trans-zeatin operon containing IPT1-LOG 1 (SEQ ID NO: 82) or IPT fragment (SEQ ID NO: 184) were then used for the transformation of Bacillus subtilis VKPM B2116, Bacillus subtilis 168 and Bacillus subtilis RB50 strains and resulted in the development of new transformed strains presented in Table 9. The correct integration of the artificial operons at amyE integration site was confirmed by cPCR.
The synthetic fragments containing dxs gene (SEQ ID NO: 86) were assembled in expression cassette as described in Example 5. The constructed DXS expression cassette (SEQ ID NO: 90) was used for the transformation of Bacillus subtilis VKPM B2116, Bacillus subtilis 168 and Bacillus subtilis RB50 derived strains and resulted in the development of new transformed strains presented in Table 9.
The correct integration of the artificial operon at lacA integration site was confirmed by cPCR.
Bacillus subtilis strains obtained by transformation
All of the constructed strains and control strain have been cultivated following the procedure described int the Example 6. The frozen stocks of the strains preserved in 20% glycerol at −80° C. were spread onto solid seed medium containing erythromycin and lincomycin in the appropriate concentration and incubated for approximately 1 day at 37° C. For further testing, a vegetative-stage medium was inoculated with 1 to 5 plugs of the culture on solid seed plates per baffled 250-mL Erlenmeyer flask containing 25 mL of vegetative medium and appropriate amounts of antibiotics.
The cultures were incubated at 37° C. for 8-20 h at 220 RPM. The culture in the vegetative medium was used as seed culture, and it was inoculated into the production medium. A 10-% inoculum was used (2.5 mL per 25 mL of production medium in 250-mL Erlenmeyer flask). The production medium was amended with tryptophan (final concentration 50 mg/L) for evaluation of Bacillus subtilis 168 derived strains. The cultures were incubated at 30° C., 34° C. or 37° C. for up to 48 h at 220 RPM. The fermented cultures were sampled and analyzed as described in the Example 7. The titters of trans-zeatin (tZ), trans-zeatin riboside (tZR), isopentenyl adenine riboside (iPR) and isopentenyl adenine (iP) were measured using the LC/MS as described in the Example 7.
The extraction and analysis were performed as described in Example 7. The yields of isoprenoid cytokinins detected are shown in Table 17,
To evaluate Escherichia coli as a possible isoprenoid cytokinins producing strain, two sets of expression plasmids were constructed.
First set was the plasmid for heterologous cytokinin gene/operon expression of synthetic gene IPT (SEQ ID NO: 179) and IPT-LOG operon (SEQ ID NO: 82). IPT gene or IPT-LOG operon were assembled into pBBR1 plasmid vector. pBBR1 vector has heterologous expression system based on promoter inducible with isopropyl-β-D-1-thiogalactopyranoside (IPTG): LaclQ/Placuvs-T7 and contains chloramphenicol resistance cassette as selection marker.
The second plasmid was cloned for heterologous expression of DXS (SEQ ID NO: 90) to increase the isoprenoid precursor supply. DXS gene was assembled into p15A plasmid vector. p15A vector has expression system which is based on the XyIS/Pm: where XyIS in the positive regulator of m-toluate inducible Pm promoter and contains kanamycin resistance cassette as selection marker.
The first step of the expression plasmids assembly was PCR amplification of insert fragments: IPT-LOG, DXS and vector backbones: pBBR1 and p15A. IPT DNA fragment was amplified with primer set SEQ ID NO: 163 and SEQ ID NO: 182, while IPT-LOG DNA fragment was amplified with primer set SEQ ID NO: 163 and SEQ ID NO: 164 for IPT and SEQ ID NO: 165 and SEQ ID NO: 166 for LOG amplification. The primer set SEQ ID NO: 167 and SEQ ID NO: 168 was used for amplification the pBBR1 vector backbone. DXS fragment was amplified with primer set: SEQ ID NO: 169 and SEQ ID NO: 170; and the primer set SEQ ID NO: 171 and SEQ ID NO: 172 was used for amplification the p15A vector backbone. The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles. PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The mass of each fragment was measured using the NanoDrop instrument, absorbance at 260 nm. The fragments were assembled into final expression construct by 10 μl HIFI assembly reaction (NEBuilder HiFi DNA Assembly Master Mix). The reaction was set up on ice: to 5 μl of HiFi DNA Assembly Master Mix fragments were added in DNA molar ratio vector: insert=1:2 (total amount of fragments was up to 0.2 pmols). The reaction was filled to the final volume of 10 μl with nuclease free water. Samples were incubated in a thermocycler at 50° C. for 60 minutes. In the next step, 1 μl of the chilled assembled product was used for transformation of competent E. coli BL21(DE3) cells. Plasmid DNA was isolated from the obtained strains and correct assembly was confirmed by sequencing.
The transformation of the p15A plasmid for expression of DXS gene resulted in the transformed strain TZ3077. The transformation of the pBBR1 plasmid for expression of IPT gene resulted in the transformed strain TZAB3079. The transformation of pBBR1 plasmid for expression of IPT-LOG operon resulted in the transformed strains TZ3082. Additionally, IPT gene and IPT-LOG operon expression plasmid was transformed into TZ3077 (E. coli BL21(DE3) strain with DXS expression plasmid). Obtained strain with two plasmids: one for DXS gene and the other for IPT gene expression was saved as TZAB3087. Obtained strain with two plasmids: one for DXS gene and the other for IPT-LOG operon expression was saved as TZ3091. All transformants were confirmed by cPCR and are listed in the Table 10 below.
Escherichia coli BL21(DE3) strains obtained by transformation
Escherichia coli
Constructed strains (TZ3077 to TZ3091) and control strain Escherichia coli BL21(DE3) are preserved in 20% glycerol at −80° C. Strains were always cultivated in the presence of appropriate concentration of selection antibiotic (kanamycin/chloramphenicol). First, they were spread onto solid seed medium—2YT agar plates (Table 11) and incubated for approximately 17 h at 37° C. For further testing, a vegetative-stage medium—2YT was inoculated with one single colony of the culture on solid seed plates per 250-mL Erlenmeyer flask containing 50 mL of vegetative medium and appropriate amounts of antibiotics. The cultures were incubated at 37° C. for 17 h at 220 RPM. The culture in the vegetative medium (Table 12) was used as seed culture: 2.5% inoculum was used for inoculation of 50 mL of production medium in 250-mL Erlenmeyer flask. As production media (Table 13), 2YT supplemented with glucose was used (sterile glucose was added after autoclaving in the final concentration of 25 g/L). The cultures were incubated at 37° C. for 10 h at 220 RPM. After 2 h of fermentation, strains with expression plasmids were induced. Strains with IPT/IPT-LOG expression plasmid were induced with IPTG in the final concentration 150 μM. Strains with DXS expression plasmid were induced with m-toluate in the final concentration 1 mM. Strains with both expression systems (DXS and IPT/IPT-LOG) were induced with both inductors (150 μM IPTG and 1 mM m-toluate). After 10 hours of fermentation, cultures were sampled for analysis of trans-zeatin and related isoprenoid cytokinins. Extraction protocol used was as described in the Example 7. The titers of trans-zeatin, trans-zeatin riboside, isopentenyl adenine and isopentenyl adenine riboside were measured using the LC/MS as described in the Example 7.
The yields of isoprenoid cytokinins detected are shown in
The strains expressing IPT produce isoprenoid cytokinins in the amounts up to 2.8 mg/L (see
The synthetic fragments containing genes IPT1-LOG 1 (SEQ ID NO: 178) for isoprenoid cytokinin biosynthesis were amplified from previously constructed synthetic trans-zeatin operon SEQ ID NO: 82 using primer set SEQ ID NO: 173 and SEQ ID NO: 174 for fragment SEQ ID NO: 178 and for expressing gene IPT1 (SEQ ID NO: 180) primer set SEQ IN NO: 173 and SEQ ID NO: 177 were used for construction of fragment SEQ ID NO: 180. Fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 63.3° C. 25 s, 72° C. 45 s), 72° C. 5 min, 10° C. hold.
The synthetic dxs gene (SEQ ID NO: 181) was amplified from previously constructed synthetic dxs operon SEQ ID NO: 90 using primer set SEQ ID NO: 175 and SEQ ID NO: 176 using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 67.6° C. 25 s, 72° C. 60 s), 72° C. 5 min, 10° C. hold. PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer.
The IPT1 gene and IPT1-LOG 1 isoprenoid cytokinin operons were further cloned into the pVWEx6 plasmid (Henke et al. 2021) and dxs gene expression cassette into the pECXT99APsyn plasmid by HiFi assembly reaction using NEBuilder HiFi DNA Assembly Master Mix according to the protocol provided by the manufacturer. Both plasmids were previously linearized by digestion with BamH/FD restriction enzyme. HiFi reaction mixture was further used for electroporation of electrocompetent DH10β E. coli cells. Kanamycin selection marker enabled selection of pVWEx6+IPT1-LOG 1 and pVWEx6+IPT1 transformants while tetracycline was used for selection of pECXT99APsyn+dxs transformants. Transformants were confirmed by colony PCR. For plasmid isolation colonies were inoculated into 2TY medium with appropriate antibiotics and incubated at 37° C. overnight. Plasmids were isolated from overnight cultures by GeneJET Plasmid Miniprep Kit (Thermo Fisher) plasmid extraction kit according to the protocol provided by the manufacturer. Isolated plasmids were further analyzed by digestion with KpnI and XbaI FD restriction enzymes.
For transformation of generated plasmids into the Corynebacterium stationis DSM 20305 electrocompetent cells were prepared according to Yili et al. 2015. Transformation with isolated plasmids was performed by electroporator BioRad using 2 mm cuvettes. Approximately 250 ng of plasmid was introduced to 50 μL aliquot of previously prepared electrocompetent cells, transferred into 2 mm cuvette and exposed to electrical pulse. Cells were immediately transferred into 2 mL Eppendorf tube containing 1 mL of regeneration medium. After 3 hours of incubation at shaker set to 30° C. and 200 rpm, transformants were plated onto recovery agar plates LBHIS with selection markers in appropriate concentrations. Obtained transformants (presented in Table 14) were verified by colony PCR.
For transformants with both, pVWEx6+ipt1-log 1 and pECXT99APsyn+dxs as well as pVWEx6+ipt1 and pECXT99APsyn+dxs constructs, confirmed transformants were used for preparation of new electrocompetent cells for second generation of strains. Electrocompetent cells were transformed with additional plasmid and selected on plates with both kanamycin and tetracycline. Colony PCR verification of transformants was made for both plasmids.
Corynebacterium stationis strains obtained
Corynebacterium
stationis
All of the constructed strains and control strain have been cultivated following this procedure. The frozen stocks of the strains Corynebacterium stationis DSM 20305, TZ3136, TZ3138, TZ3139, TZ3142 and TZ3146, preserved in 20% glycerol and stored at −80° C., were spread onto solid seed medium containing tetracycline or/and kanamycin in the appropriate concentration and incubated for approximately 1 day at 30° C. For further testing, a vegetative-stage medium was inoculated with 5 plugs of the culture on solid seed plates per baffled 250-mL Erlenmeyer flask containing 25 mL of vegetative medium and appropriate amounts of antibiotics. The cultures were incubated at 30° C. for 18 h at 200 RPM. The culture in the vegetative medium was used as seed culture, and it was inoculated into the production medium. A 10-% inoculum was used (2.5 mL per 25 mL of production medium in baffled 250-mL Erlenmeyer flask). The cultures were incubated at 30° C. for up to 48 h at 200 RPM. The fermented cultures were sampled and analyzed as described in the Example 7. Titres of trans-zeatin, trans-zeatin riboside, isopentenyl adenine and isopentenyl adenine riboside were measured using the LC/MS as described in the Example 7.
Isoprenoid cytokinin production of Corynebacterium stationis transformants was tested in fermentation process. The cultivation was performed as described in the Example 15. The extraction and analysis were performed according to procedure described in the Example 7. The results are shown in
Bacillus subtilis 168 and Corynebacterium stationis DSM 20305 with genetic modification IPT1,
E. coli BL21 (DE3)
Bacillus subtilis 168
Corynebacterium
stationis
The synthetic fragment containing purA gene (SEQ ID NO: 98) used for enhancement of purine nucleotide biosynthesis pathway was assembled in the artificial gene expression cassette. The initial (SEQ ID NO: 96) and end fragment (SEQ ID NO: 97) containing gene integration homology, the promoter sequence, and the zeocin selectable marker were designed and synthesized for stable genome integration into the yybN locus in the genome of B. subtilis.
The first step of the artificial operon assembly was PCR amplification of separate DNA fragments performed using primer pair SEQ ID NO: 99 and SEQ ID NO: 100 for the initial fragment SEQ ID NO: 96, and primer set SEQ ID NO: 101 and SEQ ID NO: 102 for end fragment SEQ ID NO: 97. The primer set SEQ ID NO: 103 and SEQ ID NO: 104 was used for amplification of fragment containing genes for purA overexpression SEQ ID NO: 98. The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 65° C. 25 s, 72° C. 23/25 s), 72° C. 5 min, 10° C. hold. PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial operon by repetitive steps of restriction and ligation. A combination of SpeI (BcuI) and XbaI restriction sites was used to provide compatible restriction ends for a successful ligation. After each step of ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected enzyme (SpeI (BcuI), and XbaI, Thermo Fisher), and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer. The first two fragments were used in the ligation reaction with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and the addition of 5% PEG 4000 and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 102 and SEQ ID NO: 105, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. The restriction digest, cleaning, and ligation steps were repeated for ligation of the end fragment. PCR was run on 0.8% agarose gel, the fragment was excised from the gel, digested and cleaned as before, and ligated as before. The final operon containing the yybN homology, promoter with RBS sequence, purA genes, and zeocin resistance cassette was amplified using the primer pair SEQ ID NO: 106 and SEQ ID NO: 107, cleaned and ligated as described before. The constructed synthetic operon was used for the transformation of Bacillus subtilis VKPM B2116. The transformation with purA operon SEQ ID NO: 108 resulted in the transformed strain BS19. The correct integration of the artificial operons at yybN integration site was confirmed by cPCR.
The synthetic fragments containing synthetic LOG gene (LOG 8—SEQ ID NO: 116) and various IPT genes (SEQ ID NOs: 129, 154-156) for isoprenoid cytokinin biosynthesis were assembled in artificial isoprenoid cytokinin operons. The initial and end fragments containing gene integration homology, the promoter sequence, and the erythromycin selectable marker (SEQ ID NO: 73) were designed and synthesized for stable genome integration into the amyE locus in the genome of B. subtilis.
The first step of the artificial operon assembly was PCR amplification of separate DNA fragments performed using primer pair SEQ ID NO: 74 and SEQ ID: 75 for the initial fragment SEQ ID NO: 76, and for synthetic fragment LOG 8 (SEQ ID NO: 116) and fragments IPT1, IPT6, IPT7 and IPT9 (SEQ ID NOs: 179, 151, 152 and 153, respectively) containing genes for isoprenoid-cytokinin biosynthesis. The primer set SEQ ID NO: 77 and SEQ ID NO: 78 was used for amplification of the end fragment SEQ ID NO: 79. The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 65° C. 25 s, 72° C. 30 s), 72° C. 5 min, 10° C. hold. PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial operon by repetitive steps of restriction and ligation. A combination of SpeI (BcuI), VspI, NdeI and SnaBI restriction sites was used to provide compatible restriction ends for a successful ligation. After each step of ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected enzyme (SpeI (BcuI), VspI, NdeI and SnaBI, Thermo Fisher), and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer. Initial fragment and various IPT fragments (SEQ ID NO: 179, 151, 152 and 153) were used in ligation reactions with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and the addition of 5% PEG 4000 and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of each inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 120 and SEQ ID NO: 121, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. DNA fragments were extracted from agarose gel as described before. The restriction digest (VspI and NdeI), cleaning, and ligation steps were repeated for ligation of AmyE+IPT1 operon, AmyE+IPT6 operon, AmyE+IPT7 operon and AmyE+IPT9 operon (SEQ ID NO: 129, 154, 155 and 156, respectively) and fragment containing LOG 8 gene (SEQ ID NO: 116). 1 μl of inactivated ligation was further used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 122 and SEQ ID NO: 123, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. DNA fragments were extracted from agarose gel as described before. XbaI enzymatic digestion was performed to ligate operons AmyE 0+IPT1/6/7/9+LOG 8 (SEQ ID NO: 136, 157, 158 and 159, respectively) and operon EryR+AmyE END (SEQ ID NO: 140), previously ligated by SnaBI restriction site and PCR amplified using primer set SEQ ID NO: 126 and SEQ ID NO: 127. The final operons containing the amy E homology, promoter with RBS sequence, various IPT genes and gene LOG 8, and erythromycin resistance cassette were amplified using the primer pair SEQ ID NO: 128 and SEQ ID NO: 123, cleaned and ligated as described before. The constructed synthetic trans-zeatin operons containing IPT1-LOG 8, IPT6-LOG 8, IPT7-LOG 8 and IPT9-LOG 8 (SEQ ID NOs: 147, 160, 161 and 162, respectively) were used for the transformation of Bacillus subtilis BS19 (described in Example 17). All transformants were confirmed by cPCR and are listed in the Table 18 below. All of the constructed strains have been cultivated as described in the Example 6. The extraction and analysis were performed as described in the Example 7 and the yields of cytokinins are shown in
Bacillus subtilis strains obtained
The synthetic fragments containing synthetic gene IPT1 (SEQ ID NO: 71) and various LOG genes (SEQ ID NO: 110-119) for isoprenoid cytokinin biosynthesis were assembled in artificial isoprenoid cytokinin operons. The initial and end fragments containing gene integration homology, the promoter sequence, and the erythromycin selectable marker were designed and synthesized for stable genome integration into the amyE locus in the genome of B. subtilis.
The first step of the artificial operon assembly was PCR amplification of separate DNA fragments performed using primer pair SEQ ID NO: 74 and SEQ ID NO: 75 for the initial fragment SEQ ID NO: 76, and for synthetic fragment IPT1 SEQ ID NO: 71 and fragments LOG 2-LOG 11 (SEQ ID NO: 110-119) containing genes for isoprenoid-cytokinin biosynthesis. The primer set SEQ ID NO: 126 and SEQ ID NO: 127 was used for amplification of the end fragment containing erythromycin selectable marker (SEQ ID NO: 140). The fragments were amplified using Eppendorf cycler and Phusion polymerase (Thermo Fisher) in the buffer provided by the manufacturer and with the addition of 200 μM dNTPs, 0.5 μM of each primer, and approximately 10 ng of template in a final volume of 50 μl for 30 cycles using the PCR cycling conditions: 98° C. 30 s, 30 cycles of (98° C. 30 s, 65° C. 25 s, 72° C. 30 s), 72° C. 5 min, 10° C. hold. PCR reaction products of each fragment were run on 0.8% agarose gel, excised from the gel, and extracted from the gel by GeneJET Gel Extraction Kit (Thermo Fisher) according to the protocol provided by the manufacturer. The fragments were assembled into an artificial operon by repetitive steps of restriction and ligation. A combination of SpeI (BcuI), VspI, NdeI XbaI and SnaBI restriction sites was used to provide compatible restriction ends for a successful ligation. After each step of ligation, the combined fragments were used as a new template for the next PCR amplification. The restriction was done in 50 μl volume with the addition of 5 μl FD green buffer (Thermo Fisher Scientific), 2-3 μl of the selected enzyme (SpeI (BcuI), VspI, NdeI, XbaI and SnaBI, Thermo Fisher), and up to app. 1500 ng of PCR fragment. After restriction digest, the digested DNA fragments were cleaned with Wizard SV Gel and PCR Clean-up system according to the protocol provided by the manufacturer.
The first two fragments were used in the ligation reaction with 2.5 U of T4 DNA ligase (Thermo Fisher) in the buffer provided by the manufacturer and the addition of 5% PEG 4000 and both fragments in a 1:1 molar ratio to the final volume of 15 μl. In the next step, 1 μl of inactivated ligation was used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 120 and SEQ ID NO: 121, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. DNA fragments were extracted from agarose gel as described before. The restriction digest (VspI and NdeI), cleaning, and ligation steps were repeated for ligation of fragments containing various LOG genes (SEQ ID NO: 110-119). 1 μl of inactivated ligation was further used as a template in a new 50 μL PCR with a primer set SEQ ID NO: 122 and SEQ ID NO: 123, and the same PCR mix and PCR cycling conditions as previously but with longer elongation time. DNA fragments were extracted from agarose gel as described before. XbaI enzymatic digestion was performed to ligate operons AmyE 0+IPT1+LOG 2-11 (SEQ ID NO: 130-139) and operon EryR+AmyE END (SEQ ID NO: 140), previously ligated by SnaBI restriction site and PCR amplified using primer set SEQ ID NO: 126 and SEQ ID NO: 127.
The final operons containing the amyE homology, promoter with RBS sequence, IPT1 and various LOG genes, and erythromycin resistance cassette were amplified using the primer pair SEQ ID NO: 128 and SEQ ID NO: 123, cleaned and ligated as described before. The constructed synthetic trans-zeatin operons containing IPT1-LOG 2-11 (SEQ ID NO: 141-150) were used for the transformation of Bacillus subtilis BS19 (described in Example 17). All transformants were confirmed by cPCR and are listed in the Table 19 below. All of the constructed strains have been cultivated as described in the Example 6. The extraction and analysis were performed as described in the Example 7 and the yields of cytokinins are shown in
Bacillus subtilis strains obtained
Number | Date | Country | Kind |
---|---|---|---|
20075015.6 | Nov 2020 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/083188 | 11/26/2021 | WO |