The present invention relates to bacterial cells genetically modified to improve their tolerance to certain commodity chemicals, such as dicarboxylic acids (herein referred to as “diacids”) and other polycarboxylic acids, and to methods of preparing and using such bacterial cells for production of diacids and other compounds.
Aliphatic diacids are commonly used as precursors for nylon polymers (polyamides), typically prepared by condensing diamines with diacids. Diacids are also used as monomers for various other polymers and copolymers including polyurethanes. Different chain lengths and the presence of unsaturated bonds or branched chains within the constituent diacids imparts different physical properties to the polymer.
There has been significant recent interest in producing diacids biologically, i.e., in microbial cells. For example, as reported on their respective websites (accessed in October 2016), Myriant Corporation and BioAmber Inc. have both begun biological production of succinic acid, as a replacement molecule for petrochemical-derived adipic acid, Verdezyne Inc. is developing a process to produce adipic acid in yeast, and one of the major worldwide manufacturers of nylons, INVISTA™, is actively seeking the development of biologically produced precursors through collaborations with external parties. The production of diacids in metabolically engineered microbial cells have been reviewed and described in several publications such as, e.g., Polen et al., 2013; Adkins et al., 2013; Park et al., 2013; Yu et al., 2014; Cheong et al., 2016; Deng and Mao, 2015; WO 2011/003034 A2 (Verdezyne); Curran et al., 2013; Sengupta et al., 2015; and Zhang et al., 2015.
For production of bulk chemicals from renewable plant-based carbon feedstocks, high product titers are essential in order to minimize capital equipment and downstream separations costs for product purification. At the high titers required for economical fermentation processes, however, most chemicals exhibit significant toxicity that reduces yields and productivities by negatively affecting microbial growth (Van Dien, 2013; Zingaro et al., 2013). Escherichia coli being a suitable host for industrial applications, there has been some interest in developing E. coli strains with improved tolerance to chemicals of interest for production, such as, e.g., n-butanol, ethanol and isobutanol, or to stress conditions present during fermentation (see, e.g., Haft et al, 2014; Sandberg et al., 2014; Lennen and Herrgard, 2014; Tenaillon et al., 2012; Minty et al., 2011; Dragosits et al., 2013a,b; Winkler et al., 2014; Wu et al., 2014; LaCroix et al., 2015; Jensen et al., 2015 and 2016; Doukyu et al., 2012; Shenhar et al., 2012; and Rath and Jawali, 2006).
In addition, Byrne et al., 2012, describes computational modelling of microorganisms such as E. coli, proposing combinations of medium compositions and gene-deletion strains for six industrially important byproducts, e.g., succinate. WO 01/05959 (Ajinomoto K K) relates to production of a target substance such as glutamic acid in, e.g., E. coli strains. Finally, WO 2016/162442 (Metabolic Explorer) relates to a recombinant microorganism capable of producing 2,4-dihydroxybutyrate, which is characterized by an increased cellular export, and preferably by a decreased cellular import, of 2,4-dihydroxybutyrate.
Despite these and other advances in the art, there is still a need for bacterial cells with improved tolerance to chemicals of interest for bio-based production, such as aliphatic diacids and other compounds. It is an object of the invention to provide such bacterial cells.
It has been found by the present inventors that certain genetic modifications unexpectedly improve the tolerance of bacterial cells, such as those of, e.g., the Escherichia genera, to certain chemical compounds, particularly aliphatic diacids (herein also referred to as “aliphatic dicarboxylic acids”).
Accordingly, the invention provides bacterial cells with improved tolerance to at least one aliphatic diacid, as well as bacterial cells which are capable of producing an aliphatic diacid and have improved tolerance to the aliphatic diacid. Particularly contemplated are glutaric acid, adipic acid, succinic acid, muconic acid, fumaric acid, itaconic acid, malic acid, malonic acid, maleic acid, glucaric acid, pimelic acid, suberic acid, sebacic acid, 2,5-furandicarboxylic acid, terephthalic acid, azelaic acid, mesaconic acid, citraconic acid, tartaric acid, tartronic acid, diaminopimelic acid, and glutaconic acid.
The invention also relates to compositions comprising such bacterial cells and one or more aliphatic diacids, methods of preparing or screening for such bacterial cells, and methods of producing aliphatic diacids using such bacterial cells.
These and other aspects and embodiments are described in more detail below.
In the present work, glutaric acid and adipic acid were selected for performing adaptive laboratory evolutions. Based on the findings reported herein, various aspects of the invention provide for genetically modified bacterial host cells with a higher tolerance to one or more diacids or other compounds. When transformed with a recombinant biosynthetic pathway for producing the diacid from a carbon source, the genetically modified bacterial host cells of the invention result in improved production of the diacid from carbon feedstock, since they maintain robust metabolic activity in the presence of higher concentrations of the diacid than the parent cells. For example, it was found that a reduced expression of kgtP improved tolerance to glutaric acid, implicating KgtP, an α-ketoglutarate importer, as being a direct importer for glutarate.
So, in a first aspect, a bacterial cell is provided, comprising a biosynthetic pathway for producing an aliphatic dicarboxylic acid and at least one genetic modification which reduces expression of an endogenous gene selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR, or a combination of any thereof. In one preferred embodiment, the genetic modification reduces the expression of kgtP. In another preferred embodiment, the at least one genetic modification reduces the expression of ybjL, proV, proW, proX, sspA or a combination of any thereof. For example, the bacterial cell may comprise genetic modifications which reduce the expression of kgtP, proV and ybjL; kgtP and proV; kgtP and ybjL; or kgtP and sspA. Non-limiting examples of genetic modifications include a knock-down or knock-out of the endogenous gene. In a particular embodiment, the genetic modification is a knock-out. The genetic modification may, for example, provide for an increased growth rate, a reduced lag time, or both, of the cell in the presence of at least one of glutaric acid and adipic acid as compared to a control, e.g., the bacterial cell without the genetic modification.
In a second aspect, a bacterial cell is provided, genetically modified from a parent bacterial cell so as to comprise one or more of
The bacterial cell, may, for example, comprise (a) at least one mutant protein selected from the group consisting of SpoT-V422A, SpoT-A451D, SpoT-A451V, SpoT-W457C, SpoT-N454H, SpoT-D580Y, SpoT-R236L, SpoT-R236S, SpoT-M247K, SpoT-NIR(601-603)S, SpoT-T442I, SpoT-S434L, PolB-R477G, RpoC-H419P, RpoC-P64L, RpoB-K203T, Rnt-Q179P, Rnt-A27T, Rnt-F194L, Rnt-A180T, SapC-G79W; and/or (b) a mutation in rph or the pyrE/rph intergenic region which increases the expression of PyrE.
The bacterial cell of any aspect or embodiment may further comprise a recombinant biosynthetic pathway for producing at least one of glutaric acid, adipic acid, succinic acid, muconic acid, fumaric acid, itaconic acid, malic acid, malonic acid, maleic acid, glucaric acid, pimelic acid, suberic acid, sebacic acid, 2,5-furandicarboxylic acid, terephthalic acid, or azelaic acid, mesaconic acid, citraconic acid, tartaric acid, tartronic acid, diaminopimelic acid and glutaconic acid.
Also provided is a process for preparing a recombinant bacterial cell for producing an aliphatic dicarboxylic acid, the process comprising genetically modifying an E. coli cell to (a) introduce a recombinant biosynthetic pathway for producing the aliphatic dicarboxylic acid, and (b) knock-down or knock-out at least one endogenous gene selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR; such as a combination of two or more endogenous genes selected from kgtP, proV, ybjL and sspA; such as a combination selected from kgtP, proV and ybjL; kgtP and proV; kgtP and ybjL and kgtP and sspA, and/or (c) express a mutant of at least one of SpoT, PolB, RpoC, RpoB, Rnt and SapC and/or increase the expression of PyrE; wherein steps (a), (b) and (c) can be performed in any order.
Also provided is a process for improving the tolerance of a bacterial cell to an aliphatic dicarboxylic acid comprising genetically modifying the bacterial cell to (a) knock-down or knock-out at least one endogenous gene selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR; such as a combination of two or more endogenous genes selected from kgtP, proV, ybjL and sspA; such as a combination selected from kgtP, proV and ybjL; kgtP and proV; kgtP and ybjL and kgtP and sspA; and/or (b) express a mutant of at least one of SpoT, PolB, RpoC, RpoB, Rnt and SapC and/or increase the expression of PyrE, wherein steps (a) and (b) are performed in any order.
The bacterial cell may, for example, be derived from the Escherichia, Lactobacillus, Lactococcus, Bacillus, Pseudomonas, Corynebacterium, Deinococcus or Ralstonia species, such as the Escherichia coli species.
Also provided is a method for producing an aliphatic dicarboxylic acid, comprising culturing such genetically modified bacterial cells in the presence of a carbon source, and, optionally, isolating the aliphatic dicarboxylic acid.
Also provided is a composition comprising glutaric acid or adipic acid at a concentration of at least 5 g/L and a plurality of bacterial cells of the Escherichia genus genetically modified to (a) knock-down or knock-out at least one endogenous gene selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR; such as a combination of two or more endogenous genes selected from kgtP, proV, ybjL and sspA; such as a combination selected from kgtP, proV and ybjL; kgtP and proV; kgtP and ybjL and kgtP and sspA; and/or (b) express a mutant of at least one of SpoT, PolB, RpoC, RpoB, Rnt and SapC and/or increase expression of PyrE.
In embodiments where the bacterial cell comprises a biosynthetic pathway for producing an aliphatic dicarboxylic acid, the pathway may, for example, comprise
Unless otherwise indicated or contradicted by context, a “diacid” as used herein is an aliphatic dicarboxylic acid of the general formula COOH—R—COOH (I), where R is an alkyl chain. An “aliphatic diacid” or “aliphatic dicarboxylic acid” herein refers to an organic compound comprising an aliphatic carbon chain to which two or more carboxyl (—COOH) groups are attached, and includes linear aliphatic diacids, as well as derivatives thereof. Aliphatic diacids suitable for production in bacteria typically comprise from 3 to 12 carbon atoms, preferably 3 to 10 carbon atoms, more preferably 3 to 8 carbon atoms, even more preferably, 4 to 7 or 5 to 8 carbon atoms, and, most preferably, 5 to 7 carbon atoms, and optionally comprises one or more heteroatoms or other substituents. Examples of heteroatoms include oxygen (e.g., in the form of an oxo group, a.k.a. keto group), nitrogen, sulphur and halogens. Examples of other substituents include hydroxyl groups, amino groups, carboxyl groups, and alkyl groups. Preferred aliphatic diacids include, but are not limited to, glutaric acid, adipic acid, succinic acid, muconic acid, fumaric acid, itaconic acid, malic acid, malonic acid, maleic acid, glucaric acid, pimelic acid, suberic acid, sebacic acid, 2,5-furandicarboxylic acid, terephthalic acid, or azelaic acid, mesaconic acid, citraconic acid, tartaric acid, tartronic acid, diaminopimelic acid and glutaconic acid. In some embodiments, the aliphatic diacid does not comprise any heteroatom substituents. In some embodiments, the aliphatic diacid does not comprise any substituents. Glutaric, adipic, pimelic and sebacic acid are particularly preferred.
As used herein, a “recombinant biosynthetic pathway” for a compound of interest refers to an enzymatic pathway resulting in the production of a compound of interest in a host cell, wherein at least one of the enzymes is expressed from a transgene, i.e., a gene added to the host cell genome by transformation. In some cases, the recombinant biosynthetic pathway also comprises a deletion of one or more native genes in the host cell. The compound of interest is typically a diacid, and may be the actual end product or a precursor or intermediate in the production of another end product.
The terms “tolerant” or “improved tolerance”, when used to describe a genetically modified bacterial cell of the invention or a strain derived therefrom, refers to a genetically modified bacterial cell or strain that shows a reduced lag time, an improved growth rate, or both, in the presence of a diacid than the parent bacterial cell or strain from which it is derived, typically at concentrations of 1 g/L, such as at least 2 g/L or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 40 g/L or higher, such as at least 75 g/L or higher, such as at least 100 g/L or higher. An improved growth rate is at least 5%, such as at least 10%, such as at least 20%, such as at least 50%, such as at least 75% higher than that of a control, typically the parent cell or strain. A reduced lag time is at least 10%, such as at least 20%, such as at least 50%, such as at least 75%, such as at least 90% shorter than that of a control, typically the parent cell or strain.
The term “gene” refers to a nucleic acid sequence that encodes a cellular function, such as a protein, optionally including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. An “endogenous gene” refers to a native gene in its natural location in the genome of an organism. A “transgene” is a gene, native or heterologous, that has been introduced into the genome by a transformation procedure. Genes names are herein set forth in italicised text with a lower-case first letter (e.g., metJ) whereas protein names are set forth in normal text with a capital first letter (e.g., MetJ).
As used herein the term “coding sequence” refers to a DNA sequence that encodes a specific amino acid sequence.
The term “native”, when used to characterize a gene or a protein herein with respect to a host cell, refers to a gene or protein having the nucleic acid or amino acid sequence as found in the host cell.
The term “heterologous”, when used to characterize a gene or protein with respect to a host cell, refers to a gene or protein which has a nucleic acid or amino acid sequence not normally found in the host cell.
As used herein the term “transformation” refers to the transfer of a nucleic acid fragment, such as a gene, into a host cell. Host cells containing a gene introduced by transformation or a “transgene” are referred to as “transgenic” or “recombinant” or “transformed” cells.
As used herein, a “genetic modification” or “genetically modified” refers to the introduction a genetically inherited change in the host cell genome. Examples of changes include mutations in genes and regulatory sequences, coding and non-coding DNA sequences. “Mutations” include deletions, substitutions and insertions of one or more nucleotides or nucleic acid sequences in the genome. Other genetic modifications include the introduction of heterologous genes or coding DNA sequences on a plasmid and/or into a chromosome by recombinant techniques. In one embodiment, the genetic modification is in a chromosome.
The term “expression”, as used herein, refers to the process in which a gene is transcribed into mRNA, and may optionally include the subsequent translation of the mRNA into an amino acid sequence, i.e., a protein or polypeptide.
As used herein, “reduced expression” or “downregulation” of an endogenous gene in a host cell means that the levels of the mRNA, protein and/or protein activity encoded by the gene are significantly reduced in the host cell, typically by at least 25%, such as at least 50%, such as at least 75%, such as at least 90%, such as at least 95%, as compared to a control. Typically, when the reduced expression is obtained by a genetic modification in the host cell, the control is the unmodified host cell. Sometimes, e.g., in the case of gene knock-out, the reduction of native rnRNA and functional protein encoded by the gene is higher, such as 99% or greater.
“Increased expression”, “upregulation”, “overexpressing” or the like, when used in the context of a protein or activity described herein, means increasing the protein level or activity within a bacterial cell. An up-regulation of an activity can occur through, e.g., increased activity of a protein, increased potency of a protein or increased expression of a protein. The protein with increased activity, potency or expression can be encoded by genes disclosed herein.
Genetic modifications resulting in a reduced expression of a target gene/protein can include, e.g., knock-down of the gene (e.g., a mutation in a promoter or other expression control sequence that results in decreased gene expression), a knock-out or disruption of the gene (e.g., a mutation or deletion of the gene that results in 99 percent or greater decrease in gene expression), a mutation or deletion in the coding sequence which results in the expression of non-functional protein, and/or the introduction of a nucleic acid sequence that reduces the expression of the target gene, e.g. a repressor that inhibits expression of the target or inhibitory nucleic acids (e.g. CRISPR/dCas9, antisense RNA, etc.) that reduces the expression of the target gene.
Standard recombinant DNA and molecular cloning techniques used here are well known in the art and are described by Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual, 4th ed.; Cold Spring Harbor Laboratory: Cold Spring Harbor, N.Y., 2012; and by Silhavy, T. J., Berman, M. L. and Enquist, L. W. Experiments with Gene Fusions; Cold Spring Harbor Laboratory: Cold Spring Harbor, N.Y., 1984; and by Ausubel, F. M. et al., In Current Protocols in Molecular Biology, published by John Wiley & Sons (1995); and by Datsenko and Wanner, 2000; and by Baba et al., 2006; and by Thomason et al., 2007.
A “conservative” amino acid substitution in a protein is one that does not negatively influence protein activity. Typically, a conservative substitution can be made within groups of amino acids sharing physicochemical properties, such as, e.g., basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagines), hydrophobic amino acids (leucine, isoleucine, valine and methionine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, and threonine). Most commonly, substitutions can be made between Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, Ala/Glu, Asp/Gly. Other preferred substitutions are set out in Table 1 below.
As described herein, the invention provides bacterial cells with improved tolerance to one or more diacids, as well as related processes and materials for producing and using such bacterial cells.
The genetic modifications according to the invention include those resulting in reduced expression of genes, e.g., by gene knock-down or knock-out, herein referred to as “Group 1 modifications”; as well as silent mutations in coding or non-coding regions and non-silent (i.e., coding) mutations in coding regions, herein referred to as “Group 2 modifications”; and combinations thereof.
In a preferred embodiment, the one or more genetic modifications provide for an increased growth rate, a reduced lag time, or both, of the bacterial cell in the presence of at least one of glutaric and adipic acid as compared to the bacterial cell without the genetic modification, e.g., the parent or wild-type bacterial cell. The glutaric and/or adipic acid may be present in the growth medium at, e.g., a concentration of at least about 1 g/L, such as at least about 2 g/L, such as at least about 5 g/L, such as at least about 10 g/L, such as at least about 20 g/L.
a) Group 1 Modifications
In one aspect, the bacterial cell has at least one genetic modification which reduces expression an endogenous gene selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR.
For example, in one embodiment, the expression of one or more of kgtP, ybjL, proV, proW, proX, proQ and sspA, such as kgtP, ybjL, proV or sspA, is reduced. In one specific embodiment, the expression of kgtP is reduced, optionally wherein the expression of lysP is not reduced. In one specific embodiment, the expression of ybjL is reduced. In one specific embodiment, the expression of sspA is reduced. In one specific embodiment, the expression of proV, proW, proX or proQ, such as e.g. proV, is reduced, optionally wherein the expression of marR is not reduced. In another specific embodiment, the expression of cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, or yeaR is reduced.
In another aspect, there is provided a bacterial cell which comprises genetic modifications reducing the expression of two or more endogenous genes, wherein at least one gene is selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR. In one embodiment, the bacterial cell comprises a genetic modification reducing the expression of kgtP but no genetic modification which reduces the expression of lysP. In one embodiment, the bacterial cell comprises a genetic modification reducing the expression of proV but no genetic modification which reduces the expression of marR. In one embodiment, the bacterial cell comprises genetic modifications which reduce the expression of at least two genes selected from the group consisting of kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR.
In one embodiment, the genetic modifications reduce the expression of kgtP and one or more other endogenous genes, optionally selected from ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG and yeaR, such as from ybjL, proV, proW, proX, and sspA, optionally wherein the other endogenous genes do not comprise lysP. In separate and specific embodiments, the bacterial cell comprises genetic modifications which reduce the expression of kgtP and ybjL, kgtP and proV, proV and ybjL, or kgtP and sspA.
In one embodiment, the genetic modifications reduce the expression of kgtP and two or more endogenous genes selected from ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG and yeaR, wherein at least one endogenous gene is selected from ybjL, proV, proW, proX, and sspA. In one specific embodiment, the bacterial cell comprises genetic modifications which reduce the expression of kgtP, proV and ybjL.
In other separate and specific embodiments, the bacterial cell comprises:
In one specific embodiment, either one or both of the first and second genetic modifications is a knock-out of the gene, optionally a deletion. In an alternative embodiment at least one of the first and second genetic modifications is a knock-down of the gene.
In one aspect, there is provided a bacterial cell according to any one of the preceding aspects and embodiments, wherein the genetic modification is a knock-down of the one or more endogenous genes, resulting in at least 25%, such as at least 50%, such as at least 75%, such as at least 90%, such as at least 95%, reduction in the level of mRNA encoded by the gene.
In one aspect, there is provided a bacterial cell according to any one of the preceding aspects and embodiments, wherein the genetic modification is a knock-down of the one or more endogenous genes, resulting in at least 25%, such as at least 50%, such as at least 75%, such as at least 90%, such as at least 95%, reduction in the level of protein encoded by the gene.
In one aspect, there is provided a bacterial cell according to any one of the preceding aspects and embodiments, wherein the genetic modification is a knock-out of the one or more endogenous genes.
Knock-down or knock-out of a gene can be accomplished by any method known in the art for bacterial cells, and include, e.g., lambda Red mediated recombination, P1 phage transduction, and single-stranded oligonucleotide recombineering/MAGE technologies (see, e.g., Datsenko and Wanner, 2000; Thomason et al., 2007a,b; Wang et al., 2009). Typically, a knock-down of a gene can be accomplished by, for example, a mutation in the promoter region resulting in decreased transcription, a deletion or mutation in the coding region of the gene resulting in a reduced or fully or substantially eliminated activity of the protein, or by the presence of antisense sequences that interfere with transcription or translation of the gene, resulting in reduced expression of the protein. Preferably, the knocking-down of a gene results in at least 20% reduction in the expression level of the gene product in the bacterial cell, such as at least 30%, such as at least 40%, such as at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95% or higher.
A knock-out of a gene includes elimination of a gene's expression, such as by introducing a mutation in the coding sequence and/or promoter so that at least a portion (up to and including all) of the coding sequence and/or promoter is disrupted, shifted or deleted, resulting in loss of expression of the protein, or expression only of a non-functional mutant or non-functional fragment of the endogenous protein. As used herein, the symbol “A”, i.e., the greek uppercase letter for “delta”, denotes a deletion of an endogenous gene. Preferably, a knock-out of a gene results in 1% or less of the native gene product being detectable, such as no detectable gene product.
b) Group 2 Modifications
In certain embodiments, a mutant protein is expressed in the bacterial cell, e.g., from a mutated version of an endogenous gene, or from a transgene encoding the mutant protein.
In one aspect, the bacterial cell comprises a Group 2 modification, e.g., a mutation in one or more of SpoT, PolB, RpoC, RpoB, Rnt and SapC; an increased expression of one or more of SpoT, PolB, RpoC, RpoB, Rnt and SapC; and/or a mutation in rph or the pyrE/rph intergenic region which increases the expression of pyrE, wherein the one or more mutations improve tolerance to at least one aliphatic diacid such as, e.g. glutaric or adipic acid. Preferably, the bacterial cell further comprises a Group 1 modification according to any aspect or embodiment herein.
In one embodiment, the Group 2 modification comprises a mutant SpoT, comprising one or more mutations. The mutations may be located, for example, in the threonyl-tRNA synthetase GTPase and SpoT (TGS) domain corresponding to amino acid residues 1388 to T447, or the linker region between the TGS domain and the aspartokinase, chorismate mutase and TyrA (ACT) domain corresponding to A448 to T621 in E. coli SpoT. Without being limited to theory, since the mutations identified are near the TGS domain which is involved in nucleotide binding, at least some of the mutations may decrease the ppGpp synthetase activity of SpoT, e.g., by decreasing its binding affinity to substrates such as ATP or (p)ppGpp, or increasing its binding affinity to products such as GTP, AMP, or GDP. This may, in turn, reduce sensitivity of the cells to accumulating ppGpp and delay the onset of the stringent response. The stringent response may be activated under general stress conditions such as in high concentrations of diacids and might prevent growth under such conditions. In one embodiment, the SpoT mutant comprises a mutation in one or more amino acid residues selected from those corresponding to R236, M247, V422, S434, T442, A451, N454, W457, D580, N601, 1602 and R603 in E. coli SpoT. In one embodiment, the mutant SpoT comprises at least one amino acid substitution selected from V422A, A451D, A451V, W457C, N454H, D580Y, R236L, R236S, M247K, NIR(601-603)S, T442I, and S434L or a conservative substitution of any thereof. In a specific embodiment, the mutant SpoT comprises a mutation in A451 or R236, e.g., an amino acid substitution selected from A451D, A451V, R236L and R236S, or a conservative substitution thereof, e.g., selected from A451E, A451N, A451G, A451A, A451L, A451I, R236I, R236V, R236T, R236A, R236N and R236G.
In one embodiment, the Group 2 modification comprises a mutant PolB comprising one or more mutations. The mutation may be located, e.g., in the residue corresponding to R477 in E. coli PolB, and may be an amino acid substitution such as R477G or a conservative substitution thereof, e.g., R477A, R477D or R477S.
In one embodiment, the Group 2 modification comprises a mutant RpoC comprising one or more mutations. The mutation may be located in, e.g., the residue corresponding to H419 and/or P64 in E. coli RpoC, and may be an amino acid substitution such as H419P, P64L, or a conservative substitution thereof, e.g., H419A, P64I, P64V, P64M, P64A or P64F. Without being limited to theory, since some of these residues (e.g., H419) are close to residues involved in ppGpp-binding, at least some of them may decrease ppGpp binding to RpoC which in turn may reduce sensitivity of the cells to accumulating ppGpp and delay the onset of the stringent response.
In one embodiment, the Group 2 modification comprises a mutant RpoB comprising one or more mutations. The mutation may be located in, e.g., the residue corresponding to K203 in E. coli RpoB, and may be an amino acid substitution such as K203T or a conservative substitution thereof, e.g., K203S or K203A. Without being limited to theory, since residue K203 is near the entrance for dsDNA, it may interact with phosphate on dsDNA, possibly reducing premature transcription termination.
In one embodiment, the Group 2 modification comprises a mutant Rnt comprising one or more mutations. The mutations may be located, for example, in or close to catalytic residues of conserved exonuclease motifs corresponding to positions 23, 25, 181, and 186 in E. coli Rnt. In one embodiment, the Rnt mutant comprises a mutation in one or more amino acid residues selected from those corresponding to Q179, A27, F194 and A180. In one embodiment, the mutant Rnt comprises at least one amino acid substitution selected from Q179P, A27T, F194L and A180T or a conservative substitution of any thereof, e.g., Q179A, A27S, A27G, F1951, F195V, F195T, F195A, A180S and A180G.
In one embodiment, the Group 2 modification comprises a mutant SapC comprising one or more mutations. The mutation may be located in, e.g., the residue corresponding to G79 in E. coli SapC, and may be an amino acid substitution such as G79W or a conservative substitution thereof, e.g., G79Y or G79F.
In one embodiment, the bacterial cell comprises one or more mutations which increase(s) the expression level or activity of PyrE, optionally in combination with a Group 1 modification. E. coli K-12 MG1655 and W3110, plus their common ancestor strain W1485, are known to exhibit pyrimidine starvation in minimal media due to the presence a frameshift mutation occurring in rph relative to other E. coli strains (Jensen et al., 1993). This mutation disrupts the transcriptional/translational coupling required for efficient translation of pyrE, encoding orotate phosphoribosyltransferase in the pyrimidine biosynthesis pathway. Compensatory mutations that correct this deficiency are well-known in the art. One of these mutations is an 82 bp deletion near the 3′ terminus of rph, due to presence of two homologous GCAGAAGGC sequences flanking this 82 bp region (Conrad et al., 2009). In addition to the 82 bp deletion, a 1 bp deletion at coordinate 3815809 in the pyrE/rph intergenic region has previously been encountered in strains evolved for growth on a minimal glucose medium (LaCroix et al., 2015), and a wide array of other frameshift mutations, substitutions, and coding mutations near the 3′ terminus of rph were encountered in a short-term selection/evolution of combinatorial mutant libraries in minimal medium at an elevated temperature of 42° C. (Sandberg et al., 2014). Without being limited to theory, all of these mutations can serve the same function of increasing expression of PyrE, with the selective pressure for these mutations being even stronger in minimal media with particular imposed stresses (certain chemicals or heat) than in minimal media alone. In one embodiment, the bacterial cell comprises mutations in rph or the pyrE/rph intergenic region, such as, e.g., the 82 bp deletion near the 3′ terminus of rph, the 1 bp deletion in the intergenic region between pyrE and rph, or both. In one embodiment, increased expression of PyrE is achieved by transforming the bacterial cell with a transgene expressing the endogenous protein. Increased expression may be obtained by causing an up-regulation through increased expression of a protein, the copy number of a gene or genes encoding the protein may be increased. Alternatively, a strong and/or inducible promoter can be used to direct the expression of the gene, the gene being expressed either as a transient expression vehicle or homologously or heterologously incorporated into the bacterial genome. In another embodiment, the promoter, regulatory region and/or the ribosome binding site upstream of the gene can be altered to achieve the over-expression. The expression can also be enhanced by increasing the relative half-life of the messenger or other forms of RNA. Any one or a combination of these approaches can be used to effect upregulation of a desired target protein as needed.
In a specific embodiment, the bacterial cell comprises at least one Group 1 modification and at least one Group 2 modification. Non-limiting examples of Group 1 modifications for combination with any one or more of the overexpressed or mutant SpoT, PolB, RpoC, RpoB, Rnt, SapC and pyrE/rph include knock-down or knock-out of
In separate and specific embodiments, the bacterial cell comprises:
In other separate and specific embodiments, the bacterial cell comprises:
In some aspects, the bacterial cell comprises a recombinant pathway for producing an aliphatic diacid of interest, optionally providing for a production level of at least about 5 g/L of the aliphatic diacid over a predetermined period of time, e.g., about 200h, about 100h, about 72h, about 48h or about 24h. A recombinant pathway can, for example, be added to introduce the capability to produce the diacid in a bacterial cell which does not have a native pathway to do so, typically by transforming the cell with one or more heterologous enzymes catalyzing the desired reaction(s). Alternatively, in cases where the bacterial cell has a native pathway for production of the diacid of interest, a recombinant pathway can nonetheless be introduced in order to increase the production yield, e.g., by overexpressing one or more native enzymes or transforming the cell with heterologous enzymes. In separate and specific embodiments, the recombinant pathway provides for a production level of at least 5 g/L, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 40 g/L or higher, such as at least 45 g/L or higher, such as at least 75 g/L or higher, such as at least 100 g/L or higher.
So, in one aspect, there is provided a bacterial cell with improved tolerance to at least one aliphatic diacid according to any aspect or embodiment described herein, wherein the bacterial cell further comprises a recombinant biosynthetic pathway for producing an aliphatic diacid of interest, such as, e.g., glutaric acid, adipic acid, succinic acid, muconic acid, fumaric acid, itaconic acid, malic acid, malonic acid, maleic acid, glucaric acid, pimelic acid, suberic acid, sebacic acid, 2,5-furandicarboxylic acid, terephthalic acid, or azelaic acid, mesaconic acid, citraconic acid, tartaric acid, tartronic acid, diaminopimelic acid or glutaconic acid. In principle, any such recombinant biosynthetic pathway which is known in the art can be introduced into the cell by standard recombinant technologies. Some specific, preferred pathways are, however, exemplified below and in Example 1—see the section entitled “Biological production of diacids” and references cited therein.
It is to be understood that, when a specific enzyme of these biosynthetic pathways is mentioned by name such as, e.g., “lysine monooxygenase”, the enzyme may be any characterized and sequenced enzyme, from any species, that have been reported in the literature so long as it provides the desired activity. In some embodiments, the enzyme is an overexpressed gene which is native to the host cell used. In some embodiments, the enzyme is a functionally active fragment or variant of an enzyme which is heterologous or native to the host cell. Also, in some embodiments, the recombinant biosynthetic pathway comprises a knock-down or a knock-out of one or more genes, typically for the purpose of avoiding competing reactions reducing the yield of the desired aliphatic diacid.
So, in one embodiment, the biosynthetic pathway is for producing glutaric acid from glucose, and comprises genes, optionally overexpressed and/or heterologous, encoding:
The bacterial cell may further be modified by one or more of
In one embodiment, the biosynthetic pathway is for producing adipic acid and comprises
The bacterial cell may further be modified to knock-down or knock-out one or more native genes corresponding to ptsG, poxB, pta, sdhA, and iclR.
In one embodiment, the biosynthetic pathway is for producing adipic acid and comprises
The bacterial cell may further be modified to knock-down or knock-out one or more native genes corresponding to pta, poxB, ldhA, and adhE.
In one embodiment, the biosynthetic pathway is for producing pimelic acid and comprises feeding glutaric acid and
The bacterial cell may further be modified to knock-down or knock-out one or more native genes corresponding to pta, poxB, IdhA, adhE, and frdA.
In one embodiment, the biosynthetic pathway is for producing sebacic acid and comprises
The bacterial cell may further be modified to knock-down or knock-out one or more native genes corresponding to pta, poxB, ldhA, adhE, frdA, and native acyl-CoA thioesterases including yciA, ybgC, ydiI, tesA, fadM, and tesB.
In one embodiment, the biosynthetic pathway is for producing adipic acid and comprises
The bacterial cell may further be modified to knock-down or knock-out one or more native genes corresponding to pta, poxB, ldhA, adhE, ptsG, sdhA, and iclR.
In one embodiment, the biosynthetic pathway is for producing adipic acid via whole cell bioconversion from supplied medium to long chain free fatty acids (C12-C16) and comprises combinations of
In one embodiment, the biosynthetic pathway is for producing muconic acid and comprises
The bacterial cell may further be modified to increase the flux toward precursors for DAHP (erythrose 4-phosphate and phosphoenolpyruvate), such as by knock-down or knock-out of genes corresponding to E. coli ptsH, ptsI, crr, and pykF; by overexpressing genes corresponding to ubiC, aroF, aroE, and aroL (or feedback-resistant mutants thereof), or combinations thereof.
Some bacteria contain a native pathway for production of a diacid, avoiding the necessity for a recombinant pathway. These include, for example, Actinobacillus succinogenes (succinic acid), Mannheimia succiniproducens (succinic acid), Thermobifida fusca (malic acid), and Escherichia coli (fumaric and malic acids), among numerous others.
3) Processes
In one aspect, there is provided a process for preparing a recombinant bacterial cell, e.g., an E. coli cell. Also provided is a process for improving the tolerance of a bacterial cell, e.g., an E. coli cell, to a diacid. Also provided is a method of identifying a bacterial cell which is tolerant to at least one diacid. Also provided is a process for preparing a recombinant bacterial cell, e.g., an E. coli cell, for producing a diacid.
These processes may comprise one or more steps of genetically modifying a bacterial cell to knock-down or knock-out one or more endogenous genes of any aspect or embodiment of the Group 1 modifications and/or introducing one or more mutations in the endogenous protein(s) or gene(s) of any Group 2 aspect or embodiment. This can be achieved by, e.g., transforming the bacterial cell with genetic constructs, e.g., vectors, antisense nucleic acids or siRNA, which result, e.g., in the knock-out or knock-down of a gene, introduce a mutation into an endogenous gene, or which encode the mutated protein from a transgene.
The genetic constructs, particularly vectors, can also comprise suitable regulatory sequences, typically nucleotide sequences located upstream (5′ non-coding sequences), within, or downstream (3′ non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include promoters (e.g., constitutive promoters or inducible promoters), translation leader sequences, introns, polyadenylation recognition sequences, RNA processing sites, effector binding sites and stem-loop structures.
Alternatively, bacterial cells can be exposed to selection pressure (as described in the Examples) or to conditions which introduce random mutations in endogenous genes, and bacterial cells which comprise one or more Group 1 and/or Group 2 modifications according to any preceding aspects and embodiments can then be identified. Typically, this involves preparing a population of the genetically modified bacterial cell, having different Group 1 and/or Group 2 modifications, and then selecting from this population any bacterial cell which has an improved tolerance to a diacid at a predetermined concentration.
In one specific embodiment, the Group 1 modification is a knock-down or knock-out of one or more endogenous genes selected from kgtP, ybjL, proV, proW, proX, proQ, cspE, rfaE, yfbP, yfjM, pstS, pstA, pstB, pstC, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR or, e.g., a knock-down or knock-out of kgtP in combination one or more other genes, e.g., ybjL, proV and/or sspA. In one specific embodiment, the Group 2 modification is a mutation in at least one endogenous protein or gene selected from SpoT, PolB, RpoC, RpoB, Rnt or SapC, such as e.g., at least one mutant protein selected from the group consisting of SpoT-V422A, SpoT-A451D, SpoT-A451V, SpoT-W457C, SpoT-N454H, SpoT-D580Y, SpoT-R236L, SpoT-R236S, SpoT-M247K, SpoT-NIR(601-603)S, SpoT-T442I, SpoT-S434L, PolB-R477G, RpoC-H419P, RpoC-P64L, RpoB-K203T, Rnt-Q179P, Rnt-A27T, Rnt-F194L, Rnt-A180T, SapC-G79W; and/or a mutation which increases the expression of PyrE, such as, e.g. a mutation in rph or the pyrE/rph intergenic region.
The processes may further comprise
In one embodiment, the diacid is glutaric acid, and the predetermined concentration is at least 2 g/L or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 40 g/L or higher, such as at least 75 g/L or higher, such as at least 100 g/L or higher. In one embodiment, the diacid is adipic acid, and the predetermined concentration is at least 2 g/L or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 40 g/L or higher, such as at least 75 g/L or higher, such as at least 100 g/L or higher. In one embodiment, the diacid is pimelic acid, and the predetermined concentration is at least 2 g/L or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 30 g/L or higher, such as at least 45 g/L or higher, such as at least 75 g/L or higher. In one embodiment, the diacid is sebacic acid, and the predetermined concentration is at least 2 g/L or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 30 g/L or higher, such as at least 45 g/L or higher, such as at least 75 g/L or higher.
In a particular embodiment, the predetermined concentration is at most 20 g/L, such as at most 30 g/L, such as at most 50 g/L, such as at most 75 g/L, such as at most 100 g/L, such as at most 150 g/L.
Assays for assessing the tolerance of a modified bacterial cell to a diacid typically evaluate the growth rate, lag time, or both, of the bacterial cell at predetermined concentrations for the diacid in question, typically as compared to a control. Preferably, the control is the native or unmodified parent cell or strain, and an improved tolerance is identified as an improved growth rate, a reduced lag-time or both. For example, an improved growth rate can be at least 5%, such as at least 10%, such as at least 20%, such as at least 50%, such as at least 75% higher than that of the control, while a reduced lag time can be at least 10%, such as at least 20%, such as at least 50%, such as at least 75%, such as at least 90% shorter than that of the control. Specific assays are described, in detail, in the Examples.
Also provided is a method of producing a diacd, comprising culturing the bacterial cell obtained by any one of these methods, or the bacterial cell of any preceding aspect or embodiment, under conditions where the diacid is produced. Typically, these conditions include the presence of a suitable carbon source or mixes of different suitable carbon sources. Non-limiting examples of suitable carbon sources include, e.g., sucrose, D-glucose, D-xylose, L-arabinose, glycerol; raw carbon feedstocks such as crude glycerol and cane syrup; as well as hydrolysates produced from cellulosic or lignocellulosic materials. For further details see, e.g., Adkins et al., 2013; Park et al., 2013; Yu et al., 2014; Cheong et al., 2016; Deng and Mao, 2015; WO 2011/003034 A2 (Verdezyne); Curran et al., 2013; Sengupta et al., 2015; and Zhang et al., 2015.
4) Compositions
A bacterial cell which has an increased tolerance to a diacid can be useful as a production host for the diacid. Bacterial cells according to the invention may have an increased growth rate, a decreased lag time, or both, in the diacid. For example, the bacterial cell may have Group 1 and/or Group 2 modifications providing for an increased growth rate, a reduced lag time, or both, of the cell in at least one diacid, e.g., in glutaric acid, adipic acid, succinic acid, muconic acid, fumaric acid, itaconic acid, malic acid, malonic acid, maleic acid, glucaric acid, pimelic acid, suberic acid, sebacic acid, 2,5-furandicarboxylic acid, terephthalic acid, or azelaic acid, mesaconic acid, citraconic acid, tartaric acid, tartronic acid, diaminopimelic acid and/or glutaconic acid.
In one aspect, there is provided a composition comprising a plurality of bacterial cells according to any aspect or embodiment described herein, e.g., an in vitro culture of such bacterial cells, optionally in a suitable culture medium and/or a chemically-defined medium comprising a carbon source. In one embodiment, the composition is substantially homogenous with respect to the bacterial cells.
In one aspect, there is provided a composition comprising a plurality of bacterial cells according to any preceding aspect or embodiment and a diacid. In one embodiment, the diacid is present at a concentration at which the genetic modification(s) and/or mutant(s) comprised in the bacterial cells results in an improved tolerance as compared to the parent bacterial cells, e.g., wild-type or native bacterial cells. The concentrations at which bacterial cells according to the invention have improved tolerance are shown in Example 1, e.g., in “Cross-compound tolerance testing”. Typically, the concentration of the diacid is at least 1 g/L, such as at least 2 g/L, or higher, such as at least 5 g/L or higher, such as at least 10 g/L or higher, such as at least 20 g/L or higher, such as at least 40 g/L or higher, such as at least 45 g/L or higher, such as at least 75 g/L or higher, such as at least 100 g/L or higher.
In one aspect, there is provided a composition comprising
In separate and specific embodiments, the diacid is glutaric acid, adipic acid, pimelic acid and sebacic acid, respectively. In other specific embodiments, the diacid present in the composition is at least 45 g/L fumaric acid, at least 45 g/L itaconic acid, at least 55 g/L malic acid, at least 50 g/L succinic acid, at least 45 g/L pimelic acid, and at least 38 g/L sebacic acid, respectively.
As described in Example 1; “Cross-compound tolerance testing,” genetic modifications according to the invention also conferred tolerance to other chemicals, such as to other carboxylic acids (glutarate and adipate; hexanoate, octanoate, isobutyrate, glutarate and p-coumarate), to diamines (e.g., HMDA, putrescine) and diols (2,3-butanediol, 1,2-propanediol). Accordingly, in one embodiment, there is provided a composition comprising a plurality of bacterial cells according to any preceding aspect or embodiment, and a chemical selected from the following, at at least the indicated concentration:
Preferably, the bacterial cells are of the Escherichia, Lactobacillus, Lactococcus, Bacillus, Pseudomonas, Corynebacterium, Deinococcus or Ralstonia genera, such as, e.g., E. coli cells, and comprise
Assays for assessing the tolerance of a modified bacterial cell to a selected diacid typically evaluate the growth rate, lag time, or both, of the bacterial cell at one or more predetermined concentrations of the compound, typically as compared to a control (e.g., no compound). The predetermined concentrations(s) could be, for example, 1, 2, 5, 10, 20, 40, 45, 75, or 100 g/L. Preferably, the control is the native or unmodified parent cell or strain, and an improved tolerance is identified as an improved growth rate, a reduced lag-time or both. For example, an improved growth rate can be at least 5%, such as at least 10%, such as at least 20%, such as at least 50%, such as at least 75%, such as at least 100%, such as at least 200%, such as at least 300%, such as at least 500%, such as at least 1000%, such as at least 10000% higher than that of the control; while a reduced lag time can be at least 10%, such as at least 20%, such as at least 50%, such as at least 75%, such as at least 90% shorter than that of the control. Indeed, in some cases the native or unmodified parent cell cannot grow at all in a concentration of the diacid that the modified bacterial cell can grow in. Specific assays are described, in detail, in the Examples.
5) Bacterial Cells
Also provided are strains, clones and other progeny of the bacterial cells of these and other aspects and embodiments, as well as cell cultures of such bacterial cells or strains. Typically, as used herein, a “strain” typically refers to a group of cells which are descendants of a initial single colony of parent cells whereas a “clone” is a group of cells which are the descendants of an initial genetically modified single parent cell.
Non-limiting examples of bacterial cells suitable for modification according to any one of the aspects and embodiments described herein include bacteria of the Escherichia, Lactobaccillus, Lactococcus, Corynebacterium, Bacillus, Ralstonia, Clostridia, Deinococcus or Pseudomonas genera, such as from the Escherichia, Lactobacillus, Lactococcus, Bacillus, Pseudomonas, Corynebacterium, Deinococcus or Ralstonia genera. In one embodiment, the bacterial cell is an E. coli cell, such as a cell of the commercially available and/or fully characterized strains K-12 MG1655, BW25113, BL21, BL21(DE3), K-12 W3110, W, JM109, or Crooks (ATCC 8739). In a specific embodiment, the bacterial cell is derived from an E. coli K12 strain. In another embodiment, the bacterial cell is a Lactobacillus cell, such as a cell of the commercially available and/or fully characterized strains Lactobacillus plantarum JDM1, Lactobacillus plantarum WCFS1, and Lactobacillus plantarum NCIMB 8826. In another embodiment, the bacterial cell is a Lactococcus cell, such as a cell of the commercially available and/or fully characterized strains Lactococcus lactis lactis CV56, Lactococcus lactis lactis NIZO B40, and Lactococcus lactis cremoris NZ9000. In another embodiment, the bacterial cell is a Bacillus cell, such as a cell of the commercially available and/or fully characterized strains Bacillus subtilis 168 and Bacillus subtilis PY79. In one embodiment, the bacterial cell is a Pseudomonas cell, such as a cell of the commercially available and/or fully characterized strain Pseudomonas putida KT2440. In another embodiment, the bacterial cell is a Ralstonia cell, such as a cell of the commercially available and/or fully characterized strains Ralstonia eutropha H16 and Ralstonia eutropha JMP134. In another embodiment, the bacterial cell is a Corynebacterium cell, such as a cell of the commercially available and/or fully characterized strains 534 (ATCC 13032), K051, MB001, R, SCgG1, and SCgG2. In another embodiment, the bacterial cell is a Deinococcus cell, such as a D. radiodurans or D. geothermalis cell, such as a cell of the commercially available and/or fully characterized strain D. radiodurans R1.
While aspect and embodiments relating to bacterial cells herein typically refer to genes or proteins according to their designation in E. coli, for bacterial cells of another family or species, it is within the level of skill in the art to identify the corresponding gene or protein, i.e., the ortholog and/or paralog, in the other family or species, typically by identifying sequences having moderate or high homology to the E. coli sequence, optionally taking the function of the protein expressed by the gene and/or the locus of the gene in the genome into account, Table 2 sets out the function of the protein encoded by each specific gene, the corresponding E.C. number (if applicable), its locus in the E. coli K-12 MG1655 genome and the SEQ ID number of the coding or non-coding sequence and, where applicable, the encoded amino acid sequence.
Tables 3 and 4 set out some examples of homologs or orthologs in selected organisms, identified in a preliminary and non-limiting analysis. Indeed, homologs or orthologs of these proteins exist also in other bacteria, and other homologs or orthologs not identified in this preliminary search can exist in the species listed in Table 3. The skilled person is well-familiar with different searching and/or screening methods for identifying homologs or orthologs across different species. To briefly summarize some of the preliminary findings in Table 3:
E. coli gene
Ralstonia
Corynebacterium
B. subtilis
P. putida
L. plantarum
L. lactis
eutropha
glutamicum
28% identity
34% identity
25% identity
(116 αα)
“sulfate
“sugar
“chloride
transporter
transport protein”
channel protein”
29% identity
“RND
superfamily
exporter”
24% identity
29% identity
33% identity
“hypothetical
“dehydrogenase”
“elongation
protein
factor Ts”
BSU32070”
28% identity
24% identity
24% identity
29% identity
“sensor
“tryptophan
“multimodular
“O-linked N-
histidine
synthase subunit
transpeptidase-
acetylglucosamine
kinase”
alpha”
transglycosylase
transferase OGT”
Pbp2A”
30% identity
26% identity
“ABC transporter
“small heat
substrate-
shock protein”
binding protein”
45% identity
56% identity
“hypothetical
“hypothetical
protein
protein
JDM1
—
0823”
NCgl2333”
25% identity
“prolyl-tRNA
synthetase”
31% identity
“phosphopyruvate
hydratase”
26% identity
24% identity
“hypothetical
“30S ribosomal
protein
protein S12”
BSU13060”
32% identity
35% identity
“O-succinylbenzoate
“rhodanese-
synthase”
related
sulfurtransferase”
D. radiodurans R1
D. radiodurans R1
D. radiodurans R1
D. radiodurans R1
So, in one aspect, there is provided a bacterial cell according to any one of the preceding aspects and embodiments, wherein each recited gene is instead (i) a gene encoding the corresponding (homolog or ortholog) protein in Table 3 or 4 (ii) a gene located at the corresponding locus, or (iii) both.
Escherichia coli K-12 MG1655 was grown overnight in M9 minimal medium+1% glucose and subcultured the following morning to an initial OD600 of 0.05 in M9+1% glucose. Cells were grown to mid-exponential phase (OD600 0.7-1.0) and were back-diluted with fresh medium to an OD600 of 0.7. The diluted cells were used to inoculate M9+1% glucose containing varying concentrations of glutaric acid or adipic acid which were neutralized to pH 7.0 with sodium hydroxide, and growth was measured in FlowerPlates in a Biolector microbioreactor system (m2p-labs) at 37° C. with 1000 rpm shaking. The culture volume in each well was 1.4 mL.
Adaptive Laboratory Evolution of Tolerant Strains
Based on the screening results, E. coli K-12 MG1655 was grown overnight in M9 minimal medium and 150 μL was transferred the next day into 8 tubes containing 15 mL of M9+1% glucose+20 g/L glutaric acid or 25 g/L adipic acid on a Tecan Evo robotic platform custom-designed for performing adaptive laboratory evolutions (ALE). Cells were cultured on a 37° C. heat block with stirring by magnetic stir bars. Culture OD600 was monitored at times determined by a predictive custom script, and when the OD600 reached approximately 0.3, 150 μL of culture was inoculated into a new tube with the same media concentration. Instrument downtime would occasionally result in cells overgrowing to saturation or an OD600 greater than 0.3, and reinoculations were occasionally performed from cryogenic stocks of the population. When the growth rate was observed to substantially increase, the media concentration was changed. These concentration changes for glutaric acid were to 30 g/L, 40 g/L, and 45 g/L, and 47.5 g/L, while the changes adipic acid were to 35 g/L, 40 g/L, 45 g/L and 50 g/L. Approximately 100 μL of each population (8 per chemical) were plated on LB agar and incubated at 37° C. overnight.
Primary Screening of ALE Isolates
Five colonies from wild-type K-12 MG1655 and 10 individual colonies deriving from each population were inoculated into 300 μL M9+1% glucose in 96 well deepwell plates and incubated in a 300 rpm plate shaker at 37° C. The next day, cells were diluted 10× in M9+1% glucose and 30 μL was transferred into clear-bottomed 96 well half-deepwell plates (with rectangular wells) containing M9+1% glucose and M9+1% glucose+52.78 g/L glutaric acid or 55.56 g/L adipic acid, such that the final concentration of glutaric acid or adipic acid was 47.5 g/L or 50 g/L, respectively. In addition, cryogenic glycerol stocks of the overnight culture were saved in a 96 well plate format. Half deepwell plates were incubated at 37° C. with 225 rpm shaking in a Growth Profiler (Enzyscreen), with optical scans of the plates taken at 15 minute intervals. Green pixel values integrated over a 1 mm diameter circular area in each well were converted to OD600 values using a previously determined calibration between OD600 and green pixel values. Resulting growth curves were visually inspected for isolates exhibiting the most robust or unique growth patterns within each population. In general, it was attempted to select three isolates per population for further analysis, and all populations were represented in the resequenced isolates.
Secondary Screening of ALE Isolates
Selected isolates from the primary screen were restruck onto LB agar from the cryogenic stock made from the overnight culture plate for the primary screen. Five K-12 MG1655 colonies and three individual colonies from each isolate were inoculated as biological replicates into a new 96 well deepwell plate containing 300 μL of M9+1% glucose, and grown overnight as for the primary screen. The next day, a cryogenic stock and half deepwell plates containing M9+1% glucose with or without glutaric acid or adipic acid were inoculated using the plate of overnight cultures, and growth was measured as described for the primary screen. Resulting growth curves were visually inspected for isolates exhibiting robust and reproducible growth between replicates in high concentrations of glutaric acid or adipic acid.
Re-Sequencing of ALE Isolates
A total of 20 isolates were selected from the secondary screen for whole-genome resequencing. An individual colony was taken from the LB agar plates prepared following the primary screen, inoculated into 2 mL LB, and grown overnight at 37° C. in a 250 rpm shaker. The following morning, 0.5 mL of cells were transferred to microcentrifuge tubes and centrifuged at 16000×g for 2 minutes. The supernatant was removed and pellets were stored at −20° C. until further processing. Genomic DNA was extracted from thawed cell pellets using a PureLink genomic DNA extraction kit, with further concentration and purification performed by ethanol precipitation. To generate libraries for sequencing, the Illumina TruSeq Nano kit was used according to the manufacturers' directions using an input quantity of 200 ng of genomic DNA from each isolate. Sequencing was performed on an Illumina MiSeq sequencer, with a minimum 20×average genomic coverage ensured for each isolate based on the number of reads. Fastq output files were analyzed for variants compared to the K-12 MG1655 reference genome (accession number NC_000913.3) using breseq (www-adress barricklab.org/twiki/bin/view/Lab/ToolsBacterialGenomeResequencing).
Sole Carbon Source Plate Growth Assay
M9 agar plates lacking glucose and instead containing 10 g/L of glutaric acid or 10 g/L of adipic acid (both neutralized to pH 7.0 with sodium hydroxide) were prepared, and strains were struck onto wedges of the plate from a colony on an LB plate. Plates were incubated for up to 4 weeks at 37° C.
Construction of Gene Knockouts
Probable important losses-of-function were determined by identifying genes across all isolates that harboured mutations, especially those occurring in multiple populations, and by the presence of at least one mutation that either generated a premature stop codon, a frameshift mutation, or the presence of an insertion element sequence within the gene. For those genes, the corresponding knockout strain from the Keio collection of single knockout mutants (where each gene is replaced with a cassette consisting of a kanamycin resistance gene flanked by FRT sites) was used as a donor strain for P1vir phage transduction. Briefly, the Keio strain was grown to early exponential phase in LB+5 mM CaCl2 and 80 μL of a P1vir stock raised on K-12 MG1655 was added. After significant lysis was observed after 1.5 to 2 hours, the lysate was filter-sterilized to remove cells and stored at 4° C. Strain K-12 MG1655 was grown overnight in LB+5 mM CaCl2) and 100 μL of the overnight culture was mixed with 100 μL of the P1vir lysate of the Keio collection mutant, and the mixture was incubated at 37° C. without shaking for 20 minutes. The entire mixture was then plated on LB agar containing 1.25 mM sodium pyrophosphate as a chelating agent and 25 μg/mL kanamycin. One colony was then restruck on LB+1.25 mM Na2P4O7+25 μg/mL kanamycin plate and analyzed for presence of the Keio cassette in place of the wild-type gene by colony PCR. When further knockouts were constructed in the same strain, the Keio cassette was flipped out to generate a scar sequence such that KanR marker could be recycled. This was performed by transforming with pCP20, which constitutively expresses a flippase recombinase, and plating cells on LB agar+100 μg/mL ampicillin and incubating at 30° C. The next day, one or more colonies was tested by colony PCR for loss of the Keio cassette, and successful mutants were then cured of pCP20 by elevated temperature curing at 40° C. Strains were verified to be cured of plasmid by plating on LB agar+100 μg/mL ampicillin and incubation at 30° C. P1vir transductions were then performed using these mutant strains as recipients.
Biological triplicate cultures of each strain were grown to saturation overnight in 96 well deepwell plates containing 300 μL M9+1% glucose. The next day, cells were diluted 1:10 in deionized water in a clear 96 well plate and the OD600 was measured on a BioTek plate reader. 48 well FlowerPlates containing a final volume of 1.4 mL of M9+1/o glucose (plus relevant chemical) were inoculated to OD600 0.03 (with plate reader pathlength, 200 μL volume) with the overnight culture and sealed with Breathseal film. Light backscatter intensity was monitored in a Biolector microbioreactor system at 37° C. with 1000 rpm shaking.
Keio Collection Screening for Loss-of-Function Mutations
For primary screening, Keio collection mutants were inoculated directly from a cryogenic stock of the Keio collection into 300 μL LB medium containing 25 μg/mL kanamycin in 96 well deepwell plates and grown at 37° C. with 300 rpm shaking overnight. The Keio background strain, BW25113, was also inoculated into wells of this plate as a control. A cryogenic stock was made from each plate, and the cryogenic stock was replica plated into another 96 well deepwell plate containing 300 μL M9+1% glucose and grown overnight. The next day, cells were inoculated 1:100 into clear bottomed 96 well half-deepwell plates containing M9+1% glucose plus 40 g/L and 47.5 g/L putrescine, or 45 and 50 g/L adipic acid, and cultivated in a Growth Profiler as previously described for screening of ALE isolates.
As a secondary screen, promising Keio collection mutants were struck on LB+25 μg/mL kanamycin from the cryogenic stock plate prepared during primary screening above and biological triplicate colonies were inoculated into a 96 well deepwell plate containing 300 μL M9+1% glucose. The next day, cells were inoculated into plates for cultivation on the Growth Profiler as described above.
96 well deepwell plates containing 300 μL of M9+1% glucose were inoculated directly from cryogenic stocks made from precultures for the secondary screening of ALE isolates and were grown overnight at 37° C. with 300 rpm shaking. The next day, cells were diluted 1:100 into 96 well half-deepwell plates containing the following final concentrations of each chemical in M9+1% glucose:
Plates were cultivated in a Growth Profiler for 48 hours as described for screening of ALE isolates. Green pixel integrated values from each well were converted to OD600 values using a calibration curve and the resulting OD600 vs. elapsed time data was processed using custom scripts to determine the time required for each culture to reach an OD of 1.0 (tOD1). This value is a combined measure of growth rate and lag time in each culture. The median value was taken for biological triplicates of each isolate and was normalized to the median tOD1 for K-12 MG1655 controls (5 replicates). The ratio of tOD1(evolved)/tOD1(wild-type) is presented.
The maximum measured concentration of glutaric acid at which exponentially growing K-12 MG1655 can grow was found to be 50 g/L with severe inhibition (Table 5). Increasing inhibition of growth was observed from 10 to 50 g/L.
The maximum measured concentration of adipic acid at which exponentially growing K-12 MG1655 can grow was found to be 75 g/L with an extensive lag phase of 27 hours (Table 6). Growth rates dropped sharply as a function of concentration between 10 and 50 g/L.
Growth was also tested in pimelic acid (C7) and sebacic acid (C10) (Table 7). Robust growth was still observed in pimelic acid at 45 g/L, however inhibition was observed as a function of increasing concentration. Sebacic acid was more toxic, with nearly no growth detected above 40 g/L concentration.
Aiming for a starting growth rate between 0.3 to 0.4 h−1, it was decided to begin evolutions at a concentration of 20 g/L glutaric acid and 25 g/L adipic acid.
Variants detected in glutaric and adipic acid evolved strains are presented in Tables 8 and 9. Each strain name corresponds to the chemical the strain was isolated from, the population the strain was isolated from, and the original number of the strain assigned during primary screening (e.g. GLUT1-3 is a glutaric acid-evolved strain isolated from population 1). In each table, strains are arranged such that all that were isolated from the same population are presented in the same rows. Strains with an asterisk (*) following their name are hypermutator strains, and only the mutation identified that can be associated with generating the hypermutator phenotype (here only in mutS or mutT in 1 isolate from 1 glutaric acid population and all isolates from 1 adipic acid population) and those mutations that are shared with other mutations in the same gene in other strains are shown.
Mutations that occur independently across multiple populations, or that appear fixed in a highly variable population are likely causative and of highest interest. For glutaric acid, these include mutations in kgtP or its promoter region (24 out of 24 isolates), spoT (all isolates except in population GLUT8), rpoC (9 isolates in 5 populations), proV (6 isolates in 3 populations) and proX (2 isolates in 1 population), rnt (5 isolates in 3 populations), nagC and nagA (4 isolates in 2 populations). In place of mutations in spoT, a coding mutation in polB was found in all 3 isolates of population GLUT8. Mutations in rpoB, encoding another subunit of RNA polymerase in addition to rpoC, were found in all 3 isolates from population GLUT5. Of these mutations, those of kgtP, proV and proX, and nagC are likely loss-of-function mutations, due to the presence of frameshift mutations, premature stop codons, or IS element insertions in at least one population of individual isolate that possesses mutations in that gene. Other mutations are likely gain-of-function or weakening of function, for example coding mutations in genes encoding subunits of RNA polymerase (RpoC and RpoB), SpoT (plus an in-frame deletion in one population), and PolB.
For adipic acid, mutated genes that occurred across multiple populations included those in kgtP (19 out of 19 isolates), ybjL (12 isolates in 6 populations), proV or its promoter region (11 isolates in 5 populations; plus 3 isolates in 1 population that possessed a large deletion spanning proV, proX, and proW plus other neighboring genes), sspA (7 isolates in 4 populations; also found in 1 isolate from glutaric acid), the intergenic region between pyrE and rph (6 isolates in 3 populations), nagC (5 isolates in 2 populations), yicC (5 isolates in 2 populations), spoT (4 isolates in 2 populations), and pstS or its promoter region (4 isolates in 2 populations). Notably lacking were mutations in any subunit of RNA polymerase. Of these mutations, those of kgtP, ybjL, proV, sspA, and nagC are likely loss-of-function mutations, due to the presence of frameshift mutations, premature stop codons, or IS element insertions in at least one population of individual isolate that possesses mutations in that gene. Coding mutations in SpoT are likely gain-of-function or weakening of function, as for glutaric acid. Mutations in sspA are either coding SNPs or an in-frame (21 bp) deletion, therefore it is unclear whether this mutation is a loss-of-function. PstS is one subunit of a transporter complex complex PstBACS which is involved in the import of inorganic phosphate under phosphate starvation conditions.
Each re-sequenced isolate was characterized using the Biolector system for growth at the screening concentration of chemical (47.5 g/L glutaric acid or 50 g/L adipic acid) in biological triplicates. The average growth rates with standard errors for the three replicates are shown in Tables 10 and 11.
Variations in growth behavior amongst evolved isolates can be noted. Better growing strains are defined by both the slope of the curve (higher growth rate) and at what time the cultures begin growing (reduced lag time). Some isolates exhibit poorer improvements in growth rates (e.g. ADIP7-2 and ADIP8 isolates) but especially reduced lag times. The phenotype to genotype relationship infers mutations that are of highest interest and those that are not of interest. For example, GLUT2-10 was the best performing isolate from population GLUT2, indicating that either the RpoC-H419P and/or RpsA-T358S mutations are causative for higher growth rate, or the lack of other mutations found in the other two isolates is beneficial. Another example of this would, for example, be when multiple isolates from one population are growing nearly identically (e.g. GLUT4-1 and GLUT4-4). This indicates that any differences in mutations between these two isolates are not important for tolerance, in this case the intergenic mutations between yfbP and nuoN, and between ibsE and rfaE, found in GLUT4-4.
Sole Carbon Source Plate Growth Assay Wild-type, glutaric acid, and adipic acid evolved strains were struck on M9 agar containing glutarate or adipate as a sole carbon source. No growth was observed on adipic acid plates, indicating that E. coli cannot utilize adipic acid as a sole carbon source. Robust but very slow growth of wild-type K-12 MG1655 was observed after a few weeks (Table 12), indicating the ability of E. coli to use glutarate as a sole carbon source, almost certainly through promiscuous activity of pathway enzymes (because glutarate is not a natural metabolite in E. coli). Growth on this compound as a sole carbon source has not been previously reported in the literature. Evolved isolates generally could not grow on glutaric acid, with the exception of weak growth exhibited by GLUT8-9. The only common genetic feature between all evolved isolates relative to the wild-type are probable loss-of-function mutations in kgtP, implicating KgtP, an α-ketoglutarate importer, as being a direct importer for glutarate. GLUT8-9 notably features an in-frame 9 bp deletion in kgtP that is unique among all the isolates. Without being limited to theory, this mutation may result in a reduced activity of KgtP, rather than a full loss-of-function.
Probable loss-of-function mutations were identified from re-sequencing results as described in methods and in the results of the resequencing analysis. Because there was probable loss-of-function of kgtP in all resequenced isolates, this was the only single knockout tested, and additional knockouts were selected to be tested as double combinations together with kgtP (Table 13). Only the triple knockout in kgtP, proV, and nagC was tested initially due to one isolate (GLUT8-6) possessing probable loss-of-function mutations in kgtP, proV, and nagA, and due to previous studies indicating similar phenotypes and likely the same mechanism of action for improved tolerance in high osmotic pressures (or due to high Na+ concentrations) from both nagC and nagA knockouts (Lennen and Herrgard, 2014). All strains with kgtP knockouts exhibited higher growth rates than the wild-type in 23.8 g/L and 47.5 g/L glutaric acid, however none of the multiple knockout strains exhibited significantly improved growth relative to the single kgtP knockout strain alone. K-12 MG1655 ΔkgtP nagC::kan exhibited reduced growth relative to K-12 MG1655 kgtP::kan in 47.5 g/L glutaric acid.
A second group of selected single, double, and triple knockout mutants was tested in the Biolector testing format in M9+47.5 g/L glutaric acid (Table 14). The ybjL loss-of-function which had been identified from resequencing of adipic acid evolved isolates was included, to determine if that mutation would also confer tolerance toward glutaric acid. The proV and ybjL mutations did not increase growth rates alone, but in double combinations with the kgtP mutation, were found to increase the growth rate over that of the kgtP single mutant. K-12 MG1655 ΔkgtP sspA::kan additionally exhibited an increased growth rate over that of the kgtP single knockout mutant. The triple knockout mutant in kgtP, proV, and ybjL exhibited a growth rate higher than that of the tested double knockout combinations, with a growth rate nearly equivalent to two of the evolved isolates tested alongside in the same experiment (GLUT1-3 and GLUT4-1), and exceeding the growth rate of many other evolved isolates in Table 6.
Probable loss-of-function mutations were identified as previously described. As for glutarate, probable loss-of-function of kgtP was identified in all resequenced isolates, therefore this single knockout was tested with additional double and triple combinations all containing the kgtP knockout. Of the tested strains, K-12 MG1655 kgtP::kan exhibited slightly improved tolerance in 25 g/L adipate, and a much larger improvement in growth in 50 g/L adipate (Table 15). The only tested combinatorial knockout with a higher growth rate than the kgtP single knockout strain was MG1655 ΔkgtP proV::kan. K-12 MG1655 ΔkgtP sspA::kan exhibited greatly reduced tolerance relative to the wild-type, indicating that the sspA mutations isolated in resequenced mutants are likely either gain-of-function mutations or that they only result in weakened activity of the gene product.
In a second experiment, additional single, double, and triple knockout combinations were tested in 25 g/L and 50 g/L adipate (Table 16).
The ybjL single knockout strain was found to moderately improve tolerance, but below the levels conferred by deletion of kgtP. K-12 MG1655 ΔkgtP ybjL::kan exhibited growth rates similar to K-12 MG1655 ΔkgtP proV::kan, and the triple knockout combination found in K-12 MG1655 ΔkgtP ΔproV ybjL::kan exhibited an improved growth rate over either double knockout combination. Selected single, double, and triple knockout strains were then tested in the Biolector testing format (Table 17). Results were similar to that observed in Table 16, with a higher growth rate observed for K-12 MG1655 ΔkgtP ΔproV ybjL::kan than for some ALE isolates. K-12 MG1655 ΔkgtP sspA::kan was additionally tested, however growth rates were not improved for media containing adipate as they were for glutarate.
To determine if any additional single gene deletion candidates were overlooked, screening on elevated glutarate and adipate concentrations was also conducted using the Keio collection of gene knockouts, which is a commercial collection of knockouts in nearly all non-essential genes and ORFs in E. coli strain BW25113. This strain is a K-12 derivative and possesses known mutations relative to the K-12 MG1655 background. All Keio collection strains with knockouts in genes that were found to be mutated in Tables 8 and 9 were screened for growth against the BW25113 control in M9+40 g/L or 47.5 g/L glutaric acid, or M9+45 g/L or 50 g/L adipic acid (neutralized with sodium hydroxide) in the Growth Profiler screening format. Primary screening hits were measured again in a secondary screen in biological replicates, with averaged growth curves for 3 biological replicate cultures shown individually for each strain in Tables 18 and 19, For glutaric acid (Table 18), BW25113 cspE::kan and BW25113 proX::kan exhibited the largest increases in growth rate at 47.5 g/L glutarate, with small improvements also seen with 40 g/L glutarate. ProX is a subunit with ProV in the ProVWX ABC transporter. In M9+47.5 g/L glutarate, additional knockout strains with smaller improvements in growth rates were the rfaE, yfbP, and yfjM knockout strains.
For adipic acid (Table 19), a number of single deletion mutants exhibited moderate increases in growth rate in 50 g/L adipate. These were knockouts in proQ, pstS, rph, rpoS, sspA, tdk, uvrB, ycjG, and yeaR. In 47.5 g/L adipate, smaller percentage improvements in growth rate were observed, however all of these mutants similar exhibited significant increases in growth rate.
A list of all gene disruption mutants in both the K-12 MG1655 and BW25113 background strains that exhibited increased tolerance to glutaric acid is shown in Table 20. A similar table for adipic acid is shown in Table 21.
The single knockout strains in kgtP, proV, and ybjL, plus the double and triple combination knockout strains, were struck as previously described on glutarate as a sole carbon source, together with wild-type K-12 MG1655 and a selection of ALE evolved isolates as controls. Robust growth was again observed from K-12 MG1655 after a few weeks incubation, with reduced growth of GLUT8-9 and greatly reduced or no growth in other evolved isolates (Table 22). A larger inoculum was spread on the plates, which likely explains why very weak growth was observed for GLUT8-6 and GLUT1-10. K-12 MG1655 kgtP::kan exhibited no growth, indicating that loss-of-function of kgtP is explicitly responsible for the weak or absent growth of evolved strains on glutarate, and suggesting that KgtP is indeed a direct importer of glutrate. The proV and ybjL single knockout strains did not exhibit reduced growth relative to K-12 MG1655, and double and triple knockout combination strains with kgtP did not exhibit any growth, as would be expected from the loss of kgtP.
Every secondary screened evolved isolate from the glutaric acid and adipic acid evolutions was grown in the presence of every other compound in the study as indicated in the Methods. The normalized tOD1(evolved strain)/tOD1(wild-type) are shown in Tables 23 and 24 for the glutaric acid and adipic acid evolved isolates, respectively. Lower values are indicative a larger improvement in growth of the evolved isolate (left column) in that chemical condition (top row), whereas higher values are indicative of a lower improvement or decrease in growth compared to the wild-type. Averaged ratios across conditions and strains shown at the right and bottom of the plot allow for overall by-chemical and by-strain trends to be observed. Strain names that are followed by an asterisk (*) were not re-sequenced, and strain names in italics were found to be hypermutator strains.
The majority of glutaric acid-evolved isolates exhibit cross-tolerance to adipic acid (notable exceptions were isolates from the GLUT5 population, GLUT1-3, GLUT1-9, and GLUT2-10). Likewise, the majority of adipic acid evolved isolates exhibit cross-tolerance to glutaric acid (notable exceptions were most isolates in the ADIP1, ADIP2, and ADIP3 populations, plus a couple isolates from other populations (ADIP4-4, ADIP5-5, neither of which were resequenced). Isolates with the highest degree of cross-tolerance were GLUT4-10, GLUT8-6, and GLUT8-9. The GLUT8 population was notable for possessing coding mutations in polB and lacking coding mutations in spoT. Adipic acid evolved isolates exhibited a lower overall degree of cross tolerance, with the best performing isolate being ADIP6-9. This isolate most likely has loss-of-function of kgtP, proV, and ybjL, plus coding mutations in proQ (suggested to be loss-of-function from Keio mutant screen) and spoT. The ADIP6 population specifically exhibited a high level of cross-tolerance toward all other acid salts in the study (hexanoate, octanoate, isobutyrate, glutarate, and p-coumarate). Acid cross-tolerance was also evident from many glutaric acid evolved isolates, however high cross-tolerance toward the diamine HMDA and the diols 2,3-butanediol and 1,2-propanediol was evident in a number of isolates. Cross-tolerance toward HMDA in population GLUT8 can be inferred to be due to a mutation present in GLUT8-6 and GLUT8-9 that is not found in GLUT8-5, which in this case is a coding mutation in sapC that is likely causative for HMDA cross-tolerance.
GLUT3-5
1.31
0.58
1.05
0.98
0.90
1.00
0.53
1.21
0.70
0.52
0.81
2.28
0.99
ADIP3-2
1.27
0.90
1.42
0.51
3.22
3.51
0.94
3.67
0.74
1.38
0.68
2.64
1.74
Additionally, each evolved isolate was tested for cross-tolerance toward other dicarboxylic acids of interest. First, K-12 MG1655 was tested in the Growth Profiler screening format for growth in the presence of a range of concentrations of each compound (note that this had been done in the Biolector format previously for pimelic acid and sebacic acid (Table 7) thus was not repeated here): fumaric acid, itaconic acid, malic acid, and succinic acid. All diacids tested were either the neutral sodium salts, or the free diacid was neutralized with sodium hydroxide to pH 7.0 for testing. Variable concentrations of these compounds elicited growth inhibition in E. coli K-12 MG1655 (Table 25). Based on these results, a screening concentration was selected for the evolved isolates for which wild-type cells could achieve at a growth rate of 0.15-0.3 h−1 (versus uninhibited growth at 0.7-0.9 h−1 in M9 glucose minimal medium). These concentrations were: 45 g/L fumaric acid, 45 g/L itaconic acid, 55 g/L malic acid, 50 g/L succinic acid, 45 g/L pimelic acid, and 38 g/L sebacic acid. The results of glutaric acid and adipic acid-evolved isolates grown in these concentrations of fumaric, itaconic, and malic acid are shown in are shown in Tables 26 and 27, and the same isolates grown at the selected concentration of succinic, pimelic, and sebacic acid (linear aliphatic diacids) are shown in Tables 28 and 29. A majority of evolved isolates exhibited increased growth rates and/or reductions in lag time in all tested diacids. These in particular included isolates from the ADIP1, ADIP2, ADIP3, and ADIP7 populations, ADIP5-6 (a hypermutator strain) for all diacids generally. ADIP7-2 and ADIP7-5 exhibited the highest growth rates in sebacic acid, with the ADIP2 population also exhibiting significantly improved growth. ADIP7-2 and ADIP7-5 possessed different sets of mutations but notably only possessed loss-of-function mutations in kgtP plus distinct modulatory mutations (probable reduction-of-function of rpoS based on the Keio collection screen and loss-of-function of pstS plus an intergenic insertion between hns and tdk in ADIP7-2, and loss-of-function of ybjL, proV, and probable reduction-of-function of sspA based on the Keio collection screen in ADIP7.5). The common features of ADIP2 isolates were possessing only loss-of-function mutations in kgtP, deletion of proVWX, and mutations that restore expression of PyrE. Glutaric acid evolved isolates tended to exhibit a specificity toward tolerance to particular diacids. Isolates from the GLUT5 population exhibited significantly improved growth rates in fumaric acid (these isolates exhibited probable reduction-of-function of sspA and the RpoB-K203T and SpoT-R236L mutations), whereas the GLUT1 and GLUT2 populations had some of the most improved growth rates and reduced lag times in itaconic acid (these isolates featured loss-of-function of kgtP and the SpoT-V422A and RpoC-H419P mutations in GLUT1-3 and GLUT1-9; loss-of-function of kgtP, proV, probable reduction-of-function of sspA, and the SpoT-V422A mutation in GLUT1-10; loss-of-function of kgtP and optionally proV, and the SpoT-A451D mutation in GLUT2-1 and GLUT2-9; and loss-of-function of kgtP and proV plus the SpoT-A451D and RpoC-H419P mutations in GLUT2-10). Malic acid cross-tolerance was weak across all glutaric acid evolved isolates. Isolates from the GLUT8 population had dramatically improved growth rates (as well as moderate reductions in lag time) toward sebacic acid (these isolates commonly featured loss-of-function of kgtP and proV, and the RpoC-P64L and PolB-R477G mutations, with the best performing isolates GLUT8-6 and GLUT8-9 additionally possessing the SapC-G79W mutation), with GLUT1-10 and GLUT2-9 also exhibiting significantly enhanced growth rates. Notably, the GLUT8 population had relatively poor cross-tolerance for most diacids, and all tested isolates featured the PolB-R477G mutation, whereas all other populations featured isolates with mutations in SpoT.
Glutaric acid has been the target of two studies, both in engineered E. coli. The highest reported titer of 0.82 g/L from glucose was achieved via native L-lysine production (Adkins et al., 2013) (see FIG. 1 of Adkins et al., 2013, hereby incorporated by reference). A heterologous pathway composed of genes from Pseudomonas putida KT2440 was expressed from plasmids and consisted of a lysine monooxygenase to convert lysine to 5-aminovaleramide, a 5-aminovaleramidase to convert 5-aminovaleramide to 5-aminovaleric acid, a 5-aminovalerate transaminase to convert 5-aminovaleric acid and α-ketoglutarate to glutarate semialdehyde and L-glutamate, and a glutarate semialdehyde dehydrogenase to convert glutarate semialdehyde to glutaric acid. To improve flux toward L-lysine, previously known feedback resistance mutations were made in DapA (4-hydroxytetrahydrodipicolinate synthase) and LysC (asparate kinase III), and these modified proteins were additionally overexpressed from plasmids. Finally, cadA and IdcC, encoding two lysine decarboxylases, were deleted to prevent side conversion of L-lysine into cadaverine. In a second study (Park et al., 2013), glutarate was not able to be produced Prom glucose, however 1.7 g/L glutarate could be produced by feeding both L-lysine and α-ketoglutarate, with only 5-aminovalerate able to be produced from glucose without supplementation. This appeared to use the same or similar heterologous genes from P. putida as in the previous paper, only expressed together as an artificial operon on one plasmid instead of in two operons on two plasmids. The strain additionally contained a dapA promoter replacement to allow constitutive expression of lysine biosynthesis, and deletion of speE, speG, patA, and puuPA (which would prevent production of spermidine, acetylspermidine, putrescine degradation, and putrescine import, although likely not for any targeted purpose here; use of this background strain for production of other compounds).
Overproduction of adipic acid, as well as other diacids that can be readily converted chemically into adipic acid, has been more heavily pursued due to the use of adipic acid in existing commercial polyamides. A wide variety of routes have been explored, with the first reported direct route in E. coli being a proof-of-concept demonstration, with a maximum titer of 639 μg/mL adipic acid (Yu et al., 2014). In the best-performing strain, acetyl-CoA and succinyl-CoA were condensed by a reversible 3-oxoadipyl-CoA thiolase (PaaJ from E. coli), 3-oxoadipyl-CoA was reduced to 3-hydroxyadipyl-CoA by a 3-hydroxyacyl-CoA dehydrogenase (PaaH1 from Ralstonia eutropha), 3-hydroxyadipyl-CoA was dehydrated to 2,3-dehydroadipyl-CoA by a putative enoyl-CoA hydratase (h16_AA307 gene product from Ralstonia eutropha H16), 2,3-dehydroadipyl-CoA was reduced to adipyl-CoA by a trans-enoyl-CoA reductase (Ter from Euglena gracilis), adipyl-CoA was converted to adipyl-phosphate by a phosphate butyryltransferase (Ptb from Clostridium acetobutylicum), and adipyl-phosphate was finally dephosphorylated to adipic acid using a butyryl kinase (Buk1 from Clostridium acetobutylicum) (see FIG. 1 of Yu et al., 2014, hereby incorporated by reference). The genes encoding these enzymes were heterologously expressed on two different plasmids, and additional modifications were made to the genome to improve the succinyl-CoA supply using modifications that were previously employed for succinic acid production via succinyl-CoA (Liu et al., Process Biochem. 47:1532, 2012). These were deletions of ptsG, poxB, pta, sdhA, and iciR.
Very recently, production of 2.5 g/L adipic acid in bioreactors, as well as smaller quantities of suberic acid (C8) and sebacic acid (C10), or pimelic acid (C7) alone, was demonstrated in E. coli from glycerol using a relatively similar modular pathway (Cheong et al., 2016). It was composed of a thiolase capable of condensing a primer and extender unit (e.g. succinyl-CoA and acetyl-CoA for adipic acid), plus a hydroxyacyl-CoA hydrogenase, an enoyl-CoA hydratase, an enoyl-CoA reductase to generate the fully reduced product (for adipic acid, this would be adipyl-CoA) (see FIG. 1a of Cheong et al., 2016, hereby incorporated by reference). The major difference from previous work was the use of an acyl-CoA thioesterase to liberate the final diacid from CoA. For production of adipic acid production, a CoA transferase (Cat1 from Clostridium kluyveri) was expressed for activation of succinic acid to succinyl-CoA, with native sucD encoding a subunit of native E. coli succinyl-CoA synthetase deleted. The thiolase was E. coli PaaJ, the hydroxyacyl-CoA hydrogenase was E. coli PaaH, the enoyl-CoA hydratase was E. coli PaaF, the enoyl-CoA reductase was Ter from Treponema denticola, and the acyl-CoA thioesterase was the dicarboxylic acyl-CoA thioesterase Acot8 from Mus musculus. Additionally, fermentative pathways leading to production of acetate (pta and poxB), lactate (IdhA), and ethanol (adhE) were deleted from the background strain. To produce approximately 25 mg/L pimelic acid, glutaric acid was fed to generate glutaryl-CoA as a primer unit via the action of Cat1 in a background strain deficient in ldhA, poxB, pta, adhE, and the gene encoding fumarate reductase, frdA. To generate a mixture containing predominantly adipic acid at 95 mg/L, but also 34 mg/L suberic acid and 13 mg/L sebacic acid, an alternative thiolase (DcaF), hydroxyacyl-CoA dehydrogenase (DcaH), and enoyl-CoA hydratase (DcaE) from Acinetobacter sp. ADP1, with other enzymes the same as for producing adipic acid. In this case, the background strain was the same as that used for pimelic acid, with additional deletions in a number of other native E. coli acyl-CoA thioesterases (yciA, ybgC, ydiI, tesA, fadM, and tesB).
An alternative native adipate production pathway (reverse adipate degradation) has been reported in the bacterium Thermobifida fusca B6, where succinyl-CoA and acetyl-CoA are condensed by a p-ketothioase (EC 2.3.1.174) to form 3-oxoadipyl-CoA, followed by a series of reactions that are the same as those shown in FIG. 1 of Yu et al., 2014, to form adipyl-CoA. Adipyl-CoA is subsequently converted to adipic acid by a succinyl-CoA synthetase (Tfu_2577, Tfu_2576). A titer of over 2 g/L of adipic acid was obtained by fermentation of T. fusca B6 on glucose and milled corncob (Deng and Mao, 2015). The pathway has not yet been heterologously expressed in other organisms.
Adipic acid production from S. cerevisiae has been described by Verdezyne (e.g. WO 2011/003034 A2), however the starting substrates are fatty acids and the pathway for adipic acid production is therefore very different. The engineered microorganisms described have genetic modifications that add or increase the 6-oxohexanoic acid dehydrogenase, omega oxo fatty acid dehydrogenase, 6-hydroxyhexanoic acid dehydrogenase, omega hydroxyl fatty acid dehydrogenase, hexanoate synthase, monooxygenase, monooxygenase reductase, fatty alcohol oxidase, acyl-CoA ligase, acyl-CoA oxidase, enoyl-CoA hydratase, 3-L-hydroxyacyl-CoA dehydrogenase, and/or acetyl-CoA C-acetyltransferase activities. These modifications suggest a pathway where a fatty acid is broken down into multiple shorter chain diacids, derivatives are condensed with acetyl-CoA to increase the chain length where necessary, and C6 diacid products are reduced to adipic acid following some similar steps to those shown in FIG. 1a of Cheong et al., supra).
In addition to directly producing the final diacid products, other groups have developed alternative pathways to cis,cis-muconic acid, which can be chemically or enzymatically reduced to adipic acid, or glucaric acid, which can be produced in few steps from glucose and can be chemically reduced to adipic acid. Basic pathway schematics are shown in FIG. 4 of Polen et al. (2013) for muconic acid, and FIG. 5 of Polen et al. (2013) for glucaric acid. FIGS. 4 and 5 of Polen et al. (2013) are hereby incorporated by reference.
Muconic acid has been produced at 141 mg/L from glucose in S. cerevisiae (Curran et al., 2013) in a strain possessing a deletion in ARO3 (a 3-deoxy-D-arabino-heptulosonate-7-phosphate synthase isoform), expression of a feedback resistant version of ARO4 (another 3-deoxy-D-arabino-heptulosonate-7-phosphate synthase isoform; the modifications included both a feedback resistant coding mutation and a constitutive promoter replacement), and a deletion in ZWF1 (glucose-6-phosphate dehydrogenase). The heterologous pathway was expressed on 3 plasmids containing codon-optimized DHS from Podospora anserina, catechol 1,2-dioxygenase from Candida albicans, overexpressed TKL1 (transketolase) from S. cerevisiae, and protocatechuate decarboxylase from Enterobacter cloacae. To obtain higher expression of protocatechuate decarboxylase, another copy of the gene was also integrated onto the chromosome. The combination of non-pathway mutations (deletion of ZWF1 and ARO3; expression of feedback-resistant ARO4; overexpression of TKL1) served to relieve feedback inhibition of the shikimate pathway that is ordinarily employed for aromatic amino acid biosynthesis, and to direct flux into the pentose phosphate pathway via transketolase instead of glucose-6-phosphate dehydrogenase, increasing the supply of the erythrose-4-phosphate precursor.
Muconic acid has additionally been produced (170 mg/L of muconic acid) from glucose in E. coli possessing deletions in ptsH, ptsI, crr, and pykF and overexpressing ubiC, a feedback resistant version of aroF, aroE, and aroL (Sengupta et al., 2015). These mutations increase the supply of the precursors erythrose-4-phosphate and phosphoenolpyruvate. The heterologous pathway was composed of pobA from Pseudomonas putida KT2440, aroY from Klebsiella pneumoniae, and catA from Acinetobacter sp. strain ADP1. E. coli co-cultures consisting of strains engineered to overproduce DHS, and to convert DHS to muconic acid, have additionally been engineered to produce muconic acid from glycerol at a final titer of 2 g/L (Zhang et al., Microb. Cell Factories 14:134, 2015).
Number | Date | Country | Kind |
---|---|---|---|
16201310.6 | Nov 2016 | WO | international |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2017/079313 | 11/15/2017 | WO |
Number | Date | Country | |
---|---|---|---|
62422457 | Nov 2016 | US |