The present disclosure relates to recombinant host cells comprising one or more recombinant polynucleotides encoding enzymes in select pathways that provide the ability to use the cells to produce 1,3-butadiene, and the methods of manufacture of the cells, and methods of use of the cells for the production of 1,3-butadiene.
The official copy of the Sequence Listing is submitted concurrently with the specification as an ASCII formatted text file via EFS-Web, with a file name of “CX5-115USP1.txt”, a creation date of Mar. 1, 2012, and a size of 17,649 bytes. The Sequence Listing filed via EFS-Web is part of the specification and is incorporated in its entirety by reference herein.
1,3-butadiene (also referred to herein as “butadiene”) is a feedstock chemical used in the production synthetic rubbers, polymer resins, and other industrially important chemicals such as hexamethylenediamine, and adipidonitrile. Currently, nearly all of the 25 billion pounds of 1,3-butadiene produced annually is made by steam-cracking of non-renewable petroleum feedstock chemicals. Accordingly, there is a need for alternative processes that could produce 1,3-butadiene from renewable non-petroleum feedstock chemicals such as sugars (e.g., molasses, sugar cane juice), and particularly, from sugar compositions obtained from non-food cellulosic biomass sources (e.g., sugar cane bagasse, corn stover, wheat straw).
US2011/0300597A1 discloses non-naturally occurring microbial organisms containing butadiene pathways comprising at least one exogenous nucleic acid encoding a butadiene pathway enzyme expressed in a sufficient amount to produce butadiene. US2011/0300597A1 proposes, among other pathways, an engineered butadiene pathway that includes a crotonol intermediate which is formed through a 2-step reduction of crotonyl-CoA to crotonol through crotonaldehyde using a supposed crotonaldehyde reductase enzyme (see e.g. at
US2012/0021478A1 discloses non-naturally occurring microbial organisms containing butadiene pathways comprising at least one exogenous nucleic acid encoding a butadiene pathway enzyme expressed in a sufficient amount to produce butadiene. US201210021478A1 proposes, among other pathways, an engineered butadiene pathway in which a 3,5-dihydroxypentanoate and/or a 5-hydroxypent-2-enoate intermediate is formed and then decarboxylated by a supposed 3-hydroxyacid decarboxylase to form 3-butene-1-ol. The 3-butene-1-ol is subsequently dehydrated by a supposed 3-butene-1-ol dehydrogenase or a chemical catalyst to provide butadiene (see e.g.,
The present disclosure fulfills a need in the art by providing recombinant host cells that comprise an engineered pathway of enzymes as depicted in
In some embodiments, the present disclosure provides a recombinant host cell capable of producing crotonol, the host cell comprising: (a) a recombinant polynucleotide encoding a FAR enzyme capable of converting crotonyl-CoA and/or crotonyl-ACP to crotonol. In certain embodiments, the host cell further is capable of producing 1,3-butadiene and further comprises: (b) a recombinant polynucleotide encoding an enzyme capable of converting crotonol to but-2-enyl phosphate: and (c) a recombinant polynucleotide encoding an enzyme capable of converting but-2-enyl phosphate to 1,3-butadiene.
In some embodiments of the recombinant host cell, the recombinant polynucleotide encoding the FAR enzyme comprises one or more nucleotide sequence differences relative to the corresponding naturally occurring polynucleotide, which result in an improved property selected from: (a) increased activity of the FAR enzyme in the conversion of crotonyl-CoA (or ACP) to crotonol; (b) increased expression of the FAR enzyme; (c) increased host cell tolerance of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene; or (d) altered host cell concentration of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene.
In further embodiments of the recombinant host cell, the recombinant polynucleotide encoding an FAR enzyme comprises a polynucleotide sequence that has at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identity to a sequence encoding any one of SEQ ID NO: 1, 2, 3, and 4, or which hybridizes under stringent conditions to a polynucleotide sequence encoding any one of SEQ ID NO: 1, 2, 3, and 4. In some embodiments, the FAR enzyme comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identity to an amino acid sequence of any one of SEQ ID NO: 1, 2, 3, and 4.
In some embodiments of the recombinant host cell, the FAR enzyme capable of converting crotonyl-CoA to crotonol is the next enzyme in a pathway of enzymes catalyzing a series of conversions: (i) acetyl-CoA to acetoacetyl-CoA; (ii) acetoacetyl-CoA to 3-hydroxybutyryl-CoA; and (iii) 3-hydroxybutyryl-CoA to crotonyl-CoA. In some embodiments of the recombinant host cell, the FAR enzyme capable of converting crotonyl-CoA to crotonol is the next enzyme in a pathway of enzymes catalyzing the series of conversions: (i) acetyl-CoA to malonyl-CoA; (ii) malonyl-CoA to malonyl-ACP; (iii) malonyl-ACP to acetoacetyl-ACP; (iv) acetoacetyl-ACP to 3-hydroxybutyryl-ACP; and (v) 3-hydroxybutyryl-ACP to crotonyl-ACP.
In some embodiments of the recombinant host cell, the FAR enzyme capable of converting crotonyl-CoA to crotonol is the next enzyme in a pathway comprising the series of enzymes: (i) acetoacyl-CoA thiolase; (ii) acetoacetyl-CoA reductase; and (iii) a crotonase or dehydratase having activity on longer chain f-keto-acyl-CoA. In some embodiments of the recombinant host cell, the FAR enzyme capable of converting crotonyl-CoA to crotonol is the next enzyme in a pathway comprising the series of enzymes: (i) acetyl-CoA carboxylase; (ii) ACP-malonyl transferase; (iii) β-keto-acyl-ACP synthase; (iv) acetoacetyl-ACP reductase; and (v) 3-hydroxybutyryl-ACP dehydratase.
In some embodiments, the recombinant host cell comprises an alcohol kinase enzyme capable of converting crotonol to but-2-enyl phosphate, wherein the recombinant polynucleotide encoding the enzyme capable of converting crotonol to but-2-enyl phosphate comprises one or more nucleotide sequence differences relative to the corresponding naturally occurring polynucleotide, which result in an improved property selected from: (a) increased activity in the conversion of crotonol to but-2-enyl phosphate: (b) increased expression; (c) increased host cell tolerance of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene; or (d) altered host cell concentration of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene.
In some embodiments, the recombinant host cell comprises a terpene synthase enzyme capable of converting but-2-enyl phosphate to 1,3-butadiene, wherein the recombinant polynucleotide encoding the enzyme capable of converting but-2-enyl phosphate to 1,3-butadiene comprises one or more nucleotide sequence differences relative to the corresponding naturally occurring polynucleotide, which result in an improved property selected from: (a) increased activity in the conversion of but-2-enyl phosphate to 1,3-butadiene: (b) increased expression; (c) increased host cell tolerance of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene; or (d) altered host cell concentration of acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP. 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, but-2-enyl phosphate, or 1,3-butadiene.
In some embodiments, the recombinant host cell further comprises one or more recombinant polynucleotides encoding one or more enzymes selected from: (i) acetoacyl-CoA thiolase; (ii) acetyl-CoA carboxylase: (iii) ACP-malonyl transferase; (iv) 1-keto-acyl-ACP synthase: (v) acetoacetyl-CoA reductase; (vii) acetoacetyl-ACP reductase; (viii) crotonase or other dehydratase: or (viii) 3-hydroxybutyryl-ACP dehydratase. In some embodiments, any one of these one or more recombinant polynucleotides comprise one or more nucleotide sequence differences relative to the corresponding naturally occurring polynucleotide, which result in an improved property selected from: (a) altered activity of an encoded enzyme; (b) altered expression of an encoded enzyme; (c) increased host cell tolerance of a compound selected from: acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, or 1,3-butadiene; and (d) altered host cell concentration of a compound selected from: acetyl-CoA, acetoacetyl-CoA, malonyl-CoA, malonyl-ACP, 3-hydroxybutyryl-CoA, acetoacetyl-ACP, crotonyl-CoA, crotonyl-ACP, crotonol, or 1,3-butadiene.
In some embodiments of the recombinant host cell, the host cell is capable of producing crotonol and/or 1,3-butadiene by fermentation of a carbon source, wherein the carbon source is a fermentable sugar. In some embodiments, the fermentable sugar is glucose. In some embodiments, the fermentable is obtained from a cellulosic biomass, such as sugar cane bagasse, corn stover, or wheat straw.
In some embodiments of the recombinant host cell, the host cell is from a strain of microorganism derived from any one of: Escherichia coli, Bacillus, Saccharomyces, Streptomyces and Yarrowia. In some embodiments, the host cell is from a microorganism selected from E. coli, S. cerevisiae, and Y. lipolytica.
The present disclosure also provides methods of manufacturing the recombinant host cells of the disclosure (i.e., recombinant host cells comprising an engineered pathway of
The present disclosure also provides methods of using the recombinant host cells disclosed herein in processes for making crotonol and/or 1,3-butadiene. In some embodiments, the disclosure provides a method of producing crotonol or 1,3-butadiene comprising contacting a recombinant host cell of the disclosure (i.e., a recombinant host cell comprising an engineered pathway of
In some embodiments, the present disclosure provides a method of producing 1,3-butadiene that includes a chemo-catalytic dehydration step, the method comprising (i) contacting the recombinant host cell of the disclosure which is capable of producing crotonol (e.g., via the engineered pathway of
The present disclosure addresses the need in the art for biological compositions and associated methods to produce 1,3-butadiene from cheap, renewable carbon sources, such as fermentable sugars obtained from plant biomass.
The present disclosure provides recombinant host cells that are capable of producing crotonol and/or 1,3-butadiene, and associated compositions, processes, techniques, and methods of manufacture, that can provide for large scale production of 1,3-butadiene. The recombinant host cells of the disclosure comprise one or more recombinant polynucleotides that encode one or more enzymes in select pathways of enzymes, which are depicted schematically in
In particular embodiments, the recombinant host cells comprise a polynucleotide encoding a fatty acyl reductase (FAR) enzyme which is capable of directly converting crotonyl-CoA to crotonol as a single enzyme and/or crotonyl-ACP to crotonol as a single enzyme. In some embodiments, the FAR enzyme is an engineered enzyme derived from a fatty acyl reductase gene found in a species of Marinobacter or Oceanobacter, and in particular embodiments the gene found in Marinobacter algicola strain DG893 or Marinobacter aquaeolei VT8.
In some embodiments of the disclosure, the host cells further comprise an engineered pathway of enzymes that carries out the further conversion of crotonol to 1,3-butadiene, thereby providing for fully biosynthetic route for the production 1,3-butadiene. This engineered pathway proceeds from the intermediate compound, crotonol, through a kinase mediated phosphorylation to give the corresponding phosphate ester, but-2-enyl phosphate. This phosphate can be eliminated in a step akin to isoprene synthesis to provide the desired product 1,3-butadiene. This further engineered pathway is depicted in
In some embodiments, the recombinant host cells comprise one or more recombinant polynucleotides encoding an engineered variant of an enzyme described herein and in the engineered pathways of
The recombinant host cells, engineered pathways, and specific recombinant polynucleotides and encoded enzymes that make up the pathways and carry out the substrate-to-product conversions are described in greater detail below. Additionally, the following sections describe methods for using the recombinant host cells for the production of crotonol and/or 1,3-butadiene from fermentable sugars.
The technical and scientific terms used in the descriptions herein will have the meanings commonly understood by one of ordinary skill in the art, unless specifically defined otherwise. Accordingly, the following terms are intended to have the following meanings.
“Protein”, “polypeptide,” and “peptide” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation, phosphorylation, lipidation, myristilation, ubiquitination, etc.). Included within this definition are D- and L-amino acids, and mixtures of D- and L-amino acids.
“Enzyme” as used herein refers to a polypeptide or protein having capable of catalyzing the conversion of substrate molecule to a product molecule.
“Nucleic acid” or “polynucleotide” are used interchangeably herein to denote a polymer of at least two nucleic acid monomer units or bases (e.g., adenine, cytosine, guanine, thymine) covalently linked by a phosphodiester bond, regardless of length or base modification
“Naturally occurring” or “wild-type” refers to the form found in nature. For example, a naturally occurring or wild-type polypeptide or polynucleotide sequence is a sequence present in an organism that can be isolated from a source in nature and which has not been intentionally modified by human manipulation.
“Recombinant” or “engineered” or “non-naturally occurring” when used with reference to, e.g., a cell, nucleic acid, or polypeptide, refers to a material, or a material corresponding to the natural or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic materials and/or by manipulation using recombinant techniques. Non-limiting examples include, among others, recombinant cells expressing genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise expressed at a different level.
“Percentage of sequence identity,” “percent identity,” and “percent identical” are used herein to refer to comparisons between polynucleotide sequences or polypeptide sequences, and are determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which either the identical nucleic acid base or amino acid residue occurs in both sequences or a nucleic acid base or amino acid residue is aligned with a gap to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Determination of optimal alignment and percent sequence identity is performed using the BLAST and BLAST 2.0 algorithms (see e.g., Altschul et al., 1990, J. Mol. Biol. 215: 403-410 and Altschul et al., 1977, Nucleic Acids Res. 3389-3402). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information website.
Briefly, the BLAST analyses involve first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as, the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5. N=−4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff 1989, Proc Natl Acad Sci USA 89:10915).
Numerous other algorithms are available that function similarly to BLAST in providing percent identity for two sequences. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman, 1981. Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85:2444, by computerized implementations of these algorithmns (GAP, BESTFIT, FASTA, and TFASTA in the GCG Wisconsin Software Package), or by visual inspection (see generally, Current Protocols in Molecular Biology, F. M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1995 Supplement) (Ausubel)). Additionally, determination of sequence alignment and percent sequence identity can employ the BESTFIT or GAP programs in the GCG Wisconsin Software package (Accelrys, Madison W1), using default parameters provided.
“Reference sequence” refers to a defined sequence to which another sequence is compared. A reference sequence is not limited to wild-type sequences, and can include engineered or altered sequences. For example, a reference sequence can be a previously engineered or altered amino acid sequence. A reference sequence also may be a subset of a larger sequence, for example, a segment of a full-length gene or polypeptide sequence. Generally, a reference sequence is at least 20 nucleotide or amino acid residues in length, at least 25 residues in length, at least 50 residues in length, or the full length of the nucleic acid or polypeptide. Since two polynucleotides or polypeptides may each (1) comprise a sequence (i.e., a portion of the complete sequence) that is similar between the two sequences, and (2) may further comprise a sequence that is divergent between the two sequences, sequence comparisons between two (or more) polynucleotides or polypeptide are typically performed by comparing sequences of the two polynucleotides over a comparison window to identify and compare local regions of sequence similarity.
“Comparison window” refers to a conceptual segment of at least about 20 contiguous nucleotide positions or amino acids residues wherein a sequence may be compared to a reference sequence of at least 20 contiguous nucleotides or amino acids and wherein the portion of the sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The comparison window can be longer than 20 contiguous residues, and includes, optionally 30, 40, 50, 100, or longer windows.
“Corresponding to”, “reference to” or “relative to” when used in the context of the numbering of a given amino acid or polynucleotide sequence refers to the numbering of the residues of a specified reference sequence when the given amino acid or polynucleotide sequence is compared to the reference sequence. In other words, the residue number or residue position of a given polymer is designated with respect to the reference sequence rather than by the actual numerical position of the residue within the given amino acid or polynucleotide sequence. For example, a given amino acid sequence, such as that of an engineered enzyme, can be aligned to a reference sequence by introducing gaps to optimize residue matches between the two sequences. In these cases, although the gaps are present, the numbering of the residue in the given amino acid or polynucleotide sequence is made with respect to the reference sequence to which it has been aligned.
“Different from” or “differs from” with respect to a designated reference sequence refers to difference of a given amino acid or polynucleotide sequence when aligned to the reference sequence. Generally, the differences can be determined when the two sequences are optimally aligned. Differences include insertions, deletions, or substitutions of amino acid residues in comparison to the reference sequence. Typically, the reference sequence is a naturally occurring sequence from which the sequence with the differences is derived. The present disclosure provides engineered pathways of enzymes, wherein the enzymes are encoded by one or more recombinant polynucleotides having one or more nucleotide sequence differences relative to a reference polynucleotide sequence, which is typically the corresponding naturally occurring polynucleotide from which the recombinant polynucleotide is derived. Further, the nucleotide differences may encode one or more amino acid residue differences in the enzymes, where the encoded amino acid differences, which can include either/or both conservative and non-conservative amino acid substitutions.
“Derived from” as used herein in the context of engineered enzymes, identifies the originating enzyme, and/or the gene encoding such enzyme, upon which the engineering was based.
“Amino acid residue” or “amino acid” or “residue” as used herein refers to the specific monomer at a sequence position of a polypeptide, such as an enzyme.
“Amino acid difference” or “residue difference” refers to a change in the amino acid residue at a position of a polypeptide sequence relative to the amino acid residue at a corresponding position in a reference sequence.
“Conservative amino acid substitution” refers to a substitution of a residue with a different residue having a similar side chain, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids. By way of example and not limitation, an amino acid with an aliphatic side chain may be substituted with another aliphatic amino acid, e.g. alanine, valine, leucine, and isoleucine; an amino acid with hydroxyl side chain is substituted with another amino acid with a hydroxyl side chain, e.g., serine and threonine; an amino acids having aromatic side chains is substituted with another amino acid having an aromatic side chain, e.g., phenylalanine, tyrosine, tryptophan, and histidine; an amino acid with a basic side chain is substituted with another amino acid with a basic side chain, e.g., lysine and arginine; an amino acid with an acidic side chain is substituted with another amino acid with an acidic side chain, e.g. aspartic acid or glutamic acid: and a hydrophobic or hydrophilic amino acid is replaced with another hydrophobic or hydrophilic amino acid, respectively.
“Non-conservative substitution” refers to substitution of an amino acid in a polypeptide with an amino acid with significantly differing side chain properties. Non-conservative substitutions may use amino acids between, rather than within, the defined groups and affects (a) the structure of the peptide backbone in the area of the substitution (e.g. proline for glycine) (b) the charge or hydrophobicity, or (c) the bulk of the side chain. By way of example and not limitation, an exemplary non-conservative substitution can be an acidic amino acid substituted with a basic or aliphatic amino acid: an aromatic amino acid substituted with a small amino acid; and a hydrophilic amino acid substituted with a hydrophobic amino acid.
“Deletion” refers to modification of the polypeptide by removal of one or more amino acids from the reference polypeptide. Deletions can comprise removal of 1 or more amino acids, 2 or more amino acids, 5 or more amino acids, 10 or more amino acids, 15 or more amino acids, or 20 or more amino acids, up to 10% of the total number of amino acids, or up to 20% of the total number of amino acids making up the polypeptide while retaining enzymatic activity and/or retaining the improved properties of an engineered enzyme. Deletions can be directed to the internal portions and/or terminal portions of the polypeptide. In various embodiments, the deletion can comprise a continuous segment or can be discontinuous.
“Insertion” refers to modification of the polypeptide by addition of one or more amino acids to the reference polypeptide. In some embodiments, the improved engineered enzymes comprise insertions of one or more amino acids relative to the corresponding naturally occurring polypeptide as well as insertions of one or more amino acids to other improved polypeptides. Insertions can be in the internal portions of the polypeptide, or to the carboxy or amino terminus. Insertions as used herein include fusion proteins as is known in the art. The insertion can be a contiguous segment of amino acids or separated by one or more of the amino acids in the naturally occurring polypeptide.
“Fragment” as used herein refers to a polypeptide that has an amino-terminal and/or carboxy-terminal deletion, but where the remaining amino acid sequence is identical to the corresponding positions in the sequence. Fragments can typically have about 80%, 90%. 95%, 98%, and 99% of the full-length polypeptide, for example the FAR enzyme polypeptide of SEQ ID NO: 1. The amino acid sequences of the specific recombinant polypeptides included in the Sequence Listing of the present disclosure include an initiating methionine (M) residue (i.e., M represents residue position 1). The skilled artisan, however, understands that this initiating methionine residue can be removed by biological processing machinery, such as in a host cell or in vitro translation system, to generate a mature protein lacking the initiating methionine residue, but otherwise retaining the enzyme's properties. Consequently, the term “amino acid residue difference relative to SEQ ID NO: 1 at position n” as used herein may refer to position “n” or to the corresponding position (e.g., position (n−1) in a reference sequence that has been processed so as to lack the starting methionine.
“Improved property” as used herein refers to a functional characteristic of an enzyme or host cell that is improved relative to the same functional characteristic of a reference enzyme or reference host cell. Improved properties of the engineered enzymes or host cells comprising engineered pathways disclosed herein can include but are not limited to: increased thermostability, increased solvent stability, increased pH stability, altered pH activity profile, increased activity (including increased rate conversion of substrate to product, or increased percentage conversion in a period of time), increased and/or altered stereoselectivity, altered substrate specificity and/or preference, decreased substrate, product, and side-product inhibition, decreased inhibition by a component of a feedstock, decreased side-product or impurity production, altered cofactor preference, increased expression, increased secretion, as well as increased stability and/or activity in the presence of additional compounds reagents useful for the production of 1,3-butadiene or crotonol.
“Stability in the presence of” as used in the context of improved enzyme properties disclosed herein refers to stability of the enzyme measured during or after exposure of the enzyme to certain compounds/reagents/ions in the same solution with the enzyme. It is intended to encompass challenge assays of stability where the enzyme is first exposed to the compounds/reagents/ions for some period of time then assayed in a solution under different conditions.
“Solution” as used herein refers to any medium, phase, or mixture of phases, in which the recombinant host cells and/or enzymes of the present disclosure are active. It is intended to include purely liquid phase solutions (e.g., aqueous, or aqueous mixtures with co-solvents, including emulsions and separated liquid phases), as well as slurries and other forms of solutions having mixed liquid-solid phases.
“Thermostability” refers to the functional characteristic of retaining activity (e.g. more than 60% to 80%) in the presence of, or after exposure to for a period of time (e.g. 0.5-72 hrs), elevated temperatures (e.g. 30-60° C.) compared to the activity of an untreated enzyme.
“Solvent stability” refers to the functional characteristic of retaining activity (e.g., more than 60% to 80%) in the presence of, or after exposure to for a period of time (e.g. 0.5-72 hrs), increased concentrations (e.g., 5-99%) of solvent compared to the activity of an untreated enzyme.
“pH stability” refers to the functional characteristic of retaining activity (e.g., more than 60% to 80%) in the presence of, or after exposure to for a period of time (e.g. 0.5-72 hrs), conditions of high or low pH (e.g., pH 2 to 12) compared to the activity of an untreated enzyme.
“Increased activity” or “increased enzymatic activity” refers to an improved property of an enzyme (e.g., FAR enzyme), which can be represented by an increase in specific activity (e.g., product produced/time/weight protein) or an increase in percent conversion of the substrate to the product (e.g. percent conversion of crotonyl-CoA to crotonol in a specified time period using a specified amount of a FAR enzyme) as compared to a reference enzyme under suitable reaction conditions. Any property relating to enzyme activity may be altered, including the classical enzyme properties of Km, Vmax or kcat, changes of which can lead to increased enzymatic activity. Improvements in enzyme activity can be from about 1.1-times the enzymatic activity of the corresponding naturally occurring enzyme, to as much as 1.2-times, 1.5-times, 2-times, 3-times. 4-times, 5-times, 6-times, 7-times, or more than 8-times the enzymatic activity than the naturally occurring parent enzyme. It is understood by the skilled artisan that the activity of any enzyme is diffusion limited and hence, any improvements in the enzyme activity of the enzyme will have an upper limit related to the diffusion rate of the substrates acted on by the enzyme. Methods to determine enzyme activity can depend on the particular enzyme, substrate, and product, and are well-known in the art. Comparisons of enzyme activities are made, e.g., using a defined preparation of enzyme, a defined assay under a set of conditions, as further described in detail herein. Generally, when lysates are compared, the numbers of cells and the amount of protein assayed are determined as well as use of identical expression systems and identical host cells to minimize variations in amount of enzyme produced by the host cells and present in the lysates.
“Conversion” refers to the enzymatic conversion of the substrate to the corresponding product. “Percent conversion” refers to the percent of the substrate that is reduced to the product within a period of time under specified conditions. Thus, the “enzymatic activity” or “activity” of an enzyme can be expressed as “percent conversion” of the substrate to the product.
“Isolated” as used herein in the context of enzymes or compounds such as “isolated crotonol” refers to a molecule which is substantially separated from other contaminants that naturally accompany it. The term embraces isolated compounds, such as isolated crotonol, which have been made biosynthetically in a recombinant host cell and then are removed or purified from the cellular environment or expression system.
“Coding sequence” refers to that portion of a polynucleotide that encodes an amino acid sequence of a protein (e.g., a gene).
“Heterologous” polynucleotide refers to any polynucleotide that is introduced into a host cell by laboratory techniques, and includes polynucleotides that are removed from a host cell, subjected to laboratory manipulation, and then reintroduced into a host cell.
“Codon optimized” refers to changes in the codons of the polynucleotide encoding a protein to those preferentially used in a particular organism such that the encoded protein is efficiently expressed in the organism of interest. In some embodiments, the polynucleotides encoding the enzymes used in the engineered pathways of the present disclosure may be codon optimized for optimal production from the host organism selected for expression.
“Control sequence” is defined herein to include all components, which are necessary or advantageous for the expression of a polynucleotide and/or polypeptide of the present disclosure. Each control sequence may be native or foreign to the polynucleotide of interest. Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator.
“Operably linked” is defined herein as a configuration in which a control sequence is appropriately placed (i.e., in a functional relationship) at a position relative to a polynucleotide of interest such that the control sequence directs or regulates the expression of the polynucleotide and/or polypeptide of interest.
“Expression” includes any step involved in the production of a polypeptide (e.g. encoded enzyme) including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
“Transform” or “transformation,” as used in reference to a host cell, means a host cell has a non-native nucleic acid sequence integrated into its genome or as an episome (e.g. plasmid) that is maintained through multiple generations of the host cell.
“Culturing” refers to growing a population of host cells under suitable conditions in a liquid or solid medium. In particular embodiments, culturing refers to the fermentative bioconversion of a carbon source (e.g., sugar) to an end product (e.g. butadiene).
“Recoverable” as used in reference to producing a composition (e.g. crotonol) by a method of the present invention, refers to the amount of composition which can be isolated from the reaction mixture yielding the composition according to methods known in the art.
“Enzyme class” as used herein refers to the numerical classification scheme for enzymes based on the reaction catalyzed by the enzyme. The enzyme class is designated by the Enzyme Commission (“EC”) number. The EC number classification scheme is well-known in the art and published by International Union of Biochemistry and Molecular Biology (IUBMB) (see at e.g. www.chem.qmul.ac.uk/iubmb enzyme).
“Pathway of enzymes” or “enzyme pathway” refers to a group of enzymes expressed in a host cell that catalyze a series of conversions of substrate to product that are linked together, e.g., the product of the first enzyme is the substrate for the second enzyme, and the product of the second enzyme is the substrate of the third enzyme, and so on. As used herein, the term enzyme pathway may refer to a naturally occurring or an engineered pathway. Further, as used herein, an enzyme pathway may be part of a larger pathway in a cell (i.e., a sub-pathway).
“Host cell” as used herein refers to a living cell or microorganism that is capable of reproducing its genetic material and along with it recombinant genetic material that has been introduced into it e.g., via heterologous transformation.
“Recombinant host cell” as used herein refers to a host cell that has been transformed with recombinant genetic material e.g., one or more recombinant polynucleotides.
“Sugar” as used herein refers to carbohydrate compounds and compositions made up of monosaccharides, disaccharides, trisaccharides, oligosaccharides, and polysaccharides, e.g., fructose, galactose, glucose, ribose, xylose, sucrose, lactose, maltose, maltotriose, starch, cellulose.
“Fermentable sugar” as used herein refers to sugar compounds and compositions that can be metabolized by a recombinant host cell. Exemplary fermentable sugars include sugars from sugar cane, starch from feedstock such as corn, from lignocellulosic feedstocks where the cellulose part of a plant is broken down to sugars (e.g. in a saccharitication process) glucose and xylose.
“1,3-Butadiene” or “butadiene” as used herein refers to the diene compound of molecular formula C4H6 having CAS number 106-99-0. IUPAC name: buta-1,3-diene.
“CoA” as used herein refers to coenzyme A, the naturally occurring thiol compound having CAS number 85-61-0.
“ACP” as used herein refers to the acyl carrier protein, the naturally occurring polypeptide that comprises 4′-phosphopantethiene moiety which can forms a thioester linkage with the growing fatty acid chain during the biosynthesis of fatty acids.
“Crotonyl-CoA” or “crotonoyl-CoA” as used herein refers to the thioester compound of crotonyl (either the trans- or the cis-isomer or a mixture thereof) and coenzyme A which has the CAS number 992-67-6. IUPAC name: S-[2-[3-[[4-[[[5-(6-aminopurin-9-yl)-4-hydroxy-3-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-2-hydroxy-3,3-dimethylbutanoyl]amino]propanoylamino]ethyl]but-2-enethioate.
“Crotonyl-ACP” or “crotonoyl-ACP” as used herein refers to the compound of a crotonyl moiety (either the trans- or the cis-isomer or a mixture thereof) attached through a thioester linkage to the acyl-carrier protein.
“Crotonol” or “crotyl alcohol” (IUPAC name but-2-en-1-ol) as used herein refers to the unsaturated alcohol compound which may be present as either the (E)-isomer (“trans”, CAS 504-61-0), the (Z)-isomer (“cis”, CAS 4088-60-2) or a mixture of (E) and (Z) in any combination which has the CAS number 6117-91-5.
“FAR enzyme” or “fatty acyl reductase” refers to an enzyme that catalyzes reduction of a fatty acyl-CoA, a fatty acyl-ACP, or other fatty acyl thioester substrate directly to its corresponding fatty alcohol with the reducing equivalents provided by the oxidation of NAD(P)H to NAD(P)+. (EC 1.1.1*) The enzymatic reaction catalyzed by a FAR enzyme on fatty acyl-CoA can be represented by:
fatty acyl-CoA+2NAD(P)H→fatty alcohol+2NAD(P)+
In contrast to the FAR enzyme, where a single enzyme is capable of catalyzing this reduction to the fatty alcohol, typically, the enzymatic reduction of fatty acyl-CoA molecules to fatty alcohols is catalyzed two distinct reductase enzymes: (1) an “acyl-CoA reductase” which reduces the acyl-CoA substrate to its corresponding fatty aldehyde (e.g., enzyme of class EC 1.2.1.50); and (2) an “fatty aldehyde reductase” (e.g. an oxidoreductase) reduces the fatty aldehyde to the fatty alcohol (e.g., an enzyme of class EC 1.1.1.1). Such a two-enzyme reduction can be represented by:
fatty acyl-CoA+NAD(P)H→fatty aldehyde+NAD(P)+
fatty aldehyde+NAD(P)H→fatty alcohol+NAD(P)+
The present disclosure provides recombinant host cells that comprise engineered pathways of enzymes that are useful for the production of 1,3-butadiene. Generally, the engineered pathways introduced into the host cells by transforming the host cells with one or more recombinant polynucleotides encoding one or more of the enzymes in the pathway. The recombinant host cells thereby produced are capable of expressing the encoded enzyme(s) such that the substrate-to-product conversions of the engineered pathway are carried out biosynthetically and host cell produces the desired product compound of the pathway. The relevant portions of the engineered pathways are illustrated schematically in
In some embodiments, the recombinant host cells that comprise an engineered pathway of enzymes are capable of producing crotonol. In such embodiments, the recombinant host cell comprises a recombinant polynucleotide encoding an enzyme capable of converting crotonyl-CoA and/or crotonyl-ACP to crotonol (see conversion of
In some embodiments, the recombinant host cells that comprise an engineered pathway of enzymes are capable of producing the compound 1,3-butadiene biosynthetically. In such embodiments, the recombinant host cells comprise: (a) a recombinant polynucleotide encoding a FAR enzyme capable of converting crotonyl-CoA and/or crotonyl-ACP to crotonol (see conversion of
The present disclosure contemplates that any of the exemplary enzymes disclosed herein may be engineered using methods known in the art (e.g. random PCR, gene shuffling, directed evolution, etc.) to provide variant engineered enzymes having improved properties. Specific improved properties of engineered enzymes useful for the recombinant host cells of the present disclosure can include altered (i.e., increased or decreased) enzyme activity or enzyme expression. For example, decreased enzyme activity or expression may be desirable in many situations, particularly to prevent the detrimental build-up in concentration of product which can be a substrate for another slower downstream enzyme in the pathway.
The engineered enzymes of the present disclosure can be obtained by subjecting the polynucleotide encoding the naturally occurring enzyme (or one or more homologous naturally occurring enzymes) to mutagenesis and/or directed evolution methods. Exemplary techniques for engineering enzymes of the present disclosure can include directed evolution techniques such as mutagenesis and/or DNA shuffling as described in Stemmer, 1994, Proc Natl Acad Sci USA 91:10747-10751; WO 95/22625; WO 97/0078; WO 97/35966; WO 98/27230; WO 00/42651; WO 01/75767 and U.S. Pat. No. 6,537,746. Other directed evolution procedures that can be used include, among others, staggered extension process (StEP), in vitro recombination (Zhao et al., 1998, Nat. Biotechnol. 16:258-261), mutagenic PCR (Caldwell et al., 1994, PCR Methods Appl. 3:S136-S140), and cassette mutagenesis (Black et al. 1996. Proc Natl Acad Sci USA 93:3525-3529). Mutagenesis and directed evolution techniques useful for the purposes herein are also described in e.g., Ling, et al., 1997, Anal. Biochem. 254(2):157-78; Dale et al., 1996, “Oligonucleotide-directed random mutagenesis using the phosphorothioate method,” in Methods Mol. Biol. 57:369-74; Smith, 1985, Ann. Rev. Genet. 19:423-462; Botstein et al., 1985, Science 229:1193-1201; Carter, 1986, Biochem. J. 237:1-7; Kramer et al., 1984, Cell, 38:879-887; Wells et al., 1985, Gene 34:315-323; Minshull et al., 1999, Curr Opin Chem Biol 3:284-290; Christians et al., 1999, Nature Biotech 17:259-264; Crameri et al., 1998, Nature 391:288-291; Crameri et al., 1997, Nature Biotech 15:436-438; Zhang et al., 1997, Proc Natl Acad Sci USA 94:45-4-4509; Crameri et al., 1996, Nature Biotech 14:315-319; Stemmer, 1994, Nature 370:389-391; Stemmer, 1994, Proc Natl Acad Sci USA 91:10747-10751; PCT Publ. Nos. WO 95/22625, WO 97/0078. WO 97/35966, WO 98/27230, WO 00/42651, and WO 01/75767; and U.S. Pat. No. 6,537,746. All publications and patent are hereby incorporated by reference herein.
In some embodiments, it is contemplated that the enzymes disclosed herein are encoded by recombinant polynucleotides having sequences that have been codon optimized for expression in the recombinant host cell. In some embodiments, it is contemplated that the enzymes disclosed herein are encoded by recombinant polynucleotides having sequences that also include control sequences that can increase expression and/or secretion of the enzymes. The control sequences may be ones associated with the enzyme gene in its host organism or associated with the host cell. In some embodiments, it is contemplated that the recombinant polynucleotides that can further comprise a sequence encoding a signal peptide. In such embodiments, the signal peptide may be one that is associated with the enzyme in its naturally occurring organism. In other embodiments, the signal peptide can be one that is associated with a gene found in the recombinant host cell, thereby providing for the improved expression of the enzyme in the host cell.
Exemplary enzymes that can be used in the various substrate-to-product conversion steps of the engineered pathways of the present disclosure are described in greater detail below and in the Examples.
Pathway of
Acetoacetyl-CoA is a naturally occurring metabolic intermediate formed in most host cells by condensation of two acetyl-CoA which is catalyzed by naturally occurring thiolase enzymes (e.g., enzymes of class EC 2.3.1.9 or EC 2.3.1.16). Thiolase enzymes of class EC 2.3.1.9 include the gene products of atoB from E. coli (MetaCyc Accession Number EGI 1672; Nat. Biotechnol. 2003, 21, 796) and ERG 10 from S. cerevisiae (MetaCyc Accession Number YPL028W; J. Biol. Chem. 1994, 269, 1381). In addition to these, other exemplary thiolases useful in the engineered pathways of the recombinant host cells of the present disclosure are shown in Table 1.
Escherichia coli
Escherichia coli
Homo sapiens
Saccharomyces
cerevisiae
Zoogloea
ramigera
Clostridium
acetobutylicum
Escherichia coli
Saccharomyces
cerevisiae
In some embodiments of the present disclosure, an enzyme of Table 1 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment, no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
Alternatively, an engineered pathway through the intermediate acetoacetyl-ACP can be used in the production of crotonol or 1,3-butadiene. Acetoacetyl-ACP is formed in three steps from acetyl-CoA via the intermediacy of malonyl-CoA (acetyl-CoA carboxylase; EC 6.4.1.2), malonyl-ACP (ACP-malonyl transferase; EC 2.3.1.39) with the final step catalyzed by beta-keto-acyl-ACP synthase (EC 2.3.1.41). Each of these enzymes is well known and exemplary enzymes of these classes are shown in Table 2.
Saccharomyces
cerevisiae
Escherichia coli
Saccharomyces
cerevisiae
Escherichia coli
Saccharomyces
cerevisiae
Escherichia coli
In some embodiments of the present disclosure, an enzyme of Table 2 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment, no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
Reduction of acetoacetyl-CoA (or -ACP) to the (R)- or (S)-3-hydroxybutryl-CoA (or -ACP) is an established reaction in cellular metabolism catalyzed by reductase enzymes in the EC 1.1.1.35, EC 1.1.1.36, EC 1.1.1.157 and EC 1.1.100 class. Useful reductases in these classes include gene products of phaB from R. sphaeroides (MetaCyc Accession Number G-10357; Mol. Microbiol, 2006, 61, 297), phbB from Z. ramigera (MetaCyc Accession Number G-9969; Mol. Microbiol, 1989, 3, 349) and phbB from C. necator (MetaCyc Accession Number G-14621; J. Biol. Chem. 1999, 264, 15293). These and other exemplary reductases of these enzyme classes useful in the recombinant host cells and methods of the present disclosure are shown in Table 3.
Escherichia coli
Arabidopsis
thaliana
Burkholderia
psuedomallei
Burkholderia
psuedomallei
Escherichia coli
Saccharomyces
cerevisiae
Escherichia coli
In some embodiments of the present disclosure, a reductase enzyme of Table 3 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
(R)- or (S)-3-hydroxybutyryl-CoA (or -ACP) is dehydrated by the action of a dehydratase enzyme of classes EC 4.2.1.55 (for CoA) or EC 4.2.1.17 (for ACP) to the corresponding cis-, or trans-enoyl-CoA (or -ACP). Exemplary enzymes in these classes useful in the engineered pathways of the present disclosure include crotonase from R. rubrum (Biochem. 1969, 8, 2748) and others exhibiting crotonase activity from C. acelobutylicum (Meta. Engin. 2008, 10, 305) and C. kluyveri (FEBS Lett. 1972, 21, 351). These and other exemplary enzymes of these classes useful in the engineered pathways are shown in Table 4.
Clostridium
acetobutylicum
Bacillus cereus
Clostridium kluyveri
Homo sapiens
Rattus norvegicus
Mus Musculus
In some embodiments of the present disclosure, a dehydratase enzyme of Table 4 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment, no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
In some embodiments, the conversion of crotonyl-CoA (or crotonyl-ACP) to crotonol at Step D of the pathway of
A number of FAR enzymes obtained from marine bacteria, and engineered enzyme variants thereof, which are useful in preparing the recombinant host cells and methods of the present disclosure are disclosed in International patent publication WO2012/006114, which is hereby incorporated by reference herein. Further detailed description of useful FAR enzymes is provided below.
In certain embodiments, the FAR enzyme and/or functional fragment can be derived or obtained from a γ proteobacterium of the order Alteromonadales. In some embodiments, the FAR enzyme and/or functional fragment can be derived from or obtained from the Alteromonadales family Alteromonadaceae. In certain embodiments, the FAR enzyme and/or functional fragment can be derived from or obtained from an Alteromonadaceae genus such as but not limited to the Alteromonadaceae genus Marinobacter. In certain specific embodiments, the FAR enzyme and/or functional fragment can be derived from the Marinobacter species algicola. In a particular embodiment, the FAR enzyme and/or functional fragment can be derived from or obtained from the M. algicola species strain DG893. In some specific embodiments, the FAR enzyme for use in the methods disclosed herein is from the marine bacterium Marinobacter algicola DG893 (SEQ ID NO: 1) (“FAR_Maa”).
In some embodiments, the FAR enzyme and/or functional fragment is derived or obtained from a species of Marinobacter including, but not limited to, a species selected from M. algicola, M. alkaliphilus, M. aquaeolei, M. arcticus, M. bryozoorum, M. daepoensis, M. excellens, M. flavimaris, M. guadonensis, M. hydrocarhonoclasticus, A. koreenis, M. lipolyticus, M. litoralis, M. lutaoensis, M. maritimus. M. sediminum, M. squalenivirans and M. vinifirmus and equivalent and synonymous species thereof.
In one specific embodiment, the FAR enzyme is derived or obtained from M. algicola strain DGX893 and has an amino acid sequence that is at least 70% identical. at least 75%, at least 80% identical, at least 85% identical, at least 90% identical, at least 93% identical at least 95% identical, at least 97% identical and/or at least 98% identical to SEQ ID NO: 1 or a functional fragment thereof. In another specific embodiment, the isolated FAR enzyme has an amino acid sequence that is identical to SEQ ID NO: 1.
In one specific embodiment, the FAR enzyme is derived or obtained from Marinhbacter aquaeolei (e.g., M. aquaeolei VT8) and has an amino acid sequence that is at least at least 70% identical, at least 75%, at least 80% identical, at least 85% identical, at least 90% identical, at least 93% identical, at least 95% identical, at least 97% identical and/or at least 98% identical to SEQ ID NO: Y or a functional fragment thereof. In another specific embodiment, the isolated FAR enzyme has an amino acid sequence that is identical to SEQ ID NO: 2.
In various embodiments, the isolated FAR enzyme and/or functional fragment is obtained or derived from a marine bacterium selected from the group of Meptuniibacter caesariensis species strain MED92, Reinekea sp. strain MED297, Marinomonas sp. strain MED121, unnamed gammaproteobacterium strain HTCC2207 and Marinobacter sp. strain ELB 17 and equivalents and synonymous species thereof.
In various embodiments, the FAR enzyme and/or functional fragment can be derived or obtained from a γ proteobacterium of the order Oceanospirillilales. In some embodiments, the FAR enzyme and/or functional fragment can be derived from or obtained from the Oceanospirillilales family Oceanospirillaceae. In certain embodiments, the FAR enzyme and/or functional fragment can be derived from or obtained from an Oceanospirillaceae genus, such as but not limited to Oceanobacter. In a particular embodiment, the FAR enzyme and/or functional fragment can be derived from or obtained from the Oceanobacter species strain RED65 and has an amino acid sequence that is at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, at least 90% identical, at least 93% identical, at least 95% identical, at least 97% identical and/or at least 98% identical to SEQ ID NO: 3 or a functional fragment thereof. In another specific embodiment, the FAR enzyme for use in the methods disclosed herein comprises or consists of a sequence having 100% identity to the sequence of SED ID NO: 3 (“FAR_Ocs”). In other specific embodiments, the isolated FAR enzyme or functional fragment is obtained or derived from Oceanobacter kriegii. In still other specific embodiments, the isolated FAR enzyme or functional fragment is obtained or derived from Oceanobacter strain WH099.
In various embodiments, the FAR enzyme is from a marine bacterium and is selected from the group consisting of FAR_Hch (Hahella chejuensis KCTC 2396 GenBank YP—436183.1); FAR_Mac (from marine Actinobacterium strain PHSC20C1), FAR_JVC (JCVI_ORF—1096697648832, GenBank Accession No. EDD40059.1; from a marine metagenome), FAR_Fer (JCVI_SCAF—1101670217388; from a marine bacterium found at a depth of 12m in an upwelling in the area of Fernandina Island, the Galapagos Islands, Ecuador), FAR_Key (JCVI_SCAF—1097205236585, from a marine bacterium found at a depth of 1.7m off the coast of Key West Florida), and FAR_Gal (JCVI_SCAF—1101670289386, at a depth of 0.1 m at Isabella Island, Galapagos Islands, Ecuador). Approximate sequence identity to M. algicola DG893 (FAR_Maa) and Oceanobacter sp. RED65 (FAR_Ocs) is given in Table 5.
In one particular embodiment, the FAR enzyme is isolated or derived from the marine bacterium FAR_Gal. In other embodiments, the FAR enzyme or functional fragment is isolated or derived from an organism selected from the group consisting of Vitis vinifera (GenBank Accession No. CA022305.1 or CAO67776.1), Desulfatibacillum alkenivorans (GenBank Accession No. NZ_ABII01000018.1), Stigmatella aurantiaca (NZ_AAMD01000005.1) and Phytophthora ramorum (GenBank Accession No.: AAQXO 1001105.1). Also included are bfar from Bombyx mori (which encodes FAR enzyme polypeptide of SEQ ID NO: 4); hfar from H. sapiens, jjfar from Simmondsia chinensis, MS2 from Zea mays, MS2, FAR4, FAR6, or FER4 from Arabidopsis thaliana (e.g. FAR6 having Accession NP—115529); mfar1 and mfar2 from Atus musculzs; or Adhe2 (AFG40749.1 GI:383103240) from E. coli P12B.
In certain embodiments, a FAR enzyme or functional fragment thereof that is especially suitable for the production of fatty alcohols is identified by the presence of one or more domains, which are found in proteins with FAR activity. In various embodiments, the one or more domains is identified by multiple sequence alignments using hidden Markov models (“HMMs”) to search large collections of protein families, for example, the Pfam collection available at http://pfam.sanger.ac.uk/. See R. D. Finn et al. (2008) Nucl. Acids Res. Database Issue 36:D281-D288.
In certain embodiments, the one or more protein domains by which FAR enzymes are identified belongs to a family of NAD binding domains found in the male sterility proteins of arabidopsis and drosophila, as well as in the fatty acyl reductase enzyme from the jojoba plant (JJFAR). See Aarts M G et al. (1997) Plant J. 12:615-623. This family of binding domains is designated “NAD_binding—4” (PF07993; see http://pfam.sanger.ac.uk/family?acc=PF07993). In various embodiments, the NAD_binding—4 domain is found near the N-terminus of the putative FAR enzyme. In various embodiments, the one or more protein domains by which enzymes with FAR activity are identified belongs to a family of domains known as a “sterile” domain (PF03015; see hup:/ipfam.sanger.ac.uk/family?acc=PF03015), which are also found in the male sterility proteins of Arabidopsis species and a number of other organisms. See Aarts M G et al. (1997) Plant J. 12:615-623. In particular embodiments, the sterile domain is found near the C-terminus of the putative FAR enzyme. In certain specific embodiments, a FAR enzyme for use in the methods described herein is identified by the presence of at least one NAD_binding—4 domain near the N-terminus and the presence of at least one sterile domain near the C-terminus.
In certain embodiments, the NAD_binding—4 domain of the putative FAR enzyme has an amino acid sequence that is at least 10%, such as at least 15%, such as at least 20%, such as at least 25%, such as at least 30%, such as at least 35%, such as at least 40%, such as at least 45%, such as at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 85%, such as at least 90% or more identical to the amino acid sequence of a known NAD_binding—4 domain. See, e.g., Aarts M G et al. (1997) Plant J. 12:615-623. In various embodiments, the sterile domain of the putative FAR enzyme has an amino acid sequence that is at least 10%, such as at least 15%, such as at least 20%, such as at least 25%, such as at least 30%, such as at least 35%, such as at least 40%, such as at least 45%, such as at least 50% or more identical to the amino acid sequence of a known sterile domain. See id.
In some embodiments, the NAD_binding—4 domain of the putative FAR enzyme has an amino acid sequence that is at least 10%, such as at least 15%, such as at least 20%, such as at least 25%, such as at least 30%, such as at least 35%, such as at least 40%, such as at least 45%, such as at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 85%, such as at least 90% or more identical to the amino acid sequence of one or more example polypeptides that form the definition of the NAD_binding—4 Pfam domain (PF07993). In certain embodiments, the sterile domain of the putative FAR enzyme has an amino acid sequence that is at least 10%, such as at least 15%, such as at least 20%, such as at least 25%, such as at least 30%, such as at least 35%, such as at least 40%, such as at least 45%, such as at least 50% or more identical to the amino acid sequence of one or more example polypeptides that form the definition of the sterile Pfam domain (PF03015). In various embodiments, the NAD_binding—4 domain or the sterile domain of the putative FAR enzyme is identified by an E-value of 1×104 or less, such as an E-value of 1×10−5, such as an E-value of 1×0−10, such as an E-value of 1×10−15, such as an E-value of 1×10−20, such as an E-value of 1×10−25, such as an E-value of 1×10−30 or lower. As used herein, the term E-value (expectation value) is the number of hits that would be expected to have a score equal or better than a particular hit by chance alone. Accordingly, the E-value is a criterion by which the significance of a database search hit can be evaluated. See, e.g., http://pfam.sanger.ac.uk/help; http://www.csb.yale.edu/userguides/seq/hmmer/docs/node5.html.
The FAR enzymes described herein have not previously been recognized as FAR enzymes because of the low homology of the FAR coding sequences to the sequences coding for proteins with known FAR activity, such as the FAR enzymes from S. chinensis ((FAR Sim); GenBank Accession no. AAD38039.1; gi|5020215|gb|AAD38039.1|AF149917—1 acyl CoA reductase [Simmondsia chinensis]—Plant Physiol. 2000 March; 122(3):635-44. Purification of a jojoba embryo fatty acyl-coenzyme A reductase and expression of its cDNA in high erucic acid rapeseed; Metz J G, Pollard M R, Anderson L, Hayes T R, Lassner M W. PMID: 10712526), B. mori ((FAR Bom); GenBank Accession no. BAC79425.1; gi|33146307|dbj|BAC79425.1| fatty-acyl reductase [Bombyx mori]; Proc Natl Acad Sci USA 2003 Aug. 5; 100(16):9156-61. Epub 2003 Jul. 18. Pheromone gland-specific fatty-acyl reductase of the silkmoth, Bombyx mori. Moto K, Yoshiga T, Yamamoto M, Takahashi S, Okano K. Ando T, Nakata T, Matsumoto S. PMID: 12871998), Arabidopsis thaliana (GenBank Accession no. DQ446732.1 or NM—115529.1; gi|91806527|gb|DQ446732.1|Arabidopsis thaliana clone pENTR221-At3g44560; gi|18410556|ref|NM—115529.1| Arabidopsis thaliana male sterility protein, putative (AT3G56700); Plant Physiol. 2009 May 15; 166(8):787-96. Epub 2008 Dec. 4. Functional expression of five Arabidopsis fatty acyl-CoA reductase genes in Escherichia coli. Doan T T, Carlsson A S, Hamberg M, Billow L, Stymne S, Olsson P. PMID: 19062129) or Ostninia scapulalis (GenBank Accession no. EU817405.1; gi|210063138|gb|EU817405.1| Ostrinia scapulalis FAR-like protein XIII; Insect Biochem. Mol. Biol. 2009 February; 39(2):90-5. Epub 2008 Oct. 26 Pheromone-gland-specific fatty-acyl reductase in the adzuki bean borer, Ostrinia scapulalis (Lepidoptera: Crambidae) Antony B, Fujii T, Moto K, Matsumoto S, Fukuzawa M, Nakano R, Tatsuki S, Ishikawa Y.).
Pathway of
As an alternative to the pathway of
A number of acyl-CoA (or -ACP) reductase enzymes in class 1.2.1 are known to have the ability to reduce fatty acyl-CoA compounds to the corresponding fatty aldehydes, and are provided in Table 6.
Specific exemplary fatty acyl-CoA reductase enzymes classes EC 1.2.1.50, EC 1.2.1.76 and EC1.2.1.n2 that could be used in the engineered pathway of
Photobacterium
leiognathi
Clostridium kluyveri
Acinetobacter sp.
Gallus gallus
Arabidopsis
thaliana
Arabidopsis
thaliana
Arabidopsis
thaliana
Arabidopsis
thaliana
Arabidopsis
thaliana
There are numerous alcohol dehydrogenasesketoreductase that have been well-studied functionally and structurally, including extensive engineering to provide enzymes having improved properties. Engineered ketoreductases having improved properties (e.g., increased activity, enantioselectivity, and/or thermostability) are described in the patent publications US 20080318295A1; US 20090093031 A1; US 20090155863A1; US 20090162909A1; US 20090191605A1; US 20100055751A ; WO/2010/025238A2; WOi/2010/025287A2; and US 20100062499A1; each of which are incorporated by reference herein. Exemplary enzymes of this class, either as the wild type or after enzyme engineering/evolution, which are capable of reducing fatty aldehydes to the corresponding alcohol are shown in Table 8:
Thermoanaerobacter
brockii
Rhodococcus ruber
Lactobacillus brevis
Lactobacillus kefir
Kluyveromyces lactis
Candida boidinii
Saccharomyces
cerevisiae
Bacillus
stearothermophilus
Escherichia coli
In some embodiments of the present disclosure, a reductase enzyme of Table 7 or Table 8 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment, no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
The conversion of a hydroxyl group (e.g., as in an alcohol) to the corresponding phosphate ester is an ubiquitous reaction found in all organisms. Accordingly, there are a large number of alcohol kinase enzymes in class EC 2.7.1.x that are known to catalyze conversion of an alcohol to a phosphate as shown in Table 9.
In particular, based on their known activity and structure, the alcohol kinase enzymes in classes EC 2.7.1.30, EC 2.7.1.32, EC 2.7.1.36, EC 2.7.1.39 and EC 2.7.1.82 are well-suited for converting crotonol to the corresponding phosphate compound, but-2-enyl phosphate. Some exemplary alcohol kinases include glycerol kinase (EC 2.7.1.30; J. Biol. Chem. 1955, 211, 951), choline kinase (EC 2.7.1.32; J. Biol. Chem. 1953, 202, 431), mevalonate kinase (EC 2.7.1.36; J. Biol. Chem. 1958, 233, 1100), homoserine kinase (EC 2.7.1.39; J. Biochem. 1957, 44, 299), ethanolamine kinase (EC 2.7.1.82; Biochim. Biophys. Acta. 1972, 276, 143). Additionally, phosphorylation of simple alcohols by bacterial (S. felxneri and S. enterica) non-specific acid phosphatases (UniProt Q71EB8) has been demonstrated (Adv. Synth. Catal. 2005, 347, 1155). Also, it has been reported that isopentyl phosphate kinase from peppermint (Mentha×piperita) which normally phosphorylates isopentyl phosphate to the corresponding pyrophosphate also has activity on converting isopentenol and dimethylallyl alcohol to the corresponding phosphate (PNAS 1999, 96, 13714). These and other exemplary alcohol kinase enzymes from these classes that could be used in preparing an engineered pathway of
Saccharomyces cerevisiae
Escherichia coli (strain K12)
Homo sapiens
Mus musculus
Mus musculus
Caenorhabditis elegans
Saccharomyces cerevisiae
Homo sapiens
Dictyostelium discoideum
Methanocaldococcus jannaschii
Rattus norvegicus
Saccharomyces cerevisiae
Arabidopsis thaliana
Saccharomyces cerevisiae
Escherichia coli (strain K12)
Methanocaldococcus jannaschii
In some embodiments of the present disclosure, an alcohol kinase enzyme of Table 10 naturally occurs in the host cell used to prepare the recombinant host cell capable of producing crotonol or 1,3-butadiene. In such an embodiment, no further modification of the host cell is needed to provide expression of an enzyme capable of the conversion of substrate to product of
Pathway of
The phosphate product of
The vast majority of the phosphate lyase enzymes in Table 11 can be broadly described as “terpene synthases” which are known to eliminate a pyrophosphate and form a tertiary carbocation intermediate. The conversion of but-2-enyl-phosphate to 1,3-butadiene in the engineered pathway of
Exemplary isoprene synthases that can be engineered to carry out the desired transformation include isoprene synthase from P. alba (FEBS Lett. 2005, 579, 2514), P. Montana (Metabol. Engin. 2010, 12, 70), P. tremula×P. alba (Planta 2001, 213, 483). Alternatively, butadiene is produced by the action of monoterpene synthases (e.g. from S. officinalis in J. Biol. Chem. 1998, 273, 14891; M. alternifolia in Plant Pysio. Biochem. 2004, 42, 875: O. basilicum in Plant Physio. 2004, 136, 3724; A. annua in Plant Physiol. 2002, 130, 477). These and other exemplary terpene synthase enzymes that can be engineered to provide butadiene synthase activity are shown in Table 12.
Populus alba
Papulus tremula x
P. alba
Populus tremuloides
Pueraria lobata
Populus nigra
Populus nigra
Salvia officinalis
Salvia officinalis
Melaleuca
alternifolia
Ocimum basilicum
Ocimum basilicum
Ocimum basilicum
Artemisia annua
In some embodiments, a naturally occurring gene, such as a homolog of a gene in Table 12, having the butadiene synthase activity can be identified. Such a gene can then be used to heterologously transform a host cell which lacks this gene, and/or has such a gene but the native activity is not sufficient. Accordingly, heterologous transformation with a homolog of a gene of Table 12 can provide a recombinant host cell with an improved property (e.g., altered expression of a gene, altered concentration of a substrate and/or product due to use of a non-native gene in the pathway).
In some embodiments, an engineered version of a gene of Table 12, or a engineered version of a homolog of a gene of Table 12, can be used to transform a host cell to provide an enzyme capable of the conversion of substrate to product of
In some embodiments, the present disclosure provides a method for producing a recombinant host cell, wherein the method comprises transforming a suitable host cell with one or more nucleic acid constructs encoding: (a) a FAR enzyme, wherein the enzyme is capable of converting crotonyl-CoA and/or crotonyl-ACP to crotonol; (b) an enzyme capable of converting crotonol to but-2-enyl phosphate; and (c) a enzyme capable of converting but-2-enyl phosphate to 1,3-butadiene.
In some embodiments, the host cell is a bacterial cell. In some embodiments, the host cell is a yeast cell. The transformed or transfected host cell is cultured in a suitable nutrient medium under conditions permitting the expression of the FAR enzyme, the alcohol kinase enzyme, and/or the terpene synthase enzyme. The medium used to culture the cells may be any conventional medium suitable for growing the host cells, such as minimal or complex media containing appropriate supplements. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g. in catalogues of the American Type Culture Collection).
A. Host Cells
The recombinant host cells of the present invention generally comprise a recombinant polynucleotide encoding an enzyme, such as a FAR enzyme. Suitable host cells include, but are not limited to microorganisms including bacteria, yeast, filamentous fungi and algae. In certain embodiments, microorganisms useful as recombinant host cells are wild-type microorganisms. In certain embodiments, host cell is the bacteria Escherichia coli. In some embodiments, the host is a the yeast, and in particular embodiments, an oleaginous yeast.
In various embodiments, microorganisms useful as recombinant host cells are genetically modified. As used herein, “genetically modified” microorganisms include microorganisms having one or more endogenous genes removed, microorganisms having one or more endogenous genes with reduced expression compared to the parent or wild-type microorganism, or microorganisms having one or more genes overexpressed compared to the parent or wild-type microorganism. In certain embodiments, the one or more genes that are overexpressed are endogenous to the microorganism. In some embodiments, the one or more genes that are overexpressed are heterologous to the microorganism.
In certain embodiments, the genetically modified microorganism comprises an inactivated or silenced endogenous gene that codes for a protein involved in the biosynthesis of fatty acyl-CoA substrates. In particular embodiments, the inactive or silenced gene encodes a fatty acyl-ACP thioesterase or a fatty acyl-CoA synthetase (FACS).
In certain embodiments, the genetically modified microorganism alters (i.e., increases or decreases) the expression a gene that encodes one or more of the enzymes in the pathway of enzymes of
B. Prokaryotc Host Cells
In some embodiments, the host cell is a prokaryotic cell. Suitable prokaryotic cells include gram positive, gram negative and gram-variable bacterial cells. In certain embodiments, host cells include, but are not limited to, species of a genus selected from the group consisting of Agrobacterium, Alicyclobacillus. Anabaena, Anacystis, Acinetobacter, Acidothermus, Arthrobacter, Azobacter. Bacillus. Bifidobacterium, Brevibacterium, Butvrivibrio. Buchnera, Campestris, Camplyobacter, Clostridium, Corynehacterium, Chromatium, Coprococcus, Cyanohacteria, Escherichia, Enlerococcus. Enterohacter, Erwinia, Fusobacterium, Faecalihacterium, Francisella, Flavobacterium, Geobacillus, Haemophilus, Helicobacter, Klebsiella, Lactobacillus, Lactococcus, Ilyobacter, Aficrococcus, Microbacterium, Mesorhizohium, Methylobacterium, Methylobacterium, Mycobacterium, Neisseria, Pantoea, Pseudomonas, Prochlorococcus, Rhodobacter, Rhodopseudomonas, Rhodopseudomonas, Roseburia, Rhodospirillum, Rhodococcus, Scenedesmun, Streplomyces, Streptococcus, Svnnecoccus, Siccharomonospora, Staphylococcus, Serratia, Salmonella, Shigella, Thermoanaerobacterium, Tropheryma, Tularensis, Temecula, Thermosynechococcus, Thermococcus, Ureaplasma, Xanthomonas, Xylella, Yersinia and Zymomonas. In particular embodiments, the host cell is a species of a genus selected from the group consisting of Agrobacterium, Arthrobacter, Bacillus, Clostridium, Corynebacterium, Escherichia, Erwinia, Geobacillus, Klebsiella, Lactobacillus, Mrobacterium, Pantoea Rhodococcus, Streptomyces and Zymomonas.
In particular embodiments, the bacterial host cell is a species of the genus Escherichia, e.g., E. coli. E. coli provides a good prokaryotic microorganism for producing a recombinant host cell capable of producing a chemical such as crotonol or 1,3-butadiene under aerobic, anaerobic or microaerobic conditions. Examples of chemicals produced by recombinant E. coli host cells include ethanol, lactic acid, succinic acid, and the like. In certain embodiments, the E. coli is a wild-type bacterium. In various embodiments, the wild-type E. coli bacterial strain useful in the processes described herein is selected from, but not limited to, strain W3110, strain MG1655 and strain BW25113. In other embodiments, the E. coli is genetically modified. Examples of genetically modified E. coli useful as recombinant host cells include, but are not limited to, genetically modified E. coli found in the Keio Collection, available from the National BioResource Project at NBRP E. coli, Microbial Genetics Laboratory, National Institute of Genetics 111 Yata, Mishima, Shizuoka, 411-8540.
In particular embodiments, the genetically modified E. coli comprises an inactivated or silenced endogenous fadD gene, which codes for an acyl-CoA synthetase protein. In other embodiments the genetically modified E. coli comprises an inactivated of silenced endogenous fadK gene, which codes for an endogenous short-chain acyl-CoA synthetase. In still other embodiments, the genetically modified E. coli comprises an inactivated or silenced endogenous fadD gene and an inactivated or silenced endogenous fadK gene. In other embodiments, the genetically modified E. coli comprises an endogenous fadD gene that has reduced expression compared to the parent or wild-type strain. In various embodiments, the genetically modified E. coli comprises an endogenous fadK gene that has reduced expression compared to the parent or wild-type strain.
In certain embodiments, the recombinant host cell is an industrial bacterial strain. Numerous bacterial industrial strains are known and suitable for use in the methods disclosed herein. In some embodiments, the bacterial host cell is a species of the genus Bacillus, e.g., B. thuringiensis, B. anthracis, B. megaterium. B. subtilis, B. lentus, B. circulans, B. pumilus, B. lautus, B. coagulans, B. brevis, B. firmus, B. alkaophius, B. licheniformis, B. clausii, B. stearothermophilus, B. halodurans and B. amnyloliquebciens. In particular embodiments, the host cell is a species of the genus Bacillus and is selected from the group consisting of B. subtilis, B. pwuilus, B. licheniformis, B. clausii, B. stearothernophilus, B. megaterinum and B. amnyloliquefaciens.
In some embodiments the bacterial host cell is a species of the genus Erwinia, e.g. E. uredovwra, E. carotvora, E. ananas, E. herbicola, E. punctata or E. terreus.
In other embodiments the bacterial host cell is a species of the genus Pantoea, e.g., P. citrea or P. agglomerans.
In still other embodiments, the bacterial host cell is a species of the genus Streptomnyces, e.g., S. ambofaciens, S. achromogenes, S. avermitilis, S. coelicolor, S. aureofaciens, S. aureus, S. fungicidicus, S. griseus or S. lividans.
In further embodiments, the bacterial host cell is a species of the genus Zymomonas, e.g., Z. mobilis or Z. lipolytica.
In further embodiments, the bacterial host cell is a species of the genus Rhodococcus, e.g. R. opacus.
C. Yeast Host Cells
In certain embodiments, the recombinant host cell is a yeast. In various embodiments, the yeast host cell is a species of a genus selected from the group consisting of Candida, Hansenula, Saccharomyvces, Schizosaccharomyces, Pichia, Kluyverornmyces, and Yarrowia. In particular embodiments, the yeast host cell is a species of a genus selected from the group consisting of Saccharomyces, Candida, Pichia and Yarrowia.
In various embodiments, the yeast host cell is selected from the group consisting of Hansenula polymorpha, Saccharomyces cerevisiae, Saccaromyces carlsbergensis, Saccharornmces diastaticus, Saccharomces norbensis, Saccharomnces kluyveri, Schizosaccharomyces pombe, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia ferniemtans, Issatchenkia orientalis, Pichia kodamae, Pichia membranaefaciens, Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia quercuum, Pichia pijperi, Pichia stipitis, Pichia methanolica, Pichia angusta, Kluyveromyces lactis, Candida albicans, Candida krusei, Candida ethanolic and Yarrowia lipolytica and synonyms or taxonomic equivalents thereof.
In certain embodiments, the yeast host cell is a wild-type cell. In various embodiments, the wild-type yeast cell strain is selected from, but not limited to, strain BY4741, strain FL100a, strain INVSC1, strain NRRL Y-390, strain NRRL Y-1438, strain NRRL YB-1952, strain NRRL Y-5997, strain NRRL Y-7567, strain NRRL Y-1532, strain NRRL YB-4149 and strain NRRL Y-567. In other embodiments, the yeast host cell is genetically modified. Examples of genetically modified yeast useful as recombinant host cells include, but are not limited to, genetically modified yeast found in the Open Biosystems collection found at http://www.openbiosystems.com/GeneExpression/YeastYKO/. See Winzeler et al. (1999) Science 285:901-906.
In other embodiments, the recombinant host cell is an oleaginous yeast. Oleaginous yeast are organisms that accumulate lipids such as tri-acylglycerols. Examples of oleaginous yeast include, but are not limited to, organisms selected from the group consisting of Yarrowia lipolytica, Yarrowia paralipolytica, Candida revkaufi, Candida pulcherrima, Candida tropicalis, Candida utilis, Candida curvata D, Candida curvala R, Candida diddensiae, Candida holdinii, Rhodotorula glutinous, Rhodotorula graminis, Rhodotorula mucilaginosa, Rhodotorula minula, Rhodotorula bacarum, Rhodosporidium onruloides, Cryptococcus (terricolus) albidus var. albidus, Cryplococcus laurentii, Trichosporon pullans, Trichosporon cutaneum, Trichosporon cutancum, Trichosporon pullulans, Lipomyces starkeyii, Lipomyces lipoferus, Lipomyces letrasponrus, Endomy opsis vernalis, Hansenula cierri, Hlansenula saturnus, and Trigonopsis variables. In particular embodiments, the oleaginous yeast is Y. lipolytica. In certain embodiments, Yarrowia lipolytica strains include, but are not limited to, DSMZ 1345, DSMZ 3286, DSMZ 8218, DSMZ 70561, DSMZ 70562, and DSMZ 21175.
In certain embodiments, the oleaginous yeast is a wild-type organism. In other embodiments, the oleaginous yeast is genetically modified.
In yet other embodiments, the recombinant host cell is a filamentous fungus. In certain embodiments, the filamentous fungal host cell is a species of a genus selected from the group consisting of Achlya, Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Cephalosporium, Chrysosporium, Cochliobolus, Conynascus, Cryphonectria, Cryptococcus, Coprinus, Coriolus, Diplodia, Endothis, Fusarium, Gibberella, Gliocladium, Humicola, Hypocrea, Myceliophthora, Mucor, Neurospora, Penicillium, Podospora, Phlebia, Piromyces, Pyricularia, Rhizomucor, Rhizopus. Schizophyllum, Scytalidium, Sporotrichum, Talaromyces, Thermoascus, Thielavia, Trametes, Tolypocladium, Trichoderma, Verticillium, Volvariella, and teleomorphs, synonyms or taxonomic equivalents thereof.
In some embodiments, the filamentous fungal host cell is an Aspergillus species, a Chrysosporium species, a Corynascus species, a Fusarium species, a Humicola species, a Myceliophthora species, a Neurospora species, a Penicillum species, a Tolypocladium species, a Tramates species, or Trichoderma species. In other embodiments, the Trichoderma species is selected from T. longibrachiatum, T. viride, Hypocrea jecorina and T. reesei; the Aspergillus species is selected from A. awamori, A. jirnigatus, A. japonicus, A. nidulans, A. niger, A. aculeatus. A. foetidus, A. oryzae, A. sojae, and A. kawachi; the Chrsosporium species is C. lucknowense; the Fusarium species is selected from F. graminum, F. oxysporum and F. venenatum; the Myceliophihora species is M. thermophilia; the Neurospora species is N. crassa; the Humicola species is selected from H. insolens, H. grisea, and H. lanuginosa; the Penicillum species is selected from P. purpurogenum, P. chrysogenum, and P. verruculosum; the Thielavia species is T. terrestris; and the Trametes species is selected from T. villosa and T. versicolor.
In some embodiments, the filamentous fungal host is a wild-type organism. In other embodiments, the tilamentous fungal host is genetically modified.
In certain particular embodiments, recombinant host cells for use in the methods described herein are derived from strains of Escherichia coli, Bacillus, Saccharomyces, Streptomyces and Yarrowia.
In certain embodiments the host cell is a Yarrowia cell, such as a Y. lipolytica cell.
Cells which are useful in the practice of the present disclosure include prokaryotic and eukaryotic cells which are readily accessible from a number of culture collections and other sources, e.g., the American Type Culture Collection (ATCC), Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ) (German Collection of Microorganisms and Cell Culture), Centraalbureau Voor Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL). Yarrowia lipolytica is available, as a non-limiting example, from the ATCC under accession numbers 20362, 18944, and 76982.
In some embodiments, the recombinant host cell comprising a polynucleotide encoding a FAR enzyme described herein, further lacks a gene encoding a fatty acyl-CoA synthetase (FACS) and/or a gene encoding a fatty acyl-ACP thiocsterase (TE). Without being bound to a particular theory, crotonol, and subsequent 1,3-butadiene, production may be increased in a recombinant host cell lacking a gene encoding a FACS and/or a TE because silencing or inactivating the FACS and/or TE gene may inactivate a competing biosynthetic pathways. Accordingly, in some embodiments, the recombinant E. coli host cells of the present disclosure can further comprise a silenced or inactivated fatty acyl-CoA synthetase fadD gene and/or silenced or inactivated short chain fatty acyl-CA synthetase fdK gene. The recombinant E. coli host can be genetically modified to be silenced or inactivated in one or more of the additional genes described above.
D. Host Cell Transformation and Culture
Recombinant polynucleotides of the disclosure, e.g. polynucleotides encoding FAR enzyme, may be introduced into host cells for expression of the FAR enzyme in the engineered pathway of
In some embodiments, a host cell is transformed with a recombinant polynucleotide encoding a enzyme in an engineered pathway of
Methods, reagents and tools for transforming host cells described herein, such as bacteria (include E. coli), yeast (including oleaginous yeast) and filamentous fungi are known in the art. General methods, reagents and tools for transforming, e.g., bacteria can be found, for example, in Sambrook et al (2001) Molecular Cloning: A Laboratory Manual, 3rd ed., Cold Spring Harbor Laboratory Press, New York. Methods, reagents and tools for transforming yeast are described in “Guide to Yeast Genetics and Molecular Biology,” C. Guthrie and G. Fink, Eds., Methods in Enzymology 350 (Academic Press, San Diego, 2002). Methods, reagents and tools for transforming, culturing, and manipulating Y. lipolytica are found in “Yarrowia lipolytica.” C. Madzak, J. M. Nicaud and C. Gaillardin in “Production of Recombinant Proteins. Novel Microbial and Eucaryotic Expression Systems,” G. Gellissen, Ed. 2005, which is incorporated herein by reference for all purposes. In some embodiments, introduction of the DNA construct or vector of the present disclosure into a host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, PEG-mediated transformation, electroporation, or other common techniques (See Davis et al., 1986, Basic Methods in Molecular Biology, which is incorporated herein by reference).
The recombinant host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying the expression of certain pathway enzymes (e.g., the FAR enzyme of
A. Biosynthetic Production and Isolation of Crotonol or 1,3-butadiene
The present disclosure also provides methods for producing crotonol or 1,3-butadiene by fermentation of the recombinant host cells comprising one or more recombinant polynucleotides as described herein. As noted elsewhere herein, in some embodiments, the recombinant host cells comprise an engineered pathway of enzymes that provide for the ability to produce crotonol biosynthetically (see e.g.,
Generally, in the embodiments of the methods for producing the fermentation products describe above and else herein, the fermentable sugar may comprise products of a cellulosic saccharification process, including, for example, mono-, di-, and trisaccharides (e.g. glucose, xylose, sucrose, maltose, and the like), and more complex polysaccharide carbohydrates (e.g. lignocellulose, xylans, cellulose, starch, and the like), and the like. Compositions of fermentation media suitable for the growth of recombinant host cells such as E. coli, yeast, and filamentous fungi are well known in the art. See, for example, Yeast Protocols (1st and 2nd edition), Hahan-Hagerdal Microbial Cell Factories 2005, Walker Adv. In Applied Microbiology (2004), which is incorporated herein by reference.
Fermentation conditions suitable for generating the desired fermentation product, crotonol, are well known in the art. The suitable conditions can comprise aerobic, microaerobic or anaerobic conditions. In some embodiments, the suitable conditions for fermentation can comprise anaerobic conditions. Typical anaerobic conditions are the absence of oxygen (i.e., no detectable oxygen), or less than about 5, about 2.5, or about 1 mmol/L/h oxygen. In the absence of oxygen, the NADH produced in glycolysis cannot be oxidized by oxidative phosphorylation. Under anaerobic conditions, pyruvate or a derivative thereof may be utilized by the host cell as an electron and hydrogen acceptor in order to generate NAD+. In certain embodiments of the present disclosure, when the fermentation process is carried out under anaerobic conditions, pyruvate may be reduced to a fermentation product such as ethanol, butanol, or lactic acid.
Typically, the suitable conditions comprise running the fermentation at a temperature that is optimal for the recombinant host cell. For example, the fermentation process may be performed at a temperature in the range of from about 25° C. to about 42° C. Typically the process is carried out a temperature that is less than about 38° C. less than about 35° C., less than about 33° C., less than about 38° C., but at least about 20° C., 22° C. or 25° C.
In some embodiments of the methods, the recombinant host cells of the present disclosure are grown under batch or continuous fermentation conditions. Classical batch fermentation is a closed system, wherein the composition of the medium is set at the beginning of the fermentation and is not subject to artificial alterations during the fermentation. A variation of the batch system is a fed-batch fermentation which also finds use in the present disclosure. In this variation, the substrate is added in increments as the fermentation progresses. Fed-batch systems are useful when catabolite repression is likely to inhibit the metabolism of the cells and where it is desirable to have limited amounts of substrate in the medium. Batch and fed-batch fermentations are common and well known in the art.
Continuous fermentation is carried out using an open system where a defined fermentation generally maintains the culture at a constant high density where cells are primarily in log phase growth. Continuous fermentation systems strive to maintain steady state growth conditions. Methods for modulating nutrients and growth factors for continuous fermentation processes as well as techniques for modulating nutrients and growth factors for continuous fermentation processes as well as techniques for maximizing the rate of product formation are well known in the art of industrial microbiology.
B. Chemo-catalytic Dehydration of Crotonol to 1,3-butadiene
As described above, the present disclosure provides recombinant host cells and associated fermentation methods using the cells to produce the compound, crotonol. Crotonol is an alcohol compound with a density of 0.8454 g/cm3 which has a melting point <25° C. and a boiling point of 121.2° C. In some embodiments of the methods, the crotonol is recovered from the fermentation medium. Recovery of crotonol can be carried out using well-known bioindustrial and/or chemical techniques, e.g. extraction, or distillation. Crotonol produced biosynthetically and thereafter recovered from the medium can then be further converted to 1,3-butadiene via a chemo-catalytic dehydration step.
The efficient chemo-catalytic conversion of crotonol to 1,3-butadiene using a solid-acid catalyst, e.g., aluminosilicate, is known in the art (see e.g., Ichikawa et al., J. Mol. Cat. A 2006, 256, 106-112). Other chemo-catalytic dehydration techniques and suitable conditions for the conversion of alcohols to olefins are well known in the art. Typical dehydration catalysts that convert alcohols such as butanols and pentanols into olefins include various acid treated and untreated alumina (e.g., γ-alumina) and silica catalysts and clays including zeolites (e.g. β-type zeolites, ZSM-5 or Y-type zeolites, fluoride-treated β-zeolite catalysts, fluoride-treated clay catalysts, etc.), sulfonic acid resins (e.g., sulfonated styrenic resins such as Amberlyst 15), strong acids such as phosphoric acid and sulfuric acid, Lewis acids such boron trifluoride, and many different types of metal salts including metal oxides (e.g. zirconium oxide or titanium dioxide) and metal chlorides (see e.g. Latshaw B E, Dehydration of Isobutanol to Isobutylene in a Slurry Reactor, Department of Energy Topical Report, February 1994).
Generally, dehydration reactions can be carried out in gas or liquid phase with heterogeneous or homogeneous catalyst systems in many different reactor configurations. Typically, the catalysts used are stable to the water that is generated by the reaction. The water is usually removed from the reaction zone with the product. The resulting alkene(s) either exit the reactor in the gas or liquid phase (e.g., depending upon the specific alkene and reactor conditions) and is (are) captured by a downstream purification process. The water generated by the dehydration reaction exits the reactor with unreacted alcohol and alkene product(s) and is separated by distillation or phase separation. Because water is generated in large quantities in the dehydration step, the dehydration catalysts used are generally tolerant to water and a process for removing the water from substrate and product may be part of any process that contains a dehydration step. For this reason, it is possible to use wet (i.e., up to about 95% or 98% water by weight) alcohol as a substrate for a dehydration reaction and remove this water with the water generated by the dehydration reaction (e.g., using a zeolite catalyst as described U.S. Pat. Nos. 4,698,452 and 4,873.392). Additionally, neutral alumina and zeolites will dehydrate alcohols to alkenes but generally at higher temperatures and pressures than the acidic versions of these catalysts.
Various features and embodiments of the disclosure are illustrated in the following representative examples, which are intended to be illustrative, and not limiting.
This Example illustrates the preparation of a recombinant E. coli host cell that expresses the genes in the engineered pathways of
The following genes of the engineered pathway of
The resulting plasmid containing the genes is used to transform E. coli strain K12 using routine transformation methods. Transformed E. coli cells are pre-cultured in LB medium (Difco) with 0.4% glucose and 30 μg/ml chloramphenicol, incubated at 37° C. and 200 rpm with a 2″ throw for 18 hours. Growth is monitored by measuring the optical density at 600 nm. Fresh LB liquid medium including 0.2% glucose and 30 μg/ml chloramphenicol is inoculated with sufficient cells from the pre-culture to obtain a starting optical density of 0.1. After approximately 2 to 3 hours of growth at 37° C. and 250 rpm with a 2″ throw, an optical density of approximately 0.6 is obtained. Isopropylthioglycoside (IPTG) is added to the cells to a final concentration of 1 mM and the cells are incubated at 30° C. and 200 rpm with a 2″ throw for 30-90 minutes until an OD of approximately 1.2 is obtained. Glucose is added to the cells to a final concentration of 2%, the containers are sealed and crotonol production is monitored by extraction of samples from the fermentation medium into organic solvent followed by crotonol analysis of the organic extract using HPLC or a comparable analysis technique.
The resulting recombinant host cell comprises an engineered pathway of
This isolated crotonol product recovered from the bioreactor is converted to 1,3-butadiene by dehydration over a solid acid chemical catalyst (
This Example illustrates the preparation of a recombinant E. coli host cell that expresses the genes in the engineered pathways of
The following genes of the engineered pathway of
The resulting plasmid containing the genes is used to transform E. coli strain K12 using routine transformation methods. Transformed E. coli cells are pre-cultured in LB medium (Difco) with 0.4% glucose and 30 μg/ml chloramphenicol, incubated at 37° C. and 200 rpm with a 2″ throw for 18 hours. Growth is monitored by measuring the optical density at 600 nm. Fresh LB liquid medium including 0.2% glucose and 30 μg/ml chloramphenicol is inoculated with sufficient cells from the pre-culture to obtain a starting optical density of 0.1. After approximately 2 to 3 hours of growth at 37° C. and 250 rpm with a 2″ throw, an optical density of approximately 0.6 is obtained. Isopropylthioglycoside (IPTG) is added to the cells to a final concentration of 1 mM and the cells are incubated at 30° C. and 200 rpm with a 2″ throw for 30-90 minutes until an OD of approximately 1.2 is obtained. Glucose is added to the cells to a final concentration of 2%, the containers are sealed and 1,3-butadiene production is monitored using GC-FID (Agilent GC-GasPro column, 1 ml head space injection, split 10; Method-203° C. for 2.5 min. 250° C. for 2.5 min (ramp 50° C./min). 203° C. for 2 min) with butadiene eluting at 1.9 minutes.
This Example illustrates methods and conditions for the large scale production of 1,3-butadiene using a recombinant E. coli host cell of Example 2 comprising an engineered pathway of
The 1,3-butadiene is isolated by directing the fermentation off-gas using a gentle nitrogen sweep, first through a chilled scrubber at 0° C. to condense by-products, primarily water vapor, and then to a cryogenic condenser/trap at −20° C. to collect the 1,3-butadiene as a liquid. The remaining by-product gases, primarily nitrogen and CO2, then are vented into the atmosphere.
This Example illustrates how a recombinant E. coli host cell of Example 1 or 2 comprising an engineered pathway of
Briefly, the engineered strain is analyzed as to determine which recombinant gene's expression and/or which enzyme's activity is limiting the production of crotonol and/or 1,3-butadiene. A limiting gene's expression can be increased by increasing the copy number in the host cell. If enzyme activity is limiting, it can also be increased by increased copy number of the gene encoding it. Alternatively, the enzyme's gene is engineered via directed evolution to provide a gene encoding an enzyme having increased activity and the host cell is transformed with that recombinant gene. This general process of identifying the limiting gene and/or enzyme followed by increasing copy number and/or enzyme engineering is iterated until the desired amount of production is achieved from the E. coli host cell.
Additionally, metabolic modeling (Biotechnol. Bioengin 2003, 84, 647-657) is utilized to optimize the recombinant E. coli host cell's growth conditions and to knock out genes in the recombinant host cell that are responsible for metabolic leakage/inefficiencies in the engineered pathways of
This Example illustrates how a recombinant E. coli host cell expressing a FAR enzyme construct is capable of converting the non-natural substrate crotonyl-CoA to crotonol.
Preparation of E. coli Construct for Adhe2 Overexpression
The wild-type gene adhe2 encodes the enzyme Adhe2 reported as an aldehyde alcohol dehydrogenase from E. coli P12B (GenBank access. AFG40749.1 GI:383103240). The wild-type gene adhe2 was cloned in a pCK-900 vector and transformed into E. coli. The recombinant E. coli containing adhe2 (or E. coli transformed with empty pCK-900 vector) were grown for 16 h in 2xYT with 30 μg/mL chloramphenicol, then diluted in a 50 mL conical tube to OD 0.2 in 2xYT with 30 μg/mL chloramphenicol, 20 mM MgCl2, and 0.25% glucose, (50 mL total volume). The tubes were sealed and shaken at 250 rpm, 30° C. for 2 h, then 1 mM IPTG was added and the E. coli grown for an additional 2 hours under the same conditions. The cells were centrifuged (2800×g, 10 min), the supernatant discarded, and the cell pellets were lysed via addition of 2 mL of 100 mM Tris pH 7.5 with 1 mM DTT. 1 mM MgCl2, 0.5 mg/mL polymixin B sulfate, and 0.5 mg/mL lysozyme, with shaking at room temperature for 2 h. The resulting lysates were centrifuged for 10 min at 12.800×g, and the pellets were discarded. The supernatants were analyzed by SDS-PAGE revealing an overexpressed band at ˜92 kDa consistent with the expected molecular weight of the enzyme, Adhe2.
Enzymatic Activity Assay
2-fold serial dilutions of lysate (30 μL) from E. coli expressing Adhe2 or E. coli control (pCK-900 empty vector) were incubated with 1 mM NADH (20 μL), buffer (100 mM Tris pH 7.5, 1 mM DTT, 20 μL), and 1 mM crotonyl-CoA, 1 mM butyryl-CoA (a natural substrate), or water (as control). Adhe2 enzymatic activity was quantified as the rate of NADH consumption by measuring the UV absorbance at 340 nm.
As shown by the activity assay results in Table 13 below, the consumption of NADH was significantly increased in the solutions containing the Adhe2 expressing lysate and either the natural substrate butyryl-CoA, or the unnatural substrate, crotonyl-CoA, relative to the consumption in the presence of water (i.e., no substrate).
Further, the amount of NADH consumption increased with the amount of Adhe2 expressing lysate added. Additionally, there was no difference in the consumption of NADH in the solutions containing the empty vector lysates with any of butyryl-CoA, crotonyl-CoA, or water. (The very small increases in activity with the empty vector likely correspond to the expression of endogenous Adhe2 enzyme that is present in E. coli). Thus, the results shown in Table 13 indicate that the wild-type Adhe2 is capable of reducing the non-natural substrate crotonyl-CoA, although with slightly less activity than it reduces the natural substrate, butyryl-CoA.
Each publication, patent, patent application, or other document cited in this application is hereby incorporated by reference in its entirety for all purposes to the same extent as if each were individually indicated to be incorporated by reference for all purposes in the specification directly adjacent the citation.
While various specific embodiments have been illustrated and described, it will be appreciated that various changes can be made without departing from the spirit and scope of the invention(s).
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2013/027820 | 2/26/2013 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61606030 | Mar 2012 | US |