Methods and microorganisms for producing flavors and fragrance chemicals

Information

  • Patent Grant
  • 11060079
  • Patent Number
    11,060,079
  • Date Filed
    Wednesday, December 19, 2018
    6 years ago
  • Date Issued
    Tuesday, July 13, 2021
    3 years ago
  • Inventors
  • Original Assignees
    • ARDRA INC.
  • Examiners
    • Holland; Paul J
    Agents
    • Wilson Sonsini Goodrich & Rosati
Abstract
The present disclosure relates to biosynthetic pathways for producing flavor and fragrance chemicals, such as green notes including trans-2-unsaturated aldehydes and lactones. The present disclosure provides methods for producing trans-2-unsaturated aldehydes, delta-lactones, and gamma-lactones. The present disclosure provides pathways for the preparation of trans-2-unsaturated aldehydes, delta-lactones, and gamma-lactones by reacting aldehydes in the presence of aldolases.
Description
BACKGROUND

Trans-2-unsaturated aldehydes such as trans-2-hexenal, gamma-lactones such as gamma-deltalactone, and delta-lactones such as delta-decalactone are important flavor and aroma constituents and are found in many natural products. These compounds can be used either directly or as intermediates in the flavor, fragrance, pharmaceutical, and cosmetic industries. Most flavors are provided by extraction from natural sources or by traditional methods as chemical synthesis. The problem with chemical synthesis is that the resulting compounds are non-natural and products containing them need to be labelled accordingly. On the other hand, extracting the compounds from natural sources is complicated, expensive, non-sustainable and the supply is often limited. For example, natural trans-2-hexenal, which is used in apple flavors, is currently produced using a fatty acid lipoxygenase pathway with very low yields and massoia lactone ((R)-5,6-dihydro-6-pentyl-2H-pyran-2-one) is extracted from the bark of the Massoia tree at great expense for its conversion to delta-decalactone.


Chemical synthesis of food aroma starts from petroleum derived chemicals and the resulting products are considered non-natural. Driven by consumer preference for products without artificial flavors, demand for alternative, sustainable ways to produce these compounds has increased over the last years. Chemical synthesis also has drawbacks such as poor reactions selectivity leading to undesirable side reactions, low yields, pollution, and, in some cases, high manufacturing costs. Additionally, resolution of racemic mixtures is usually difficult to achieve chemically. This leads to the reduction of process efficiency and to the increase of downstream costs. Extraction of natural products is also subject to various problems. These raw materials may contain low concentrations of the desired compounds and are subject to weather and plant diseases, making the extraction expensive. Natural sources become insufficient to cover the increasing demand of the market.


SUMMARY

A large interest has been focused on producing flavor compounds from biological origin. The present disclosure relates to high-yield, cost-effective methods of producing trans-2-unsaturated aldehydes, gamma-lactones, and delta-lactones and applications thereof in the industry of, e.g., flavors, fragrances, pharmaceuticals and/or cosmetics.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a trans-2-unsaturated aldehyde pathway, wherein the non-naturally occurring microorganism comprises at least one of the following trans-2-unsaturated aldehyde pathway enzymes: an aldolase that catalyzes condensation of two aldehydes to produce 3-hydroxy aldehyde; and a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde; wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the trans-2-unsaturated aldehyde pathway; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a delta-lactone pathway, wherein the non-naturally occurring microorganism comprises at least one of the following delta-lactone pathway enzymes:

    • i. an aldolase (a first aldolase) that catalyzes condensation of two aldehydes to produce 3-hydroxy aldehyde;
    • ii. an aldolase (a second aldolase) that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde to a 5,3-dihydroxy aldehyde, which undergoes cyclization to form tetrahydro-2H-pyran-2,4-diol;
    • iii. an oxidase enzyme that transforms the tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one;
    • iv. a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and
    • v. a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone;


      wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the delta-lactone pathway; optionally wherein the aldolase in i. is different from the aldolase in ii.; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a delta-lactone pathway, wherein the non-naturally occurring microorganism comprises at least one of the following delta-lactone pathway enzymes:

    • i. an aldolase (a first aldolase) that catalyzes condensation of two aldehydes to produce 3-hydroxy aldehyde;
    • ii. an aldolase (a second aldolase) that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde to a 5,3-dihydroxy aldehyde, which undergoes cyclization to form tetrahydro-2H-pyran-2,4-diol;
    • iii. an oxidase enzyme that transforms the tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one;
    • iv. a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and
    • v. a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone,


      wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the delta-lactone pathway; optionally wherein the aldolase in i. is different from the aldolase in ii.; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a gamma-lactone pathway, wherein the non-naturally occurring microorganism comprises at least one of the following gamma-lactone pathway enzymes:

    • i. an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid;
    • ii. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid;
    • iii. a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone;
    • iv. a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2-(5H)-furanone; and
    • v. a reductase that reduces the 2-(5H)-furanone to a gamma-lactone:


      wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the gamma-lactone pathway; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a delta-lactone pathway, wherein the non-naturally occurring microorganism comprises at least one of the following delta-lactone pathway enzymes:

    • i. an aldolase (a first aldolase) that catalyzes condensation of an acetaldehyde and an aldehyde to 3-hydroxy aldehyde;
    • ii. an aldolase (a second aldolase) that catalyzes condensation of the 3-hydroxy aldehyde and an acetaldehyde to 5,3-dihydroxy aldehyde, which undergoes cyclization to form tetrahydro-2H-pyran-2,4-diol;
    • iii. an oxidase enzyme that transforms the tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one; and
    • iv. a dehydratase enzyme that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one;


      wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the delta-lactone pathway; optionally wherein the aldolase in i. is different from the aldolase in ii.; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism having a gamma-lactone pathway, wherein the non-naturally occurring microorganism comprises at least one of the following gamma-lactone pathway enzymes:

    • i. an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvate to a 4-hydroxy-2-oxo carboxylic acid;
    • ii. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy acid;
    • iii. dehydration of the 2,4-dihydroxy acid to 4-hydroxy-2-ene-acid;
    • iv. a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and
    • v. a reductase that reduces the 2(5H)-furanone to a gamma-lactone:


      wherein the non-naturally occurring microorganism comprises at least one exogenous nucleic acid encoding an enzyme from the gamma-lactone pathway; and optionally wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a process comprising reacting acetaldehyde and an aldehyde of formula R—CHO, wherein R is hydrogen or CnH2+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10, in the presence of an aldolase enzyme, to form the 3-hydroxy aldehyde compound of formula II




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; wherein, the aldolase enzyme is specific for the condensation of an acetaldehyde and an aldehyde molecule; and optionally wherein the aldolase enzyme is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or wild-type aldolase enzyme.


In some embodiments, the process further comprises a dehydration step to dehydrate the compound of formula II to form a trans-2-unsaturated aldehyde compound of formula III




embedded image



wherein R is hydrogen or CnH2+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the process further comprises reacting the trans-2-unsaturated aldehyde compound of formula III with an alcohol dehydrogenase or an enzyme having alcohol dehydrogenase activity to form trans-2-unsaturated alcohols. In some embodiments, the process further comprises reacting the trans-2-unsaturated aldehyde compound of formula III with oxidoreductases to form trans-2-unsaturated carboxylic acids.


In some embodiments, the trans-2-unsaturated aldehyde comprises 2-propenal, trans-2-butenal, trans-2-pentenal, trans-2-hexenal, trans-2-heptenal, trans-2-octenal, trans-2-nonenal, trans-2-decenal, trans-2-undecenal, trans-2-dodecenal, trans-2-tridecenal, and any combination thereof. In some embodiments, the trans-2-unsaturated alcohol comprises prop-2-enol, trans-2-butenol, trans-2-pentenol, trans-2-hexenol, trans-2-heptenol, trans-2-octenol, trans-2-nonenol, trans-2-decenol, trans-2-undecenol, trans-2-dodecenol, trans-2-tridecenol, and any combination thereof. In some embodiments, the trans-2-unsaturated carboxylic acid comprises prop-2-enoic acid, trans-2-butenoic acid, trans-2-pentenoic acid, trans-2-hexenoic acid, trans-2-heptenoic acid, trans-2-octenoic acid, trans-2-nonenoic acid, trans-2-decenoic acid, trans-2-undecenoic acid, trans-2-dodecenoic acid, trans-2-tridecenoic acid, and any combination thereof.


In some embodiments, the process further comprises:


(a) condensation of the 3-hydroxy aldehyde compound of formula II and an acetaldehyde molecule in the presence of an aldolase enzyme to form a 5,3-dihydroxy aldehyde compound of formula IV




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; wherein the aldolase enzyme is specific for the condensation of the compound of formula II and an acetaldehyde molecule; optionally wherein the aldolase enzyme specific for the condensation of the compound of formula II and an acetaldehyde molecule is different from the aldolase enzyme that is specific for the condensation of an acetaldehyde and an aldehyde molecule; and optionally wherein the aldolase enzyme is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or wild-type aldolase enzyme;


(b) cyclization of the 5,3-dihydroxyaldehyde compound of formula IV to a tetrahydro-2H-pyran-2,4-diol compound of formula IV(b)




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;


(c) oxidation of the tetrahydro-2H-pyran-2,4-diol compound of formula IV(b) to a tetrahydro-4-hydroxy-2H-pyran-2-one compound of formula V




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; optionally wherein the oxidation comprises an enzymatic oxidation;


(d) dehydration of the tetrahydro-4-hydroxy-2H-pyran-2-one compound of formula V to a 5,6-dihydro-2H-pyran-2-one compound of formula VI




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; optionally wherein the dehydration comprises dehydratase; and optionally,


(e) reduction of the 5,6-dihydro-2H-pyran-2-one compound of formula VI in the presence of a reductase to form a delta-lactone compound of formula VII




embedded image



wherein R is hydrogen or CnH2+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; and optionally wherein at least one enzyme of any one of (a)-(e) is partially purified, purified, or is a modified enzyme.


In some embodiments, the compound of formula VI comprises massoia lactone, C-10 massoia lactone (5,6-dihydro-6-pentyl-2H-pyran-2-one), C-12 massoia lactone (5,6-dihydro-6-heptyl-2H-pyran-2-one), C-14 massoia lactone (5,6-dihydro-6-nonyl-2H-pyran-2-one), 5,6-dihydro-2H-pyran-2-one, 5,6-dihydro-6-methyl-2H-pyran-2-one, 5,6-dihydro-6-hexyl-2H-pyran-2-one, 5,6-dihydro-6-ethyl-2H-pyran-2-one, 5,6-dihydro-6-propyl-2H-pyran-2-one, 5,6-dihydro-6-butyl-2H-pyran-2-one, 5,6-dihydro-6-decyl-2H-pyran-2-one, 5,6-dihydro-6-octyl-2H-pyran-2-one, and any combination thereof.


In some embodiments, the compound of formula VII comprises delta-valerolactone, delta-hexalactone, delta-heptalactone, delta-octalactone, delta-nonalactone, delta-decalactone, delta-undecalactone, delta-dodecalactone, delta-tridecalactone, delta-tetradecalactone, delta-pentadecalactone, and any combination thereof.


In one embodiment, the present disclosure relates to a process of producing a gamma-lactone compound, comprising:


(a) condensing an aldehyde molecule and a pyruvic acid in the presence of an aldolase enzyme to form a 4-hydroxy-2-oxo carboxylic acid compound of formula VIII




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; wherein the aldolase enzyme is specific for the condensation of the pyruvic acid and an aldehyde molecule; and optionally wherein the aldolase enzyme is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or wild-type aldolase enzyme;


(b) reduction of the 4-hydroxy-2-oxo carboxylic acid compound of formula VIII in the presence of a dehydrogenase or a keto-reductase to form a 2,4-dihydroxy carboxylic acid compound of formula IX




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;


(c) lactonization of the 2,4-dihydroxy carboxylic acid compound of formula IX to a 3-hydroxydihydro-2-(3H)-furanone compound of formula X




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10;


(d) dehydration of the 3-hydroxydihydro-2-(3H)-furanone compound of formula X to a 2(5H)-furanone compound of formula XI




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; optionally wherein the dehydration comprises dehydratase; and


(e) reduction of 2(5H)-furanone compound of formula XI in the presence of reductase to a gamma-lactone compound of formula XII




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10; optionally wherein the reduction comprises reductase.


In some embodiments, the gamma-lactone compound of formula XII comprises gamma-valerolactone, gamma-hexalactone, gamma-heptalactone, gamma-octalactone, gamma-nonalactone, gamma-decalactone, gamma-undecalactone, gamma-dodecalactone, gamma-tridecalactone, gamma-tetradecalactone, gamma-butyrolactone, and any combination thereof. In some embodiments, the lactone is in enantiomeric excess. In some embodiments, the gamma-lactone compound of formula XII comprises (R)-gamma-valerolactone, (R)-gamma-hexalactone, (R)-gamma-heptalactone, (R)-gamma-octalactone, (R)-gamma-nonalactone, (R)-gamma-decalactone, (R)-gamma-undecalactone, (R)-gamma-dodecalactone, (R)-gamma-tridecalactone, (R)-gamma-tetradecalactone, (R)-gamma-butyrolactone, and any combination thereof. In some embodiments, the compound of formula VII comprises (R)-delta-hexalactone, (R)-delta-heptalactone, (R)-delta-octalactone, (R)-delta-nonalactone, (R)-delta-decalactone, (R)-delta-undecalactone, (R)-delta-dodecalactone, (R)-delta-tridecalactone, (R)-delta-tetradecalactone, (R)-delta-pentadecalactone, and any combination thereof. In some embodiments, the compound of formula VI comprises (R)-massoia lactone, (R)-5,6-dihydro-6-pentyl-2H-pyran-2-one, (R)-5,6-dihydro-6-heptyl-2H-pyran-2-one, (R)-5,6-dihydro-6-nonyl-2H-pyran-2-one, (R)-5,6-dihydro-2H-pyran-2-one, (R)-5,6-dihydro-6-methyl-2H-pyran-2-one, (R)-5,6-dihydro-6-hexyl-2H-pyran-2-one, (R)-5,6-dihydro-6-ethyl-2H-pyran-2-one, (R)-5,6-dihydro-6-propyl-2H-pyran-2-one, (R)-5,6-dihydro-6-butyl-2H-pyran-2-one, (R)-5,6-dihydro-6-decyl-2H-pyran-2-one, (R)-5,6-dihydro-6-octyl-2H-pyran-2-one, and any combination thereof.


In some embodiments, the dehydration is spontaneous and/or chemically catalyzed. In some embodiments, the dehydration comprises dehydratase. In some embodiments, the lactonization is spontaneous, chemical, enzymatic, non-enzymatic, or any combination thereof.


In some embodiments, the 3-hydroxy aldehyde compound of formula I comprises 3-hydroxy-propanal, 3-hydroxy-butanal, 3-hydroxy-pentanal, 3-hydroxy-hexanal, 3-hydroxy-heptanal, 3-hydroxy-octanal, 3-hydroxy-nonanal, 3-hydroxy-decanal, 3-hydroxy-undecanal, 3-hydroxy-dodecanal, 3-hydroxy-tridecanal, and any combination thereof. In some embodiments, the compound of formula II is in enantiomeric excess. In some embodiments, the compound of formula II comprises (R)-3-hydroxy-propanal, (R)-3-hydroxy-butanal, (R)-3-hydroxy-pentanal, (R)-3-hydroxy-hexanal, (R)-3-hydroxy-heptanal, (R)-3-hydroxy-octanal, (R)-3-hydroxy-nonanal, (R)-3-hydroxy-decanal, (R)-3-hydroxy-undecanal, (R)-3-hydroxy-dodecanal, (R)-3-hydroxy-tridecanal, and any combination thereof.


In some embodiments, the compound of formula III comprises 2-propenal, trans-2-butenal, trans-2-pentenal, trans-2-hexenal, trans-2-heptenal, trans-2-octenal, trans-2-nonenal, trans-2-decenal, trans-2-undecenal, trans-2-dodecenal, trans-2-tridecenal, or any combination thereof.


In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes an aldolase. In some embodiments, the aldolase is a pyruvate dependent aldolase. In some embodiments, the aldolase is a pyruvate class II aldolase. In some embodiments, the pyruvate class II aldolase is HpaI, BphI, Eda or a homolog, or a mutant, or any combination thereof. In some embodiments, the aldolase is or comprises deoxyribose-5-phosphate aldolase (DERA), or a variant, or a homologue thereof. In some embodiments, the aldolase comprises an amino acid sequence of SEQ ID NO: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20, or an active fragment, or a homologue thereof. In some embodiments, the aldolase comprises at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% amino acid sequence identity to any one of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In some embodiments, the aldolase enzyme comprises the following conserved amino acid residues in the active site of the enzyme: lysine167, lysine 201, aspartic acid 16 and aspartic acid 102, where the number associated with each residue refers to the residue number in the amino acid sequence of E. coli DERA of SEQ ID NO: 16.


In some embodiments, the exogenous nucleic acid comprises a nucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the exogenous nucleic acid comprises a nucleotide sequence that is at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or at least about 99% nucleotide sequence identity to any one of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes a ketoreductase, oxidoreductase, aldehyde reductase, enoate reductase, dehydratase, lactonization enzyme, or alcohol dehydrogenase. In some embodiments, the non-naturally occurring microorganism further comprises a decarboxylase capable of the decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide.


In some embodiments, the decarboxylase is pyruvate decarboxylase (PDC), benzoylformate decarboxylase (BFD), or alpha-ketoacid decarboxylase (KDC). In some embodiments, the decarboxylase comprises an amino acid sequence of SEQ ID NO: 71 or an active fragment or homologue thereof. In some embodiments, the BFD comprises an amino acid sequence of SEQ ID NO: 76 or an active fragment or homologue thereof. In some embodiments, the KDC comprises an amino acid sequence of SEQ ID NO: 68 or 77 or an active fragment or homologue thereof.


In some embodiments, the non-naturally occurring microorganism expresses an enzyme identified in table 10 or table 9. In some embodiments, one or more genes encoding an enzyme that utilizes pyruvate is deleted from the non-naturally occurring microorganism as compared to wild-type. In some embodiments, one or more genes encoding an enzyme capable of catalyzing a reaction involving an acetaldehyde molecule, a pyruvate, or an aldehyde precursor is deleted from the non-naturally occurring microorganism as compared to wild-type. In some embodiments, the one or more genes that is deleted from the non-naturally occurring microorganism as compared to wild-type comprises one or more genes encoding an alcohol dehydrogenase, a lactate dehydrogenase, or a pyruvate formate lyase.


In some embodiments, the process comprises fermentation. In some embodiments, the fermentation is partially anaerobic, substantially, or fully anaerobic. In some embodiments, the process is a sugar based fermentation process. In some embodiments, the sugar based fermentation process comprises feeding sugar, or feeding at least one aldehyde precursor, or feeding sugar and an aldehyde precursor, or feeding only aldehyde precursors, or feeding pyruvate and/or pyruvic acid, or any combination thereof. In some embodiments, the sugar based fermentation process provides a higher yield of the trans-2-aldehyde than a fatty acid based process. In some embodiments, the fermentation is carried out in a single vessel. In some embodiments, the fermentation is a two-phase process; optionally wherein the two-phase process comprises a strain growth phase and/or a production phase. In some embodiments, the sugar based fermentation process comprises glucose, xylose, arabinose, glycerol, galactose, mannose, fructose, starch, and any combination thereof.


In some embodiments, the non-naturally occurring microorganism or the process of the present disclosure comprises a host microorganism, wherein the host is selected from a bacteria, yeast, algae, cyanobacteria, fungi, or a plant cell, or any combination thereof.


In some embodiments, the compound of formula III, and/or the compound of formula VII, and/or the compound of formula VI, and/or the compound of formula XII is at least about 50% pure. In some embodiments, the compound of formula II, and/or the compound of formula VI, and/or the compound of formula VII, and/or the compound of formula XII is in enantiomeric excess of the (R)-configuration.


In one embodiment, the present disclosure relates to method of producing trans-2-unsaturated aldehyde from aldehydes and aldolases comprising at least one of the following trans-2-unsaturated aldehyde pathway enzymes:

    • i. an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce 3-hydroxy aldehyde; and
    • ii. a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde;


      wherein at least one of the trans-2-unsaturated aldehyde pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing trans-2-unsaturated aldehyde from aldehydes and aldolases comprising an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce 3-hydroxy aldehyde; and optionally, a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde: optionally wherein the aldolase enzyme and/or the dehydratase is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing delta-lactones from aldehydes and aldolases comprising at least one of the following delta-lactone pathway enzymes

    • i. an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce a 3-hydroxy aldehyde;
    • ii. an aldolase enzyme that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde molecule to a 5,3-dihydroxy aldehyde;
    • iii. an oxidase that oxidizes of tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one;
    • iv. a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and
    • v. a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone;


      optionally wherein the aldolase in i. is different from the aldolase in ii; and wherein at least one of the delta-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to method of producing delta-lactones from aldehydes and aldolases comprising the following delta-lactone pathway enzymes:

    • i. an aldolase (first aldolase) enzyme that catalyzes condensation of two aldehyde molecules to produce a 3-hydroxy aldehyde;
    • ii. an aldolase (second aldolase) enzyme that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde molecule to a 5,3-dihydroxy aldehyde; optionally wherein the aldolase in i. is different from the aldolase in ii.; and optionally wherein the method further comprises at least one of the following delta-lactone pathway enzymes.
    • iii. an oxidase that oxidizes tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one;
    • iv. a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and
    • v. a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone:


      optionally wherein at least one of the delta-lactone pathway enzymes of i. to v. is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing gamma-lactones from an aldehyde molecule, a carboxylic acid, and aldolase comprising at least one of the following gamma-lactone pathway enzymes:

    • i. an aldolase enzyme that catalyzes condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid;
    • ii. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid;
    • iii. a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone;
    • iv. a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone; and
    • v. a reductase that reduces the 2(5H)-furanone to a gamma-lactone;


      wherein at least one of the gamma-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing gamma-lactones from an aldehyde molecule, a carboxylic acid, and aldolase comprising an aldolase enzyme that catalyzes condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid; and optionally wherein the method further comprises at least one of the following gamma-lactone pathway enzymes:

    • i. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid;
    • ii. a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone;
    • iii. a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone; and
    • iv. a reductase that reduces the 2(5H)-furanone to a gamma-lactone;


      wherein at least one of the enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing gamma-lactones from an aldehyde, a pyruvate, and aldolase comprising at least one of the following gamma-lactone pathway enzymes:

    • i. an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvate to a 4-hydroxy-2-oxo carboxylic acid;
    • ii. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid;
    • iii. a dehydratase that dehydrates 2,4-dihydroxy carboxylic acid to 4-hydroxy-2-ene acid;
    • iv. a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and
    • v. a reductase that reduces the 2(5H)-furanone to a gamma-lactone;


      wherein at least one of the gamma-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In one embodiment, the present disclosure relates to a method of producing gamma-lactones from an aldehyde, a pyruvate, and aldolase comprising an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvate to a 4-hydroxy-2-oxo carboxylic acid; and optionally wherein the method further comprises at least one of the following gamma-lactone pathway enzymes:

    • i. a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid;
    • ii. a dehydratase that dehydrates 2,4-dihydroxy carboxylic acid to 4-hydroxy-2-ene acid;
    • iii. a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and
    • iv. a reductase that reduces the 2(5H)-furanone to a gamma-lactone;


      wherein at least one of the enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the method, or the non-naturally occurring microorganism, or the process of the present disclosure comprises an aldolase wherein the aldolase is a deoxyribose-5-phosphate aldolase DERA enzyme. In some embodiments, the DERA enzyme is modified. In some embodiments, the aldolase comprises an amino acid sequence of SEQ ID NO: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20, or an active fragment, or a homologue thereof. In some embodiments, the aldolase comprises at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% amino acid sequence identity to any one of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In some embodiments, the present disclosure provides for an exogenous nucleic acid encoding the aldolase comprising a nucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the exogenous nucleic acid encoding the aldolase comprises a nucleotide sequence that is at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or at least about 99% nucleotide sequence identity to any one of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the modified DERA enzyme comprises at least one mutation, wherein the at least one mutation corresponds to or is at an amino acid position selected from R190, T12, and S238 of SEQ ID NO: 18. In some embodiments, the DERA enzyme comprises at least one mutation selected from R190K, R190I, R190F, R190A, R190L, R190M, T12A, and S238A of SEQ ID NO: 18. In some embodiments, the modified DERA enzyme comprises mutations corresponding to each or at each amino acid position selected from R190, T12, and S238 of SEQ ID NO: 18. In some embodiments, the modified DERA enzyme comprises mutations at each amino acid position corresponding to or selected from R190, T12, and S238 of SEQ ID NO: 18. In some embodiments, the modified DERA enzyme comprises mutations at all of the amino acid positions corresponding to R190, T12, and S238 of SEQ ID NO: 18.


In some embodiments, the method or the process of the present disclosure is performed in a culture, wherein the culture is a microbial culture. In some embodiments, the aldehyde molecules are fed to the culture. In some embodiments, the aldehyde molecules are produced from glucose. In some embodiments, at least one aldehyde molecule is produced from glucose and the other aldehyde molecule is fed to the culture.


In some embodiments, at least one enzyme is in the form of and/or is included in and/or is derived from a microbial cell, a permeabilized microbial cell, a microbial cell extract, a partially purified enzyme, or a purified enzyme; optionally wherein the at least one enzyme is aldolase; optionally wherein the aldolase is DERA, or a variant, or homologue thereof.


In some embodiments, the method comprises expressing at least one of the enzymes in a host cell. In some embodiments, the host cell is a microbial cell. In some embodiments, the host cell is a fungal or bacterial cell. In some embodiments, the microbial host cell is a transformed microbial host cell selected from the group consisting of Comamonas sp., Corynebacterium sp., Brevibacterium sp., Rhodococcus sp., Azotobacter sp., Citrobacter sp., Enterobacter sp., Clostridium sp., Klebsiella sp., Salmonella sp., Lactobacillus sp., Aspergillus sp., Saccharomyces sp., Yarrowia sp., Zygosaccharomyces sp., Pichia sp., Kluyveromyces sp., Candida sp., Hansenula sp., Dunaliella Debaryonyes sp., Mucor sp., Torulopsis sp., Methylobacteria sp., Bacillus sp., Escherichia sp., Pseudomonas sp., Rhizobium sp., and Streptomyces sp. In some embodiments, the transformed microbial host cell is selected from the group consisting of Bacillus sp., Pseudomonas sp., and Escherichia sp. In some embodiments, the transformed microbial host cell is Escherichia coli.


In some embodiments, the microbial cell is a viable cell. In some embodiments, the microbial cell produces a crude cell lysate. In some embodiments, the crude cell lysate is purified.


In some embodiments, the microbial cell and/or the non-naturally occurring microorganism comprises one or more gene deletions to increase the availability of aldehyde and/or pyruvate. In some embodiments, the microbial cell and/or the non-naturally occurring microorganism comprises one or more gene deletions encoding an alcohol dehydrogenase gene, a lactate dehydrogenase gene, or a pyruvate formate lyase gene. In some embodiments, the non-naturally occurring microorganism comprises one or more gene deletions compared to a corresponding unmodified or wild-type microorganism. In some embodiments, the one or more deleted genes encode enzyme(s) that consume aldehyde and/or pyruvate.


In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from aldB, poxB, ybbO, yahK, deoC, paoAB/C (yagT/S/R), adhE, yqhD, eutG adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from dkgA aldB poxB ybbO yahK deoC paoA/B/C (yagT/S/R) adhE yqhD eutG adhP and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, dkgA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB in E col. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from dkgA, aldB, ybbO, yahK, deoC, paoAB/C (yagTS/R), adhE, yqhD, eutG, adhP, and, yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, dkgA, a1 dB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB IdhA, adhE, yqhD, eutG, adhP and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from pflB, ldhA, adhE in E. coli. In some embodiments, the one or more gene(s) that is deleted increase the availability of aldehyde and/or pyruvate.


In one embodiment, the present disclosure relates to a method of producing delta-lactone comprising adding or mixing at least one aldolase and/or at least one (precursor) aldehyde, with a non-naturally occurring microorganism to produce a delta-lactone from precursor aldehyde molecules, wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a method of producing gamma-lactone comprising adding or mixing at least one aldolase and/or at least one (precursor) aldehyde, with a non-naturally occurring microorganism to produce gamma-lactone from a precursor aldehyde molecule, wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In one embodiment, the present disclosure relates to a method of producing trans-2-unsaturated aldehyde comprising adding or mixing at least one aldolase and/or at least one (precursor) aldehyde, with a non-naturally occurring microorganism to produce trans-2-unsaturated aldehyde from precursor aldehyde molecules, wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In some embodiments, the aldolases are two aldolases, wherein one aldolase is specific for the condensation of an acetaldehyde and an aldehyde molecule: optionally wherein the other aldolase is specific for the condensation of 3-hydroxy aldehyde and an acetaldehyde molecule, and optionally wherein the two aldolases are two different aldolases.


In some embodiments, the precursor aldehyde molecules are two aldehyde molecules. In some embodiments, the two aldehyde molecules are fed to a microbial culture. In some embodiments, at least one of the aldehyde molecule, the pyruvate, or pyruvic acid is fed to a microbial culture. In some embodiments, the aldehyde molecule is produced from glucose via an aldehyde producing pathway. In some embodiments, at least one of the two aldehyde molecules is produced from glucose and at least one of the two aldehyde molecules is fed to a microbial culture. In some embodiments, at least one of the two aldehyde molecules is acetaldehyde. In some embodiments, the acetaldehyde is produced from glucose via decarboxylation of pyruvate by pyruvate decarboxylase.


In some embodiments, the non-naturally occurring microorganism comprises at least one modified aldolase enzyme, wherein the modified aldolase enzyme comprises an increased activity towards an aldehyde condensation reaction in comparison to an unmodified aldolase enzyme. In some embodiments, the method or process of the present disclosure relates to a modified enzyme comprising at least one modified aldolase enzyme. In some embodiments, the modified aldolase enzyme comprises an increased activity towards an aldehyde condensation reaction in comparison to an unmodified aldolase enzyme. In some embodiments, the modified aldolase enzyme is a modified deoxyribose-5-phosphate aldolase (DERA) enzyme. In some embodiments, the modified DERA enzyme comprises at least one mutation corresponding to or at an amino acid position selected from R190, T12, and S238 of SEQ ID NO: 18.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing delta-lactone from aldehydes and aldolases, wherein the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing gamma-lactone from an aldehyde, a pyruvate, and aldolases, wherein the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In one embodiment, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing trans-2-unsaturated aldehyde from aldehydes and aldolases, wherein the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In some embodiments, the present disclosure relates to a recombinant vector comprising a gene encoding any DERA enzyme, and/or encoding any KDC, and/or encoding any BFD, and/or encoding any decarboxylase of the present disclosure.


In some embodiments, the present disclosure relates to an expression vector comprising any nucleic acid sequence disclosed in the present disclosure. In some embodiments, a host cell comprises the expression vector. In some embodiments, the cell is a microbial cell.


In some embodiments, the present disclosure relates to an isolated non-naturally occurring microorganism comprising a gene encoding any DERA enzyme, and/or encoding any KDC, and/or encoding any BFD, and/or encoding any decarboxylase of the present disclosure.


In some embodiments, a non-naturally occurring microorganism is obtained by introducing a recombinant vector into a host microorganism.


In one embodiment, the present disclosure relates to a method of producing an aldolase, comprising: a) culturing a host cell that expresses a modified aldolase enzyme under conditions that allow for aldolase production, wherein the modified aldolase enzyme is encoded by a nucleic acid molecule with a sequence at least about 85%, or at least about 90% identical to any one of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20; and b) isolating the aldolase from the culture: optionally wherein the aldolase is DERA.


In one embodiment, the present disclosure relates to a method of preparing a modified enzyme, comprising: (a) subjecting a deoxyribonucleic acid (DNA) sequence encoding the enzyme to random or site directed mutagenesis: (b) expressing the modified DNA sequence obtained in (a) in a host cell; and (c) screening for host cells expressing the modified enzyme wherein the enzyme has improved aldehyde condensation activity; optionally wherein the enzyme is an aldolase; optionally wherein the aldolase is DERA; and optionally wherein DERA comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the host cells screened in (c) are subjected to a second mutagenesis treatment, to rescreening, to reisolation and to recloning.


In some embodiments, the modified DNA sequence is expressed by transforming a suitable host cell with the modified DNA sequence and culturing the host cell obtained in (b) under suitable conditions for expressing the modified DNA sequence. In some embodiments, the host cell is a microbial cell. In some embodiments, the host cell is a fungal or bacterial cell.


Advantages of the present disclosure include a high yield single-step fermentation process, as opposed to a multistep enzyme catalysis process, otherwise used for bio-based production of chemicals. The pathways disclosed herein are based on sugars and/or precursor aldehydes, which provides a higher yield than currently used pathways for the production of trans-2-unsaturated aldehydes, delta-lactones and gamma-lactones. Fatty acids may pose problems of insolubility, toxicity, and contacts with enzymes. Current pathways used in the industry for making trans-2-unsaturated aldehydes, delta-lactones, and gamma-lactones involve fatty acid-based pathways such as lipoxygenase pathway and beta-oxidation pathway. Fatty acids used in lactone processes are very expensive. The present disclosure uses glucose instead, which is a cheap precursor, thus making the production of the trans-2-unsaturated aldehydes, the delta-lactones, and the gamma-lactones more cost effective.


Current methods, for example for the production of trans-2-hexenal, involve expression of plant-based enzymes such as 13-hydroperoxide lyase (HPL), which is not well expressed in industrial hosts as it is a membrane-bound enzyme found in chloroplasts. The pathways of the present disclosure are based on microbial enzymes that are easily expressed in industrial hosts, such as in E. coli and S. cerevisiae. The pathways disclosed herein use carbon-carbon bond forming aldolase enzymes. Aldolases may have broad substrate specificity thus, the same pathway may be extended to make different products, by changing the chain length of the precursor molecules. For example, the same pathway may be used to make trans-2-pental, trans-2-hexenal, and trans-2-heptenal, trans-2-octenal, or trans-2-nonenal, all of which are aroma chemicals.


Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.


INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.





BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the disclosure are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the disclosure are utilized, and the accompanying drawings (also “FIGURE” and “FIG.” herein), of which:



FIG. 1 exemplifies the fatty acid lipoxygenase pathway of trans-2-hexenal production;



FIG. 2 exemplifies the fatty acid oxidation pathway of gamma-lactone production:



FIG. 3 exemplifies the fatty acid oxidation pathway of delta-lactone production;



FIG. 4 exemplifies the current industrial process for making trans-2-hexenal and cis-3-hexenol:



FIG. 5 exemplifies the general pathway for producing trans-2-unsaturated aldehydes using aldolase for aldehyde condensation:



FIG. 6 exemplifies the general pathway for producing delta-lactones using aldolase for aldehyde condensation:



FIG. 7 exemplifies the general pathway for producing gamma-lactones using pyruvate dependent aldolase;



FIG. 8 exemplifies the pathway for producing trans-2-hexenal:



FIG. 9 exemplifies the pathway for producing massoia lactone:



FIG. 10 exemplifies the pathway for producing gamma-lactone:



FIG. 11 exemplifies production of trans-2-hexenal by acetaldehyde and butyraldehyde.





DETAILED DESCRIPTION

While various embodiments of the disclosure have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions may occur to those skilled in the art without departing from the present disclosure. It should be understood that various alternatives to the embodiments of the disclosure described herein may be employed.


Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the present disclosure belongs. In case of conflict, the present application including the definitions will control. Also, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. All publications, patents and other references mentioned herein are incorporated by reference in their entireties for all purposes as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference, unless only specific sections of patents or patent publications are indicated to be incorporated by reference.


The present disclosure describes non-naturally occurring microorganisms that are engineered by expressing genes encoding enzymes involved in biochemical pathways for producing trans-2-unsaturated aldehydes or lactones, such as delta-lactones and gamma-lactones, by using aldolases. The present disclosure also describes additional genetic modifications that may be used to improve the performance of the trans-2-unsaturated aldehyde pathway and the lactones (e.g, delta-lactone or gamma-lactone) production pathways. The genetic modifications may be towards optimizing the expression system or to the non-natural organism for improvement of production metrics including yield, titre, and productivity. Additionally, genetic modifications may be aimed at improving the non-natural microorganism's characteristics including but not limited to tolerance to inhibitors found in the feedstocks, product tolerance, osmotolerance, and efficient product secretion.


In order to further define the present disclosure, the following terms, abbreviations and definitions are provided.


As used herein, the term “about” modifying the quantity of an ingredient or reactant of the present disclosure employed refers to variations in the numerical quantity that may occur, for example, through typical measuring and liquid handling procedures used for making concentrates or solutions in the real world; through inadvertent error in these procedures; through differences in the manufacture, source, or purity of the ingredients employed to make the compositions or to carry out the methods; and the like. The term “about” also encompasses amounts that differ due to different equilibrium conditions for a composition resulting from a particular initial mixture. Whether or not modified by the term “about”, the claims include equivalents to the quantities. In some embodiments, the term “about” means within 10% of the reported numerical value, or within 5% of the reported numerical value, or within 20% of the reported numerical value.


The indefinite articles “a” and “an” preceding an element or component of the present disclosure are intended to be nonrestrictive regarding the number of instances, i.e., occurrences of the element or component. Therefore “a” or “an” should be read to include one or at least one, and the singular word form of the element or component also includes the plural unless the number is obviously meant to be singular.


As used herein, “synergistic” refers to a greater-than-additive effect produced by a combination (i.e., an effect that is greater than the sum of individual effects) or an additive effect when the individual effects are not expected to be additive. The term also refers to the addition of one compound which results in less of another compound being required.


The term “polynucleotide” is intended to encompass a singular nucleic acid as well as plural nucleic acids, and may refer to a nucleic acid molecule or construct, e.g., messenger RNA (mRNA) or plasmid DNA (pDNA). A polynucleotide may contain the nucleotide sequence of the full-length cDNA sequence, or a fragment thereof, including the untranslated 5′ and 3′ sequences and the coding sequences. The polynucleotide may be composed of any polyribonucleotide or polydeoxyribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. For example, polynucleotides may be composed of single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. “Polynucleotide” embraces chemically, enzymatically, or metabolically modified forms.


The term “amino acid” refers to the basic chemical structural unit of a protein or polypeptide. The following abbreviations are used herein to identify specific amino acids:


















Three-Letter
One-Letter



Amino Acid
Abbreviation
Abbreviation









Alanine
Ala
A



Arginine
Arg
R



Asparagine
Asn
N



Aspartic acid
Asp
D



Cysteine
Cys
C



Glutamine
Gln
Q



Glutamic acid
Glu
E



Glycine
Gly
G



Histidine
His
H



Leucine
Leu
L



Lysine
Lys
K



Methionine
Met
M



Phenylalanine
Phe
F



Proline
Pro
P



Serine
Ser
S



Threonine
Thr
T



Tryptophan
Trp
W



Tyrosine
Tyr
Y



Valine
Val
V










The term “gene” refers to a nucleic acid fragment that is capable of being expressed as a specific protein, optionally including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. The term “gene” refers to a linear sequence of nucleotides along a segment of DNA that provides the coded instructions for synthesis of RNA. In some instances, a gene provides coded instructions for translation into protein. In some instances, a gene provides coded instructions for the synthesis of non-coding RNA, which does not lead to translation into protein.


As used herein, the term “protein” comprises at least one polypeptide comprising amino acid residues bound together by peptide bonds.


As used herein, the terms “variant,” “modified,” and “mutant” are synonymous and refer to a polypeptide differing from a specifically recited polypeptide by one or more amino acid insertions, deletions, mutations, and substitutions, created using, e.g., recombinant DNA techniques, such as mutagenesis. Guidance in determining which amino acid residues may be replaced, added, or deleted without abolishing activities of interest, may be found by comparing the sequence of the particular polypeptide with that of homologous polypeptides, e.g., yeast or bacterial, and minimizing the number of amino acid sequence changes made in regions of high homology (conserved regions) or by replacing amino acids with consensus sequences.


The mutants of the present disclosure may be generated in accordance with any suitable method, including, but not limited to, methods described and exemplified herein. Mutations, such as substitutions, insertions, deletions, and/or side chain modifications, may be introduced into the nucleotide and amino acid sequences of the gene of interest using any suitable technique, including site-directed mutagenesis (Wu, ed., Meth. Enzymol. 217, Academic Press (1993)) or random mutagenesis, or a combination. The lambda red recombinase method may be used to “knock out” genes (Datsenko et al., PNAS USA 97: 6640-6645 (2000)). Permanent, marker-free, multiple gene disruptions may be created. Non-naturally occurring nucleotides and amino acids also may be used.


Random Mutagenesis


The random mutagenesis of the DNA sequence encoding an enzyme of the present disclosure may conveniently be performed by using any method known in the art. For instance, the random mutagenesis may be performed by using a suitable physical or chemical mutagenizing agent, by using a suitable oligonucleotide, or by subjecting the DNA sequence to PCR generated mutagenesis. In some embodiments, the random mutagenesis may be performed by using any combination of these mutagenizing agents.


In some embodiments, the mutagenizing agent may, e.g., be one which induces transitions, transversions, inversions, scrambling, deletions, and/or insertions. Examples of a physical or chemical mutagenizing agent suitable for the present purpose include, but it is not limited to, ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N′-nitro-N-nitrosoguanidine (MNNG), O-methyl hydroxylamine, nitrous acid, ethyl methane sulphonate (EMS), sodium bisulphite, formic acid, and nucleotide analogues. When such agents are used the mutagenesis is typically performed by incubating the DNA sequence encoding the unmodified or wild-type enzyme to be mutagenized in the presence of the mutagenizing agent of choice under suitable conditions for the mutagenesis to take place, and selecting for mutated DNA having the desired properties. When the mutagenesis is performed by using an oligonucleotide, the oligonucleotide may be doped or spiked with the non-parent or non wild-type nucleotides during the synthesis of the oligonucleotide at the positions wanted to be changed. The doping or spiking may be done so that codons for unwanted amino acids are avoided. The doped or spiked oligonucleotide can be incorporated into the DNA encoding the enzyme (e.g., DERA) by any published technique using e.g., PCR. LCR or any DNA polymerase and ligase. When PCR generated mutagenesis is used either a chemically treated or non-treated gene encoding a parent lipolytic enzyme is subjected to PCR under conditions that increases the misincorporation of nucleotides (Deshler 1992, Leung et al. 1989).


A mutator strain of E. coli (Fowler et al. 1974), S. cereviciae or any other microbial organism may be used for the random mutagenesis of the DNA encoding the enzyme (e.g., DERA) by e.g., transforming a plasmid containing the parent enzyme into the mutator strain, growing the mutator strain with the plasmid and isolating the mutated plasmid from the mutator strain. The mutated plasmid may subsequently be transformed into the expression organism.


The DNA sequence to be mutagenized may conveniently be present in a genomic or cDNA library prepared from an organism expressing the parent or wild-type enzyme (e.g., DERA). Alternatively, the DNA sequence may be present on a suitable vector such as a plasmid or a bacteriophage, which as such may be incubated with or otherwise exposed to the mutagenizing agent. The DNA to be mutagenized may also be present in a host cell either by being integrated in the genome of said cell or by being present on a vector harboured in the cell. Finally, the DNA to be mutagenized may be in isolated form. It will be understood that the DNA sequence to be subjected to random mutagenesis is preferably a cDNA or a genomic DNA sequence.


In some embodiments, it may be convenient to amplify the mutated DNA sequence prior to expression in a host cell or screening. Such amplification may be performed in accordance with methods known in the art. In some embodiments, the method comprises PCR generated amplification using oligonucleotide primers prepared on the basis of the DNA or amino acid sequence of the parent or wild type enzyme.


Subsequent to the incubation with or exposure to the mutagenizing agent, the mutated DNA is expressed by culturing a suitable host cell carrying the DNA sequence under conditions allowing expression to take place. The host cell may be one which has been transformed with the mutated DNA sequence, optionally present on a vector, or one which carried the DNA sequence encoding the parent enzyme during the mutagenesis treatment. The mutated DNA sequence may further comprise a DNA sequence encoding functions permitting expression of the mutated DNA sequence.


As used herein, “homologue” refers to a protein that is functionally equivalent i.e. has the same enzymatic activity as an enzyme having an amino acid sequence of the specified sequence identification number, but may have a limited number of amino acid substitutions, deletions, insertions or additions in the amino acid sequence. In order to maintain the function of the protein, the substitutions may be conservative substitutions, replacing an amino acid with one having similar properties.


In various aspects, a homologue of each enzyme refers to a protein which has an identity of at least 25%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 99% with the amino acid sequence of SEQ ID NO corresponding to the enzyme and retains enzymatic activity. Algorithms for determining sequence identity are publicly available and include e.g. BLAST available through the National Center for Biotechnology Information (NCBI). One skilled in the art may determine if the sequences are similar to a degree that indicates homology and thus similar or identical function.


A polynucleotide encoding a homologue of each enzyme may be obtained by appropriately introducing substitution, deletion, insertion, and/or addition to the DNA of the enzyme which is composed of a nucleotide sequence disclosed herein, using methods such as site-specific mutagenesis (Nucleic Acid Res. 10, pp. 6487 (1982). Methods in Enzymol. 100, pp. 448 (1983). Molecular Cloning 2nd Edt., Cold Spring Harbor Laboratory Press (1989). PCR A Practical Approach IRL Press pp. 200 (1991)). The polynucleotide encoding a homologue of each enzyme may be introduced and expressed in a host to obtain the homologue.


As used herein, term “microbial cell” is intended to include whole microorganism cells, and/or fragmented cells, and/or homogenized cells. The term microbial cell preparation may include whole microorganism cells as well as cell mass particulate forms of immobilized intracellular enzymes. In some embodiments, the microbial cell is a viable cell. In some embodiments, the microbial cell produces a crude cell lysate. In some embodiments, the crude cell lysate is purified. In some embodiments, the crude cell lysate is unpurified. In some embodiments, the crude cell lysate is a combination of purified and unpurified crude cell lysate. In some embodiments, the microbial cell comprises one or more gene deletions to increase the availability of aldehyde and/or pyruvate. In some embodiments, the microbial cell comprises one or more gene deletions encoding an alcohol dehydrogenase, a lactate dehydrogenase, and/or a pyruvate formate lyase. In some embodiments, a deleted gene may be an alcohol dehydrogenase gene, a lactate dehydrogenase gene, and/or a pyruvate formate lyase gene.


As used herein, “enzyme” includes proteins produced by a cell capable of catalyzing biochemical reactions. Further, unless context dictates otherwise, as used herein “enzyme” includes protein fragments that retain the relevant catalytic activity, and may include artificial enzymes synthesized to retain the relevant catalytic activity. In some embodiments, at least one enzyme (e.g., aldolase, DERA) of the present disclosure may be in the form of a whole microbial cell, a microbial cell, a permeabilized microbial cell, a microbial cell extract, one or more cell components of a microbial cell extract, a partially purified enzyme, a purified enzyme, or combinations thereof.


Each of the enzymes described herein may be attached to an additional amino acid sequence as long as it retains an activity functionally equivalent to that of the enzyme. As mentioned above, it is understood that each enzyme or a homologue thereof may be a (poly)peptide fragment as long as it retains an activity functionally equivalent to that of the enzyme.


In some embodiments, at least one enzyme (e.g., aldolase, DERA) is expressed in a host cell. In some embodiments, the at least one enzyme is an unmodified or wild type enzyme. In some embodiments, the at least one enzyme is a modified or a variant enzyme. In some embodiments, the host cell is a microbial cell. In some embodiments, the host cell is a fungal or a bacteria cell. In some embodiments, the microbial host cell is a transformed microbial host cell. In some embodiments, the microbial or transformed microbial host cell may be selected from Comamonas sp., Corynebacterium sp., Brevibacterium sp., Rhodococcus sp., Azotobacter sp., Citrobacter sp., Enterobacter sp., Clostridium sp., Klebsiella sp., Salmonella sp., Lactobacillus sp., Aspergillus sp., Saccharomyces sp., Yarrowia sp., Zygosaccharomyes sp., Pichia sp., Kluveromyces sp., Candida sp., Hansenula sp., Dunaliella Debaryomyces sp., Mucor sp., Torulopsis sp., Methylobacteria sp., Bacillus sp., Escherichia sp., Pseudomonas sp., Rhizobium sp., and Streptomyces sp. In some embodiments, the microbial host cell may be Bacillus sp., Pseudomonas sp., or Escherichia sp.


As used herein, the term “enzymatic activity” describes the ability of an enzyme to convert a substrate (e.g., aldehyde molecule) into a product or an intermediate. For example, an enzymatic activity may be described as the ability of an enzyme to catalyze condensation of aldehyde molecules, such as the ability of an aldolase (e.g., DERA or a modified DERA) to catalyze condensation of two aldehyde molecules or one aldehyde and pyruvic acid to produce 3-hydroxy aldehyde or 4-hydroxy-2-oxo carboxylic acid, respectively. In some embodiments, both the natural substrate of the enzyme and a synthetic modified analog of the natural substrate can be used. The enzymatic activity can be determined in what is known as an activity assay via the increase in the product or intermediate, the decrease in the starting material, the decrease or increase in a specific cofactor, or a combination of at least two of the aforementioned parameters as a function of a defined period of time. If the enzyme catalyzes a reversible reaction, both the starting material and the product may be employed as substrate in the activity assay in question.


As used herein, the term “microorganism” is intended to mean any organism that exists as a microscopic cell and encompasses prokaryotic or eukaryotic cells or organisms having a microscopic size and includes bacteria, archaea and eubacteria of all species as well as eukaryotic microorganisms such as yeast and fungi. The term also includes cell cultures of any species (including e.g, plant and mammalian cells) that may be cultured for the production of a biochemical.


As used herein, the term “non-naturally occurring” when used in reference to a microorganism refers to a microorganism that has at least one genetic alteration not normally found in a naturally occurring strain of the referenced species, including wild-type strains of the referenced species. Genetic alterations include, for example, modifications introducing expressible nucleic acids encoding metabolic polypeptides, other nucleic acid additions, nucleic acid deletions and/or other functional disruption of the microbial genetic material. Such modifications include, for example, coding regions and functional fragments thereof, for heterologous, homologous or both heterologous and homologous polypeptides for the referenced species. Additional modifications include, for example, non-coding regulatory regions in which the modifications alter expression of a gene or operon. In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes an aldolase enzyme. In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes a pyruvate dependent aldolase. In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes a pyruvate class II aldolase (e.g., HpaI, BphI, Eda, or a homologue, or a mutant, or any combination thereof). In some embodiments, the non-naturally occurring microorganism comprises an exogenous nucleic acid that encodes a ketoreductase, oxidoreductase, aldehyde reductase, enoate reductase, dehydratase, lactonization enzyme, or alcohol dehydrogenase, or any combination thereof.


The term “endogenous” refers to a referenced molecule or activity that originates in a host microorganism. Similarly, the term when used in reference to expression of an encoding nucleic acid refers to expression of an encoding nucleic acid contained within the microorganism.


As used herein the term “exogenous” refers to molecules or activity that is introduced into a host microorganism. The molecule may be introduced, for example, by introduction of an encoding nucleic acid into the host genetic material such as by integration into a host chromosome or as non-chromosomal genetic material such as a plasmid. In reference to expression of an encoding nucleic acid the term refers to introduction of the encoding nucleic acid in an expressible form into the microorganism. When used in reference to a biosynthetic activity, the term refers to an activity that is introduced into a reference host organism. The source may be, for example, an encoding nucleic acid that expresses the activity following introduction into the host microorganism.


The term “heterologous” refers to a molecule or activity derived from a source other than the referenced species whereas “homologous” refers to a molecule or activity derived from the host microbial organism. Accordingly, exogenous expression of an encoding nucleic acid of the present disclosure may use either or both a heterologous or homologous encoding nucleic acid.


The term “oxidoreductase” is defined as any enzyme that facilitates the oxidation or reduction of a substrate. The term oxidoreductase includes “oxidases,” which facilitate oxidation reactions in which molecular oxygen is the electron acceptor; “reductases,” which facilitate reduction reactions in which the analyte is reduced and molecular oxygen is not the analyte; and “dehydrogenases.” which facilitate oxidation reactions in which molecular oxygen is not the electron acceptor. See, for example, Oxford Dictionary of Biochemistry and Molecular Biology, Revised Edition, A. D. Smith, Ed., New York: Oxford University Press (1997) pp. 161, 476, 477, and 560. Non-limiting examples of oxidoreductases include a reductase, an oxidase, a dehydrogenase, a ketoreductase, an alcohol dehydrogenase, a carbonyl reductase, an aldehyde dehydrogenase, an amino acid dehydrogenase, an amine oxidase, a disulfide reductase, an enoate reductase, and a mixed function oxidase. A listing of such enzymes may be found in Enzyme Nomenclature: Webb, E. C., Ed. Academic: Orlando 1984; pp 20-141.


As used herein, the term “operably linked” refers to a linkage between one or more expression control sequences and the coding region in a polynucleotide to be expressed in such a way that expression is achieved under conditions compatible with the expression control sequence.


The expression “derived from” in relation to an enzyme or (poly)peptide denotes that the enzyme or poly(peptide) was isolated from a (micro)organism or that it includes all or a biologically active part of the amino acid sequence of an enzyme or (poly)peptide isolated or characterized from such a (micro)organism.


In some embodiments, the processes/pathways disclosed herein is a process comprising, consisting of, or consisting essentially of condensing two aldehyde molecules using an aldolase enzyme as described herein.


The aldolases catalyze aldol condensation by stereo-controlled addition of a nucleophilic donor onto an electrophilic aldehyde acceptor. Due to the mechanistic requirements aldolases are quite specific for the nucleophilic donor component but show large flexibility in the acceptor range. Hence aldolases are categorized based on their nucleophilic donors. Different classes of aldolases are 1) acetaldehyde dependent aldolase, 2) pyruvate/phosphoenolpyruvate-dependent aldolases, 3) dihydroxyacetone phosphate/dihydroxyacetone-dependent aldolases, and 4) glycine dependent aldolases.


In some embodiments, aldolases may be acetaldehyde dependent aldolases. In some embodiments, aldolases may be pyruvate dependent aldolases. In some embodiments, the aldehydes may be donors or acceptors.


In some embodiments, the donors may include one or more of acetaldehyde (ethanal), propanal, 2-methylpropanal, methylglyoxal, lactaldehyde, glycolaldehyde, acrolein, formaldehyde, acetaldehyde, propionaldehyde, n-butyraldehyde, iso-butyraldehyde, hexanal, n-pentanal (valeraldehyde), caproaldehyde, crotonaldehyde, n-heptaldhyde, n-octanal, n-nonylaldehyde, phenylacetaldehyde, benzaldehyde, p-tolualdehyde, salicylaldehyde, vanillin, piperonal, 2-ethyl-2-hex-enal and 2-ethylhexaldehyde, undecaldehyde, decaldehyde, nonaldehyde, octaldehyde, heptaldehyde, hexaldehyde, pentaldehyde, or butyraldehyde.


In some embodiments, the donors may be non-aldehydes including pyruvate, propanone (acetone), glyoxylic acid, or 3-propenol.


In some embodiments, the acceptors may include one or more of acetaldehyde (ethanal), propanal, butanal, isobutanal, 2-methyl-1-butanal, 3-methyl-1-butanal, pentanal, hexanal, 3-methyl-1-pentanal, 4-methyl-1-pentanal, succinate semialdehyde, lactaldehyde, glycoldehyde, glyceraldehyde, 2-phenylacetaldehyde, cinnamaldehyde, glyoxal, glyoxylic acid, methyl glyoxal, acrolein, succindialdehyde, glutaraldehyde, adipaldehyde, malondialdehyde, malonic semialdehyde (3-oxopropionic acid), muconate semialdehyde, 2-hydroxymuconate semialdehyde, formaldehyde, acetaldehyde, propionaldehyde, n-butyraldehyde (butanal), iso-butyraldehyde, tert-butanal, hexanal (n-hexaldehyde), n-pentanal (valeraldehyde, pentaldehyde), caproaldehyde, crotonaldehyde, n-heptaldhyde (heptaldehyde, heptanal), n-octanal (octaldehyde), n-nonylaldehyde (nonaldehyde), n-decaldehyde, 3-hydroxypropanal, 3-hydroxybutanal, 3-hydroxypentanal, 3-hydroxyhexanal, 3-hydroxyheptanal, 3-hydroxy-octanal, 3-hydroxynonanal, 3-hydroxydecanal, phenylacetaldehyde, benzaldehyde, p-tolualdehyde, salicylaldehyde, vanillin, piperonal, 2-ethyl-2-hex-enal and 2-ethylhexaldehyde, undecaldehyde.


As used herein, the term DERA refers to the enzyme deoxyribose-5-phosphate aldolase belonging to the class aldolases, the term AKR refers to the class aldo-ketoreductase, the term ADH refers to the enzyme alcohol dehydrogenase, the term PDC refers to the enzyme pyruvate decarboxylase, the term BFD refers to the enzyme benzoylformate decarboxylase, the term KDC refers to the enzyme alpha-ketoacid decarboxylase, the term KRED refers to ketoreductase, the term G6PD refers to the enzyme glucose-6-phosphate 1-dehydrogenase (hemi-acetal oxidase), the term AKHD refers to the enzyme aspartokinase/homoserine dehydrogenase, the term HSK refers to the enzyme homoserine kinase, the term TS refers to the enzyme threonine synthase, the term IMDH refers to the enzyme isopropylmalate dehydrogenase, the term IPMI refers to the enzyme isopropylmalate dehydratase, the term NADH refers to reduced nicotinamide adenine dinucleotide, the term NADPH refers to reduced nicotinamide adenine dinucleotide phosphate, the term NAD refers to nicotinamide adenine dinucleotide, and the term NADP refers to nicotinamide adenine dinucleotide phosphate.


In some embodiments, the trans-2-unsaturated aldehyde pathway or the delta-lactone pathway disclosed herein comprise the condensation of two aldehyde molecules to 3-hydroxy acetaldehyde using enzyme from class aldolases. In some embodiments, the condensation of an aldehyde and a 3-hydroxy acetaldehyde molecule may be performed using an enzyme from class aldolases. In some embodiments, the enzyme from the class aldolases is a deoxyribose-5-phosphate aldolase (DERA) (EC 4.1.2.4.).


In some embodiments, DERA enzymes may be described as class I aldolases that form covalent Schiff base intermediates. In all studied structures, DERA adopts the classical eight-bladed TIM barrel fold. The oligomerisation state of DERA seems to depend on the temperature of the organism. For example, DERA from E. coli is a homodimer, whereas DERA from Thermotoga maritima is a homotetramer. The degree of oligomerization does not seem to affect catalysis but may affect stability under various conditions.


In some embodiments, the DERA is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the aldolase (e.g., DERA) is an enzyme comprising an amino acid sequence encoded by a DNA which comprises at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% nucleotide sequence identity to any one of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, DERAs as described herein are derived from microorganisms of the genus Bacillus, Escherichia, Thermotoga Deinococcus, Listeria, Staphylococcus, Streptococcus, and Methanothermobacter. In some embodiments, the DERA is derived from Bacillus halodurans, Bacillus cereus. Bacillus subtilis, Escherichia coli, Thermotoga maritima, Deinococcus radiodurans, Listeria monocytogenes, Staphylococcus aureus, Streptococcus pneumonia, and Methanothermobacter thermautotrophicus. In some embodiments, a DERA as used in a process described herein comprises an amino acid sequence of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or active fragments or homologues thereof. In some embodiments, the DERA is an enzyme comprising an amino acid sequence which comprises at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% sequence identity to any one of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In some embodiments, the aldolase enzyme comprises the following conserved amino acid residues in the active site of the enzyme: lysine 167, lysine 201, aspartic acid 16 and aspartic acid 102, where the number associated with each residue refers to the residue number in the amino acid sequence of E. coli DERA of the SEQ ID NO: 16 and corresponding codons in the nucleotide sequence of SEQ ID NO: 2.


In some embodiments, the DERA enzyme is specifically modified to increase activity towards an alkyl aldehyde. In some embodiments, the DERA enzyme is specifically modified to increase activity towards hexanal. In some embodiments, the DERA enzyme is modified to increase activity towards propanal, butanal, pentanal, hexanal, heptanal, octanal, nonanal, decanal, undecanal, dodecanal, tridecanal, and any combination thereof. In some embodiments, the DERA enzyme is modified to increase activity towards condensation of an acetaldehyde and an acetaldehyde, propanal, butanal, pentanal, hexanal, heptanal, octanal, nonanal, decanal, undecanal, dodecanal, or a tridecanal. In some embodiments, the DERA enzyme is modified to increase activity towards condensation of an acetaldehyde and a 3-hydroxy aldehyde (e.g., 3-hydroxy-propanal, 3-hydroxy-butanal, 3-hydroxy-pentanal, 3-hydroxy-hexanal, 3-hydroxy-heptanal, 3-hydroxy-octanal, 3-hydroxy-nonanal, 3-hydroxy-decanal, 3-hydroxy-undecanal, 3-hydroxy-dodecanal, 3-hydroxy-tridecanal). In some embodiments, the DERA enzyme used for the condensation of an acetaldehyde and an aldehyde is different from the DERA enzyme used for the condensation of an acetaldehyde and a 3-hydroxy aldehyde. In some embodiments, an aldolase enzyme (e.g., pyruvate-dependent aldolase) is modified to increase activity towards condensation of a pyruvic acid and an aldehyde (e.g., acetaldehyde, propanal, butanal, pentanal, hexanal, heptanal, octanal, nonanal, decanal, undecanal, dodecanal, or tridecanal).


In some embodiments, the DERA enzyme is modified to increase activity towards a 3-hydroxy aldehyde. In some embodiments, the DERA enzyme is modified to increase activity towards 3-hydroxy-propanal, 3-hydroxy-butanal, 3-hydroxy-pentanal, 3-hydroxy-hexanal, 3-hydroxy-heptanal, 3-hydroxy-octanal, 3-hydroxy-nonanal, 3-hydroxy-decanal, 3-hydroxy-undecanal, 3-hydroxy-dodecanal, 3-hydroxy-tridecanal, and any combination thereof. In some embodiments, the DERA enzyme is specifically modified to increase activity towards 3-hydroxy octanal.


In some embodiments, the DERA enzyme comprises one or more of the following mutations: R190K, R190I, R190F, R190A, R190L, R190M, T12A, and S238A. In some embodiments, the DERA enzyme comprises, but is not limited to, one or more of the following mutations: RI90K, R190I, R190F, TI2A, and S238A. In some embodiments, the DERA enzyme comprises, but is not limited to, one or more of the following mutations: R190A or R90L or RI90M, T12A and S238A. In some embodiments, the number associated with amino acids R190, T12, and S238 corresponds to the amino acid residue numbers in BH1352 DERA, represented in SEQ ID NO: 18. In some embodiments, all residues R190, T12, and S238 are mutated or modified. In some embodiments, only one of R190, T12, and S238 is mutated or modified. In some embodiments, the amino acid R190 is mutated or modified. In some embodiments, the DERA enzyme from another organism may comprise one or more amino acid mutations that correspond to the amino acid mutations R190K, R190I, R190F, R190A, R190L, R190M, T12A, and S238A (e.g., of SEQ ID NO: 18), wherein the amino acids R190, T12 and S238 correspond to the amino acid residue numbers in SEQ ID NO: 18. In some embodiments, one or more amino acid(s) is mutated to a hydrophobic amino acid. In some embodiments, one or more amino acid(s) is mutated to a nonpolar amino acid. In some embodiments, one or more amino acid(s) is mutated to a charged amino acid. In some embodiments, one or more amino acid(s) is mutated to a basic amino acid.


In some embodiments, enzymes belong to the Pfam database [Finn R. D. et al., Pfam: the protein families database Nucl. Acids Res. (1 Jan. 2014) 42 (Dl): D222-D230] group PF01791 (DeoC/LacD family aldolase) include deoxyribose-5-phosphate aldolases, which also belong to the InterPro family IPR002915, IPR013785, IPRO11343, and IPR028581. Protein sequences that belong to the InterPro and Pfam family of proteins may be obtained, such that they are homologues of DERA described herein.


In some embodiments, the 5,3-dihydroxy aldehyde product of the DERA catalyzed reaction spontaneously forms a hemi-acetal.


In some embodiments the hemi-acetal is oxidized to form a lactone by a hemi-acetal specific oxidase. TABLE 1 lists hemi-acetal specific oxidases.













TABLE 1





Protein
Genbank ID
GI number
Organism
SEQ ID NO







Zwf
NP_416366.1
16129805

Escherichia coli K-12

21





MG1655



G6PD
P37986.1
585164

Dickeya dadantii 3937

22


G6PD
Q9Z8U6.1
6225396

Chlamydia pneumoniae

23









In some embodiments, hemi-acetal oxidases as described herein are derived from microorganisms of the genus Escherichia In some embodiments, the hemi-acetal oxidase is derived from Escherichia coli K-12 MG1655. In some embodiments, a hemi-acetal oxidase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 21 or active fragments or homologues thereof.


In some embodiments, the hemi-acetal oxidase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 24.


In some embodiments, the gamma-lactone pathway disclosed herein comprises the condensation of one aldehyde molecule to pyruvate using enzyme from class aldolases. In some embodiments, the condensation of an aldehyde and a pyruvate molecule may be performed using an enzyme from class aldolases. In some embodiments, the enzyme from the class aldolases is a pyruvate dependent aldolase (PDA).


In some embodiments, sources of encoding nucleic acids for the pathway enzymes described herein are not particularly restricted and may include any species where the encoded gene product may catalyze the relevant reaction. The enzymes may be derived from, but not limited to, the following species: Paraburkholderia xenovorans LB400, Agrobacterium tumefaciens, Bacillus cereus, Bacillus halodurans, Bacillus subtilis, Helicobacter pylori, Lactobacillus brevis, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas synringae, Rhodopseudomonas palustris, Salmonella typhimurium, Saccharomyces cerevisiae, Clostridium acetobutylicum. TABLE 2 illustrates exemplary pyruvate dependent aldolase enzymes.













TABLE 2









SEQ


Protein
Genbank ID
GI number
Organism
ID NO







EDA
NP_416364
16129803

Escherichia coli K-12 MG1655

25


MphE
NP_414886
16128337

Escherichia coli K-12 MG1655

26


BphI
ABE37049
91693852

Paraburkholderia xenovorans

27





LB400









In some embodiments, enzymes described herein may belong to the InterPro superfamily family IPR000887 and IPRO17629, which describe the pyruvate dependent aldolase family of enzymes.


In some embodiments, pyruvate dependent aldolase as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the pyruvate dependent aldolase is derived from Escherichia coli K-12 MG1655. In some embodiments, a pyruvate dependent aldolase as used in a process described herein comprises an amino acid sequence of SEQ ID Nos: 25 or 26, or active fragments or homologues thereof.


In some embodiments, the pyruvate dependent aldolase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 28 or 29.


In some embodiments, sources of encoding nucleic acids for the pathway enzymes described herein are not particularly restricted and may include any species where the encoded gene product may catalyze the relevant reaction. The enzymes may be derived from, but not limited to, the following species: Agrobacterium tumefaciens, Bacillus cereus, Bacillus halodurans, Bacillus subtilis. Helicobacter pylori. Lactobacillus brevis. Pseudomonas aeruginosa. Pseudomonas putida, Pseudomonas synringae, Rhodopseudomonas palustris. Salmonella typhimurium, Saccharomyces cerevisiae, Clostridium acetobutylicum. TABLE 3 illustrates exemplary aldo-keto reductase enzymes.













TABLE 3









SEQ


Protein
Genebank ID
GI number
Organism
ID NO



















AKR11B
CAB12792.1
2633288

Bacillus subtilis

30


PA1127
NP_249818.1
15596324

Pseudomonas aeruginosa

31


AKR
WP_
499200775

Bacillus halodurans

32



010898315





AKR
NP_790200
28867581

Pseudomonas syringae

33









In some embodiments, enzymes described herein may belong to the InterPro superfamily family IPR023210, IPR001395, IPR018170, and IPRO20471, which describes the aldo-keto reductase family of enzymes that possess a beta-alpha-beta fold which comprises a parallel 8 beta/alpha barrel which contains the NADP binding motif.


In some embodiments, there is provided an alcohol dehydrogenase, aldo-keto reductases (AKR), oxidoreductase, or aldehyde reductase capable of selectively reducing trans-2-unsaturated aldehydes to trans-2-unsaturated alcohols or to trans-2-unsaturated carboxylic acids. In some embodiments, there is provided alcohol dehydrogenases, aldo-keto reductases (AKR), oxidoreductases, or aldehyde reductases capable of selectively reducing 2-keto-4-hydroxy aldehydes to 2,4-dihydroxy aldehydes. In some embodiments, there is provided alcohol dehydrogenases, aldo-keto reductases (AKR), oxidoreductases, or aldehyde reductases capable of selectively reducing 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid. In some embodiments, the source of this enzyme is not particularly restricted.


In some embodiments, aldo-keto reductases (AKRs) as described herein are derived from microoganisms of the genus Pseudomonas. In some embodiments, an AKR as provided herein is derived from Pseudomonas aeruginosa. In some embodiments, an AKR as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 31 or active fragments or homologues thereof.


In some embodiments, the AKR is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 34.


In some embodiments, L-threonine is the starting metabolite for the production of 2-ketobutyrate by pathways disclosed herein. It is a common metabolite but it may be produced by a sequence of reactions starting from L-aspartate.


In some embodiments, L-threonine is produced from L-aspartate by sequential reactions catalyzed by aspartate kinase/homoserine dehydrogenase (ThrA) (EC 2.7.2.4), homoserine kinase (ThrB) (EC 2.7.1.39) and threonine synthase (ThrC) (EC 4.2.3.1). The reaction proceeds via L-aspatyl-4-phosphate, L-aspartate semialdehyde, L-homoserine and O-phospho-L-homoserine. Over expression of aspartate kinase/homoserine dehydrogenase, homoserine kinase and threonine synthase leads to increased levels of L-threonine [Mak. W. S., et al., Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun, 2015, 6: p. 10005.; Rodriguez, G. M, and S. Atsumi, Toward aldehdle and alkane production by removing aldehyde reductase activity in Escherichia coli. Metab Eng, 2014, 25: p. blast.ncbi.nlm.nih.gov/Blast.cgi227-37.; Zhang, K., et al., Expanding metabolism for biosynthesis of nonnatural alcohols. Proc Natl Acad Sci USA, 2008, 105(52): p. 20653-8.1. TABLE 4 illustrates examples of isopropylmalate synthases, isopropylmalate dehydratases and 3-isopropylmalate dehydrogenases.













TABLE 4









SEQ


Protein
Genbank ID
GI number
Organism
ID NO



















ThrA
NP_414543.1
16127996

Escherichia coli K-12

35





MG1655



ThrA
P27725.1
113540

Serratia marcescens

36


ThrA
Q89AR4.1
38372169

Buchnera aphidicola str.

37





Bp (Baizongia pistaciae)



ThrB
NP_414544.1
16127997

Escherichia coli K-12

38





MG1655



ThrB
Q8DEP3.1
32129677

Vibrio vulnificus

39





CMCP6



ThrB
Q057U7.1
122285538

Buchnera aphidicola

40





BCc



ThrC
NP_414545.1
16127998

Escherichia coli K-12

41





MG1655



ThrC
P44503.1
1174691

Haemophilus influenzae

42





Rd KW20



ThrC
P29363.3
12231046

Pseudomonas

43






aeruginosa PAO1










In some embodiments, aspartate kinases/homoserine dehydrogenases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the aspartate kinase/homoserine dehydrogenase is derived from Escherichia coli K-12 MG655. In some embodiments, an aspartate kinase/homoserine dehydrogenase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 35 or active fragments or homologues thereof.


In some embodiments, the aspartate kinase/homoserine dehydrogenase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 44.


In some embodiments, homoserine kinases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the homoserine kinase is derived from Escherichia coli K-12MG1655. In some embodiments, a homoserine kinase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 38 or active fragments or homologues thereof.


In some embodiments, the homoserine kinase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 45.


In some embodiments, threonine synthases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the threonine synthase is derived from Escherichia coli K-12MG1655. In some embodiments, a threonine synthase used in a process as described herein comprises an amino acid sequence of SEQ ID NO: 41 or active fragments or homologues thereof.


In some embodiments, the threonine synthase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 46.


In some embodiments, 2-ketobutyrate is the starting metabolite for the production of longer chain precursor 2-ketoacids by pathways disclosed herein. In some embodiments, 2-ketobutyrate is produced by dehydration of L-threonine.


In some embodiments, 2-ketobutyrate is produced by dehydration of L-threonine by threonine dehydratase (IlvA) (EC 4.3.1.19) to yield water and 2-aminobut-2-enoate, which spontaneously converts into 2-ketobutyrate. Over expression of threonine dehydratase leads to increased levels of 2-ketobutyrate [Mak, W. S., et al., Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun. 2015, 6: p. 10005.; Rodriguez, G. M, and S. Atsumi, Toward aldehyde and alkane production by removing aldehyde reductase activity in Escherichia coli. Metab Eng, 2014, 25: p. blast.ncbi.nlm.nih.gov/Blast.cgi227-37.; Zhang, K., et al., Expanding metabolism for biosynthesis of nonnatural alcohols. Proc Natl Acad Sci USA, 2008, 105(52): p. 20653-8.]. TABLE 5 illustrates exemplary threonine dehydratases.













TABLE 5





Protein
Genebank ID
GI number
Organism
SEQ ID NO



















IlvA
NP_418220
16131630

Escherichia coli K-12 MG1655

47


IlvA
Q9CKJ2.1
20140770

Pasteurella multocida subsp. multocida

48





str. Pm70



IlvA
P53607.1
1729930

Burkholderia multivorans ATCC

49





17616










In some embodiments, threonine dehydratase as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the threonine dehydratase is derived from Escherichia coli K-12 MGI655. In some embodiments, a threonine dehydratase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 47 or active fragments or homologues thereof.


In some embodiments, the threonine dehydratase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 50.


In some embodiments, 2-ketoacids of different chain length are the starting metabolites for the production of longer chain precursor aldehydes by pathways disclosed herein. In some embodiments, 2-ketoacids are produced by multiple reactions starting from 2-ketobutyrate.


In some embodiments, 2-ketoacids such as 2-ketovalerate, 2-ketoheptanoic acid and 2-ketooctanoic acid are produced from 2-ketobutyrate by sequential reactions catalyzed by 2-isopropylmalate synthase (LeuA) (EC 2.3.3.13), isopropylmalate dehydratase (LeuC) (EC 4.2.1.33; 4.2.1.35) and/or 3-isopropylmalate dehydrogenase (LeuB) (EC1.1.1.85). Each cycle requires one acetyl-CoA and leads to chain elongation. In some embodiments, longer chains are produced by passing the cycle several times. Over expression of the enzymes (e.g., 2-isopropylmalate synthase, isopropylmalate dehydratase, and/or 3-isopropylmalate) dehydrogenase can lead to increased levels of 2-ketoacid production [Mak, W. S., et al., Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun, 2015, 6: p. 10005.; Rodriguez. G. M, and S. Atsumi, Toward aldehyde and alkane production by removing aldehyde reductase activity in Escherichia coli. Metab Eng, 2014, 25: p. blast.ncbi.nlm.nih.gov/Blast.cgi 227-37.; Zhang, K., et al., Expanding metabolism for biosynthesis of nonnatural alcohols. Proc Natl Acad Sci USA, 2008, 105(52): p. 20653-8.]. TABLE 6 illustrates examples of isopropylmalate synthases, isopropylmalate dehydratases and 3-isopropylmalate dehydrogenases.













TABLE 6






Genebank
GI

SEQ


Protein
ID
number
Organism
ID NO



















LeuA
NP_414616.1
16128068

Escherichia coli K-12

51





MG1655



LeuA
Q9ZEY8.3
11182435

Buchnera aphidicola

52





str. APS



LeuA
Q47BI0.1
123626799

Dechloromonas

53






aromatica RCB




LeuB
NP_414615.4
90111082

Escherichia coli K-12

54





MG1655



LeuB
Q4QLS3.1
81335999

Haemophilus influenzae

55





86-028NP



LeuB
Q8PH05.1
24211902

Xanthomonas axonopodis

56





pv. citri str. 306



LeuC
NP_414614.1
16128066

Escherichia coli K-12

57





MG1655



LeuC
B1KKZ2.1
226736001

Shewanella woodyi

58





ATCC 51908



LeuC
A6WXG4.1
166989645

Ochrobactrum anthropi

59





ATCC 49188



LeuD
NP_414613.1
16128065

Escherichia coli K-12

60





MG1655



LeuD
C4LAV4.1
259494466

Tolumonas auensis

61





DSM 9187



LeuD
C0ZCK0.1
254809060

Brevibacillus brevis

62





NBRC 100599









In some embodiments, 2-isopropylmalate synthases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the 2-isopropylmalate synthase is derived from Escherichia coli K-12 MG1655. In some embodiments, a 2-isopropylmalate synthase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 51 or active fragments or homologues thereof.


In some embodiments, the 2-isopropylmalate synthase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 63.


In some embodiments, 3-isopropylmalate dehydrogenases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the 3-isopropylmalate dehydrogenase is derived from Escherichia coli K-12 MG1655. In some embodiments, a 3-isopropylmalate dehydrogenase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 54 or active fragments or homologues thereof.


In some embodiments, the 3-isopropylmalate dehydrogenase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 64.


In some embodiments, 3-isopropylmalate dehydratases as described herein are derived from microorganisms of the genus Escherichia. In some embodiments, the 3-isopropylmalate dehydratase is derived from Escherichia coli K-12 MG655. In some embodiments, a 3-isopropylmalate dehydratase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 57 or active fragments or homologues thereof. In some embodiments, a isopropylmalate dehydratase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 60 or active fragments or homologues thereof.


In some embodiments, the isopropylmalate dehydratases is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 65.


In some embodiments, the starting aldehyde of any one of the pathways disclosed herein is butyraldehyde. In some embodiments, the butyraldehyde is a natural metabolite. In some embodiments, the butyraldehyde is produced by decarboxylation of 2-ketovalerate.


In some embodiments, butyraldehyde is produced by the decarboxylation of 2-ketovalerate by alpha-ketoacid decarboxylase (KDC) (EC 4.1.1.74) to yield butyraldehyde and carbon dioxide. KDC (KivD) from Lactococcus lactis lactis KF147 has broad substrate specificity, ranging from keto-propionic acid to keto-octonoic acid, with highest activity observed for keto-valerate. [Mak, W. S., et al., Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun. 2015, 6: p. 10005.]. TABLE 7 illustrates examples of alpha-ketoacid decarboxylases.













TABLE 7






Genebank


SEQ


Protein
ID
GI number
Organism
ID NO



















KivD
ADA65057.1
281375551

Lactococcus lactis

66






lactis KF147




Indolepyruvate
P23234.1
118333

Enterobacter cloacae

67


decarboxylase






KDC
A0R480.1
189028401

Mycobacterium

68






smegmatis







str. MC2 155









In some embodiments, alpha-ketoacid decarboxylases as described herein are derived from microorganisms of the genus Lactococcus. In some embodiments, the alpha-ketoacid decarboxylase is derived from Lactococcus lactis lactis KF147. In some embodiments, a alpha-ketoacid decarboxylase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 66 or active fragments or homologues thereof.


In some embodiments, the alpha-ketoacid decarboxylase a is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 69.


In some embodiments, butyraldehyde is obtained by the decarboxylation of 2-ketovalerate by KDC. In some embodiments, butyraldehyde is (alternatively or additionally) obtained by one or more of the reaction pathways identified in TABLE 8.












TABLE 8





Kegg reaction ID
Enzyme Name
EC number
Reaction







R01172
butanal: NAD+
1.2.1.10;
Butanal + CoA +



oxidoreductase
1.2.1.57;
NAD+ <=>



(CoA-acylating)
1.2.1.87
Butanoyl-CoA +





NADH + H+


R01173
butanal: NADP+
1.2.1.57
Butanal + CoA +



oxidoreductase

NADP+ <=>



(CoA-acylating

Butanoyl-CoA +





NADPH + H+


R01172
butanal: NAD+
1.2.1.10;
Butanal + CoA +



oxidoreductase
1.2.1.57;
NAD+ <=>



(CoA-acylating)
1.2.1.87
Butanoyl-CoA +





NADH + H+


R03544
butanol
1.1.1.-
Butanal + NADH +



dehydrogenase

H+ <=> 1-Butanol +





NAD+


R03545
butanol
1.1.1.-
Butanal +



dehydrogenase

NADPH + H+ <=>





1-Butanol + NADP+









In some embodiments, hexanal is produced by the decarboxylation of 2-ketoheptanal by a mutated alpha-ketoacid decarboxylase (KDC) (EC 4.1.1.74) to yield hexanal and carbon dioxide. Three mutations G402V. M538L and F542V in KDC (KivD) from Lactococcus lactis lactis KF147 changes substrate specificity towards longer chain alpha-ketoacids, ranging from keto-hexanoic acid to keto-octonoic acid, with highest activity observed for keto-heptanoic acid. [Mak, W. S., et al., Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun. 2015, 6: p. 10005.].


In some embodiments, mutated KDCs showing high homology to KivD from Lactococcus lactis lactis KF147 might be used. Mutations include the following: G402V. M538L and F542. Amino acid positions are based on SEQ ID NO: 66.


In some embodiments, mutated alpha-ketoacid decarboxylases as described herein are derived from microorganisms of the genus Lactococcus. In some embodiments, the mutated alpha-ketoacid decarboxylase is derived from Lactococcus lactis lactis KF147. In some embodiments, a mutated alpha-ketoacid decarboxylase as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 68 or active fragments or homologues thereof.


In some embodiments, the mutated alpha-ketoacid decarboxylase is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 70 or 79.


In some embodiments, the starting aldehyde of any one of the pathways disclosed herein is acetaldehyde which is a common central metabolite, or may be produced by decarboxylation of pyruvate.


In some embodiments, acetaldehyde is produced by the decarboxylation of pyruvate by pyruvate decarboxylase (PDC) (EC 4.1.1.1) to yield acetaldehyde and carbon dioxide. In some embodiments, the non-naturally occurring microorganism comprises a decarboxylase capable of the decarboxylaion of pyruvate to yield acetaldehyde and carbon dioxide. PDC from S. cerevisiae has a broad substrate range for aliphatic 2-keto acids. It has been extensively studied, engineered, and expressed in E. coli [Candy, J. M., Duggleby, R. G., & Mattick. J. S. (1991). Expression of active yeast pyruvate decarboxylase has also been studied. Journal of General Microbiology, (137), 5-9; Killenberg-Jabs, M., König, S., Hohmann, S., & Hübner, G. (1996). Purification and characterization of the pyruvate decarboxylase from a haploid strain of S. cerevisiae has been reported. [Biological Chemistry Hoppe-Seyler, 377(5), 313-7]. PDC from Zymomonas mobilis also has a broad substrate range for 2-keto acids, and has been extensively studied and expressed in Escherichia coli [Pohl, M., Siegert, P., Mesch, K., Bruhn, H., & Grotzinger, J. (1998). The role of residues glutamate-50 and phenylalanine-496 in Zynomonas mobilis pyruvate decarboxylase, the promoter and nucleotide sequences of the Zymomonas mobilis pyruvate decarboxylase, and the active site mutants of pyruvate decarboxylase from Zymomonas mobilis have been studied. [The Biochemical Journal, 315, Pt 3, 745-51. Conway. T., Osman, Y. a, Konnan, J. I., Hoffmann, E. M., & Ingram, L. O. (1987); Journal of Bacteriology, 169(3), 949-54. Siegert, P., Mesch, K., & Bruhn. H. (1998). Eur. J. Biochem., 257, 538-546; Candy, J. M., Koga, J., Nixon, P. F., & Duggleby, R. G. (19%)]. The sequence identifiers for the exemplary PDC described herein may be found in TABLE 9 and searched for using the GenBank accession number (GI number).













TABLE 9





Protein
Genebank ID
GI number
Organism
SEQ ID NO



















PDC
WP_011241152
499560369

Zymomonas mobilis

71


PDC
P06169.7
30923172

Saccharomyces

72






cerevisiae




PDC
AEE86169
332660769

Arabidopsis thaliana

73


PDC
KLA18896
821638028

Bacillus cereus

74









In some embodiments, PDCs as described herein are derived from microorganisms of the genus Zymomonas. In some embodiments, the PDC is derived from Zymomonas mobilis. In some embodiments, a PDC as used in a process described herein comprises an amino acid sequence of SEQ ID NO: 71 or active fragments or homologues thereof.


In some embodiments, the PDC is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 75.


Pyruvate decarboxylases have also been shown to act on pyruvate for the production of acetaldehyde may include, but are not limited to, benzoylformate decarboxylase (BFD) (EC 4.1.1.7) derived from Pseudomonas putida and branched chain alpha-ketoacid decarboxylase (KDC) derived from Lactococcus lactis [Gocke, D., Graf, T., Brosi, H., Frindi-Wosch, I., Walter, L., Möller, M., & Pohl, M. (2009). Comparative characterisation of thiamine diphosphate-dependent decarboxylases. Journal of Molecular Catalysis B: Enzymatic, 61(1-2), 30-35]. In addition, mutants of PDC and BFD that have been generated by site-directed mutagenesis including but not limited to: PDC 1472A, PDC I476F, PDC I472A/I476F, BFD A460I, BFD F464I, and BFD A460I/F464I, have also shown activity on pyruvate towards acetaldehyde formation [Siegert, P., McLeish, M. J., Baumann, M., Iding, H., Kneen, M. M., Kenyon, G. L., & Pohl, M. (2005). Exchanging the substrate specificities of pyruvate decarboxylase from Zymomonas mobilis and benzoylformate decarboxylase from Pseudomonas putida. Protein Engineering. Design & Selection: PEDS, 18(7), 345-57].


In some embodiments, the non-naturally occurring microorganism further includes: a decarboxylase capable of the decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide. In some embodiments, the decarboxylase comprises a pyruvate decarboxylase (PDC), which may comprise an amino acid sequence of SEQ ID NO: 71 or an active fragment or homologue thereof benzoylformate decarboxylase (BFD), which may comprise an amino acid sequence or SEQ ID NO: 76 or active fragment or homologue thereof, or alpha-detoacid decarboxylase (KDC), which may comprise an amino acid sequence of SEQ ID NO: 68 or 77 or active fragment or homologue thereof. The microorganism may alternatively or further express an enzyme identified in TABLE 9, and/or TABLE 10.


In some embodiments, the BFD described herein comprises an amino acid sequence of SEQ ID NO: 76 or active fragment or homologue thereof, and that of the KDC of SEQ ID NO: 68 or 77 or active fragments or homologues thereof.


In some embodiments, the BFD is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 78. In some embodiments, the KDC is an enzyme comprising an amino acid sequence encoded by a DNA which comprises the nucleotide sequence of SEQ ID NO: 79 or 70. In some embodiments, the processes as described herein are carried out with live cells. In some embodiments, the processes are carried out in vitro with lysed cells or with partially, or with substantially completely purified enzyme, or with purified enzyme. In some embodiments, the processes are carried out with permeabilized cells. In some embodiments, methods are carried out in vitro and the enzyme is immobilized. For example, the enzyme may be immobilized on a support (e.g., solid array) using a linker, such as using a biotin-(strept)avidin complex.


In some embodiments, the non-naturally occurring microorganism expresses at least one enzyme identified in TABLE 10. In some embodiments, acetaldehyde is obtained by the decarboxylation of pyruvate by PDC In some embodiments, acetaldehyde is (alternatively or additionally) obtained by one or more of the reaction pathways identified in TABLE 10 below.












TABLE 10





KEGG





REACTION

EC



ID
ENZYME NAME
NUMBER
REACTION







R00025
ethylnitronate: oxygen 2-
1.13.12.16
Ethylnitronate + Oxygen +



oxidoreductase (nitrite-

Reduced FMN <=>



forming)

Acetaldehyde + Nitrite + FMN +





H2O


R00224
pyruvate carboxy-lyase
4.1.1.1
Pyruvate <=> Acetaldehyde +



(acetaldehyde-forming)

CO2


R00228
acetaldehyde: NAD+
1.2.1.10
Acetaldehyde + CoA + NAD+ <=>



oxidoreductase (CoA-

Acetyl-CoA + NADH + H+



acetylating)




R00326
acetaldehyde: acceptor
1.2.99.6
Acetaldehyde + Acceptor + H2O <=>



oxidoreductase

Acetate + Reduced acceptor


R00710
Acetaldehyde: NAD+
1.2.1.3,
Acetaldehyde + NAD+ + H2O <=>



oxidoreductase
1.2.1.5
Acetate + NADH + H+


R00711
Acetaldehyde: NADP+
1.2.1.4,
Acetaldehyde + NADP+ + H2O <=>



oxidoreductase
1.2.1.5,
Acetate + NADPH + H+




1.2.1.-



R00746
Ethanol: NADP+
1.1.1.2,
Ethanol + NADP+ <=>



oxidoreductase
1.1.1.71
Acetaldehyde + NADPH + H+


R00747
2-Phosphonoacetaldehyde
3.11.1.1
Phosphonoacetaldehyde + H2O <=>



phosphonohydrolase

Acetaldehyde +





Orthophosphate


R00748
ethanolamine-phosphate
4.2.3.2
Ethanolamine phosphate + H2O <=>



phosphate-lyase (deaminating;

Acetaldehyde + Ammonia +



acetaldehyde-forming)

Orthophosphate


R00749
ethanolamine ammonia-lyase
4.3.1.7
Ethanolamine <=> Acetaldehyde +



(acetaldehyde-forming)

Ammonia


R00750
4-hydroxy-2-oxopentanoate
4.1.3.39
Acetaldehyde + Pyruvate <=> 4-



pyruvate-lyase (acetaldehyde-

Hydroxy-2-oxopentanoate



forming)




R00751
L-threonine acetaldehyde-lyase
4.1.2.5
L-Threonine <=> Glycine +



(glycine-forming)

Acetaldehyde


R00753
(S)-lactate acetaldehyde-lyase
4.1.2.36
(S)-Lactate <=> Formate +



(formate-forming)

Acetaldehyde


R00754
ethanol: NAD+ oxidoreductase
1.1.1.1,
Ethanol + NAD+ <=>




1.1.1.71
Acetaldehyde + NADH + H+


R00755
Pyruvate decarboxylase, TPP
4.1.1.1
Acetaldehyde + Thiamin



dependent reaction

diphosphate <=> 2-(alpha-





Hydroxyethyl)thiamine





diphosphate


R00799
Nitroethane: oxygen
1.7.3.1
Nitrite + Acetaldehyde +



oxidoreductase

Hydrogen peroxide <=>





Nitroethane + Oxygen + H2O


R01019
acetaldehyde: pyrroloquinoline-
1.2.99.3
PQQ + Acetaldehyde + H2O <=>



quinone oxidoreductase

PQQH2 + Acetate


R01066
2-deoxy-D-ribose-5-phosphate
4.1.2.4
2-Deoxy-D-ribose 5-phosphate <=>



acetaldehyde-lyase (D-

D-Glyceraldehyde 3-



glyceraldehyde-3-phosphate-

phosphate + Acetaldehyde



forming)




R01410


Hydrogen cyanide +





Acetaldehyde + Ammonia <=>





alpha-Aminopropiononitrile +





H2O


R01841
17alpha-Hydroxyprogesterone
4.1.2.30
17alpha-Hydroxyprogesterone <=>



acetaldehyde-lyase

Androstenedione +





Acetaldehyde


R02345
3-Hydroxybutan-2-one: D-
2.2.1.4
Acetoin + D-Ribose 5-phosphate <=>



ribose-5-phosphate

Acetaldehyde + 1-Deoxy-



aldehydetransferase

D-altro-heptulose 7-phosphate


R03723
24R,24(1)R)-fucosterol-
4.1.2.33
((24R,24(1)R)-Fucosterol



epoxide acetaldehyde-lyase

epoxide <=> Desmosterol +



(desmosterol-forming)

Acetaldehyde


R05198
ethanol: cytochrome c
1.1.2.8
Ethanol + 2 Ferricytochrome c <=>



oxidoreductase

2 Ferrocytochrome c +





Acetaldehyde + 2H+


R05380
acetaldehyde hydro-lyase
4.2.1.112
Acetaldehyde <=> Acetylene +





H2O


R05381
diethanolamine ethanolamine-
4.3.3.-
Ethanolamine + Acetaldehyde <=>



lyase (acetaldehyde-forming)

Diethanolamine


R05382
triethanolamine
4.3.3.-
Triethanolamine <=>



diethanolamine-lyase

Diethanolamine + Acetaldehyde



(acetaldehyde-forming)




R05565

1.14.15.-
2 Atrazine + Oxygen <=> 2





Deethylatrazine + 2





Acetaldehyde


R05567

1.14.15.-
2 Deisopropylatrazine +





Oxygen <=> 2





Deisopropyldeethylatrazine + 2





Acetaldehyde


R05811

2.1.1.-
Cobalt-precorrin 5 + S-





Adenosyl-L-methionine + H2O <=>





Cobalt-precorrin 6 + S-





Adenosyl-L-homocysteine +





Acetaldehyde


R06171
L-allo-threonine acetaldehyde-
4.1.2.5,
L-Allothreonine <=> Glycine +



lyase (glycine-forming)
4.1.2.49
Acetaldehyde


R06973

4.1.1.-
3-Oxopropanoate <=>





Acetaldehyde + CO2


R07247
fluoroacetaldehyde: L-threonine
2.2.1.8
L-Threonine +



aldehydetransferase

Fluoroacetaldehyde <=>





Acetaldehyde + 4-Fluoro-L-





threonine


R07772
cobalt-precorrin 5A
3.7.1.12
Cobalt-precorrin 5A + H2O <=>



acylhydrolase

Cobalt-precorrin 5B +





Acetaldehyde


R08195
D-threonine acetaldehyde-lyase
4.1.2.42
D-Threonine <=> Glycine +



(glycine-forming)

Acetaldehyde


R08196


D-Allothreonine <=> Glycine +





Acetaldehyde


R08516
17alpha-Hydroxypregnenolone
4.1.2.30
17alpha-Hydroxypregnenolone ⇔



acetaldehyde-lyase

Dehydroepiandrosterone +





Acetaldehyde


R09127
ethanol: cytochrome c
1.1.2.7
Ethanol + 2 Ferricytochrome cL <=>



oxidoreductase

Acetaldehyde + 2





Ferrocytochrome cL + 2H+


R09156
chloroethane, donor: oxygen
1.13.12.-,
Chloroethane + Oxygen +



oxidoreductase (dechlorinating,
1.14.99.39
Reduced acceptor <=>



acetaldehyde-forming)

Acetaldehyde + Hydrochloric





acid + Acceptor + H2O


R09479
ethanol: quinone oxidoreductase
1.1.5.5
Ethanol + Ubiquinone <=>





Acetaldehyde + Ubiquinol


R09524
acetyl-CoA: acetoin O-
2.3.1.190
Acetoin + CoA + NAD+ <=>



acetyltransferase

Acetaldehyde + Acetyl-CoA +





NADH + H+


R09552
ethanol: N,N-dimethyl-4-
1.1.99.36
Ethanol + N,N-Dimethyl-4-



nitrosoaniline oxidoreductase

nitrosoaniline <=> Acetaldehyde +





4-(Hydroxylamino)-N,N-





dimethylaniline


R09959
7,8-dihydroneopterin 3′-
4.1.2.50
7,8-Dihydroneopterin 3′-



triphosphate acetaldehyde-

triphosphate + H2O <=> 6-



lyase (6-carboxy-5,6,7,8-

Carboxy-5,6,7,8-tetrahydropterin +



tetrahydropterin and

Acetaldehyde + Triphosphate



triphosphate-forming)




R10285
choline trimethylamine-lyase
4.3.99.4
Choline <=> Trimethylamine +



(acetaldehyde-forming)

Acetaldehyde + H+









In some embodiments, pyruvate is obtained by one or more of the reaction pathways identified in TABLE 11 below.












TABLE 11





Kegg





reaction ID
Enzyme Name
EC number
Reaction







R00006
acetolactate synthase
2.2.1.6
2-Acetolactate + CO2 <=> 2





Pyruvate


R00008
4-hydroxy-4-methyl-
4.1.3.17
Parapyruvate <=> 2 Pyruvate



2-oxoglutarate





aldolase




R00014
pyruvate
1.2.4.1 2.2.1.6
Pyruvate + Thiamin diphosphate <=>



dehydrogenase
4.1.1.1
2-(alpha-





Hydroxyethyl)thiamine





diphosphate + CO2


R00195
diaminopropionate
4.3.1.15
2,3-Diaminopropanoate + H2O <=>



ammonia-lyase

Pyruvate + 2 Ammonia


R00196
L-lactate
1.1.2.3
(S)-Lactate + 2 Ferricytochrome c <=>



dehydrogenase

Pyruvate + 2 Ferrocytochrome





c + 2H+


R00197
D-lactate
1.1.2.4
(R)-Lactate + 2 Ferricytochrome c <=>



dehydrogenase

Pyruvate + 2 Ferrocytochrome





c + 2H+


R00198
D-lactate
1.1.2.5
(R)-Lactate + 2 Ferricytochrome c-



dehydrogenase

553 <=> Pyruvate + 2





Ferrocytochrome c-553 + 2H+


R00199
pyruvate, water
2.7.9.2
ATP + Pyruvate + H2O <=> AMP +



dikinase

Phosphoenolpyruvate +





Orthophosphate


R00200
pyruvate kinase
2.7.1.40
ATP + Pyruvate <=> ADP +





Phosphoenolpyruvate


R00203
lactaldehyde
1.2.1.22
Methylglyoxal + NAD+ + H2O <=>



dehydrogenase
1.2.1.23
Pyruvate + NADH + H+


R00205
2-oxoaldehyde
1.2.1.49
Methylglyoxal + NADP+ + H2O <=>



dehydrogenase

Pyruvate + NADPH + H+



(NADP+)




R00206
pyruvate, phosphate
2.7.9.1
ATP + Pyruvate + Orthophosphate <=>



dikinase

AMP + Phosphoenolpyruvate +





Diphosphate


R00207
pyruvate oxidase
1.2.3.3
Pyruvate + Orthophosphate +





Oxygen <=> Acetyl phosphate +





Hydrogen peroxide + CO2


R00208
phosphoenolpyruvate
3.1.3.60
Phosphoenolpyruvate + H2O <=>



phosphatase

Pyruvate + Orthophosphate


R00209
pyruvate
1.2.4.1 1.8.1.4
Pyruvate + CoA + NAD+ <=>



dehydrogenase
2.3.1.12
Acetyl-CoA + CO2 + NADH + H+



(acetyl-transferring)




R00210
pyruvate
1.2.1.51
Pyruvate + CoA + NADP+ <=>



dehydrogenase

Acetyl-CoA + CO2 + NADPH +



(NADP+)

H+


R00211
pyruvate oxidase
1.2.3.6
Pyruvate + CoA + Oxygen <=>



(CoA-acetylating)

Hydrogen peroxide + Acetyl-CoA





CO2


R00212
formate C-
2.3.1.54
Acetyl-CoA + Formate <=> CoA +



acetyltransferase

Pyruvate


R00213
carbamoyl-serine
4.3.1.13
O-Carbamoyl-L-serine + H2O <=>



ammonia-lyase

Pyruvate + 2 Ammonia + CO2


R00214
malate
1.1.1.38
S)-Malate + NAD+ <=> Pyruvate +



dehydrogenase
1.1.1.39
CO2 + NADH + H+



(oxaloacetate-





decarboxylating)




R00215
D-malate
1.1.1.83
(R)-Malate + NAD+ <=> Pyruvate +



dehydrogenase

CO2 + NADH + H+



(decarboxylating)




R00216
malate
1.1.1.40
(S)-Malate + NADP+ <=>



dehydrogenase

Pyruvate + CO2 + NADPH + H+



(oxaloacetate-





decarboxylating)





(NADP+)




R00217
malate
1.1.1.38
Oxaloacetate <=> Pyruvate + CO2



dehydrogenase
1.1.1.40




(oxaloacetate-
4.1.1.3




decarboxylating)




R00218
acetylenedicarboxylate
4.1.1.78
Acetylenedicarboxylate + H2O <=>



decarboxylase

Pyruvate + CO2


R00219

4.1.1.-
2-Hydroxyethylenedicarboxylate





carboxy-lyase (pyruvate-forming


R00220
L-serine ammonia-
4.3.1.17
L-Serine <=> Pyruvate +



lyase
4.3.1.19
Ammonia


R00221
D-serine ammonia-
4.3.1.18
D-Serine <=> Pyruvate +



lyase

Ammonia


R00223
diaminopropionate
4.3.1.15
Serine <=> Pyruvate + Ammonia



ammonia-lyase




R00224
pyruvate
4.1.1.1
Pyruvate <=> Acetaldehyde + CO2



decarboxylase




R00226
acetolactate synthase
2.2.1.6
(S)-2-Acetolactate + CO2 <=> 2





Pyruvate


R00237
(S)-citramalyl-CoA
4.1.3.25
(3S)-Citramalyl-CoA <=> Acetyl-



lyase

CoA + Pyruvate


R00258
alanine transaminase
2.6.1.2
L-Alanine + 2-Oxoglutarate <=>





Pyruvate + L-Glutamate


R00297
D-lactate
1.1.99.6
(R)-Lactate + Acceptor <=>



dehydrogenase

Pyruvate + Reduced acceptor



(acceptor)




R00324
acetylpyruvate
3.7.1.6
Acetylpyruvate + H2O <=>



hydrolase

Acetate + Pyruvate


R00325
citramalate lyase
4.1.3.22
(S)-2-Methylmalate <=> Acetate +





Pyruvate


R00344
pyruvate
6.4.1.1
ATP + Pyruvate + HCO3− <=>



carboxylase

ADP + Orthophosphate +





Oxaloacetate


R00350
4-hydroxy-4-methyl-
4.1.3.17
4-Carboxy-4-hydroxy-2-



2-oxoglutarate

oxoadipate <=> Oxaloacetate +



aldolase

Pyruvate


R00353
methylmalonyl-CoA
2.1.3.1
Malonyl-CoA + Pyruvate <=>



carboxytransferase

Acetyl-CoA + Oxaloacetate


R00368
strombine
1.5.1.22
N-(Carboxymethyl)-D-alanine +



dehydrogenase

NAD+ + H2O <=> Glycine +





Pyruvate + NADH + H+


R00369
alanine-glyoxylate
2.6.1.44
L-Alanine + Glyoxylate <=>



transaminase

Pyruvate + Glycine


R00396
alanine
1.4.1.1
L-Alanine + NAD+ + H2O <=>



dehydrogenase

Pyruvate + Ammonia + NADH +





H+


R00398
alanopine
1.5.1.17
Alanopine + NAD+ + H2O <=> L-



dehydrogenase

Alanine + Pyruvate + NADH + H+


R00400
alanine-oxo-acid
2.6.1.12
L-Alanine + Oxaloacetate <=>



transaminase

Pyruvate + L-Aspartate


R00409
methylisocitrate
4.1.3.30
(2S,3R)-3-Hydroxybutane-1,2,3-



lyase

tricarboxylate <=> Pyruvate +





Succinate


R00430
pyruvate kinase
2.7.1.40
GTP + Pyruvate <=> GDP +





Phosphoenolpyruvate


R00452
D-lysopine
1.5.1.16
D-Lysopine + NADP+ + H2O <=>



dehydrogenase

L-Lysine + Pyruvate + NADPH +





H+


R00453
lysine-pyruvate 6-
2.6.1.71
L-Lysine + Pyruvate <=> L-2-



transaminase

Aminoadipate 6-semialdehyde +





L-Alanine


R00470
4-hydroxy-2-
4.1.3.16
4-Hydroxy-2-oxoglutarate <=>



oxoglutarate
4.1.3.42
Pyruvate + Glyoxylate



aldolase




R00471
4.1.3.16
4.1.3.16
(4R)-4-Hydroxy-2-oxoglutarate <=>





Pyruvate + Glyoxylate


R00532
serine-sulfate
4.3.1.10
L-Serine O-sulfate + H2O <=>



ammonia-lyase

Pyruvate + Ammonia + Sulfate


R00543
acylpyruvate
3.7.1.5
3-Acylpyruvate + H2O <=>



hydrolase

Carboxylate + Pyruvate


R00562
D-octopine
1.5.1.11
D-Octopine + NAD+ + H2O <=>



dehydrogenase

L-Arginine + Pyruvate + NADH +





H+


R00572
pyruvate kinase
2.7.1.40
CTP + Pyruvate <=> CDP +





Phosphoenolpyruvate


R00576
glutamine-
2.6.1.15
L-Glutamine + Pyruvate <=> 2-



pyruvate

Oxoglutaramate + L-Alanine



transaminase




R00585
serine-pyruvate
2.6.1.51
L-Serine + Pyruvate <=>



transaminase

Hydroxypyruvate + L-Alanine


R00659
pyruvate kinase
2.7.1.40
UTP + Pyruvate <=> UDP +





Phosphoenolpyruvate


R00666
N5-
1.5.1.24
N5-(L-1-Carboxyethyl)-L-



(carboxyethyl)ornithine

ornithine + NADP+ + H2O <=> L-



synthase

Ornithine + Pyruvate + NADPH +





H+


R00673
Tryptophanase
4.1.99.1
L-Tryptophan + H2O <=> Indole +





Pyruvate + Ammonia


R00692
phenylalanine(histidine)
2.6.1.58
L-Phenylalanine + Pyruvate <=>



transaminase

Phenylpyruvate + L-Alanine


R00703
L-lactate
1.1.1.27
(S)-Lactate + NAD+ <=> Pyruvate +



dehydrogenase

NADH + H+


R00704
D-lactate
1.1.1.28
(R)-Lactate + NAD+ <=> Pyruvate +



dehydrogenase

NADH + H+


R00724
pyruvate kinase
2.7.1.40
ITP + Pyruvate <=> IDP +





Phosphoenolpyruvate


R00728
tyrosine phenol-
4.1.99.2
L-Tyrosine + H2O <=> Phenol +



lyase

Pyruvate + Ammonia


R00750
4-hydroxy-2-
4.1.3.39
Acetaldehyde + Pyruvate <=> 4-



oxovalerate aldolase

Hydroxy-2-oxopentanoate


R00782
cystathionine
4.4.1.1 4.4.1.8
L-Cysteine +H2O <=> Hydrogen



gamma-lyase
4.4.1.28
sulfide + Pyruvate + Ammonia


R00906
beta-alanopine
1.5.1.26
beta-Alanopine + NAD+ + H2O <=>



dehydrogenase

beta-Alanine + Pyruvate +





NADH + H+


R00907
beta-alanine-
2.6.1.18
L-Alanine + 3-Oxopropanoate <=>



pyruvate

Pyruvate + beta-Alanine



transaminase




R00930
methylmalonyl-CoA
2.1.3.1
(S)-Methylmalonyl-CoA +



carboxytransferase

Pyruvate <=> Propanoyl-CoA +





Oxaloacetate


R00985
anthranilate synthase
4.1.3.27
Chorismate + Ammonia <=>





Anthranilate + Pyruvate + H2O


R00986
anthranilate synthase
4.1.3.27
Chorismate + L-Glutamine <=>





Anthranilate + Pyruvate + L-





Glutamate


R01012
phosphoenolpyruvate-
2.7.1.121
Phosphoenolpyruvate + Glycerone <=>



glycerone

Pyruvate + Glycerone



phosphotransferase

phosphate


R01031
3-chloro-D-alanine
4.5.1.2
3-Chloro-D-alanine + H2O <=>



dehydrochlorinase

Pyruvate + Hydrochloric acid +





Ammonia


R01032

unclear
Cl− + Pyruvate + Ammonia <=> 3-




reaction
Chloro-L-alanine + H2O


R01064
2-dehydro-3-deoxy-
4.1.2.21
2-Dehydro-3-deoxy-6-phospho-D-



6-
4.1.2.55
galactonate <=> Pyruvate + D-



phosphogalactonate

Glyceraldehyde 3-phosphate



aldolase




R01085
acylpyruvate
3.7.1.5
3-Fumarylpyruvate + H2O <=>



hydrolase
3.7.1.20
Fumarate + Pyruvate


R01138
pyruvate kinase
2.7.1.40
dATP + Pyruvate <=> dADP +





Phosphoenolpyruvate


R01147
Pyridoxamine-5′-

Pyridoxamine phosphate +



phosphate: 2-

Pyruvate <=> Pyridoxal phosphate +



oxoglutarate

D-Alanine



aminotransferase




R01148
D-amino-acid
2.6.1.21
D-Alanine + 2-Oxoglutarate <=>



transaminase

Pyruvate + D-Glutamate


R01196
pyruvate synthase
1.2.7.1
2 Reduced ferredoxin + Acetyl-




1.2.7.11
CoA + CO2 + 2H+ <=> 2





Oxidized ferredoxin + Pyruvate +





CoA


R01215
valine-pyruvate
2.6.1.66
L-Valine + Pyruvate <=> 3-



transaminase

Methyl-2-oxobutanoic acid + L-





Alanine


R01261
alanine-oxo-acid
2.6.1.12
L-Alanine + 2-Oxo acid <=>



transaminase

Pyruvate + L-Amino acid


R01286
cystathionine beta-
4.4.1.8
L-Cystathionine + H2O <=> L-



lyase

Homocysteine + Ammonia +





Pyruvate


R01302
chorismate lyase
4.1.3.40
4-Hydroxybenzoate + Pyruvate <=>





Chorismate


R01335

3.7.1.-
Glycolate + Pyruvate <=> 5-





Hydroxy-2,4-dioxopentanoate +





H2O


R01344
D-amino-acid
2.6.1.21
D-Amino acid + Pyruvate <=> 2-



transaminase

Oxo acid + D-Alanine


R01355
2,3-dimethylmalate
4.1.3.32
(2R,3S)-2,3-Dimethylmalate <=>



lyase

Propanoate + Pyruvate


R01447
lactate-malate
1.1.99.7
(S)-Lactate + Oxaloacetate <=>



transhydrogenase

(S)-Malate + Pyruvate


R01576
3-deoxy-D-manno-
4.1.2.23
3-Deoxy-D-manno-octulosonate <=>



octulosonate

Pyruvate + D-Arabinose



aldolase




R01584
N-methylalanine
1.4.1.17
N-Methyl-L-alanine + H2O +



dehydrogenase

NADP+ <=> Pyruvate +





Methylamine + NADPH + H+


R01645
4-hydroxy-2-
4.1.2.52
4-Hydroxy-2-oxo-heptanedioate <=>



oxoheptanedioate

Succinate semialdehyde +



aldolase

Pyruvate


R01647
4-hydroxy-2-
4.1.2.52
Succinate semialdehyde +



oxoheptanedioate

Pyruvate <=> 2,4-Dihydroxyhept-



aldolase

2-enedioate


R01683
tauropine
1.5.1.23
Tauropine + NAD+ + H2O <=>



dehydrogenase

Taurine + Pyruvate + NADH + H+


R01699
pyruvate
1.2.4.1
Pyruvate + Enzyme N6-



dehydrogenase

(lipoyl)lysine <=>



(acetyl-transferring)

[Dihydrolipoyllysine-residue





acetyltransferase] S-





acetyldihydrolipoyllysine + CO2


R01712
pyridoxamine-
2.6.1.30
Pyridoxamine + Pyruvate <=>



pyruvate

Pyridoxal + L-Alanine



transaminase




R01782
2-dehydro-3-deoxy-
4.1.2.28
2-Dehydro-3-deoxy-D-xylonate <=>



D-pentonate aldolase

Pyruvate + Glycolaldehyde


R01784
2-dehydro-3-deoxy-
4.1.2.18
2-Dehydro-3-deoxy-L-arabinonate <=>



L-pentonate aldolase

Pyruvate + Glycolaldehyde


R01811
N-acetylneuraminate
4.1.3.3
N-Acetylneuraminate <=> N-



lyase

Acetyl-D-mannosamine +





Pyruvate


R01858
pyruvate kinase
2.7.1.40
dGTP + Pyruvate <=> dGDP +





Phosphoenolpyruvate


R01874
D-cysteine
4.4.1.15
D-Cysteine + H2O <=> Hydrogen



desulfhydrase

sulfide + Ammonia + Pyruvate


R02050
(R)-3-amino-2-
2.6.1.40
(R)-3-Amino-2-methylpropanoate +



methylpropionate-

Pyruvate <=> 2-Methyl-3-



pyruvate

oxopropanoate + L-Alanine



transaminase




R02261
2-keto-3-deoxy-L-
4.1.2.53
2-Dehydro-3-deoxy-L-rhamnonate <=>



rhamnonate aldolase

(S)-Lactaldehyde + Pyruvate


R02271
aminolevulinate
2.6.1.43
5-Aminolevulinate + Pyruvate <=>



transaminase

4,5-Dioxopentanoate + L-Alanine


R02293
diaminobutyrate-
2.6.1.46
L-2,4-Diaminobutanoate +



pyruvate

Pyruvate <=> L-Aspartate 4-



transaminase

semialdehyde + L-Alanine


R02320
pyruvate kinase
2.7.1.40
Nucleoside triphosphate +





Pyruvate <=> NDP +





Phosphoenolpyruvate


R02408
cystathionine
4.4.1.1 4.4.1.8
L-Cystine + H2O <=> Pyruvate +



gamma-lyase

Ammonia + Thiocysteine


R02628
phosphoenolpyruvate-
2.7.3.9
Phosphoenolpyruvate + Protein



protein

histidine <=> Pyruvate + Protein



phosphotransferase

N(pi)-phospho-L-histidine


R02721

unclear
D-Glyceraldehyde 3-phosphate +




reaction
Pyruvate <=> 5-(2-Hydroxyethyl)-





4-methylthiazole


R02754
2-dehydro-3-
4.1.2.20
5-Dehydro-4-deoxy-D-glucarate <=>



deoxyglucarate

Pyruvate + 2-Hydroxy-3-



aldolase

oxopropanoate


R02953
S-alkylcysteine lyase
4.4.1.6
S-Alkyl-L-cysteine + H2O <=>





Alkyl thiol + Ammonia + Pyruvate


R02970
alanine-
2.6.1.47
Pyruvate + Aminomalonate <=> L-



oxomalonate

Alanine + Oxomalonate



transaminase




R03001
D-methionine-
2.6.1.41
D-Methionine + Pyruvate <=> 4-



pyruvate

Methylthio-2-oxobutanoic acid +



transaminase

L-Alanine


R03037
Isochorismatase
3.3.2.1
Isochorismate + H2O <=> (2S,3S)-





2,3-Dihydro-2,3-





dihydroxybenzoate + Pyruvate


R03050
acetolactate synthase
2.2.1.6
2-Acetolactate + Thiamin





diphosphate <=> 2-(alpha-





Hydroxyethyl)thiamine





diphosphate + Pyruvate


R03081
2-dehydro-3-deoxy-
4.1.2.18
2-Dehydro-3-deoxy-D-fuconate <=>



L-pentonate aldolase

Pyruvate + (R)-Lactaldehyde


R03105
3-mercaptopyruvate
2.8.1.2
Mercaptopyruvate + Sulfite <=>



sulfurtransferase

Thiosulfate + Pyruvate


R03106
3-mercaptopyruvate
2.8.1.2
Hydrogen cyanide +



sulfurtransferase

Mercaptopyruvate <=>





Thiocyanate + Pyruvate


R03145
pyruvate
1.2.5.1
Pyruvate + Ubiquinone + H2O <=>



dehydrogenase

Acetate + Ubiquinol + CO2



(quinone)




R03277
2-dehydro-3-
4.1.2.20
2-Hydroxy-3-oxopropanoate +



deoxyglucarate

Pyruvate <=> 2-Dehydro-3-deoxy-



aldolase

D-glucarate


R03465
4-(2-
4.1.2.34
(3E)-4-(2-Carboxyphenyl)-2-



carboxyphenyl)-2-

oxobut-3-enoate + H2O <=> 2-



oxobut-3-enoate

Carboxybenzaldehyde + Pyruvate



aldolase




R03502
1D-1-guanidino-3-
2.6.1.56
1D-1-Guanidino-3-amino-1,3-



amino-1,3-dideoxy-

dideoxy-scyllo-inositol + Pyruvate <=>



scyllo-inositol

1D-1-Guanidino-1-deoxy-3-



transaminase

dehydro-scyllo-inositol + L-Alanine


R03528
cysteine-S-conjugate
4.4.1.13
L-Cysteine-S-conjugate + H2O <=>



beta-lyase

Aryl thiol + Ammonia +





Pyruvate


R03732
opine dehydrogenase
1.5.1.28
(2S)-2-{[1-(R)-





Carboxyethyl]amino}pentanoate +





NAD+ + H2O <=> L-Norvaline +





Pyruvate + NADH + H+


R03796
oximinotransferase
2.6.3.1
Pyruvate oxime + Acetone <=>





Pyruvate + Acetone oxime


R03854
2,2-dialkylglycine
4.1.1.64
2,2-Dialkylglycine + Pyruvate <=>



decarboxylase

Dialkyl ketone + CO2 + L-Alanine



(pyruvate)




R04152
2-
2.6.1.37
2-Aminoethylphosphonate +



aminoethylphosphonate-

Pyruvate <=>



pyruvate

Phosphonoacetaldehyde + L-Alanine



transaminase




R04187
beta-alanine-
2.6.1.18
L-Alanine + (S)-Methylmalonate



pyruvate

semialdehyde <=> Pyruvate + L-3-



transaminase

Aminoisobutanoate


R04672
acetolactate synthase
2.2.1.6
(S)-2-Acetolactate + Thiamin





diphosphate <=> 2-(alpha-





Hydroxyethyl)thiamine





diphosphate + Pyruvate


R04861

non-enzymatic
3-Sulfinylpyruvate + H2O <=>





Sulfite + Pyruvate


R04941
cystathionine beta-
4.4.1.8
L-Selenocystathionine + H2O <=>



lyase

Selenohomocysteine + Ammonia +





Pyruvate


R05136
trans-o-
4.1.2.45
Salicylaldehyde + Pyruvate <=>



hydroxybenzylidene

trans-o-



pyruvate hydratase-

Hydroxybenzylidenepyruvate +



aldolase

H2O


R05298
4-hydroxy-2-
4.1.3.43
Propanal + Pyruvate <=> 4-



oxohexanoate

Hydroxy-2-oxohexanoic acid



aldolase




R05538

aldorase
2-Hydroxy-4-(1-oxo-1,3-dihydro-




reaction
2H-inden-2-ylidene)-but-2-enoic





acid + H2O <=> 2-Formyl-1-





indanone + Pyruvate


R05544


2-Hydroxy-4-(2-oxo-1,3-dihydro-





2H-inden-1-ylidene)but-2-enoic





acid + H2O <=> 1-Formyl-2-





indanone + Pyruvate


R05553
aminodeoxychorismate
4.1.3.38
4-Amino-4-deoxychorismate <=>



lyase

4-Aminobenzoate + Pyruvate


R05605
2-dehydro-3-deoxy-
4.1.2.14
2-Dehydro-3-deoxy-6-phospho-D-



phosphogluconate
4.1.2.55
gluconate <=> D-Glyceraldehyde



aldolase

3-phosphate + Pyruvate


R05636
1-deoxy-D-xylulose-
2.2.1.7
Pyruvate + D-Glyceraldehyde 3-



5-phosphate

phosphate <=> 1-Deoxy-D-



synthase

xylulose 5-phosphate + CO2


R05648

4.1.2.-
cis-4-(1′-Hydroxynaphth-2′-yl)-2-





oxobut-3-enoate + H2O <=> 1-





Hydroxy-2-naphthaldehyde +





Pyruvate


R05652
taurine-pyruvate
2.6.1.77
Taurine + Pyruvate <=>



aminotransferase

Sulfoacetaldehyde + L-Alanine


R05861
D-amino-acid
1.4.3.3
D-Alanine + H2O + Oxygen <=>



oxidase
1.4.3.19
Pyruvate + Ammonia + Hydrogen





peroxide


R06602
isochorismate lyase
4.2.99.21
Isochorismate <=> Salicylate +





Pyruvate


R06603


Chorismate <=> Salicylate +





Pyruvate


R06913

4.2.1.-
2-Hydroxy-3-





methylbenzalpyruvate + H2O <=>





3-Methylsalicylaldehyde +





Pyruvate


R06923

4.2.1.-
2-Hydroxy-3-





carboxybenzalpyruvate + H2O <=>





3-Formylsalicylic acid +





Pyruvate


R06934

4.2.1.-
2-Hydroxy-4-





hydroxymethylbenzalpyruvate +





H2O <=> 4-





Hydroxymethylsalicylaldehyde +





Pyruvate


R07399
(R)-citramalate
2.3.1.182
Acetyl-CoA + Pyruvate + H2O <=>



synthase

(R)-2-Methylmalate + CoA


R07633
(2R)-sulfolactate
4.4.1.24
3-Sulfolactate <=> Pyruvate +



sulfo-lyase

HSO3−


R07634
L-cysteate sulfo-
4.4.1.25
L-Cysteate + H2O <=> Pyruvate +



lyase

HSO3− + Ammonia


R07685


(Z)-4-(2-Hydroxy-5-





sulfonatophenyl)-2-oxo-3-





butenoate + 2H2O <=> 5-





Sulfosalicylate + Pyruvate


R07692


4-(3-Hydroxy-2-naphthyl)-2-





oxobut-3-enoic acid + 2H2O <=>





3-Hydroxy-2-naphthoate +





Pyruvate


R07713
4-(2-
4.1.2.34
(3Z)-4-(2-Carboxyphenyl)-2-



carboxyphenyl)-2-

oxobut-3-enoate + H2O <=> 2-



oxobut-3-enoate

Carboxybenzaldehyde + Pyruvate



aldolase




R08166
2-succinyl-6-
4.2.99.20
2-Succinyl-5-enolpyruvyl-6-



hydroxy-2,4-

hydroxy-3-cyclohexene-1-



cyclohexadiene-1-

carboxylate <=> (1R,6R)-6-



carboxylate synthase

Hydroxy-2-succinylcyclohexa-2,4-





diene-1-carboxylate + Pyruvate


R08170

4.4.1.-
S-





(Indolylmethylthiohydroximoyl)-





L-cysteine + H2O <=>





Indolylmethylthiohydroximate +





Pyruvate + Ammonia


R08197
arginine-pyruvate
2.6.1.84
L-Arginine + Pyruvate <=> 5-



transaminase

Guanidino-2-oxopentanoate + L-





Alanine


R08200
phosphonopyruvate
3.11.1.3
3-Phosphonopyruvate + H2O <=>



hydrolase

Pyruvate + Orthophosphate


R08253

unclear
Isoniazid + Pyruvate <=> Isoniazid




reaction
pyruvate + H2O


R08570
2-dehydro-3-deoxy-
4.1.2.51
2-Dehydro-3-deoxy-D-gluconate <=>



D-gluconate aldolase
4.1.2.55
D-Glyceraldehyde + Pyruvate


R08648
acetolactate synthase
2.2.1.6
Pyruvate + 2-Oxobutanoate <=>





(S)-2-Aceto-2-hydroxybutanoate +





CO2


R08654

4.4.1.-
S-(Phenylacetothiohydroximoyl)-





L-cysteine + H2O <=>





Phenylacetothiohydroximate +





Pyruvate + Ammonia


R08660

4.4.1.-
S-





(Hydroxyphenylacetothiohydroximoyl)-





L-cysteine + H2O <=> p-





Hydroxyphenylacetothiohydroximate +





Pyruvate + Ammonia


R08667

4.4.1.-
S-(4-





Methylthiobutylthiohydroximoyl)-





L-cysteine + H2O <=> 4-





Methylthiobutylthiohydroximate +





Pyruvate + Ammonia


R08686

4.4.1.-
S-Alkyl-thiohydroximate + H2O <=>





Thiohydroximic acid +





Pyruvate + Ammonia


R08698

spontaneous
Dehydroalanine + H2O <=>





Pyruvate + Ammonia


R08714

2.6.1.-
Putrescine + Pyruvate <=> 4-





Aminobutyraldehyde + L-Alanine


R09048


3 Cadaverine + 4 Pyruvate <=> 17-





Oxosparteine + 4 L-Alanine + 3





H2O


R09088


Benzoate + Pyruvate <=>





Pyruvophenone + CO2


R09238

3.7.1.-
cis,cis-2,4-Dihydroxy-5-methyl-6-





oxo-2,4-hexadienoate + H2O <=>





(S)-Methylmalonate semialdehyde +





Pyruvate


R09254
phenylalanine(histidine)
2.6.1.58
L-Tyrosine + Pyruvate <=> 3-(4-



transaminase

Hydroxyphenyl)pyruvate + L-





Alanine


R09366
cystathionine
4.4.1.1
Se-Methyl-L-selenocysteine +



gamma-lyase
4.4.1.13
H2O <=> Pyruvate + Ammonia +





Methaneselenol


R10147
4-hydroxy-
4.3.3.7
L-Aspartate 4-semialdehyde +



tetrahydrodipicolinate

Pyruvate <=> (2S,4S)-4-Hydroxy-



synthase

2,3,4,5-tetrahydrodipicolinate +





H2O


R10178
4-aminobutyrate-
2.6.1.96
4-Aminobutanoate + Pyruvate <=>



pyruvate

Succinate semialdehyde + L-



transaminase

Alanine


R10180
L-tryptophan-
2.6.1.99
L-Tryptophan + Pyruvate <=>



pyruvate

Indolepyruvate + L-Alanine



aminotransferase




R10283
4-hydroxy-2-
4.1.3.16
(4S)-4-Hydroxy-2-oxoglutarate <=>



oxoglutarate
4.1.3.42
Pyruvate + Glyoxylate



aldolase




R10487
(S)-dichlorprop
1.14.11.43
(S)-(2,4-



dioxygenase (2-

Dichlorophenoxy)propanoate + 2-



oxoglutarate)

Oxoglutarate + Oxygen <=> 2,4-





Dichlorophenol + Pyruvate +





Succinate + CO2


R10527
(S)-dichlorprop
1.14.11.43
(S)-2-(4-Chloro-2-



dioxygenase (2-

methylphenoxy)propanoate + 2-



oxoglutarate)

Oxoglutarate + Oxygen <=> 4-





Chloro-2-methylphenol + Pyruvate +





Succinate + CO2


R10539
(R)-dichlorprop
1.14.11.44
Mecoprop-P + 2-Oxoglutarate +



dioxygenase (2-

Oxygen <=> 4-Chloro-2-



oxoglutarate)

methylphenol + Pyruvate +





Succinate + CO2


R10542
(R)-dichlorprop
1.14.11.44
(R)-(2,4-



dioxygenase (2-

Dichlorophenoxy)propanoate + 2-



oxoglutarate)

Oxoglutarate + Oxygen <=> 2,4-





Dichlorophenol + Pyruvate +





Succinate + CO2


R10550
L-threo-3-deoxy-
4.1.2.54
2-Dehydro-3-deoxy-L-galactonate <=>



hexylosonate

Pyruvate + L-Glyceraldehyde



aldolase




R10552
tRNA 4-
4.1.3.44
N1-Methylguanine in tRNA(Phe) +



demethylwyosine

Pyruvate + S-Adenosyl-L-



synthase (AdoMet-

methionine <=> 4-



dependent

Demethylwyosine in tRNA(Phe) +





L-Methionine + 5'-





Deoxyadenosine + CO2 + H2O


R10583
Chorismatase
3.3.2.13
Chorismate + H2O <=> (4R,5R)-





4,5-Dihydroxycyclohexa-1(6),2-





diene-1-carboxylate + Pyruvate


R10597
3-hydroxybenzoate
4.1.3.45
Chorismate <=> 3-



synthase

Hydroxybenzoate + Pyruvate


R10616
2-dehydro-3-deoxy-
4.1.2.51
2-Dehydro-3-deoxy-D-galactonate <=>



D-gluconate aldolase
4.1.2.55
Pyruvate + D-Glyceraldehyde


R10674
(R)-citramalyl-CoA
4.1.3.46
(3R)-Citramalyl-CoA <=> Acetyl-



lyase

CoA + Pyruvate


R10691

3.7.1.-
2,4-Diketo-3-deoxy-L-fuconate +





H2O <=> (S)-Lactate + Pyruvate


R10866

1.2.7.-
Pyruvate + CoA + Oxidized





flavodoxin <=> Acetyl-CoA +





CO2 + Reduced flavodoxin


R10992
alanine-glyoxylate
2.6.1.44
L-Alanine + 2-Oxobutanoate <=>



transaminase

Pyruvate + (S)-2-Aminobutanoate


R11023

4.4.1.-
S-(Hercyn-2-yl)-L-cysteine





S-oxide <=> Ergothioneine +





Pyruvate + Ammonia


R11038
(5-formylfuran-3-
2.6.1.108
(5-Formylfuran-3-yl)methyl



yl)methyl phosphate

phosphate + L-Alanine <=> [5-



transaminase

(Aminomethyl)furan-3-yl]methyl





phosphate + Pyruvate


R11099


2-Iminopropanoate + H2O <=>





Pyruvate + Ammonia


R11141
3-acetyloctanal
2.2.1.12
Pyruvate + trans-2-Octenal <=>



synthase

(S)-3-Acetyloctanal + CO2


R11213
2-dehydro-3,6-
4.1.2.58
2-Dehydro-3,6-dideoxy-6-sulfo-D-



dideoxy-6-

gluconate <=> (2S)-3-



sulfogluconate

Sulfolactaldehyde + Pyruvate



aldolase




R11257
maleylpyruvate
3.7.1.23
Maleylpyruvate + H2O <=>



hydrolase

Maleic acid + Pyruvate









Microorganisms expressing enzyme(s) as described herein may be provided in various forms, including live forms e.g. in an aqueous solution or in culture medium or in “resting” forms such as in a freeze dried or tablet form.


In some embodiments, the method is carried out in culture, with one or more host microorganisms, producing the pathway enzyme(s).


In some embodiments, there is provided a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase as described herein capable of condensing two aldehyde molecules.


In some embodiments, there is provided a trans-2-unsaturated aldehyde pathway comprising a condensation step and a dehydration step. In some embodiments, the condensation step comprises the condensation of two aldehyde molecules to a 3-hydroxy aldehyde by using an aldolase. In some embodiments, the dehydration step comprises dehydration of the 3-hydroxy aldehyde to a trans-2-unsaturated aldehyde. In some embodiments, the dehydration step is via spontaneous dehydration. In some embodiments, the dehydration step comprises a dehydratase. In some embodiments, dehydration is aided by dehydratase.


In some embodiments, there is provided a non-naturally occurring microorganism having a trans-2-unsaturated aldehyde pathway, wherein the microorganism comprises at least one of the following trans-2-unsaturated aldehyde pathway enzymes: an aldolase that catalyzes condensation of two aldehydes to produce 3-hydroxy aldehyde; and a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde (the dehydration step). In some embodiments, the dehydration step is via spontaneous dehydration. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, or naturally-occurring microorganism, or wild-type microorganism. In some embodiments, the trans-2-unsaturated aldehyde is converted to trans-2-unsaturated alcohol or trans-2-unsaturated carboxylic acids by using aldoketoreductases, oxidoreductases, aldehyde reductases or alcohol dehydrogenases. In some embodiments, the microorganism includes at least one exogenous nucleic acid encoding an enzyme from the trans-2-unsaturated aldehyde pathway. In some embodiments, the microorganism further includes a PDC for decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide. In some embodiments, the microorganism expresses an enzyme identified in TABLE 9 and/or TABLE 10 or an active fragment or homologue thereof for producing acetaldehyde. In some embodiments, at least one or all the nucleic acids are exogenous to the host microorganism. In some embodiments, the aldolase is DERA. In some embodiments, the aldolase enzyme is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or wild-type aldolase enzyme. In some embodiments, the non-naturally occurring microorganism expresses a modified aldolase. In some embodiments, the modified aldolase is a modified DERA. In some embodiments, the microorganism expressing a modified aldolase comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type or naturally occurring, or unmodified microorganism. In some embodiments, the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In some embodiments, trans-2-unsaturated aldehyde is produced by direct feeding of precursor aldehydes to cells expressing only aldolase (e.g., DERA or a similar aldolase). In some embodiments, trans-2-unsaturated aldehyde is produced from glucose. In some embodiments, precursor aldehydes are produced by the cell (e.g., via engineered pathways). In some embodiments, the engineered pathways require modification (e.g., expression or overexpression) of at least one gene. In some embodiments, strains (e.g., 2 different strains) are optimized to include at least one gene deletion(s). In some embodiments, acetaldehyde is produced in a cell (e.g., from glucose). In some embodiments, precursor aldehyde molecules (e.g., two precursor aldehydes) are produced from glucose. In some embodiments, precursor aldehyde molecules (e.g., two precursor aldehydes) are fed to the cell. In some embodiments, at least one precursor aldehyde is produced from glucose and at least one precursor aldehyde is fed to the cell. In some embodiments, at least one precursor aldehyde is fed to a cell following production of acetaldehyde from glucose.


In some embodiments, the trans-2-unsaturated aldehyde is produced by feeding precursor aldehydes to a microbial culture.


In some embodiments, the aldehyde precursors used for the formation of trans-2-unsaturated aldehyde are produced from glucose by an aldehyde producing pathway.


In some embodiments, there is provided a pathway for the production of acetaldehyde from glucose. In some embodiments, production of acetaldehyde from glucose is via decarboxylation of pyruvate by pyruvate decarboxylase.


In some embodiments, there is provided a non-naturally occurring microorganism having a trans-2-unsaturated aldehyde pathway and/or having the ability to produce precursor aldehydes (e.g., aldehydes, acetaldehydes, and butyraldehydes) leading to the production of trans-2-unsaturated aldehyde(s) (e.g., trans-2-hexenal). In some embodiments, the non-naturally occurring microorganism comprises: (a) at least one of the following trans-2-unsaturated aldehyde pathway enzymes: a pyruvate decarboxylase that catalyzes decarboxylation of pyruvate to acetealdehyde, an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-Lhomoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine into 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate into 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate into 2-ethyl-3-oxo succinate, and an alphaketoisovalerate decarboxylase that catalyzes the decarboxylation of 2-keto acid (e.g., 2-keto-valerate) into an aldehyde (e.g., butyraldehyde); and (b) at least one of the following enzymes: an aldolase that catalyzes condensation of acetaldehyde and aldehyde (e.g., butyraldehyde) to produce 3-hydroxy aldehyde (e.g., 3-hydroxy hexanal), and a dehydratase that dehydrates the 3-hydroxy aldehyde (e.g., 3-hydroxy hexanal) to trans-2-unsaturated aldehyde (e.g., trans-2-hexenal). In some embodiments, the microorganism comprises an aldolase that is specific for condensation of acetaldehyde and aldehyde to produce 3-hydroxy aldehyde. In some embodiments, the microorganism comprises at least one exogenous nucleic acid encoding an enzyme from said trans-2-unsaturated aldehyde pathway. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In some embodiments, there is provided a pathway for the production of butyraldehyde from glucose. This can be achieved by over production of L-threonin by overexpression of aspartokinase (thrA), overexpression of homoserine kinase (thrB), and/or overexpression of threonin synthase (thrC). Subsequently threonin can be transformed into butyraldehyde using threonin desaminase (ilvA), isoproylmalate synthase (leuA), isopropylmalate dehydrogenase (leuB), acetohydroxy acid isomeroreductase (leuC, leuD) and alpha-ketoisovalerate decarboxylase (kivD). [Atsumi et al. “Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels” NATURE Vol 451 (2008).]


In some embodiments, the microorganism further includes a pathway for producing butyraldehyde, wherein the microorganism comprises at least one of the following butyraldehyde pathway enzymes: an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-Lhomoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine into 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate into 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate into 2-ethyl-3-oxo succinate, and an alphaketoisovalerate decarboxylase that catalyzes the decarboxylation of 2-keto-valerate into butyraldehyde. In some embodiments, the microorganism expresses an enzyme identified in TABLE 7 or an active fragment or homologue thereof for producing butyraldehyde.


In some embodiments, the non-naturally occurring microorganism includes at least one modification (e.g., deletion) to an endogenous nucleic acid encoding an enzyme from the trans-2-unsaturated pathway or at least one modification that affects the expression of an enzyme from the trans-2-unsaturated pathway.


In some embodiments, the non-naturally occurring microorganism includes at least one modification (e.g., deletion) to an endogenous nucleic acid encoding an enzyme effecting production of the trans-2-unsaturated aldehyde or any pathway intermediate.


In some embodiments, the trans-2-unsaturated aldehyde is converted to trans-2-unsaturated alcohol or trans-2-unsaturated carboxylic acids by using aldoketoreductase, oxidoreductase, aldehyde reductase, or alcohol dehydrogenase, wherein the microorganism includes at least one exogenous nucleic acid encoding an enzyme from said trans-2-unsaturated aldehyde pathway.


In some embodiments, the microoganism is genetically modified to contain a nucleic acid encoding an alcohol dehydrogenase capable of reducing trans-2-unsaturated aldehydes to trans-2-unsaturated alcohols. In some embodiments, the microoganism is genetically modified to contain a nucleic acid encoding an oxidoreductase capable of oxidizing trans-2-unsaturated aldehydes to trans-2-unsaturated carboxylic acids.


In some embodiments, a delta lactone pathway is disclosed that comprises: 1) condensation of two aldehyde molecules to a 3-hydroxy aldehyde intermediate using an aldolase enzyme; 2) condensation of the 3-hydroxy aldehyde intermediate and an aldehyde (e.g. acetaldehyde) to a 5,3-dihydroxy aldehyde intermediate using an aldolase enzyme 3) cyclization of 5,3-dihydroxy aldehyde to form a tetrahydro-2H-pyran-2,4-diol (compound of formula IV(b)); 4) oxidation of tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one (compound of formula V): 5) dehydration of the tetrahydro-4-hydroxy-2H-pyran-2-one (compound of formula V) to a 5,6-dihydro-2H-pyran-2-one (compound of formula VI); and 6) reduction of the 5,6-dihydro-2H-pyran-2-one (compound of formula VI) to a delta-lactone (compound of formula VII) using reductase.


In some embodiments, there is provided a non-naturally occurring microorganism having a delta lactone pathway. The microorganism includes at least one of the following delta lactone pathway enzymes: an aldolase that catalyzes the condensation of two aldehyde molecules to produce 3-hydroxy aldehyde (i.e., step 1): an aldolase that catalyzes condensation of the 3-hydroxy aldehyde intermediate and an aldehyde (e.g. acetaldehyde) to a 5,3-dihydroxy aldehyde intermediate (i.e., step 2); a lactonization enzyme that transforms a 5,3-dihydroxy aldehyde intermediate into a tetrahydro-4-hydroxy-2H-pyran-2-one: a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone. In some embodiments, the microorganism has at least one exogenous nucleic acid encoding an enzyme from said delta-lactone pathway. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, or a naturally occurring microorganism, or wild-type microorganism. In some embodiments, the microorganism further includes a PDC for decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide. In some embodiments, the microorganism expresses an enzyme identified in TABLE 9 and/or TABLE 10, or an active fragment or homologue thereof for producing acetaldehyde. In some embodiments, at least one or all the nucleic acids are exogenous to the host microorganism. In some embodiments, the aldolase is DERA.


In some embodiments, there is provided a non-naturally occurring microorganism having a delta-lactone pathway. The microorganism includes at least one of the following delta-lactone pathway enzymes: an aldolase that catalyzes condensation of an acetaldehyde and an aldehyde to 3-hydroxy aldehyde (i.e., step 1); an aldolase that catalyzes condensation of the 3-hydroxy aldehyde and an acetaldehyde to 5,3-dihydroxy aldehyde, which undergoes cyclization to form tetrahydro-2H-pyran-2,4-diol (i.e., step 2); an oxidase enzyme that transforms the tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one: a dehydratase enzyme that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one. In some embodiments, the microorganism has at least one exogenous nucleic acid encoding an enzyme from said delta-lactone pathway. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, or a naturally occurring microorganism, or wild-type microorganism. In some embodiments, the 5,6-dihydro-2H-pyran-2-one is massoia lactone (e.g., (R)-5,6-Dihydro-6-pentyl-2H-pyran-2-one). In some embodiments the massoia lactone is C-10 massoia lactone. In some embodiments, the massoia lactone is C-12 massoia lactone. In some embodiments, the massoia lactone is C-14 massoia lactone. In some embodiments, the microorganism further includes a PDC for decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide. In some embodiments, the microorganism expresses an enzyme identified in table 3 or an active fragment or homologue thereof for producing acetaldehyde. In some embodiments, at least one or all the nucleic acids are exogenous to the host microorganism. In some embodiments, the aldolase is DERA.


In some embodiments, the aldolase in step 1 is different from the aldolase in step 2. In some embodiments, the aldolase that catalyzes the condensation of two aldehyde molecules to produce 3-hydroxy aldehyde is different from the aldolase that catalyzes condensation of a 3-hydroxy aldehyde intermediate and an aldehyde (e.g. acetaldehyde) to a 5,3-dihydroxy aldehyde intermediate. In some embodiments, the aldolase in step 1 is the same as the aldolase in step 2. In some embodiments, the aldolase in step 1 and the aldolase in step 2 are both DERA.


In some embodiments, the delta-lactone is produced by feeding the aldehyde molecules (i.e., precursor aldehydes) to a microbial culture. In some embodiments, the delta-lactone is produced by feeding both aldehyde molecules (i.e., precursor aldehydes) to a microbial culture. In some embodiments, the delta-lactone is produced by feeding at least one aldehyde molecules (i.e., precursor aldehydes) to a microbial culture. In some embodiments, the delta-lactone is produced by feeding one aldehyde molecule (i.e., precursor aldehydes) to a microbial culture.


In some embodiments, the aldehyde precursors for the formation of delta-lactone are produced from glucose, for example by aldehyde producing pathways. In some embodiments, both aldehyde precursors for the formation of delta-lactone are produced from glucose. In some embodiments, at least one aldehyde precursor for the formation of delta-lactone is produced from glucose. In some embodiments, one aldehyde precursor for the formation of delta-lactone is produced from glucose.


In some embodiments, there is provided a pathway for the production of acetaldehyde from glucose. In some embodiments, this is achieved by decarboxylation of pyruvate by pyruvate decarboxylase.


In some embodiments, two aldehyde precursors are fed to the cells (e.g., microbial culture). In some embodiments, two aldehyde precursors are produced from glucose. In some embodiments, at least one aldehyde precursor is produced from glucose and at least one aldehyde precursor is fed to the cell. In some embodiments, one aldehyde precursor is produced from glucose and one aldehyde precursor is fed to the cell.


In some embodiments, a delta-lactone is produced by feeding precursor aldehydes (e.g., two precursor aldehydes) to cells. In some embodiments, a delta-lactone is produced by feeding precursor aldehydes to cells expressing two engineered aldolases. In some embodiments, the two engineered aldolases are different aldolases. In some embodiments, an aldolase is specific for the first condensation reaction. In some embodiments, the first condensation reaction includes the condensation of two aldehydes. In some embodiments, an aldolase is specific for the condensation of 3-hydroxyaldehyde and aldehyde (i.e., second condensation reaction). In some embodiments, the aldolase specific for the first condensation reaction is different from the aldolase specific for the second condensation reaction. In some embodiments, the aldolases are DERA or derived from DERA. In some embodiments, delta-lactone is produced from glucose. In some embodiments, delta-lactone is produced by producing acetaldehyde in a cell (e.g., from glucose) and feeding at least one aldehyde. In some embodiments, precursor aldehydes are produced by the cell via engineered pathways. In some embodiments, the engineered pathways require modification (e.g., expression or overexpression) of at least one gene. In some embodiments, strains (e.g., 2 different strains) are optimized to include at least one gene deletion(s). In some embodiments, acetaldehyde is produced in a cell (e.g., from glucose). In some embodiments, precursor aldehyde molecules (e.g., two precursor aldehydes) are produced from glucose. In some embodiments, precursor aldehyde molecules (e.g., two precursor aldehydes) are fed to the cell. In some embodiments, at least one precursor aldehyde is produced from glucose and at least one precursor aldehyde is fed to the cell. In some embodiments, at least one precursor aldehyde is fed to a cell following production of acetaldehyde from glucose.


In some embodiments, the microorganism further includes a pathway for producing hexanal, wherein the microorganism comprises at least one of the hexanal pathway enzymes: an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-Lhomoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine into 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate into 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate into 2-ethyl-3-oxo succinate, and a ketoisovalerate decarboxylase that catalyzes decarboxylation of 2-keto-heptanoic acid into hexanal. Decarboxylation of 2-ethyl-3-oxo succinate to 2-ketovalerate may occur spontaneously. In some embodiments, 2-ketovalerate is converted into 2-keto-caproic acid and further into 2-keto-heptanoic acid by the same enzymes (e.g., a hexanal pathway enzyme).


In some embodiments, there is provided a non-naturally occurring microorganism having a delta-lactone pathway. The microorganism includes at least one of the following delta-lactone pathway enzymes: an aldolase that catalyzes condensation of an acetaldehyde and a hexanal to 3-hydroxy-octanal; an aldolase that catalyzes condensation of the 3-hydroxy-octanal and an acetaldehyde to 5,3-dihydroxy-decanal: a lactonization enzyme and/or a dehydratase enzyme to transform 5,3-dihydroxy-decanal to massoia lactone. In some embodiments, the microorganism has at least one exogenous nucleic acid encoding an enzyme from said delta-lactone pathway. In some embodiments, the microorganism further includes a PDC for decarboxylation of pyruvate to yield acetaldehyde and carbon dioxide. In some embodiments, the microorganism expresses an enzyme identified in TABLE 9 and/or TABLE 10, or an active fragment or homologue thereof for producing acetaldehyde. In some embodiments, at least one or all the nucleic acids are exogenous to the host microorganism. In some embodiments, the aldolase is DERA.


In some embodiments, the delta-lactone pathway comprises a dehydration step that is spontaneous dehydration. In some embodiments, the dehydration step comprises a dehydratase. In some embodiments, dehydration is a chemically catalyzed dehydration reaction. In some embodiments, dehydration is aided by dehydratase. In some embodiments, the lactonization is spontaneous. In some embodiments, the lactonization is enzymatic. In some embodiments, the lactonization is a non-enzymatic lactonization reaction. In some embodiments, the lactonization is a chemical lactonization reaction.


In some embodiments, there is provided a pathway for the production of hexanal from glucose. In some embodiments, production of hexanal from glucose can be by at least one of the following: over production of L-threonin, overexpression of aspartokinase (thrA), overexpression of homoserine kinase (thrB), and/or overexpression of threonin synthase (thrC). In some embodiments, threonin can be transformated into hexanal. In some embodiments, threonine is transformed into hexanal using at least one of threonin desaminase (ilvA), isoproylmalate synthase (leuA), isopropylmalate dehydrogenase (leuB), acetohydroxy acid isomeroreductase (leuC, leuD), and alpha-ketoisovalerate decarboxylase (kivD_VLV).


In some embodiments, there is provided a non-naturally occurring microorganism capable of producing delta-lactone by producing acetaldehyde and other aldehydes from glucose. In some embodiments, the non-naturally occurring microorganism converts the acetaldehyde and aldehyde molecules to delta-lactone using a delta-lactone pathway. In some embodiments, there is provided a non-naturally occurring microorganism capable of producing delta-lactone (e.g., delta-decalactone) by producing acetaldehyde and aldehyde (e.g., hexanal) from glucose and/or converting the acetaldehyde and aldehyde (e.g., hexanal) to delta-lactone (delta-decalactone) using a delta-lactone pathway. The microorganism includes at least one of the following delta-lactone pathway enzymes: a pyruvate decarboxylase that catalyzes decarboxylation of pyruvate to acetealdehyde, an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-Lhomoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine to 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate to 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, and an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate to 2-ethyl-3-oxo succinate, and a ketoisovalerate decarboxylase (e.g., engineered enzyme) that catalyzes decarboxylation of 2-keto-acid to aldehyde (e.g., 2-keto-heptanoic acid to hexanal); and (b) at least one of the following enzymes: an aldolase that catalyzes condensation of acetaldehyde and an aldehyde (e.g., hexanal) to produce 3-hydroxy aldehyde (e.g., 3-hydroxyoctanal), an aldolase that catalyzes condensation of the 3-hydroxy aldehyde (e.g., 3-hydroxyoctanal) and acetaldehyde to a 5,3-dihydroxy aldehyde (e.g., a 5,3-dihydroxy decanal intermediate), which undergoes a spontaneous cyclization to form tetrahydro-2H-pyran-2,4-diol (e.g., 6-pentyltetrahydro-2H-pyran-2,4-diol), an oxidase that oxidizes tetrahydro-2H-pyran-2,4-diol (e.g., 6-pentyltetrahydro-2H-pyran-2,4-diol) into tetrahydro-4-hydroxy-2H-pyran-2-one (e.g., 4-hydroxy-6-pentyltetrahydro-2H-pyran-2-one), a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one (e.g., 4-hydroxy-6-pentyltetrahydro-2H-pyran-2-one) to 5,6-dihydro-2H-pyran-2-one (e.g., 6-pentyl-5,6-dihydro-2H-pyran-2-one), and a reductase that reduces the 5,6-dihydro-2H-pyran-2-one (e.g., 6-pentyl-5,6-dihydro-2H-pyran-2-one) to a delta-lactone (e.g., delta-decalactone). In some embodiments, decarboxylation of 2-ethyl-3-oxo succinate to 2-ketovalerate occurs spontaneously. In some embodiments, 2-ketovalerate is converted to 2-keto-caproic acid and further to 2-keto-heptanoic acid. In some embodiments, the microorganism has at least one exogenous nucleic acid encoding an enzyme from said delta-lactone pathway.


In some embodiments, the pathway for aldehyde production is the same, thus involves the same pathway enzymes. For longer aldehydes the product for a first reaction cycle undergoes a second reaction cycle and so forth. Every cycle adds on to the chain length. In some embodiments, the resulting aldehyde is controlled by the co-expressed ketoaldehyde decarboxylase (e.g., GEO175).


In some embodiments, the non-naturally occurring microorganism includes at least one modification to an endogenous nucleic acid encoding an enzyme from the delta-lactone pathway or affecting the expression of an enzyme from this delta-lactone pathway.


In some embodiments, the non-naturally occurring microorganism includes at least one modification (e.g., deletion) to an endogenous nucleic acid encoding an enzyme effecting production of the delta lactone or any pathway intermediate.


In some embodiments, a microorganism is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an aldehyde molecule and a 3-hydroxy-aldehyde molecule to a 5,3-dihydroxyaldehyde.


In some embodiments, a gamma lactone pathway is disclosed that comprises: 1) condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid using aldolase (e.g, pyruvate-dependent aldolase): 2) reduction of the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid using dehydrogenase or keto-reductase; 3) lactonization of the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone: 4) dehydration of the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone (compound of formula XI); and 5) reduction of 2(5H)-furanone (compound of formula XI) to a gamma-lactone (compound of formula XII) using reductase.


In some embodiments, there is provided a non-naturally occurring microorganism having a gamma-lactone pathway. The microorganism includes at least one of the following gamma-lactone pathway enzymes: an aldolase (e.g, pyruvate-dependent aldolase) that catalyzes condensation of an aldehyde molecule and a pyruvic acid (or pyruvate) to a 4-hydroxy-2-oxo carboxylic acid; a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid; a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone; a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone.


In some embodiments, there is provided a non-naturally occurring microorganism having a gamma-lactone pathway, wherein the microorganism comprises at least one of the following gamma-lactone pathway enzymes: an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvic acid (or a pyruvate) to a 4-hydroxy-2-oxo carboxylic acid; a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy acid; dehydration (e.g., a dehydration enzyme) of the 2,4-dihydroxy acid to 4-hydroxy-2-ene-acid: a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone.


In some embodiments, there is provided a non-naturally occurring microorganism having a gamma-lactone pathway, wherein the microorganism comprises at least one of the following gamma-lactone pathway enzymes: an aldolase (e.g., pyruvate dependent aldolase) that catalyzes condensation of an aldehyde molecule and a pyruvic acid (or pyruvate) to a 4-hydroxy-2-oxo acid: an oxidoreductase that transforms the 4-hydroxy-2-oxo acid to a 2,4-dihydroxy acid; a lactonization enzyme that transforms the 2,4-dihydroxy acid to a 2-hydroxy-gamma-lactone; a dehydratase that dehydrates the 2-hydroxy-gamma-lactone to 5-furan-2(5H)-one; and a reductase that reduces the 5-furan-2(5H)-one to a gamma-lactone (FIG. 10).


In some embodiments, the reductase is enoate reductase. In some embodiments, the microorganism comprises at least one exogenous nucleic acid encoding an enzyme from said gamma-lactone pathway. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, or a naturally occurring microorganism, or wild-type microorganism.


In some embodiments, the microorganism further comprises modification to overproduce pyruvic acid. In some embodiments, the microorganism further includes a pathway for producing heptanal, wherein the microorganism comprises at least one of the following heptanal pathway enzymes: an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-Lhomoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine into 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate into 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, and an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate into 2-ethyl-3-oxo succinate. Decarboxylation of 2-ethyl-3-oxo succinate to 2-ketovalerate may occur spontaneously, 2-ketovalerate is converted into 2-keto-caproic acid, into 2-keto-heptanoic acid, and further into 2-keto-octanoic acid by the same enzymes (e.g., a heptanal pathway enzyme), and an engineered ketoisovalerate decarboxylase that catalyzes decarboxylation of 2-keto-octanoic acid into heptanal.


In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase (e.g, pyruvate-dependent aldolase) capable of condensing an aldehyde molecule and a pyruvic acid (or pyruvate) to 4-hydroxy-2-oxo carboxylic acid.


In some embodiments, the gamma-lactone pathway comprises a dehydration step that is spontaneous dehydration. In some embodiments, the dehydration step comprises a dehydratase. In some embodiments, dehydration is a chemically catalyzed dehydration reaction. In some embodiments, the lactonization is spontaneous. In some embodiments, the lactonization is enzymatic. In some embodiments, the lactonization is a non-enzymatic lactonization reaction. In some embodiments, the lactonization is a chemically catalyzed lactonization reaction.


In some embodiments, the gamma-lactone is produced by feeding the aldehyde molecule (i.e., precursor aldehyde) to a microbial culture (e.g., fed to the cells). In some embodiments, the precursor aldehyde is produced from glucose. In some embodiments, the precursor aldehyde is fed to the cell. In some embodiments, the pyruvate and/or pyruvic acid is produced from glucose. In some embodiments, the pyruvate and/or pyruvic acid is fed to the cell. In some embodiments, both the aldehyde molecule and the pyruvate and/or pyruvic acid is produced from glucose. In some embodiments, both the aldehyde molecule and the pyruvate and/or pyruvic acid is fed to the cell. In some embodiments, at least one of the aldehyde molecule and/or at least one of the pyruvate or pyruvic acid is fed to the cell. In some embodiments, at least one of the aldehyde molecule and/or at least one of the pyruvate or pyruvic acid is produced from glucose.


In some embodiments, the aldehyde molecule (i.e., aldehyde precursor) for the formation of the gamma-lactone are produced from glucose by aldehyde producing pathways. In some embodiments, there is provided a pathway for the production of heptanal from glucose. In some embodiments, production of heptanal from glucose comprises at least one of over production of L-threonin, overexpression of aspartokinase (thrA), overexpression of homoserine kinase (thrB), and overexpression of threonin synthase (thrC). In some embodiments, threonin is transformated into hexanal by at least one of threonin desaminase (ilvA), isoproylmalate synthase (leuA), isopropylmalate dehydrogenase (leuB), acetohydroxy acid isomeroreductase (leuC, leuD), and alpha-ketoisovalerate decarboxylase (geo175).


In some embodiments, there is provided a non-naturally occurring microorganism capable of producing pyruvate and aldehyde in a cell. In some embodiments, the non-naturally occurring microorganism produces pyruvate and aldehyde and has a gamma-lactone pathway.


In some embodiments, a microorganism strain is capable of producing pyruvate and aldehyde precursor in a cell (e.g., from glucose). In some embodiments, there is provided a non-naturally occurring microorganism capable of producing pyruvate and aldehyde (e.g., heptanal and/or having a gamma-lactone pathway for production of gamma-lactone (e.g., gamma-decalactone). The microorganism comprises at least one of the following gamma-lactone pathway enzymes: a) enzymes for production of aldehyde (e.g., heptanal), an aspartokinase that catalyzes phosphorylation of L-asparate, a homoserine kinase that catalyzes phosphorylation of L-homoserine to O-phospho-Lhomoserine, a threonine synthase that catalyzes the hydration of O-phospho-L-homoserine to L-threonine, a threonine desaminase that catalyzes the desamination of L-threonine into 2-ketobutyrate, an isopropylmalate synthase that catalyzes the acetylation of 2-keto butyrate into 2-ethyl malate, an acetohydroxy acid isomeroreductase that catalyzes the formation of 3-ethyl malate from 2-ethylmalate via a 2-ethyl maleate intermediate, an isopropylmalate dehydrogenase that catalyzes the oxidation of 3-ethyl malate into 2-ethyl-3-oxo succinate, and a ketoisovalerate decarboxylase (e.g., engineered enzyme) that catalyzes decarboxylation of 2-keto-acid to aldehyde (e.g., 2-keto-octanoic acid to heptanal); and (b) at least one of the following enzymes: b) enzymes for production of gamma-lactone (e.g., gamma-decalactone); an aldolase (e.g, pyruvate-dependent aldolase) that catalyzes condensation of an aldehyde molecule (e.g., heptanal) and pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid (e.g., 4-hydroxy-2-oxo decanoic acid): a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid (e.g., 4-hydroxy-2-oxo decanoic acid to 2,4-dihydroxy decanoic acid): a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid (e.g., 2,4-dihydroxy decanoic acid) to a 3-hydroxydihydro-2-(3H)-furanone (e.g., 5-hexyl-3-hydroxydihydrofuran-2(3H)-one): a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone (e.g., 5-hexy-3-hydroxydihydrofuran-2(3H)-one to 5-hexylfuran-2(5H)-one); and a reductase that reduces the 2(5H)-furanone to a gamma-lactone (e.g., gamma-decalactone). In some embodiments, the microorganism has at least one exogenous nucleic acid encoding an enzyme from said gamma-lactone pathway. In some embodiments, decarboxylation of 2-ethyl-3-oxo succinate to 2-ketovalerate happens spontaneously. In some embodiments, 2-ketovalerate is converted into 2-keto-caproic acid, 2-keto acid (e.g., 2-keto-heptanoic acid and further into 2-keto-octanoic acid) by the same enzymes. In some embodiments, an engineered ketoisovalerate decarboxylase catalyzes decarboxylation of 2-keto acid to aldehyde (e.g., 2-keto-octanoic acid into heptanal).


In some embodiments, the non-naturally occurring microorganism includes at least one modification to an endogenous nucleic acid encoding an enzyme from the gamma-lactone pathway or affecting the expression of an enzyme from this gamma-lactone pathway.


In some embodiments, the non-naturally occurring microorganism includes at least one modification/deletion to an endogenous nucleic acid encoding an enzyme that has an effect on production of the gamma lactone or any pathway intermediate.


Also provided herein is a method of producing enantiopure (R)-configuration of the delta-lactone, the gamma-lactone, and/or the compound of formula II comprising culturing a non-naturally occurring microorganism as described herein or performing the biosynthetic process as described herein.


In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing two aldehyde molecules to 3-hydroxy aldehyde. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing two acetaldehyde molecules to 3-hydroxybutanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and formaldehyde to 3-hydroxy-propanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a propionaldehyde to 3-hydroxy-pentanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a butyraldehyde to 3-hydroxy-hexanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a pentaldehyde to 3-hydroxy-heptanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a hexaldehyde to 3-hydroxy-octanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a heptaldehyde to 3-hydroxy-nonanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a octaldehyde to 3-hydroxy-decanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a nonaldehyde to 3-hydroxy-undecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a decaldehyde to 3-hydroxy-dodecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a undecaldehyde to 3-hydroxy-tridecanal.


In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an aldehyde molecule and a 3-hydroxy aldehyde molecule to 5,3-dihydroxy aldehyde. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde molecule and a 3-hydroxybutanal molecule to 5,3-dihydroxyhexanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxy-propanal molecule to 5,3-dihydroxypentanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxypentanal molecule to 5,3-dihydroxyheptanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxy-hexanal to 5,3-dihydroxy-octanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxy-heptanal molecule to 5,3-dihydroxynonanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxy-octanal molecule to 5,3-dihydroxydecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxynonanal molecule to 5,3-dihydroxyundecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxydecanal molecule to 5,3-dihydroxydodecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxy-undecanal molecule to 5,3-dihydroxytridecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxydodecanal molecule to 5,3-dihydroxytetradecanal. In some embodiments, a microorganism used in a method according to the present disclosure is a microorganism which is genetically modified so as to contain a nucleic acid molecule encoding an aldolase capable of condensing an acetaldehyde and a 3-hydroxytridecanal molecule to 5,3-dihydroxypentadecanal.


In some embodiments, the microoganism is genetically modified to contain a nucleic acid encoding an alcohol dehydrogenase capable of reducing trans-2-unsaturated aldehydes to trans-2-unsaturated alcohols. In some embodiments, the microoganism is genetically modified to contain a nucleic acid encoding an oxidoreductase capable of oxidizing trans-2-unsaturated aldehydes to trans-2-unsaturated carboxylic acids (e.g., gene ydcW, GI: 1742352; aldB, GI: 85676455). (Gruez et al. “Crystal structure and kinetics identify Escherichia coli YdcW gene product as a medium-chain aldehyde dehydrogenase” J Mol Biol. 2004, Oct. 8; 343(1):29-41]. In some embodiments, the microorganism is genetically modified to contain a nucleic acid encoding a PDC capable of decarboxylating pyruvate to yield acetaldehyde and carbon dioxide. See, TABLE 9.


When reference is made to at least one exogenous nucleic acid being included in a microorganism, it is to be understood that this refers to the referenced encoding nucleic acids or biochemical activities and not the number of separate nucleic acids introduced into the host organism. Such exogenous nucleic acids may be introduced into the host organism on separate nucleic acid molecules, on polycistronic nucleic acid molecules, or a combination thereof. For example, where two or more exogenous nucleic acids encoding different enzymatic activities are introduced into a host organism, the two or more exogenous nucleic acids maybe introduced as a single nucleic acid, for example, on a single plasmid, on separate plasmids, maybe integrated into the host chromosome at a single site or multiple sites, and still be considered as two or more exogenous nucleic acids.


Depending on the host microorganism selected, nucleic acids for some or all of the trans-2-unsaturated aldehyde pathway enzymes or the delta-lactone pathway enzymes or the gamma-lactone pathway enzymes described herein may be introduced into the host organism. If the host microorganism endogenously expresses one or more of the pathway genes then it may not be necessary to introduce these genes, but only those nucleic acids encoding enzyme(s) in the pathway for which the microorganism is deficient. Where a host microorganism is selected that expresses one or more of the pathway genes, the microorganism may be engineered such that the gene encoding the enzyme is overexpressed and/or genes encoding enzymes or proteins of competing pathways may be deleted.


The host microorganism may be engineered to increase co-factor pools of NADH and/or NADPH to improve metabolic flux. In some embodiments, if E. coli is to be used as the host organism, glucosephosphate isomerase (pgi) gene may be deleted to divert flux towards the pentose phosphate pathway to increase NADPH pools. Other strategies may involve switching the endogenous NADH-dependent glyceraldehyde-3-phosphate dehydrogenase (GAPDH) to the host E. coli strain with an exogenous NADPH-dependent glyceraldehyde-3-phosphate dehydrogenase derived from Clostridium acetobutylicum. In another method, an NADH kinase (Pos5P) maybe introduced from S. cerevisiae into the host E. coli strain. The latter was successfully used to increase several products that are produced through NADPH-dependent pathways [Lee, W.-H., Kim. M.-D., Jin. Y.-S., & Seo, J.-H. (2013). Engineering of NADPH regenerators in Escherichia coli for enhanced biotransformation. Applied Microbiology and Biotechnology, 97(7):2761-72].


If E. coli is chosen as the host organism. NADH pools may be increased by limiting competing pathways though the deletion of genes encoding NADH-dependent enzymes, including but not limited to: alcohol dehydrogenases (adhE; eutG; adhP; yjgB; yqhD), lactate dehydrogenase (idhA), and pyruvate-formate lyase (pflB).


Host microorganisms may be selected from, and the non-naturally occurring microorganisms generated in, for example, bacteria, yeast, fungus or any of a variety of other microorganisms may be used as a host organism.


In some embodiments, bacterial species may include: Escherichia coli, Bacillus subtilis, Bacillus licheniformis, Bacillus cereus, Bacillus megaterium, Bacillus brevis, Bacillus pumilus, Corynebacterium glutamicum, Zymomonas mobilis. Clostridium acetobutylicum. Clostridium butylicum, Clostridium kluyveri, Clostridium autoethanogenum, Moorella thermoacetica, Clostridium aceticum, Clostridium beijerinckii, Clostridium saccharoperhulacetonicum, Clostridium perfringens, Clostridium difficile, Clostridium botulinum, Clostridium tyrobutyricum, Clostridium tetanomorphum, Clostridium tetani, Clostridium propionicum, Clostridium aminobutyricum, Clostridium subterminale. Clostridium sticklandii. Ralstonia eutropha, Mycobacterium bovis, Mycobacterium tuberculosis, Porphyromonas gingivalis, Pseudomonas fluorescens, Pseudomonas putida, Pseudomonas aeruginosa, Pseudomonas carboxidovorans (Oligotropha carboxidovorans), Pseudomonas stutzeri, Klebsiella pneumonia. Kebsiella oxytoca, Anaerobiospirillum succiniciproducens, Actinohacillus succinogenes, Mannheimia succiniciproducens, Rhizobium etli, Gluconobacter oxydans, Lactococcus lactis, Lactobacillus plantarum, Streptomyces coelicolor, Citrobacter freundii, Citrobacter amalonaticus, Acinetobacter calcoaceticus, Acinetobacter baylyi, Thermotoga maritima. Halobacterium salinarum, Serratia marcescens, Rhodospirillum rubrum, Ideonella sp., Rhodobacter capsulatus, Methyococcus capsulatus, Methylosinus trichosporium, Methylubacterium extorquens, Methylocystis GB2.5, Methylotrophus capsulatus, Methylomonas sp. I6a, and Pyrococcus furiosus.


In some embodiments, yeasts or fungi may include; Saccharomyces cerevisvae, Schizosaccharomyces pombe, Saccharomycopsis crataegensis, Kluyeromyces lactis, Kluyveromyces marxianus, Aspergillus terreus, Aspergillus niger, Pichia stipitis, Pichia pastoris, Rhizopus arrhizus, Rhizopus oryzae, Yarrowia lipoiytica, Issatchenkia orientalis, Issatchenkia occidentalis, Candida lambica, Candida sorboxylosa, Candida zempinina, Candida geochares, Pichia membranifaciens, Zygosaccharomyces kombuchaensis, Candida sorbosivorans, Candida vanderwaltii, Candida sorbophila, Zygosaccharomyces bisporus, Zygosaccharomyces lentus, Saccharomyces bayanus, Saccharomyces bulderi, Debaryomyces castellii, Candida boidinii Candida etchellsii, Pichia jadini, Pichia anomala, and Penicillium chrysogenun.


In some embodiments, cyanobacteria may include: Acaryochloris marina MBIC11017, Anabaena sp. PCC 7120. Anabaena variabilis ATCC 29413. Agmenellum quadruplicatum, Chlorobion tepidum TLS, Cyanothece sp. ATCC 51142, Gloeobacter violaceus PCC 7421, Microcystis aeruginosa NIES-843, Nostuc punctiforme ATCC 29133, Prochlorococcus marinus MED4, Prochlorococcus marinus MiT9313, Prochlorococcus marinus SS120, Prochlorococcus marinus str. AS601, Prochlorococcus marinus str. MIT 9211. Prochlorococcus marinus str. MIT 9215, Prochlorococcus marinus sir. MIT 9301, Prochlorococcus marinus str. MIT 9303, Prochlorococcus marinus str. MIT 9312, Prochlorococcus marinus str. MIT 9515, Prochlorococcus marinus str. NATL1A, Prochlorococcus marinus str. NATL2A, Rhodopseudomonas palustris CGA009, Synechococcus elongatus PCC 6301, Synechococcus elongatus PCC 7942, Synechococcus sp. CC9311, Synechococcus sp. CC9605, Synechococcus sp. CC9902, Synechococcus sp. JA-2-3B, Synechococcus sp. JA-3-3Ab. Synechococcus sp. PCC 7002, Synechococcus sp. RCC307, Synechococcus sp. WH 7803, Synechococcus sp. WH8102, Synechocystis sp. PCC 6803. Thermosynechococcus elongatus BP-1, and Trichodesmium erythraeum IMS101.


In some embodiments, algae may include: Botryococcus braunii, Chlamydomonas reinhardii, Chlorella sp., Crypthecodinium cohnii, Cylindrotheca sp., Dunaliella primolecta, Isochrvsis sp., Monallanthus salina, Nannochloris sp., Nannochloropsis sp., Neochloris oleoabundans, Nitzschia sp., Phaeodactylum tricornutum, Schizochytrium sp., and Tetraselmis sueica.


In some embodiments, the host microorganism is not particularly restricted and the enzymatic activity or activities may be incorporated into any suitable host organism using methods, for example, as described herein. In some embodiments, the host is selected from bacteria, yeast, algae, cyanobacteria, fungi, or a plant cell, or any combination thereof. In some embodiments, the non-naturally occurring microorganism of the present disclosure or the process according to the present disclosure comprises a host microorganism. In some embodiments, the host microorganism is selected from a bacteria, yeast, algae, cyanobacteria, fungi, or a plant cell, or any combination thereof.



E. coli and S. cerevisiae are particularly useful host organisms since they are well characterized microorganisms suitable for genetic engineering. Further, acetaldehyde is a natural metabolite of both E. coli and S. cerevisiae present in the central carbon metabolism of both species.


A nucleic acid molecule encoding enzymes as described herein may be used alone or as part of a vector. The nucleic acid molecules may include expression control sequences operably linked to the polynucleotide comprised in the nucleic acid molecule. These expression control sequences may be suited to ensure transcription and synthesis of a translatable RNA in bacteria or fungi. Expression refers to the transcription of the heterologous DNA sequence, preferably into a translatable mRNA. Regulatory elements ensuring expression in fungi and bacteria encompass promoters, enhancers, termination signals, targeting signals and the like. Promoters for use in connection with the nucleic acid molecule may be homologous or heterologous with regard to its origin and/or with regard to the gene to be expressed. Suitable promoters are for instance promoters which lend themselves to constitutive expression. Promoters which are only activated at a point in time determined by external influences may also be used. Artificial and/or chemically inducible promoters may be used. Chemically inducible promoters may include but is not limited to: IPTG-inducible promoters such as T7 or Ptrc, or tetracycline-inducible promoters such as PLtetO-1.


An overview of different expression systems is for instance contained in Bitter et al. (Methods in Enzymology 153 (1987), 516-544) and in Sawers et al. (Applied Microbiology and Biotechnology 46 (1996), 1-9), Billman-Jacobe (Current Opinion in Biotechnology 7 (1996), 500-4), Hockney (Trends in Biotechnology 12 (1994), 456-463), Griffiths et al., (Methods in Molecular Biology 75 (1997), 427-440). An overview of yeast expression systems is for instance given by Hensing et al. (Antonie van Leuwenhoek 67 (1995), 261-279), Bussineau et al. (Developments in Biological Standardization 83 (1994), 13-19), Gellissen et al. (Antonie van Leuwenhoek 62 (1992), 79-93, Fleer (Current Opinion in Biotechnology 3 (1992), 486-496), Vedvick (Current Opinion in Biotechnology 2 (1991), 742-745) and Buckholz (Bio/Technology 9 (1991), 1067-1072).


Expression vectors have been widely described in the literature. As a rule, they contain not only a selection marker gene and a replication-origin ensuring replication in the host selected, but also a bacterial or viral promoter, and in most cases a termination signal for transcription. Between the promoter and the termination signal there is in general at least one restriction site or a polylinker which enables the insertion of a coding DNA sequence. The DNA sequence naturally controlling the transcription of the corresponding gene may be used as the promoter sequence, if it is active in the selected host organism. However, this sequence may also be exchanged for other promoter sequences. It is possible to use promoters ensuring constitutive expression of the gene and inducible promoters which permit a deliberate control of the expression of the gene. Bacterial and viral promoter sequences possessing these properties are described in detail in the literature. Regulatory sequences for the expression in microorganisms, including E. coli and S. cerevisiae, are described in the literature. Promoters permitting a particularly high expression of a downstream sequence are for instance the T7 promoter [Studier et al., Methods in Enzymology 185 (1990), 60-89], lacUV5, trp, trp-lacUV5 IDeBoer et al., in Rodriguez and Chamberlin (Eds). Promoters. Structure and Function: Praeger. New York, (1982), 462-481; DeBoer et al., Proc. Natl. Acad. Sci. USA (1983), 21-25], Ipi, rac [Boros et al., Gene 42 (1986), 97-100]. Termination signals for transcription are also described in the literature.


Inducible promoters which may provide higher polypeptide yields than constitutive promoters may be used. Suitably, in certain embodiments, a two-stage process may be used: the host cells are first cultured under optimum conditions up to a relatively high cell density; and transcription is then induced.


When two or more exogenous encoding nucleic acids are to be co-expressed, both nucleic acids may be inserted, for example, into one expression vector or into separate expression vectors. For single vector expression, the encoding nucleic acids may be operationally linked to a common expression control sequence or linked to different expression control sequences, such as one inducible promoter and one constitutive promoter.


In some embodiments, the present disclosure is directed to a gene encoding at least one aldolase (e.g., DERA) or at least one enzyme of the present disclosure, a recombinant vector containing the gene, and/or a recombinant microorganism containing said gene or recombinant vector.


As used herein, the term “vector” means a DNA construct containing a DNA sequence operably linked to a suitable control sequence capable of effecting the expression of the DNA in a suitable host. The vector may be a plasmid, a phage particle, or simply a potential genomic insert. Once incorporated into a suitable host, the vector may replicate and function independently of the host genome, or may in some instances, integrate into the genome itself. In the present specification, “plasmid” and “vector” are sometimes used interchangeably because most vector is a type of plasmid which is replicated independently.


It is to be understood that in some embodiments, a non-naturally occurring microorganism that produces a pathway intermediate or product, may be used in combination with another organism (or other organisms) expressing downstream or upstream pathway enzyme(s) to produce a desired product. For example, a wild-type or engineered organism may be used to produce and accumulate pyruvate, aldehydes, and/or 3-hydroxyaldehydes. These intermediates may then be used as a substrate for another engineered organism expressing one or more of the trans-2-unsaturated aldehyde pathway genes and/or the delta-lactone pathway genes and/or the gamma-lactone pathway genes.


In some embodiments, a microorganism as provided herein may optionally be engineered to delete one or more byproduct or alternative pathways. In some embodiments, the non-naturally occurring microorganism may be engineered to delete one or more genes encoding an enzyme that utilizes pyruvate. In some embodiments, the non-naturally occurring microorganism comprises deletion of one or more genes encoding an enzyme that utilizes pyruvate. In some embodiments, the non-naturally occurring microorganism comprises deletion of one or more genes encoding an enzyme capable of reacting with an acetaldehyde molecule, a pyruvate, or an aldehyde precursor. In some embodiments, the non-naturally occurring microorganism comprises deletion of one or more genes encoding an alcohol dehydrogenase, a lactate dehydrogenase, or a pyruvate formate lyase.


In some embodiments, one or more genes encoding an alcohol dehydrogenase, dihydroxy acid dehydratase, threonine transporter, aldehyde dehydrogenase, a lactate dehydrogenase or a pyruvate formate lyase are deleted from a host microorganism. In some embodiments, a deleted gene may be an alcohol dehydrogenase gene, a lactate dehydrogenase gene, and/or a pyruvate formate lyase gene. In some embodiments, the host microorganism is E col and one or more genes that are deleted is selected from aldB, poxB, yahK, deoC, paoA, paoB, paoC, ilvD, rhtA, dkgA pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB. Other genes native to E. coli and homologous to one or more of aldB, poxB, yahK, deoC, paoA, paoB, paoC, ilvD, rhtA, dkgA, pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB may also be deleted. In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB. In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG adhP, and yjgB. In some embodiments the microorganism comprising one or more gene deletions selected from aldB, poxB, ybbO, yahK, deoC, paoAJB/C (yagT/S/R), adhE, yqhD, eutG adhP, and yjgB can be used for producing trans-2-unsaturated aldehydes by feeding precursors (e.g., trans-2-hexenal production by feeding acetaldehyde and butyraldehyde). In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from ilvD, rhtA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, IdhA, adhE, vqhD, eutG, adhP, and vjgB. In some embodiments the microorganism comprising one or more gene deletions selected from ilvD, rhtA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB can be used for producing trans-2-unsaturated aldehydes from glucose (e.g., trans-2-hexenal production from glucose (i.e., internal production of the precursors)).


In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from dkgA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R,) adhE, yqhD, eutG, adhP, and yjgB. In some embodiments the microorganism comprising one or more gene deletions selected from dkgA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG, adhP, and yjgB, can be used for producing delta-lactone by feeding aldehyde precursors (e.g., delta-decalatone production by feeding acetaldehyde and hexanal). In some embodiments, the host microorganism is Eco/i and the one or more genes that is deleted is selected from ilvD, rhtA, dkgA, aldB, poxB, ybbO, ya, deoC, paoA/B/C (yagT/S/R), pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB. In some embodiments the microorganism comprising one or more gene deletions selected from ilvD, rhtA, dkgA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB can be used for producing delta-lactones from glucose (e.g., delta-decalactone production from glucose (i.e., internal production of the precursors)).


In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from dkgA, aldB, vbbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG, adhP, and, yjgB. In some embodiments the microorganism comprising one or more gene deletions selected from dkgA, aldB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG, adhP, and, yjgB can be used for producing gamma-lactone by feeding precursor(s) (e.g., gamma-decalatone production by feeding heptanal (pyruvate can be produced by the cells and/or be additionally fed)). In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from ilvD, rhtA, dkgA, aldB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB IdhA, adhE, yqhD, eutG, adhP, and yjgB. In some embodiments the microorganism comprising one or more gene deletions selected from ilvD, rhtA, dkgA, aldB, ybbO, yahK, deoC, paoA/BC (yagT/S/R), pflB IdhA, adhE, yqhD, eutG, adhP, and yjgB can be used for producing gamma-lactones from glucose (e.g., gamma-decalactone production from glucose (i.e., internal production of the precursors). In some embodiments, the host microorganism is E. coli and the one or more genes that is deleted is selected from pflB, ldhA, adhE. In some embodiments the microorganism comprising one or more gene deletions selected from pflB, ldhA, adhE can be used for producing gamma-lactones by feeding only the precursor alkyl aldehyde (e.g., heptanal) and internal production of pyruvate from glucose.


Multiple sequence alignment algorithms (such as ClustalW) may be used to identify aldehyde reductases and alcohol dehydrogenase that share similar function to adhE which catalyzes reduction of acetaldehyde to ethanol. Aldehyde reductases or alcohol dehydrogenases that are native to E. coli that show activity on aldehydes include but are not limited to sequence data found in the TABLE 12 below.














TABLE 12







Gene
GenBank





name
Accession
GI
Organism





















yahK
P75691.1
2492774

Escherichia coli




yqhD
Q46856.1
3025295

Escherichia coli




yjgB
AAA97166.1
537111

Escherichia coli




gldA
BAE77365.1
85676115

Escherichia coli




ybbO
BAE76272.1
85674632

Escherichia coil




yghA
BAE77062.1
85675809

Escherichia coli




adhP
BAA15126.1
1742410

Escherichia coli




fucO
BAE76871.1
85675618

Escherichia coil




eutG
BAA16331.1
1799879

Escherichia coli




yiaY
YP_026233.1
49176377

Escherichia coli




eutE
NP_416950.1
16130380

Escherichia coil




betA
NP_414845.1
16128296

Escherichia coli











The activity of the aldehyde reductases and alcohol dehydrogenases described herein on the trans-2-unsaturated aldehydes may be determined. In some embodiments, one or more of the aldehyde reductases and alcohol dehydrogenases described herein that show substrate preference and activity towards trans-2-unsaturated aldehydes may be overexpressed in the host organism to improve production of trans-2-unsaturated alcohols. Sequence similarity search to identify homologues derived from other organisms to the native aldehyde reductases and alcohol dehydrogenases in E. coli that show activity on trans-2-unsaturated aldehydes may be performed.


In some embodiments, pyruvate used according to embodiments of the present disclosure is produced from renewable feedstock (such as glucose). In some embodiments, the host organism is provided with a feedstock of sugars. Such sources include, for example, sugars such as glucose, xylose, arabinose, glycerol, galactose, mannose, fructose, starch, and any combination thereof. Glucose may be obtained from various carbohydrate-containing sources including conventional bio-renewable sources such as corn (maize), wheat, potato, cassava and rice as well as alternative sources such as energy crops, plant biomass, agricultural wastes, forestry residues, sugar processing residues and plant-derived household wastes. Sources of carbohydrate include renewable feedstocks and biomass, e.g. cellulosic biomass, hemicellulosic biomass, and lignin feedstocks.


In some embodiments, the trans-2-unsaturated aldehyde process, or the delta-lactone process, or the gamma-lactone process comprises a fermentation process. In some embodiments, the fermentation process comprises a sugar-based fermentation process. In some embodiments, the sugar-based fermentation process comprises feeding sugar, feeding precursors, feeding sugar and precursors, feeding at least one aldehyde molecule (i.e., aldehyde precursor), feeding sugar and an aldehyde precursor, feeding only aldehyde precursors, feeding pyruvate and/or pyruvic acid, or any combination thereof. In some embodiments, the fermentation is partially anaerobic, substantially, or fully anaerobic.


In some embodiments, the fermentation is carried out in a single vessel. In some embodiments, the fermentation is a two-phase process. In some embodiments, the two-phase process comprises a strain growth phase and/or a production phase.


Bio-renewable feedstock sources that may be used in accordance with the present disclosure include any renewable organic matter that includes a source of carbohydrates. These include, for example, grasses, trees (hardwood and softwood), vegetation and crop residues. Other sources may include, for example, waste materials (e.g., spent paper, green waste, municipal waste, etc.). Suitable carbohydrates, including glucose, may be isolated from biorenewable materials. See, for example. Centi and van Santen, Catalysis for Renewables, Wiley-VCH, Weinheim 2007; Kamm, Gruber and Kamm, Biorefineries-Industrial Processes and Products, Wiley-VCH, Weinheim 2006; Shang-Tian Yang, Bioprocessing for Value-Added Products from Renewable Resources New Technologies and Applications, Elsevier B. V. 2007; Furia, Starch in the Food Industry. Chapter 8, CRC Handbook of Food Additives 2nd Edition CRC Press, 1973. See also chapters devoted to Starch, Sugar and Syrups within Kirk-Othmer Encyclopedia of Chemical Technology, 5th Edition. John Wiley and Sons 2001. Processes to convert starch to glucose have been reported. See, for example, Schenck, Glucose and Glucose containing Syrups in Ullmann's Encyclopedia of Industrial Chemistry, Wiley-VCH 2009. Furthermore, methods to convert cellulose to glucose have been described. See, for example, Centi and van Santen, Catalysis for Renewables, Wiley-VCH, Weinheim 2007; Kamm, Gruber and Kamm, Biorefineries-Industrial Processes and Products, Wiley-VCH, Weinheim 2006: Shang-Tian Yang, Bioprocessing for Value-Added Products from Renewable Resources New Technologies and Applications, Elsevier B. V. 2007.


Alternative carbon sources may be crude glycerol obtained from biodiesel production plants, lactic acid obtained from degradation of waste poly-lactic acid, lactose or cheese whey permeate obtained from dairy industry, glucosamine obtained from chitin rich waste. The carbon sources may also be fatty acids and their esters (e.g., monoglycerides, diglycerides and triglycerides) obtained from plants or plant products such as canola oil, coconut oil, corn oil, olive oil, palm oil, safflower oil, peanut oil, soybean oil, sesame oil, sunflower oil and any combination thereof.


In some embodiments, the processes as provided may be carried out in a fermenter.


The engineered organism may be cultivated in a variety of reactor systems, and the process may be carried out in different modes of operations. The most commonly used bioreactor is a stirred tank bioreactor or aerated fermenter. The fermenter is equipped with sterile air supply, the mixing of bubble dispersion is achieved by mechanical agitation, and the temperature may be maintained using a jacket or coil that circulates steam or cooling water. For aerated vessels, high height/diameter ratio (>3) may be chosen to increase the contact time between the bubbles and liquid phase. Other variations of bioreactors are airlift bioreactor where mixing is achieved without mechanical agitation, and packed bed or fluidized bed bioreactors which are used when the biocatalyst is immobilized.


The fermentation may be carried out in three different modes: batch, fed-batch and continuous mode. A standard batch bioreactor is considered a “closed” system. In batch mode, all the media components are added to bioreactor while ensuring the sterility. Once the medium has been prepared, the bioreactor is inoculated with an appropriate inoculum and the fermentation is allowed to proceed until the end without any changes to the medium, i.e., without feeding of any additional components. Components such as acid and/or base can, however, be added to maintain the pH, and air/oxygen may be added to maintain the dissolved oxygen levels. In batch fermentation biomass and product concentration change over time until the fermentation is complete. The cells undergo classical lag-phase, exponential growth-phase, stationary phase growth, followed by death phase.


A variation of the batch mode is fed-batch mode where the nutrients including the carbon source is added to the fermenter as the process progresses.


In addition to batch or fed-batch mode, continuous mode of fermentation may also be used. A continuous system is considered to be “open” system in contrast to the batch mode. In continuous mode, defined production medium is added continuously to the bioreactor and equal amount of bioreactor contents are removed at the same rate. Continuous operation may be carried out in a chemostat where the vessel contents, including the cells are removed, or in a bioreactor that uses perfusion culture, which allows recycling of the viable cells back to the bioreactor, allowing high cell densities to be achieved.


The commonly used fermenter designs and different operation modes are very well-established in the literature [Biochemical Engineering Fundamentals, 2nd Ed. J. E. Bailey and D. F. Ollis, McGraw Hill, New York, 1986: Development of Sustainable Bioprocesses: Modeling and Assessment, E. Heinzle, A. P. Biwer and C. L. Cooney. John Wiley & Sons, Ltd., 2006; Bioprocess Engineering. Basic Concepts, 2nd Ed., M. L. Shuler and F. Kargi, Prentice Hall, 20011. Batch, fed-batch or continuous fermentation procedures may be employed.


In some embodiments, processes as provided herein are carried out in substantially anaerobic conditions. “Substantially anaerobic” when used in reference to a culture or growth condition means that, in some embodiments, the amount of oxygen is less than about 10% of saturation for dissolved oxygen in liquid media. In some embodiments, the term includes sealed chambers of liquid or solid medium maintained with an atmosphere less than about 1% oxygen. In some embodiments, the processes are conducted under substantially aerobic conditions. As used herein the term “substantially aerobic” when used in reference to a culture or growth condition means that, in some embodiments, that the amount of oxygen is equal to or greater than about 10% of saturation for dissolved oxygen in liquid media. In some embodiments, the term includes sealed chambers of liquid or solid medium maintained with an atmosphere greater than about 1% oxygen.


Various components may be added to the culture medium to support growth of the microorganism and/or the metabolic processes described herein, including, for example, nutrients, pH modifiers, osmoprotectants.


The organisms may be grown in any suitable medium for growth such as Luria-Bertani broth, Terrific broth or yeast extract-peptone-dextrose (YPD) medium. For production, depending up on the choice of the host, synthetic minimal media such as M9 minimal medium, yeast synthetic minimal medium, yeast nitrogen base, BG-11, or variations thereof may be used. A suitable minimal medium must contain at least one carbon source, at least one nitrogen source, salts, cofactors, buffers, and other components required to grow and maintain the recombinant microorganism. The carbon source may be one or more of the carbon sources described previously, the nitrogen source may be an ammonium salt or nitrate salt including but not limited to (NH4)2SO4, NH4Cl, (NH4)2HPO4, NH4OH, KNO3, NaNO3. The medium may be supplemented with complex or organic nitrogen sources such as urea, yeast extract, casamino acids, peptone, tryptone, soy flour, corn steep liquor, or casein hydrolysate. The minimal medium may be supplied with trace metals including but not limited to H3BO3, MnCl2, ZnSO4, Na2MoO4, CuSO4, Co(NO3)2, CuCl2, ZnCl2, CoCl2, FeCl3, KI. The minimal medium may be supplemented with vitamins and/or non-vitamin compounds including but not limited to biotin, pantothanate, folic acid, inositol, nicotinic acid, p-aminobenzoic acid, pyridoxine, ribolavin, thiamine, cyanocobalamin, citric acid, ethylenediamine tetraacetic acid (EDTA), ferric ammonium citrate. The medium may be supplied by carbon dioxide either by direct sparging or in the form of NaHCO3, or Na2CO3.


Depending upon the host organism used the minimal medium may suitably have a pH range between about pH 2.0 to about pH 10.0, between about pH 4.0 to about pH 10.0, between about pH 2.0 to about pH 8.0, between about pH 4.0 to about pH 8.0, between about pH 5.0 to about pH 8.0, between about pH 5.0 to about pH 7.0. Depending upon the host organism used the minimal medium may suitably have a pH of about 2.0, a pH of about 2.5, a pH of about 3.0, a pH of about 3.5, a pH of about 4.0, a pH of about 4.5, a pH of about 5.0, a pH of about 5.5, a pH of about 6.0, a pH of about 6.5, a pH of about 7.0, a pH of about 7.5, a pH of about 8.0, a pH of about 8.5, a pH of about 9.0, a pH of about 9.5, or a pH of about 10.0.


The fermentation may be carried out in temperature ranging from about 25° C., to about 42° C. The fermentation may be carried out in temperature ranging from about 30° C., to about 40° C. from about 33° C., to about 38° C., from about 35° C., to about 38° C., from about 35° C., to about 37° C., from about 36° C., to about 38° C., from about 36° C., to about 37° C., from about 35° C., to about 42° C., from about 25° C., to about 38° C., from about 27° C., to about 38° C., from about 29° C. to about 38° C. In some embodiments, the fermentation may be carried out at about 37° C. In some embodiments, the fermentation may be carried out at about 38° C. In some embodiments, the fermentation may be carried out at about 36° C. In some embodiments, the fermentation may be carried out at about 25° C., at about 28° C., at about 30° C., at about 32° C., at about 34° C., at about 35° C., at about 39° C., at about 40° C., at about 41° C., at about 42° C. Higher temperatures may be used if the host organism chosen is thermophilic where the cultivation temperature may be as high as about 80° C. In some embodiments, the fermentation may be carried out at least about 42° C. In some embodiments, the fermentation may be carried out in temperature ranging from about 42° C., to about 80° C.


The fermentation may be carried out under aerobic, microaerobic, or anaerobic conditions. It may also be carried out under two different phases involving aerobic growth-phase and a microaerobic or anaerobic production phase. In some embodiments, the fermentation is carried out under partially anaerobic conditions. In some embodiments, the fermentation is carried out under fully anaerobic conditions. Sterile air or oxygen may be introduced to maintain the desired dissolved oxygen levels in the medium.


Product, intermediate and byproduct formation may be analyzed by methods such as HPLC (High Performance Liquid Chromatography) equipped with a refractive index and/or photodiode array detector(s), GC-MS (Gas Chromatography-Mass Spectroscopy), GC-FID (Gas Chromatography-Flame Ionization Detector) and LC-MS (Liquid Chromatography-Mass Spectroscopy). Individual enzymatic activities from the exogenous DNA sequences may also be assayed.


The amount of product in the medium may be determined by using, for example, High Performance Liquid Chromatography (HPLC), Gas Chromatography (GC), Liquid Chromatography-Mass Spectrometry (LC-MS), and Gas Chromatography-Mass Spectrometry (GC-MS).


In some embodiments, processes as disclosed herein further include purifying the product of the processes. Such methods of purification may include, for example, liquid extraction, filtration, distillation or evaporation. Isolation of compound from the fermentation broth depends on the final purity of the compound required. The separation techniques may include: centrifugation, microfiltration, ultrafiltration, nano-filtration, evaporation, crystallization, distillation, and ion-exchange. Typical downstream processing operation may include a series of processes including separation of cells using centrifugation or microfiltration, removal of additional solids in the broth using ultrafiltration, removal of salts from the broth using nanofiltration, ion-exchange, or evaporative crystallization, and purification using distillation.


Microorganisms as described herein may be produced to secrete the resulting product, whether by choosing a host organism with a secretory signal corresponding to the product or by engineering the host organism to provide for the same. For example, membrane-bound transporter proteins may be overexpressed in the host organism to improve the secretion of the products to the fermentation broth, including but not limited to yhjX gene encoding a pyruvate-inducible inner membrane protein and putative transporter which belongs to the major facilitator superfamily of proteins, or phoE, phoF and phoC genes encoding porin E, porin F and porin C, respectively, which are responsible for membrane transport of aldehydes. The product may be recovered from the culture medium. In some embodiments, the product may be extracted from the microorganism. In some embodiments, the microorganisms may be ruptured and the culture medium or lysate may be centrifuged to remove particulate cell debris, and the membrane and soluble protein fractions may be separated if necessary.


In some embodiments, the present disclosure provides a pathway for producing trans-2-unsaturated aldehydes using aldehyde as a precursor. A general pathway is shown in FIG. 5. Trans-2-unsaturated aldehydes, such as hexenal, are currently produced using a fatty acid based pathway. A fatty acid based pathway is shown in FIG. 1. A fatty acid based pathway is a multistep enzymatic process that may use soybean flour and crushed leaves as the source of the enzymes 13-lipoxygenase and 13-hydroperoxide lyase [Gigot et al., “The lipoxygenase metabolic pathway in plants: potential for industrial production of natural green leaf volatiles,” Biotechnol Agron soc Environ., 2010, 14(3), 451-460]. In contrast to multistep pathways for the production of trans-2-unsaturated aldehydes, such as for trans-2-hexenal (FIG. 4 shows multistep industrial process for making trans-2-hexenal), the pathway of the present disclosure is a shorter, single fermentation process. The pathways described herein, are based on sugars, giving a higher yield than fatty acid based pathways. The pathways disclosed herein, are based on microbial enzymes that are easily expressed in industrial hosts such as in E. coli and S. cerevisiae. In some embodiments, the pathways disclosed herein uses carbon-carbon bond forming aldolase enzymes. Aldolases may have broad substrate specificity, so the same pathway may be extended to make many products by changing chain length of the precursor molecules.


In some embodiments, the present disclosure provides a pathway for producing delta-lactones using aldehydes as a precursor. A general pathway is shown in FIG. 6.


In some embodiments, the present disclosure provides a pathway for producing gamma-lactones using aldehyde and pyruvate as precursors. A general pathway is shown in FIG. 7.


In some embodiments, the aldehyde is selected from formaldehyde, acetaldehyde, propionaldehyde, n-butyraldehyde, iso-butyraldehyde, hexanal, n-pentanal (valeraldehyde), caproaldehyde, crotonaldehyde, n-heptaldhyde, n-octanal, n-nonylaldehyde, phenylacetaldehyde, benzaldehyde, p-tolualdehyde, salicylaldehyde, vanillin, piperonal, 2-ethyl-2-hex-enal, 2-ethylhexaldehyde, dodecyl aldehyde, undecaldehyde, decaldehyde, nonaldehyde, octaldehyde, heptaldehyde, hexaldehyde, pentaldehyde, butyraldehyde, or any combination thereof.


In some embodiments, the aldehyde used in the methods described herein is a compound represented by formula I:




embedded image



wherein, R is hydrogen or CnH2n+1, and each n is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is an integer. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 1. In some embodiments, the present disclosure relates to any one of compounds disclosed herein, wherein n is 2. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 3. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 4. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 5. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 6. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 7. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 8. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 9. In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein n is 10.


In some embodiments, the intermediate in the trans-2-unsaturated aldehyde pathway or the lactones pathways is a 3-hydroxy aldehyde compound represented by formula II:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, there is provided a process comprising reacting acetaldehyde and an aldehyde of formula R—CHO, wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10, in the presence of an aldolase enzyme, to form the 3-hydroxy aldehyde compound of formula II, wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the aldolase enzyme is specific for the condensation of an acetaldehyde and an aldehyde molecule. In some embodiments, the aldolase enzyme is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or wild-type aldolase enzyme.


In some embodiments, the 3-hydroxy aldehyde is 3-hydroxy-propanal, 3-hydroxy-butanal, 3-hydroxy-pentanal, 3-hydroxy-hexanal, 3-hydroxy-heptanal, 3-hydroxy-octanal, 3-hydroxy-nonanal, 3-hydroxy-decanal, 3-hydroxy-undecanal, 3-hydroxy-dodecanal, 3-hydroxy-tridecanal, or any combination thereof.


In some embodiments, the 3-hydroxy aldehyde comprises 3-hydroxy-propanal, 3-hydroxy-butanal, 3-hydroxy-pentanal, 3-hydroxy-hexanal, 3-hydroxy-heptanal, 3-hydroxy-octanal, 3-hydroxy-nonanal, 3-hydroxy-decanal, 3-hydroxy-undecanal, 3-hydroxy-dodecanal, 3-hydroxy-tridecanal, and any combination thereof.


In some embodiments, the 3-hydroxy aldehyde compound of formula II is in enantiomeric excess. In some embodiments, the enantiomer of 3-hydroxy aldehyde produced in excess is (R)-3-hydroxy aldehyde. In some embodiments, the enantiomer of 3-hydroxy aldehyde produced in excess is (S)-3-hydroxy aldehyde. In some embodiments, the 3-hydroxy aldehyde compound of formula II comprises (R)-3-hydroxy-propanal, (R)-3-hydroxy-butanal, (R)-3-hydroxy-pentanal, (R)-3-hydroxy-hexanal, (R)-3-hydroxy-heptanal, (R)-3-hydroxy-octanal, (R)-3-hydroxy-nonanal, (R)-3-hydroxy-decanal, (R)-3-hydroxy-undecanal, (R)-3-hydroxy-dodecanal, (R)-3-hydroxy-tridecanal, and any combination thereof.


In some embodiments, the 3-hydroxy aldehyde compound of formula II contains at least about 50% of a compound of formula II in the (R)-configuration. In some embodiments, the product of the method of the present disclosure contains at least about 51%, at least about 52%, at least about 53%, at least about 54%, at least about 55%, at least about 56%, at least about 57%, at least about 58%, at least about 59%, at least about 60%, at least about 61%, at least about 62%, at least about 63%, at least about 64%, at least about 65%, at least about 66%, at least about 67%, at least about 68%, at least about 69%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula II in the (R)-configuration. For example, the product is free of undesired byproduct or starting material. For example, the impurities (e.g., undesired byproduct, starting material, or the (S)-configuration of compound of formula II) present in the (R)-configuration of the compound of formula II is less than about 50%, less than about 45%, less than about 40%, less than about 35%, less than about 30%, less than about 25%, or less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, a trans-2-unsaturated aldehyde pathway is disclosed herein that comprises condensation of two aldehyde molecules to a 3-hydroxy aldehyde intermediate using an enzyme from class aldolases, and spontaneous dehydration of the 3-hydroxy aldehyde intermediate to alpha-beta unsaturated aldehyde. FIG. 5 shows the schematic of this pathway. In some embodiments, a pathway is disclosed that comprises condensation of two aldehyde molecules to a 3-hydroxy aldehyde intermediate using an enzyme from class aldolase, and dehydration of the 3-hydroxy aldehyde intermediate to alpha-beta unsaturated aldehyde by using dehydratase. In some embodiments, the alpha-beta unsaturated aldehydes comprise trans-2-unsaturated aldehydes. In some embodiments, the alpha-beta unsaturated aldehyde is a trans-2-unsaturated aldehyde. In some embodiments, a pathway is disclosed that comprises condensation of two aldehyde molecules to a beta-hydroxy aldehyde intermediate using the aldolase enzyme deoxyribosephosphate (DERA), and dehydration of the beta-hydroxy aldehyde intermediate to alpha-beta unsaturated aldehyde. In some embodiments, the dehydration is spontaneous. In some embodiments the dehydration is via a dehydratase. In some embodiments the aldolase comprises DERA. In some embodiments, the aldolase enzyme is DERA.


In some embodiments, the present disclosure provides a process comprising a dehydration step to dehydrate the compound of formula II to form a trans-2-unsaturated aldehyde compound.


In some embodiments, the trans-2-unsaturated aldehyde described herein is a compound represented by formula III:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the trans-2-unsaturated aldehyde is selected from trans-prop-2-enal, trans-2-butenal, trans-2-pentenal, trans-2-hexenal, trans-2-heptenal, trans-2-octenal, trans-2-nonenal, trans-2-decenal, trans-2-undecenal, trans-2-dodecenal, trans-2-tridecenal, and any combination thereof.


In some embodiments, the trans-2-unsaturated aldehyde comprises trans-prop-2-enal, trans-2-butenal, trans-2-pentenal, trans-2-hexenal, trans-2-heptenal, trans-2-octenal, trans-2-nonenal, trans-2-decenal, trans-2-undecenal, trans-2-dodecenal, trans-2-tridecenal, and any combination thereof.


In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is 2-propenal, wherein R is hydrogen. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-butenal, wherein n is 1. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-pentenal, wherein n is 2. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-hexenal, wherein n is 3. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-heptenal, wherein n is 4. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-octenal, wherein n is 5. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-nonenal, wherein n is 6. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-decenal, wherein n is 7. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-undecenal, wherein n is 8. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-dodecenal, wherein n is 9. In some embodiments, the trans-2-unsaturated aldehyde compound of formula III is trans-2-tridecenal, wherein n is 10.


In some embodiments, the trans-2-unsaturated aldehydes may be converted to trans-2-unsaturated alcohols. In some embodiments, the trans-2-unsaturated aldehydes may be converted to trans-2-unsaturated alcohols by using alcohol dehydrogenase or an enzyme having alcohol dehydrogenase activity. In some embodiments, trans-prop-2-enal is converted to trans-prop-2-enol. In some embodiments, trans-2-butenal is converted to trans-2-butenol. In some embodiments, trans-2-pentenal is converted to trans-2-pentenol. In some embodiments, trans-2-hexenal is converted to trans-2-hexenol. In some embodiments, trans-2-heptenal is converted to trans-2-heptenol. In some embodiments, trans-2-octenal is converted to trans-2-octenol. In some embodiments, trans-2-nonenal is converted to trans-2-nonenol. In some embodiments, trans-2-decenal is converted to trans-2-decenol. In some embodiments, trans-2-undecenal is converted to trans-2-undecenol. In some embodiments, trans-2-dodecenal is converted to trans-2-dodecenol. In some embodiments, trans-2-tridecenal is converted to trans-2-tridecenol.


Alcohol dehydrogenase (ADH) is an enzyme that catalyzes reduction of aldehydes and/or ketones into alcohols. Alcohol dehydrogenase is also known as aldehyde reductase, ADH, NAD-dependent alcohol dehydrogenase, NADH-alcohol dehydrogenase, primary alcohol dehydrogenase, aldehyde reductase (NADPH), NADP-alcohol dehydrogenase. NADP-aldehyde reductase, NADP-dependent aldehyde reductase, NADPH-aldehyde reductase. NADPH-dependent aldehyde reductase, and alcohol dehydrogenase (NADP).


In some embodiments, the trans-2-unsaturated aldehydes may be converted to trans-2-unsaturated carboxylic acids. In some embodiments, the trans-2-unsaturated aldehydes may be converted to trans-2-unsaturated carboxylic acids by using oxidoreductase. In some embodiments, trans-prop-2-enal is converted to trans-prop-2-enoic acid. In some embodiments, trans-2-butenal is converted to trans-2-butenoic acid. In some embodiments, trans-2-pentenal is converted to trans-2-pentenoic acid. In some embodiments, trans-2-hexenal is converted to trans-2-hexenoic acid. In some embodiments, trans-2-heptenal is converted to trans-2-heptenoic acid. In some embodiments, trans-2-octenal is converted to trans-2-octenoic acid. In some embodiments, trans-2-nonenal is converted to trans-2-nonenoic acid. In some embodiments, trans-2-decenal is converted to trans-2-decenoic acid. In some embodiments, trans-2-undecenal is converted to trans-2-undecenoic acid. In some embodiments, trans-2-dodecenal is converted to trans-2-dodecenoic acid. In some embodiments, trans-2-tridecenal is converted to trans-2-tridecenoic acid.


The reduction of trans-2-unsaturated aldehydes to trans-2-unsaturated alcohols or trans-2-unsaturated carboxylic acids may be carried out by using appropriate alcohol dehydrogenase (ADH), aldo-ketoreductases, oxidoreductase, or aldehyde reductase using a reducing equivalent as cofactor, which, in some embodiments, may be NADH or NADPH. In some embodiments, the ADH, AKR, oxidoreductase, or aldehyde reductase is substantially specific towards the trans-2-unsaturated aldehydes and does not act on acetaldehyde, thereby substantially avoiding or eliminating the production of ethanol as a side product.


In some embodiments, an acetaldehyde and a butyraldehyde undergo condensation to a 3-hydroxy-hexanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-hexanal intermediate is dehydrated to trans-2-hexenal. In some embodiments, an acetaldehyde and a formaldehyde undergo condensation to a 3-hydroxy-propanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-propanal intermediate is dehydrated to trans-prop-2-enal. In some embodiments, two acetaldehyde molecules undergo condensation to a 3-hydroxy-butanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-butanal intermediate is dehydrated to trans-2-butenal. In some embodiments, an acetaldehyde and a propionaldehyde undergo condensation to a 3-hydroxy-pentanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-pentanal intermediate is dehydrated to trans-2-pentenal. In some embodiments, an acetaldehyde and a pentaldehyde undergo condensation to a 3-hydroxy-heptanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-heptanal intermediate is dehydrated to trans-2-heptenal. In some embodiments, an acetaldehyde and a hexaldehyde undergo condensation to a 3-hydroxy-octanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-octanal intermediate is dehydrated to trans-2-octenal. In some embodiments, an acetaldehyde and a heptaldehyde undergo condensation to a 3-hydroxy-nonanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-nonanal intermediate is dehydrated to trans-2-nonenal. In some embodiments, an acetaldehyde and an octaldehyde undergo condensation to a 3-hydroxy-decanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-decanal intermediate is dehydrated to trans-2-decenal. In some embodiments, an acetaldehyde and a nonaldehyde undergo condensation to a 3-hydroxy-undecanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-undecanal intermediate is dehydrated to trans-2-undecenal. In some embodiments, an acetaldehyde and a decaldehyde undergo condensation to a 3-hydroxy-dodecanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-dodecanal intermediate is dehydrated to trans-2-dodecenal. In some embodiments, an acetaldehyde and an undecaldehyde undergo condensation to a 3-hydroxy-tridecanal intermediate in the presence of an enzyme aldolase. In some embodiments, the 3-hydroxy-tridecanal intermediate is dehydrated to trans-2-tridecenal.


In some embodiments, any of the pathways disclosed herein may further comprise addition of alcohol dehydrogenase to the alpha-beta unsaturated aldehyde to generate alpha-beta unsaturated alcohols. In some embodiments, any of the aforementioned pathways may further comprise addition of oxidoreductase to the alpha-beta unsaturated aldehyde to generate alpha-beta unsaturated carboxylic acid. In some embodiments, the aldolase is DERA


In some embodiments, the intermediate 3-hydroxy aldehyde produced by any one of the pathway disclosed herein is in enantiomeric excess. In some embodiments, the enantiomer of 3-hydroxy aldehyde produced in excess is (R)-3-hydroxy aldehyde. In some embodiments, the enantiomer of 3-hydroxy aldehyde produced in excess is (S)-3-hydroxy aldehyde.


In some embodiments, the present disclosure relates to the production of delta-lactones by using aldolases, as illustrated in FIG. 6. Delta-lactones may be produced via chemical synthesis as described in Ramachandran et al. (Tetrahedron Letters 41 (2000), 583-586) and in Nobuhara (Agr. Biol. Chem., (1968) vol. 32, no. 8, p 1016-1020). Delta-lactones may also be isolated from natural sources, such as from extracts of roots (Ricardo et al., ARKIVOC (2004) (vi), 127-136). WO2007068498 discloses production of delta-lactones by reacting acetaldehyde and a substituted acetaldehyde. WO2007068498 discloses the production of delta-lactones by sequential reactions of the same aldolase. Many of these processes have disadvantages. Disadvantages include the number of required steps, low yield, expensive reagents, hazardous reagents, scarcity of natural sources, and cost. In addition, sequential reactions are known to be poor.


In some embodiments, it is the object of the present disclosure to provide with improved processes that provide higher specificity and higher activity. The present disclosure differs from WO2007068498 at least in that two different aldolases are used to produce delta-lactones. Advantages of the present disclosure include, but it is not limited to, the use of renewable resources, clean production, less pollution and less energy intensive processes in biological systems. In some embodiments, enzyme screening or protein engineering may be used to identify DERA or DERA like enzymes that are highly specific, for example for hexenal and acetaldehyde condensation to produce 3-hydroxy octanal (See, FIG. 6 (DERA 1 or aldolase 1)). In some embodiments, another or a second aldolase may be DERA or a DERA like enzyme which is specific for a reaction of, for example 3-hydroxy-octanal with acetaldehyde (See, FIG. 6 (DERA2 or aldolase 2)). In some embodiments, the second enzyme (e.g., DERA 2) may be identified through screening or protein engineering. In some embodiments, both enzymes may be derived from the same enzyme but may differ in specific mutations to improve efficiency for a desired reaction.


In some embodiments, a delta lactone pathway is disclosed that comprises: 1) condensation of two aldehyde molecules to a 3-hydroxy aldehyde intermediate using an aldolase enzyme; 2) condensation of the 3-hydroxy aldehyde intermediate and an aldehyde (e.g. acetaldehyde) to a 5,3-dihydroxy aldehyde intermediate using an aldolase enzyme; 3) cyclization of 5,3-dihydroxy aldehyde to form a tetrahydro-2H-pyran-2,4-diol (compound of formula IV(b)); 4) oxidation of tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one (compound of formula V); 5) dehydration of the tetrahydro-4-hydroxy-2H-pyran-2-one (compound of formula V) to a 5,6-dihydro-2H-pyran-2-one (compound of formula VI); and 6) reduction of the 5,6-dihydro-2H-pyran-2-one (compound of formula VI) to a delta-lactone (compound of formula VII) using reductase. FIG. 6 shows the schematic of this pathway.


In some embodiments, the dehydration step is spontaneous. In some embodiments, the dehydration step is by using dehydratase. In some embodiments, the lactonization is spontaneous. In some embodiments, the lactonization is enzymatic. In some embodiments, the lactonization is a non-enzymatic lactonization reaction. In some embodiments, the lactonization is a chemical lactonization reaction. In some embodiments, the reduction step comprises NADH or NADPH. In some embodiments, the aldolase enzyme is deoxyribosephosphate (DERA). In some embodiments, the aldolase in step 1 is different from the aldolase in step 2 (FIG. 6). In some embodiments, the aldolase in step 1 is the same as the aldolase in step 2. In some embodiments, step 1-5 is in sequential order. In some embodiments, the pathway comprises steps 1-4.


In some embodiments, the present disclosure provides a process of making a 5, 3-dihydroxy aldehyde compound by condensing a 3-hydroxy aldehyde compound of formula II and an acetaldehyde in the presence of an aldolase.


In some embodiments, the intermediate 5,3-dihydroxy aldehyde compound is represented by formula IV:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the aldolase enzyme is specific for the condensation of the compound of formula II and an acetaldehyde molecule to form a compound of formula IV. In some embodiments, the aldolase enzyme that is specific for the condensation of the compound of formula II and an acetaldehyde molecule to form a compound of formula IV is different from the aldolase enzyme that is specific for the condensation of an acetaldehyde and an aldehyde molecule. In some embodiments, the aldolase enzyme that is specific for the condensation of the compound of formula II and an acetaldehyde molecule to form a compound of formula IV is the same as the aldolase enzyme that is specific for the condensation of an acetaldehyde and an aldehyde molecule. In some embodiments, the aldolase enzyme that is specific for the condensation of the compound of formula II and an acetaldehyde molecule to form a compound of formula IV is a modified aldolase enzyme comprising an increased enzymatic activity in comparison with a corresponding unmodified or naturally occurring or wild-type aldolase enzyme.


In some embodiments, the 5,3-dihydroxy aldehyde is 5,3-dihydroxypentanal, 5,3-dihydroxyhexanal, 5,3-dihydroxyheptanal, 5,3-dihydroxyoctanal, 5,3-dihydroxynonanal, 5,3-dihydroxydecanal, 5,3-dihydroxyundecanal, 5,3-dihydroxydodecanal, 5,3-dihydroxytridecanal, 5,3-dihydroxytetradecanal, 5,3-dihydroxypentadecanal, or any combination thereof.


In some embodiments, the 5,3-dihydroxy aldehyde comprises 5,3-dihydroxypentanal, 5,3-dihydroxyhexanal, 5,3-dihydroxyheptanal, 5,3-dihydroxyoctanal, 5,3-dihydroxynonanal, 5,3-dihydroxydecanal, 5,3-dihydroxyundecanal, 5,3-dihydroxydodecanal, 5,3-dihydroxytridecanal, 5,3-dihydroxytetradecanal, 5,3-dihydroxypentadecanal, and any combination thereof.


In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxypentanal, wherein R is hydrogen. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxyhexanal, wherein n is 1. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxyheptanal, wherein n is 2. In some embodiments, the trans-2-unsaturated aldehyde compound of formula IV is 5,3-dihydroxyoctanal, wherein n is 3. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxynonanal, wherein n is 4. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxydecanal, wherein n is 5. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxyundecanal, wherein n is 6. In some embodiments, the 53-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxydodecanal, wherein n is 7. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxytridecanal, wherein n is 8. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxytetradecanal, wherein n is 9. In some embodiments, the 5,3-dihydroxy aldehyde compound of formula IV is 5,3-dihydroxypentadecanal, wherein n is 10.


In some embodiments, the compound of formula IV (i.e., 5,3-dihydroxy aldehyde) undergoes cyclization to a tetrahydro-2H-pyran-2,4-diol compound of formula IV(b). In some embodiments, the hemi-acetal intermediate, tetrahydro-2H-pyran-2,4-diol, is a compound represented by formula IV(b),




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4.5, 6, 7, 8, 9, or 10.


In some embodiments, the compound of formula IV(b) undergoes oxidation to a tetrahydro-4-hydroxy-2H-pyran-2-one. In some embodiments, the oxidation comprises an enzymatic oxidation. In some embodiments, the oxidation is an enzymatic oxidation. In some embodiments, the delta lactone intermediate, tetrahydro-4-hydroxy-2H-pyran-2-one, is a compound represented by formula V:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the compound of formula V undergoes dehydration to form a compound of formula VI. In some embodiments, the dehydration comprises dehydratase. In some embodiments, the delta lactone is a compound represented by formula VI:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the delta lactone disclosed herein is in enantiomeric excess. In some embodiments, the enantiomer of delta lactone produced in excess is (R)-delta lactone. In some embodiments, the enantiomer of delta lactone produced in excess is (S)-delta lactone. In some embodiments, the compound of formula VI is in enantiomeric excess.


In some embodiments, the delta-lactone of formula VI is massoia lactone, C-10 massoia lactone (5,6-dihydro-6-pentyl-2H-pyran-2-one), C-12 massoia lactone (5,6-dihydro-6-heptyl-2H-pyran-2-one), C-14 massoia lactone (5,6-dihydro-6-nonyl-2H-pyran-2-one), 5,6-dihydro-2H-pyran-2-one, 5,6-dihydro-6-methyl-2H-pyran-2-one, 5,6-dihydro-6-hexyl-2H-pyran-2-one, 5,6-dihydro-6-ethyl-2H-pyran-2-one, 5,6-dihydro-6-propyl-2H-pyran-2-one, 5,6-dihydro-6-butyl-2H-pyran-2-one, 5,6-dihydro-6-decyl-2H-pyran-2-one, 5,6-dihydro-6-octyl-2H-pyran-2-one, or any combination thereof.


In some embodiments, the delta-lactone of formula VI comprises massoia lactone, C-10 massoia lactone (5,6-dihydro-6-pentyl-2H-pyran-2-one), C-12 massoia lactone (5,6-dihydro-6-heptyl-2H-pyran-2-one), C-14 massoia lactone (5,6-dihydro-6-nonyl-2H-pyran-2-one), 5,6-dihydro-2H-pyran-2-one, 5,6-dihydro-6-methyl-2H-pyran-2-one, 5,6-dihydro-6-hexyl-2H-pyran-2-one, 5,6-dihydro-6-ethyl-2H-pyran-2-one, 5,6-dihydro-6-propyl-2H-pyran-2-one, 5,6-dihydro-6-butyl-2H-pyran-2-one, 5,6-dihydro-6-decyl-2H-pyran-2-one, 5,6-dihydro-6-octyl-2H-pyran-2-one, and any combination thereof.


In some embodiments, the delta-lactone of formula VI is (R)-5,6-dihydro-2H-pyran-2-one, (R)-massoia lactone, (R)-massoia lactone, (R)-5,6-dihydro-6-pentyl-2H-pyran-2-one, (R)-5,6-dihydro-6-heptyl-2H-pyran-2-one, (R)-5,6-dihydro-6-nonyl-2H-pyran-2-one, (R)-5,6-dihydro-2H-pyran-2-one, (R)-5,6-dihydro-6-methyl-2H-pyran-2-one, (R)-5,6-dihydro-6-hexyl-2H-pyran-2-one, (R)-5,6-dihydro-6-ethyl-2H-pyran-2-one, (R)-5,6-dihydro-6-propyl-2H-pyran-2-one, (R)-5,6-dihydro-6-butyl-2H-pyran-2-one, (R)-5,6-dihydro-6-decyl-2H-pyran-2-one, (R)-5,6-dihydro-6-octyl-2H-pyran-2-one, or any combination thereof.


In some embodiments, the delta-lactone of formula VI comprises (R)-5,6-dihydro-2H-pyran-2-one, (R)-massoia lactone, (R)-massoia lactone, (R)-5,6-dihydro-6-pentyl-2H-pyran-2-one, (R)-5,6-dihydro-6-heptyl-2H-pyran-2-one, (R)-5,6-dihydro-6-nonyl-2H-pyran-2-one, (R)-5,6-dihydro-2H-pyran-2-one, (R)-5,6-dihydro-6-methyl-2H-pyran-2-one, (R)-5,6-dihydro-6-hexyl-2H-pyran-2-one, (R)-5,6-dihydro-6-ethyl-2H-pyran-2-one, (R)-5,6-dihydro-6-propyl-2H-pyran-2-one, (R)-5,6-dihydro-6-butyl-2H-pyran-2-one, (R)-5,6-dihydro-6-decyl-2H-pyran-2-one, (R)-5,6-dihydro-6-octyl-2H-pyran-2-one, and any combination thereof.


Massoia lactones are 10, 12 and 14 carbon chain compounds, also referred to as the C-10, C-12 and C-14 massoia lactones, respectively, that possess characteristic α,β-unsaturated δ-lactone moieties. Massoia lactones may have substitution at the C6 position of the α,β-unsaturated δ-lactone structures with chains of variable length containing five, seven or nine carbons.


In some embodiments, the 5,6-dihydro-2H-pyran-2-one, compound of formula VI undergoes reduction to a compound of formula VII. In some embodiments, the reduction comprises a reductase. In some embodiments, the reduction is by using a reductase. In some embodiments, the delta-lactone is a compound represented by formula VII:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the delta-lactone compound of formula VII is delta-valerolactone, wherein R is hydrogen. In some embodiments, delta-lactone compound of formula VII is delta-hexalactone, wherein n is 1. In some embodiments, the delta-lactone compound of formula VII is delta-heptalactone, wherein n is 2. In some embodiments, the delta-lactone compound of formula VII is delta-octalactone, wherein n is 3. In some embodiments, the delta-lactone compound of formula VII is delta-nonalactone, wherein n is 4. In some embodiments, the delta-lactone compound of formula VII is delta-decalactone, wherein n is 5. In some embodiments, the delta-lactone compound of formula VII is delta-undecalactone, wherein n is 6. In some embodiments, the delta-lactone compound of formula VII is delta-dodecalactone, wherein n is 7. In some embodiments, the delta-lactone compound of formula VII is delta-tridecalactone, wherein n is 8. In some embodiments, the delta-lactone compound of formula VII is delta-tetradecalactone, wherein n is 9. In some embodiments, the delta-lactone compound of formula VII is delta-pentadecalactone, wherein n is 10.


In some embodiments, the delta-lactone of formula VII is delta-valerolactone, delta-hexalactone, delta-heptalactone, delta-octalactone, delta-nonalactone, delta-decalactone, delta-undecalactone, delta-dodecalactone, delta-tridecalactone, delta-tetradecalactone, delta-pentadecalactone, or any combination thereof.


In some embodiments, the delta-lactone of formula VII comprises delta-valerolactone, delta-hexalactone, delta-heptalactone, delta-octalactone, delta-nonalactone, delta-decalactone, delta-undecalactone, delta-dodecalactone, delta-tridecalactone, delta-tetradecalactone, delta-pentadecalactone, and any combination thereof.


In some embodiments, the delta lactone disclosed herein is in enantiomeric excess. In some embodiments, the enantiomer of delta lactone produced in excess is (R)-delta lactone. In some embodiments, the enantiomer of delta lactone produced in excess is (S)-delta lactone. In some embodiments, the compound of formula VII is in enantiomeric excess.


In some embodiments, the delta-lactone of formula VII is (R)-delta-hexalactone, (R)-delta-heptalactone, (R)-delta-octalactone, (R)-delta-nonalactone, (R)-delta-decalactone, (R)-delta-undecalactone, (R)-delta-dodecalactone, (R)-delta-tridecalactone, (R)-delta-tetradecalactone, (R)-delta-pentadecalactone, or any combination thereof.


In some embodiments, the delta-lactone of formula VII comprises (R)-delta-hexalactone, (R)-delta-heptalactone, (R)-delta-octalactone, (R)-delta-nonalactone, (R)-delta-decalactone, (R)-delta-undecalactone, (R)-delta-dodecalactone, (R)-delta-tridecalactone, (R)-delta-tetradecalactone, (R)-delta-pentadecalactone, and any combination thereof.


In some embodiments, the delta-lactone of formula VII comprises (S)-delta-hexalactone, (S)-delta-heptalactone, (S)-delta-octalactone, (S)-delta-nonalactone, (S)-delta-decalactone, (S)-delta-undecalactone, (S)-delta-dodecalactone, (S)-delta-tridecalactone. (S)-delta-tetradecalactone. (S)-delta-pentadecalactone, and any combination thereof.


In some embodiments, a massoia lactone pathway comprises: condensation of an acetaldehyde and a hexanal to 3-hydroxy-octanal by using aldolase DERA; condensation of the 3-hydroxy-octanal and an acetaldehyde to 5,3-dihydroxy-decanal by using aldolase DERA; and cyclization of 5,3-dihydroxy-decanal followed by oxidation and dehydration to form massoia lactone. In some embodiments, the massoia lactone is (R)-massoia lactone. In some embodiments the dehydration is spontaneous. In some embodiments, the dehydration is via a dehydratase. In some embodiments, the oxidation is spontaneous. In some embodiments, the oxidation is via an enzymatic oxidation. FIG. 9 shows the schematic of this pathway.


In some embodiments, a gamma lactone pathway is disclosed that comprises: 1) condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid (compound of formula VIII) using aldolase (e.g. pyruvate-dependent aldolase); 2) reduction of the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid (compound of formula IX) using dehydrogenase or keto-reductase; 3) lactonization of the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2(3H)-furanone (compound of formula X); 4) dehydration (e.g. spontaneous dehydration or dehydratase) of the 3-hydroxydihydro-2(3H)-furanone (compound of formula X) to 2(5H)-furanone (compound of formula XI); and 5) reduction of 2(5H)-furanone (compound of formula XI) to a gamma-lactone (compound of formula XII) using reductase. FIG. 7 shows the schematic of this pathway.


In some embodiments, the present disclosure discloses a process of producing gamma-lactone. In some embodiments, condensation of an aldehyde molecule and a pyruvic acid in the presence of an aldolase enzyme results in a 4-hydroxy-2-oxo carboxylic acid compound. In some embodiments, the aldolase enzyme is specific for the condensation of the pyruvic acid and an aldehyde molecule. In some embodiments, the aldolase enzyme (e.g., DERA) is a modified aldolase enzyme. In some embodiments, the modified aldolase enzyme comprises an increased enzymatic activity in comparison with a corresponding unmodified, naturally occurring, or wild-type aldolase enzyme. In some embodiments, a gamma-lactone intermediate, 4-hydroxy-2-oxo carboxylic acid, is a compound represented by formula VIII:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the compound of formula VIII is reduced to 2,4-dihydroxy carboxylic acid. In some embodiments, the compound of formula VIII is reduced to 2,4-dihydroxy carboxylic acid in the presence of dehydrogenase and/or a keto-reductase. In some embodiments, a gamma-lactone intermediate, 2,4-dihydroxy carboxylic acid, is a compound represented by formula IX:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the 2,4-dihydroxy carboxylic acid compound of formula IX undergoes lactonization to form a 3-hydroxydihydro-2(3H)-furanone (e.g., a compound of formula X). In some embodiments, a gamma-lactone intermediate, 3-hydroxydihydro-2(3H)-furanone, is a compound represented by formula X:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the 3-hydroxydihydro-2(3H)-furanone compound of formula X undergoes dehydration to form 2(5H)-furanone (e.g., compound of formula XI). In some embodiments, a gamma-lactone intermediate, 2(5H)-furanone, is a compound represented by formula XI:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, dehydration occurs spontaneously. In some embodiments, dehydration comprises dehydratase.


In some embodiments, 2(5H)-furanone compound of formula XI is reduced to a gamma-lactone. In some embodiments, the reduction of 2(5H)-furanone compound of formula XI to gamma-lactone is in the presence of a reductase. In some embodiments, the gamma-lactone is a compound represented by formula XII:




embedded image



wherein R is hydrogen or CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the gamma-lactone compound of formula XII is gamma-butyrolactone, wherein R is hydrogen. In some embodiments, the gamma-lactone compound of formula XII is gamma-valerolactone, wherein n is 1. In some embodiments, gamma-lactone compound of formula XII is gamma-hexalactone, wherein n is 2. In some embodiments, the gamma-lactone compound of formula XII is gamma-heptalactone, wherein n is 3. In some embodiments, the gamma-lactone compound of formula XII is gamma-octalactone, wherein n is 4. In some embodiments, the gamma-lactone compound of formula XII is gamma-nonalactone, wherein n is 5. In some embodiments, the gamma-lactone compound of formula XII is gamma-decalactone, wherein n is 6. In some embodiments, the gamma-lactone compound of formula XII is gamma-undecalactone, wherein n is 7. In some embodiments, the gamma-lactone compound of formula XII is gamma-dodecalactone, wherein n is 8. In some embodiments, the gamma-lactone compound of formula XII is gamma-tridecalactone, wherein n is 9. In some embodiments, the gamma-lactone compound of formula XII is gamma-tetradecalactone, wherein n is 10.


In some embodiments, the gamma lactone disclosed herein is in enantiomeric excess. In some embodiments, the enantiomer of gamma lactone produced in excess is (R)-gamma lactone. In some embodiments, the enantiomer of gamma lactone produced in excess is (S)-gamma lactone. In some embodiments, the compound of formula XII is in enantiomeric excess.


In some embodiments, the gamma-lactone of formula XII is (R)-gamma-hexalactone, (R)-gamma-heptalactone, (R)-gamma-octalactone, (R)-gamma-nonalactone, (R)-gamma-decalactone, (R)-gamma-undecalactone, (R)-gamma-dodecalactone, (R)-gamma-tridecalactone, (R)-gamma-tetradecalactone, (R)-gamma-valerolactone, (R)-gamma-butyrolactone, or any combination thereof.


In some embodiments, the gamma-lactone of formula XII comprises (R)-gamma-hexalactone, (R)-gamma-heptalactone, (R)-gamma-octalactone, (R)-gamma-nonalactone, (R)-gamma-decalactone, (R)-gamma-undecalactone, (R)-gamma-dodecalactone, (R)-gamma-tridecalactone, (R)-gamma-tetradecalactone, (R)-gamma-butyrolactone, (R)-gamma-valerolactone, and any combination thereof.


In some embodiments, a gamma lactone pathway is disclosed that comprises, but not necessarily in this order: condensation of an aldehyde molecule and a pyruvate to a 2-keto-4-hydroxy acid using aldolase (e.g. pyruvate class II aldolase, Bphl, Hpal); reduction of the 2-keto-4-hydroxy acid to 2,4-dihydroxy acid using dehydrogenase or keto-reductase; dehydration of the 2,4-dihydroxy acid to 4-hydroxy-2-ene-acid; lactonization of the 4-hydroxy-2-ene acid to 2(5H)-furanone (compound of formula XI); and reduction of 2(5H)-furanone (compound of formula XI) to a gamma-lactone (compound of formula XII) using reductase. FIG. 10 shows the schematic of this pathway.


In some embodiments, a gamma lactone pathway is disclosed that comprises: 1) condensation of a heptanal molecule and a pyruvate (or pyruvic acid) to a 2-keto-4-hydroxydecanoic acid using pyruvate dependent aldolase (e.g., Bphl/Hpal aldolase); 2) reduction of the 2-keto-4-hydroxydecanoic acid to 2,4-dihydroxy-decanoic acid using keto-reductase and NADH/NADPH; 3) dehydration of the 2,4-dihydroxy-decanoic acid to 4-hydroxy-2-decenoic acid 4) lactonization of the 4-hydroxy-2-decenoic acid to 5-hexyl-furanone; and 5) reduction of 5-hexyl-furanone to a gamma-decalactone using enoate reductase.


In some embodiments, the lactonization may be spontaneous, chemical, enzymatic, non-enzymatic, or any combination thereof.


In some embodiments, the dehydration step is spontaneous. In some embodiments, the dehydration step is In some embodiments, the dehydration step is by using dehydratase. In some embodiments, the lactonization is spontaneous. In some embodiments, the lactonization is enzymatic. In some embodiments, the lactonization is a non-enzymatic lactonization reaction. In some embodiments, the lactonization is a chemical lactonization reaction. In some embodiments, the reduction step comprises NADH or NADPH. In some embodiments, the aldolase enzyme is deoxyribosephosphate (DERA).


In some embodiments, the gamma lactone disclosed herein is in enantiomeric excess. In some embodiments, the enantiomer of gamma lactone produced in excess is (R)-gamma lactone. In some embodiments, the enantiomer of gamma lactone produced in excess is (S)-gamma lactone.


An example of a reductase enzyme is an enoate-reductase capable of hydrogenating the alpha, beta carbon-carbon double bond at the alpha, beta position next to a carboxylic acid group into a carbon-carbon single bond. Example of enzymes having alpha, beta enoate reductase activity may be found in US applications 20070117191 and 20070254341. In some embodiments, the enoate reductase is a 2-enoate reductase, a NADH-dependent fumarate reductase, a succinate dehydrogenase, a coumarate reductase, a beta-nitroacrylate reductase, a methylacetate reductase, a NADPH 2-enoyl CoA reductase, enoyl-[acyl-carrier protein] reductase, N-ethylmaleimide reductase, crotonobetaine reductase, or a gamma-butyrobetaine reductase.


Enzymes belonging to the ketoreductase (KRED) or carbonyl reductase class (EC1.1.1.184) are useful for the synthesis of optically active alcohols from the corresponding prostereoisomeric ketone substrates and by stereospecific reduction of corresponding racemic aldehyde and ketone substrates. KREDs typically convert a ketone or aldehyde substrate to the corresponding alcohol product, but may also catalyze the reverse reaction, oxidation of an alcohol substrate to the corresponding ketone/aldehyde product. The reduction of ketones and aldehydes and the oxidation of alcohols by enzymes such as KRED may require a co-factor, most commonly reduced nicotinamide adenine dinucleotide (NADH) or reduced nicotinamide adenine dinucleotide phosphate (NADPH), and nicotinamide adenine dinucleotide (NAD) or nicotinamide adenine dinucleotide phosphate (NADP) for the oxidation reaction. NADH and NADPH serve as electron donors, while NAD and NADP serve as electron acceptors. It is frequently observed that ketoreductases and alcohol dehydrogenases accept either the phosphorylated or the non-phosphorylated co-factor (in its oxidized and reduced state).


KRED enzymes may be found in a wide range of bacteria and yeasts (for reviews: Kraus and Waldman. Enzyme catalysis in organic synthesis Vols. 1&2.VCH Weinheim 1995; Faber, K., Biotransformations in organic chemistry, 4th Ed. Springer, Berlin Heidelberg New York. 200 Hummel and Kula Eur. J. Biochem. 1989 184:1-13). Several KRED gene and enzyme sequences have been reported, e.g., Candida magnoliae (Genbank Acc. No. JC7338; GI: 11360538) Candida parapsilosis (Genbank Acc. No. BAA24528.1; GI: 2815409), Sporobolomyces salmonicolor (Genbank Acc. No. AF160799: GI: 6539734).


The reduction of 2-keto-4-hydroxy aldehyde to 2,4-dihydroxy aldehyde may be carried out by using appropriate alcohol dehydrogenase (ADH), aldo-ketoreductases, oxidoreductase, or aldehyde reductase using a reducing equivalent as cofactor, which in some embodiments, may be NADH or NADPH. In some embodiments, the ADH AKR, oxidoreductase, or aldehyde reductase, however, is substantially specific towards 2-keto-4-hydroxy aldehyde and does not act on acetaldehyde, thereby substantially avoiding or eliminating the production of ethanol as a side product.


In some embodiments, the present disclosure relates to a method of producing trans-2-unsaturated aldehyde from aldehydes and aldolases comprising at least one of the following trans-2-unsaturated aldehyde pathway enzymes: an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce 3-hydroxy aldehyde; and a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde. In some embodiments, the at least one of the trans-2-unsaturated aldehyde pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing trans-2-unsaturated aldehyde from aldehydes and aldolases comprising an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce 3-hydroxy aldehyde. In some embodiments the method of producing trans-2-unsaturated aldehyde comprises a dehydratase that dehydrates the 3-hydroxy aldehyde to trans-2-unsaturated aldehyde. In some embodiments, the aldolase enzyme is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing delta-lactones from aldehydes and aldolases comprising at least one of the following delta-lactone pathway enzymes: an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce a 3-hydroxy aldehyde (i.e., step 1): an aldolase enzyme that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde molecule to a 5,3-dihydroxy aldehyde (i.e., step 2); an oxidase that oxidizes of tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone. In some embodiments, the aldolase in step 1 is different from the aldolase in step 2. In some embodiments, the at least one of the delta-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing delta-lactones from aldehydes and aldolases comprising the following delta-lactone pathway enzymes: an aldolase enzyme that catalyzes condensation of two aldehyde molecules to produce a 3-hydroxy aldehyde (i.e., step 1); and, an aldolase enzyme that catalyzes condensation of the 3-hydroxy aldehyde and an aldehyde molecule to a 5,3-dihydroxy aldehyde (i.e., step 2). In some embodiments, the aldolase in step 1 is different from the aldolase in step 2. In some embodiments, the method further comprises at least one of the following delta-lactone pathway enzymes: an oxidase that oxidizes tetrahydro-2H-pyran-2,4-diol to a tetrahydro-4-hydroxy-2H-pyran-2-one; a dehydratase that dehydrates the tetrahydro-4-hydroxy-2H-pyran-2-one to a 5,6-dihydro-2H-pyran-2-one; and a reductase that reduces the 5,6-dihydro-2H-pyran-2-one to a delta-lactone. In some embodiments, the at least one of the delta-lactone pathway enzymes of any one of the abovementioned steps is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing gamma-lactones from an aldehyde molecule, a carboxylic acid, and aldolase comprising at least one of the following gamma-lactone pathway enzymes: an aldolase enzyme that catalyzes condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid; a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid; a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone; a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone. In some embodiments, the at least one of the gamma-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing gamma-lactones from an aldehyde molecule, a carboxylic acid, and aldolase comprising an aldolase enzyme that catalyzes condensation of an aldehyde molecule and a pyruvic acid to a 4-hydroxy-2-oxo carboxylic acid. In some embodiments, the method further comprises at least one of the following gamma-lactone pathway enzymes: a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid; a lactonization enzyme that transforms the 2,4-dihydroxy carboxylic acid to a 3-hydroxydihydro-2-(3H)-furanone: a dehydratase that dehydrates the 3-hydroxydihydro-2-(3H)-furanone to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone. In some embodiments, the at least one of the enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing gamma-lactones from an aldehyde, a pyruvate, and aldolase comprising at least one of the following gamma-lactone pathway enzymes: an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvate to a 4-hydroxy-2-oxo carboxylic acid; a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid; a dehydratase that dehydrates 2,4-dihydroxy carboxylic acid to 4-hydroxy-2-ene acid; a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone. In some embodiments, the at least one of the gamma-lactone pathway enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the present disclosure relates to a method of producing gamma-lactones from an aldehyde, a pyruvate, and an aldolase comprising an aldolase that catalyzes condensation of an aldehyde molecule and a pyruvate to a 4-hydroxy-2-oxo carboxylic acid. In some embodiments, the method further comprises at least one of the following gamma-lactone pathway enzymes: a dehydrogenase or a keto-reductase that reduces the 4-hydroxy-2-oxo carboxylic acid to 2,4-dihydroxy carboxylic acid; a dehydratase that dehydrates 2,4-dihydroxy carboxylic acid to 4-hydroxy-2-ene acid; a lactonization enzyme that transforms the 4-hydroxy-2-ene acid to 2(5H)-furanone; and a reductase that reduces the 2(5H)-furanone to a gamma-lactone. In some embodiments, the at least one of the enzymes is partially purified, is purified, is a modified enzyme, or any combinations thereof.


In some embodiments, the aldolase is a deoxyribose-5-phosphate aldolase DERA enzyme. In some embodiments, the DERA enzyme is modified. In some embodiments, the aldolase comprises an amino acid sequence of SEQ ID NO: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20, or an active fragment, or a homologue thereof. In some embodiments, the aldolase comprises at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% amino acid sequence identity to any one of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.


In some embodiments, an exogenous nucleic acid encoding the aldolase comprises a nucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the exogenous nucleic acid encoding the aldolase comprises a nucleotide sequence that is at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or at least about 99% nucleotide sequence identity to any one of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the modified DERA enzyme comprises at least one mutation, wherein the at least one mutation corresponds to or is at an amino acid position selected from R190, T12, and S238 of SEQ ID NO: 18. In some embodiments, the DERA enzyme comprises at least one mutation selected from R190K, R190I, R190F, R190A, R190L, R190M, T12A, and S238A of SEQ ID NO: 18. In some embodiments, the modified DERA enzyme comprises mutations corresponding to each or at each amino acid position selected from R190, T12, and S238 of SEQ ID NO: 18. In some embodiments, the modified DERA enzyme comprises mutations at each amino acid position corresponding to or selected from R190, T12, and S238 of SEQ ID NO: 18


In some embodiments, the method or the process of the present disclosure is performed in a culture. In some embodiments, the culture is a microbial culture. In some embodiments, the aldehyde molecules according to the method or the process of the present disclosure are fed to the culture. In some embodiments, the aldehyde molecules according to the method or the process of the present disclosure are produced from glucose. In some embodiments, at least one aldehyde molecule is produced from glucose and the other aldehyde molecule is fed to the culture.


In some embodiments, the method or the process of the present disclosure comprises expressing at least one of the enzymes in a host cell. In some embodiments, the host cell is a microbial cell. In some embodiments, the host cell is a fungal or bacterial cell. In some embodiments, the microbial host cell is a transformed microbial host cell selected from the group consisting of Comamonas sp., Corynebacterium sp., Brevibacterium sp. Rhodococcus sp., Azotobacter sp., Citrobacter sp., Enterobacter sp., Clostridium sp., Klebsiella sp., Salmonella sp., Lactobacillus sp., Aspergillus sp., Saccharomyces sp., Yarrowia sp., Zygosaccharomyces sp., Pichia sp., Kluyveromyces sp., Candida sp., Hansenula sp., Dunaliella Debaryomyces sp., Mucor sp., Torulopsis sp., Methylobacteria sp., Bacillus sp., Escherichia sp., Pseudomonas sp., Rhizobium sp., and Streptomyces sp. In some embodiments, the microbial host cell is Escherichia coli. In some embodiments, the microbial cell is a viable cell. In some embodiments, the microbial cell produces a crude cell lysate. In some embodiments, crude cell lysate is purified. In some embodiments, the microbial cell comprises one or more gene deletions to increase the availability of aldehyde and/or pyruvate. In some embodiments, the microbial cell comprises one or more gene deletions encoding an alcohol dehydrogenase, a lactate dehydrogenase, or a pyruvate formate lyase.


In some embodiments, the present disclosure relates to a method of producing delta-lactone comprising a non-naturally occurring microorganism for producing delta-lactone from precursor aldehyde molecules in the presence of aldolases. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, naturally occurring, or wild-type microorganism. In some embodiments, the aldolases are two aldolases. In some embodiments, one aldolase is specific for the condensation of an acetaldehyde and an aldehyde molecule and the other aldolase is specific for the condensation of 3-hydroxy aldehyde and an acetaldehyde molecule. In some embodiments, the two aldolases are two different aldolases. In some embodiments, the two aldolases are the same aldolase. In some embodiments, the two aldolases are DERA, or a variant, or homologue thereof.


In some embodiments, the present disclosure relates to a method of producing gamma-lactone comprising a non-naturally occurring microorganism for producing gamma-lactone from a precursor aldehyde molecule in the presence of aldolases. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified, naturally occurring, or wild-type microorganism.


In some embodiments, the present disclosure relates to a method of producing trans-2-unsaturated aldehyde comprising a non-naturally occurring microorganism for producing trans-2-unsaturated aldehyde from precursor aldehyde molecules in the presence of aldolases. In some embodiments, the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.


In some embodiments, the precursor aldehyde molecules are two aldehyde molecules. In some embodiments, the precursor aldehyde molecule is one aldehyde molecule. In some embodiments, the two aldehyde molecules are fed to a microbial culture. In some embodiments, the one aldehyde molecule is fed to a microbial culture. In some embodiments, at least one of the aldehyde molecule, the pyruvate, or pyruvic acid is fed to a microbial culture. In some embodiments, the aldehyde molecule is produced from glucose via an aldehyde producing pathway. In some embodiments, at least one of the two aldehyde molecules is produced from glucose and at least one of the two aldehyde molecules is fed to a microbial culture. In some embodiments, at least one of the two aldehyde molecules is acetaldehyde. In some embodiments, the acetaldehyde is produced from glucose via decarboxylation of pyruvate by pyruvate decarboxylase.


In some embodiments, the method or the process of the present disclosure comprises a non-naturally occurring enzyme. In some embodiments, the non-naturally occurring enzyme comprises at least one modified aldolase enzyme. In some embodiments, the modified aldolase enzyme comprises an increased activity towards an aldehyde condensation reaction in comparison to an unmodified aldolase enzyme. In some embodiments, the modified aldolase enzyme is a modified deoxyribose-5-phosphate aldolase (DERA) enzyme.


In some embodiments, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing delta-lactone from aldehydes and aldolases. In some embodiments, the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the delta-lactone pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In some embodiments, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing gamma-lactone from an aldehyde, a pyruvate, and aldolases. In some embodiments, the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the gamma-lactone pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In some embodiments, the present disclosure relates to a non-naturally occurring microorganism capable of synthesizing trans-2-unsaturated aldehyde from aldehydes and aldolases. In some embodiments, the non-naturally occurring microorganism is derived from a wild-type or unmodified microorganism and the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in the trans-2-unsaturated aldehyde pathway, in comparison with the enzymatic activity of the same enzyme in the wild-type microorganism.


In some embodiments, the non-naturally occurring microorganism comprises one or more gene deletions encoding enzymes that consume aldehyde and/or pyruvate. In some embodiments, the one or more gene deletions is selected from an alcohol dehydrogenase, a lactate dehydrogenase, or a pyruvate formate lyase.


In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from dkgA aldB poxB ybbO yahK deoC paoAB/C (yagT/S/R) adhE yqhD eutG adhP and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, dkgA, aldB, poxB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), pflB, IdhA, adhE, yqhD, eutG, adhP, and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from dkgA, aldB, ybbO, yahK, deoC, paoA/B/C (yagT/S/R), adhE, yqhD, eutG, adhP, and, yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from ilvD, rhtA, dkgA, aldB, ybbO, yahK, deoC, paoAB/C (yagT/S/R), pflB ldhA, adhE, yqhD, eutG, adhP and yjgB in E. coli. In some embodiments, the one or more genes that is deleted corresponds to one or more genes selected from pflB, ldhA, adhE in E. coli.


In some embodiments, the present disclosure relates to a recombinant vector comprising a gene encoding DERA enzyme, and/or encoding KDC, and/or encoding BFD, and/or encoding decarboxylase. In some embodiments, the present disclosure relates to an expression vector comprising a nucleic acid sequence corresponding to any SEQ ID NO disclosed in the present disclosure or corresponding to any enzyme disclosed in the present disclosure. In some embodiments, a host cell comprises the expression vector. In some embodiments, the host cell is a microbial cell.


In some embodiments, the present disclosure relates to an isolated non-naturally occurring microorganism comprising a gene encoding a DERA enzyme, and/or an aldolase, and/or any enzyme disclosed in the present disclosure, and/or encoding a KDC, and/or encoding a BFD, and/or encoding a decarboxylase.


In some embodiments, a non-naturally occurring microorganism is obtained by introducing a recombinant vector into a host microorganism.


In some embodiments, the present disclosure relates to a method of preparing a modified enzyme, comprising: (a) subjecting a deoxyribonucleic acid (DNA) sequence encoding the enzyme to random or site directed mutagenesis; (b) expressing the modified DNA sequence obtained in (a) in a host cell; and (c) screening for host cells expressing the modified enzyme. In some embodiments, the enzyme has improved aldehyde condensation activity. In some embodiments, the enzyme is an aldolase. In some embodiments, the aldolase is DERA. In some embodiments, DERA comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. In some embodiments, the modified DNA sequence is expressed by transforming a suitable host cell with the modified DNA sequence and culturing the host cell obtained in (b) under suitable conditions for expressing the modified DNA sequence. In some embodiments, host cells screened in (c) may be subjected to a second mutagenesis treatment, and/or to rescreening, and/or to reisolation, and/or to recloning.


In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein R is hydrogen.


In some embodiments, the present disclosure relates to any one of the compounds disclosed herein, wherein R is CnH2n+1, and each ‘n’ is independently 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.


In some embodiments, the trans-2-unsaturated aldehyde of the methods of the present disclosure has a percent of theoretical yield of not less than about 3%, not less than about 4%, not less than about 5%, not less than about 6%, not less than about 7%, not less than about 8%, not less than about 9%, not less than about 10%, not less than about 11%, not less than about 12%, not less than about 13%, not less than about 14%, not less than about 15%, not less than about 16%, not less than about 17%, not less than about 18%, not less than about 19%, not less than about 20%, not less than about 22%, not less than about 25%, not less than about 27%, not less than about 30%, not less than about 32%, not less than about 35%, not less than about 37%, not less than about 40%, not less than about 42%, not less than about 45%, not less than about 47%, not less than about 50%, not less than about 52%, not less than about 55%, not less than about 57%, not less than about 60%, not less than about 65%, not less than about 70%, not less than about 75%, not less than about 80%, not less than about 85%, not less than about 90%, or not less than about 95%.


In some embodiments, the delta-lactone of the methods of the present disclosure has a percent of theoretical yield of not less than about 1.5%, not less than about 2%, not less than about 2.5%, not less than about 3%, not less than about 4%, not less than about 5%, not less than about 6%, not less than about 7%, not less than about 8%, not less than about 9%, not less than about 10%, not less than about 11%, not less than about 12%, not less than about 13%, not less than about 14%, not less than about 15%, not less than about 16%, not less than about 17%, not less than about 18%, not less than about 19%, not less than about 20%, not less than about 22%, not less than about 25%, not less than about 27%, not less than about 30%, not less than about 32%, not less than about 35%, not less than about 37%, not less than about 40%, not less than about 42%, not less than about 45%, not less than about 47%, not less than about 50%, not less than about 52%, not less than about 55%, not less than about 57%, not less than about 60%, not less than about 65%, not less than about 70%, not less than about 75%, not less than about 80%, not less than about 85%, not less than about 90%, or not less than about 95%.


In some embodiments, the gamma-lactone of the methods of the present disclosure has a percent of theoretical yield of not less than about 1.5%, not less than about 2%, not less than about 2.5%, not less than about 3%, not less than about 4%, not less than about 5%, not less than about 6%, not less than about 7%, not less than about 8%, not less than about 9%, not less than about 10%, not less than about 11%, not less than about 12%, not less than about 13%, not less than about 14%, not less than about 15%, not less than about 16%, not less than about 17%, not less than about 18%, not less than about 19%, not less than about 20%, not less than about 22%, not less than about 25%, not less than about 27%, not less than about 30%, not less than about 32%, not less than about 35%, not less than about 37%, not less than about 40%, not less than about 42%, not less than about 45%, not less than about 47%, not less than about 50%, not less than about 52%, not less than about 55%, not less than about 57%, not less than about 60%, not less than about 65%, not less than about 70%, not less than about 75%, not less than about 80%, not less than about 85%, not less than about 90%, or not less than about 95%.


In some embodiments, the trans-2-unsaturated aldehyde from the methods of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure, 1H NMR and gas chromatography may be used to characterize the desired trans-2-unsaturated aldehyde product. For example, the trans-2-unsaturated aldehyde is free of undesired byproduct or starting material. For example, the impurities (e.g., byproduct and starting material) in the trans-2-unsaturated aldehyde product is less than about 25% (e.g., less than about 20%, less than about 18%, less than about 15%, less than about 13%, less than about 10%, less than about 8%, less than about 5%, less than about 3%, less than about 2%, less than about 1.5%, or less than about 1%).


In some embodiments, the delta-lactone from the method of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure, 1H NMR and gas chromatography may be used to characterize the desired delta-lactone product. For example, the delta-lactone is free of undesired byproduct or starting material. For example, the impurities (e.g., byproduct and starting material) in the delta-lactone product is less than about 25% (e.g., less than about 20%, less than about 18%, less than about 15%, less than about 13%, less than about 10%, less than about 8%, less than about 5%, less than about 3%, less than about 2%, less than about 1.5%, or less than about 1%).


In some embodiments, the gamma-lactone from the methods of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60% at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure, 1H NMR and gas chromatography may be used to characterize the desired delta-lactone product. For example, the gamma-lactone is free of undesired byproduct or starting material. For example, the impurities (e.g., byproduct and starting material) in the gamma-lactone product is less than about 25% (e.g., less than about 20%, less than about 18%, less than about 15%, less than about 13%, less than about 10%, less than about 8%, less than about 5%, less than about 3%, less than about 2%, less than about 1.5%, or less than about 1%).


In some embodiments, the compound of formula III of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure.


In some embodiments, the compound of formula VI of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about %%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure.


In some embodiments, the compound of formula VII of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82% at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure.


In some embodiments, the compound of formula XII of the present disclosure is at least about 45%, at least about 47%, at least about 50%, at least about 52%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about %%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% pure.


In some embodiments, the trans-2-unsaturated aldehyde product of the method of the present disclosure contains at least about 80% of a compound of formula Ill. In some embodiments, the product of the methods of the present disclosure contains at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula III. For example, the product is free of undesired byproduct or starting material. For example, the impurities in the trans-2-unsaturated aldehyde compound of formula III is less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the delta-lactone product of the method of the present disclosure contains at least about 80% of a compound of formula VI. In some embodiments, the product of the method of the present disclosure contains at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula VI. For example, the product is free of undesired byproduct or starting material. For example, the impurities in the delta-lactone compound of formula VI is less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the delta-lactone product of the method of the present disclosure contains at least about 80% of a compound of formula VII. In some embodiments, the product of the method of the present disclosure contains at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula VII. For example, the product is free of undesired byproduct or starting material. For example, the impurities in the delta-lactone compound of formula VII is less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the gamma-lactone product of the method of the present disclosure contains at least about 80% of a compound of formula XII. In some embodiments, the product of the method of the present disclosure contains at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula XII. For example, the product is free of undesired byproduct or starting material. For example, the impurities in the gamma-lactone compound of formula XII is less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the delta-lactone product of the method of the present disclosure contains at least about 50% of a compound of formula VI in the (R)-configuration. In some embodiments, the product of the method of the present disclosure contains at least about 51%, at least about 52%, at least about 53%, at least about 54%, at least about 55%, at least about 56%, at least about 57%, at least about 58%, at least about 59%, at least about 60%, at least about 61%, at least about 62%, at least about 63%, at least about 64%, at least about 65%, at least about 66%, at least about 67%, at least about 68%, at least about 69%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula VI in the (R)-configuration. For example, the product is free of undesired byproduct or starting material. For example, the impurities (e.g., undesired byproduct, starting material, or the (S)-configuration of compound of formula VI) in the delta-lactone compound of formula VI in the (R)-configuration is less than about 50%, less than about 45%, less than about 40%, less than about 35%, less than about 30%, less than about 25%, or less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the delta-lactone product of the method of the present disclosure contains at least about 50% of a compound of formula VII in the (R)-configuration. In some embodiments, the product of the method of the present disclosure contains at least about 51%, at least about 52%, at least about 53%, at least about 54%, at least about 55%, at least about 56%, at least about 57%, at least about 58%, at least about 59%, at least about 60%, at least about 61%, at least about 62%, at least about 63%, at least about 64%, at least about 65%, at least about 66%, at least about 67%, at least about 68%, at least about 69%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula VII in the (R)-configuration. For example, the product is free of undesired byproduct or starting material. For example, the impurities (e.g., undesired byproduct, starting material, or the (S)-configuration of compound of formula VII) in the delta-lactone compound of formula VII in the (R)-configuration is less than about 50%, less than about 45%, less than about 40%, less than about 35%, less than about 30%, less than about 25%, or less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


In some embodiments, the gamma-lactone product of the method of the present disclosure contains at least about 50% of a compound of formula XII in the (R)-configuration. In some embodiments, the product of the method of the present disclosure contains at least about 51%, at least about 52%, at least about 53%, at least about 54%, at least about 55%, at least about 56%, at least about 57%, at least about 58%, at least about 59%, at least about 60%, at least about 61%, at least about 62%, at least about 63%, at least about 64%, at least about 65%, at least about 66%, at least about 67%, at least about 68%, at least about 69%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 98.5%, at least about 99%, or at least about 99.5% of a compound of formula X in the (R)-configuration. For example, the product is free of undesired byproduct or starting material. For example, the impurities (e.g., undesired byproduct, starting material, or the (S)-configuration of compound of formula XII) in the gamma-lactone compound of formula XII in the (R)-configuration is less than about 50%, less than about 45%, less than about 40%, less than about 35%, less than about 30%, less than about 25%, or less than about 20% (e.g., less than about 19%, less than about 18%, less than about 17%, less than about 16%, less than about 15%, less than about 14%, less than about 13%, less than about 12%, less than about 11%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1.5%, less than about 1%, or less than about 0.5%).


It will be appreciated that the methods disclosed herein are suitable for both large-scale and small-scale preparations of the desired compounds. In some embodiments of the methods described herein, compounds of formula III, or compounds of formula VI, or compounds of formula VII, or compounds of formula XII may be prepared on a large scale, for example on an industrial production scale rather than on an experimental/laboratory scale. For example, the methods described herein allow for the preparation of batches of at least about 1 g, or at least about 5 g, or at least about 10 g, or at least about 100 g, or at least about 1 kg, or at least about 10 kg, or at least about 100 kg, or at least about 500 kg, or at least about 700 kg, or at least about 1000 kg, or at least about 2000 kg, or at least about 2500 kg, or at least about 5000 kg, or at least about 10000 kg of product.


In some embodiments, the overall yield of the compounds produced by the pathway of the present disclosure is equal to or higher than the overall yield of the compounds produced via a fatty acid oxidation pathway. For example, in some embodiments, the overall yield of a trans-2-unsaturated aldehyde produced via the pathway of the present disclosure is equal to or higher than the overall yield of the trans-2-unsaturated aldehyde produced via a fatty acid oxidation pathway. In some embodiments, the sugar based fermentation process provides a higher yield of the trans-2-aldehyde than a fatty acid based process.


In some embodiments, the overall yield of a trans-2-unsaturated aldehyde compound of formula III according to the method of the present disclosure is equal to or higher than the overall yield of the trans-2-unsaturated aldehyde compound of formula Ill produced via a fatty acid oxidation pathway. In some embodiments, the overall yield of the trans-2-unsaturated aldehyde according to the present disclosure is about or at least about 1.1 times, about or at least about 1.2 times, about or at least about 1.3 times, about or at least about 1.4 times, about or at least about 1.5 times, about or at least about 1.6 times, about or at least about 1.7 times, about or at least about 1.8 times, about or at least about 1.9 times, about or at least about 2 times, about or at least about 2.1 times, about or at least about 3 times, about or at least about 4 times, about or at least about 5 times the overall yield of the trans-2-unsaturated aldehyde produced via a fatty acid oxidation pathway. In some embodiments, the overall yield of the trans-2-unsaturated aldehyde according to the present disclosure is between about 1.1 and about 2 times, between about 1.5 and about 2.5 times, between about 1.2 to about 2.1 times, between about 1.8 to about 2.2 times, between about 1.1 and about 2.5 times, or between about 1.1 and about 3 times the overall yield of the trans-2-unsaturated aldehyde produced via a fatty acid oxidation pathway.


In some embodiments, the theoretical yield of a trans-2-unsaturated aldehyde compound of formula III according to the method of the present disclosure is equal to or at least about 0.045 moles, equal to or at least about 0.050 moles, equal to or at least about 0.055 moles, equal to or at least about 0.056 moles, equal to or at least about 0.057 moles, equal to or at least about 0.058 moles, equal to or at least about 0.059 moles, equal to or at least about 0.060 moles, equal to or at least about 0.065 moles, equal to or at least about 0.070 moles, equal to or at least about 0.075 moles, equal to or at least about 0.080 moles, equal to or at least about 0.085 moles, equal to or at least about 0.090 moles, equal to or at least about 0.095 moles, equal to or at least about 0.096 moles, equal to or at least about 0.097 moles, equal to or at least about 0.098 moles, equal to or at least about 0.099 moles, equal to or at least about 0.1 moles, equal to or at least about 0.11 moles, equal to or at least about 0.12 moles, or equal to or at least about 0.13 moles of trans-2-unsaturated aldehyde per mole of carbon. In some embodiments, the theoretical yield of a trans-2-unsaturated aldehyde compound of formula III according to the method of the present disclosure is between about 0.055 and about 0.12 moles, between about 0.056 and about 0.11 moles, between about 0.060 and about 0.11 moles, between about 0.070 and about 0.12 moles, or between about 0.075 and about 0.11 moles of trans-2-unsaturated aldehyde per mole of carbon.


In some embodiments, the overall yield of a delta-lactone compound of formula VI and/or of formula VII according to the methods of the present disclosure is equal to or at least about 0.25 times, equal to or at least about 0.28 times, equal to or at least about 0.30 times, equal to or at least about 0.35 times, equal to or at least about 0.40 times, equal to or at least about 0.45 times, equal to or at least about 0.50 times, equal to or at least about 0.55 times, equal to or at least about 0.60 times, equal to or at least about 0.65 times, equal to or at least about 0.70 times, equal to or at least about 0.75 times, equal to or at least about 0.80 times, equal to or at least about 0.81 times, equal to or at least about 0.82 times, equal to or at least about 0.83 times, equal to or at least about 0.84 times, equal to or at least about 0.85 times, equal to or at least about 0.86 times, equal to or at least about 0.87 times, equal to or at least about 0.88 times, equal to or at least about 0.90 times, equal to or at least about 1 time, or equal to or at least about two times the overall yield of the delta-lactone produced via a fatty acid oxidation pathway.


In some embodiments, the theoretical yield of a delta-lactone compound of formula VI and/or of formula VII according to the methods of the present disclosure is equal to or at least about 0.030 moles, equal or at least about 0.035 moles, equal to or at least about 0.040 moles, equal to or at least about 0.041 moles, equal to or at least about 0.042 moles, equal to or at least about 0.043 moles, equal to or at least about 0.044 moles, equal to or at least about 0.045 moles, equal to or at least about 0.046 moles, equal to or at least about 0.047 moles, equal to or at least about 0.048 moles, equal to or at least about 0.049 moles, equal to or at least about 0.050 moles, equal to or at least about 0.055 moles, equal to or at least about 0.060 moles, equal to or at least about 0.065 moles, equal to or at least about 0.070 moles, equal to or at least about 0.080 moles, equal to or at least about 0.090 moles, or equal to or at least about 0.1 moles of delta-lactone per mole of carbon. In some embodiments, the theoretical yield of a delta-lactone compound of formula VI and/or of formula VII according to the methods of the present disclosure is between about 0.035 and about 0.048 moles, between about 0.030 and about 0.050 moles, between about 0.040 and about 0.048 moles, between about 0.045 and about 0.055 moles, or between about 0.020 and about 0.060 moles of delta-lactone per mole of carbon.


In some embodiments, the overall yield of a gamma-lactone compound of formula XII according to the methods of the present disclosure is equal to or at least about 0.25 times, equal to or at least about 0.28 times, equal to or at least about 0.30 times, equal to or at least about 0.35 times, equal to or at least about 0.40 times, equal to or at least about 0.45 times, equal to or at least about 0.50 times, equal to or at least about 0.55 times, equal to or at least about 0.60 times, equal to or at least about 0.65 times, equal to or at least about 0.70 times, equal to or at least about 0.75 times, equal to or at least about 0.80 times, equal to or at least about 0.81 times, equal to or at least about 0.82 times, equal to or at least about 0.83 times, equal to or at least about 0.84 times, equal to or at least about 0.85 times, equal to or at least about 0.86 times, equal to or at least about 0.87 times, equal to or at least about 0.88 times, equal to or at least about 0.90 times, equal to or at least about 1 time, or equal to or at least about two times the gamma-lactone produced via a fatty acid oxidation pathway.


In some embodiments, the theoretical yield of a gamma-lactone compound of formula XII according to the methods of the present disclosure is equal to or at least about 0.030 moles, equal or at least about 0.035 moles, equal to or at least about 0.040 moles, equal to or at least about 0.041 moles, equal to or at least about 0.042 moles, equal to or at least about 0.043 moles, equal to or at least about 0.044 moles, equal to or at least about 0.045 moles, equal to or at least about 0.046 moles, equal to or at least about 0.047 moles, equal to or at least about 0.048 moles, equal to or at least about 0.049 moles, equal to or at least about 0.050 moles, equal to or at least about 0.055 moles, equal to or at least about 0.060 moles, equal to or at least about 0.065 moles, equal to or at least about 0.070 moles, equal to or at least about 0.080 moles, equal to or at least about 0.090 moles, or equal to or at least about 0.1 moles of delta-lactone per mole of carbon. In some embodiments, the theoretical yield of a gamma-lactone compound of formula XII according to the methods of the present disclosure is between about 0.035 and about 0.048 moles, between about 0.030 and about 0.050 moles, between about 0.040 and about 0.048 moles, between about 0.045 and about 0.055 moles, or between about 0.020 and about 0.060 moles of gamma-lactone per mole of carbon.


In some embodiments, the trans-2-unsaturated aldehyde process, or the delta-lactone process, or the gamma-lactone process is cell-free. In some embodiments, the trans-2-unsaturated aldehyde process, or the delta-lactone process, or the gamma-lactone process is a biotransformation process.


Genetic Engineering Technologies


The technologies for deriving the genetically engineered (e.g., non-naturally occurring) microorganism from a wild-type microorganism may be used to genetically modify any enzymes in the wild-type microorganism that may be involved in the trans-2-unsaturated aldehyde pathway, the delta-lactone pathway, and/or the gamma-lactone pathway, and thereby derive the genetically engineered microorganism therefrom.


Increase or decrease an enzymatic activity: the target gene, which is an enzyme that directly or indirectly affects the yield and/or selectivity of the trans-2-unsaturated aldehyde pathway, the delta-lactone pathway, and/or the gamma-lactone pathway, may have its enzymatic activity increased or decreased. When the target encodes an enzyme that contributes to the biosynthesis of the trans-2-unsaturated aldehyde, the delta-lactone, and/or the gamma-lactone, the present disclosure uses genetic engineering technologies to increase the enzymatic activity of the enzyme. The increase of an enzymatic activity of a target gene (encoding the enzyme) in a genetically engineered microorganism may be achieved in one of three ways: by either replacing the native enzyme with an exogenous enzyme which is more active than the native enzyme encoded by the target gene, by enhancing the expression level of the target gene, or both. The exogenous enzyme may be an ortholog, a homolog, or an up-mutant of the target gene. When the target gene is overexpressed in the genetically engineered microorganism, the enzyme encoded by the target gene may be produced in a larger quantity. Thus the genetically engineered microorganism can have an increased enzymatic activity of the target gene. The up-mutant can have point mutations, deletions or insertions.


Expression


In some embodiments of the present disclosure, a DNA sequence encoding an enzyme of the present disclosure, or a mutated DNA sequence encoding a variant or modified enzyme (e.g., aldolase. DERA) prepared by methods described herein, or any alternative methods known in the art, can be expressed, in enzyme form, using an expression vector which typically includes control sequences encoding a promoter, operator, ribosome binding site, translation initiation signal, and, optionally, a repressor gene or various activator genes.


In some embodiments, the recombinant expression vector carrying the DNA sequence encoding an enzyme of the present disclosure may be any vector which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced. Thus, the vector may be an autonomously replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, a bacteriophage or an extrachromosomal element, minichromosome or an artificial chromosome. Alternatively, the vector may be one which, when introduced into a host cell, is integrated into the host cell genome and replicated together with the chromosome(s) into which it has been integrated.


In some embodiments, in the vector, the DNA sequence should be operably connected to a suitable promoter sequence. The promoter may be any DNA sequence which shows transcriptional activity in the host cell of choice and may be derived from genes encoding proteins either homologous or heterologous to the host cell.


The expression vector of the present disclosure may also comprise a suitable transcription terminator and, in eukaryotes, polyadenylation sequences operably connected to the DNA sequence encoding a variant of the present disclosure. Termination and polyadenylation sequences may suitably be derived from the same sources as the promoter.


The vector may further comprise a DNA sequence enabling the vector to replicate in a host cell. Examples of such sequences are the origins of replication of plasmids pUC19, pACYC177, pUBI10, pE194, pAMB1 and pIJ702.


The vector may also comprise a selectable marker, e.g., a gene the product of which complements a defect in the host cell, such as the dal genes from B. subtilis or B. licheniformis, or one which confers antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracyclin resistance.


In some embodiments, intracellular expression may be advantageous in some respects, e.g., when using certain bacteria as host cells. In some embodiments, the expression is extracellular. The parent enzyme (e.g., aldolase, DERA) may in itself comprise a preregion permitting secretion of the expressed enzyme into the culture medium. In some embodiments, this preregion may be replaced by a different preregion or signal sequence, convenient accomplished by substitution of the DNA sequences encoding the respective preregions.


The procedures used to ligate the DNA construct of the present disclosure encoding an enzyme of the present disclosure, the promoter, terminator and other elements, respectively, and to insert them into suitable vectors containing the information necessary for replication, are well known to persons skilled in the art (Sambrook et al. (1989)).


A cell of the present disclosure either comprising a DNA construct or an expression vector of the disclosure as defined herein is advantageously used as a host cell in the production of an enzyme of the present disclosure. The cell may be transformed with the DNA construct of the present disclosure encoding a variant, or a wild-type, conveniently by integrating the DNA construct in the host chromosome. This integration is generally considered to be an advantage as the DNA sequence is more likely to be stably maintained in the cell. Integration of the DNA constructs into the host chromosome may be performed according to conventional methods, e.g., by homologous or heterologous recombination. Alternatively, the cell may be transformed with an expression vector as described herein in connection with the different types of host cells.


The cell of the present disclosure may be a cell of a higher organism such as a mammal or an insect. In some embodiments the cell is a microbial cell. e.g., a bacterial or a fungal (including yeast) cell.


Examples of suitable bacteria are gram-positive bacteria such as Bacillus subtilis. Bacillus licheniformis, Bacillus lentus, Bacillus brevis. Bacillus stearothermophilus, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus circulars, Bacillus lautus, Bacillus megaterium, Bacillus thuringaensis, or Streptomyces lividans, or Streptomyces murinus, or gram-negative bacteria such as E. coli. The transformation of the bacteria may for instance be effected by protoplast transformation or by using competent cells in a manner known in the art.


The yeast organism may be selected from a species of Saccharomyces or Schizosaccharomyces, e.g., Saccharomyces cerevisiae. The filamentous fungus may belong to a species of Aspergillus, e.g., Aspergillus oryzae, Aspergillus niger or Aspergillus nidulans. Fungal cells may be transformed by a process involving protoplast formation and transformation of the protoplasts followed by regeneration of the cell wall in a manner known in the art.


In some embodiments, the present disclosure relates to a method of producing an enzyme of the present disclosure, or a variant thereof, which method comprises cultivating a host cell as described above under conditions conducive to the production of the variant and recovering the variant from the cells and/or culture medium.


The medium used to cultivate the cells may be any conventional medium suitable for growing a host cell and obtaining expression of the enzyme of the disclosure or a variant thereof. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g., in catalogues of the American Type Culture Collection).


The enzyme of the present disclosure, or a variant thereof secreted from the host cells may conveniently be recovered from the culture medium by well-known procedures including separating the cells from the medium by centrifugation or filtration, and precipitating proteinaceous components of the medium by means of a salt such as ammonium sulphate, followed by chromatographic procedures such as ion exchange chromatography, affinity chromatography, or the like.


Methods


Growth of E. Coli according to the methods of the present disclosure may include at least two phases: seed growth and fermentation. The seed growth phase, as described further below, may proceed in one or more seed culture stages (e.g., two stages or three stages).


A seed culture is first grown by inoculating seed medium with a sample from a stock culture (e.g., a working cell bank (WCB)). A sample of this seed culture is used either to inoculate a second seed culture or to inoculate a relatively large fermentation culture. Such seed cultures are typically carried out to allow the quantity of the microorganism from a stored culture (e.g., WCB) to be exponentially increased (scaled-up). Seed cultures may also be used to rejuvenate relatively dormant microbes in stored cultures. As is well understood in the art, at least one seed culture (e.g., two or three cultures or stages) may be used to scale-up the quantity of bacteria (e.g., E. coli) for inoculation into the fermentation medium.


The number of seed cultures (or stages) used depends on, for example, the size and volume of the fermentation step. For example, the culture process may involve two seed cultures: a first seed culture is grown from an inoculation of a stock (stage one seed culture), a sample of this seed culture is used to inoculate a second seed culture (stage two seed culture), and a sample from this second culture is used to inoculate a fermentation culture (fermentation stage). In some embodiments, the first and/or second seed cultures are grown in M9 media. In some embodiments, the cultures are cultured under pH controlled conditions. In some embodiments, a pH 7 is maintained with 5% ammonium hydroxide. In some embodiments, the cultures are cultured under uncontrolled pH conditions.


In stage one, a culture of E Cob is suspended in seed medium and is incubated at a temperature between about 30-40° C., preferably at 37±1 deg. C. In stage two, a sample of the stage one seed medium is used to inoculate a stage two seed medium for further growth. After inoculation, the stage two medium is incubated at a temperature between about 30-40° C., preferably at 37±1° C. Preferably, growth in seed media at any stage does not result in cell lysis before inoculation of fermentation media. Additional growth in a third (fourth, etc.) stage seed culture may also be carried out.


All references cited within this disclosure are hereby incorporated by reference in their entirety. It should be appreciated that any patent, publication, or other disclosure material, in whole or in part, that is incorporated by reference herein is incorporated only to the extent that the incorporated material does not conflict with definitions, statements, or other disclosure material set forth in this disclosure. As such, and to the extent necessary, the disclosure as explicitly set forth herein supersedes any conflicting material incorporated herein by reference. Certain embodiments are further described in the following examples. These embodiments are provided as examples only and are not intended to limit the scope of the claims in any way.


It will be understood that numerous modifications thereto will appear to those skilled in the art. Accordingly, the above description and accompanying drawings should be taken as illustrative of the present disclosure and not in a limiting sense. It will further be understood that it is intended to cover any variations, uses, or adaptations of the present disclosure following, in general, the principles of the present disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the present disclosure pertains and as may be applied to the essential features herein before set forth, and as follows in the scope of the appended claims.


The following examples are provided to further illustrate the advantages and features of the present disclosure, but are not intended to limit the scope of the disclosure. While they are typical of those that may be used, other procedures, methodologies, or techniques may alternatively be used.


EXAMPLES

The following specific examples are illustrative and non-limiting. The examples described herein reference and provide non-limiting support to the various embodiments described in the preceding sections.


The strains and plasmids constructed and used in the examples herein include those listed in TABLE 13.












TABLE 13







Strain designation
Genotype









Wild type (K-12 MG1655)

E. coli MG1655




Ardra008

E. coli MG1655 ΔadhE,





ΔldhA, ΔpflB ΔyqhD,




ΔeutG, ΔadhP ΔyjgB::




cmR + pArdra002
























DNA parts and plasmids
Description









Empty vector
pBluescript II KS(+)



pArdra002
pBluescript II KS(+) expressing




PDC, BH1352, PA1127




each under own promoter










Gene deletions were preformed using the lamda-red protocol and deletion primers provided within the publication (Baba, T., et al., Construction of Escherichia coli K-12 in-frame single-gene knockout mutants: the Keio collection. Mol Syst Biol, 2006, 2: p. 2006 0008).


The gene an dprotein sequences of BH1352(DERA Bacillus halodurans). PA127 (Akr Pseudomonas aeruginosa PAO1), PDC from Zymomonas mobilis, pyruate dependent aldolase (Eda Escherichia coli K-12MG1655), 4-hydroxy-2-oxovalerate aldolase (MphE Escherichia coli K-12 MGg1655), 4-hydroxy-2-oxovalerate aldolase (Paraburkholderia xenovorans LB4g), 4-hydroxy-2-oxoheptanedioate aldolase (HpcH/HpaI Escherichia coli ATCC8739). L-Threonine dehydratase ilvA (Escherichia coli K-12 MG1655), alpha-ketoisovalerate decarboxylase (kivD Lactococcus lactis lactis KF147), 2-isopropylmalate synthase (leuA Escherichia coli K-13 MG1655), 3-isopropylmalate dehydrogenase (leuB Escherichia coli K-12 MG1655), Isopropylmalate dehydratase (leuC Escherichia coli K-12Mg655), Isopropylmalate dehydratase (leuD Escherichia coli K-12 Mg1655), aspartate kinase homoserine dehydrogenase (thra Escherichia coli MG1655), homoserine kinase (thrB Escherichia coli MG1655), threonine synthase (thrC Escherichia coli MG1655). Themi-acetal oxidase (zwf Escherichia coli), the DERA sequences, the BFD and KDC sequences are shown in TABLE 14.











TABLE 14







SEQ


Name

ID NO


















Gene sequence






PA1127
atgagcgttgaaagcattcgcatcgagggtatcgacacgccggtctcgcg
34


(Akr
catcggcctcggcacctgggccatcggcggctggatgtggggcggcgctg




Pseudomonas

acgacgcgacgtcggtggaaaccatccggcgtgcggtggaatccgggatc




aeruginosa

aacctgatcgacaccgcgccggtctatggcttcggccattccgaagaggt



PAO1)
cgtcggcaaggccttgcagggcctgcgcgacaaggcggtgatcgccacca




aggcggcgctggagtggagcgacgcgggcatccaccgcaacgcctccgcc




gcacgcatccgccgggaggtcgaggactcgctgcggcggctgaagaccga




tcgtatcgacctgtaccagattcactggccggacccgctggtggcgcacg




aggaaaccgccggcgaactggagcgcctgcgccgcgacggcaagatcctc




gccatcggcgtgagcaactattcgccggaacagatggacgggttccgcca




gttcgctccgctggccagcgtgcagccaccctacaacctgttcgagcgcg




ccatcgacgccgacgtgctgccctacgccgagcgtaacggcatcgtcgtg




ctggcctacggagcgctgtgccgcggcctgctttccggacggatgaacgc




cgagacccgcttcgatggcgacgacctgcgcaagtccgacccgaagttcc




agcagccccgcttcgcccagtacctggcagcggtcgcgcaactggaggaa




ctggctcgcgagcgctatggcaagtcggtgctggccctggccatccgctg




gattctcgatcgcggccctacggtggcgctgtggggggcgcgcaagccgg




agcagttgaacggcatcgccgacgccttcggctggcgcctggacgacgag




gccatggcccgcatcgagcggatcctcgccgagaccatccaggatccggt




cggtcccgagttcatggcgccgcccagccgcaacgcctaa






PDC
atgagttatactgtcggtacctatttagcggagcggcttgtccagattgg
75



Z. mobilis

tctcaagcatcacttcgcagtcgcgggcgactacaacctcgtccttcttg



ZM4 = 
acaacctgcttttgaacaaaaacatggagcaggtttattgctgtaacgaa



ATCC 31821
ctgaactgcggtttcagtgcagaaggttatgctcgtgccaaaggcgcagc




agcagccgtcgttacctacagcgtcggtgcgctttccgcatttgatgcta




tcggtggcgcctatgcagaaaaccttccggttatcctgatctccggtgct




ccgaacaacaatgatcacgctgctggtcacgtgttgcatcacgctcttgg




caaaaccgactatcactatcagttggaaatggccaagaacatcacggccg




ccgctgaagcgatttacaccccggaagaagctccggctaaaatcgatcac




gtgattaaaactgctcttcgtgagaagaagccggtttatctcgaaatcgc




ttgcaacattgcttccatgccctgcgccgctcctggaccggcaagcgcat




tgttcaatgacgaagccagcgacgaagcttctttgaatgcagcggttgaa




gaaaccctgaaattcatcgccaaccgcgacaaagttgccgtcctcgtcgg




cagcaagctgcgcgcagctggtgctgaagaagctgctgtcaaatttgctg




atgctctcggtggcgcagttgctaccatggctgctgcaaaaagcttcttc




ccagaagaaaacccgcattacatcggcacctcatggggtgaagtcagcta




tccgggcgttgaaaagacgatgaaagaagccgatgcggttatcgctctgg




ctcctgtcttcaacgactactccaccactggttggacggatattcctgat




cctaagaaactggttctcgctgaaccgcgttctgtcgtcgttaacggcat




tcgcttccccagcgtccatctgaaagactatctgacccgtttggctcaga




aagtttccaagaaaaccggtgcattggacttcttcaaatccctcaatgca




ggtgaactgaagaaagccgctccggctgatccgagtgctccgttggtcaa




cgcagaaatcgcccgtcaggtcgaagctcttctgaccccgaacacgacgg




ttattgctgaaaccggtgactcttggttcaatgctcagcgcatgaagctc




ccgaacggtgctcgcgttgaatatgaaatgcagtggggtcacattggttg




gtccgttcctgccgccttcggttatgccgtcggtgctccggaacgtcgca




acatcctcatggttggtgatggttccttccagctgacggctcaggaagtc




gctcagatggttcgcctgaaactgccggttatcatcttcttgatcaataa




ctatggttacaccatcgaagttatgatccatgatggtccgtacaacaaca




tcaagaactgggattatgccggtctgatggaagtgttcaacggtaacggt




ggttatgacagcggtgctggtaaaggcctgaaggctaaaaccggtggcga




actggcagaagctatcaaggttgctctggcaaacaccgacggcccaaccc




tgatcgaatgcttcatcggtcgtgaagactgcactgaagaattggtcaaa




tggggtaagcgcgttgctgccgccaacagccgtaagcctgttaacaagct




cctctag






Pyruvate
atgaaaaactggaaaacaagtgcagaatcaatcctgaccaccggcccggt
28


dependent
tgtaccggttatcgtggtaaaaaaactggaacacgcggtgccgatggcaa



aldolase
aagcgttggttgctggtggggtgcgcgttctggaagtgactctgcgtacc



(Eda
gagtgtgcagttgacgctatccgtgctatcgccaaagaagtgcctgaagc




Escherichia

gattgtgggtgccggtacggtgctgaatccacagcagctggcagaagtca




coli K-12

ctgaagcgggtgcacagttcgcaattagcccgggtctgaccgagccgctg



MG1655)
ctgaaagctgctaccgaagggactattcctctgattccggggatcagcac




tgtttccgaactgatgctgggtatggactacggtttgaaagagttcaaat




tcttcccggctgaagctaacggcggcgtgaaagccctgcaggcgatcgcg




ggtccgttctcccaggtccgtttctgcccgacgggtggtatttctccggc




taactaccgtgactacctggcgctgaaaagcgtgctgtgcatcggtggtt




cctggctggttccggcagatgcgctggaagcgggcgattacgaccgcatt




actaagctggcgcgtgaagctgtagaaggcgctaagctgtaa






4-hydroxy-
atgaacggtaaaaaactttatatctcggacgtcacattgcgtgacggtat
29


2-oxovalerate
gcacgccattcgtcatcagtattcgctggaaaacgttcgccagattgcca



aldolase
aagcactggacgatgcccgcgtggattcgattgaagtggcccacggcgac



(MphE
ggtttgcaaggttccagctttaactatggtttcggcgcacatagcgacct




Escherichia

tgaatggattgaagcggcggcggatgtggtgaagcacgccaaaatcgcga




coli K-12

cgttgttgctgccaggaatcggcactattcacgatctgaaaaatgcctgg



MG1655)
caggctggcgcgcgggtggttcgtgtggcaacgcactgtaccgaagctga




tgtttccgcccagcatattcagtatgcccgcgagctcggaatggacaccg




ttggttttctgatgatgagccatatgaccacgccggagaatctcgccaag




caggcaaagctgatggaaggctacggtgcgacctgtatttatgtggtgga




ttctggcggtgcgatgaacatgagcgatatccgtgaccgtttccgcgccc




tgaaagcagagctgaaaccagaaacgcaaactggcatgcacgctcaccat




aacctgagtcttggcgtggcgaactctatcgcggcggtggaagagggctg




cgaccgaatcgacgccagcctcgcgggaatgggcgcgggcgcaggtaacg




caccgctggaagtgtttattgccgccgcggataaactgggctggcagcat




gggaccgatctctatgcgttaatggatgccgccgacgacctggtgcgtcc




gttgcaggatcgaccggtacgagtcgatcgcgaaacgctggcgctgggat




acgctggtgtttactcgagcttcctgcgtcactgtgaaacggcggcggcg




cgttatggcttaagtgcggtggatattctcgttgagctgggcaaacgccg




gatggttggcggccaggaggatatgatcgttgacgtggcgctggatctgc




gcaacaacaaataa






4-hydroxy-
atgaagctagaaggcaaaaaagtcaccgtccacgacatgacgctgcgcga
80


2-oxovalerate
cggcatgcaccccaagcgccaccagatgacgctggagcaaatgaaatcca



aldolase
tcgcctgcggcctggatgccgcgggcatcccgctgatcgaagtcacccac



(Paraburkholderia
ggcgacggcctgggcggctcctccgtcaactacggctttccggcgcacag




xenovorans

cgacgaggaatacctgggcgccgtgattccgttgatgaagcaggccaagg



LB400)
tcagtgccctgctcttgcccggcatcggcaccgtcgaacacctgaagatg




gccaaggacctgggcgtgaacaccatccgcgtggccacccactgcaccga




agccgatgtgtcggagcagcacatcacccaatcgcgcaagctgggtctgg




acaccgtgggctttttgatgatggcgcacatggccagcccagaaaagctg




gtcagccaggcactcttgatgcaaggctacggcgccaactgcatctacgt




caccgactcggccggctacatgctgcctgacgacgtgaaagcgcgcctga




gtgccgtgcgtgccgcgctcaaacccgaaaccgagttgggctttcacggc




catcacaacctggccatgggcgtcgccaactcgatcgccgcgatcgaggc




cggggccacccgcatcgatgccgctgccgctggcctgggtgccggcgccg




gcaacacgccgatggaggtgttcatcgccgtatgcgcgcgcatggggatc




gaaaccggagtcgatgtgttcaagatccaggacgtggccgaagacctggt




ggtgccgatcatggaccatgtcatccgcatcgaccgcgactc






4-hydroxy-
atggaaaacagttttaaagcggcgctgaaagcaggccgcccgcagatcgg
81


2-oxoheptane
attatggctggggctgagtagcagttacagcgcagagttactggccggag



dioate
caggattcgactggttgttgatcgacggtgaacacgcgccgaataacgtg



aldolase
caaaccgtgctcacccagctacaggcgattgcgccctaccccagtcagcc



(HpcH/HpaI
agtggtacgcccatcgtggaacgatccggtgcaaatcaaacaactgctgg




Escherichia

acgtcggcacacaaaccttgctggtgccgatggtacaaaacgctgacgaa




coli

gcccgtgaagcggtacgcgccacccgttatccccccgccggtattcgcgg



ATCC8739)
tgtgggcagtgcgctggcccgcgcctcgcgctggaatcgtattcctgatt




acctgcaaaaagccaacgatcaaatgtgcgtgctggtgcagatcgaaacg




cgtgaggcaatgaagaacttaccgcagattctggacgtggaaggcgtcga




cggcgtgtttatcggcccggcggatctgagcgccgatatgggttatgccg




gtaatccgcagcacccggaagtacaggccgccattgagcaggcgatcgtg




cagatccgtgaatcgggcaaagcgccggggatcctgatcgccaatgagca




actggcaaaacgctatctggaactgggcgcgctgtttgtcgccgtcggcg




ttgacaccaccctgctcgcccgcgccgctgaagcgctggcagcacggttt




ggtgcgcaggccaccgccgtgaagcccggcgtgtattaa






Threonine
atggctgactcgcaacccctgtccggtgctccggaaggtgccgaatattt
50


dehydratase
aagagcagtgctgcgcgcgccggtttacgaggcggcgcaggttacgccgc



ilvA
tacaaaaaatggaaaaactgtcgtcgcgtcttgataacgtcattctggtg



(Escherichia
aagcgcgaagatcgccagccagtgcacagctttaagctgcgcggcgcata




coli K-12

cgccatgatggcgggcctgacggaagaacagaaagcgcacggcgtgatca



MG1655)
ctgcttctgcgggtaaccacgcgcagggcgtcgcgttttcttctgcgcgg




ttaggcgtgaaggccctgatcgttatgccaaccgccaccgccgacatcaa




agtcgacgcggtgcgcggcttcggcggcgaagtgctgctccacggcgcga




actttgatgaagcgaaagccaaagcgatcgaactgtcacagcagcagggg




ttcacctgggtgccgccgttcgaccatccgatggtgattgccgggcaagg




cacgctggcgctggaactgctccagcaggacgcccatctcgaccgcgtat




ttgtgccagtcggcggcggcggtctggctgctggcgtggcggtgctgatc




aaacaactgatgccgcaaatcaaagtgatcgccgtagaagcggaagactc




cgcctgcctgaaagcagcgctggatgcgggtcatccggttgatctgccgc




gcgtagggctatttgctgaaggcgtagcggtaaaacgcatcggtgacgaa




accttccgtttatgccaggagtatctcgacgacatcatcaccgtcgatag




cgatgcgatctgtgcggcgatgaaggatttattcgaagatgtgcgcgcgg




tggcggaaccctctggcgcgctggcgctggcgggaatgaaaaaatatatc




gccctgcacaacattcgcggcgaacggctggcgcatattctttccggtgc




caacgtgaacttccacggcctgcgctacgtctcagaacgctgcgaactgg




gcgaacagcgtgaagcgttgttggcggtgaccattccggaagaaaaaggc




agcttcctcaaattctgccaactgcttggcgggcgttcggtcaccgagtt




caactaccgttttgccgatgccaaaaacgcctgcatctttgtcggtgtgc




gcctgagccgcggcctcgaagagcgcaaagaaattttgcagatgctcaac




gacggcggctacagcgtggttgatctctccgacgacgaaatggcgaagct




acacgtgcgctatatggtcggcggacgtccatcgcatccgttgcaggaac




gcctctacagcttcgaattcccggaatcaccgggcgcgctgctgcgcttc




ctcaacacgctgggtacgtactggaacatttctttgttccactatcgcag




ccatggcaccgactacgggcgcgtactggcggcgttcgaacttggcgacc




atgaaccggatttcgaaacccggctgaatgagctgggctacgattgccac




gacgaaaccaataacccggcgttcaggttctttttggcgggttag






Alpha-
atgtatacagtaggagattacctattagaccgattacacgagttaggaat
69


ketoisovalerate
tgaagaaatttttggagtccctggagactataacttacaatttttagatc



decarboxylase
aaattatttcccgcaaggatatgaaatgggtcggaaatgctaatgaatta



(kivD
aatgcttcttatatggctgatggctatgctcgtactaaaaaagctgccgc




Lactococcus

atttcttacaacctttggagtaggtgaattgagtgcagttaatggattag




lactis

caggaagttacgccgaaaatttaccagtagtagaaatagtgggatcacct




lactis

acatcaaaagtccaaaatgaaggaaaatttgttcatcatacgctggctga



KF147)
cggtgattttaaacactttatgaaaatgcacgaacctgttacagcagctc




gaactttactgacagcagaaaatgcaaccgttgaaattgaccgagtactt




tctgcactactaaaagaaagaaaacctgtctatatcaacttaccagttga




tgttgctgctgcaaaagcagagaaaccctcactccctttgaaaaaagaaa




atccaacttcaaatacaagtgaccaagagattttgaataaaattcaagaa




agcttgaaaaatgccaaaaaaccaatcgtgattacaggacatgaaataat




tagctttggcttagaaaatacagtcactcaatttatttcaaagacaaaac




tccctattacgacattaaactttggaaaaagttcagttgatgaaactctc




ccttcatttttaggaatctataatggtaaactctcagagcctaatcttaa




agaattcgtggaatcagccgacttcatcctgatgcttggagttaaactca




cagactcttcaacaggagcatttacccatcatttaaatgaaaataaaatg




atttcactgaacatagacgaaggaaaaatatttaacgaaagcatccaaaa




ttttgattttgaatccctcatctcctctctcttagacctaagcggaatag




aatacaaaggaaaatatatcgataaaaagcaagaagactttgttccatca




aatgcgcttttatcacaagaccgcctatggcaagcagttgaaaacctaac




tcaaagcaatgaaacaatcgttgctgaacaagggacatcattctttggcg




cttcatcaattttcttaaaaccaaagagtcattttattggtcaaccctta




tggggatcaattggatatacattcccagcagcattaggaagccaaattgc




agataaagaaagcagacaccttttatttattggtgatggttcacttcaac




ttacagtgcaagaattaggattagcaatcagagaaaaaattaatccaatt




tgctttattatcaataatgatggttatacagtcgaaagagaaattcatgg




accaaatcaaagctacaatgatattccaatgtggaattactcaaaattac




cagaatcatttggagcaacagaagaacgagtagtctcgaaaatcgttaga




actgaaaatgaatttgtgtctgtcatgaaagaagctcaagcagatccaaa




tagaatgtactggattgagttagttttggcaaaagaagatgcaccaaaag




tactgaaaaaaatgggtaaactatttgctgaacaaaataaatcataa






2-
atgagccagcaagtcattattttcgataccacattgcgcgacggtgaaca
63


isopropylmalate
ggcgttacaggcaagcttgagtgtgaaagaaaaactgcaaattgcgctgg



synthase
cccttgagcgtatgggtgttgacgtgatggaagtcggtttccccgtctct



(leuA
tcgccgggcgattttgaatcggtgcaaaccatcgcccgccaggttaaaaa




Escherichia

cagccgcgtatgtgcgttagctcgctgcgtggaaaaagatatcgacgtgg




coli K-12

cggccgaatccctgaaagtcgccgaagccttccgtattcatacctttatt



MG1655)
gccacttcgccaatgcacatcgccaccaagctgcgcagcacgctggacga




ggtgatcgaacgcgctatctatatggtgaaacgcgcccgtaattacaccg




atgatgttgaattttcttgcgaagatgccgggcgtacacccattgccgat




ctggcgcgagtggtcgaagcggcgattaatgccggtgccaccaccatcaa




cattccggacaccgtgggctacaccatgccgtttgagttcgccggaatca




tcagcggcctgtatgaacgcgtgcctaacatcgacaaagccattatctcc




gtacatacccacgacgatttgggcctggcggtcggaaactcactggcggc




ggtacatgccggtgcacgccaggtggaaggcgcaatgaacgggatcggcg




agcgtgccggaaactgttccctggaagaagtcatcatggcgatcaaagtt




cgtaaggatattctcaacgtccacaccgccattaatcaccaggagatatg




gcgcaccagccagttagttagccagatttgtaatatgccgatcccggcaa




acaaagccattgttggcagcggcgcattcgcacactcctccggtatacac




caggatggcgtgctgaaaaaccgcgaaaactacgaaatcatgacaccaga




atctattggtctgaaccaaatccagctgaatctgacctctcgttcggggc




gtgcggcggtgaaacatcgcatggatgagatggggtataaagaaagtgaa




tataatttagacaatttgtacgatgctttcctgaagctggcggacaaaaa




aggtcaggtgtttgattacgatctggaggcgctggccttcatcggtaagc




agcaagaagagccggagcatttccgtctggattacttcagcgtgcagtct




ggctctaacgatatcgccaccgccgccgtcaaactggcctgtggcgaaga




agtcaaagcagaagccgccaacggtaacggtccggtcgatgccgtctatc




aggcaattaaccgcatcactgaatataacgtcgaactggtgaaatacagc




ctgaccgccaaaggccacggtaaagatgcgctgggtcaggtggatatcgt




cgctaactacaacggtcgccgcttccacggcgtcggcctggctaccgata




ttgtcgagtcatctgccaaagccatggtgcacgttctgaacaatatctgg




cgtgccgcagaagtcgaaaaagagttgcaacgcaaagctcaacacaacga




aaacaacaaggaaaccgtgtga






3-
Atgtcgaagaattaccatattgccgtattgccgggggacggtattggtcc
64


isopropylmalate
ggaagtgatgacccaggcgctgaaagtgctggatgccgtgcgcaaccgct



dehydrogenase
ttgcgatgcgcatcaccaccagccattacgatgtaggcggcgcagccatt



(leuB
gataaccacgggcaaccactgccgcctgcgacggttgaaggttgtgagca




Escherichia

agccgatgccgtgctgtttggctcggtaggcggcccgaagtgggaacatt




coli K-12

taccaccagaccagcaaccagaacgcggcgcgctgctgcctctgcgtaag



MG1655)
cacttcaaattattcagcaacctgcgcccggcaaaactgtatcaggggct




ggaagcattctgtccgctgcgtgcagacattgccgcaaacggcttcgaca




tcctgtgtgtgcgcgaactgaccggcggcatctatttcggtcagccaaaa




ggccgcgaaggtagcggacaatatgaaaaagcctttgataccgaggtgta




tcaccgttttgagatcgaacgtatcgcccgcatcgcgtttgaatctgctc




gcaagcgtcgccacaaagtgacgtcgatcgataaagccaacgtgctgcaa




tcctctattttatggcgggagatcgttaacgagatcgccacggaataccc




ggatgtcgaactggcgcatatgtacatcgacaacgccaccatgcagctga




ttaaagatccatcacagtttgacgttctgctgtgctccaacctgtttggc




gacattctgtctgacgagtgcgcaatgatcactggctcgatggggatgtt




gccttccgccagcctgaacgagcaaggttttggactgtatgaaccggcgg




gcggctcggcaccagatatcgcaggcaaaaacatcgccaacccgattgca




caaatcctttcgctggcactgctgctgcgttacagcctggatgccgatga




tgcggcttgcgccattgaacgcgccattaaccgcgcattagaagaaggca




ttcgcaccggggatttagcccgtggcgctgccgccgttagtaccgatgaa




atgggcgatatcattgcccgctatgtagcagaaggggtgtaa






Isopropylmalate
atggctaagacgttatacgaaaaattgttcgacgctcacgttgtgtacga
65


dehydratase
agccgaaaacgaaaccccactgttatatatcgaccgccacctggtgcatg



(leuC
aagtgacctcaccgcaggcgttcgatggtctgcgcgcccacggtcgcccg




Escherichia

gtacgtcagccgggcaaaaccttcgctaccatggatcacaacgtctctac




coli K-12

ccagaccaaagacattaatgcctgcggtgaaatggcgcgtatccagatgc



Mg1655)
aggaactgatcaaaaactgcaaagaatttggcgtcgaactgtatgacctg




aatcacccgtatcaggggatcgtccacgtaatggggccggaacagggcgt




caccttgccggggatgaccattgtctgcggcgactcgcataccgccaccc




acggcgcgtttggcgcactggcctttggtatcggcacttccgaagttgaa




cacgtactggcaacgcaaaccctgaaacagggccgcgcaaaaaccatgaa




aattgaagtccagggcaaagccgcgccgggcattaccgcaaaagatatcg




tgctggcaattatcggtaaaaccggtagcgcaggcggcaccgggcatgtg




gtggagttttgcggcgaagcaatccgtgatttaagcatggaaggtcgtat




gaccctgtgcaatatggcaatcgaaatgggcgcaaaagccggtctggttg




caccggacgaaaccacctttaactatgtcaaaggccgtctgcatgcgccg




aaaggcaaagatttcgacgacgccgttgcctactggaaaaccctgcaaac




cgacgaaggcgcaactttcgataccgttgtcactctgcaagcagaagaaa




tttcaccgcaggtcacctggggcaccaatcccggccaggtgatttccgtg




aacgacaatattcccgatccggcttcgtttgccgatccggttgaacgcgc




gtcggcagaaaaagcgctggcctatatggggctgaaaccgggtattccgc




tgaccgaagtggctatcgacaaagtgtttatcggttcctgtaccaactcg




cgcattgaagatttacgcgcggcagcggagatcgccaaagggcgaaaagt




cgcgccaggcgtgcaggcactggtggttcccggctctggcccggtaaaag




cccaggcggaagcggaaggtctggataaaatctttattgaagccggtttt




gaatggcgcttgcctggctgctcaatgtgtctggcgatgaacaacgaccg




tctgaatccgggcgaacgttgtgcctccaccagcaaccgtaactttgaag




gccgccaggggcgcggcgggcgcacgcatctggtcagcccggcaatggct




gccgctgctgctgtgaccggacatttcgccgacattcgcaacattaaata




a






Isopropylmalate
Atggcagagaaatttatcaaacacacaggcctggtggttccgctggatgc
82


dehydratase
cgccaatgtcgataccgatgcaatcatcccgaaacagtttttgcagaaag



(leuD
tgacccgtacgggttttggcgcgcatctgtttaacgactggcgttttctg




Escherichia

gatgaaaaaggccaacagccaaacccggacttcgtgctgaacttcccgca




coli K-12

gtatcagggcgcttccattttgctggcacgagaaaacttcggctgtggct



Mg1655)
cttcgcgtgagcacgcgccctgggcattgaccgactacggttttaaagtg




gtgattgcgccgagttttgctgacatcttctacggcaatagctttaacaa




ccagctgctgccggtgaaattaagcgatgcagaagtggacgaactgtttg




cgctggtgaaagctaatccggggatccatttcgacgtggatctggaagcg




caagaggtgaaagcgggagagaaaacctatcgctttaccatcgatgcctt




ccgccgccactgcatgatgaacggtctggacagtattgggcttaccttgc




agcacgacgacgccattgccgcttatgaagcaaaacaacctgcgtttatg




aattaa






aspartate
atgcgagtgttgaagttcggcggtacatcagtggcaaatgcagaacgttt
44


kinase/
tctgcgtgttgccgatattctggaaagcaatgccaggcaggggcaggtgg



homoserine
ccaccgtcctctctgcccccgccaaaatcaccaaccacctggtggcgatg



dehydrogenase
attgaaaaaaccattagcggccaggatgctttacccaatatcagcgatgc



(thrA
cgaacgtatttttgccgaacttttgacgggactcgccgccgcccagccgg




Eschrichia

ggttcccgctggcgcaattgaaaactttcgtcgatcaggaatttgcccaa




coli

ataaaacatgtcctgcatggcattagtttgttggggcagtgcccggatag



MG1655)
catcaacgctgcgctgatttgccgtggcgagaaaatgtcgatcgccatta




tggccggcgtattagaagcgcgcggtcacaacgttactgttatcgatccg




gtcgaaaaactgctggcagtggggcattacctcgaatctaccgtcgatat




tgctgagtccacccgccgtattgcggcaagccgcattccggctgatcaca




tggtgctgatggcaggtttcaccgccggtaatgaaaaaggcgaactggtg




gtgcttggacgcaacggttccgactactctgctgcggtgctggctgcctg




tttacgcgccgattgttgcgagatttggacggacgttgacggggtctata




cctgcgacccgcgtcaggtgcccgatgcgaggttgttgaagtcgatgtcc




taccaggaagcgatggagctttcctacttcggcgctaaagttcttcaccc




ccgcaccattacccccatcgcccagttccagatcccttgcctgattaaaa




ataccggaaatcctcaagcaccaggtacgctcattggtgccagccgtgat




gaagacgaattaccggtcaagggcatttccaatctgaataacatggcaat




gttcagcgtttctggtccggggatgaaagggatggtcggcatggcggcgc




gcgtctttgcagcgatgtcacgcgcccgtatttccgtggtgctgattacg




caatcatcttccgaatacagcatcagtttctgcgttccacaaagcgactg




tgtgcgagctgaacgggcaatgcaggaagagttctacctggaactgaaag




aaggcttactggagccgctggcagtgacggaacggctggccattatctcg




gtggtaggtgatggtatgcgcaccttgcgtgggatctcggcgaaattctt




tgccgcactggcccgcgccaatatcaacattgtcgccattgctcagggat




cttctgaacgctcaatctctgtcgtggtaaataacgatgatgcgaccact




ggcgtgcgcgttactcatcagatgctgttcaataccgatcaggttatcga




agtgtttgtgattggcgtcggtggcgttggcggtgcgctgctggagcaac




tgaagcgtcagcaaagctggctgaagaataaacatatcgacttacgtgtc




tgcggtgttgccaactcgaaggctctgctcaccaatgtacatggccttaa




tctggaaaactggcaggaagaactggcgcaagccaaagagccgtttaatc




tcgggcgcttaattcgcctcgtgaaagaatatcatctgctgaacccggtc




attgttgactgcacttccagccaggcagtggcggatcaatatgccgactt




cctgcgcgaaggtttccacgttgtcacgccgaacaaaaaggccaacacct




cgtcgatggattactaccatcagttgcgttatgcggcggaaaaatcgcgg




cgtaaattcctctatgacaccaacgttggggctggattaccggttattga




gaacctgcaaaatctgctcaatgcaggtgatgaattgatgaagttctccg




gcattctttctggttcgctttcttatatcttcggcaagttagacgaaggc




atgagtttctccgaggcgaccacgctggcgcgggaaatgggttataccga




accggacccgcgagatgatctttctggtatggatgtggcgcgtaaactat




tgattctcgctcgtgaaacgggacgtgaactggagctggcggatattgaa




attgaacctgtgctgcccgcagagtttaacgccgagggtgatgttgccgc




ttttatggcgaatctgtcacaactcgacgatctctttgccgcgcgcgtgg




cgaaggcccgtgatgaaggaaaagttttgcgctatgttggcaatattgat




gaagatggcgtctgccgcgtgaagattgccgaagtggatggtaatgatcc




gctgttcaaagtgaaaaatggcgaaaacgccctggccttctatagccact




attatcagccgctgccgttggtactgcgcggatatggtgcgggcaatgac




gttacagctgccggtgtctttgctgatctgctacgtaccctctcatggaa




gttaggagtctga






homoserine
atggttaaagtttatgccccggcttccagtgccaatatgagcgtcgggtt
45


kinase
tgatgtgctcggggcggcggtgacacctgttgatggtgcattgctcggag



(thrB
atgtagtcacggttgaggcggcagagacattcagtctcaacaacctcgga




Escherichia

cgctttgccgataagctgccgtcagaaccacgggaaaatatcgtttatca




coli

gtgctgggagcgtttttgccaggaactgggtaagcaaattccagtggcga



MG1655)
tgaccctggaaaagaatatgccgatcggttcgggcttaggctccagtgcc




tgttcggtggtcgcggcgctgatggcgatgaatgaacactgcggcaagcc




gcttaatgacactcgtttgctggctttgatgggcgagctggaaggccgta




tctccggcagcattcattacgacaacgtggcaccgtgttttctcggtggt




atgcagttgatgatcgaagaaaacgacatcatcagccagcaagtgccagg




gtttgatgagtggctgtgggtgctggcgtatccggggattaaagtctcga




cggcagaagccagggctattttaccggcgcagtatcgccgccaggattgc




attgcgcacgggcgacatctggcaggcttcattcacgcctgctattcccg




tcagcctgagcttgccgcgaagctgatgaaagatgttatcgctgaaccct




accgtgaacggttactgccaggcttccggcaggcgcggcaggcggtcgcg




gaaatcggcgcggtagcgagcggtatctccggctccggcccgaccttgtt




cgctctgtgtgacaagccggaaaccgcccagcgcgttgccgactggttgg




gtaagaactacctgcaaaatcaggaaggttttgttcatatttgccggctg




gatacggcgggcgcacgagtactggaaaactaa






threonine
atgaaactctacaatctgaaagatcacaacgagcaggtcagctttgcgca
46


synthase
agccgtaacccaggggttgggcaaaaatcaggggctgttttttccgcacg



(thrC
acctgccggaattcagcctgactgaaattgatgagatgctgaagctggat




Escherichia

tttgtcacccgcagtgcgaagatcctctcggcgtttattggtgatgaaat




coli

cccacaggaaatcctggaagagcgcgtgcgcgcggcgtttgccttcccgg



MG1655)
ctccggtcgccaatgttgaaagcgatgtcggttgtctggaattgttccac




gggccaacgctggcatttaaagatttcggcggtcgctttatggcacaaat




gctgacccatattgcgggtgataagccagtgaccattctgaccgcgacct




ccggtgataccggagcggcagtggctcatgctttctacggtttaccgaat




gtgaaagtggttatcctctatccacgaggcaaaatcagtccactgcaaga




aaaactgttctgtacattgggcggcaatatcgaaactgttgccatcgacg




gcgatttcgatgcctgtcaggcgctggtgaagcaggcgtttgatgatgaa




gaactgaaagtggcgctagggttaaactcggctaactcgattaacatcag




ccgtttgctggcgcagatttgctactactttgaagctgttgcgcagctgc




cgcaggagacgcgcaaccagctggttgtctcggtgccaagcggaaacttc




ggcgatttgacggcgggtctgctggcgaagtcactcggtctgccggtgaa




acgttttattgctgcgaccaacgtgaacgataccgtgccacgtttcctgc




acgacggtcagtggtcacccaaagcgactcaggcgacgttatccaacgcg




atggacgtgagtcagccgaacaactggccgcgtgtggaagagttgttccg




ccgcaaaatctggcaactgaaagagctgggttatgcagccgtggatgatg




aaaccacgcaacagacaatgcgtgagttaaaagaactgggctacacttcg




gagccgcacgctgccgtagcttatcgtgcgctgcgtgatcagttgaatcc




aggcgaatatggcttgttcctcggcaccgcgcatccggcgaaatttaaag




agagcgtggaagcgattctcggtgaaacgttggatctgccaaaagagctg




gcagaacgtgctgatttacccttgctttcacataatctgcccgccgattt




tgctgcgttgcgtaaattgatgatgaatcatcagtaa






glucose-6-
atggcggtaacgcaaacagcccaggcctgtgacctggtcattttcggcgc
24


phosphate
gaaaggcgaccttgcgcgtcgtaaattgctgccttccctgtatcaactgg



1-
aaaaagccggtcagctcaacccggacacccggattatcggcgtagggcgt



dehydrogenase
gctgactgggataaagcggcatataccaaagttgtccgcgaggcgctcga



(zwf
aactttcatgaaagaaaccattgatgaaggtttatgggacaccctgagtg




Escherichia

cacgtctggatttttgtaatctcgatgtcaatgacactgctgcattcagc




coli K-12

cgtctcggcgcgatgctggatcaaaaaaatcgtatcaccattaactactt



MG1655)
tgccatgccgcccagcacttttggcgcaatttgcaaagggcttggcgagg




caaaactgaatgctaaaccggcacgcgtagtcatggagaaaccgctgggg




acgtcgctggcgacctcgcaggaaatcaatgatcaggttggcgaatactt




cgaggagtgccaggtttaccgtatcgaccactatcttggtaaagaaacgg




tgctgaacctgttggcgctgcgttttgctaactccctgtttgtgaataac




tgggacaatcgcaccattgatcatgttgagattaccgtggcagaagaagt




ggggatcgaagggcgctggggctattttgataaagccggtcagatgcgcg




acatgatccagaaccacctgctgcaaattctttgcatgattgcgatgtct




ccgccgtctgacctgagcgcagacagcatccgcgatgaaaaagtgaaagt




actgaagtctctgcgccgcatcgaccgctccaacgtacgcgaaaaaaccg




tacgcgggcaatatactgcgggcttcgcccagggcaaaaaagtgccggga




tatctggaagaagagggcgcgaacaagagcagcaatacagaaactttcgt




ggcgatccgcgtcgacattgataactggcgctgggccggtgtgccattct




acctgcgtactggtaaacgtctgccgaccaaatgttctgaagtcgtggtc




tatttcaaaacacctgaactgaatctgtttaaagaatcgtggcaggatct




gccgcagaataaactgactatccgtctgcaacctgatgaaggcgtggata




tccaggtactgaataaagttcctggccttgaccacaaacataacctgcaa




atcaccaagctggatctgagctattcagaaacctttaatcagacgcatct




ggcggatgcctatgaacgtttgctgctggaaaccatgcgtggtattcagg




cactgtttgtacgtcgcgacgaagtggaagaagcctggaaatgggtagac




tccattactgaggcgtgggcgatggacaatgatgcgccgaaaccgtatca




ggccggaacctggggacccgttgcctcggtggcgatgattacccgtgatg




gtcgttcctggaatgagtttgagtaa






DERA
atgaacattgcaaagttaattgaccatacaattttaaaagctaatactac
1


(Bacillus
taaagaagatgttatgaaagtaatcgaagaagcaaaggaatataaattcg




cereus

cttctgtttgtattaatcctacatgggtaaagctagctgctgaggaatta



ATCC
gctggacatgatgtagatgtttgtactgtaatcggtttcccattaggcgc



10987;
aagtactactgaaacaaaagctttcgaaacaaaagatgctatcgcaaaag



AE017194.1)
gtgcaactgaagttgacatggtaatcaacgtaggcgctttaaaagatggc




gacgacgaacttgttgaaaaagacatttatgaagtagtacaagcagcaaa




aggaaaagctcttgtaaaagtaatcattgaaacttgcctattaacagatg




aagagaaagtacgcgcttgtgaattatcagtaaaagctggggctgatttc




gtaaaaacttcaactggattctcaactggcggagcaactgctgaagatat




cgcattaatgcgtaaaacagttggaccaaacgttggtgtaaaagcatctg




gtggcgttcgtacacgtgaagatgcagaaaaaatggtagctgctggagct




tctcgcgttggagcaagtgctagtgttgcaatcgtattaaatgatgcaaa




aggtgctacagataactactaa






DERA
atgactgatctgaaagcaagcagcctgcgtgcactgaaattgatggacct
2


(Escherichia
gaccaccctgaatgacgacgacaccgacgagaaagtgatcgccctgtgtc




coli K12;

atcaggccaaaactccggtcggcaataccgccgctatctgtatctatcct



CP009685)
cgctttatcccgattgctcgcaaaactctgaaagagcagggcaccccgga




aatccgtatcgctacggtaaccaacttcccacacggtaacgacgacatcg




acatcgcgctggcagaaacccgtgcggcaatcgcctacggtgctgatgaa




gttgacgttgtgttcccgtaccgcgcgctgatggcgggtaacgagcaggt




tggttttgacctggtgaaagcctgtaaagaggcttgcgcggcagcgaatg




tactgctgaaagtgatcatcgaaaccggcgaactgaaagacgaagcgctg




atccgtaaagcgtctgaaatctccatcaaagcgggtgcggacttcatcaa




aacctctaccggtaaagtggctgtgaacgcgacgccggaaagcgcgcgca




tcatgatggaagtgatccgtgatatgggcgtagaaaaaaccgttggtttc




aaaccggcgggcggcgtgcgtactgcggaagatgcgcagaaatatctcgc




cattgcagatgaactgttcggtgctgactgggcagatgcgcgtcactacc




gctttggcgcttccagcctgctggcaagcctgctgaaagcgctgggtcac




ggcgacggtaagagcgccagcagctactaa






DERA
atgtcacgttcgattgcacaaatgattgatcatacgctacttaaaccaaa
3


(Bacillus
tacaacagaagaccaaattgtaaagctctgtgaggaagcaaaggaatatt




halodurans

catttgcatctgtttgtgtgaatcctacttgggtcgctcttgctgcgcag



C-125;
ttgctaaaagatgcacctgatgtgaaagtatgtacagttatcggctttcc



BA000004)
gttaggggcaacgactccggaagtgaaagcgtttgaaacgactaatgcca




ttgaaaatggagcgacagaagtggacatggtcattaacattggagcgtta




aaagataaacaatacgagcttgttggacgcgacattcaagcggttgttaa




agcagcagaagggaaagcattaacgaaagtaatcattgaaacatcgttat




taacggaggaagagaagaaggctgcgtgtgagcttgccgtaaaagcagga




gccgactttgtcaaaacgtcgactggattctctggcggaggtgctacggc




tgaggatatcgcgctcatgcgaaaagtggtcggaccaaatttaggagtca




aagcttctggaggtgttagagatctgtccgacgcgaaagcgatgattgat




gctggtgctactcggattggtgcgagtgctggggtggcgattgttaacgg




ggagcgtagcgaagggagttattaa






DERA
atgtcattagccaacataattgatcatacagctttgaaaccgcatacaca
4


(Bacillus
aaaagcggacattctaaaactaattgaagaagcgaaaacatacaaatttg




subtilis

cttcagtatgtgtcaatccgacatgggtggagcttgctgcaaaagagctt



subsp.
aagggaactggagtcgacgtttgtacggtcatcggcttcccgctcggtgc



subtilis str.
caatacaactgaaacaaaagcgttcgaaacaaaagacgccatttcaaaag



168;
gcgccactgaagtggatatggtcattaatattgccgctttaaaagacaag



AL009126.3)
gaagacgatgtggtggaagctgatatccgcggtgtagtggaagctgtagc




cggaaaagcgcttgtcaaagtcattatcgaaacgtgccttctgactgatg




aagaaaaagaacgtgcatgccgtttagcggtgtctgcgggagcggatttc




gtaaaaacatcaacaggcttttctacaggcggcgcaacgaaggaagatat




cgccttaatgcgcaaaacagtagggcctgatatcggcgtgaaagcatctg




gcggcgtcagaacgaaagaagatgtagacacaatggtagaggccggagca




agccgaattggcgccagcgcaggcgtttctatcgtaaaaggagaaaatgc




atcaggcggagacaactattaa






DERA
atgatagagtacaggattgaggaggcagtagcgaagtacagagagttcta
5


(Thermotoga
cgaattcaagcccgtcagagaaagcgcaggtattgaagatgtgaaaagtg




maritima;

ctatagagcacacgaatctgaaaccgtttgccacaccagacgatataaaa



CP011108)
aaactctgtcttgaagcaagggaaaatcgtttccatggagtctgtgtgaa




tccgtgttatgtgaaactggctcgtgaagaactcgaaggaaccgatgtga




aagtcgtcaccgttgttggttttccactgggagcgaacgaaactcggacg




aaagcccatgaggcgattttcgctgttgagagtggagccgatgagatcga




tatggtcatcaacgttggcatgctcaaggcaaaggagtgggagtacgttt




acgaggatataagaagtgttgtcgaatcggtgaaaggaaaagttgtgaag




gtgatcatcgaaacgtgctatctggatacggaagagaagatagcggcgtg




tgtcatttccaaacttgctggagctcatttcgtgaagacttccacgggat




ttggaacaggaggggcgaccgcagaagacgttcatctcatgaaatggatc




gtgggagatgagatgggtgtaaaagcttccggagggatcagaaccttcga




ggacgctgttaaaatgatcatgtacggtgctgatagaataggaacgagtt




cgggagttaagatcgttcaggggggagaagagagatatggaggttaa






DERA
gtggttaaaatgaatgtggagacaagggaggaacttgcatcacttataga
6


(Methano-
ccacaccaatgtgagggctgatgcaacagaaaatgatattgagaggctat




thermobacter

gcagggaggcggtcagctacggcttcaggtgcgcggtggtcacacccacc




thermau-

aatgtcaggctggcggctgaactccttgaggggaccgatgtgacggtctg




trophicus;

ctcagttgttggtttcccggcaggcgtcagtacaccccgcgttaaggccc



AP011952)
ttgaagcctctgaggccgttgagaacggggccggtgaggtggacatggtc




atgaatatcggggccatgaagtcaggcaatagggagctcgtatacaggga




tatcagcggcgttgttgatgccgccggcgtccccgtcaaggttatacttg




aaacagcctatctcacagacaaggagaaggttgaagcctgccttataagt




aaagaggccggtgcggcatttgttaaaacatcaacagcctatggtggact




agccggcgccacagttgaggatgtgatgctcatgcggaaaacggtgggtg




atgagatgggagtcaaggcatctgggggaataagggatcttgaaacagcc




cttgcgatgatagatgctggggcagacaggatcgggacatcaaccggtgt




acagataatcgagggatggaggtaa






DERA
atgtcactcgcctcctacatcgaccacacgctgcttaaggccaccgccac
7


(Deinococcus
gctcgccgacatccgcacgctgtgtgaggaagcccgcgagcactcgttct




radiodurans;

acgcggtgtgcatcaacccggtctttattccccacgcccgcgcctggctc



AE000513)
gaaggcagcgacgtgaaggtcgccaccgtctgcggctttcccctcggcgc




catcagctccgagcagaaagctctggaagcccgcctgagcgccgaaacgg




gcgccgacgaaatcgatatggtcatccacatcggctcggcgcttgccggc




gactgggacgcggtggaagccgacgtgcgggcagtgcgccgcgcggtgcc




cgagcaggtgctcaaggtgattatcgaaacctgctacctgaccgacgagc




aaaagcgcttggcgactgaggtcgccgtacagggcggcgccgacttcgtg




aagacgagcacaggcttcggcaccggcggcgccaccgtggacgacgtgcg




cctgatggcggaagtgatcgggggccgcgccggactcaaggcggcgggcg




gcgtccgcactcctgccgacgcgcaagccatgatcgaggcgggcgcgacc




cggctgggcacctcgggcggcgtgggtctggtgtcgggcggcgaaaacgg




agccggctactaa






DERA
atgaatagtgcaaaattgattgatcacactttattgaagcctgagtcaac
8


(Staphylococcus
acgtacgcaaatcgatcaaatcatcgatgaagcgaaagcataccatttta




aureus

aatctgtatgtgtgaatccaacgcatgttaaatatgcagcagagcgacta



subsp.
gctgattcagaggtgttagtttgtacggtaataggattcccattaggtgc



aureus
atcgacaactgcgacgaaagcatttgaaacagaagatgcgattcaaaatg



Mu50;
gtgcagatgaaattgacatggtcatcaacatcggcgcattaaaagatgga



BA000017)
cgttttgatgatgtacaacaagacattgaagcagtggtgaaagctgcgaa




aggtcacacagtaaaagtgattattgagacggtattgttggaccatgacg




aaatcgtaaaagcgagtgaattaacaaaagtggctggtgcggacttcgtt




aaaacttcaacaggttttgcaggtggcggtgcgactgcagaagacgttaa




attaatgaaagatacagtaggtgctgatgtagaagtaaaagcatcaggtg




gcgtacgtaatttagaagatttcaataaaatggttgaagcaggtgcgaca




cgtattggtgcgagcgcaggcgttcaaattatgcaaggtttagaagcaga




ttcagattactaa






DERA
atgacaattgctaaaatgatcgatcatactgctttaaaaccagacacaac
9


(Listeria
gaaagaacaaattttaaccctaacaaaagaagcaagagaatacggctttg




monocytogenes

catccgtatgtgtgaacccaacttgggtaaaactatccgctgaacaactt



EGD-e;
gctggagcagaatccgtagtatgtactgttatcggtttcccactaggagc



HG421741)
gaatacccctgaagtaaaagcatttgaagtgaaagatgccatccaaaacg




gcgcgaaagaagtcgatatggttatcaatatcggtgcacttaaagacaag




gacgacgaattagtagaacgcgatattcgcgctgttgtcgatgttgctaa




aggcaaagcattagtaaaagtaattatcgaaacttgcctattaacagacg




aagaaaaagtgcgcgcatgcgaaatcgctgtaaaagcaggaacagacttc




gttaaaacatctacaggattttcaacaggtggcgcaactgccgaagatat




cgccttgatgcgtaaaacagttggaccgaacatcggtgtaaaagcatctg




gtggggttcgtacgaaagaagacgtagaaaaaatgatcgaagcaggcgca




actcgtatcggcgcaagtgcaggcgttgcaattgtttccggcgaaaaacc




agctaaacctgataattactaa






DERA
atgaaattaaataaatatatagatcatacgcttttaaaacaagatgcaaa
10


(Streptococcus
gaaaaaacaaattgatagtttgttgtctgaggctagagagtatgactttg




pneumoniae

ccagtgtttgcgttaatccgacctgggttgaacatgctaaaaaaggactt



TIGR4;
gaaggcacagatgttaaggtttgcacagtagtaggtttccctttgggagc



AE005672)
aacaacttcagccgtgaaagcatttgagacaaaagaagctatccaaaatg




gtgcagatgagattgatatggtgatcaatgttggagctctcaaatcaggt




aatttagccttggttgagtcagatattcgcgcagtagtggaagcaagtgg




tgataagttagtgaaagtcattattgaagcttgccttctgacagaccaag




aaaaagttgttgtttgccaattggcccaaaaagctggggctgactttgtc




aaaacatctactggcttttcaactggtggtgctacgatagcagatgttac




attaatgcgtgaaacagttggatctgatatgggtgtcaaggccgccggtg




gagctcgttcttatgcagatgctcttgcctttgtcgaagcaggtgcgacc




cgtatcggaacgtcagctggggtagctattttaaaaggagaattggcaga




tggcgactactaa






BFD
atggcttcggtacacggcaccacatacgaactcttgcgacgtcaaggcat
78


(Pseudomonas
cgatacggtcttcggcaatcctggctcgaacgagctcccgtttttgaagg




putida;

actttccagaggactttcgatacatcctggctttgcaggaagcgtgtgtg



Ap013070.1)
gtgggcattgcagacggctatgcgcaagccagtcggaagccggctttcat




taacctgcattctgctgctggtaccggcaatgctatgggtgcactcagta




acgcctggaactcacattccccgctgatcgtcactgccggccagcagacc




agggcgatgattggcgttgaagctctgctgaccaacgtcgatgccgccaa




cctgccacgaccacttgtcaaatggagctacgagcccgcaagcgcagcag




aagtccctcatgcgatgagcagggctatccatatggcaagcatggcgcca




caaggccctgtctatctttcggtgccatatgacgattgggataaggatgc




tgatcctcagtcccaccacctttttgatcgccatgtcagttcatcagtac




gcctgaacgaccaggatctcgatattctggtgaaagctctcaacagcgca




tccaacccggcgatcgtcctgggcccggacgtcgacgcagcaaatgcgaa




cgcagactgcgtcatgttggccgaacgcctcaaagctccggtttgggttg




cgccatccgctccacgctgcccattccctacccgtcatccttgcttccgt




ggattgatgccagctggcatcgcagcgatttctcagctgctcgaaggtca




cgatgtggttttggtaatcggcgctccagtgttccgttaccaccaatacg




acccaggtcaatatctcaaacctggcacgcgattgatttcggtgacctgc




gacccgctcgaagctgcacgcgcgccaatgggcgatgcgatcgtggcaga




cattggtgcgatggctagcgctcttgccaacttggttgaagagagcagcc




gccagctcccaactgcagctccggaacccgcgaaggttgaccaagacgct




ggccgacttcacccagagacagtgttcgacacactgaacgacatggcccc




ggagaatgcgatttacctgaacgagtcgacttcaacgaccgcccaaatgt




ggcagcgcctgaacatgcgcaaccctggtagctactacttctgtgcagct




ggcggactgggcttcgccctgcctgcagcaattggcgttcaactcgcaga




acccgagcgacaagtcatcgccgtcattggcgacggatcggcgaactaca




gcattagtgcgttgtggactgcagctcagtacaacatccccactatcttc




gtgatcatgaacaacggcacctacggtgcgttgcgatggtttgccggcgt




tctcgaagcagaaaacgttcctgggctggatgtgccagggatcgacttcc




gcgcactcgccaagggctatggtgtccaagcgctgaaagccgacaacctt




gagcagctcaagggttcgctacaagaagcgctttctgccaaaggcccggt




acttatcgaagtaagcaccgtaagcccggtgaagtga






KDC
atgtatacagtaggagattacctgttagaccgattacacgagttgggaat
79


(Lactococcus
tgaagaaatttttggagttcctggtgactataacttacaatttttagatc




lactis;

aaattatttcacgcgaagatatgaaatggattggaaatgctaatgaatta



AY548760.1)
aatgcttcttatatggctgatggttatgctcgtactaaaaaagctgccgc




atttctcaccacatttggagtcggcgaattgagtgcgatcaatggactgg




caggaagttatgccgaaaatttaccagtagtagaaattgttggttcacca




acttcaaaagtacaaaatgacggaaaatttgtccatcatacactagcaga




tggtgattttaaacactttatgaagatgcatgaacctgttacagcagcgc




ggactttactgacagcagaaaatgccacatatgaaattgaccgagtactt




tctcaattactaaaagaaagaaaaccagtctatattaacttaccagtcga




tgttgctgcagcaaaagcagagaagcctgcattatctttagaaaaagaaa




gctctacaacaaatacaactgaacaagtgattttgagtaagattgaagaa




agtttgaaaaatgcccaaaaaccagtagtgattgcaggacacgaagtaat




tagttttggtttagaaaaaacggtaactcagtttgtttcagaaacaaaac




taccgattacgacactaaattttggtaaaagtgctgttgatgaatctttg




ccctcatttttaggaatatataacgggaaactttcagaaatcagtcttaa




aaattttgtggagtccgcagactttatcctaatgcttggagtgaagctta




cggactcctcaacaggtgcattcacacatcatttagatgaaaataaaatg




atttcactaaacatagatgaaggaataattttcaataaagtggtagaaga




ttttgattttagagcagtggtttcttctttatcagaattaaaaggaatag




aatatgaaggacaatatattgataagcaatatgaagaatttattccatca




agtgctcccttatcacaagaccgtctatggcaggcagttgaaagtttgac




tcaaagcaatgaaacaatcgttgctgaacaaggaacctcattttttggag




cttcaacaattttcttaaaatcaaatagtcgttttattggacaaccttta




tggggttctattggatatacttttccagcggctttaggaagccaaattgc




ggataaagagagcagacaccttttatttattggtgatggttcacttcaac




ttaccgtacaagaattaggactatcaatcagagaaaaactcaatccaatt




tgttttatcataaataatgatggttatacagttgaaagagaaatccacgg




acctactcaaagttataacgacattccaatgtggaattactcgaaattac




cagaaacatttggagcaacagaagatcgtgtagtatcaaaaattgttaga




acagagaatgaatttgtgtctgtcatgaaagaagcccaagcagatgtcaa




tagaatgtattggatagaactagttttggaaaaagaagatgcgccaaaat




tactgaaaaaaatgggtaaattatttgctgagcaaaataaatag






KDC
atgaccgatgacggctacacagtcggtgactacctcctggaccggctcgc
70


(Mycobacterium
cgaactcggcgtgaccgaggtcttcggcgtccccggcgactatcagctgg




smegmatis

agttcctcgaccacgtcgtggcgcacccgcgcatcacgtgggtcggcggc



(strain
gcgaacgaactcaacgcgggctacgcggccgacggctacggccggctgcg



ATCC
gggcatggcggcgttggtcaccacgttcggcgtcggtgagctctcggcgg



700084/
ccaacgccatcgcaggcagctacgccgagcacgtcccggtggtgcacatc



mc(2)155); 
gtcggcgcgccgtcgaaggattcgcaggccgcgcggcgcatcgtgcacca



A0R480)
cacgctgggtgacggcgatttcgagcacttcctgcgcatgagccgcgaga




tcacctgcgcgcaggccaatctggtgcccgccacggcgacccgcgagatc




gaccgcgtgctctcggaggtgcacgagcagaagcggcccggctatctgct




gatcgcgaccgacgtcgcccgcttccccaccgaaccaccgaccgccccgc




tgccgcgccacagcgggggcaccagcccacgggcgctgtcgctgttcacc




gaggccgcaacgcaactcatcggcgagcaccggttgaccgtgctggccga




tttcctggtgcaccgcatgggatgcgtcgaggcgctcaacaagctgctga




ccgccgacaccgtgccgcacgccacgctgatgtggggcaagagcctggtc




gacgagagttcgccgaacttcctgggcatctacgccggtgcggccagtga




gggctcggtgcgcgaggtgatcgaggacgcgcctgtgctggtgaccgcag




gtgtgctgttcaccgacatggtcagcgggttcttcagccagcgcatcgac




ccggcacgcacgatcgacatcggggtcaaccagagcgtcgtcgccgggca




ggtgttcgcgccgctggacatggctgccgcgctcgacgcgctcgcgtcga




tcctcgccgaacgcggcatcgagtcgcccgcactgccgccggcccccgca




ccgcagcggcccgcagcgccgccgcgtgacgcggtgcttacacaggaagc




gttgtgggacaggctcgccgaggcgttgacgccgggcaacgtggtgctcg




ccgaccagggcacgtctttctacggcctggccgggcaccggctggcgtcg




ggcgtgacattcatcggtcagccgttgtgggcgtcgatcggctacacact




gcccgccgcgctcggcgcgggtctggccgaccgcgaccggcgcacggtgc




tgctgatcggcgacggcgcagcgcagttgacggtgcaggaactcggcgcg




ttcggccgcgaggggctcacgcccgtggtggtggtcgtcaacaacaacgg




ctacaccgtggagcgcgcgatccacggtgtcacggcccgctacaacgaca




tcacggcgtggcggtggaccgagttgccggccgcactcggtgtgcccgat




gcgctgacgttccgctgcgccacctacggcgaactcgacgacgcgctgac




cgtcgccgccgagacgcaggaccgcatggtgttcgtcgaggtgatgcttg




agcgcatggacattccgccgctgctgggcgagctcgcacagtcggcgtcg




gccgccaacgccaaatag






Protein sequence





PA1127
MSVESIRIEGIDTPVSRIGLGTWAIGGWMWGGADDAT
31


(Akr
SVETIRRAVESGINLIDTAPVYGFGHSEEVVGKALQG




Pseudomonas

LRDKAVIATKAALEWSDAGIHRNASAARIRREVEDSL




aeruginosa 

RRLKTDRIDLYQIHWPDPLVAHEETAGELERLRRDGK



PAO1)
ILAIGVSNYSPEQMDGFRQFAPLASVQPPYNLFERAI




DADVLPYAERNGIVVLAYGALCRGLLSGRMNAETRFD




GDDLRKSDPKFQQPRFAQYLAAVAQLEELARERYGKS




VLALAIRWILDRGPTVALWGARKPEQLNGIADAFGWR




LDDEAMARIERILAETIQDPVGPEFMAPPSRNA






PDC Z.
MSYTVGTYLAERLVQIGLKHHFAVAGDYNLVLLDNLL
71



mobilis

LNKNMEQVYCCNELNCGFSAEGYARAKGAAAAVVTYS



ZM4 =
VGALSAFDAIGGAYAENLPVILISGAPNNNDHAAGHV



ATCC 31821
LHHALGKTDYHYQLEMAKNITAAAEAIYTPEEAPAKI




DHVIKTALREKKPVYLEIACNIASMPCAAPGPASALF




NDEASDEASLNAAVEETLKFIANRDKVAVLVGSKLRA




AGAEEAAVKFADALGGAVATMAAAKSFFPEENPHYIG




TSWGEVSYPGVEKTMKEADAVIALAPVFNDYSTTGWT




DIPDPKKLVLAEPRSVVVNGIRFPSVHLKDYLTRLAQ




KVSKKTGALDFFKSLNAGELKKAAPADPSAPLVNAEI




ARQVEALLTPNTTVIAETGDSWFNAQRMKLPNGARVE




YEMQWGHIGWSVPAAFGYAVGAPERRNILMVGDGSFQ




LTAQEVAQMVRLKLPVIIFLINNYGYTIEVMIHDGPY




NNIKNWDYAGLMEVFNGNGGYDSGAGKGLKAKTGGEL




AEAIKVALANTDGPTLIECFIGREDCTEELVKWGKRV




AAANSRKPVNKLL*






Pyruvate
MKNWKTSAESILTTGPVVPVIVVKKLEHAVPMAKALV
25


dependent 
AGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTV



aldolase
LNPQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPL



(Eda
IPGISTVSELMLGMDYGLKEFKFFPAEANGGVKALQA




Escherichia

IAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSW




coli K-12

LVPADALEAGDYDRITKLAREAVEGAKL



MG1655)







4-hydroxy-
MNGKKLYISDVTLRDGMHAIRHQYSLENVRQIAKALD
26


2-oxovalerate
DARVDSIEVAHGDGLQGSSFNYGFGAHSDLEWIEAAA



aldolase
DVVKHAKIATLLLPGIGTIHDLKNAWQAGARVVRVAT



(MphE
HCTEADVSAQHIQYARELGMDTVGFLMMSHMTTPENL




Escherichia

AKQAKLMEGYGATCIYVVDSGGAMNMSDIRDRFRALK




coli K-12

AELKPETQTGMHAHHNLSLGVANSIAAVEEGCDRIDA



MG1655)
SLAGMGAGAGNAPLEVFIAAADKLGWQHGTDLYALMD




AADDLVRPLQDRPVRVDRETLALGYAGVYSSFLRHCE




TAAARYGLSAVDILVELGKRRMVGGQEDMIVDVALDL




RNNK






4-hydroxy-
MKLEGKKVTVHDMTLRDGMHPKRHQMTLEQMKSIACG
27


2-oxovalerate
LDAAGIPLIEVTHGDGLGGSSVNYGFPAHSDEEYLGA



aldolase
VIPLMKQAKVSALLLPGIGTVEHLKMAKDLGVNTIRV



(Paraburkholderia
ATHCTEADVSEQHITQSRKLGLDTVGFLMMAHMASPE



xenovorans
KLVSQALLMQGYGANCIYVTDSAGYMLPDDVKARLSA



LB400)
VRAALKPETELGFHGHHNLAMGVANSIAAIEAGATRI




DAAAAGLGAGAGNTPMEVFIAVCARMGIETGVDVFKI




QDVAEDLVVPIMDHVIRIDRDSLTLGYAGVYSSFLLF




AKRASAKYGVPARDILVELGRRGMVGGQEDMIEDTAM




TMARERGLTLTAA






4-hydroxy-
MENSFKAALKAGRPQIGLWLGLSSSYSAELLAGAGFD
83


2-oxoheptane
WLLIDGEHAPNNVQTVLTQLQAIAPYPSQPVVRPSWN



dioate
DPVQIKQLLDVGTQTLLVPMVQNADEAREAVRATRYP



aldolase
PAGIRGVGSALARASRWNRIPDYLQKANDQMCVLVQI



(HpcH/HpaI
ETREAMKNLPQILDVEGVDGVFIGPADLSADMGYAGN




Escherichia 

PQHPEVQAAIEQAIVQIRESGKAPGILIANEQLAKRY




coli

LELGALFVAVGVDTTLLARAAEALAARFGAQATAVKP



ATCC8739)
GVY






Threonine 
MADSQPLSGAPEGAEYLRAVLRAPVYEAAQVTPLQKM
47


dehydratase
EKLSSRLDNVILVKREDRQPVHSFKLRGAYAMMAGLT



ilvA
EEQKAHGVITASAGNHAQGVAFSSARLGVKALIVMPT



(Escherichia
ATADIKVDAVRGFGGEVLLHGANFDEAKAKAIELSQQ




coli K-12

QGFTWVPPFDHPMVIAGQGTLALELLQQDAHLDRVFV



MG1655)
PVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDSACLKA




ALDAGHPVDLPRVGLFAEGVAVKRIGDETFRLCQEYL




DDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAGM




KKYIALHNIRGERLAHILSGANVNFHGLRYVSERCEL




GEQREALLAVTIPEEKGSFLKFCQLLGGRSVTEFNYR




FADAKNACIFVGVRLSRGLEERKEILQMLNDGGYSVV




DLSDDEMAKLHVRYMVGGRPSHPLQERLYSFEFPESP




GALLRFLNTLGTYWNISLFHYRSHGTDYGRVLAAFEL




GDHEPDFETRLNELGYDCHDETNNPAFRFFLAG






Alpha-
MYTVGDYLLDRLHELGIEEIFGVPGDYNLQFLDQIIS
66


ketoisovalerate
RKDMKWVGNANELNASYMADGYARTKKAAAFLTTFGV



decarboxlase
GELSAVNGLAGSYAENLPVVEIVGSPTSKVQNEGKFV



kivD
HHTLADGDFKHFMKMHEPVTAARTLLTAENATVEIDR



(Lactococcus
VLSALLKERKPVYINLPVDVAAAKAEKPSLPLKKENP




lactis

TSNTSDQEILNKIQESLKNAKKPIVITGHEIISFGLE




lactis

NTVTQFISKTKLPITTLNFGKSSVDETLPSFLGIYNG



KF147)
KLSEPNLKEFVESADFILMLGVKLTDSSTGAFTHHLN




ENKMISLNIDEGKIFNESIQNFDFESLISSLLDLSGI




EYKGKYIDKKQEDFVPSNALLSQDRLWQAVENLTQSN




ETIVAEQGTSFFGASSIFLKPKSHFIGQPLWGSIGYT




FPAALGSQIADKESRHLLFIGDGSLQLTVQELGLAIR




EKINPICFIINNDGYTVEREIHGPNQSYNDIPMWNYS




KLPESFGATEERVVSKIVRTENEFVSVMKEAQADPNR




MYWIELVLAKEDAPKVLKKMGKLFAEQNKS






2-
MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALER
51


isopro-
MGVDVMEVGFPVSSPGDFESVQTIARQVKNSRVCALA



pylmalate 
RCVEKDIDVAAESLKVAEAFRIHTFIATSPMHIATKL



synthase
RSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPI



(leuA
ADLARVVEAAINAGATTINIPDTVGYTMPFEFAGIIS




Escherichia

GLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGA




coli K-13

RQVEGAMNGIGERAGNCSLEEVIMAIKVRKDILNVHT



MG1655)
AINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHS




SGIHQDGVLKNRENYEIMTPESIGLNQIQLNLTSRSG




RAAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQV




FDYDLEALAFIGKQQEEPEHFRLDYFSVQSGSNDIAT




AAVKLACGEEVKAEAANGNGPVDAVYQAINRITEYNV




ELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGLA




TDIVESSAKAMVHVLNNIWRAAEVEKELQRKAQHNEN




NKETV






3-
MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMR
54


isopropylmalate
ITTSHYDVGGAAIDNHGQPLPPATVEGCEQADAVLFG



dehydrogenase
SVGGPKWEHLPPDQQPERGALLPLRKHFKLFSNLRPA



(leuB
KLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQ




Escherichia

PKGREGSGQYEKAFDTEVYHRFEIERIARIAFESARK




coli K-12

RRHKVTSIDKANVLQSSILWREIVNEIATEYPDVELA



MG1655)
HMYIDNATMQLIKDPSQFDVLLCSNLFGDILSDECAM




ITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIA




NPIAQILSLALLLRYSLDADDAACAIERAINRALEEG




IRTGDLARGAAAVSTDEMGDIIARYVAEGV






Isopropylmalate
MAKTLYEKLFDAHVVYEAENETPLLYIDRHLVHEVTS
57


dehydratase
PQAFDGLRAHGRPVRQPGKTFATMDHNVSTQTKDINA



(leuC
CGEMARIQMQELIKNCKEFGVELYDLNHPYQGIVHVM




Escherichia

GPEQGVTLPGMTIVCGDSHTATHGAFGALAFGIGTSE




coli K-12

VEHVLATQTLKQGRAKTMKIEVQGKAAPGITAKDIVL



Mg1655
AIIGKTGSAGGTGHVVEFCGEAIRDLSMEGRMTLCNM




AIEMGAKAGLVAPDETTFNYVKGRLHAPKGKDFDDAV




AYWKTLQTDEGATFDTVVTLQAEEISPQVTWGTNPGQ




VISVNDNIPDPASFADPVERASAEKALAYMGLKPGIP




LTEVAIDKVFIGSCTNSRIEDLRAAAEIAKGRKVAPG




VQALVVPGSGPVKAQAEAEGLDKIFIEAGFEWRLPGC




SMCLAMNNDRLNPGERCASTSNRNFEGRQGRGGRTHL




VSPAMAAAAAVTGHFADIRNIK






Isopropylmalate
MAEKFIKHTGLVVPLDAANVDTDAIIPKQFLQKVTRT
60


dehydratase
GFGAHLFNDWRFLDEKGQQPNPDFVLNFPQYQGASIL



(leuD
LARENFGCGSSREHAPWALTDYGFKVVIAPSFADIFY




Escherichia

GNSFNNQLLPVKLSDAEVDELFALVKANPGIHFDVDL




coli K-12

EAQEVKAGEKTYRFTIDAFRRHCMMNGLDSIGLTLQH



Mg1655)
DDAIAAYEAKQPAFMN






aspartate
MRVLKFGGTSVANAERFLRVADILESNARQGQVATVL
35


kinase/
SAPAKITNHLVAMIEKTISGQDALPNISDAERIFAEL



homoserine
LTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLL



dehydrogenase 
GQCPDSINAALICRGEKMSIAIMAGVLEARGHNVTVI



(thra
DPVEKLLAVGHYLESTVDIAESTRRIAASRIPADHMV




Escherichia

LMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADC




coli

CEIWTDVDGVYTCDPRQVPDARLLKSMSYQEAMELSY



MG1655)
FGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIG




ASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAA




RVFAAMSRARISVVLITQSSSEYSISFCVPQSDCVRA




ERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMR




TLRGISAKFFAALARANINIVAIAQGSSERSISVVVN




NDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGALL




EQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNL




ENWQEELAQAKEPFNLGRLIRLVKEYHLLNPVIVDCT




SSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQL




RYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDEL




MKFSGILSGSLSYIFGKLDEGMSFSEATTLAREMGYT




EPDPRDDLSGMDVARKLLILARETGRELELADIEIEP




VLPAEFNAEGDVAAFMANLSQLDDLFAARVAKARDEG




KVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENA




LAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTL




SWKLGV






homoserine 
MVKVYAPASSANMSVGFDVLGAAVTPVDGALLGDVVT
38


kinase
VEAAETFSLNNLGRFADKLPSEPRENIVYQCWERFCQ



(thrB
ELGKQIPVAMTLEKNMPIGSGLGSSACSVVAALMAMN




Escherichia

EHCGKPLNDTRLLALMGELEGRISGSIHYDNVAPCFL




coli

GGMQLMIEENDIISQQVPGFDEWLWVLAYPGIKVSTA



MG1655)
EARAILPAQYRRQDCIAHGRHLAGFIHACYSRQPELA




AKLMKDVIAEPYRERLLPGFRQARQAVAEIGAVASGI




SGSGPTLFALCDKPETAQRVADWLGKNYLQNQEGFVH




ICRLDTAGARVLEN






threonine
MKLYNLKDHNEQVSFAQAVTQGLGKNQGLFFPHDLPE
41


synthase
FSLTEIDEMLKLDFVTRSAKILSAFIGDEIPQEILEE



(thrC
RVRAAFAFPAPVANVESDVGCLELFHGPTLAFKDFGG




Escherichia

RFMAQMLTHIAGDKPVTILTATSGDTGAAVAHAFYGL




coli

PNVKVVILYPRGKISPLQEKLFCTLGGNIETVAIDGD



MG1655)
FDACQALVKQAFDDEELKVALGLNSANSINISRLLAQ




ICYYFEAVAQLPQETRNQLVVSVPSGNFGDLTAGLLA




KSLGLPVKRFIAATNVNDTVPRFLHDGQWSPKATQAT




LSNAMDVSQPNNWPRVEELFRRKIWQLKELGYAAVDD




ETTQQTMRELKELGYTSEPHAAVAYRALRDQLNPGEY




GLFLGTAHPAKFKESVEAILGETLDLPKELAERADLP




LLSHNLPADFAALRKLMMNHQ






glucose-6- 
MAVTQTAQACDLVIFGAKGDLARRKLLPSLYQLEKAG
21


phosphate
QLNPDTRIIGVGRADWDKAAYTKVVREALETFMKETI



1-
DEGLWDTLSARLDFCNLDVNDTAAFSRLGAMLDQKNR



dehydrogenase
ITINYFAMPPSTFGAICKGLGEAKLNAKPARVVMEKP



(zwf
LGTSLATSQEINDQVGEYFEECQVYRIDHYLGKETVL




Escherichia

NLLALRFANSLFVNNWDNRTIDHVEITVAEEVGIEGR




coli K-12 

WGYFDKAGQMRDMIQNHLLQILCMIAMSPPSDLSADS



MG1655)
IRDEKVKVLKSLRRIDRSNVREKTVRGQYTAGFAQGK




KVPGYLEEEGANKSSNTETFVAIRVDIDNWRWAGVPF




YLRTGKRLPTKCSEVVVYFKTPELNLFKESWQDLPQN




KLTIRLQPDEGVDIQVLNKVPGLDHKHNLQITKLDLS




YSETFNQTHLADAYERLLLETMRGIQALFVRRDEVEE




AWKWVDSITEAWAMDNDAPKPYQAGTWGPVASVAMIT




RDGRSWNEFE






DERA
MIEYRIEEAVAKYREFYEFKPVRESAGIEDVKSAIEH
11


(Thermotoga
TNLKPFATPDDIKKLCLEARENRFHGVCVNPCYVKLA




maritima;

REELEGTDVKVVTVVGFPLGANETRTKAHEAIFAVES



AAD36625.1)
GADEIDMVINVGMLKAKEWEYVYEDIRSVVESVKGKV




VKVIIETCYLDTEEKIAACVISKLAGAHFVKTSTGFG




TGGATAEDVHLMKWIVGDEMGVKASGGIRTFEDAVKM




IMYGADRIGTSSGVKIVQGGEERYGG






DERA
MVKMNVETREELASLIDHTNVRADATENDIERLCREA
12


(Methano-
VSYGFRCAVVTPTNVRLAAELLEGTDVTVCSVVGFPA




thermobacter

GVSTPRVKALEASEAVENGAGEVDMVMNIGAMKSGNR




thermauto- 

ELVYRDISGVVDAAGVPVKVILETAYLTDKEKVEACL




trophicus;

ISKEAGAAFVKTSTAYGGLAGATVEDVMLMRKTVGDE



AAB85318.1)
MGVKASGGIRDLETALAMIDAGADRIGTSTGVQIIEG




WR






DERA
MSLASYIDHTLLKATATLADIRTLCEEAREHSFYAVC
13


(Deinococcus
INPVFIPHARAWLEGSDVKVATVCGFPLGAISSEQKA




radiodurans;

LEARLSAETGADEIDMVIHIGSALAGDWDAVEADVRA



AAF10775.1)
VRRAVPEQVLKVIIETCYLTDEQKRLATEVAVQGGAD




FVKTSTGFGTGGATVDDVRLMAEVIGGRAGLKAAGGV




RTPADAQAMIEAGATRLGTSGGVGLVSGGENGAGY






DERA
MNSAKLIDHTLLKPESTRTQIDQIIDEAKAYHFKSVC
14


(Staphylococcus
VNPTHVKYAAERLADSEVLVCTVIGFPLGASTTATKA




aureus;

FETEDAIQNGADEIDMVINIGALKDGRFDDVQQDIEA



strain
VVKAAKGHTVKVIIETVLLDHDEIVKASELTKVAGAD



Mu50;
FVKTSTGFAGGGATAEDVKLMKDTVGADVEVKASGGV



WP_001083318.1)
RNLEDFNKMVEAGATRIGASAGVQIMQGLEADSDY






DERA
MKLNKYIDHTLLKQDAKKKQIDSLLSEAREYDFASVC
15


(Streptococcus
VNPTWVEHAKKGLEGTDVKVCTVVGFPLGATTSAVKA




pneumoniae

FETKEAIQNGADEIDMVINVGALKSGNLALVESDIRA



serotype 
VVEASGDKLVKVIIEACLLTDQEKVVVCQLAQKAGAD



4 (strain
FVKTSTGFSTGGATIADVTLMRETVGSDMGVKAAGGA



ATCC
RSYADALAFVEAGATRIGTSAGVAILKGELADGDY



BAA-334/




TIGR4);




WP_000773677.1)







DERA
MTDLKASSLRALKLMDLTTLNDDDTDEKVIALCHQAK
16


(Escherichia
TPVGNTAAICIYPRFIPIARKTLKEQGTPEIRIATVT




coli

NFPHGNDDIDIALAETRAAIAYGADEVDVVFPYRALM



(strain
AGNEQVGFDLVKACKEACAAANVLLKVIIETGELKDE



K12);
ALIRKASEISIKAGADFIKTSTGKVAVNATPESARIM



NP_418798.1)
MEVIRDMGVEKTVGFKPAGGVRTAEDAQKYLAIADEL




FGADWADARHYRFGASSLLASLLKALGHGDGKSASSY






DERA
MTIAKMIDHTALKPDTTKEQILTLTKEAREYGFASVC
17


(Listeria
VNPTWVKLSAEQLAGAESVVCTVIGFPLGANTPEVKA




monocytogenes

FEVKDAIQNGAKEVDMVINIGALKDKDDELVERDIRA




serovar

VVDVAKGKALVKVIIETCLLTDEEKVRACEIAVKAGT



1/2a (strain
DFVKTSTGFSTGGATAEDIALMRKTVGPNIGVKASGG



ATCC
VRTKEDVEKMIEAGATRIGASAGVAIVSGEKPAKPDN



BAA-679/
Y



EGD-e);




NP_465519.1)







DERA
MSRSIAQMIDHTLLKPNTTEDQIVKLCEEAKEYSFAS
18


(Bacillus
VCVNPTWVALAAQLLKDAPDVKVCTVIGFPLGATTPE



halodurans
VKAFETTNAIENGATEVDMVINIGALKDKQYELVGRD



(strain
IQAVVKAAEGKALTKVIIETSLLTEEEKKAACELAVK



ATCC
AGADFVKTSTGFSGGGATAEDIALMRKVVGPNLGVKA



BAA-125/
SGGVRDLSDAKAMIDAGATRIGASAGVAIVNGERSEG



DSM 18197/
SY



FERM 7344/




JCM 9153/




C-125);




WP_010897517.1)







DERA
MNIAKLIDHTILKANTTKEDVMKVIEEAKEYKFASVC
19


(Bacillus
INPTWVKLAAEELAGHDVDVCTVIGFPLGASTTETKA




cereus

FETKDAIAKGATEVDMVINVGALKDGDDELVEKDIYE



(strain
VVQAAKGKALVKVIIETCLLTDEEKVRACELSVKAGA



ATCC10987/
DFVKTSTGFSTGGATAEDIALMRKTVGPNVGVKASGG



NRS 248);
VRTREDAEKMVAAGASRVGASASVAIVLNDAKGATDN



WP_001017443.1)
Y






DERA
MSLANIIDHTALKPHTQKADILKLIEEAKTYKFASVC
20


(Bacillus
VNPTWVELAAKELKGTGVDVCTVIGFPLGANTTETKA




subtilis

FETKDAISKGATEVDMVINIAALKDKEDDVVEADIRG



(strain
VVEAVAGKALVKVIIETCLLTDEEKERACRLAVSAGA



168);
DFVKTSTGFSTGGATKEDIALMRKTVGPDIGVKASGG



CAB15978.2)
VRTKEDVDTMVEAGASRIGASAGVSIVKGENASGGDN




Y






BFD
MASVHGTTYELLRRQGIDTVFGNPGSNELPFLKDFPE
76


(Benzoylformate
DFRYILALQEACVVGIADGYAQASRKPAFINLHSAAG



decarboxylase;
TGNAMGALSNAWNSHSPLIVTAGQQTRAMIGVEALLT




Pseudomonas

NVDAANLPRPLVKWSYEPASAAEVPHAMSRAIHMASM




putida

APQGPVYLSVPYDDWDKDADPQSHHLFDRHVSSSVRL



NBRC 14164;
NDQDLDILVKALNSASNPAIVLGPDVDAANANADCVM



WP_016501746.1)
LAERLKAPVWVAPSAPRCPFPTRHPCFRGLMPAGIAA




ISQLLEGHDVVLVIGAPVFRYHQYDPGQYLKPGTRLI




SVTCDPLEAARAPMGDAIVADIGAMASALANLVEESS




RQLPTAAPEPAKVDQDAGRLHPETVFDTLNDMAPENA




IYLNESTSTTAQMWQRLNMRNPGSYYFCAAGGLGFAL




PAAIGVQLAEPERQVIAVIGDGSANYSISALWTAAQY




NIPTIFVIMNNGTYGALRWFAGVLEAENVPGLDVPGI




DFRALAKGYGVQALKADNLEQLKGSLQEALSAKGPVL




IEVSTVSPVK






KDC
MYTVGDYLLDRLHELGIEEIFGVPGDYNLQFLDQIIS
77


(indole-3- 
REDMKWIGNANELNASYMADGYARTKKAAAFLTTFGV



pyruvate
GELSAINGAGSYAENLPVVEIVGSPTSKVQNDGKFVH



decarboxylase;
HTLADGDFKHFMKMHEPVTAARTLLTAENATYEIDRV




Lactococcus

LSQLLKERKPVYINLPVVAAAKAEKPALSLEKESSTT




lactis

NTTEQVILSKIEESLKNAQKPVVIAGHEVISFGLEKT



WP_046124870.1)
VTQFVSETKLPITTLNFGKSAVDESLPSFLGIYNGKL




SEISLKNFVESADFILMLGVKLTDSSTGAFTHHLDEN




KMISLNIDEGIIFNKVVEDFDFRAVVSSLSELKGIEY




EGQYIKQYEEFIPSSAPLSQDRLWQAVESLTQSNETI




VAEQGTSFFGASTIFLKSNSRFIGQPLWGSIGYTFPA




ALGSQIADKESRHLLFIGDGSLQLTVQELGLSIREKL




NPICFIINNDGYTVEREIHGPTQSYNDIPMWNYSKLP




ETFGATEDRVVSKIVRTENEFVSVMKEAQADVNRMYW




IELVLEKEDAPKLLKKMGKLFAEQNK






KDC
MTDDGYTVGDYLLDRLAELGVTEVFGVPGDYQLEFLD
68


(Mycobacterium
HVVAHPRITWVGGANELNAGYAADGYGRLRGMAALVT




smegmatis

TFGVGELSAANAIAGSYAEHVPVVHIVGAPSKDSQAA



(strain
RRIVHHTLGDGDFEHFLRMSREITCAQANLVPATATR



ATCC 700084/
EIDRVLSEVHEQKRPGYLLIATDVARFPTEPPTAPLP



mc(2)155)
RHSGGTSPRALSLFTEAATQLIGEHRLTVLADFLVHR



WP_011730751.1)
MGCVEALNKLLTADTVPHATLMWGKSLVDESSPNFLG




IYAGAASEGSVREVIEDAPVLVTAGVLFTDMVSGFFS




QRIDPARTIDIGVNQSVVAGQVFAPLDMAAALDALAS




ILAERGIESPALPPAPAPQRPAAPPRDAVLTQEALWD




RLAEALTPGNVVLADQGTSFYGLAGHRLASGVTFIGQ




PLWASIGYTLPAALGAGLADRDRRTVLLIGDGAAQLT




VQELGAFGREGLTPVVVVVNNNGYTVERAIHGVTARY




NDITAWRWTELPAALGVPDALTFRCATYGELDDALTV




AAETQDRMVFVEVMLERMDIPPLLGELAQSASAANAK









Example 1: Selection of Adolase Enzymes for the Trans-2-Unsaturated Aldehyde and Delta-Lactone Pathways

The first operation in the trans-2-unsaturated aldehyde pathway and the delta-lactone pathway is the condensation of two aldehyde molecules by analdolase enzyme. To identify the aldolase enzymes that may catalyze the aldehyde condensation reaction, a list of various aldolase homologues or putative aldolases was identified based on sequence similarity to E. coli deoxyribose-5-phosphate aldolase (DERA). These various aldolases are listed in TABLE 15.













TABLE 15









% Identity


Name
Organism
PDB No.
GI
to BH1352



















Aldolase
Thermus
1J2W

45




thermophilus HB8






deoxyribose-5-phosphate

Escherichia coli

1JCJ
16974931
35


aldolase
(K201L mutant)





deoxyribose-5-phosphate

Escherichia coli

1KTN
28372821
36


aldolase






Aldolase

Aquifex aeolicus

1MZH
29726561
43


Aldolase

Aeropyrum pernix

1N7K
29726561
37


deoxyribose-5-phosphate

Pyrobaculum

1VCV
73535346
36


aldolase

aerophilum






deoxyribose-5-phosphate

Plasmodium yoelii

2A4A
75766024
35


aldolase






Hydrolase

Streptomyces

3I78
297342939
39




griseus






deoxyribose-5-phosphate

Mycobacterium

3NDO
30101668
44


aldolase

smegmatis






deoxyribose-5-phosphate

Mycobacterium

3NG3
299689368
46


aldolase

avium 104






deoxyribose-5-phosphate

Entamoeba

3NGJ
301016075
69


aldolase

histolydca






Putative deoxyribose-5-

Coccidioides

3OA3
303325189
51


phosphate aldolase
immitis





deoxyribose-5-phosphate

Thermotoga

3R12
329666268
55


aldolase

maritima






deoxyribose-5-phosphate

Lactobacilus brevis

4XBK
946926237
53


aldolase






deoxyribose-5-phosphate

Lactobacilus brevis

4XBS
946926241
53


aldolase
(E78K mutant)





deoxyribose-5-phosphate

Colwelia

5C2X
985483947
37


aldolase

psychereythraea






deoxyribose-5-phosphate

Shewanella

5C6M
985483957
34


aldolase

halifaxensis






deoxyribose-5-phosphate

Streptococcus suis

5DBT
1018192340
69


aldolase









Example 2: Selection of Aldolase Enzymes for the Gamma-Lactone Pathway

The first step in the gamma-lactone pathway is the condensation of analdehyde and a pyruvate or pyruvic acid by analdolase enzyme. To identify the aldolase enzymes that may catalyze the condensation reaction, a list of various aldolase homologues or putative aldolases was identified based on sequence similarity to E. coli pyruvate dependent aldolase Eda (listed in TABLE 16), E. coli 4-hydroxy-2-oxoheptanedioate aldolase HpcH (listed in TABLE 17). E. coli 4-hydroxy-2-oxovalerate aldolase MphE (listed in TABLE 18), and Paraburkholderia xenovorans LB400 4-hydroxy-2-oxovalerate aldolase BphI (listed in TABLE 19).













TABLE 16









% Ident


Name
Organism
PDB No.
GI
to Eda



















KDPG aldolase

Escherichia coli

1FQ0
10835436
100


KDPG aldolase

E. coil (K133Q/T161K

1FWR
10835451
99



mutant)





KDPG aldolase

Pseudomonas putida

1MXS
37926857
46


Putatve KDPG aldolase

Haemophilus influenza

1VHC
40889919
37


KDPG aldolase

Thermotoga maritima MSB8

1VLW
52696233
33


KDPG aldolase

Thermotoga maritima

1WA3
60593837
33


KDPG aldolase

E. coli

2C0A
75766436
99


KDPG aldolase

E. coli (E45N mutant)

1WAU
88191724
99


KDGgal

E. coli

2V81
157836718
25


Oxoglutarate Aldolase

Thermus thermophilus HB8

2YW3
159795598
40


KDPG aldolasc

Oleispira antarctica

3VCR
372467175
50


Keto-hydroxyglutarte-

Vibrionales bacterium

4E38
380765206
40


aldolase
SWAT-3





KDPG Aldolase

Zymomonas mobilis

4BK9
635576158
54


Unknown protein

E. coli

4QCC
723586857
26




















TABLE 17









% Ident to


Name
Organism
PDB No.
GI
HpcH



















Apo Class II Aldolase

Escherichia coli C

pdb|2V5J|A
158430996
100


Hpch






Pyruvate Aldolase

Escherichia coli ATCC

pdb|4B5X|A
402550309
99


(HpaI), Mutant D42a
8739





Pyruvate Aldolase R70a

Escherichia coli ATCC

pdb|4B5W|A
402550303
99


Mutant, HpaI
8739





Pyruvate Aldolase, HpaI,

Escherichia coli ATCC

pdb|4B5S|A
402550295
10



8739





Class II Aldolase

Escherichia coli K-12

pdb|2VWS|A
198443128
56


2-Dehydro-3-Deoxy-

Escherichia coli

pdb|1DXE|A
10120730
48


Galactarate Aldolase






Hpch/hpal

Burkholderia cenocepacia

pdb|4MF4|A
538261383
42


Aldolase/citrate Lyase
J2315





Family Protein






HpchHPAI ALDOLASE

Desulfitobacterium

pdb|3QZ6|A
326328104
33




hafniense DCB-2






Citrate Synthase Sbng

Staphylococcus aureus

pdb|4TV5|A
700588452
28



subsp. aureus str. Newman





Citrate Synthase Variant

Staphylococcus aureus

pdb|4TV6|A
700588453
27


Sbng E151q
subsp. aureus str. Newman





Macrophomate Synthase

Macrophoma commelinae

pdb|1IZC|A
29726304
25




















TABLE 18









% Ident to


Name
Organism
PDB No.
GI
MphE



















Bifunctional Aldolase-

Pseudomonas sp. CF600

pdb|1NVM|A
33357505
83


Dehydrogenase






Aldolase/aldehyde

Thermomonospora curvata

pdb|4LRS|A
538261320
53


Dehydrogenas
DSM 43183





Aldolase-dehydrogenase

Mycobacterium

pdb|4JN6|A
491668629
52




tuberculosis






Putative Aldolase

Bacteroides vulgatus

pdb|3DXI|A
197305157
25


(Bvu_2661)
ATCC 8482





Truncated Alpha-

Neisseria meningitidis

pdb|3RMJ|A
380258925
26


isopropylmalate Synthase
serogroup B





2-Isopropylmalate

Cytophaga hutchinsonii

pdb|3EEG|A
203282565
23


Synthase
ATCC 33406





Isopropylmalate Synthase

Leptospira biflexa serovar

pdb|4OV4|A
673541345
22



Patoc strain ‘Patoc 1 (Paris)’




















TABLE 19









% Ident to


Name
Organism
PDB No.
GI
BphI



















Bifunctional Aldolase-

Pseudomonas sp. CF600

pdb|1NVM|A
33357505
57


Dehydrogenase






Aldolase-dehydrogenase

Mycobacterium

pdb|4JN6|A
491668629
51




tuberculosis






The Bifunctional Enzyme

Thermomonospora curvata

pdb|4LRS|A
538261320
52


(aldolase/aldehyde
DSM 43183





Dehydrogenase)






2-Isopropylmalate

Cytophaga hutchinsonii

pdb|3EEG|A
203282565
25


Synthase
ATCC 33406





Putative 2-

Listeria monocytogenes

pdb|3EWB|X
218766747
27


Isopropylmalate Synthase
serotype 4b str. F2365





Putative Aldolase

Bacteroides vulgatus

pdb|3DXI|A
197305157
40


(Bvu_2661)
ATCC 8482





Homocitrate Synthase

Thermus thermophilus

pdb|2ZTJ|A
261278579
37



HB27





Oxaloacetate

Vibrio cholerae

pdb|2NX9|A
122921155
24


Decarboxylase






Isopropylmalate Synthase

Leptospira biflexa serovar

pdb|4OV4|A
673541345
29



Patoc strain ‘Patoc 1 (Paris)’





Carboxyl Transferase

Rhizobium etli CFN 42

pdb|4JX6|A
508123917
35


Multifunctional Pyruvate

Rhizobium etli CFN 42

pdb|2QF7|A
158430175
35


Carboxylase













Example 3: Strain for Producing Trans-2-Hexenal Using Aldolase-Based Pathway

Strain E. coli MG1655 ΔpflB ΔldhA ΔadhE ΔyqhD ΔeutG ΔadhP ΔyjgB::cmR containing pArdra002 plasmid expressing pyruvate decarboxylase from Zymomonas mobilis (PDC), deoxyribose-5-phosphate aldolase from Bacillus halodurans (BH1352), and aldo-keto reductase from Pseudomonas aeruginosa (PA1127) was examined for the production of trans-2-hexenal. The strain was prepared by deleting seven genes, pflB, ldhA, adhE, yqhD, eutG, adhP, and yjgB using the lambda red recombinase method described in Datsenko and Wanner (Datsenko et al., PNAS USA 97: 6640-6645 (2000)).


Seed Preparation: Two seed cultures were prepared. The first seed culture was prepared by inoculating a glycerol stock in 5 mL medium and grown overnight. A second seed culture was prepared in 50 mL falcon tubes by inoculating 100 μL of the first seed culture in 10 mL media and grown overnight. The cultures were used to inoculate the fermenter to an optical density at 600 nm (OD600) of approximately 0.2 (range from about 0.15 to about 0.25).


Growth Phase: Cells were grown in a 2 L fermenter with an agitation set to 1000 RPM and DO over 50% until an OD600 of approximately 18 was reached. Protein expression was then induced with 0.1 M IPTG (1.5 mL of 1 M stock). Protein expression was carried out for about 7 hours. At about 7 hours or when the glucose from the medium was completely consumed, additional glucose was fed to the cells.


Production Phase: The production phase started after about 7 hours of protein expression by reducing the agitation to 500 RPM and by shutting down the air-flow. In addition to the 3.3 g/L glucose (10 mL of 50% glucose stock), 3 g/L of both acetaldehyde and butyraldehyde were added to the culture at the start and after 2 hours from the start of the production phase. Additional 3 g/L of both acetaldehyde and butyraldehyde were added to the culture after about 15 hours. The total production time was approximately 20 hours.


Sampling: 1 mL samples were taken throughout and especially before and after addition of glucose.


Results: 2.5 g/L of trans-2-hexenal was produced in about 20 hours. The progression of the trans-2-hexenal production during the production phase is shown in FIG. 11.


Example 4: Theoretical Yield Calculation of Trans-2-Hexenal


FIG. 1 describes a fatty acid oxidation pathway used in the industry to produce natural trans-2-hexenal. As shown in FIG. 1, one mole of linolenic acid provides 1 mole of trans-2-hexenal. The theoretical yield of trans-2-hexenal per mole of carbon according to the pathway of FIG. 1 can be calculated as follows:













Mole amount
Theoretical yield of trans-2-hexenal







 1 mole of linolenic acid
   1 mole of trans-2-hexenal


18 moles of carbon
   1 mole of trans-2-hexenal


 1 mole of carbon
0.055 mole of trans-2-hexenal









The theoretical yield of trans-2-hexenal according to the present disclosure is about twice as much as the theoretical yield of natural trans-2-hexenal. According to the present disclosure:


(1) 0.5 mole of glucose=1 mole of acetaldehyde


(2) 1 mole of glucose=1 mole of butyraldehyde


(3) 1 mole of acetaldehyde+1 mole of butyraldehyde=1 mole of trans-2-hexenal


Thus, the theoretical yield of trans-2-hexenal according to the present disclosure can be determined as follow:
















Mole amount
Theoretical yield of trans-2-hexenal









1.5 moles of glucose
  1 mole of trans-2-hexenal



  9 moles of carbon
  1 mole of trans-2-hexenal



  1 mole of carbon
0.11 mole of trans-2-hexenal










Example 5: Theoretical Yield Calculation of Delta-Decalactone

The theoretical yield of delta decalactone according to the present disclosure:


(1) 1 mole of glucose=2 moles of acetaldehyde


(2) 2.5 moles of glucose=1 mole of hexanal


(3) 2 moles of acetaldehyde+1 mole of hexanal=1 mole of delta-decalactone


Thus, the theoretical yield of delta-decalactone according to the present disclosure can be determined as follow:
















Mole amount
Theoretical yield of delta-decalactone









3.5 moles of glucose
   1 mole of delta-decalactone



 21 moles of carbon
   1 mole of delta-decalactone



  1 mole of carbon
0.048 mole of delta-decalactone










Example 6: Theoretical Yield Calculation of Gamma-Decalactone

The theoretical yield of gamma-decalactone according to the present disclosure:


(1) 1 mole of glucose=2 moles of pyruvate


(2) 3 moles of glucose=1 mole of heptanal


(3) 1 mole of pyruvate+1 mole of heptanal=1 mole of gamma-decalactone


Thus, the theoretical yield of gamma-decalactone according to the present disclosure can be determined as follow:













Mole amount
Theoretical yield of gamma-decalactone







3.5 moles of glucose
   1 mole of gamma-decalactone


 21 moles of carbon
   1 mole of gamma-decalactone


  1 mole of carbon
0.048 mole of gamma-decalactone









Example 7: Construction Of Pyruvate-Accumulating Strain

As pyruvate is the first metabolite in the trans-2-unsaturated aldehydes, delta-lactones, and gamma-lactones pathways, increasing production requires a host organism that can accumulate pyruvate. The latter represents a key metabolite in the central carbon metabolism of most common microorganisms. In E. coli, pyruvate is the main precursor to several native fermentative by-products. To produce high titres of pyruvate in E. coli, pathways draining pyruvate to fermentative products were deleted. Seven genes, adhE (encoding alcohol dehydrogenase), ldhA (encoding lactate dehydrogenase), pflB (encoding pyruvate formate lyase), yqhD (encoding NADP-dependent alcohol dehydrogenase), eutG (encoding alcohol dehydrogenase), adhP (encoding alcohol dehydrogenase), and yjgB (encoding ZN-dependent alcohol dehydrogenase) were deleted from E. coli. The gene deletions were sequentially transferred from the corresponding single gene deletion mutants from KEIO collection to the wild type strain using P1 transduction method. The gene deletions were confirmed using PCR using at least the primers provided in the sequences: F-adhE-check (ggtcaactaa tccttaactg atcg) and R-adhEcheck (aagcaaatca tcaccgcact gac), F-ldhA-check (caggcttagc gcaacaaacg) and R-ldhA-check (ggttgcgcct acactaagc), F-pflBcheck (tccacttaag aaggtaggtg ttac) and R-pflA-check (aaagttgccg ctttacgggg aaa). The resultant strain with seven gene deletions was designated as LMSE-40C.

Claims
  • 1. A non-naturally occurring microorganism having a trans-2-unsaturated aldehyde pathway, wherein the non-naturally occurring microorganism is transformed with an exogenous nucleic acid encoding an aldolase comprising the amino acid sequence of SEQ ID NOs: 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20, an exogenous nucleic acid encoding a decarboxylase comprising the amino acid sequence of SEQ ID NOs: 68, 71, 76, or 77 and an exogenous nucleic acid encoding an aldo-ketoreductase comprising the amino acid sequence of SEQ ID NOs: 30, 31, 32, or 33, wherein the non-naturally occurring microorganism produces a trans-2-unsaturated aldehyde.
  • 2. The non-naturally occurring microorganism of claim 1, wherein the non-naturally occurring microorganism comprises an increased enzymatic activity of at least one enzyme in said trans-2-unsaturated aldehyde pathway in comparison with the enzymatic activity of the same enzyme in a corresponding unmodified or wild-type microorganism.
  • 3. The non-naturally occurring microorganism of claim 1, wherein the non-naturally occurring microorganism further comprises an exogenous nucleic acid encoding a ketoreductase, oxidoreductase, aldehyde reductase, enoate reductase, dehydratase, lactonization enzyme, or alcohol dehydrogenase.
  • 4. The non-naturally occurring microorganism of claim 1, wherein the decarboxylase comprises the amino acid sequence of SEQ ID NO: 71.
  • 5. The non-naturally occurring microorganism of claim 1, wherein the decarboxylase comprises the amino acid sequence of SEQ ID NO: 76.
  • 6. The non-naturally occurring microorganism of claim 1, wherein the decarboxylase comprises the amino acid sequence of SEQ ID NO: 68 or 77.
  • 7. The non-naturally occurring microorganism of claim 1, wherein the non-naturally occurring microorganism further comprises an exogenous nucleic acid encoding an, ethylnitronate:oxygen 2-oxidoreductase (nitriteforming), pyruvate carboxy-lyase (acetaldehyde-forming), acetaldehyde:NAD+ oxidoreductase (CoA-acetylating), acetaldehyde:acceptor oxidoreductase, Acetaldehyde:NAD+ oxidoreductase, Acetaldehyde:NADP+ oxidoreductase, Ethanol:NADP+ oxidoreductase, 2-Phosphonoacetaldehyde phosphonohydrolase, ethanolamine-phosphate phosphate-lyase (deaminating; acetaldehyde-forming), ethanolamine ammonia-lyase (acetaldehyde-forming), 4-hydroxy-2-oxopentanoate pyruvate-lyase (acetaldehyde-forming), L-threonine acetaldehyde-lyase (glycine-forming), (S)-lactate acetaldehyde-lyase (formate-forming), ethanol:NAD+ oxidoreductase, Pyruvate decarboxylase, TPP dependent reaction, Nitroethane:oxygen oxidoreductase, acetaldehyde:pyrroloquinoline-quinone oxidoreductase, 2-deoxy-D-ribose-S-phosphate acetaldehyde-lyase (D-glyceraldehyde-3-phosphate-forming), 17alpha-Hydroxyprogesterone acetaldehyde-lyase, 3-Hydroxybutan-2-one:D-ribose-5-phosphate aldehydetransferase, 24R,24(1)R)-fucosterol-epoxide acetaldehyde-lyase (desmosterol-forming), ethanol:cytochrome c oxidoreductase, acetaldehyde hydro-lyase, diethanolamine ethanolamine-lyase (acetaldehyde-forming), triethanolamine diethanolamine-lyase (acetaldehyde-forming), L-allo-threonine acetaldehyde-lyase (glycine-forming), fluoroacetaldehyde:L-threonine aldehydetransferase, cobalt-precorrin 5A acylhydrolase, D-threonine acetaldehyde-lyase (glycine-forming), 17alpha-Hydroxypregnenolone acetaldehyde-lyase, ethanol:cytochrome c oxidoreductase, chloroethane, donor:oxygen oxidoreductase (dechlorinating, acetaldehydeforming), ethanol:quinone oxidoreductase, acetyl-CoA:acetoin O-acetyltransferase, ethanol:N,N-dimethyl-4-nitrosoaniline oxidoreductase, 7,8-dihydroneopterin 3′-triphosphate acetaldehyde-lyase (6-carboxy-5,6,7,8-tetrahydropterin and triphosphate-forming), or choline trimethylamine-lyase (acetaldehyde-forming).
  • 8. The non-naturally occurring microorganism of claim 1, wherein one or more genes encoding an enzyme that utilizes pyruvate are deleted from the non-naturally occurring microorganism as compared to a wild-type microorganism.
  • 9. The non-naturally occurring microorganism of claim 1, wherein one or more genes encoding an enzyme capable of catalyzing a reaction involving an acetaldehyde molecule, a pyruvate, or an aldehyde precursor are deleted from the non-naturally occurring microorganism as compared to a wild-type microorganism.
  • 10. The non-naturally occurring microorganism of claim 8, wherein the trans-2-unsaturated aldehyde is trans-2-hexenal.
CROSS-REFERENCE

This application is a continuation of International Application No. PCT/US2017/040019, filed Jun. 29, 2017, which claims the benefit of U.S. Provisional Application No. 62/357,337, filed Jun. 30, 2016 and U.S. Provisional application No. 62/415,363, filed Oct. 31, 2016, each of which is entirely incorporated herein by reference.

US Referenced Citations (6)
Number Name Date Kind
20070117191 Kamachi et al. May 2007 A1
20070254341 Raemakers-Franken et al. Nov 2007 A1
20080289056 Greenberg et al. Nov 2008 A1
20110014669 Madden et al. Jan 2011 A1
20120142081 Yoshikuni et al. Jun 2012 A1
20160060635 Liao et al. Mar 2016 A1
Foreign Referenced Citations (6)
Number Date Country
WO-2005118794 Dec 2005 WO
WO-2006134482 Dec 2006 WO
WO-2007068498 Jun 2007 WO
WO-2015042201 Mar 2015 WO
WO-2017011815 Jan 2017 WO
WO-2018005806 Jan 2018 WO
Non-Patent Literature Citations (53)
Entry
Genbank Q9X1P5.1.2014. Genbank. p. 1-8 (Year: 2014).
Genbank Q6QBS4. 2006. Genbank. p. 1-2 (Year: 2006).
Kizer L et al. Application of Functional Genomics to Pathway Optimization for Increased Isoprenoid Production. Applied and Environmental Microbiology, May 2008, p. 3229-3241. (Year: 2008).
Prather KLJ et al. De novo biosynthetic pathways: rational design of microbial chemical factories. Current Opinion in Biotechnology 2008, 19:468-474. (Year: 2008).
Dobritzsch D et al. High Resolution Crystal Structure of Pyruvate Decarboxylase from Zymomonas mobilis. 1998. The Journal of Biological Chemistry. vol. 273, No. 32, Issue 7. p. 20196-20204. (Year: 1998).
Hasson MS et al. Purification and crystallization of benzoylformate decarboxylase. 1995. Protein Science. 955-959. (Year: 1995).
Atsumi et al. Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels. Nature 451:86-89 (2008).
Baba et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol 2006.0008 (2006).
Baker et al. Rational approaches for engineering novel functionalities in carbon-carbon bond forming enzymes. Comput Struct Biotechnol J 2:e201209003 (2012).
Billman-Jacobe. Expression in bacteria other than Escherichia coli. Current Opinion in Biotechnology 7:500-4 (1996).
Bitter et al. Expression and secretion vectors for yeast. Methods Enzymol 153:516-544 (1987).
Boros et al. Expression vectors based on the rac fusion promoter. Gene 42:97-100 (1986).
Buckholz. Yeast systems for the commercial production of heterologous proteins. BioTechnology 9:1067-1072 (1991).
Bussineau et al. Genetic stability of protein expression systems in yeast. Developments in Biological Standardization 83:13-19 (1994).
Candy et al. Expression of active yeast pyruvate decarboxylase has also been studied. Journal of General Microbiology 137:2811-2815 (1991).
Candy et al. The role of residues glutamate-50 and phenylalanine-496 in Zymomonas mobilis pyruvate decarboxylase, the promoter and nucleotide sequences of the Zymomonas mobilis pyruvate decarboxylase, and the active site mutants of pyruvate decarboxylase from Zymomonas mobilis have been studied. The Biochemical Journal 315(Pt 3):745-51 (1998).
Conway et al. Promoter and nucleotide sequences of the Zymomonas mobilis pyruvate decarboxylase. Journal of Bacteriology 169(3):949-54 (1987).
Datsenko et al. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. PNAS USA 97(12):6640-6645 (2000).
Deboer et al. Promoters, Structure and Function. Edited by Rodriguez and Chamberlin (Eds). Praeger, New York, pp. 462-481 (1982).
Deboer et al. The tac promoter: a functional hybrid derived from the trp and lac promoters PNAS USA 80:21-25 (1983).
Diamond et al. Method of RNA sequence analysis. Methods in Enzymology 100:431-453 (1983).
Finn et al. Pfam: the protein families database. Nucl Acids Res 42(D1):D222-D230 (Jan. 1, 2014).
Fleer. Engineering yeast for high level expression. Current Opinion in Biotechnology 3:486-496 (1992).
Gellissen et al. Heterologous protein production in yeast. Antonie van Leuwenhoek 62:79-93 (1992).
Gigot et al. The lipoxygenase metabolic pathway in plants: potential for industrial production of natural green leaf volatiles. Biotechnol Agron soc Environ 14(3):451-460 (2010) (English Abstract).
Gijsen et al. Unprecedented Asymmetric Aldol Reactions with Three Aldehyde Substrates Catalyzed by 2-Deoxyribose-5-phosphate Aldolase. J Am Chem Soc 116(18):8422-8423 (1994).
Gocke et al. Comparative characterisation of thiamine diphosphate-dependent decarboxylases. Journal of Molecular Catalysis B: Enzymatic 61(1-2):30-35 (2009).
Griffiths et al. Production of heterologous proteins using the baculovirus/insect expression system. Methods Mol Biol 75:427-440 (1997).
Gruez et al. Crystal structure and kinetics identify Escherichia coli YdcW gene product as a medium-chain aldehyde dehydrogenase J Mol Biol 343(1):29-41 (2004).
Hensing et al. Physiological and technological aspects of large-scale heterologous-protein production with yeasts. Antonie van Leuwenhoek 67:261-279 (1995).
Hockney. Recent developments in heterologous protein production in Escherichia coli. Trends in Biotechnology 12:456-463 (1994).
Hummel et al. Dehydrogenases for the synthesis of chiral compounds. Eur J Biochem 184:1-13 (1989).
Killenberg-Jabs et al. Purification and characterisation of the pyruvate decarboxylase from a haploid strain of Saccharomyces cerevisiae. Biological Chemistry Hoppe-Seyler 377(5):313-317 (1996).
Lee et al. Engineering of NADPH regenerators in Escherichia coli for enhanced biotransformation. Applied Microbiology and Biotechnology 97(7):2761-72 (2013).
Mak et al. Integrative genomic mining for enzyme function to enable engineering of a non-natural biosynthetic pathway. Nat Commun 6:10005 (2015).
McPherson et al. PCR a Practical Approach. Oxford University Press pp. 200-209 (1991).
Nobuhara. Syntheses of Unsaturated Lactones: Part I. Some Lactones of 5-Substituted-5-hydroxy-2-enoic Acids as a Synthetic Butter or Butter Cake Flavor Agr. Biol. Chem 32(8):1016-1020 (1968).
Oxford Dictionary of Biochemistry and Molecular Biology, Revised Edition, A D. Smith, Ed., New York: Oxford University Press (1997) pp. 161, 476, 477, and 560.
PCT/US2017/040019 International Preliminary Report on Patentability dated Jan. 10, 2019.
PCT/US2017/040019 International Search Report and Written Opinion dated Dec. 21, 2017.
PCT/US2017/040019 Invitation to Pay Additional Fees dated Oct. 3, 2017.
Pohl et al. Active site mutants of pyruvate decarboxylase from Zymomonas mobilis—a site-directed mutagenesis study of L112,I472,I476, E473, and N482. Eur J Biochem 257:538-546 (1998).
Ramachandran et al. Asymmetric synthesis of goniothalamin, hexadecanolide, massoia lactone, and parasorbic acid via sequential allylboration-esterification ring-closing metathesis reactions. Tetrahedron Letters 41:583-586 (2000).
Ricardo et al. Bioactive pyrones and flavonoids from Cryptocarya ashersoniana seedlings. ARKIVOC Available at< http://hdl.handle.net/11449/32714> pp. 127-136 (2004).
Rodriguez et al. Toward aldehyde and alkane production by removing aldehyde reductase activity in Escherichia coli. Metab Eng 25:227-237 (2014).
Sawers et al. Alternative regulation principles for the production of recombinant proteins in Escherichia coli. Applied Microbiology and Biotechnology 46:1-9 (1996).
Siegert et al. Exchanging the substrate specificities of pyruvate decarboxylase from Zymomonas mobilis and benzoylformate decarboxylase from Pseudomonas putida. Protein Engineering, Design & Selection 18(7):345-57 (2005).
Studier et al. Use of T3 RNA polymerase to direct expression of cloned genes. Methods Enzymol. 185:60-89 (1990).
Vedvick. Gene expression in yeast: Pichia pastoris. Current Opinion in Biotechnology 2:742-745 (1991).
Webb. Enzyme Nomenclature. Orlando, Fl. pp. 20-141 (1984).
Yu et al. Cosmid cloning and walking to map human CD1 leukocyte differentiation antigen genes. Meth. Enzymol. 217:378-398 (1993).
Zhang et al. Expanding metabolism for biosynthesis of nonnatural alcohols. PNAS USA 105(52):20653-20658 (2008).
Zoller et al. Oligonucleotide-directed mutagenesis using M13-derived vectors: an efficient and general procedure for the production of point mutations in any fragment of DNA. Nucleic Acid Res. 10:6487-6500 (1982).
Related Publications (1)
Number Date Country
20190177713 A1 Jun 2019 US
Provisional Applications (2)
Number Date Country
62415363 Oct 2016 US
62357337 Jun 2016 US
Continuations (1)
Number Date Country
Parent PCT/US2017/040019 Jun 2017 US
Child 16225611 US