Protopanaxatriol (C30H52O4) is a triterpenoid with a dammarane-type tetracyclic scaffold that is derived from Dammarenediol-II (“dammarenediol”). Protopanaxatriol can be used as a pesticide, for example, against sucking and chewing insects. In addition, protopanaxadiol and protopanaxatriol are known to have various biological properties, including anti-inflammatory, anxiolytic, anti-stress, and anti-tumor effects. Oh et al., Anti-stress effects of 20(S)-protopanaxadiol and 20(S)-protopanaxatriol in immobilized mice, Biol Pharm Bull 38(2):331-335 (2015). Protopanaxatriol, which naturally occurs in Panax ginseng (ginseng) and Panax pseudoginseng (notoginseng), is conventionally isolated from ginseng root. Natural ginseng is very rare. Cultivated ginseng is very slow growing, requiring about seven to eleven years of growth before harvest. Therefore, there is a need for biotechnology processes that produce dammarenediol and its derivatives, including protopanaxadiol, protopanaxatriol, and including glycosylated forms or ginsenosides. Ginsenosides have a long history of use to boost immunity, and have a wide spectrum of pharmacological activities. Zhou et al., The Synergistic Effects of Polysaccharides and Ginsenosides From American Ginseng (Panax quinquefolius L.) Ameliorating Cyclophosphamide-Induced Intestinal Immune Disorders and Gut Barrier Dysfunctions Based on Microbiome-Metabolomics Analysis, Frontiers in Immunology 12: 1273 (2021); Leung and Wong Pharmacology of ginsenosides: a literature review. Chin Med 5: 1-7 (2010).
In accordance with various embodiments, the invention provides engineered microbial cells, enzymes, and methods for producing dammarenediol-II (“dammarenediol”) as well as compounds derived from dammarenediol, such as but not limited to protopanaxadiol and protopanaxatriol, and glycosylated forms thereof (“e.g., ginsenosides”). In accordance with the disclosure, microbial host cells are engineered to express a heterologous biosynthetic pathway that produces dammarenediol (or a derivative thereof). The heterologous pathway will generally comprise a dammarenediol synthase (DDS) enzyme (such as an engineered DDS described herein) which acts on 2,3-oxidosqualene substrate, and in various embodiments further comprises a protopanaxadiol synthase (PPDS) enzyme for production of protopanaxadiol (which can be an engineered PPDS described herein), and optionally a protopanaxatriol synthase (PPTS) enzyme for production of protopanaxatriol (which can be an engineered PPTS described herein). In some embodiments, the host cell can further express a heterologous uridine diphosphate-dependent glycosyltransferase (UGT) enzyme producing natural or non-natural glycosylated forms of dammarenediol, protopanaxadiol or protopanaxatriol, generally referred to as ginsenosides.
Accordingly, in various aspects, the present disclosure provides microbial host cells and methods for producing dammarenediol or a derivative thereof (i.e., a compound derived from dammarenediol), and which involves expressing a heterologous biosynthetic pathway.
The heterologous biosynthetic pathway comprises one or more of: a dammarenediol synthase (DDS) enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 9; a protopanaxadiol synthase (PPDS) enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16; and a protopanaxatriol synthase (PPTS) enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, and SEQ ID NO: 21.
In some embodiments, the DDS enzyme comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 3. Exemplary engineered derivatives of SEQ ID NO: 3 include enzymes having one or more mutations shown in Tables 1 and 2, and the enzymes represented by SEQ ID NOS: 81, 82, and 85.
In various aspects the heterologous biosynthetic pathway comprises a PPDS enzyme comprising an amino acid sequence that has at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 10. Exemplary engineered PPDS enzymes of the disclosure can have one or more mutations shown in Table 3. In exemplary embodiments, the PPDS enzyme comprises at least one, two, three, four, five, six, seven, eight, or all of the following mutations with respect to SEQ ID NO: 10: T108N, I212F, K338G, D135E, S68P, V150P, F167H, L283M, H482R, R347Q, M390L, R243K, L292I, V329M, Q278E, and N58E. An exemplary engineered derivative of SEQ ID NO: 10 includes enzymes comprising the amino acid sequence of SEQ ID NO: 83.
In various aspects, the heterologous biosynthetic pathway comprises a PPTS enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 17. Exemplary engineered PPTS enzymes of the disclosure include enzymes having one or more mutations shown in Table 4. In some embodiments, the PPTS enzyme comprises at least 2, or at least 3, or at least 4, or at least 5, or at least 8, or at least 10 amino acid substitutions with respect to SEQ ID NO: 17 as listed in Table 4. Exemplary PPTS enzymes comprise at least two, at least 3, at least 4, at least 5, or all amino acid substitutions, with respect to SEQ ID NO: 17, selected from: G294T, S166K, C472H, K252Q, V239I, A323P, I412V, I369T, K362D, and T250P. An exemplary engineered derivative of SEQ ID NO: 17 includes enzymes comprising the amino acid sequence of SEQ ID NO: 84.
In some embodiments, the heterologous enzyme pathway further comprises one or more uridine diphosphate-dependent glycosyltransferase (UGT) enzymes, thereby producing one or more glycosides of dammarenediol, protopanaxadiol and/or protopanaxatriol.
In some embodiments, the heterologous biosynthetic pathway further comprises a squalene synthase (SQS) enzyme as well as a squalene monooxygenase (SQE) producing 2,3-oxidosqualene from farnesyl diphosphate. The DDS enzyme acts on the 2,3-oxidosqualene substrate to produce the cyclic dammarenediol product.
In some embodiments, the biosynthetic pathway is expressed in a microbial host cell (such as a bacterium or yeast) that expresses an enzymatic pathway that produces iso-pentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). IPP and DMAPP substrates are used to produce farnesyl diphosphate. In some embodiments, the enzymatic pathway is a methylerythritol phosphate (MEP) pathway and/or a mevalonic acid (MVA) pathway.
In various embodiments, the microbial host cells are further suitable for production at commercial scale, which can include culturing in batch culture, continuous culture, or semi-continuous culture.
In still other aspects, the invention provides engineered DDS, PPDS, and PPTS enzymes providing improved productivities or stability in host cells, including for microbial production of dammarenediol, protopanaxadiol, and protopanaxatriol, and glycosylated forms thereof.
The dammarenediol, protopanaxadiol, protopanaxatriol, or ginsenoside derived therefrom obtained according to this disclosure may be incorporated into pesticide or insecticide composition. In some embodiments, the product is incorporated into a pharmaceutical composition for use as an active pharmaceutical agent having anti-inflammatory, anxiolytic, anti-stress, and anti-tumor activity. In some embodiments, the product is incorporate into food products (including beverages) and nutraceutical products.
Other aspects and embodiments of this disclosure will be apparent from the following Drawings and Detailed Description.
In accordance with various embodiments, the invention provides engineered microbial cells, enzymes, and methods for producing dammarenediol-II (“dammarenediol”) as well as compounds derived from dammarenediol, such as but not limited to protopanaxadiol and protopanaxatriol, and glycosylated forms thereof (“e.g., ginsenosides”). In accordance with the disclosure, microbial host cells are engineered to express a heterologous biosynthetic pathway that produces dammarenediol (or a derivative thereof). The heterologous pathway will generally comprise a dammarenediol synthase (DDS) enzyme (such as an engineered DDS described herein) which acts on 2,3-oxidosqualene substrate, and in various embodiments further comprises a protopanaxadiol synthase (PPDS) enzyme for production of protopanaxadiol (which can be an engineered PPDS described herein), and optionally a protopanaxatriol synthase (PPTS) enzyme for production of protopanaxatriol (which can be an engineered PPTS described herein). In some embodiments, the host cell can further express a heterologous uridine diphosphate-dependent glycosyltransferase (UGT) enzyme producing natural or non-natural glycosylated forms of dammarenediol, protopanaxadiol or protopanaxatriol, generally referred to as ginsenosides.
The biosynthetic pathways for dammarenediol, protopanaxadiol, and protopanaxatriol are illustrated in
Accordingly, in one aspect, the present disclosure provides a method for producing dammarenediol or a derivative thereof. The method comprises providing a microbial host cell expressing a heterologous biosynthetic pathway producing dammarenediol or a derivative thereof. The heterologous biosynthetic pathway in various embodiments comprises one or more of: a dammarenediol synthase (DDS) enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 9; a protopanaxadiol synthase (PPDS) enzyme comprising an amino acid sequence that has at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16; and a protopanaxatriol synthase (PPTS) enzyme comprising an amino acid sequence that has at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, and SEQ ID NO: 21.
In another aspect, the present disclosure provides a microbial host cell producing dammarenediol or a derivative thereof. The microbial host cell expresses a heterologous biosynthetic pathway producing dammarenediol or a derivative thereof. In certain embodiments, the heterologous biosynthetic pathway comprises one or more of: a Dammarenediol Synthase (DDS) enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 9; a protopanaxadiol synthase (PPDS) enzyme comprising an amino acid sequence that has at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16; and a protopanaxatriol synthase (PPTS) enzyme comprising an amino acid sequence that has at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, and SEQ ID NO: 21.
In still other aspects, the invention provides engineered DDS, PPDS, and PPTS enzymes providing improved productivities or stabilities in microbial host cells, including for microbial production of dammarenediol, protopanaxadiol, and protopanaxatriol, and glycosylated forms thereof.
DDS is a component of the biosynthetic pathway for dammarane-type triterpene saponins (e.g. ginsenosides or panaxosides), which is an oxidosqualene cyclase that produces specifically the 20S isomer of the triterpene dammarenediol II shown in
In some embodiments, the DDS enzyme comprises an amino acid sequence having at least about 80% sequence identity, or at least about 85% sequence identity, or at least about 90% sequence identity, or at least about 95% sequence identity, or at least 97% sequence identity, or at least 98% sequence identity, or at least 99% sequence identity to the amino acid sequence of SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 9.
In some embodiments, the DDS enzyme comprises one or more mutations that are designed to improve the stability, activity, expression and/or temperature resistance of the DDS enzyme in a microbial strain. In some embodiments, the DDS enzyme comprises one or more mutations that are designed to improve stability, activity, expression and/or temperature resistance of the DDS enzyme in a bacterial host (e.g., E. coli). In some embodiments, the DDS enzyme comprises one or more mutations that are designed to improve stability, activity, expression and/or temperature resistance DDS enzyme in a yeast host.
In some embodiments, the DDS enzyme comprises an amino acid sequence having at least about 70% sequence identity, or at least about 80% sequence identity, or at least about 85% sequence identity, or at least about 90% sequence identity, or at least 95% sequence identity, or at least 97% sequence identity, or at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises from 1 to 30, or from 1 to 20, or from 1 to 10, or from 1 to 5 amino acid modifications independently selected from substitutions, deletions, and insertions with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions at positions corresponding to the following positions of SEQ ID NO: 3: 30, 64, 68, 365, 369, 425, 461, 465, 468, 606, 628, and 632. In some embodiments, the DDS enzyme comprises at least 2, or at least 3, or at least 4, or more substitutions at positions selected from 30, 64, 68, 365, 369, 425, 461, 465, 468, 606, 628, and 632 with respect to SEQ ID NO: 3.
In some embodiments, the DDS enzyme comprises one or more substitutions at positions corresponding to positions selected from 606, 628, and 632 of SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from N606I, N606L, N606V, T628A, T628V, T628G, F632L, F632I, F632V, and F632A with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from N606I, T628A, and F632L with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises an amino acid sequence of SEQ ID NO: 5.
In some embodiments, the DDS enzyme comprising the substitutions selected from N606I, T628A, and F632L (with respect to SEQ ID NO: 3) comprises an amino acid sequence that otherwise has at least 95% sequence identity, or at least 97% sequence identity, or at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 5.
In some embodiments, the DDS enzyme comprises one or more substitutions at positions corresponding to positions selected from 365, 369, and 461 of SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from T365E, T365D, F369Y, F369W, R461T, and R461S with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises a substitution selected from T365E and T365D with respect to SEQ ID NO: 3. Additionally, or alternatively, the DDS enzyme comprises a substitution selected from F369Y and F369W with respect to SEQ ID NO: 3. Additionally, or alternatively, the DDS enzyme comprises a substitution selected from R461T and R461S with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from T365E, F369Y, and R461S with respect to SEQ ID NO: 3. For example, the DDS enzyme may comprise the amino acid sequence of SEQ ID NO: 6.
In some embodiments, the DDS enzyme comprises one or more substitutions at positions selected from 30, 64, and 68 with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from Q30D, Q30E, M64L, M64I, M64V, M64A, R68M, and R68T with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises a substitutions selected from Q30D and Q30E with respect to SEQ ID NO: 3. Additionally or alternatively, the DDS enzyme comprises a substitution selected from M64L, M64I, M64V, and M64A with respect to SEQ ID NO: 3. Additionally or alternatively, the DDS enzyme comprises a substitution selected from R68M and R68T with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from Q30D, M64L, and R68M with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises the amino acid sequence of SEQ ID NO: 7.
In some embodiments, the DDS enzyme comprises one or more substitutions at positions selected from 425, 465, and 467 with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from L465K, L465R, L465H, C468Y, C468F, C468W, I425G, I425V, and I425A with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises a substitution selected from L465K, L465R, and L465H with respect to SEQ ID NO: 3. Additionally, or alternatively, the DDS enzyme comprises a substitution selected from C468Y, C468F, and C468W with respect to SEQ ID NO: 3. Additionally, or alternatively, the DDS enzyme comprises a substitution selected from I425G, I425V, and I425A with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises one or more substitutions selected from L465K, C468Y, and I425A with respect to SEQ ID NO: 3. In some embodiments, the DDS enzyme comprises the amino acid sequence of SEQ ID NO: 8.
In some embodiments, the DDS enzyme comprises an amino acid sequence that has at least 90% sequence identity to SEQ ID NO: 7, and has one or more substitutions with respect to SEQ ID NO: 7 selected from: T364E, T364D, F368Y, R460S, R460T, L464K, L464R, C467Y, and I424A. For example, the DDS enzyme may have two, three, or more substitutions with respect to SEQ ID NO: 7 selected from: T364E, F368Y, R460S, L464K, C467Y, and I424A. In some embodiments, the DDS enzyme has at least the following substitutions with respect to SEQ ID NO: 7: T364E, F368Y, R460S, L464K, C467Y, and I424A. An exemplary engineered DDS enzyme has the amino acid sequence of SEQ ID NO: 81.
Accordingly, the present disclosure provides a DDS enzyme (including for use in the microbial host cells and methods of the disclosure), and which comprises an amino acid sequence that is at least 90% identical, or at least 95% identical, or at least 97% identical, or at least 98% identical to SEQ ID NO: 81. In various embodiments, the DDS enzyme comprises one or more mutations with respect to SEQ ID NO: 81 listed in Table 1. For example, the DDS enzyme may comprise two, three, four, five, or more mutations with respect to SEQ ID NO: 81 listed in Table 1. In exemplary embodiments, the DDS enzyme has one or more of the following mutations with respect to SEQ ID NO: 81: Y49F, S181T, deletion of amino acids L195-E197, S198P, E238S, I407V, D507E, R637K, and M695I. An exemplary engineered DDS enzyme comprises the amino acid sequence of SEQ ID NO: 82.
Accordingly, in some embodiments the DDS enzyme comprises an amino acid sequence that is at least 90% identical, or at least 95% identical, or at least 97% identical, or at least 98% identical to SEQ ID NO: 82. In some embodiments, the DDS enzyme comprises one or more mutations with respect to SEQ ID NO: 82 that are listed in Table 2. For example, the DDS enzyme may comprise two, three, four, five, or more mutations with respect to SEQ ID NO: 82 listed in Table 2. In various embodiments, the DDS enzyme comprises one or more mutations with respect to SEQ ID NO: 82 selected from: F649L, F649V, F649I, F649A, L548F, Q149E, Q149D, A120S, A120T, G573A, G573L, S380A, S380G, and A256G. Exemplary engineered DDS enzymes comprise two, three, four, five, or all mutations with respect to SEQ ID NO: 82 selected from: F649L, L548F, Q149E, A120S, G573A, S380A, and A256G.
An exemplary engineered DDS enzyme is represented by SEQ ID NO: 85. Thus, in some embodiments, the heterologous biosynthetic pathway comprises a DDS enzyme comprising an amino acid sequence that is at least 90%, or at least 95%, or at least 97%, or at least 98%, or at least 99% identical to SEQ ID NO: 85. For example, the DDS enzyme may have from 1 to 10 or from 1 to 5 amino acid modifications independently selected from substitutions, insertions, and deletions with respect to SEQ ID NO: 85.
Protopanaxadiol synthase (PPDS) is an oxidoreductase enzyme that converts dammarenediol to protopanaxadiol. Specifically, PPDS catalyzes the hydroxylation of dammarenediol at the C-12 position to yield protopanaxadiol as shown in
In some embodiments, the heterologous biosynthetic pathway comprises a PPDS enzyme comprising an amino acid sequence that has at least about 70% sequence identity to the amino acid sequence of SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16. In some embodiments, the PPDS enzyme comprises an amino acid sequence having at least about 70%, or at least about 80%, or at least about 85%, or at least about 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to an amino acid sequence selected from SEQ ID NOS: 10, 11, 12 and 16. In some embodiments, the PPDS enzyme comprises an amino acid sequence having at least about 70%, or at least about 80%, or at least about 85%, or at least about 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to the amino acid sequence of SEQ ID NO: 10. In some embodiments, the PPDS enzyme comprises from 1 to 30, or from 1 to 20, or from 1 to 10, or from 1 to 5 amino acid modifications independently selected from substitutions, deletions, and insertions with respect to SEQ ID NO: 10.
In various embodiments, the PPDS enzyme comprises one or more substitutions (e.g., at least 2, 3, 4, or 5 substitutions) selected from N58D, N58E, S68P, R85K, I95V, L96F, L96W, L96Y, T108N, T108Q, D135E, M144L, M144V, M144I, V150P, G152A, G152L, G152I, G152V, M153L, M153I, M153V, M153A, F167H, S192A, S192G, E202P, I212F, R243K, R243H, V248I, V248L,N277D, N277E, Q278E, Q278D, L283M, L292I, L292V, L292A, F317L, F317I, F317V, F317A, V329M, N333K, N333R, K338G, K338A, L346I, R347Q, R347N, I362L, I362V, I362A, M390L, M390I, M390V, M390A, H482R, and H482K with respect to SEQ ID NO: 10. In various embodiments, the PPDS enzyme comprises at least 2, or at least 3, or at least 4, or at least 5, or at least 8, or at least 10 amino acid substitutions shown in Table 3.
In exemplary embodiments, the PPDS enzyme comprises at least one, two, three, four, five, six, seven, eight, or all of the following mutations with respect to SEQ ID NO: 10: T108N, I212F, K338G, D135E, S68P, V150P, F167H, L283M, H482R, R347Q, M390L, R243K, L292I, V329M, Q278E, and N58E.
An exemplary engineered PPDS enzyme is represented by SEQ ID NO: 83. Thus, in some aspects and embodiments the heterologous biosynthetic pathway comprises a PPDS enzyme that comprises the amino acid sequence of SEQ ID NO: 83, or comprises a PPTS enzyme comprising an amino acid sequence that is at least 90%, or at least 95%, or at least 97%, or at least 98%, or at least 99% identical to SEQ ID NO: 83. In some embodiments, the PPDS enzyme comprises an amino acid sequence that has from 1 to 10 or from 1 to 5 amino acid modifications from SEQ ID NO: 83, the modification being independently selected from substitutions, insertions, and deletions.
Protopanaxatriol synthase (PPTS) catalyzes the formation of protopanaxatriol from protopanaxadiol. PPTS is an oxidoreductase enzyme that catalyzes the hydroxylation of protopanaxadiol at the C-6 position to yield protopanaxatriol as shown in
In some embodiments, the heterologous biosynthetic pathway comprises a PPTS enzyme comprising an amino acid sequence that has at least about 70% sequence identity to an amino acid sequence selected from SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20 and SEQ ID NO: 21. In some embodiments, the heterologous biosynthetic pathway comprises a PPTS enzyme comprising an amino acid sequence that has at least about 70% sequence identity, or at least about 80% sequence identity, or at least about 85% sequence identity, or at least about 90% sequence identity, or at least 95% sequence identity, or at least 97% sequence identity, or at least 98% sequence identity to an amino acid sequence selected from SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20 and SEQ ID NO: 21. In some embodiments, the PPTS enzyme comprises an amino acid sequence having at least about 70%, or at least about 80%, or at least about 85%, or at least about 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to the amino acid sequence of SEQ ID NO: 17. In some embodiments, the PPTS enzyme comprises from 1 to 30, or 1 to 20, or from 1 to 10, or from 1 to 5 amino acid modifications independently selected from substitutions, deletions, and insertions with respect to SEQ ID NO: 17.
In some embodiments, the PPTS enzyme comprises one or more substitutions (e.g., at least 2, 3, 4, or 5 substitutions) with respect to SEQ ID NO: 17 selected from: I98L, I98V, I98A, A113S, A120S, A120T, K146R, F147Y, S166K, S166R, E176K, E176R, W185R, W185K, L187F, L187Y, L187W, L215I, F217L, F217I, F217V, F217A, V239I, V239L, V239A, R244K, K247L, K247V, K247I, K247A, Q249E, Q249D, T250P, K252Q, K252N, M259L, M259I, M259V, V278I, G294T, G294S, A323P, E324G, E324A, S328N, S328Q, R334K, V358E, V358D, V359A, V359G, K362D, K362E, S364T, N367G, N367A, I369T, I369S, K391P, M407A, M407G, F409Y, I412V, F426Y, V431I, V431L, N463K, N463R, and C472H.
An exemplary engineered PPTS enzyme is represented by SEQ ID NO: 84. Thus, in some aspects and embodiments the heterologous biosynthetic pathway comprises a PPTS enzyme that comprises the amino acid sequence of SEQ ID NO: 84, or comprises a PPTS enzyme comprising an amino acid sequence that is at least 90%, or at least 95%, or at least 97%, or at least 98%, or at least 99% identical to SEQ ID NO: 84. In some embodiments, the PPDS enzyme comprises an amino acid sequence that has from 1 to 10 or from 1 to 5 amino acid modifications from SEQ ID NO: 84, the modification being independently selected from substitutions, insertions, and deletions.
In some embodiments, the PPTS enzyme comprises at least 2, or at least 3, or at least 4, or at least 5, or at least 8, or at least 10 amino acid substitutions with respect to SEQ ID NO: 17 listed in Table 4. Exemplary PPTS enzymes comprise at least 2, at least 3, at least 4, at least 5, or all amino acid substitutions (with respect to SEQ ID NO: 17) selected from: G294T, S166K, C472H, K252Q, V239I, A323P, I412V, I369T, K362D, and T250P.
In some embodiments, the heterologous enzyme pathway further comprises one or more uridine diphosphate-dependent glycosyltransferase (UGT) enzymes, thereby producing one or more glycosides of dammarenediol, protopanaxadiol or protopanaxatriol that are shown in
In some embodiments, dammarenediol is monoglycosylated. In some embodiments, the dammarenediol is diglycosylated. In some embodiments, dammarenediol is glycosylated at C3-OH and/or C20-OH. In some embodiments, the dammarenediol glycosides comprise 33-O glucosylation and/or 20S—O glucosylation. In some embodiments, the dammarenediol glycosides comprise one or more branching glycosylations.
In some embodiments, protopanaxadiol is monoglycosylated. In some embodiments, the protopanaxadiol is diglycosylated. In some embodiments, protopanaxadiol is glycosylated at C3-OH and/or C20-OH. In some embodiments, the protopanaxadiol glycosides comprise one or more branching glycosylations.
In some embodiments, protopanaxatriol is monoglycosylated. In some embodiments, the protopanaxatriol is diglycosylated. In some embodiments, the protopanaxatriol is triglycosylated. In some embodiments, protopanaxatriol is glycosylated at C3-OH, C6-OH, and/or C20-OH. In some embodiments, the protopanaxatriol glycosides comprise one or more branching glycosylations.
In some embodiments, the microbial host cell is capable of producing dammarenediol, protopanaxadiol or protopanaxatriol as a substrate for glycosylation by one or more UGT enzymes. In some embodiments, the UGT enzyme(s) are capable of catalyzing glycosylation of C3-OH and/or C20-OH of dammarenediol. In some embodiments, the UGT enzyme(s) are capable of catalyzing glycosylation of C3-OH and/or C20-OH of protopanaxadiol. In some embodiments, the UGT enzyme(s) are capable of catalyzing glycosylation of C3-OH, C6-OH, and/or C20-OH of protopanaxatriol. For example, in some embodiments, the microbial cell expresses at least one, or at least two, or at least three UGT enzymes, resulting in glucosylation of dammarenediol, protopanaxadiol or protopanaxatriol. Exemplary UGT enzymes that can glycosylate a triterpenoid core include those described in WO 2021/126960, which is hereby incorporated by reference in its entirety. In some embodiments, the UGT enzyme(s) further catalyze one or more branching glycosylations, such 1-2, 1-3, and 1-6 branching glycosylations. In various embodiments, the glycosylation reaction transfers monosaccharide units selected from glucosyl, arabinosyl, furanosyl, rhamnosyl, and xylosyl.
In some embodiments, the heterologous biosynthetic pathway further comprises a squalene synthase (SQS) enzyme, catalyzing synthesis of squalene from farnesyl diphosphate. In some embodiments, the SQS comprises an amino acid sequence having at least 70%, or at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to an amino acid sequence selected from SEQ ID NOs: 1 and 23-38.
In some embodiments, the SQS comprises an amino acid sequence that is at least 70% identical to Artemisia annua SQS (SEQ ID NO: 1). For example, the SQS may comprise an amino acid sequence that is at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% identical to SEQ ID NO: 1. In some embodiments, the SQS comprises an amino acid sequence having from 1 to 20 amino acid modifications or from 1 to 10 amino acid modifications with respect to SEQ ID NO: 1, the amino acid modifications being independently selected from amino acid substitutions, deletions, and insertions. Amino acid modifications may be made to increase expression or stability of the enzyme in the microbial cell, or to increase productivity of the enzyme. AaSQS has high activity in E. coli.
In some embodiments, the SQS comprises an amino acid sequence that is at least 70% identical to Siraitia grosvenorii SQSa (SEQ ID NO: 23). For example, the SQS may comprise an amino acid sequence that is at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% identical to SEQ ID NO: 23. In some embodiments, the SQS comprises an amino acid sequence having from 1 to 20 amino acid modifications or from 1 to 10 amino acid modifications with respect to SEQ ID NO: 23, the amino acid modifications being independently selected from amino acid substitutions, deletions, and insertions. Amino acid modifications may be made to increase expression or stability of the enzyme in the microbial cell, or to increase productivity of the enzyme. SgSQSa has high activity in E. coli.
In some embodiments, the SQS comprises an amino acid sequence that is at least 70% identical to Siraitia grosvenorii SQSb (SEQ ID NO: 24). For example, the SQS may comprise an amino acid sequence that is at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% identical to SEQ ID NO: 24. In some embodiments, the SQS comprises an amino acid sequence having from 1 to 20 amino acid modifications or from 1 to 10 amino acid modifications with respect to SEQ ID NO: 24, the amino acid modifications being independently selected from amino acid substitutions, deletions, and insertions. Amino acid modifications may be made to increase expression or stability of the enzyme in the microbial cell, or to increase productivity of the enzyme. SgSQSb has high activity in E. coli.
Amino acid modifications to the SQS enzyme can be guided by available enzyme structures and homology models, including those described in Aminfar and Tohidfar, In silico analysis of squalene synthase in Fabaceae family using bioinformatics tools, J. Genetic Engineer. and Biotech. 16 (2018) 739-747. The publicly available crystal structure for HsSQE (PDB entry: 6C6N) may be used to inform amino acid modifications.
In some embodiments, the heterologous biosynthetic pathway further comprises a squalene epoxidase (SQE) producing 2,3-oxidosqualene. In some embodiments, the SQE comprises an amino acid sequence having at least 70%, or at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to an amino acid sequence selected from SEQ ID NOs: 2 and 39-70. In some embodiments, the SQE enzyme comprises an amino acid sequence having at least 70%, or at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or at least 99% sequence identity to the amino acid sequence of SEQ ID NO: 2.
Amino acid modifications can be guided by available enzyme structures and homology models, including those described in Padyana A K, et al., Structure and inhibition mechanism of the catalytic domain of human squalene epoxidase, Nat. Comm. (2019) Vol. 10(97): 1-10; or Ruckenstulh et al., Structure-Function Correlations of Two Highly Conserved Motifs in Saccharomyces cerevisiae Squalene Epoxidase, Antimicrob. Agents and Chemo. (2008) Vol. 52(4): 1496-1499.
In some embodiments, the microbial host cell expresses an enzymatic pathway that produces iso-pentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In some embodiments, the enzymatic pathway is a methylerythritol phosphate (MEP) pathway and/or a mevalonic acid (MVA) pathway.
In some embodiments, the host cell is a bacterial host cell engineered to increase production of IPP and DMAPP from glucose as described in U.S. Pat. Nos. 10,480,015 and 10,662,442, the contents of which are hereby incorporated by reference in their entireties. For example, in some embodiments the host cell overexpresses MEP pathway enzymes, with balanced expression to push/pull carbon flux to IPP and DMAPP. In some embodiments, the host cell is engineered to increase the availability or activity of Fe—S cluster proteins, so as to support higher activity of IspG and IspH, which are Fe—S enzymes. In some embodiments, the host cell is engineered to overexpress IspG and IspH, so as to provide increased carbon flux to 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate (HMBPP) intermediate, but with balanced expression to prevent accumulation of HMBPP at an amount that reduces cell growth or viability, or at an amount that inhibits MEP pathway flux and/or terpenoid production. In some embodiments, the host cell exhibits higher activity of IspH relative to IspG. In some embodiments, the host cell is engineered to downregulate the ubiquinone biosynthesis pathway, e.g., by reducing the expression or activity of IspB, which uses IPP and FPP substrate.
The microbial cell will produce MEP or MVA products, which act as substrates for the heterologous enzyme pathway. The MEP (2-C-methyl-D-erythritol 4-phosphate) pathway, also called the MEP/DOXP (2-C-methyl-D-erythritol 4-phosphate/1-deoxy-D-xylulose 5-phosphate) pathway or the non-mevalonate pathway or the mevalonic acid-independent pathway refers to the pathway that converts glyceraldehyde-3-phosphate and pyruvate to IPP and DMAPP. The pathway, which is present in bacteria, typically involves action of the following enzymes: 1-deoxy-D-xylulose-5-phosphate synthase (Dxs), 1-deoxy-D-xylulose-5-phosphate reductoisomerase (IspC), 4-diphosphocytidyl-2-C-methyl-D-erythritol synthase (IspD), 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase (IspE), 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase (IspF), 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase (IspG), and isopentenyl diphosphate isomerase (IspH). The MEP pathway, and the genes and enzymes that make up the MEP pathway, are described in U.S. Pat. No. 8,512,988, which is hereby incorporated by reference in its entirety. For example, genes that make up the MEP pathway include dxs, ispC, ispD, ispE, ispF, ispG, ispH, idi, and ispA. In some embodiments, the host cell expresses or overexpresses one or more of dxs, ispC, ispD, ispE, ispF, ispG, ispH, idi, ispA, or modified variants thereof, which results in the increased production of IPP and DMAPP. In some embodiments, the FPP substrate is produced at least in part by metabolic flux through an MEP pathway, and wherein the host cell has at least one additional gene copy of one or more of dxs, ispC, ispD, ispE, ispF, ispG, ispH, idi, ispA, or modified variants thereof.
The MVA pathway refers to the biosynthetic pathway that converts acetyl-CoA to IPP. The mevalonate pathway, which will be present in yeast, typically comprises enzymes that catalyze the following steps: (a) condensing two molecules of acetyl-CoA to acetoacetyl-CoA (e.g., by action of acetoacetyl-CoA thiolase); (b) condensing acetoacetyl-CoA with acetyl-CoA to form hydroxymethylglutaryl-CoenzymeA (HMG-CoA) (e.g., by action of HMG-CoA synthase (HMGS)); (c) converting HMG-CoA to mevalonate (e.g., by action of HMG-CoA reductase (HMGR)); (d) phosphorylating mevalonate to mevalonate 5-phosphate (e.g., by action of mevalonate kinase (MK)); (e) converting mevalonate 5-phosphate to mevalonate 5-pyrophosphate (e.g., by action of phosphomevalonate kinase (PMK)); and (f) converting mevalonate 5-pyrophosphate to isopentenyl pyrophosphate (e.g., by action of mevalonate pyrophosphate decarboxylase (MPD)). The MVA pathway, and the genes and enzymes that make up the MVA pathway, are described in U.S. Pat. No. 7,667,017, which is hereby incorporated by reference in its entirety. In some embodiments, the host cell expresses or overexpresses one or more of acetoacetyl-CoA thiolase, HMGS, HMGR, MK, PMK, and MPD or modified variants thereof, which results in the increased production of IPP and DMAPP. In some embodiments, FPP substrate is produced at least in part by metabolic flux through an MVA pathway, and wherein the host cell has at least one additional gene copy of one or more of acetoacetyl-CoA thiolase, HMGS, HMGR, MK, PMK, MPD, or modified variants thereof.
In various embodiments, the microbial cells further express one or more farnesyl diphosphate synthase (FPPS) enzymes. An exemplary enzyme is shown herein as SEQ ID NO: 80. Numerous other FPPS enzymes are well known in the art and the selection of which is not critical.
In still other embodiments, microbial cells expressing the heterologous biosynthesis pathway co-express an isoprenol utilization pathway as described in US 2019/0367950, which is hereby incorporated by reference in its entirety. Such cells can produce IPP and DMAPP precursors from prenol and/or isoprenol substrate provided to the culture.
The microbial host cell in various embodiments may be prokaryotic or eukaryotic. In some embodiments, the microbial host cell is a bacterium, and which can be optionally selected from Escherichia spp., Bacillus spp., Corynebacterium spp., Rhodobacter spp., Zymomonas spp., Vibrio spp., and Pseudomonas spp. For example, in some embodiments, the bacterial host cell is a species selected from Escherichia coli, Bacillus subtilis, Corynebacterium glutamicum, Rhodobacter capsulatus, Rhodobacter sphaeroides, Zymomonas mobilis, Vibrio natriegens, or Pseudomonas putida. In some embodiments, the bacterial host cell is E. coli. Alternatively, the microbial cell may be a yeast cell, such as but not limited to a species of Saccharomyces, Pichia, or Yarrowia, including Saccharomyces cerevisiae, Pichia pastoris, and Yarrowia lipolytica.
In some embodiments, the microbial host cell is cultured in a carbon source comprising glucose, sucrose, fructose, xylose, and/or glycerol. In some embodiments, culture conditions are selected from aerobic, microaerobic, and anaerobic. In some embodiments, the microbial host cell is cultured at a temperature in the range of about 22° C. to about 37° C., or about 27° C. to about 37° C., or about 30° C. to about 37° C. In some embodiments, dammarenediol, protopanaxadiol, protopanaxatriol or glycosylated derivatives thereof are recovered from the culture.
In various embodiments, the microbial host cell may be cultured at a temperature between 22° C. and 37° C. While commercial biosynthesis in bacteria such as E. coli can be limited by the temperature at which overexpressed and/or foreign enzymes (e.g., enzymes derived from plants) are stable, recombinant enzymes may be engineered to allow for cultures to be maintained at higher temperatures, resulting in higher yields and higher overall productivity. In some embodiments, the culturing is conducted at about 22° C. or greater, about 23° C. or greater, about 24° C. or greater, about 25° C. or greater, about 26° C. or greater, about 27° C. or greater, about 28° C. or greater, about 29° C. or greater, about 30° C. or greater, about 31° C. or greater, about 32° C. or greater, about 33° C. or greater, about 34° C. or greater, about 35° C. or greater, about 36° C. or greater, or about 37° C.
In some embodiments, the microbial host cells are further suitable for commercial production, at commercial scale. In some embodiments, the size of the culture is at least about 100 L, at least about 200 L, at least about 500 L, at least about 1,000 L, or at least about 10,000 L, or at least about 100,000 L, or at least about 500,000 L, or at least about 600,000 L. In an embodiment, the culturing may be conducted in batch culture, continuous culture, or semi-continuous culture.
In various embodiments, methods further include recovering the product from the cell culture or from cell lysates. In some embodiments, the culture produces at least about 100 mg/L, or at least about 200 mg/L, or at least about 500 mg/L, or at least about 1 g/L, or at least about 2 g/L, or at least about 5 g/L, or at least about 10 g/L, or at least about 20 g/L, or at least about 30 g/L, or at least about 40 g/L of the terpenoid or terpenoid glycoside product.
In some embodiments, the production of indole (including prenylated indole) is used as a surrogate marker for terpenoid production, and/or the accumulation of indole in the culture is controlled to increase production. For example, in various embodiments, accumulation of indole in the culture is controlled to below about 100 mg/L, or below about 75 mg/L, or below about 50 mg/L, or below about 25 mg/L, or below about 10 mg/L. The accumulation of indole can be controlled by balancing protein expression and activity using the multivariate modular approach as described in U.S. Pat. No. 8,927,241 (which is hereby incorporated by reference), and/or is controlled by chemical means.
Manipulation of the expression of genes and/or proteins, including gene modules, can be achieved through various methods. For example, expression of the genes or operons can be regulated through selection of promoters, such as inducible or constitutive promoters, with different strengths (e.g., strong, intermediate, or weak). Several non-limiting examples of promoters of different strengths include Trc, T5 and T7. Additionally, expression of genes or operons can be regulated through manipulation of the copy number of the gene or operon in the cell. In some embodiments, expression of genes or operons can be regulated through manipulating the order of the genes within a module, where the genes transcribed first are generally expressed at a higher level. In some embodiments, expression of genes or operons is regulated through integration of one or more genes or operons into the chromosome.
Optimization of protein expression can also be achieved through selection of appropriate promoters and ribosomal binding sites. In some embodiments, this may include the selection of high-copy number plasmids, or single-, low- or medium-copy number plasmids. The step of transcription termination can also be targeted for regulation of gene expression, through the introduction or elimination of structures such as stem-loops.
Expression vectors containing all the necessary elements for expression are commercially available and known to those skilled in the art. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are genetically engineered by the introduction into the cells of heterologous DNA. The heterologous DNA is placed under operable control of transcriptional elements to permit the expression of the heterologous DNA in the host cell.
In some embodiments, endogenous genes are edited, as opposed to gene complementation. Editing can modify endogenous promoters, ribosomal binding sequences, or other expression control sequences, and/or in some embodiments modifies trans-acting and/or cis-acting factors in gene regulation. Genome editing can take place using CRISPR/Cas genome editing techniques, or similar techniques employing zinc finger nucleases and TALENs. In some embodiments, the endogenous genes are replaced by homologous recombination.
In some embodiments, genes are overexpressed at least in part by controlling gene copy number. While gene copy number can be conveniently controlled using plasmids with varying copy number, gene duplication and chromosomal integration can also be employed. For example, a process for genetically stable tandem gene duplication is described in US 2011/0236927, which is hereby incorporated by reference in its entirety.
The terpene or terpenoid product can be recovered by any suitable process. For example, the aqueous phase can be recovered, and/or the whole cell biomass can be recovered, for further processing. The desired product can be produced in batch or continuous bioreactor systems.
For example, products may be recovered from the reaction or culture, which can include adjusting the pH and/or temperature of the reaction or culture, and optionally adding one or more solubilizers, followed by enzyme or biomass removal. Biomass and/or enzymes can be removed by centrifugation, thereby preparing a clarified broth. An exemplary process for biomass removal employs a disc stack centrifuge to separate liquid and solid phases. The clarified broth (liquid phase) is recovered for further processing. In some embodiments, products are crystallized from the clarified broth, and/or may be purified from the clarified broth using one or more processes selected from filtration, ion exchange, activated charcoal, bentonite, affinity chromatography, and digestion, which can optionally be conducted prior to crystallization and/or prior to recrystallization. In some embodiments, the recovery process can include one or more steps of tangential flow filtration (TFF). Exemplary processes for recovery of glycosylated products are described in WO 2022/115527, which is hereby incorporated by reference in its entirety. Other process for product recovery, including for recovery of triterpenoids (such as squalene derivatives) is described in US 2021/0207078, which is hereby incorporated by reference in its entirety.
The similarity of nucleotide and amino acid sequences, i.e. the percentage of sequence identity, can be determined via sequence alignments. Such alignments can be carried out with several art-known algorithms, such as with the mathematical algorithm of Karlin and Altschul (Karlin & Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877), with hmmalign (HMMER package, http://hmmer.wustl.edu/) or with the CLUSTAL algorithm (Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994) Nucleic Acids Res. 22, 4673-80). The grade of sequence identity (sequence matching) may be calculated using e.g. BLAST, BLAT or BlastZ (or BlastX). A similar algorithm is incorporated into the BLASTN and BLASTP programs of Altschul et al (1990) J. Mol. Biol. 215: 403-410. BLAST polynucleotide searches can be performed with the BLASTN program, score=100, word length=12.
BLAST protein searches may be performed with the BLASTP program, score=50, word length=3. To obtain gapped alignments for comparative purposes, Gapped BLAST is utilized as described in Altschul et al (1997) Nucleic Acids Res. 25: 3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs are used. Sequence matching analysis may be supplemented by established homology mapping techniques like Shuffle-LAGAN (Brudno M., Bioinformatics 2003b, 19 Suppl 1:154-162) or Markov random fields.
“Conservative substitutions” may be made, for instance, on the basis of similarity in polarity, charge, size, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the amino acid residues involved. The 20 naturally occurring amino acids can be grouped into the following six standard amino acid groups:
As used herein, “conservative substitutions” are defined as exchanges of an amino acid by another amino acid listed within the same group of the six standard amino acid groups shown above. For example, the exchange of Asp by Glu retains one negative charge in the so modified polypeptide. In addition, glycine and proline may be substituted for one another based on their ability to disrupt α-helices. Some preferred conservative substitutions within the above six groups are exchanges within the following sub-groups: (i) Ala, Val, Leu and Ile; (ii) Ser and Thr; (ii) Asn and Gln; (iv) Lys and Arg; and (v) Tyr and Phe.
As used herein, “non-conservative substitutions” are defined as exchanges of an amino acid by another amino acid listed in a different group of the six standard amino acid groups (1) to (6) shown above.
Modifications of enzymes as described herein can include conservative and/or non-conservative mutations. In some embodiments, an Alanine is substituted or inserted at position 2, to increase stability.
In some embodiments “rational design” is involved in constructing specific mutations in enzymes. Rational design refers to incorporating knowledge of the enzyme, or related enzymes, such as its reaction thermodynamics and kinetics, its three dimensional structure, its active site(s), its substrate(s) and/or the interaction between the enzyme and substrate, into the design of the specific mutation. Based on a rational design approach, mutations can be created in an enzyme which can then be screened for increased production of a terpene or terpenoid relative to control levels. In some embodiments, mutations can be rationally designed based on homology modeling. As used herein, “homology modeling” refers to the process of constructing an atomic resolution model of one protein from its amino acid sequence and a three-dimensional structure of a related homologous protein.
The dammarenediol, protopanaxadiol, protopanaxatriol, or ginsenoside derived therefrom obtained according to this disclosure can be incorporated into pesticide or insecticide compositions. In some embodiments, the product is incorporated into a pharmaceutical composition for use as an active pharmaceutical agent having anti-inflammatory, anxiolytic, anti-stress, and anti-tumor activity. In some embodiments, the product is incorporate into food products (including beverages) and nutraceutical products.
As used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise. For example, reference to “a cell” includes a combination of two or more cells, and the like.
As used herein, the term “about” in reference to a number is generally taken to include numbers that fall within a range of 10% in either direction (greater than or less than) of the number.
Protopanaxatriol can be produced by biosynthetic fermentation processes using microbial strains that produce high levels of MVA or MEP pathway products, along with heterologous expression of the biosynthesis enzymes. For example, in bacteria such as E. coli, isopentenyl pyrophosphate (IPP) and dimethylallyl pyrophosphate (DMAPP) can be produced from glucose or other carbon substrates, and converted to farnesyl diphosphate (FPP) by recombinant farnesyl diphosphate synthase (FPPS).
An E. coli strain that has high MEP pathway flux may be used (see U.S. Pat. Nos. 10,662,442 and 10,480,015, which are hereby incorporated by reference), to direct the MEP pathway products to protopanaxatriol.
In these experiments, an SQS enzyme and a SQE enzyme were co-expressed with a DDS enzyme in an E coli strain producing farnesyl pyrophosphate (FPP), and the production of squalene, 2,3-oxidosqualene and dammarenediol was quantified by GC-FID chromatography using authentic standards of each compound. Dammarenediol productions were verified by GC-MS spectrum analysis. The SQS enzyme (designated SQS1) is shown herein as SEQ ID NO: 1. The SQE enzyme (designated SQE1) is shown herein as SEQ ID NO: 2. The FPPS enzyme is shown herein as SEQ ID NO: 80. Candidate DDS enzymes include those designated as DDS1 (SEQ ID NO: 3) and DDS7 (SEQ ID NO: 9).
E. coli strains were incubated at 30° C. for 72 hr. The titers of squalene, oxidosqualene, dammarenediol and total triterpenoids were plotted. Dammarenediol-II productions were verified by GC-MS spectrum analysis. As shown in
Two DDS enzymes, DDS2 (SEQ ID NO: 4) and DDS1 (SEQ ID NO: 3), were expressed in the E. coli strain and levels of the dammarenediol and intermediates were compared. These strains were incubated at 30° C. for 72 hr, and squalene, oxidosqualene and dammarenediol were quantified. The titers of squalene, oxidosqualene, dammarenediol and total triterpenoids were plotted. As shown in
These results demonstrate that a bacterial strain co-expressing FPPS, a squalene synthase (SQS), a squalene epoxidase (SQE) and a dammarenediol synthase (DDS) can produce dammarenediol.
To improve the production of dammarenediol, DDS1 enzyme was engineered for improved production of dammarenediol. Specifically, the following DDS derivatives were constructed:
These derivatives were expressed in MEP-pathway engineered E. coli expressing FPPS, SQS, and SQE enzymes. These strains were incubated at 30° C. for 72 h, and squalene, oxidosqualene and dammarenediol were quantified by GC-FID chromatography using authentic standards. As shown in
The DDS1 derivative (Pq.DDS1) (SEQ ID NO: 80) was tested alongside DDS5. Pq.DDS1 incorporates the mutations T364E, F368Y, R460S, L464K, C467Y, and I424A relative to DDS5. E. coli strains expressing SQS1, SQE1 and Pq.DDS1 (or DDS5) were incubated at 37° C. for 72 hrs. The relative titer of strains expressing SQS1-SQE1-Pq.DDS1 is shown relative to strains expressing SQS1-SQE1-DDS5 (
Derivatives of Pq.DDS1 were created. Strains expressing SQS1, SQE1, and Pq.DDS1 derivatives were incubated at 37° C. for 72 hours. Dammarendiol levels were quantified by GC-FID chromatography using authentic standards. The fold improvement relative to Pq.DDS1 for each derivative is shown in Table 1. L195Del3 refers to the deletion of 3 residues (L195-E197) in Pq.DDS1.
The DDS1 derivative (Pq.DDS2) (SEQ ID NO: 81) was tested alongside Pq.DDS1. Pq.DDS2 incorporates the mutations Y49F, S181T, L195Del3, S198P, E238S, I407V, D507E, R637K, and M695I relative to Pq.DDS1. E. coli strains expressing SQS1, SQE1 and Pq.DDS2 (or Pq.DDS1) were incubated at 37° C. for 72 hrs. The relative titer of strains expressing SQS1-SQE1-Pq.DDS2 is shown relative to strains expressing SQS1-SQE1-Pq.DDS1 (
Derivatives of Pq.DDS2 were created. Strains expressing SQS1, SQE1, and Pq.DDS2 derivatives were incubated at 37° C. for 72 hours. Dammarenediol levels were quantified by GC-FID chromatography using authentic standards. The fold improvement relative to Pq.DDS2 for each derivative is shown in Table 2.
The DDS1 derivative (Pq.DDS3) was tested alongside Pq.DDS2. Pq.DDS3 incorporates the mutations F649L, L548F, Q149E, A120S, G573A, S380A, and A256G relative to Pq.DDS2. E. coli strains expressing SQS1, SQE1 and Pq.DDS3 (or Pq.DDS2) were incubated at 37° C. for 72 hrs. The relative titer of strains expressing SQS1-SQE1-Pq.DDS3 is shown relative to strains expressing SQS1-SQE1-Pq.DDS2 (
The strain producing dammarenediol (expressing DDS2) was further engineered to express a protopanaxadiol synthase (PPDS). Seven different PPDS enzymes PPDS1 to PPDS7 (SEQ ID NOs: 10-16) were expressed along with a cytochrome P450 reductase (CPR1). PPDS enzymes contained a truncation of the native N-terminus, which was replaced by an E. coli membrane anchor as described in U.S. Pat. No. 10,774,314, which is hereby incorporated by reference. The strains were incubated at 30° C. for 72 hr, and dammarenediol and protopanaxadiol were quantified by GC-FID chromatography using authentic standards of each compound. Titers of dammarenediol and protopanaxadiol produced by each strain were plotted. As shown in
To improve the production of protopanaxadiol, PPDS1 enzyme substitutions were screened, as shown in Table 3 (“PPDS1 derivatives”). Strains were incubated at 37° C. for 72 h, and protopanaxadiol levels were quantified by GC-FID chromatography using authentic standards. The fold improvement relative to PPDS1 (wild type) is shown in Table 3. A PPDS1 variant was produced (Pg.PPDS1) which incorporated the mutations T108N, I212F, K338G, D135E, S68P, V150P, F167H, L283M, H482R, R347Q, M390L, R243K, L292I, V329M, Q278E, and N58E relative to PPDS1. Strains expressing SQS1, SQE1, Pq.DDS3, CPR1, Pg.PPDS1 (or PPDS1) were incubated at 37° C. for 72 hrs. As shown in
An E. coli strain producing protopanaxadiol was further engineered to express a protopanaxatriol synthase (PPTS). Like with PPDS, PPTS was engineered to include an N-terminal membrane anchor from an E. coli inner membrane protein. Two different PPTS enzymes PPTS1 (SEQ ID NO: 17) and PPTS2 (SEQ ID NO: 18) were expressed along with a cytochrome P450 reductase (CPR1). The strains expressing (1) PPDS1, CPR1 and PPTS1 and (2) PPDS1, CPR1 and PPTS2 were incubated at 30° C. for 72 hr. Dammarenediol, protopanaxadiol and protopanaxatriol were quantified by GC-FID chromatography using authentic standards of each compound and plotted. As shown in
To improve the production of protopanaxatriol, PPTS1 enzyme was engineered by screening amino acid substitutions as shown in Table 4 (“PPTS1 derivatives”). Strains were incubated at 37° C. for 72 h, and protopanaxatriol levels were quantified by GC-FID chromatography using authentic standards. The fold improvements relative to PPTS1 (wild type) are shown in Table 4. A Pg.PPTS2 was created incorporated the mutations G294T, S166K, C472H, K252Q, V239I, A323P, I412V, I369T, K362D, and T250P relative to PPTS1. Strains expressing SQS1, SQE1, Pq.DDS3, CPR1, Pg.PPDS1, and Pg.PPTS2 (or PPTS1) were incubated at 37° C. for 72 hours. As shown in
Saccharomyces cerevisiae FPPS
Artemisia annua squalene synthase (SQS1)
Siraitia grosvenorii SQSa
Siraitia grosvenorii SQSb
Cucumis sativus
Cucumis melo
Cucumis melo
Cucurbita moschata
Sechium edule
Panax quinquefolius
Malus domestica
Glycine soja
Diospyros kaki
Camellia oleifera
Eleutherococcus senticosus
Flavobacteriales bacterium
Bacteroidetes bacterium
Methylomonas lenta (SQE1)
Siraitia grosvenorii SQE1
Siraitia grosvenorii SQE2
Cucurbita maxima
Cucurbita moschata
Cucurbita moschata
Cucumis sativus
Cucumis melo
Cucurbita maxima
Ziziphus jujube
Morus alba
Juglans regia (JrSQE1)
Cucumis melo
Cucumis sativus
Juglans regia (JrSQE2)
Theobroma cacao
Cucurbita moschata
Phaseolus vulgaris
Hevea brasiliensis
Zea mays
Medicago sativa
Bathymodiolus azoricus Endosymbiont
Methyloprofundus sediment
Methylomicrobium buryatense
Ononis spinosa SQEL
Ononis spinosa SQE2
Medicago truncatula SQE1
Medicago truncatula SQE2
Hypholoma sublateritium SQE
Hypholoma sublateritium SQE2
Hypholoma sublateritium SQE3
Panax quinquefolius DDS (DDS1)
Panax ginseng DDS (DDS2)
Panax vietnamensis DDS (DDS7)
Panax quinquefolius DDS (Pq.DDS1)
Panax ginseng PPDS (PPDS1)
Panax notoginseng PPDS (PPDS2)
Panax notoginseng PPDS (PPDS3)
Kalopanax septemlobus PPDS (PPDS4)
Eleutherococcus senticosus PPDS (PPDS5)
Camellia sinensis PPDS (PPDS6)
Panax quinquefolius PPDS (PPDS7)
Panax ginseng PPTS (PPTS1)
Panax quinquefolius PPTS (PPTS2)
Panax notoginseng PPTS (PPTS3)
Eleutherococcus senticosus PPTS (PPTS4)
Panax notoginseng PPTS (PPTS5)
Camptotheca acuminate Cytochrome
Stevia rebaudiana (SrCPR1)
Arabidopsis thaliana CPR1 (AtCPR1)
Arabidopsis thaliana CPR2 (AtCPR2)
Arabidopsis thaliana (AtCPR3)
Stevia rebaudiana CPR2 (SrCPR2)
Stevia rebaudiana CPR3 (SrCPR3)
Artemisia annua CPR
Camptotheca acuminate CaCPR
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US22/76194 | 9/9/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63242212 | Sep 2021 | US |