The Cannabaceae family of plants produces numerous different cannabinoids (>=120) in variable, relative quantities over a 7-10 week flowering period. Many of these cannabinoids have been and are currently being explored as therapeutics in chordates (e.g., mammals), and as a result, they are largely approved for medical and/or recreational use in the United States (Abrams D I Eur J Int Med 2018, 49, 7-11). Specifically, the most sought after (phyto)cannabinoids are (i) tetrahydrocannabinolic acid (THCA); (ii) cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA). These phytocannabinoids as well as their associated chemical analogs (e.g., THCVA, CBDVA, and CBCVA) are all biosynthesized in various quantities from the same pre-cursor in the cannabinoid biosynthetic pathway, which is cannabigerolic acid and cannabigerovarinic acid (i.e., CBGA and CBGVA, respectively). Thus, to mass produce any specific phytocannabinoid (e.g., THC(V)A/CBD(V)A/CBC(V)A/etc), both the rate and total quantity of biosynthesized CBG(V)A must be increased. The biosynthesis of these hydrophobic compounds and their on-pathway intermediates creates limitations with regards to production. Specifically, cells are limited by the following on-pathway pre-cursor molecules: (i) the amount of available pre-cursor molecules fluxing through the pathway to the terminal phytocannabinoids, such as C3-CoA/C4-CoA/C6-CoA, OA/DVA, and CBGA/CBGVA. Further limitations to mass producing these phytocannabinoids is the intracellular availability of available geranyl pyrophosphate (GPP) and the pre-cursors that lead to the native synthesis of this compound (e.g., MEV pathway). To sustainably meet the demands of both the consumer and medicinal market for these valuable terminal phytocannabinoids (e.g., THC(V)A, CBD(V)A, CBC(V)A) there is a need for the scaled production of cannabinoid biosynthesis in non-native hosts. Thus, the total production capacity of cannabinoids is accelerated by the engineering of enzymes in the cannabinoid biosynthesis pathway within a non-native host.
Described herein is the discovery and/or optimization of enzymes involved in the biosynthesis of Olivetolic acid (OA) from hexanoic acid. Specifically, disclosed are engineered enzymes, cells, and methods that significantly enhance both the rate and total quantity at which cannabinoid precursors can be biosynthesized. This is accomplished by increasing the flux of required precursors for CBGAS activity, olivetolic acid (OA) and geranyl pyrophosphate (GPP) (
HCS: Hexanoyl-CoA synthetases: a variety of natural enzymes were found to catalyze hexanoyl-CoA and butyryl-CoA from hexanoic acid and butyric acid, respectively.
PKS: polyketide synthase (Type III): natural enzymes were identified, and their activity improved via protein engineering
PKC: polyketide cyclase: new, nonnaturally occurring enzymes were developed and improved by engineering.
Fusions of PKS & PKC: showed improved OA titer and OA/OL ratio.
Fusions of HCS & PKS: may increase the flux to tetraketide (and ultimately to OA and analogs) from carboxylic acid
CHIL proteins: may increase enzyme's activity and reduce byproduct formation of HTAL and PDAL.
Some aspects of the present disclosure are directed to a polyketide synthase comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, or 68 wherein the polyketide synthase has polyketide synthase (PKS) activity. In some embodiments, the amino acid sequence of the polyketide synthase comprises at least one amino acid modification as compared to SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 2, wherein the amino acid substitution is located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 3, wherein the amino acid substitution is located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 4, wherein the amino acid substitution is located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 5, wherein the amino acid substitution is located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 6, wherein the amino acid substitution is located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 7, wherein the amino acid substitution is located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 8, wherein the amino acid substitution is located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 68, wherein the amino acid substitution is located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase further comprises a cleavage sequence, a linker sequence, a solubility tag, scaffolding tag, dimer-association small peptide extension off the termini, or affinity tag sequence.
In some embodiments, the polyketide synthase produces tetraketides with a variety of alkyl chain lengths from the condensation of one or more acyl-CoA substrates, all with varying alkyl chain lengths. In some embodiments, the tetraketides, are condensed from one or more acyl-CoA substrates selected from the group consisting of Acetyl-CoA, Butyryl-CoA, Hexanoyl-CoA, Octanoyl-CoA, Decanoyl-CoA, Dodecanoyl-CoA, Myristoyl-CoA, Palmitoleyl-CoA, Linoleyl-CoA, Palmityl-CoA, Malonyl-CoA, and Oleyl-CoA. In some embodiments, the polyketide synthase comprises a polypeptide sequence with at least 70% identity with SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, or 68 and produces a corresponding di-, tri-, or tetraketide of various alkyl chain-lengths from one or more acyl-CoA substrates shown in
Some aspects of the present disclosure are directed to a cell comprising the polyketide synthase disclosed herein. In some embodiments, the cell is a bacteria cell or a yeast cell.
Some aspects of the present disclosure are directed to a polynucleotide coding for a polyketide synthase described herein.
Some aspects of the present disclosure are directed to a polyketide cyclase comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 9, 10, 11, 12, 69, 71, 72, 73, 74, 75, 76, 77, 78, 79 or 80, wherein the polyketide cyclase has polyketide cyclase (PKC) activity. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least one amino acid modification as compared to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises a chimeric amino acid sequence comprising portions of two or more of SEQ ID NOs 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, and/or 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a chimeric amino acid sequence comprising portions of SEQ ID NO: 9 and SEQ ID NO: 10, portions of SEQ ID NO: 71 and SEQ ID NO: 72, portions of SEQ ID NO: 76 and SEQ ID NO: 72, portions of SEQ ID NO: 69 and SEQ ID NO: 71 or portions of SEQ ID NO: 69 and SEQ ID NO: 76.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 10, wherein the amino acid substitution is located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 10.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 11, wherein the amino acid substitution is located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 11. In some embodiments, the polyketide cyclase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, dimer-promoting small peptide terminal extension, or affinity tag sequence.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 69, wherein the amino acid substitution is located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 69. In some embodiments, the polyketide cyclase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, dimer-promoting small peptide terminal extension, or affinity tag sequence.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 71, wherein the amino acid substitution is located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 71. In some embodiments, the polyketide cyclase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, dimer-promoting small peptide terminal extension, or affinity tag sequence.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 72, wherein the amino acid substitution is located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 72. In some embodiments, the polyketide cyclase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, dimer-promoting small peptide terminal extension, or affinity tag sequence.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 76, wherein the amino acid substitution is located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with a 1-20 C-terminus or N-terminus truncation as compared to SEQ ID NO: 76. In some embodiments, the polyketide cyclase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, dimer-promoting small peptide terminal extension, or affinity tag sequence.
In some embodiments, the polyketide cyclase comprises a C-terminal and N-terminal small peptide that can dimerize. In some embodiments, the polyketide cyclase comprises a ubiquitin at the N-terminal. In some embodiments, the polyketide cyclase comprises a C-terminal and/or an N-terminal scaffolding tag capable of forming a homodimer and/or heterodimer.
In some embodiments, the polyketide cyclase is capable of cyclizing the PKS-produced tetraketide to the corresponding 6-alkyl-2,4-dihydroxy benzoic acid as shown in
Some aspects of the present disclosure are directed to a cell comprising a polyketide cyclase described herein. In some embodiments, the cell is a bacteria cell or a yeast cell.
Some aspects of the present disclosure are directed to a polynucleotide coding for a polyketide cyclase described herein.
Some aspects of the present disclosure are directed to a fusion protein comprising a polypeptide having polyketide synthase activity and a polypeptide having polyketide cyclase activity. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide cyclase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79 or 80. In some embodiments, the fusion protein further comprises a linker between the polypeptide having polyketide synthase activity and the polypeptide having polyketide cyclase activity. In some embodiments, the linker is between 5 and 52 amino acids in length. In some embodiments, the linker has an amino acid sequence selected from SEQ ID NO: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, or 27.
In some embodiments, the fusion protein is a bi-function fusion protein, a tri-functional fusion protein or a tetra-functional fusion protein. In some embodiments, the fusion protein comprises an amino acid sequence having at least 90% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68 and comprises an amino acid sequence having at least 90% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80 and forms a bi-functional fusion protein. In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO: 34, 35, 36, 37, or 38.
In some embodiments, the fusion protein is capable of producing olivetolic acid from Hexanoyl-CoA and/or divarinic acid from butyryl-CoA. In some embodiments, the fusion protein is capable of producing a ratio of olivetolic acid to olivetol from Hexanoyl-CoA at a ratio of greater than 0.1.
In some embodiments, the polypeptide having polyketide synthase activity is located at the N-terminus. In some embodiments, the polypeptide having polyketide cyclase activity is located at the N-terminus.
In some embodiments, the fusion protein comprises a C-terminal and N-terminal small peptide that can dimerize. In some embodiments, the fusion protein comprises a ubiquitin at the N-terminal. In some embodiments, the fusion protein comprises a ubiquitin at the C-terminal. In some embodiments, the fusion protein comprises a C-terminal and/or an N-terminal scaffolding tag capable of forming a homodimer and/or heterodimer.
Some aspects of the present disclosure are directed to a cell comprising a fusion protein described herein. In some embodiments, the cell is a bacteria cell or a yeast cell.
Some aspects of the present disclosure are directed to a polynucleotide coding for a fusion protein described herein.
Some aspects of the present disclosure are directed to a cell comprising an exogenous nucleotide sequence coding for at least one of the following: (a) a polyketide synthase comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68, wherein the polyketide synthase has polyketide synthase (PKS) activity; (b) a polyketide cyclase comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 8, 9, 10, 11, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79 or 80, wherein the polyketide cyclase has polyketide cyclase (PKC) activity; (c) a fusion protein comprising a polypeptide having polyketide synthase activity and a polypeptide having polyketide cyclase activity; and (d) an enzyme having acyl-CoA activity.
In some embodiments, the cell comprises (a) the polyketide synthase, wherein the polyketide synthase is capable of producing a tetraketide from acyl-CoA substrates selected from carboxylic acids with two to twenty-two carbons, such as for example Acetyl-CoA, Butyryl-CoA, Hexanoyl-CoA, Octanoyl-CoA, Decanoyl-CoA, Dodecanoyl-CoA, Myristoyl-CoA Palmitoleyl-CoA, Linoleyl-CoA, Palmityl-CoA, and Oleyl-CoA or acyl-CoA substrates shown in
In some embodiments, the cell comprises a polyketide synthase as described herein (e.g., an amino acid sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68) and a polyketide cyclase described herein (e.g., an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80). In some embodiments, the cell comprises a fusion protein of SEQ ID NO: 34, 35, 36, 37, or 38.
Some aspects of the present disclosure are directed to a fusion protein comprising a polypeptide having polyketide synthase activity and a polypeptide having acyl-CoA synthetase activity (e.g., HCS enzymes). In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having acyl-CoA synthetase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the fusion protein further comprises a linker between the polypeptide having polyketide synthase activity and the polypeptide having acyl-CoA synthetase activity. In some embodiments, the linker is between 5 and 52 amino acids in length. In some embodiments, the linker has an amino acid sequence selected from SEQ ID NOs. 60, 61, or 62. In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO: 63, 64, 65
In some embodiments, the fusion protein is capable of producing a tetraketide-CoA from Hexanoic acid and/or Butyric acid.
In some embodiments, the acyl-CoA synthetase peptide is located at the N-terminus of the polyketide synthase peptide. In some embodiments, the acyl-CoA synthetase peptide is located at the C-terminus of the polyketide synthase.
In some embodiments, the cell comprises a acyl-CoA synthetase as described herein (e.g., an amino acid sequence of SEQ ID NO: 28, 29, 30, 31, 32 or 33) and a polyketide synthase as described herein (e.g., an amino acid sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68) and a polyketide cyclase as described herein (e.g., an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80). In other embodiments the cell comprises a fusion between an acyl-CoA synthetase and a polyketide synthase as described herein (e.g., an amino acid sequence of SEQ ID NO: 63, 64, 65) and a separate polyketide cyclase as described herein (e.g., an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80).
In some embodiments, the cell further comprises an exogenous polynucleotide coding for a CHIL protein. In some embodiments, the cell is capable of utilizing hexanoic acid to produce olivetolic acid. In some embodiments, the cell is capable of utilizing butyric acid to produce divarinic acid. In some embodiments, the cell is capable of utilizing octanoic acid, decanoic acid, dodecanoic acid, oleic acid, palmitic acid, myristic acid or stearic acid to produce one or more olivetolic acid analogs.
In some embodiments, the cell is a bacteria cell or a yeast cell. In some embodiments, the cell is a Yarrowia strain.
Some aspects of the present disclosure are directed to a method of producing one or more 6-alkyl-2,4-dihydroxy benzoic acid(s), e.g., as shown in
Some aspects of the present disclosure are directed to a cell comprising a fusion protein as disclosed herein. Some aspects of the present disclosure are directed to a cell comprising a fusion protein as disclosed herein and an enzyme with polyketide cyclase activity. In some embodiments, the enzyme with polyketide cyclase activity is a polyketide cyclase disclosed herein. In some embodiments, the cell is capable of utilizing hexanoic acid to produce olivetolic acid. In some embodiments, the cell is capable of utilizing butyric acid to produce divarinic acid. In some embodiments, the cell is capable of utilizing octanoic acid, decanoic acid, dodecanoic acid, oleic acid, palmitic acid, myristic acid or stearic acid to produce one or more olivetolic acid analogs. In some embodiments, the cell is a bacteria cell or a yeast cell. In some embodiments, the cell is a Yarrowia strain
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
Some aspects of the present disclosure are directed to a polyketide synthase (type III) comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, or 68 wherein the polyketide synthase has polyketide synthase (PKS) activity. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, or 68.
Polyketide synthases catalyze the sequential condensation of acetate units to an acceptor molecule to produce a large number of natural products through the intermediacy of a polyketide. Three different classes of polyketide synthases are known, Type I, II and III (See. E.g., US20190078098; Austin, M. B. and J. P. Noel. Natural Product Reports, 2002. 20(1): p. 79-110; Lim, Y., et al. Molecules, 2016. 21(6): p. 806; Yu, D., et al. IUBMB Life, 2012. 64(4): p. 285-295). A type III polyketide synthase (PKS1) that was identified in C. sativa condenses hexanoyl-CoA with three malonyl-CoAs to produce dodecanoyl-tetraketide-CoA. Unlike other Type III polyketide synthases, PKS1 was not able to cyclize the dodecanoyl-tetraketide to olivetolic acid (OA), instead, the decarboxylated product Olivetol (OL) was formed (Taura, F, Tanaka, S, Tagichi, C, Fukumizu, T, Tanaka, H, Shoyame Y, Morimnoto, S. FEBS Lett, 2009, 583, 2061). Further structural and biochemical characterization of PKS1 confirmed the enzyme being a homodimer and suggested that OL is produced through a non-enzymatic chemical aldol condensation of the dodecanoyl-tetraketide (Kearsey, LJ, Prandi, N, Karuppiah, V, Yan, C, Leys D, Toogood, H, Takano, E, Scrutton N S FEBS J. 2020, 287(8) 1511-1524). It was later shown that in Cannabis, dodecanoyl-tetraketide-CoA is condensed to OA by the action of a separate enzyme, polyketide cyclase PKC1 (Gagne S J, Stout, J M, Liu E, Boubakir Z, Clark S M, Page J E PNAS, 2012, 109(31), 12811-12816).
Type III PKSs are able to produce a wide diversity of polyketide products by using a variety of CoA-containing precursors as a starting unit. These starters range from small aliphatic molecules, such as acetyl-CoA, to larger ring-containing compounds derived from the phenylpropanoid pathway, such as 4-coumaroyl-CoA. Often, these CoA molecules are formed through the function of acid CoA ligases (or synthase) that convert carboxylic acids into corresponding CoA thioesters (Shimizu Y, Ogata, H, Goto, S ChemBioChem 2017, 18, 50-65)
As used herein, “polyketide synthase (PKS) activity” refers to the ability of an enzyme to produce a polyketide from an acyl-CoA precursor. In some embodiments, the polyketide synthase has at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or substantially 100% of the polyketide synthase (PKS) activity of a naturally occurring PKS (e.g., PKS1 from C. sativa). In some embodiments, the polyketide synthase has at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, 10-fold, or more polyketide synthase (PKS) activity as compared to a naturally occurring PKS (e.g., PKS1 from Cannabis).
Amino acid modifications may be amino acid substitutions, amino acid deletions and/or amino acid insertions. Amino acid substitutions may be conservative amino acid substitutions or non-conservative amino acid substitutions. A conservative replacement (i.e., also referred to as a conservative mutation, a conservative substitution, or a conservative variation) is an amino acid replacement in a protein that changes a given amino acid to a different amino acid with similar biochemical properties (e.g., charge, hydrophobicity and side-chain size). As used herein, “conservative variations” refer to the replacement of an amino acid residue by another, biologically similar residue. Examples of conservative variations include the substitution of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another; or the substitution of one polar residue for another, such as the substitution of arginine for lysine, glutamic for aspartic acids, or glutamine for asparagine, and the like. Other illustrative examples of conservative substitutions include the changes of: alanine to serine; arginine to lysine; asparagine to glutamine or histidine; aspartate to glutamate; cysteine to serine; glutamine to asparagine; glutamate to aspartate; glycine to proline; histidine to asparagine or glutamine; isoleucine to leucine or valine; leucine to valine or isoleucine; lysine to arginine, glutamine, or glutamate; methionine to leucine or isoleucine; phenylalanine to tyrosine, leucine or methionine; serine to threonine; threonine to serine; tryptophan to tyrosine; tyrosine to tryptophan or phenylalanine; valine to isoleucine or leucine, and the like.
“Identity” refers to the extent to which the sequence of two or more nucleic acids or polypeptides is the same. In some embodiments, percent identity between a sequence of interest and a second sequence over a window of evaluation, e.g., over the length of the sequence of interest, may be computed by aligning the sequences, determining the number of residues (nucleotides or amino acids) within the window of evaluation that are opposite an identical residue allowing the introduction of gaps to maximize identity, dividing by the total number of residues of the sequence of interest or the second sequence (whichever is greater) that fall within the window, and multiplying by 100. When computing the number of identical residues needed to achieve a particular percent identity, fractions are to be rounded to the nearest whole number. Percent identity can be calculated with the use of a variety of computer programs known in the art. For example, computer programs such as BLAST2, BLASTN, BLASTP, Gapped BLAST, etc., generate alignments and provide percent identity between sequences of interest. The algorithm of Karlin and Altschul (Karlin and Altschul, Proc. Natl. Acad. Sci. USA 87:22264-2268, 1990) modified as in Karlin and Altschul, Proc. Natl. Acad. Sci. USA 90:5873-5877, 1993 is incorporated into the NBLAST and XBLAST programs of Altschul et al. (Altschul, et al., J. Mol. Biol. 215:403-410, 1990). To obtain gapped alignments for comparison purposes, Gapped BLAST is utilized as described in Altschul et al. (Altschul, et al. Nucleic Acids Res. 25: 3389-3402, 1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs may be used. A PAM250 or BLOSUM62 matrix may be used. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (NCBI). See the Web site having URL ncbi.nlm.nih.gov for these programs. In a specific embodiment, percent identity is calculated using BLAST2 with default parameters as provided by the NCBI.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO:2. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO:2. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO:2. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 2. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 2.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 3. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 3.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 4. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 4.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 5. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 5.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 6. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 6.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 7. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 7.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 8. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 8.
In some embodiments, the amino acid sequence of the polyketide synthase comprises one amino acid modification as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises two amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises three amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises four amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises five amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises six amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises seven amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises eight amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises nine amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises 1-10 amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises 10-20 amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises 20-30 amino acid modifications as compared to SEQ ID NO: 68. In some embodiments, the amino acid sequence of the polyketide synthase comprises 30-40 amino acid modifications as compared to SEQ ID NO: 68.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 2, wherein the amino acid substitution is located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least two amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least three amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least four amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least five amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least six amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least seven amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least eight amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least nine amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least ten amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least eleven amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, 1263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twelve amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least thirteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least fourteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least fifteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least sixteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least seventeen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least eighteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least nineteen amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-one amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-two amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, 1204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-three amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-four amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-five amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-six amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, 1263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least twenty-seven amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384. In some embodiments, the polyketide synthase comprises an amino acid sequence with twenty-eight amino acid substitutions as compared to SEQ ID NO: 2, wherein the amino acid substitution are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 3, wherein the amino acid substitution is located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least two amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least three amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least four amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 3, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 4, wherein the amino acid substitution is located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 4, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 5, wherein the amino acid substitution is located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, 1264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, 1264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, 1264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, 1264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, 1264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 5, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 6, wherein the amino acid substitution is located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 6, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 7, wherein the amino acid substitution is located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, 1258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 7, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 8, wherein the amino acid substitution is located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, 1259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, 1259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, 1259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, 1259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, 1259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 8, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378.
In some embodiments, the polyketide synthase comprises an amino acid sequence with at least one amino acid substitution as compared to SEQ ID NO: 68, wherein the amino acid substitution is located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 2 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 3 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 4 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 5 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 6 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 7 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 8 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 9 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 10 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 11 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 12 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 13 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 14 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 15 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 16 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 17 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 18 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 19 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 20 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 21 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 22 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 23 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 24 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 25 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 26 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with at least 27 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369. In some embodiments, the polyketide synthase comprises an amino acid sequence with 28 amino acid substitutions as compared to SEQ ID NO: 68, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369.
In some embodiments, the polyketide synthase further comprises a tag or other sequence. In some embodiments, the polyketide synthase further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, a dimerizable small peptide, or an affinity tag sequence. For example, the tag can be an affinity tag (e.g., HA, TAP, Myc, 6×his, Flag, GST), fluorescent or luminescent protein (e.g., EGFP, ECFP, EYFP, Cerulean, DsRed, mCherry), solubility-enhancing tag (e.g., Ubiquitin tag, a SUMO tag, NUS A tag, SNUT tag, or a monomeric mutant of the Ocr protein of bacteriophage T7). See, e.g., Esposito D and Chatterjee D K. Curr Opin Biotechnol.; 17(4):353-8 (2006) or Varshavsky A. Methods Enzymol. 326: 578-593 (2000). In some embodiments, a tag can serve multiple functions. A tag is often relatively small, e.g., ranging from a few amino acids up to about 100 amino acids long. In some embodiments a tag is more than 100 amino acids long, e.g., up to about 500 amino acids long, or more. In some embodiments, a tag is located at the N- or C-terminus, e.g., as an N- or C-terminal fusion. The polypeptide could comprise multiple tags. In some embodiments, a tag is cleavable, so that it can be removed from the polypeptide, e.g., by a protease. Exemplary proteases include, e.g., thrombin, TEV protease, Factor Xa, PreScission protease, etc. In some embodiments, a “self-cleaving” tag is used. See, e.g., PCT/US05/05763. In some embodiments a tag or other heterologous sequence is separated from the rest of the protein by a polypeptide linker. For example, a linker can be a short polypeptide (e.g., 15-25 amino acids). Often a linker is composed of small amino acid residues such as serine, glycine, and/or alanine. A heterologous domain could comprise a transmembrane domain, a secretion signal domain, etc. A scaffolding tag refers to a peptide that can interact with itself or another peptide or protein. Numerous small peptides that can form homo or hetero dimers have been described and have been used to co-localize enzymes (e.g Park, WM Int. J Mol Sci (2000) 21 3584; Anderson G P, Shriver-Lake L C, Liu, J L, Goldman E R, ACS Omega, 2018, 3, 4810-4815). Small tag peptides can interact with proteins to form tight complexes and have also been used to create multi-protein scaffolds (Vanderstraeten J, Briers, Y Biotechnol. Advances 2020, 44, 107627, Keasling J D et al Nature Biotechnol 2009, 27(8), 753).
In some embodiments, the polyketide synthase is capable of producing a tetraketide from one or more acyl-CoA substrates (e.g., acyl-CoA consisting of an acid with C2 to C22 carbons). In some embodiments, the acyl-CoA substrate is selected from acetyl-CoA, butyryl-CoA, Hexanoyl-CoA, Octanoyl-CoA, decanoyl-CoA, dodecanoyl-CoA, Myristoyl-CoA, Palmitoleyl-CoA, Linoleyl-CoA, Palmitoyl-CoA, and Oleyl-CoA. In some embodiments, the acyl-CoA substrate is Hexanoyl-CoA. In some embodiments, the acyl-CoA substrate is butyryl-CoA.
In some embodiments, the polyketide synthase is capable of producing a tetraketide from the acyl-CoA substrate at a higher rate than PKS1 from Cannabis sativa. In some embodiments, the polyketide synthase is capable of producing a tetraketide from the acyl-CoA substrate at a rate that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, 10-fold, or more of the tetraketide synthesis rate of PKS-1 from Cannabis sativa. In some embodiments, the polyketide synthase is capable of producing a tetraketide from the acyl-CoA substrate at a rate that is at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or substantially 100% of the tetraketide synthesis rate of PKS-1 from Cannabis sativa.
Some aspects of the present disclosure are directed to a cell comprising a polyketide synthase disclosed herein. In some embodiments, the cell is a transgenic cell (e.g., the polyketide synthase is coded by a heterologous sequence). In some embodiments, the cell is a yeast cell, in a bacterial cell, in an algae cell, or in a plant cell. In some embodiments, the cell is a yeast cell. In some embodiments, the yeast is an oleaginous yeast (e.g., a Yarrowia lipolytica strain). In some embodiments, the bacteria is Escherichia coli.
Suitable cells may include, but are not limited to, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces sp., Hansenula polymorpha (now known as Pichia angusta), Kluyveromyces sp., Kluyveromyces lactis, Kluyveromyces marxianus, Schizosaccharomyces pompe, Dekkera bruxellensis, Arxula adeninivorans, Candida albicans, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense, Fusarium sp., Fusarium gramineum, Fusarium venenatum, Neurospora crassa, Chlamydomonas reinhardtii, Scizichytrim, sp, Aurantiochytrium sp, Yarrowia lipolytica and the like. In some embodiments, the cell is a protease-deficient strain of Saccharomyces cerevisiae. In some embodiments, the cell is a eukaryotic cell other than a plant cell. In some embodiments, the cell is a plant cell. In some embodiments, the cell is a plant cell, where the plant cell is one that does not normally produce a cannabinoid, a cannabinoid derivative or analogue, a cannabinoid precursor, or a cannabinoid precursor derivative or analogue. In some embodiments, the cell is Saccharomyces cerevisiae.
In some embodiments, the cell is a prokaryotic cell. Suitable prokaryotic cells may include, but are not limited to, any of a variety of laboratory strains of Escherichia coli, Lactobacillus sp., Salmonella sp., Shigella sp., and the like. See, e.g., Carrier et al, (1992) J. Immunol. 148:1176-1181; U.S. Pat. No. 6,447,784; and Sizemore et al. (1995) Science 270:299-302. Examples of Salmonella strains which can be employed may include, but are not limited to, Salmonella typhi and S. typhimurium. Suitable Shigella strains may include, but are not limited to, Shigella flexneri, Shigella sonnei, and Shigella disenteriae. Typically, the laboratory strain is one that is non-pathogenic. Non-limiting examples of other suitable bacteria may include, but are not limited to, Bacillus subtilis, Pseudomonas putida, Pseudomonas aeruginosa, Pseudomonas mevalonii, Rhodobacter sphaeroides, Rhodobacter capsulatus, Rhodospirillum rubrum, Rhodococcus sp., and the like.
Some aspects of the present disclosure are directed to a polynucleotide coding for a polyketide synthase disclosed herein.
An expression vector or vectors can be constructed to include exogenous nucleotide sequences coding for the recombinant polypeptides described herein operably linked to expression control sequences functional in the cell. Expression vectors applicable include, for example, plasmids, phage vectors, viral vectors, episomes and artificial chromosomes, including vectors and selection sequences or markers operable for stable integration into a host chromosome. Additionally, the expression vectors can include one or more selectable marker genes and appropriate expression control sequences. Selectable marker genes also can be included that, for example, provide resistance to antibiotics or toxins, complement auxotrophic deficiencies, or supply critical nutrients not in the culture media. Expression control sequences can include constitutive and inducible promoters, transcription enhancers, transcription terminators, and the like which are well known in the art. When two or more exogenous encoding nucleic acids are to be co-expressed, both nucleic acids can be inserted, for example, into a single expression vector or in separate expression vectors. For single vector expression, the encoding nucleic acids can be operationally linked to one common expression control sequence or linked to different expression control sequences, such as one inducible promoter and one constitutive promoter. The transformation of exogenous nucleic acid sequences can be confirmed using methods well known in the art. Such methods include, for example, nucleic acid analysis such as Northern blots or polymerase chain reaction (PCR) amplification of mRNA, or immunoblotting for expression of gene products, or other suitable analytical methods to test the expression of an introduced nucleic acid sequence or its corresponding gene product. It is understood by those skilled in the art that the exogenous nucleic acid is expressed in a sufficient amount to produce the desired product, and it is further understood that expression levels can be optimized to obtain sufficient expression using methods well known in the art and as disclosed herein.
The term “exogenous” is intended to mean that the referenced molecule or the referenced activity is introduced into the cell. The molecule can be introduced, for example, by introduction of an encoding nucleic acid into the host genetic material such as by integration into a host chromosome or as non-chromosomal genetic material such as a plasmid. Therefore, the term as it is used in reference to expression of an encoding nucleic acid refers to introduction of the encoding nucleic acid in an expressible form into the cell. When used in reference to a biosynthetic activity, the term refers to an activity that is introduced into the host. The source can be, for example, a homologous or heterologous encoding nucleic acid that expresses the referenced activity following introduction into the cell. Therefore, the term “endogenous” refers to a referenced molecule or activity that is present in the cell. Similarly, the term when used in reference to expression of an encoding nucleic acid refers to expression of an encoding nucleic acid contained within the microbial organism. The term “heterologous” refers to a molecule or activity derived from a source other than the referenced species whereas “homologous” refers to a molecule or activity derived from the host microbial organism. Accordingly, exogenous expression of an encoding nucleic acid can utilize either or both a heterologous or homologous encoding nucleic acid.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 70% identity to SEQ ID NO:1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence with 1-40 amino acid modifications as compared to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68 and, optionally, one to twenty amino acids deleted from the C-terminus or N-terminus.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 2 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 2 at positions selected from A106, Y140, S141, A145, L169, G171, C172, E200, T202, I204, A205, G208, G219, F223, G224, D225, G226, I263, M265, M272, Y274, H313, G315, N346, S348, F382, G383, and P384.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 3 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 3 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 4 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 4 at positions selected from A102, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H308, G310, N341, S343, F377, G378, and P379.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 5 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 5 at positions selected from A108, Y142, S143, A147, L171, G173, C174, E202, T204, I206, A207, G210, G221, F225, G226, D227, G228, I264, M266, M273, Y275, H314, G316, N347, S349, F383, G384, and P385.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 6 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 6 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 7 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 7 at positions selected from A103, Y136, S137, A141, L165, G167, C168, E196, T198, I200, A201, G204, G215, F219, G220, D221, G222, I258, M260, M267, Y269, H309, G311, N342, S344, F379, G380, and P381.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 8 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 8 at positions selected from A103, Y137, S138, A142, L166, G168, C169, E197, T199, I201, A202, G205, G216, F220, G221, D222, G223, I259, M261, M268, Y270, H309, G311, N342, S344, F376, G377, and P378.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence identical to SEQ ID NO: 68 with one to twenty-eight amino acid substitutions and, optionally, one to twenty amino acids deleted from the C-terminus and/or N-terminus, wherein the amino acid substitutions are located in SEQ ID NO: 68 at positions selected from S126, G156, C157, G193, G204, F208, G209, D210, I248, H297, G299, N330, S332, F367, G368, and P369.
In some embodiments, the recombinant polypeptide further comprises a fusion domain. The fusion domain is not limited and may be any fusion domain disclosed herein. In some embodiments, the fusion domain is a domain useful for affinity chromatography. In some embodiments, the fusion domain targets the protein to a specific compartment of the cell such as the ER, vacuole, Golgi, peroxisome, lipid body (e.g., oleosome), or targets secretion of the protein from the cell into the outer membrane, periplasmic space or the culture media. In other embodiments the recombinant polypeptide or its mutants described herein is fused to an acyl-CoA synthase.
Some aspects of the present disclosure are directed to a polyketide cyclase (PKC) comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80, wherein the polyketide cyclase has polyketide cyclase (PKC) activity. In some embodiments, the polyketide cyclase comprises and amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80.
As used herein, “PKC activity” refers to ability to cyclize a polyketide (i.e tetraketide) to an aromatic hydroxy acid (e.g., olivetolic acid or divarinic acid). In some embodiments, the PKC has at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or substantially 100% of the PKC activity of a naturally occurring PKC (e.g., PKC1 from Cannabis, PKC4 from Cannabis). In some embodiments, the PKC has at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, 10-fold, or more PKC activity as compared to a naturally occurring PKC (e.g., PKC from Cannabis).
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least one amino acid modification as compared to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80. As used herein, an amino acid modification may be an insertion, deletion, or substitution.
In some embodiments, the amino acid sequence of the polyketide cyclase has at least 70% identity to SEQ ID NO: 40, 41, 42, 43, 44, or 45. In some embodiments, the amino acid sequence of the polyketide cyclase has at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 40, 41, 42, 43, 44, or 45. In some embodiments, the amino acid sequence of the polyketide cyclase comprises SEQ ID NO: 40, 41, 42, 43, 44, 45 or 46.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 10. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 10.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 11. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 11.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 69. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 69.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 70. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 70.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 71. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 71.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 72. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 72.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 73. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 73.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 74. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 74.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 75. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 75.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 76. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 76.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 77. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 77.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 78. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 78.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 79. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 79.
In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 1 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 2 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 3 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 4 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 5 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 6 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 7 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 8 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 9 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 10-20 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises at least 20-30 amino acid modifications as compared to SEQ ID NO: 80. In some embodiments, the amino acid sequence of the polyketide cyclase comprises a 1-30, 1-20, 1-10, or 1-5 amino acid C-terminus or N-terminus truncation as compared to SEQ ID NO: 80.
In some embodiments, the polyketide cyclase comprises a chimeric amino acid sequence comprising portions having at least 70% sequence identity to portions of 2 or more sequences selected from SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80 (e.g., a portion with 70% identity to a portion of SEQ ID NO: 9 and another portion having at least 70% identity to a portion of SEQ ID NO: 10). In some embodiments, the PKC comprises a chimeric amino acid sequence comprising portions having at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to portions of 2 or more sequences selected from SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80 (e.g., a portion with 70% identity to a portion of SEQ ID NO: 9 and another portion having at least 70% identity to a portion of SEQ ID NO: 10). As used herein, a portion of a sequence can be at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, or more contiguous amino acids.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 10, wherein the amino acid modification is located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, 176, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, 114, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, 169, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, 176, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, 114, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, 169, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, 176, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, 114, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, 169, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, 176, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 28 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, 114, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, 169, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 29 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 30 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 31 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 32 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, 176, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 33 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, 114, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, 169, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with 34 amino acid modifications as compared to SEQ ID NO: 10, wherein the amino acid modifications are located in SEQ ID NO: 10 at positions selected from V9, H11 V12, F13, I14, L15, M17, M29, N30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, F66, E67, S68, I69, F70, M73, I76, Y79, I80, L86, L88, R89, Y92, F93, L96, F99, L100, V101, and F102, D103 and K105.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 11, wherein the amino acid modification is located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, 165, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, 169, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, 165, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, 169, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, 165, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, 169, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, 165, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 28 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 29 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, 169, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 30 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 31 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, 165, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 32 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 33 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 34 amino acid modifications as compared to SEQ ID NO: 11, wherein the amino acid modifications are located in SEQ ID NO: 11 at positions selected from V9, H11, V12, I13 I14, L15, F17, F29, F30, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, V66, E67, S68, I69, F70, V73, I76, Y79, I80, V86, F88, G89, Y92, R93, W96, L99, L100, I101, and F102, D103, and T105.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 69, wherein the amino acid modification is located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 69, wherein the amino acid modifications are located in SEQ ID NO: 69 at positions selected from V3, H5, L9, Y27, L45, E46, N48-Y56, H58, I59, E61, T63, F64, I70, Y73, I74, Y86, L94, F96, and D97.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 71, wherein the amino acid modification is located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 28 amino acid modifications as compared to SEQ ID NO: 71, wherein the amino acid modifications are located in SEQ ID NO: 71 at positions selected from V9, H11, L15, Y33, L51, E52, N54-Y62, H64, I65, E67-F70, I76, Y79, I80, Y92, L100, F102 and D103.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 72, wherein the amino acid modification is located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 28 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 29 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 30 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 31 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 32 amino acid modifications as compared to SEQ ID NO: 72, wherein the amino acid modifications are located in SEQ ID NO: 72 at positions selected from V3, H5-L9, Y27, A39, Q41, L45, E46, N48-Y56, H58, I59, E61, S62, F64, I70, Y73, I74, Y86, L94, F96, and D97.
In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least one amino acid modification as compared to SEQ ID NO: 76, wherein the amino acid modification is located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 2 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 3 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 4 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, 114, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 5 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 6 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 7 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 8 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, 114, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 9 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 10 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 11 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 12 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 13 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 14 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 15 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 16 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 17 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 18 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 19 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 20 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 21 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 22 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 23 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 24 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 25 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 26 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 27 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 28 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 29 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 30 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 31 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103. In some embodiments, the polyketide cyclase comprises an amino acid sequence with at least 32 amino acid modifications as compared to SEQ ID NO: 76, wherein the amino acid modifications are located in SEQ ID NO: 76 at positions selected from V9, H11 V12, I14, L15, M29, Y33, A45, Q47, L51, E52, N54-Y62, H64, I65, E67, S68, F70, I76, Y79, I80, Y92, L100, F102, and D103.
In some embodiments, the PKC further comprises a tag or other sequence. In some embodiments, the PKC further comprises a cleavage sequence, a linker sequence, a solubility tag, a scaffolding tag, a dimerizable small peptide, and/or an affinity tag sequence. For example, the tag can be an affinity tag (e.g., HA, TAP, Myc, 6×His, Flag, GST), fluorescent or luminescent protein (e.g., EGFP, ECFP, EYFP, Cerulean, DsRed, mCherry), solubility or expression-enhancing tag (e.g., Ubiquitin, SUMO tag, NUS A tag, SNUT tag, or a monomeric mutant of the Ocr protein of bacteriophage T7). See, e.g., Esposito D and Chatterjee D K. Curr Opin Biotechnol.; 17(4):353-8 (2006) or Varshavsky A. Methods Enzymol. 326: 578-593 (2000). In some embodiments, a tag can serve multiple functions. A tag is often relatively small, e.g., ranging from a few amino acids up to about 100 amino acids long. In some embodiments a tag is more than 100 amino acids long, e.g., up to about 500 amino acids long, or more. In some embodiments, a tag is located at the N- or C-terminus, e.g., as an N- or C-terminal fusion. The polypeptide could comprise multiple tags. In some embodiments, a tag is cleavable, so that it can be removed from the polypeptide, e.g., by a protease. Exemplary proteases include, e.g., thrombin, TEV protease, Factor Xa, PreScission protease, etc. In some embodiments, a “self-cleaving” tag is used. See, e.g., PCT/US05/05763. In some embodiments a tag or other heterologous sequence is separated from the rest of the protein by a polypeptide linker. For example, a linker can be a short polypeptide (e.g., 15-25 amino acids). Often a linker is composed of small amino acid residues such as serine, glycine, and/or alanine. A heterologous domain could comprise a transmembrane domain, a secretion signal domain, etc. A scaffolding tag refers to a peptide that can interact with itself or another peptide or protein. Numerous small peptides that can form homo or hetero dimers have been described and have been used to co-localize enzymes (e.g Park, WM Int. J Mol Sci (2000) 21 3584; Anderson G P, Shriver-Lake L C, Liu, J L, Goldman E R, ACS Omega, 2018, 3, 4810-4815). Similarly, scaffolding tag peptides can interact with proteins to form tight complexes and have also been used to create multi-protein scaffolds (Vanderstraeten J, Briers, Y Biotechnol. Advances 2020, 44, 107627, Keasling J D et al Nature Biotechnol 2009, 27(8), 753).
In some embodiments, a PKC described herein (e.g., PKC1, PKC1.1, PKC4, PKC4.8 and functional fragments, variants, and derivatives thereof) comprises a C-terminal and N-terminal small peptide that can facilitate dimerization of the enzyme (i.e., dimerizable small peptide). This modification can increase stability of the PKC by zipping the N- and C-terminus of the protein. In some embodiments, the PKC comprising a C-terminal and N-Terminal small peptide that can dimerize is selected from SEQ ID NO: 40, 41, or 42. In some embodiments, the PKC comprising a C-terminal and N-terminal small peptide that can dimerize has at least about 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to SEQ ID NO: 40, 41, or 42. In some embodiments, a PKC described herein is expressed with an N-terminal small peptide of SEQ ID NO: 47 and a C-terminal small peptide selected from SEQ ID NO: 48, 49 or 68. In some embodiments, a PKC having a C-terminal and N-terminal small peptide that can dimerize is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, 10-fold, or more stable than a PKC having the same amino acid sequence except for the C-terminal and N-terminal small peptide. In some embodiments, a PKC having a C-terminal and N-terminal small peptide that can dimerize is present in a cell at a level that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold higher than the level of a PKC having the same amino acid sequence except for the C-terminal and N-terminal small peptide.
In some embodiments, a PKC described herein (e.g., PKC1, PKC1.1, PKC4, PKC4.8 and functional fragments, variants, and derivatives thereof) comprises a ubiquitin at the N-terminal that increases solubility and expression of the PKC. In some embodiments, the PKC having ubiquitin at the N-terminal comprises the amino acid sequence of SEQ ID NO: 43, 58, or 59. In some embodiments, the PKC having ubiquitin at the N-terminal has at least about 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to SEQ ID NO: 43, 58, or 59. In some embodiments, a PKC having ubiquitin at the N-terminal is present in a cell at a level that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold higher than the level of a PKC having the same amino acid sequence but not having ubiquitin at the N-terminal. In some embodiments, a PKC having ubiquitin at the N-terminal is expressed in a cell at a level that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold higher than the level of a PKC having the same amino acid sequence but not having ubiquitin at the N-terminal. In some embodiments, a PKC having ubiquitin at the N-terminal is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold more soluble than a PKC having the same amino acid sequence but not having ubiquitin at the N-terminal.
In some embodiments, a PKC described herein (e.g., PKC1, PKC1.1, PKC4, PKC4.8 and functional fragments, variants, and derivatives thereof) comprises a C-terminal and/or an N-terminal scaffolding tag. In some embodiments, the scaffolding tag is capable of forming homodimers. In some embodiments, the scaffolding tag is capable of forming heterodimers. In some embodiments, the scaffolding tag is capable of forming both homodimers and heterodimers. In some embodiments, the scaffolding tag has the amino acid sequence of SEQ ID NO: 66 (P3) or 67 (P4). In some embodiments, the PKC having a scaffolding tag comprises the amino acid sequence of SEQ ID NO: 44 or 45. In some embodiments, the PKC having a scaffolding tag comprises an amino acid sequence having at least about 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to SEQ ID NO: 44 or 45. In some embodiments, a PKC having a scaffolding tag is expressed in a cell at a level that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold higher than the level of a PKC having the same amino acid sequence but not having a scaffolding tag. In some embodiments, a PKC having a scaffolding tag is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold more soluble than a PKC having the same amino acid sequence but not having a scaffolding tag. In some embodiments, a PKC having a scaffolding tag is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold more active than a PKC having the same amino acid sequence but not having a scaffolding tag.
In some embodiments, the polyketide cyclase is capable of producing 6-alkyl-2,4-dihydroxy benzoic acid from a tetraketide as shown in
In some embodiments, the PKC is capable of producing a 6-alkyl-2,4-dihydroxy benzoic acid, and specifically OA, an OA analog, DVA, or a DVA analog at a higher rate than PKC4.8 (SEQ ID NO: 11). In some embodiments, the PKC is capable of producing OA, OA analog, DVA, or a DVA analog at a rate that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, 10-fold, or higher than the rate of PKC4.8. In some embodiments, the PKC is capable of producing OA, OA analog, DVA, or a DVA analog at a rate that is at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or substantially 100% of the rate of PKC4.8.
Some aspects of the present disclosure are directed to a cell comprising a PKC disclosed herein. In some embodiments, the cell is a transgenic cell (e.g., the polyketide cyclase is coded by a heterologous sequence). In some embodiments, the cell is a yeast cell, in a bacterial cell, in an algae cell, or in a plant cell. In some embodiments, the cell is a yeast cell. In some embodiments, the yeast is an oleaginous yeast (e.g., a Yarrowia lipolytica strain). The cell is not limited and may be any suitable cell disclosed herein.
Some aspects of the present disclosure are directed to a polynucleotide coding for a PKC disclosed herein.
An expression vector or vectors can be constructed to include exogenous nucleotide sequences coding for the recombinant polypeptides described herein operably linked to expression control sequences functional in the cell. Any suitable expression vector disclosed herein may be used.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 70% identity to SEQ ID NO: 9, 10, 11 or 12. In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 9, 10, 11 or 12.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence with 1-40 amino acid modifications as compared to SEQ ID NO: 9, 10, 11 or 12 and, optionally, one to twenty amino acids deleted from the C-terminus or N-terminus.
Some aspects of the present disclosure are directed to a fusion protein comprising a polypeptide having polyketide synthase activity and a polypeptide having acyl-CoA synthetase activity (e.g., HCS enzymes).
In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 75% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 80% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 85% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 90% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8 or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 95% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 99% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 99.5% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity is any suitable polypeptide disclosed herein.
In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 75% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 80% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 85% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 90% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 95% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 99% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33. In some embodiments, the polypeptide having acyl-CoA synthetase activity (e.g., an HCS) comprises an amino acid sequence with at least 99.5% identity to SEQ ID NO: 28, 29, 30, 31, 32 or 33.
In some embodiments, the fusion protein further comprises a linker between the polypeptide having polyketide synthase activity and the polypeptide having polyketide cyclase activity. In some embodiments, the linker is between 5 and 52 amino acids in length. In some embodiments, the linker is 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, or 52. In some embodiments, the linker has an amino acid sequence selected from SEQ ID NO: 60, 61, or 62.
In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO: 63, 64, or 65. In some embodiments, the fusion protein comprises an amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 63, 64, or 65 or a fragment thereof.
In some embodiments, the fusion protein is capable of producing a tetraketide-CoA from Hexanoic acid and/or butyric acid.
In some embodiments, the acyl-CoA synthetase peptide is located at the N-terminus of the polyketide synthase peptide or is connected by a linker to the N-terminus of the polyketide synthase peptide. In some embodiments, the acyl-CoA synthetase peptide is located at the N-terminus of the polyketide synthase or is connected by a linker to the N-terminus of the polyketide synthase.
In some embodiments, the cell comprises a acyl-CoA synthetase as described herein (e.g an amino acid sequence of SEQ ID NO: 28, 29, 30, 31, 32 or 33) and a polyketide synthase as described herein (e.g., an amino acid sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, or 8) and a polyketide cyclase as described herein (e.g., an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80). In other embodiments the cell comprises a fusion between an acyl-CoA synthetase and a polyketide synthase as described herein (e.g., an amino acid sequence of SEQ ID NO: 63, 64, 65) and a separate polyketide cyclase as described herein (e.g., an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80).
Some aspects of the present disclosure are related to a fusion protein comprising a polypeptide having polyketide synthase activity and a polypeptide having polyketide cyclase activity. In some embodiments, the polypeptide having polyketide synthase activity is a polyketide synthase as described herein. In some embodiments, the polypeptide having polyketide synthase activity is a polyketide cyclase as described herein. In some embodiments, the fusion protein comprises a polyketide synthase and polyketide cyclase as described herein.
In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the polypeptide having polyketide synthase activity comprises an amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68.
In some embodiments, the polypeptide having polyketide cyclase activity comprises an amino acid sequence with at least 70% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80. In some embodiments, the polypeptide having polyketide cyclase activity comprises an amino acid sequence with at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80.
In some embodiments, the fusion protein further comprises a linker between the polypeptide having polyketide synthase activity and the polypeptide having polyketide cyclase activity. Any suitable linker may be used. In some embodiments, the linker is a polypeptide. In some embodiments, the linker is between 5 and 52 amino acids in length. In some embodiments, the linker comprises, consists of, or consists essentially of the amino acid sequence selected from SEQ ID NOs. 13-27.
In some embodiments, the fusion protein comprises, consists of, or consists essentially the amino acid sequence of SEQ ID NO: 34, 35, 36, 37, 38, or 39. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 34. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 35. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 36. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 37. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 38. In some embodiments, the fusion protein comprises, consists of, or consists essentially an amino acid sequence that has 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% sequence identity to SEQ ID NO: 39.
In some embodiments, the fusion protein is capable of producing a cyclized polyketide from an acyl-CoA substrate. In some embodiments, the acyl-CoA substrate contains a carboxylic acid that contains two to twenty-two carbons. In other embodiments the Acyl-CoA is selected from the group consisting of acetyl-CoA, Hexanoyl-CoA, octanoyl-CoA, decanoyl-CoA, dodecanoyl-CoA, Palmitoleyl-CoA, Linoleyl-CoA, Palmitoyl-CoA, butyryl-CoA, and Oleyl-CoA. In some embodiments, the fusion protein is capable of producing olivetolic acid from Hexanoyl-CoA. In some embodiments, the fusion protein is capable of producing divarinic acid from Butyryl-CoA. In some embodiments, the fusion protein is capable of producing an OA analog or a DVA analog,
In some embodiments, the fusion protein is capable of producing a ratio of olivetolic acid to olivetol from Hexanoyl-CoA at a ratio of greater than 0.05, 0.06, 0.07, 0.08, 0.09, 0.1, 0.2, 0.3, 0.4, or 0.5. In some embodiments, the fusion protein is capable of producing a ratio of olivetolic acid to olivetol from Hexanoyl-CoA at a ratio of greater than 0.1. In some embodiments, the fusion protein is capable of producing a ratio of divarinic acid to divarinol from Butyryl-CoA at a ratio of greater than 0.05, 0.06, 0.07, 0.08, 0.09, 0.1, 0.2, 0.3, 0.4, or 0.5. In some embodiments, the fusion protein is capable of producing a ratio of divarinic acid to divarinol from Butyryl-CoA at a ratio of greater than 0.1.
In some embodiments, the polypeptide having polyketide synthase activity is located at the N-terminus. In some embodiments, the polypeptide having polyketide cyclase activity is located at the N-terminus.
In some embodiments, a fusion protein described herein comprises a C-terminal and/or an N-terminal scaffolding tag. In some embodiments, the scaffolding tag is capable of forming homodimers. In some embodiments, the scaffolding tag is capable of forming heterodimers. In some embodiments, the scaffolding tag is capable of forming both homodimers and homodimers. In some embodiments, the scaffolding tag has the amino acid sequence of SEQ ID NO: 66 (P3) or 67 (P4). In some embodiments, the fusion protein having a scaffolding tag comprises the amino acid sequence of SEQ ID NO: 46. In some embodiments, the PKC having a scaffolding tag comprises an amino acid sequence having at least about 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to SEQ ID NO: 46. In some embodiments, a fusion protein having a scaffolding tag is expressed in a cell at a level that is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold higher than the level of a fusion protein having the same amino acid sequence but not having a scaffolding tag. In some embodiments, a fusion protein having a scaffolding tag is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold more soluble than a fusion protein having the same amino acid sequence but not having a scaffolding tag. In some embodiments, a fusion protein having a scaffolding tag is at least 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.5-fold, 5-fold, or at least 10-fold more active than a fusion protein having the same amino acid sequence but not having a scaffolding tag.
Some aspects of the present disclosure are directed to a cell comprising a fusion protein disclosed herein. In some embodiments, the cell is a transgenic cell (e.g., the polyketide synthase is coded by a heterologous sequence). In some embodiments, the cell is a yeast cell, in a bacterial cell, in an algae cell, or in a plant cell. In some embodiments, the cell is a yeast cell. In some embodiments, the yeast is an oleaginous yeast (e.g., a Yarrowia lipolytica strain). The cell is not limited and may be any cell disclosed herein.
Some aspects of the present disclosure are directed to a polynucleotide coding for a fusion protein disclosed herein.
An expression vector or vectors can be constructed to include exogenous nucleotide sequences coding for the recombinant polypeptides described herein operably linked to expression control sequences functional in the cell. Any suitable expression vector disclosed herein may be used.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 70% identity to SEQ ID NO: 34, 35, 36, 37, 38, or 39. In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence having at least 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 34, 35, 36, 37, 38, or 39.
In some embodiments, the cell comprises an exogenous nucleotide sequence coding for a recombinant polypeptide comprising an amino acid sequence with 1-40 amino acid modifications as compared to SEQ ID NO: 34, 35, 36, 37, 38, or 39 and, optionally, one to twenty amino acids deleted from the C-terminus or N-terminus.
Cells with Improved Cannabinoid and Cannabinoid Precursor Production
Some aspects of the present disclosure are directed to a cell comprising an exogenous nucleotide sequence coding for at least one of the following: (a) a polyketide synthase (e.g., a polyketide synthase described herein) comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 68, wherein the polyketide synthase has polyketide synthase (PKS) activity; (b) a polyketide cyclase (e.g., a polyketide cyclase described herein) comprising an amino acid sequence with at least 70% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80, wherein the polyketide cyclase has polyketide cyclase (PKS) activity; (c) a fusion protein (e.g., a fusion protein described herein) comprising a polypeptide having polyketide synthase activity and a polypeptide having polyketide cyclase activity; (d) an enzyme having acyl-CoA synthetase activity and (e) a fusion protein (e.g., a fusion protein described herein) comprising a polypeptide having polyketide synthase activity and a polypeptide having acyl-CoA synthetase activity. In some embodiments, the cell comprises an exogenous nucleotide sequence coding for at least two of (a)-(e). In some embodiments, the cell comprises an exogenous nucleotide sequence coding for at least three of (a)-(e). In some embodiments, the cell comprises an exogenous nucleotide sequence coding for all four of (a)-(d). In some embodiments, the cell comprises (a), (b), and (d). In some embodiments, the cell comprises (a), (c), and (d). In some embodiments, the cell comprises (b), (c), and (d). In some embodiments, the cell comprises (c) and (e), In some embodiments the cell comprises (b) and (e).
The polyketide synthase of (a) is not limited and may be any suitable polyketide synthase described herein. In some embodiments, the polyketide synthase of (a) comprises the amino acid sequence of SEQ ID NO:1, 2, 3, 4, 5, 6, 7, 8, or 68. In some embodiments, the cell comprises (a) a polyketide synthase, wherein the polyketide synthase is capable of producing a tetraketide from one or more acyl-CoA substrates selected from carboxylic acids with two to twenty-two carbons, such as for example Acetyl-CoA, Butyryl-CoA, Hexanoyl-CoA, octanoyl-CoA, decanoyl-CoA, dodecanoyl-CoA, myristoyl-CoA Palmitoleyl-CoA, Linoleyl-CoA, Palmityl-CoA, and Oleyl-CoA.
The polyketide cyclase of (b) is not limited and may be any suitable polyketide cyclase described herein. In some embodiments, the cell comprises (b) a polyketide cyclase, wherein the polyketide cyclase is capable of producing olivetolic acid (OA), an OA analog, divarinic acid (DVA), or a DVA analog from a tetraketide.
The fusion protein of (c) is not limited and may be any suitable fusion protein described herein. In some embodiments, the cell comprises (c) a fusion protein capable of producing olivetolic acid from Hexanoyl-CoA and/or divarinic acid from Butyryl-CoA.
The enzyme having acyl-CoA synthetase activity is not limited and may be any suitable enzyme. In some embodiments, the cell comprises (d) an enzyme having acyl-CoA synthetase activity. In some embodiments, the enzyme has hexanoyl-CoA synthetase (HCS) activity or butyryl-CoA synthetase activity. In some embodiments, the enzyme having HCS activity comprises an amino acid sequence selected from SEQ ID NO: 28, 29, 30, 31, 32, and 33 or an amino acid sequence with at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, 99.5%, or 99.9% identity to SEQ ID NO: 28, 29, 30, 31, 32, or 33.
In some embodiments, the cell comprises a polyketide synthase described herein and a polyketide cyclase described herein. In some embodiments, the cell comprises a polyketide synthase comprising an amino acid sequence with at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, 995, 99.5%, or 99.9% identity to SEQ ID NO:1, 2, 3, 4, 5, 6, 7, or 8 and a polyketide cyclase comprising an amino acid sequence with at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, 99.5% or 99.9% identity to SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80. In some embodiments, the cell comprises a polyketide synthase comprising an amino acid sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, or 8 and a polyketide cyclase comprising an amino acid sequence of SEQ ID NO: 9, 10, 11, 12, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, or 80. In some embodiments, the cell comprises a fusion protein of SEQ ID NO: 34, 35, 36, 37, 38, or 46.
In some embodiments, the cell further comprises an exogenous polynucleotide coding for a chalcone isomerase-like (CHIL) protein heterologous to the cell. CHIL are non-catalytic proteins that are ubiquitous in plant genomes and in this case are thought to interact with CHS to increase its activity and selectivity (Waki T, et al Nature Communications 2020, 11, 870). The CHIL protein is not limited and may be any suitable CHIL protein. In some embodiments, the exogenous CHIL protein increases OA or CBGA titers and reduces the byproducts OL, HTAL, and PDAL. In some embodiment, the CHIL protein is selected from SEQ ID NOs. 49-56.
In some embodiments, the cell is capable of utilizing hexanoic acid to produce olivetolic acid. In some embodiments, the cell is capable of utilizing butyric acid to produce divarinic acid. In some embodiments, the cell is capable of utilizing octanoic acid, decanoic acid, dodecanoic acid, oleic acid, palmitic acid, myristic acid or stearic acid to produce one or more olivetolic acid analogs.
In some embodiments, the cell comprises an upregulated MVA pathway.
In some embodiments, the cell expresses a CBGA synthase, a CBGVA synthase or a synthase that can condense 6-alkyl-2,4-dihydroxy benzoate (
In some embodiments, the cell is a yeast cell, in a bacterial cell, in an algae cell, or in a plant cell. In some embodiments, the cell is a yeast cell. In some embodiments, the yeast is an oleaginous yeast (e.g., a Yarrowia lipolytica strain). The cell is not limited and may be any suitable cell disclosed herein.
In some embodiments, the cell described herein comprises one or more additional metabolic pathway transgene(s). In some embodiments, the cell comprises an olivetolic acid pathway. In some embodiments, the olivetolic acid pathway comprises a polyketide cyclase. In some embodiments, an exogenous nucleotide codes for the polyketide cyclase. In some embodiments, the olivetolic acid pathway comprises polyketide synthase/olivetol synthase (condensation of hexanoyl coenzyme A (CoA) and 3× malonyl CoAs). In some embodiments, the cell comprises a geranyl pyrophosphate (GPP) pathway. In some embodiments, the GPP pathway comprises geranyl pyrophosphate synthase. In some embodiments, an exogenous nucleotide codes for the geranyl pyrophosphate synthase. In some embodiments, the cell comprises a farnesyl pyrophosphate (FPP) pathway. In some embodiments, the FPP pathway comprises a farnesyl pyrophosphate synthase. In some embodiments, the farnesyl pyrophosphate synthase is a mutant form. In some embodiments, the mutant farnesyl pyrophosphate synthase is described in (Jian G-Z, et al Metabolic Engineering, 2017, 41, 57, incorporated herein). In some embodiments, an exogenous nucleotide codes for the farnesyl pyrophosphate synthase. In some embodiments, the cell comprises a divarinic acid (DVA) pathway. In some embodiments, the DVA pathway comprises divarinic acid synthase. In some embodiments, an exogenous nucleotide codes for the divarinic acid synthase. In some embodiments, the cell comprises a mevalonate pathway. In some embodiments, the cell expresses HMG-CoA reductase. In some embodiments, an endogenous mevalonate pathway of the cell has been manipulated to reduce or increase production of mevalonate, isopentyl pyrophosphate (IPP) or dimethylallyl pyrophosphate (DMAP), geranyl pyrophosphate (GPP) or farnesyl pyrophosphate (FPP). In some embodiments, the cell comprises a polyketide cyclase that produces OA, DVA, and/or derivatives thereof. In some embodiments, the cell comprises a polyketide synthase that produces a tetraketide substrate of the polyketide cyclase. In some embodiments, the cell comprises a polyketide synthase that can directly form OA and derivatives from acetyl-CoA or butyryl-CoA or hexanoyl-CoA and malonyl-CoA.
In some embodiments, the cell is capable of producing a cannabinoid, a cannabinoid derivative, or cannabinoid analogue. The cannabinoids are not limited and may be any cannabinoid described herein. In some embodiments, the cannabinoid is selected from tetrahydrocannabinolic acid, cannabidiolic acid, cannabigerolic acid, or analogue thereof.
In some embodiments, production of the cannabinoid by the cell is under control of a constitutional or inducible promoter. The promoter is not limited and may be any suitable promoter known in the art.
Some aspects of the present disclosure are directed to a composition comprising a cannabinoid, cannabinoid derivative, or cannabinoid analogue produced by a cell disclosed herein. In some embodiments, the composition further comprises a cell as described herein. In some embodiments, the composition comprises purified or isolated cannabinoid, cannabinoid derivative, or cannabinoid analogue produced by a cell disclosed herein. In some embodiments, the composition comprises cannabigerolic acid, tetrahydrocannabinolic acid, cannabidiolic acid, cannabigerolic acid, or derivative, or an analogue thereof.
Some aspects of the present disclosure are directed to methods of producing olivetolic acid (OA), OA analogs, divarinic acid (DVA), or DVA analogs comprising contacting a transgenic cell as described herein with a fatty acid under suitable conditions to produce the olivetolic acid (OA), OA analogs, divarinic acid (DVA), or DVA analogs. The OA and DVA analogs are not limited. In some embodiments, the OA analogs and DVA analogs are 6-alkyl-2,4-dihydroxy benzoic acid provided in
Some aspects of the present disclosure are directed to contacting a cell comprising a polyketide synthase as described herein with a fatty acid under suitable conditions to produce a cannabinoid or cannabinoid precursor as described herein. Some aspects of the present disclosure are directed to contacting a cell comprising a polyketide cyclase as described herein with a fatty acid under suitable conditions to produce a cannabinoid or cannabinoid precursor as described herein. Some aspects of the present disclosure are directed to contacting a cell comprising a fusion protein as described herein with a fatty acid under suitable conditions to produce a cannabinoid or cannabinoid precursor as described herein.
In some embodiments, the cell contacted with a fatty acid has an upregulated MVA pathway and/or expresses CBGA synthase. In some embodiments, the cell contacted with a fatty acid has a synthase that can condense 6-alkyl-2,4-dihydroxy benzoic acid with GPP to produce a CBGA analog (e.g., as shown in
In some embodiments, the cell produces one or more of OA, GPP, FPP, and mevalonate (MVA). In some embodiments, the cell produces OA and FPP. In some embodiments, the cell produces OA and GPP. In some embodiments, the cell produces MVA and GPP. In some embodiments, one or more of OA, geraniol, farnesol, prenol, isoprenol, and MVA is provided in a culture medium for use by the cell.
Depending on the cell, the appropriate culture medium may be used. For example, descriptions of various culture media may be found in “Manual of Methods for General Bacteriology” of the American Society for Bacteriology (Washington D.C., USA, 1981). As used here, “medium” as it relates to the growth source refers to the starting medium be it in a solid or liquid form. “Cultured medium”, on the other hand and as used here refers to medium (e.g. liquid medium) containing microbes that have been fermentatively grown and can include other cellular biomass. The medium generally includes one or more carbon sources, nitrogen sources, inorganic salts, vitamins and/or trace elements.
Exemplary carbon sources include sugar carbons such as sucrose, glucose, galactose, fructose, mannose, mannitol, isomaltose, xylose, pannose, maltose, arabinose, cellobiose and 3-, 4-, or 5-oligomers thereof. Other carbon sources include alcohol carbon sources such as methanol, ethanol, glycerol. Other carbon sources include acid and esters such as acetate, formate, fatty acids having four to twenty-two carbon atoms or fatty acid esters thereof. Other carbon sources can include renewal feedstocks and biomass. Exemplary renewal feedstocks include cellulosic biomass, hemicellulosic biomass and lignin feedstocks. Mixed carbon sources can also be used, such as a fatty acid and a sugar as described herein.
The culture conditions can include, for example, liquid culture procedures as well as fermentation and other large-scale culture procedures. Useful yields of the products can be obtained under aerobic culture conditions. An exemplary growth condition for achieving, one or more cannabinoid products includes aerobic culture or fermentation conditions. In certain embodiments, the microbial organism can be sustained, cultured or fermented under aerobic conditions.
Substantially aerobic conditions include, for example, a culture, batch fermentation or continuous fermentation such that the dissolved oxygen concentration in the medium remains between 5% and 100% of saturation. The percent of dissolved oxygen can be maintained by, for example, sparging air, pure oxygen or a mixture of air and oxygen.
The culture conditions can be scaled up and grown continuously for manufacturing cannabinoid product. Exemplary growth procedures include, for example, fed-batch fermentation and batch separation; fed-batch fermentation and continuous separation, or continuous fermentation and continuous separation. All of these processes are well known in the art. Fermentation procedures are particularly useful for the biosynthetic production of commercial quantities of cannabinoid product. Generally, and as with non-continuous culture procedures, the continuous and/or near-continuous production of cannabinoid product will include culturing a cannabinoid producing organism on sufficient nutrients and medium to sustain and/or nearly sustain growth in an exponential phase. Continuous culture under such conditions can include, for example, 1 day, 2, 3, 4, 5, 6 or 7 days or more. Additionally, continuous culture can include 1 week, 2, 3, 4 or 5 or more weeks and up to several months. Alternatively, the desired microorganism can be cultured for hours, if suitable for a particular application. It is to be understood that the continuous and/or near-continuous culture conditions also can include all time intervals in between these exemplary periods. It is further understood that the time of culturing the microbial organism is for a sufficient period of time to produce a sufficient amount of product for a desired purpose.
Fermentation procedures are well known in the art. Briefly, fermentation for the biosynthetic production of cannabinoid product can be utilized in, for example, fed-batch fermentation and batch separation; fed-batch fermentation and continuous separation, or continuous fermentation and continuous separation. Examples of batch and continuous fermentation procedures are well known in the art.
In some embodiments, the methods further comprise a step of purifying or isolating the cannabinoids, derivatives or analogues thereof from the culture. Methods of isolation are not limited and may be any suitable method known in the art. Purification methods include, for example, extraction procedures as well as methods that include continuous liquid-liquid extraction, pervaporation, evaporation, filtration, membrane filtration (including reverse osmosis, nanofiltration, ultrafiltration, and microfiltration), membrane filtration with diafiltration, membrane separation, reverse osmosis, electrodialysis, distillation, extractive distillation, reactive distillation, azeotropic distillation, crystallization and recrystallization, centrifugation, extractive filtration, ion exchange chromatography, size exclusion chromatography, adsorption chromatography, carbon adsorption, hydrogenation, and ultrafiltration or centrifugal partition chromatography (CPC).
In some embodiments, the cells are grown in stirred tank fermenters with feed supplementation (sugars with or without organic acids) where the dissolved oxygen, temperature, and pH are be controlled according to the optimal growth and production process. In some embodiments, aqueous non-miscible organic solvents are supplemented to dissolve added organic acids or extract the cannabinoid products as they are being synthesized. In some embodiments, these solvents may include, but are not limited to, isopropyl myristate (IPM), diisobutyl adipate, Bis(2-ethylhexyl) adipate, decane, dodecane, hexadecane or anther organic solvent with log P>5. The later number (log P) is defined as the log of a compound's partition between water and octanol and is a standard parameter of a compound's hydrophobicity (the larger the log P the less soluble in water). Depending on the fermentation process, the products can be isolated and purified using different methods.
If no organic cosolvent is used different methods can be applied. In one embodiment, insoluble products (CBGA, CBDA, THCA and similar compounds) precipitate together with the cell biomass and the solids are isolated from the liquid supernatant using centrifugation (ultra)filtration or spray drying. The cannabinoid containing cell pellet is then washed with an aqueous miscible organic solvent (ethanol, acetonitrile, etc.) that dissolves the cannabinoid. The soluble cannabinoid solution is then separated by the insoluble cells by filtration. Evaporation of the organic solvent produces an oil that contains the cannabinoid and other oil extracts. In some embodiments the dried cannabinoid cells biomass (produced by spray drying or drying wet cell pellet) can be extracted using supercritical carbon dioxide. Evaporation of the CO2 will produce an oil that contains the extracted cannabinoid in the form of an oil. The cannabinoids from the oils obtained from ethanol extraction or CO2 extraction can be further purified using methods known to the cannabis industry. These can include fractional distillation, crystallization, chromatography or a combination of these techniques. Alternatively, the cell supernatant can be extracted with an aqueous immiscible organic solvent (ethyl acetate, heptane, decane, etc.) to extract the cannabinoids. Evaporation of the organic solvent and produces a cannabis containing oil that can be further purified as described above.
In some embodiments, an organic solvent is required during growth that is separated at the end of the fermentation. Back extraction with basic aqueous solvent or a different organic solvent with low boiling point and high polarity (ethanol, acetonitrile, etc.) will remove the cannabinoids. Isolation can then involve a simple pH shift if water is used, or an evaporation if organic solvents are used. In both cases, a recrystallization step may be required at the end to improve purity of the product.
Specific examples of certain aspects of the inventions disclosed herein are set forth below in the Examples.
One skilled in the art readily appreciates that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The details of the description and the examples herein are representative of certain embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Modifications therein and other uses will occur to those skilled in the art. These modifications are encompassed within the spirit of the invention. It will be readily apparent to a person skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.
The articles “a” and “an” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to include the plural referents. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention also includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process. Furthermore, it is to be understood that the invention provides all variations, combinations, and permutations in which one or more limitations, elements, clauses, descriptive terms, etc., from one or more of the listed claims is introduced into another claim dependent on the same base claim (or, as relevant, any other claim) unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. It is contemplated that all embodiments described herein are applicable to all different aspects of the invention where appropriate. It is also contemplated that any of the embodiments or aspects can be freely combined with one or more other such embodiments or aspects whenever appropriate. Where elements are presented as lists, e.g., in Markush group or similar format, it is to be understood that each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. It should be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements, features, etc., certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements, features, etc. For purposes of simplicity those embodiments have not in every case been specifically set forth in so many words herein. It should also be understood that any embodiment or aspect of the invention can be explicitly excluded from the claims, regardless of whether the specific exclusion is recited in the specification. For example, any one or more nucleic acids, polypeptides, cells, species or types of organism, disorders, subjects, or combinations thereof, can be excluded.
Where the claims or description relate to a composition of matter, e.g., a nucleic acid, polypeptide, or cell, it is to be understood that methods of making or using the composition of matter according to any of the methods disclosed herein, and methods of using the composition of matter for any of the purposes disclosed herein are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. Where the claims or description relate to a method, e.g., it is to be understood that methods of making compositions useful for performing the method, and products produced according to the method, are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.
Where ranges are given herein, the invention includes embodiments in which the endpoints are included, embodiments in which both endpoints are excluded, and embodiments in which one endpoint is included and the other is excluded. It should be assumed that both endpoints are included unless indicated otherwise. Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. It is also understood that where a series of numerical values is stated herein, the invention includes embodiments that relate analogously to any intervening value or range defined by any two values in the series, and that the lowest value may be taken as a minimum and the greatest value may be taken as a maximum. Numerical values, as used herein, include values expressed as percentages. For any embodiment of the invention in which a numerical value is prefaced by “about” or “approximately”, the invention includes an embodiment in which the exact value is recited. For any embodiment of the invention in which a numerical value is not prefaced by “about” or “approximately”, the invention includes an embodiment in which the value is prefaced by “about” or “approximately”. “Approximately” or “about” generally includes numbers that fall within a range of 1% or in some embodiments within a range of 5% of a number or in some embodiments within a range of 10% of a number in either direction (greater than or less than the number) unless otherwise stated or otherwise evident from the context (except where such number would impermissibly exceed 100% of a possible value). It should be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one act, the order of the acts of the method is not necessarily limited to the order in which the acts of the method are recited, but the invention includes embodiments in which the order is so limited. It should also be understood that unless otherwise indicated or evident from the context, any product or composition described herein may be considered “isolated”.
A detailed description of the biosynthesis pathways to all common cannabinoids is shown in
In contrast to MVA pathway, enzymes for the biosynthesis of olivetolic acid and some of its analogs are only present in a small number of organisms, mainly plants (T Gulk, BL Møller Trends in Plant Science 2020, 25, 985), while enzymes for the last steps of cannabinoid formation (CBGA, CBGVA etc) have so far only been identified in cannabis and related plants. Consequently, most academic or industrial groups that are developing recombinant organisms (most commonly Saccharomyces cerevisiae yeast and E. coli) for the synthesis of cannabinoids are utilizing the cannabis derived enzymes for the synthesis of OA and CBGA (JD Keasling, et al, Nature 2019, 567, 123-126). For the synthesis of OA, these enzymes are hexanoyl-CoA synthetase (HCS), tetraketide synthase or polyketide synthase (PKS) and olivetolic acid cyclase or polyketide cyclase (PKC) from C. Sativa.
As described in the general methods below, a number of natural sequences (>40) with potential activity towards the formation of OA or OA analogs were identified, cloned in expression vectors and screened for activity.
Key findings: (1) New non-cannabis polyketide synthases, PKSs, with activity towards producing OA and DVA (when co-expressed with PKC) were identified. (2) Enzymes with higher activity for producing OA and OL were identified. (3) In addition to OA, OA or OL analogs with acyl chains of different lengths can be accessed using these enzymes (
The strategy taken to identify polyketide synthases (PKS) with putative OA/OL (with/without PKC present) production activity relied on three general approaches. The first approach involved identifying and selecting sequence homologs of PKS1 (the native Cannabis enzyme with known OA/OL production activity). The second relied on a literature search of enzymes that produce similar products using analogous substrates. The third utilized artificial intelligence methods to identify potential enzymes with activity to produce OA/OL. After this analysis, a total of 33 new enzymes were identified and cloned including PKS1. Each of these was expressed in E. coli, purified, and screened for OL production from hexanoyl-CoA and malonyl-CoA. Based on these results, a subset was screened for activity in Yarrowia whole cells with fed hexanoic acid with PKS23 producing the highest concentration of OL (besides PKS1). Eleven sequence homologs of PKS23 were then identified and screened in Yarrowia whole cells with fed hexanoic acid. A table listing the relative sequence identities shared by the enzymes that produced OL is shown below.
As described below, plasmids pCL-SE-0332.PKS1, pCL-SE-0332.PKS34, pCL-SE-0332.PKS35, pCL-SE-0332.PKS36, pCL-SE-0332.PKS37, pCL-SE-0332.PKS40, and pCL-SE-0332.PKS41 were transformed into strain sCL-SE-0041 and three separate colonies each were patched and precultured for 48 h. Assay cultures consisted of YPD medium with 1 mg/mL hygromycin and 2.5 mM butyric, hexanoic, octanoic, lauric, or myristic acid. After 24 h, an additional 10 mM fatty acid was added to each assay culture. Cultures were quenched with equal volume of Ethanol after 48 h total growth and were analyzed by HPLC for product formation. PKS1 produced divarinol or olivetol from fed butyric or hexanoic acid, respectively. PKS34 and PKS37 produced divarinol from fed butyric acid. PKS34-37,40-41 all produced olivetol or olivetol analogs (no PKC is present in the strains) from fed hexanoic, octanoic, lauric, or myristic acid, as well as high concentrations of olivetol analogs with alkyl side chains of different length than the fed fatty acids, likely using fatty acyl-CoAs produced natively in the host. Corresponding PDAL and olivetolic acid analogs were also detected at lower concentrations than the olivetol analogs. Averages and standard deviations were calculated from replicates.
sCL-SE-0041 does not have polyketide synthase (PKS) activity. However. the plasmids in this example contain expression cassettes that encode for different PKS genes and the HCS2 gene. HCS2 improves activation of supplemented hexanoic acid to hexanoyl-CoA which is a substrate for some OL producing PKSs.
In the above table, all PKSs examined enabled OL production from hexanoic acid supplementation. In addition, the PKS 34-37 and 40-41 produced polyketides with different length alkyl chains that correspond to the native fatty acids reported to be produced by Yarrowia species (C2, C16, C18, C18:1, C18:2; P. Xu, K. Qiao, W. S Ahn, and G Stephanopoulos PNAS 2016 113 (39) 10848-10853). These PKSs can therefore be used for the synthesis of OL analogs (Compound A,
Only olivetol analogs are synthesized in this experiment because the Yarrowia strain (sCL-SE-0041) used for this screen does not contain polyketide cyclase activity (PKC) (
This example shows that strains expressing some of the PKSs discovered herein produce higher amounts of OL (compared to PKS1 from Cannabis) in the presence of hexanoic acid, which should translate to higher amounts of OA in the presence of a PKC. An example of testing their activity in vivo with PKC1.1 is shown below Furthermore, these enzymes enable the production of OL analogs, which can enable the production of cannabinoid analogs, from a diverse set of acyl-CoAs including long chain fatty acids and other fatty acid analogs as shown in Example 4.
As described below, plasmids pCL-SE-0332.PKS1, pCL-SE-0332.PKS23, pCL-SE-0332.PKS34, pCL-SE-0332.PKS35, pCL-SE-0332.PKS36, pCL-SE-0332.PKS37, pCL-SE-0332.PKS40, and pCL-SE-0332.PKS41 were transformed into strain SB-0665, which expresses PKC1.1 and HCS2, and four separate colonies each were patched and precultured for 48 h. Assay cultures consisted of YPD media with 1 mg/mL hygromycin and 2.5 mM hexanoic acid. After 24 h, an additional 10 mM hexanoic acid was added to each assay culture. Cultures were quenched after 48 h total growth. Enzymes produced olivetolic acid, olivetol, and PDAL.
When a PKC was present, all PKSs produced olivetolic acid in addition to olivetol, showing that the PKSs' product is the linear tetraketide molecule that can then be cyclized by the PKC. In addition, PKS34 and PKS37 produced significantly higher amounts of OA compared to PKS1.
Mutagenesis of the discovered PKSs can be performed to improve selectivity towards specific OL and OA analogs, as well as to improve the enzyme's kinetic properties (KM & kcat) for a specified substrate. For example, engineering can be performed to reduce the PKSs selectivity towards larger fatty acids and increase hexanoyl-CoA activity. Engineering of these enzyme begins by creating structural models as described in Section F of the general techniques (
Because of the high homology between all of these PKSs (Table 1) the same mutations that improve activity and selectivity should be transferable to all proteins.
The final amount of OA (or OA analogs—Compound B
As described in the general methods below, 37 natural sequences with potential activity towards the formation of OA (or OA analogs) were identified. The enzymes were first expressed in E. coli, purified, and were assayed in vitro for OA formation in the presence of PKS1 and hexanoyl-CoA. Only one enzyme, PKC4, formed detectable amounts of OA. In a first step, a library of chimeric sequences was made by transferring sequence elements of PKC1 to PKC4 as described in the following examples. This method created one chimera, PKC4.8, that was used as a template for further mutagenesis.
(1) A natural, PKC4 with activity towards OA formation using hexanoic and fatty acids with different carbon chains (where PKS1 was not active) was discovered. Mutagenesis positions to tune the enzyme towards certain OA analogs are also disclosed.
(2) A novel non-natural sequence, PKC4.8 (derived from PKC4), with good activity towards OA formation was developed. Mutagenesis of this enzyme to improve its activity further towards OA and DVA is ongoing, yet enzymes with improved activity have already been discovered and are identified as PKC4.11, PKC4.15, PKC4.17, PKC4.19 PKC4.30-4.33, and PKC4.35-38 (Example 7c) (i.e. SEQ ID NOs 69-80)
(3) Fusions of PKS-PKC were successful in improving OA titers and OA/OL ratio.
(4) Fusion with ubiquitin of PKC1/4.8 at N-terminal OR “zipping” N- and C-terminal to improve enzyme expression and stability have produced enzymes with better activity and OA/OL selectivity
(5) Combination of PKS discovered herein (such as PKS23), with PKC4 and PKC4.8 show that all OA analogs shown in
All enzymes described in this experiment were recombinantly expressed in E. coli and then purified as described in Experimental. In vitro reactions were performed by mixing purified selected PKSs and PKCs (final concentration 2.5 μM and 25 μM respectively) with selected acyl-CoAs (250 μM) and malonyl-CoA (750 μM). In this assay the addition of PKSs was required to synthesize the tetraketide substrates that were then converted to OA and OL products. The reactions were incubated for 60 or 100 min at 30 C before the samples were quenched with equal volume of ethanol and were analyzed for products using HPLC/MS as described in the analytical methods.
The above results show that PKC4 and PKC4.8 can synthesize OA with either PKS1 or PKS23. The combination of PKS and PKC affects product ratio as can be seen by the difference in the OA titers between PKS1/PKC4.8 and PKS23/PKC4.8.
Screening of PKC4 and PKC4.8 with different fatty acids: The possibility of making a large variety of OA analogs that are shown in
By looking at the total products formed in each reaction (OA+OL) it appears that PKS23 is more selective for C6-C14 acyl-CoAs, with strong preference for octanoic (C8) and decanoic (C10). On the other hand, PKC4 shows preference for tetraketides from C8 to C14 fatty acids, while PKC4.8 prefers C6-C8 (and C4 as shown in the in vivo assays of Example 5) fatty acids. In conclusion, the above results show that all OA analogs described in
As described in detail below, plasmids pCL-SE-0331, pCL-SE-0331.PKC1, pCL-SE-0331.PKC4, and pCL-SE-0331.PKC4.8 were transformed into strain SB-0109, which expresses PKS1 and HCS2, and four separate colonies from each transformation were patched and precultured for 48 h. Assay cultures consisted of YPD media with 1 mg/mL hygromycin and 2.5 mM hexanoic acid. After 24 h, an additional 10 mM hexanoic acid was added to each assay culture. Cultures were quenched with equal volume of ethanol after 48 h total growth and were analyzed by HPLC-MS. Enzymes produced olivetol and olivetolic acid. Averages and standard deviations were calculated from replicates. This experiment can be done with alternative PKSs expressed in the strain and/or with fatty acids of different chain lengths (butyric acid, octanoic acid, decanoic acid, etc.).
In agreement with the in vitro results, PKC4 has very low activity for hexanoyl-CoA in vivo. Similarly, PKC4.8, produces roughly 30-fold more olivetolic acid compared to PKC4 and almost two thirds that of PKC1 in the presence of PKS1.
The novel PKC4.8 discovered herein shows good activity for OA and Divarinic acid (DVA). Further mutagenesis (described below) will increase its activity towards OA and DVA. Mutants with selectivity for other OA analogs (N=6-16,
This example shows that fusion proteins comprised of PKS and PKC can improve both the titers of OA and the OA/OL ratio. To optimize the activity of fused enzymes, numerous linker sequences were evaluated based on length and flexibility.
As described below, plasmids pCL-SE-0476 to 0481 were transformed into strain SB-0109. For each transformation, four separate colonies were patched and precultured for 24 h. Assay cultures consisted of 500 μL YPD buffered with 100 mM MES pH 6.5 containing 1 mg/mL hygromycin and 10 mM hexanoic acid. After 24 h the cultures were quenched with equal volume of ethanol and were analyzed for olivetol and olivetolic acid by HPLC-MS. Averages and standard deviations were calculated from replicates. This experiment can be done with alternative PKSs expressed in the strain. The results are shown below (Table 8):
The above table show that when SB-0109 is transformed with the empty vector it cannot produce olivetolic acid and as expected, accumulates olivetol. The PKC-PKS fusions enable SB-0109 to produce olivetolic acid (pCL-SE-0476, 0477). The fusions with PKS at the N-terminus (pCL-SE-0487-0481) were much less active but still produced some OA product. The fusion with linker F2 (pCL-SE-0477) produced the highest levels of olivetolic acid and the best OA to OL ratio.
As described Experimental Section, the fusion construct, pCL-SE-0477 and the vector control, pCL-SE-0016 were transformed into SB-0264 to see if it can improve both olivetolic acid production and the ratio of olivetolic acid to olivetol. For each transformation, eight separate colonies were patched and precultured for 24 h. Assay cultures consisted of YPD buffered with 100 mM MES pH 6.5 containing 1 mg/ml hygromycin and 10 mM hexanoic acid. After 24 h the cultures were quenched and analyzed for olivetol and olivetolic acid. Averages and standard deviations were calculated from replicates. This experiment can be done with alternative PKSs expressed in the strain. The results are shown below (Table 9):
These data show that when strain SB-0264 is transformed with the empty vector it produces slightly more olivetol than olivetolic acid. However, when it is transformed with the PKC-PKS fusion there is a marked improvement in olivetolic acid formation and an improvement in the ratio of olivetolic acid to olivetol. Both results are advantageous for producing cannabinoids since olivetolic acid is the desired product of this reaction as it, not olivetol, is a key precursor for cannabinoid formation.
This example shows that all PKS-PKC fusions were active in Yarrowia, and fusions with linker F2 produced both the highest amount of OA and the best OA/OL ratio.
Low expression of most polyketide cyclases (PKCs) was observed in E. coli (most PKCs produced inclusion bodies) and low soluble expression was also observed in Yarrowia as evaluated by western blots. To increase the solubility and therefore the expression of PKC1, PKC4 and PKC4.8 the enzymes were fused at the N-terminal to ubiquitin (See, e.g., SEQ ID NOs. 43, 58, and 59). In one alternative strategy both the N- and C-terminus were fused to a small peptide that can dimerize. The later method may provide further stabilization of the PKCs by dimerizing or “zipping” the N- and C-terminus of the protein as shown in
To examine if a ubiquitin-tag would improve expression of PKC, a sequence coding for ubiquitin was introduced in front of the PKC1.1 sequence. This sequence is found in construct pCL-SE-0641 which was introduced into SB-0109. Construct pCL-SE-0640 (PKC1.1 without ubiquitin tag) was also introduced into SB-0109 as control. 7-8 separate colonies from each transformation were patched and precultured for 48 h. Assay cultures consisted of YNBD+CAA media with 2.5 mM hexanoic acid. After 24 h, an additional 5 mM hexanoic acid was added to each assay culture. Cultures were quenched after 48 h total growth. Averages and standard deviations were calculated from replicates.
This data shows that the ubiquitin-tagged version of PKC1.1 resulted in a reduction of OL production and significant improvement of the OA/OL ratio.
Plasmids pCL-SE-0676 and -0677 were linearized with AsiSI and transformed into SB-0109-10. Multiple clones per transformation were pre-cultured for 48 hours in 0.5 mL of YNBD (2%)+0.5% casamino acids+100 mM MES pH 6.5. 2 μL of the pre-culture was used to inoculate 0.5 mL of the same media with the addition of 2.5 mM hexanoic acid. At 24 hours, 25 μL of 40% glucose and 100 mM hexanoic acid stock was fed to the cultures for a final concentration of 2% glucose and 5 mM hexanoic acid. At 48 hours, the cultures were quenched and evaluated for olivetolic acid (OA) and olivetol (OL). These data are presented in the table below.
These data clearly show that adding dimerization sequences at the N- and C-terminus of PKC4.8 increased both the OA titer and the OA/OL ratio. This is most likely due to the increased expression and/or stability of the “zipped” construct.
As described in detail below, plasmids pCL-SE-0802, pCL-SE-0802.PKS1.1.PKC4.8, pCL-SE-0802.PKS1.1.PKC4.30, pCL-SE-0802.PKS1.1.PKC4.31, pCL-SE-0802.PKS1.1.PKC4.19, pCL-SE-0802.PKS1.1.PKC4.32, pCL-SE-0802.PKS1.1.PKC4.33, pCL-SE-0802.PKS1.1.PKC4.35, pCL-SE-0802.PKS1.1.PKC4.36, pCL-SE-0802.PKS1.1.PKC4.37, pCL-SE-0802.PKS1.1.PKC4.38, pCL-SE-0802.PKS1.1, were transformed into strain SB-0109, which expresses PKS1 and HCS2, and four separate colonies from each transformation were patched and precultured for 48 h. Assay cultures consisted of YPD media with 100 mM MES pH 6.5, 1 mg/mL hygromycin, and 2.5 mM hexanoic acid. After 24 h, an additional 5 mM hexanoic acid and 2% glucose was added to each assay culture. Cultures were quenched with equal volume of ethanol after 48 h total growth and were analyzed by HPLC-MS. Enzymes produced olivetol and olivetolic acid. Averages and standard deviations were calculated from replicates.
These data show that altering the N- and/or C-terminus by truncation or by adding dimerization domains can increase the OA titer and/or OA/OL ratio.
Improving the activity of PKC4 and PKC4.8 will eliminate HTAL and PDAL by-products that are forming from inadequate PKC activity and tetraketide accumulation (
In a first step of improving the PKC4/4.8 activity was the creation of a crystal structure model for each enzyme as described in the general methods. The active site was then identified and the olivetolic acid product was docked. All amino acids around the active site that may play a role in the substrate binding, activity and selectivity are identified and will be targeted for mutagenesis. In addition, amino acids in the dimer interface will also be targeted for mutagenesis in order to improve the dimer formation affinity, which may translate in better stability for the complex and may increase expression and tolerance to mutagenesis. Enzymes with good activity and selectivity for OA formation will be selected first using PKC4.8 as template. In subsequent screenings, mutants of PKC4.8 or PKC4 with improved activity cyclizing tetraketides with shorter or longer chain as shown in
Mutagenesis strategy will involve two parallel approaches. In the first approach, saturation mutagenesis will be performed in all amino acids in the active site. The best mutant from this screen will be selected as a template and additional saturation mutagenesis will be performed in the remaining amino acids (or a subset depending on results of the first screen). This process will be repeated multiple times until satisfactory activity is observed. The second approach will involve parallel mutagenesis of 2-5 amino acids but instead of changing each one with all 20 amino acids, only a subset will be used based on natural variation from homologs and/or amino acids identified in the previous SSM screen. Mutants will be screened as described in Example 5 in in vivo assays expressed in appropriate Yarrowia strains.
As described in detail below, plasmids expressing variants of PKC4.8 were transformed into strain SB-0109, which expresses PKS1 and HCS2, and colonies from each transformation were precultured for 48 h. Assay cultures consisted of YPD media with 100 mM MES pH 6.5, 1 mg/mL hygromycin, and 2.5 mM hexanoic acid. After 24 h, an additional 5 mM hexanoic acid and 2% glucose was added to each assay culture. Cultures were quenched with equal volume of ethanol after 48 h total growth and were analyzed by HPLC-MS. Enzymes produced olivetol and olivetolic acid. For the controls, averages and standard deviations were calculated from replicates.
To verify PKC4.33 activity, plasmids pCL-SE-0331.PKC4.8, pCL-SE-0331.PKC4.33, and pCL-SE-0331 were transformed into strain SB-0109, which expresses PKS1 and HCS2, and four colonies from each transformation were precultured for 48 h. Assay cultures consisted of YPD media with 100 mM MES pH 6.5, 1 mg/mL hygromycin, and 2.5 mM hexanoic acid. After 24 h, an additional 5 mM hexanoic acid and 2% glucose was added to each assay culture. Cultures were quenched with equal volume of ethanol after 48 h total growth and were analyzed by HPLC-MS. Enzymes produced olivetol and olivetolic acid. Averages and standard deviations were calculated from replicates.
Screening of further mutant and recombined enzymes are described below. The genes were cloned into the plasmid pCL-SE-0802.PKS1.1 and used to transform SB-0741 strain, which expresses HCS2. Six colonies from each transformation were used to inoculate pre-cultures for 48 hours in YDCM media containing 1 mg/mL hygromycin. Assay cultures were performed in the same media with 3 mM hexanoic or butanoic acid added 24 and 30 hours post inoculation. Cultures were quenched with equal volume of ethanol after 48 h total growth and were analyzed by HPLC-MS. Enzymes produced olivetol and olivetolic acid from hexanoic acid and divarinol and divarinic acid from butanoic acid. Averages and standard deviations were calculated from replicates.
Results for the screening are shown in Table 16 and 17
The formation of OL, HTAL and PDAL during the synthesis of OA by PKS and PKC may be due to a number of reasons. One likely explanation for HTAL and OL accumulation is that these compounds are formed when tetraketide accumulates due to inadequate PKC cyclization activity (
It is possible that PKS (and PKC) may require similar proteins for optimal activity and elimination of byproducts. We therefore identified and cloned putative CHIL proteins, that will be tested with our PKSs for improved activity and specificity.
Expression of HCSs improved the OA and DVA titers in cells grown in the presence of hexanoic and butyric acid respectively.
Expression of HCS improved OA/CBGA when hexanoic was made in vivo from sugar feed.
The presence of OA titers were also improved when Hexanoyl-CoA was produced through modification of fatty acid biosynthesis/oxidation.
The presence of HCS improved the CBGA & CBGVA titers in cells grown in the presence of hexanoic and butyric acids.
Plasmids pCL-SE-0539, and -0558 to -0561 were linearized with AsiSI and transformed into SB-0491 as described in the experimental section. Multiple clones per transformation were pre-cultured for 24 h in YNBD containing 0.5% casamino acids and 100 mM MES pH 6.5. 2 μl the preculture was used to inoculate 500 μl of the same medium described above that was supplemented with either 5 mM hexanoic acid or 5 mM butyric acid. After 24 h, the cultures were supplemented with additional glucose (2%) and hexanoic acid (10 mM) or butyric acid (10 mM). After another 24 hours, the cultures were quenched and evaluated for divarinic acid (DVA) and CBGVA production. The results are shown in the tables below:
SB-0491 contains prenyl-transferase activity that enable it to convert divarinic and olivetolic acid to CBGVA and CBGA, respectively. A detailed description of the prenyl transferase that is present in this strain (a fusion of membrane prenyl transferase MPT4 and a mutant geranyl phosphate synthase GPS) is described in a separate filing (Attorney Docket No: CELB-003-WO1, filed on the same day as the present application). However, SB-0491 does not contain polyketide synthase (PKS) and cyclase (PKC) activities that are necessary to produce divarinic and olivetolic acid from butryl-CoA and hexanoyl-CoA, respectively. The PKS and PKC activities are present and identical on the plasmids (pCL-SE-0539, and -0558 to -0561) used in this example. However, the plasmids used in this example contain different HCS sequences. Clearly the amount of final cannabinoid products was related to the acyl-CoA synthetase, with HCS2 having the biggest effect.
The results show that all the HCSs tested enable cells to produce divarinic acid from butyric acid. In addition, strains with HCS2, HCS6, and HCS7 produce enough divarinic acid that some is converted to CBGVA by the prenlytransferase activity in SB-0491. The results also show that all the HCSs tested enable the cells to produce olivetolic acid at sufficient levels that some is converted to CBGA by the prenyl-transferase activity in SB-0491. Another important result to highlight from this example is that the engineered cells described produce CBGA and CBGVA when grown in a medium that is supplemented with hexanoic or butyric acid, respectively, which is a commercially viable approach to producing cannabinoids at scale.
The flux towards OA, DVA and their analogs can be further increased by fusing the HCS and PKS. There are multiple examples of protein fusions that increase the flux and titers of a product compared to expressing the same proteins separately. Similar to the work described herein, the effect of linker sequences between an acyl-CoA synthetase (coumaroyl-CoA ligase) fused to a stilbene synthase in the final product titer was evaluated (Guo, H et al Mol. Biosyst. 2017, 13, 598-606). The authors showed that linker length and consistency was important and certain linkers improved substantially the kinetic properties of each enzyme in the fusion proteins. Similarly, acyl-CoA synthetases (e.g. the enzymes described herein) will be fused with the PKSs described herein (and their improved mutants that will be identified). Some examples of linker sequences that will be tested are disclosed (SEQ ID NOs 60-62) and some of the fusions that will be tested are also described (SEQ ID NOs 63-65). The fusion proteins will be evaluated for OA (or other cannabinoid formation) when expressed in a cell as described in earlier Examples.
The approach taken to identify new enzymes for each step relied on three general methods. The first involved identifying sequence homologs to known enzymes with the desired activity. The second method relied on literature searches for enzymes that perform similar reactions using the same substrates or enzymes that perform the same reaction with similar substrates. The third method utilized artificial intelligence algorithms to identify potential enzymes based on predicted activities. These methods identified many candidate sequences that were then manually curated and the selected sequences were cloned and characterized.
E. coli Expression Plasmids
Genes for each enzyme were optimized for expression in E. coli, synthesized (Codex DNA), and cloned into the pM264-c vector (ATUM). Genes were sequenced verified and then subcloned into the pD441-NHT expression vector (ATUM) with an N-terminal His tag and TEV protease cleavage site under control of the T5 promoter. Plasmids were transformed into chemically competent E. coli BL21(DE3) cells (NEB), plated on LB agar plates with 50 μg/mL kanamycin, and grown overnight at 37° C. Colony PCR was used to verify gene fragment insertion and positive colonies were inoculated into liquid LB media with 50 μg/mL kanamycin and grown overnight at 37° C. and then diluted with glycerol to create stocks containing 25% glycerol which were stored at −80° C.
Genes for each enzyme were optimized for expression in Yarrowia, synthesized (Codex DNA), and cloned into the pM264-c vector (ATUM). Genes were sequenced verified and then subcloned into the SapI sites of pCL-SE-0331, pCL-SE-0332, or pCL-SE-0337. Plasmids were transformed into chemically competent E. coli NEB 10-beta cells (NEB), plated on LB agar plates with 50 μg/mL kanamycin or 100 μg/ml carbenicillin, and grown overnight at 37° C. Colony PCR was used to verify gene fragment insertion and positive colonies were inoculated into liquid LB media with the appropriate antibiotic. Cultures were grown overnight at 33° C. and then used for isolating plasmid DNA (Qiagen).
E. coli
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
Yarrowia
The description of strain SB-0491 is provided in a patent application filed concurrently by applicants (Attorney Docket CELB-003-001)
E. coli Expression and Purification
To compare the enzyme's activities more accurately, larger cultures of the best hits and controls from the first screen were grown and the enzymes were purified according to the following protocol(s):
Glycerol stocks of each recombinant strain were used to inoculate 2 mL of LB with 50 μg/mL kanamycin. After overnight growth at 37° C., 0.1-0.5 mL were used to inoculate 100-500 mL of TB media (supplemented with [50 μg/mL kanamycin final concentration in the culture]). The cultures were then grown at 37° C. with 250 rpm shaking until an OD600 of approximately 0.8-1.2. At this point, the cultures were transferred to a shaker at room temperature for 30 min (RPM=100-125), after which they were induced with 0.25 mM IPTG. After 16 h at 150 rpm shaking, the cells were pelleted by centrifugation. Cell pellets were frozen at −80 C or immediately taken into the purification procedure(s) described below.
On the date of purification, the cell pellets were thawed and/or immediately resuspended in 10-20 mL of lysis buffer [B-PER© supplemented with 5 mM MgCl2, 100 μg/mL lysozyme, and 2 μL/mL DNaseI (TURBO DNase ThermoFisher)]. After incubation at room temperature or on ice (enzyme contingent) for ˜10 min shaking at low speed (100 rpm), the cell debris were removed by centrifugation at 4750 rpm for 20 min in a pre-chilled rotor at 4° C. Lysates were loaded in pre-equilibrated cobalt spin columns (ThermoFisher, TALON HisPur Cobalt spin column, with 1 or 5 mL resin depending on the initial culture size [100 mL vs 500 mL, respectively]) and 6×His-tagged proteins that were recombinantly overexpressed in E. coli were purified according to manufacturer's protocol, with only minor adjustments that were contingent on the enzyme(s) of interest being purified. The protein(s) of interest were eluted in 50-100 mM HEPES/TRIS pH=7.8/8.0 with 5 mM MgCl2, 50-150 mM NaCl, and 300 mM imidazole. The elution fractions were pooled, and the buffer was exchanged during the concentration steps. The final storage buffer used was, in some cases, enzyme contingent and adjusted accordingly. In most cases, the final buffer the purified enzymes were exchanged into was 50 mM HEPES/TRIS pH=7.5-8.0 with 25-100 mM NaCl/KCl, 5 mM MgCl2, and 10% v/v glycerol. An appropriately sized AMICON Ultra-15 centrifugal unit (new) was used for each enzyme and each enzyme prep. The size for the filter used for this buffer exchange & concentration process always had a MWCO that was three times smaller than the protein of interest (e.g., so the protein would concentrate and not pass through the filter). The centrifugal units were spun at 3500 rpm for 20 mins at 4° C. (3-4 times, refilling with the final dialysis buffer for a complete exchange of buffers [dialysis=final storage buffer). The final spin resulted in anywhere from 0.5-1 mL of dialyzed, purified protein that was carefully titrated out and immediately quantified while on kept on ice. Proteins were quantified using a SpectraMax M2E using the Abs@280 nm (buffer background subtraction & replicate measurements to account for any error via dilution or pipette; N>=6 in almost all cases where total yield was not limiting). The Abs@280 nm was used in combination with the theoretical extinction coefficient of the protein of interest @ 280 nm (https://web.expasy.org/protparam/protparam-doc.html), which has ˜3% error compared to other methods of protein quantification and has fewer pitfalls with regards to interference from any buffer components (e.g., Bradford assay). Proteins were then aliquoted into several 1.5 mL microcentrifuge tubes 50-100 μL in each and then immediately flash frozen in liquid nitrogen to avoid denaturation upon thawing for later use (i.e., slow freezing proteins results in a salt concentration gradient build up, which can be problematic for enzymes prone to aggregation during the thawing process when the enzyme is to be assayed). Enzymes were stored at −80° C. immediately after they were frozen in the labeled microcentrifuge tubes at the volumes. Yields for the enzyme(s) purified in this way sometimes varied; however, in most cases the total yields of purified protein were quite often more than sufficient for the number of assays that they were utilized in (e.g., 500 mL cultures with recombinant expression in E. coli yielded >=5-10 mg of total purified protein, which is approximately 2-5% of the total protein in the cells given a density of 0.8-1.2 (OD600) at the time of induction). A subset of purified enzyme, cell lysate, and the cell pellet from the above steps were always saved and run on an SDS-PAGE gel to assess purity (normalizing load concentrations beforehand). Thus, purity was always ensured (>=˜98%) before assaying any given enzyme; thereby, always obeying Efraim Racker's tenet of protein biochemistry, “Don't waste clean thinking on dirty enzymes.”.
In many cases the AKTA PURE 150 FPLC equipped with a fraction collector & a HisTalon Crude Cobalt Column (5 mL) was utilized to purify proteins. The protocol followed was nearly identical to the above, with only one exception regarding the elution volumes (1 mL into a 96 well plate during elution), which were pooled after the fact by demarcated fractions on the instrument software—that indicated which wells contained the protein of interest based on those wells where protein was titrated by the instrument during elution (e.g., rapid spike in the Abs@280 nm). Proteins were dialyzed & concentrated and then quantified and flash frozen in liquid nitrogen as described above. All enzyme aliquots were stored at −80 C after flash freezing in liquid nitrogen. Purity was ensured prior to any activity assays with the purified protein as described above (e.g., via SDS-PAGE). Noteworthy: All solutions were kept cold on ice or at 4° C. during the purification procedures that were described above.
Overnight YPD (10 g/L yeast extract, 20 g/L peptone, 2% dextrose) cultures were inoculated from glycerol stocks of the appropriate strain and grown at 30° C. with 250 rpm shaking. Once cultures had reached an OD600 of 4-6, cultures were centrifuged at 500×g for 5 min, supernatants were discarded, and cell pellets were resuspended in equal volume of water. Resuspended cells were centrifuged at 500×g for 5 min, supernatants were discarded, and cells were resuspended in a volume (75 μL×OD×Vculture) of transformation cocktail (45% PEG-400, 0.1 M LiAc, 0.1 M DTT, and 25 ug/100 μL SS Salmon Sperm DNA). For each transformation, >1 μg of plasmid DNA was added to 55 μL cells/transformation cocktail and vortexed for 2 s. Transformations were incubated at 39° C. for 1 h with 250 rpm shaking. Transformations were resuspended in 750 μL YPD with 1 M sorbitol and recovered at 30° C. overnight with 250 rpm shaking. The next day, transformations were centrifuged at 500×g for 5 min, supernatants were discarded, and cell pellets were resuspended in 750 YPD. Resuspended transformations were plated on YPD with appropriate selection or YNBD (6.71 g/L yeast nitrogen base+nitrogen, 0.5% casamino acids, 2% dextrose) agar plates and grown at 30° C. for 2 days. Individual colonies were patched onto YPD plates with appropriate selection or YNBD plates and grown at 30° C. overnight. Patches were used to inoculate 0.5 mL YPD with appropriate selection or YNBD precultures in 96w blocks and grown at 30° C. for 24-48 h with 1000 rpm shaking. For assays, 0.5 mL YPD with appropriate selection or YNBD cultures containing substrate were inoculated with 2 μL from precultures and grown at 30° C. for 2-4 days with 1000 rpm shaking with 2% glucose added every 24 h. Assay cultures were quenched by addition of 0.5 mL ethanol with 0.2% formic acid and 0.5 mg/mL pentyl-benzoic acid. Precipitates were pelleted by centrifuging at 4600×g for 10 min and then 200 μL was transferred to fresh plates, sealed, and analyzed via HPLC.
All samples were quenched with equal volume of EtOH containing 0.2 mg/mL internal standard (3,5-Diisopropyl-2-hydroxybenzoic acid CAS #2215-21-6) centrifuged, and clarified solutions were analyzed by HPLC-MS
Column: 2.1×50 mm COSMOCORE PBr (Nacalai USA, Inc.)
Mobile Phase: A; 0.1% formic acid in water, B; 0.1% formic acid in acetonitrile
Flow: 0.45 mL/min
Temp: %50 Celsius
Gradient: 20% B at 0 min, 70% B at 2.3 min, 89% B at 4.2 min, 20% B at 4.3 min, 20% B at 6 min
Detection: UV DAD and QToF MS
All compounds except PDAL-C6, were confirmed and quantified based on authentic standards. For PDAL-C6 and PDAL-C4 (if present) the quantification was made from authentic PDAL-C2 (4-hydroxy-6-methyl-2-pyrone, CAS #675-10-5)
Column: 2.1×50 mm COSMOCORE PBr (Nacalai USA, Inc.)
Mobile Phase: A; 0.1% formic acid in water, B; 0.1% formic acid in acetonitrile
Flow: 0.45 mL/min
Temp: %50 Celsius
Gradient: 20% B at 0 min, 35% B at 1 min, 40% B at 2.5 min, 70% B at 3 min, 90% B at 5 min, 20% B at 5.5 min, 20% B at 8 min
Detection: UV DAD and QToF MS
All olivetol derivatives were identified a) from their UV spectra that was identical to olivetol and divarinol and b) from their MS analysis confirming the molecular mass. The quantification was based on UV at either 210 nm or 275 nm using OL absorbance coefficient for these wavelengths.
As described in all engineering projects in this work, prior to any mutagenesis approach structural models of the proteins were created. For this, a variety of commercial and free software packages are available that were used to make structure models using crystal structures of homologous proteins as templates. The selection of the template structures used in the homology modelling process considered three important factors: i) sequence identity between the template enzyme(s) and the target enzyme(s) [only those with >30% sequence identity were used]; ii) the atomic resolution at which the template enzyme(s) were solved; and iii) The percent of sequence coverage between the target enzyme and the template enzyme(s) (i.e., differences in the length of the enzymes). Using this approach 8 to 10 templates were used to generate the homology models. The homology models were evaluated for accuracy using specific software (MolProbity) and if necessary, further refinement and correction of the structure models was achieved using secondary software. Refinement of models entailed rotamer optimization and then the use of GROMACS and energy minimization. Specifically, the top model from multi-template-based modelling was placed in a cubic box with edges 2 nm from any part of the protein being modelled. Periodic boundary conditions were defined, the system was solvated (TIP5P water model; current updated version; gold standard for MD), and the charge of the system neutralized with Na2+ or Cl2− contingent on the protein and overall charge of the system. Models were then refined using the amber99sb-ildn force-field (widely used force-field for MD), and the simulation was conducted until the potential energy of the entire system converged. The energy minimized PDB was extracted without the neutralizing ions and explicit water molecules, and then subjected to quality improvement using MolProbity. In all cases, refinement improved the overall quality of the initial model significantly.
Finally, the appropriate substrates were docked in the active site using a AutoDock Vina software package and iterative changes to the grid search size. The top two (of a number of possible orientations) docking poses for substrates were selected based on calculated binding energy and the orientation in the active site that brings substrates at the right position for reaction. After this modeling exercise was completed, amino acids in the active site that are 5 Å from each substrate were identified and were selected for mutagenesis.
This application is a U.S. national stage filing under 35 U.S.C. 371 of International Application No. PCT/US2022/029327, filed May 13, 2022, which claims the benefit of U.S. Provisional Application No. 63/188,645, filed May 14, 2021. The entire teachings of the above applications are hereby incorporated by reference in their entirety. International Application No. PCT/US2022/029327 was published under PCT Article 21(2) in English.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/029327 | 5/13/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63188645 | May 2021 | US |