The Sequence Listing, which is a part of the present disclosure, includes a computer readable form and a written sequence listing comprising nucleotide and/or amino acid sequences of the present invention. The sequence listing information recorded in computer readable form is identical to the written sequence listing. The subject matter of the Sequence Listing is incorporated herein by reference in its entirety.
The present application generally relates to recombinant enzymes and genes encoding those enzymes. More specifically, the application provides recombinant prenyltransferase genes and enzymes that function in microorganisms and plants.
Cannabinoids are a class of organic small molecules of meroterpenoid structures found in the plant genus Cannabis. The small molecules are currently under investigation as therapeutic agents for a wide variety of health issues, including epilepsy, pain, and other neurological problems, and mental health conditions such as depression, PTSD, opioid addiction, and alcoholism (Committee on the Health Effects of Marijuana, 2017).
While it is known that cannabinoids may be obtained via biosynthesis in plant species, there are many problems associated with the synthesis of such molecules which need to be overcome, including problems with large-scale manufacturing, purification, and heterologous expression for biosynthesis.
Producing cannabinoids, in recombinant microorganisms such as yeast is a promising solution to the above problems. See, e.g., U.S. Pat. Applications 16/553103, 16/553120, 17/068636 and 63/053539; U.S. Pats. 10,435,727 and 11,041,002; and U.S. Pat. Publications 2020/0063170 and 2020/0063171, all incorporated by reference.
One way to improve biosynthetic cannabinoid production in microorganisms is by the discovery and use of new enzymes that catalyze the same reactions as plant derived enzymes but with improved parameters. The present invention provides such enzymes with prenyltransferase activity.
Provided is a nucleic acid comprising a sequence encoding a prenyltransferase (PT) gene or its complement, codon optimized for production in a microorganism or a plant.
Also provided is a yeast expression cassette comprising the above nucleic acid.
Additionally provided is a non-naturally occurring prenyltransferase (PT) comprising an amino acid sequence having at least 90% amino acid sequence identity or conservative amino acid substitutions to the amino acid sequences encoded by the above nucleic acid.
Further provided is a recombinant microorganism or plant expressing a PT encoded by the above nucleic acid.
The present invention is also directed to a method of catalyzing the condensation of a polyprenol diphosphate and an alkylresorcinol or alkylresorcyclic acid to yield a cannabinoid. The method comprises combining the polyprenol diphosphate and the alkylresorcinol or alkylresorcyclic acid with the above PT.
Those of skill in the art will understand that the drawings, described below, are for illustrative purposes only. The drawings are not intended to limit the scope of the present teachings in any way.
To facilitate understanding of the invention, a number of terms and abbreviations as used herein are defined below as follows:
Conservative amino acid substitutions: As used herein, when referring to mutations in a protein, “conservative amino acid substitutions” are those in which at least one amino acid of the polypeptide encoded by the nucleic acid sequence is substituted with another amino acid having similar characteristics. Examples of conservative amino acid substitutions are ser for ala, thr, or cys; lys for arg; gln for asn, his, or lys; his for asn; glu for asp or lys; asn for his or gln; asp for glu; pro for gly; leu for ile, phe, met, or val; val for ile or leu; ile for leu, met, or val; arg for lys; met for phe; tyr for phe or trp; thr for ser; trp for tyr; and phe for tyr.
Functional variant: The term “functional variant,” as used herein, refers to a recombinant enzyme such as a fungal prenyltransferase that comprises a nucleotide and/or amino acid sequence that is altered by one or more nucleotides and/or amino acids compared to the nucleotide and/or amino acid sequences of the parent protein and that is still capable of performing an enzymatic function (e.g., synthesis of a cannabinoid) of the parent enzyme. In other words, the modifications in the amino acid and/or nucleotide sequence of the parent enzyme may cause desirable changes in reaction parameters without altering fundamental enzymatic function encoded by the nucleotide sequence or containing the amino acid sequence. The functional variant may have conservative change including nucleotide and amino acid substitutions, additions and deletions. These modifications can be introduced by standard techniques known in the art, such as site-directed mutagenesis and random PCR-mediated mutagenesis, and may comprise natural as well as non-natural nucleotides and amino acids. Also envisioned is the use of amino acid analogs, e.g. amino acids not DNA or RNA encoded in biological systems, and labels such as fluorescent dyes, radioactive elements, electron dense agents, or any other protein modification, now known or later discovered.
Recombinant nucleic acid and recombinant protein: As used herein, a recombinant nucleic acid or protein is a nucleic acid or protein produced by recombinant DNA technology, e.g., as described in Green and Sambrook (2012).
Polypeptide, protein, and peptide: The terms “polypeptide,” “protein,” and “peptide” are used herein interchangeably to refer to amino acid chains in which the amino acid residues are linked by peptide bonds or modified peptide bonds. The amino acid chains can be of any length of greater than two amino acids. Unless otherwise specified, the terms “polypeptide,” “protein,” and “peptide” also encompass various modified forms thereof. Such modified forms may be naturally occurring modified forms or chemically modified forms. Examples of modified forms include, but are not limited to, glycosylated forms, phosphorylated forms, myristoylated forms, palmitoylated forms, ribosylated forms, acetylated forms, and the like. Modifications also include intra-molecular crosslinking and covalent attachment of various moieties such as lipids, flavin, biotin, polyethylene glycol or derivatives thereof, and the like. In addition, modifications may also include protein cyclization, branching of the amino acid chain, and cross-linking of the protein. Further, amino acids other than the conventional twenty amino acids encoded by genes may also be included in a polypeptide.
The term “protein” or “polypeptide” may also encompass a “purified” polypeptide that is substantially separated from other polypeptides in a cell or organism in which the polypeptide naturally occurs (e.g., 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 100% free of contaminants).
Primer, probe and oligonucleotide: The terms “primer,” “probe,” and “oligonucleotide” may be used herein interchangeably to refer to a relatively short nucleic acid fragment or sequence. They can be DNA, RNA, or a hybrid thereof, or chemically modified analogs or derivatives thereof. Typically, they are single-stranded. However, they can also be double-stranded having two complementing strands that can be separated apart by denaturation. In certain aspects, they are of a length of from about 8 nucleotides to about 200 nucleotides. In other aspects, they are from about 12 nucleotides to about 100 nucleotides. In additional aspects, they are about 18 to about 50 nucleotides. They can be labeled with detectable markers or modified in any conventional manners for various molecular biological applications.
Vector: As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Various vectors are those capable of autonomous replication and/expression of nucleic acids to which they are linked. Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as “expression vectors.”
Linker: The term “linker” refers to a short amino acid sequence that separates multiple domains of a polypeptide. In some embodiments, the linker prohibits energetically or structurally unfavorable interactions between the discrete domains.
Cannabinoid: As used herein, the term “cannabinoid” refers to a family of structurally related aromatic meroterpenoid molecules. Cannabinoids are generally formed by the enzymatic fusion, by a cannabinoid synthase (having geranylpyrophosphate:olivetolate geranyltransferase activity), of an alkylresorcylic acid
where R1 = CH3, (CH2)2CH3 (divarinolic acid), (CH2)4CH3 (olivetolic acid), or (CH2)6CH3, with a polyprenyl pyrophosphate such as geranyl pyrophosphate, neryl pyrophosphate, geranylgeranyl pyrophosphate, of farnesyl pyrophosphate (
Codon optimized: As used herein, a recombinant gene is “codon optimized” when its nucleotide sequence is modified to accommodate codon bias of the host organism to improve gene expression and increase translational efficiency of the gene.
Expression cassette: As used herein, an “expression cassette” is a nucleic acid that comprises a gene and a regulatory sequence operatively coupled to the gene such that the promoter drives the expression of the gene in a cell. An example is a gene for an enzyme with a promoter functional in yeast, where the promoter is situated such that the promoter drives the expression of the enzyme in a yeast cell.
The present teachings relate to prenyltransferase (PT) enzymes, for example fungal PT enzymes, for biosynthesis of cannabinoids, nucleic acids encoding the PTs, host organisms for generating cannabinoids, and methods of generating cannabinoids in those host organisms.
PTs transfer a prenyl group, a product of the mevalonate pathway, onto a prenyl acceptor, usually a phenolic molecule, in this case, an alkylresorcylic acid, such as olivetolic acid. PTs are found in fungi, bacteria, plants, archaea, and animals and are often but not exclusively involved in secondary metabolism. PTs are also involved in basic cellular metabolism such as the biosynthesis of the cofactor Coenzyme Q.
As depicted in
The PTs and nucleic acids encoding the PTs are useful for producing cannabinoids genetically modified microorganisms or plants, since the PTs catalyze the condensation of olivetol (OL) or olivetolic acid (OA) and its derivatives with geranyl pyrophosphate (GPP) or other related isoprenes to yield cannabigerolic acid (CBGA), cannabigerol (CBG), cannabigerovarinic acid (CBGVA), or other cannabinoid derivatives. This engineered prenyltransferase pathway can be used in conjunction with other recombinant enzymes to make a variety of cannabinoids such as cannabidiol (CBD), tetrahydrocannabivarin (THCV), cannabichromene (CBC), and other common or novel cannabinoids.
Referring to
A specific PT is the CBGA synthase/geranyl-olivetol transferase (GOT) enzyme.
Non-Cannabis derived PTs can: (i) catalyze reverse prenylations, or (ii) have O-specificity or N-specificity. Some PTs are specific for prenylating proteins, attaching the prenyl group to a cysteine residue in the C-terminus of the target protein. O-specificity is where the prenyl group is attached to an oxygen in the prenyl acceptor instead of a carbon. N-specificity is when the prenyl group is attached to a nitrogen in the prenyl acceptor instead of a carbon. The nitrogen of the indole ring on a tryptophan residue is another possible target for protein prenylation. Some PTs are specific in their requirements for the prenyl donor.
Cannabis-derived PTs may use: (i) GPP (10 carbon prenyl donor); (ii) DMAPP (5 carbon prenyl donor); (iii) FPP (15 carbon prenyl donor); or (iv) GGPP (20 carbon prenyl donor). Only a limited subset of Cannabis-derived PTs are able to produce cannabinoids.
It should be noted that not all PTs can synthesize cannabinoids. Many PTs have loose substrate specificity and can accept a wide variety of substrates, but cannabinoid biosynthesis activity is rare in nature. Cannabis spp. are the primary organisms known to produce cannabinoids in nature.
Precursors of cannabinoids include alkylrescorcinol and alkylrescorcyclic acid compounds such as olivetol (OL) and olivetolic acid (OA) that are produced from common cellular metabolites hexanoyl-CoA and malonyl CoA.
In plants, olivetol is produced from hexanoyl-CoA and malonyl-CoA by the action of the polyketide synthase olivetol synthase; and olivetolic acid is produced from olivetol by the action of the enzyme olivetolic acid cyclase. See, e.g., US Provisional Patent Application 63/194,121.
Another precursor compound is a polyprenol compound, geranyl pyrophosphate (GPP) (also referred to as geranyl diphosphate), which is a product of the mevalonate pathway (see
In plants, cannabigerolic acid (CBGA) is produced from OA and GPP by the action of the enzyme cannabigerolic acid synthase (CBGAS), also called geranyl-olivetol transferase (GOT).
Downstream cannabinoids are produced by the action of downstream cannabinoid synthases on cannabigerolic acid. For example, THCA synthase converts CBGA to THCA. Cannabidiolic acid synthase converts CBGA to CBDA. Cannabichromenic acid synthase converts CBGA to CBCA (see
Referring to
The activity of the putative PTs can be determined by any method known in the art. In some embodiments, each candidate polypeptide is introduced into a host cell genetically modified to contain the components for CBGA biosynthesis using standard yeast cell transformation techniques (see, e.g., Green and Sambrook, 2012). Cells are subjected to fermentation, under conditions that activate the promoter controlling the candidate polypeptide (see Example 2). The fermentation broth is then subjected to HPLC analysis (see Example 5). As an example of GOT function determination of the resulting polypeptide, if CBGA is detected by HPLC analysis relative to an analytical standard, then the candidate polypeptide is determined to be a functional CBGA synthase in this context.
In some embodiments, a nucleic acid comprising a sequence encoding a prenyltransferase (PT) gene or its complement is provided. In these embodiments, the PT gene is codon optimized for production in a microorganism or a plant.
As provided in US Pat. No. 10,435,727, nucleotide sequences are codon optimized for expression of heterologous polypeptides in a host by utilizing a number of considerations:
Enzyme variations may also result from mutations introduced into the DNA and amino acid sequences to prevent or promote post translational modifications of the protein. Nonlimiting examples of post translational modifications include phosphorylation, acetylation, methylation, SUMOylation, ubiquitination, proteolytic cleavage, lipidation, including prenylation such as farnesylation or myristoylation, glycosylation, nitrosylation and biotinylation.
In various embodiments, the gene for the PT enzyme is derived from a fungus. It is envisioned that a PT from any fungus now known or later discovered can be utilized in the present invention. This includes but is not limited to the phyla Chytridiomycota, Basidiomycota, Ascomycota, Blastocladiomycota, Ascomycota, Microsporidia, Basidiomycota, Glomeromycota, Symbiomycota, and Neocallimastigomycota. For example, the fungus can be from the phylum Ascomycota, including classes, orders, families and genera Pezizomycotina, Arthoniomycetes, Colletotrichum, Coniocybomycetes, Clavicipitaceae, Dothideomycetes, Eurotiomycetes, Geoglossomycetes, Glomerellales, Hypocreales Laboulbeniomycetes, Lecanoromycetes, Leotiomycetes, Lichinomycetes, Orbiliomycetes, Pezizomycetes, Sordariomycetes, Xylonomycetes, Lahmiales, Itchiclahmadion, Triblidiales, Saccharomycotina, Saccharomycetes, Stachybotryaceae, Taphrinomycotina, Archaeorhizomyces, Neolectomycetes, Pneumocystidomycetes, Schizosaccharomycetes, and Taphrinomycetes; phylum Basidiomycota including subphyla or classes Pucciniomycotina, Ustilaginomycotina, Wallemiomycetes, and Entorrhizomycetes; subphylum Agaricomycotina including classes Tremellomycetes, Dacrymycetes, and Agaricomycetes; phylum Symbiomycota, including class Entorrhizomycota; subphylum Ustilaginomycotina including classes Ustilaginomycetes and Exobasidiomycetes; phylum Glomeromycota including classes Archaeosporomycetes, Glomeromycetes, and Paraglomeromycetes; subphylum Pucciniomycotina including orders and classes: Pucciniomycotina, Cystobasidiomycetes, Agaricostilbomycetes, Microbotryomycetes, Atractiellomycetes, Classiculomycetes, Mixiomycetes, and Cryptomycocolacomycetes; subphylum incertaesedis Mucoromyceta including orders Calcarisporiellomycota and Mucoromycota; phylum Mortierellomyceta including class Mortierellomycota; subphylum incertaesedis Entomophthoromycotina including order Entomophthorales; phylum Zoopagomyceta including classes Basidiobolomycota, Entomophthoromycota, Kickxellomycota, and Zoopagomycotina; subphylum incertaesedis Mucoromycotina including orders Mucorales, Endogonales, and Mortierellales; phylum Neocallimastigomycota including class Neocallimastigomycetes; phylum Blastocladiomycota including classes Physodermatomycetes and Blastocladiomycetes; phylum Rozellomyceta including classes Rozellomycota and Microsporidia; phylum Aphelidiomyceta including class Aphelidiomycota; Chytridiomyceta including classes Chytridiomycetes and Monoblepharidomycetes; and phylum Oomycota including classes or orders Leptomitales, Myzocytiopsidales, Olpidiopsidales, Peronosporales, Pythiales, Rhipidiales, Salilagenidiales, Saprolegniales, Sclerosporales, Anisolpidiales, Lagenismatales, Rozellopsidales, and Haptoglossales.
Specific modified fungal PT enzymes presented herein have the capacity to synthesize cannabinoids such as CBGA in a recombinant host. SEQ ID NOs: 1-135 are codon optimized nucleic acid sequences encoding modified fungal PTs having amino acid sequences SEQ ID NOs: 136-270 (Table 1).
Homologous polynucleotide sequences, or complements thereof, having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identity to the polynucleotide sequences listed sequences SEQ. ID NOs— 1-135 are also provided, including those that encode corresponding amino acid sequences SEQ ID NOs: 136-270, respectively, or having conservative amino acid substitutions. In some embodiments, the encoded PT comprises one or more conservative amino acid substitution that prevents or promotes post-translational modification of the PT. Examples of such post-translational modification is phosphorylation, acetylation, methylation, SUMOylation, ubiquitination, proteolytic cleavage, lipidation, prenylation such as farnesylation or myristoylation, glycosylation, nitrosylation, biotinylation, or any combination thereof.
In various embodiments, the nucleic acids further comprise additional nucleic acids encoding amino acids that are not part of the PT enzyme. In some of these embodiments, the additional sequences encode additional amino acids present when the nucleic acid is translated, encoding, for example, an additional protein domain, with or without a linker sequence, creating a fusion protein. Other examples are localization sequences, i.e., signals directing the localization of the folded protein to a specific subcellular compartment or membrane.
In some embodiments, the nucleic acids have, at the 5′ end, a nucleic acid encoding codon optimized cofolding peptides to create a fusion protein, e.g., having SEQ ID NOs:271-275 (Table 2), joining the sequences together to form a fusion polypeptide, e.g., having the amino acid sequence of SEQ ID NO:244-248 fused at the N terminus of the enzyme polypeptide, generating recombinant fusion polypeptides.
In some embodiments, the nucleic acid comprises additional nucleotide sequences that are not translated. Nonlimiting examples include promoters, terminators, barcodes, Kozak sequences, targeting sequences, and enhancer elements.
Expression of a gene encoding an enzyme is determined by the promoter controlling the gene. In order for a gene to be expressed, a promoter must be present within 1,000 nucleotides upstream of the gene. A gene is generally cloned under the control of a desired promoter. The promoter regulates the amount of enzyme expressed in the cell and also the timing of expression, or expression in response to external factors such as sugar source. Any promoter now known or later discovered can be utilized to drive the expression of the PT described herein.
In various embodiments, the nucleic acid encoding the PT is codon optimized for expression in yeast, for example a species of Saccharomyces, Candida, Pichia, Schizosaccharomyces, Scheffersomyces, Blakeslea, Rhodotorula, or Yarrowia. Such nucleic acids can usefully comprise a promoter functional in yeast. See e.g. http://parts.igem.org/Yeast for a listing of various yeast promoters. Exemplary yeast promoters known in the art are listed in Table 3 below drive strong expression, constant gene expression, medium or weak gene expression, or inducible gene expression. Inducible or repressible gene expression is dependent on the presence or absence of a certain molecule. For example, the GAL1, GAL7, and GAL10 promoters are activated by the presence of the sugar galactose and repressed by the presence of the sugar glucose. The HO promoter is active and drives gene expression only in the presence of the alpha factor peptide. The HXT1 promoter is activated by the presence of glucose while the ADH2 promoter is repressed by the presence of glucose.
In various embodiments, the nucleic acid is in an expression cassette, e.g., a yeast expression cassette. Any yeast expression cassette capable of expressing the enzyme in a yeast cell can be utilized. In some embodiments, the expression cassette consists of a nucleic acid encoding a PT with a promoter.
Additional regulatory elements can also be present in the expression cassette, including restriction enzyme cleavage sites, antibiotic resistance genes, integration sites, auxotrophic selection markers, origins of replication, and degrons.
The expression cassette can be present in a vector that, when transformed into a host cell, either integrates into chromosomal DNA or remains episomal in the host cell. Such vectors are well-known in the art. See e.g. http://parts.igem.org/Yeast for a listing of various yeast vectors.
A nonlimiting example of a yeast vector is a yeast episomal plasmid (YEp) that contains the pBluescript II SK(+) phagemid backbone, an auxotrophic selectable marker, yeast and bacterial origins of replication and multiple cloning sites enabling gene cloning under a suitable promoter. Other exemplary vectors include pRS series plasmids.
Additionally provided is a non-naturally occurring prenyltransferase (PT) having GOT PT activity. In various embodiments, the GOT PT comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity or conservative amino acid substitutions to SEQ ID NOs: 136-270.
In some embodiments, the GOT PT is provided in vitro. In other embodiments, the GOT PT is expressed in a recombinant host cell, e.g., a microorganism or a plant such as tobacco or Arabidopsis thaliana. Nonlimiting examples of host cell microorganisms include any species of bacteria, including but not limited to Escherichia, Corynebacterium, Caulobacter, Pseudomonas, Streptomyces, Bacillus or Lactobacillus; any species of filamentous fungus, including but not limited to any species of Aspergillus, and a yeast, e.g., any species of Saccharomyces, Candida, Pichia, Schizosaccharomyces, Scheffersomyces, Blakeslea, Rhodotorula or Yarrowia
Further provided is a recombinant host cell of a microorganism or plant expressing a GOT PT, e.g., encoded by any of the above-described nucleic acids.
Referring to
Examples of cannabinoids that may be produced in these host cells include cannabigerol (CBG), cannabigerolic acid (CBGA), cannabidiolic acid (CBDA), cannabichromene (CBC), cannabidivarin (CBCV), cannabichromenic acid (CBCA), cannabichromevarinic acid (CBCVA), cannabinol (CBN), 11-hydroxycannabinol (11-OH-CBN), cannabinerolic acid (CBNA), cannabivarin (CBV), 11-hydroxycannabivarin (11-OH-CBV), cannabinerovarinic acid (CBNVA), cannabigerophorolic acid (CBGPA), cannabigerovarinic acid (CBGVA), cannabigerogerovarinic acid (CBGGVA), tetrahydrocannabinol (THC), tetrahydrocannabinolic acid (THCA), 11-hydroxy tetrahydrocannabinol (11-OH-THC), tetrahydrocannabivarin (THCV), 11-hydroxy tetrahydrocannabivarin (11-OH-THCV), tetrahydrocannabivarin acid (THCVA), cannabinerovarinic acid (CBNVA), sesquicannabigerol (CBF), cannabigerogerol (CBGG), sesquicannabigerolic acid (CBFA), cannabigerogerolic acid (CBGGA), sesquicannabidiolic acid (CBDFA), sesquitetrahydrocannabinolic acid (THCFA), sesquicannabigerovarinic acid (CBFVA), sesquicannabichromenic acid (CBCFA), sesquicannabigerophorolic acid (CBFPA), cannabidiol (CBD), cannabidiolic acid (CBDA), cannabidivarinic acid (CBDVA), cannabidivarin (CBDV) or any combination thereof.
Cannabinoids can be produced in a heterologous cell along the pathway depicted in
Referring to
Thus, the host cell for producing cannabinoids can comprise one or more PT genes; a prenyl group acceptor; a prenyl group donor; and a reaction pathway for converting the prenyl group donor and the prenyl group acceptor to a first cannabinoid catalyzed by PT enzymes encoded by the PT genes.
The one or more PT genes may be inserted into and thereby integrated into the host cell and one or more PT genes are expressed. In some embodiments, the PT enzymes are at least 80% homologous to isolated amino acid sequences selected from SEQ. ID NOs: 136-270 which may be encoded by the PT genes having SEQ ID NOs:1-135, respectively.
In some embodiments, genetic alteration is achieved by: gene expression controlled by inducible promoter systems; natural or induced mutagenesis, recombination, and/or shuffling of genes, pathways, and whole cells performed sequentially or in cycles; overexpression and/or deletion of single or multiple genes; knockout mutations/alterations which reduce or eliminate parasitic pathways that reduce precursor concentration; and integrating vectors and PCR fragments into the host genome.
In other embodiments, the recombinant host cells of the recombinant organism are engineered to produce all precursor molecules necessary for the biosynthesis of cannabinoids, including but not limited to OA, OL and GPP, hexanoic acid, hexanoyl-CoA, malonic acid and malonyl-CoA, e.g., as disclosed in U.S. Pat. No. 10,435,727.
As discussed above, in some embodiments, the recombinant host cell is a yeast cell, e.g., a species of Saccharomyces, Candida, Pichia, Schizosaccharomyces, Scheffersomyces, Blakeslea, Rhodotorula, or Yarrowia.
Using the systems and methods herein, the genes which can be expressed to encode for a corresponding enzyme or other type of proteins include but are not limited to the fungal prenyltransferases (Fungal PT). For example, the Fungal PT gene is expressed, or overexpressed, to encode for the Fungal PT enzyme. Gene sequences can be determined using the techniques disclosed in U.S. Pat. 10,671,632.
In various embodiments, samples from fermentations of recombinant hosts expressing the cannabinoid pathway with the GOT PTs described above are: (i) prepared and extracted using a combination of fermentation, dissolution, and purification steps; and (ii) analyzed by HPLC for the presence of directing molecules, precursor molecules, intermediate molecules, and target molecules such as CBGA, CBGVA, THCA, and THCVA.
In specific embodiments, (a) the host cell produces a meroterpenoid; (b) a second reaction pathway converts the first cannabinoid to a second cannabinoid, as catalyzed by downstream enzymes, which in some embodiments are cannabinoid synthases; (c) the second cannabinoid is THCA, THCVA, CBDA, CBDVA, CBCA, CBCVA, CBGVA or CBGOA; (d) the prenyl donor is polyprenol diphosphate; (e) the prenyl acceptor is a 1,3-diphenol or diphenolic acid; (f) two or more fungal GOT PTs, or a mixture of fungal PTs and plant PTs are present in the host cell; enzymes which catalyze synthesis of the first cannabinoid.
The present invention is also directed to methods of catalyzing the condensation of a polyprenol diphosphate and an alkylresorcinol or alkylresorcyclic acid to yield a cannabinoid. The methods comprises combining the polyprenol diphosphate and the alkylresorcinol or alkylresorcyclic acid with any of the GOT PTs described above.
In some of these embodiments, the polyprenol diphosphate is geranyl pyrophosphate (GPP) or farnesyl pyrophosphate (FPP) or geranylgeranyl pyrophosphate (GGPP). In other embodiments, the alkylresorcinol or alkylresorcyclic acid is olivetol (OL) or olivetolic acid (OA), divarinolic acid, divarinol, orcinol, orsellinic acid, sphaerolic acid, sphaerophorol. In additional embodiments, the cannabinoid is cannabigerolic acid (CBGA), cannabigerol (CBG), or cannabigerovarinic acid (CBGVA).
The catalysis of these methods may be performed in vitro or in a recombinant microorganism or plant expressing the PT, as described above. In some of these embodiments, the microorganism is a yeast cell, e.g., a species of Saccharomyces, Candida, Pichia, Schizosaccharomyces, Scheffersomyces, Blakeslea, Rhodotorula, or Yarrowia.
In additional embodiments, the recombinant microorganism or plant also expresses a second PT, e.g., a fungal PT and a plant PT, as discussed above.
Also as discussed above, in some embodiments the recombinant microorganism or plant also expresses at least one other recombinant enzyme in a cannabinoid biosynthetic pathway. Nonlimiting examples of the at least one other recombinant enzyme is an olivetolic acid cyclase (OAC), a polyketide synthase, an olivetol synthase, a geranyl pyrophosphate synthase (GPPS), a malonate transporter, an aromatase, a dehydrogenase, an oxidase, a desaturase, a decarboxylase, a cannabinoid synthase including a THCA, a CBCA or a CBDA synthase, a cytochrome P450 (CYP-450), a cytochrome P450 reductase (CPR), or any combination thereof.
Preferred embodiments are described in the following examples. Other embodiments within the scope of the claims herein will be apparent to one skilled in the art from consideration of the specification or practice of the invention as disclosed herein. It is intended that the specification, together with the examples, be considered exemplary only, with the scope and spirit of the invention being indicated by the claims, which follow the examples.
Construction of Saccharomycescerevisiae strains expressing a PT with GOT activity derived from a fungal PT for meroterpenoid compound production, such as CBGA, is carried out by cloning the fungal PT into vectors with the proper regulatory elements for gene expression (e.g. promoter, terminator). As an alternative to expression from an episomal plasmid, the PT gene is inserted into the recombinant host genome. Integration is achieved by a single cross-over insertion event of the plasmid. Strains with the integrated gene can be screened by rescue of auxotrophy and genome sequencing.
A recombinant Saccharomycescerevisiae host is modified to express multiple PT genes, as illustrated in
Construction of a modified Saccharomycescerevisiae host is carried out by co-expressing downstream cannabinoid synthases with (i) a fungal-derived PT enzyme, (ii) a mixture of fungal-derived PT enzymes, or (iii) a mixture of fungal derived PT enzymes and plant-derived PT enzymes, as shown in
Modified host cells which yield cannabinoids, such as the cannabinoids produced by the fungal and mixed PT enzymes described herein, express engineered (i) a fungal-derived PT, (ii) a mixture of fungal-derived PTs, or (iii) a mixture of fungal-derived PTs and plant derived PTs. More specifically, the cannabinoid-producing strain herein is grown in a feedstock as described in U.S. Pat. Application No. 17/068636. An example feedstock use for a modified host expressing the fungal cannabinoid pathway would be growing the strain in a minimal-complete or rich culture media containing yeast nitrogen base, amino acids, vitamins, ammonium sulfate, and a carbon source, such as glucose or molasses. The feedstock is consumed by the modified host which expresses the fungal cannabinoid pathway to convert the feedstock into (i) biomass, (ii) cannabinoids, and (iii) biomass and cannabinoids. Strains expressing the fungal cannabinoid pathway can be grown on feedstock for 12 to 160 hours at 25-37° C. for isolation of the cannabinoid products.
To identify fermentation-derived cannabinoids, such as CBGA, CBGVA, THCVA, THCA, THCV etc. and all other products of a recombinant host expressing an engineered biosynthetic pathway for cannabinoids (see
For example,
As another example, in
In view of the above, it will be seen that several objectives of the invention are achieved and other advantages attained.
As various changes could be made in the above methods and compositions without departing from the scope of the invention, it is intended that all matter contained in the above description and shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
All references cited in this specification, including but not limited to patent publications and non-patent literature, and references cited therein, are hereby incorporated by reference. The discussion of the references herein is intended merely to summarize the assertions made by the authors and no admission is made that any reference constitutes prior art. Applicants reserve the right to challenge the accuracy and pertinence of the cited references.
As used herein, in particular embodiments, the terms “about” or “approximately” when preceding a numerical value indicates the value plus or minus a range of 10%. Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range is encompassed within the disclosure. That the upper and lower limits of these smaller ranges can independently be included in the smaller ranges is also encompassed within the disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure.
The indefinite articles “a” and “an,” as used herein in the specification and in the embodiments, unless clearly indicated to the contrary, should be understood to mean “at least one.”
The phrase “and/or,” as used herein in the specification and in the embodiments, should be understood to mean “either or both” of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with “and/or” should be construed in the same fashion, i.e., “one or more” of the elements so conjoined. Other elements can optionally be present other than the elements specifically identified by the “and/or” clause, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
As used herein in the specification and in the embodiments, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when separating items in a list, “or” or “and/or” shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as “only one of” or “exactly one of,” or, when used in the embodiments, “consisting of,” will refer to the inclusion of exactly one element of a number or list of elements. In general, the term “or” as used herein shall only be interpreted as indicating exclusive alternatives (i.e. “one or the other but not both”) when preceded by terms of exclusivity, such as “either,” “one of,” “only one of,” or “exactly one of.” “Consisting essentially of,” when used in the embodiments, shall have its ordinary meaning as used in the field of patent law.
As used herein in the specification and in the embodiments, the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements can optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, “at least one of A and B” (or, equivalently, “at least one of A or B,” or, equivalently “at least one of A and/or B”) can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
This application claims the benefit of U.S. Provisional Application No. 63/053,539, filed Jul. 17, 2020, and incorporated by reference herein in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/042090 | 7/16/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63053539 | Jul 2020 | US |