The present invention relates to genetically modified microorganisms producing bibenzylic acids or derivatives thereof using an operative metabolic pathway, comprising a double bond reductase (DBR), capable of reducing the C2-C3 alkene double bond of phenylpropanoid precursors, a polyketide synthase (PKS) capable of producing a tetraketide derived from the reduced phenylpropanoid, a polyketide cyclase (PKC), capable of producing a bibenzylic acid (BBA) from said tetraketide, a prenyltransferase (PT) capable of transferring a geranyl to the BBA, producing a bibenzylgerolic acid (BBGA), and a cyclase capable of cyclizing said geranylated BBGA to yield perrottetitenoic acid (PETA), which spontaneously or after induction by heat, yields perrottetinene (PET). It includes methods of producing BBAs using such microorganisms, comprising the biosynthetic pathway enzymes, and to compositions, comprising BBA and its derivatives, resulting from such methods. It further includes the use of BBA and derivatives as modulators of the endocannabinoid system, including receptors 1 and 2 (CB1 and CB2).
Bibenzylic acids (BBA) form a distinct group of plant metabolites. The core structure is related to the better-known stilbenes and dihydrostilbenes, but as the name BBA suggests, they comprise a carboxylic acid group which, in the case of stilbenes, is normally lost during biosynthesis and ring closure of the second ring. BBA derivatives are found in various plants, e.g. Pidgeon pea (Cajanus cajan) and Hortensia (Hydrangea macrophylla), and the corresponding dihydro-versions have been found in several liverworts, including the genus Radula (Chicca 2019), as well as in the two legumes Glycyrrhiza foetida and Amorpha fruticosa (Weidner 2012) the latter of which has given name to the BBA group of amorfrutins. A geranylated BBA was recently discovered in Radula perrottetii and was named perrottetinene (PET). Interestingly, the chemical structure of PET and its precursors share a high similarity to structures of the cannabinoid biosynthetic pathway (
Whereas some cannabinoids, e.g. THCA and CBDA, can be extracted from plants, in particular Cannabis sativa, there is an increasing interest in producing these molecules, and derivatives thereof by microbial fermentation. This will allow the study of their physiological effects, with the aim of identifying compounds with positive effects regarding human health. The many biological activities of cannabinoids has been recently reviewed, e.g. by Russo 2011 and Carvalho et al. 2017. Hence, due to the close structural relationship between cannabinoids and bibenzylic acids and derivatives, it is of major interest to study whether BBA derived molecules will have benefits and applications in the same or related pharmaceutical areas as for cannabinoids. Some indications of this are the fact that BBAs (aka amorfrutins) are already known to interact with the PPAR gamma receptor (Sauer et al. 2014), and that molecules like perrottetinene (see
After the early discoveries of BBAs, also known as stilbene carboxylates (STC), in certain plants, it was hypothesised that these structures were the products, intermediates, or derailment products of a single polyketide Type III synthase (PKS). However, Shibuya et al. (2002) demonstrated that STC is not a direct intermediate in the pathway to stilbenes, and Eckermann et al. (2003) reported having identified an enzyme responsible for making the related dihydro-stilbene carboxylate (DSTC), in this case a 5-hydroxy lunularic acid. Hence, they named the enzyme stilbene carboxylate synthase (STCS). However, activity of STCS was only demonstrated in vitro with a purified, recombinant enzyme and with dihydro-coumaroyl-CoA as substrate and has never been confirmed in vivo. Later, Austin et al. (2004) briefly mentioned the possibility of an unknown, auxiliary protein being responsible for the aldol cyclization of STCs. However, the authors proposed that the STC is likely derived from a rearrangement of the coumaroyl-triacetic acid lactone (CTAL), a derailment product of STS and similar PKS enzymes. In 2012 Gagne and co-workers (Gagne et al. 2012) demonstrared that olivetolic acid is made by a PKS and a tetraketide cyclase (TKC). They identified the polyketide cyclase OAC (olivetolic acid cyclase) to be responsible for the Aldol C2-C7 cyclization of a tetraketide precursor, with retention of the carboxyl group, to form olivetolic acid. The production of olivetolic acid was proposed to proceed from a tetraketide-CoA, which is released by the PKS (renamed to TKS for tetraketide synthase). They suggested that there may be an overlooked class of polyketide cyclases, structurally related to DABB enzymes from bacteria and stress proteins in plants. However, no new such cyclases have been reported to date. In summary, Gagne et al. (2012) showed that some polyketides (PKs) are formed by the concerted action of a PKS and a cyclase. Despite speculations that other PKs could be formed by such a two-step pathway, they did not mention that this could be the case for STCs or DSTCs.
Mechanistic studies of OAC were done by Yang et al. 2016. They refer to OAC as the only known plant PK cyclase, and the only characterized DABB protein from plants. The crystal structure of OAC was presented and the authors stated that it has a unique active site, including a pentyl binding pocket, responsible for binding the linear pentyl-tetra-β-ketide-CoA, the natural product of the C. sativa TKS. A number of proteins with homology to OAC, e.g. from A. thaliana, did not catalyse the cyclization (as already shown by Gagne), and they concluded that this is due to differences in size and shape of the active site.
Another study was done by Taura et al. (2016) in which they demonstrated that orselinic acid can be produced by the PKS orselinic acid synthase (ORS) and OAC, in a reaction similar to what was shown for the PKS (TKS) and OAC in the production of olivetolic acid. This showed that, in addition to the natural pentyl-tetra-β-ketide-CoA, which is used for production of olivetolic acid, the OAC can accept other substrates, however in this case the much smaller methyl-tetra-β-ketide-CoA produced by ORS.
Most recently it was shown that various other short fatty acids, in addition to hexanoic acid, are accepted as substrates by the TKC (OAC) responsible for producing OA, and yields novel OA-like compounds (Luo et al. 2019)
In two recent publications (Weidner et al. 2012, de Groot et al. 2013) two bioactive BBA derived molecules were reported, amorfrutin A1 and amorfrutin B, respectively, which the authors refer to as dihydrostilbene carboxylates (
Despite being natural compounds, no larger scale production of specific amorfrutins or other BBA derivatives from plant extracts has been developed, and instead a chemical synthesis route was developed in order to test the two compounds (de Groot et al. 2013).
The biosynthetic pathway of THCA and CBDA, starting from hexanoyl-CoA, have been described (Schachtsiek et al. 2018) and starts with the precursor hexanoyl-CoA, which is extended by the Cannabis sativa tetraketide synthase (TKS), a type III polyketide synthase (PKS), with 3 units of malonyl-CoA into a tetraketide, which is subsequently cyclized into olivetolic acid (OA) by the C. sativa olivetolic acid cyclase (OAC). OAC is, to date, the only known polyketide cyclase (PKC). Similar biosynthetic pathways, depending on PKS enzymes, are well known from plants, such as those involved in the production of stilbenes and chalcones. These pathways start from phenylpropanoyl-CoA precursors, and are extended, as for OA, with 3 units of malonyl-CoA. However, unlike olivetolic acid, the stilbenes and chalcones are made by a single PKS enzyme, the stilbene synthase (STS) and the chalcone synthase (CHS), respectively, without the need for an accessory cyclase.
Nevertheless, it is known that these PKS enzymes, in particular of the STS type, also produces non-cyclized polyketides, either tri-ketides or tetraketides (Austin et al. 2004) which can spontaneously and non-enzymatically cyclize by lactonization or aldol cyclization. It is also known that PKS enzymes are able to accept phenylpropanoyl-CoA substrates in which the C2-C3 alkene bond has been reduced, e.g. by the yeast double bond reductase (DBR) ScTSC13 (Eichenberger et al. 2017). In that case, using CHS-type PKS enzymes, the production of dihydrochalcones, e.g. phloretin, was reported.
In 2012 the cyclase CsOAC was identified (Gagne 2012) and the authors reported that this cyclase did not need to physically interact with the PKS (in this case the CsTKS) in order to produce OA, and the authors could not detect any physical interaction. The study showed, however, that when CsOAC and CsTKS were assayed together in the same chamber, thus allowing physical interaction, the production of OA was considerably higher.
Providing technical improvement, advantages and/or advancements over the background art the present invention provides solutions to technical problems identified in the art. In particular in view of the background art the present inventor has surprisingly found that the OAC is able to accept the much larger dihydro-cinnamoyl-tetra-β-ketide-CoA, and use this substrate to produce a BBA, i.e. dihydro stilbene carboxylate and that for studying the potential health benefits of these compounds, and for sustainable commercial production, a process involving fermentation from genetically modified microorganisms e.g. yeast will be attractive. It was further found by the present inventor that STS-type PKS enzymes are also able to use reduced phenylpropanoyl-CoA precursors as substrate. Together with 3 units of malonyl-CoA this would result in the formation of dihydro-tetraketide intermediates, and it has been found that the further co-expression of a PKC, leads to formation of bibenzylic acids (BBA) (
Accordingly, through the discovery that (i) the polyketide synthases (PKS), preferably of the STS type, when using a reduced dihydro-phenylpropanoid-CoA precursor, release substantial amounts of free tetraketide, as compared to the normal production of flavonoid compounds such as dihydropinosylvin, in the case of CHS, or stilbenoid compounds such as dihydroresveratrol, in the case of STS, and that (ii) the free tetraketides produced by the PKS are substrates of cyclases, such as the OAC, despite the fact that these phenylpropanoid derived tetraketides are much more bulky than other known substrates of polyketide cyclases, a first aspect of the present invention provides a genetically modified microbial host cell capable of producing bibenzylic acids or derivatives thereof wherein the genetically modified host cell expresses:
In further aspects the invention provides a cell culture, comprising the cell of the invention and a growth medium as well as a method of producing the BBA or a derivative thereof, comprising:
In a still further aspect, the invention provides a fermentation composition comprising the cell culture of the invention and the BBA or a derivative thereof, optionally in the form of a dimer, as well as compositions comprising the fermentation composition of the invention and one or more carriers, agents, adjuvants, additives and/or excipients; and the use of said composition of the invention for use as a medicament. This aspect also includes a method for treating a disease in a mammal, comprising administering a therapeutically effective amount of the composition of the invention to the mammal.
The features and advantages of the present invention is readily apparent to those skilled in the art by the below detailed description of embodiments and examples of the invention with reference to the figures and drawings included herein where:
All publications, patents, and patent applications referred to herein are incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. In the event of a conflict between a term herein and a term in an incorporated reference, the term herein prevails and controls.
The term “PKS” or “polyketide synthase” or “polyketide type III synthase” as used herein refers to an enzyme catalyzing the extension of a CoA-activated substrate with one or more malonyl-CoA units. Chalcone synthase or CHS is one example of a type 3 polyketide synthase enzyme capable of synthesizing a chalcone by condensing 3 molecules of malonyl-CoA with a phenylpropanoyl-CoA (aka (hydroxy)-cinnamoyl-CoA), such as a naringenin chalcone from one molecule of p-coumaroyl-CoA and three molecules of malonyl-CoA. Another example of a PKS is orselinic acid synthase (ORS) capable of catalyzing conversion of acetyl-CoA plus three molecules of malonyl-COA into orselinic acid. Another example of a PKS is tetraketide synthase or TKS capable of catalyzing conversion of hexanoyl-CoA plus three molecules of malonyl-CoA into a tetraketide, the normal substrate of CsOAC. Another example of a PKS is stilbene synthase or STS, a type 3 polyketide synthase enzyme capable of catalyzing the formation of a stilbene or dihydrostilbene from one molecule of (dihydro-)cinnamoyl-CoA or (dihydro-) p-coumaroyl-CoA and three molecules of malonyl-CoA.
The term “CHIL” as used herein refers to chalcone isomerase-like protein, a polypeptide also known as the non-catalytic CHI types III and IV.
The term “BBA” or “Bibenzylic acid” or derivatives thereof as used herein refers to a compound in which two phenyl groups are linked via a 2-carbon bridge, and one phenyl group carries a carboxyl group, such a compound also known as a stilbene carboxylate (STC) or a dihydro-stilbene carboxylate (DSTC) (
The term “STC” or “stilbene carboxylate” as used herein refers to a stilbene in which one of the phenyl groups carries a carboxyl group. The carbons linking the phenyl groups are themselves connected by a double bond, unlike the DSTC in which they are connected by a single bond.
The term “STCS” or “stilbene carboxylate synthase” as used herein refers to an enzyme catalysing conversion of a (dihydro-)-(hydroxy-)-cinnamoyl-CoA starter molecule plus three molecules of malonyl-CoA into an STC or an DSTC in the genetically modified microorganism.
The term “DSTC” or “dihydro-stilbene carboxylate” as used herein refers to a BBA in which the carbons linking the two phenyl groups are connected via a single bond
The term “coumaroyl-triacetic acid lactone” (CTAL) as used herein refers to a compound formed by spontaneous lactonization of a free p-coumaroyl-tetraketide (aka p-coumaroyl-triacetic acid) released from a PKS type III enzyme such as CHS, TKS, or STS
The term “PAL” as used herein refers to phenylalanine ammonia lyase, an enzyme catalyzing conversion of phenylalanine to cinnamic acid.
The term “4CL” as used herein refers to 4-coumarate-CoA-ligase, an enzyme catalyzing conversion of the ligation of CoA to various phenylpropanoic acids.
The term “tetraketide cyclase” or “TKC” or “polyketide cyclase” or “PKC” as used herein refers to an enzyme catalyzing conversion of a free tetraketide to a phenolic acid, such as orselinic acid, oliveetolic acid, or bibenzylic acid. These terms are used interchangeably. One example of a tetraketide cyclase is olivetolic acid cyclase or OAC.
The term “OA as used herein refers to olivetolic acid (
The term “double bond reductase” or “DBR” refers to an enzyme catalyzing reduction of the C2-C3 double bond of one or more substrates selected from cinnamoyl-CoA, p-Coumaroyl-CoA, Caffeoyl-CoA, Feruoyl-CoA into the respective dihydrocinnamoyl-CoA, p-dihydrocoumaroyl-CoA, dihydrocaffeoyl-CoA and dihydroferuloyl-CoA (
The term “cinnamate 4-hydroxylase” or “C4H” as used herein refers to a CYP450 trans-cinnamate 4-monooxygenase enzyme also known as cinnamate 4-hydroxylase, catalyzing conversion of cinnamic acid to p-coumaric acid.
The term “CYP450 as used herein” refers to an enzyme of the Cytochrome P450 family, catalysing oxidation of a range of substrates. Upon acting on a substrate, CYP450 must be reduced by its cognate reductase (CPR) to regain catalytic capacity.
The term “cytochrome p450 reductase” or “CPR” as used herein refers to an enzyme catalyzing the reduction of CYP450 enzymes.
The term “tyrosine ammonia lyase” or “TAL” as used herein refers to an enzyme catalyzing conversion of tyrosine to p-coumaric acid.
The term “4-coumarate-CoA ligase” or “4CL” as used herein refers to an enzyme catalyzing conversion of the ligation of CoA to various phenylpropanoic acids.
The term “chalcone isomerase type” or “CHI” as used herein refers to an enzyme catalysing stereospecifical isomerization of a chalcone to a (2S)-flavanone.
The terms “heterologous” or “recombinant” or “genetically modified” and their grammatical equivalents as used herein interchangeably refers to entities “derived from a different species or cell”. For example, a heterologous or recombinant polynucleotide gene is a gene in a host cell not naturally containing that gene, i.e. the gene is from a different species or cell type than the host cell. The terms as used herein about host cells refers to host cells comprising and expressing heterologous or recombinant polynucleotide genes.
The term “pathway” or “biosynthetic pathway” or “metabolic pathway” as used herein is intended to mean an enzyme acting in a live cell to convert a chemical substrate into a chemical product. A pathway may include one enzyme or multiple enzymes acting in sequence. A pathway including only one enzyme may also herein be referred to as “bioconversion” in particular relevant for embodiments where the cell of the invention is fed with a precursor or substrate to be converted by the enzyme into a desired product. Enzymes are characterized by having catalytic activity, which can change the chemical structure of the substrate(s). An enzyme may have more than one substrate and produce more than one product. The enzyme may also depend on cofactors, which can be inorganic chemical compounds or organic compounds (co-factor and/or co-enzymes or non-catalytic polypeptides). The NADPH-dependent cytochrome P450 reductase (CPR) is an electron donor to cytochromes P450 (CYPs). CPR shuttles electrons from NADPH through the Flavin Adenine Dinucleotide (FAD) and Flavin Mononucleotide (FMN) coenzymes into the iron of the prosthetic heme-group of the CYP. The term “operative biosynthetic metabolic pathway” refers to a metabolic pathway that occurs in a live recombinant host, as described herein.
The term “in vivo”, as used herein refers to within a living cell or organism, including, for example animal, a plant or a microorganism.
The term “in vitro”, as used herein refers to outside a living cell or organism, including, without limitation, for example, in a microwell plate, a tube, a flask, a beaker, a tank, a reactor and the like.
The term “substrate” or “precursor”, as used herein refers to any compound that can be converted into a different compound. For clarity, substrates and/or precursors include both compounds generated in situ by an enzymatic reaction in a cell or exogenously provided compounds, such as exogenously provided organic molecules which the host cell can metabolize into a desired compound.
Term “endogenous” or “native” as used herein refers to a gene or a polypepetide in a host cell which originates from the same host cell.
The term “deletion” as used herein refers to manipulation of a gene so that it is no longer expressed in a host cell.
The term “disruption” as used herein refers to manipulation of a gene or any of the machinery participating in the expression the gene, so that it is no longer expressed in a host cell.
The term “attenuation” as used herein refers to manipulation of a gene or any of the machinery participating in the expression the gene, so that the expression of the gene is reduced as compared to expression without the manipulation.
The terms “substantially” or “approximately” or “about”, as used herein refers to a reasonable deviation around a value or parameter such that the value or parameter is not significantly changed. These terms of deviation from a value should be construed as including a deviation of the value where the deviation would not negate the meaning of the value deviated from. For example, in relation to a reference numerical value the terms of degree can include a range of values plus or minus 10% from that value. For example, deviation from a value can include a specified value plus or minus a certain percentage from that value, such as plus or minus 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1% from the specified value.
The term “and/or” as used herein is intended to represent an inclusive “or”. The wording X and/or Y is meant to mean both X or Y and X and Y. Further the wording X, Y and/or Z is intended to mean X, Y and Z alone or any combination of X, Y, and Z.
The term “isolated” as used herein about a compound, refers to any compound, which by means of human intervention, has been put in a form or environment that differs from the form or environment in which it is found in nature. Isolated compounds include but is not limited to compounds of the invention for which the ratio of the compounds relative to other constituents with which they are associated in nature is increased or decreased. In an important embodiment the amount of compound is increased relative to other constituents with which the compound is associated in nature. In an embodiment the compound of the invention may be isolated into a pure or substantially pure form. In this context a substantially pure compound means that the compound is separated from other extraneous or unwanted material present from the onset of producing the compound or generated in the manufacturing process. Such a substantially pure compound preparation contains less than 10%, such as less than 8%, such as less than 6%, such as less than 5%, such as less than 4%, such as less than 3%, such as less than 2%, such as less than 1%, such as less than 0.5% by weight of other extraneous or unwanted material usually associated with the compound when expressed natively or recombinantly. In an embodiment the isolated compound is at least 90% pure, such as at least 91% pure, such as at least 92% pure, such as at least 93% pure, such as at least 94% pure, such as at least 95% pure, such as at least 96% pure, such as at least 97% pure, such as at least 98% pure, such as at least 99% pure, such as at least 99.5% pure, such as 100% pure by weight.
The term “% identity” is used herein about the relatedness between two amino acid sequences or between two nucleotide sequences. “% identity” when used herein about amino acid or nucleotide sequences refers to the degree of identity in percent between two amino acid sequences obtained when using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 5.0.0 or later. The parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. The output of Needle labeled “longest identity” (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:
“% identity” when used herein about nucleotide sequences refers to the degree of identity in percent between two nucleotide sequences obtained when using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 5.0.0 or later. The parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. The output of Needle labeled “longest identity” (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:
The protein sequences of the present invention can further be used as a “query sequence” to perform a search against sequence databases, for example to identify other family members or related sequences. Such searches can be performed using the BLAST programs. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov). BLASTP is used for amino acid sequences and BLASTN for nucleotide sequences. The BLAST program uses as defaults:
Cost to open gap: default=5 for nucleotides/11 for proteins
Cost to extend gap: default=2 for nucleotides/1 for proteins
Penalty for nucleotide mismatch: default=−3
Reward for nucleotide match: default=1
Expect value: default=10
Wordsize: default=11 for nucleotides/28 for megablast/3 for proteins.
Furthermore, the degree of local identity between the amino acid sequence query or nucleic acid sequence query and the retrieved homologous sequences is determined by the BLAST program. However only those sequence segments are compared that give a match above a certain threshold. Accordingly, the program calculates the identity only for these matching segments. Therefore, the identity calculated in this way is referred to as local identity. Alternatively, % identity for any candidate nucleic acid or amino acid sequence relative to a reference sequence can be determined as follows. A reference sequence (e.g., a nucleic acid sequence or an amino acid sequence described herein) is aligned to one or more candidate sequences using the computer program Clustal Omega (version 1.2.1, default parameters), which allows alignments of nucleic acid or polypeptide sequences to be carried out across their entire length (global alignment). Chenna et al., 2003, Nucleic Acids Res. 31(13):3497-500. Clustal Omega calculates the best match between a reference and one or more candidate sequences, and aligns them so that identities, similarities and differences can be determined. Gaps of one or more residues can be inserted into a reference sequence, a candidate sequence, or both, to maximize sequence alignments. For fast pairwise alignment of nucleic acid sequences, the following default parameters are used: word size: 2; window size: 4; scoring method: % age; number of top diagonals: 4; and gap penalty: 5. For multiple alignment of nucleic acid sequences, the following parameters are used: gap opening penalty: 10.0; gap extension penalty: 5.0; and weight transitions: yes. For fast pairwise alignment of protein sequences, the following parameters are used: word size: 1; window size: 5; scoring method:% age; number of top diagonals: 5; gap penalty: 3. For multiple alignment of protein sequences, the following parameters are used: weight matrix: blosum; gap opening penalty: 10.0; gap extension penalty: 0.05; hydrophilic gaps: on; hydrophilic residues: Gly, Pro, Ser, Asn, Asp, Gln, Glu, Arg, and Lys; residue-specific gap penalties: on. The Clustal Omega output is a sequence alignment that reflects the relationship between sequences. Clustal Omega can be run, for example, at the Baylor College of Medicine Search Launcher site on the World Wide Web (searchlauncher.bcm.tmc.edu/multi-align/multi-align.html) and at the European Bioinformatics Institute site at http://www.ebi.ac.uk/Tools/msa/clustalo/. To determine a % identity of a candidate nucleic acid or amino acid sequence to a reference sequence, the sequences are aligned using Clustal Omega, the number of identical matches in the alignment is divided by the length of the reference sequence, and the result is multiplied by 100. It is noted that the % identity value can be rounded to the nearest tenth. For example, 78.11, 78.12, 78.13, and 78.14 are rounded down to 78.1, while 78.15, 78.16, 78.17, 78.18, and 78.19 are rounded up to 78.2.
The term “coding sequence” refers to a nucleotide sequence, which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon such as ATG, GTG, or TTG and ends with a stop codon such as TAA, TAG, or TGA. The coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
The term “control sequence” or “promoter” as used herein refers to a nucleotide sequence necessary for expression of a polynucleotide encoding a polypeptide. A control sequence may be native (i.e., from the same gene) or heterologous or foreign (i.e., from a different gene) to the polynucleotide encoding the polypeptide. Control sequences include, but are not limited to leader sequences, polyadenylation sequence, pro-peptide coding sequence, promoter sequences, signal peptide coding sequence, translation terminator (stop) sequences and transcription terminator (stop) sequences. To be operational control sequences usually must include promoter sequences, transcriptional and translational stop signals. Control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with a coding region of a polynucleotide encoding a polypeptide.
The term “expression” includes any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
The term “expression vector” refers to a DNA molecule, either single- or double stranded, either linear or circular, which comprises a polynucleotide encoding a polypeptide and is operably linked to control sequences that provide for its expression. Expression vectors include expression cassettes for the integration of genes into a host cell as well as plasmids and/or chromosomes comprising such genes.
The term “host cell” refers to any cell type that is susceptible to transformation, transfection, transduction, or the like with a nucleic acid construct or expression vector comprising a polynucleotide of the present invention. Host cell encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication.
The term “polynucleotide construct” refers to a polynucleotide, either single- or double stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, and which comprises a polynucleotide encoding a polypeptide and one or more control sequences.
The term “operably linked” refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding polynucleotide such that the control sequence directs expression of the coding polynucleotide.
The terms “nucleotide sequence and “polynucleotide” are used herein interchangeably.
The term “comprise” and “include” as used throughout the specification and the accompanying items as well as variations such as “comprises”, “comprising”, “includes” and “including” are to be interpreted inclusively. These words are intended to convey the possible inclusion of other elements or integers not specifically recited, where the context allows.
The term “express” as used herein, refers to a gene which is transcribed and translated in a cell to produce a peptide or polypeptide.
The articles “a” and “an” are used herein refers to one or to more than one (i.e. to one or at least one) of the grammatical object of the article. By way of example, “an element” may mean one element or more than one element.
Terms like “preferably”, “commonly”, “particularly”, and “typically” are not utilized herein to limit the scope of the itemed invention or to imply that certain features are critical, essential, or even important to the structure or function of the itemed invention. Rather, these terms are merely intended to highlight alternative or additional features that can or cannot be utilized in a particular embodiment of the present invention.
The term “cell culture” as used herein refers to a culture medium comprising a plurality of host cells of the invention. A cell culture may comprise a single strain of host cells or may comprise two or more distinct host cell strains. The culture medium may be any medium that may comprise a recombinant host, e.g., a liquid medium (i.e., a culture broth) or a semi-solid medium, and may comprise additional components, e.g., a carbon source such as dextrose, sucrose, glycerol, or acetate; a nitrogen source such as ammonium sulfate, urea, or amino acids; a phosphate source; vitamins; trace elements; salts; amino acids; nucleobases; yeast extract; aminoglycoside antibiotics such as G418 and hygromycin B.
The genetically modified host cell of the invention produces bibenzylic acids or a derivative thereof due to the operation of a metabolic pathway expressing:
In some embodiments the double bond reductase is a native enoyl-reductase, and optionally it is overexpressed at least two-fold compared to the native expression level. In other embodiments the double bond reductase is a heterologous reductase capable of reducing the alkene C2-C3 double bond of a phenylpropanoid or phenylpropanoyl-CoA precursor. The double bond reductase of the invention can be one which comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the double bond reductase comprised in SEQ ID NO: 3.
In further embodiments a polyketide synthase which produces a linear tetraketide, and/or a free activated linear tetraketide-CoA, optionally from a dihydrophenylpropanoid, is selected. Such polyketide synthases include polyketide synthases which comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the stilbene type polyketide synthases comprised in SEQ ID NO: 4, 5, 6, 7, 8, 9, or 10. In a particular embodiment the polyketide synthase has little or no activity towards C2-C7 aldol condensation on a tetraketide substrate, so that or example less than 70% of tetraketide substrate conversion produces a C2-C7 aldol condensation, such as less than 50%, such as less than 30%, such as less than 10% of, such as less that 5%. Reduction or elimination of activity towards C2-C7 aldol condensation can optionally be optimized by mutating the polyketide synthase to achieve an inactivating mutation. The active sites of both CHS and STS type enzymes have been elucidated, including specific residues involved in the so-called aldol switch in STS (Austin et al. 2004). Targeting these residues can disrupt the cyclization, as has been shown for the CHS T197L mutation, which releases free tetraketides.
In some embodiments, the polyketide cyclase comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the polyketide cyclase comprised in SEQ ID NO: 11. More specifically the polyketide cyclase is the olivetolic acid cyclase comprised in SEQ ID NO. 11.
The tyrosine ammonia lyase is suitably one which comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the tyrosine ammonia lyase comprised in SEQ ID NO: 25.
The phenylalanine ammonia lyase is suitably one which comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the phenylalanine ammonia lyase comprised in SEQ ID NO: 1.
The cinnamate 4-hydroxylase is suitably one which comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the cinnamate 4-hydroxylase comprised in SEQ ID NO: 26.
The cytochrome p450 reductase is suitably one which comprises amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the cytochrome p450 reductase comprised in SEQ ID NO: 27.
The BBAs can be further modified, e.g. by prenylation, to eventually yield derivatives, such as perrottetinoic acid (PETA,
In further embodiments the cell of the invention further expresses a gene encoding a non-catalytic chalcone isomerase like polypeptide (CHIL), enhancing the production of BBA or derivatives thereof. The non-catalytic chalcone isomerase like polypeptide is suitably one which comprises an amino acid sequence which is at least 65%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the non-catalytic chalcone isomerase like polypeptide comprised in SEQ ID NO: 23 or 24. Moreover, in one embodiment of the invention the OAC is replaced by a non-catalytic chalcone isomerase like (preferably type 4) protein, which via physical interaction with the PKS enzyme controls the reaction of the latter, to cyclize the formed tetraketide with retention of the C1 carboxy group.
The bibenzylic acid (BBA) of the invention is particularly BBA's as defined by formula (I):
wherein R1-R7 can be either —H, —OH, —OCH3, —COOH, an acyl group, or defined by formula (II):
wherein R1, and R3-R7 can be either —H, —OH, —OCH3, —COOH, an acyl group, or a prenyl group, or defined by Formula (III):
wherein R1, and R3-R7 can be either —H, —OH, —OCH3, —COOH, an acyl group, or a prenyl group, or defined by Formula (IV):
wherein R1, and R4-R7 can be either —H, —OH, —OCH3, —COOH, an acyl group, or a prenyl group.
Accordingly, the cell of the invention expresses the enzymes to synthesize the phenylpropanoyl-CoA precursor, a double bond reductase such as ScTSC13 to reduce the C2-C3 double bond of said precursor, a PKS enzyme, and a CHIL type 4 protein, capable of interacting with the PKS, resulting in the formation of a BBA as defined by formula (I).
Any of the groups R1-R7 can subsequently be rearranged or be further modified, spontaneously, biosynthetically, or synthetically. Accordingly, in further embodiments the cell of the invention expresses a gene encoding a synthase converting a compound of Formula (II) into a compound of formula (III). In particular such synthase comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the synthase comprised in SEQ ID NO: 22.
In further embodiments, the cell of the invention may express a gene encoding a synthase converting a compound of Formula (II) into a compound as defined by Formula (IV). In particular such synthase comprises an amino acid sequence which is at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% identical to the synthase comprised in SEQ ID NO: 21.
It is also contemplated that the CHIL type 4 interacts with the PKS to enhance the production of free polyketide. The PKS in this embodiment can be any PKS, such as an STS or mutated CHS, capable of releasing the free tetraketide. Hence, in another embodiment the cell of the invention expresses the enzymes to synthesize the phenylpropanoyl-CoA precursor, a double bond reductase such as ScTSC13 to reduce the C2-C3 double bond of said precursor, a PKS enzyme, a CHIL type 4 protein, capable of interacting with the PKS, resulting in the increased formation of a free tetraketide, and a PKC such as CsOAC, capable of cyclizing said tetraketide to form a BBA as defined by Formula (I), above.
The co-expression of a CHIL type 4 protein, capable of positively interacting with the relevant PKS, has positive implications for the production of dihydrochalcones, such as those described by Eichenberger (2017). Further, if the PKS, such as the one described by Pluskal (2019), is efficiently releasing triketides or dihydro-triketides, the co-expression of a CHIL type 4 protein has a beneficial effect on the production of kavalactones including bisnoryangonin, yangonin, or its corresponding dihydro-versions.
The full biosynthetic pathway from OA to some of the most important cannabinoids THCA (tetrahydrocannabinolic acid) and CBDA (cannabidiolic acid) has recently been expressed in a microbial host (Saccharomyces cerevisiae) (Luo et al. 2019). The pathway expresses the prenyl transferase (CsPT4) from Cannabis sativa that transfers a geranyl group to OA to produce cannabigerolic acid (CBGA), followed by the cyclization of the geranyl side group by two different synthases to form either THCA (by THCA synthase) or CBDA (by CBDA synthases). Heat treatment is known to convert THCA and CBDA into THC (tetrahydrocannabinol) and CBD (cannabidiol), respectively (
In certain embodiments it is advantageous to change the cellular localization of proteins or enzymes of the invention. For example, the expression of THCA synthase (THCAS) and CBDA synthase (CBDAS) may advantageously be expressed in the vacuole of the host cell. This can be achieved by changing the localization signal of the native protein with one that directs the transport to the vacuole. An example of this is described by Zirpel et al. 2015, in which the use of a vacuolar localization signal (MIFDGTTMSIAIGLLSTLGIGAEA) from the protease proteinase A (Acc. No. F2QUG8) allows expression of THCAS in a yeast host cell. Accordingly, the THCAS and CBDAS, as used in the current invention, were both synthesized in such a way as to remove the N-terminal signal sequence of 28 amino acids, which were then replaced by the vacuolar N-terminal signal, comprising 24 amino acids, of the proteinase A (Zirpel et al. 2015; Lou et al. 2019).
Similarly, certain embodiments may benefit from relocalization of prenyltransferases. Hence, the cellular localization of prenyltransferases, used for the current invention, were N-terminally truncated to remove the plastid targeting signal. Amino acid sequences of full-length proteins were aligned, and signal sequences were putatively identified at the N-terminal end. Synthetic genes, encoding these prenyltransferases, were then designed and synthesized, excluding between 62 and 88 amino acids at the N-terminal end, resulting in the truncated prenyltransferases (SEQ ID NOS: 13-20) used for the current invention.
In certain embodiments it is advantageous to co-express a non-catalytic chalcone isomerase type III or IV protein (CHIL) in the host cell producing bibenzylic acids, bibenzylgerolic acid, or derivatives thereof. It appears that such CHIL proteins can promote polyketide formation. Alternatively, CHIL proteins may stabilize intermediates or end products by directly binding to them. The current invention demonstrates increased production of THCA when HICHIL2 (SEQ ID NO: 24) is co-expressed together with the full length THCA biosynthetic pathway. It is contemplated that different CHIL proteins have different function as accessory protein for various reactions during polyketide biosynthesis, and that specific CHILs can be found to support specific functions. In certain embodiments it is advantageous to engineer the host cell to produce more of the precursors used for the desired product. For example, it will be advantageous for production of bibenzylic acid (BBA) to optimize the host for production of the natural precursor molecules such as the aromatic amino acids phenylalanine and/or tyrosine, as well as the precursors acetyl-CoA and malonyl-CoA.
The cell of the invention is suitably selected from the genera consisting of Saccharomyces, Schizosaccharomyces, Yarrowia, Candida, Ashbya, Cyberlindnera, Pichia, Kluyveromyces, Hansenula, Arxula, and Xanthophyllomyces, optionally from the species Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Candida glabrata, Ashbya gossypii, Cyberlindnera jadinii, Pichia pastoris, Kluyveromyces lactis, Hansenula polymorpha, Candida boidinii, Arxula adeninivorans, Xanthophyllomyces dendrorhous, and Candida albicans.
Further provided for herein are cell cultures comprising the genetically modified host cells of the invention and a growth medium. Suitable growth mediums for relevant prokaryotic or eukaryotic host cells are videly known in the art.
The invention also provides a method for producing BBA or derivatives thereof comprising
The cell culture can be cultivated in a nutrient medium and at conditions suitable for production of the BBA or derivatives thereof of the invention and/or for propagating cell count using methods known in the art. For example, the culture may be cultivated by shake flask cultivation, or small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid-state fermentations) in laboratory or industrial fermenters in a suitable medium and under conditions allowing the host cells to grow and/or propagate, optionally to be recovered and/or isolated. The cultivation can take place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g. from catalogues of the American Type Culture Collection). The selection of the appropriate medium may be based on the choice of host cell and/or based on the regulatory requirements for the host cell. Such media are available in the art. The medium may, if desired, contain additional components favoring the transformed expression hosts over other potentially contaminating microorganisms. Accordingly, in an embodiment a suitable nutrient medium can include one or more of (i) trace metals; (ii) vitamins; (iii) salts (such as salts of phosphate, magnesium, potassium, zinc, iron); (iv) nitrogen sources (such as YNB, ammonium sulfate, urea, yeast extracts, ammonium nitrate, ammonium chloride, malt extract, peptone and/or amino acids); (v) carbon source (such as dextrose, sucrose, glycerol, glucose, maltose, molasses, starch, cellulose, xylan, pectin, lignocellolytic biomass hydrolysate, and/or acetate); (vi) nucleobases; (vii) aminoglycosides; and/or (viii) antibiotics (such as G418 and hygromycin B).
The cultivation of the host cell may be performed over a period of from about 0.5 to about 30 days. The cultivation process may be a batch process, continuous or fed-batch process, suitably performed at a temperature in the range of 0 to 100° C. or 0 to 80° C., for example, from about 0° C. to about 50° C. and/or at a pH, for example, from about 2 to about 10. Preferred fermentation conditions are a temperature in the range of from about 25° C. to about 55° C. and at a pH of from about 3 to about 9. The appropriate conditions are usually selected based on the choice of host cell. Accordingly, in some embodiments the method of the invention comprising one or more elements selected from:
The cell culture of the invention may be recovered and or isolated using methods known in the art. For example, the cells or compound(s) may be recovered from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, spray-drying, or lyophilization. In a particular embodiment the method includes a recovery and/or isolation step comprising separating a liquid phase of the cell culture from a solid phase of the cell culture to obtain a supernatant comprising the BBA or derivatives thereof and subjecting the supernatant to one or more steps selected from:
The method of the invention may further comprise one or more steps of mixing the BBA or derivatives thereof with one or more carriers, agents, adjuvants, additives and/or excipients, optionally pharmaceutical grade carriers, agents, adjuvants, additives and/or excipients.
The method of the invention may further comprise one or more in vitro steps in the process of producing the BBA or derivatives thereof. It may also comprise one or more in vivo steps performed in another cell than the host cell of the invention. Accordingly, in one embodiment the method of the invention further comprises feeding one or more exogenous BBA precursors to the host cell culture.
The invention also provides a fermentation composition comprising the cell culture of the invention and the BBA or derivatives thereof—either comprised in the cells or in the medium. In the fermentation composition the genetically modified host cells may be wholly or partially lysed and/or disintegrated. In some embodiments at least 50%, such as at least 75%, such as at least 95%, such as at least 99% of the genetically modified host cells in the fermentation composition are lysed and/or disintegrated. Further, in the fermentation composition of the invention at least 50%, such as at least 75%, such as at least 95%, such as at least 99% of solid cellular material may have been separated and/or removed from a liquid phase of the fermentation composition.
Moreover, in addition to BBA or derivatives thereof the fermentation composition of the invention may comprise one or more compounds selected from precursor or products of the pathway producing the BBA or derivatives thereof, trace metals, vitamins, salts, yeast nitrogen base, carbon source, YNB, and/or amino acids of the fermentation. In particular the fermentation composition of the invention comprise a concentration of BBA or derivatives thereof of at least 1 mg/kg composition, such as at least 5 mg/kg, such as at least 10 mg/kg, such as at least 20 mg/kg, such as at least 50 mg/kg, such as at least 100 mg/kg, such as at least 500 mg/kg, such as at least 1000 mg/kg, such as at least 5000 mg/kg, such as at least 10000 mg/kg, such as at least 50000 mg/kg.
Suitable supplemental nutrients can include one or more of (i) trace metals; (ii) vitamins; (iii) salts (such as salts of phosphate, magnesium, potassium, zinc, iron); (iv) nitrogen sources (such as YNB, ammonium sulfate, urea, yeast extracts, ammonium nitrate, ammonium chloride, malt extract, peptone and/or amino acids); (v) carbon source (such as dextrose, sucrose, glycerol, glucose, maltose, molasses, starch, cellulose, xylan, pectin, lignocellolytic biomass hydrolysate, and/or acetate); (vi) nucleobases; (vii) aminoglycosides; and/or (viii) antibiotics (such as G418 and hygromycin B).
The invention also provides a composition comprising the fermentation composition of the invention and one or more carriers, agents, adjuvants, additives and/or excipients. Suitable carriers, agents, adjuvants, additives and/or excipients includes formulation additives, stabilising agent, fillers and the like. The composition and the one or more carriers, agents, adjuvants, additives and/or excipients can suitably be formulated into in a dry solid form e.g by using methods known in the art, such as spray drying, spray cooling, lyophilization, flash freezing, granulation, microgranulation, encapsulation or microencapsulation. The composition and the one or more carriers, agents, adjuvants, additives and/or excipients can also be formulated into a liquid stabilized form using methods known in the art, such as adding to the fermentation composition one or more stabilizers such as sugars and/or polyols (e.g. sugar alcohols) and/or organic acids (e.g. lactic acid).
The composition of the invention may be further refined into a dietary supplement, a cosmetic, a food preparation, a feed preparation and/or an analytical or diagnostic reagent optionally using one or more steps of the methods described herein for producing the BBA or derivatives thereof. In one embodiment, BBA or derivative thereof and/or the composition comprising it can be used as a signal modulator of the cannabinoid receptor 1 (CB1), the cannabinoid receptor 2 (CB2) and/or the PPARgamma receptor.
The following scientific papers were cited in the current application:
The present application contains a Sequence Listing prepared in PatentIn ver 3.5 submitted electronically in ST25 format which is hereby incorporated by reference in its entirety. The following sequences are included:
Arabidopsis thaliana
Arabidopsis thaliana
Saccharomyces cerevisiae
Cannabis sativa
Vitis vinifera
Vitis pseudoreticulata
Arachis hypogaea
Pinus densiflora
Pinus strobus
Pinus sylvestris
Cannabis sativa
Saccharomyces cerevisiae
Cannabis sativa
Cannabis sativa
Cannabis sativa
Cannabis sativa
Cannabis sativa
Humulus lupulus
Humulus lupulus
Humulus lupulus
Cannabis sativa
Cannabis sativa
Cannabis sativa
Humulus lupulus
Zea mays
Ammi majus
Saccharomyces cerevisiae
Chemicals used in the examples herein e.g. for buffers and substrates are commercial products of at least reagent grade.
The Saccharomyces cerevisiae S288C, strain NCYC 3608, is available from the National Collection of Yeast Cultures (NCYC), Norwich, U.K. The LEU2 and HIS3 open reading frames were deleted to create two additional auxotrophies for leucine and histidine, respectively, and the KanMX cassette was excised by Cre-Lox recombination (Eichenberger et al., Met. Eng., 2017, 39: 80-89). The host ARO3 gene was then replaced by feedback insensitive mutants of ARO4 and ARO7 to increase the pool of aromatic amino acids (Luttik et al. 2008; Kallscheuer et al. 2018). The resulting strain, named BBA1, was used as the basic strain in the following examples.
All pathway genes/polynucleotides referred to and disclosed herein, encoding the enzymes and proteins used (SEQ ID NOS: 1-23), were manufactured synthetically by a commercial supplier using codons optimized for expression in yeast, S. cerevisiae, except for ScTSC13 (SEQ ID NO: 3), which was amplified by PCR from yeast genomic DNA. During synthesis, all genes were appended with the DNA sequence AAGCTTAAA at the 5′-end, including a Hind III restriction recognition site and a Kozak sequence, and with the DNA sequence CCGCGG at the 3′-end, including a Sac II recognition site. The gene sequences were maintained and amplified in a multicopy plasmid (pUC18) in E. coli. These plasmids already comprised expression cassettes, including promoters and terminators, previously described by Shao et al. (Nucl. Acids Res. 2009, 37(2):e16), separated by a multiple cloning site, comprising Hind III and Sac II, for easy cloning of the DNA encoding the relevant enzymes.
Bibenzyls (BB), tetraketides (TK), bibenzylic acids (BBA), and all other derivatives were analyzed using liquid-chromatography coupled to mass spectrometry (LC/MS). An HSS T3 column (Waters AG, Baden-Dättwil, Switzerland), 130 Å, 1.7 μm, 2.1 mm×100 mm was employed using the conditions indicated in table 2 below. The following solution were used:
Solution A=0.1% aqueous solution of formic acid,
Solution B=0.1% solution of formic acid in acetonitrile.
For mass spectrum (MS) analysis, full scan spectrum data were recorded using a Xevo® G2-XS Mass spectrometer (Waters Cooperation, Milford, US) with the parameters indicated in table 3, below.
For each compound, an extracted ion chromatogram within a mass window of 0.01 Da was calculated. Peak areas and compound quantities were calculated according to the retention time and linear calibration curve of the respective standard compounds (obtained from Sigma-Aldrich, Switzerland and/or Extrasynthese, Genay, France) wherever available.
For demonstrating successful production in an engineered yeast via the various heterologous biosynthetic pathways, genes encoding recombinant enzymes were integrated and expressed in the yeast strain. Further, the yeast strain had been engineered to improve the precursor pool of aromatic amino acids, and in some cases the pool of isoprenoids IPP and DMAPP, according to state of the art as described in the scientific literature.
For culturing the engineered yeast strain, cultures of the strain were grown in 96 well, deep well plates (DWP) at 30° C., using 5 cm shaking diameter, and 300 rpm. Pre-cultures were grown for 24 hours from single colonies in 300 μL SC dropout medium (Formedium, Hunstanton, UK), as required for selection of plasmids. The SC dropout medium contained:
6.7 g/L Yeast Nitrogen Base without Amino Acids,
76 mg/L histidine, 380 mg/L leucine, and
76 mg/L uracil depending on the auxotrophies of the strains.
pH of the medium was adjusted to 5.8 with hydrochloric acid.
Main cultures were inoculated in 300 μL of the same medium to a 1:100 dilution of the pre-culture and cultured for 72 hours at 30° C., in 96 well, 1.1 mL deep well plates (DWP) as described by Eichenberger et al. (FEMS Yeast Res. 2018, 1: 18(4)). After 72 hours all cultures had reached essentially the same final optical density (OD) measured at 600 nm. It was contemplated that all compounds of interest were located both intra- and extracellularly and estimates of product titers was based on extraction of total culture volumes.
For extraction of the polyketides and derivatives, 150 μL culture broth was mixed with 150 μL acidified methanol (1% hydrochloric acid) and incubated for 30 min in a 96 well DWP at 30° C., 5 cm shaking diameter, and 300 rpm and subsequently clarified by centrifugation at 4000 g for 5 min. The clarified lysates were analyzed by LC-MS.
It was tested if a stilbene synthase (STS) type PKS enzyme was able to convert dihydro-cinnamoyl-CoA into a bibenzyl (BB). The genes needed to produce the dihydro-cinnamoyl-CoA precursors, i.e. the Arabidopsis thaliana genes AtPAL2 (SEQ ID NO: 1) and At4CL2 (SEQ ID NO: 2), and the native double bond reductase (DBR) gene ScTSC13 (SEQ ID NO: 3), each under the control of strong glycolytic promoter were first integrated, into the BBA1 host strain. The genes were cloned using homologous recombination technology (HRT) plasmids as described by Eichenberger et al. 2017, thus providing them with recombination tags, promoter and terminator sequences, and the appropriate restriction sites for excision of the expression cassettes (Garcia-Vanegas et al. 2018). The three gene constructs, i.e. the expression cassettes, were integrated into the site XI-3 (Mikkelsen et al. 2012) of strain BBA1 by in vivo homologous recombination as described by Eichenberger et al. 2018, and the production of dihydrocinnamoyl-CoA was confirmed.
A selection of 7 polyketide synthase (PKS) enzymes (See Table 4) were cloned into a yeast expression vector, based on the pRS series of plasmids (Sikorski & Hieter 1989; Christianson et al., Gene 110 (1992) 119-122; Mumberg et al. Gene. 1995 Apr. 14; 156(1):119-22.). Included was the PKS from Cannabis sativa, CsTKS, reported to be a tetraketide synthase (Gagne et al. 2012), whereas the other six were previously reported to be stilbene synthases (STS), from various plants. Stilbene synthases are normally involved in producing bibenzyls with a non-saturated double bond, as known from the compounds pinosylvin and resveratrol, but are known to also accept the reduced dihydro-phenylpropanoic precursors (Eichenberger et al. 2017). The plasmids containing the PKS enzymes, and an empty plasmid as control, were introduced into BBA1, and grown as described above.
Cannabis
sativa
Vitis
vinifera
Vitis
pseudoreticulata
Arachis
hypogaea
Pinus
densiflora
Pinus
strobus
Pinus
sylvestris
After 72 hours of growth the total culture medium was extracted and analysed for new products. Both pinosylvin and dihydropinosylvin were detected in various amounts. The occurrence of both the saturated and non-saturated bibenzyl is a reflection of the incomplete reduction, by ScTSC13, of cinnamoyl-CoA to dihydro-cinnamoyl-CoA, leaving both substrates available for the PKS enzymes. The ratio between the two products is a reflection of enzyme substrate specificity, and the total amount of product is a reflection of their efficiency of converting the substrates to bibenzyls.
In all samples an additional compound was detected which had the mass corresponding to the calculated formula weight (276.28) of a free dihydro-cinnamoyl-tetraketide. It was concluded that this compound was the result of dihydro-cinnamoyl-tetraketide-CoA being released from the PKS enzyme, and then hydrolysed to yield the non-activated (no CoA) free tetraketide. Partial hydrolysis could possibly have happened during extraction.
It was tested if olivetolic acid cyclase (CsOAC; SEQ ID NO: 11) was able of accepting tetraketides released from STS-type PKS enzymes, and derived from dihydro-phenylpropanoids, despite these substrates being very bulky. The CsOAC gene was cloned into a second pRS vector (Mumberg et al. 1995). When the CsOAC was co-expressed with STS-type PKS enzymes (see Table 4) in the BBA1 strain, a compound with a mass corresponding to the calculated formula weight (258.27) of the cyclized bibenzylic acid (BBA) was detected, in this case the dihydro-pinosylvin-2-carboxylate. All tested PKS enzymes, including the CsTKS, allowed production of this BBA molecule and, interestingly, the relative amounts produced by the various PKS enzymes corresponded reasonably well with the amount of dihydro-pinosylvin produced in the absence of CsOAC. The highest amount of BBA was achieved when CsOAC was co-expressed, in strain BBA1, with the STS-type PKS enzyme PstSTS2 from Pinus strobus (SEQ ID NO: 9).
For testing if the newly formed BBA could be prenylated (
A biosynthetic pathway to prenylated BBAs was then assembled in BBA2, using homologous recombination technology (HRT) plasmids as described in Eichenberger et al. 2017. The plasmids comprised the PstSTS2 (SEQ ID NO: 9) and the CsOAC (SEQ ID NO: 11), as well as a truncated version (Sc-tHMGR; SEQ ID NO: 12) of the native ScHMGR1 gene (Rico et al. 2010). Further, the plasmids comprised one of 8 different prenyltransferases (PT)—(see Table 5). The selection of prenyl transferases was based on sequence homology to the CsPT4 which had previously been shown to transfer geranyl to olivetolic acid, a key step towards cannabinoid production (Luo et al. 2019). After in vivo assembly of 9 plasmids, comprising PstSTS2, ScOAC, Sc-tHMGR, and either a prenyltransferase or an empty cassette, single recombinant colonies were selected and grown for 72 hours with appropriate auxotrophic selection, before being analysed for production of prenylated BBAs.
Cannabis
sativa
Cannabis
sativa
Cannabis
sativa
Cannabis
sativa
Cannabis
sativa
Humulus
lupulus
Humulus
lupulus
Humulus
lupulus
The strain comprising the full-length pathway, and including the CsPT4 (SEQ ID NO: 15), produced a new compound with a mass corresponding to the calculated formula weight (394.503) of geranylated BBA, i.e. the bibenzylgerolic acid (BBGA,
In order to test whether the BBGA could be further modified by genes of the cannabinoid pathway, expression of CsTHCAS (SEQ ID NO: 21) and CsCBDAS (SEQ ID NO: 22) was tested in yeast. In the strain BBA2-PT4 (see example 3), comprising the PstSTS2 (SEQ ID NO: 9), CsOAC (SEQ ID NO: 11), the truncated HMGR (Sc-tHMGR; SEQ ID NO: 12), and the CsPT4 (SEQ ID NO: 15), a pRS series plasmid (Mumberg et al. 1995) was introduced comprising one of the genes from Cannabis sativa, the CsTHCAS (SEQ ID NO: 21) or the CsCBDAS (SEQ ID NO: 22). In the strain, co-expressing the CsTHCAS a new compound was detected with a mass corresponding to the calculated formula weight (392.487) of perrottetinoic acid (PETA,
In the strain BBA2-PT4 (see example 3), comprising the PstSTS2 (SEQ ID NO: 9), CsOAC (SEQ ID NO: 11), the truncated HMGR (Sc-tHMGR; SEQ ID NO: 12), and the CsPT4 (SEQ ID NO: 15), a pRS series plasmid (Mumberg et al. 1995) comprising the CsCHIL (SEQ ID NO: 23) from Cannabis sativa or the HICHIL2 (SEQ ID NO: 24) from Humulus lupulus was further introduced. In the strain, co-expressing the HICHIL2 (SEQ ID NO: 24) the production of prenylated bibenzylgerolic acid (BBGA) was higher than in a control strain, with no CHIL protein. It was concluded that some CHIL proteins, including HICHIL2 (SEQ ID NO: 24) are able to assist and improve the production of BBA and BBGA. It is contemplated that other CHIL proteins have similar effects on production of BBA and its derivatives, as well as of related bibenzylic compounds. This includes CHIL proteins derived from plants known to produce prenylated bibenzyls, such as CHIL proteins derived from liverworts, in particular those of the genera Radula and Marchantia. Several Radula species are known to produce perrottetinene (PET,
Number | Date | Country | Kind |
---|---|---|---|
20151327.2 | Jan 2020 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/050318 | 1/9/2021 | WO |