The present invention generally relates to Cannabis plants with altered expression of cannabinoids and/or altered expression of cannabinoid synthesizing enzymes. More specifically, the present disclosure relates to methods for controlling genes associated with cannabinoids synthesis in Cannabis plants.
Cannabis has been cultivated throughout human history as a source of fiber, oil, food, and for its medicinal properties. Selective breeding has produced Cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored.
There is therefore an urgent need to develop innovative approaches to accelerate Cannabis improvement and make its outcomes more predictable. Significant obstacles in plant breeding include limited sources of genetic variation underlying quantitative traits and the time-consuming and labor-intensive phenotypic and molecular evaluation of breeding germplasm required to select plants with desirable properties.
There are over 113 known cannabinoids, but the two most abundant natural derivatives are tetrahydrocannabinol (THC) and cannabidiol (CBD). THC is responsible for the well-known psychoactive effects of Cannabis consumption, but CBD, while nonpsycoative, also has therapeutic properties, for example it is investigated as a treatment for both schizophrenia and Alzheimer's disease. Cannabis has traditionally been classified as having a drug (“marijuana”) or hemp chemotype based on the relative proportion of THC to CBD; types grown for recreational use produce relatively large amounts of both. Cannabis containing high levels of CBD is increasingly grown for medical use.
Heat converts the cannabinoid acids (e.g., tetrahydrocannabinolic acid [THCA]) to neutral molecules (e.g., (−)-trans-Δ9-tetrahydrocannabinol [THC]) that bind to endocannabinoid receptors found in the vertebrate nervous system. The cannabinoid acids THCA and CBDA are both synthesized from cannabigerolic acid (CBGA) by the related enzymes THCA synthase (THCAS) and CBDA synthase (CBDAS), respectively. Expression of THCAS and CBDAS appear to be the major factor determining cannabinoid content, but the mechanisms that underlie the expression of these enzymes remain unresolved.
As the cultivation of Cannabis intensifies, breeding and farming techniques fail to provide the level of control of cannabinoid production and yield needed.
Selective breeding has been used to control the genetics of plants and modify the cannabinoid profile. For example, strains that are used as fiber (commonly called hemp) are usually bred such that they are low in psychoactive chemicals like THC. Strains used in medicine are often bred for high CBD content, and strains used for recreational purposes are usually bred for high THC content or for a specific chemical balance.
Enhancing genetic and phenotypic variation in crops has relied on intercrossing with wild relatives to introduce allelic diversity, creating novel alleles by random mutagenesis, and genetic engineering. However, these approaches are inefficient, particularly for providing variants that cause changes in complex traits that are most desired by breeders.
A powerful approach to create novel allelic variation is through genome editing. In plants, this technology has primarily been used to engineer mutations in coding sequences, with the goal of creating null alleles for functional studies.
Compared to mutations in coding sequences that alter protein structure, cis-regulatory variants are frequently less pleiotropic and often cause subtle phenotypic change by modifying the timing, pattern, or level of gene expression. A major explanation for this is the complexity of transcriptional control, which includes redundancy and modular organization of the many cis-regulatory elements (CREs) in promoters and other regulatory regions, the majority of which remain poorly characterized.
In view of the above, there is an unmet and long felt need for an approach that allows efficient and rapid generation of novel alleles in non-GMO Cannabis plants. In particular, there is a need for manipulating Cannabis plants for selectively producing predetermined ratios and/or concentrations of cannabinoids for medical use.
It is one object of the present invention to disclose a Cannabis plant exhibiting altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content, wherein said plant comprises a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression, said genomic locus comprises at least one targeted nucleotide modification within a regulatory region modulating the expression of at least one allele of said THCAS and/or CBDAS genes.
It is a further object of the present invention to disclose a Cannabis plant or a cell thereof comprising a genetically modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression, wherein said genomic locus comprises at least one targeted nucleotide modification within a regulatory region modulating the expression of at least one allele of said THCAS and/or CBDAS genes.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said targeted nucleotide modification confers altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content as compared to a comparable control Cannabis plant lacking said at least one targeted nucleotide modification.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant comprises elevated THCA, CBDA and/or Cannabigerolic acid (CBGA) content relative to a comparable control Cannabis plant.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said comparable control Cannabis plant is of a similar genotype and/or chemotype and/or genetic background and is lacking said at least one targeted nucleotide modification.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said targeted nucleotide modification is introduced through a genome editing at said regulatory region of said THCAS and/or CBDAS genomic locus.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant comprises an expression cassette encoding a RNA-guided endonuclease and at least one guide RNA (gRNA) having a sequence that is complementary to a target sequence within said regulatory region of said at least one THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said regulatory region is a promoter region or terminator region, operably linked to the coding region of the at least one THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the regulatory region is upstream of the 5′ end of the coding sequence of the THCAS and/or CBDAS allele or is downstream of the 3′ end of the coding sequence of the THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the regulatory region comprises a transcription factor binding site, an RNA polymerase binding site, a TATA box, or a combination of structural variations thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said targeted nucleotide modification interrupts or interferes or down regulate or silence transcription and/or translation of the Cannabis allele sequence encoding THCAS and/or CBDAS enzyme.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said targeted nucleotide modification enhance or induce or increase transcription and/or translation of the Cannabis allele sequence encoding THCAS and/or CBDAS enzyme.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the THCAS allele is selected from a THCAS variant or homologue of ‘Finola’ (FN) Cannabis strain (FNTHCAS), or a THCAS variant or homologue of Purple Kush (PK) Cannabis strain (PKTHCAS).
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the CBDAS allele is selected from a cannabidiolic acid synthase-like 1 (CBDAS2), CBDAS variant or homologue of ‘Finola’ (FN) Cannabis strain (FNCBDAS), or a CBDAS variant or homologue of Purple Kush (PK) Cannabis strain (PKCBDAS).
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the FNTHCAS is selected from FNTHCAS-1 and FNTHCAS-2 alleles, and the PKTHCAS is selected from PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4 alleles.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the PKCBDAS is selected from PKCBDAS and PKCBDAS1 alleles.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the regulatory region comprises a sequence of the promoter region of a THCAS allele selected from: promoter of FNTHCAS-1 (pFNTHCAS-1) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:1 or a functional variant thereof, promoter of FNTHCAS-2 (pFNTHCAS-2) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:2 or a functional variant thereof, promoter of PKTHCAS-1 (pPKTHCAS-1) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:3 or a functional variant thereof, promoter of PKTHCAS-2 (pPKTHCAS-2) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:4 or a functional variant thereof, promoter of PKTHCAS-3 (pPKTHCAS-3) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:5 or a functional variant thereof, promoter of PKTHCAS-4 (pPKTHCAS-4) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:6 or a functional variant thereof, a promoter sequence of any other THCAS allele, and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the regulatory region comprises a sequence of the promoter region of a CBDAS allele selected from: promoter of CBDAS2 (pCBDAS2) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:1150 or a functional variant thereof, promoter of FNCBDAS (pFNCBDAS) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:1151 or a functional variant thereof, promoter of PKCBDAS (pPKCBDAS) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:1152 or a functional variant thereof, promoter of PKCBDAS1 (pPKCBDAS1) having at least 70% sequence identity or similarity to a nucleic acid sequence as set forth in SEQ ID NO:1153 or a functional variant thereof, a promoter sequence of any other CBDAS allele, and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the targeted nucleotide modification is induced by a gRNA sequence comprising at least 70% sequence identity or similarity to a target sequence within said regulatory region sequence or a codon degenerate nucleotide sequence thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the guide RNA (gRNA) nucleotide sequence is selected from the group consisting of: (a) a gRNA targeting pFNTHCAS-1 and comprising a nucleotide sequence that is at least 80% identical to a nucleotide sequence selected from the group consisting of SEQ ID NO: 7-91 and any combination thereof; (b) a gRNA targeting pFNTHCAS-2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 92-176 and any combination thereof; (c) a gRNA targeting pPKTHCAS-1 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 177-278 and any combination thereof; (d) a gRNA targeting pPKTHCAS-2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 279-419 and any combination thereof; (e) a gRNA targeting pPKTHCAS-3 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 420-560 and any combination thereof; (f) a gRNA targeting pPKTHCAS-4 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 561-681 and any combination thereof; (g) a gRNA targeting pCBDAS2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 682-794 and any combination thereof; (h) a gRNA targeting pFNCBDAS and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 795-895 and any combination thereof; (i) a gRNA targeting pPKCBDAS and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 896-1016 and any combination thereof; and (j) a gRNA targeting pPKCBDAS1 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 1017-1149 and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the targeted nucleotide modification is selected from the group consisting of insertion, deletion, single nucleotide polymorphism (SNP), and a polynucleotide modification, such that the expression of the THCAS and/or CBDAS polynucleotide is reduced or affected.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said nucleotide modification is a missense mutation, nonsense mutation, insertion, deletion, indel, substitution or duplication.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the insertion or the deletion produces a gene comprising a frameshift.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said nucleotide modification is a silencing mutation, a knockdown mutation, a knockout mutation, a loss of function mutation, or a mutation causing enhanced allele expression.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant is heterozygous or homozygous for the at least one nucleotide modification.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the targeted nucleotide modification targets the genomic locus of at least one THCAS and/or CBDAS polynucleotide allele such that the one or more nucleotide modifications are present within (a) the coding region; (b) non-coding region; (c) regulatory sequence; or (d) untranslated region, of an endogenous polynucleotide encoding THCAS and/or CBDAS enzyme.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the at least one targeted nucleotide modification results in one or more of the following: (a) reduced or elevated expression of the THCAS and/or CBDAS polynucleotide; (b) reduced or elevated enzymatic activity of the protein encoded by the THCAS and/or CBDAS polynucleotide; (c) generation of one or more non-functional alternative spliced transcripts of the THCAS and/or CBDAS polynucleotide; (d) deletion of a substantial portion or of the full-length open reading frame of the THCAS and/or CBDAS polynucleotide; (e) repression or enhancement of an enhancer motif present within the regulatory region controlling the expression of said THCAS and/or CBDAS polynucleotide; (f) repression or enhancement of a repressor motif present within a regulatory region controlling the expression of the THCAS and/or CBDAS polynucleotide; (g) modification of one or more nucleotides or deletion of a regulatory element operably linked to the expression of the THCAS and/or CBDAS allele polynucleotide, wherein the regulatory element is present within a promoter, intron, 3′UTR, terminator or a combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the targeted DNA modification is introduced through a genome modification technique selected from the group consisting polynucleotide-guided endonuclease, CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and CRISPR-associated (Cas) (CRISPR-Cas) endonucleases, base editing deaminases, zinc finger nuclease (ZFN), a transcription activator-like effector nuclease (TALEN), engineered site-specific meganucleases, Argonaute or any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said Cas genes or proteins are selected from the group consisting of Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9, Cas10, Cast10d, Cas12, Cas13, Cas14, CasX, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn1, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Cpf1, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csz1, Csx15, Csf1, Csf2, Csf3, Csf4, Cu1966, bacteriophages Cas such as CasΦ (to (Cas-phi), and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant has THCA (and/or THC) and/or CBDA (and/or CBD) content of up to 30% by weight, particularly between about 0% to about 30% by weight, more particularly between about 0.3% to about 30%, even more particularly between about 0.1% to about 10% by weight.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant has a THCA (and/or THC) and/or CBDA (and/or CBD) content of not more than about 0.3% by weight.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant has a THCA (and/or THC) and/or CBDA (and/or CBD) content of at least 20% by weight.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant is THCA (or THC) and/or CBDA (or CBD) free.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein the plant is a hybrid or an inbred line.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said nucleotide modification is generated in planta via introduction of a construct comprising (a) Cas DNA and gRNA sequence having at least 80% sequence similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:7-SEQ ID NO:1149 and any combination thereof, or (b) a ribonucleoprotein (RNP) complex comprising Cas protein and gRNA sequence having at least 80% sequence similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:7-1149 and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant or a cell thereof as defined in any of the above, wherein said plant comprises at least one targeted genome modification in a polynucleotide selected from the group consisting of pFNTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1, pFNTHCAS-2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:2, pPKTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:3, pPKTHCAS-2 comprising a nucleic acid sequence having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:4, pPKTHCAS-3 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:5, pPKTHCAS-4 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:6, pCBDAS2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1150, pFNCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1151, pPKCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1152, pPKCBDAS1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1153 and any combination thereof, and said plant exhibits reduced expression of THCA, THC, CBDA and/or CBD relative to a Cannabis plant of a similar genotype or genetic background lacking said targeted genome modification.
It is a further object of the present invention to disclose a recombinant DNA construct comprising a polynucleotide sequence encoding a RNA-guided endonuclease and at least one guide RNA (gRNA) having a sequence that is complementary to a target sequence within a regulatory region operably linked to the expression of at least one tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) allele.
It is a further object of the present invention to disclose the recombinant DNA construct as defined in any of the above, wherein the gRNA sequence comprising a polynucleotide sequence complementary to at least one of pFNTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1, pFNTHCAS-2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:2, pPKTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:3, pPKTHCAS-2 comprising a nucleic acid sequence having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:4, pPKTHCAS-3 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:5, pPKTHCAS-4 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:6, pCBDAS2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1150, pFNCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1151, pPKCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1152, pPKCBDAS1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1153.
It is a further object of the present invention to disclose the recombinant DNA construct as defined in any of the above, wherein the gRNA sequence comprising a polynucleotide sequence having at least 80% sequence similarity to a nucleotide sequence selected from SEQ ID NO:7-SEQ ID NO:1149 and any combination thereof.
It is a further object of the present invention to disclose the recombinant DNA construct as defined in any of the above, wherein said gRNA sequence comprises a 3′ Protospacer Adjacent Motif (PAM) selected from the group consisting of NGG (SpCas), NNNNGATT (NmeCas9), NNAGAAW (StCas9), NAAAAC (TdCas9), NNGRRT (SaCas9) and TBN (Cas-phi).
It is a further object of the present invention to disclose a plant cell comprising the recombinant construct as defined in any of the above.
It is a further object of the present invention to disclose a guide RNA (gRNA) sequence that targets a tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) genomic locus of a plant cell, wherein the gRNA is complementary to at least one target sequence of a regulatory region within said genomic locus, said regulatory region operably linked to the expression of at least one THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose the gRNA sequence as defined in any of the above, wherein said gRNA comprises a polynucleotide sequence selected from the group consisting of pFNTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1, pFNTHCAS-2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:2, pPKTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:3, pPKTHCAS-2 comprising a nucleic acid sequence having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:4, pPKTHCAS-3 having at least 75% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:5, pPKTHCAS-4 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:6, pCBDAS2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1150, pFNCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1151, pPKCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1152, and pPKCBDAS1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1153.
It is a further object of the present invention to disclose the gRNA sequence as defined in any of the above, wherein the nucleotide sequence of said gRNA is selected from the group consisting of a nucleotide sequence that is at least 80% identical to the nucleotide sequence as set forth in SEQ ID NO.: 7-1149 and any combination thereof.
It is a further object of the present invention to disclose a recombinant DNA construct that expresses the guide RNA as defined in any of the above.
It is a further object of the present invention to disclose a plant cell or host cell comprising the guide RNA as defined in any of the above.
It is a further object of the present invention to disclose a plant cell or a host cell comprising the recombinant DNA construct as defined in any of the above.
It is a further object of the present invention to disclose a plant having stably incorporated into its genome the recombinant DNA construct as defined in any of the above.
It is a further object of the present invention to disclose a transgenic Cannabis plant comprising an endonuclease-mediated stably inherited genomic modification of regulatory region of a tetrahydrocannabinol acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene allele, said modification resulting in altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content as compared to a comparable control Cannabis plant lacking said at least one targeted nucleotide modification.
It is a further object of the present invention to disclose a seed produced by the plant as defined in any of the above.
It is a further object of the present invention to disclose the Cannabis plant as defined in any of the above, further comprising a heterologous nucleic acid sequence selected from the group consisting of: a reporter gene, a selection marker, a disease resistance gene, a herbicide resistance gene, an insect resistance gene; a gene involved in carbohydrate metabolism, a gene involved in fatty acid metabolism, a gene involved in amino acid metabolism, a gene involved in plant development, a gene involved in plant growth regulation, a gene involved in yield improvement, a gene involved in drought resistance, a gene involved in increasing nutrient utilization efficiency, a gene involved in cold resistance, a gene involved in heat resistance, a gene involved in salt resistance in plants and any combination thereof.
It is a further object of the present invention to disclose the Cannabis plant as defined in any of the above, wherein said plant does not comprise within its genome exogenous genetic material and said plant is a non-naturally occurring Cannabis plant or cell thereof.
It is a further object of the present invention to disclose a plant part, plant cell or plant seed of a plant as defined in any of the above.
It is a further object of the present invention to disclose a tissue culture of regenerable cells, protoplasts or callus obtained from the Cannabis plant or a cell thereof as defined in any of the above.
It is a further object of the present invention to disclose a non-living product or medical composition derived from the Cannabis plant as defined in any of the above.
It is a further object of the present invention to disclose the non-living product or medical composition as defined in any of the above, comprising a combined delta-9-tetrahydrocannabinol and tetrahydrocannabinolic acid concentration of between about 0% to about 30% by weight and/or cannabidiol and cannabidiolic acid concentration of between about 0% to about 30% by weight.
It is a further object of the present invention to disclose the non-living product or medical composition as defined in any of the above, comprising a THCA (or THC) and/or CBDA (or CBD) content of not more than about 0.3% by weight.
It is a further object of the present invention to disclose the non-living product or medical composition as defined in any of the above, comprising a THCA (and/or THC) and/or CBDA (or CBD) content of at least 20% by weight.
It is a further object of the present invention to disclose the non-living product or medical composition as defined in any of the above, comprising Cannabis oil, Cannabis tincture, dried Cannabis flowers, and/or dried Cannabis leaves for medical use.
It is a further object of the present invention to disclose the non-living product or medical composition as defined in any of the above, formulated for inhalation, oral consumption, sublingual consumption, or topical consumption.
It is a further object of the present invention to disclose a method for producing a Cannabis plant or a cell thereof as defined in any of the above, comprising steps of introducing one or more nucleotide modifications through a targeted genome modification at a regulatory region modulating the expression of at least one THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose the method as defined above, wherein the regulatory region comprises a nucleotide sequence selected from the group consisting of pFNTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:1 or a functional variant thereof, pFNTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:2 or a functional variant thereof, pPKTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:3 or a functional variant thereof, pPKTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:4 or a functional variant thereof, pPKTHCAS-3 having a nucleic acid sequence as set forth in SEQ ID NO:5 or a functional variant thereof, pPKTHCAS-4 having a nucleic acid sequence as set forth in SEQ ID NO:6 or a functional variant thereof, pCBDAS2 having a nucleic acid sequence as set forth in SEQ ID NO:1150 or a functional variant thereof, pFNCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1151 or a functional variant thereof, pPKCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1152 or a functional variant thereof, pPKCBDAS1 having a nucleic acid sequence as set forth in SEQ ID NO:1153 or a functional variant thereof, and any combination thereof.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein said functional variant has at least 70% sequence identity or similarity to the nucleic acid sequence of said regulatory region sequence or a codon degenerate nucleotide sequence thereof.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the targeted nucleotide modification is induced by a guide RNA that corresponds to a target sequence comprising a polynucleotide selected from the group consisting of pFNTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1, pFNTHCAS-2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:2, pPKTHCAS-1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:3, pPKTHCAS-2 comprising a nucleic acid sequence having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:4, pPKTHCAS-3 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:5, pPKTHCAS-4 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:6, pCBDAS2 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1150, pFNCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1151, pPKCBDAS having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1152, pPKCBDAS1 having at least 70% sequence similarity to the nucleotide sequence as set forth in SEQ ID NO:1153, and any combination thereof.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the guide RNA (gRNA) nucleotide sequence is selected from the group consisting of: (a) a gRNA targeting pFNTHCAS-1 and comprising a nucleotide sequence that is at least 80% identical to a nucleotide sequence selected from the group consisting of SEQ ID NO: 7-91 and any combination thereof; (b) a gRNA targeting pFNTHCAS-2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 92-176 and any combination thereof; (c) a gRNA targeting pPKTHCAS-1 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 177-278 and any combination thereof; (d) a gRNA targeting pPKTHCAS-2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 279-419 and any combination thereof; (e) a gRNA targeting pPKTHCAS-3 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 420-560 and any combination thereof; (f) a gRNA targeting pPKTHCAS-4 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 561-681 and any combination thereof; (g) a gRNA targeting pCBDAS2 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 682-794 and any combination thereof; (h) a gRNA targeting pFNCBDAS and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 795-895 and any combination thereof; (i) a gRNA targeting pPKCBDAS and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 896-1016 and any combination thereof; and (j) a gRNA targeting pPKCBDAS1 and comprising a nucleotide sequence that is at least 80% identical to the nucleotide sequence selected from the group consisting of SEQ ID NO.: 1017-1149 and any combination thereof.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the targeted nucleotide modification is selected from the group consisting of insertion, deletion, single nucleotide polymorphism (SNP), a missense mutation, nonsense mutation, indel, substitution or duplication and a polynucleotide modification, such that the expression of the THCAS and/or CDBAS polynucleotide is reduced or affected.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the Cannabis plant exhibits reduced THCA and/or CDBA content when the targeted DNA modification within the regulatory region results in reduced expression or activity of protein encoded by the at least one THCAS and/or CDBAS allele polynucleotide.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the Cannabis plant exhibits elevated THCA and/or CDBA content when the targeted DNA modification within the regulatory region results in increased expression or activity of protein encoded by the at least one THCAS and/or CDBA allele polynucleotide, respectively.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein the targeted DNA modification is through a genome modification technique selected from the group consisting of polynucleotide-guided endonuclease, CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and CRISPR-associated (Cas) (CRISPR-Cas) endonucleases, base editing deaminases, zinc finger nuclease (ZFN), a transcription activator-like effector nuclease (TALEN), engineered site-specific meganucleases, Argonaute or any combination thereof.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein said plant has THCA (or THC) and/or CBDA (or CBD) content of up to 30% by weight, particularly between about 0% to about 30% by weight, more particularly between about 0.3% to about 30%, even more particularly between about 0.1% to about 10% by weight.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein said plant has a THCA (or THC) and/or CBDA (or CBD) content of not more than about 0.3% by weight.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein said plant has a THCA (or THC) and/or CBDA (or CBD) content of at least 20% by weight.
It is a further object of the present invention to disclose the method as defined in any of the above, wherein said plant is THCA or THC and/or CBDA or CBD free.
It is a further object of the present invention to disclose a method for altering tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content in a Cannabis plant as defined in any of the above, by modifying a genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression relative to a comparable control Cannabis plant, wherein said method comprises introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region modulating expression of at least one THCAS and/or CBDAS allele.
It is a further object of the present invention to disclose a plant part, plant cell or plant seed produced by the method as defined in any of the above.
It is a further object of the present invention to disclose a method for producing a non-living product or medical Cannabis composition, the method comprising: (a) obtaining the Cannabis plant as defined in any of the above; and (b) formulating a non-ling product or medical Cannabis composition from said plant.
It is a further object of the present invention to disclose an isolated nucleotide sequence having at least 70% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:6 and SEQ ID NO:1150 to SEQ ID NO:1153.
It is a further object of the present invention to disclose an isolated nucleotide sequence having at least 70% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:7 to SEQ ID NO:1149.
It is a further object of the present invention to disclose a vector, construct or expression system or cassette comprising a nucleic acid sequence having at least 70% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:1153.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 7-91 and any combination thereof for targeted genome modification of pFNTHCAS-1 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 92-176 and any combination thereof for targeted genome modification of pFNTHCAS-2 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 177-278 and any combination thereof for targeted genome modification of pPKTHCAS-1 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 279-419 and any combination thereof for targeted genome modification of pPKTHCAS-2 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 420-560 and any combination thereof for targeted genome modification of pPKTHCAS-3 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 561-681 and any combination thereof for targeted genome modification of pPKTHCAS-4 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 682-794 and any combination thereof for targeted genome modification of pCBDAS2 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 795-895 and any combination thereof for targeted genome modification of pFNCBDAS gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 896-1016 and any combination thereof for targeted genome modification of pPKCBDAS gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 1017-1149 and any combination thereof for targeted genome modification of pPKCBDAS1 gene.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 7-681 and any combination thereof for reducing THCA content in a Cannabis plant.
It is a further object of the present invention to disclose use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 682-1149 and any combination thereof for reducing CBDA content in a Cannabis plant.
It is a further object of the present invention to disclose the Cannabis plant as defined in any of the above wherein said plant genotype is obtainable by deposit under accession number with NCIMB Aberdeen AB21 9YA, Scotland, UK.
It is a further object of the present invention to disclose a method for producing a Cannabis plant exhibiting altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content and comprising a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression as compared to a control Cannabis plant, said method comprises steps of introducing one or more nucleotide modifications through a targeted genome modification at a regulatory region modulating at least one THCAS and/or CBDAS allele expression.
In order to understand the invention and to see how it may be implemented in practice, a plurality of embodiments is adapted to now be described, by way of non-limiting example only, with reference to the accompanying drawings; wherein:
It will be appreciated that, for simplicity and clarity of illustration, elements shown in the figures have not necessarily been drawn accurately or to scale. For example, the dimensions of some of the elements may be exaggerated relative to other elements for clarity, or several physical components may be included in one functional block or element. Further, where considered appropriate, reference numerals may be repeated among the figures to indicate corresponding or analogous elements.
In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, and components, modules, units and/or circuits have not been described in detail so as not to obscure the invention.
Some features or elements described with respect to one embodiment may be combined with features or elements described with respect to other embodiments. For the sake of clarity, discussion of same or similar features or elements may not be repeated.
Plant breeding is currently limited by improvements in quantitative traits that often rely on laborious selection of rare naturally occurring mutations in gene-regulatory regions.
By the current invention, CRISPR/Cas9 genome editing of promoters of genes encoding cannabinoid synthesis proteins (e.g. THCAS and/or CBDAS) generate diverse cis-regulatory alleles that provide beneficial quantitative variation for breeding. This approach allows immediate selection and fixation of novel alleles in transgene-free plants and manipulation of cannabinoid (e.g. THCA, CBDA and CBGA) content in the Cannabis plant.
The present invention thus provides a platform to enhance variation and control of THC, CBD and/or other cannabinoids expression in the Cannabis plant.
The present invention discloses manipulation of the biosynthesis pathways of a Cannabis plant of genus Cannabis. Accordingly, Cannabis plants of the present invention having a modified therapeutic component(s) profile may be useful in the production of medical Cannabis and/or may also be useful in the production of specific components or therapeutic formulations derived therefrom.
According to one embodiment, a Cannabis plant exhibiting altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content is provided. The aforementioned plant comprises a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression, said genomic locus comprises at least one targeted nucleotide modification within a regulatory region modulating the expression of at least one allele of said THCAS and/or CBDAS genes.
According to further embodiment, the present invention provides a recombinant DNA construct comprising a polynucleotide sequence encoding a RNA-guided endonuclease and at least one guide RNA (gRNA) having a sequence that is complementary to a target sequence within a regulatory region operably linked to the expression of at least one tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) allele.
According to further embodiment, a guide RNA (gRNA) sequence that targets a tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) genomic locus of a plant cell is herein disclosed. The gRNA is complementary to at least one target sequence of a regulatory region within said genomic locus, said regulatory region operably linked to the expression of at least one THCAS and/or CBDAS allele.
It is further within the scope of the current invention to provide a recombinant DNA construct that expresses the guide RNA as defined in any of the above.
It is further within the scope to provide a plant cell comprising the guide RNA as defined in any of the above.
According to further embodiments, a plant cell comprising the recombinant DNA construct as defined above is provided.
According to other embodiments, a plant having stably incorporated into its genome the recombinant DNA construct as defined in any of the above is disclosed.
It is further within the scope to provide a seed produced by the plant of the present invention.
It is further within the scope to provide a plant part, plant cell or plant seed of a plant as defined above. The plant part may include a tissue culture of regenerable cells, protoplasts or callus obtained from the Cannabis plant provided by the present invention.
According to an embodiment of the present invention, the genotype of the Cannabis plant comprising at least one targeted nucleotide modification within a regulatory region modulating the expression of at least one allele of said THCAS and/or CBDAS genes is obtainable by seed deposit under accession number with NCIMB Aberdeen AB21 9YA, Scotland, UK.
According to some further embodiments of the present invention, a product such as a medical composition, derived from the Cannabis plant as defined in any of the above is provided.
The present invention further provides a method for producing a Cannabis plant exhibiting altered tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content and comprising a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression as compared to a control Cannabis plant, said method comprises steps of introducing one or more nucleotide modifications through a targeted genome modification at a regulatory region modulating at least one THCAS and/or CBDAS allele expression.
According to a further aspect of the present invention, a method for altering tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content in a Cannabis plant by modifying a genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression as compared to a control Cannabis plant is provided. The method comprises introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region modulating expression of at least one THCAS and/or CBDAS allele.
The present invention further provides a plant part, plant cell or plant seed produced by the method as defined in any of the above.
According to some other embodiments, the present invention provides a method for producing a medical Cannabis composition. The method comprises (a) obtaining the Cannabis plant of the present invention comprising at least one targeted nucleotide modification within a regulatory region modulating the expression of at least one allele of said THCAS and/or CBDAS genes; and (b) formulating a medical Cannabis composition from said plant.
According to further embodiments, the present invention provides an isolated nucleotide sequence having at least 70% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:1153.
According to further embodiments of the present invention, a vector, construct or expression system or cassette comprising a nucleic acid sequence having at least 70% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:1153 is herein provided.
Other embodiments of the present invention include the use of a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 7-681 and any combination thereof and/or at least one of SEQ ID NO: 682-1149 and any combination thereof, for reducing THCA content and/or CBDA content, respectively, in a Cannabis plant.
According to further main aspects of the present invention, THC free Cannabis plants for seeds, fiber and/or medical use are provided. This may be achieved by using genome editing techniques to target regulatory regions controlling expression of THCAS alleles, and selecting for mutations significantly reducing THCAS content within the plant. In this way, any high level THC variety (e.g. Purple Kush) can be converted into a low level THC (below 0.3% THC by weight) or even THC free Cannabis variety.
According to other main aspects of the invention, Cannabis plants with reduced CBDA or CBD content are produced by genome modification targeted to regulatory regions, i.e. promoter regions of genes or alleles encoding CBDAS enzyme.
According to other main aspects of the invention, Cannabis plants comprising silencing mutation conferring significantly reduced CBDA (or CBD) and THCA (or THC) content are produced by genome modification targeted to regulatory regions, i.e. promoter regions of genes or alleles encoding CBDAS and THCAS enzymes. Such plants may be essentially absent of both enzymes and may have elevated or enhanced content of CBGA, which is a substrate for CBDAS and THCAS enzymes and a precursor for CBDA and THCA cannabinoids.
It is acknowledged that regulatory regions of genes may have relatively less conserved among different alleles or homologues or varieties of the same gene. Thus specific sequences should be designed for targeting regulatory regions of different alleles of a gene of interest, i.e. THCAS and CBDAS gene alleles of different Cannabis varieties.
It is herein acknowledged that though widely favored in plant and animal evolution and domestication, cis-regulatory variants are far from being saturated and thus represent an unexplored resource for expanding allelic diversity and for controlling and/or altering gene expression.
The present invention uses genome editing techniques at target sites located outside the coding region, i.e. cis-regulatory elements, of Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabis cannabidiolic acid synthase (CsCBDAS) gene variants, to delete the full-length gene and/or silence its expression, resulting in significantly reduced THC and/or CBD content in the modified plant.
It is emphasized that alterations in gene-regulatory elements may result in linear or non-linear effect between transcriptional and phenotypic change, and it is unexpected how such responses vary for different genes. Thus, by exploring targeted cis-regulatory variation, new opportunities for altering gene expression to obtain desirable phenotypes and traits could be achieved by the current invention.
It is also disclosed that up until now, in plants, and especially in Cannabis plants, genome editing technology was mainly used to engineer mutations in coding sequences, with the goal of creating null alleles for functional studies.
The current invention hypothesized that elements of CRISPR/Cas9 technology could be integrated to engineer cis-regulatory mutations targeted to modify, and more specifically silence THCAS and/or CBDAS expression. In this way Cannabis plants or strains with reduced content, or even substantially free of, THCA and/or THC and/or CBDA and/or CBD are generated, useful for medical purposes.
According to main embodiments, the present invention provides a Cannabis plant exhibiting reduced tetrahydrocannabinolic acid (THCA) and/or cannabidiolic acid (CBDA) content, wherein said plant comprises a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) and/or cannabidiolic acid synthase (CBDAS) gene expression, said genomic locus comprises at least one targeted nucleotide modification as compared to a control Cannabis plant, within a regulatory region controlling said THCAS and/or CBDAS expression. The present invention further provides methods for production the aforementioned Cannabis plant and polynucleotide sequences for generating the targeted mutations.
Reference is now made to
As described in
Cannabigerolic acid (CBGA), the precursor to all natural cannabinoids, is cyclized into tetrahydrocannabinolic acid (THCA) and cannabidiolic acid (CBDA) by THCA and CBDA synthase (THCAS and CBDAS in
In
Here, we describe regulating THCAS and/or CBDAS cannabinoid biosynthesis protein expression by targeted genome editing at the promoter regions of THCAS and/or CBDAS gene alleles, for example, of the drug-type strain Purple Kush′ and the hemp variety “Finola”
It is within the scope that a system that exploits heritability of CRISPR/Cas9 transgenes carrying gRNAs in F1 populations, to rapidly and efficiently generate novel cis-regulatory alleles for THCAS and/or CBDAS gene variants in Cannabis is provided.
In this way, stabilized promoter alleles that provides a reduced THCAS and/or CBDAS expression is generated. It should be noted that transcriptional change may be a poor predictor of phenotypic effect, showing the complexity in how regulatory variation impacts quantitative traits.
By targeting gene promoter regions with guide RNAs (gRNAs) corresponding to the regulatory regions of various THCAS homologue sequences from the ‘drug-type’ strain Purple Kush (PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4) and the hemp variety “Finola” (FNTHCAS-1 and FNTHCAS-2), a range of lower THCA-content phenotypic changes could be obtained in Cannabis.
In addition, by targeting gene promoter regions with guide RNAs (gRNAs) corresponding to the regulatory regions of various CBDAS homologue sequences such as of cannabidiolic acid synthase-like 1 (CBDAS2) homologue, CBDAS variant or homologue of ‘Finola’ (FN) Cannabis strain (FNCBDAS) or a CBDAS variant or homologue of Purple Kush (PK) Cannabis strain (PKCBDAS) or PKCBDAS1, a range of lower CBDA-content phenotypic changes could be obtained in Cannabis.
Through Cas9-gRNA directed cleavage and imprecise repair at each target site, an array of mutation types could be induced in the promoter sequences of THCAS gene variants (pFNTHCAS-1, pFNTHCAS-2, pPKTHCAS-1, pPKTHCAS-2, pPKTHCAS-3 and pPKTHCAS-4) and/or CBDAS gene variant (pCBDAS2, pFNCBDAS, pPKCBDAS and pPKCBDAS1), including deletions of various sizes and small indels at target sites.
The resulting alleles, having mutations that might impact cis-regulatory elements (CREs) or cis-regulatory modules, could then be evaluated for phenotypic changes by generating stable homozygous mutants in subsequent generations.
According to one embodiment, a CRISPR/Cas9 construct with gRNA designed to target the promoter region upstream of at least one of FNTHCAS-1, FNTHCAS-2, PKTHCAS-1, PKTHCAS-2, PKTHCAS-3, PKTHCAS-4, CBDAS2, FNCBDAS, PKCBDAS and PKCBDAS1 coding sequence was generated, without considering any predicted cis-regulatory element or module in the promoter sequence. First-generation transgenic plants (TO) were generated and PCR genotyping was performed to reveal mutations/deletions of various sizes in the target region, causing reduced or knockdown of THCAS and/or CBDAS expression and/or function.
It is herein acknowledged that as the pharma industry is interested in extracting the cannabinoids from the Cannabis plant, individual Cannabis plants or strains or varieties containing modulated levels of such cannabinoids can be developed, tailored to the specific needs of the pharma industry thereby increasing the cost effectiveness and attractiveness of this crop.
The solution proposed by the current invention is using genome editing such as the CRISPR/Cas system in order to create cultivated Cannabis plants with modulated levels or ratios of cannabinoids. More specifically alternation of specific cannabinoids, i.e. THCA or THC, CBDA or CBD is achieved by using genome editing targeted to the regulatory regions of these genes to reduce their expression at the transcription (RNA) and/or translation (protein) levels of the cannabinoid biosynthesis pathway.
Breeding using genome editing allows a precise and significantly shorter breeding process in order to achieve these goals with a much higher success rate. Thus genome editing, has the potential to generate improved varieties faster and at a lower cost.
The current invention discloses the generation of non GMO Cannabis plants with manipulated and controlled cannabinoid content, using the genome editing technology, e.g., the CRISPR/Cas9 highly precise tool targeted to regulatory regions of genes of interest. The generated mutations can be introduced into elite or locally adapted Cannabis lines rapidly, with relatively minimal effort and investment.
Genome editing is an efficient and useful tool for increasing crop productivity traits, and there is particular interest in advancing manipulation of genes controlling cannabinoids biosynthesis in Cannabis species, to produce strains which are adapted to specific therapeutic or regulatory needs.
A major obstacle for CRISPR-Cas9 plant genome editing is lack of efficient tissue culture and transformation methodologies. The present invention achieves these aims and surprisingly provides transformed and regenerated Cannabis plants with modified desirable cannabinoids content.
To that end, guide RNAs (gRNAs) were designed for each of the target gene promoters herein identified in Cannabis to induce mutations in at least one Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS)) and/or cannabidiolic acid synthase (CBDAS) gene variant or homologue.
The present invention shows that Cannabis plants which contain genome editing events with at least one promoter of THCAS allele, express not more than 0.5% THC (or THCA) by weight. In specific embodiments, such plants express less than 0.5%, preferably less than 0.4%, less than 0.3%, less than 0.2%, less than 0.1% or 0% THC and/or THCAS by weight (e.g. by dry weight).
In further embodiments, the present invention shows that Cannabis plants which contain genome editing events with at least one promoter of CBDAS allele, express not more than 0.5% CBD (or CBDA) by weight. In specific embodiments, such plants express less than 0.5%, preferably less than 0.4%, less than 0.3%, less than 0.2%, less than 0.1% or 0% CBD and/or CBDAS by weight (e.g. by dry weight).
The present invention further shows that Cannabis plants containing genome editing events within the THCAS genomic locus express higher levels of CBD (or CBDA) compared to non-edited plants. In a further embodiment, the CsTHCAS edited plants contain very low levels of THCA (or THC), preferably not more than 0.5% by weight.
The present invention further shows that Cannabis plants containing genome editing events within the CBDAS genomic locus express higher levels of THC (or THCA) compared to non-edited plants. In a further embodiment, the CsCBDAS edited plants contain very low levels of CBDA (or CBD), preferably not more than 0.5% by weight.
The present invention further shows that Cannabis plants containing knockdown genome editing events within THCAS and CBDAS genomic locus express higher levels of CBG (or CBGA) compared to non-edited plants. In a further embodiment, the edited plants contain very low levels of CBDA (or CBD) and THCA (or THC), preferably not more than 0.5% by weight.
It is a further aspect of the present invention to provide the Cannabis plant as defined in any of the above, wherein said plant has a THC and/or CBD content of up to 30% by weight, particularly between about 0% to about 30% by weight, more particularly between about 0.3% to about 30%, even more particularly between about 0.3% to about 10% by weight.
According to a further embodiment, the Cannabis plant of the present invention has a THCA (and/or THC) and/or CBDA (and/or CBD) content of not more than about 0.3% by weight.
According to a further embodiment, the Cannabis plant of the present invention has a THCA (and/or THC) and/or CBDA (and/or CBD) content of at least 20% by weight.
According to a further embodiment, the Cannabis plant of the present invention is THCA (or THC) and/or CBDA (or CBD) free.
As used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to “a plant” includes a plurality of such plants, reference to “a cell” includes one or more cells and equivalents thereof known to those skilled in the art, and so forth.
As used herein the term “about” denotes ±25% of the defined amount or measure or value.
As used herein the term “similar” denotes a correspondence or resemblance range of about ±20%, particularly ±15%, more particularly about ±10% and even more particularly about ±5%.
As used herein the term “corresponding” generally means similar, analogous, like, alike, akin, parallel, identical, resembling or comparable. In further aspects it means having or participating in the same relationship (such as type or species, kind, degree, position, correspondence, or function). It further means related or accompanying. In some embodiments of the present invention it refers to plants of the same Cannabis species or strain or variety or to sibling plant, or one or more individuals having one or both parents in common.
As used herein, the phrase “consisting essentially of” or “essentially” generally refers to a composition, method that includes materials, steps, features, components, or elements, in addition to those literally disclosed, provided that these additional materials, steps, features, components, or elements do not materially affect the basic and novel characteristic(s) of the claimed subject matter.
A “plant” as used herein refers to any plant at any stage of development, particularly a seed plant. The term “plant” includes the whole plant or any parts or derivatives thereof, such as plant cells, seeds, plant protoplasts, plant cell tissue culture from which tomato plants can be regenerated, plant callus or calli, meristematic cells, microspores, embryos, immature embryos, pollen, ovules, anthers, fruit, flowers, leaves, cotyledons, pistil, seeds, seed coat, roots, root tips and the like.
The term “plant cell” used herein refers to a structural and physiological unit of a plant, comprising a protoplast and a cell wall. The plant cell may be in a form of an isolated single cell or a cultured cell, or as a part of higher organized unit such as, for example, plant tissue, a plant organ, or a whole plant.
The term “plant cell culture” as used herein means cultures of plant units such as, for example, protoplasts, regenerable cells, cell culture, cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes and embryos at various stages of development, leaves, roots, root tips, anthers, meristematic cells, microspores, flowers, cotyledons, pistil, fruit, seeds, seed coat or any combination thereof.
The term “plant material” or “plant part” used herein refers to leaves, stems, roots, root tips, flowers or flower parts, fruits, pollen, egg cells, zygotes, seeds, seed coat, cuttings, cell or tissue cultures, or any other part or product of a plant or a combination thereof.
A “plant organ” as used herein means a distinct and visibly structured and differentiated part of a plant such as a root, stem, leaf, flower, flower bud, or embryo.
The term “Plant tissue” as used herein means a group of plant cells organized into a structural and functional unit. Any tissue of a plant in planta or in culture is included. This term includes, but is not limited to, whole plants, plant organs, plant seeds, tissue culture, protoplasts, meristematic cells, calli and any group of plant cells organized into structural and/or functional units. The use of this term in conjunction with, or in the absence of, any specific type of plant tissue as listed above or otherwise embraced by this definition is not intended to be exclusive of any other type of plant tissue.
As used herein, the term “progeny” or “progenies” refers in a non limiting manner to offspring or descendant plants. According to certain embodiments, the term “progeny” or “progenies” refers to plants developed or grown or produced from the disclosed or deposited seeds as detailed inter alia. The grown plants preferably have the desired traits of the disclosed or deposited seeds.
The term “Cannabis” refers hereinafter to a genus of flowering plants in the family Cannabaceae. Cannabis is an annual, dioecious, flowering herb that includes, but is not limited to three different species, Cannabis sativa, Cannabis indica and Cannabis ruderalis. The term also refers to hemp. Cannabis plants produce a group of chemicals called cannabinoids. Cannabinoids, terpenoids, and other compounds are secreted by glandular trichomes that occur most abundantly on the floral calyxes and bracts of female Cannabis plants.
It is herein acknowledge that Cannabis has traditionally been classified as having a drug-type (“marijuana”) or hemp-type chemotype based on the relative proportion of THC to CBD.
The term “chemotype” also sometime used as “chemovar” refers herein to a chemically distinct entity in a plant, with differences in the composition of the secondary metabolites. It is herein acknowledged that minor genetic and epigenetic changes with little or no effect on morphology or anatomy may produce large changes in the chemical phenotype. Chemotypes are often defined by the most abundant chemical produced by that individual.
According to further aspects, a chemotype describes the subspecies of a plant that have the same morphological characteristics (relating to form and structure) but produce different quantities of chemical components in their essential oils.
In certain embodiments of the invention, chemical phenotypes (chemotypes) can be useful to classify C. sativa as drug- or fiber-type varieties. It is suggested that drug-type or chemotype I C. sativa cultivars contain high levels of 49-THC, while CBD-rich cultivars containing low levels of THC are regarded as fiber-type or chemotype III cultivars. Hemp- and drug-type cultivars are members of both C. sativa sativa and C. sativa indica subspecies. Chemotaxonomic analysis is useful to differentiate hemp from drug-type C. sativa based on acceptable levels of 49-THC established by regulating bodies.
The term “Finola” or “FN” as used herein refers to a hemp-type Cannabis Sativa L. strain or cultivar. It is known for its high CBD content and low THC levels (e.g. low THC<0.2%).
The term “Purple Kush” or “PK” herein refers to a marijuana or drug or potent-type C. sativa strain. It is known for its relatively high THC content.
It is within the scope of the present invention that THCAS and CBDAS (which determine the drug vs hemp chemotype) are highly non-homologous between drug- and hemp-type alleles. The current invention offers for the first time THCAS and/or CBDAS alleles derived from FN and PK strains that are modified at their promoter region by targeted genome modification to down regulate or silence their expression. The THCAS genes modified at their promoter-site include FNTHCAS-1, FNTHCAS-2, PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4. The sequences of the THCAS promoter-sites targeted by genome modification include sequence corresponding to pFNTHCAS-1 having a polynucleotide sequence as set forth in SEQ ID NO: 1, pFNTHCAS-2 having a polynucleotide sequence as set forth in SEQ ID NO: 2, pPKTHCAS-1 having a polynucleotide sequence as set forth in SEQ ID NO: 3, pPKTHCAS-2 having a polynucleotide sequence as set forth in SEQ ID NO: 4, pPKTHCAS-3 having a polynucleotide sequence as set forth in SEQ ID NO: 5 and pPKTHCAS-4 having a polynucleotide sequence as set forth in SEQ ID NO: 6.
The CBDAS genes modified at their promoter-site include cannabidiolic acid synthase-like 1 (CBDAS2), FNCBDAS, PKCBDAS and PKCBDAS1. The sequences of the CBDAS promoter-sites targeted by genome modification include sequence corresponding to pCBDAS2 having a polynucleotide sequence as set forth in SEQ ID NO: 1150, pFNCBDAS having a polynucleotide sequence as set forth in SEQ ID NO:
1151, pPKCBDAS having a polynucleotide sequence as set forth in SEQ ID NO: 1152 and pPKCBDAS1 having a polynucleotide sequence as set forth in SEQ ID NO: 1153.
The term “nonpsychoactive” refers hereinafter to products or compositions or elements or components of Cannabis not significantly affecting the mind or mental processes.
The term “cannabinoid” refers hereinafter to a class of diverse chemical compounds that act on cannabinoid receptors on cells that repress neurotransmitter release in the brain. These receptor proteins include the endocannabinoids (produced naturally in the body by humans and animals), the phytocannabinoids (found in Cannabis and some other plants), and synthetic cannabinoids.
The main cannabinoids are concentrated in a viscous resin produced in structures known as glandular trichomes. Up until now, at least 113 different cannabinoids have been isolated from the Cannabis plant. The main classes of cannabinoids from Cannabis are THC (tetrahydrocannabinol), THCA (tetrahydrocannabinolic acid), CBD (cannabidiol), CBDA (cannabidiolic acid), CBN (cannabinol), CBG (cannabigerol), CBC (cannabichromene), CBL (cannabicyclol), CBV (cannabivarin), THCV (tetrahydrocannabivarin), CBDV (cannabidivarin), CBCV (cannabichromevarin), CBGV (cannabigerovarin), CBGM (cannabigerol monomethyl ether), CBE (cannabielsoin), CBT (cannabicitran) and any combination thereof.
The best studied cannabinoids include tetrahydrocannabinol (THC), cannabidiol (CBD) and cannabinol (CBN).
Reference is now made to Tetrahydrocannabinol (THC), the primary psychoactive component of the Cannabis plant. Delta-9-tetrahydrocannabinol (Δ9-THC, THC) and delta-8-tetrahydrocannabinol (Δ8-THC), through intracellular CB1 activation, induce anandamide and 2-arachidonoylglycerol synthesis produced naturally in the body and brain. These cannabinoids produce the effects associated with Cannabis by binding to the CB1 cannabinoid receptors in the brain. Tetrahydrocannabinolic acid (THCA, 2-COOH-THC; conjugate base tetrahydrocannabinolate) is a precursor of tetrahydrocannabinol (THC), the active component of cannabis. THCA is found in variable quantities in fresh, undried cannabis, but is progressively decarboxylated to THC with drying, and especially under intense heating such as when cannabis is smoked or cooked into cannabis edibles. In the context of the present invention, the term THC also refers to THCA and vice versa.
Reference is now made to Cannabidiol (CBD) which is considered as non-psychotropic. Cannabidiol has little affinity for CB1 and CB2 receptors but acts as an indirect antagonist of cannabinoid agonists. It is further acknowledged herein that it is an antagonist at the putative cannabinoid receptor, GPR55, a GPCR expressed in the caudate nucleus and putamen. Cannabidiol has also been shown to act as a 5-HT1A receptor agonist. Cannabis produces CBD-carboxylic acid through the same metabolic pathway as THC, until the next to last step, where CBDA synthase performs catalysis instead of THCA synthase. CBDA is converted into CBD by decarboxylation. In the context of the present invention, the term CBD also refers to CBDA and vice versa.
CBD shares a precursor with THC and is the main cannabinoid in CBD-dominant Cannabis strains.
In the context of the current invention, enzymes within the biosynthetic pathway of THC and/or CBD, especially THCAS and/or CBDAS (depicted in
Provided herein are methods for modifying the content of THC and/or THCA and/or CBDA and/or CBD compound of a Cannabis plant, comprising, consisting essentially of, or consisting of introducing one or more nucleotide modifications, through targeted DNA modification at a genomic locus of the plant, preferably at the regulatory region operably linked to at least one THCAS and/or CBDAS gene variant coding region.
As used herein, the term “modifying” or “modulation,” or the like, refers to any detectable change in the genotype and/or phenotype of a plant, as compared to a control plant (e.g., a wild-type plant that does not comprise the DNA modification).
The term “altered” as used herein generally means to become different, changed or modified in some particular or trait, or in other words, to cause the characteristics of something to change. In the context of the present invention it means to reduce (decrease) or increase (elevate) THC or THCA and/or CBD or CBDA content in a Cannabis plant by targeted genome modification, as compared to a control Cannabis plant lacking the genomic modification and having a similar genotype or genetic background or chemotype.
As used herein the term “genetic modification” or “genome modification” or “genomic modification” or “nucleotide modification” refers hereinafter to genetic manipulation or modulation, which is the direct manipulation of an organism's genes using biotechnology. It also refers to a set of technologies used to change the genetic makeup of cells, including the transfer of genes within and across species, targeted mutagenesis and genome editing technologies to produce improved organisms. According to main embodiments of the present invention, modified Cannabis plants with altered cannabinoid content traits are generated using genome editing mechanism. This technique enables to achieve in planta modification of specific genes that control the biosynthesis of main cannabinoids, namely, THC and/or CBD in the Cannabis plant.
The term “genome editing”, or “genome/genetic modification” or “genome engineering” generally refers hereinafter to a type of genetic engineering in which DNA is inserted, deleted, modified or replaced in the genome of a living organism. Unlike previous genetic engineering techniques that randomly insert genetic material into a host genome, genome editing targets the insertions to site specific locations.
It is within the scope of the present invention that the common methods for such editing use engineered nucleases, or “molecular scissors”. These nucleases create site-specific double-strand breaks (DSBs) at desired locations in the genome. The induced double-strand breaks are repaired through nonhomologous end-joining (NHEJ) or homologous recombination (HR), resulting in targeted mutations (‘edits’). Families of engineered nucleases used by the current invention include, but are not limited to: meganucleases, zinc finger nucleases (ZFNs), transcription activator-like effector-based nucleases (TALEN), and the clustered regularly interspaced short palindromic repeats (CRISPR/Cas9) system.
Reference is now made to exemplary genome editing terms used by the current disclosure:
According to aspects of the present invention, the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and CRISPR-associated (Cas) genes are used for the first time for generating genome modification targeted at regulatory sites, e.g. promoter, non-coding sites of genes in the Cannabis plant.
According to further aspects of the invention, Cas protein, such as Cas9 (also known as Csn1) is required for gene silencing. Cas9 participates in the processing of CRISPR RNA (crRNAs), and is responsible for the destruction of the target DNA. To achieve site-specific DNA recognition and cleavage, Cas9 is complexed with both a crRNA and a separate trans-activating crRNA (tracrRNA or trRNA), that is partially complementary to the crRNA.
Without wishing to be bound by theory, it is herein acknowledged that during the destruction of target DNA, nuclease domains cut both DNA strands, generating double-stranded breaks (DSBs) at sites defined by a 20-nucleotide target sequence within an associated crRNA transcript. It is further noted that the double-stranded endonuclease activity of Cas9 also requires that a short conserved sequence, (2-5 nts) known as protospacer-associated motif (PAM), follows immediately 3′-of the crRNA complementary sequence.
20 According to further aspects of the invention, a two-component system may be used by the current invention, combining trRNA and crRNA into a single synthetic single guide RNA (sgRNA) for guiding targeted gene or genomic locus alterations.
It is further within the scope that Cas9 nuclease variants include wild-type Cas9, Cas9D10A and nuclease-deficient Cas9 (dCas9).
According to further aspects of the present invention, non-limiting examples of Cas genes or proteins are selected from the group consisting of Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9, Cas10, Cast10d, Cas12, Cas13, Cas14, CasX, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn1, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Cpf1, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csz1, Csx15, Csf1, Csf2, Csf3, Csf4, Cu1966, bacteriophages Cas such as CasΦ (to (Cas-phi), and any combination thereof.
Reference is now made to
TAL effector nucleases (TALEN) are a class of sequence-specific nucleases that can be used to make double-strand breaks at specific target sequences in the genome of a plant or other organism.
Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Endonucleases include restriction endonucleases, which cleave DNA at specific sites without damaging the bases, and meganucleases, also known as homing endonucleases, which like restriction endonucleases, bind and cut at a specific recognition site, however the recognition sites for meganucleases are typically longer, about 18 bp or more.
The term “meganucleases” as used herein refers hereinafter to endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs); as a result this site generally occurs only once in any given genome. Meganucleases are therefore considered to be the most specific naturally occurring restriction enzymes. The naming convention for meganuclease is similar to the convention for other restriction endonuclease.
One step in the recombination process involves polynucleotide cleavage at or near the recognition site. The cleaving activity can be used to produce a double-strand break.
Zinc finger nucleases (ZFNs) are engineered double-strand break inducing agents comprised of a zinc finger DNA binding domain and a double-strand-break-inducing agent domain. Recognition site specificity is conferred by the zinc finger domain, which typically comprising two, three, or four zinc fingers, for example having a C2H2 structure, however other zinc finger structures are known and have been engineered. ZFNs include an engineered DNA-binding zinc finger domain linked to a non-specific endonuclease domain. Additional functionalities can be fused to the zinc-finger binding domain, including transcriptional activator domains, transcription repressor domains, and methylases. In some examples, dimerization of nuclease domain is required for cleavage activity. Each zinc finger recognizes three consecutive base pairs in the target DNA.
The term “Cas gene” herein refers to a gene that is generally coupled, associated or close to, or in the vicinity of flanking CRISPR loci in bacterial systems. The terms “Cas gene”, “CRISPR-associated (Cas) gene” are used interchangeably herein. The term “Cas endonuclease” herein refers to a protein encoded by a Cas gene. A Cas endonuclease herein, when in complex with a suitable polynucleotide component, is capable of recognizing, binding to, and optionally nicking or cleaving all or part of a specific DNA target sequence. A Cas endonuclease described herein comprises one or more nuclease domains. Cas endonucleases of the disclosure includes those having a HNH or HNH-like nuclease domain and/or a RuvC or RuvC-like nuclease domain. Examples of a Cas endonuclease of the disclosure includes a Cas9 protein, a Cpf1 protein, a C2c1 protein, a C2c2 protein, a C2c3 protein, Cas3, Cas 5, Cas7, Cas8, Casio, or complexes of these.
The commonly-used Cas9 from Streptococcus pyogenes (SpCas9) recognizes the PAM sequence 5′-NGG-3′ (where “N” can be any nucleotide base).
Other Cas variants and their PAM sequences (5′ to 3′) within the scope of the current invention include NmeCas9 (isolated from Neisseria meningitides) recognizing NNNNGATT, StCas9 (isolated from Streptococcus thermophiles) recognizing NNAGAAW, TdCas9 (isolated from Treponema denticola) recognizing NAAAAC and SaCas9 (isolated from Staphylococcus aureus) recognizing NNGRRT or NGRRT or NGRRN.
As used herein, the terms “guide polynucleotide/Cas endonuclease complex”, “guide polynucleotide/Cas endonuclease system”, “guide polynucleotide/Cas complex”, “guide polynucleotide/Cas system”, “guided Cas system” are used interchangeably herein and refer to at least one guide polynucleotide and at least one Cas endonuclease that are capable of forming a complex, wherein said guide polynucleotide/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site. A guide polynucleotide/Cas endonuclease complex herein can comprise Cas protein(s) and suitable polynucleotide component(s) of any CRISPR system such as a type I, II, or III CRISPR system. A Cas endonuclease unwinds the DNA duplex at the target sequence and optionally cleaves at least one DNA strand, as mediated by recognition of the target sequence by a polynucleotide (such as, but not limited to, a crRNA or guide RNA) that is in complex with the Cas protein. Such recognition and cutting of a target sequence by a Cas endonuclease typically occurs if the correct protospacer-adjacent motif (PAM) is located at or adjacent to the 3′ end of the DNA target sequence. Alternatively, a Cas protein herein may lack DNA cleavage or nicking activity, but can still specifically bind to a DNA target sequence when complexed with a suitable RNA component.
A guide polynucleotide/Cas endonuclease complex can cleave one or both strands of a DNA target sequence.
“Cas9” (formerly referred to as Cas5, Csn1, or Csx12) herein refers to a Cas endonuclease of a type II CRISPR system that forms a complex with a crNucleotide and a tracrNucleotide, or with a single guide polynucleotide, for specifically recognizing and cleaving all or part of a DNA target sequence. A type II CRISPR system includes a DNA cleavage system utilizing a Cas9 endonuclease in complex with at least one polynucleotide component. For example, a Cas9 can be in complex with a CRISPR RNA (crRNA) and a trans-activating CRISPR RNA (tracrRNA). In another example, a Cas9 can be in complex with a single guide RNA.
The guide polynucleotide can also be a single molecule (also referred to as single guide polynucleotide) comprising a crNucleotide sequence linked to a tracrNucleotide sequence. The single guide polynucleotide being comprised of sequences from the crNucleotide and the tracrNucleotide may be referred to as “single guide RNA” (when composed of a contiguous stretch of RNA nucleotides) or “single guide DNA” (when composed of a contiguous stretch of DNA nucleotides) or “single guide RNA-DNA” (when composed of a combination of RNA and DNA nucleotides).
The single guide polynucleotide can form a complex with a Cas endonuclease, wherein said guide polynucleotide/Cas endonuclease complex (also referred to as a guide polynucleotide/Cas endonuclease system) can direct the Cas endonuclease to a genomic target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the target site.
The terms “single guide RNA” or “sgRNA” are used interchangeably herein and relate to a synthetic fusion of two RNA molecules, a crRNA (CRISPR RNA) comprising a variable targeting domain (linked to a tracr mate sequence that hybridizes to a tracrRNA), fused to a tracrRNA (trans-activating CRISPR RNA). The single guide RNA can comprise a crRNA or crRNA fragment and a tracrRNA or tracrRNA fragment of the type II CRISPR/Cas system that can form a complex with a type II Cas endonuclease, wherein said guide RNA/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site.
The terms “guide RNA” or “guide RNA/Cas endonuclease complex”, “guide RNA/Cas endonuclease system”, “guide RNA/Cas complex”, “guide RNA/Cas system”, “gRNA/Cas complex”, “gRNA/Cas system”, “RNA-guided endonuclease” are used interchangeably herein and refer to at least one RNA component preferably with at least one Cas endonuclease that are capable of forming a complex, wherein said guide RNA/Cas endonuclease complex can direct the Cas endonuclease to a DNA target site, enabling the Cas endonuclease to recognize, bind to, and optionally nick or cleave (introduce a single or double strand break) the DNA target site. A guide RNA/Cas endonuclease complex herein can comprise Cas protein(s) and suitable RNA component(s) of any of the known CRISPR systems such as a type I, II, or III CRISPR system. A guide RNA/Cas endonuclease complex can comprise a Type II Cas9 endonuclease and at least one RNA component (e.g., a crRNA and tracrRNA, or a gRNA).
The guide polynucleotide of the methods and compositions described herein may be any polynucleotide sequence that targets the genomic loci of a plant cell comprising a polynucleotide having a nucleic acid sequence that is at least 75% (e.g., 80%, 85% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%) similar or identical to a sequence selected from the group consisting of SEQ ID NOs: 1-6 and SEQ ID NOs: 1150-1153.
In certain embodiments, the guide polynucleotide is a guide RNA (gRNA). The gRNA that targets SEQ ID NO: 1 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:7-91.
According to further embodiments, the gRNA that targets SEQ ID NO: 2 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:92-176.
According to further embodiments, the gRNA that targets SEQ ID NO: 3 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:177-278.
According to further embodiments, the gRNA that targets SEQ ID NO: 4 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:279-419.
According to further embodiments, the gRNA that targets SEQ ID NO: 5 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:420-560.
According to further embodiments, the gRNA that targets SEQ ID NO: 6 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:561-681.
According to further embodiments, the gRNA that targets SEQ ID NO: 1150 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:682-794.
According to further embodiments, the gRNA that targets SEQ ID NO: 1151 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:795-895.
According to further embodiments, the gRNA that targets SEQ ID NO: 1152 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:896-1016.
According to further embodiments, the gRNA that targets SEQ ID NO: 1153 comprises at least 90% similarity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1017-1149.
The guide polynucleotide can be introduced into a cell transiently, as single stranded polynucleotide or a double stranded polynucleotide, using any method known in the art such as, but not limited to, particle bombardment or Agrobacterium transformation. The guide polynucleotide can also be introduced indirectly into a cell by introducing a recombinant DNA molecule (via methods such as, but not limited to, particle bombardment or Agro bacterium transformation) comprising a heterologous nucleic acid fragment encoding a guide polynucleotide, operably linked to a specific promoter that is capable of transcribing the guide RNA in said cell. The specific promoter can be, but is not limited to, a RNA polymerase III promoter.
The terms “target site”, “target sequence”, “target site sequence”, target DNA″, “target locus”, “genomic target site”, “genomic target sequence”, “genomic target locus” and “protospacer”, are used interchangeably herein and refer to a polynucleotide sequence such as, but not limited to, a nucleotide sequence on a chromosome, episome, or any other DNA molecule in the genome (including chromosomal, choloroplastic, mitochondrial DNA, plasmid DNA) of a cell, at which a guide polynucleotide/Cas endonuclease complex can recognize, bind to, and optionally nick or cleave. The target site can be an endogenous site in the genome of a cell, or alternatively, the target site can be heterologous to the cell and thereby not be naturally occurring in the genome of the cell, or the target site can be found in a heterologous genomic location compared to where it occurs in nature.
The length of the target DNA sequence (target site) can vary, and includes, for example, target sites that are at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more nucleotides in length.
Active variants of genomic target sites can also be used. Such active variants can comprise at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to the given target site, wherein the active variants retain biological activity and hence are capable of being recognized and cleaved by an endonuclease (e.g. Cas). Assays to measure the single or double-strand break of a target site by an endonuclease are known in the art and generally measure the overall activity and specificity of the agent on DNA substrates containing recognition sites.
The term “protospacer adjacent motif” or “PAM” as used herein refers hereinafter to a short (e.g. 2-6 base pair) adjacent to a target sequence (protospacer) that is recognized (targeted) by a guide polynucleotide/Cas endonuclease system described herein. The Cas endonuclease may not successfully recognize a target DNA sequence if the target DNA sequence is not followed by a PAM sequence. The sequence and length of a PAM herein can differ depending on the Cas protein or Cas protein complex used. The PAM sequence can be of any length but is typically 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 nucleotides long.
PAM is a component of the invading virus or plasmid, but is not a component of the bacterial CRISPR locus. PAM is an essential targeting component which distinguishes bacterial self from non-self DNA, thereby preventing the CRISPR locus from being targeted and destroyed by nuclease.
It is within the scope of the present invention that gRNA sequences complementary or corresponding to the target sequence to be modified comprises a 3′ Protospacer Adjacent Motif (PAM) selected from the group consisting of NGG (SpCas), NNNNGATT (NmeCas9), NNAGAAW (StCas9), NAAAAC (TdCas9) and NNGRRT (SaCas9).
The term “Next-generation sequencing” or “NGS” as used herein refers hereinafter to massively, parallel, high-throughput or deep sequencing technology platforms that perform sequencing of millions of small fragments of DNA in parallel. Bioinformatics analyses are used to piece together these fragments by mapping the individual reads to the reference genome.
A “genomic locus” of a plant as used herein, generally refers to the location on a chromosome of the plant where a gene, such as a polynucleotide involved in THCA synthesis (THCAS) and/or CBDA synthesis (CBDAS), is found. It is within the scope of the current invention that the genetic locus includes the coding sequence of a gene and the regulatory regions (non-coding sequences) located upstream (e.g. promoter sequences or elements) and/or downstream (e.g. terminator sequences or elements) to the coding region and controlling its expression, namely transcription and/or translation. As used herein, “gene” includes a nucleic acid fragment that expresses or encodes a functional molecule such as, but not limited to, a specific protein coding sequence, and regulatory elements, regions or sequences, such as those preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence.
In general, a locus is the specific physical location of a gene or other DNA sequence on a chromosome. The plural of locus is “loci”.
According to some aspects of the invention, a functional locus, as used herein means the whole set of genomic regions that are alternatively used to carry out the same function. Regulatory regions may be also an example of different DNA regions belonging to the same functional locus. Different elements such as promoters and enhancers regulate gene expression, and a given gene may be under the control of multiple of such regulatory elements.
According to further aspects of the present invention, the promoter sequence includes the region upstream to 5′UTR, preceding the first exon of the open reading frame (ORF) or CDS.
A “regulatory region” or “regulatory element” or “regulatory sequence” generally refers to a transcriptional regulatory element involved in regulating the transcription of a nucleic acid molecule such as a gene or a target gene. The regulatory region or element comprises nucleic acids and may include a promoter, an enhancer, an intron, a 5′-untranslated region (5′-UTR, also known as a leader sequence), or a 3′-UTR or a combination thereof. A regulatory element may act in “cis” or “trans”, and generally it acts in “cis”, i.e. it activates expression of genes located on the same nucleic acid molecule, e.g. a chromosome, where the regulatory element is located. According to further aspects, a regulatory region is operably linked to the coding region of a gene and controlling its expression (transcription and/or translation).
A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism.
An “enhancer” element is any nucleic acid molecule that increases transcription of a nucleic acid molecule when functionally linked to a promoter regardless of its relative position.
It is noted that the sequence of a regulatory region or element may be less conserved among different variants or alleles of the same gene.
A “repressor” (also herein referred to as silencer) is defined as any nucleic acid molecule which inhibits the transcription when functionally linked to a promoter regardless of relative position.
A “promoter” generally refers to a nucleic acid fragment capable of controlling transcription of another nucleic acid fragment. A promoter generally includes a core promoter (also known as minimal promoter) sequence that includes a minimal regulatory region to initiate transcription that is a transcription start site. Generally, a core promoter includes a TATA box and a GC rich region associated with a CAAT box or a CCAAT box. These elements act to bind RNA polymerase II to the promoter and assist the polymerase in locating the RNA initiation site. Some promoters may not have a TATA box or CAAT box or a CCAAT box, but instead may contain an initiator element for the transcription initiation site. A core promoter is a minimal sequence required to direct transcription initiation and generally may not include enhancers or other UTRs (e.g. 5′ untranslated region, also known as a leader sequence or leader RNA, directly upstream from the initiation codon.). According to some aspects of the invention, a regulatory region comprises a transcription factor binding site, an RNA polymerase binding site, a TATA box, or a combination of structural variations thereof.
The term “cis-element” or “cis-regulatory region” or “cis-regulatory element (CRE)” generally refers to transcriptional regulatory element that affects or modulates expression of an operably linked transcribable polynucleotide, where the transcribable polynucleotide is present in the same DNA sequence. A cis-element may function to bind transcription factors, which are trans-acting polypeptides that regulate transcription. Cis-regulatory elements (CREs) are regions of non-coding DNA which regulate the transcription of neighboring genes.
An “intron” is an intervening sequence in a gene that is transcribed into RNA but is then excised in the process of generating the mature mRNA. The term is also used for the excised RNA sequences.
An “exon” is a portion of the sequence of a gene that is transcribed and is found in the mature messenger RNA derived from the gene but is not necessarily a part of the sequence that encodes the final gene product.
The 5′ untranslated region (5′UTR) (also known as a translational leader sequence or leader RNA) is the region of an mRNA that is directly upstream from the initiation codon. This region is involved in the regulation of translation of a transcript by differing mechanisms in viruses, prokaryotes and eukaryotes.
The “3′ non-coding sequences” refer to DNA sequences located downstream of a coding sequence and include polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3′ end of the mRNA precursor.
“RNA transcript” generally refers to a product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When an RNA transcript is a perfect complimentary copy of a DNA sequence, it is referred to as a primary transcript or it may be a RNA sequence derived from posttranscriptional processing of a primary transcript and is referred to as a mature RNA. “Messenger RNA” (“mRNA”) generally refers to RNA that is without introns and that can be translated into protein by the cell. “cDNA” generally refers to a DNA that is complementary to and synthesized from an mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into the double-stranded by using the Klenow fragment of DNA polymerase I. “Sense” RNA generally refers to RNA transcript that includes mRNA and so can be translated into protein within a cell or in vitro. “Antisense RNA” generally refers to a RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks expression or transcripts accumulation of a target gene. The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e. at the 5′ non-coding sequence, 3′ non-coding sequence, introns, or the coding sequence. “Functional RNA” generally refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes.
“Targeted DNA modification” can be used synonymously with targeted DNA mutation or targeted nucleotide modification and refers to the introduction of a site specific modification that alters or changes the nucleotide sequence at a specific genomic locus of the plant (e.g., Cannabis).
In certain embodiments, the targeted DNA modification occurs at a genomic locus that comprises a regulatory region (e.g. promoter region) involved in controlling expression of THCAS and/or CBDAS polynucleotide variant or homologue and thus affecting THCA and/or CBDA content or concentration in the Cannabis plant.
According to main embodiments, the THCAS polynucleotide variant or homologue comprises “Finola” (FN) variety FNTHCAS-1 and FNTHCAS-2 genes, and Purple Kush (PK) strain PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4 genes.
In certain embodiments, the polynucleotide sequence of the regulatory regions operably linked to the THCAS genes FNTHCAS-1, FNTHCAS-2, PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4 comprising a nucleic acid sequence that is at least 75% (e.g., 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%) similar or identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-6, respectively.
According to other main embodiments, the CBDAS polynucleotide variant or homologue comprises Cannabidiolic acid synthase-like 1 (CBDAS2), “Finola” (FN) variety FNCBDAS gene, and Purple Kush (PK) strain PKCBDAS and PKCBDAS1 genes.
In certain embodiments, the polynucleotide sequence of the regulatory regions operably linked to the CBDAS genes selected from Cannabidiolic acid synthase-like 1 (CBDAS2), “Finola” (FN) variety FNCBDAS gene, and Purple Kush (PK) strain PKCBDAS and PKCBDAS1, comprising a nucleic acid sequence that is at least 75% (e.g., 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%) similar or identical to a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1150-1153, respectively.
Sequence alignments and percent identity or similarity calculations may be determined using a variety of comparison methods and bioinformatics computing methods designed to detect similar or identical sequences, known to the skilled person in the relevant field.
In one embodiment the sequence identity or similarity percentage (%) is determined over the entire length of the molecule (nucleotide or amino acid).
The targeted DNA or genome modification described herein may be any modification known in the art such as, for example, insertion, deletion or indel, single nucleotide polymorphism (SNP), and or a polynucleotide modification. Additionally, the targeted DNA modification in the genomic locus may be located anywhere in the genomic locus, such as, for example, a coding region of the encoded polypeptide (e.g., exon), a non-coding region (e.g., intron), a regulatory element, or untranslated region. In preferred embodiments, the modification is in a regulatory element or region controlling a gene targeted for down regulation, knockdown or silencing, knockout mutation, a loss of function mutation or any combination thereof.
The specific location of the targeted DNA modification within the regulatory region polynucleotide of the THCAS and/or CBDAS gene is not particularly limited, as long as the targeted DNA modification results in reduced expression or activity of the protein encoded by the THCAS and/or CBDAS gene. In certain embodiments the targeted DNA modification is a deletion of one or more nucleotides, preferably contiguous, of the genomic locus.
As used herein “reduced”, “reduction”, “decrease” or the like refers to any detectable decrease in an experimental group (e.g., Cannabis plant with a targeted DNA modification described herein) as compared to a control group (e.g., wild-type Cannabis plant, preferably of similar genetic background, that does not comprise the targeted DNA modification).
Accordingly, reduced expression of a gene or protein comprises any detectable decrease in the total level of the RNA or protein in a sample and can be determined using routine methods in the art such as, for example, PCR, Northern blot, Western blotting, ELISA as well as methods described in Rodriguez-Leal, Daniel, et al. “Engineering quantitative trait variation for crop improvement by genome editing.” Cell 171.2 (2017): 470-480, which is incorporated herein by reference.
In certain embodiments, a reduction in the expression or activity of the THCAS and/or CBDAS polynucleotide is due to a targeted DNA modification at regulatory region operably linked to the THCAS and/or CBDAS coding region that results in one or more of the following:
In certain embodiments, the targeted DNA modification at a genomic locus involved in THCAS and/or CBDAS expression results in Cannabis plants that exhibit reduced THCA or THC and/or CBDA or CBD content, or free of THCA and/or THC and/or CBDA and/or CBD compared to or relative to comparable control plants lacking the targeted DNA modification. Such comparable control plants may be of a similar genotype or chemotype or genetic background, but lacking the at least one targeted nucleotide modification.
For example, in certain embodiments, the modified Cannabis plant has THCA and/or THC and/or CBDA and/or CBD content of up to 30% by weight, particularly between about 0% to about 30% by weight, more particularly between about 0.3% to about 30%, even more particularly between about 0.3% to about 10% by weight.
In other embodiments, the targeted DNA modification results in Cannabis plants that exhibit a THCA and/or THC and/or CBDA and/or CBD content of not more than about 0.3% by weight.
In yet other embodiments, the targeted DNA modification results in THCAS and/or CBDAS promoter region results in THCA and/or CBDA free Cannabis plant.
In certain embodiments, the regulatory region within THCAS and/or CBDAS genomic locus has more than one (e.g., 2, 3, 4, 5, 6, 7, 8, 9, or 10) targeted DNA modification.
In certain embodiments, the plant may have targeted DNA modifications at more than one (e.g., 2, 3, 4, 5, 6, 7, 8, 9, or 10) genomic loci that are involved in regulation of THCAS and/or CBDAS expression in the plant (e.g., Cannabis).
The targeted DNA modification of the genomic locus may be done using any genome modification technique known in the art. In certain embodiments the targeted DNA modification is through a genome modification technique selected from the group consisting of a polynucleotide-guided endonuclease, CRISPR-Cas endonucleases, base editing deaminases, zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), engineered site-specific meganuclease, or Argonaute.
In some embodiments, the genome modification may be facilitated through the induction of a double-stranded break (DSB) or single-strand break, in a defined position in the genome near the desired alteration. DSBs can be induced using any DSB-inducing agent available, including, but not limited to, TALENs, meganucleases, zinc finger nucleases, Cas9-gRNA systems (based on bacterial CRISPR-Cas systems), guided cpf1 endonuclease systems, and the like. In some embodiments, the introduction of a DSB can be combined with the introduction of a polynucleotide modification template.
A polynucleotide modification template can be introduced into a cell by any method known in the art, such as, but not limited to, transient introduction methods, bioloistics and transformation by Agrobacterium and viral based vectors.
The polynucleotide modification template can be introduced into a cell as a single stranded polynucleotide molecule, a double stranded polynucleotide molecule, or as part of a circular DNA (vector DNA). The polynucleotide modification template can also be tethered to the guide RNA and/or the Cas endonuclease. Tethered DNAs can allow for co-localizing target and template DNA, useful in genome editing and targeted genome regulation, and can also be useful in targeting post-mitotic cells where function of endogenous HR machinery is expected to be highly diminished.
The polynucleotide modification template may be present transiently in the cell or it can be introduced via a viral replicon.
A “modified nucleotide” or “edited nucleotide” or “genome modification” refers to a nucleotide sequence of interest that comprises at least one alteration when compared to its non-modified nucleotide sequence form, for example in a control Cannabis plant which does not comprise the alternation. Such “alterations” include, for example: (i) replacement of at least one nucleotide, (ii) a deletion of at least one nucleotide, (iii) an insertion of at least one nucleotide, or (iv) any combination of (i)-(iii).
The term “polynucleotide modification template” includes a polynucleotide that comprises at least one nucleotide modification when compared to the nucleotide sequence to be edited. A nucleotide modification can be at least one nucleotide substitution, addition or deletion. Optionally, the polynucleotide modification template can further comprise homologous nucleotide sequences flanking the at least one nucleotide modification, wherein the flanking homologous nucleotide sequences provide sufficient homology to the desired nucleotide sequence to be edited.
The process for editing a genomic sequence combining DSB and modification templates generally comprises: providing to a host cell, a DSB-inducing agent, or a nucleic acid encoding a DSB-inducing agent, that recognizes a target sequence in the chromosomal sequence and is able to induce a DSB in the genomic sequence, and at least one polynucleotide modification template comprising at least one nucleotide alteration when compared to the nucleotide sequence to be edited. The polynucleotide modification template can further comprise nucleotide sequences flanking the at least one nucleotide alteration, in which the flanking sequences are substantially homologous to the chromosomal region flanking the DSB.
The endonuclease can be provided to a cell by any method known in the art, for example, but not limited to, transient introduction methods, transfection, microinjection, and/or topical application or indirectly via recombination constructs. The endonuclease can be provided as a protein or as a guided polynucleotide complex directly to a cell or indirectly via recombination constructs. The endonuclease can be introduced into a cell transiently or can be incorporated into the genome of the host cell using any method known in the art. In the case of a CRISPR-Cas system, uptake of the endonuclease and/or the guided polynucleotide into the cell can be facilitated with a Cell Penetrating Peptide (CPP). Transformation can be performed using a DNA plasmid such as a plant codon optimized Streptococcus pyogenes Cas9 (pcoSpCas9) plasmid. The plasmid contains the plant codon optimized SpCas9 and the above mentioned at least one sgRNA. DNA introduction into the plant cells can be done by Agrobacterium infiltration, virus based plasmids for delivery of the genome editing molecules and mechanical insertion of DNA (PEG mediated DNA transformation, biolistics, etc.).
In addition, it is within the scope of the present invention that the Cas9 protein is directly inserted together with a gRNA (ribonucleoprotein-RNP's) in order to bypass the need for in vivo transcription and translation of the Cas9+gRNA plasmid in planta to achieve gene editing. Insertion of the aforementioned plasmid DNA can be done, but is not limited to, using different delivery systems, biological and/or mechanical, e.g. Agrobacterium infiltration, virus based plasmids for delivery of the genome editing molecules and mechanical insertion of DNA (PEG mediated DNA transformation, biolistics, etc.).
As used herein, a “genomic region” is a segment of a chromosome in the genome of a cell that is present on either side of the target site or, alternatively, also comprises a portion of the target site. The genomic region can comprise at least 5-10, 5-15, 5-20, 5-25, 5-30, 5-35, 5-40, 5-45, 5-50, 5-55, 5-60, 5-65, 5-70, 5-75, 5-80, 5-85, 5-90, 5-95, 5-100, 5-200, 5-300, 5-400, 5-500, 5-600, 5-700, 5-800, 5-900, 5-1000, 5-1100, 5-1200, 5-1300, 5-1400, 5-1500, 5-1600, 5-1700, 5-1800, 5-1900, 5-2000, 5-2100, 5-2200, 5-2300, 5-2400, 5-2500, 5-2600, 5-2700, 5-2800. 5-2900, 5-3000, 5-3100 or more bases such that the genomic region has sufficient homology to undergo homologous recombination with the corresponding region of homology.
The terms “targeting”, “gene targeting” and “DNA targeting” are used interchangeably herein. DNA targeting herein may be the specific introduction of a knock out, knock down, edit, or knock-in at a particular DNA sequence, such as in a chromosome or plasmid of a cell. In general, DNA targeting can be performed herein by cleaving one or both strands at a specific DNA sequence in a cell with an endonuclease associated with a suitable polynucleotide component. Such DNA cleavage, if a double-strand break (DSB), can lead to modifications at the target site.
A targeting method herein can be performed in such a way that two or more DNA target sites are targeted in the method, for example. Such a method can optionally be characterized as a multiplex method. Two, three, four, five, six, seven, eight, nine, ten, or more target sites can be targeted at the same time in certain embodiments. A multiplex method is typically performed by a targeting method in which multiple different RNA components are provided, each designed to guide a guide polynucleotide/Cas endonuclease complex to a unique DNA target site.
The terms “knock-out”, “gene knock-out” and “genetic knock-out” are used interchangeably herein. A knock-out represents a DNA sequence of a cell that has been rendered partially or completely inoperative by targeting with a guide polynucleotide/endonuclease complex such as Cas protein; such a DNA sequence prior to knock-out could have encoded an amino acid sequence, or could have had a regulatory function (e.g., promoter), for example. A knock-out may be produced by an indel (insertion or deletion of nucleotide bases in a target DNA sequence e.g. through NHEJ), or by specific removal of sequence that reduces or completely destroys the function of sequence at or near the targeting site.
The guide polynucleotide/Cas endonuclease system can be used in combination with a co-delivered polynucleotide modification template to allow for editing (modification) of a genomic nucleotide sequence of interest.
The terms “knock-in”, “gene knock-in, “gene insertion” and “genetic knock-in” are used interchangeably herein. A knock-in represents the replacement or insertion of a DNA sequence at a specific DNA sequence in cell by targeting with a Cas protein (by HR, wherein a suitable donor DNA polynucleotide is also used). Examples of knock-ins are a specific insertion of a heterologous amino acid coding sequence in a coding region of a gene, or a specific insertion of a transcriptional regulatory element in a genetic locus.
The term “gene knockdown” as used herein refers hereinafter to an experimental technique by which the expression of one or more of an organism's genes is reduced. The reduction can occur through genetic modification, i.e. targeted genome editing or by treatment with a reagent such as a short DNA or RNA oligonucleotide that has a sequence complementary to either gene or an mRNA transcript. The reduced expression can be at the level of RNA or at the level of protein. It is within the scope of the present invention that the term gene knockdown also refers to a loss of function mutation and/or gene knockout mutation in which an organism's genes is made inoperative or nonfunctional.
The term “gene silencing” as used herein refers hereinafter to the regulation of gene expression in a cell to prevent the expression of a certain gene. Gene silencing can occur during either transcription or translation. In certain aspects of the invention, gene silencing is considered to have a similar meaning as gene knockdown. When genes are silenced, their expression is reduced. In contrast, when genes are knocked out, they are completely not expressed. Gene silencing may be considered a gene knockdown mechanism since the methods used to silence genes, such as RNAi, CRISPR, or siRNA, generally reduce the expression of a gene by at least 70% but do not completely eliminate it.
The term “loss of function mutation” as used herein refers to a type of mutation in which the altered gene product lacks the function of the wild-type gene. A synonyms of the term included within the scope of the present invention is null mutation.
The term “microRNAs” or “miRNAs” refers hereinafter to small non-coding RNAs that have been found in most of the eukaryotic organisms. They are involved in the regulation of gene expression at the post-transcriptional level in a sequence specific manner. MiRNAs are produced from their precursors by Dicer-dependent small RNA biogenesis pathway. MiRNAs are candidates for studying gene function using different RNA-based gene silencing techniques. For example, artificial miRNAs (amiRNAs) targeting one or several genes of interest is a potential tool in functional genomics.
The term “in planta” means in the context of the present invention within the plant or plant cells. More specifically, it means introducing CRISPR/Cas complex into plant material comprising a tissue culture of several cells, a whole plant, or into a single plant cell, without introducing a foreign gene or a mutated gene. It also used to describe conditions present in a non-laboratory environment (e.g. in vivo).
The term “genotype” or “genetic background” refers hereinafter to the genetic constitution of a cell or organism. An individual's genotype includes the specific alleles, for one or more genetic marker loci, present in the individual's haplotype. As is known in the art, a genotype can relate to a single locus or to multiple loci, whether the loci are related or unrelated and/or are linked or unlinked. In some embodiments, an individual's genotype relates to one or more genes that are related in that the one or more of the genes are involved in the expression of a phenotype of interest. Thus, in some embodiments a genotype comprises a summary of one or more alleles present within an individual at one or more genetic loci. In some embodiments, a genotype is expressed in terms of a haplotype. It further refers to any inbreeding group, including taxonomic subgroups such as subspecies, taxonomically subordinate to species and superordinate to a race or subrace and marked by a pre-determined profile of latent factors of hereditary traits.
The term “orthologue” as used herein refers hereinafter to one of two or more homologous gene sequences found in different species.
The term “functional variant” or “functional variant of a nucleic acid or amino acid sequence” as used herein, for example with reference to SEQ ID NOs: 1-6 and 1150-1153 refers to a variant of a sequence or part of a sequence which retains the biological function of the full non-variant allele and hence has the activity of the expressed gene or protein. A functional variant also comprises a variant of the gene of interest encoding a polypeptide which has sequence alterations that do not affect function of the resulting protein, for example, in non-conserved residues. Also encompassed is a variant that is substantially identical, i.e. has only some sequence variations, for example, in non-conserved residues, to the wild type nucleic acid or amino acid sequences of the alleles as shown herein, and is biologically active.
The term “variety” or “cultivar” used herein means a group of similar plants that by structural features and performance can be identified from other varieties within the same species.
The term “allele” used herein means any of one or more alternative or variant forms of a gene or a genetic unit at a particular locus, all of which alleles relate to one trait or characteristic at a specific locus. In a diploid cell of an organism, alleles of a given gene are located at a specific location, or locus (loci plural) on a chromosome. Alternative or variant forms of alleles may be the result of single nucleotide polymorphisms, insertions, inversions, translocations or deletions, or the consequence of gene regulation caused by, for example, by chemical or structural modification, transcription regulation or post-translational modification/regulation. An allele associated with a qualitative trait may comprise alternative or variant forms of various genetic units including those mat are identical or associated with a single gene or multiple genes or their products or even a gene disrupting or controlled by a genetic factor contributing to the phenotype represented by the locus. According to further embodiments, the term “allele” designates any of one or more alternative forms of a gene at a particular locus. Heterozygous alleles are two different alleles at the same locus. Homozygous alleles are two identical alleles at a particular locus. A wild type allele is a naturally occurring allele.
As used herein, the term “locus” (loci plural) means a specific place or places or region or a site on a chromosome where for example a gene or genetic marker element or factor is found. In specific embodiments, such a genetic element is contributing to a trait.
As used herein, the term “homozygous” refers to a genetic condition or configuration existing when two identical or like alleles reside at a specific locus, but are positioned individually on corresponding pairs of homologous chromosomes in the cell of a diploid organism.
Conversely, as used herein, the term “heterozygous” means a genetic condition or configuration existing when two different or unlike alleles reside at a specific locus, but are positioned individually on corresponding pairs of homologous chromosomes in the cell of a diploid organism.
As used herein, the phrase “genetic marker” or “molecular marker” or “biomarker” refers to a feature in an individual's genome e.g., a nucleotide or a polynucleotide sequence that is associated with one or more loci or trait of interest In some embodiments, a genetic marker is polymorphic in a population of interest, or the locus occupied by the polymorphism, depending on context. Genetic markers or molecular markers include, for example, single nucleotide polymorphisms (SNPs), indels (i.e. insertions deletions), simple sequence repeats (SSRs), restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNAs (RAFDs), cleaved amplified polymorphic sequence (CAPS) markers, Diversity Arrays Technology (DArT) markers, and amplified fragment length polymorphisms (AFLPs) or combinations thereof, among many other examples such as the DNA sequence per se. Genetic markers can, for example, be used to locate genetic loci containing alleles on a chromosome that contribute to variability of phenotypic traits. The phrase “genetic marker” or “molecular marker” or “biomarker” can also refer to a polynucleotide sequence complementary or corresponding to a genomic sequence, such as a sequence of a nucleic acid used as a probe or primer.
As used herein, the term “germplasm” refers to the totality of the genotypes of a population or other group of individuals (e.g., a species). The term “germplasm” can also refer to plant material; e.g., a group of plants that act as a repository for various alleles. Such germplasm genotypes or populations include plant materials of proven genetic superiority; e.g., for a given environment or geographical area, and plant materials of unknown or unproven genetic value; that are not part of an established breeding population and that do not have a known relationship to a member of the established breeding population.
The terms “hybrid”, “hybrid plant” and “hybrid progeny” used herein refers to an individual produced from genetically different parents (e.g., a genetically heterozygous or mostly heterozygous individual).
The term “homology” is meant DNA sequences that are similar. For example, a “region of homology to a genomic region” is a region of DNA that has a similar sequence to a given “genomic region” in the cell or organism genome. A region of homology can be of any length that is sufficient to promote homologous recombination at the cleaved target site, such that the region of homology has sufficient homology to undergo homologous recombination with the corresponding genomic region. “Sufficient homology” indicates that two polynucleotide sequences have sufficient structural similarity to act as substrates for a homologous recombination reaction or o have the same function. The structural similarity includes overall length of each polynucleotide fragment, as well as the sequence similarity of the polynucleotides.
“Sequence similarity” can be described by the percent sequence identity over the whole length of the sequences, and/or by conserved regions comprising localized similarities such as contiguous nucleotides having 100% sequence identity, and percent sequence identity over a portion of the length of the sequences.
The amount of sequence identity shared by a target and a donor polynucleotide can vary and includes total lengths and/or regions having unit integral values in the ranges of about 1-20 bp, 20-50 bp, 50-100 bp, 75-150 bp, 100-250 bp, 150-300 bp, 200-400 bp, 250-500 bp, 300-600 bp, 350-750 bp, 400-800 bp, 450-900 bp, 500-1000 bp, 600-1250 bp, 700-1500 bp, 800-1750 bp, 900-2000 bp, 1-2.5 kb, 1.5-3 kb, 2-4 kb, 2.5-5 kb, 3-6 kb, 3.5-7 kb, 4-8 kb, 5-10 kb, or up to and including the total length of the target site. These ranges include every integer within the range. The amount of homology can also be described by percent sequence identity over the full aligned length of the two polynucleotides which includes percent sequence identity (or similarity) of about at least 50%, 55%, 60%, 65%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%. Sufficient homology includes any combination of polynucleotide length, global percent sequence identity, and optionally conserved regions of contiguous nucleotides or local percent sequence identity, for example sufficient homology can be described as a region of 75-150 bp having at least 80% sequence identity to a region of the target locus. Sufficient homology can also be described by the predicted ability of two polynucleotides to specifically hybridize under high stringency conditions, see, for example, Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual, (Cold Spring Harbor Laboratory Press, NY); Current Protocols in Molecular Biology, Ausubel et al., Eds (1994) Current Protocols, (Greene Publishing Associates, Inc. and John Wiley & Sons, Inc.); and, Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, (Elsevier, New York).
The structural similarity between a given genomic region and the corresponding region of homology (e.g. found on the donor DNA) can be any degree of sequence identity that allows for homologous recombination to occur. For example, the amount of homology or sequence identity shared by the “region of homology” a corresponding DNA (e.g. of the donor DNA) and the “genomic region” of the organism genome can be at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity, such that the sequences undergo homologous recombination
As used herein, “homologous recombination” includes the exchange of DNA fragments between two DNA molecules at the sites of homology.
As used herein, “sequence identity” or “identity” in the context of two nucleic acid or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins, it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule. The term further refers hereinafter to the amount of characters which match exactly between two different sequences. Hereby, gaps are not counted and the measurement is relational to the shorter of the two sequences.
It is further within the scope that the terms “similarity” and “identity” additionally refer to local homology, identifying domains that are homologous or similar (in nucleotide and/or amino acid sequence). It is acknowledged that bioinformatics tools such as BLAST, SSEARCH, FASTA, and HMMER calculate local sequence alignments which identify the most similar region between two sequences. For domains that are found in different sequence contexts in different proteins, the alignment should be limited to the homologous domain, since the domain homology is providing the sequence similarity captured in the score. According to some aspects the term similarity or identity further includes a sequence motif, which is a nucleotide or amino-acid sequence pattern that is widespread and has, or is conjectured to have, a biological significance. Proteins may have a sequence motif and/or a structural motif, a motif formed by the three-dimensional arrangement of amino acids which may not be adjacent.
As used herein, the terms “nucleic acid”, “nucleic acid sequence”, “nucleotide”, “nucleic acid molecule” or “polynucleotide” are intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA molecules (e.g., mRNA), natural occurring, mutated, synthetic DNA or RNA molecules, and analogs of the DNA or RNA generated using nucleotide analogs. It can be single-stranded or double-stranded. Such nucleic acids or polynucleotides include, but are not limited to, coding sequences of structural genes, anti-sense sequences, and non-coding regulatory sequences that do not encode mRNAs or protein products. These terms also encompass a gene. The term “gene”, “allele” or “gene sequence” is used broadly to refer to a DNA nucleic acid associated with a biological function. Thus, genes may include introns and exons as in the genomic sequence, or may comprise only a coding sequence as in cDNAs, and/or may include cDNAs in combination with regulatory sequences. Thus, according to the various aspects of the invention, genomic DNA, cDNA or coding DNA may be used. In one embodiment, the nucleic acid is cDNA or coding DNA.
The terms “peptide”, “polypeptide” and “protein” are used interchangeably herein and refer to amino acids in a polymeric form of any length, linked together by peptide bonds.
According to other aspects of the invention, a “modified” or a “mutant” plant is a plant that has been altered compared to the naturally occurring wild type (WT) plant. Specifically, the endogenous nucleic acid sequences of the promoter or regulatory regions of one or more of Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) homologues or variants, particularly pFNTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:1 or a functional variant thereof, pFNTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:2 or a functional variant thereof, pPKTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:3 or a functional variant thereof, pPKTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:4 or a functional variant thereof, pPKTHCAS-3 having a nucleic acid sequence as set forth in SEQ ID NO:5 or a functional variant thereof, pPKTHCAS-4 having a nucleic acid sequence as set forth in SEQ ID NO:6 or a functional variant thereof and any combination thereof, have been altered compared to wild type sequences using mutagenesis and/or genome editing methods as described herein. This causes inactivation of at least one of these endogenous genes and thus downregulate their function in production of THCA and/or THC.
In other embodiments of the present invention, the endogenous nucleic acid sequences of the promoter or regulatory regions of one or more of Cannabis cannabidiolic acid synthase (CsCBDAS) homologues or variants, particularly pCBDAS2 having a nucleic acid sequence as set forth in SEQ ID NO:1150 or a functional variant thereof, pFNCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1151 or a functional variant thereof, pPKCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1152 or a functional variant thereof and pPKCBDAS1 having a nucleic acid sequence as set forth in SEQ ID NO:1153 or a functional variant thereof, have been altered compared to wild type sequences using mutagenesis and/or genome editing methods as described herein. This causes inactivation of at least one of these endogenous genes and thus downregulate their function in production of CBDA and/or CBD.
Such plants have an altered cannabinoid profile which may be suitable for treatment of different medical conditions or diseases. Therefore, the cannabinoid profile is affected by the presence of at least one mutated endogenous cannabinoid biosynthesis enzyme gene (e.g. THCAS and/or CBDAS) in the Cannabis plant genome which has been specifically targeted using genome editing techniques.
As used herein, the term “cannabinoid biosynthesis enzyme” refers to a protein acting as a catalyst for producing one or more cannabinoids in a plant of genus Cannabis.
Examples of cannabinoid biosynthesis enzymes within the context of this disclosure include, but are not limited to: tetrahydrocannabinolic acid synthase (THCAS), cannabidiolic acid synthase (CBDAS), aromatic prenyltransferase (PT), olivetol synthase (OLS), acyl-activating enzyme 1 (AAE1), polyketide synthase (PKS), olivetolic acid cyclase (OAC), tetraketide synthase (TKS), type III PKS, chalcone synthase (CHS), prenyltransferase, CBCA synthase, GPP synthase, FPP synthase, Limonene synthase, aromatic prenyltransferase, and geranylphosphate: olivetolate geranyltrasferase.
In certain embodiments of the present invention, the promoter region controlling expression of THCAS variants of “Finola” cultivar, i.e. FNTHCAS-1 and FNTHCAS-2, as well as of “Purple Kush” strain, i.e. PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4 have been modified by targeted genome modification to downregulate their expression.
In certain other embodiments of the present invention, the promoter region controlling expression of CBDAS variants of cannabidiolic acid synthase-like 1 (CBDAS2), “Finola” cultivar, i.e. FNCBDAS, as well as of “Purple Kush” strain, i.e. PKCBDAS and PKCBDAS1 have been modified by targeted genome modification to downregulate their expression.
Disclosed herein, is a method of controlling THCA and/or CBDA synthesis in a plant of genus Cannabis. In some embodiments, the method comprising: Manipulating expression of a gene coding for the cannabinoid biosynthesis enzyme THCAS and/or CBDAS by targeting its promoter region controlling its expression using CRISPR/cas system with suitable corresponding gRNA sequences.
As used herein, the term “controlling” refers to directing, governing, steering, and/or manipulating, specifically reducing, decreasing or down regulating or silencing the amount of a cannabinoid or cannabinoids produced in a plant of genus Cannabis. In one embodiment, controlling means affecting the expression of a coding region of a gene using cis and/or trans genomic elements, particularly, causing loss of function or down regulation of the gene's function. In other embodiments, controlling means inducing, increasing, enhancing or elevating the amount or expression of genes encoding a cannabinoid or cannabinoids produced in a plant of genus Cannabis.
According to further aspects, controlling comprises modifying a plant of genus Cannabis to produce an unnaturally occurring concentration of a first cannabinoid, e.g. THCA and/or CBDA. In one embodiment, controlling comprises modifying a plant of genus Cannabis to produce an unnaturally occurring ratio of a first cannabinoid, e.g. THCA and/or CBDA to the other cannabinoids.
As used herein, the term “expression of a gene” or “gene expression” refers hereinafter to a plant's ability to utilize information from genetic material for producing functional gene products. Within the context of this disclosure, expression is meant to encompass the plant's ability to produce proteins, such as enzymes, and various other molecules from the plant's genetic material. In one embodiment, the plant expresses mutated or modified cannabinoid biosynthesis enzymes for cannabinoid biosynthesis. It is further within the scope that it refers to transcription (RNA) or translation (protein) levels of gene expression.
In the context of the present invention, a regulatory sequence (e.g. promoter region), targeted by genome modification, is capable of increasing or decreasing the expression of specific genes, e.g. THCAS and/or CBDAS gene expression.
As used herein, the term “manipulating expression of a gene” refers in the context of the present invention, to intentionally changing the genome of a plant of genus Cannabis, within regulatory regions of certain genes, to control the expression of certain features or characteristics of the plant.
In one embodiment, the plant's genome is manipulated to express less THCA synthase and/or CBDA synthase by mutating the promoter region operably linked to the gene using genome modification.
As used herein, the term “coding” refers to storing genetic information and accessing the genetic information for producing functional gene products.
According to further aspects of the present invention, the altered THC and/or CBD content trait is not conferred by the presence of transgenes expressed in Cannabis.
Cannabis plants of the invention are modified plants, compared to wild type plants, which comprise and express at least one mutant Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or cannabidiolic acid synthase (CBDAS) allele.
There are a variety of methods for the regeneration of plants from plant tissues. The particular method of regeneration will depend on the starting plant tissue and the particular plant species to be regenerated. The regeneration, development and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art.
Recombinant DNA constructs comprising one or more of the polynucleotide sequences set forth in SEQ ID NOs: 1-6, SEQ ID NOs: 1150-1153 and SEQ ID NOs: 7-1149 are provided herein.
Also provided are plants, plant cells, and/or seeds introduced with a polynucleotide described herein. In certain embodiments the plant, plant cell, or seed comprises a recombinant DNA construct comprising one or more of the polynucleotide sequences set forth in SEQ ID NOs: 1-1153.
In certain embodiments, the plant, plant cell, or seed comprises a recombinant DNA construct comprising one or more guide polynucleotides comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 7-1149, that target the genomic loci of a plant cell comprising a polynucleotide sequence that is at least 75% (e.g., 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%) identical to a sequence selected from the group consisting of SEQ ID NOs: 1-6 and SEQ ID NOs: 1150-1153.
The polynucleotide of the plant, plant cell, or seed can be stably introduced or can be transiently expressed by the plant, plant cell, or seed. In certain embodiments, the polynucleotide is stably introduced into the plant, plant cell, or seed.
The terms “polynucleotide”, “polynucleotide sequence”, “nucleic acid sequence”, “nucleotide sequence”, “nucleic acid fragment”, and “isolated nucleic acid fragment” are used interchangeably herein. These terms encompass nucleotide sequences and the like. A polynucleotide may be a polymer of RNA or DNA that is single- or double-stranded, that optionally contains synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA, synthetic DNA, or mixtures thereof.
The term “recombinant DNA construct” or “recombinant expression construct” is used interchangeably and generally refers to a discrete polynucleotide into which a nucleic acid sequence or fragment can be moved. Preferably, it is a plasmid vector or a fragment thereof comprising the promoters of the present disclosure. The choice of plasmid vector is dependent upon the method that will be used to transform host plants. The skilled artisan is well aware of the genetic elements that must be present on the plasmid vector in order to successfully transform, select and propagate host cells containing the chimeric gene. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by PCR and Southern analysis of DNA, RT-PCR and Northern analysis of mRNA expression, Western analysis of protein expression, or phenotypic analysis.
The terms “plasmid”, “vector” and “cassette” refer to an extra chromosomal element often carrying genes that are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA fragments. Such elements may be autonomously replicating sequences, genome integrating sequences, phage or viral nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing a promoter fragment and DNA sequence for a selected gene product along with appropriate 3′ untranslated sequence into a cell.
The promoters for use in the vector may be derived in their entirety from a native gene or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. Core promoters are often modified to produce artificial, chimeric, or hybrid promoters, and can further be used in combination with other regulatory elements, such as cis-elements, 5′UTRs, enhancers, or introns, that are either heterologous to an active core promoter or combined with its own partial or complete regulatory elements. In certain embodiments the promoter of the recombinant DNA construct may be a tissue-specific promoter, developmental regulated promoter, or a constitutive promoter.
“Tissue-specific promoter” and “tissue-preferred promoter” are used interchangeably to refer to a promoter that is expressed predominantly but not necessarily exclusively in one tissue or organ, but that may also be expressed in one specific cell. “Developmentally regulated promoter” generally refers to a promoter whose activity is determined by developmental events. “Constitutive promoter” generally refers to promoters active in all or most tissues or cell types of a plant at all or most developing stages. As with other promoters classified as “constitutive”, some variation in absolute levels of expression can exist among different tissues or stages. The term “constitutive promoter” or “tissue-independent” are used interchangeably herein.
In certain embodiments the promoter of the recombinant DNA construct is heterologous to the expressed nucleotide sequence. A “heterologous nucleotide sequence” generally refers to a sequence that is not naturally occurring with the sequence of the disclosure. While this nucleotide sequence is heterologous to the sequence, it may be homologous, or native, or heterologous, or foreign, to the plant host. However, it is recognized that the instant sequences may be used with their native coding sequences to increase or decrease expression resulting in a change in phenotype in the transformed seed.
The terms “heterologous nucleotide sequence”, “heterologous sequence”, “heterologous nucleic acid fragment”, and “heterologous nucleic acid sequence” are used interchangeably herein.
The term “operably linked” or “functionally linked” generally refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. In other aspects, the ability of a regulatory nucleic acid sequence to drive the expression of a coding sequence. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e. that the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in sense or antisense orientation.
The terms “initiate transcription”, “initiate expression”, “control expression”, “drive transcription”, and “drive expression” are used interchangeably herein and all refer to the primary function of a promoter. As detailed in this disclosure, a promoter is a non-coding genomic DNA sequence, usually upstream (5′) to the relevant coding sequence, and its primary function is to act as a binding site for RNA polymerase and initiate transcription by the RNA polymerase. Additionally, there is “expression” of RNA, including functional RNA, or the expression of polypeptide for operably linked encoding nucleotide sequences, as the transcribed RNA ultimately is translated into the corresponding polypeptide. Thus the term “expression”, as used herein, generally refers to the production of a functional end-product e.g., an mRNA or a protein (precursor or mature).
The term “expression cassette” as used herein, generally refers to a discrete nucleic acid fragment into which a nucleic acid sequence or fragment can be cloned or synthesized through molecular biology techniques.
The expression cassette (e.g., CRISPR/RNA-guided endonuclease expression cassette such as a CRISPR/Cas9 expression cassette or a CRISPR/Cpf 1 expression cassette) may be introduced into a plant using any method known in the art or described herein, e.g., by such as Agrobacterium-mediated recombination, viral-vector mediated recombination, microinjection, gene gun bombardment/biolistic particle delivery, or electroporation of plant protoplasts. The expression cassette (e.g., CRISPR/RNA-guided endonuclease expression cassette such as a CRISPR/Cas9 expression cassette or a CRISPR/Cpf1 expression cassette) may be integrated onto the same chromosome or a different chromosome than the gene of interest. In some embodiments, integration of the expression cassette (CRISPR/RNA-guided endonuclease expression cassette such as a CRISPR/Cas9 expression cassette or a CRISPR/Cpf1 expression cassette) onto a different chromosome than the gene of interest is preferable so that the expression cassette can later be removed through a self-cross or a cross with another plant without having to undergo homologous recombination to separate the expression cassette from the gene of interest.
In some embodiments, the allele of the gene of interest contains the target region against which the gRNAs (e.g., sgRNAs) are designed such that mutations can be introduced into the target region of the allele using the RNA-guided endonuclease (e.g., Cas9, Cpf1, or Csm1 endonuclease). In some embodiments, the target region or a portion thereof, is part of a regulatory, non-coding region of the allele.
In some embodiments, the target region is 0 to 5000 base pairs (e.g., 0 to 5000, 0 to 4000, 0 to 3000, 0 to 2000, 0 to 1000, 100 to 5000, 100 to 4000, 100 to 3000, 100 to 2000, 100 to 1000, 500 to 5000, 500 to 4000, 500 to 3000, 500 to 2000, 500 to 1000, 1000 to 5000, 1000 to 4000, 1000 to 3000, or 1000 to 2000 base pairs) upstream of the 5′ end of the coding sequence of the gene or allele of interest.
In some embodiments, the target region is 0 to 5000 base pairs (e.g., 0 to 5000, 0 to 4000, 0 to 3000, 0 to 2000, 0 to 1000, 100 to 5000, 100 to 4000, 100 to 3000, 100 to 2000, 100 to 1000, 500 to 5000, 500 to 4000, 500 to 3000, 500 to 2000, 500 to 1000, 1000 to 5000, 1000 to 4000, 1000 to 3000, or 1000 to 2000 base pairs) downstream of the 3′ end of the coding sequence of the gene or allele of interest.
In some embodiments, the target region comprises a regulatory region of the gene or allele of interest, i.e. THCAS and/or CBDAS. As used herein, a “regulatory region” of a gene of interest contains one or more nucleotide sequences that, alone or in combination, are capable of modulating expression of the gene of interest. Regulatory regions include, for example, promoters, enhancers, terminators and introns. In some embodiments, the regulatory region comprises a transcription factor binding site, an RNA polymerase binding site, a TATA box, or a combination thereof. In some embodiments, the regulatory region is within a certain distance of the gene of interest, e.g., 0 to 5000 base pairs (e.g., 0 to 5000, 0 to 4000, 0 to 3000, 0 to 2000, 0 to 1000, 100 to 5000, 100 to 4000, 100 to 3000, 100 to 2000, 100 to 1000, 500 to 5000, 500 to 4000, 500 to 3000, 500 to 2000, 500 to 1000, 1000 to 5000, 1000 to 4000, 1000 to 3000, or 1000 to 2000 base pairs) upstream of the 5′ end of the coding sequence of the gene of interest, or 0 to 5000 base pairs (e.g., 0 to 5000, 0 to 4000, 0 to 3000, 0 to 2000, 0 to 1000, 100 to 5000, 100 to 4000, 100 to 3000, 100 to 2000, 100 to 1000, 500 to 5000, 500 to 4000, 500 to 3000, 500 to 2000, 500 to 1000, 1000 to 5000, 1000 to 4000, 1000 to 3000, or 1000 to 2000 base pairs) downstream of the 3′ end of the coding sequence of the gene of interest.
In some embodiments, a regulatory region may be identified using databases or other information available in the art.
In some embodiments, a regulatory region can be identified, e.g., by analyzing the sequences within a certain distance of the gene of interest (e.g., within 2-5 kilobases) for one or more of transcription factor binding sites, RNA polymerase binding sites, TATA boxes, reduced SNP density or conserved non-coding sequences.
In some embodiments, the target region may be larger, e.g., 0 to 100 kilobases (e.g., 0 to 100, 0 to 90, 0 to 80, 0 to 70, 0 to 60, 0 to 50, 0 to 40, 0 to 30, 0 to 20 or 0 to 10 kilobases) upstream of the 5′ end of the coding sequence of the gene of interest, or 0 to 60 kilobases (e.g., 0 to 60, 0 to 50, 0 to 40, 0 to 30, 0 to 20 or 0 to 10 kilobases) base pairs downstream of the 3′ end of the coding sequence of the gene of interest. Such larger regions may include both proximal promoter regions (e.g., within 1 to 3 Kb of the 5′ end of the coding sequence) and distal enhancer regions.
“Transformation” as used herein generally refers to both stable transformation and transient transformation. “Stable transformation” generally refers to the introduction of a nucleic acid fragment into a genome of a host organism resulting in genetically stable inheritance. Once stably transformed, the nucleic acid fragment is stably integrated in the genome of the host organism and any subsequent generation. Host organisms containing the transformed nucleic acid fragments are referred to as “transgenic” organisms.
“Transient transformation” generally refers to the introduction of a nucleic acid fragment into the nucleus, or DNA-containing organelle, of a host organism resulting in gene expression without genetically stable inheritance.
The term “introduced” means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell, and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods, as well as sexually crossing. Thus, “introduced” in the context of inserting a nucleic acid fragment (e.g., a recombinant DNA construct/expression construct) into a cell, means “transfection” or “transformation” or “transduction” and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
The heterologous polynucleotide can be stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant DNA construct.
“Transient expression” generally refers to the temporary expression of often reporter genes such as b-glucuronidase (GUS), fluorescent protein genes ZS-GREEN1, ZS-YELLOW1 N1, AM-CYAN 1, DS-RED in selected certain cell types of the host organism in which the transgenic gene is introduced temporally by a transformation method. The transformed materials of the host organism are subsequently discarded after the transient gene expression assay.
“Plant” includes reference to whole plants, plant organs, plant tissues, seeds and plant cells and progeny of same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores.
The plant, plant cell, and seed, of the compositions and methods described herein may further comprise a heterologous nucleic acid sequence that confers advantageous properties, such as improved agronomics, to the plant, plant cell, and/or seed. The heterologous nucleic acid sequences are known to those of ordinary skill in the art, and can be routinely incorporated in the plant, plant cell, and/or seeds described herein using routine methods in the art, such as those described herein.
In certain embodiments the heterologous nucleic acid sequence is selected from the group consisting of a reporter gene, a selection marker, a disease resistance gene, a herbicide resistance gene, an insect resistance gene, a gene involved in carbohydrate metabolism, a gene involved in fatty acid metabolism, a gene involved in amino acid metabolism, a gene involved in plant development, a gene involved in plant growth regulation, a gene involved in yield improvement, a gene involved in drought resistance, a gene involved in increasing nutrient utilization efficiency, a gene involved in cold resistance, a gene involved in heat resistance and a gene involved in salt resistance in plants.
In certain embodiments, the present disclosure contemplates the transformation of a recipient cell with more than one advantageous gene. Two or more genes can be supplied in a single transformation event using either distinct gene-encoding vectors, or a single vector incorporating two or more gene coding sequences. Any two or more genes of any description, such as those conferring herbicide, insect, disease (viral, bacterial, fungal, and nematode), or drought resistance, oil quantity and quality, or those increasing yield or nutritional quality may be employed as desired.
Main aspects of the invention involve targeted mutagenesis methods, specifically genome editing, and exclude embodiments that are solely based on generating plants by traditional breeding methods.
Described are polynucleotides as well as methods for modifying metabolite biosynthesis pathways in Cannabis plants and/or Cannabis plant cells, Cannabis plants and/or plant cells exhibiting modified metabolite biosynthesis pathways. In particular, described are methods for modifying production of THC and/or THCA, CBD and/or CBDA in Cannabis plants by modulating the expression and/or activity of at least one Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or cannabidiolic acid synthase (CsCBDAS) gene and Cannabis plants having modified expression and/or activity of at least one of these genes/proteins.
Accordingly, in certain embodiments, the present invention provides methods of downregulating production of THC and/or THCA, CBD and/or CBDA. In particular embodiments, there is provided methods of downregulating expression and/or activity of Cannabis THCA synthase and/or CBDA synthase by CRISPR/Cas9 assisted targeted genome editing within the regulatory region controlling the expression of the gene. This strategy enables to knock-down expression of the target genes and to generate knockout allele variations directly in the elite target germplasm with minimal genetic drag associated with conventional breeding material.
Down regulation of key steps in metabolic pathway re-directs intermediates and energy to alternative metabolic pathways and results in increased production and accumulation or reduced production and elimination of other end products. THC, CBD and other Cannabis metabolites share a biosynthetic pathway; that cannabigerolic acid (CBGA) is a precursor of THC, CBD and Cannabichromene. In particular, THCA synthase catalyzes the production of delta-9-tetrahydrocannabinolic acid from cannabigerolic acid; delta-9-tetrahydrocannabinolic undergoes thermal conversion to form THC. CRISPR/Cas9 genome editing has been widely adopted in plants as a tool for understanding fundamental biological processes. With this disclosure it is demonstrated that genome editing is useful for agricultural applications. It is herein shown that by targeting the promoters of genes involved in cannabinoid biosynthesis, e.g. THCA synthase and/or CBDA synthase, Cannabis plants with altered (e.g. reduced) THCA and/or CBDA content could be achieved.
The approach of the current invention enable production by CRISPR/Cas9 gene editing regulatory mutations associated with phenotypic variation. Such streamlined trait improvement is expected for the gene targeted. The phenotypic variation achieved by engineering regulatory alleles for a single gene (i.e. THCAS and/or CBDAS) previously required multiple natural and induced mutations within the gene or several genes.
The effects may be dominant or semi dominant or co-dominant. There is also potential for engineering, not only loss of function mutations, but also gain-of-function alleles.
It is further acknowledged that breeders expend great time and effort to adapt beneficial allelic variants to diverse breeding germplasm. By the current invention this constraint is bypassed by directly generating and selecting for the most desirable regulatory variant in the context of modified THCA and/or CBDA content in the Cannabis plant.
As the medical Cannabis pharmaceutical industry is focusing on developing new cannabinoid based drugs, and these are mostly extracted from the Cannabis plant, there is a growing need for Cannabis plants bred for producing high levels of specific cannabinoids. In addition, there is a need for advanced breeding programs for food and fiber (Hemp) as well.
The present invention is aimed at enhancing cannabinoid breeding capabilities by using advanced molecular genome editing technologies in order to maximize the plants' phyto-chemical molecules production potential.
According to a further aspect of the present invention, a method or a tool is provided that enables the regulation in planta or the production of specific cannabinoid molecules.
It is further within the scope of the present invention to provide means and methods for in planta modification of specific genes' regulatory regions that relate to and/or control the cannabinoid biosynthesis pathways (as indicated in
According to a main embodiment, the present invention provides a Cannabis plant exhibiting reduced tetrahydrocannabinolic acid (THCA) content, wherein said plant comprises a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) gene expression, said genomic locus comprises at least one targeted nucleotide modification as compared to a control Cannabis plant, within the regulatory region controlling said THCAS expression.
According to a further main embodiment, the present invention provides a Cannabis plant exhibiting reduced cannabidiolic acid (CBDA) content, wherein said plant comprises a modified genomic locus involved in cannabidiolic acid synthase (CBDAS) gene expression, said genomic locus comprises at least one targeted nucleotide modification as compared to a control Cannabis plant, within the regulatory region controlling said CBDAS expression.
According to a further embodiment of the present invention, cannabinoid biosynthesis enzyme Δ9-tetrahydrocannabinolic acid (THCA) synthase (THCAS) is down regulated using targeted genome modification in a regulatory region operably linked to the coding region of the gene (e.g. gene editing techniques as inter alia presented). As a result, the production of the major cannabinoid THCA (converted into THC by decarboxylation) is significantly reduced and/or totally abolished.
According to a further embodiment of the present invention, cannabinoid biosynthesis enzyme cannabidiolic acid (CBDA) synthase (CBDAS) is down regulated using targeted genome modification in a regulatory region operably linked to the coding region of the gene (e.g. gene editing techniques as inter alia presented). As a result, the production of the major cannabinoid CBDA (converted into CBD by decarboxylation) is significantly reduced and/or totally abolished.
According to a further embodiment of the present invention, targeted genome modification of the herein identified Cannabis gene encoding cannabinoid synthesis enzyme CsTHCAS results in reduced or no production of THCA and thus Cannabis plants with reduced content, or free of, THCA and/or THC are provided by the present invention.
According to a further embodiment of the present invention, the THCAS gene is a THCAS variant or homologue of ‘Finola’ (FN) Cannabis strain (FNTHCAS) or a THCAS variant or homologue of Purple Kush (PK) Cannabis strain (PKTHCAS).
According to a further embodiment of the present invention, the FNTHCAS homologue is selected from the group consisting of FNTHCAS-1 and FNTHCAS-2 genes and the PKTHCAS homologue is selected from the group consisting of PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4 genes.
According to a further embodiment of the present invention, the regulatory region comprises a nucleotide sequence selected from the group consisting of pFNTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:1 or a functional variant thereof, pFNTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:2 or a functional variant thereof, pPKTHCAS-1 having a nucleic acid sequence as set forth in SEQ ID NO:3 or a functional variant thereof, pPKTHCAS-2 having a nucleic acid sequence as set forth in SEQ ID NO:4 or a functional variant thereof, pPKTHCAS-3 having a nucleic acid sequence as set forth in SEQ ID NO:5 or a functional variant thereof, pPKTHCAS-4 having a nucleic acid sequence as set forth in SEQ ID NO:6 or a functional variant thereof and any combination thereof.
According to a further embodiment of the present invention, the functional variant has at least 75% sequence identity or similarity to the nucleic acid sequence of said regulatory region sequence or a codon degenerate nucleotide sequence thereof.
According to a further embodiment of the present invention, targeted genome modification of the herein identified Cannabis gene encoding cannabinoid synthesis enzyme CsCBDAS results in reduced or no production of CBDA and thus Cannabis plants with reduced content, or free of, CBDA and/or CBD are provided by the present invention.
According to a further embodiment of the present invention, the CBDAS gene is a cannabidiolic acid synthase-like 1 (CBDAS2) variant or homologue, CBDAS variant or homologue of ‘Finola’ (FN) Cannabis strain (FNCBDAS) or a CBDAS variant or homologue of Purple Kush (PK) Cannabis strain (PKCBDAS or PKCBDAS1).
According to a further embodiment of the present invention, the regulatory region comprises a nucleotide sequence selected from the group consisting of pCBDAS2 having a nucleic acid sequence as set forth in SEQ ID NO:1150 or a functional variant thereof, pFNCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1151 or a functional variant thereof, pPKCBDAS having a nucleic acid sequence as set forth in SEQ ID NO:1152 or a functional variant thereof and pPKCBDAS1 having a nucleic acid sequence as set forth in SEQ ID NO:1153 or a functional variant thereof and any combination thereof.
According to a further embodiment of the present invention, the functional variant has at least 75% sequence identity or similarity to the nucleic acid sequence of said regulatory region sequence or a codon degenerate nucleotide sequence thereof.
According to a further embodiment of the present invention, the targeted nucleotide modification is selected from the group consisting of insertion, deletion, single nucleotide polymorphism (SNP), and a polynucleotide modification, such that the expression of the THCAS and/or CBDAS polynucleotide is reduced or affected.
According to a further embodiment of the present invention, the nucleotide modification is a missense mutation, nonsense mutation, insertion, deletion, indel, substitution or duplication.
According to a further embodiment of the present invention, the insertion or the deletion produces a gene comprising a frameshift.
According to a further embodiment of the present invention, the nucleotide modification is a silencing mutation, a knockdown mutation, a knockout mutation, a loss of function mutation or any combination thereof.
It is further within the scope of the current invention to provide a method for producing a Cannabis plant exhibiting reduced tetrahydrocannabinolic acid (THCA) content and comprising a modified genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) gene expression as compared to a control Cannabis plant, said method comprises steps of introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region controlling THCAS expression.
According to a further aspect, the present invention provides a method for reducing tetrahydrocannabinolic acid (THCA) content in a Cannabis plant by modifying genomic locus involved in tetrahydrocannabinolic acid synthase (THCAS) gene expression as compared to a control Cannabis plant, wherein the method comprises introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region controlling THCAS expression.
It is further within the scope of the current invention to provide a method for producing a Cannabis plant exhibiting reduced cannabidiolic acid (CBDA) content and comprising a modified genomic locus involved in cannabidiolic acid synthase (CBDAS) gene expression as compared to a control Cannabis plant, said method comprises steps of introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region controlling CBDAS expression.
According to a further aspect, the present invention provides a method for reducing cannabidiolic acid (CBDA) content in a Cannabis plant by modifying genomic locus involved in cannabidiolic acid synthase (CBDAS) gene expression as compared to a control Cannabis plant, wherein the method comprises introducing one or more nucleotide modifications through a targeted DNA modification at a regulatory region controlling CBDAS expression.
According to a further aspect, the present invention provides a method for producing a medical Cannabis composition, the method comprising: (a) obtaining the Cannabis plant as defined in any of the above; and (b) formulating a medical Cannabis composition from said plant.
The present invention further provides an isolated nucleotide sequence having at least 75% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:6 and SEQ ID NO:1150 to SEQ ID NO:1153.
According to further aspects, the present invention provides an isolated nucleotide sequence having at least 75% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:7 to SEQ ID NO:1149.
According to further aspects, the present invention provides a vector, construct or expression system or cassette comprising a nucleic acid sequence having at least 75% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO:1 to SEQ ID NO:1153.
Other embodiments of the present invention include the use of a nucleotide sequence having at least 75% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 7-91, 92-176, 177-278, 279-419, 420-560 and 561-681 for targeted genome modification of pFNTHCAS-1, pFNTHCAS-2, pPKTHCAS-1, pPKTHCAS-2, pPKTHCAS-3 and pPKTHCAS-4 gene, respectively.
Yet other embodiments of the present invention include the use of a nucleotide sequence having at least 75% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 682-794, 795-895, 896-1016 and 1017-1149 for targeted genome modification of pCBDAS2, pFNCBDAS, pPKCBDAS and pPKCBDAS 1 gene, respectively.
It is also within the scope of the present invention to provide the use of a nucleotide sequence having at least 75% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 7-681 and any combination thereof for reducing THCA content in a Cannabis plant.
It is also within the scope of the present invention to provide the use of a nucleotide sequence having at least 75% sequence identity to a nucleotide sequence as set forth in at least one of SEQ ID NO: 682-1149 and any combination thereof for reducing CBDA content in a Cannabis plant.
According to some embodiments of the present invention, the above in planta modification can be based on alternative gene silencing technologies such as Zinc Finger Nucleases (ZFN's), Transcription activator-like effector nucleases (TALEN's), RNA silencing, amiRNA or any other gene silencing technique known in the art.
According to some other embodiments of the present invention, DNA introduction into the plant cells can be done by Agrobacterium infiltration, viral based plasmids for virus induced gene silencing (VIGS) and by mechanical insertion of DNA (PEG mediated DNA transformation, biolistics, etc.).
In addition, it is within the scope of the present invention that the Cas9 protein is directly inserted together with a gRNA (ribonucleoprotein-RNP's) in order to bypass the need for in vivo transcription and translation of the Cas9+gRNA plasmid in planta to achieve gene editing.
It is a core aspect of the present invention that the above CRISPR/Cas system allows the modification of specific DNA sequences. This is achieved by combining the Cas nuclease (Cas9, Cpf1 or the like) with a guide RNA molecule (gRNA). The gRNA is designed such that it should be complementary to a specific DNA sequence targeted for editing in the plant genome and which guides the Cas nuclease to a specific nucleotide sequence (see
Without wishing to be bound by theory, according to further specific aspects of the present invention, upon reaching the specific DNA sequence, the Cas9 nuclease cleaves both DNA strands to create double stranded breaks leaving blunt ends. This cleavage site is then repaired by the cellular non homologous end joining DNA repair mechanism resulting in insertions or deletions which eventually creates a mutation around the cleavage site. The deletion form of the mutation consists of at least 1 base pair deletion. As a result of this base pair deletion the gene coding sequence is disrupted and the translation of the encoded protein is compromised either by a premature stop codon or disruption of a functional or structural property of the protein.
It is further within the scope that by introducing a gRNA with homology to a specific site of a gene described in
A reduction in the production of THC, CBD, or Cannabichromene will enhance production of the remaining metabolites in this shared pathway. For example, production of CBD and/or Cannabichromene is enhanced by inhibiting production of THC. THC production may be inhibited by inhibiting expression and/or activity of tetrahydrocannabinolic acid (THCA) synthase enzyme.
Described are certain embodiments of enhancing production of one or more secondary metabolites by downregulation of the production of one or more metabolites having a shared biosynthetic pathway.
The plant of the invention includes plants wherein the plant is heterozygous for the each of the mutations. In other embodiment however, the plant is homozygous for the mutations. Progeny that is also homozygous can be generated from these plants according to methods known in the art.
It is further within the scope that variants of a particular nucleotide or amino acid sequence according to the various aspects of the invention will have at least about 50%-99%, for example at least 75%, for example at least 85%, 86%, 87%, 88%, 89%, 90%, 92%, 94%, 95%, 96%, 97%, 98% or 99% or more sequence identity to that particular non-variant nucleotide sequence of the Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) allele as shown in any one of SEQ ID NO 1-6. Sequence alignment programs to determine sequence identity are well known in the art.
It is further within the scope that variants of a particular nucleotide or amino acid sequence according to the various aspects of the invention will have at least about 50%-99%, for example at least 75%, for example at least 85%, 86%, 87%, 88%, 89%, 90%, 92%, 94%, 95%, 96%, 97%, 98% or 99% or more sequence identity to that particular non-variant nucleotide sequence of the Cannabis cannabidiolic acid synthase (CsCBDAS) allele as shown in any one of SEQ ID NO 1150-1153. Sequence alignment programs to determine sequence identity are well known in the art.
Also, the various aspects of the invention encompass not only a Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or cannabidiolic acid synthase (CsCBDAS) promoter sequence, but also fragments thereof. By “fragment” is intended a portion of the nucleotide sequence or a portion of the amino acid sequence and hence of the protein encoded thereby. Fragments of a nucleotide sequence may encode protein fragments that retain the biological activity of the native protein, in this case cannabinoid biosynthesis enzymes.
It is further within the scope that manipulation of cannabinoid biosynthesis enzymes in Cannabis plants is herein achieved by generating gRNA with homology to a specific site of predetermined gene promoters in the Cannabis genome i.e. Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or cannabidiolic acid synthase (CsCBDAS) homologue or variant, sub cloning this gRNA into a plasmid containing the Cas9 gene, and insertion of the plasmid into the Cannabis plant cells. In this way site specific mutations in the promoter regions controlling the aforementioned genes are generated thus effectively creating non-active molecules, resulting in loss of function of at least one of the THCAS and/or CBDAS enzymes and reduced content of THCA (and/or THC) and/or CBDA (and/or CBD) in the genome edited plant.
In order to understand the invention and to see how it may be implemented in practice, a plurality of preferred embodiments will now be described, by way of non-limiting example only, with reference to the following examples.
Identifying Cannabis THCA Synthase (THCAS) and CBDA Synthase (CBDAS) Alleles for Targeted Genome Editing
This example demonstrates the identification of Cannabis genomic loci, particularly, promoter regions, targets for modulating THCA and/or CBDA content in the plant.
In order to downregulate the production of THCA and/or CBDA in the plant, Cannabis THCA synthesis (THCAS) and CBDA synthase (CBDAS) gene alleles or homologues/variants were identified. Particularly, the promoter regions of Cannabis THCA synthesis (THCAS) and CBDA synthase (CBDAS) gene alleles or homologues/variants were identified. The used strategy was to modify the expression of the identified genes by introducing targeted mutations at the promoter regions of THCAS and/or CBDAS variants by CRISPR/Cas technology. In this way, knock-down or loss-of-function mutations in regulatory sequences or elements necessary for the expression of one or more of the identified THCAS and/or CBDAS genes or alleles could be achieved, to substantially reduce the cannabinoid biosynthesis enzymes production in the plant, specifically in high THC or low THC Cannabis varieties.
The promoter regions (about 2 Kb upstream to the first ATG within the gene or allele of interest) of various THCAS and/or CBDAS gene variants have been identified in different Cannabis sativa (C. sativa) strains or varieties. The promoter regions were cloned and sequenced.
The following promoter sequences are herein exemplified:
pFNTHCAS-1: The promoter region of THCAS-1 variant of the hemp-type “Finola” cultivar was mapped to Chromosome 6 CM011610.1:22241180-22244169 and has a nucleic acid sequence as set forth in SEQ ID NO:1.
pFNTHCAS-2: The promoter region of THCAS-2 variant of “Finola” cultivar was mapped to Chromosome 3 CM011607.1:10147821-10150821 and has a nucleic acid sequence as set forth in SEQ ID NO:2.
pPKTHCAS-1: The promoter region of THCAS-1 variant of “Purple Kush” cultivar was mapped to Chromosome 8 CM010797.2:28650050-28653053 and has a nucleic acid sequence as set forth in SEQ ID NO:3.
pPKTHCAS-2: The promoter region of THCAS-2 variant of “Purple Kush” cultivar was mapped to Chromosome 7 CM010796.2:62086454-62089454 and has a nucleic acid sequence as set forth in SEQ ID NO:4.
pPKTHCAS-3: The promoter region of THCAS-3 variant of “Purple Kush” cultivar was mapped to Chromosome 4 CM010793.2:49188167-49191167 and has a nucleic acid sequence as set forth in SEQ ID NO:5.
pPKTHCAS-4: The promoter region of THCAS-4 variant of “Purple Kush” cultivar was mapped to Chromosome 3 CM010792.2:58197740-58200740 and has a nucleic acid sequence as set forth in SEQ ID NO:6.
pCBDAS2: The promoter region of Cannabidiolic acid synthase-like 1 (CBDAS2) variant was mapped to CM010797.2_52807720-52810660 and has a nucleic acid sequence as set forth in SEQ ID NO:1150.
pFNCBDAS: The promoter region of CBDAS variant of “Finola” cultivar was mapped to CM011610.1_21834038-21837038 and has a nucleic acid sequence as set forth in SEQ ID NO:1151.
pPKCBDAS: The promoter region of CBDAS variant of “Purple Kush” cultivar was mapped to CM010792.2_58197740-58200740 and has a nucleic acid sequence as set forth in SEQ ID NO:1152.
pPKCBDAS1: The promoter region of CBDAS1 variant of “Purple Kush” cultivar was mapped to CM010796.2_62086535-62089454 and has a nucleic acid sequence as set forth in SEQ ID NO:1153.
Targeted DNA Modification of Genomic Loci Involved in THCA and/or CBDA Synthesis
Targeted DNA modification of promoter regions encoding identified THCAS and/or CBDAS alleles was performed to affect cannabinoid profile and specifically THCA and/or CBDA content in Cannabis.
Toward this end, CRISPR/Cas9 assisted targeted genome editing in gene regulatory regions was used to alter the expression, or more specifically, knock-out any gene sequence of interest and generate knock-outs through small deletions, internal small fragment deletions within the target genetic locus, or full-length gene deletion.
The Cannabis plants of the present invention comprise CRISPR/Cas9 expression cassette that encodes a Cas9 endonuclease and at least one guide RNA (gRNA), each gRNA containing a sequence that is complementary to a target sequence within a regulatory region of an allele of a gene of interest, wherein the target region is 0 to 2000 base pairs upstream of the 5′ end of the coding sequence of the gene of interest or wherein the target region is 0 to 2000 base pairs downstream of the 3′ end of the coding sequence of the gene of interest. Through CRISPR-Cas genome editing targeted to the regulatory sequences of a gene of interest, new variations of the identified alleles are introduced directly in the elite target germplasm with minimal genetic drag associated with conventional breeding material. Examples of target THCAS gene alleles within the scope of the current invention include the “Finola” THCAS-1 and THCAS-2 variants (FNTHCAS-1 and FNTHCAS-2, respectively), and the Purple Kush THCAS-1, THCAS-2, THCAS-3 and THCAS-4 variants (PKTHCAS-1, PKTHCAS-2, PKTHCAS-3 and PKTHCAS-4, respectively).
Examples of target CBDAS gene alleles within the scope of the current invention include Cannabidiolic acid synthase-like 1 (CBDAS2) variant, the “Finola” CBDAS variant (FNCBDAS), and the Purple Kush CBDAS and CBDAS1 variants (PKCBDAS, and PKCBDAS1, respectively). Tables 1-10 below provide the guide RNA (gRNA) polynucleotide sequences and targeting strategies to knock-down expression of the target genes, and more specifically the promoter regions of the genes.
Production of Cannabis lines with mutated Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or mutated Cannabidiolic acid synthase (CsCBDAS) may be achieved by at least one of the following breeding/cultivation schemes:
Scheme 1:
Scheme 2:
It is noted that line stabilization may be performed by the following:
According to certain embodiments of the present invention, line stabilization requires about 6 self-crossing (6 generations) and done through a single seed descent (SSD) approach.
F1 hybrid seed production: Novel hybrids are produced by crosses between different Cannabis strains.
According to further aspects of the current invention, shortening line stabilization period is performed by Doubled Haploids (DH). More specifically, the CRISPR-Cas9 system is transformed into microspores to achieve DH homozygous parental lines. It is noted that a doubled haploid (DH) is a genotype formed when haploid cells undergo chromosome doubling. Artificial production of doubled haploids is important in plant breeding. It is herein acknowledged that conventional inbreeding procedures take about six generations to achieve approximately complete homozygosity, whereas doubled haploidy achieves it in one generation.
It is within the scope of the current invention that genetic markers specific for Cannabis are developed and provided by the current invention:
It is further within the scope of the current invention that allele and genetic variation is analyzed for the Cannabis strains or cultivars used.
Reference is now made to optional stages that have been used for the production of Cannabis plants with mutated tetrahydrocannabinolic acid synthase (CsTHCAS) and/or mutated Cannabidiolic acid synthase (CsCBDAS) by genome editing targeted to the promoter region of the gene:
Stage 1: Identifying Cannabis sativa (C. sativa) tetrahydrocannabinolic acid synthase (THCAS) and Cannabidiolic acid synthase (CBDAS) orthologues/homologs or alleles from various cultivars/strains characterized by different THCA and or CBDA content or profile, for example “Finola” (FN) hemp-type cultivar with relatively low THCA level and “Purple Kush” (PK) potent-type strain with relatively high THCA level.
The following homologs have herein been identified in Cannabis sativa (C. sativa). These genes have been sequenced and mapped.
pFNTHCAS-1: The promoter region of THCAS-1 variant or allele of the hemp-type “Finola” cultivar was mapped to Chromosome 6 CM011610.1:22241180-22244169 and has a nucleic acid sequence as set forth in SEQ ID NO:1.
pFNTHCAS-2: The promoter region of THCAS-2 variant of “Finola” cultivar was mapped to Chromosome 3 CM011607.1:10147821-10150821 and has a nucleic acid sequence as set forth in SEQ ID NO:2.
pPKTHCAS-1: The promoter region of THCAS-1 variant of “Purple Kush” cultivar was mapped to Chromosome 8 CM010797.2:28650050-28653053 and has a nucleic acid sequence as set forth in SEQ ID NO:3.
pPKTHCAS-2: The promoter region of THCAS-2 variant of “Purple Kush” cultivar was mapped to Chromosome 7 CM010796.2:62086454-62089454 and has a nucleic acid sequence as set forth in SEQ ID NO:4.
pPKTHCAS-3: The promoter region of THCAS-3 variant of “Purple Kush” cultivar was mapped to Chromosome 4 CM010793.2:49188167-49191167 and has a nucleic acid sequence as set forth in SEQ ID NO:5.
pPKTHCAS-4: The promoter region of THCAS-4 variant of “Purple Kush” cultivar was mapped to Chromosome 3 CM010792.2:58197740-58200740 and has a nucleic acid sequence as set forth in SEQ ID NO:6.
pCBDAS2: The promoter region of Cannabidiolic acid synthase-like 1 (CBDAS2) variant was mapped to CM010797.2_52807720-52810660 and has a nucleic acid sequence as set forth in SEQ ID NO:1150.
pFNCBDAS: The promoter region of CBDAS variant of “Finola” cultivar was mapped to CM011610.1_21834038-21837038 and has a nucleic acid sequence as set forth in SEQ ID NO:1151.
pPKCBDAS: The promoter region of CBDAS variant of “Purple Kush” cultivar was mapped to CM010792.2_58197740-58200740 and has a nucleic acid sequence as set forth in SEQ ID NO:1152.
pPKCBDAS1: The promoter region of CBDAS1 variant of “Purple Kush” cultivar was mapped to CM010796.2_62086535-62089454 and has a nucleic acid sequence as set forth in SEQ ID NO:1153.
Stage 2: Designing and synthesizing gRNA molecules corresponding to the sequences targeted for editing, i.e. sequences of the promoter region of each of the Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabidiolic acid synthase (CsCBDAS) gene variants or homologues. It is noted that the editing event is preferably targeted to a unique restriction site sequence to allow easier screening for plants carrying an editing event within their genome.
According to some aspects of the invention, the nucleotide sequence of the gRNAs should be compatible with the genomic sequence of the target genomic loci sequence. Therefore, for example, suitable gRNA molecules should be constructed for different Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabidiolic acid synthase (CsCBDAS) homologues of different Cannabis strains.
Reference is now made to Tables 1-10 presenting gRNA nucleic acid sequences targeted to the promoter region of various herein identified Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabidiolic acid synthase (CsCBDAS) homologues. In the aforementioned tables, the term ‘PAM’ refers to ‘protospacer adjacent motif’, which is a 2-6 base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The CsTHCAS and/or CsCBDAS genomic DNA sense strand is marked as “1”, and the antisense strand is marked as “−1”.
Reference is made to Table 11 presenting a summary of the sequences within the scope of the current invention.
The above gRNA polynucleotides have been cloned into suitable vectors and their sequence has been verified. In addition, different Cas9 versions have been analyzed for optimal compatibility between the Cas9 protein activity and the gRNA polynucleotide in the Cannabis plant.
The efficiency of the designed gRNA polynucleotides may be validated by transiently transforming Cannabis tissue culture. A plasmid carrying a predetermined gRNA sequence together with the Cas9 gene has been transformed into Cannabis cells/protoplasts. The protoplast cells have been grown for a short period of time and then were analyzed for existence of genome editing events. The positive constructs were subjected to the herein established stable transformation protocol into Cannabis plant tissue for producing genome edited Cannabis plants within the promoter region of tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabidiolic acid synthase (CsCBDAS) genes.
Stage 3: Transforming Cannabis plants using Agrobacterium or biolistics (gene gun) methods. For Agrobacterium and bioloistics, a DNA plasmid carrying (Cas9+gene specific gRNA) can be used. A vector containing a selection marker, Cas9 gene and relevant gene/promoter specific gRNA's is constructed. For biolistics, Ribonucleoprotein (RNP) complexes carrying (Cas9 protein+gene/promoter specific gRNA) are used. RNP complexes are created by mixing the Cas9 protein with relevant gene specific gRNA's.
According to some embodiments of the present invention, transformation of various Cannabis tissues was performed using particle bombardment of:
According to further embodiments of the present invention, transformation of various Cannabis tissues was performed using Agrobacterium (Agrobacterium tumefaciens) by:
Transformation efficiency by A. tumefaciens has been compared to the bombardment (biolistics system) method by transient GUS transformation experiment. After transformation, GUS staining of the transformants has been performed.
According to further embodiments of the present invention, additional transformation tools were used in Cannabis, including, but not limited to:
Stage 4: Regeneration in tissue-culture. When transforming DNA constructs into the plant, antibiotics is used for selection of positive transformed plants. An improved regeneration protocol was herein established for the Cannabis plant.
Reference is now made to
Stage 5: Selection of positive transformants. Once regenerated plants appear in tissue culture, DNA is extracted from leaf sample of the transformed plant and PCR is performed using primers flanking the edited region. PCR products are then digested with enzymes recognizing the restriction site near the original gRNA sequence. If editing event occurred, the restriction site will be disrupted and the PCR product will not be cleaved. No editing event will result in a cleaved PCR product.
Reference is now made to
Screening for CRISPR/Cas9 gene editing events has been performed by at least one of the following analysis methods:
Reference is now made to
Stage 6: Selection of transformed Cannabis plants presenting reduced expression of at least one of Cannabis tetrahydrocannabinolic acid synthase (CsTHCAS) and/or Cannabidiolic acid synthase (CsCBDAS) homologue or variant as described above. It is within the scope that different gRNA promoters were tested in order to maximize editing efficiency.
The present invention offers targeted genome editing within promoter regions of genes or alleles encoding THCAS and/or CBDAS cannabinoid synthesis enzyme that can be used to modulate the cannabinoid profile, particularly, THCA and/or CBDA content in the Cannabis plant. Upon modifying the promoter region of THCAS and/or CBDAS genes by the method of the present invention, a Cannabis plant with desirable phenotype of altered, and more specifically reduced THCA (or THC) and/or CBDA (or CBD) levels, with no known pleotropic effects as compared to a wild-type control Cannabis plant, can be achieved. For example, downregulation of THCAS expression by a knock down mutation in its promoter region, reduce THCA level and may also enhance the level of CBDA synthesis. Downregulation of CBDAS expression by a knock down mutation in its promoter region, reduce CBDA level and may also enhance the level of THCA synthesis. Downregulation of both THCAS and CBDAS expression by a knock down mutation at their promoter region, reduce both THCA and CBDA levels and may also enhance the level of their substrate CBGA in the plant. In this way, the concentration level of predetermined cannabinoids such as THCA, CBDA and/or CBGA in a Cannabis variety can be adjusted (reduced or elevated) as desired. For example, any Cannabis variety or cultivar with a high THCA or THC level can be converted into a low level THCA plant variety.
The guide RNA/Cas9 endonuclease system is used to target and induce a double strand break at a Cas9 endonuclease target site located within the regulatory regions operably linked to the herein identified genes encoding THCA and/or CBDA synthase homologues or variants or alleles. Cannabis cultivars or varieties comprising targeted mutations within the promoter region of THCAS and/or CBDAS gene alleles were selected and evaluated for their phenotypic effect on cannabinoid expression profiled in the plant.
Various embodiments have been presented. Each of these embodiments may of course include features from other embodiments presented, and embodiments not specifically described may include various features described herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IL2021/050847 | 7/11/2021 | WO |
Number | Date | Country | |
---|---|---|---|
62705719 | Jul 2020 | US |