The present invention, in some embodiments thereof, relates to DICER-liker knock-out plant cells.
Production of proteins in plants for human health applications offers some advantages, including ease of scaling and lack of human and animal pathogens (Donini and Marusic, 2019; Loh et al., 2017; Tekoah et al., 2015). The first plant-derived pharmaceutical recombinant protein was approved in 2012 for commercial use in humans (Mor, 2016) and the second enzyme alpha-galactosidase-A for the treatment of the Fabry disease that is currently tested in phase III clinical trials (Kizhner et al., 2015; Ruderfer et al., 2018). Despite the success, plant expression systems are characterized by low yields as a result of gene silencing that is one of the fundamental regulatory and defense mechanism controlling gene expression, developmental regulation and epigenetic modifications (Baulcombe, 2004; Moazed, 2009).
RNA interference occurs either by transcriptionally gene silencing (TGS) related to DNA modification by RNA-directed DNA methylation (RdDM) or post-transcriptional gene silencing (PTGS) related to RNA modification by degradation of mRNA or blocking the translation of RNA transcripts. In both situations, small RNA (siRNA) molecule, 21-24 nt in length, are produced (Ghildiyal and Zamore, 2009). Several proteins are known to be involved in these processes. RNA-dependent RNA polymerases (RDRs) are a family of proteins that produce double-stranded RNA (dsRNA) molecules using aberrant single-stranded RNAs, such as viral RNAs or transgene-derived mRNAs, as templates. The endoribonuclease (RNase III) enzymes known in plants as DICER like (DCL), produce the siRNAs or miRNAs (21-24 nt) from long double-stranded RNA or hairpin-loop-structured RNAs (Ji, 2008). A dsRNA-binding protein recruits a Dicer-siRNA complex to Argonaute (Ago) family of proteins, and the Ago cleaves the anti-guide strand of the siRNA duplex (Hammond et al., 2001). The remaining single-stranded (ss) antisense siRNA along with the Ago associates with other RNA-binding proteins that assemble into RNA-induced silencing complex (RISC) (Martinez et al., 2002). In the cytoplasm, mRNA targets can be cleaved via RISC slicer activity by Ago to silence targeted genes, known as PTGS (Rand et al., 2005). In the nucleus, RISC can take the form of an RNA-induced transcriptional silencing (RITS) complex, which interacts with RNA polymerase II (PoIII) and nascent RNA transcripts and directs chromatin remodeling to achieve epigenetic silencing through RdDM (Pratt and MacRae, 2009; Verdel et al., 2004).
Plant genomes contain at least four distinct classes of DCL family proteins (DCL1-4) that play a central role in gene silencing. Like their animal counterparts, each class of DCL participates in a specific function (Fukudome and Fukuhara, 2016). DCL1 gives rise to 21 nucleotides (nt) long siRNAs from miRNA precursors, which are transcribed from non-coding genes (Choudhary et al., 2019). The main activity of DCL3 is to process long dsRNA derived from heterochromatic regions of the genome into 24-nt siRNAs. These molecules then direct DNA methylation of target sequences (Blevins et al., 2015). DCL2 and DCL4 are involved in virus induced RNA silencing (VIGS) and transgene silencing. DCL2 gives rise to 22 (nt) siRNAs and DCL4 is responsible for the production of 21-nt siRNAs (Chen et al., 2018; Parent et al., 2015).
Plants genomes contain six RDRs, divided into two groups: RDRα consisting of RDR1, RDR2 and RDR6 and RDR7 consisting of RDR3, RDR4 and RDR5. RDR1, RDR2, and RDR6 genes share the C-terminal canonical catalytic DLDGD (SEQ ID NO: 32) motif of eukaryotic RDRs, which are involved in plant antiviral and transgene silencing (Wassenegger and Krczal, 2006).
Induced mutations and RNAi technology were implemented in plants to knock-down DCL or RDR genes involved in silencing (e.g. Daxinger et al., 2008; Parent et al., 2015; Seta et al., 2017; Yoshikawa et al., 2005, Konstantina Katsarou et al.Mol Plant Pathol. 2019 March; 20(3): 432-446; Matsuo and Matsumura, 2017; Qin et al., 2017; Suzuki et al., 2019, Deleris et al. Science. 2006 Jul. 7; 313(5783):68-71; Xie et al. Proc Natl Acad Sci USA. 2005 Sep. 6; 102(36): 12984-12989; Hernan Garcia-Ruiz et al. Plant Cell. 2010 February; 22(2): 481-496; Polydore S and Axtell M J (2018) The Plant Journal 94:1051-1063; Matsuo and Atsumi G (2019) Planta 250:463-473; International Patent Application Publication Nos. WO2015054602 and WO2019222379; and US Patent Application Publication Nos. US20120240288 and US20100050294). Notably, few of these studies show that DCL suppression results in distinct developmental defects and higher susceptibility to certain RNA viruses.
According to an aspect of some embodiments of the present invention there is provided an isolated plant cell in suspension comprising loss of function mutations in all alleles of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the DCL2 are in a region shared by all alleles of the DCL2 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the DCL2 are located within nucleic acid residues 288-307 and/or 881-900 corresponding to SEQ ID NO: 74; and/or nucleic acid residues 288-307 and/or 881-900 corresponding to SEQ ID NO: 75.
According to some embodiments of the invention, the region shared by all alleles of the DCL2 gene comprises a sequence selected from the group consisting of SEQ ID NO: 1-2.
According to some embodiments of the invention, the loss of function mutations in the DCL4 are in a region shared by all alleles of the DCL4 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the DCL4 are located within nucleic acid residues 967-986 and/or 1407-1426 corresponding to SEQ ID NO: 76; and/or nucleic acid residues 931-950 and/or 1371-1390 corresponding to SEQ ID NO: 77.
According to some embodiments of the invention, the region shared by all alleles of the DCL4 gene comprises a sequence a sequence selected from the group consisting of SEQ ID NO: 3-4.
According to some embodiments of the invention, the loss of function mutations in the RDR1 are in a region shared by all alleles of the RDR1 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the RDR1 are located within nucleic acid residues 916-937 and/or 962-984 corresponding to SEQ ID NO: 78; and/or nucleic acid residues 921-937 and/or 962-984 corresponding to SEQ ID NO: 79.
According to some embodiments of the invention, the region shared by all alleles of the RDR1 gene comprises a sequence a sequence selected from the group consisting of SEQ ID NO: 26-27.
According to some embodiments of the invention, the loss of function mutations in the RDR2 are in a region shared by all alleles of the RDR2 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the RDR2 are located within nucleic acid residues 575-553 and/or 589-612 corresponding to SEQ ID NO: 80; and/or nucleic acid residues 575-553 and/or 589-612 corresponding to SEQ ID NO: 81.
According to some embodiments of the invention, the region shared by all alleles of the RDR2 gene comprises a sequence a sequence selected from the group consisting of SEQ ID NO: 28-29.
According to some embodiments of the invention, the loss of function mutations in the RDR6 are in a region shared by all alleles of the RDR6 in the plant cell.
According to some embodiments of the invention, the loss of function mutations in the RDR6 are located within nucleic acid residues 2767-2785 and/or 2820-2839 corresponding to SEQ ID NO: 82; nucleic acid residues 49-67 and/or 102-121 corresponding to SEQ ID NO: 83; and/or nucleic acid residues 49-67 and/or 102-121 corresponding to SEQ ID NO: 84.
According to some embodiments of the invention, the region shared by all alleles of the RDR6 gene comprises a sequence a sequence selected from the group consisting of SEQ ID NO: 30-31.
According to some embodiments of the invention, the plant cell has reduced expression and/or activity of a glycosylation enzyme as compared to a control plant cell of the same genetic background not subjected to an agent which downregulates expression and/or activity of the glycosylation enzyme.
According to some embodiments of the invention, the isolated plant cell comprising a heterologous nucleic acid sequence for expressing an expression product of interest.
According to an aspect of some embodiments of the present invention there is provided a method of expressing a recombinant expression product of interest in a plant cell, the method comprising culturing the cell under condition which allow expression of the expression product of interest.
According to an aspect of some embodiments of the present invention there is provided a method of abolishing expression and/or activity of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6 in a plant cell, the method comprising introducing into an isolated plant cell in suspension an agent capable of introducing loss of function mutations in all alleles of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6 in the plant cell.
According to some embodiments of the invention, the method comprises introducing into the isolated plant cell an agent capable of downregulating expression and/or activity of a glycosylation enzyme.
According to some embodiments of the invention, the plant cell has reduced expression and/or activity of a glycosylation enzyme as compared to a control plant cell of the same genetic background not subjected to an agent which downregulates expression and/or activity of the glycosylation enzyme.
According to some embodiments of the invention, the glycosylating enzyme comprises xylosyltransferase and/or fucosyltransferase.
According to some embodiments of the invention, the agent is a genome editing agent.
According to some embodiments of the invention, the genome editing agent is selected from the group consisting of CRISPR/Cas system, Zinc finger nuclease (ZFN), transcription-activator like effector nuclease (TALEN) or meganuclease.
According to some embodiments of the invention, the agent comprises a DCL2 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1-2.
According to some embodiments of the invention, the agent comprises a DCL4 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 3-4.
According to some embodiments of the invention, the agent comprises a RDR1 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 26-27.
According to some embodiments of the invention, the agent comprises a RDR2 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 28-29.
According to some embodiments of the invention, the agent comprises a RDR6 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 30-31.
According to some embodiments of the invention, the method further comprising expressing in the isolated plant cell a recombinant expression product of interest other than the agent.
According to some embodiments of the invention, the loss of function mutations abolish expression of the at least two genes, as determined by RT-PCR.
According to some embodiments of the invention, the loss of function mutations abolish expression and/or activity of the DCL2 and DCL4, as determined by no expression of transgene specific 21-nt and 22-nt siRNAs in the plant cell following expression of the transgene in the plant cell.
According to some embodiments of the invention, the plant is selected from the group consisting of Tobacco, Arabidopsis, Aloe Vera, grape seeds, oil palm, plantain, pine, banana, date, eggplant, jojoba, pineapple, rubber tree, cassava, yam, sweet potato and tomato.
According to some embodiments of the invention, the plant is a Tobacco plant.
According to some embodiments of the invention, the Tobacco is Nicotiana tabacum.
According to some embodiments of the invention, the plant cell is a BY-2 line cell.
Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting.
Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.
In the drawings:
The present invention, in some embodiments thereof, relates to DICER-liker knock-out plant cells.
Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details set forth in the following description or exemplified by the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.
The production of recombinant proteins in plants has many advantages. However, plant expression systems are characterized by low yields as a result of gene silencing. Plants DICER like (DCL) and RNA-dependent RNA polymerases (RDRs) proteins are key components participating in gene silencing.
While reducing specific embodiments of the present invention to practice, the present inventors have now generated DCL2- and DCL4 knockout plant cells. Furthermore, these cells expressed higher amounts of a recombinant protein as compared to their wild type counterparts.
Specifically, the present inventors used CRISPR/Cas9 technology to develop DCL2- and DCL4 knockout Nicotiana tabacum BY2 cells which are fully mutated in all gene alleles (Examples 1-3 of the Examples section which follows). These knocked-out BY2 cells were viable, and their growth rate was similar to native BY2 cells. Furthermore, the expression level of a recombinant protein (Rasburicase) in the DCL2- and DCL4 knockout cell lines was up to 10 fold higher compared to the expression level in the wild-type cells. Corroborating the correlation between DCL2- and DCL4 knock-out and the increased expression of a recombinant protein, the present inventors show that the knock-out lines do not produce transgene specific 21 bp and 22 bp siRNA.
Consequently, specific embodiments of the present invention propose isolated DCL2-and DCL4 knock-out plant cells and methods of generating same. Following, these novel plant cells may be used for e.g. expression of recombinant proteins.
Thus, according to one aspect of the present invention, there is provided a method of abolishing expression and/or activity of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6in a plant cell, the method comprising introducing into an isolated plant cell in suspension an agent capable of introducing loss of function mutations in all alleles of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6in said plant cell.
According to specific embodiments, the at least two genes comprise at least DCL2 and DCL4, DCL2 and RDR1, DCL2 and RDR2, DCL2 and RDR6, DCL4 and RDR1, DCL4 and RDR2, DCL4 and RDR6, RDR1 and RDR2, RDR1 and RDR6, or RDR2 and RDR6, each possibility represents a separate embodiments of the invention.
According to specific embodiments, the at least two genes comprise at least three genes, at least four genes or all of the 5 genes.
According to specific embodiments, the at least three genes comprise at least DCL2+DCL4+RDR1, DCL2+DCL4+RDR2, DCL2+DCL4+RDR6, DCL2+RDR1+RDR2, DCL2+RDR1+RDR6, DCL2+RDR2+RDR6, DCL4+RDR1+RDR2, DCL4+RDR1+RDR6, DCL4+RDR2+RDR6, RDR1+RDR2+RDR6, each possibility represents a separate embodiments of the invention.
The agent may be a single agent targeting the different genes or several distinct agents each targeting at least one gene.
Thus, according to specific embodiments, the agent comprises at least two, at least three, at least four or at least 5 distinct agents.
DCL (DICER like) are endoribonuclease enzymes belonging to the RNase III family. DCL enzymes cleave double-stranded RNA (dsRNA) or hairpin-loop-structured RNAs and pre-microRNA (pre-miRNA) into short single-stranded RNA fragments called small interfering RNA and microRNA, respectively. Plant genomes contain at least four distinct classes of DCL family proteins.
As used herein, the term “DCL2” refers to a gene encoding endoribonuclease Dicer like 2. DCL2 creates 22-nt long siRNA from cis-acting antisense transcripts which aid in viral immunity and defense.
The Nicotiana tabacum, TN90 for example, comprises two DCL2 genes, their sequences can be obtained from known databases such as solgenomics (www(dot)solgenomics(dot)net). Exemplary contigs include:
Exemplary cDNA sequences of Nicotiana tabacum DCL2A and DCL2B are provided in SEQ ID NOs: 74 and 75, respectively.
As used herein, the term “DCL4” refers to a gene encoding endoribonuclease Dicer like 4. DCL4 is involved in trans-acting siRNA metabolism and transcript silencing at the post-transcriptional level and creates 21-nt long siRNA.
The Nicotiana tabacum, TN90 for example, comprises two DCL4 genes, their sequences can be obtained from known databases such as solgenomics (www(dot)solgenomics(dot)net). Exemplary contigs include:
Exemplary cDNA sequences of Nicotiana tabacum DCL2A and DCL2B are provided in SEQ ID NOs: 76 and 77, respectively.
RNA-dependent RNA polymerase (RDR), also known as RNA replicase, EC NO: 2.7.7.48, is an enzyme that catalyzes the replication of RNA from an RNA template. There are three major clades of eukaryotic RDRs-RDRα, RDRβ, and RDRγ, while RDRβwas likely lost in the plant lineage. Plant genomes contain at least 6 RDR family proteins. RDRα genes have duplicated in plants to yield separate RDR1, RDR2, and RDR6 subgroups and RDRγ genes have duplicated to yield RDR3, RDR4, and RDR5 subgroups [Zong et al., (2009), Gene 447: 29-39].
As used herein, the term “RDR1” refers to a gene encoding RNA-dependent RNA polymerase 1. RDR1 is involved in pathogen resistance and stress response. RDR1 contributes to the production of vsRNAs (21-nt, 22-nt) and the antiviral defense conferred through these vsRNAs.
The Nicotiana tabacum, TN90 for example, comprises 2 RDR1 genes, their sequences can be obtained from known databases such as solgenomics (www(dot)solgenomics(dot)net). Exemplary contigs include:
Exemplary cDNA sequences of Nicotiana tabacum RDR1A and RDR1B are provided in SEQ ID NOs: 78 and 79, respectively.
As used herein, the term “RDR2” refers to a gene encoding RNA-dependent RNA polymerase 2. RDR2 is involved in transgene silencing and essential for the biogenesis of endogenous siRNA (female gamete formation, genome maintenance etc.). It leads to the formation of 24-nt and cytosine methylation, histone modification and RNA-directed DNA methylation (RdDM).
The Nicotiana tabacum, TN90 for example, comprises 2 RDR2 genes, their sequences can be obtained from known databases such as solgenomics (www(dot)solgenomics(dot)net). Exemplary contigs include:
Exemplary cDNA sequences of Nicotiana tabacum RDR2A and RDR2B are provided in SEQ ID NOs: 80 and 81, respectively.
As used herein, the term “RDR6” refers to a gene encoding RNA-dependent RNA polymerase 6. RDR6 participates in the production of transacting tasiRNAs and natural antisense siRNAs and was also found to be essential for sense transgene-induced posttranscriptional gene silencing (S-PTGS). RDR6 contributes to the antiviral defense conferred through these vsRNAs (21-nt, 24-nt).
The Nicotiana tabacum, TN90 for example, comprises 3 RDR6 genes, their sequences can be obtained from known databases such as solgenomics (www(dot)solgenomics(dot)net). Exemplary contigs include:
Exemplary cDNA sequences of Nicotiana tabacum RDR6A, RDR6B and RDR6C are provided in SEQ ID NOs: 82, 83 and 84, respectively.
As used herein, the term “isolated plant cell” refers to a plant cell at least partially separated from the natural plant (or part thereof).
In some embodiments, the isolated cell is a plant cell in a suspension culture.
Suspension culture means that the cells are not part of a tissue, but are rather floating as single cells or clusters of not more than 100 cells in a culture medium.
Suitable devices and methods for culturing plant cells in suspension are known in the art, for example, as described in International Patent Application PCT IL2008/000614.
Non-limiting examples of plants useful in the methods of the invention include all plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and dicotyledonous plants including a fodder or forage legume, ornamental plant, food crop, tree, or shrub selected from the list comprising Acacia spp., Acer spp., Actinidia spp., Aesculus spp., Agathis australis, Albizia amara, Alsophila tricolor, Andropogon spp., Arachis spp, Areca catechu, Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp., Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, Camellia sinensis, Cannabaceae, Cannabis indica, Cannabis, Cannabis sativa, Hemp, industrial Hemp, Capsicum spp., Cassia spp., Centroema pubescens, Chacoomeles spp., Cinnamomum cassia, Coffea arabica, Colophospermum mopane, Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., Dicksonia squarosa, Dibeteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium rectum, Echinochloa pyramidalis, Ehraffia spp., Eleusine coracana, Eragrestis spp., Erythrina spp., Eucalypfus spp., Euclea schimperi, Eulalia vi/losa, Pagopyrum spp., Feijoa sellowlana, Fragaria spp., Flemingia spp, Freycinetia banksli, Geranium thunbergii, GinAgo biloba, Glycine javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, Hedysarum spp., Hemaffhia altissima, Heteropogon contoffus, Hordeum vulgare, Hyparrhenia rufa, Hypericum erectum, Hypeffhelia dissolute, Indigo incamata, Iris spp., Leptarrhena pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonus bainesli, Lotus spp., Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago saliva, Metasequoia glyptostroboides, Musa sapientum, Nicotianum spp., Onobrychis spp., Ornithopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, Photinia spp., Picea glauca, Pinus spp., Pisum sativam, Podocarpus totara, Pogonarthria fleckii, Pogonaffhria squarrosa, Populus spp., Prosopis cineraria, Pseudotsuga menziesii, Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphiolepsis umbellata, Rhopalostylis sapida, Rhus natalensis, Ribes grossularia, Ribes spp., Robinia pseudoacacia, Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Sciadopitys vefficillata, Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, Spinacia spp., Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagi spp, Taxodium distichum, Themeda triandra, Trifolium spp., Triticum spp., Tsuga heterophylla, Vaccinium spp., Vicia spp., Vitis vinifera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mays, amaranth, artichoke, asparagus, broccoli, Brussels sprouts, cabbage, canola, carrot, cauliflower, celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, sugar beet, sugar cane, sunflower, tomato, squash tea, trees. Alternatively algae and other non-Viridiplantae can be used for the methods of some embodiments of the invention.
According to some embodiments of the invention, the plant is an edible and/or non-toxic plant, which is amenable to genetic modification so as to bring about expression from the nucleic acid construct.
According to some embodiments of the invention, the plant is a crop plant such as, but not limited to, rice, maize, wheat, barley, peanut, potato, sesame, olive tree, palm oil, banana, soybean, sunflower, canola, sugarcane, alfalfa, millet, leguminosae (bean, pea), flax, lupinus, rapeseed, tobacco, tomato, carrot, cucumber, melon, grapes, while clover, celery, ginger, horseradish, poplar and cotton.
According to a specific embodiment, the plant is a carrot plant.
According to specific embodiments, the plant is selected from the group consisting of Tobacco, Arabidopsis, Aloe Vera, grape seeds, oil palm, plantain, pine, banana, date, eggplant, jojoba, pineapple, rubber tree, cassava, yam, sweet potato and tomato.
According to a specific embodiment, the plant is a Tobacco plant.
According to a specific embodiment, the Tobacco plant is Nicotiana tabacum.
According to a specific embodiment, the Tobacco plant is Nicotiana benthamiana.
In one embodiment the Tobacco cells are from a Tobacco cell line, such as, but not limited to Nicotiana tabacum L. cv Bright Yellow (BY-2) cells.
The isolated plant cell of some embodiments of the present invention is introduced with an agent capable of introducing loss of function mutations.
As used herein, the phrase “loss of function mutations” refers to any mutation in the DNA sequence of a gene (e.g., DCL2, DCL4, RDR1, RDR2, RDR6) which results in downregulation of the expression level and/or activity of the expressed product, i.e., the mRNA transcript and/or the translated protein. Non-limiting examples of such loss of function mutations include a missense mutation, i.e., a mutation which changes an amino acid residue in the protein with another amino acid residue and thereby abolishes the enzymatic activity of the protein; a nonsense mutation, i.e., a mutation which introduces a stop codon in a protein, e.g., an early stop codon which results in a shorter protein devoid of the enzymatic activity; a frame-shift mutation, i.e., a mutation, usually, deletion or insertion of nucleic acid(s) which changes the reading frame of the protein, and may result in an early termination by introducing a stop codon into a reading frame (e.g., a truncated protein, devoid of the enzymatic activity), or in a longer amino acid sequence (e.g., a readthrough protein) which affects the secondary or tertiary structure of the protein and results in a non-functional protein, devoid of the enzymatic activity of the non-mutated polypeptide; a readthrough mutation due to a frame-shift mutation or a modified stop codon mutation (i.e., when the stop codon is mutated into an amino acid codon), with an abolished enzymatic activity; a promoter mutation, i.e., a mutation in a promoter sequence, usually 5′ to the transcription start site of a gene, which results in down-regulation of a specific gene product; a regulatory mutation, i.e., a mutation in a region upstream or downstream, or within a gene, which affects the expression of the gene product; a deletion mutation, i.e., a mutation which deletes coding nucleic acids in a gene sequence and which may result in a frame-shift mutation or an in-frame mutation (within the coding sequence, deletion of one or more amino acid codons); an insertion mutation, i.e., a mutation which inserts coding or non-coding nucleic acids into a gene sequence, and which may result in a frame-shift mutation or an in-frame insertion of one or more amino acid codons; an inversion, i.e., a mutation which results in an inverted coding or non-coding sequence; a splice mutation i.e., a mutation which results in abnormal splicing or poor splicing; and a duplication mutation, i.e., a mutation which results in a duplicated coding or non-coding sequence, which can be in-frame or can cause a frame-shift.
According to specific embodiments loss of function mutation of a gene may comprise at least one allele of the gene.
According to specific embodiments loss of function mutation of a gene comprises more than one allele of the gene.
According to specific embodiments loss of function mutation of a gene comprises all alleles of the gene.
The term “allele” as used herein, refers to any of one or more alternative forms of a gene locus, all of which alleles relate to a trait or characteristic. In a diploid cell or organism, the two alleles of a given gene occupy corresponding loci on a pair of homologous chromosomes.
As DCL2, DCL4, RDR1 and RDR2 are encoded by two distinct genes in Nicotinana plants e.g. Nicotiana tabacum, according to specific embodiments, the loss of function mutation comprises all four alleles of the gene.
As RDR6 is encoded by three distinct genes in Nicotinana plants e.g. Nicotiana tabacum, according to specific embodiments, the loss of function mutation comprises all six alleles of the gene.
In such instances the agent may be a single agent targeting the different alleles of the gene or several distinct agents each targeting at least one allele of the gene.
Thus, according to specific embodiments, the loss of function mutations are in a region shared by all alleles of the gene.
According to other specific embodiments, the loss of function mutations are in region shared by at least 2 alleles of the gene.
According to other specific embodiments, the loss of function mutations are in regions not shared by all allele of the gene.
A shared region, as used herein, refers to a region in a wild type non-mutated allele of a gene having at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to another distinct wild type non-mutated allele of the gene.
According to specific embodiments, the shared region has 100% sequence identity Sequence identity can be determined using any nucleic acid sequence alignment algorithm such as CLC, Vector NTI, Blast, ClustalW, and MUSCLE.
According to specific embodiments, the shared region is at least 10, at least 15, at least 20, at least 25 nucleic acids long.
According to specific embodiments, the loss of function mutations in the DCL2 are located within nucleic acid residues 288-307 and/or 881-900 corresponding to SEQ ID NO: 74 and/or nucleic acid residues 288-307 and/or 881-900 corresponding to SEQ ID NO: 75.
According to specific embodiments, the region shared by all alleles of the DCL2 gene comprises a sequence selected from the group consisting of SEQ ID NO: 1-2.
As the agent may comprise the sequence of the region shared by all alleles of the DCL2 gene (see
According to specific embodiments, the loss of function mutations in the DCL4 are located within nucleic acid residues 967-986 and/or 1407-1426 corresponding to SEQ ID NO: 76 and/or nucleic acid residues 931-950 and/or 1371-1390 corresponding to SEQ ID NO: 77.
According to specific embodiments, the region shared by all alleles of the DCL4 gene comprises a sequence selected from the group consisting of SEQ ID NO: 3-4.
According to specific embodiments, the agent comprises a DCL4 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 3-4.
According to specific embodiments, the loss of function mutations in the RDR1 are located within nucleic acid residues 916-937 and/or 962-984 corresponding to SEQ ID NO: 78 and/or nucleic acid residues 921-937 and/or 962-984 corresponding to SEQ ID NO: 79.
According to specific embodiments, the region shared by all alleles of the RDR1 gene comprises a sequence selected from the group consisting of SEQ ID NO: 26-27.
According to specific embodiments, the agent comprises a RDR1 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 26-27.
According to specific embodiments, the loss of function mutations in the RDR2 are located within nucleic acid residues 575-553 and/or 589-612 corresponding to SEQ ID NO: 80; and/or nucleic acid residues 575-553 and/or 589-612 corresponding to SEQ ID NO: 81.
According to specific embodiments, the region shared by all alleles of the RDR2 gene comprises a sequence selected from the group consisting of SEQ ID NO: 28-29.
According to specific embodiments, the agent comprises a RDR2 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 28-29.
According to specific embodiments, the loss of function mutations in the RDR6 are located within nucleic acid residues 2767-2785 and/or 2820-2839 corresponding to SEQ ID NO: 82, nucleic acid residues 49-67 and/or 102-121 corresponding to SEQ ID NO: 83; and/or nucleic acid residues 49-67 and/or 102-121 corresponding to SEQ ID NO: 84.
According to specific embodiments, the region shared by all alleles of the RDR6 gene comprises a sequence selected from the group consisting of SEQ ID NO: 30-31.
According to specific embodiments, the agent comprises a RDR6 DNA targeting sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 30-31.
As used herein, the phrase “corresponding to SEQ ID NO:” intends to include the nucleic acid or a homolog thereof as defined by its location in the sequence of the recited SEQ ID NO relative to any other sequence encoding the recited enzyme.
Methods of introducing loss of function mutations to a gene of interest are well known in the art [see for example Menke D. Genesis (2013) 51:-618; Capecchi, Science (1989) 244:1288-1292; Santiago et al. Proc Natl Acad Sci USA (2008) 105:5809-5814; International Patent Application Nos. WO 2014085593, WO 2009071334 and WO 2011146121; U.S. Pat. Nos. 8,771,945, 8,586,526, 6,774,279 and UP Patent Application Publication Nos. 20030232410, 20050026157, US20060014264; the contents of which are incorporated by reference in their entireties] and include targeted homologous recombination, site specific recombinases, PB transposases and genome editing by engineered nucleases. Agents for introducing nucleic acid alterations to a gene of interest can be designed publically available sources or obtained commercially from Transposagen, Addgene and Sangamo Biosciences.
Following is a description of various exemplary methods used to introduce loss of function mutations to a gene of interest and agents for implementing same that can be used according to specific embodiments of the present invention.
According to specific embodiments, the agent is a genome editing agent.
Genome Editing using engineered endonucleases—this approach refers to a reverse genetics method using artificially engineered nucleases to cut and create specific double-stranded breaks at a desired location(s) in the genome, which are then repaired by cellular endogenous processes such as, homology directed repair (HDS) and non-homologous end-joining (NHEJF). NHEJF directly joins the DNA ends in a double-stranded break, while HDR utilizes a homologous donor sequence as a template for regenerating the missing DNA sequence at the break point. In order to introduce specific nucleotide modifications to the genomic DNA, a donor DNA repair template containing the desired sequence must be present during HDR.
Genome editing cannot be performed using traditional restriction endonucleases since most restriction enzymes recognize a few base pairs on the DNA as their target and these sequences often will be found in many locations across the genome resulting in multiple cuts which are not limited to a desired location. To overcome this challenge and create site-specific single- or double-stranded breaks, several distinct classes of nucleases have been discovered and bioengineered to date. These include for example the meganucleases, Zinc finger nucleases (ZFNs), transcription-activator like effector nucleases (TALENs) and CRISPR/Cas system.
Meganucleases—Meganucleases are commonly grouped into four families: the LAGLIDADG family, the GIY-YIG family, the His-Cys box family and the HNH family. These families are characterized by structural motifs, which affect catalytic activity and recognition sequence. For instance, members of the LAGLIDADG family are characterized by having either one or two copies of the conserved LAGLIDADG motif. The four families of meganucleases are widely separated from one another with respect to conserved structural elements and, consequently, DNA recognition sequence specificity and catalytic activity. Meganucleases are found commonly in microbial species and have the unique property of having very long recognition sequences (>14 bp) thus making them naturally very specific for cutting at a desired location.
This can be exploited to make site-specific double-stranded breaks in genome editing. One of skill in the art can use these naturally occurring meganucleases, however the number of such naturally occurring meganucleases is limited. To overcome this challenge, mutagenesis and high throughput screening methods have been used to create meganuclease variants that recognize unique sequences. For example, various meganucleases have been fused to create hybrid enzymes that recognize a new sequence.
Alternatively, DNA interacting amino acids of the meganuclease can be altered to design sequence specific meganucleases (see e.g., U.S. Pat. No. 8,021,867). Meganucleases can be designed using the methods described in e.g., Certo, M T et al. Nature Methods (2012) 9:073-975; U.S. Pat. Nos. 8,304,222; 8,021,867; 8,119,381; 8,124,369; 8,129,134; 8,133,697; 8,143,015; 8,143,016; 8,148,098; or 8, 163,514, the contents of each are incorporated herein by reference in their entirety. Alternatively, meganucleases with site specific cutting characteristics can be obtained using commercially available technologies e.g., Precision Biosciences' Directed Nuclease Editor™ genome editing technology.
ZFNs and TALENs—Two distinct classes of engineered nucleases, zinc-finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs), have both proven to be effective at producing targeted double-stranded breaks (Christian et al., 2010; Kim et al., 1996; Li et al., 2011; Mahfouz et al., 2011; Miller et al., 2010).
Basically, ZFNs and TALENs restriction endonuclease technology utilizes a non-specific DNA cutting enzyme which is linked to a specific DNA binding domain (either a series of zinc finger domains or TALE repeats, respectively). Typically a restriction enzyme whose DNA recognition site and cleaving site are separate from each other is selected. The cleaving portion is separated and then linked to a DNA binding domain, thereby yielding an endonuclease with very high specificity for a desired sequence. An exemplary restriction enzyme with such properties is Fok1. Additionally Fok1 has the advantage of requiring dimerization to have nuclease activity and this means the specificity increases dramatically as each nuclease partner recognizes a unique DNA sequence. To enhance this effect, Fok1 nucleases have been engineered that can only function as heterodimers and have increased catalytic activity. The heterodimer functioning nucleases avoid the possibility of unwanted homodimer activity and thus increase specificity of the double-stranded break.
Thus, for example to target a specific site, ZFNs and TALENs are constructed as nuclease pairs, with each member of the pair designed to bind adjacent sequences at the targeted site. Upon transient expression in cells, the nucleases bind to their target sites and the FokI domains heterodimerize to create a double-stranded break. Repair of these double-stranded breaks through the non-homologous end-joining (NHEJ) pathway often results in small deletions or small sequence insertions. Since each repair made by NHEJ is unique, the use of a single nuclease pair can produce an allelic series with a range of different deletions at the target site.
The deletions typically range anywhere from a few base pairs to a few hundred base pairs in length, but larger deletions have been successfully generated in cell culture by using two pairs of nucleases simultaneously (Carlson et al., 2012; Lee et al., 2010). In addition, when a fragment of DNA with homology to the targeted region is introduced in conjunction with the nuclease pair, the double-stranded break can be repaired via homology directed repair to generate specific modifications (Li et al., 2011; Miller et al., 2010; Urnov et al., 2005).
Although the nuclease portions of both ZFNs and TALENs have similar properties, the difference between these engineered nucleases is in their DNA recognition peptide. ZFNs rely on Cys2-His2 zinc fingers and TALENs on TALEs. Both of these DNA recognizing peptide domains have the characteristic that they are naturally found in combinations in their proteins. Cys2-His2 Zinc fingers are typically found in repeats that are 3 bp apart and are found in diverse combinations in a variety of nucleic acid interacting proteins. TALEs on the other hand are found in repeats with a one-to-one recognition ratio between the amino acids and the recognized nucleotide pairs. Because both zinc fingers and TALEs happen in repeated patterns, different combinations can be tried to create a wide variety of sequence specificities. Approaches for making site-specific zinc finger endonucleases include, e.g., modular assembly (where Zinc fingers correlated with a triplet sequence are attached in a row to cover the required sequence), OPEN (low-stringency selection of peptide domains vs. triplet nucleotides followed by high-stringency selections of peptide combination vs. the final target in bacterial systems), and bacterial one-hybrid screening of zinc finger libraries, among others. ZFNs can also be designed and obtained commercially from e.g., Sangamo Biosciences™ (Richmond, CA).
Method for designing and obtaining TALENs are described in e.g. Reyon et al. Nature Biotechnology 2012 May; 30(5):460-5; Miller et al. Nat Biotechnol. (2011) 29: 143-148; Cermak et al. Nucleic Acids Research (2011) 39 (12): e82 and Zhang et al. Nature Biotechnology (2011) 29 (2): 149-53. A recently developed web-based program named Mojo Hand was introduced by Mayo Clinic for designing TAL and TALEN constructs for genome editing applications (can be accessed through www(dot)talendesign(dot)org). TALEN can also be designed and obtained commercially from e.g., Sangamo Biosciences™ (Richmond, CA).
CRISPR-Cas system (also referred to herein as “CRISPR”)—Many bacteria and archea contain endogenous RNA-based adaptive immune systems that can degrade nucleic acids of invading phages and plasmids. These systems consist of clustered regularly interspaced short palindromic repeat (CRISPR) nucleotide sequences that produce RNA components and CRISPR associated (Cas) genes that encode protein components. The CRISPR RNAs (crRNAs) contain short stretches of homology to the DNA of specific viruses and plasmids and act as guides to direct Cas nucleases to degrade the complementary nucleic acids of the corresponding pathogen. Studies of the type II CRISPR/Cas system of Streptococcus pyogenes have shown that three components form an RNA/protein complex and together are sufficient for sequence-specific nuclease activity: the Cas9 nuclease, a crRNA containing 20 base pairs of homology to the target sequence, and a trans-activating crRNA (tracrRNA) (Jinek et al. Science (2012) 337: 816-821.). It was further demonstrated that a synthetic chimeric guide RNA (gRNA) composed of a fusion between crRNA and tracrRNA could direct Cas9 to cleave DNA targets that are complementary to the crRNA in vitro. It was also demonstrated that transient expression of Cas9 in conjunction with synthetic gRNAs can be used to produce targeted double-stranded brakes in a variety of different species (Cho et al., 2013; Cong et al., 2013; DiCarlo et al., 2013; Hwang et al., 2013a,b; Jinek et al., 2013; Mali et al., 2013).
The CRIPSR/Cas system for genome editing contains two distinct components: a gRNA and an endonuclease e.g. Cas9.
The gRNA is a combination of the target homologous sequence (crRNA) and the endogenous bacterial RNA that links the crRNA to the Cas e.g. Cas9 nuclease (tracrRNA) in a single chimeric transcript. The gRNA/Cas complex is recruited to the target sequence by the base-pairing between the gRNA sequence and the complement genomic DNA. For successful binding of Cas9, the genomic target sequence must also contain the correct Protospacer Adjacent Motif (PAM) sequence immediately following the target sequence. The binding of the gRNA/Cas9 complex localizes the Cas9 to the genomic target sequence so that the Cas9 can cut both strands of the DNA causing a double-strand break. Just as with ZFNs and TALENs, the double-stranded breaks produced by CRISPR/Cas can undergo homologous recombination or NHEJ and are susceptible to specific sequence modification during DNA repair.
The Cas9 nuclease has two functional domains: RuvC and HNH, each cutting a different DNA strand. When both of these domains are active, the Cas9 causes double strand breaks in the genomic DNA.
A significant advantage of CRISPR/Cas is that the high efficiency of this system coupled with the ability to easily create synthetic gRNAs. This creates a system that can be readily modified to target modifications at different genomic sites and/or to target different modifications at the same site. Additionally, protocols have been established which enable simultaneous targeting of multiple genes. The majority of cells carrying the mutation present biallelic mutations in the targeted genes.
However, apparent flexibility in the base-pairing interactions between the gRNA sequence and the genomic DNA target sequence allows imperfect matches to the target sequence to be cut by the endonuclease e.g. Cas9.
Modified versions of the Cas9 enzyme containing a single inactive catalytic domain, either RuvC- or HNH-, are called ‘nickases’. With only one active nuclease domain, the Cas9 nickase cuts only one strand of the target DNA, creating a single-strand break or ‘nick’. A single-strand break, or nick, is normally quickly repaired through the HDR pathway, using the intact complementary DNA strand as the template. However, two proximal, opposite strand nicks introduced by a Cas9 nickase are treated as a double-strand break, in what is often referred to as a ‘double nick’ CRISPR system. A double-nick can be repaired by either NHEJ or HDR depending on the desired effect on the gene target. Thus, if specificity and reduced off-target effects are crucial, using the Cas9 nickase to create a double-nick by designing two gRNAs with target sequences in close proximity and on opposite strands of the genomic DNA would decrease off-target effect as either gRNA alone will result in nicks that will not change the genomic DNA.
In order to cut DNA at a specific site, Cas9 proteins require the presence of a gRNA and a protospacer adjacent motif (PAM), which immediately follows the gRNA target sequence in the targeted polynucleotide gene sequence. The PAM is located at the 3′ end of the gRNA target sequence but is not part of the gRNA. Different Cas proteins require a different PAM. Accordingly, selection of a specific polynucleotide gRNA target sequence by a gRNA is generally based on the recombinant Cas protein used. Non-limiting examples of PAM sequence include 5-NRG-3′, where R is either A or G, NNGRR (SEQ ID NO: 85), “NGG” sequence, “NAG”, NNNNGATT (SEQ ID NO: 86) and NNNNGNNN (SEQ ID NO: 87) where “N” can be any nucleotide (e.g. T, G, A). According to specific embodiments, the PAM sequence is selected from the group consisting of TGG, GGG and AGG.
The gRNA comprises a “gRNA guide sequence” or “gRNA target sequence” which corresponds to the target sequence on the target polynucleotide gene sequence that is followed by a PAM sequence.
Although a perfect match between the gRNA guide sequence and the DNA sequence on the targeted gene is preferred, a mismatch between a gRNA guide sequence and target sequence on the gene sequence of interest is also permitted as long as it still allows hybridization of the gRNA with the complementary strand of the gRNA target polynucleotide sequence on the targeted gene. A seed sequence of between 8-12 consecutive nucleotides in the gRNA, which perfectly matches a corresponding portion of the gRNA target sequence is preferred for proper recognition of the target sequence. The remainder of the guide sequence may comprise one or more mismatches. In general, gRNA activity is inversely correlated with the number of mismatches. According to specific embodiments, the gRNA of the present invention comprises 7 mismatches, 6 mismatches, 5 mismatches, 4 mismatches, 3 mismatches, 2 mismatches, or less, and even no mismatch, with the corresponding gRNA target gene sequence (less the PAM). According to specific embodiments, the gRNA nucleic acid sequence is at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to the gRNA target polynucleotide sequence in the gene of interest. Of course, the smaller the number of nucleotides in the gRNA guide sequence the smaller the number of mismatches tolerated. The binding affinity is thought to depend on the sum of matching gRNA-DNA combinations.
Any gRNA guide sequence can be selected in the target nucleic acid sequence, as long as it allows introducing at the proper location, the patch/donor sequence of the present invention. Accordingly, the gRNA guide sequence or target sequence of the present invention may be in coding or non-coding regions a gene (i.e., introns or exons).
In one embodiment, the gRNA is a sgRNA.
As used herein, the term “sgRNA” refers to single guide RNA used in conjunction with CRISPR associated systems (Cas). sgRNAs are a fusion of crRNA and tracrRNA and contain nucleotides of sequence complementary to the desired target site. Jinek et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity” Science 337(6096):816-821 (2012) Watson-Crick pairing of the sgRNA with the target site permits R-loop formation, which in conjunction with a functional PAM permits DNA cleavage or in the case of nuclease-deficient Cas9 allows binds to the DNA at that locus.
There are a number of publically available tools to help choose and/or design target sequences as well as lists of bioinformatically determined unique gRNAs for different genes in different species such as the Feng Zhang lab's Target Finder, the Michael Boutros lab's Target Finder (E-CRISP), the RGEN Tools: Cas-OFFinder, the CasFinder: Flexible algorithm for identifying specific Cas9 targets in genomes and the CRISPR Optimal Target Finder.
Non-limiting examples of a gRNA that can be used in the present disclosure include those described in the Example section which follows.
Thus, for example the gRNA sequence that target DCL2 may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1-2, the gRNA sequence that target DCL4 may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 3-4, the gRNA sequence that target RDR1 may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 26-27, the gRNA sequence that target RDR2 may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 28-29, the gRNA sequence that target RDR6 may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 30-31.
In order to use the CRISPR system, both gRNA and a CAS endonuclease (e.g. Cas9) should be expressed in a target cell. The insertion vector can contain both cassettes on a single plasmid or the cassettes are expressed from two separate plasmids. In addition the insertion vector may contain nucleic acid sequences encoding selectable markers, internal ribosome entry site (IRES) and the like. CRISPR plasmids are commercially available such as the px330 plasmid from Addgene (75 Sidney St, Suite 550A•Cambridge, MA 02139) or the pBIN19 vector from ATCC (catalog number 37327). Use of clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas)-guide RNA technology and a Cas endonuclease for modifying plant genomes are also at least disclosed by Svitashev et al., 2015, Plant Physiology, 169 (2): 931-945; Kumar and Jain, 2015, J Exp Bot 66: 47-57; and in U.S. Patent Application Publication No. 20150082478, which is specifically incorporated herein by reference in its entirety. CAS endonucleases that can be used to effect DNA editing with gRNA include, but are not limited to, Cas9, CasX, Cpf1 (Zetsche et al., 2015, Cell. 163(3):759-71), C2c1, C2c2, and C2c3 (Shmakov et al., Mol Cell. 2015 Nov. 5; 60(3):385-97).
T-GEE system (TargetGene's Genome Editing Engine)—A programmable nucleoprotein molecular complex containing a polypeptide moiety and a specificity conferring nucleic acid (SCNA) which assembles in-vivo, in a target cell, and is capable of interacting with the predetermined target nucleic acid sequence. The programmable nucleoprotein molecular complex is capable of specifically modifying and/or editing a target site within the target nucleic acid sequence and/or modifying the function of the target nucleic acid sequence. Nucleoprotein composition comprises (a) polynucleotide molecule encoding a chimeric polypeptide and comprising (i) a functional domain capable of modifying the target site, and (ii) a linking domain that is capable of interacting with a specificity conferring nucleic acid, and (b) specificity conferring nucleic acid (SCNA) comprising (i) a nucleotide sequence complementary to a region of the target nucleic acid flanking the target site, and (ii) a recognition region capable of specifically attaching to the linking domain of the polypeptide. The composition enables modifying a predetermined nucleic acid sequence target precisely, reliably and cost-effectively with high specificity and binding capabilities of molecular complex to the target nucleic acid through base-pairing of specificity-conferring nucleic acid and a target nucleic acid. The composition is less genotoxic, modular in their assembly, utilize single platform without customization, practical for independent use outside of specialized core-facilities, and has shorter development time frame and reduced costs.
“Hit and run” or “in-out”—involves a two-step recombination procedure. In the first step, an insertion-type vector containing a dual positive/negative selectable marker cassette is used to introduce the desired sequence alteration. The insertion vector contains a single continuous region of homology to the targeted locus and is modified to carry the mutation of interest. This targeting construct is linearized with a restriction enzyme at a one site within the region of homology, electroporated into the cells, and positive selection is performed to isolate homologous recombinants. These homologous recombinants contain a local duplication that is separated by intervening vector sequence, including the selection cassette. In the second step, targeted clones are subjected to negative selection to identify cells that have lost the selection cassette via intrachromosomal recombination between the duplicated sequences. The local recombination event removes the duplication and, depending on the site of recombination, the allele either retains the introduced mutation or reverts to wild type. The end result is the introduction of the desired modification without the retention of any exogenous sequences.
The “double-replacement” or “tag and exchange” strategy—involves a two-step selection procedure similar to the hit and run approach, but requires the use of two different targeting constructs. In the first step, a standard targeting vector with 3′ and 5′ homology arms is used to insert a dual positive/negative selectable cassette near the location where the mutation is to be introduced. After electroporation and positive selection, homologously targeted clones are identified. Next, a second targeting vector that contains a region of homology with the desired mutation is electroporated into targeted clones, and negative selection is applied to remove the selection cassette and introduce the mutation. The final allele contains the desired mutation while eliminating unwanted exogenous sequences.
Site-Specific Recombinases—The Cre recombinase derived from the P1 bacteriophage and Flp recombinase derived from the yeast Saccharomyces cerevisiae are site-specific DNA recombinases each recognizing a unique 34 base pair DNA sequence (termed “Lox” and “FRT”, respectively) and sequences that are flanked with either Lox sites or FRT sites can be readily removed via site-specific recombination upon expression of Cre or Flp recombinase, respectively. For example, the Lox sequence is composed of an asymmetric eight base pair spacer region flanked by 13 base pair inverted repeats. Cre recombines the 34 base pair lox DNA sequence by binding to the 13 base pair inverted repeats and catalyzing strand cleavage and religation within the spacer region. The staggered DNA cuts made by Cre in the spacer region are separated by 6 base pairs to give an overlap region that acts as a homology sensor to ensure that only recombination sites having the same overlap region recombine.
Basically, the site specific recombinase system offers means for the removal of selection cassettes after homologous recombination. This system also allows for the generation of conditional altered alleles that can be inactivated or activated in a temporal or tissue-specific manner. Of note, the Cre and Flp recombinases leave behind a Lox or FRT “scar” of 34 base pairs. The Lox or FRT sites that remain are typically left behind in an intron or 3′ UTR of the modified locus, and current evidence suggests that these sites usually do not interfere significantly with gene function.
Thus, Cre/Lox and Flp/FRT recombination involves introduction of a targeting vector with 3′ and 5′ homology arms containing the mutation of interest, two Lox or FRT sequences and typically a selectable cassette placed between the two Lox or FRT sequences. Positive selection is applied and homologous recombinants that contain targeted mutation are identified. Transient expression of Cre or Flp in conjunction with negative selection results in the excision of the selection cassette and selects for cells where the cassette has been lost. The final targeted allele contains the Lox or FRT scar of exogenous sequences.
Introducing the agent capable of introducing the loss of function mutations to the plant cell may be effected by any method known in the art.
According to specific embodiments, the cell is introduced with a nucleic acid construct comprising a nucleic acid sequence encoding the agent.
As used herein the term “nucleic acid sequence” or “polynucleotide” refers to a single or double stranded nucleic acid sequence which is isolated and provided in the form of an RNA sequence, a complementary polynucleotide sequence (cDNA), a genomic polynucleotide sequence and/or a composite polynucleotide sequences (e.g., a combination of the above).
Constructs useful in the methods according to some embodiments of the invention may be constructed using recombinant DNA technology well known to persons skilled in the art. The gene constructs may be inserted into vectors, which may be commercially available, suitable for transforming into plants and suitable for expression of the nucleic acid sequence of interest in the transformed cells. The genetic construct can be an expression vector wherein said nucleic acid sequence is operably linked to one or more regulatory sequences allowing expression in the plant cells in a constitutive or inducible manner.
In a particular embodiment of some embodiments of the invention the regulatory sequence is a plant-expressible promoter.
As used herein the phrase “plant-expressible” refers to a promoter sequence, including any additional regulatory elements added thereto or contained therein, is at least capable of inducing, conferring, activating or enhancing expression in a plant cell, preferably a monocotyledonous or dicotyledonous plant cell.
This includes any promoter of plant origin, but also any promoter of non-plant origin which is capable of directing transcription in a plant cell, i.e., certain promoters of viral or bacterial origin such as the CaMV35S (Harpster et al. (1988) Mol Gen Genet. 212(1):182-90, type III RNA polymerase III promoter (U6), the subterranean clover virus promoter No 4 or No 7 (WO9606932), or T-DNA gene promoters but also tissue-specific or organ-specific promoters including but not limited to seed-specific promoters (e.g., WO89/03887), organ-primordia specific promoters (An et al. (1996) Plant Cell 8(1):15-30), stem-specific promoters (Keller et al., (1988) EMBO J. 7(12): 3625-3633), leaf specific promoters (Hudspeth et al. (1989) Plant Mol Biol. 12: 579-589), mesophyl-specific promoters, root-specific promoters (Keller et al. (1989) Genes Dev. 3: 1639-1646), tuber-specific promoters (Keil et al. (1989) EMBO J. 8(5): 1323-1330), vascular tissue specific promoters (Peleman et al. (1989) Gene 84: 359-369), stamen-selective promoters (WO 89/10396, WO 92/13956), dehiscence zone specific promoters (WO 97/13865) and the like.
According to specific embodiments, the promotor is a constitutive promotor.
According to other specific embodiments, the promotor is an inducible promotor.
Nucleic acid sequences of some embodiments of the invention may be optimized for plant expression. Examples of such sequence modifications include, but are not limited to, an altered G/C content to more closely approach that typically found in the plant species of interest, and the removal of codons atypically found in the plant species commonly referred to as codon optimization.
The phrase “codon optimization” refers to the selection of appropriate DNA nucleotides for use within a structural gene or fragment thereof that approaches codon usage within the plant cell of interest. Therefore, an optimized gene or nucleic acid sequence refers to a gene in which the nucleotide sequence of a native or naturally occurring gene has been modified in order to utilize statistically-preferred or statistically-favored codons within the plant cell. The nucleotide sequence typically is examined at the DNA level and the coding region optimized for expression in the plant species determined using any suitable procedure, for example as described in Sardana et al. (1996, Plant Cell Reports 15:677-681). In this method, the standard deviation of codon usage, a measure of codon usage bias, may be calculated by first finding the squared proportional deviation of usage of each codon of the native gene relative to that of highly expressed plant genes, followed by a calculation of the average squared deviation. The formula used is: 1 SDCU=n=1N [(Xn−Yn)/Yn] 2/N, where Xn refers to the frequency of usage of codon n in highly expressed plant genes, where Yn to the frequency of usage of codon n in the gene of interest and N refers to the total number of codons in the gene of interest. A table of codon usage from highly expressed genes of dicotyledonous plants is compiled using the data of Murray et al. (1989, Nuc Acids Res. 17:477-498).
One method of optimizing the nucleic acid sequence in accordance with the preferred codon usage for a particular plant cell type is based on the direct use, without performing any extra statistical calculations, of codon optimization tables such as those provided on-line at the Codon Usage Database through the NIAS (National Institute of Agrobiological Sciences) DNA bank in Japan (www(dot)kazusa(dot)or(dot)jp/codon/). The Codon Usage Database contains codon usage tables for a number of different species, with each codon usage table having been statistically determined based on the data present in Genbank.
By using the above tables to determine the most preferred or most favored codons for each amino acid in a particular species (for example, rice), a naturally-occurring nucleotide sequence encoding a protein of interest can be codon optimized for that particular plant species. This is effected by replacing codons that may have a low statistical incidence in the particular species genome with corresponding codons, in regard to an amino acid, that are statistically more favored. However, one or more less-favored codons may be selected to delete existing restriction sites, to create new ones at potentially useful junctions (5′ and 3′ ends to add signal peptide or termination cassettes, internal sites that might be used to cut and splice segments together to produce a correct full-length sequence), or to eliminate nucleotide sequences that may negatively effect mRNA stability or expression.
The naturally-occurring encoding nucleotide sequence may already, in advance of any modification, contain a number of codons that correspond to a statistically-favored codon in a particular plant species. Therefore, codon optimization of the native nucleotide sequence may comprise determining which codons, within the native nucleotide sequence, are not statistically-favored with regards to a particular plant, and modifying these codons in accordance with a codon usage table of the particular plant to produce a codon optimized derivative. A modified nucleotide sequence may be fully or partially optimized for plant codon usage provided that the protein encoded by the modified nucleotide sequence is produced at a level higher than the protein encoded by the corresponding naturally occurring or native gene. Construction of synthetic genes by altering the codon usage is described in for example PCT Patent Application 93/07278.
The nucleic acid construct of some embodiments of the invention can further include additional polynucleotide sequences that allow, for example, the translation of several proteins from a single mRNA such as an internal ribosome entry site (IRES) and sequences for genomic integration.
Plant cells may be transformed stably or transiently with the nucleic acid constructs of some embodiments. In stable transformation, the nucleic acid molecule of some embodiments is integrated into the plant genome and as such it represents a stable and inherited trait. In transient transformation, the nucleic acid molecule is expressed by the cell transformed but it is not integrated into the genome and as such it represents a transient trait.
According to specific embodiments, the transformation step comprises a stable transformation.
There are various methods of introducing foreign genes into both monocotyledonous and dicotyledonous plants (Potrykus, I., Annu. Rev. Plant. Physiol., Plant. Mol. Biol. (1991) 42:205-225; Shimamoto et al., Nature (1989) 338:274-276).
The principle methods of causing stable integration of exogenous DNA into plant genomic DNA include two main approaches:
The Agrobacterium system includes the use of plasmid vectors that contain defined DNA segments that integrate into the plant genomic DNA. Methods of inoculation of the plant cell vary depending upon the plant species and the Agrobacterium delivery system. A widely used approach is the leaf disc procedure which can be performed with any tissue explant that provides a good source for initiation of whole plant differentiation. Horsch et al. in Plant Molecular Biology Manual A5, Kluwer Academic Publishers, Dordrecht (1988) p. 1-9. A supplementary approach employs the Agrobacterium delivery system in combination with vacuum infiltration. The Agrobacterium system is especially viable in the creation of transgenic dicotyledenous plants.
There are various methods of direct DNA transfer into plant cells. In electroporation, the protoplasts are briefly exposed to a strong electric field. In microinjection, the DNA is mechanically injected directly into the cells using very small micropipettes. In microparticle bombardment, the DNA is adsorbed on microprojectiles such as magnesium sulfate crystals or tungsten particles, and the microprojectiles are physically accelerated into cells or plant tissues.
Although stable transformation is presently preferred, transient transformation of plant cells is also envisaged by some embodiments of the invention.
Transient transformation can be effected by any of the direct DNA transfer methods described above or by viral infection using modified plant viruses.
Viruses that have been shown to be useful for the transformation of plant hosts include CaMV, TMV and BV. Transformation of plant cells using plant viruses is described in U.S. Pat. No. 4,855,237 (BGV), EP-A 67,553 (TMV), Japanese Published Application No. 63-14693 (TMV), EPA 194,809 (BV), EPA 278,667 (BV); and Gluzman, Y. et al., Communications in Molecular Biology: Viral Vectors, Cold Spring Harbor Laboratory, New York, pp. 172-189 (1988). Pseudovirus particles for use in expressing foreign DNA in many hosts, including plants, is described in WO 87/06261.
Construction of plant RNA viruses for the introduction and expression of non-viral exogenous nucleic acid sequences in plants is demonstrated by the above references as well as by Dawson, W. O. et al., Virology (1989) 172:285-292; Takamatsu et al. EMBO J. (1987) 6:307-311; French et al. Science (1986) 231:1294-1297; and Takamatsu et al. FEBS Letters (1990) 269:73-76.
When the virus is a DNA virus, suitable modifications can be made to the virus itself. Alternatively, the virus can first be cloned into a bacterial plasmid for ease of constructing the desired viral vector with the foreign DNA. The virus can then be excised from the plasmid. If the virus is a DNA virus, a bacterial origin of replication can be attached to the viral DNA, which is then replicated by the bacteria. Transcription and translation of this DNA will produce the coat protein which will encapsidate the viral DNA. If the virus is an RNA virus, the virus is generally cloned as a cDNA and inserted into a plasmid. The plasmid is then used to make all of the constructions. The RNA virus is then produced by transcribing the viral sequence of the plasmid and translation of the viral genes to produce the coat protein(s) which encapsidate the viral RNA.
Construction of plant RNA viruses for the introduction and expression in plants of non-viral exogenous nucleic acid sequences such as those included in the construct of some embodiments of the invention is demonstrated by the above references as well as in U.S. Pat. No. 5,316,931.
In one embodiment, a plant viral nucleic acid is provided in which the native coat protein coding sequence has been deleted from a viral nucleic acid, a non-native plant viral coat protein coding sequence and a non-native promoter, preferably the subgenomic promoter of the non-native coat protein coding sequence, capable of expression in the plant cell host, packaging of the recombinant plant viral nucleic acid, and ensuring a systemic infection of the host by the recombinant plant viral nucleic acid, has been inserted. Alternatively, the coat protein gene may be inactivated by insertion of the non-native nucleic acid sequence within it, such that a protein is produced. The recombinant plant viral nucleic acid may contain one or more additional non-native subgenomic promoters. Each non-native subgenomic promoter is capable of transcribing or expressing adjacent genes or nucleic acid sequences in the plant host cell and incapable of recombination with each other and with native subgenomic promoters. Non-native (foreign) nucleic acid sequences may be inserted adjacent the native plant viral subgenomic promoter or the native and a non-native plant viral subgenomic promoters if more than one nucleic acid sequence is included. The non-native nucleic acid sequences are transcribed or expressed in the host plant cell under control of the subgenomic promoter to produce the desired products.
In a second embodiment, a recombinant plant viral nucleic acid is provided as in the first embodiment except that the native coat protein coding sequence is placed adjacent one of the non-native coat protein subgenomic promoters instead of a non-native coat protein coding sequence.
In a third embodiment, a recombinant plant viral nucleic acid is provided in which the native coat protein gene is adjacent its subgenomic promoter and one or more non-native subgenomic promoters have been inserted into the viral nucleic acid. The inserted non-native subgenomic promoters are capable of transcribing or expressing adjacent genes in a plant host cell and are incapable of recombination with each other and with native subgenomic promoters. Non-native nucleic acid sequences may be inserted adjacent the non-native subgenomic plant viral promoters such that said sequences are transcribed or expressed in the host plant under control of the subgenomic promoters to produce the desired product.
In a fourth embodiment, a recombinant plant viral nucleic acid is provided as in the third embodiment except that the native coat protein coding sequence is replaced by a non-native coat protein coding sequence.
The viral vectors are encapsidated by the coat proteins encoded by the recombinant plant viral nucleic acid to produce a recombinant plant virus. The recombinant plant viral nucleic acid or recombinant plant virus is used to infect appropriate host plant cells. The recombinant plant viral nucleic acid is capable of replication in the host cell, systemic spread in the cell culture, and transcription or expression of foreign gene(s) (isolated nucleic acid) in the host cells to produce the desired protein.
In addition to the above, the nucleic acid molecule of some embodiments of the invention can also be introduced into a chloroplast genome thereby enabling chloroplast expression.
A technique for introducing exogenous nucleic acid sequences to the genome of the chloroplasts is known. This technique involves the following procedures. First, plant cells are chemically treated so as to reduce the number of chloroplasts per cell to about one. Then, the exogenous nucleic acid is introduced via particle bombardment into the cells with the aim of introducing at least one exogenous nucleic acid molecule into the chloroplasts. The exogenous nucleic acid is selected such that it is integratable into the chloroplast's genome via homologous recombination which is readily effected by enzymes inherent to the chloroplast. To this end, the exogenous nucleic acid includes, in addition to a gene of interest, at least one nucleic acid stretch which is derived from the chloroplast's genome. In addition, the exogenous nucleic acid includes a selectable marker, which serves by sequential selection procedures to ascertain that all or substantially all of the copies of the chloroplast genomes following such selection will include the exogenous nucleic acid. Further details relating to this technique are found in U.S. Pat. Nos. 4,945,050; and 5,693,507 which are incorporated herein by reference. A polypeptide can thus be produced by the protein expression system of the chloroplast and become integrated into the chloroplast's inner membrane.
Following the transformation step of, the plant cells may be cultured for at least 6 hours, at least 12 hours, at least 1 day, at least two days, at least a week, at least two weeks or at least three weeks, at least one month, at least 2 months, at least 3 months, at least 4 months, at least 6 months, each possibility represented a separate embodiments of the present invention.
The cells may be cultured in a positive selection medium in order to identify cells that have been successfully transformed.
As used herein, the phrase “positive selective medium” refers to the medium or growth conditions which select for cells which contain a positive selectable marker gene. Transformed cells survive and/or grow when exposed to agents or conditions which would, normally, be detrimental to the survival of a cell that did not contain the positive selectable marker gene.
Sequencing analysis may also be effected in order to identify cells that have been successfully transformed.
If the construct encodes an agent capable of introducing the loss of function mutations to the plant cell (e.g. silencing agent), a sequencing analyses may be carried out to confirm presence of mutations.
If the construct encodes an agent capable of introducing the loss of function mutations to the plant cell (e.g. silencing agent), additional analyses may be carried out to confirm down-regulation of expression and/or activity of the gene of interest (e.g. DCL2, DCL4, RDR1, RDR1, RDR6). Method of determining down-regulation of expression and/or activity are well known in the art and include RT-PCR, ELISA, Western blot, IP and the like.
According to specific embodiments, the loss of function mutation abolishes expression and/or activity of the expressed product of the gene.
Thus, according to specific embodiments, the loss of function mutations abolish expression of DCL2, DCL4, RDR1, RDR2 and/or RDR6, as determined by RT-PCR or Western blot.
According to specific embodiments, the loss of function mutations abolish expression and/or activity of DCL2, as determined by no expression of transgene specific 21-nt siRNAs in the plant cell following expression of the transgene in the plant cell.
According to specific embodiments, the loss of function mutations abolish expression and/or activity of DCL4, as determined by no expression of transgene specific 22-nt siRNAs in the plant cell following expression of the transgene in the plant cell.
Methods of determining transgene specific 21-nt and/or 22-nt siRNAs are well known in the art and are also described in details in the Examples section which follows.
If the construct encodes an expression product of interest other an agent capable of introducing the loss of function mutations to the plant cell (e.g. a recombinant protein of interest) as further described hereinbelow, additional analyses may be carried out to confirm expression and/or activity of expression product of interest. Method of determining expression and/or activity are well known in the art and include RT-PCR, ELISA, western blot, IP and the like.
Specific embodiments of the present invention are also directed to plant cells prepared by the methods disclosed herein.
Hence, according to an aspect of the present invention, there is provided a plant cell obtainable by the method.
According to an additional or an alternative aspect of the present invention, there is provided as isolated plant cell in suspension comprising loss of function mutations in all alleles of at least two genes selected from the group consisting of DCL2, DCL4, RDR1, RDR2 and RDR6 in said plant cell.
According to specific embodiments, the loss of function mutations in DCL2 comprise a mutation selected from the group consisting of:
According to specific embodiments, the loss of function mutations in DCL4 comprise a mutation selected from the group consisting of:
According to specific embodiments, the loss of function mutations in DCL2 comprise a mutation selected from the group consisting of:
According to specific embodiments, the loss of function mutations in DCL4 comprise a mutation selected from the group consisting of:
According to specific embodiments, the loss of function mutations in DCL2 comprise a mutation selected from the group consisting of:
According to specific embodiments, the loss of function mutations in DCL4 comprise a mutation selected from the group consisting of:
It will be appreciated that the plant cells described herein may have been further genetically modified to express an expression product of interest.
Thus, according to specific embodiments, the method comprises expressing in the isolated plant cell a recombinant expression product of interest.
According to specific embodiments, the isolated plant cell comprises a heterologous nucleic acid sequence for expressing an expression product of interest.
As used herein, the term “heterologous” refers to a nucleic acid sequence which is not native to the plant cell at least in localization or is completely absent from the native plant cell.
According to an aspect of the present invention, there is provided a method of expressing a recombinant expression product of interest in a plant cell, the method comprising culturing the plant cell disclosed herein which has been transformed with the agent capable of introducing the loss of function mutations disclosed herein and further transformed to express the expression product (e.g. polypeptide) of interest under condition which allow expression of the expression product of interest.
Such conditions may be for example an appropriate temperature (e.g., 37° C.), atmosphere (e.g., air plus 5% CO2), pH, light, medium (e.g. MS-BY-2 medium), carbon source and supplements.
Following, the produced expression product (e.g. polypeptide) may be purified and formulated in accordance with standard procedures.
According to specific embodiments, the expression product of interest is not the agent capable of introducing the loss of function mutations to the plant cell (e.g. silencing agent) described herein.
According to specific embodiments, the expression product of interest is a polypeptide.
According to specific embodiments, the expression product of interest is a mammalian polypeptide.
According to specific embodiments, the expression product of interest is a human polypeptide.
According to some embodiments, the expression product of interest is a pharmaceutical.
Non-limiting Examples of polypeptides of interest that can be expressed by the plant cells and methods disclosed herein include cytokines, cytokine receptors, growth factors (e.g. EGF, HER-2, FGF-alpha, FGF-beta, TGF-alpha, TGF-beta, PDGF, IGF-I, IGF-2, NGF), growth factor receptors, growth hormones (e.g. human growth hormone, bovine growth hormone); insulin (e.g., insulin A chain and insulin B chain), pro-insulin, erythropoietin (EPO), colony stimulating factors (e.g. G-CSF, GM-CSF, M-CSF); interleukins; vascular endothelial growth factor (VEGF) and its receptor (VEGF-R), interferons, tumor necrosis factor and its receptors, thrombopoietin (TPO), thrombin, brain natriuretic peptide (BNP); clotting factors (e.g. Factor VIII, Factor IX, von Willebrands factor and the like), anti-clotting factors; tissue plasminogen activator (TPA), urokinase, follicle stimulating hormone (FSH), luteinizing hormone (LH), calcitonin, CD proteins (e.g., CD2, CD3, CD4, CD5, CD7, CD8, CDI Ia, CDI Ib, CD18, CD19, CD20, CD25, CD33, CD44, CD45, CD71, etc.), CTLA proteins (e.g.CTLA4); T-cell and B-cell receptor proteins, bone morphogenic proteins (BNPs, e.g. BMP-I, BMP-2, BMP-3, etc.), neurotrophic factors, e.g. bone derived neurotrophic factor (BDNF), neurotrophins, e.g. rennin, rheumatoid factor, RANTES, albumin, relaxin, macrophage inhibitory protein (e.g. MIP-I, MIP-2), viral proteins or antigens, surface membrane proteins, ion channel proteins, enzymes, regulatory proteins, immunomodulatory proteins, (e.g. HLA, MHC, the B7 family), homing receptors, transport proteins, superoxide dismutase (SOD), G-protein coupled receptor proteins (GPCRs), neuromodulatory proteins, Alzheimer's Disease associated proteins and peptides. Fusion proteins, chimeric proteins, as well as fragments or portions, or mutants, variants, or analogs of any of the aforementioned proteins and polypeptides are also included among the suitable polypeptides that can be expressed by the plant cells and methods disclosed herein. The polypeptide of interest can be a glycoprotein. One class of glycoproteins are viral glycoproteins, in particular subunits, that can be used to produce for example a vaccine. Some examples of viral proteins comprise proteins from rhinovirus, poliomyelitis virus, herpes virus, bovine herpes virus, influenza virus, newcastle disease virus, respiratory syncitio virus, measles virus, retrovirus, such as human immunodeficiency virus or a parvovirus or a papovavirus, rotavirus or a coronavirus, such as transmissible gastroenteritisvirus or a flavivirus, such as tick-borne encephalitis virus or yellow fever virus, a togavirus, such as rubella virus or eastern-, western-, or venezuelean equine encephalomyelitis virus, a hepatitis causing virus, such as hepatitis A or hepatitis B virus, a pestivirus, such as hog cholera virus or a rhabdovirus, such as rabies virus.
According to a specific embodiment, the polypeptide of interest is an antibody. The term “antibody” as used herein refers to recombinant antibodies (for example of the classes IgD, IgG, IgA, IgM, IgE) and recombinant antibodies such as single-chain antibodies, chimeric and humanized antibodies and multi-specific antibodies. The term “antibody” also refers to fragments and derivatives of all of the foregoing, and may further comprise any modified or derivatised variants thereof that retain the ability to specifically bind an epitope. Antibody derivatives may comprise a protein or chemical moiety conjugated to an antibody. A monoclonal antibody is capable of selectively binding to a target antigen or epitope. Non-limiting examples of antibodies include, monoclonal antibodies (mAbs), humanized or chimeric antibodies, camelized antibodies, camelid antibodies (NanobodiesRTM), single chain antibodies (scFvs), Fab fragments, F(ab′)2 fragments, disulfide-linked Fvs (sdFv) fragments, anti-idiotypic (anti-Id) antibodies, intra-bodies, synthetic antibodies, and epitope-binding fragments of any of the above. The term “antibody” also refers to fusion protein that includes a region equivalent to the Fc region of an immunoglobulin. Also envisaged is the production in the plant cells of so called dual-specificity antibodies (Bostrom J et al (2009) Science 323, 1610-1614).
Non-limiting examples of antibodies within the scope of the present invention include those comprising the amino acid sequences of the following antibodies: anti-TNFalpha antibodies such as Adalinunab (Humira™), anti-HER2 antibodies including antibodies comprising the heavy and light chain variable regions (see U.S. Pat. No. 5,725,856) or Trastuzumab such as HERCEPTIN™; anti-CD20 antibodies such as chimeric anti-CD20 as in U.S. Pat. No. 5,736,137, a chimeric or humanized variant of the 2H7 antibody as in U.S. Pat. No. 5,721,108; anti-VEGF antibodies including humanized and/or affinity matured anti-VEGF antibodies such as the humanized anti-VEGF antibody huA4.6.1 AVASTIN™ (WO 96/30046 and WO 98/45331); anti-EGFR (chimerized or humanized antibody as in WO 96/40210); anti-CD3 antibodies such as OKT3 (U.S. Pat. No. 4,515,893); anti-CD25 or anti-tac antibodies such as CHI-621 (SIMULECT) and (ZENAPAX) (U.S. Pat. No. 5,693,762).
According to a specific embodiment, the polypeptide of interest is an enzyme.
According to specific embodiments, the enzyme is rasburicase.
According to a specific embodiment, the enzyme is a lysosomal enzyme. Examples include, but are not limited to, a cermidase e.g., N-acetylgalactosamine-4-sulphatase (arylsulphatase B), α-glucocerebrosidase, α-L-iduronidase, alpha-galactosidase A, beta-galactosidase.
According to a specific embodiment the polypeptide of interest is a chimeric polypeptide e.g., the polypeptide of interest-attached to a heterologous polypeptide which is not native to the polypeptide of interest, also referred to as a fusion protein. Examples include, but are not limited to, Etanercept (Enbrel™), a chimeric polypeptide that fuses the TNF receptor to the constant end of the IgG1 antibody.
It will be appreciated that the plant cells described herein may have been further modified to have reduced expression and/or activity of a glycosylation enzyme.
Thus, according to specific embodiments, the plant cell has reduced expression and/or activity of a glycosylation enzyme as compared to a control plant cell of the same genetic background not subjected to an agent which downregulates expression and/or activity of said glycosylation enzyme.
According to specific embodiments, the method comprises introducing into the isolated plant cell an agent capable of downregulating expression and/or activity of a glycosylation enzyme Examples of glycosylating enzymes comprises xylosyltransferase and/or fucosyltransferase.
As used herein “Xylosyltransferase” abbreviated as “XylT” refers to an enzyme that catalyzes the transfer of xylose from GDP-xylose to the beta-linked bisecting mannose in the core of N-glycans while linking it with a beta-1,2 glycosidic linkages (EC 2.4.2.38).
As used herein “Fucosyltransferase”, abbreviated as “FucT” refers to an enzyme that catalyses the transfer of fucose from GDP-fucose to the core alpha-linked N-acetyl glucosamine (GlcNAc) of protein-bound N-glycans (EC 2.4.1.214).
The N. tabacum comprises two XylT genes and 5 FucT genes. These include:
As used herein, “reduced expression and/or activity” or “downregulating expression and/or activity” refers to a decrease of at least 10% in the level of expression and/or activity of a glycosylation enzyme in comparison to a control cell of the same genetic background which was not subjected to (or contacted with) an agent which downregulates expression and/or activity of the glycosylation enzyme, as may be determined by e.g. PCR, ELISA, Western blot analysis, immunopercipitation, flow cytometry, immuno-staining or activity assays such as comparison of oligosaccharides obtained from PNGaseA to oligosaccharides obtained from PNGaseF. According to a specific embodiment, the decrease is in at least 20%, 30%, 40% or even higher say, 50%, 60%, 70%, 80%, 90% or even 100%.
Methods and agents for reducing (or downregulating) expression and/or activity of a glycosylation enzyme are well known in the art and may be effected at the genomic (e.g. homologous recombination and site specific endonucleases) and/or the transcript level using a variety of molecules which interfere with transcription and/or translation (e.g., RNA silencing agents e.g. transcriptional gene silencing (TGS), post-transcriptional gene silencing (PTGS), quelling, co-suppression, and translational repression RNA interference (RNAi) e.g. siRNA, miRNA, antisense) or on the protein level (e.g., aptamers, small molecules and inhibitory peptides, antagonists, enzymes that cleave the polypeptide, antibodies and the like).
A detailed description on downregulation at the genomic level is further provided hereinabove. Non-limiting examples of specific methods of reducing expression and/or activity of xylosyltransferase and/or fucosyltransferase at the genomic level are disclosed for example in Hanania U, et al. (2017) Plant Biotechnology Journal 15:1120-1129; Jansing J, Sack M, et al. (2019) Plant biotechnology journal 17:350-361, Mercx Sb, et al. (2017) Frontiers in plant science 8:403-403, the contents of which are fully incorporated herein by reference.
It will be appreciated that the heterologous sequences encoding e.g. the agents disclosed herein and optionally the regulatory sequences and/or selectable markers accompanying them may be removed from the isolated plant cell once they are no longer needed (e.g. following introduction of a loss of function mutation). Various techniques are known in the art for the removal of transgenes and markers while leaving only the required ones in place. Such methods include for example: 1) Transient expression of Cas9 (Chen et al., 2018; Zhang et al., 2016); (2) Transfection of preassembled complexes of purified Cas9 protein and guide RNA (RNP) into plant protoplasts (Woo et al., 2015); (3) Using ‘suicide’ transgenes, such as the BARNASE gene under the control of the rice REG2 promoter, that effectively kill all of the CRISPR-Cas9 containing pollen and embryos, assuring that any viable embryos will be free of foreign DNA (He et al., 2018) or (4) Coupling the CRISPR construct with an RNA interference element, which targets an herbicide resistance enzyme in rice (Lu et al., 2017), resulting in transgene-free mutated plants.
It is expected that during the life of a patent maturing from this application many relevant agents capable of reducing or abolishing expression and/or activity of a target protein and specifically introducing loss of function mutations in a gene will be developed and the scope of the term agent which reduces or abolishes expression and/or activity or agent capable of introducing loss of function mutations is intended to include all such new technologies a priori.
As used herein the term “about” refers to ±10%
The terms “comprises”, “comprising”, “includes”, “including”, “having” and their conjugates mean “including but not limited to”.
The term “consisting of” means “including and limited to”.
The term “consisting essentially of” means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.
As used herein, the singular form “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a compound” or “at least one compound” may include a plurality of compounds, including mixtures thereof.
Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.
Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.
As used herein the term “method” refers to manners, means, techniques and procedures for accomplishing a given task including, but not limited to, those manners, means, techniques and procedures either known to, or readily developed from known manners, means, techniques and procedures by practitioners of the chemical, pharmacological, biological, biochemical and medical arts.
When reference is made to particular sequence listings, such reference is to be understood to also encompass sequences that substantially correspond to its complementary sequence as including minor sequence variations, resulting from, e.g., sequencing errors, cloning errors, or other alterations resulting in base substitution, base deletion or base addition, provided that the frequency of such variations is less than 1 in 50 nucleotides, alternatively, less than 1 in 100 nucleotides, alternatively, less than 1 in 200 nucleotides, alternatively, less than 1 in 500 nucleotides, alternatively, less than 1 in 1000 nucleotides, alternatively, less than 1 in 5,000 nucleotides, alternatively, less than 1 in 10,000 nucleotides.
It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.
Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.
Reference is now made to the following examples, which together with the above descriptions illustrate some embodiments of the invention in a non limiting fashion.
Generally, the nomenclature used herein and the laboratory procedures utilized in the present invention include molecular, biochemical, microbiological and recombinant DNA techniques. Such techniques are thoroughly explained in the literature. See, for example, “Molecular Cloning: A laboratory Manual” Sambrook et al., (1989); “Current Protocols in Molecular Biology” Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., “Current Protocols in Molecular Biology”, John Wiley and Sons, Baltimore, Maryland (1989); Perbal, “A Practical Guide to Molecular Cloning”, John Wiley & Sons, New York (1988); Watson et al., “Recombinant DNA”, Scientific American Books, New York; Birren et al. (eds) “Genome Analysis: A Laboratory Manual Series”, Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; “Cell Biology: A Laboratory Handbook”, Volumes I-III Cellis, J. E., ed. (1994); “Culture of Animal Cells—A Manual of Basic Technique” by Freshney, Wiley-Liss, N. Y. (1994), Third Edition; “Current Protocols in Immunology” Volumes I-III Coligan J. E., ed. (1994); Stites et al. (eds), “Basic and Clinical Immunology” (8th Edition), Appleton & Lange, Norwalk, C T (1994); Mishell and Shiigi (eds), “Selected Methods in Cellular Immunology”, W. H. Freeman and Co., New York (1980); available immunoassays are extensively described in the patent and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521; “Oligonucleotide Synthesis” Gait, M. J., ed. (1984); “Nucleic Acid Hybridization” Hames, B. D., and Higgins S. J., eds. (1985); “Transcription and Translation” Hames, B. D., and Higgins S. J., eds. (1984); “Animal Cell Culture” Freshney, R. I., ed. (1986); “Immobilized Cells and Enzymes” IRL Press, (1986); “A Practical Guide to Molecular Cloning” Perbal, B., (1984) and “Methods in Enzymology” Vol. 1-317, Academic Press; “PCR Protocols: A Guide To Methods And Applications”, Academic Press, San Diego, C A (1990); Marshak et al., “Strategies for Protein Purification and Characterization—A Laboratory Course Manual” CSHL Press (1996); all of which are incorporated by reference as if fully set forth herein. Other general references are provided throughout this document. The procedures therein are believed to be well known in the art and are provided for the convenience of the reader. All the information contained therein is incorporated herein by reference.
Plant cell suspensions—Nicotiana tabacum cv. BY-2 cells (Nagata, 2004) were cultured as a suspension culture in liquid MS-BY-2 medium (Nagata and Kumagai, 1999) at 25° C. with constant agitation on an orbital shaker (85 rpm). The suspensions were grown at 50 mL of volume in 250 mL Erlenmeyers and were sub-cultured weekly at 2.5% (v/v) concentration.
Construction of Cas9/sgRNA vectors to knockout DCL2 and DCL4 genes—Based on alignment analysis two pairs of crRNA were designed: one pair targets both genes of DCL2 (SEQ ID NO: 1-2, Table 1A hereinbelow) and one pair targets both genes of DCL4 (SEQ ID NO: 3-4, Table 1A hereinbelow) in a common homological region. Each of the four crRNAs was fused to the tracrRNA backbone sequence (SEQ ID NO: 5, Table 1A hereinbelow) resulting in the construction of four sgRNAs (designated sgRNA1-sgRNA4, respectively). Three binary vectors, i.e. phCas9-DCL2, phCas9-DCL4 and phCas9-DCL2-DCL4 (
Construction of Cas9/sgRNA vectors to knockout RDR1, RDR2, RDR6 genes—Based on alignment analysis three pairs of crRNA were designed: one pair targets both genes of RDR1 (SEQ ID NO: 26-27, Table 1B hereinbelow), one pair targets both genes of RDR2 (SEQ ID NO: 28-29, Table 1B hereinbelow) and one pair targets all three genes of RDR6 (SEQ ID NO: 30-31, Table 1B hereinbelow) in a common homological region. Each of the six crRNAs was fused to the tracrRNA backbone sequence (SEQ ID NO: 5, Table 1A hereinabove) resulting in the construction of six sgRNAs (designated sgRNA11-sgRNA16, respectively). Each of these gRNAs was constructed under the U6 promoter and inserted into a binary pBIN19 backbone vector.
CACGCCT
TGG 3′
AGTGAGG
TGG 3′
ATAGCTT
TGG 3′
GTTCTGAC
CGG 3′
CACATG
TGG 3′
CTAAAAT
TGG 3′
Construction of a vector encoding resburicase—A Bean yellow dwarf virus (BeYDV) expression vector (Chen et al., 2011; Mor et al., 2002) encoding the rasburicase gene of Aspergillus (DB00049, SEQ ID NO: 95) was constructed. Specifically, the vector encodes the replicon initiator protein (Rep) and deletions of the viral coat and movement genes, contains insertion of an expression cassette for raburicase under the control of a CaMV 35S promoter and Octopine synthase terminator.
Transformation of cells and selection of lines—The final vectors were used to transform the tobacco cells via the Agrobacterium plant transformation procedure (An, 1985). Once a stable transgenic cell suspension was established, it was tested for transgene expression as pools or used for isolating and screening individual cell lines (clones). Establishing of individual cell lines was conducted by using highly diluted aliquots of the transgenic cell suspension and spreading them on solid medium. The cells were allowed to grow until small calli developed. Each callus, representing a single clone, was then re-suspended in liquid medium and sampled.
AFLP assay—Genomic DNA was extracted using the DNeasy plant mini kit (Qiagen). PCR amplification was effected using the appropriate forward and reverse primers (Table 2, SEQ ID 6-7 hereinbelow) in 35 cycles according to the following procedure: 95° C. for 1 minute, 60° C. for 20 seconds and 72° C. for 1 minute. Following, 5 μl of each sample was separated on 1X TBE 15% Polyacrylamide Gel Electrophoresis (PAGE) and then stained for 5 minutes with ethidium bromide.
RFLP assay—Genomic DNA was extracted using the DNeasy plant mini kit (Qiagen). PCR amplification was effected using the appropriate forward and reverse primers (Table 2 SEQ ID 8-11 hereinbelow) in 35 cycles according to the following procedure: 95° C. for 1 minute, 60° C. for 20 seconds and 72° C. for 1 minute. Following, digestion of the PCR products using the 5 restriction enzymes E.conI or PspFI was done by using 10 μl of the PCR product 3 al of restriction buffer 2 μl enzyme and 15 al DDW. The digested products were separated by electrophoresis on an ethidium bromide-stained 2% agarose gel.
DNA Sequencing—Genomic DNA was extracted using DNeasy plant mini kit (Qiagen). 35 cycles of PCR amplification was effected using the appropriate forward and reverse primers (Table 2 SEQ ID 12-19 hereinbelow) according to the following procedure: 95° C. for 1 minute, 60° C. for 20 seconds and 72° C. for 1 minute. Following, the PCR products were sub-cloned into the pGEMT vector. Colonies were sequenced by Sanger method and were aligned with the wild-type target sequences to determine mutations.
siRNA detection—Micro RNA (miRNA) was isolated using the mirPreimer—miRNA isolation kit (Sigma SNC50-1kt) according to the manufacturer instructions. To a 5 μl miRNA sample, 5 μl of 2× loading dye were added and the samples were heated at 90° C. for 5 minutes to denature RNA and then placed on ice. Following, the samples were separated on 15% polyacrylamide MOPS 7M-urea gels for 2.5 hours at 100v in 20 mM MOPS. The miRNA was transferred onto positively charged nylon membrane (REF 11417240001 Roche) using the semi dry transferring cell, bio rad instrument followed by crosslinking the membrane with EDC [1-ethyl-3-(3-dimethylaminopropyl) carbodiimide].
For synthesizing the RNA probe, DIG RNA Labeling kit SP6/T7 (Sigma) was used. DNA template for transcription was inserted into the pGEMT vector and the insert orientation was determined. In order to get sense strand RNA probe, SP6 or T7 polymerase was used. Signal intensity was scanned with high resolution chemiluminescence settings using a ChemiDoc Touch Imaging System (Bio Rad) with Image Lab™ Software ver 5.2.1 (Bio-Rad).
Real-time RT-PCR—Total RNA was extracted from cell cultures using RNeasy Plant Mini Kit (Qiagen) and contaminated genomic DNA in the total RNA was degraded with TURBO DNase (Thermo Fisher Scientific). 500 ng of total RNA was used to generate the first-strand cDNA using the iSCRIPT select cDNA synthesis kit (Bio-Rad) with oligo mix/Random (1:3) primers. The qPCR mixtures were prepared using 15 μl TaqMan master mix (Thermo scientific), 3 μl ready mix of primers and probe, 5 μl template cDNA (0.4 ng to 6.4 ng) and 7 μl ddH2O. Amplification was performed using the appropriate forward and reverse primers (Table 3 hereinbelow) in a rotor gene Real-Time PCR system (Corbett). The qPCR reaction conditions were as follows: DNA polymerase activation at 95° C. for 10 minutes was followed by 40 cycles of DNA melting at 95° C. for 15 seconds, annealing at 60° C. for 15 seconds and extension at 72° C. for 15 seconds.
In the Nicotiana tabacum genome, two genes of DCL2 and two genes of DCL4 are publicly known (Table 4 hereinbelow). In order to increase the possibility of producing a mutation two pairs of CRISPR small guide RNAs (sgRNAs) were designed based on an alignment analysis. One pair targets both genes of DCL2 and one pair targets both genes of DCL4 in a common homological region. Accordingly, the following crRNAs were defined: crRNA1, crRNA2—each 20 bp long shared between the N.tab-DCL2A and the N.tab-DCL2B genes (SEQ ID NO: 1-2, Table 1A hereinabove) and crRNA3, crRNA4—each 20 bp long shared between the N.tab-DCL4A and N.tab-DCL4B genes (SEQ ID NO: 3-4, Table 1A hereinabove). Each of the constructed crRNA was designed to hybridize with the strand complementary to the target gene and hence its sequence is of the selected common homological region of the target (
Following, in order to evaluate if knockout of DCL2 and DCL4 genes induces higher expression amounts of a recombinant protein compared to a wild type cell line, two approaches were selected. In the first approach, the three constructed vectors were used in three separate stable transformations of pre-transformed BY2 cells expressing a recombinant protein (Example 2 hereinbelow). In the second approach, the phCas9-DCL2-DCL4 vector was transformed into BY2 cells, knock-out lines were selected and then re-transformed to express a recombinant protein (Example 3 hereinbelow).
BY2 cells were transformed by Agrobacterium for expression of the Aspergillus rasburicase gene (DB00049) using the Bean yellow dwarf virus (BeYDV) expression vector (Chen et al., 2011; Mor et al., 2002). A total of 100 individual transformed cell lines were isolated and screened for expression of the recombinant rasburicase. One of the lines (line 40), which expressed the higher level of rasburicase, was used for further knock-out of DCL2 and/or DCL4 genes. Thus, in a second transformation, Line 40 was transformed by Agrobacterium with the three binary vectors: phCas9-DCL2, phCas9-DCL4 or phCas9-DCL2-DCL4 (
Following, the effects of deletion of DCL2 and/or DCL4 on the expression level of the recombinant rasburicase was tested. Specifically, pools of cells were collected eight weeks post transformation and the expression level of rasburicase was tested by activity. The expression level of rasburicase in ΔD2ΔD4 cells was 5 fold higher compared to the expression level in the wild-type line 40, while the expression level of rasburicase in ΔD2 or ΔD4 cells was approximately 2 fold higher (
In the next step, a total of 72 individual cell lines were isolated from the transformed ΔD2ΔD4 pooled cells and the expression level of rasburicase was tested. Twenty five of these lines (about 35%) expressed 2-9 fold higher levels of rasburicase compared to the original line 40 (
The selected ΔD2ΔD4 lines 3, 18, 34, 58 and 65 were analyzed to detect indels in DCL2 and DCL4 genes. To identify and characterize the targeted mutations in DCL2 genes, an AFLP assay was applied (Liu et al., 2015). PCR primers (SEQ ID NO: 6-7, Table 2 hereinabove) were designed to flank the target site of DCL2-gRNA1, to produce a product of 104 bp of the wild-type genes. Indeed the PCR product obtained from the wild type BY2 cells was 104 bp long. However, the PCR products from ΔD2ΔD4 lines 3, 18, 34, 58 and 65 were shorter and/or longer than the anticipated size (
The Cas9 generated mutations in the DCL2 and the DCL4 genes in cell lines 18 and 65 were further characterized by sequencing. Specifically, to sequence the mutations in the DCL2 genes, a PCR was performed using a set of primers (SEQ ID NO: 12-13, Table 2 hereinabove) flanking the gRNA1 Cas9 target site of both DCL2 genes and primers (SEQ ID NO: 14-15, Table 2 hereinabove) flanking the gRNA1-gRNA2 Cas9 target site of both DCL2 genes. To sequence the mutations in the DCL4 genes, a PCR was performed using a set of primers (SEQ ID NO: 16-17, Table 2 hereinabove) flanking the gRNA3 Cas9 target site of both DCL4 genes and primers (SEQ ID NO: 18-19, Table 2 hereinabove) flanking the gRNA3-gRNA4 Cas9 target site of both DCL4 genes. Following, the obtained PCR products were cloned into a pGEMT vector and the sub-clones were sequenced, revealing the presence of assorted insertions and/or deletions:
Following, to approve the mutations introduced in the DCL2 and DCL4 genes are indeed loss-of-function mutations leading to the enhanced expression of the recombinant rasburicase, the selected ΔD2ΔD4 lines 3, 18, 34, 58 and 65 which express high levels of rasburicase (7, 7.5, 8, 9.5 and 7.5 fold respectively, compared to line 40,
BY2 cells were transformed by Agrobacterium with the binary vector phCas9-DCL2-DCL4 (
The Cas9 generated mutations in the DCL2 and the DCL4 genes in cell line 35 were further characterized by sequencing. Specifically, to sequence the mutations in the DCL2 genes, a PCR was performed using a set of primers (SEQ ID NO: 12-13, Table 2 hereinabove) flanking the gRNA1 Cas9 target site of both DCL2 genes and primers (SEQ ID NO: 14-15, Table 2 hereinabove) flanking the gRNA1-gRNA2 Cas9 target site of both DCL2 genes. To sequence the mutations in the DCL4 genes, a PCR was performed using a set of primers (SEQ ID NO: 16-17, Table 2 hereinabove) flanking the gRNA3 Cas9 target site of both DCL4 genes. Following, the obtained PCR products were cloned into a pGEMT vector and the sub-clones were sequenced, revealing the presence of assorted insertions and/or deletions. No wild type products were detected among any of the tested genes. Three mutations for the DCL2 genes were identified: A mutation of 2,528 bp deletion and a mutation of 34 bp deletion were identified in both alleles of DCL2A and a mutation of 70 bp deletion was identified in one allele of DCL2B. No PCR product of the second allele of DCL2B from all sub-clones was obtained, this can happen in case of a long deletion. (
To study the effects of DCL2 and DCL4 knock-out on expression level and gene silencing against transgenes, wild-type BY2 cells and the resultant six transgenic cells lines were re-transformed by Agrobacterium for expression of the Aspergillus rasburicase gene (DB00049) using the Bean yellow dwarf virus (BeYDV) expression vector (Chen et al., 2011; Mor et al., 2002). Following, pools of each re-transformed cell line were collected eight weeks post transformation and the expression level of rasburicase was tested by activity. A significant increase in the rasburicase expression level was obtained in pools of all lines compared to wild type BY2 ranging from 5 fold in the pool of line 97 to 9.6 fold in the pool of line 97 (
Following, to approve the mutations introduced in the DCL2 and DCL4 genes are indeed loss-of-function mutations leading to the enhanced expression of the recombinant rasburicase, the accumulation of siRNA molecules in the selected ΔD2ΔD4 lines 5, 35, 71, 93 and 103 was examined. Specifically, siRNA were extracted and hybridized with an RNA probe of the rasburicase sense sequence. The rasburicase transformed wild type BY2 cells produced 21-nt, 22-nt and 24-nt siRNAs, while all the tested ΔD2ΔD4 lines produced only the 24-nt class obtained by the DCL3 gene and the 22-nt and 21-nt classes obtained by the DCL2 and DCL4 genes were not detected (
In the Nicotiana tabacum genome, seven genes of the RDRα group are publicly known (Table 5 hereinbelow). Based on alignment analysis, these genes were divided into three homologous groups: Group 1-RDR1-A,B: Group 2-RDR2-A,B; and Group 3-RDR6-A,B,C. In order to increase the possibility of producing a mutation two pairs of CRISPR small guide RNAs (sgRNAs) were designed based on an alignment analysis for each group. Accordingly, the following crRNAs were defined: crRNA11, crRNA12-19 bp and 20 bp long shared between the N.tab-RDR1A and the N.tab-RDR1B genes (SEQ ID NO: 26-27, Table 1B hereinabove), crRNA13, crRNA14-20 bp and 21 bp long shared between the N.tab-RDR2A and the N.tab-RDR2B genes (SEQ ID NO: 28-29, Table 1B hereinabove), and crRNA15, crRNA16-19 bp and 20 bp long shared between the N.tab-RDR6A, N.tab-RDR6B and the N.tab-RDR6C genes (SEQ ID NO: 30-31, Table 1B hereinabove). For the application of the CRISPR/Cas9 technology, the four crRNAs were each fused to the tracrRNA backbone sequence (SEQ ID NO: 5, Table 1A hereinabove) resulting in the construction of four sgRNAs (designated sgRNA11-sgRNA16, respectively). Each of these gRNA is constructed into a binary vector namely.
Following, in order to evaluate if knockout of RDR1, RDR2 and/or RDR6 and possibly in combination with DCL2 and/or DCL4 genes induces higher expression amounts of a recombinant protein compared to a wild type cell line, two approaches were selected as described in Examples 2-3 hereinabove.
Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.
It is the intent of the applicant(s) that all publications, patents and patent applications referred to in this specification are to be incorporated in their entirety by reference into the specification, as if each individual publication, patent or patent application was specifically and individually noted when referenced that it is to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting. In addition, any priority document(s) of this application is/are hereby incorporated herein by reference in its/their entirety.
This application claims the benefit of priority of U.S. Provisional Patent Application No. 63/087,366 filed on 5 Oct. 2020 and U.S. Provisional Patent Application No. 63/186,280 filed on 10 May 2021, the contents of which are incorporated herein by reference in their entirety. The ASCII file, entitled 89444SequenceListing.txt, created on 5 Oct. 2021, comprising 558,975 bytes, submitted concurrently with the filing of this application is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IL2021/051194 | 10/5/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63186280 | May 2021 | US | |
63087366 | Oct 2020 | US |