The present disclosure belongs to the field of plant genetic engineering and relates to methods for non-transgenic plant genome editing. More particularly, but not exclusively, the present disclosure relates to methods of generating targeted mutation in plant genome by delivering CRISPR/Cas nuclease to plants using engineered plant negative-stranded RNA viral vector systems, whereby plants with modified genetic material and their progeny are produced.
The development of sequence-specific endonucleases (SSNs) technologies has made targeted genome editing possible. SSNs comprise a sequence-specific DNA-binding domain and a non-specific DNA cleavage domain. Upon introduction into cells, SSNs target specific sites in the genome, creating double-strand breaks (DSBs) in DNA. This activates the cell's non-homologous end joining (NHEJ) mechanism to repair DSBs. This repair process often leads to base insertions or deletions, resulting in mutations in the target sequence. Additionally, when homologous DNA sequences are present within the cell, the homologous recombination (HR) machinery can utilize homologous templates to repair DSBs, enabling genetic modifications, allelic gene replacement, or targeted insertion of exogenous genes into the target sequence. Furthermore, SSN variants that cannot generate DSBs can be fused with different effector proteins, e.g., transcriptional regulatory proteins, cytosine deaminases, adenine deaminases, reverse transcriptases, to enable targeted gene transcription regulation, base editing, insertion, or deletion of single or multiple bases.
Commonly used SSNs include zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR-associated proteins (CRISPR/Cas) system. Among these, CRISPR/Cas technology requires only the introduction of Cas nucleases and target site-specific guide RNA into cells. It is widely utilized in animal and plant genome editing due to its ease of use and high efficiency.
One of the critical steps in genome editing technology is the delivery of sequence-specific nucleases (SSNs), i.e., the methods and means to introduce SSNs into recipient cells. The rigid plant cell walls hinder the direct transfection of SSN molecules, necessitating genetic transformation methods, such as Agrobacterium-mediated transformation or gene gun bombardment, to deliver SSNs in the form of transgenes into plant cells. However, this method presents several challenges when applied in crop genetic improvement: i) Most crop species or varieties exhibit low transformation efficiency; ii) The segregation of SSN transgenes in the progeny of edited plants relies on sexual hybridization, which is time-consuming and labor-intensive, especially for polyploid or asexually reproduced plants, or plants with long breeding cycle; iii) The stable genetic transformation process can cause unpredictable changes and damage to plant chromosomal DNA; iv) Products involving the genetic transformation process may be categorized as genetically modified organisms (GMOs) and face regulatory scrutiny in certain countries.
Viral vectors are natural biomacromolecule delivery devices widely used for introducing and transiently expressing foreign genes in plants (e.g., Donson et al., Proc Natl Acad Sci USA, 1991, 88: 7204-7208; Wang et al., PLoS Pathog, 2015, 11: e1005223). Particularly, RNA viral vectors, not involving a DNA stage during replication, bear negligible risk of integrating foreign nucleic acids into the host genome. Infection of plants with plant viruses is faster and simpler than stable genetic transformation and viral replication can result in high levels of foreign gene expression. Moreover, infectious viral vectors can achieve in planta delivery of foreign genes within the infected plant tissues without the need for isolating specific recipient cells or tissues. Currently, various plant viruses have been engineered as vectors for the direct delivery of nucleic acids or expression of foreign proteins within the plant (Gleba et al., Curr Opin Plant Biol, 2004, 7: 182-188; Cody and Scholthof, Annu Rev Phytopathol, 2019, 57: 211-230).
In plant genome editing, the use of viral vectors as a delivery tool is limited by the large sizes of CRISPR/Cas nucleases. The sizes of commonly used Cas9 and Cas12a nucleases (approximately 4 kb), together with regulatory elements, can reach 5-7 kb, presenting a significant challenge to the packaging capacity of viral vectors. Due to the cargo size limitation, some infectious plant viral vectors have been used to deliver the smaller CRISPR guide RNAs (gRNAs), and the list includes the single-stranded DNA cabbage leaf curl virus (Yin et al., Sci Rep, 2015, 5: 14926), and the positive-sense RNA virus tobacco rattle virus (Ali et al., Mol Plant, 2015, 8: 1288-1291; patents WO 2015189693; WO 2016084084), tobacco mosaic virus (Cody et al., Plant Physiol, 2017, 175: 23-35), pea early browning virus (Ali et al., Virus Res, 2018, 244: 333-337), beet necrotic yellow vein virus (Jiang et al., Plant Biotechnol J, 2019, 17: 1302-1315), and foxtail mosaic virus (Mei et al., Plant Direct, 2019, 3: 1-16). In these cases, the guide RNAs delivered by these viral vectors collaborate with the Cas9 protein provided by an integrated transgene to achieve genome editing. Additionally, some plant viral vectors with relatively larger packaging capacities have been used to deliver other nuclease components. For example, tobacco rattle virus has been used to deliver meganucleases of approximately 1 kb (Honig et al., Mol Plant, 2015, 8: 1292-1294) or zinc finger nucleases of approximately 2 kb (Marton et al., Plant Physiol, 2010, 154: 1079-1087; patent WO 2009130695), or split CRISPR/Cas nuclease components (U.S. Patent Application US 20190359993).
A viral replicon system is a class of vectors with a large cargo capacity that can replicate but cannot infect plants systematically. Baltes et al. (Plant Cell, 2014, 151-163) disclose the delivery of CRISPR/Cas nucleases using a geminivirus replicon derived from bean yellow dwarf virus (BeYDV). Wang et al. (Mol Plant, 2017, 10: 1007-1010; Chinese patent CN 109880846 A) disclose the delivery of CRISPR/Cas nucleases and DNA repair templates using a wheat dwarf virus (WDV) replicon. A technique for delivering CRISPR/Cas nuclease utilizing the PVX vector or tomato mosaic virus vector is described by Airga et al. (Plant Cell Physiol, 2020, 61: 1946-1953) and a US patent (US 20190359993). A technique for delivering CRISPR/Cas nucleases into plant single cells via a barley yellow streak mosaic virus vector is described in Gao et al. (New Phytol, 2019, 223: 2120-2133) and a Chinese patent (CN 110511955 A). The above viral replicon vectors are unable to infect plants systemically and need to be introduced into cells by means of Agrobacterium transformation or biolistic transformation, which can express the complete CRISPR/Cas nuclease and enable gene editing in the inoculated cells.
Plant negative-stranded RNA viruses mainly include the non-segmented negative-stranded RNA viruses in the family Rhabdoviridae and the segmented viruses in the order Bunyaviridae. A few reports documented the development of plant negative-stranded RNA virus vectors. Ma et al. (Nature Plants, 2020, 6: 773-779) disclose a method to systematically deliver CRISPR/Cas nucleases and achieve DNA-free gene editing in N. benthamiana using Sonchus yellow net rhabdovirus (SYNV) vector. However, there is little report on the delivery of Cas nucleases in different crops due to the limitation of the SYNV host range. Feng et al. (Proc Natl Acad Sci USA, 2020, 117: 1181-1190) report a method for reverse genetic manipulation of tomato spotted wilt virus.
To overcome the current technical challenges in achieving non-transgenic targeted genome editing in plants using plant viral vectors, the present invention utilizes a tospovirus, tomato spotted wilt virus (TSWV)-based vector to deliver sequence-specific nucleic acid modifying enzyme into plants, wherein said nucleic acid modifying enzyme targets specific DNA sequences in the plant genome and cleaves or modifies the target sites. Precise modifications at the target sites are achieved through the plant's endogenous DNA repair mechanisms. Furthermore, the TSWV vectors employed in this invention are capable of systemic infection to enable the delivery of sequence-specific nucleases in infected whole plants. The procedure results in gene-edited plant parts without the integration of exogenous nucleotide sequences. Additionally, a method is provided to obtain non-transgenic plants with heritable gene edits from the edited plant parts (such as plant cells) through tissue culture and plant regeneration. Furthermore, the edited plant parts (such as plant cells) without integrated exogenous DNA sequences, when subjected to antiviral drug treatment during the tissue culture process, result in the recovery of non-transgenic, virus-free plants with heritable targeted gene modifications.
In one aspect of the present disclosure, a method is provided to modify the genetic material of a plant cell without the need for introducing exogenous genetic material into the plant genome, comprising: a) providing at least one plant cell to be modified; b) providing a tospovirus vector comprising at least one polynucleotide sequence encoding a sequence-specific nucleic acid modifying enzyme; c) infecting the plant cell with the tospovirus vector to allow transient expression of the sequence-specific nucleic acid modifying enzyme, wherein the nucleic acid modifying enzyme targets and cleaves or modifies a genomic sequence in the infected plant cells, and wherein the plant DNA repair machinery completes the modification of genetic material of the plant cells.
In certain embodiments, the present disclosure also provides a method for generating genetically modified plants without the need for introducing exogenous DNA into the plant genome, comprising: a) providing at least one plant cell to be modified; b) providing a tospovirus vector comprising at least one polynucleotide sequence encoding a sequence-specific nucleic acid modifying enzyme; c) infecting said plant cell with said tospovirus vector to allow transient expression of said nucleic acid modifying enzyme, wherein the nucleic acid modifying enzyme targets and cleaves or modifies a genomic sequence in the infected plant cells, and wherein the plant DNA repair machinery completes the modification of genetic material of the plant cells; d) obtaining a genetically modified plant from the modified plant cell.
In some embodiments, the method further comprises selecting a genetically modified plant cell or plant, wherein the selection does not require a selectable marker.
In some embodiments, said genetic modifications comprise nucleotide deletion, insertion or substitution of one or more nucleotides, or a combination thereof, at the target DNA sequence.
In some embodiments, the sequence-specific nucleic acid modifying enzyme is a nuclease, its mutated forms or derivatives capable of modifying genomic DNA sequence, i.e., CRISPR/Cas nucleases, TALEN nucleases, zinc finger nucleases, meganuclease. In certain embodiments, the nuclease is the Streptococcus pyogenes CRISPR/SpCas9. In certain embodiments, the nuclease is the Lachnospiraceae bacterium CRISPR/LbCas12a (LbCpf1).
In some embodiments, the sequence-specific nucleic acid modifying enzyme is a nucleotide base-modifying enzyme, for example, an adenine base editor comprising a Cas polypeptide fused to an adenine deaminase, a cytidine base editor comprising a Cas polypeptide with a cytidine deaminase. In certain preferred embodiments, the Cas polypeptide is a nickase variant of Streptococcus pyogenes SpCas9, and the adenine deaminase is an artificially evolved E. coli tRNA adenine deaminase A (ecTadA) variant (TadA8e V106W), and the cytidine deaminase is a human-derived Apo B messenger RNA cytidine deaminase catalytic subunit 3A (hAPOBEC3A; hA3A).
In some embodiments, the recombinant tospovirus vector system is the TSWV vector comprising the S, M, and L genome segments having at least 70%, at least 80%, at least 90%, at least 95%, at least 99% or 100% sequence homologies to the nucleic acid sequences as set forth in SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, respectively.
In some embodiments, the TSWV vectors provided by the present disclosure contain one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) exogenous gene insertion sites. In certain embodiments, the recombinant TSWV vector carries a reporter gene in the S genomic segment and another reporter gene in the M genomic segment, with the reporter gene being operably linked to viral mRNA transcription regulatory cis-elements. Preferably, the reporter genes are the green fluorescent protein gene (GFP) and red fluorescent protein gene (RFP). After introducing the recombinant viral vector constructs into plant cells, virus replication, and systemic infection can be tracked by the expressed fluorescent reporters.
In some embodiments, the TSWV S segment comprises at least one heterologous polynucleotide sequence encoding a sequence-specific nuclease component being operably linked to viral mRNA transcription regulatory cis-elements; preferably, the nuclease component is a chimeric CRISPR guide RNA polynucleotide molecule.
In some embodiments, the TSWV non-structural protein gene in the S segment (NSs) is replaced by heterologous polynucleotide sequences encoding a fusion of the sequence-specific nuclease component to heterologous sequences encoding, for example, a fluorescent reporter. In some embodiments, the sequence-specific nuclease component is a guide RNA. In certain embodiments, the heterologous polynucleotide sequence inserted into said S genome encodes one guide RNA. In other embodiments, the heterologous polynucleotide sequence encodes two or more guide RNAs.
In some embodiments, the TSWV M segment comprises at least one heterologous polynucleotide sequence operably linked to viral transcription regulatory elements and encoding a Cas polypeptide, optionally selected from the group consisting of Streptococcus pyogenes SpCas9, Lachnospiraceae bacterium LbCas12a (LbCpf1), or an adenine base editor or cytosine base editor comprising a Cas9 variant fused to a base deaminase. In some preferred embodiments, the coding sequences of LbCas12a, SpCas9, adenine base editor, and cytosine base editor polypeptides are plant codon-optimized sequences; preferably, as set forth in SEQ ID NO: 10, 11, 84, 85, respectively.
In some embodiments, the viral glycoprotein precursor (GP) gene sequence in the TSWV M segment is replaced by a heterologous polynucleotide sequence encoding a sequence-specific nuclease component.
In some embodiments, the viral RNA-dependent RNA polymerase (RdRp) gene sequence in the TSWV L segment is plant codon-optimized as set forth in SEQ ID NO: 5.
In some embodiments, the tospovirus vector is an infectious TSWV vector devoid of a functional NSs protein. In some embodiments, the TSWV vector is a non-insect-transmissible tospovirus vector devoid of a functional GP protein.
In another aspect, the present disclosure provides methods for introducing an infectious TSWV vector into plant cells for the transient expression of a sequence-specific nucleic acid modifying enzyme. In certain embodiments, the nucleic acid modifying enzyme is a CRISPR/LbCas12a or CRISPR/SpCas9 nuclease. In certain embodiments, the nucleic acid modifying enzyme is an adenine base editor or cytosine base editor.
In some embodiments, the method for infecting a plant is through introducing viral nucleic acid constructs into plant tissues under conditions sufficient to initial recombinant virus infection. In certain embodiments, the viral nucleic acid constructs are Agrobacterium binary plasmids containing transcriptional regulatory cis-elements, including a promoter and a terminator, and the constructs are delivered into plant tissue through agroinfiltration.
In some embodiments, the viral nucleic acid constructs being introduced into the plant tissues further comprise helper constructs for expressing one or more viral suppressors of RNA silencing suppressors (VSRs). In certain embodiments, the VSR is selected from the group consisting of tomato bushy stunt virus (TBSV) p19, tobacco etch virus (TEV) P1/Hc-Pro, barley stripe mosaic virus BSMV yb, or TSWV NSs.
In some embodiments, the binary constructs produce the positive- or negative-sense viral RNA transcripts corresponding to the chimeric sequence of the recombinant TSWV S, M, and L segments. In certain preferred embodiments, the binary constructs produce positive-sense RNA transcripts.
In some embodiments, the polynucleotide constructs comprising the TSWV L, M, and S segments are delivered into a plant cell in relative molar ratios of 1-2 (L):2-10 (M):1-2 (S), preferably in a relative ration of 1:10:2. In certain embodiments, the ratios of the nucleic acid constructs are the ratios of the relative concentration (OD600) of Agrobacterium strains carrying different binary plasmids.
In some embodiments, the method for infecting a plant with the TSWV vector is to use recombinant virus particles through mechanical rubbing, pressurized spraying, and plant grafting.
In some embodiments, the plant to be genetically modified is selected from the group consisting of natural or experimental host plants that can be systemically or locally infected by a tospovirus, including, but are not limited to, Nicotiana benthamiana, common tobacco (N. tabacum), tomato (Solanum lycopersicum), sweet pepper (Capsicum annuum), chili pepper (C. frutescens), habanero pepper (C. chinense), peanut (Arachis hypogaea), potato (Solanum tuberosum), ground cherry (Physalis alkekengi), eggplant (Solanum melongena), soybean (Glycine max), cotton (Gossypium hirsutum), lettuce (Lactuca sativa), spinach (Spinacia oleracea), watermelon (Citrullus lanatus), cucumber (Cucumis sativus), broad bean (Viciafaba), black mung bean (Vigna mungo), green bean (Vigna radiata) or cowpea (Vigna unguiculata).
In some embodiments, the plants to be genetically modified comprise different varieties or genotypes of viral host species. In some preferred embodiments, the plant varieties include but are not limited to, a plurality of peanut varieties, such as Huayu 25, Huayu 32, Huayu 39, Huayu 48, Huayu 9616, Fengyou 68, Honghua 1, Dabaisha, Silihong, and Shuofeng 518. In other preferred embodiments, the plant varieties include but are not limited to, a plurality of sweet pepper varieties, such as Zhoula 2, Zhoula 3, Zhoula 4, Fuxiang Xiuli, Bola 5, Bola 7, Bola 15, Xingshu 201, Xingshu 301, and Xingshu Guiyan.
In some embodiments, the infectious TSWV vectors introduced into plant cells express the LbCas12a polypeptide and a single guide RNA forms a nuclease complex targeting the Nicotiana benthamiana crFucT (SEQ ID NO: 86) or the crDCL2 target sites (SEQ ID NO:87), or the conserved crPDS1 target site in N. benthamiana and N. tabacum (SEQ ID NO: 12), or the tomato crPDS2 target site (SEQ ID NO: 13), or the pepper crPDS3 (SEQ ID NO: 14) or crPDS6 target sites (SEQ ID NO: 99), or the peanut and soybean crPDS4 target site (SEQ ID NO: 15), or the peanut crPDS5 target site (SEQ ID NO: 98), or the ground cherry crER target site (SEQ ID NO: 88).
In some embodiments, the infectious TSWV vectors express the LbCas12a polypeptide and multiple guide RNAs (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) in tandem to form a plurality of nuclease complexes targeting a plurality (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) of target site sequences. In some preferred embodiments, the two guide RNAs are expressed in tandem to target the N. benthamiana crPDS1 and crFucT target site, or the three guide RNAs are expressed in tandem to target the N. benthamiana crPDS1, crFucT, and crDCL2 target sites, respectively.
In some embodiments, the infectious TSWV vectors express the SpCas9 polypeptide and a single guide RNA to form a nuclease complex targeting the gGFP target site (SEQ ID NO: 16) in the GFP-transgenic N. benthamiana or the N. benthamiana gPDS2 (SEQ ID NO: 89), gRDR6 (SEQ ID NO: 90), or gSGS3 (SEQ ID NO: 91) target sites, or the gPDS1 target sequence (SEQ ID NO: 17) conserved in both N. benthamiana and N. tabacum.
In some embodiments, the infectious TSWV vectors express a SpCas9 polypeptide and multiple guide RNAs (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) in tandem, forming a plurality of nuclease complexes targeting a plurality (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) of target sequences. In some preferred embodiments, the two guide RNAs are expressed in tandem targeting the N. benthamiana gPDS2 and gRDR6 targets, respectively, or the three guide RNAs are expressed in tandem targeting the gPDS2, gRDR6, and gSGS3 targets, respectively.
In some embodiments, the infectious TSWV vectors express an adenine base editor polypeptide and a guide RNA, forming an adenine base editor complex targeting the N. benthamiana gPDSa (SEQ ID NO: 92), or gFucTa1 (SEQ ID NO: 93), or gBBLd (SEQ ID NO: 97) target site sequences.
In some embodiments, infectious TSWV vectors express a cytosine base editor polypeptide and a guide RNA forming a base editor nuclease complex targeting the N. benthamiana gFucTa2 (SEQ ID NO: 94) or gRDR6a1 (SEQ ID NO: 95), or gRDR6a2 (SEQ ID NO: 96) target site sequences.
In another aspect, the present disclosure provides methods for generating plants from viral vector-infected plant cells with modified genetic material and for further selecting plants with modified genetic material. The plant cells in the present disclosure may be cells from an intact plant or isolated plant cells, such as callus tissue, suspension cells, and protoplasts. In some preferred embodiments, the plant cells are derived from infected intact plants.
In some embodiments, virus-infected plant cells or tissues are used as explants for tissue culture and plant regeneration on media without a selection marker (e.g., antibiotics, herbicides, etc.) to obtain plants with modified genetic material.
In some embodiments, the method further comprises selecting the plants with modified genetic material from the regenerated plants without using a selection maker (e.g., antibiotics, herbicides). In certain embodiments, the method of selecting plants with modified genetic material involves the use of restriction endonuclease protection analysis, Sanger sequencing analysis, and high-throughput sequencing (HTS) analysis to identify plants with genetic modifications.
In some embodiments, cells from the upper leaf tissues of N. benthamiana plants infected with TSWV vectors are cultured in vitro to regenerate plants with modified genetic material but without exogenous DNA.
In some embodiments, cells from the upper leaf tissues of tobacco plants infected with TSWV vectors are cultured in vitro to regenerate plants with modified genetic material but without exogenous DNA.
In some embodiments, cells from the upper leaf tissues of tomato plants infected with TSWV vectors are cultured in vitro to regenerate plants with modified genetic material but without exogenous DNA.
In some embodiments, the present disclosure further compares the effects of the TSWV vector with or without the NSs gene on plant cell differentiation and regeneration and finds that NSs has an inhibitory effect on plant regeneration. In some preferred embodiments, cells infected with the TSWV vectors deficient in NSs are used to regenerate plants with modified genetic material by tissue culture.
In another aspect of the present invention there is provided a method for generating virus-free plants from cells infected with a tospovirus by tissue culture, comprising: a) providing at least one plant cell infected with a tospovirus; b) in vitro culturing the infected plant cells on a medium containing an antiviral compound; c) obtaining and identifying virus-free regenerated plants.
In some embodiments, treatments with the antiviral compound ribavirin result in a 100% TSWV clearance rate, compared with a 4.4% clearance rate without treatments without an antiviral.
In some embodiments, treatments with the antiviral compound favipiravir result in a 66.7% TSWV clearance rate.
In some embodiments, treatments with the antiviral compound remdesivir result in a 15.6% TSWV clearance rate.
In another aspect of the present invention, there is provided a recombinant tospovirus vector system, comprising: a) a polynucleotide sequence of a viral vector of the tospovirus, and b) a heterologous polynucleotide sequence encoding at least one sequence-specific nucleic acid modifying enzyme being operably linked to a viral transcription regulatory cis-element, wherein the tospovirus vector is infectious and capable of directing transient expression of said sequence-specific nucleic acid modifying enzyme in a plant cell.
In some embodiments, the sequence-specific nucleic acid modifying enzyme is a nuclease, its mutated forms or derivatives capable of modifying genomic DNA sequence, i.e., CRISPR/Cas nucleases, TALEN nucleases, zinc finger nucleases, meganuclease. In certain embodiments, the nuclease is the Streptococcus pyogenes CRISPR/SpCas9. In certain embodiments, the nuclease is the Lachnospiraceae bacterium CRISPR/LbCas12a (LbCpf1).
In some embodiments, the sequence-specific nucleic acid modifying enzyme is an adenine base editor. In some embodiments, the sequence-specific nucleic acid modifying enzyme is a cytidine base editor.
In some embodiments, the recombinant tospovirus vector system is the TSWV vector comprising the S, M, and L genome segments having at least 70%, at least 80%, at least 90%, at least 95%, at least 99% or 100% sequence homologies to the nucleic acid sequences as set forth in SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, respectively.
In some embodiments, the TSWV S segment comprises at least one heterologous polynucleotide sequence encoding a sequence-specific nuclease component being operably linked to viral mRNA transcription regulatory cis-elements; preferably, the nuclease component is a chimeric CRISPR guide RNA polynucleotide molecule.
In some embodiments, the TSWV non-structural protein gene in the S segment (NSs) is replaced by heterologous polynucleotide sequences encoding a fusion of the sequence-specific nuclease component to heterologous sequences encoding, for example, a fluorescent reporter. In some embodiments, the sequence-specific nuclease component is a guide RNA. In certain embodiments, the heterologous polynucleotide sequence inserted into said S genome encodes one guide RNA. In certain embodiments, the heterologous polynucleotide sequence encodes two or more guide RNAs.
In some embodiments, the TSWV M segment comprises at least one heterologous polynucleotide sequence operably linked to viral transcription regulatory elements and encoding a Cas polypeptide, optionally selected from the group consisting of Streptococcus pyogenes SpCas9, Lachnospiraceae bacterium LbCas12a (LbCpf1), or an adenine base editor or cytosine base editor comprising a Cas9 variant fused to a base deaminase.
In some preferred embodiments, the coding sequences of LbCas12a, SpCas9, adenine base editor, and cytosine base editor polypeptides are plant codon-optimized sequences; preferably, as set forth in SEQ ID NO: 10, 11, 84, 85, respectively.
In some embodiments, the viral glycoprotein precursor (GP) gene sequence in the TSWV M segment is replaced by a heterologous polynucleotide sequence encoding a sequence-specific nuclease component.
In some embodiments, the viral RNA-dependent RNA polymerase (RdRp) gene sequence in the TSWV L segment is plant codon-optimized as set forth in SEQ ID NO: 5.
In some embodiments, the tospovirus vectors are viral nucleic acid constructs capable of initiating infection upon introduction into a plant; preferably, the viral constructs are Agrobacterium binary plasmids containing transcriptional regulatory cis-elements, including a promoter and a terminator, and the constructs are delivered into plant tissue through agroinfiltration.
In some embodiments, the viral nucleic acid constructs being introduced into the plant tissues further comprise helper constructs for the expression of one or more VSRs. In certain embodiments, the VSR is selected from the group consisting of TBSV p19, TEV P1/Hc-Pro, BSMV γb, or TSWV NSs.
In some embodiments, the binary constructs produce the positive- or negative-sense viral RNA transcripts corresponding to the chimeric sequence of the recombinant TSWV S, M, and L segments. In certain preferred embodiments, the binary constructs produce positive-sense RNA transcripts.
In some embodiments, the polynucleotide constructs comprising the TSWV L, M, and S segments are delivered into a plant cell in relative molar ratios of 1-2 (L):2-10 (M):1-2 (S), preferably in a relative ration of 1:10:2. In certain embodiments, the ratios of the nucleic acid constructs are the ratios of the relative concentration (OD600) of Agrobacterium strains carrying different binary plasmids.
In some embodiments, the method for infecting a plant with a TSWV vector is to use recombinant virus particles through mechanical rubbing, pressurized spraying, plant grafting.
In some embodiments, the tospovirus vector is an infectious TSWV vector devoid of a functional NSs protein. In some embodiments, the TSWV vector is non-insect-transmissible tospovirus vector devoid of a functional GP protein.
In another aspect of the present invention, there is also provided a bacterial strain or recombinant viral particle comprising the recombinant tospovirus vector system.
In another aspect, the disclosure also provides a kit of the recombinant tospovirus vector systems, wherein the reagents containing the viral vectors may be provided individually or in combination, including: a) a composition comprising nucleic acid constructs comprising the polynucleotide sequence of the TSWV S, M and L segments and at least one heterologous polynucleotide sequence encoding a sequence-specific nuclease; (b) a helper expression vector encoding one or VSRs selected from the group consisting of TBSV p19, TEV P1/Hc-Pro, BSMV γb, or TSWV NSs. The kit further comprises a user manual or instructions.
In another aspect, the present disclosure also relates to the application of recombinant tospovirus vector systems, strains or recombinant viruses, kits, or methods as described above to the editing of plant genomes.
In another aspect, the present disclosure also provides genetically modified plants and their offspring produced by using the recombinant tospovirus vector system described above or produced according to the method described above.
Before explaining in detail at least one embodiment of the present invention, it should be understood that the present invention is not necessarily limited in its application to the details shown in the following specification or exemplified by the embodiments. The present invention may have other embodiments or may be implemented or accomplished in various ways. It is also to be understood that the phrases and terms used herein are for descriptive purposes and are not to be construed as limiting.
The following definitions and methods are provided to define the present disclosure better and to guide those of ordinary skill in the art in the practice of the present disclosure. Unless specified otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by those skilled in the field. Moreover, the terms and laboratory procedures related to protein and nucleic acid chemistry, molecular biology, cell and tissue culture, microbiology, and virology used herein are widely used terms and routine procedures in the corresponding fields. For example, the standard recombinant DNA and molecular cloning techniques used in the present invention are well-known to those skilled in the art and are more comprehensively described in the following literature: Michael R. Green and Joseph Sambrook, Molecular Cloning: A Laboratory Manual (4th edition); Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 2012. Also, for a better understanding of the present invention, definitions, and explanations of the relevant terms are provided below.
The terms “contains”, “includes”, “has” and their synonyms as used herein refer to “including but not limited to”. The scope of this terminology includes the scope of the terminology “composed of” and “essentially composed of”.
The phrase “essentially composed of” means that the composition and method may include other components and/or steps, as long as the other components and/or steps do not substantially alter the basis and novel features of the required composition or method.
As used herein, the singular forms “one” and “the” include plural referents unless the context clearly indicates otherwise. For example, the terms “one compound” or “at least one compound” may include a number of compounds, including mixtures thereof.
As used herein, the term “plant” refers to whole plants, any progeny, and the parts that make up the plant. As used herein, “plant parts” include but are not limited to plant cells, plant cell cultures, plant tissues, plant tissue cultures, plant cuttings, plant organs (such as fruits, seeds, embryos, meristematic regions, callus tissue, leaves, roots, shoots, flowers, gametophytes, sporophytes, pollen, and microspores). Plant cells can be individual cells or cell aggregates (e.g., callus tissues and cultured cells), or they can be protoplasts, gametophyte-producing cells, or cells or collections of cells capable of regenerating into complete plants. The plant cell cultures or tissue cultures can regenerate plants, and the regenerated plants have the physiological or morphological characteristics and substantially the same genotype as the plant from which the cell or tissue originated. The regenerable cells in the plant cell cultures or tissue cultures may be embryos, protoplasts, meristematic cells, callus tissues, pollens, leaves, anthers, roots, root tips, filaments, flowers, kernels, spikes, rachises, husks, or stems. Plant parts include harvestable parts and parts that can be used to propagate offspring plants. Plant parts that can be used for propagation include, but are not limited to, seeds, fruits, cuttings, seedlings, tubers, and rootstocks. A harvestable part of a plant may be any useful part of a plant, including, but not limited to, flowers, pollen, seedlings, tubers, leaves, stems, fruits, seeds, and roots.
The term “plant cell” refers to the structural and physiological unit of a plant. As used herein, plant cells include protoplasts and protoplasts with partial cell walls. Plant cells can be in the form of isolated individual cells or cell aggregates (e.g., loose callus tissues and cultured cells) and can be part of tissue units (e.g., plant tissues, plant organs, and plants). Thus, a plant cell may be a protoplast, a cell-producing gamete, or a cell or collection of cells capable of regenerating into a complete plant. Therefore, in the embodiments herein, a seed containing a plurality of plant cells and capable of regenerating into a whole plant is considered a “plant part”. Plant “progeny” includes any subsequent generations of plants.
The terms “variety”, “line”, “cultivar”, “germplasm” and “genotype” are used interchangeably in the present invention to refer to plant species that taxonomically belong to the same species and whose genetic combinations and phenotypes have stable characteristics that distinguish them from other plant kinds.
“Gene” is the entire nucleotide sequence required to produce a polypeptide chain or functional RNA.
The term “genome” when used concerning plant cells, covers not only chromosomal DNA present in the nucleus but also organelle DNA present in subcellular components of the cell (e.g., mitochondria, plastids). “Polynucleotides”, “nucleic acid sequence”, “nucleotide sequence” or “nucleic acid fragment” are used interchangeably and refer to a single- or double-stranded RNA or DNA polymer and may optionally contain synthetic, non-natural, or altered nucleotide bases. Nucleotides are referred to by their individual letter names as follows: “A” for adenosine or deoxyadenosine (corresponding to RNA or DNA, respectively), “C” for cytidine or deoxycytidine, “G” denotes guanosine or deoxyguanosine, “U” denotes uridine, “T” denotes thymine, “I” denotes inosine, and “R” indicates purine (A or G), “Y” indicates pyrimidine (C or T), “K” indicates G or T, “H” denotes A or C or T, and “N” denotes any nucleotide.
The term “genetic material” is the DNA that transmits genetic information between the parent and the offspring.
The term “sequence” refers to a nucleotide sequence of any length, which may be DNA or RNA; it may be straight-stranded, circular, or branched, and may be single- or double-stranded.
The term “expression” refers to the conversion of sequence information into the corresponding expression product, including direct transcription products (e.g., mRNA, tRNA, rRNA, antisense RNA, nuclease, structural RNA, or any other type of RNA) or proteins produced by translation of mRNA.
Expression products also include RNA modified by, for example, cap addition, polyadenylation, methylation, and proteins modified by, for example, methylation, acetylation, phosphorylation, ubiquitination, ADP-ribosylation, myristoylation, and glycosylation.
The term “encoding” refers to a DNA polynucleotide sequence that can be transcribed into RNA that translates a protein (mRNA) or a DNA polynucleotide sequence that can be transcribed into RNA that does not translate a protein (non-coding RNA such as tRNA, rRNA, etc.); or an RNA polynucleotide sequence that can be translated into a protein.
The terms “polypeptide”, “peptide”, and “protein” are used interchangeably in the present invention to refer to polymers of amino acid residues.
The term “fusion protein” refers to a polypeptide chain resulting from the expression of two or more genes encoding polypeptides linked together, which may be connected by a specific linker peptide.
“Codon-optimization” refers to replacing one or more codons in the encoding sequence with synonymous codons to enhance the expression of the target protein in a specific host cell. Each organism shows some degree of codon utilization difference or preference, and to improve the expression level of the gene in the target organism, replacing underutilized or rare codons in the target gene with codons preferred by the target organism without changing amino acids. In addition, codon-optimization includes inactivation of suspicious polyadenylation sites, exon-intron splicing sites, sequences that can form secondary structures, and balancing GC content to improve gene expression at the transcriptional level. “Codon-optimization” is a conventional means of gene design, and there are published patents and public computer programs in the field for optimizing codons of genes intended for expression in specific plant species, for example, codon optimization software available from several commercial nucleic acid synthesis companies, for example, codon usage database available at www.kazusa.orjp/codon/, “Method for achieving improved polypeptide expression” (U.S. Pat. No. 8,812,247), and “Method and device for optimizing a nucleotide sequence for the purpose of expression of a protein” (U.S. Pat. No. 8,224,578), Murray et al. (Nucleic Acids Res 1989, 17:477-497) and Nakamura et al. (Nucleic Acids Res, 2000, 28: 292).
The percentage of “identity” describes the extent to which a polynucleotide or peptide segment is identical in a sequence (e.g., nucleotide sequence or amino acid sequence) alignment. Sequence alignment is generated by manually comparing two sequences, such as the sequence shown herein as a reference and another sequence, to produce the highest number of matching elements, such as individual nucleotides or amino acids, while allowing gaps to be introduced into either sequence. The “identity score” of a sequence compared to a reference sequence is the number of matching elements divided by the entire length of the reference sequence, excluding gaps introduced into the reference sequence by the matching process. As used herein, the “percent identity” (“% identity”) is the identity score multiplied by 100.
As used herein, “recombinant” nucleic acids refer to non-naturally occurring nucleic acids, such as two otherwise separate nucleic acid fragments joined together by artificial intervention to produce a combination of nucleic acid fragments with the intended function. This artificial combination is often accomplished by chemical synthesis or by artificial manipulation of separate segments of the nucleic acid (e.g., by genetic engineering techniques).
As used herein, the term “chimeric sequence” as applied to a nucleic acid or polypeptide refers to the combination of two or more sequences of different origins into a single recombinant sequence by manual intervention or a regulatory sequence and nucleotide sequence of interest of the same origin but arranged in a manner different from that typically found in nature.
The terms “heterologous” and “exogenous” are used interchangeably herein and refer to polynucleotide or polypeptide sequences that are not naturally present in a nucleic acid or protein. Heterologous nucleic acid sequences may be linked to naturally occurring nucleic acid sequences (or variants thereof) (e.g., by means of genetic engineering) to produce chimeric nucleic acid sequences.
The term “vector” refers to a self-replicating nucleic acid molecule, such as a plasmid, phage, virus, or cosmid, that is transferred to a recipient cell by genetic engineering techniques.
The terms “nucleic acid construct”, “recombinant vector”, “recombinant nucleic acid construct” or “recombinant expression vector” are used interchangeably herein and refer to DNA molecules comprising a vector and at least one insert, and also include functional RNA molecules produced by transcription of these constructs, or RNA equivalents synthesized chemically according to the sequence of the nucleic acid construct. Nucleic acid constructs are typically recombinant expression vectors generated for the purpose of expressing and/or propagating the insert or for the purpose of constructing other recombinant nucleic acid sequences. The inserted DNA molecule may or may not be operably linked to a promoter sequence and may or may not be operably linked to a DNA regulatory sequence.
The term “regulatory sequence” refers to a specific nucleic acid sequence present in the internal or collateral (5′ or 3′ end) sequence of a coding sequence that regulates transcription, RNA processing or, stability, or translation. Regulatory sequences include but are not limited to, promoters, enhancers, introns, transcriptional regulatory elements, polyadenylation signals, RNA processing signals, translation enhancers, etc.
The term “operably linked” means that the regulatory element, including but not limited to the promoter sequence and transcription termination sequence, is linked to a nucleic acid sequence (e.g., a coding sequence or open reading frame) such that the transcription and translation of the nucleotide sequence are controlled and regulated by said regulatory element.
As used in the present invention, a “promoter” is a DNA regulatory region capable of binding RNA polymerase and initiating transcription of a downstream (3′ direction) coding or non-coding sequence, referring to a nucleic acid fragment capable of controlling the transcription of another nucleic acid fragment. In certain embodiments, the promoter is a promoter capable of controlling downstream gene transcription in a plant cell, whether or not it is of plant cell or viral origin. The promoter may be a constitutive promoter, a tissue-specific promoter, a developmentally regulated promoter, or an inducible promoter.
“Introduced” into a plant as used herein means to transform a nucleic acid molecule (e.g., plasmid, linear nucleic acid fragment, RNA, etc.) or protein into a plant cell by methods including, but not limited to, Agrobacterium-mediated transformation, biolistic transformation, electroporation, PEG transformation, etc., enabling said nucleic acid or protein to function in the plant cell.
“Agroinfiltration” refers to a method in plant biology for the induction of gene expression in plants by injecting a suspension of Agrobacterium tumefaciens into plant tissues, in which Agrobacterium present in the intercellular space transfer target genes located within the binary plasmid transfer DNA (T-DNA) to plant cells to obtain transient expression. Agrobacterium infiltration is applicable to a wide range of plants but is most effective in Nicotiana benthamiana and Nicotiana tabacum.
“Agroinfection” or “agroinfiltration” refers to a method of using Agrobacterium tumefaciens to inoculate infection factors (such as viruses or viroids) into plants. Viral DNA or cDNA sequences are cloned into the T-DNA region of Agrobacterium tumefaciens binary plasmids and transferred into plant cells through Agrobacterium tumefaciens infection. Replication of viruses or viroids within cells usually leads to systemic infection and the generation of symptoms. The term “transient expression” refers to the introduction of exogenous nucleic acid sequences into plant cells, tissues, organs, or individuals through agroinfiltration, biolistic transformation, viral vector infection, PEG-mediated protoplast transformation, etc. The exogenous nucleic acid sequences may or may not be integrated into plant chromosomes, and the exogenous nucleic acid fragments may be maintained at the introduced sites for expression for a period of time.
The term “virus-infectious clone” refers to a nucleic acid construct or plasmid containing a viral sequence that can rescue an active recombinant virus when introduced into a suitable host cell.
The term “viral rescue” refers to the process of introducing a viral infectious DNA clone or cDNA clone into a suitable host cell and producing an infectious virus.
For this document, “recombinant virus” refers to a virus generated using recombinant DNA technology. Unlike naturally occurring wild-type viruses, the sequence of a recombinant virus can be artificially modified through recombinant DNA technology.
The terms “viral vector”, “recombinant viral vector”, “recombinant viral expression vector”, and “recombinant viral vector system” are used interchangeably herein and refer to viruses that have been genetically engineered in an experimental setting for the purpose of introducing a heterologous nucleic acid sequence into a host cell, tissue, organ or individual organism.
In some embodiments, the viral vector system may be a combination of one or more recombinant nucleic acid constructs or recombinant viral vectors, which can replicate, transcribe, and express heterologous sequences when introduced into host cells. In some embodiments, the viral vector system may be a combination of strains comprising different nucleic acid constructs or viral vectors, respectively, preferably said strains being Agrobacterium; in certain embodiments, said viral vector system may also be a genetically engineered viral particle.
The term “virion” refers to an infective viral structural form containing a genomic nucleic acid core and a protein capsid, with or without an outer envelope. For the purposes of this paper, “viral particle” includes a virus nucleocapsid particle without an envelope structure and with infective activity.
The viral “positive-strand” RNA and “negative-strand” RNA are relative to the polarity of messenger RNA (mRNA), with the “positive-strand” RNA having the same polarity as mRNA and the “negative-strand” RNA being complementary to mRNA. For negative-stranded RNA viruses, as the name suggests, the genomic RNA wrapped in the viral particle is the negative-stranded RNA, also called virion RNA (vRNA); conversely, the complementary positive-stranded RNA is called complementary RNA (vRNA) or antigenomic RNA (agRNA).
In the present invention, “genetic modification”, “gene modification”, “nucleic acid modification” and “gene editing” refer to modification of the genome of an organism by means of genetic engineering to alter the structure or composition of the genomic sequence, such as altering the sequence of a gene, including single or multiple deoxyribonucleotide deletions, substitutions, insertions, or a combination of thereof, or to produce heritable epigenetic modifications, such as DNA methylation, histone modifications, etc.
“Genetically modified plant” means a plant that has been genetically engineered to include the introduction of single or multiple deoxyribonucleotide insertions, deletions, substitutions, or combinations thereof into its genome, and in which the modified genetic sequence is stably inherited.
“Nucleic acid modifying enzyme” refers herein to an enzyme that specifically targets and alters the structure or composition of intracellular nucleic acid sequences, such as altering the sequence of a gene, including single or multiple deoxyribonucleotide deletions, substitutions, insertions, or combinations thereof, or producing heritable epigenetic modifications such as DNA methylation, histone modifications, etc.
“Sequence-specific endonuclease”, “endonuclease”, “nuclease”, and “nucleic acid modifying enzyme” are used interchangeably herein to refer to enzymes with activity for cleaving or modifying nucleic acids.
“Cas nuclease”, “CRISPR nuclease”, “CRISPR/Cas protein”, “Cas protein”, and “Cas effector protein” are used interchangeably herein and refer to a nuclease that includes a Cas protein or fragment thereof (e.g., a protein containing the active DNA cleavage domain of Cas and/or the guide RNA binding domain of Cas) and is an RNA-directed nuclease. Cas proteins are protein components of the CRISPR/Cas (clustered regularly interspaced short palindromic repeats and their associated systems) genome editing system that can target and cleave DNA target sequences to form DNA double- or single-stranded breaks under the guidance of a guide RNA.
The terms “guide RNA,” “gRNA,” and “sgRNA” are used interchangeably herein to refer to an RNA molecule that binds and directs nucleases to specific sites in the genome. A guide RNA comprises a nucleotide sequence sufficiently complementary to a target sequence to hybridize to that target sequence and a nucleotide sequence that binds to a Cas protein to form a CRISPR/Cas complex. In the CRISPR/Cas9 nuclease system, the guide RNA is typically an RNA binary complex molecule formed by hybridization of a partially complementary CRISPR RNA (crRNA) and trans-acting CRISPR RNA (tracrRNA), or a gRNA molecule formed by artificially ligating the two RNA molecules. In the CRISPR/Cas12a nuclease system, the guide RNA consists of crRNA only, without the tracrRNA molecule.
“Base editing” is a technique for editing specific bases of intracellular nucleic acid molecules. Sequence-specific nucleic acid modification enzymes deaminate single or multiple bases of intracellular target DNA or RNA molecules, and the cellular DNA replication and repair pathway converts the deaminated bases into other bases, such as cytosine to thymine or adenine to guanine.
A “base editor” is a reagent that specifically targets and modifies bases (e.g., A, C, G, T, or U) of a nucleotide (e.g., DNA or RNA) and causes the modified bases to be converted. Depending on its modified bases, the reagents can be classified as adenine base editor (ABE), cytosine base editor (CBE), etc. The “base editor” usually consists of Cas9 nickase (nCas9) or catalytically inactive dead Cas9 (dCas9) fused with base deaminase and other effector proteins, which can be directed by guide RNA to a target sequence. The deaminated bases are converted to other bases by the cellular DNA replication and repair pathway after “base editor” catalyzing target sequence deamination.
The term “Protospacer adjacent motif sequence (PAM)” refers to a nucleic acid sequence in the region of a DNA target sequence that is recognized by the corresponding Cas nuclease. PAM is essential for the cleavage of the target sequence by the guide RNA-directed Cas nuclease.
As used herein, “target DNA” is a DNA polynucleotide containing a “target site” or “target sequence”. The terms “target site”, “target sequence”, and “protospacer DNA” are used interchangeably herein and, in the context of the present invention, refer to a segment of target DNA containing a nucleic acid sequence that binds to a guide RNA.
The present invention discloses a method for delivering sequence-specific nucleic acid modifying enzymes using a broad host spectrum plant RNA virus vector for plant gene editing, more specifically, including, but not limited to, a method for delivering genome editing elements and modifying plant genetic material through the viral vector system. An engineered infectious viral vector is used to carry a nucleic acid sequence that expresses the sequence-specific nucleic acid modifying enzyme intracellularly after infecting a plant. The nucleic acid modification enzymes target and cleave the target site to produce DNA double-strand breaks or modify specific sites to create base modifications in the plant genome. Then, cellular DNA repair machinery completes the modification of the plant's cellular genetic material, producing one or more types of base deletion, insertion, substitution, or a combination thereof.
Sequence-specific nucleases commonly used for genome editing include meganucleases, zinc finger nucleases, transcription-like activator effector nucleases, and CRISPR/Cas nucleases. The meganucleases, zinc finger nuclease, and transcription activator-like effector nuclease systems specifically target DNA sites through the DNA binding domain of the nuclease protein. The CRISPR/Cas nuclease system relies on a short guide RNA to guide the Cas protein to recognize the target sequence, thus eliminating the need to assemble multiple DNA binding units and requiring only the introduction of the Cas protein and a customized guide RNA into the cell. The CRISPR/Cas nuclease system is the most broadly used genome editing tool.
CRISPR/Cas systems have been classified into two major classes and six types (I-VI) based on the number and function of Cas genes, among which the most facile gene editing tools are Class 2 type II and V systems that require only a single Cas effector protein. Type II CRISPR systems include CRISPR/Cas9 systems such as the Streptococcus pyogenes CRISPR/SpCas9 system and the Staphylococcus aureus CRISPR/SaCas9 system, which consists of the Cas9 protein, crRNA, and tracrRNA. The crRNA hybridizes with a portion of the tracrRNA complementary sequence to form a double-stranded RNA, and the structure is cleaved by endogenous double-stranded RNA enzymes to help the maturation of crRNA processing; meanwhile, crRNA hybridizes with tracrRNA to form a binary complex that binds to the Cas9 protein and directs the latter to target DNA sequences that are paired with the crRNA proto-spacer sequence (protospacer). The CRISPR/Cas9 system engineered for gene editing generally employs a single guide RNA (sgRNA or gRNA) formed by joining the crRNA and tracrRNA together, which contains a sequence structure that binds to Cas9 and a protospacer sequence that pairs with the target DNA. Type V CRISPR/Cas systems include CRISPR/Cas12a (also known as Cpf1), such as Acidaminococcus CRISPR/AsCas12a, Lachnospiraceae bacterium CRISPR/LbCas12a, and Francisella novicida CRISPR/FnCas12a, and other homologous proteins exhibit RNA-directed DNA endonuclease activity. The CRISPR/Cas12a nuclease system requires only a single guide RNA, crRNA, and does not involve an tracrRNA. The crRNA contains sequences that bind to Cas12a, as well as protospacer sequences that pair with the target DNA. In addition, Cas12a has its pre-crRNA processing activity. It can cleave transcripts containing an array of multiple crRNAs to generate multiple guide RNAs, making it easier to achieve precise processing of crRNAs and simultaneous expression of multiple crRNAs for multiplexed gene editing (Fonfara et al., Nature, 2016, 532: 517-521; Zetsche et al. Nat Biotechnol, 2017, 35: 31-34).
Methods for designing CRISPR/Cas9 or CRISPR/Cas12a nucleases for gene editing are publicly available (Jinek et al., Science, 337: 816-821; Cong et al., Science, 2013, 339: 819-823; Zetsche et al., Cell, 2015, 63: 759-771), and SpCas9 and LbCas12a amino acid sequences are available in the SwissProt database search (accession numbers Q99ZW2 and 5ID6_A). In short, the guide RNA sequence determines the targeting specificity, so only the targeting sequence (original spacer sequence) on the gRNA needs to be changed to alter the gene target of the Cas nuclease. Usually, Cas proteins are fused at one or both ends with one or more specific protein localization signals, such as the nuclear localization signal (NLS), which is sufficient to translocate the Cas protein and the guide RNA complex to a specific organelle (e.g., the nucleus) to perform its function. When Cas protein and engineered guide RNA are co-introduced or co-expressed within the cell, Cas protein and guide RNA form a complex and is directed by the guide RNA to bind to a genomic target site, while the Cas protein cleaves one or both DNA strands of the target sequence. Broken DNA strands can be repaired by the intracellular non-homologous end joining (NHEJ) repair machinery, which directly rejoins the double-stranded split ends, but this repair mechanism is prone to base deletions, insertions or substitutions at the break junction.
If an exogenous DNA recombination template is provided to the cell in which a double-stranded DNA break occurs for directing the repair of the broken DNA ends by the mechanism of homologous recombination (HR), desired sequence changes can be introduced into the genome through the repair template. The recombination template can be linear or circular single- or double-stranded DNA containing sequences homologous to both sides of the genomic target sequence being cleaved by the nuclease (homologous arms). The length of the homologous arms can be 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 nucleotides or longer. When the genomic target site DNA is broken due to nuclease cleavage, the free DNA ends can be used as a template for DNA synthesis mediated by homologous sequences from exogenously provided DNA, resulting in the incorporation of a predetermined exogenous sequence at the target site.
In some embodiments, the present invention employs the CRISPR/LbCas12a nuclease for plant gene editing; in other embodiments, the CRISPR/SpCas9 nuclease is employed for plant gene editing.
In addition, the Cas fusion protein may also contain one or more heterologous effector protein structural domains, ideally with the two structural domains connected by a linker peptide sequence. Protein structural domains that can be fused to Cas proteins include but are not limited to, antigenic epitope tags (e.g., HA, His, Flag, myc), reporter proteins (green fluorescent protein, β-glucosidase, luciferase), DNA methylation enzymes, transcriptional activation domains, transcriptional repression domains, histone modifying enzymes, DNA topoisomerases, DNA recombinases, cytosine deaminases, uracil DNA glycosylase inhibitor, adenine deaminase, reverse transcriptase, etc. The Cas endonuclease domains can be inactivated by mutations and still maintain the ability to form complexes with guide RNAs and target specific genomic DNA sequences. For example, when amino acid mutations are introduced into the active site of the RuvC or HNH endonuclease domain of the Cas9 protein, such as mutation of the tenth aspartate to alanine (D10A) or mutation of the 840th histidine to alanine (H840A), or mutation of the 854th asparagine to alanine (N854A), or mutation of the 863rd asparagine to alanine (N863A), these mutations produce nCas9 mutant proteins that can only cut one strand and become a nickase. If both catalytical sites of the RuvC and HNH domains are mutated, the resulting dCas9 mutant will lose the ability to cut DNA. dCas9 or nCas9 protein can be fused to the specific effector protein domains mentioned above to target specific genomic sequences under the direction of a guide RNA for DNA modification (cytosine methylation, adenine base editing, cytosine base editing, etc.), transcriptional regulation (activation, repression, histone modification, etc.), DNA recombination, DNA synthesis, or other applications (Anzalone et al., Nat Biotechnol., 2020, 38: 824-844).
The base editor contains nuclease-inactivated Cas proteins (e.g., Cas9 and Cpf1, etc.) and DNA-dependent base deaminases such as cytosine deaminase and adenine deaminase. Cytosine deaminases are naturally occurring or engineered deaminases such as human APOBEC3A (hA3A). Naturally occurring adenine deaminases often use RNA as a substrate to convert adenosine on single-stranded RNA to inosine (I) by deamination. Recently, DNA-dependent adenine deaminases capable of converting deoxyguanosine on single-stranded DNA to inosine (I) using single-stranded DNA as a substrate have been obtained through directed evolutionary approaches based on E. coli tRNA adenine deaminase A (ecTadA) (Gaudelli et al., Nature, 2017, 551: 464-471). The cytosine base editor (CBE) and adenine base editor (ABE) proteins can specifically target intracellular nucleic acid under the direction of a guide RNA and perform a deamination reaction on cytosine bases (C) or adenine bases (A) located within the target site without generating a double-stranded DNA break (DSB). After DNA replication and repair, deaminated cytosine bases or adenine bases are replaced with thymine (T) or guanine (G), respectively, thus forming a targeted substitution from C to T (cytosine base editing) or A to G (adenine base editing).
Methods for designing guide RNAs in ABE and CBE systems for performing base modifications have been published in the literature (Komor et al., Nature, 2016, 533: 420-424; Gaudelli et al., Nature, 2017, 551: 464-471). Briefly, the guide RNA sequence determines the targeting specificity so that the gene target of the base editor can be altered by simply changing the targeting sequence (protospacer sequence) in the gRNA. When the base editor and the guide RNA are co-introduced into the cell, the base editor protein can produce a base conversion within the active window of the target sequence.
In some embodiments, the present invention employs CBE for plant genome base editing; in some embodiments, ABE is used for plant genome base editing.
As described above, the present invention provides viral vector systems and methods for the delivery of CRISPR/Cas elements to plant cells.
Viral vectors are used to express one or more CRISPR elements within plant cells. For example, one Cas nuclease nucleic acid sequence, one or more gRNA nucleic acid sequences are inserted into different viral vectors and operably linked to viral transcriptional regulatory sequences; preferably, two or more CRISPR/Cas elements are inserted into the same viral vector, and operably linked to one or more transcriptional regulatory sequences. Cas nuclease and one or more gRNA nucleic acid sequences can be inserted at the same site on the viral genome and controlled by the same regulatory sequence; the Cas nuclease and one or more gRNA nucleic acid sequences can be inserted at different sites on the viral genome and controlled by different regulatory sequences.
More specifically, considering that most plant DNA viral vectors and positive-stranded RNA viral vectors can only package a limited length of exogenous fragments, making it difficult to express the larger size CRISPR/Cas nuclease genes (>4 kb), the present invention describes the engineering of a plant segmented, negative-stranded RNA virus, tomato spotted wilt virus (TSWV), as a preferred viral vector for delivery of CRISPR/Cas. In certain embodiments, the TSWV vector may comprise one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) insertion sites, said insertion sites being upstream and/or downstream of the viral transcriptional regulatory elements. In a preferred embodiment, the Cas protein coding sequence contained within one genome segment of the recombinant TSWV vector is operably linked to the viral transcriptional regulatory sequence, and one or more gRNA coding sequences contained within the other genome segment are operably linked to the viral transcriptional regulatory sequence. Within TSWV-infected cells, viral transcriptional regulatory sequences drive the expression of Cas proteins and gRNAs; gRNAs direct Cas nucleases to target specific DNA sites in the genome, and Cas nucleases cleave one or both strands of the target sequence.
Tospoviruses refer to virus species taxonomically belonging to the genus Orthotospovirus, family Tospoviridae, order Bunyavirales in the phylum Negarnaviricota. Tospoviruses include Groundnut bud necrosis virus (GBNV), Groundnut ringspot virus (GRSV), Groundnut yellow spot virus (GYSV), Impatiens necrotic spot virus (INSV), tomato chlorotic spot virus (TCSV), Tomato spotted wilt virus (TSWV), Watermelon silver mottle virus (WSMoV), Zucchini lethal chlorosis virus (ZLCV), Groundnut chlorotic fan-spot virus (GCFSV), Watermelon bud necrosis virus, Melon yellow spot virus (MYSV), Iris yellow spot virus (IYSV), Chrysanthemum stem necrosis virus (CSNV), etc.
Tospoviruses have conserved morphological, biological, and molecular characteristics. The particles of tospoviruses are quasi-spherical in shape with a diameter of about 80-110 nanometers (nm) and are wrapped by a membrane whose outer layer consists of approximately 5 nm of glycoprotein spikes. Three single-stranded, negative-sense genomic RNAs of different lengths, i.e., L, M and S genomic fragments, with a total genome length of about 16,600 bases (nt), are wrapped inside the viral particle. Each genomic segment has a highly conserved nine bases at the 3′ end and is complementary to the 5′ end to form a panhandle structure, giving each genomic segment a pseudo-loop shape, and this structure plays an important role in viral transcription and replication (
Tospoviruses are transmitted by the insect vector thrips under natural conditions and affect a variety of food crops, vegetables, flowers, and other plants. Tospoviruses can readily infect their natural or experimental host plants by mechanical rubbing and grafting under laboratory conditions. Tomato spot wilt virus (TSWV) has a vast host range of more than 1090 monocotyledonous and dicotyledonous plants from more than 85 families, including Solanaceae, Cucurbitaceae, Asteraceae, Leguminosae, and Cruciferae (Parrella et al., J Plant Pathol, 2003, 85: 227-264), such as Abelmoschus esculentus, Ageratum houstonianum, Allium cepa, Amaranthus caudatus, Amaranthus hybridus, Amaranthus retroflexus, Amaranthus inosus, Amaranthus viridis, Amaryllis sp., Ananas comosus, Anemone, Apium graveolens var. dulce, Aquilegia vulgaris, Arabidopsis thaliana, Arachis hypogaea, Arctium lappa, Arum palaestinum, Begonia sp., Belamcanda chinensis, Beta vulgaris, Bidens pilosa, Brassica campestris, Brassica oleracea var. botrytis, Brassica pekinensis, Brassica rapa ssp. Chinensis, Calceolaria herbeohybrida, Calendula officinalis, Callistephus chinensis, Campanula glomerata, Campanula isophylla, Campanula latiloba, Campanula pyramidalis, Canavalia gladiata, Canavalia obtusifolia, Canavalia occidentalis, Capsella bursa-pastoris, Capsicum annuum, Capsicum frutescens, Capsicum Chinense, Capsicum Baccatum, Capsicum Pubescens, Carica papaya, Cassia occidentalis, Cassia tora, Catharanthus roseus, Centaurea cyanus, Cheiranthus cheiri, Chenopodium album, Chenopodium ambrosioides, Chenopodium amaranticolor, Chenopodium murale, Chrysanthemun coronarium, Chrysanthemun indicum, Chrysanthemun leucanthemum, Chrysanthemun morifolium, Cicer arietinum, Cichorium endiva, Cichorium intybus, Cineraria, Citrullus lanatus, Coffea arabica, Convolvulus arvensis, Conyza bonariensis, Cordyline terminalis, Coriandrum sativum, Coreopsis drummondii, Coronopus didymus (synonym Youngia japonica), Crotalaria incana, Crotalaria juncea, Crotalaria pallida (synonym Crotalaria mucronata), Cucumis melo, Cucumis sativus, Cucurbita maxima, Cucurbita moschata, Cucurbita pepo, Cyclamen persicum, Cynara scolymus, Dahlia pinnata, Dahlia variabilis, Datura stramonium, Delphinium sp., Desmodium triflorum, Desmodium unicinatum, Duboisia leichhardtii, Emilia sonchifolia, Foeniculum vulgare, Fragaria vesca, Galinsoga parviflora, Galinsoga quadriradiata, Gladiolus sp., Glycine max, Glycine soja, Gomphrena globosa, Gossypium hirsutum, Gossypium barbadense, Helianthus annuus, Hibiscus tiliaceus, Hippeastrum hybridum, Hippeastrum reginae, Hippeastrum rutilum, Hoya carnosa, Hydrangea macrophylla, Hyoscyamus niger, Impatiens holsii, Impatiens sultanii, Impatiens wallerana, Ipomea batatas, Lactuca sativa, Lathyrus odoratus, Lathyrus odoratus, Leonotis nepetaefolia, Limonium latifolium, Lupinus leucophyllus, Lupinus mutabilis, Lupinus polyphyllus, Lycium procissimum, Lycopersicon esculentum, Lycopersicon hirsutum, Lycopersicon pimpinellifolium, Malva parviflora, Matthiola incana, Medicago polymorpha, Melilotus officinalis, Nerium oleander, Nicandra physaloides, Nicotiana acuminata, Nicotiana bigelovii, Nicotiana clevelandii, Nicotiana glutinosa, Nicotiana rustica, Nicotiana sylvestris, Nicotiana tabacum, Papaver nudicaule, Pelargonium hortorum, Petunia hybrida, Phaseolus angularis, Phaseolus vulgaris, Phlox drummondii, Physalis angulate (synonym Physalis minima), Physalis spp., Pisum arvense, Pisum sativum, Plantago major, Plantago rugelii, Polygonum convolvulus, Portulaca oleracea, Primula sp., Primula malacoides, Primula sinensis, Ranunculus sp., Rubus idaeus, Rudbeckia amplexicaulis, Saintpaulia ionantha, Salpiglossis, Scabiosa, Schizanthus, Sechium edule, Senecio cruentus, Senecio jacobea, Sesamum indicum, Sinningia eciosa, Solanum capsicastrum, Solanum carolinense, Solanum laciniatum, Solanum mammosum Solanum melongena, Solanum nigrum, Solanum nodiflorum, Solanum seaforthianum, Solanum triflorum, Solanum tuberosum, Sonchus oleraceus, Spinacia oleracea, Stachys arvensis, Stapellia, Stellaria media, Streptosolen jamesonii, Tagetes patula, Taraxacum, Tephrosia purpurea, Tribulus terrestris, Trifolium repens, Trifolium subterraneum, Tropaeolum majus, Vallota, Verbena brasilliensis, Verbena litoralis, Verbesina enceloides, Vicia faba, Vigna mungo, Vigna radiata, Vigna unguiculata, Xanthium saccharatum, Zantedeschia aethopica, Zantedeschia albo-maculata, Zantedeschia elliottiana, Zantedeschia melanoleuca, Zantedeschia rehmannii, Zinnia elegans, etc.
As described above, one aspect of the present invention provides a TSWV expression vector system. As can be understood by one skilled in the art, genetic manipulation of viral genomes and viral vector development entail the availability of a virus infectious clone, i.e., viral (c)DNA clones that can produce recombinant viruses upon introduction into a suitable host cell. The process, also known as virus rescue, allows for the expression of viral or non-viral sequences carried by the virus in the host cell.
Methods for construction and applications of plant negative-stranded RNA virus infectious cDNA clones and viral expression vectors have been published in related articles or patents, such as those related to Sonchus yellow net virus (SYNV) (Wang et al., PLoS Pathog, 2015, 11: e 1005223; patent CN 105039388 B), Barley yellow striate mosaic virus (BYSMV) (Gao et al., New Phytol, 2019, 223: 2120-2133, patent CN 110511955 A), Tomato spotted wilt virus infectious clone system (Feng et al., PNAS, 2020, 117: 1181-1190). The process of rescuing a recombinant negative-stranded RNA virus from cloned DNA involves: 1) construction of a nucleic acid construct that transcriptionally generates viral genomic or antigenomic RNA and at least one nucleic acid construct that expresses the viral core protein; 2) co-introduction of the above nucleic acid constructs into a suitable host cell in the form of a circular plasmid, or a linear DNA fragment, or a nucleic acid fragment generated by in vitro transcription; 3) intracellular expression of viral RNA and protein that assemble to form an infectious unit to produce a recombinant virus. The above introduction methods are known in the art, such as Agrobacterium transformation, biolistic bombardment, mechanical inoculation with nucleic acid (plasmid DNA or in vitro transcribed RNA), PEG-mediated transformation, electroporation, microinjection, liposome transformation, or nanoparticle-mediated transformation.
After introducing the above nucleic acid constructs into suitable plant host cells, the plant RNA silencing response is an important limiting factor preventing their efficient expression and subsequent generation of recombinant viruses. As an important mechanism to defend against exogenous nucleic acid invasion, the plant RNA silencing response inhibits transient expression of the introduced nucleic acid (Johansen et al., Plant Physiol 2001, 126: 930-938) and also limits replication and infection of nascent recombinant viruses (Kasschau et al., Virology 2001, 285: 71-81). To improve transient expression efficiency and recombinant plant negative-strand RNA virus recovery, the strategy known in the art is to co-express viral suppressors of RNA silencing (Wang et al., PLoS Pathog, 2015, 11: e 1005223; Ma & Li, Viruses, 2020, 12: 1459). A large number of viral RNA silencing suppressors are known, including tomato bushy stunt virus p19, tobacco etch virus P1/Hc-Pro, barley stripe mosaic virus γb, TSWV NSs, turnip crinkle virus p38, flock house virus B2, influenza virus NS1, and Ebola virus VP35, etc. When co-expressed, these silencing suppressors have different modes of action and have a synergistic effect on promoting recombinant negative-stranded virus recovery (Ganesan et al., J Virol 2013, 87:10598-10611; Ma & Li, Viruses 2020, 12: 1459).
Methods for constructing TSWV vector systems and recombinant viral rescue technology are disclosed in relevant embodiments of the present invention. The TSWV genome sequences, i.e., its S, M, and L segment sequences, are disclosed in SEQ ID NO: 1, 2, and 3, respectively. It should be understood in the art that the TSWV genome sequences need not be 100% identical to the disclosed sequences as set forth in SEQ ID NO: 1, 2, 3. The TSWV genome sequences have at least 70%, at least 80%, at least 90%, at least 95%, at least 99% identity with the whole genome sequence disclosed herein; or the amino acid sequence of the RdRp protein encoded therein has at least 70%, at least 80%, at least 90%, at least 95%, at least 99% identity.
In order to generate nucleic acid constructs that can be transcribed to produce viral genomic RNA, in certain embodiments, TSWV S, M, and L genomic nucleic acid sequences are inserted into plant expression vectors and are operably ligated downstream of a promoter sequence. Numerous suitable plant expression vectors are known to those with ordinary skills in the art and are commercially available, such as pCambia, pGreen, pCASS, PBin19, pGD, pCB301, and others. Suitable promoters include constitutive or inducible promoters derived from plant endogenous genes or plant viruses, such as the actin promoter, the ubiquitin promoter, or the cauliflower mosaic virus (CaMV) 35S promoter, etc. A version of the CaMV 35S promoter (SEQ ID NO: 9), which produces viral RNAs with precise 5′ end, is used in certain embodiments.
In the TSWV nucleic acid constructs described above, the 3′ end of the viral sequence is operably attached to a self-cleaving hepatitis D virus ribozyme sequence so that the transcribed viral RNAs can be cleaved by the ribozyme to produce a faithful viral 3′ end sequence. The design of the hepatitis D virus ribozyme is described in papers (Sharmeen et al., 1998, J. Virol. 62, 2674-2679; Feng et al., Proc Natl Acad Sci USA, 2020, 2: 1181-1190) and a U.S. Pat. No. 5,225,337 (Ribozyme compositions and methods for use) and is within the knowledge of those skilled in the art.
In some embodiments, the TSWV L segment nucleic acid construct comprises an L antigenomic cDNA sequence being operably linked to a 35S promoter to transcribe the L antigenomic (positive-sense) RNA; the M segment nucleic acid construct comprises an M genomic or antigenomic cDNA sequence being operably linked to a 35S promoter to transcribe the M genomic (negative-sense) or antigenomic (positive-sense) RNA; the S segment nucleic acid construct contains the S antigenomic cDNA sequence being operably linked to the 35S promoter to transcribe the S antigenomic (positive-sense) RNA.
Rescuing negative-stranded viruses requires intracellular co-expression of viral core protein and viral RNA to assemble viral infectious ribonucleoprotein complexes. In some embodiments, the L and S antigenomic RNA transcripts (positive) can be used as translation templates to express the viral core proteins RdRp (or termed L) and N, respectively, which in turn package viral (anti)genomic RNAs to form a viral nucleocapsid that serves as a template for viral replication to produce viral progeny genome.
Those skilled in the art should understand that the L and S segment constructs can also be made to contain the L and S genomic cDNA sequences being operably ligated to the 35S promoter, but in this case, the L and S genomic RNA transcripts (negative sense) are unable to express the viral core protein and therefore require the RdRp and N provided from helper expression vectors to translate the viral core proteins that function to encapsidate the viral genomic RNA and initiate recombinant viral replication and infection.
The TSWV L genome nucleic acid construct is further modified in embodiments of the present invention by replacing the sequence encoding RdRp in the L genome (SEQ ID NO: 4) with a codon-optimized coding sequence. Preferably, the optimized RdRp sequence is shown in SEQ ID NO: 5. It is understood that gene sequence optimization is a routine operation in the art and different versions of optimized sequences can be obtained for the same gene depending on the optimization method and that the codon-optimized sequences provided in the present invention are only examples. It is conceivable that other optimized sequence versions may also be suitable to improve the expression of TSWV RdRp in plant cells and assist recombinant virus production.
In certain embodiments, the TSWV M genomic segment may be manipulated by recombinant DNA techniques, e.g., insertion of heterologous nucleic acid sequences or partial or complete deletion or replacement of the GP sequence (SEQ ID NO: 6). In some preferred embodiments, all or part of the GP coding sequence is replaced with a heterologous reporter gene nucleic acid sequence.
In certain embodiments, the TSWV S genomic segment may be manipulated by recombinant DNA technology, e.g., by insertion of a heterologous nucleic acid sequence upstream or downstream of the coding sequence of the NSs gene (SEQ ID NO: 7), or by partial or complete deletion or replacement of the NSs gene. In some preferred embodiments, all or part of the sequence of the NSs gene is replaced with a heterologous reporter gene nucleic acid sequence.
In certain embodiments, heterologous nucleic acid sequences contained in the TSWV M and S genome segments are operably linked to viral transcriptional regulatory cis-elements, for instance, the promoter sequences responsible for mRNA transcription within the non-coding regions of the M and S genomes and the cis-acting element sequence responsible for transcription termination within the viral intergenic region.
Further, the above recombinant TSWV S, M, and L genomic nucleic acid constructs are introduced into suitable plant cells under conditions sufficient to achieve recombinant virus rescue. To facilitate recombinant virus production, in certain embodiments, the nucleic acid constructs co-introduced into the cell further comprise helper expression vectors encoding one or more RNA silencing suppressors being operably linked to the promoter, such as the 35S promoter. Preferably, in some embodiments, the RNA silencing suppressor is a tomato bushy stunt virus p19, tobacco etch virus Hc-Pro, barley stripe mosaic virus γb, and TSWV NSs, or a combination of two or more of these suppressors. If desired, the nucleic acid constructs co-introduced into the cells may include TSWV N and RdRp expression vectors.
In some embodiments, the nucleic acid construct compositions introduced into plant cells comprise: a) nucleic acid constructs comprising the recombinant TSWV S, M, and L segment sequences; b) helper expression vectors encoding one or more viral suppressors of RNA silencing; and, if desired, c) one or more helper expression vectors encoding TSWV N and L proteins.
The method of “introduction into plant cells” described in this aspect is a conventional method in the art, such as Agrobacterium-mediated transformation, biolistic bombardment, pollen tube introduction, PEG-mediated fusion, electroporation, microinjection, ultrasonic transformation, liposome transformation, or nanoparticle-mediated transformation, etc., through which viral nucleic acid sequence-containing circular plasmids or linear DNA fragments, or in vitro transcribed viral RNA, etc. are introduced into suitable eukaryotic cells.
In certain embodiments, the method of introduction into plant cells is the Agrobacterium-mediated transformation method, wherein one or more nucleic acid constructs are transformed into suitable plant cells through agroinfiltration. The nucleic acid construct is a plasmid vector suitable for Agrobacterium-mediated transformation, i.e., a binary plasmid vector comprising a transfer DNA (T-DNA) region that is transferred to the plant cell during Agrobacterium infection of the plant. Nucleic acid constructs can also be introduced into cells by other means, such as biolistic transformation, leaf rubbing, PEG fusion, etc. In this case, the nucleic acid constructs can be introduced into the cell in the form of DNA, RNA, or ribonucleoprotein complexes.
In the preferred example of the present invention, the transferred T-DNA fragment within the plasmid is introduced into N. benthamiana cells by agroinfiltration of the leaf tissues to initiate viral transcription, expression, and generation of recombinant virus. N. benthamiana is one of the most suitable plants for Agrobacterium-mediated transformation, but a variety of other plants also have varying degrees of sensitivity to this technique.
In certain embodiments, the recombinant TSWV is devoid of functional GP and can infect plants but cannot be acquired by the insect vector thrips and be transmitted to other plants via the insects. Thus, the recombinant virus and method provided by the present invention is biologically containable.
In certain embodiments of the present invention, the recombinant TSWV lacks functional NSs, is still infective to plants, and is less virulent than the wild-type virus.
The TSWV vector may comprise one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) insertion sites upstream and/or downstream of viral transcriptional regulatory elements. In some embodiments, the recombinant TSWV M and S segments each carry a reporter gene that is operably linked to a viral transcriptional regulatory sequence. Preferably, the reporter genes are the green fluorescent protein gene (GFP) and the red fluorescent protein gene (RFP). After introducing the recombinant nucleic acid constructs into plant cells, replication and systemic infection of the recombinant virus can be tracked by the expressed reporters.
In another aspect, the present invention discloses a TSWV expression vector system for delivering sequence-specific nucleic acid modifying enzymes to plants and a method of constructing the vector system.
In certain embodiments, the recombinant TSWV M segment contains the coding sequence of one component of the nucleic acid modifying enzyme operably linked to the viral transcriptional regulatory sequence, while the S segment contains another component of the nucleic acid modifying enzyme operably linked to the viral transcriptional regulatory sequence. In TSWV-infected cells, viral transcriptional regulatory sequences drive the co-expression of the nucleic acid modifying enzyme components, forming a complex that targets specific DNA sites in the genome and produces modifications.
As described above, the TSWV vectors containing multiple insertion sites and recombinant viruses containing exogenous nucleic acid sequences in the M and/or S segment, respectively, or simultaneously, can systematically infect plants and express one or more exogenous sequences. Thus, the present invention provides viral expression vector systems that can express sequence-specific nucleic acid modifying enzymes.
More specifically, at least two polynucleotide sequences encoding the components of CRISPR/Cas nucleic acid modifying enzymes are inserted into the recombinant TSWV genome and operably linked to viral transcriptional cis-elements. When the recombinant TSWV vector is introduced into plant cells, the nucleic acid modifying enzyme is expressed.
In certain embodiments, the CRISPR/Cas nucleic acid modifying enzyme is the Lachnospiraceae bacterium CRISPR/LbCas12a nuclease.
In certain embodiments, the CRISPR/Cas nucleic acid modifying enzyme is the Streptococcus pyogenes CRISPR/SpCas9 nuclease.
In certain embodiments, the CRISPR/Cas nucleic acid modifying enzyme is an ABE based on the CRISPR/SpCas9 system.
In certain embodiments, said CRISPR/Cas nucleic acid modifying enzyme is a CBE based on the CRISPR/SpCas9 system.
To improve the efficiency of Cas effector protein expression in plant cells, provided in certain embodiments are plant codon-optimized coding sequences, such as the rice codon-optimized LbCas12a (as shown in SEQ ID NO: 10), the N. benthamiana codon-optimized SpCas9 (as shown in SEQ ID NO: 11), the N. benthamiana codon-optimized ABE (as shown in SEQ ID NO: 84), and the N. benthamiana codon-optimized CBE (as shown in SEQ ID NO:85). It will be appreciated by those with ordinary skill in the art that methods of codon-optimization for specific host species are feasible and known.
As shown by the sequences above, the Cas effector proteins of the present invention are fusion proteins containing nuclear localization sequences at their N-terminal and/or C-terminal ends to achieve sufficient accumulation of the CRISPR/Cas complex in the plant cell nucleus. In addition, depending on the DNA location to be edited, the Cas fusion protein may also include other localization sequences, such as cytoplasmic localization sequences, chloroplast localization sequences, mitochondrial localization sequences, etc. In some embodiments, the sequence encoding Cas12a protein replaces the GP coding sequence in the TSWV M genomic segment, and the expression of Cas12a protein is controlled by the viral transcriptional cis-elements. A heterologous nucleic acid sequence encoding a guide RNA (crRNA) for Cas12a is inserted upstream or downstream of the NSs open reading frame in the S genomic segment and controlled by the viral transcriptional cis-element. Preferably, the NSs coding sequence in the S genome is replaced by a reporter gene so that the recombinant virus infection can be tracked by the reporter. In certain embodiments of this aspect, the guide RNA transcript is processed by Cas12a nuclease to trim the terminal polynucleotide sequences derived from the virus.
In certain embodiments, the GP coding sequence in the M genomic segment is replaced by the Cas9 coding sequence, and the expression of the Cas9 protein is controlled by the M transcriptional cis-elements. The heterologous nucleic acid molecule encoding the Cas9 guide RNA is inserted upstream or downstream of the NSs coding sequence in the S genome, and the expression of the guide RNA is controlled by the S genome transcriptional cis-elements. Preferably, the NSs coding sequence is replaced by a reporter gene. In certain embodiments, the guide RNA transcript is trimmed by the Cas9 nuclease and endogenous RNA processing machinery to remove the terminal polynucleotide sequences derived from the virus.
In certain embodiments, the GP coding sequence in the M genomic segment is replaced by the ABE coding sequence, and the expression of the ABE protein is controlled by the M transcriptional cis-elements. The heterologous nucleic acid molecule encoding the guide RNA is inserted upstream or downstream of the NSs coding sequence in the S genome, and the expression of the guide RNA is controlled by the S genome transcriptional cis-elements. Preferably, the NSs coding sequence is replaced by a reporter gene.
In certain embodiments, the GP coding sequence in the M genomic segment is replaced by the CBE coding sequence, and the expression of the CBE protein is controlled by the M transcriptional cis-elements. The heterologous nucleic acid molecule encoding the guide RNA is inserted upstream or downstream of the NSs coding sequence in the S genome, and the expression of the guide RNA is controlled by the S genome transcriptional cis-elements. Preferably, the NSs coding sequence is replaced by a reporter gene.
In certain embodiments, the recombinant TSWV vector contains two heterologous nucleic acid sequences encoding a Cas effector protein and a guide RNA (crRNA or gRNA) inserted in the M and S segments, respectively. Prior solutions in the art have shown that nucleic acid sequences encoding Cas effector proteins and gRNA (or crRNA) elements can be expressed in tandem with a single RNA transcript (Campa et al., Nat Methods, 2019, 16: 887-893; Tang et al., Plant Biotechnol J, 2019, 17: 1431-1445). It is, therefore, conceivable that the two nucleic acid sequences encoding the Cas protein and the gRNA element can be joined to be expressed as a single nucleic acid sequence from a single genome segment of TSWV, such as the M, S, or L segment.
Further, in certain embodiments, the recombinant TSWV vector described above expresses the CRISPR/Cas nucleic acid modifying enzyme in plant cells. The aforementioned nucleic acid constructs containing the TSWV recombinant genomic segments and the helper expression vectors are introduced into suitable plant cells under conditions sufficient to rescue recombinant viruses.
In certain embodiments, the nucleic acid construct composition introduced into a plant cell comprises: a) nucleic acid constructs comprising recombinant TSWV S, M and L genomic sequences, wherein the S segment comprises one or more heterologous nucleic acid sequences encoding a CRISPR guide RNA, the M segment comprises one or more heterologous nucleic acid sequences encoding a Cas nuclease component, and the L genome comprises a codon-optimized viral RdRp sequence (SEQ ID NO: 5); b) helper expression vectors encoding one or more viral suppressors of RNA silencing; and, if desired, c) helper expression vector encoding the TSWV N and L proteins.
The method of “introduction into plant cells” described in the present invention is a conventional method in the field, e.g., by transforming a circular plasmid, linear DNA fragment, or in vitro transcribed RNA containing a viral nucleic acid sequence through Agrobacterium-mediated transformation, particle bombardment, pollen tube introduction, PEG-mediated fusion, electroporation, microinjection, laser transformation, ultrasonic transformation, liposome transformation, or nanoparticle-mediated transformation, and other methods to introduce into suitable eukaryotic cells. The method of introduction into plant cells described in certain embodiments of the present invention is the agroinfiltration method, whereby Agrobacterium tumefaciens carrying the above-mentioned nucleic acid constructs (binary vectors) were infiltrated into the leaf tissues of N. benthamiana. This introduces the DNA fragment within the binary plasmids into N. benthamiana cells to initiate transcription, expression, and generation of recombinant virus.
TSWV is a tri-segmented, negative-stranded RNA virus. When recovering recombinant TSWV from cloned DNAs, the S, M, and L segments need to be present simultaneously in the same cells in order to assemble active infection units and initiate recombinant virus infection. Due to the large differences in the length of the three genomic segments, their expression and assembly efficiencies may differ from one to the others. In addition, the insertion of large heterologous sequences, such as the 3.9 kb LbCas12a coding sequence, the 4.2 kb SpCas9 coding sequence, the 4.9 kb ABE coding sequence, or the 5.2 kb CBE coding sequence, can further affect the rescue efficiency of the recombinant genomic segments. Some embodiments in the present invention show that efficient generation of recombinant viruses requires optimal adjustment of the polarity (positive or negative sense) of the TSWV cDNA in the nucleic acid construct and the ratio of the three genomic nucleic acid constructs.
In certain embodiments, the recombinant L and S segment sequences are operably linked to the 35S promoter in the antigenomic cDNA orientation to produce positive sense viral RNAs that can serve as a template for translation of the N and L proteins and for genomic replication.
In certain embodiments, the recombinant M segment sequence comprising the LbCas12a coding sequence is operably linked to the 35S promoter in an antigenomic cDNA orientation to transcribe a positive sense RNA.
In certain embodiments, the recombinant M segment sequence comprising the SpCas9 coding sequence is operably linked to the 35S promoter in an antigenomic or genomic cDNA orientation, preferably in a genomic cDNA orientation, to transcribe a positive sense RNA.
In certain embodiments, the recombinant M genomic segment comprising the ABE or CBE heterologous coding sequence is operably linked to the 35S promoter in the antigenomic cDNA orientation to transcribe a positive-sense RNA.
When Agrobacterium-infiltration is used to introduce viral expression vectors, the ratio of each recombinant viral construct introduced into the plant can be altered by adjusting the concentration of the Agrobacterium cultures (showed as OD600) to achieve coordinated expression and efficient assembly of the three recombinant TSWV genome segments. It is understood that it is possible to rescue recombinant viruses from cloned viral cDNAs within a certain range of the ratios of viral constructs.
In certain embodiments, recovery of recombinant TSWV is achieved when agrobacterial cultures carrying the recombinant S genomic construct containing crRNA, the recombinant M genomic construct containing LbCas12a, and the recombinant L genomic construct are adjusted at the ratios of 2:2:1, or 1:2:1, or 1:10:2. Preferably, the most efficient virus recovery is achieved when the S:M:L ratio is set to 1:10:2.
In certain embodiments, efficient recovery of recombinant TSWV is achieved when agrobacterial cultures carrying the recombinant S genomic construct containing crRNA, the recombinant M genomic construct containing SpCas9, and the recombinant L genomic construct are adjusted at the ratio of 1:10:2.
In certain embodiments, efficient recovery of recombinant TSWV is achieved when agrobacterial cultures carrying the recombinant S genomic construct containing crRNA, the recombinant M genomic construct containing ABE or CEB, and the recombinant L genomic construct are adjusted at the ratio of 1:10:2.
Limited packaging capacity is the most significant impediment to the use of recombinant viral vectors to deliver sequence-specific nucleases. In some embodiments, the developed TSWV vector has multiple insertion sites and unusual carrying capacity. The TSWV vectors carrying CRISPR/Cas nuclease fragments longer than 5 kb can systematically infect plants and express the exogenous sequences. It is understood that TSWV can also express other CRISPR/Cas nucleases of similar or smaller sizes, including, but not limited to, SaCas9, ST11Cas9, ST12Cas9, St3Cas9, NmCas9, AsCas12a, FnCas12a, CasX, CasY, Cas4D nucleases, their mutants or derivatives, such as the high-fidelity Cas nucleases espCas9, HFCas9, Cas9 nickase (nCas9), dead Cas9 (dCas9), prime editor, etc. TSWV delivery of other small-sized nuclease, such as meganuclease (˜1 kb), zinc finger nucleases (˜2 kb), and transcription activator-like effector nucleases (˜3 kb), is also anticipated.
In some embodiments, The TSWV vector is used as a typical example of the genus Othortospovirus. It is understood that this methodology can be applied to other virus members in this genus or other phylogenetically related segmented plant negative-stranded RNA viruses having similar genomic structure and biological properties, including, but not limited to, Groundnut bud necrosis virus (GBNV), Groundnut ringspot virus (GRSV), Groundnut yellow spot virus (GYSV), Impatiens necrotic spot virus (INSV), Tomato chlorotic spot virus (TCSV), Watermelon silver mottle virus (WSMoV), Zucchini lethal chlorosis virus, ZLCV), Groundnut chlorotic fan-spot virus (GCFSV), Watermelon bud necrosis virus (WBNV), Melon yellow spot virus (MYSV), and Iris yellow spot virus (IYSV), and Chrysanthemum stem necrosis virus (CSNV).
In another aspect, the present invention provides a method for modifying plant genetic material using the TSWV vector to deliver a CRISPR/Cas nucleic acid modifying enzyme complex to plant cells, wherein the Cas effector protein directed by a guide RNA targets cleave or modify specific sequences of the plant genome, and the cellular DNA repair machinery completes the modification of the plant genetic material by producing one or more types of base deletion, insertion, substitution or a combination thereof.
In certain embodiments, the nuclease delivered by the TSWV vector is a CRISPR/LbCas12a nuclease; In certain embodiments, the nuclease is a CRISPR/SpCas9 nuclease.
In certain embodiments, the base editor delivered by the TSWV vector is an ABE; In certain embodiments, the base editor is a CBE.
Methods for designing and constructing a suitable CRISPR guide RNA based on a given target sequence are known to those with ordinary skills in the art. For example, the design of LbCas12a crRNA can be found in Fonfara et al. (Nature, 2016, 532: 517-521). In general, LbCas12a crRNA contains a protospacer sequence of about 23 nucleotides complementary to the target sequence and a direct repeat (DR) sequence that binds to the LbCas12a protein to form a complex, wherein the DR sequence is responsible for binding Cas12a to form the complex, and the spacer sequence targets the complementary target DNA. The target sequence has the following structure: 5′-TTTN-Nx-3′, where N is any one base of A, G, C or T, Nx denotes X consecutive nucleotides, X is an integer of 15≤X≤30, and TTTN is the protospacer adjacent motif (PAM) sequence of LbCas12a. In some specific embodiments of the present invention, X is 23.
In addition, Cas12a has its crRNA processing activity and can cleave transcripts containing multiple crRNA alignments to generate multiple guide RNAs, making it easier to achieve precise processing of crRNAs and simultaneous expression of multiple crRNAs for multiplexed gene editing. (Fonfara et al, Nature, 2016, 532: 517-521; Zetsche et al, Nat Biotechnol, 2017, 35: 31-34).
In certain embodiments, the guide RNA for Cas12a is a single crRNA, with a structure of “direct repeat (DR)-Spacer-direct repeat (DR)”. In these embodiments, the TSWV vectors express a primary transcript containing the “DR-Spacer-DR” guide RNA with virus-derived sequences at both ends, and the Cas12a protein binds to the DR sequence to process the virus-derived terminal polynucleotide through its nuclease activity to form a mature crRNA containing the “DR-Spacer” structure.
In some embodiments, the guide RNA for Cas12a is a tandem of multiple crRNAs, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more crRNA sequences. Specifically, the tandem array of crRNA is expressed in the form of “DR-Spacer 1-DR-Spacer 2-DR . . . ”. In these embodiments, the TSWV vectors express a primary transcript containing a tandem repeat of crRNA with virus-derived sequences at both ends, and the Cas12a protein binds to the DR sequence to release two or more mature crRNA molecules through its nuclease activity. The resulting target gene editing efficiency is similar to that of the corresponding TSWV vector delivering a single crRNA.
In certain embodiments, the LbCas12a and crRNA complex delivered by TSWV vectors target the crPDS1 target sequence 5′-TTTCTGCTGTTAACTTGAGAGTCCAAG, which is conserved in the phytoene desaturase (PDS) gene in N. benthamiana and tobacco (SEQ ID NO: 12) (bold letters are PAM sequences, hereinafter), or the crPDS2 target sequence 5′-TTTCTGCTGTTAACTTGAGAGTCCAAG-3′ of the tomato SIPDS gene (SEQ ID NO: 13), or the crPDS3 target sequence 5′-TTTCTGCTGTCAACTTGAGAGTCCAAG-3′ (SEQ ID NO: 14) in the pepper PDS gene, or the crPDS4 target sequence 5′-TTTGACAGAAAACTGAAGAACACATAT-3′ (SEQ ID NO: 15) conserved in peanut and soybean PDS genes.
In certain embodiments, the LbCas12a and crRNA complex delivered by the TSWV vector targets to the crFucT target sequence 5′-TTTGGAACAGATTCCAATAAGAAGCCT-3′ (SEQ ID NO: 86), which is conserved in the N. benthamiana FucT genes, or the crDCL2 target sequence 5′-TTTGAGAAGCTATGCTTATCTTCTTCG-3′ (SEQ ID NO: 87) in the N. benthamiana DCL2, or the crER target sequence 5′-TTTAATGTTAAACCTTTAGTGTCCTTT-3′ (SEQ ID NO: 88) in the ground cherry ER gene, or the crPDS5 target sequence 5′-AATGCAATGTATATTGATTGCCTTAAA-3′ (SEQ ID NO: 98) conserved in the peanut PDS genes, or the crPDS6 target sequence 5′-TTTCTGCTGTCAACTTGAGAGTCCAAG-3′ (SEQ ID NO: 99) in the pepper PDS gene.
In certain embodiments, the TSWV vector delivers LbCas12a and multiple crRNAs to form multiple LbCas12a/crRNA complexes that can simultaneously target multiple genes for multiplexed gene editing, such as tandem expression of two crRNAs (crPDS1 & crFucT; SEQ ID NO: 135) targeting the N. benthamiana crPDS1 and crFucT loci, or tandem expression of three crRNAs (crPDS1&FucT&crDCL2; SEQ ID NO: 136) targeting the N. benthamiana crPDS, crFucT and crDCL2 loci. The editing efficiency generated by multi-gRNA TSWV vectors is similar to that of the corresponding single-gRNA vector.
The design of SpCas9 gRNA is described by Ran et al. (Nat Protoc, 2013, 11: 2281-2308). SpCas9 gRNA contains a protospacer sequence of approximately 20 nucleotides complementary to the target sequence, as well as a sequence that binds to the SpCas9 protein to form a complex. The target sequence has the following structure: 5′-Nx-NGG-3′, where N is any one base of A, G, C or T, Nx denotes X consecutive nucleotides, X is an integer of 14≤X≤30, and NGG is the PAM sequence of SpCas9. In certain embodiments of the present invention, X is 20.
When a gRNA is expressed by an RNA viral vector, the primary gRNA transcript typically contains a gRNA sequence embedded in virus-derived sequences at both ends. In the presence of co-expressed Cas9 that binds the gRNA sequence, the unbound virus-derived sequences can be trimmed off by cellular RNA processing enzymes to release one or more mature gRNA molecules (Mikami et al., Plant Cell Physiol, 2017, 58: 1857-1867; Ellison et al., Nat Plants, 2020, 6: 620-624).
In certain embodiments, the SpCa9 and gRNA complex delivered by the TSWV vector targets the gGFP target sequence 5′-GATACCCAGATCATATGAAGCGG-3′ (SEQ ID NO: 16) in the N. benthamiana GFP transgene, or the conserved gPDS1 target sequence 5′-GGACTCTTGCCAGCAATGCTTGG-3′ (SEQ ID NO: 17) in the N. benthamiana and common tobacco PDS genes.
In certain embodiments, the SpCas9 and gRNA complex delivered by the TSWV vector target the gPDS2 target sequence 5′-TTGGTAGTAGCGACTCCATGGGG-3′ (SEQ ID NO: 89), which is conserved in the N. benthamiana PDS gene, or the gRDR6 target sequence 5′-CCCCTCCCCTGACTCTTACCCAACT-3′ (SEQ ID NO: 90) in the RDR6 gene, or the gSGS3 target sequence 5′-ACAAGAGTGGAAGCAGTGCTGGG-3′ (SEQ ID NO: 91) in the SGS3 gene.
In certain embodiments, the TSWV vector delivers SpCas9 and multiple tandem gRNAs to form multiple SpCas9/gRNA complexes that can simultaneously target multiple genes for multiplex gene editing, such as tandem expression of two gRNAs (gPDS2 & gRDR6; SEQ ID NO: 137) targeting the gPDS2 and gRDR6 loci, or tandem expression of three gRNAs (gPDS2&gRDR6&gSGS3; SEQ ID NO: 138) targeting the gPDS2, gRDR6 and gSGS3 sites. The editing efficiency generated by multi-gRNA TSWV vectors is similar to that of the corresponding single-gRNA vector.
Designing gRNAs for the base editors (ABE and CBE, etc.) is similar to the design for the SpCas9 nuclease. In certain embodiments, the ABE and gRNA complexes delivered by the TSWV vector are designed to target the gPDSa target sequence 5′-TGCAAATTGAGTTGGGAGTGAGG-3′ (SEQ ID NO: 92) in the N. benthamiana PDSa gene, target sequence 5′-CCTCTTGAGAATGTTGCTATTGG-3′ (gFucTa1, SEQ ID NO: 93) from the FucTa gene of N. benthamiana, or selected from the target sequence 5′-GAAATCAGAGTAAGGTGCGGAGG-3′ (gBBLd, SEQ ID NO: 97) of the N. benthamiana berberine bridging enzyme-like nucleic acid (BBL) d gene.
In certain embodiments, the targets of the CBE and gRNA complexes delivered by the TSWV vectors are selected from the target sequence 5′-CACACCTTCCTCCAGGAGATAGG-3′ of the N. benthamiana alpha-1,3-fucosyltransferase a gene (FucTa) (gFucTa2. SEQ ID NO: 94), the target1 sequence 5′-CCAATATCTAGGCGTGAGGGATT-3′ (gRDR6a1, SEQ ID NO: 95) of the RNA-dependent RNA polymerase 6a gene (RDR6a) and the target2 sequence 5′-CCCGACTTCATGGGGAAGGAGGA-3′ (gRDR6a2, SEQ ID NO: 96).
The guide RNAs described above may target a single gene or multiple genes, such as 2, 3, 4, 5, 6, 7, 8, 9 or 10 and more. In certain embodiments, the guide RNA targets a single gene, such as Cas12a crRNA-targeted the tomato PDS gene, pepper PDS gene, soybean PDS gene, and Cas9 gRNA-targeted the GFP transgene. In certain embodiments, the guide RNA targets two genes, such as the N. benthamiana and tobacco PDSa and PDSb genes, the peanut PDSa and PDSb genes.
For modifying plant genetic material, the above-mentioned TSWV vectors containing Cas effector proteins and guide RNAs targeting different genomic loci can be introduced into plant cells, tissues, or organs by any method known in the art. For example, circular plasmids, linear DNA fragments, or in vitro transcribed RNA containing the nucleic acid sequences of recombinant viral vectors are introduced into suitable plant cells by agroinfection, biolistic bombardment, pollen tube introduction, PEG-mediated fusion, electroporation, microinjection, ultrasonic transformation, liposome transformation, or nanoparticle-mediated transformation. In certain embodiments, the method described for introducing viral vectors into plant cells is the Agrobacterium infection method, whereby Agrobacterium cultures carrying the above-mentioned nucleic acid construct (binary vectors) are infiltrated into benthamiana leaf tissues. The viral nucleic acid sequence introduced into the N. benthamiana cells directs the production of a recombinant virus that expresses the CRISPR nuclease complex in the inoculated leaves and upper uninoculated leaves, which targets and cleaves or modifies the target sequence and induces modification of the genetic material.
TSWV viral vectors can also be introduced into plant cells by any means of plant inoculation with virus particles. For example, plant saps containing recombinant virus particles are used to infect plants through mechanical rubbing, pressurized spraying, needle-pricking, syringe injection, vacuum infiltration, biolistic bombardment, or by grafting of an infected plant with a plant to be infected. In certain embodiments, the method of virus infection described is to inoculate N. benthamiana, tobacco, tomato, sweet pepper, chili pepper, habanero pepper, potato, ground cherry, lettuce, peanut, soybean, and cotton with N. benthamiana saps containing recombinant virus particles through mechanical inoculation and pressurized spraying. Recombinant virus particles introduced into plant cells initiate virus replication and infection and express CRISPR nuclease complexes in the inoculated leaves and/or systemic leaves, which cleave or modify target sequences and induce modifications of genetic material.
After a plant is infected with a viral vector, any method known in the art can be used to identify whether a modification of the target sequence has occurred. Examples include functional assays such as inactivation or activation of reporter gene expression or amplification of the target DNA sequence using polymerase chain reaction for Surveyor and T7EI nuclease assays, restriction endonuclease protection analysis, Sanger sequencing analysis, or high-throughput sequencing analysis.
In certain embodiments, methods for identifying genetic modifications are restriction endonuclease protection analysis and Sanger sequencing analysis. Depending on the target sequence, the frequencies of target gene editing generated by the nuclease expressed in the viral vector in the present invention can range from 45% to 98%.
In certain embodiments, the method for identifying genetic modifications is high-throughput sequencing analysis. High-throughput sequencing is performed by amplifying a sequence, including the target with specific primers with adaptors using total plant DNA as the template, barcoding the amplicons in a secondary round of amplification, and then performing high-throughput sequencing. The amplicon sequencing results are compared with the wild-type sequence, and the percentage of reads with expected mutations at the target to the total reads is calculated to represent the frequency of target gene modification (editing).
Using the method disclosed in the present invention, the CRISPR nucleic acid modification enzyme is transiently expressed locally or systematically in plant tissues by the introduced recombinant TSWV viral vector, achieving modification of specific genomic sequences without the need for transformation and integration of the nuclease-encoding gene in the plant genome.
In the present invention, the target sequence to be modified can be located at any position in the genome, for example, at a functional gene body such as a protein-coding gene exon or intron, or can be located in a gene expression regulatory region such as a promoter region or an enhancer region, thereby achieving functional modification of said gene or modification of gene expression. In certain embodiments, the target sequence modification comprises deletion, insertion, substitution of one or more bases, or a combination thereof, which may result in frameshift and premature termination, amino acid substitution, or deletion.
If an exogenous DNA recombinant template is provided to direct DNA homologous recombination in plant cells infected with a recombinant TSWV virus vector, it is conceivable that the expressed nuclease cleaves the genomic target gene to produce DNA breaks, the free DNA ends can be used for DNA synthesis mediated by homologous sequences using the exogenously provided DNA as a template, thereby integrating the predetermined exogenous sequence at the target gene locus.
Plants that can be genetically modified by the methods provided by the present invention include natural or experimental hosts that can be infected with the viral vectors, such as Abelmoschus esculentus, Ageratum houstonianum, Allium cepa, Amaranthus caudatus, Amaranthus hybridus, Amaranthus retroflexus, Amaranthus inosus, Amaranthus viridis, Amaryllis sp., Ananas comosus, Anemone, Apium graveolens var. dulce, Aquilegia vulgaris, Arabidopsis thaliana, Arachis hypogaea, Arctium lappa, Arum palaestinum, Begonia sp., Belamcanda chinensis, Beta vulgaris, Bidens pilosa, Brassica campestris, Brassica oleracea var. botrytis, Brassica pekinensis, Brassica rapa ssp. Chinensis, Calceolaria herbeohybrida, Calendula officinalis, Callistephus chinensis, Campanula glomerata, Campanula isophylla, Campanula latiloba, Campanula pyramidalis, Canavalia gladiata, Canavalia obtusifolia, Canavalia occidentalis, Capsella bursa-pastoris, Capsicum annuum, Capsicum frutescens, Capsicum Chinense, Capsicum Baccatum, Capsicum Pubescens, Carica papaya, Cassia occidentalis, Cassia tora, Catharanthus roseus, Centaurea cyanus, Cheiranthus cheiri, Chenopodium album, Chenopodium ambrosioides, Chenopodium amaranticolor Chenopodium murale, Chrysanthemun coronarium, Chrysanthemun indicum, Chrysanthemun leucanthemum, Chrysanthemun morifolium, Cicer arietinum, Cichorium endiva, Cichorium intybus, Cineraria, Citrullus lanatus, Coffea arabica, Convolvulus arvensis, Conyza bonariensis, Cordyline terminalis, Coriandrum sativum, Coreopsis drummondii, Coronopus didymus (synonym Youngia japonica), Crotalaria incana, Crotalaria juncea, Crotalaria pallida (synonym Crotalaria mucronata), Cucumis melo, Cucumis sativus, Cucurbita maxima, Cucurbita moschata, Cucurbita pepo, Cyclamen indicum (synonym Cyclamen persicum), Cynara scolymus, Dahlia pinnata, Dahlia variabilis, Datura stramonium, Delphinium sp., Desmodium triflorum, Desmodium unicinatum, Duboisia leichhardtii, Emilia sonchifolia, Foeniculum vulgare, Fragaria vesca, Galinsoga parviflora, Galinsoga quadriradiata, Gladiolus sp., Glycine max, Glycine soja, Gomphrena globosa, Gossypium hirsutum, Gossypium barbadense, Helianthus annuus, Hibiscus tiliaceus, Hippeastrum hybridum, Hippeastrum reginae, Hippeastrum rutilum, Hoya carnosa, Hydrangea macrophylla, Hyoscyamus niger, Impatiens holsii, Impatiens sultanii, Impatiens wallerana, Ipomea batatas, Lactuca sativa, Lathyrus odoratus, Leonotis nepetaefolia, Limonium latifolium, Lupinus leucophyllus, Lupinus mutabilis, Lupinus polyphyllus, Lycium procissimum, Lycopersicon esculentum, Lycopersicon hirsutum, Lycopersicon pimpinellifolium, Malva parviflora, Matthiola incana, Medicago polymorpha, Melilotus officinalis, Nerium oleander, Nicandra physaloides, Nicotiana acuminata, Nicotiana bigelovii, Nicotiana clevelandii, Nicotiana glutinosa, Nicotiana rustica, Nicotiana sylvestris, Nicotiana tabacum, Papaver nudicaule, Pelargonium hortorum, Petunia hybrida, Phaseolus angularis, Phaseolus vulgaris, Phlox drummondii, Physalis angulate (synonym Physalis minima), Physalis spp., Pisum arvense, Pisum sativum, Plantago major, Plantago rugelii, Polygonum convolvulus, Portulaca oleracea, Primula sp., Primula malacoides, Primula sinensis, Ranunculus sp., Rubus idaeus, Rudbeckia amplexicaulis, Saintpaulia ionantha, Salpiglossis, Scabiosa, Schizanthus, Sechium edule, Senecio cruentus, Senecio jacobea, Sesamum indicum, Sinningia eciosa, Solanum capsicastrum, Solanum carolinense, Solanum laciniatum, Solanum mammosum Solanum melongena, Solanum nigrum, Solanum nodiflorum, Solanum seaforthianum, Solanum triflorum, Solanum tuberosum, Sonchus oleraceus, Spinacia oleracea, Stachys arvensis, Stapellia, Stellaria media, Streptosolen jamesonii, Tagetes patula, Tarax patula, Taraxacum, Tephrosia purpurea, Tribulus terrestris, Trifolium repens, Trifolium subterraneum, Tropaeolum majus, Vallota, Verbena brasilliensis, Verbena litoralis, Verbesina enceloides, Vicia faba, Vigna mungo, Vigna radiata, Vigna unguiculata, Xanthium saccharatum, Zantedeschia aethopica, Zantedeschia albo-maculata, Zantedeschia elliottiana, Zantedeschia melanoleuca, Zantedeschia rehmannii, Zinnia elegans, etc.
In certain embodiments, the plants include N. benthamiana, tobacco, tomato, sweet pepper, chili pepper, habanero pepper, ground cherry, potato, lettuce, peanut, soybean, and cotton.
It is well known that plants, especially cultivated crops, contain different varieties, lines, cultivars, germplasms, genotypes, etc. In certain embodiments of the present invention, the TSWV vectors effectively infect different varieties within the same species and express nucleic acid modification enzymes, producing targeted modification efficiencies in target genes at similar frequencies.
In certain embodiments, ten peanut varieties, including the varieties Huayu 25, Huayu 32, Huayu 39, Huayu 48, Huayu 9616, Fengyou 68, Honghua 1, Dabaisha, Sili Hong, and Shuofeng 518, are infected with the TSWV vectors to produce targeted modifications in the target sequences at similar frequencies.
In certain embodiments, ten sweet pepper varieties, including Zhoula 2, Zhoula 3, Zhoula 4, Fuxiang Xiuli, Bola 5, Bola 7, Bola 15, Xingshu 201, Xingshu 301, and Xingshu Guiyan varieties, are infected with the TSWV vectors to produce targeted modifications in the target sequences at similar frequencies.
It is generally believed that virus infection of host plants is less dependent on the plant genotypes. It is conceivable that TSWV vectors could effectively infect different varieties, lines, cultivars, germplasms, or genotypes of other host plants to modify their genetic materials.
Further, the present invention discloses methods for obtaining plants from the plant cells with modified genetic material and selecting plants with modified genetic material.
The plant cells in the present invention can be cells in intact plants infected with viruses or isolated plant cells, such as callus tissues, suspension cells, or protoplasts.
Methods for obtaining plant organs or individual plants from plant cells include propagation by seed, such as plant cells being naturally or artificially induced to develop into adult embryonic stem cells, which in turn develop into germ cells that produce seeds after sexual reproduction. Plants can also be obtained from plant cells through micropropagation, i.e., asexual propagation of plant cells in vitro using plant tissue culture techniques.
In certain embodiments, the method for obtaining plants with modified genetic material is micropropagation. The method comprises using virus-infected plant cells or tissues as explants for tissue culture and plant regeneration on a selection marker-free medium, i.e., no selection agents (e.g., antibiotics, herbicides, etc.) are used in the tissue culture process. Further, the method comprises selecting plants with modified genetic material from the regenerated plants, wherein the selection process does not require selection by any selection marker. Instead, the selection is conducted by screening the plants using restriction endonuclease analysis or DNA sequencing to analyze target gene sequences.
The present invention further compares the effects of TSWV viral vectors with or without the NSs gene on plant cell differentiation and regeneration and finds that NSs inhibits plant regeneration. Thus, in some preferred embodiments of the present invention, plant cells infected with TSWV vectors deficient in functional NSs are used to modify the plants' genetic material.
Plant tissue culture and regeneration methods are known to those skilled in the art, and the requirements for medium composition and growth hormone compositions vary among plant cells.
In certain embodiments, the TSWV vector-infected N. benthamiana leaf cells were used for tissue culture to obtain plants with modified genetic material and without the insertion of exogenous nucleic acid.
In certain embodiments, the TSWV vector-infected tobacco leaf cells were used for tissue culture to obtain plants with modified genetic material and without the insertion of exogenous nucleic acid.
In certain embodiments, the TSWV vector-infected tomato leaf cells were used for tissue culture to obtain plants with modified genetic material and without the insertion of exogenous nucleic acid.
When virus-infected somatic cell tissues were used for tissue culture, the virus often continued to be present in the regenerated plants. Persistently viral infections may result in continuous expression of nucleic acid modifying enzymes in the regenerated plants, which leads to non-heritable editing in the somatic cells of the regenerated plants or increased off-target modifications. In addition, persistent virus infection can interfere with plant physiology and phenotype or even adversely affect plant fertility and seed setting. Therefore, removing viral vectors during plant regeneration can avoid these adverse effects.
Certain compounds can effectively inhibit human or animal RNA virus replication in vivo and in vitro. These compounds include various nucleoside or nucleotide analogs, such as ribavirin, favipiravir, remdesivir, 6-azauridine, oxypurinol, gemcitabine, galidesivir (BCV4430), mizoribine, monupiravir, sofosbuvir, tenofovir, etc. (Geraghty et al., Viruses, 2021, 13: 667; Borbone et al, Molecules, 2021, 26: 986). These nucleoside or nucleotide analogs target the viral RdRP and are mis-incorporated by RdRp into the nascent viral RNA during viral replication, leading to the termination of viral RNA replication or causing the accumulation of numerous deleterious mutations in the viral genomes. Different compounds have significantly different affinities for RdRP of different animal RNA viruses, and their antiviral effects also vary. The present inventors unexpectedly found that when using TSWV-infected somatic cell tissues for tissue culture, the addition of certain nucleoside or nucleotide analogs within the medium greatly facilitated virus clearance from the regenerated plants and significantly increased the proportion of the regenerated plants with genetic modifications.
In certain embodiments, when TSWV CRISPR/Cas12a vector-infected leaf tissues are cultured in a medium containing 40 mg/L ribavirin, 100% of the regenerated plants were free of virus. In contrast, 28.9% of plants have tetra-allelic mutations at the crPDS1 target site. In comparison, only 4.4% of the regenerated plants in the mock were virus-free, and 15.6% contained bi-allelic mutations.
In certain embodiments, when TSWV CRISPR/Cas12a vector-infected leaf tissues are cultured in a medium containing 40 mg/L of favipiravir, 66.7% of the regenerated plants were free of virus.
In certain embodiments, when TSWV CRISPR/Cas12a vector-infected leaf tissues were cultured in a medium containing 1.2 mg/L of remdesivir, 15.6% of the regenerated plants were free of the virus.
In another aspect, the present invention also provides recombinant TSWV particles, as described above, wherein the virus particles comprise chimeric genomic segments containing a heterologous sequence encoding a nucleic acid modifying enzyme. The recombinant TSWV virus particles can be introduced into isolated plant cells or specific organs or tissues of intact plants, such as leaves, leaf veins, roots, etc., through conventional methods in the art, such as mechanical inoculation, pressurized spraying, etc. Virus particles infect plant cells and express the nucleic acid modifying enzymes that cleave or modify plant genomic target sequences and induce the modifications of genetic material.
In another aspect, the present invention further provides kits that contain the viral vector elements mentioned in the above methods and compositions. The kit may comprise a user manual and viral vector system comprising: a) a nucleic acid construct comprising recombinant TSWV S, M, and L chimeric genomic sequences, wherein the S segment comprises one or more heterologous nucleic acid sequences encoding a CRISPR guide RNA component, the M segment comprises one or more heterologous nucleic acid sequences encoding a Cas nucleic acid modifying enzyme component, and the L segment comprises codon-optimized viral RdRp sequence (SEQ ID NO: 5); b) helper expression vectors encoding one or more viral suppressors of RNA silencing. The viral vector is introduced into a plant cell to generate a recombinant virus that transiently expresses CRISPR/Cas nucleic acid modifying enzymes that modify the genetic material in the cell. The viral vector reagents may be provided individually or in combination and may be placed in a suitable container, such as a tube, bottle, or pill bottle.
Compared to the prior art, the benefit offered by the viral delivery system-based plant genome editing technology disclosed in the present invention include:
The present invention is described in further detail hereinafter in connection with the embodiments, which are used only to illustrate the present invention and should not be regarded as limiting the scope of the present invention. Any modifications or improvements made on the basis of the present invention without deviating from the present invention belong to the scope of protection of the present invention.
The experimental methods used in the following embodiments are conventional methods, and the reagents and materials used can be obtained from commercial sources unless otherwise specified.
The recombinant TSWV vectors are described using the TSWV-X-Y nomenclature (or V-X-Y for short) in the following embodiments, where X refers to the heterologous nucleic acid sequence carried by the M genome segment, and Y refers to the heterologous nucleic acid sequence inserted in the S genome. When a genome segment contains two tandem heterologous nucleic acid elements, these two elements are joined by a semicolon, e.g., TSWV-X-Y:Z. For example, TSWV-Cas12a-crPDS1:GFP is a recombinant TSWV vector in which the M segment contains the heterologous Cas12a sequence and the S segment contains the heterologous crRNA and GFP sequences. The pS/M/L(−/+)X is used to describe the constructs containing the TSWV S, M, or L genomic cDNA sequences, in which the (−) indicates genomic cDNA inserted downstream of the promoter in the construct and to produce the negative-sense viral RNA, the (+) indicates antigenomic cDNA inserted downstream of the promoter in the construct to produce transcription to produce the positive-sense viral RNA, and the X indicates the heterologous nucleic acid sequence carried in the genome. For example, pM(+)cas12a is a TSWV M construct containing the antigenomic cDNA sequence and a heterologous Cas12a sequence.
The Agrobacterium binary vector pCB301-2μ-HDV (
The Agrobacterium binary vector pGD (
The viral suppressor of RNA silencing (VSR) expression vectors, pGD-HcPro, pGD-p19, and pGD-γb, are described in “Ganesan U, Bragg J N, Deng M, et al. minireplicon: a step toward reverse genetic analysis of plant negative-strand RNA viruses. J Virol. 2013, 87: 10598-10611”.
The N. benthamiana codon-optimized SpCas9 sequence is described in “Yin K, Han T, Liu G, et al., Geminivirus-based guide RNA delivery system for CRISPR/Cas9 mediated plant genome editing. Scientific Reports, 2015, 5: 14926”.
The plasmid pYQ230 containing the rice codon-optimized LbCas12a sequence is described in “Tang, X., Lowder, L. G., Zhang, T., et al. A CRISPR-Cpf1 system for efficient genome editing and transcriptional repression in plants. Nature Plants, 2017, 3:17018”.
Nicotiana benthamiana LAB accession and the 16C transgenic line are described in “Ruiz M T, Voinnet O, Baulcombe DC. Initiation and maintenance of virus-induced gene silencing. Plant Cell, 1998, 10:937-46”. Tobacco (N. tabacum cv. K326), tomato (Solanum lycopersicum cv. Hongbaoshi), potato (S. tuberosum), sweet pepper (Capsicum annuum), chili pepper (C. frutescens), habanero pepper (C. chinense), ground cherry (Physalis grisea), lettuce (Lactuca sativa), land cotton (Gossypium hirsutum cv. Junmian No. 1), peanut (Arachis hypogaea), and soybean (Glycine max cv. William 82) were the regular plant materials available to the public from the Institute of Biotechnology, Zhejiang University.
The target gene sequences of Solanaceae species (N. benthamiana, tobacco, tomato, potato, and pepper) were retrieved from the Sol Genomics Network (SGN)e (https://solgenomics.net/), while the target gene sequences of legumes (peanut and soybean) were available from the Legume Genome Information System LIS database (https://legumeinfo.org/). The Cas12a target sequences were designed using the CRISPR RGEN Tools (http://www.rgenome.net/) online software, and the target sites for Cas9 and Cas9-derived base editors were designed by using the CRISPR-P (http://cbi.hzau.edu.cn/cgi-bin/CRISPR) online software.
Total RNA Extraction from TSWV-Infected Plant Samples.
0.1 g of TSWV-infected tomato plants grown in the laboratory is collected and quickly ground into powder in liquid nitrogen. Total RNA is extracted from the leaf samples using Ttizol reagent (Invitrogen), dissolved in 50 μL of DEPC-treated pure water after precipitation with isopropanol. The extracted RNA is used immediately for subsequent experiments or stored in −80° C. refrigerator in aliquots.
Reverse Transcription to Obtain Viral Genomic cDNA
Procedure according to TaKaRa AMV reverse transcriptase instructions as follows:
The primers used for reverse transcription were as follows:
The tri-segmented, negative-stranded TSWV RNA genome consists of the large (L), medium (M), and small (S) RNAs (
1) pL(+) and pL(+)opt Vector Construction
pL(+) is a construct containing the TSWV L antigenomic cDNA (
pL(+)opt is a construct derived from pL(+), in which the RdRp coding sequence (SEQ ID NO: 4) is replaced by a codon-optimized sequence (SEQ ID NO: 5) (
2) Construction of pM(−), pM(−) RFP, and pM(+) RFP
pM(−) is a construct containing the TSWV M genomic cDNA (
pM(−)RFP is a construct derived from pM(−) in which the GP gene is replaced by the RFP (
The pM(+)RFP has the same insertion sequence as the pM(−)RFP, but the sequence is inserted into the vector in the opposite direction to produce the M antigenomic RNA (
3) pS(+) and pS(+)GFP Vector Construction
pS(+) is a construct containing the TSWV S antigenomic cDNA (
The pS(+)GFP is a construct derived from pS(+) in which the NSs gene is replaced by the GFP (
4) Construction of the Expression Vectors pGD-NSs and pGD-NSm
Using the S and M cDNAs as templates, the coding regions of NSs (SEQ ID NO: 7) and NSm (SEQ ID NO: 8) were amplified using primer pairs BamHI/NSs/F, and SalI/NSs/R, BamHI/NSm/F and SalI/NSm/, respectively. The above PCR products were inserted into the pGD vectors through BamHI and SalI digestion and ligation to obtain the protein expression vectors pGD-NSs and pGD-NSm, respectively.
Agroinoculation of N. benthamiana
Recombinant binary plasmids were introduced into the Agrobacterium tumefaciens EHA105 strain by electroporation. Single A. tumefaciens colony was grown overnight at 28° C. in Yeast Extract and Peptone (YEP) culture medium containing 50 mg 1−1 kanamycin and 50 mg 1−1 rifampicin. The overnight cultures were transferred to a fresh YEP culture medium and allowed to grow to about 0.8-1.0 optical density (OD600). Cultures were collected by centrifugation and resuspended in infiltration buffer (10 mM MES, 10 mM MgCl2, and 100 μM Acetosyringone, pH 5.6) at 1.0 OD600, followed by incubation for 2-4 h at room temperature.
For analysing the effect of M genome polarity (positive or negative sense) and co-expression of NSm proteins on recombinant TSWV rescue, N. benthamiana leaf tissues were infiltrated with Agrobacterium mixtures harboring various combinations of recombinant viral constructs containing two reporter genes as listed below.
In the mixtures, the final concentration of the bacterial solution containing pL, pM, and pS was adjusted to OD600=0.1. Agrobacterium cultures harboring the pGD plasmids for expressing three VSR (barley stripe mosaic virus γb, tomato bushy stunt virus p19, tobacco etch virus P1/HC-Pro) were mixed in equal proportions and added to the mixtures at a final OD600 of 0.05 to minimize RNA silencing. Agrobacterium culture harboring the pGD-NSm expression vector was adjusted to a final concentration of OD600=0.1. The mixed cultures were infiltrated into the abaxial surfaces of fully expanded N. benthamiana leaves at approximately the 6-leaf stage with a 1-ml needleless syringe, as shown in
Expression of GFP and RFP was visible five days after agroinoculation in all three groups of inoculated leaves of N. benthamiana. Compared with RFP, the area of leaf tissue with GFP expression is smaller, especially in group 3 (
Green and red fluorescence began to appear in the upper non-inoculated leaves 7 days after agroinoculation in both Group 1 and Group 2, and there was widespread fluorescence spread in the systemic leaves at day 10. In contrast, fluorescence in the upper non-inoculated leaves in Group 3 was delayed, and scattered red and green fluorescence cell clusters were seen at day 10 (
By 6-8 days after agroinoculation, all inoculated plants in Groups 1 and 2 successively developed systemic symptoms, such as young leaves curling and chlorosis, while systemic infections in Group 3 were delayed for 2-3 days. Plants were photographed 5 days post systemic infection under visible light and long-wavelength ultraviolet (UV) illumination (
The conclusions from the above experiments are: i) the L and S antigenomic RNA transcripts produced from the viral constructs can direct the translation of the RdRp and N proteins sufficient for genome encapsidation and replication; ii) for the M genome, both genomic cDNA vector (Group 1) and antigenomic cDNA vectors (Groups 2 and 3) can support recombinant virus rescue; iii) Providing additional viral movement protein does not facilitate but in fact delayed generation of recombinant virus (compare Group 2 with 3); iv) TSWV GP and NSs can be replaced by foreign genes without compromising plant systemic infection.
Four binary vectors for expression of viral suppressors of RNA silencing (VSRs), i.e., pGD-p19, pGD-HcPro, pGD-γb, and pGD-NSs, were included in the agrobacterial mixtures in Example 2. This implementation further tested the effect of expressing single or multiple combinations of VSRs on recombinant TSWV rescue. In the Agrobacterium mixtures, the final concentrations of the bacterial solution containing pL(+)opt, pM(−)RFP, and pS(+)GFP plasmids were adjusted to OD600=0.1, respectively, and the final concentrations of the bacterial solution of each of the following VSRs were adjusted to OD600=0.05.
Eight days after agroinoculation, about 16-18 plants of 20 inoculated plants showed systemic symptoms in experimental Group 1 (VSRs), Group 2 (p19), and Group 6 (p19+HcPro), i.e., the rescue efficiencies were 80-90%. There were 7, 12, and 2 plants that developed systemic symptoms in experimental Group 3 (HcPro), Group 4 (γb), and Group 5 (NSs), respectively. According to the time and proportion of plants that showed systemic symptoms, the infection efficiencies from highest to lowest in the experiments were Group 2, 1, 6, 4, 3, and 5. Until 20 days after inoculation, none of the 20 infiltrated plants in experimental Group 7 showed systemic group (
The CRISPR/Cas12a system contains the Cas12a protein and a CRISPR guide RNA (crRNA) that binds to the target. As shown in
1) Construction of the pM(−)Cas12a and pM(+)Cas12a Vectors
The plasmid pYQ230 containing the rice codon-optimized Cas12a sequence flanked by a nuclear localization signal and 3×Flag antigen epitope sequence (SEQ ID NO: 10) was used as a template, and the 3×Flag-NLS-Cas12a-NLS fragment was amplified using the primers M UTR/Cas12a/F and Cas12a/R (Table 1). The M vector fragment, excluding the GP sequence, was amplified from the pM(−) plasmid using primers M 3′UTR/F and Cas12a/M IGR/R. The two fragments described above were ligated using an In-Fusion cloning reagent to generate the pM(−)Cas12a vector.
Using pM(−)Cas12a as a template, the recombinant M genomic cDNA fragment was amplified using the primers 35S/M 3′UTR/F and HDV/M 5′UTR/R and ligated with the StuI and SmaI digested pCB301-2μ-HDV vector fragment by In-Fusion cloning method to generate the M antigenomic cDNA vector pM(+)Cas12a.
2) Construction of the pS(+)crRNA:GFP Series Vectors
To facilitate crRNA insertion, a SpeI digest site was inserted upstream of the GFP start codon of the pS(+)GFP plasmid. The fragment PacI-IGR-GFP and fragment SpeI-5′UTR-HDV-NOS fragment were amplified from the pS(+)GFP vector with primer pairs SpeI/GFP/F and IGR PacI/R, NOS PvuI/F and S 5′UTR/R, respectively. The above fragments were inserted into a PacI/PvuI digested pS(+)GFP vector to obtain pS(+)SpeI GFP intermediate vector with the introduced SpeI restriction site.
According to the CRISPR RGEN Tools (http://www.rgenome.net/), we selected the crPDS1 target site in the N. benthamiana phytoene desaturase gene (NbPDS) and tobacco NtPDS (5′-TTTCTGCCGTTAATTTGAGAGTCCAAG-3′) (SEQ ID NO: 12) (bold letters are PAM same below), the crPDS2 target site in the tomato SIPDS (5′-TTTCTGCTGTTAACTTGAGAGTCCAAG-3′) (SEQ ID NO: 13), the crPDS3 target site in the pepper CaPDS (5′-TTTCTGCTGTCAACTTGAGAGTCCAAG-3′) (SEQ ID NO: 14), the crPDS4 target site in the peanut AhPDS loci and soybean GmPDS loci (5′-TTTGACAGAAAACTGAAGAACACATAT-3′) (SEQ ID NO: 15). The crRNA sequences (as shown in SEQ ID NO: 18-21, respectively) that can target the above target sites have the structure of “direct repeat-protospacer-direct repeat” and contain SpeI cleavage sites on both sides. The above crRNA sequences were inserted into the SpeI site of pS(+)SpeI:GFP vector to generate pS(+)crPDS1:GFP, pS(+)crPDS2:GFP, pS(+)crPDS3:GFP, and pS(+)crPDS4:GFP vectors, respectively.
Example 2 established the conditions for rescuing the TSWV-RFP-GFP vector from cloned cDNAs and found that both M genomic and anti-genomic cDNA constructs could be used to successfully recover the recombinant virus. A preliminary experiment showed that the insertion of Cas12a gene (nearly 4 kb) in the M genome significantly affected the rescue of the TSWV-Cas12a-crRNA vector. Therefore, the present embodiment systematically investigates the experimental conditions, including the polarity of the M genome (pM(−)Cas12a or pM(+)Cas12a) and the relative ratios of each Agrobacterium mixture containing each genomic component. Agrobacterium mixtures of different concentrations (OD600) were formulated according to the eight combinations shown below, each containing L, M, and S cDNA constructs and four VSRs helper vectors, i.e., pGD-p19+ pGD-HcPro+ pGD-γb+ pGD-NSs. Groups 1-4 have the M genomic cDNA construct pM(−)Cas12a, Groups 5-8 contain the M antigenomic cDNA construct pM(+)Cas12a, and Groups 5-7 have the pGD-NSm expression vector.
The above combinations of Agrobacterium mixtures were infiltrated into leaves of N. benthamiana for recovery of recombinant virus. GFP fluorescence was observed to widely distribute in the inoculated leaves in all groups by 5 days post inoculation (dpi) (
The systemically infected plants in the above groups exhibited typical TSWV symptoms (
From the above results, it is clear that unlike the TSWV-RFP-GFP vector rescue system, the M genomic cDNA vector pM(−)Cas12a is not suitable to rescue TSWV-Cas12a-crRNA because it tends to lose most of the Cas12a sequence. In contrast, the M antigenomic cDNA vector pM(+)Cas12a can be used to generate the intact TSWV-Cas12a-crRNA vector. The recombinant virus was most efficiently rescued when the ratio of Agrobacterium carrying pL, pM- and pS- was 2:10:1 (Group 8). In contrast, transient expression of the NSm protein did not contribute to the virus rescue (compare Groups 7 and 8). Therefore, the combination and concentration of Agrobacterium in Group 8 was used for recombinant virus rescue in all relevant embodiments described below.
The crPDS1 sequence (5′-TTTCTGCCGTTAATTTGAGAGTCCAAG-3′; SEQ ID NO: 12) (bold letters indicate PAM and the underlined bases refer to HinfI cleavage site) is conserved in both homoeologs in the allotetraploid genome of N. benthamiana (
The two groups of Agrobacterium mixtures were inoculated into N. benthamiana leaves and both produced TSWV symptoms such as leaf curling and crinkling as well as faint green mottling about 10 dpi and expressed GFP fluorescence in the upper uninoculated leaves (
Polymerase chain reaction/restriction endonuclease protection (PCR/RE) was used to detect gene editing, and if editing occurred at this target site, the restriction endonuclease recognition site that partially overlaps with the Cas12a cutting site is disrupted, and the target DNA PCR fragment is not susceptible to digestion by the corresponding endonuclease. The specific methods are described below:
PCR/RE analysis showed that the mutation frequency in the NbPDS target locus was approximately 50%, whereas no mutation was found in Control group plants (
Example 6 used agroinfiltration to initiate TSWV-Cas12a-crPDS1:GFP infection in N. benthamiana, but most plants were significantly less susceptible to Agrobacterium transformation. This embodiment explored the possibility of using recombinant virus particles to infect other plants through mechanical transmission with the following experimental steps (
Healthy N. benthamiana, potato, ground cherry, cotton, and lettuce seedlings at appropriate ages were inoculated with TSWV-Cas12a-crPDS1:GFP saps according to the mechanical inoculation method described above. After 8-16 days of inoculation, N. benthamiana, potato, cotton, and lettuce plants that were inoculated through mechanical rubbing and pressurized spraying showed infection symptoms. Green fluorescent were detected in the upper uninoculated tissues under fluorescence microscopy (
Since the crPDS1 target site was conserved in the PDS-A and PDS-B homoeologs in the allotetraploid genome of N. benthamiana and tobacco (
These results indicate that the TSWV-based vectors are broadly applicable to diverse crop hosts for somatic genome editing through mechanical inoculation.
The crPDS2 (5′-TTTCTGCTGTTAACTTGAGAGTCCAAG-3′ (SEQ ID NO: 13) (bold letters indicate PAM and the underlined bases refer to HinfI restriction site) targeting the tomato SIPDS gene is shown in
The genus Capsicum includes several species, such as chili pepper (C. frutescens), sweet pepper (C. annuum), and habanero pepper (C. chinense). Further, the crPDS3 target sequence (5′-TTTCTGCTGTCAACTTGAGAGTCCAAG-3′, bold letters indicate PAM, and the underlined bases refer to HinfI restriction site) (SEQ ID NO: 14), which is conserved in all of the above pepper plants, was selected to target the PDS gene (
The above experiment results demonstrated that TSWV-Cas12a-crRNA could infect a variety of Solanaceae crops by mechanical inoculation and produce somatic genome editing.
Ground cherry (Physalis alkekengi L.) is a plant in the genus Physalis, family Solanaceae, and is phylogenetically related to tomato. It has been shown that mutation of ERECTA (ER) genes in both tomato and Physalis alkekengi leads to shorter internodes in plants (Kwon et al. Rapid customization of Solanaceae fruit crops for urban agriculture. Nat Biotechnol, 2020, 38: 182-188). The ground cherry ER gene target crER: 5′-TTTAATGTTAAACCTTTAGTGTCCTTT-3′ (bold letters are PAM, SEQ ID NO: 88) was selected to construct pS(+)crER:GFP according to the method described in Example 4. The pS(+)crER:GFP, pL(+)opt, pM(+)Cas12a, pGD-p19, pGD-HcPro, pGD-γb, and pGD-NSs vectors were delivered into the leaves of N. benthamiana by agroinfiltration as described in Example 6, and the obtained infected leaf saps were used to inoculate ground cherry seedlings at the two true-leaf stage through sap inoculation. The upper uninoculated leaves of the infected ground cherry plants showed leaf chlorosis and slight curling, with GFP expression detected around 10 dpi (
The above experimental results demonstrate that the recombinant TSWV-Cas12a-crRNA vector can infect ground cherry by mechanical inoculation and produce target gene editing.
Peanut (Arachis hypogaea) is an allotetraploid with genomes derived from two ancestral wild species, Arachis duranensis, and Arachis ipaensis. The designed target sequence crPDS4 (5′-TTTGACAGAAAACTGAAGAACACACATAT (G)-3′ (SEQ ID NO: 15) (bold letters are PAM; with the underlined sequence is a NdeI restriction site), is conserved in two homoeologs of the peanut PDS gene (
Since the crPDS4 target locus is also conserved in the soybean PDS gene (
The above experimental results indicate that the recombinant TSWV vector can infect plant hosts belonging to taxonomically diverse families by mechanical inoculation and efficiently produce targeted DNA editing.
The crPDS6 target sequence (5′-TTTCTGCTGTCAACTTGAGAGTCCAAG-3; bold letters are PAM, SEQ ID NO: 99) conserved in the PDS gene of all 10 varieties of sweet pepper (C. annuum) was selected and engineered into pS(+)crPDS6:GFP vector as described in Example 4. V-Cas12a-crPDS6:GFP was recovered and inoculated to pepper plants according to the method described in Example 4 and Example 8. Symptoms, including leaf chlorosis and mottling, were observed in the upper non-inoculated leaves of these plants at approximately 7 dpi. Total proteins were extracted from systemically infected leaf tissues, and the expressions of TSWV N, Cas12a, and GFP proteins were detected by western blot analysis (
The crPDS5 target site sequence (5′-AATGCAATGTATATTGATTGCCTTAAA-3′; PAM sequence in bold, SEQ ID NO: 98) conserved in both homoeologs of PDS in all ten peanut varieties was selected and engineered into pS(+)crPDS5:GFP vector as described in Example 4. Plants of 10 peanut varieties were inoculated by the method described in Example 10. Symptoms of TSWV were observed in non-inoculated peanut leaves approximately 7 days after inoculation (
Examples 6-12 above show that TSWV-Cas12a-crRNA can deliver Cas12a and a single crRNA, and this example provides a method for multiplexed gene editing using the vector delivering Cas12a and multiple crRNAs. The N. benthamiana crPDS1 target site shown in Example 6 was selected, as well as the crFucT (5′-TTTGGAACAGATTCCAATAAGAAGCCT-3′; SEQ ID NO: 86) and crDCL2 (5′-TTTGAGAAGCTATGCTTATCTTCTTCG-3′; SEQ ID NO: 87) target sites (Table 3). According to the method described in Example 4, pS(+)crPDS1:GFP, pS(+)crFucT:GFP and pS(+)crDCL2:GFP were constructed for targeting these individual sequences. Simultaneously, A tandem repeat of crRNA sequence in the configuration “DR-Spacer 1-DR-Spacer 2-DR” for targeting the crPDS1 and crFucT sites (SEQ ID NO: 135) was chemically synthesized. Additionally, a triplet crRNA repeats in the configuration “DR-Spacer 1-DR-Spacer 2-DR-Spacer 3-DR” for targeting the crPDS1, crFucT, and crDCL2 (SEQ ID NO: 136) were also synthesized (
The above plasmids, along with pL(+)opt, pM(+)Cas12a, and binary plasmids encoding the VSRs were transformed into Agrobacterium strain EHA105. Mixtures of bacterial cultures were agroinoculated to N. benthamiana according to the methods described in Example 6. Plants were systemically infected starting at 7 dpi and showed leaf curling, crinkling, mottling, and chlorosis symptoms (
The TSWV-Cas9-gRNA vector construction strategy involves two main steps. Firstly, the GP sequence in the TSWV M genome is replaced by the sequence encoding SpCas9 to produce the genomic cDNA vector pM(−)Cas9 and the antigenomic cDNA vector pM(+)Cas9. Second, the RFP coding sequence and gRNA sequence are inserted into the pS(+) vector in place of the NSs coding frame to generate the pS(+)RFP:sgRNA vector. In this design, the gRNA sequence produced from the S genome mRNA transcripts will be embedded in viral non-coding sequences. It has been previously demonstrated that such gRNA sequences can bind to the Cas9 proteins and then be cleaved by cellular RNases to release the mature gRNA molecules (Mikami et al., Plant Cell Physiol, 2017, 58: 1857-1867; Ma et al., Nat Plants, 2020, 6: 773-779). The methods for specific constructs are detailed below.
1) Construction of the pM(−)Cas9 and pM(+)Cas9 Vectors
Using a plasmid containing the N. benthamiana codon-optimized Cas9 sequence (Yin et al., Scientific Reports, 2015, 5: 14926) as a template, which also includes the nuclear localization signal and the 3×Flag antigen epitope sequence (SEQ ID NO: 11) on each side of the Cas9 sequence, we amplified the 3×Flag-NLS-Cas9-NLS fragment using the primers Cas9/F and Cas9/R. The GP coding sequence in the pM(−) vector was removed by PCR amplification using the primers M 3′UTR/F and Cas9/M IGR/R. The above two fragments having a 15-bp homologous sequence at both ends were ligated to produce the pM(−)Cas9 vector using an In-Fusion cloning reagent.
Using pM(−)Cas9 as a template, the recombinant M genomic cDNA fragment was amplified using primers 35S/M 3′UTR/F and HDV/M 5′UTR/R and ligated with the StuI/SmaI-digested pCB301-2μ-HDV vector fragment by In-Fusion cloning to generate the antigenomic cDNA vector pM(+)Cas9.
2) Construction of the pS(+)RFP:gRNA Series Vectors
First, the S antigenomic cDNA vector pS(+)RFP:gGFP was constructed for the expression of the gRNA molecule targeting the gGFP target loci (5′-GATACCCAGATCATATGAAGCGG-3′; bolded sequence is PAM) (SEQ ID NO: 16) of the mGFP5 in the N. benthamiana 16c line. The pS(+)GFP was used as a template to amplify the fragment excluding the GFP gene using S primers 5′UTR/F and S IGR/R, and the pGD-RFP was used as a template to amplify the RFP fragment using primers S 5′UTR/RFP/F and RFP/R, and the gRNA sequence (SEQ ID NO: 22) targeting the gGFP loci was chemically synthesized, which contains SpeI/BamHI digestion sites on each sides and also contains 15 nt homologous sequence to facilitate In-Fusion cloning. The above three DNA fragments were ligated by an In-Fusion cloning reagent to generate pS(+)RFP:gGFP.
The gPDS1 target sequence (5′-GGACTCTTGCCAGCAATGCTTGG-3′), which is conserved in the PDS genes of several Solanaceae plants, was selected according to the CRISPR-P (http://cbi.hzau.edu.cn/cgi-bin/CRISPR) online software and inserted into pS(+)RFP:gGFP vector using the SpeI and BamHI restriction sites to obtain pS(+)RFP:gPDS1.
The results of Example 4 demonstrate that efficient recovery of recombinant viral vectors necessitates the use of relatively high concentrations of the M antigenomic cDNA plasmids due to the insertion of the large Cas coding sequences that can affect the packaging of the M genome. Therefore, Agrobacterium tumefaciens EHA105 strains carrying pL(+)opt, pM(−)Cas9 or pM(+)Cas9, and pS(+)RFP:gGFP were mixed in a ratio of 2:10:1, plus equal volume mixtures of Agrobacterium tumefaciens carrying the VSR expression vectors (pGD-p19+pGD-HcPro+pGD-γb+pGD-NSs), in the final concentration (OD600) detailed in the table shown below:
At about 10 days post-inoculation, both group 1 and group 2 infiltrated plants began to exhibit symptoms of virus systemic infection, suggesting TSWV infection, including leaf mottling, chlorosis, and curling. The infiltrated plants were monitored for a period of 20 days, and it was observed that the virus infection rate in Group 1 was slightly higher than that of Group 2 (
N. benthamiana 16c plants systemically infected with the recombinant virus displayed red fluorescence in upper leaves, whereas the healthy control plants did not. Consequently, the RFP-expression leaf tissues showed significantly suppressed GFP expression (
To confirm the gene editing at the mGFP5 target site, the genomic DNA region spanning the gGFP target site (5′-GATACCCAGATCATATGAAGCGG-3′; SEQ ID NO: 16; NdeI restriction site underlined) PCR was amplified using the specific primers listed in Table 2. PCR/RE analysis indicated that the TSWV-Cas9-RFP:gGFP-infected plant tissues had a mutation frequency of approximately 56-60%, while healthy control plants showed no editing (
The above results indicate that expression of either M genomic or antigenomic RNAs can support recombinant viral rescue and Cas9 protein expression and that the RFP:gRNA fusion sequence inserted within the S genome expresses a fluorescent reporter gene for tracking viral infection and also produces functional gRNA forming a complex with Cas9 protein to yield gene editing at the target sequence.
As depicted in
The above gPDS1 target was conserved in both homoeologs of tobacco PDSa and PDSb (
Examples 14-16 have demonstrated that the TSWV-Cas9-gRNA system can deliver Cas9 and a single gRNA. In this example, we present a method for multiplexed gene editing using the vector to deliver Cas9 protein and multiple gRNAs. Specifically, we targeted the N. benthamiana gPDS2 (5′-TTGGTAGTAGCGACTCCATGGGG-3′; SEQ ID NO: 89), gRDR6 (5′-CCCCTCCTGACTCTTACCCAACT-3′; SEQ ID NO: 90), and gSGS3 (5′-ACAAGAGTGGAAGCAGTGCTGGG-3′; SEQ ID NO: 91) target loci (Table 3). To this end, we constructed the TSWV S plasmids pS(+)gPDS2:GFP, pS(+)gRDR6:GFP, and pS(+)gsGS3:GFP, containing a single gRNA, as described in Example 14. Additionally, we chemically synthesized a tandem gRNA sequence targeting both gPDS2 and gRDR6 (SEQ ID NO: 137), and a triplet gRNA tandem sequence targeting gPDS2, gRDR6, and gSGS3 (SEQ ID NO: 138) (
The plasmids mentioned above, along with pL(+)opt, pM(+)Cas9, and the VSRs constructs were transformed into Agrobacterium strain EHA105 and inoculated into N. benthamiana using agroinfiltration, as described in Example 14. By 7 dpi, some inoculated plants exhibited systemic infection and displayed symptoms such as leaf curling, crinkling, and mottling (
The adenine base editing system comprises an adenine base editor (ABE) fusion protein and a guide RNA (gRNA) that selectively binds to the target site. As depicted in
1) Construction of pM(+)ABE
The ABE coding sequence (SEQ ID NO: 84) was chemically synthesized by optimizing the TadA8e (V106W) sequence for N. benthamiana codon usage and fused with the nCas9, which includes NLS and 3×Flag antigen epitope sequences at the N-terminal end and 3×SV40 NLS sequence at the C-terminal end. The NLS-3×Flag-TadA8e (V106W)-nCas9-3×NLS fragment was amplified with the primers SpeI/Flag/F (SEQ ID NO: 124) and ABE/R (SEQ ID NO: 125) primers using the synthesized ABE DNA fragment as a template. The pM(+)Cas9 plasmid backbone fragment, excluding the Cas9 sequence, was amplified using primer pair M 3′UTR/F (SEQ ID NO: 65) and Cas9/M IGR/R (SEQ ID NO: 66). The wo fragments having a 15-bp overlapping homologous sequence at each end were circularized using In-Fusion reagent to generate the pM(+)ABE vector.
2) Construction of pS(+)GFP:gRNA Series Vector
We selected the gPDSa target sequence (5′-TGCAAATTGAGTTGGGAGGAGG-3′; SEQ ID NO: 92) in the NbPDSa gene, the gFucTa1 target sequence (5′-CCTCTTGAGAATGTTGCTATTGG-3′; SEQ ID NO: 93) in the NbFucTa gene, and the gBBLd target sequence (5′-GAAATCAGAGTAAGGTGCGGAGG-3′; SEQ ID NO: 97) in the NbBBLd gene, according to CRISPR-P website (http://cbi.hzau.edu.cn/cgi-bin/CRISPR). The above gRNA sequence fragments were then inserted into the SpeI site in the pS(+)SpeI:GFP vector, resulting in the generation of the pS(+)gPDSa:GFP, pS(+)gFucTa1:GFP, and pS(+)gBBLd:GFP vectors.
The pS(+)gPDSa:GFP, pS(+)gFucTa1:GFP, and pS(+)gBBLd:GFP constructs in Example 18 contain sgRNA sequences that target the gPDSa, gFucTa1, and gBBLd loci, respectively. The selected gPDSa targets only the PDSa homoeolog in N. benthamiana. N. benthamiana has four FucT genes, and the gFucTa1 targets only FucTa. N. benthamiana has five BBL genes, and the selected gBBLd is conserved in all five genes. In this Example, only the BBLd gene target was analyzed for base editing. Agrobacterium strains containing the three plasmids were individually mixed with Agrobacterium strains carrying the pL(+)opt, pM(+)ABE, pGD-p19, pGD-HcPro, pGD-γb, and pGD-NSs vectors at suitable ratios and infiltrated into N. benthamiana leaves for recombinant virus rescue, as described in Example 15. Approximately 10 days after agroinfiltration, the upper young leaves of N. benthamiana showed symptoms such as leaf curling and yellowing (
The cytosine base editing system requires a cytosine base editor (CBE) fusion protein and a guide RNA (gRNA) that binds to the target. As depicted in
1) Construction of pM(+)CBE Vector
The nucleotide sequence of the CBE (SEQ ID NO: 85) containing the codon-optimized hA3A sequence with the nCas9 sequence and the UGI sequence was chemically synthesized. The resulting sequence contained NLS and 3×Flag antigen epitope sequence at the N-terminus of the hA3A sequence and NLS at the C-terminus of the UGI sequence. To construct the pM(+)CBE, the NLS-3×Flag-hA3A-nCas9-UGI-NLS fragment was amplified using the primers SpeI/F (SEQ ID NO: 124) and CBE/R (SEQ ID NO: 126). The pM(+)Cas9 vector was used as a template to obtain the fragment excluding the Cas9 sequence via PCR using the primers M 3′UTR/F and Cas9/M IGR/R. The two resulting fragments shared a 15-bp homologous sequence at each end and were ligated using an In-Fusion reagent to generate the pM(+)CBE vector.
2) Construction of the pS(+)GFP:gRNA Vectors
We selected the target sequence for the endogenous genes in N. benthamiana using the CRISPR-P software (http://cbi.hzau.edu.cn/cgi-bin/CRISPR). The selected target sequences for the RDR6a gene were 5′-CCAATATCTAGGCGTGAGGGATT-3′ (gRDR6a1; SEQ ID NO: 95) and 5′-CCCGACTTCATGGGGAAGGAGGA-3′ (gRDR6a2; SEQ ID NO:96). For the FucTa gene, the selected target sequence was 5′-CACACCTTCCTCCAGGAGATAGG-3′ (gFucTa2; SEQ ID NO: 94). The sgRNA sequence fragments were then inserted into the SpeI site of the pS(+)SpeI:GFP vector to generate pS(+)gRDR6a1:GFP, pS(+)gRDR6a2:GFP, and pS(+)gFucTa2:GFP.
The pS(+)gRDR6a1:GFP, pS(+)gRDR6a2:GFP, and pS(+)gFucTa2:GFP vectors described in Example 20 each contain gRNA sequence that can target a single target site. The selected gRDR6a1 and gRDR6a2 target only the N. benthamiana RDR6a. N. benthamiana has four FucT genes, and the selected gFucTa2 only targets FucTa gene. Agrobacterium strains containing the individual plasmids described above, and the pL(+)opt, pM(+)CBE, pGD-p19, pGD-HcPro, pGD-γb, and pGD-NSs vectors were mixed in appropriate ratios and infiltrated to leaves of N. benthamiana for recombinant virus rescue. Approximately 10 days after agroinfiltration, the upper young leaves of N. benthamiana showed symptoms of leaf curling and chlorosis (
To clarify whether the tissues infected with the TSWV-Cas12a-crRNA vector could be regenerated to obtain mutant plants, infected leaves in Example 6 were collected for regeneration through tissue culture. The specific experimental procedure involved the following steps:
After 10 days of culturing of virus-infected leaf tissues in a differentiation medium, calli were induced on some explants and further differentiated into leaves and shoots over time. The regenerated plants were either albino or green seedlings. The green seedlings rooted normally and were transplanted to soil. In contrast, albino seedlings failed to root (
In a similar experiment, leaves were collected from the N. benthamiana plants systematically infected with the TSWV-Cas9-RFP:gGFP vector shown in Example 15 to regenerate mutant plants through tissue culture. The same tissue culture method mentioned above was employed to regenerate plants from the infected cells. Fluorescence imaging analysis revealed that none of the three regenerated plants expressed the mGFP5 transgene (
The tomato leaves systemically infected with V-Cas12a-crPDS2:GFP in Example 8 were collected and rinsed with sterile water. The leaves were then surface disinfected using 70% ethanol for 15 seconds, followed by immersion in 0.1% mercuric chloride (v/v) for 8 minutes. The sterilized leaves were rinsed 3-5 times with sterile water and blotted with sterile filter paper. The main veins and leaf edges were trimmed off, and the remaining leaves were cut into 0.5-cm2 pieces. The leaf pieces were placed in an antibiotic-free MS differentiation medium and cultured for 7 days at 25° C. in darkness. Afterward, they were transferred to a 12-hour light/12-hour dark cycle and tested separately for the four combinations of plant hormones as follows.
After approximately 22 days of culture, the explants in experimental Groups 1 and 2 began to show signs of leaf differentiation. In contrast, in Groups 3 and 4, the edges of the leaves displayed enlarged calli that failed to differentiate into buds or leaves (
Five regenerated tomato seedlings were randomly selected, and their genomic DNA was extracted. Subsequently, PCR/RE was performed to detect mutations at the PDS target loci. PCR products amplified from the M0-1 and M0-4 plants were partially resistant to HinfI digestion, indicating the presence of edited PDS genes. PCR products from the M0-2 plant, however, showed complete resistance to HinfI digestion, which indicated that all four PDS homeoalleles had been successfully edited. The PCR products from M0-3 and M0-5 plants were sensitive to HinfI digestion, indicating that the target genes were not edited (
Plant regeneration through culturing the leaf tissues infected with the TSWV vectors are usually ineffective in eliminating the virus. The embodiment investigated the effect of antiviral treatment during plant regeneration on virus clearance using leaf tissues infected with the TSWV-Cas12a-crRNA vector targeting the N. benthamiana crPDS1 site. As depicted in
Albino plants were observed during tissue culture in the different treatment groups (
Statistical analysis of the mutant genotypes in the PDSa and PDSb in each group of M0 plants showed that the highest proportion of plants with mutations at the target site was observed in the ribavirin treatment group. The remdesivir treatment group had only a slight increase over the mock group, while the favipiravir group had the lowest mutation rates (
To investigate the inheritance pattern of the different mutation genotypes from the M0 generation to next generation (M1), 19 M0 plants covering the four genotypes as previously defined, were selected, and self-pollinated seeds were obtained for each plant. Fourteen to sixteen M1 seedlings from each M0 line were randomly selected and analyzed for mutation genotypes at their respective target loci of PDSa and PDSb genes using HTS. The results are presented in Table 4. For M0 plants with WT and Chi genotypes (0<indels %≤35.0%), their M1 progeny were found to be wild-type. When M0 plants were heterozygous (He), their offspring contained a 3:1 ratio of mutant/wild-type, consistent with Mendel's law. In contrast, when M0 plants were Bi/Ho genotypes, their M1 progeny all contained mutations. These results indicated that the mutation types of He and Ho/Bi in M0 plants were stably inherited by the offspring, while the Chi mutation was not heritable.
Furthermore, 10 M0 plants with He or Ho/Bi genotypes for both PDSa and PDSb genes were selected to set seeds, and the phenotypes of their progeny were counted after sowing the seedlings. M1 offspring plants of some M0 lines showed a phenotype ratio of green seedlings to white seedlings that was approximately 3:1, such as RB-#54, while other lines had ratios of approximately 15:1, such as RD-#76 (
This embodiment further investigates the impact of antiviral drug treatment on virus clearance and mutant plant recovery using tobacco leaf tissues infected with the TSWV vectors for tissue culture and regeneration. We used the recombinant TSWV-Cas12a-crRNA vector described in Example 24 as an example, which targets the crPDS1 target sites of both PDS genes in tobacco. The systemic leaves infected with this virus vector were used as explants for tissue culture and plant regeneration, following the same experimental protocol as described in Example 24. After 10 days of culturing virus-infected tissues in a differentiation medium, some explants showed induction of callus, which further developed into leaves and shoots over time. The regenerated plants exhibited two phenotypes: albino and green seedlings. The green seedlings rooted normally and were successfully transplanted to the soil, while the albino seedlings failed to root (
HTS analysis of the percentage of regenerated plants containing target editing in each drug treatment group indicated that, the percentage of regenerated plants carrying target mutations in the ribavirin treatment group was significantly higher than that in the control treatment and other drug treatment groups (
In the above-described embodiment, the TSWV NSs gene encoding an RNA silencing suppressor (Takeda et al., FEBS Lett, 2002, 532: 75-79) was replaced with a reporter gene. To investigate the effect of NSs on the rescue efficiency, virulence, and gene editing ability of the recombinant virus, the recombinant virus vector TSWV-Cas9-NSs: gPDS1 containing NSs gene was tested in this embodiment. This vector was similar to TSWV-Cas9-RFP: gPDS1, but the NSs gene was retained upstream of the crPDS1 gRNA sequence instead of the RFP reporter gene (
For this purpose, the S antigenomic vector pS(+)NSs:gPDS1 was constructed. The linearized plasmid fragment was obtained by PCR amplification of the pS(+) plasmid using the primers gPDS1/NSs/F and S IGR/R, and the fragment containing the gPDS1 gRNA sequence (SEQ ID NO: 23) was obtained by PCR amplification using the primers gPDS1/F and IGR/scaffold/R. These fragments were then ligated with an In-Fusion reagent to generate the pS(+)NSs:gPDS1 vector.
The following Agrobacterium mixtures were infiltrated into N. benthamiana leaf tissues using the method described in Example 11.
In this particular experiment, the TSWV-Cas9-NSs:gPDS1 vector was able to express the NSs protein, and therefore, the pGD-NSs transient expression vector was not included in the agrobacterial mixtures. The TSWV-Cas9-RFP:gPDS1 began to display systemic infection 8 days after agroinfiltration, while the TSWV-Cas9-NSs:gPDS1 did not show systemic infection until day 10. However, there was no significant difference in the symptoms produced by both recombinant viruses (
Young leaf tissues infected with TSWV-Cas9-NSs:gPDS1 and TSWV-Cas9-RFP:gPDS1 were collected and cultured in media using the method described in Example 13. The TSWV-Cas9-NSs:gPDS1-infected leaf explants exhibited severe necrosis after 7 days of culturing and were essentially completely necrotic after 10 days of culturing. In contrast, leaf explants infected with TSWV-Cas9-RFP:gPDS1 did not exhibit severe necrosis and began to show callus differentiation by day 10 (
Based on the results described above, it can be concluded that the TSWV gene editing vector that retains the virulence factor gene NSs has a reduced rescue efficiency and induces severe necrosis of infected cells during tissue culture, ultimately hindering plant regeneration.
The TSWV glycoprotein (GP) is crucial for virion morphogenesis and insect transmission, as previously reported (Sin et al., Proc Natl Acad Sci USA, 2005, 102: 5168-5173). In the present invention, the GP gene is replaced by a heterologous sequence. To verify that the recombinant virus is unable to be transmitted by the insect vector thrips, N. benthamiana leaf tissues infected with the recombinant TSWV-Cas9-NSs:gPDS1 and wild-type TSWV (TSWV-WT) virus from Example 15 were used to inoculate sweet peppers plants mechanically. Pools of 24-hour-old first-instar thrips were given a 24-hour acquisition access period (AAP) on sweet peppers infected with TSWV or V-Cas12a-crCaPDS. Subsequently, groups of larvae were transferred to French kidney bean (Phaseolus vulgaris) pods to complete their development into adults. Adult thrips were transferred to test tubes containing healthy pepper leaf discs of 1.5 cm in diameter and allowed a 48-h inoculation entry period (IAP). After the IAP, thrips were collected in microcentrifuge tubes (10 thrips per tube) for RNA extraction, and the leaf discs were further incubated on the water surface of a microtiter plate at 27° C. for 4 days before RNA extraction. The presence of TSWV was detected by RT-PCR using the primers listed in Table 1. As shown in the table below, the virus acquisition rate of thrips was 53%, and the transmission rate was 65% on sweet peppers infected with the wild-type virus, whereas the recombinant TSWV-Cas9-NSs:gPDS1 was unable to be acquired and transmitted by thrips.
N. benthamiana
N. tabacum
S. lycopersicum
C. frutescens
C. annuum
A. hypogaea
G. max
N. benthamiana
N. benthamiana
N. tabacum
N. benthamiana
N. benthamiana
P. alkekengi
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
N. benthamiana
A. hypogaea
C. frutescens
C. annuum
TTTCTGCCGTTAATTTGAGAGTCCAAG
N. benthamiana
N. tabacum
TTTGGAACAGATTCCAATAAGAAGCCT
N. benthamiana
TTTGAGAAGCTATGCTTATCTTCTTCG
N. benthamiana
TTTCTGCTGTTAACTTGAGAGTCCAAG
S. lycopersicum
TTTCTGCTGTCAACTTGAGAGTCCAAG
C. frutescens
C. annuum
TTTGACAGAAAACTGAAGAACACATAT
A. hypogaea
TTTAATGTTAAACCTTTAGTGTCCTTT
P. alkekengi
N. benthamiana
N. benthamiana
N. tabacum
N. benthamiana
CCCCTCCTGACTCTTACCCAACT
N. benthamiana
N. benthamiana
N. benthamiana
CCTCTTGAGAATGTTGCTATTGG
N. benthamiana
N. benthamiana
CCAATATCTAGGCGTGAGGGATT
N. benthamiana
CCCGACTTCATGGGGAAGGAGGA
N. benthamiana
N. benthamiana
A. hypogaea
TTTCTGCTGTCAACTTGAGAGTCCAAG
C. frutescens
C. annuum
Number | Date | Country | Kind |
---|---|---|---|
202110431464.8 | Apr 2021 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/088299 | 4/21/2022 | WO |