The development of CRISPR as a programmable genome-engineering tool provides transformative applications for both medicine and biotechnology. However, much of the recent focus in synthetic biology has been on exploiting CRISPR to target DNA. Improved compositions and methods for utilizing CRISPR to target RNA are therefore needed.
In one aspect, provided herein are nucleic acid molecule comprising: (a) a sequence encoding a Cas13 polypeptide; and (b) a sequence encoding a guide RNA comprising one or more spacers and one or more Cas13-specific direct repeats, wherein the spacers are capable of specifically hybridizing with one or more target RNAs. In some embodiments, the Cas13 is Cas13d. In some embodiments, the Cas13d is RfxCas13d. In some embodiments, the sequence encoding the Cas13 polypeptide further comprises a localization signal. In some embodiments, the localization signal is a nuclear localization signal. In some embodiments, the target RNA is an endogenous RNA or a viral RNA. In some embodiments, the target RNA is an mRNA. In some embodiments, the spacers are positioned between two Cas13-specific direct repeats. In some embodiments, the spacers are 20 to 40 nucleotides in length. In some embodiments, the spacers are 25 to 35 nucleotides in length. The nucleic acid molecule of claim 10, wherein the spacers are about 30 nucleotides in length. In some embodiments, the Cas13-specific direct repeats are 25 to 45 nucleotides in length. The nucleic acid molecule of claim 12, wherein the Cas13-specific direct repeats are 30 to 40 nucleotides in length. The nucleic acid molecule of claim 13, wherein the Cas13-specific direct repeats are about 36 nucleotides in length. In some embodiments, the guide RNA further comprises a AAAAC motif at its 5′ end. In some embodiments, the guide RNA comprises two or more spacers, and wherein the two or more spacers are capable of specifically hybridizing with the same target RNA. In some embodiments, the guide RNA comprises two or more spacers, and wherein the two or more spacers are capable of specifically hybridizing with different target RNAs. In some embodiments, the guide RNA comprises three or more spacers. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to a ubiquitous promoter. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to an inducible promoter. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to a tissue-specific promoter. In another aspect, provided herein are vectors comprising any of the nucleic acid molecules described herein. In some embodiments, the vector is a single vector. In some embodiments, the vector is an Adeno-associated viral vector. Also provided herein are cells comprising any of the nucleic acid molecules described herein. In another aspect, provided herein are methods of modifying a target RNA in a cell, the methods comprising contacting the cell with any of the nucleic acid molecules described herein. Also provided herein are methods of modifying a target RNA in a cell, the method comprising contacting the cell with any of the vectors described herein. In some embodiments, the target RNA is endogenous RNA or viral RNA.
In another aspect, provided herein are methods of modifying a target RNA in a cell, the methods comprising contacting the cell with (a) a nucleic acid molecule comprising a sequence encoding a Cas13 polypeptide, and (b) a sequence encoding a guide RNA comprising one or more spacers and one or more Cas13-specific direct repeats, wherein the one or more spacers are capable of specifically hybridizing with the target RNA. In some embodiments, the Cas13 is Cas13d. In some embodiments, the Cas13d is RfxCas13d. In some embodiments, the sequence encoding the Cas13 polypeptide further comprises a localization signal In some embodiments, the localization signal is a nuclear localization signal. In some embodiments, the spacers are positioned between two Cas13-specific direct repeats. In some embodiments, the spacers are 20 to 40 nucleotides in length. In some embodiments, the spacers are 25 to 35 nucleotides in length. In some embodiments, the spacers are about 30 nucleotides in length. In some embodiments, the Cas13-specific direct repeats are 25 to 45 nucleotides in length. In some embodiments, the Cas13-specific direct repeats are 30 to 40 nucleotides in length. In some embodiments, the Cas13-specific direct repeats are about 36 nucleotides in length. In some embodiments, the guide RNA further comprises a AAAAC motif at its 5′ end. In some embodiments, the guide RNA comprises two or more spacers, and wherein the two or more spacers are capable of specifically hybridizing with the same target RNA. In some embodiments, the guide RNA comprises two or more spacers, and wherein the two or more spacers are capable of specifically hybridizing with different target RNAs. In some embodiments, the guide RNA comprises three or more spacers. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to a ubiquitous promoter. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to an inducible promoter. In some embodiments, the sequence encoding a Cas13 polypeptide is operably linked to a tissue-specific promoter. In some embodiments, the nucleic acid molecule is comprised within a first vector and the guide RNA is comprised within a second vector. In some embodiments, the first vector and/or the second vector is an AAV vector.
In another aspect, provided herein are transgenic organisms having a recombinant nucleic acid molecule stably integrated into the genome of the organisms, wherein the recombinant nucleic acid molecule comprises a sequence that encodes a Cas13 polypeptide. Also provided are transgenic organisms having two or more recombinant nucleic acid molecules stably integrated into the genome of the organism, comprising at least a first recombinant nucleic acid molecule that comprises a sequence encoding a Cas13 polypeptide, and a second recombinant nucleic acid molecule that comprises a sequence that encodes a guide RNA. In some embodiments, the Cas13 polypeptide is a Cas13d. In some embodiments, the Cas13d polypeptide is RfxCas13d. In some embodiments, the organism is a vertebrate. In some embodiments, the organism is an invertebrate. In some embodiments, the organism is an insect.
All publications, patents, patent applications, and information available on the internet and mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, patent application, or item of information was specifically and individually indicated to be incorporated by reference. To the extent publications, patents, patent applications, and items of information incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
Where values are described in terms of ranges, it should be understood that the description includes the disclosure of all possible sub-ranges within such ranges, as well as specific numerical values that fall within such ranges irrespective of whether a specific numerical value or specific sub-range is expressly stated.
Various embodiments of the features of this disclosure are described herein. However, it should be understood that such embodiments are provided merely by way of example, and numerous variations, changes, and substitutions can occur to those skilled in the art without departing from the scope of this disclosure. It should also be understood that various alternatives to the specific embodiments described herein are also within the scope of this disclosure.
The present disclosure provides nucleic acid molecules comprising (a) a sequence encoding a Cas13 polypeptide; and (b) a sequence encoding a guide RNA comprising one or more spacers and one or more Cas13-specific direct repeats, where the spacers are capable of specifically hybridizing with one or more target RNAs. In some instances, the Cas13 is Cas13d. Also provided are methods of modifying a target RNA in a cell comprising contacting the cell with a nucleic acid molecule comprising a sequence encoding a Cas13 polypeptide and a sequence encoding a guide RNA described herein. The present disclosure further provides transgenic organisms having a recombinant nucleic acid molecule stably integrated into the genome of the organism, where the recombinant nucleic acid molecule comprises a sequence that encodes a Cas13 polypeptide.
Current applications of CRISPR-Cas nucleases in Drosophila melanogaster are limited to DNA-targeting class 2 systems. The present disclosure reports, among other things, a programmable platform for transcript targeting applications utilizing a Type VI-D RNA-targeting Cas ribonuclease, CasRx. The present disclosure provides methods for genetically encoding CasRx allowing for CRISPR-based transcript targeting manifesting as visible phenotypes comparable to previous gene knockdown experiments. Through genetic and bioinformatic analysis, the disclosure demonstrates on-target transcript knockdown capabilities of CasRx. The disclosure also includes description of off-target effects following on-target transcript cleavage by CasRx, providing the first evidence of off-target activity expressing a Type VI ribonuclease in eukaryotes. The disclosure provides the use of a programmable RNA-targeting Cas system in e.g., Drosophila melanogaster, and provides alternative approaches for in vivo gene knockdown studies.
CRISPR functions via the association of CRISPR RNAs (crRNAs) and CRISPR-associated (Cas) proteins to provide adaptive and heritable immunity to protect prokaryotic hosts from foreign genetic elements and invading viruses. Specifically, it acts as a programmable RNA-guided nuclease capable of degrading exogenous nucleic acids (DNA or RNA) by exploiting molecular memory of prior infections archived as heritable DNA sequences in CRISPR arrays. These CRISPR arrays consist of altering repeats and invader-derived (spacer) DNA sequences which get transcribed and then processed into small, mature crRNAs. Mature crRNAs then combine with Cas proteins to form crRNA-Cas complexes, which target and cleave specific nucleic acid sequences. There are several types and subtypes of CRISPR systems found in bacteria that utilize a diversity of proteins and mechanisms to provide immunity. For example, Type I, II, V (and perhaps IV) target DNA, while Type III targets both DNA and RNA, and Type VI targets RNA exclusively.
While much of the recent focus in synthetic biology has been on exploiting CRISPR to target DNA, the recent findings that Type VI CRISPR systems can also be reprogrammed to target RNA has revealed exciting possibilities for transcriptome engineering. For example, one recent discovery was the finding and functional characterization of CasRx as a compact single-effector Cas enzyme that exclusively targets RNA with superior efficiency and specificity as compared to RNA interference (RNAi) (See e.g., Konermann et al. Cell 173:665-676 (2018)). In human cells, CasRx demonstrated highly efficient on-target gene knockdown with limited off-target activity. Given these characteristics, we wanted to test its functionality in Drosophila melanogaster (flies) to enable the exploration of new biological questions in vivo. While CRISPR has been used extensively to generate heritable DNA mutations in flies, RNA-targeting using CRISPR has not been demonstrated and therefore RNA-targeting in flies is restricted to the application of RNAi-based approaches.
The present disclosure provides the first use of a Cas-based RNA-targeting system through CasRx-mediated transcript targeting in vivo, e.g., in flies. In some instances, the methods and compositions provided herein involve CasRx and guide RNA arrays (gRNAarray) that are encoded in the genome to promote robust expression throughout development. Performing bidirectional and binary genetic crosses with ubiquitous and tissue-specific expression of CasRx, the disclosure demonstrates the ability to obtain clear, highly penetrant phenotypes comparable to previously established phenotypes obtained by RNAi. In some instances, transcript knockdown are quantified through RNA sequencing (RNAseq) analysis. CasRx is shown to be capable of targeted knockdown for various genes at numerous stages of fly development, and can be useful for transcript targeting applications and genome editing in vivo.
Unless otherwise indicated “nuclease” can refer to an enzyme which possesses endonucleolytic catalytic activity for polynucleotide cleavage.
A “target RNA” as used herein can include an RNA that can include a “target sequence”. The term “target sequence” can refer to a nucleic acid sequence present in a target RNA to which a spacer of a guide RNA can hybridize, provided sufficient conditions for hybridization exist. Hybridization between the spacer and the target sequence can, for example, be based on Watson-Crick base pairing rules, which enables programmability in the spacer sequence. The spacer sequence can be designed, for instance, to hybridize with any target sequence.
The “spacer” within a guide RNA can include a nucleotide sequence that is complementary to a specific sequence within a target RNA.
“Binding” as used herein can refer to a non-covalent interaction between macromolecules (e.g., between a protein and a nucleic acid). While in a state of non-covalent interaction, the macromolecules are said to be “associated” or “interacting” or “binding” (e.g., when a molecule X is said to interact with a molecule Y, it means that the molecule X binds to molecule Y in a non-covalent manner). Binding interactions are generally characterized by a dissociation constant (Kd) of less than 10−6 M, less than 10−7 M, less than 10−8 M, less than 10−9 M, less than 10−10 M, less than 10−11 M, less than 10−12 M, less than 10−13 M, less than 10−14 M, or less than 10−15 M. Kd is dependent on environmental conditions, e.g., pH and temperature, as is known by those in the art. “Affinity” refers to the strength of binding, and increased binding affinity is correlated with a lower Kd.
The terms “hybridizing” or “hybridize” can refer to the pairing of substantially complementary or complementary nucleic acid sequences within two different molecules. Pairing can be achieved by any process in which a nucleic acid sequence joins with a substantially or fully complementary sequence through base pairing to form a hybridization complex. For purposes of hybridization, two nucleic acid sequences or segments of sequences are “substantially complementary” if at least 80% of their individual bases are complementary to one another.
As used herein, “operably linked” can refer to the situation in which part of a linear DNA sequence can influence the other parts of the same DNA molecule. For example, when a promoter controls the transcription of the coding sequence, it is operatively linked to the coding sequence.
As used herein, a “polypeptide” can include proteins, fragments of proteins, and peptides, whether isolated from natural sources, produced by recombinant techniques, or chemically synthesized. A polypeptide may have one or more modifications, such as a post-translational modification (such as glycosylation, etc.) or any other modification (such as PEGylation, etc.). The polypeptide may contain one or more non-naturally-occurring amino acids (such as an amino acid with a side chain modification). Polypeptides described herein typically comprise at least about 10 amino acids.
As used herein, “contacting” a cell with a nucleic acid molecule can be allowing the nucleic acid molecule to be in sufficient proximity with the cell such that the nucleic acid molecule can be introduced into the cell.
A “promoter” can be a region of DNA that leads to initiation of transcription of a gene.
A “motif” can be a nucleotide or amino acid sequence pattern that is correlated with biological significance or function.
I. Cas 13 Polypeptide
Provided herein are nucleic acid molecules comprising: (a) a sequence encoding a Cas13 polypeptide; and (b) a sequence encoding a guide RNA comprising one or more spacers and one or more Cas13-specific direct repeats, where the spacers are capable of specifically hybridizing with one or more target RNAs.
“Nucleic acid molecules” as used herein can include a DNA sequence or an RNA sequence. The Cas13 polypeptide can be any of the Cas13 polypeptides described herein or known in the art. “Cas13 polypeptides” and “Cas13” are used interchangeably herein. Cas13 are RNA-targeting programmable nucleases associated with Type VI CRISPR-Cas systems. Type VI CRISPR-Cas systems are dedicated RNA-targeting immune systems in prokaryotes. Cas13 family contains at least four known subtypes, including Cas13a (formerly C2c2), Cas13b, Cas13c and Cas13d. Type VI-A and VI-B systems possess the crRNA-dependent target cleavage activity and a non-specific, collateral RNase activity that is stimulated by target recognition and cleavage. Both of these activities are mediated by the two HEPN domains contained in type VI effectors Cas13a and Cas13b (Yan et al. Molecular Cell 70(2):327-339, 2018).
In some instances, the Cas13 is a Cas13d protein. Cas13d are effectors associated with subtype VI-D, a variant of type VI CRISPR-Cas, and have robust target cleavage and collateral RNase activities along with their ability to process pre-crRNA. Cas13d has a smaller size compared to other Cas13s and can be advantageous for RNA targeting applications described herein, such as for packaging into a viral vector for delivery.
Cas13 can be guided by a guide RNA which encodes target specificity. The Cas13 protein complexes with the guide RNA via recognition of a short hairpin in the crRNA and target specificity is encoded by a spacer that is complementary to the target region. In addition to programmable RNase activity, Cas13s exhibit collateral activity after recognition and cleavage of a target transcript, leading to non-specific degradation of any nearby transcripts regardless of complementarity to the spacer. Cas13 can process its own pre-crRNAs, allowing individual short single crRNAs to be customized to target RNA in vitro or to provide Escherichia coli with programmable immunity against the lytic single-stranded RNA MS2 bacteriophage. CRISPR/Cas13 can have broad applicability as an RNAi-like platform for RNA silencing. Compared to small RNAs and RNA interference, which are difficult in design and are limited by high off-target potential, CRISPR/Cas13 can be used to manipulate only the target RNA, with few or no off-target effects in eukaryotes, and multiple crRNAs can be used to eradicate a particular mRNA transcript.
The Cas13 polypeptides can be naturally-occurring or non-naturally occurring. The Cas13 polypeptides can be a mutant Cas13 polypeptide (e.g., a mutant of a naturally occurring Cas13 polypeptide). Mutant Cas13 can have altered activity compared to a naturally occurring Cas13, such as altered nuclease activity without substantially diminished binding affinity to RNA). In some instances, the mutant Cas13 has no nuclease (e.g., ribonuclease) activity. For instance, mutant Cas13 encodes a ribonuclease with the positively charged catalytic residues of the HEPN motifs inactivated thereby eliminating programmable RNA cleavage without affecting guide RNA array processing, or target RNA binding. The Cas13 can have a size of about 700 to about 1200 amino acids (e.g., about 700 to about 1100, about 700 to about 1000, about 700 to about 900, about 700 to about 800, about 800 to about 1200, about 800 to about 1100, about 800 to about 1000, about 800 to about 900, about 900 to about 1200, about 900 to about 1100, about 900 to about 1000, about 1000 to about 1200, about 1000 to about 1100, or about 1100 to about 1200 amino acids). In some instances, the Cas13 has a size of about 930 amino acids. In some instances, the Cas13 is Cas13d. Cas13d derived from a variety of species are contemplated herein, including but not limited to, Ruminococcus sp., Ruminoccocus flavefaciens, Ruminoccocus albus, and Eubacterium siraeum. In some instances, the Cas13d is derived from Ruminococcus flavefaciens strain XPD3002 (e.g., CasRx or RfxCas13d). In some instances, the Cas13d is a catalytically inactive version of CasRx (e.g. dCasRx). An exemplary sequence of CasRx (NLS-RfxCas13d-NLS) can be found at Plasmid #109049 (pXR001: EF1a-CasRx-2A-EGFP, addgene).
The sequence encoding a Cas13 polypeptide described herein can be at least 80% identical (e.g. at least 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, or 99% identical) to the sequence of RfxCas13d. In some instances, the sequence encoding a Cas13 polypeptide is identical to the sequence of Cas13d.
In some embodiments, the nucleic acid molecule provided herein comprises a sequence encoding a Cas13 protein and further comprises one or more localization signals. Localization signals can be an amino acid sequence on a protein that tags the protein for transportation to a particular location in a cell. An exemplary localization signal is nuclear localization signal, which can be an amino acid sequence that tags a protein for import into the cell nucleus by nuclear transport. The localization signals can be operably linked to the sequence encoding a Cas13 protein. In some embodiments, the localization signal is a nuclear localization signal. For example, the sequence encoding Cas13 can encode two nuclear localization signals, where upon translation, the Cas13 is fused to N- and C-terminal nuclear localization signals. An exemplary NLS is SV40 large T antigen NLS (PKKKRRV (SEQ ID NO: 1)) and nucleoplasmin NLS (KRPAATKKAGQAKKKK (SEQ ID NO: 2)). Other NLSs are known in the art; see, e.g., Konermann et al., Cell 173:665-676, 2018; Cokol et al., EMBO Rep. 1(5):411-415 (2000); Freitas and Cunha, Curr Genomics 10(8): 550-557 (2009).
In some instances, the sequence encoding a Cas13 polypeptide is operably linked to a promoter. Suitable promoters include but are not limited to ubiquitous promoters (e.g., ubiquitin promoter), tissue-specific promoters, inducible promoters, and constitutive promoters.
The sequence encoding a Cas13 polypeptide can be further operably linked to a sequence that encodes one or more reporter genes. Any suitable reporter genes are contemplated, including but not limited to, fluorescent reporters.
II. Guide RNA
Provided herein are guide RNAs comprising one or more spacers and one or more Cas13-specific direct repeats, where the spacers are capable of specifically hybridizing with one or more target RNAs. Also provided herein are sequences encoding the guide RNAs provided herein.
The guide RNA can include at least one (e.g., at least two, three, four, five, six, or seven) spacers. The spacers can bind to the same or different target sequences in the same target RNA, or can bind to different target RNAs. The spacers can be designed to target any sequence in a target RNA. In instances where two or more spacers are included in the guide RNA, the spacers can have the same or different length. The spacers can have a length of between 20 to 40 nucleotides (e.g., 20 to 35, 20 to 30, 20 to 25, 25 to 40, 25 to 35, 25 to 30, 30 to 40, 30 to 35, or 35 to 40 nucleotides). In some instances, the spacers can have a length of about 30 nucleotides.
The guide RNA can include at least one (e.g., at least two, three, four, five, six, or seven) direct repeats. A direct repeat can be a repetitive sequence within a CRISPR locus that are interspersed by short spacers. A direct repeat sequence can have homology to a trans-activating CRISPR RNA, and facilitates the formation of a crRNA: tracrRNA duplex. The sequence and secondary structure of Cas13-specific direct repeats can be dependent on the specific Cas13. For instance, Cas13d from different species can have different direct repeat sequences and/or secondary structures. Exemplary direct repeat sequences for Cas13d can be found at e.g. Konnerman et al. Cell 173:665-676 (2018). The Cas13-specific direct repeats in the guide RNA provided herein can be chosen based on the specific Cas13 used. Direct repeat sequences functioning together with Cas13 proteins of various bacterial species may be identified by bioinformatic analysis of sequence repeats occurring in the respective CRISPR/Cas operons and by experimental binding studies of Cas13 protein together with putative DR sequence flanked target sequences. The Cas13-specific direct repeats can be about 30 to about 40 (e.g., about 31, 32, 33, 34, 35, 36, 37, 38, or 39) nucleotides in length. In some instances, the Cas13-specific (e.g., Cas13d-specific) direct repeats are about 36 nucleotides in length. In some instances, the direct repeats form a hairpin structure capable of interacting with the Cas13 polypeptide to form a complex. In some instances, the Cas13-specific direct repeats are Cas13d-specific direct repeats. Exemplary Cas13d-specific direct repeat sequences can be found at Konermann et al. Cell 173:665-676 (2018).
An exemplary sequence of a RfxCas13d-specific direct repeat is shown below (SEQ ID NO: 3): CAAGUAAACCCCUACCAACUGGUCGGGGUUUGAAAC
The direct repeats in the guide RNA described herein can include a sequence that is at least 80% identical (e.g. at least 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99% identical) to SEQ ID NO: 3.
The spacers can be arranged in tandem and interspersed by direct repeats. For example, a spacer can be positioned between two direct repeats. The guide RNA can include, e.g., as part of its sequence, [direct repeat 1-spacer 1-direct repeat 2-spacer 2-direct repeat 3-spacer 3-direct repeat 4-spacer 4-direct repeat 5]. In some instances, the guide RNA includes n spacers and n+1 direct repeats, where n≥1 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10).
Some embodiments disclosed herein provide nucleic acid molecules that encode the guide RNA. Some embodiments provide vectors comprising nucleic acid molecules encoding the guide RNA. Nucleic acid molecules encoding the guide RNA can be operably linked to one or more promoters. Any suitable promoters described herein and known in the art are contemplated, such as but not limited to, a polymerase III promoter, such as a polymerase-3 U6 (U6:3) promoter. Exemplary U6 promoters can be found e.g., in Xia et al. Nucleic Acids Res. 31(17) e100; or at Addgene plasmid #112688 (gRNA[Sxl]0.1026B).
Nucleic acid molecules encoding the guide RNA can be further operably linked to sequences that encode one or more reporter genes. Any suitable reporter genes are contemplated, including but not limited to, fluorescent reporters.
III. Vectors
Some embodiments disclosed herein provide vectors (e.g. viral vectors) that comprise nucleic acid molecules comprising a sequence encoding a Cas13 polypeptide (e.g. any Cas13 polypeptides described herein) and/or a sequence encoding a guide RNA (e.g. any guide RNAs described herein). Any suitable vectors described herein and known in the art are contemplated. In some instances, the viral vector is an Adeno-associated viral vector (AAV). Adeno-associated virus is a naturally occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient replication and a productive life cycle. (For a review, see Muzyczka et al., Curr. Topics in Micro and Immunol. 158:97-129 (1992)). AAV vectors efficiently transduce various cell types and can produce long-term expression of transgenes in vivo. AAV vectors have been extensively used for gene augmentation or replacement and have shown therapeutic efficacy in a range of animal models as well as in the clinic; see, e.g., Mingozzi and High, Nature Reviews Genetics 12, 341-355 (2011); Deyle and Russell, Curr Opin Mol Ther. 2009 August; 11(4): 442-447; Asokan et al., Mol Ther. 2012 April; 20(4): 699-708. AAV vectors containing as little as 300 base pairs of AAV can be packaged and can produce recombinant protein expression. For example, AAV2, AAV5, AAV2/5, AAV2/8 and AAV2/7 vectors have been used to introduce DNA into photoreceptor cells (see, e.g., Pang et al., Vision Research 2008, 48(3):377-385; Khani et al., Invest Ophthalmol Vis Sci. 2007 September; 48(9):3954-61; Allocca et al., J. Virol. 2007 81(20):11372-11380). In some embodiments, the AAV vector can include (or include a sequence encoding) an AAV capsid polypeptide described in PCT/US2014/060163; for example, a virus particle comprising an AAV capsid polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, and 17 of PCT/US2014/060163, and a Cas9 sequence and guide RNA sequence as described herein. In some embodiments, the AAV capsid polypeptide is an Anc80 polypeptide, e.g., Anc80L27; Anc80L59; Anc80L60; Anc80L62; Anc80L65; Anc80L33; Anc80L36; or Anc80L44. In some embodiments, the AAV incorporates inverted terminal repeats (ITRs) derived from the AAV2 serotype. Exemplary left and right ITRs are presented in Table 6 of WO 2018/026976. It should be noted, however, that numerous modified versions of the AAV2 ITRs are used in the field, and the ITR sequences shown below are exemplary and are not intended to be limiting. Modifications of these sequences are known in the art, or will be evident to skilled artisans, and are thus included in the scope of this disclosure. Expression of Cas13 and/or guide RNA in the AAV vector can be driven by a promoter described herein or known in the art.
IV. Target RNA and Methods of Modifying a Target RNA in a Cell
The target RNA can be any RNA molecules endogenous or exogenous to a eukaryotic cell, and can be protein-coding or non-protein-coding. A variety of RNA targets are contemplated herein. For example, a target RNA can be messenger RNA (mRNA), ribosomal RNA (rRNA), signal recognition particle RNA (SRP RNA), transfer RNA (tRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), antisense RNA (aRNA), long noncoding RNA (lncRNA), microRNA (miRNA), piwi-interacting RNA (piRNA), small interfering RNA (siRNA), short hairpin RNA (shRNA), retrotransposon RNA, viral genome RNA, viral noncoding RNA, or the like. A guide RNA provided herein can include spacers that are capable of specifically hybridizing with the same target RNA or at least two different target RNAs.
In some instances, the RNA can be a viral RNA (e.g., single stranded viral RNA). The viral RNA can be from an anthropod-borne virus (arboviruse), such as but not limited to tick-borne viruses, midge-borne viruses, and mosquito-borne viruses. Exemplary viruses include, but are not limited to, Zika, Chikungunya, Dengue, Yellow fever, West Nile, Japanese encephalitis, Rift Valley fever, and Eastern equine encephalitis viruses. See, e.g. Reynolds et al. Comp Med 67(3):232-241 (2017). Additional viruses contemplated include, but are not limited to, lymphocytic choriomeningitis virus (LCMV), influenza A virus (IAV), and vesicular stomatitis virus (VSV). Additional viral RNAs that can be targeted by the compositions and methods described here in can be found at e.g., Frejie et al., Molecular Cell, 76(5):826-837. Collateral cleavage and tissue-specific cell death resulting from the use of the systems provided herein can be useful for ssRNA virus targeting in arbovirus vectors.
In some aspects, the present disclosure provides methods of modifying a target RNA in a cell. The methods can include introducing a nucleic acid sequence encoding a Cas13 polypeptide (e.g., any of the Cas13 polypeptides described herein) and a guide RNA (e.g., any of the guide RNAs described herein) into the cell. The sequence encoding a Cas13 protein and the guide RNA can be introduced into the cell in the same nucleic acid molecule or in different nucleic acid molecules. In some instances, the methods include contacting the cell with (a) a nucleic acid molecule comprising a sequence encoding a Cas13 polypeptide, and (b) a sequence encoding a guide RNA comprising one or more spacers and one or more Cas13-specific direct repeats, wherein the one or more spacers are capable of specifically hybridizing with the target RNA. In some instances, the sequence encoding a Cas13 polypeptide is introduced via a first vector (e.g. any suitable vectors described herein) and the guide RNA is introduced via a second vector (e.g. any suitable vectors described herein). Also contemplated are cells comprising the nucleic acid molecule comprising a sequence encoding a Cas13 polypeptide and/or a sequence encoding a guide RNA described herein.
Methods of Monitoring Target RNA Modification
The present disclosure in some instances provides compositions for and methods of monitoring target RNA modification e.g., in a cell, comprising monitoring the presence and/or levels of a target RNA, or monitoring the presence and/or levels of a protein corresponding to a target RNA (e.g. for protein-coding RNA). Any suitable techniques and assays for monitoring RNA and/or protein levels known in the art are contemplated herein. Exemplary methods include in situ hybridization, antibody staining, and RNA sequencing.
V. Transgenic Organisms
As used herein, a “transgenic organism” can include a non-human animal in which one or more of the cells of the organism includes a transgene. The organism can be a vertebrate or an invertebrate, such as an arthropod (e.g., an insect).
In some instances, a transgenic organism provided herein has a recombinant nucleic acid molecule stably integrated into the genome of the organism, wherein the recombinant nucleic acid molecule comprises a sequence that encodes a Cas13 polypeptide (e.g. any of the Cas13 polypeptides described herein). In some instances, a transgenic organism has two or more recombinant nucleic acid molecules stably integrated into the genome of the organism, comprising at least a first recombinant nucleic acid molecule that comprises a sequence encoding a Cas13 polypeptide, and a second recombinant nucleic acid molecule that comprises a sequence that encodes a guide RNA.
A founder animal carrying a recombinant nucleic acid comprising a sequence that encodes a Cas13 polypeptide can be identified based upon the presence of the sequence in its genome and/or expression of Cas13 in tissues or cells of the animal. A founder animal carrying a recombinant nucleic acid comprising a sequence that encodes a guide RNA can be identified based upon the presence of the sequence in its genome. A transgenic founder animal can then be used to breed additional animals carrying the transgene. A transgenic animal can be heterozygous or homozygous for the transgenes.
Methods for making transgenic animals are known in the art; see, e.g., WO2016049024; WO201604925; WO2017124086; WO2018009562; and U.S. Pat. No. 9,901,080. Such techniques include, without limitation, pronuclear microinjection (See, e.g., U.S. Pat. No. 4,873,191), retrovirus mediated gene transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci. USA, 82:6148-1652 (1985)), gene targeting into embryonic stem cells (Thompson et al., Cell 56:313-321 (1989)), electroporation of embryos (Lo, Mol. Cell. Biol., 3:1803-1814 (1983)), and in vitro transformation of somatic cells, such as cumulus or mammary cells, followed by nuclear transplantation (Wilmut et al., Nature, 385:810-813 (1997); and Wakayama et al., Nature, 394:369-374 (1998)); these methods can be modified to use CRISPR as described herein. For example, fetal fibroblasts can be genetically modified using CRISPR as described herein, and then fused with enucleated oocytes. After activation of the oocytes, the eggs are cultured to the blastocyst stage. See, for example, Cibelli et al., Science, 280:1256-1258 (1998).
The present disclosure also provides a population of cells isolated from an organism as described herein, as well as primary or cultured cells, e.g., isolated cells, engineered to include a sequence that encodes a Cas13 protein and/or a guide RNA. The cells can be isolated from any of the transgenic animals described above. Also provided are methods of introducing the transgenes described herein into a cell (e.g., primary cells or cultured cells). Exemplary methods include viral delivery (e.g., using viral vectors) and electroporation.
To determine the efficacy of CRISPR-based programmable RNA-targeting in flies, flies were engineered to encode the CasRx ribonuclease. To do so, two transgenes were generated utilizing a broadly expressing ubiquitin (Ubiq) promoter to drive expression of either CasRx (Ubiq-CasRx), or a catalytically inactive version of the ribonuclease, termed dCasRx (Ubiq-dCasRx), used as a negative control (
To assess the efficacy of programmable RNA-targeting by CasRx, bidirectional genetic crosses were conducted between homozygous gRNAarray (+/+; gRNAarray/gRNAarray) expressing flies crossed to either Ubiq-CasRx (Ubiq-CasRx/CyO; +/+), or Ubiq-dCasRx (Ubiq-dCasRx/CyO; +/+) expressing flies (
However, while the Mendelian transheterozygote inheritance rates were expected to be 50%, the recorded inheritance rates were significantly lower than expected (ranging from 9.6-28.4%) suggesting some possible toxicity leading to lethality (
To further explore the utility of programmable RNA-targeting of CasRx in flies, its efficiency was investigated when expression was restricted to specific cell types and tissues by leveraging the classical binary Gal4/UAS system. To develop this system, two transgenes were generated using the UASt promoter to drive expression of either CasRx (UASt-CasRx), or as a negative control dCasRx (UASt-dCasRx) (
To further explore programmable ribonuclease activity of CasRx and quantify the level of transcript reduction, a dual luciferase reporter assay was developed. This assay comprised of ubiquitously expressed firefly luciferase (Fluc) and a control renilla luciferase (Rluc) (Ubiq-Fluc-Ubiq-Rluc) (
Furthermore, it was confirmed that only the combination of all three transgenes (Ubiq-CasRx/+; gRNAFluc/Ubiq-Fluc-Ubiq-Rluc) resulted in lethality by crossing heterozygous flies expressing Ubiq-CasRx (Ubiq-CasRx/Cyo; +/+) to homozygous flies expressing either gRNAFluc (+/+; gRNAFluc/gRNAFluc) or homozygous flies expressing the dual luciferase reporter transgene (+/+; Ubiq-Fluc-Ubiq-Rluc/Ubiq-Fluc-Ubiq-Rluc). As expected, no distinguishable phenotypes or dramatic influence on inheritance in F1 transheterozygotes (Ubiq-CasRx/+; gRNAFluc/+ or Ubiq-CasRx/+; Ubiq-Fluc-Ubiq-Rluc/+) compared to Ubiq-dCasRx controls (Ubiq-dCasRx/+; gRNAFluc/+ or Ubiq-dCasRx/+; Ubiq-Fluc-Ubiq-Rluc/+) were observed (
Given the inability to generate and measure luciferase expression from Ubiq-CasRx F2 triple transheterozygotes (Ubiq-CasRx/+; gRNAFluc/Ubiq-Fluc-Ubiq-Rluc) in the luciferase crosses described above, a GFP reporter assay was generated to directly visualize CasRx-mediated transcript knockdown. A binary GFP reporter construct was generated, comprised of both a CasRx gRNAarray targeting GFP along with GFP expression driven by the broadly expressing OpIE2 promoter (gRNAGFP) (
Upon obtaining distinct visual phenotypes from Ubiq-CasRx transheterozygotes (Ubiq-CasRx/+; gRNAaarray/+), both the on- and potential off-target transcript reduction rates were quantified. All gRNAarray target genes from our binary crosses producing either highly-penetrant, visible phenotypes (w, cn, and wg) or lethal phenotypes (N, y, and GFP) were analyzed (Table 5). To do so, whole-transcriptome RNAseq analysis was implemented comparing F1 Ubiq-CasRx transheterozygotes (Ubiq-CasRx/+; gRNAaarray/+) compared to control F1 Ubiq-dCasRx transheterozygotes (Ubiq-dCasRx/+; gRNAarray/+) (
In total 34 samples were analyzed (Table 5), and CasRx was found to be capable of consistent on-target transcript reduction based on bioinformatic analysis (
These results demonstrate the use of CasRx for programmable RNA-targeting in flies. Although cellular toxicity from ubiquitous expression of CasRx and dCasRx was observed, as well as unexpected lethality and tissue necrosis in both bidirectional and Gal4/UAS crosses, clear, visible phenotypes as well as quantitative evidence demonstrating on-target transcript cleavage were obtained. This is the first demonstration of a programmable RNA targeting Cas system in Drosophila melanogaster, paving the way to providing an alternative approach for gene knockdown studies in vivo, however with further optimization may be required to increase the CasRx on-target cleavage rates.
Through analysis of RNaseq data, consistent reduction in target gene expression was found, however only 50% of the samples crossed a significance threshold. Since clear phenotypes were observed indicating on-target transcript knockdown for w, cn, and wg targeting, but no significant on-target reduction was found through DESeq2 analysis, it is hypothesized that developmental timing of sample collection is imperative for quantifying transcript knockdown efficiency. Notwithstanding, significant on-target transcript expression reduction were obtained that also corresponded with lethality phenotypes (y, N, and GFP) and resulted in numerous misexpressed genes. Targeting GFP, a non-essential gene, produced the largest quantity of misexpressed genes as well as the most significant fold change compared to all other gene targets analyzed. Interestingly, Gadd45, a gene involved in cellular arrest and apoptosis in Drosophila melanogaster, was found to be significantly misexpressed in 4 samples (w, N, y, and GFP). It is possible that CasRx cleavage may result in a dramatically higher number of misexpressed genes and possible lethality or cellular apoptosis.
Evidence of off-target effects resulting from catalytic activity of CasRx identified through DESeq2 analysis is provided. This is the first report of off-target activity occurring from the application of a Cas13 ribonuclease in eukaryotic cells, and key factors that determine lethality are highlighted. Two main factors contributing to CasRx-mediated lethality were identified: the catalytic activity of the CasRx HEPN domains and the presence of the target transcript resulting in on-target cleavage. For example, lethality and tissue necrosis phenotypes were eliminated comparing dCasRx to CasRx crosses and no lethality was observed when crossing Ubiq-CasRx expressing flies to gRNAFluc expressing flies in the absence of the Fluc transcript. These results recapitulate previous mechanistic analysis of CasRx and other Cas13 ribonucleases demonstrating that off-target activity following targeted transcript cleavage is a native feature of Cas13 ribonuclease applications.
Cas13 enzymes have been proposed to be highly specific ribonucleases with the ability to replace previously developed RNAi technologies. dCas13 enzymes retain efficient RNA binding activity and can be modified to effectively diminish the promiscuous RNase activity of Cas13 ribonucleases. Previous studies have utilized dCas13 enzymes for RNA base editing, dynamic imaging of RNA, and to manipulate pre-mRNA splicing, demonstrating both the specificity and versatility of dCas13 RNA binding. Further modifications to dCasRx may provide viable alternatives for targeted transcript degradation in flies through manipulation of the nonsense mediated mRNA decay (NMD) pathway or through inhibition of proper transcript splicing. However, there remain advantages to the catalytic activity of CasRx and other Cas13 ribonucleases, including the promiscuous RNase activity these enzymes exhibit.
Due to the programmable nature of CRISPR systems, numerous arthropods can theoretically be transgenically engineered and studied applying CasRx. This report provides a preliminary characterization of CasRx function in arthropods and opens up numerous avenues to explore transcript targeting, virus targeting, and technological development of RNA binding applications. One potential application could involve controlling the spread of vector-borne illnesses in arthropods, such as mosquitoes. Recently, in cell culture experiments, a Cas13 ribonuclease was used to directly target a variety of ssRNA viruses known to infect humans. Aedes mosquitoes are primary vectors for ssRNA viruses such as dengue virus, with an estimated 390 million people infected annually. ssRNA viruses transmitted through Aedes mosquitoes rapidly evolve in both vectors and humans, which presents a significant challenge for generating efficient vaccines or biological methodologies for reducing transmission. The CasRx RNA targeting system in arthropods provides a platform to reduce the spread of ssRNA arboviruses by directly targeting ssRNA virus genomes in a programmable manner. In this case, collateral cleavage and tissue-specific cell death may serve as a significant advantage for ssRNA virus targeting in arbovirus vectors.
To select RNA sites for CasRx targeting, target genes were analyzed to identify 30-nucleotide regions that had no poly-U stretches greater than 4 bp, had GC base content between 30% and 70%, and were not predicted to form very strong hairpin structures. Care was also taken to select target sites in RNA regions that were predicted to be accessible, such as regions without predicted RNA secondary or tertiary structure (
Four CasRx- and dCasRx-expressing constructs were assembled under the control of one of two promoters: Ubiquitin-63E (Ubiq) or UASt (Ubiq-CasRx, Ubiq-dCasRx, UASt-CasRx, UASt-dCasRx) using the Gibson enzymatic assembly method. A base vector (Addgene plasmid #112686) containing piggyBac and an attB-docking site, Ubiq promoter fragment, SpCas9-T2A-GFP, and the Opie2-dsRed transformation marker was used as a template to build all four constructs. To assemble constructs OA-1050E (Addgene plasmid #132416, Ubiq-CasRx) and OA-1050R (Addgene plasmid #132417, Ubiq-dCasRx), the SpCas9-T2A-GFP fragment was removed from the base vector by cutting with restriction enzymes SwaI and PacI, and then replaced with CasRx and dCasRx fragments amplified with primers 1050E.C3 and 1050E.C4 (Table 15) from constructs pNLS-RfxCas13d-NLS-HA (pCasRx) and pNLS-dRfxCas13d-NLS-HA (pdCasRx), respectively. To assemble constructs OA-1050L (Addgene plasmid #132418, UASt-CasRx) and OA-1050S (Addgene plasmid #132419, UASt-dCasRx), the base vector described above was digested with restriction enzymes NotI and PacI to remove the Ubiq promoter and SpCas9-T2A-GFP fragments. And then UASt promoter fragment and CasRx or dCasRx fragments, respectively, were cloned in. The UASt promoter fragment was amplified from plasmid pJFRC81, with primers 1041.C9 and 1041.C11 (Table 15). The CasRx and dCasRx fragments were amplified with primers 1050L.C1 and 1050E.C4 (Table 15) from constructs pCasRx and pdCasRx, respectively.
Seven four-gRNA-array constructs were designed, OA-1050G (Addgene plasmid #132420), OA-1050I (Addgene plasmid #132421), OA-1050J (Addgene plasmid #133304), OA-1050K (Addgene plasmid #132422), OA-1050U (Addgene plasmid #132423), OA-1050V (Addgene plasmid #132424), OA-1050Z4 (Addgene plasmid #132425), targeting transcripts of white, Notch, GFP, firefly luciferase, cinnabar, wingless, and yellow, respectively. To generate a base plasmid, OA-1043, which was used to build all the final seven four-gRNA-array constructs, Addgene plasmid #112688 containing the miniwhite gene as a marker, an attB-docking site, a D. melanogaster polymerase-3 U6 (U6:3) promoter fragment, and a guide RNA fragment with a target, scaffold, and terminator sequence (gRNA) was digested with restriction enzymes AscI and XbaI to remove the U6:3 promoter and gRNA fragments. Then the U6:3 promoter fragment amplified from the same Addgene plasmid #112688 with primers 1043.C1 and 1043.C23 (Table S16), was cloned back using Gibson enzymatic assembly method. To generate constructs OA-1050G, OA-1050I, OA-1050K, OA-1050U, OA-1050V, OA-1050Z4, plasmid OA-1043 was digested with restriction enzymes PstI and NotI, a fragment containing arrays of four tandem variable gRNAs (comprised of a 36-nt direct repeat (DR) and a 30-nt spacer) corresponding to different target genes respectively, followed by an extra DR and a 7 thymines terminator was synthesized and subcloned into the digested backbone using Gene Synthesis (GenScript USA Inc., Piscataway, N.J.). To generate constructs OA-1050J, a fragment containing arrays of four tandem variable gRNAs targeting GFP with an extra DR and a 7 thymines terminator, followed by the OpIE2-GFP marker was synthesized and subcloned into the above digested OA-1043 backbone using Gene Synthesis (GenScript USA Inc., Piscataway, N.J.).
To assemble construct OA-1052B (Addgene plasmid #132426), the dual-luciferase expression vector consisted of firefly luciferase linked with T2A-EGFP (Fluc-T2A-EGFP) and renilla luciferase both driven by Ubiq promoter fragment (Ubiq-Fluc-T2A-eGFP-Ubiq-Rluc), Addgene plasmid #112688 containing the white gene as a marker, an attB-docking site as described previously was digested with enzymes AscI and XbaI, and the following components were cloned in using the Gibson enzymatic assembly method: i) a D. melanogaster Ubiq promoter fragment amplified from Addgene plasmid #112686 with primers 1052B.C1 and 1052B.C2; ii) a custom gBlocks® Gene Fragment (Integrated DNA Technologies, Coralville, Iowa) of a firefly luciferase coding sequence; iii) a T2A-eGFP fragment amplified from Addgene plasmid #112686 with primers 908.A1 and 908.A2; iv) a custom gBlocks® Gene Fragment containing a p10 3′UTR fragment, reversed renilla luciferase followed by an SV40 3′UTR fragment; v) another Ubiq promoter fragment as reversed sequence amplified from Addgene plasmid #112686 with primers 908.A3 and 908.A4 (Table 15). All plasmids and sequence maps were made available for download and/or order at Addgene (www.addgene.com) with identification numbers listed in
Flies were maintained under standard conditions at 26° C. Embryo injections were performed at Rainbow Transgenic Flies, Inc. (http://www.rainbowgene.com). All CasRx and dCasRx expressing lines were generated by site-specifically integrating constructs at available ϕC31 integration sites on the 2nd chromosome (site 8621 (UAS/-(d)CasRx) and attp40w (Ubiq-(d)CasRx)). Homozygous lines were established for UASt-CasRx and UASt-dCasRx and heterozygous balanced lines were established for Ubiq-CasRx and Ubiq-dCasRx (over Curly of Oster: CyO). All gRNAarray expressing lines were generated by site-specifically integrating constructs at an available ϕC31 integration site on the 3rd chromosome (site 8622). Homozygous lines were established for all gRNAarray expressing flies. Dual-luciferase reporter expressing lines were generated by site-specifically integrating the constructs at an available ϕC31 integration site on the 3rd chromosome (site 9744). Homozygous lines were established for the dual-luciferase reporter expressing flies.
To genetically assess efficiency of CasRx ribonuclease activity, at 26° C., Ubiq-CasRx and Ubiq-dCasRx expressing lines were bidirectionally crossed to gRNAarray expressing lines and let lay for 4 days before removing parents. F1 transheterozygotes were scored for inheritance and penetrance of observable phenotypes up to 17 days post initial laying (13-17 days). Embryo, larvae, and pupae counts preceded by crossing male Ubiq-CasRx and Ubiq-dCasRx expressing flies to female gRNAarray expressing flies. Flies were incubated at 26° C. for 48 h with yeast to induce embryo laying. Flies were then transferred to embryo collection chambers containing yeast-smeared grape-juice plates and incubated at 26° C. overnight (16 h). The grape-juice plates were then removed, embryos counted, and the grape-juice plates incubated for 24 h at 26° C. Total larvae and transheterozygote larvae were then counted and the grape-juice plates transferred to jars and incubated at 26° C. Once all larvae reached the pupal stage, total and transhet pupae were counted. Finally, total adult flies and total adult transheterozygotes were counted 20 days post initial lay. Each genetic cross was set using 5♂ and 10♀ (paternal CasRx) or 4♂ and 8♀ (maternal CasRx) flies in triplicate.
To investigate the tissue-specific activity of CasRx, a 2-step crossing scheme was designed to generate F2 triple transheterozygotes (
Total RNA was extracted from F1 transheterozygous flies at different developmental stages based on the reported highest expression level available through modENCODE analysis (
For all samples, total RNA was extracted using Qiagen RNeasy Mini Kit (Qiagen 74104). Following extraction, total RNA was treated with Invitrogen Turbo™ DNase (Invitrogen AM2238). RNA concentration was analyzed using Nanodrop OneC UV-vis spectrophotometer (ThermoFisher ND-ONEC-W). RNA integrity was assessed using RNA 6000 Pico Kit for Bioanalyzer (Agilent Technologies #5067-1513). RNA-seq libraries were constructed using NEBNext Ultra II RNA Library Prep Kit for Illumina (NEB #E7770) following the manufacturer's instructions previously. three replicates for all CasRx and dCasRx samples were sequenced and analyzed with the exception of gRNAcn where 2 replicates were analyzed. In total 34 samples, 17 CasRx experimental samples and 17 dCasRx control samples, were sequenced and analyzed.
To further understand CasRx-induced differential gene expression profiles, the raw transcript counts were normalized by transcripts per million (TPM) and maximum a posteriori (MAP) method was used with the original shrinkage estimator in DESeq2 pipeline to estimate transcript logarithmic fold change (LFC) (47). Wald test with Benjamini-Hochberg correction was used for statistical inference. The detailed analysis results are presented in Tables 7-12. Per DESeq2 analysis requirement, some values are shown as NA due to the following reasons: 1) if all samples for a given transcripts have 0 transcript counts, this transcript's baseMean will be 0 and its LFC, p value, and padj will be set to NA; 2) If one replicate of a transcript is an outlier with extreme count (detected by Cook's distance), this transcript's p value and padj will be set to NA. 3) If a transcript is found to have a low mean normalized count after automatic independent filtering, this transcript's padj will be set to NA.
To measure the efficacy of targeted CasRx knockdown a dual Luciferase reporter system comprised of both Firefly and Renilla Luciferase was utilized. A 2-step genetic crossing scheme was performed (
Embodiment 1: A method of modifying a target locus of interest in vivo in an organism, comprising delivering to said locus a Type VI CRISPR-Cas loci effector protein and one or more nucleic acid components, wherein at least the one or more nucleic acid components is engineered and the effector protein forms a complex with the one or more nucleic acid components and upon binding of said complex to the target locus of interest the effector protein induces a modification of the target locus of interest.
Embodiment 2: The method of Embodiment 1, wherein the target locus of interest comprises RNA.
Embodiment 3: The method of Embodiment 2, wherein the target locus of interest comprises endogenous mRNA.
Embodiment 4: The method of any one of Embodiments 1-3, wherein the modification of the target locus of interest comprises a strand break.
Embodiment 5: The method of any one of Embodiments 1-4, wherein the effector protein and one or more nucleic acid components are non-naturally occurring.
Embodiment 6: The method of any one of Embodiments 1-5, wherein the effector protein is encoded by a subtype VI-D CRISPR-Cas loci.
Embodiment 7: The method of Embodiment 6, wherein the effector protein comprises Cas13d.
Embodiment 8: The method of Embodiment 7, wherein the Cas13d is derived from Ruminococcus flavefaciens.
Embodiment 9: The method of any one of Embodiments 1-8, wherein the effector protein is fused to one or more localization signal.
Embodiment 10: The method of Embodiment 9, wherein the one or more localization signal is nuclear localization signal.
Embodiment 11: The method of any one of the preceding Embodiments, wherein when in complex with the effector protein the nucleic acid component(s) is capable of effecting or effects sequence specific binding of the complex to a target sequence of the target locus of interest.
Embodiment 12: The method of any one of the preceding Embodiments, wherein the nucleic acid component(s) comprise one or more CRISPR RNA (crRNA) arrays and/or one or more trans-activating crRNA (tracrRNA).
Embodiment 13: The method of any one of the preceding Embodiments, wherein the nucleic acid component(s) comprise one or more CRISPR RNA (crRNA) arrays and do not comprise any trans-activating crRNA (tracrRNA).
Embodiment 14: The method of Embodiments 12 or 13, wherein the one or more CRISPR RNA (crRNA) arrays are pre-crRNA arrays.
Embodiment 15: The method of any one of the preceding Embodiments, wherein the effector protein and nucleic acid component(s) are provided via one or more polynucleotide molecules encoding the effector protein and/or the nucleic acid component(s), and wherein the one or more polynucleotide molecules are operably configured to express the effector protein and/or the nucleic acid component(s).
Embodiment 16: The method of Embodiment 15, wherein the one or more polynucleotide molecules comprise one or more regulatory elements operably configured to express the effector protein and/or the nucleic acid component(s).
Embodiment 17: The method of Embodiment 16, wherein the one or more regulatory elements are ubiquitous promoters or inducible promotors.
Embodiment 18: The method of Embodiment 17, wherein the one or more regulatory elements comprise one or more inducible UAS promoters.
Embodiment 19: The method of any one of Embodiments 15-18, wherein the one or more polynucleotide molecules are comprised within one or more vectors.
Embodiment 20: The method of any one of Embodiments 15-18, wherein the one or more polynucleotide molecules are comprised in a delivery system, or the method of claim 19 wherein the one or more vectors are comprised in a delivery system.
Embodiment 21: The method of any one of the preceding Embodiments, wherein the effector protein and one or more nucleic acid component(s) are delivered via one or more delivery vehicles comprising liposome(s), particle(s), exosome(s), microvesicle(s), a gene-gun or one or more viral vectors.
Embodiment 22: The method of any one of the preceding Embodiments, wherein the organism is a vertebrate.
Embodiment 23: The method of any one of the preceding Embodiments, wherein the organism is an invertebrate.
Embodiment 24: The method of Embodiment 23, wherein the organism is an insect.
Embodiment 25: An organism comprising a modified target locus of interest, wherein the target locus of interest has been modified according to a method of any one of the preceding Embodiments.
Embodiment 26: The organism of Embodiment 26, wherein the organism is a vertebrate.
Embodiment 27: The organism of Embodiment 26, wherein the organism is an invertebrate.
Embodiment 28: The organism of Embodiment 27, wherein the organism is an insect.
It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
This application claims priority to U.S. Patent Application Ser. No. 62/798,078, filed Jan. 29, 2019, which is incorporated herein by reference in its entirety.
This invention was made with government support under HR0011-17-2-0047 awarded by the Defense Advanced Research Project Agency. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/015680 | 1/29/2020 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62798078 | Jan 2019 | US |