This application contains references to amino acid sequences and/or nucleic acid sequences which have been submitted concurrently herewith as the sequence listing file entitled “000211usnp_SequenceListing.XML”, file size 1,421,263 bytes, created on Sep. 10, 2024. The aforementioned sequence listing is hereby incorporated by reference in its entirety pursuant to 37 C.F.R. § 1.52 (e) (5).
Gene editing technologies have been blooming and are powerful tools to manipulate genetic materials in target cells, tissues, or organisms. Clustered regularly interspaced short palindromic repeats (CRISPR)-related technologies are among the most promising gene editing tools available. However, a low rate of desired homology directed repair precise (HDR)-mediated editing, a less satisfactory off-target rate, and a high translocation rate and its resulting mutagenesis have become the main obstacles in further advancing the technologies and its wider usage.
In one aspect, to address the need for a more effective and specific gene editing outcome, provided herein is a system for altering a target sequence, comprising a modified single-guide RNA (sgRNA) and a donor DNA, wherein the modified sgRNA comprises a CRISPR RNA (crRNA) and a trans-active RNA (tracrRNA), wherein the modified sgRNA comprises one or more internal anchors that are at least 5 nucleotides away from both 3′ and 5′ ends of the modified sgRNA, wherein the donor DNA comprises a first portion and a second portion, wherein the first portion comprises one or more binding segments capable of binding to an internal anchor of the one or more internal anchors via a non-covalent bond and the second portion comprises a sequence of interest (SOI).
In some embodiments, the non-covalent bond is a Watson-Crick interaction.
In some embodiments, the modified sgRNA comprises a nexus, a first hairpin, and a single-stranded region between the tracrRNA and the crRNA. In some embodiments, the modified sgRNA further comprises a bulge region. In some embodiments, the modified sgRNA further comprises a second hairpin.
In some embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region of the modified sgRNA. In some specific embodiments, the internal anchor of the one or more internal anchors is located in the single-stranded region between the tracrRNA and the crRNA. In other specific embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, the modified sgRNA further comprises a second hairpin, and wherein the single-stranded region is within the second hairpin.
In some embodiments, each of the one or more internal anchors or each of the one or more binding segments is 3-nucleotide to 100-nucleotide long. In other embodiments, each of the one or more internal anchors or each of the one or more binding segments is 3-nucleotide to 20-nucleotide long. In yet other embodiments, each of the one or more internal anchors or each of the one or more binding segments is about 5-nucleotide long.
In some embodiments, each of the one or more internal anchors comprises a sequence from SEQ ID NOs. 1 to 472 from Table 1. In other embodiments, each of the one or more internal anchors comprises a sequence from SEQ ID NOs. 473 to 3056 from Table 2. In other embodiments, each of the one or more binding segments comprises a sequence from SEQ ID NO. 3057 to 3528 from Table 3. In other embodiments, each of the one or more binding segments comprises a sequence from SEQ ID NO. 3529 to 6112 from Table 4.
In some embodiments, the one or more binding segments are linked by a linker. In some specific embodiments, the linker is about 1 to 30-nucleotide long. In other specific embodiments, the linker is about 10 to 25-nucleotide long. In other embodiments, the linker is a sequence of poly-deoxyadenosines.
In some embodiments, the SOI comprises the target sequence with one or more nucleotide substitution, one or more nucleotide insertion, one or more nucleotide deletion, or any combination thereof. In some embodiments, the one or more nucleotide insertion comprises 1 to 100 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 101 to 1,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 10,001 to 100,000 nucleotides. In other embodiments, the one or more nucleotide insertion comprises 2 to 10 random nucleotides. In other embodiments, the one or more nucleotide deletion comprises 1 to 50 nucleotides.
In some embodiments, the second portion of the donor DNA further comprises an upstream and/or a downstream homology arm. In specific embodiments, the upstream homology arm is 5 to 1000-nucleotide long. In specific embodiments, the downstream homology arm is about 10 to 1000-nucleotide long. In other embodiments, the upstream homology arm is 100 to 1,000-nucleotide long. In other embodiments, the downstream homology arm is about 41 to 1,000-nucleotide long.
In some embodiments, the first portion of the donor DNA is at 5′ of the second portion of the donor DNA. In other embodiments, the first portion of the donor DNA is at 3′ of the second portion of the donor DNA.
In some embodiments, the donor DNA is single-stranded. In other embodiments, the first portion of the donor DNA is single-stranded and the second portion of the donor DNA is fully or partially double-stranded.
In some embodiments, the donor DNA is close ended on 3′ and/or 5′ end.
In some embodiments, the system further comprises a CRISPR nuclease. In specific embodiments, the CRISPR nuclease is a DNA nuclease. In specific embodiments, the DNA nuclease is a Cas9, a Cas12, a Cas14, or a CasΦ.
In another aspect, provided herein is a system comprising a donor DNA and two modified single-guide RNAs (sgRNAs) for cutting at a first locus on a first chromosome and a second locus on a second chromosome, wherein each of the modified sgRNAs comprises a CRISPR RNA (crRNA) and a trans-active RNA (tracrRNA), wherein each of the modified sgRNAs comprises one or more internal anchors that are at least 5 nucleotides away from both 3′ and 5′ ends of each of the modified sgRNAs, wherein the donor DNA comprises a first portion and a second portion, wherein the first portion comprises one or more binding segments capable of binding to an internal anchor of the one or more internal anchors via a non-covalent bond and the second portion comprises a sequence of interest (SOI), wherein the donor DNA comprises an upstream homology arm and/or a downstream homology arm.
In some embodiments, the first chromosome and the second chromosome are the same. In some specific embodiments, the first locus is at 5′ of the second locus. In other embodiments, the first chromosome and the second chromosome are different. In some embodiments the two modified sgRNAs target different locus on the same gene. In some embodiments, the two modified sgRNAs target different locus on the same strand. In some embodiments, the two modified sgRNAs target different locus on the different strands. In some embodiments, the system comprising a second donor DNA. In some embodiments, the donor DNA and the second donor DNA each comprises an upstream homology arm and/or a downstream homology arm. In some embodiments, the homology arm(s) of the donor DNA and the homology arm(s) of the second donor DNA are complementary to sequences on the same strand. In some embodiments, the homology arm(s) of the donor DNA and the homology arm(s) of the second donor DNA are complementary to sequences on the different strands.
In some embodiments, the first locus and the second locus are at least 50, 100, 1,000, 10,000, or 100,000 nucleotides apart.
In some embodiments, the upstream homology arm flanks 5′ end of the first locus. In other embodiments, the downstream homology arm flanks 3′ end of the second locus.
In some embodiments, the non-covalent bond is a Watson-Crick interaction.
In some embodiments, the modified sgRNA comprises a nexus, a first hairpin, and a single-stranded region between the tracrRNA and the crRNA. In some embodiments, the modified sgRNA further comprises a bulge region. In some embodiments, the modified sgRNA further comprises a second hairpin.
In some embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region of the modified sgRNA. In some specific embodiments, the internal anchor of the one or more internal anchors is located in the single-stranded region between the tracrRNA and the crRNA. In other specific embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, the internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, the modified sgRNA further comprises a second hairpin, and wherein the single-stranded region is within the second hairpin.
In some embodiments, each of the one or more internal anchors or each of the one or more binding segments is 3-nucleotide to 100-nucleotide long. In other embodiments, each of the one or more internal anchors or each of the one or more binding segments is 3-nucleotide to 20-nucleotide long. In yet other embodiments, each of the one or more internal anchors or each of the one or more binding segments is about 5-nucleotide long.
In some embodiments, each of the one or more internal anchors comprises a sequence from SEQ ID NOs. 1 to 472 from Table 1. In other embodiments, each of the one or more internal anchors comprises a sequence from SEQ ID NOs. 473 to 3056 from Table 2. In other embodiments, each of the one or more binding segments comprises a sequence from SEQ ID NO. 3057 to 3528 from Table 3. In other embodiments, each of the one or more binding segments comprises a sequence from SEQ ID NO. 3529 to 6112 from Table 4.
In some embodiments, the one or more binding segments are linked by a linker. In some specific embodiments, the linker is about 1 to 30-nucleotide long. In other specific embodiments, the linker is about 10 to 25-nucleotide long. In other embodiments, the linker is a sequence of poly-deoxyadenosines.
In some embodiments, the SOI comprises a region between the first locus and the second locus with one or more nucleotide substitution, one or more nucleotide insertion, one or more nucleotide deletion, or any combination thereof. In some embodiments, the one or more nucleotide insertion comprises 1 to 100 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 101 to 1,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide insertion comprises 10,001 to 100,000 nucleotides. In some embodiments, the one or more nucleotide deletion comprises 1 to 100 nucleotides. In some embodiments, the one or more nucleotide deletion comprises 101 to 1,000 nucleotides. In some embodiments, the one or more nucleotide deletion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide deletion comprises 1,001 to 10,000 nucleotides. In some embodiments, the one or more nucleotide deletion comprises 10,001 to 100,000 nucleotides.
In specific embodiments, the upstream homology arm is 5 to 1000-nucleotide long. In specific embodiments, the downstream homology arm is about 10 to 1000-nucleotide long. In other embodiments, the upstream homology arm is 100 to 1,000-nucleotide long. In other embodiments, the downstream homology arm is about 41 to 1,000-nucleotide long.
In some embodiments, the first portion of the donor DNA is at 5′ of the second portion of the donor DNA. In other embodiments, the first portion of the donor DNA is at 3′ of the second portion of the donor DNA.
In some embodiments, the donor DNA is single-stranded. In other embodiments, the first portion of the donor DNA is single-stranded and the second portion of the donor DNA is fully or partially double-stranded.
In some embodiments, the donor DNA is close ended on 3′ and/or 5′ end.
In some embodiments, the system further comprises a CRISPR nuclease. In specific embodiments, the CRISPR nuclease is a DNA nuclease. In specific embodiments, the DNA nuclease is a Cas9, a Cas12, a Cas14, or a CasΦ.
In another aspect, provided herein is a method of modifying a cell, wherein the method comprises transporting a system as described herein.
In some embodiments, the transporting comprises: (a) incubating the CRISPR nuclease and the modified sgRNA to form a ribonucleoprotein (RNP) complex; (b) applying the donor DNA to the RNP complex; and (c) delivering the RNP complex-donor DNA from (b) to the cell. In some specific embodiments, in step (a) the ratio of the CRISPR nuclease and the modified sgRNA is about 1:0.5 to about 1:10. In other specific embodiments, in step (a) the ratio of the CRISPR nuclease and the modified sgRNA is about 1:1 to 1:2.
In other embodiments, the transporting comprises: (a) providing one or more vectors comprising a nucleotide sequence encoding the CRISPR nuclease and a nucleotide sequence encoding the modified gRNA; (b) delivering the one or more vectors of (a) to the cell; and (c) delivering the donor DNA to the cell. In specific embodiments, step (c) is performed about 6 to 48 hours after step (b).
In some embodiments, the delivering is achieved by viral vectors, liposomes, lipid nanoparticles, or electroporation.
In some embodiments, the cell is an immune cell. In specific embodiments, the immune cell is a T cell, a B cell, an NK cell, or a hematopoietic stem cell.
In some embodiments, the method is performed ex vivo or in vivo the method is performed ex vivo or in vivo.
In some embodiments, a percentage of desired editing is at least 10%, at least 50%, at least 100%, or at least 200% higher than a comparable system without the donor DNA comprising a first portion that binds to the modified sgRNA and/or without the modified sgRNA with the one or more internal anchors. In other embodiments, the method has an off-target rate at least 10%, at least 50%, or at least 100% lower than a comparable system without the donor DNA comprising the first portion that binds to the modified sgRNA and/or without the modified sgRNA with the one or more internal anchors. In other embodiments, the method has a translocation, large insertion, or large deletion rate at least 10%, at least 50%, or at least 100% lower than a comparable system without the donor DNA comprising the first portion that binds to the modified sgRNA and/or without the modified sgRNA with the one or more internal anchors.
In another aspect, provided herein is a method of treating a genetic disorder, wherein the method comprises administering to a subject with an effective amount of the system as described herein. In some embodiments, the SOI comprises a sequence that reverses or alleviate the genetic disorder.
Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The novel features of the disclosure are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the disclosure are utilized, and the accompanying drawings (also “Figure” and “FIG.” herein), of which:
The present disclosure is based, in part, on the surprising finding that, when designing a donor DNA to be coupled to an sgRNA, the location of the effective coupling is not random. As shown in figures and examples below, the coupling between the donor DNA and the sgRNA is located at least 5 nucleotides away from both 3′ and 5′ ends of the modified sgRNA.
Provided herein are compositions of the systems as described herein. Various embodiments regarding non-covalent bonds, locations of the internal anchors on the modified sgRNAs, length and sequences of the internal anchors of the modified sgRNAs/binding segments of the donor DNAs, linkers between one or more binding segments on donor DNAs, SOI, other features of the donor DNAs, and Cas proteins are disclosed respectively. Furthermore, provided herein are methods of using the systems described herein.
While various embodiments of the disclosure have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions may occur to those skilled in the art without departing from the disclosure. It should be understood that various alternatives to the embodiments of the disclosure described herein may be employed.
The term “CRISPR/Cas,” as used herein, can refer to a ribonucleoprotein complex, e.g., a two part component ribonucleoprotein complex, with single guide RNA (sgRNA) and a CRISPR-associated (Cas) endonuclease. In some cases, CRISPR/Cas comprises more than two components. The term “CRISPR” can refer to the Clustered Regularly Interspaced Short Palindromic Repeats and the related system thereof. CRISPR can be used as an adaptive defense system that enables bacteria and archaea to detect and silence foreign nucleic acids (e.g., from viruses or plasmids). CRISPR can be adapted for use in a variety of cell types to allow for polynucleotide editing in a sequence-specific manner. In some cases, one or more elements of a CRISPR system can be derived from a type I, type II, type III, type IV, type V, or type VI CRISPR system. In the CRISPR type II system, the guide RNA can interact with Cas and direct the nuclease activity of the Cas enzyme to a target region. The target region can comprise a “protospacer” and a “protospacer adjacent motif” (PAM), and both domains can be used for a Cas enzyme mediated activity (e.g., cleavage). The protospacer can be referred to as a target site (or a genomic target site). The sgRNA can pair with (or hybridize) the opposite strand of the protospacer (binding site) to direct the Cas enzyme to the target region. The PAM site can refer to a short sequence recognized by the Cas enzyme and, in some cases, required for the Cas enzyme activity. The sequence and number of nucleotides for the PAM site can differ depending on the type of the Cas enzyme.
The term “Cas,” as used herein, generally refers to a wild type Cas protein, a fragment thereof, or a mutant or variant thereof. The term “Cas,” “enzyme Cas,” “enzyme CRISPR,” “protein CRISPR,” or “protein Cas” can be used interchangeably throughout the present disclosure.
A Cas protein can comprise a protein of or derived from a CRISPR/Cas a type I, type II, type III, or type IV, which has an RNA-guided polynucleotide-binding or nuclease activity. Examples of suitable Cas proteins include CasX, Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9 (also known as Csn1 and Csx12), Cas10, Cas10d, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Cszl, Csx15, Csf1, Csf2, Csf3, Csf4, Cu1966, homologues thereof, and modified versions thereof. In some cases, a Cas protein can comprise a protein of or derived from a CRISPR/Cas type V or type VI system, and modified versions thereof. In some cases, a Cas protein can be a catalytically dead or inactive Cas (dCas). In some cases, a Cas protein can have reduced or minimal nuclease activity (i.e., deactivated Cas, or dCas). In some cases, a Cas protein can be operatively coupled to one or more additional proteins, such as a nucleic acid polymerase. In an example, a Cas protein can be a dCas that is fused to a reverse transcriptase.
The term “single guide RNA” or “sgRNA,” as used herein, can refer to an RNA molecule (or a group of RNA molecules collectively) that can bind to a Cas protein and aid in targeting the Cas protein to a specific location within a target polynucleotide (e.g., a DNA). A single guide RNA comprises a CRISPR RNA (crRNA) segment and a trans-activating crRNA (tracrRNA) segment. The term “crRNA” or “crRNA segment,” as used herein, refers to an RNA molecule or portion thereof that includes a polynucleotide-targeting guide sequence, a stem sequence, and, optionally, a 5′-overhang sequence. The term “tracrRNA” or “tracrRNA segment,” refers to an RNA molecule or portion thereof that includes a protein-binding segment (e.g., the protein-binding segment can be capable of interacting with a CRISPR-associated protein, such as a Cas9).
The term “polynucleotide” or “nucleic acid,” as used interchangeably herein, can refer to a polymeric form of nucleotides (e.g., ribonucleotides or deoxyribonucleotides) of any length. Thus, this term includes single-, double-, or multi-stranded DNA or RNA, genomic DNA, complementary DNA (cDNA), guide RNA (gRNA), messenger RNA (mRNA), DNA-RNA hybrids, or a polymer comprising purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases. The term “oligonucleotide,” as used herein, can refer to a polynucleotide of between about 5 and about 100 nucleotides of single- or double-stranded DNA or RNA. The length of a nucleic acid can be referred to in reference of the number of bases in the nucleic acid sequence. For example, a sequence of 100 nucleotides can be referred to as being 100 bases in length. However, for the purposes of this disclosure, there can be no upper limit to the length of an oligonucleotide. In some cases, oligonucleotides can be known as “oligomers” or “oligos” and can be isolated from genes, or chemically synthesized by methods known in the art. The terms “polynucleotide” and “nucleic acid” can include single-stranded (such as sense or antisense) and double-stranded polynucleotides. Examples of nucleotides for DNA can include cytosine (C), guanine (G), adenine (A), thymine (T), or modifications thereof. Examples of nucleotides for RNA can include C, G, A, uracil (U), or modifications thereof.
A “subject” disclosed herein includes any living organism. Thus, in some embodiments, subjects are mammals, avians, reptiles, amphibians, fish, plants, fungi, or bacteria. Mammalian subjects include but are not limited to humans, non-human primates (e.g., gorilla, monkey, baboon, and chimpanzee, etc.), dogs, cats, goats, horses, pigs, cattle, sheep, and the like, and laboratory animals (e.g., rats, guinea pigs, mice, gerbils, hamsters, and the like). Avian subjects include but are not limited to chickens, ducks, turkeys, geese, quail, pheasants, and birds kept as pets. In some embodiments, suitable subjects include both males and females and subjects of any age, including embryonic (e.g., in-utero or in-ovo), infant, juvenile, adolescent, adult and geriatric subjects. In some embodiments, a subject is a human.
“Treating” or “treatment” can refer to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) a targeted pathologic condition or disorder. Those in need of treatment can include those already with the disorder, as well as those prone to have the disorder, or those in whom the disorder is to be prevented. For example, a subject can be successfully “treated” for a disease caused by a gain-of-function mutation, if, after receiving a therapeutic amount of a composition according to the methods of the present disclosure, the subject shows observable and/or measurable reduction in or absence of one or more of the following: relief to some extent of one or more of the symptoms associated with the specific disease; reduced morbidity and/or mortality, and improvement in quality of life issues.
Certain ranges are presented herein with numerical values being preceded by the term “about.” The term “about” can be used herein to provide literal support for the exact number that it precedes, as well as a number that is near to or approximately the number that the term precedes. In determining whether a number is near to or approximately a specifically recited number, the near or approximating un-recited number can be a number which, in the context in which it is presented, provides the substantial equivalent of the specifically recited number. Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, can be encompassed within the methods and compositions described herein are. The upper and lower limits of these smaller ranges can independently be included in the smaller ranges and are also encompassed within the methods and compositions described herein, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits can also be included in the methods and compositions described herein.
Whenever the term “at least,” “greater than,” or “greater than or equal to” precedes the first numerical value in a series of two or more numerical values, the term “at least,” “greater than” or “greater than or equal to” applies to each of the numerical values in that series of numerical values. For example, greater than or equal to 1, 2, or 3 is equivalent to greater than or equal to 1, greater than or equal to 2, or greater than or equal to 3.
Whenever the term “no more than,” “less than,” or “less than or equal to” precedes the first numerical value in a series of two or more numerical values, the term “no more than,” “less than,” or “less than or equal to” applies to each of the numerical values in that series of numerical values. For example, less than or equal to 3, 2, or 1 is equivalent to less than or equal to 3, less than or equal to 2, or less than or equal to 1.
The term “desired editing efficiency” or “percentage of desired editing” used herein is referred to an expected editing outcome with the designed sequence and designed location, based on the design of SOI in the donor DNAs described herein. In some embodiments where the SOI comprises the target sequence with one or more nucleotide substitution, one or more nucleotide insertion, one or more nucleotide deletion, or any combination thereof, the desired editing efficiency or the percentage of the desired editing is defined as the proportion of the corresponding gene editing products comprising the target sequence with one or more nucleotide substitution, one or more nucleotide insertion, one or more nucleotide deletion, or any combination thereof, and at the expected loci based on the design of the upstream and/or downstream homology arms if the second portion of the donor DNA has any homology arms.
The term “upstream homology arm” used herein is referred to a segment of target sequence or a region that is between the first and second loci that is at the 5′ side of the cutting site. The term “downstream homology arm” used herein is referred to a segment of target sequence or a region that is between the first and second loci that is at the 3′ side of the cutting site. The term “distal/proximal homology arm” used herein is referred to a segment of target sequence or a region that is between the first and second loci that is distal or proximal relative to protospacer adjacent motif (PAM) site.
In one aspect, provided herein is a system for altering a target sequence, comprising a modified single-guide RNA (sgRNA) and a donor DNA, wherein the modified sgRNA comprises a CRISPR RNA (crRNA) and a trans-active RNA (tracrRNA), wherein the modified sgRNA comprises one or more internal anchors that are at least 5 nucleotides away from both 3′ and 5′ ends of the modified sgRNA, wherein the donor DNA comprises a first portion and a second portion, wherein the first portion comprises one or more binding segments capable of binding to an internal anchor of the one or more internal anchors via a non-covalent bond and the second portion comprises a sequence of interest (SOI). In another aspect, provided here in a system comprising a donor DNA and two modified single-guide RNAs (sgRNAs) for cutting at a first locus on a first chromosome and a second locus on a second chromosome, wherein each of the modified sgRNAs comprises a CRISPR RNA (crRNA) and a trans-active RNA (tracrRNA), wherein each of the modified sgRNAs comprises one or more internal anchors that are at least 5 nucleotides away from both 3′ and 5′ ends of each of the modified sgRNAs, wherein the donor DNA comprises a first portion and a second portion, wherein the first portion comprises one or more binding segments capable of binding to an internal anchor of the one or more internal anchors via a non-covalent bond and the second portion comprises a sequence of interest (SOI), wherein the donor DNA comprises an upstream homology arm and/or a downstream homology arm.
In some embodiments, the non-covalent bond described herein is a hydrogen bond. In specific embodiments, the non-covalent bond described herein is a Watson-Crick interaction. In other embodiments, the non-covalent bond described herein is an ionic interaction. In other embodiments, the non-covalent bond described herein is a Van der Waals interaction. In other embodiments, the non-covalent bond described herein is a hydrophobic bond.
(b) Locations of the Internal Anchors on the Modified sgRNAs
In some embodiments, the modified sgRNA comprises a nexus, a first hairpin, and a single-stranded region between the tracrRNA and the crRNA. Accordingly, in some specific embodiment, an internal anchor of the one or more internal anchors is located in a single-stranded region between the tracrRNA and the crRNA. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the first hairpin, thus resulting an artificial bulge-like structure in the stem portion of the first hairpin. In other specific embodiments, because one or more sequences in the tracrRNA are reverse complementary to counterpart sequences in crRNA, stems are formed between the tracrRNA and the crRNA. Accordingly, an internal anchor of the one or more internal anchors is located in the stem between the tracrRNA and the crRNA, thus resulting an artificial bulge-like structure in the stem.
In some embodiments, the modified sgRNA comprises a nexus, a first hairpin, a second hairpin, an optionally one or more hairpins, and a single-stranded region between the tracrRNA and the crRNA. Accordingly, in some specific embodiment, an internal anchor of the one or more internal anchors is located in a single-stranded region between the tracrRNA and the crRNA. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the second hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the optionally one or more hairpins. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the first hairpin and the second hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the second hairpin and the third hairpins, or the two neighboring hairpins. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the first hairpin, thus resulting an artificial bulge-like structure in the stem portion of the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the second hairpin, thus resulting an artificial bulge-like structure in the stem portion of the second hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the optionally one or more hairpins, thus resulting an artificial bulge-like structure in the stem portion of the optionally one or more hairpins. In other specific embodiments, because one or more sequences in the tracrRNA are reverse complementary to counterpart sequences in crRNA, stems are formed between the tracrRNA and the crRNA. Accordingly, an internal anchor of the one or more internal anchors is located in the stem between the tracrRNA and the crRNA, thus resulting an artificial bulge-like structure in the stem.
In some embodiments, the modified sgRNA comprises a nexus, a bulge region, a first hairpin, and a single-stranded region between the tracrRNA and the crRNA. Specifically, because one or more sequences in the tracrRNA are reverse complementary to counterpart sequences in crRNA, stems are formed between the tracrRNA and the crRNA. The stems are split by the bulge region into an upper stem and a lower stem. Accordingly, in some specific embodiment, an internal anchor of the one or more internal anchors is located in a single-stranded region between the tracrRNA and the crRNA, which corresponds to the loop on top of the upper stem. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the first hairpin, thus resulting an artificial bulge-like structure in the stem portion of the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located within the upper stem, thus resulting an artificial bulge-like structure in the upper stem. In other specific embodiments, an internal anchor of the one or more internal anchors is located within the lower stem, thus resulting an artificial bulge-like structure in the lower stem.
In some embodiments, the modified sgRNA comprises a nexus, a bulge region, a first hairpin, a second hairpin, optionally one or more hairpins, and a single-stranded region between the tracrRNA and the crRNA. Specifically, because one or more sequences in the tracrRNA are reverse complementary to counterpart sequences in crRNA, stems are formed between the tracrRNA and the crRNA. The stems are split by the bulge region into an upper stem and a lower stem. Accordingly, in some specific embodiment, an internal anchor of the one or more internal anchors is located in a single-stranded region between the tracrRNA and the crRNA, which corresponds to the loop on top of the upper stem. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the second hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region within the optionally one or more hairpins. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the nexus and the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in a single-stranded region between the first hairpin and the second hairpin, or any neighboring hairpins. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the first hairpin, thus resulting an artificial bulge-like structure in the stem portion of the first hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the second hairpin, thus resulting an artificial bulge-like structure in the stem portion of the second hairpin. In other specific embodiments, an internal anchor of the one or more internal anchors is located in the stem portion of the optionally one or more hairpins, thus resulting an artificial bulge-like structure in the stem portion of the one or more hairpins. In other specific embodiments, an internal anchor of the one or more internal anchors is located within the upper stem, thus resulting an artificial bulge-like structure in the upper stem. In other specific embodiments, an internal anchor of the one or more internal anchors is located within the lower stem, thus resulting an artificial bulge-like structure in the lower stem.
(c) Length of the Internal Anchors of the Modified sgRNAs/Binding Segments of the Donor DNAs
In some embodiments, each of the one or more internal anchors is 3-nucleotide to 100-nucleotide long. In other embodiments, each of the one or more internal anchors is 3-nucleotide to 20-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 3-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 4-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 5-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 6-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 7-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 8-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 9-nucleotide long. In other embodiments, each of the one or more internal anchors is at least 10-nucleotide long. In other embodiments, each of the one or more internal anchors is 3-nucleotide long. In other embodiments, each of the one or more internal anchors is 4-nucleotide long. In other embodiments, each of the one or more internal anchors is 5-nucleotide long. In other embodiments, each of the one or more internal anchors is 6-nucleotide long. In other embodiments, each of the one or more internal anchors is 7-nucleotide long. In other embodiments, each of the one or more internal anchors is 8-nucleotide long. In other embodiments, each of the one or more internal anchors is 9-nucleotide long. In other embodiments, each of the one or more internal anchors is 10-nucleotide long.
In some embodiments, the binding segments of the donor DNAs bind to the internal anchors of the modified sgRNAs via a Watson-Crick interaction. Therefore, the binding segments of the donor DNAs also share similar length as the internal anchors of the modified sgRNAs. Accordingly, in some embodiments, each of the one or more binding segments is 3-nucleotide to 100-nucleotide long. In other embodiments, each of the one or more binding segments is 3-nucleotide to 20-nucleotide long. In other embodiments, each of the one or more binding segments is at least 3-nucleotide long. In other embodiments, each of the one or more binding segments is at least 4-nucleotide long. In other embodiments, each of the one or more binding segments is at least 5-nucleotide long. In other embodiments, each of the one or more binding segments is at least 6-nucleotide long. In other embodiments, each of the one or more binding segments is at least 7-nucleotide long. In other embodiments, each of the one or more binding segments is at least 8-nucleotide long. In other embodiments, each of the one or more binding segments is at least 9-nucleotide long. In other embodiments, each of the one or more binding segments is at least 10-nucleotide long. In other embodiments, each of the one or more binding segments is 3-nucleotide long. In other embodiments, each of the one or more binding segments is 4-nucleotide long. In other embodiments, each of the one or more binding segments is 5-nucleotide long. In other embodiments, each of the one or more binding segments is 6-nucleotide long. In other embodiments, each of the one or more binding segments is 7-nucleotide long. In other embodiments, each of the one or more binding segments is 8-nucleotide long. In other embodiments, each of the one or more binding segments is 9-nucleotide long. In other embodiments, each of the one or more binding segments is 10-nucleotide long.
(d) Sequence of the Internal Anchors of the Modified sgRNAs/Binding Segments of the Donor DNAs
In some embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence that is an uncommon motif in a host genome. In specific embodiments, in some embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence that is an uncommon motif in human genome. As non-limiting examples, in some embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence from Table 1. In another embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence from Table 2. In some embodiments, the binding segments of the donor DNAs bind to the internal anchors of the modified sgRNAs via a Watson-Crick interaction. Therefore, in some embodiments, each of the binding segments of the donor DNAs comprises a sequence that is reverse complementary to a sequence from Table 1. In other embodiments, each of the binding segments of the donor DNAs comprises a sequence that is reverse complementary to a sequence from Table 2.
In some embodiments, each of the binding segments of the donor DNAs comprises a sequence that is an uncommon motif in a host genome. In specific embodiments, in some embodiments, each of the binding segments of the donor DNAs comprises a sequence that is an uncommon motif in human genome. As non-limiting examples, in some embodiments, each of the binding segments of the donor DNAs comprises a sequence from Table 3. In another embodiments, each of the binding segments of the donor DNAs comprises a sequence from Table 4. In some embodiments, the binding segments of the donor DNAs bind to the internal anchors of the modified sgRNAs via a Watson-Crick interaction. Therefore, in some embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence that is reverse complementary to a sequence from Table 3. In some embodiments, each of the internal anchors of the modified sgRNAs comprises a sequence that is reverse complementary to a sequence from Table 4.
In some embodiments, a nucleotide sequence (a linker) is designed to be in between one or more binding segments on donor DNAs.
In some embodiments, the linker is about 1 to 30-nucleotide long. In other embodiments, the linker is about 10 to 25-nucleotide long. In some embodiments, the linker is at least 5-nucleotide long. In some embodiments, the linker is at least 10-nucleotide long. In some embodiments, the linker is at least 15-nucleotide long. In some embodiments, the linker is at least 20-nucleotide long. In some embodiments, the linker is at least 25-nucleotide long. In some embodiments, the linker is about 5-nucleotide long. In some embodiments, the linker is about 10-nucleotide long. In some embodiments, the linker is about 15-nucleotide long. In some embodiments, the linker is about 20-nucleotide long. In some embodiments, the linker is about 25-nucleotide long. In some embodiments, the linker is about 30-nucleotide long.
In some embodiments, the linker is a sequence of deoxyadenosines, deoxyguanosine, thymidine, deoxycytidines, or any combination thereof. In some specific embodiments, the linker is sequence of poly-deoxyadenosine. In other specific embodiments, the linker is sequence of poly-thymidine. In other specific embodiments, the linker is sequence of deoxyadenosines and thymidines.
In some embodiments, the SOI of the donor DNA as described herein comprises the target sequence with one or more nucleotide substitution, one or more nucleotide insertion, one or more nucleotide deletion, or any combination thereof.
In some specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of 1 to 100 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of 1 to 50 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of 2 to 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide insertion of at least 50, at least 100, at least 1,000, at least 10,000, or at least 100,000 nucleotides.
In some specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide deletion of 1 to 50 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide deletion of 1 to 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide deletion of at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide deletion of at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 nucleotides.
In some specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide substitution of 1 to 50 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide substitution of 1 to 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide substitution of at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 nucleotides. In other specific embodiments, the SOI of the donor DNA as described herein comprises the target sequence with nucleotide substitution of at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, or at least 20 nucleotides.
Furthermore, the SOI of the donor DNA as described herein serves as a template for homologous recombination after the guide RNA-medicated nuclease cut, so in some embodiments, the second portion of the donor DNA further comprises an upstream and/or a downstream homology arm.
In some embodiments, the upstream homology arm described herein is about 10 to 1,000-nucleotide long. In other embodiments, the upstream homology arm described herein is about 10 to 80-nucleotide long. In some embodiments, the upstream homology arm described herein is at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, or at least 70 base long. In some specific embodiments, the upstream homology arm described herein is about 20-nucleotide long. In some specific embodiments, the upstream homology arm described herein is about 25-nucleotide long. In some specific embodiments, the upstream homology arm described herein is about 30-nucleotide long. In some specific embodiments, the upstream homology arm described herein is about 35-nucleotide long.
In some embodiments, the downstream homology arm described herein is about 10 to 1,000-nucleotide long. In other embodiments, the downstream homology arm described herein is about 10 to 80-nucleotide long. In some embodiments, the downstream homology arm described herein is at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, or at least 70 base long. In some specific embodiments, the downstream homology arm described herein is about 20-nucleotide long. In some specific embodiments, the downstream homology arm described herein is about 25-nucleotide long. In some specific embodiments, the downstream homology arm described herein is about 30-nucleotide long. In some specific embodiments, the downstream homology arm described herein is about 35-nucleotide long.
In some embodiments, the second portion of the donor DNA further comprises a distal and/or a proximal homology arm. In some embodiments, the distal homology arm is about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80-nucleotide long. In some embodiments, the proximal homology arm is about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80-nucleotide long.
In some specific embodiments, the SOI of the donor DNA described herein is optimized to avoid any PAM sequences of the CRISPR nuclease used in the system. For example, in some embodiments, the SOI of the donor DNA described herein is optimized by a silent mutation to avoid any PAM sequences of the CRISPR nuclease used in the system while no change on the level of amino acid is introduced in the SOI.
The donor DNA as described herein, as illustrated in
Furthermore, the first portion of the donor DNAs as described herein, which comprises one or more binding segments, can be designed to be various locations of donor DNAs, as tested in
In some embodiments, the system described herein further comprises a CRISPR nuclease. In some specific embodiments, the CRISPR nuclease is a DNA nuclease.
In some embodiments, the CRISPR nuclease is a class I CRISPR nuclease. In other embodiments, the CRISPR nuclease is a class II CRISPR nuclease.
In some embodiments, the CRISPR nuclease is a type I CRISPR nuclease. In some specific embodiments, the CRISPR nuclease is a type I-A, type I-B, I-C, I-D, I-E, I-F, or I-U CRISPR nuclease. In other embodiments, the CRISPR nuclease is a type II CRISPR nuclease. In some specific embodiments, the CRISPR nuclease is a type II-A, type II-B, or type II-C CRISPR nuclease. In some embodiments, the CRISPR nuclease is a type III CRISPR nuclease. In some specific embodiments, the CRISPR nuclease is a type III-A, type III-B, type III-C, or type III-D CRISPR nuclease. In some embodiments, the CRISPR nuclease is a type IV CRISPR nuclease. In other embodiments, the CRISPR nuclease is a type V CRISPR nuclease. In some specific embodiments, the CRISPR nuclease is a type V-A, V-B, or V-C CRISPR nuclease.
In some embodiments, the DNA nuclease described herein is a Cas1, a Cas2, a Cas3, a Cas4, a Cas5, a Cas6, a Cas7, a Cas8, a Cas9, a Cas10, a Cas12, a Cas14, a CasΦ, a Casm, or a Cmr. In some specific embodiments, when the DNA nuclease is a Cas 9, the DNA nuclease is a high-fidelity Cas9 or an eCas9. In some specific embodiments, when the DNA nuclease is a Cas 12, the DNA nuclease is a Cas12a (Cpf1), a Cas12b, a Cas12c, a Cas12d, a Cas12e, a Cas12g, a Cas12h, a Cas12i, a Cas12j, or a Cas12k.
In one aspect, provided herein is a kit comprising the donor DNA described herein and the modified sgRNA described herein. In some embodiments, the kit comprises the donor DNA described herein, the modified sgRNA described herein, and the CRISPR nuclease described herein.
In one aspect, provided herein is a pharmaceutical composition comprising the donor DNA described herein, the modified sgRNA described herein, and a pharmaceutically acceptable salt or derivative thereof. In some embodiments, the kit comprises the donor
DNA described herein, the modified sgRNA described herein, the CRISPR nuclease described herein, and a pharmaceutically acceptable salt or derivative thereof.
In one aspect, provided herein is a method of modifying a cell, wherein the method comprises transporting the system as described herein into the cell.
In some embodiments, the transporting comprises (a) incubating the CRISPR nuclease described herein and the modified sgRNA described herein to form a ribonucleoprotein (RNP) complex; (b) applying the donor DNA to the RNP complex; and (c) delivering the RNP complex-donor DNA from (b) to the cell. In some specific embodiments, the RNP complex is formed in vitro in step (a). In some embodiments, the CRISPR nuclease described herein is expressed and purified using relevant plasmid. In some embodiments, the modified sgRNA is in vitro transcribed from a corresponding ssDNA. In some non-limiting exemplary embodiments, the ratio of the CRISPR nuclease and the modified sgRNA in step (a) is about 1:0.5 to about 1:10. In other non-limiting exemplary embodiments, the ratio of the CRISPR nuclease and the modified sgRNA in step (a) is about 1:1 to about 1:1.2. In other non-limiting exemplary embodiments, the ratio of the CRISPR nuclease and the modified sgRNA in step (a) is about 1:0.6, about 1:1.2, about 1:2, or about 1:5. In some specific embodiments, applying the donor DNA to the RNP complex is carried out in vitro in step (b). In some specific embodiments, delivering the RNP complex-donor DNA from (b) to the cell is achieved by viral vectors, liposomes, and/or lipid nanoparticles. In other specific embodiments, delivering the RNP complex-donor DNA from (b) to the cell is achieved by electroporation. In one embodiment, delivering the RNP complex-donor DNA from (b) to the cell is achieved by nucleofection (see e.g., Distler et al., Exp Dermatol 2005 April; 14 (4): 315-20). In other specific embodiments, delivering the RNP complex-donor DNA from (b) to the cell is achieved by a polyethylene glycol (PEG) mediated transfection. In other specific embodiments, delivering the RNP complex-donor DNA from (b) to the cell is achieved by a gene gun.
In some embodiments, the transporting comprises (a) providing one or more vectors comprising a nucleotide sequence encoding the CRISPR nuclease described herein and a nucleotide sequence encoding the modified gRNA described herein; (b) delivering the one or more vectors of (a) to the cell; and (c) delivering the donor DNA described herein to the cell. In some preferred embodiments, step (c) is performed about 6 to 48 hours after step (b). In other preferred embodiments, step (c) is performed at least 6 hours after step (b). In other preferred embodiments, step (c) is performed at least 12 hours after step (b). In other preferred embodiments, step (c) is performed at least 18 hours after step (b). In other preferred embodiments, step (c) is performed at least 24 hours after step (b). In other preferred embodiments, step (c) is performed at least 30 hours after step (b). In other preferred embodiments, step (c) is performed at least 36 hours after step (b). In other preferred embodiments, step (c) is performed at least 42 hours after step (b). In other preferred embodiments, step (c) is performed at least 48 hours after step (b). In some specific embodiments, the delivering of the one or more vectors of (a) to the cell is achieved by viral vectors, liposomes, and/or lipid nanoparticles. In some specific embodiments, the delivering of the donor DNA to the cell is achieved by viral vectors, liposomes, and/or lipid nanoparticles.
In some embodiments, the cell being modified herein is an immune cell. In some specific embodiments, the cell being modified herein is a T cell. In some specific embodiments, the cell being modified herein is a B cell. In some specific embodiments, the cell being modified herein is an NK cell. In some specific embodiments, the cell being modified herein is a hematopoietic stem cell.
In some embodiments, method described herein is performed in vitro. In other embodiments, method described herein is performed ex vivo. In other embodiments, method described herein is performed in vivo.
Since the cleavage efficiency of the nuclease described herein is relatively high and the efficiency of homology directed repair (HDR) is relatively low, a large portion of the nuclease-induced double stranded breaks (DSBs) might be repaired via NHEJ. In other words, the resulting population of cells might contain some combination of wild type alleles, NHEJ-repaired alleles, and/or the desired edited allele based on the design of the SOI of the donor DNA described herein. The method of modifying the cell using the system described herein is advantageous, in view of a higher percentage of desired editing, low off-target rate, and/or low translocation large insertion, or large deletion rate.
In some embodiments, a percentage of desired editing using the method described herein is at least 10%, at least 50%, at least 100%, or at least 200% higher than a comparable system without the donor DNA comprising a first portion that binds to the modified sgRNA and/or without the modified sgRNA. In other embodiments, the method described herein has an off-target rate at least 10%, at least 50%, or at least 100% lower than a comparable system without the donor DNA comprising a first portion that binds to the modified sgRNA and/or without the modified sgRNA. In other embodiments, the method has a translocation, large insertion, or large deletion rate at least 10%, at least 50%, or at least 100% lower than a comparable system without the donor DNA comprising a first portion that binds to the modified sgRNA and/or without the modified sgRNA.
In order to further increase the HDR rate, in some embodiments, the method described herein further comprises synchronizing the cell to S phase. In other embodiments, the method described herein further comprises synchronizing the cell to G2 phase. In other embodiments, the method described herein further comprises inhibiting genes involved in NHEJ pathway. In other embodiments, the method described herein further comprises inhibiting genes involved in mismatch-repair pathway. In other embodiments, the method described herein further comprises fusing ctIP, a protein involved in double-stranded break resection, to the CRISPR nuclease. In other embodiments, the method described herein further comprises fusing single-strand annealing protein to the CRISPR nuclease.
In another aspect, provided herein is a method of treating a genetic disorder, wherein the method comprises administering to a subject with an effective amount of the system described herein. In some embodiments, the SOI comprises a sequence that reverses or alleviate the genetic disorder.
The following is a description of various methods and materials used in the studies, and are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present disclosure, and are not intended to limit the scope of what the inventors regard as their disclosure nor are they intended to represent that the experiments below were performed and are all of the experiments that may be performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, percentages, etc.), but some experimental errors and deviations should be accounted for
The “Fusion-Oligo design” is referred to as a system where a donor DNA is fused to a guide RNA (see
Separately, “Terminal-Anchor design” was also tested. The “Terminal-Anchor design” is referred to as a system which comprises a guide RNA that is partially complementary to a donor DNA, and the complementary sequence is located at a terminus of the guide RNA (see
Based on the above surprising and unexpected observations, it was concluded that the type and the location of the connection between donor DNAs and guide RNAs substantially affect the editing efficiency. Donor DNA being fused with or hybridized to termini of guide RNAs did not result in satisfactory editing efficiency.
The “Internal-Anchor design” is referred to as a system which comprises a guide RNA that is partially complementary to a donor DNA, and the complementary sequence is located at the internal part of the guide RNA (see
With the Internal-Anchor CRISPR system, the effect of different positions of the fragments of donor DNA (“tail”) that was complementary to the guide RNA on the efficiencies of desired gene editing (‘CTT’ insertion at the target locus) in HEK293T cells was first evaluated. Different DNA donor designs included: conventional design where a donor DNA is without tails (indicated as “0” in
Next, the effect of a tail sequence on the donor DNA, matching or non-matching with the internal anchor (IA) sequence of the guide RNA, on editing efficiency was tested. As expected, negative control samples (no Guide RNAs, no Donor DNAs, or no Cas9, shown as “-” under “guide,” “Donor tail,” or “Cas9” categories) showed no observable CTT insertion (see
Different ratios of the two (ranging from 1:0.6 to 1:10) were tested using two tail designs (“L” and “L10aL,” similarly labeled as in Example 1). As shown in
The effect of the tail-tail inner sequence length (0, 5, 10, 15, 20, 25, and 30 deoxyadenosines;
The effect of the length of DNA donor homology arms on HDR editing efficiency was studied. Donors of different lengths at both distal (D) and proximal (P) sides relative to the cutting position were tested as shown in
Genome-wide off-target profiling was evaluated when targeting the HEK3 site using the Internal-Anchor CRISPR system with various elements: (1) the different guides (“WT” denotes guide RNA without internal anchors, and “IA” denotes guide RNA with internal anchors); (2) donor DNA tails (“0” denotes donor DNA without tails/binding segments that are complementary to the guide RNA, and “L10aL” denotes two DNA tails/binding segments are at the 5′ end of the donor DNA, with ten deoxyadenosines in between); (3) homology arms (“D20P16” denotes donor DNAs have a 20-nucleotide long distal homology arm and a 16-nucleotide long proximal arm; “D20P36” denotes donor DNAs have a 20-nucleotide long distal homology arm and a 36-nucleotide long proximal arm); and (4) by three SpCas9 (WT, HiFi and eCas9). As expected, the high-fidelity Cas9s showed reduced off-target site numbers (see
To delete a long-fragment in HEK3, dual modified sgRNAs (i.e., leg1 and leg2 in
LegRNA production. The legRNAs were produced by in vitro transcription (IVT). Briefly, DNA template of the legRNA was constructed and amplified by PCR. Then, T7 polymerase was used to transcribe the DNA to RNA, followed by DNAse I treatment to remove the remaining DNA template in the sample.
Cas9-legRNA ribonucleoprotein (RNP) transfection. HEK293T cells were seeded in a 24-well plate one day before transfection. Firstly, legRNAs were assembled with cas9 protein to form a ribonucleoprotein complex (RNP). Next, asODN were added to form a RNP-asODN complex. The RNP-asODN complex were transfected into the cells using lipofectamine. DNA from the cells were extracted 2-3 days after transfection.
PCR and gel electrophoresis. PCR and gel electrophoresis were used to visualize the 1 kb deletion. 1 forward primer and 2 reverse primers were used in the PCR reaction so that the WT product and the deletion product can be amplified and visualized on a gel simultaneously.
Amplicon sequencing. The amplicon sequencing library was generated with 2-step PCR using target site specific primers and index primers. The library was sequenced by illumine Nextseq500 or iSeq sequencing system.
The results of using different asODN-legRNA designs were shown in
Similarly, as Example 1, a long-fragment (1075-bp) deletion in HPRT1 was tested. A similar dual asODN-legRNA system as Design A1 was used (asODN5: SEQ ID: HPRT_A1_asODN1; asODN6: SEQ ID: HPRT_A1_asODN2; legRNA7: SEQ ID: HPRT_legRNA1; and legRNA8: SEQ ID: HPRT_legRNA2), and the results were shown in
To test large-fragment insertion of 1734 bases into GAPDH, cas9 and legRNA (SEQ ID: GAPDH_legRNA) with a circularized anchored-ssODN (casODN), cas800HA (SEQ ID: casODN_800HA, internal anchor sequence: ACATTGTTCTCACTT (SEQ ID NO: 6123) were used (
While preferred embodiments of the present disclosure have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the disclosure be limited by the specific examples provided within the specification. While the disclosure has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the disclosure. Furthermore, it shall be understood that all aspects of the disclosure are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the disclosure described herein may be employed in practicing the disclosure. It is therefore contemplated that the disclosure shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the disclosure and that methods and structures within the scope of these claims and their equivalents be covered thereby.
This application is a U.S. National Phase Application under 35 U.S.C. 371 of International Application No. PCT/CN2023/080356, filed on Mar. 8, 2023, which claims the benefit of U.S. Provisional Application Ser. No. 63/318,362 filed on Mar. 9, 2022, the entirety of which is hereby incorporated by reference herein.
The disclosure was made with government support under Grant No. 2016-02830 awarded by the Swedish Research Council to Zongli Zheng.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2023/080356 | 3/8/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63318362 | Mar 2022 | US |