The present disclosure provides methods and tools with application in the editing of nucleic acid. The methods or tools can be used in genome editing and are efficient and precise.
Over the past decade, genome editing technologies have become one of the essential molecular techniques in biomedical research (1-4). Given that many human diseases have a genetic basis, these genome editing technologies, especially precise insertion, deletion, or replacement of parts of the genome, hold tremendous promise for the treatment of monogenetic disorders (5-8). Since their discovery, CRISPR/Cas systems have quickly become the editing technology of choice for targeted genome manipulation (9-12). CRISPR/Cas systems employ a short RNA molecule, the guide RNA, to lead a CRISPR effector nuclease to the genomic target position of interest, creating a DSB at the genomic target site. Subsequently, DNA damage-induced endogenous DNA repair machineries are recruited towards the cut site and repair the damage, by which the genomic manipulations occur and the ‘editing’ is finalized (13,14).
Cas12a (previously known as Cpf1), like Cas9, is a single-effector CRISPR protein (15,16). Cas12a differs from Cas9 with respect to several important properties (17). Cas12a naturally employs a single CRISPR RNA (crRNA) as guide RNA, which is substantially shorter than the engineered single guide RNA (sgRNA) used for Cas9 (18). In addition, Cas12a recognizes a T-rich protospacer adjacent motif (PAM) sequence, comparing to the G-rich PAM recognized by Cas9 (16). And importantly, Cas12a uses a single RuvC catalytic domain to cleave both the target and non-target strands, creating 5′-overhang sticky ends (19,20), while Cas9 employs two nuclease domains RuvC and HNH generating a blunt-ended DSB at the target locus (21,22).
Some of these unique properties of Cas12a have been exploited in various practical applications, for example, to edit AT-rich sequences that lack adequate Cas9 target sites (23,24). Other unique applications include: Cas12a's crRNA processing ability was utilized to simplify multiplex genome editing (25); DNase-dead Cas12a-carried base-editing domains achieved base-editing without DNA strand breaks (26,27); the on-target binding-induced collateral ssDNA cleavage activity was further developed to a virus detect tool in clinical diagnostics (28); and the ability to generate sticky-ends at the target DNA cleavage site has been employed as a tool for in vitro DNA assembly (29,30). Recently, Li et al developed a new Cas12a-based genome editing method named MITI (microhomology-dependent targeted integration) (31), which utilized the Cas12a-generated compatible sticky ends between transgene and target site termini to direct a site-specific gene insertion. The authors demonstrated how MITI could be applied to insert a gene of interest together with a positive selection cassette into a single Cas12a target-site in the genome. Yet, the need for a selection cassette and the reported inaccurate integration of the targeting construct at the 5′ and 3′ junctions restrain its application, especially in the context of gene therapeutics.
Two AsCas12a cleavage sites on a genome may be used to excise a target sequence and replace it with a dsDNA insert containing compatible sticky ends. This strategy has been termed ‘Cut-And-Paste Repair’ (CAPR), but the repair efficiency is rather low.
There is a need to continue to expand the repertoire of methods and tools for the efficient and precise editing of nuclei acid/genome sequences.
The present invention is based on the finding that nucleic acids (including genomic nucleic acids) can be efficiently altered or edited using methods which exploit both ‘sticky-ended’ ligation (through microhomology-mediated end joining) and homology directed repair (HDR). The invention is further based on the finding that repair templates which comprise a single sticky end compatible with the 5′ overhang created by certain nucleases (including, for example nucleases of the CRISPR-Cas system), allow sticky-ended ligation, and may be sufficient to effect efficient nucleic acid/genome editing.
Throughout this specification, the terms “comprise”, “comprising” and/or “comprises” is/are used to denote aspects and embodiments of this invention that “comprise” a particular feature or features. It should be understood that this/these terms may also encompass aspects and/or embodiments which “consist essentially of” or “consist of” the relevant feature or features.
For convenience, the various methods of this disclosure shall generally be referred to as “methods of editing nucleic acids”.
The term “editing” should be taken to embrace the act of altering a nucleic acid sequence—that is making one or more changes to a nucleic acid sequence. In this regard, a method of editing a nucleic acid sequence may be used to correct or introduce a mutation and or to introduce a particular sequence (to another). A method of editing a nucleic acid sequence may comprise replacing one sequence (or part(s) of a sequence) with another. A method of editing as described herein may focus on making one or more predetermined or defined changes to the nucleic acid sequence of a specific target region. The term ‘target region’ should be taken to mean a part or portion of a nucleic acid sequence which comprises a sequence which is to be edited. The term ‘editing’ as applied to a nucleic acid sequence may embrace any alteration or modification of the sequence of a target region. For example, an editing method of this disclosure may comprise correcting an error within a target region. In another example, an editing method of this disclosure may comprise altering (e.g. changing) one or more of the nucleobases within a target region. In a further example, an editing method of this disclosure may be used to add nucleobases to a target region. Additionally or alternatively, an editing method of this disclosure may be used to delete nucleobases from a target region within a nucleic acid sequence.
Within the context of this disclosure, a nucleic acid to be edited may be a genomic nucleic acid. As such, the phrase “nucleic acid editing” may embrace a method of genome editing. The nucleic acid to be edited may be a synthetic or isolated nucleic acid sequence from any source. The nucleic acid to be edited may comprise an exogenous sequence. The nucleic acid to be edited may comprise an endogenous sequence. The nucleic acid sequence to be edited may be part of a genome. The nucleic acid to be edited may be a double stranded nucleic acid.
As such, the disclosure provides a method of editing a nucleic acid, said method comprising:
The disclosure further provides a repair template for use in a genome editing method or in a method of editing a nucleic acid, said repair template comprising an end which matches a cut end of a cut (genomic) nucleic acid to be edited and a sequence which is homologous to a sequence of the (genomic) nucleic acid to be edited.
The step of cutting the nucleic acid to be edited may use a nuclease which creates staggered cuts, especially staggered cuts in dsDNA. It should be noted that a staggered cut may comprise a break (in the nucleic acid to be edited) characterized by overhanging sequences. These over hangs may be 5′ overhangs. Cuts of this type may be commonly referred to as ‘sticky-end’ or ‘sticky-ended’ cuts. Whatever nuclease is used to cut the nucleic acid, that nuclease may yield a staggered (sticky-end) cut having a set number of nucleotides with the overhang sequence. By way of example a sticky-ended cut may comprise a 5′-overhang, with 2, 3, 4, 5, 6 or 7 nucleotides. In one teaching, a 5′ overhang (sticky-ended cut) may comprise 4 or 5 nucleotides.
The step of cutting the nucleic acid to be edited may use a nuclease of the CRISPR-Cas system.
The step of cutting the nucleic acid may use a Cas12a nuclease. As such, a method of this disclosure may comprise:
Cas12a (previously known as Cpf1) is a single-effector CRISPR protein. Unlike Cas9, Cas12a naturally employs a single CRISPR RNA (crRNA) as guide RNA, which is substantially shorter than the engineered single guide RNA (sgRNA) used for Cas9. Moreover, Cas12a recognizes a T-rich protospacer adjacent motif (PAM) sequence, comparing to the G-rich PAM recognized by Cas9. Cas12a uses a single RuvC catalytic domain to cleave both the target and non-target strands, this creates 5′-overhang sticky ends (in contrast, Cas9 employs two nuclease domains RuvC and HNH generating a blunt-ended double-stranded break at the target locus).
The term Cas12a (as used herein), embraces any of the recognized Cas12a homologs, including, for example FnCas12a (from Francisella novicida), LbCas12a (from Lachnospiraceae bacterium) and AsCas12a (from Acidaminococcus sp.). The term ‘Cas12a’ may also embrace any functional form of Cas12a including, for example, any fragments which retain an ability to cut (or cleave) a nucleic acid to create 5′-overhang sticky ends. In one teaching the Cas12 nuclease is AsCas12a.
In use, a Cas12a nuclease will cut both strands of the nucleic acid to be edited (i.e. both the target and the non-target strand) to yield two nucleic acid fragments, both having 5′-overhang sequences (so called 5′ overhang sticky ends).
A method of this disclosure may use a single Cas12a molecule in order to cut the nucleic acid sequence creating a single double stranded break in the nucleic acid (i.e. the nucleic acid to be edited).
The nucleic acid to be edited may comprise a protospacer adjacent motif (PAM) or PAM sequence. The site at which the nuclease (for example a Cas12a nuclease) cuts, may lie distal to the PAM site. Where the nuclease (for cutting the nucleic acid sequence to be edited) is a Cas12a nuclease, the PAM sequence is a T-rich PAM sequence.
The nucleic acid to be edited may comprise a Cas12a cleavage site.
The nucleic acid to be edited may comprise a guide RNA or CRISPR RNA binding site. Further detail regarding any gRNA and/or crRNA component is provided below.
The target region of the nucleic acid sequence to be edited may lie adjacent (or downstream of) the Cas12a cleavage site. The specific sequence to be altered may lie within a few, for example 1, 2, 3, 4, 5, 10, 15, 20 or more, base pairs of the cleavage site.
A repair template for use in a method of this disclosure may comprise a double stranded nucleic acid sequence. The repair template may comprise a single stranded sequence. The repair template may be a fully or partial double-stranded repair template.
As stated, the repair template comprises an end which matches a cut end of the nucleic acid to be edited and a sequence which is homologous to a part of that nucleic acid.
The end of the repair template which matches a cut end of the nucleic acid to be edited, may be referred to as a ‘cut-end matching overhang sequence’. The cut-end matching overhang sequence may comprise a 5′-overhang sequence (namely a ‘cut end matching 5′-overhang sequence’). The cut-end matching 5′-overhang sequence may match the sequence of the 5′-overhang of one of the nuclease cut ends of the nucleic acid to be edited.
One of skill will appreciate that when a nucleic acid sequence is cut (or cleaved) using a nuclease as described herein (for example Cas12a), two staggered and ‘sticky’ cut ends are generated. Each staggered/sticky end comprises a 5′-overhang sequence. Cutting a nucleic acid sequence as described herein (for example with Cas12a) will generate two cut ends a left-hand, or proximal, cut end and a right-hand, or distal, cut end.
The cut end matching 5′-overhang sequence a repair template for use in a method of this disclosure may match the sequence of a distal 5′-overhang cut end. For example, the cut matching 5′ overhang sequence of a repair template for use in a method of this disclosure may comprise a sequence which matches the sequence of a distal 5′-overhang cut end generated by Cas12a.
The other end of the repair template may also be sticky-ended (i.e. comprise a sequence overhang (e.g. a 5′-overhang)).
Alternatively, the other end may be blunt-ended.
Accordingly a repair template for use in a method of this disclosure may comprise two ends, at least one of which is ‘sticky-end’ with a 5′-overhang sequence, the sequence of which matches the sequence of one of the cut ends of the nucleic acid sequence to be edited and one other sticky end or a blunt end. Where the nucleic acid to be edited is cut using Cas12a, the repair template may comprise at least one end in which the sequence of the 5′-overhang matches the sequence of one of the cut ends generated by the Cas12a nuclease.
The cut end matching 5′-overhang sequence of the repair template may comprise the same or a different (for example a lesser) number of nucleotides as present in the 5′ overhang of the (distal) cut end of the nucleic acid sequence to be edited. Where the repair template comprises fewer nucleotides in its cut-end matching 5′ overhang, those nucleic acids will match (i.e. correspond to) at least some of the nucleotides present in sequence of the (distal) cut-end 5′-overhang.
The cut end matching 5′-overhang sequence (of the repair template) may comprise 2 nucleotides, 3 nucleotides, 4 nucleotides or 5 nucleotides.
In one teaching, the cut end matching 5′-overhang sequence (of the repair template) may comprise 4 nucleotides. The 5′-overhang sequence of the (distal) cut end of the nucleic acid sequence to be edited may also comprise 4 nucleotides. Those 4 nucleotides may match or be the same as 4 of the nucleotides present in the cut matching 5′ overhang sequence of the repair template.
In one teaching, the cut end matching 5′-overhang sequence (of the repair template) may comprise 5 nucleotides. The 5′-overhang sequence of the (distal) cut end of the nucleic acid sequence to be edited may also comprise 5 nucleotides. Those 5 nucleotides may match or be the same as 5 of the nucleotides present in the cut matching 5′ overhang sequence of the repair template.
The part of the repair template which comprises a sequence which is homologous to part of the nucleic acid to be edited, may be homologous to all or part of the target region of that nucleic acid sequence. For convenience, this sequence shall be referred to as the ‘homologous sequence’.
The homologous sequence may comprise the intended sequence edit—that is the sequence that is to replace or modify a sequence of the target region or the sequence which introduces an additional nucleic acid sequence into the target region.
In one teaching, the homologous sequence (of the repair template) may be substantially identical to all or part of the target region.
As stated, the homologous sequence of the repair template may be fully or partially double stranded. For example, the homologous sequence may comprise both double stranded parts and single stranded parts.
Relative to the sequence of the target region, the homologous sequence may comprise one or more nucleobase alterations—for example the inclusion of additional nucleobases (not present in the sequence of the target region) and/or the omission of other nucleobases which are present in the target region. Where the target region comprises a mutation (for example a deleterious mutation, the homologous sequence of the repair template may comprise a correction—i.e. the correct nucleotide base pairs.
The homologous sequence may comprise a sequence which differs from the sequence of all or part of the target region by the presence of:
The homologous sequence may comprise between about 1 and about 300 bases and/or base pairs.
The homologous sequence may comprise between about 1, about 2, about 3, about 4, about 5, about 6, about 7, about 8, about 9 or about 10 and about 250 bases and/or base pairs.
The homologous sequence may comprise between about 20 and about 200 base pairs.
The homologous sequence may comprise about 25, about 30, about 35, about 40, about 45, about 50 about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 105, about 110, about 115, about 120, about 125, about 130, about 135, about 140, about 145, about 150, about 155, about 160, about 165, about 170, about 175, about 180, about 185, about 190, or about 195 bases and/or base pairs.
The homologous sequence may comprise about 40 to about 120 bases and/or base pairs.
The homologous sequence may comprise about 60 to about 100 bases and/or base pairs.
The homologous sequence may comprise 80 bases and/or base pairs.
The homologous sequence may comprise, at least a single strand of nucleic acid which is homologous to sequence of the nucleic acid to be edited. All or part of that homologous sequence may further comprise an additional nucleic acid strand forming base pairs with some or all of the bases of the single strand. For example, the homologous sequence may comprise, for example 1-300, for example 20, 40, 60, 80, 100 or 120 bases (which bases are homologous to a sequence of the nucleic acid to be edited). At least some of those bases may be paired to other bases to form a double stranded homologous sequence. For example, 1-60 (for example 2, 4, 6, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50 or 55) bases of the single strand of the homologous sequence may be pair to other bases to form a (partial) double strand).
One of skill will appreciate that the exact size (in terms of number of bases and/or base pairs) of the homologous sequence may vary. The variance may depend on the size of the target region within the nucleic acid sequence to be edited (longer target regions may require longer homologous sequences) and or the number of edits that need to be made to the target region.
As stated, the homologous sequence may match at least part of the sequence of the target region.
The abovementioned homologous sequence may be disposed (in the repair template) between the blunt end and the sticky end.
In view of the above, the repair template may comprise a double stranded nucleic acid sequence comprising a short dsDNA homologous arm (comprising the homologous sequence) and a “sticky” 5′-overhang end which matches an AsCas12a-generated cut end.
The repair template (and methods using the same) functions to deliver a desired sequence to a target region within a nucleic acid sequence to be edited. For example, the repair template may deliver:
In one teaching, a method of editing a nucleic acid sequence may comprise:
In all the disclosed methods the step of contacting may take place under conditions which facilitate editing of the nucleic acid sequence by replacement of a sequence within a target region of the nucleic acid to be edited with a nucleic acid sequence comprised within the repair template (specifically, for example, the nucleic acid sequence provided by the homologous sequence of the repair template). Those conditions may facilitate editing via both homology directed repair (HDR) and microhomology-mediated end joining (MMEJ) mechanisms. This combined approach is advantageous and the inventors have adopted the term ‘Ligation-Assisted Homologous Recombination’ (LAHR) to describe the various novel (HDR/MMEJ) methods described herein.
As compared to prior art methods, the LAHR methods described herein achieve relatively high levels of editing efficiency. Accordingly, the methods of this disclosure may be used to efficiently:
Further, as compared to prior art methods, the LAHR methods described herein can achieve high levels of editing efficiency without the use of one or more DNA repair inhibitors, such as BAY-598 and/or NU7441. Other DNA repair inhibitors may include SB939, A196, KY02111, R-PFI-2-hydrochloride, A395 and/or AT9283. Moreover, in the disclosed methods the AsCas12a-cleaved genomic DSB end and repair template may both contain homologous 5′ overhangs, such that they enter the MMEJ pathway at the level of Polyθ, skipping the need of strand resection. The remaining homologous arm of the template recombines by HDR. As a consequence the disclosed methods provide a novel method capable of precisely editing the genome, which (as stated) does not require the use of DNA repair inhibitors.
The methods of this disclosure may further comprise the use of a guide RNA (gRNA) most commonly known as a CRISPR RNA (crRNA). Where the nuclease is a Cas12a nuclease, the method may use is single crRNA. The role of the crRNA is to guide the nuclease to the cleavage site. The crRNA may be synthetic and one of skill will know that the sequence of any necessary crRNA may vary depending on the sequence of the nucleic acid to be edited.
This disclosure further provides a kit for repairing a nucleic acid sequence, the kit comprising a repair template as described herein.
A kit of this disclosure may further comprise:
The present invention will be described by reference to the following figures which show:
Recombinant SpCas9 and AsCas12a proteins (Table 11), as well as single-copy EGFPΔfluor and EGFPY66S reporter HAP1 cell lines and Beta-2-microglobulin (B2M)-deficient HAP1 cell line (HAP1B2M−/−) are available through Divvly (https://divvly.com/geijsenlab).
HAP1 cells derived from the KBM-7 cell line were a main cell line used in this study (32). All the reporter cell lines based on HAP1 cells were cultured in Iscove's Modified Dulbecco's Medium (IMDM) (Gibco), supplemented with 10% fetal bovine serum and 1% penicillin/streptomycin; HEK293 cells were cultured in Dulbecco's Modified Eagle Medium (DMEM) (Gibco), with 10% fetal bovine serum and 1% penicillin/streptomycin; C2C12 cells were cultured in DMEM, with 15% fetal bovine serum and 1% penicillin/streptomycin; ARPE19 cells were cultured in DMEM/F12 (Gibco), with 20% fetal bovine serum, 56 mM sodium bicarbonate and 2 mM L-glutamine. All cells were grown at 37° C. in a humidified atmosphere containing 5% CO2.
Two targeting constructs, pAAVS1-EGFPΔfluor and pAAVS1-EGFPY66S, were made to generate the single-copy EGFPΔfluor and EGFPY66S reporter HAP1 cell lines (
The expression plasmid pET15B_AsCas12a was constructed using a previously published SpCas9 expression plasmid ‘Sp-Cas9’ (Addgene #62731) as backbone (33). Briefly, E. coli. codon optimized AsCas12a coding sequence (including NLS and 6×HIS tag at C-terminal) was synthesized by GenScript (GenScript). The AsCas12a coding sequence then was amplified by PCR with the primer pair: Fw 5′-AGGAGATATACCATGACCCAGTTTG-3′ (SEQ ID NO: 9), Rv 5′-GTTAGCAGCCGGATCCTTAATG-3′ (SEQ ID NO: 10), and cloned into the backbone plasmid between NcoI and BamHI sites.
All the restriction enzymes used here were products of New England Biolabs (NEB). All PCR fragments were cloned into the backbone plasmids with In-Fusion HD Cloning Plus kit (Takara).
To generate the single-copy EGFPΔfluor and EGFPY66S reporter cell lines, we targeted the reporter genes into the genome of HAP1 cells at the human AAVS1 locus that located in the first intron of human PPP1R12C gene (34). To enhance the efficiency of HDR, we co-transfected the donor plasmid together with recombinant SpCas9 protein and AAVS1-T2 guide RNA (10) into the cells by using Lonza Nucleofection system following the manufacturer's protocol. In brief, 4 μg of donor plasmid, 150 pmol of recombinant SpCa9 protein (75 μM) and 300 pmol of Alt-R 2-part guide RNA (100 μM) (IDT) were added into 100 μL of Lonza Nucleofection buffer for cell line (Lonza). 1×106 HAP1 cells were resuspended with the complete Lonza Nucleofection buffer, and the nucleofection was performed in a Lonza Nucleofector 2b device with the program of ‘Cell-line T-030’. 24 hours after transfection, the addition of puromycin (1:20000) was applied in the cell culture to start the positive selection. The concentration of puromycin was doubled after 3 days. After 10 days of positive selection, survival cells were single-cell sorted onto a 96-well plate. We typically sorted 96 single cells for each targeting. The correctly targeted HAP1 clones were verified by border PCRs (
To express and purify the recombinant AsCas12a protein, we adapted a previously published method (33). In brief, the expression plasmid pET15B_AsCas12a was introduced into the One Shot BL21(DE3) chemically competent E. coli. cells (Invitrogen) that were previously transformed with a chaperone plasmid pG-Tf2 (Takara). A single colony was grown overnight in 50 mL LB medium pre-culture containing 150 μg/mL ampicillin, 34 μg/mL chloramphenicol and 0.1% glucose, at 37° C., with shaking at 225 rpm. 10 mL pre-culture was then added into 400 mL of LB medium (150 μg/mL ampicillin, 34 μg/mL chloramphenicol, 1% glucose, 5 ng/mL tetracycline, and 2.5 mM MgCl2) and cultured at 37° C., with shaking at 225 rpm until OD reached 0.5. After IPTG was added to a final concentration of 1 mM, the culture was incubated overnight at 25° C. with shaking at 225 rpm. Harvested cells were lysed in the lysis buffer (50 mM NaH2PO4, 1 M NaCl, 1 mM MgCl2, 0.2 mM PMSF, 10 mM beta-2-mercaptoethanol and 0.1 mg/mL lysozyme, pH 8.0, supplemented with cOmplete Protease Inhibitor Cocktail Tablets (Roche), 1 tablet/50 mL and Benzonase Nuclease, 25 U/mL) with sonication at 4° C. The sonicated cell lysate was solubilized with the NDSB buffer (50 mM NaH2PO4, 1 M NaCl, 2 M NDSB-201, 2.5 mM MgCl2 and 10 mM beta-2-mercaptoethanol, pH 8.0) at 4° C. with rotation. The solubilized cell lysate was cleared by centrifugation at 10,000×g for 60 minutes at 4° C. The Ni2+ affinity column chromatography was performed using a 5-mL HisTrap™ HP column with an ÄKTA pure 25 FPLC system (GE Healthcare). AsCas12a protein was eluted in the elution buffer (50 mM NaH2PO4, 1 M NaCl, 500 mM GABA, 500 mM imidazole, 2.5 mM MgCl2 and 5 mM beta-2-mercaptoethanol, pH 8.0) with a continuous concentration gradient. The target elution peak was buffer exchanged into the protein storage buffer (25 mM NaH2PO4, 500 mM NaCl, 250 mM, 150 mM glycerol, 75 mM glycine, 1.25 mM MgCl2, 2 mM beta-2-mercaptoethanol, pH 8.0) (33), using a HiLoad 26/600 Superdex 200 gel filtration column (GE Healthcare). The purified AsCas12a protein then was concentrated to 75 μM using Amicon Ultracel Centrifugal Filters (MWCO 100 kDa) (Millipore).
All guide RNAs used in this study are synthetic guide RNAs (IDT, Table 3). The dsDNA repair inserts (used in CAPR), or templates (used in LAHR) were produced by annealing two reverse complement ssDNA oligos. All ssDNA sequences were from Integrated DNA Technologies (IDT) and listed in Tables 4-7. Each ssDNA oligo was dissolved in the oligo annealing buffer (30 mM HEPES, pH 7.5; 100 mM potassium acetate) to reach the concentration of 100 μM. A pair of oligos for annealing was mixed in equal volume, and heat at 95° C. for 5 minutes, and cool down to room temperature.
Induced Transduction by Osmocytosis and Propanebetaine (iTOP)
The recombinant CRISPR nuclease proteins, guide RNAs and repair donors were simultaneously transduced into target cells by using the iTOP method we described previously (33). One day prior to transduction, the reporter cells were plated in the Matrigel-coated wells on 96-well plates at 30-40% confluence, such that on the day of transduction, cells were at 70-80% confluence. Next day, for each well of the 96-well plate, 50 μL of iTOP mixture that contains 20 μL of transduction supplement (Opti-MEM media supplemented with 542 mM NaCl, 333 mM GABA, 1.67×N2, 1.67×B27, 1.67×non-essential amino acids, 3.3 mM Glutamine, 167 ng/mL bFGF2, and 84 ng/mL EGF), 10 μL of CRISPR nuclease protein (75 μM), 10 μL of guide RNA (75 μM) and the excess volume of nuclease-free water to reach the 50-μL total volume, were prepared. For the no-protein control, 10 μL of protein storage buffer was used instead of the CRISPR nuclease protein; and for the no-guide control, the equal volume of nuclease-free water was used to replace the guide RNA. The 50-μL iTOP mixture was added onto the cells immediately after the culture medium was removed. The plate then was incubated in a cell culture incubator for 45 minutes, after which the iTOP mixture was gently removed and exchanged for 200 μL of regular culture medium.
To deliver the LAHR components into reporter cells by electroporation, we used a Lonza Nucleofector system which includes Cell Line Nucleofector Kit V and Nucleofector 2b Device (Lonza), following the manufacturing protocol. In brief, 1 million target cells were re-suspended in 100 μL of supplemented Nucleofector solution V buffer which contains AsCas12a RNP together with repair templates (50-500 pmol of each component in the molar ratio of 1:1 were used in experiments). The electroporation was performed with the program ‘Cell-line T-020’ in the Nucleofector 2b Device. After electroporation, the cells were incubated at 37° C. and the culture medium was changed after 16 hours.
To verify the gene editing efficacies in single-copy EGFPΔfluor and EGFPY66S reporter HAP1 cell lines, FACS analyses were performed 48 hours after iTOP transduction. Cells in each well were trypsinized and resuspended in 200 μL of FACS buffer (5% FBS in 1×DPBS) containing 1:1000 DAPI (4′,6-diamidino-2-phenylindole) DNA dye (Sigma). For the beta-2 microglobulin (B2M)-deficient HAP1 cells, 48 hours after iTOP transduction, cells from each well were firstly trypsinized and then incubated in 50 μL of staining solution (1% FITC-conjugated anti-human HLA-A, B, C antibody (Biolegend) in FACS buffer) for 10 minutes at 4° C. After washing three times with 1×DPBS, cells were resuspended in 150 μL FACS buffer containing 1:1000 DAPI DNA dye. FACS analyses were carried out on a CytoFLEX LX system (Beckman). In all experiments, the total number of 10,000 viable single cells were acquired and were gated based on side and forward light-scatter parameters. Constitutive EGFP-expressing control HAP1 cells were used to adjust the parameters for the identification and gating of EGFP/FITC positive cells. The EGFP/FITC signal was detected using the 488 nm diode laser for excitation and the 525/40 nm filter for emission.
Cell viability was analyzed using an MTS Assay Kit (Abcam) following the manufacturer's instructions. In Brief, cells were seeded on a 96-well plate at 30-40% of confluence, iTOP transduction was performed when the confluence reached 70-80%. 12-24 hours after the iTOP transduction, 5 μg/mL of 3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)-2H-tetrazolium (MTS reagent) was added into each well and incubated at 37° C. for 90 minutes. The absorbance was measured on a BIO-RAD XMark Microplate spectrophotometer at 490 nm (BIO-RAD).
To perform the small interfering RNA (siRNA)-mediated knockdown of the target genes involved in different DNA DSB repair pathways, the EGFPY66S reporter HAP1 cells were plated on 48-well plates and transfected with 3 pmol of either the targeting or control siRNAs (Table 8) using Lipofectamine RNAiMAX Transfection Reagent based on the manufacturer's protocol (Thermo Fisher). All siRNA oligos in this study were ordered from Thermo Fisher (Thermo Fisher).
qPCR Analysis
For siRNA targeted cell samples, total RNA was extracted using Trizol Reagent following the manufacturer's protocol. cDNA was produced using 250 ng of random hexamer primer (Invitrogen) and 5 μg of DNase-free RNase-treated total RNA each sample with SuperScript III Reverse Transcription Kit following the manufacturer's protocol (Invitrogen). qPCR was performed with iQ SYBR Green Supermix (BIO-RAD) in a BIO-RAD CFX96 Real-Time system (BIO-RAD). All the gene specific primers were listed in Table 8.
To assess the AsCas12a cleavage efficiencies of the target sites in the EGFPΔfluor mutant. We applied T7 Endonuclease I (T7E1) assay following the AsCas12a targeting cleavage conducted by iTOP. Three days after iTOP AsCas12 SNP transduction, cells were harvested to isolate genomic DNA using DNeasy Blood & Tissue Kit (Qiagen). Primers used for genomic DNA amplification are listed in Table 9. The gel-purified PCR products were then subjected to T7E1 assay with Alt-R Genome Editing Detection Kit (IDT) following the manufacturing protocol. Briefly, in a thermocycler, 500 ng of purified PCR product were denatured at 95° C. for 5 min and re-annealed at −2° C. per second temperature ramp to 85° C., followed by a −0.1° C. per second ramp to 25° C., and cooled to 4° C. The rehybridized PCR product was incubated with 3 U T7E1 enzyme at 37° C. for 30 min. The enzyme-treated products were resolved on a 2% agarose gel. Densitometry analysis was performed with ImageJ (35).
Amplicon sequencing with Illumina MiSeq platform was performed as previously described (36). In brief, the amplicon libraries were built following a two-round PCR protocol. The first round of PCR (PCR 1) amplified the target genomic loci by using locus-specific primer pairs tailed with Illumina sequencing adapters (Table 10). PCR 1 was performed using a Q5 High-Fidelity PCR Kit (NEB), following the manufacturing protocol. Each PCR 1 reaction (50 μL) contained 50 ng of genomic DNA template, 0.5 μM of each primer, 200 μM of dNTP, 0.02 U/μL of Q5 High-Fidelity DNA polymerase and 1×Q5 reaction buffer. The PCR 1 amplification initiated with a denaturation step at 98° C. for 2 min, followed by 30 cycles of denaturation at 98° C. for 10 s, primer annealing at 61° C. for 30 s, and primer extension at 72° C. for 30 s. Upon completion of the cycling steps, a final extension at 72° C. for 5 min was done and then the reaction was held at 12° C. The gel-purified PCR 1 products were then used as the templates of the second round PCR (PCR 2) where the PCR 1 products were indexed by the amplification using unique illumine barcoding primers. PCR 2 was as well performed with the Q5 High-Fidelity PCR Kit, in a 25-μL setup using 10 ng of purified PCR 1 product as template in each reaction. For the PCR 2 amplification, a denaturation step initiated at 98° C. for 12 s, followed by 12 cycles of denaturation at 98° C. for 10 s, primer annealing at 61° C. for 30 s, and primer extension at 72° C. for 30 s. When the final extension at 72° C. for 5 min was done the reaction was held at 12° C. Next, gel-purified PCR 2 products (pooled amplicons) were sequenced on an Illumina MiSeq platform, by which we generated about 30,000 total reads for each experimental sample. Sequencing reads were demultiplexed using MiSeq Reporter (Illumina). Alignment of amplicon sequences to a reference sequence was performed using CRISPResso2 (37). The editing efficiency was calculated as: the percentage of [the number of reads of edited]/[the number of total reads].
As shown in
We previously reported how a combination of small molecules could trigger the efficient uptake and intracellular release of recombinant protein and small oligonucleotides, a method termed iTOP (33). We employed iTOP to simultaneously deliver all the components of CAPR (recombinant AsCas12a protein, crRNA pair, and the repair insert) into the EGFPΔfluor reporter cells. The repair efficiency was quantified by FACS analysis and Sanger sequencing 48 hours after the iTOP transduction. We observed that CAPR enabled the replacement of mutated region between two cut sites and rescued the EGFP fluorescence, yet at rather low efficiency (<0.5%) (
Taken together, our data suggested that a single sticky end generated by AsCas12a cleavage was able to ligate to the compatible end of the repair insert by end-joining mechanisms, thereby allowing the homologous region of the insert to recombine to the corresponding region on the genome by a homology-directed process (
To explore the LAHR hypothesis further, we built another single copy EGFP mutant reporter (EGFPY66S) cell line (
To evaluate the role of the length of the homologous arm in the LAHR process, we designed a series of eight LAHR templates sharing the same sticky end, and with different lengths of the homologous arms, varying from 20 bp to 200 bp (
Next, we explored whether the presence of a compatible sticky end on the LAHR template was required in the LAHR process. Since the optimal length of the homologous arm has been determined to be 80 bp (
Since the iTOP transduction technology allowed simultaneous delivery of AsCas12a protein, crRNA, and LAHR template, we examined how the quantitative ratio of these components to affect LAHR efficiency. We had previously noticed that editing efficiencies plateaued when the concentration of SpCas9 protein reached 15-20 μM (not shown). We observed that the amount of AsCas12a protein used in LAHR exhibited similar plateau effect when the concentration was reaching 15 μM (
Next, we compared LAHR efficiency with simple HDR in our single-copy EGFPY66S reporter cells. As shown in
The LAHR template featured a rather short single-sided homologous arm carrying an intended nucleotide substitution, we wondered how the location of the nucleotide substitution could influence the LAHR efficiency. To address this question, we designed another experiment based on the same single-copy EGFPY66S reporter cell line used above, in which we performed LAHR with a series of repair templates carrying not only the nucleotide substitution to repair the A200C mutation, but also an additional silent mutation distributed along the homologous arm on each LAHR template (
Previous reports have demonstrated that mutation disrupting the PAM sequence or seed region can avoid ‘re-cutting’ of the edited genome and increase editing efficiency (42,43). We designed a LAHR targeting strategy to test whether introduction of silent mutations disrupting the Cas12a PAM site and/or seed region on the LAHR template could similarly enhance LAHR targeting efficiency (
In addition to using iTOP to deliver the LAHR components, we assessed the applicability of LAHR with a non-iTOP delivery method. With the Lonza Nucleofection system, we applied LAHR to repair the same mutation in EGFPY66S reporter cell line. We observed that nucleofection-mediated delivery of AsCas12a RNP and LAHR template similarly allows LAHR-mediated restoration of EGFP expression (
Our proof-of-concept data and characterization of LAHR in the reporter cell-line demonstrated that LAHR could efficiently repair a point-mutation in an EGFPY66S reporter system. We next compared LAHR gene editing efficiency with AsCas12a or SpCas9-mediated HDR in endogenous genes. Previously, we had introduced a homozygous nonsense mutation G4045T (Glu55-STOP) in exon 2 of the human beta-2-microglobulin (B2M) gene resulting in a B2M knockout phenotype (HAP1B2M−/−, unpublished). In the absence of B2M, the MHC1 complex cannot be presented at the cell surface. In this system, restoration of surface MHC1 expression can be used to quantify the restoration of B2M expression. There is an AsCas12a PAM sequence, TTTC, 14-bp upstream of this G4045T mutation, and the AsCas12a cleavage site is 4 bp downstream from the G4045T mutation (
In addition to repairing a targeted nonsense mutation in the endogenous B2M gene, we also assessed the ability of LAHR to precisely introduce single nucleotide substitutions in other endogenous genes. In a previously published report by Wang et al. (40), point mutations, C698874T and A474580G (
In the LAHR process, the AsCas12a-generated DSB is repaired by using a repair template featuring a sticky end (5′ overhang) and a short double-stranded homologous arm. Since both features are indispensable to accomplish the repair, we therefore assumed that there might be two distinct DSB repair pathways involved in the LAHR process. We hereby hypothesized that LAHR could utilize the 5′ homologous overhangs to ligate the repair template to the AsCas12a-created compatible DSB end via an MMEJ pathway and is subsequently completed by a homology-directed integration of the homologous arm (
As expected, PARP1 knockdown decreased EGFPY66S repair efficiency, as the upstream inhibition DSB detection and repair machinery recruitment could fundamentally restrain all DSB repair. The knockdown of Polyθ, an MMEJ essential gene, also significantly decreased the repair efficiency, which indicated that Polyθ-mediated MMEJ likely plays an important role in the LAHR process. After MMEJ, the ligated repair template could potentially be utilized through SSA, a process coordinated by Rad52. However, RAD52 knockdown did not affect LAHR efficiency, suggesting that LAHR does not involve RAD52-dependent SSA-mediated repair. Knockdown of RAD51 resulted in a significantly decreased repair efficiency, which indicated that besides MMEJ, HDR likely was another essential pathway employed in LAHR. In addition, we also observed that knockdown of either 53BP1 or Ku80, consistently resulted in a slight but significant increase in LAHR-mediated repair, in line with the role of these factors in determining the balance between NHEJ and other resection-dependent repair pathways.
Canonical MMEJ is initiated by strand resection, which creates 3′ overhangs to expose matched microhomologies (51). In LAHR, the AsCas12a-created genomic 5′ overhang and the compatible 5′ overhangs on the repair template seem to bypass the need for resection. To further examine the possibility of a resection-independent MMEJ mechanism employed in LAHR, we designed a LAHR template containing a 4-nt 3′ homologous overhang which could be utilized in MMEJ only after resection exposing the matching homology on the genome (
Together, these results clearly verified our hypothesis that both HDR and MMEJ pathways were essential for LAHR-mediated gene repair, and the MMEJ in LAHR takes place in a resection-independent manner.
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
(SEQ
(SEQ
(SEQ
(SEQ
(SEQ
(SEQ
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
GUUUUAGAGCUAGAAAUAGCAAGUUAAA
C
AGCATTCCTCACTACTGAAATACTCCTGCTCCCCCAAGCAGTGGGGGT
GCCAGGUUCUAGAGGAUGA (SEQ ID NO: 111)
GGAAGAGATAGUUUGAUUU (SEQ ID NO: 113)
GUCUUCAAGGGUGUCUGUC (SEQ ID NO: 116)
CAAACAAACCCUUAUCGUAAA (SEQ ID NO: 118)
AAGCCUCCGCUCCUGAACAAU (SEQ ID NO: 119)
AAGAUAGAGCGUGAAGGCGAA (SEQ ID NO: 120)
AGATGGACCCTACTGGAAGTC (SEQ ID NO: 127)
TGTTCATTGAACCCACTATTACCGTC (SEQ ID NO: 128)
ACCTCTGACCAGAGAGCTGCA (SEQ ID NO: 130)
CAAGAAGAGAACCTTGAAGCAA (SEQ ID NO: 131)
TGGATGCTCATGTCAAAAGGT (SEQ ID NO: 132)
AAGACTTGCGGCAATACATG (SEQ ID NO: 137)
CAGCATATTCCAAATATGCTGC (SEQ ID NO: 138)
CATGACAGAGACAGTGAAGAATTG (SEQ ID NO: 141)
ACTTTGGAGCATACCCTCTC (SEQ ID NO: 142)
ATACTCCATCCTCAGTGAGGTC (SEQ ID NO: 143)
ATGGGATCCTTGCTGCTATC (SEQ ID NO: 144)
GAGATTCTGAAGAAGCCGAGA (SEQ ID NO: 145)
CACCAATTCCAGGTAGAATAGA (SEQ ID NO: 147)
AGTGTGGCATAAATGCCAA (SEQ ID NO: 151)
ACAGCACTCCTGTAACTGTCT (SEQ ID NO: 155)
In the current study we describe a novel method for precise genome editing using an AsCas12a-generated DSB, and a dsDNA repair template containing a matching 5′ overhang and a short double-stranded homologous arm. We called this method LAHR, for ‘Ligation-Assisted Homologous Recombination’. LAHR was the first precise genome editing tool that deployed both HDR and MMEJ mechanisms to repair an AsCas12a-generated DSB and introduced a desired nucleotide substitution. The complementary 5′ overhangs created by AsCas12a at the target site in the genome lock the repair template in place and ligate via a resection-independent MMEJ pathway, while template integration is completed by HDR. As summarized in
In Cas12a-mediated genome editing, a LAHR template is more efficient than a ssODN template in introducing a specific mutation, The comparison between LAHR and SpCas9-mediated HDR (using ssODN templates) is difficult, if not impossible, due to differences in PAM sites, cut-sites and repair template preference between Cas12a and Cas9 gene editing systems. Yet our data demonstrates that LAHR repair efficiency is on par with Cas9-mediated HDR.
We also noticed that the distance between the mutation and the AsCas12a target site affects LAHR editing efficiency, which is consistent with previous reports describing the effect of the distance between the mutation and nuclease target site upon Cas9 targeting and ssODN-mediated HDR. When the cut site is more than 10 bp removed from the Cas9 target site, HDR efficiency was shown to drop sharply (52). A solution to prevent this drop in efficiency is to extend the size of the flanking homologous arms on the ssODN (53). This principle may also apply in LAHR, but using simultaneous transduction of the AsCas12a RNP and the LAHR template DNA, we observed that repair efficiencies drop with LAHR templates over 100 bp in size, likely because these have more trouble passing the nuclear envelope. We determined that with an 80-bp homologous arm, a favorable distance between the mutation and the Cas12a target site is between 0-20 bp.
Taken together, we believe LAHR adds an attractive tool to the CRISPR toolbox and provides an essential alternative to traditional Cas9-mediated HDR particularly in circumstances where the Cas9-mediated editing is impaired by the lack of a suitable PAM site or efficient guide RNA candidates.
Number | Date | Country | Kind |
---|---|---|---|
2117455.2 | Dec 2021 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2022/084232 | 12/2/2022 | WO |