This application contains a Sequence Listing that has been submitted electronically as an XML file named “54397-0009US1_SL_ST26.XML.” The XML file, created on Jul. 16, 2024, is 272,296 bytes in size. The material in the XML file is hereby incorporated by reference in its entirety.
The present disclosure belongs to the field of biotechnology and gene editing technology, and specifically relates to a method for constructing a gRNA mutant, a gRNA mutant, a method for narrowing an editing window of a base editor, a base editor, an isolated polynucleotide, a recombinant expression vector, a recombinant host cell, a base editor, a composition, and use as well as a method for gene editing in a cell or subject and a method for treating or preventing a disease.
CRISPR/Cas (clustered regularly interspaced short palindromic repeats/CRISPR-associated) system is a technique for a specific DNA modification of a targeted gene by an RNA-guided Cas nuclease, which is an adaptive immune defense mechanism evolved from bacteria and archaea in response to constant attacks by phages and foreign plasmids. Since its discovery, the CRISPR/Cas systems have been successfully applied to gene editing in multiple species and have been extensively used in the field of gene editing.
CRISPR/Cas systems discovered so far can be sorted into two classes (Class 1 and Class 2), and further sorted into six types (Type I to Type VI). Of these, Class 1 systems include Type I, Type III, and Type IV, and Class 2 systems include Type II, Type V, and Type VI. In the commonly used CRISPR/Cas systems, guide RNA (gRNA) guides the Cas protein to perform precise cleavages at the targeted sites of the genome, resulting in DNA double strand breaks (DSBs), and the host cell repairs the DSBs through its own non-homologous end-joining (NHEJ) or based on the homologous end recombination repair (HDR), but it is difficult to realize specific editing of a single base. Due to many uncertainties about DNA double strand breaks, HDR occurs with a very low probability, and NHEJ causes random insertions or deletions of bases. Consequently, conventional CRISPR/Cas techniques have certain disadvantages in gene editing of a single base.
The advent of base editors (BEs) remedies the defects of conventional CRISPR/Cas techniques in single-base editing. Without the occurrence of DNA double strand breaks and the participation of a donor DNA, the base editor may achieve precise point mutations at target sites and therefore holds a great application prospect in the treatment of genetic diseases caused by gene mutations. Available base editors mainly include: a cytosine base editor (CBE), which may convert the cytidylate within an editing window of the target sequence into the thymidylate (C>T): an adenine base editor (ABE), which may convert the adenylate within an editing window into the guanylate (A>G); and a novel glycosylase base editor (GBE), which may edit cytidylate into adenylate in Escherichia coli or specifically edit cytidylate into guanylate in mammalian cells.
At present, except for GBE that can precisely edit the C6 site in mammalian cells, the CRISPR/Cas-based base editors all have an editing window involving multiple nucleotides. For example, the editing window of typical CBE involves 4 or 5 bases, and the editing window of ABE also involves up to 4 bases, in which all Cs or As within the editing window may be edited. However, about half of known pathogenic mutations are point mutations (also referred to as single nucleotide polymorphism SNP). For correction of pathogenic point mutations, it is often necessary to precisely correct one base, and additional base modification(s) would cause side effects instead. It is thus particularly necessary to narrow the editing windows of CBE and ABE, or even enable the editing windows to be accurate to one base.
However, there is still a lack of a universal method for narrowing the editing window of bases now, which enables the editing windows of CBE and ABE to be accurate to one base.
In terms of CBE, the base editing window of CBE may be narrowed to some extent by selecting cytosine deaminases from different sources, different CRISPR systems, different linkers, and mutations in cytosine deaminases. In terms of ABE, no other adenine deaminases than the evolved adenine deaminases derived from bacteria have been found to be effective for A-to-G mutations after fused to the CRISPR systems. There is a lack of an effective method for narrowing the editing window of ABE. To date, there is no universal method for narrowing an editing window of a base editor, or a method that enables the editing windows of CBE and ABE to be accurate to one base, which greatly restricts the application of the base editing systems in precise base editing.
In view of the problems existing in the prior art, for example, the lack of a universal method for narrowing the editing window of the base editor and the difficulty of CBE or ABE in achieving the precise editing of a single base, the present disclosure provides a method for constructing a gRNA mutant, in which the gRNA mutant is obtained by mutating the guide sequence region in the gRNA. The gRNA mutant, when applied to the base editor, is capable of universally narrowing the base editing window and realizing the precise editing of one base.
In the first aspect, the present disclosure provides a method for constructing a gRNA mutant, wherein the method comprises:
In some embodiments, the screening step in the method according to the present disclosure comprises:
In some embodiments, in the method according to the present disclosure, the guide sequence region has a first end proximal to a PAM sequence of the nucleic acid of interest and a second end distal to the PAM sequence; and
In some embodiments, in the method according to the present disclosure, m is any integer from 15 to 30, preferably any integer from 15 to 25;
In some embodiments, in the method according to the present disclosure, the mutated nucleotide contains the substitution, deletion or insertion of 1 to 10 bases, preferably the substitution, deletion or insertion of 1 to 5 bases, more preferably the substitution, deletion or insertion of 1 to 3 bases.
In the second aspect, the present disclosure provides a method for narrowing an editing window of a base editor, wherein the method comprises constructing a gRNA mutant by the method according to the first aspect: preferably, the editing window of the base editor is one base.
In the third aspect, the present disclosure provides a gRNA mutant, wherein the gRNA mutant is constructed by the method according to the first aspect: preferably, the gRNA mutant is used for a base editor with an editing window being a single nucleotide site:
In some embodiments, the guide sequence region in the gRNA mutant according to the present disclosure has a first end proximal to a PAM sequence of the nucleic acid of interest and a second end distal to the PAM sequence; and
In some embodiments, the gRNA mutant according to the present disclosure, wherein m is any integer from 15 to 30, preferably any integer from 15 to 25:
In some embodiments, the mutated nucleotide in the gRNA mutant according to the present disclosure contains a substitution, deletion or insertion of 1 to 10 bases, preferably a substitution, deletion or insertion of 1 to 5 bases, more preferably a substitution, deletion or insertion of 1 to 3 bases.
In the fourth aspect, the present disclosure provides an isolated polynucleotide, wherein the isolated polynucleotide encodes the gRNA mutant according to the third aspect.
In the fifth aspect, the present disclosure provides a recombinant expression vector, wherein the recombinant expression vector contains the isolated polynucleotide according to claim 11.
In the sixth aspect, the present disclosure provides a recombinant host cell, wherein the recombinant host cell contains the recombinant expression vector according to claim 12.
In the seventh aspect, the present disclosure provides a base editor, wherein the base editor comprises either of the following (i) and (ii) and either of the following (iii) and (iv):
In some embodiments, the second domain in the base editor according to the present disclosure is a polypeptide having a deaminase activity: optionally, the second domain is an adenine deaminase or a mutant, homologue or polypeptide fragment having or partially having an adenine deaminase activity of the adenine deaminase: optionally, the second domain is a cytosine deaminase or a mutant, homologue or polypeptide fragment having or partially having a cytosine deaminase activity of the cytosine deaminase:
In the eighth aspect, the present disclosure provides a composition, wherein the composition comprises the gRNA mutant according to the third aspect, the isolated polynucleotide according to the fourth aspect, the recombinant expression vector according to the fifth aspect, the recombinant host cell according to the sixth aspect, or the base editor according to the seventh aspect:
In the ninth aspect, the present disclosure provide use of the gRNA mutant according to the third aspect, the isolated polynucleotide according to the fourth aspect, the recombinant expression vector according to the fifth aspect, the recombinant host cell according to the sixth aspect, the base editor according to the seventh aspect, or the composition according to the eighth aspect in at least one of the following (a) and (b):
In the tenth aspect, the present disclosure provides a method for gene editing in a cell or subject, wherein the method comprises bringing the cell or subject into contact with any one of the gRNA mutant according to the third aspect, the isolated polynucleotide according to the fourth aspect, the recombinant expression vector according to the fifth aspect, the recombinant host cell according to the sixth aspect, the base editor according to the seventh aspect, or the composition according to the eighth aspect:
In the eleventh aspect, the present disclosure provides a method for treating or preventing a disease, wherein the method comprises administering to a subject the gRNA mutant according to the third aspect, the isolated polynucleotide according to the fourth aspect, the recombinant expression vector according to the fifth aspect, the recombinant host cell according to the sixth aspect, the base editor according to the seventh aspect, or the composition according to the eighth aspect:
In some embodiments, the method for constructing a gRNA mutant provided in the present disclosure enables an acquisition of the gRNA mutant that does not perfectly match the target sequence of the nucleic acid of interest by mutating the guide sequence region in the gRNA. The gRNA mutant, when applied in a base editor, could significantly narrow the editing window of the base editor, improve the specificity of the gene editing, and has wide application prospects in gene therapy, drug screening, construction of animal and plant models, and other aspects. Besides, the gRNA mutants obtained by the construction method of the present disclosure further have an improved base editing efficiency and an optimized base editing effect.
In some embodiments, the method for constructing a gRNA mutant provided in the present disclosure allows for construction of a gRNA mutant that enables the editing window of the base editor to be accurate to one base, which has a high base editing specificity and editing efficiency, and is particularly useful for the treatment of genetic diseases caused by single-base mutations.
In some embodiments, the method for narrowing an editing window of a base editor provided in the present disclosure enables a base editor with a remarkably narrowed base editing window to be obtained without affecting the gene editing efficiency by modifying the gRNA using the method for constructing a gRNA mutant according to the present application, which has wide application prospects in the fields such as animal model construction, researches on functional genomics, molecular breeding, clinical medicine, and translational medicine.
In some embodiments, the base editor provided in the present disclosure has a narrowed base editing window or may even allow the editing window to be narrowed to one base, which could efficiently and precisely correct a single-site mutation, providing an efficient and precise editing tool for the treatment of genetic diseases, etc.
When used in combination with the term “including” in the claims and/or specification, the word “a” or “an” may refer to “one”, or refer to “one or more”, “at least one”, and “one or more than one”.
As used in the claims and specification, the term “including”, “having”, “comprising”, or “containing” is intended to be inclusive or open-ended, and does not exclude additional or unrecited elements or methods and steps.
Throughout the application document, the term “about” means that a value includes the error or standard deviation caused by the device or method used to measure the value.
It is applicable to the content disclosed herein that the term “or” is defined only as alternatives and “and/or”, but the term “or” used herein refers to “and/or” unless otherwise expressly stated to be only alternatives or mutual exclusion between alternatives.
As used herein, the terms “polypeptide”, “peptide”, and “protein” are used interchangeably herein and are an amino acid polymer of any length. This polymer may be linear or branched. It may contain modified amino acids, and it may be spaced by non-amino acids. This term also includes amino acid polymers that have already been modified (e.g., formation of disulfide bonds, glycosylation, lipidation, acetylation, phosphorylation, or any other operations such as conjugation with a labeled component).
As used herein, the term “editing window” refers to the region covered by editable bases that are located within the guide sequence region of gRNA and have an editing efficiency greater than the average editing efficiency of all editable bases. Exemplarily, the adenine base editor edits the adenines (A) present in the guide sequence region, and the adenines (A) constitute editable bases within the guide sequence region. If there is i (i≥1) A within the guide sequence region, where the base editing efficiency of Ax (x is any integer from 1 to i) is greater than the average editing efficiency of A1 to Ai, then Ax constitutes the coverage region of the editing window. In some expressions of the present disclosure, “narrowing of editing window” and “improvement of single-base editing specificity” have the same meaning.
As used herein, the term “CRISPR” refers to clustered regularly interspaced short palindromic repeats, which comes from the immune system of microorganisms.
As used herein, the term “Cas protein mutant” includes Cas protein mutants, homologues or polypeptide fragments thereof that have a lost or reduced endonuclease activity as compared to a wild-type Cas protein.
As used herein, the term “wild-type Cas protein” refers to a CRISPR-associated protein. The Cas protein and the CRISPR sequence jointly form the CRISPR/Cas system. The Cas protein has a nuclease-associated functional domain, and cleaves a target sequence at a specific position by recognizing PAM (protospacer adjacent motif).
As used herein, the term “fusion protein” refers to a hybrid polypeptide comprising protein domains from at least two different proteins. The fusion protein may be a chimeric protein produced by ligating two or more genes that initially encode the separate proteins. Translation of the fusion gene produces a single polypeptide with functional characteristics derived from each of the original proteins.
As used herein, said “gRNA” is also referred to as a guide RNA, and has the meaning typically understood by a person skilled in the art. In general, the guide RNA may include a direct repeat sequence and a guide sequence, or consists essentially of or consists of a direct repeat sequence and a guide sequence (also referred to as a spacer in the context of the endogenous CRISPR system). In different CRISPR systems, the gRNA may include crRNA and tracrRNA or may contain only crRNA, according to different Cas proteins on which the gRNA depends. The crRNA and tracrRNA may be artificially modified and fused to form single guide RNA (sgRNA).
As used herein, the term “protospacer adjacent motifs (PAMs)” refers to sequences adjacent to the target sequence recognized by the Cas protein, which may be located at the 3′ end (e.g., the CRISPR/Cpf1 system) of the target sequence, or may be located at the 5′ end (e.g., the CRISPR/Cas9 system) of the target sequence.
As used herein, the term “target sequence” refers to the nucleotide sequence in the nucleic acid of interest that is complementary to or at least partially complementary to the gRNA. In the present disclosure, “target sequence” and “target nucleic acid” may be used interchangeably.
As used herein, the term “target strand” refers to the nucleotide strand in the nucleic acid of interest that hybridizes with the gRNA: the term “non-target strand” refers to the nucleotide strand in the nucleic acid of interest that does not hybridize or pair with the gRNA.
As used herein, the term “polynucleotide” refers to a polymer composed of nucleotides. A polynucleotide may be in the form of an individual fragment, or may be a constituent part of a larger nucleotide sequence structure, which is derived from the nucleotide sequence that has been isolated at least once in number or concentration, and could be recognized, operated, and sequence recovered as well as nucleotide sequence recovered by a standard molecular biological method (e.g., using a cloning vector). When a nucleotide sequence is represented by a DNA sequence (i.e., A, T, G, C), it also includes an RNA sequence (i.e., A, U, G, C), where “U” substitutes for “T”. In other words, “polynucleotide” refers to a nucleotide polymer knocked out from an additional nucleotide (an individual fragment or an entire fragment), or may be a constituent part or component of a larger nucleotide structure, such as an expression vector or a polycistronic sequence. Polynucleotide includes DNA, RNA, and cDNA sequences. “Recombinant polynucleotide” or “recombinant nucleic acid molecule” is one of “polynucleotides”.
As used herein, the term “hybridization” refers to any process of pairing complementary nucleic acids by binding a nucleic acid strand to a complementary strand through base pairing to form a hybrid complex.
As used herein, the term “mutant” refers to a polynucleotide or polypeptide that, relative to the “wild-type” or “comparative” polynucleotide or polypeptide, contains alternation(s) (i.e., substitution, insertion and/or deletion of a polynucleotide) at one or more (e.g., several) positions, where the substitution refers to a substitution of a different nucleotide for a nucleotide that occupies one position: the deletion refers to removal of a nucleotide that occupies certain position; and the insertion refers to an addition of a nucleotide after the nucleotide adjacent to and immediately following the occupied position. In the present disclosure, alternations of nucleotides also correspond to alternations of bases, and the substituted, inserted or deleted nucleotides correspond to the substituted, inserted or deleted bases.
As used herein, the term “mutated nucleotide” or “mutation of nucleotide” includes “substitution, deletion or addition of one or more nucleotides”. In the present disclosure, the term “mutation” refers to an alternation of a nucleotide sequence.
As used herein, the terms “sequence identity” and “percent identity” refer to the percentage of nucleotides or amino acids that are the same (i.e., identical) between two or more polynucleotides or polypeptides. The sequence identity between two or more polynucleotides or polypeptides may be determined by aligning the nucleotide sequences of polynucleotides or the amino acid sequences of polypeptides and scoring the number of positions at which nucleotide or amino acid residues are identical in the aligned polynucleotides or polypeptides, and comparing the number of these positions with the number of positions at which nucleotide or amino acid residues are different in the aligned polynucleotides or polypeptides. Polynucleotides may differ at one position by, e.g., containing a different nucleotide (i.e., substitution or mutation) or deleting a nucleotide (i.e., insertion or deletion of a nucleotide in one or two polynucleotides). Polypeptides may differ at one position by, e.g., containing a different amino acid (i.e., substitution or mutation) or deleting an amino acid (i.e., insertion or deletion of an amino acid in one or two polypeptides). The sequence identity may be calculated by dividing the number of positions at which nucleotide or amino acid residues are identical by the total number of nucleotide or amino acid residues in the polynucleotides or polypeptides. For example, the percent identity may be calculated by dividing the number of positions at which nucleotide or amino acid residues are identical by the total number of nucleotide or amino acid residues in the polynucleotides or polypeptides, and multiplying the result by 100.
Exemplarily, two or more sequences or subsequences, when compared and aligned at maximum correspondence by the sequence alignment algorithm or by the visual inspection measurement, have at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% “sequence identity” or “percent identity.” of nucleotides. In some embodiments, the sequences are substantially identical over the full length of either or both of the compared biopolymers (e.g., polynucleotides).
As used herein, the terms “individual” and “subject” may be used interchangeably and refer to mammals. Mammals include, but are not limited to, domesticated animals (e.g., dairy cows, sheep, cats, dogs, and horses), primates (e.g., humans and non-human primates such as monkeys), rabbits, and rodents (e.g., mice and rats). In particular, the individual is a human.
As used herein, the term “vector” refers to a DNA construct containing DNA sequences operably linked to appropriate regulatory sequences, so as to express a gene of interest in a suitable host.
As used herein, the term “recombinant expression vector” refers to a DNA structure for expressing, e.g., a polynucleotide encoding the desired polypeptide. The recombinant expression vector may include, for example, transcription subunits containing i) a set of genetic elements having a regulatory effect on gene expression, such as promoters and enhancers: ii) a structure or coding sequence transcribed into mRNA and translated into a protein; and iii) an appropriate transcriptional and translational initiation and termination sequences. The recombinant expression vector is constructed in any suitable manner. The nature of the vector is not important, and any vector may be used, including plasmids, viruses, phages, and transposons. Possible vectors for use in the present disclosure include, but are not limited to, chromosomes, non-chromosomes, and synthetic DNA sequences, for example, viral plasmids, bacterial plasmids, phage DNAs, yeast plasmids, and vectors derived from combinations of plasmids and phage DNAs, and DNAs from viruses such as lentivirus, adeno-associated viruses, retroviruses, cowpox, adenoviruses, avian pox, baculoviruses, SV40, and Pseudorabies virus.
As used herein, the term “host cell” refers to a cell in which an exogenous polynucleotide has already been introduced, including progenies of such a cell. The host cell includes “transformants” and “transformed cells”, which include cells transformed from primary cells and progenies derived therefrom. The host cell is any type of cell system that can be used to produce the antibody molecules of the present disclosure, including eukaryotic cells, such as mammalian cells, insect cells, and yeast cells; and prokaryotic cells, such as Escherichia coli cells. The host cell includes cultured cells and also includes cells in transgenic animals, transgenic plants, or in cultured plant tissues or animal tissues.
As used herein, “treatment” means that after afflicting from a disease, a subject is brought into contact (e.g., administered) with the cyclic RNA, linear RNA, recombinant nucleic acid molecule, recombinant expression vector, or composition of the present disclosure, thereby alleviating symptoms of the disease as compared to the case of no contact, which does not mean that the symptoms of the disease have to be completely suppressed. Affliction from a disease refers to occurrence of symptoms of the disease in the body.
As used herein, “prevention” means that before afflicting from a disease, a subject is brought into contact (e.g., administered) with the cyclic RNA, linear RNA, recombinant nucleic acid molecule, recombinant expression vector, composition or the like of the present disclosure, thereby alleviating symptoms after affliction from the disease as compared to the case of no contact, which does not mean that illness has to be completely suppressed.
Unless otherwise defined or expressly indicated by the context, all technical and scientific terms used herein have the same meanings as typically understood by one of ordinary skill in the art to which the present disclosure belongs.
Construction Method for gRNA Mutant
The present disclosure provides a method for constructing a gRNA mutant, comprising the following steps:
The method for constructing a gRNA mutant according to the present disclosure enables an acquisition of the gRNA mutant that is incompletely complementary to the target sequence of the nucleic acid of interest by mutating the guide sequence region in the gRNA, and the gRNA mutant is also referred to as an imperfect guide-RNA (igRNA). Compared with the gRNA, the igRNA has a narrowed editing window for the base editor, and could effectively improve the base editing specificity, and improves the base editing efficiency to some extent, resolving the defect that gene editing with high specificity at a specific site is difficult to realize because there are many base editing sites within the editing window of an available base editor.
In some preferred embodiments, the screening step comprises: screening the mutant with an editing window for the base editor being a single base, to obtain the gRNA mutant. The gRNA mutant, after applied to the base editor, is capable of realizing the precise modification of a specific single base, which provides a base editing tool with high specificity for gene therapy of diseases, animal model construction, molecular breeding, etc., and is particularly suitable for correction of pathogenic single-base mutations in genetic diseases.
In some embodiments, the guide sequence region has a first end proximal to a PAM sequence of the nucleic acid of interest and a second end distal to the PAM sequence: the guide sequence region has m nucleotides, and any one of the mutated nucleotides is located at the position of the nth nucleotide starting from the second end, 1≤n≤m, where m and n are positive integers.
The present disclosure has discovered that the position of the base mutation in gRNA has a great influence on the size of the editing window; which is specifically subject to the number of nucleotides spaced between the mutated nucleotide and the PAM sequence in the nucleic acid of interest.
In some preferred embodiments, 2≤n≤m/2. In some more preferred embodiments, 2≤n≤m/3. By introducing mutant nucleotides at the aforementioned positions, the editing windows of the gRNA mutants applied to the base editor can be effectively narrowed, providing unequivocal and implementable modification positions for gRNA modifications.
In the present disclosure, the number of nucleotides in the guide sequence region is not specifically limited, and its concrete number may be designed according to the type of the Cas protein for the base editor and the target sequence bound.
In some alternative embodiments, the number m of nucleotides in the guide sequence region is any integer from 15 to 30. In some preferred embodiments, the number m of nucleotides in the guide sequence region is any integer from 15 to 25. In some preferred embodiments, the gRNA mutant is applied to the CRISPR/Cas9 system, and the number m of nucleotides in the guide sequence region is any integer from 19 to 21. Furthermore, any one of the mutated nucleotides is located at a position of the 1st to 12th nucleotides, preferably at a position of the 2nd to 10th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 20 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 10th nucleotides, preferably at a position of the 2nd to 10th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, 9th, or 10th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 21 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 10th nucleotides, preferably at a position of the 2nd to 10th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, or 10th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 22 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 11th nucleotides, preferably at a position of the 2nd to 11th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, 9th, 10th, or 11th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 23 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 11th nucleotides, preferably at a position of the 2nd to 11th nucleotides, more preferably at a position of the 2nd to 6th nucleotides, starting from the second end.
Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th. 6th, 7th, 8th, 9th, 10th, or 11th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 24 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 12th nucleotides, preferably at a position of the 2nd to 12th nucleotides, more preferably at a position of the 2nd to 7th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, 9th, 10th, 11th, or 12th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 25 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 12th nucleotides, preferably at a position of the 2nd to 12th nucleotides, more preferably at a position of the 2nd to 7th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, 9th, 10th, 11th, or 12th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 19 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 9th nucleotides, preferably at a position of the 2nd to 9th nucleotides, more preferably at a position of the 2nd to 5th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, or 9th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 18 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 9th nucleotides, preferably at a position of the 2nd to 9th nucleotides, more preferably at a position of the 2nd to 5th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, 7th, 8th, or 9th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 17 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 8th nucleotides, preferably at a position of the 2nd to 8th nucleotides, more preferably at a position of the 2nd to 5th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd. 4th 5th, 6th, 7th, or 8th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 16 nucleotides. Accordingly, any one of the corresponding mutated nucleotides is located at a position of the 1st to 8th nucleotides, preferably at a position of the 2nd to 8th nucleotides, more preferably at a position of the 2nd to 5th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, or 8th nucleotide starting from the second end.
Exemplarily, the gRNA mutant is applied to the CRISPR/Cas9 system, and the guide sequence region has 15 nucleotides. Accordingly, any one of the mutated nucleotides is located at a position of the 1st to 7th nucleotides, preferably at a position of the 2nd to 7th nucleotides, more preferably at a position of the 2nd to 4th nucleotides, starting from the second end. Exemplarily, any one of the mutated nucleotides is located at the position of the 1st, 2nd, 3rd, 4th 5th, 6th, or 7th nucleotide starting from the second end.
In other embodiments, m and n may also be other values, which are not exhausted in the present disclosure.
In some embodiments, the mutated nucleotide comprises a substitution, deletion or insertion of 1 to 10 bases, preferably a substitution, deletion or insertion of 1 to 5 bases, more preferably a substitution, deletion or insertion of 1 to 3 bases. Exemplarily, the mutated nucleotide comprises substitutions, deletions, or insertions of 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10 bases. The present disclosure has found that the number of bases included in the mutated nucleotide directly influences the specificity and efficiency of the gRNA mutant for base editing. When the number of the mutated bases is within the above range, it is possible to obtain a gRNA mutant with high specificity and with an editing window optimally narrowed to one base.
It is to be noted that the mutated bases may be located at any one or more positions of the 1st nucleotide to the nth nucleotide starting from the second end in the guide sequence region. The mode of mutation in a base at any position may be any one independently selected from the substitution, deletion, and insertion. As long as the total number of mutated bases included in the mutated nucleotides falls within the range provided in the present disclosure, the specificity of the gRNA for gene editing may be optimized.
Furthermore, the number of mutated bases included in the mutated nucleotides is further determined by the type of the base editor.
In some preferred embodiments, the base editors are adenine base editors (ABEs). The mutated nucleotides of the gRNA mutant for the adenine base editor contain a substitution, deletion or insertion of 1 to 5 bases, preferably a substitution, deletion or insertion of 1 to 3 bases, more preferably a substitution, deletion or insertion of one base. Exemplarily, the mutated nucleotides of the gRNA mutant for the adenine base editor contain a substitution, deletion or insertion of 1 base, 2 bases, 3 bases, 4 bases, 5 bases and so forth. The present disclosure has found in the experiments that when the number of mutated bases in the gRNA mutants of some genes is set as 1, an adenine base editor capable of mutating adenine at one specific site is obtained.
In some preferred embodiments, the base editors are cytosine base editors (CBEs). The mutated nucleotides of the gRNA mutant for the cytosine base editor contain a substitution, deletion or insertion of 1 to 5 bases, preferably a substitution, deletion or insertion of 1 to 3 bases, more preferably substitutions, deletions or insertions of 2 or 3 bases. Exemplarily, the mutated nucleotides of the gRNA mutant for the cytosine base editor contain a substitution, deletion or insertion of 1 base, 2 bases, 3 bases, 4 bases, 5 bases and so forth. The present disclosure has found in the experiments that when the number of mutated bases in the gRNA mutants of some genes is set as 2 or 3, a cytosine base editor capable of editing cytosine at one specific site is obtained.
gRNA Mutant
The present disclosure provides a gRNA mutant, which is obtained by the construction method according to the present disclosure. The gRNA mutant, when used for the base editor, is capable of effectively improving the specificity of the base editing, narrowing the editing window for the base editor, and finally realizing efficient and specific editing of a particular single base, thereby providing a positive and efficient gene editing strategy for treatment of diseases and construction of animal and plant models.
Furthermore, the gRNA mutant further comprises a repetitive sequence region for binding a Cas protein. The repetitive sequence region may be folded to form a specific structure (such as a stem-loop structure) to be recognized by the Cas protein. After binding to the gRNA mutant, the Cas protein recognizes the target sequence by recognizing the PAM (protospacer adjacent motif) sequence adjacent to the target sequence of the nucleic acid of interest. After the mutation sequence region of the gRNA mutant is hybridized with the target sequence, the double strands of the nucleic acid of interest are unwinded to form an R-loop region, attaining a ternary complex of the Cas protein-gRNA-target sequence, and the single-stranded nucleic acid strand of the R-loop region is further edited. Depending upon the difference in CRISPR/Cas systems to which the gRNA mutants are applied, the repetitive sequence region may be linked to the 5′ end or the 3′ end of the guide sequence region. In some alternative embodiments, the gRNA mutant is applied to the CRISPR/Cas9 system, and the gRNA mutant includes the nucleotide sequence of 5′-guide sequence region-repetitive sequence region-3′. In some alternative embodiments, the gRNA mutant is applied to the CRISPR/Cpf1 system, and the gRNA mutant includes the nucleotide sequence of 5′-repetitive sequence region-guide sequence region-3′.
In the present disclosure, the nucleotide sequence of the mutation sequence region of the gRNA mutant is not specifically limited, so long as the sequence of the mutation region thereof enables the gRNA mutant to be hybridized with the target sequence to form a double strand and to unwind the nucleic acid of interest to form a R-loop region; and enables the editing window of the gRNA mutant for the base editor to be narrowed.
In some alternative embodiments, the gRNA mutant is used to mutate the base C at a specific site of the HIRA locus into base T, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 1. The nucleotide sequence of the mutation sequence region of the gRNA mutant (hereinafter referred to as igRNA) is set forth in any one of SEQ ID NOs: 2 to 5. Of these, the sequence set forth in SEQ ID NO: 2 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 1, the sequence set forth in SEQ ID NO: 3 has a substitution of one base at the position corresponding to the 5th nucleotide of the sequence set forth in SEQ ID NO: 1, the sequence set forth in SEQ ID NO: 4 has a deletion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 1, and the sequence set forth in SEQ ID NO: 5 has an insertion of two bases at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 1. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 1, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the HIRA locus, have a universally improved base editing specificity to the C6 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base C at a specific site of the DNMT3B locus into base T, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 6. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 7 to 10. Of these, the sequence set forth in SEQ ID NO: 7 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 6, the sequence set forth in SEQ ID NO: 8 has substitutions of two bases at the positions corresponding to the 3rd and 4th nucleotides of the sequence set forth in SEQ ID NO: 6, the sequence set forth in SEQ ID NO: 9 has a deletion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 6, and the sequence set forth in SEQ ID NO: 10 has an insertion of two bases at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 6. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 6, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the DNMT3B locus, have a universally improved base editing specificity to the C8 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base C at a specific site of the RNF2 locus into base T, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 11. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 12 to 14. Of these, the sequence set forth in SEQ ID NO: 12 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 11, the sequence set forth in SEQ ID NO: 13 has a deletion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 11, and the sequence set forth in SEQ ID NO: 14 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 11. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 11, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the RNF2 locus, have a universally improved base editing specificity to the C6 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base C at a specific site of the NSD1 locus into base T, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 19. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 20 to 22. Of these, the sequence set forth in SEQ ID NO: 20 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 19, the sequence set forth in SEQ ID NO: 21 has deletions of two bases at the positions corresponding to the 2nd and 3rd nucleotides of the sequence set forth in SEQ ID NO: 19, and the sequence set forth in SEQ ID NO: 22 has an insertion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 19. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 19, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the NSD1 locus, have a universally improved base editing specificity to the C6 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the PSMB2 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 23. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 24 to 27. Of these, the sequence set forth in SEQ ID NO: 24 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 23, the sequence set forth in SEQ ID NO: 25 has a substitution of one base at the position corresponding to the 5th nucleotide of the sequence set forth in SEQ ID NO: 23, the sequence set forth in SEQ ID NO: 26 has a deletion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 23, and the sequence set forth in SEQ ID NO: 27 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 23. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 23, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the PSMB2 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the ABCA3 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 28. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 29 to 31. Of these, the sequence set forth in SEQ ID NO: 29 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 28, the sequence set forth in SEQ ID NO: 30 has a deletion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 28, and the sequence set forth in SEQ ID NO: 31 has an insertion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 28. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 28, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the ABCA3 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the EMX1-SITE3 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 32. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 33 to 36. Of these, the sequence set forth in SEQ ID NO: 33 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 32, the sequence set forth in SEQ ID NO: 34 has a substitution of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 32, the sequence set forth in SEQ ID NO: 35 has a deletion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 32, and the sequence set forth in SEQ ID NO: 36 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 32. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 32, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the EMX1-SITE3 locus, have a universally improved base editing specificity to the A6 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the VISTA HS267 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 37. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 38 to 41. Of these, the sequence set forth in SEQ ID NO: 38 has a substitution of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 37, the sequence set forth in SEQ ID NO: 39 has a substitution of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 37, the sequence set forth in SEQ ID NO: 40 has a deletion of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 37, and the sequence set forth in SEQ ID NO: 41 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 37. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 37, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the VISTA HS267 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the SNCA locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 42. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 43 to 46. Of these, the sequence set forth in SEQ ID NO: 43 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 42, the sequence set forth in SEQ ID NO: 44 has a substitution of one base at the position corresponding to the 5th nucleotide of the sequence set forth in SEQ ID NO: 42, the sequence set forth in SEQ ID NO: 45 has a deletion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 42, and the sequence set forth in SEQ ID NO: 46 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 42. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 42, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the SNCA locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the ANO5 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 47. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 48 to 51. Of these, the sequence set forth in SEQ ID NO: 48 has a substitution of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 47, the sequence set forth in SEQ ID NO: 49 has a substitution of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 47, the sequence set forth in SEQ ID NO: 50 has a deletion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 47, and the sequence set forth in SEQ ID NO: 51 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 47. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 47, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the ANO5 locus, have a universally improved base editing specificity to the A7 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the KCNQ2 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 52. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 53 to 55. Of these, the sequence set forth in SEQ ID NO: 53 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 52, the sequence set forth in SEQ ID NO: 54 has a deletion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 52, and the sequence set forth in SEQ ID NO: 55 has an insertion of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 52. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 52, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the KCNQ2 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the NOTCH2 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 56. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 57 to 59. Of these, the sequence set forth in SEQ ID NO: 57 has a substitution of one base at the position corresponding to the 6th nucleotide of the sequence set forth in SEQ ID NO: 56, the sequence set forth in SEQ ID NO: 58 has a deletion of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 56, and the sequence set forth in SEQ ID NO: 59 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 56. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 56, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the NOTCH2 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the GFI1 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 60. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 61 to 63. Of these, the sequence set forth in SEQ ID NO: 61 has a substitution of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 60, the sequence set forth in SEQ ID NO: 62 has a deletion of one base at the position corresponding to the 4th nucleotide of the sequence set forth in SEQ ID NO: 60, and the sequence set forth in SEQ ID NO: 63 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 60. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 60, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the GFI1 locus, have a universally improved base editing specificity to the A5 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the CFAP61 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 71. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 72 to 75. Of these, the sequence set forth in SEQ ID NO: 72 has a substitution of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 71, the sequence set forth in SEQ ID NO: 73 has substitutions of two bases at the positions corresponding to the 2nd and 3rd nucleotides of the sequence set forth in SEQ ID NO: 71, the sequence set forth in SEQ ID NO: 74 has a deletion of one base at the position corresponding to the 6th nucleotide of the sequence set forth in SEQ ID NO: 71, and the sequence set forth in SEQ ID NO: 75 has an insertion of one base at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 71. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 71, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the CFAP61 locus, have a universally improved base editing specificity to the A11 site, and the editing efficiency is also improved.
In some alternative embodiments, the gRNA mutant is used to mutate the base A at a specific site of the Query_55451 locus into base G, and the nucleotide sequence of the guide sequence region of the unmutated gRNA is set forth in SEQ ID NO: 76. The nucleotide sequence of the mutation sequence region of the gRNA mutant is set forth in any one of SEQ ID NOs: 77 to 80. Of these, the sequence set forth in SEQ ID NO: 77 has substitutions of two bases at the positions corresponding to the 2nd and 3rd nucleotides of the sequence set forth in SEQ ID NO: 76, the sequence set forth in SEQ ID NO: 78 has a substitution of one base at the position corresponding to the 2nd nucleotide of the sequence set forth in SEQ ID NO: 76, the sequence set forth in SEQ ID NO: 79 has deletions of two bases at the positions corresponding to the 3rd and 4th nucleotides of the sequence set forth in SEQ ID NO: 76, and the sequence set forth in SEQ ID NO: 80 has an insertion of two bases at the position corresponding to the 3rd nucleotide of the sequence set forth in SEQ ID NO: 76. Compared to the gRNA whose guide sequence region is the sequence set forth in SEQ ID NO: 76, igRNAs with different modes of mutation, when applied to the cytosine base editors for editing the Query_55451 locus, have a universally improved base editing specificity to the A9 site, and the editing efficiency is also improved.
It follows from the above that the gRNA mutants obtained by the construction method according to the present disclosure could universally improve the editing specificity of the cytosine base editor and the adenine base editor, providing a universal and effective optimization strategy for narrowing the editing window of the base editor.
In the present disclosure, the nucleotide sequence of the repetitive sequence region of the gRNA mutant is not specifically limited, so long as the sequence of the mutation region thereof enables the gRNA mutant to bind to the Cas protein and guide the Cas protein to target the target sequence that edits the nucleic acid of interest.
In some alternative embodiments, the gRNA mutant is applied to the CRISPR/spCas9 system, and the repetitive sequence region of the gRNA mutant includes a nucleotide sequence set forth in either of the following (v) and (vi):
In some alternative embodiments, the gRNA mutant is applied to the CRISPR/saCas9 system, and the repetitive sequence region of the gRNA mutant includes a nucleotide sequence set forth in either of the following (vii) and (viii):
In some embodiments, the present disclosure provides an isolated polynucleotide, which encodes the gRNA mutant according to the present disclosure, and could be used for mass production of gRNA mutants.
In some embodiments, the present disclosure provides a recombinant expression vector, comprising the isolated polynucleotide according to the present disclosure. The vectors for linking polynucleotides may be of various types commonly used in the art and suitable for cell transduction in vivo or in vitro.
In some embodiments, the present disclosure provides a recombinant host cell, comprising the recombinant expression vector according to the present disclosure. The gRNA mutant according to the present disclosure could be obtained through replication and expression of the recombinant expression vector in the recombinant host cell.
The method for narrowing the base editor provided in the present disclosure comprises constructing a target gRNA mutant by using the method for constructing a gRNA mutant according to the present disclosure.
The method for narrowing the base editor provided in the present disclosure could improve the base editing specificity without an influence on the base editing efficiency and effectively narrow base editing windows: moreover, the method is universal in narrowing of editing windows, and is suitable for optimizing cytosine base editors and adenine base editors to expand the applications of base editors in the fields such as animal model construction, researches on functional genomics, molecular breeding, clinical medicine, and translational medicine.
In some preferred embodiments, the method for narrowing the base editor narrows the editing window of the base editor to one base. Since most of genetic diseases are caused by single-base mutations, narrowing the window of the base editor to one base by the method according to the present disclosure could specifically correct pathogenic point mutations and avoid unnecessary base editing during the gene editing, which provides a positive and effective therapeutic tool for the treatment of genetic diseases.
The base editor provided in the present disclosure comprises either of the following (i) and (ii) and either of the following (iii) and (iv):
The polynucleotide that expresses the gRNA mutant may be a DNA nucleic acid molecule that is transcribed to the gRNA mutant, or an RNA nucleic acid molecule containing the gRNA mutant. The recombinant expression vector may be formed through recombination by ligating the nucleotide sequence encoding the gRNA mutant with any type of vector. For example, the recombinant expression vector is a viral vector containing the nucleotide sequence that encodes the gRNA mutant, and so forth.
The polynucleotide that expresses the fusion protein may be an RNA molecule that produces the fusion protein by translation, or a DNA molecule that produces the above RNA molecule by transcription. The recombinant expression vector may be formed through recombination by ligating the nucleotide sequence encoding the fusion protein with any type of vector. For example, the recombinant expression vector is a viral vector containing an open reading frame that encodes the fusion protein, and so forth.
In the present disclosure, the first domain and the second domain of the fusion protein may be linked directly or indirectly via a linker peptide.
The first domain is a Cas protein having a lost or reduced nuclease activity. In some embodiments, the first domain is a Cas protein mutant that loses the nuclease activity, or a homologue or polypeptide fragment thereof, and the fusion protein binds to the gRNA via the first domain and recognizes the target sequence adjacent to the PAM sequence of the nucleic acid of interest to form a ternary complex of fusion protein-gRNA mutant-target sequence. Due to loss of the nuclease activity for the Cas protein, the first domain will not cleave the target strand or non-target strand after formation of the ternary complex, and only the base modification activity of the second domain is relied on to mutate the base within the editing window of the target sequence. In some embodiments, the first domain is a Cas protein mutant with a reduced nuclease activity, or a homologue or polypeptide fragment thereof, and the first domain cleaves one of the target strands or non-target strands after formation of the ternary complex. In some embodiments, the first domain is a Cas protein mutant with an expanded PAM frame, or a homologue or polypeptide fragment thereof. Exemplarily,
In the present disclosure, there are many different choices for the Cas protein. Exemplarily, the Cas protein is selected from Cas9 proteins (WP_032462936, WP_165886160, WP 002460848, and WP_002807152), Cas12a proteins (i.e., Cpf1 proteins, PDB: 6KLB_A, PDB: 6KLB_D, and UniProtKB/Swiss-Prot: U2UMQ6), Cas12b proteins (WP_217021837, WP_163299037, and WP_027726362), Cas13a protein (UniProtKB/Swiss-Prot: PODPB8), and so forth.
In some specific embodiments, the Cas protein is Streptococcus pyogenes Cas9 (SpCas9), Staphylococcus aureus Cas9 (SaCas9), Lachnospiraceae bacterium Cpf1 (LbCpf1), etc.
In some alternative embodiments, the first domain is a SpCas9 protein mutant. Furthermore, the SpCas9 protein mutant is selected from SpCas9 protein mutants with a lost nuclease activity (SpdCas9) or SpCas9 protein mutants with a reduced nuclease activity (SpnCas9). Exemplarily, the amino acid sequence of SpdCas9 is set forth in SEQ ID NO: 85, or is a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO: 85. The amino acid sequence of SpnCas9 is set forth in SEQ ID NO: 86, or is a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO: 86.
In some alternative embodiments, the first domain is a SaCas9 protein mutant. Furthermore, the SaCas9 protein mutant is selected from SaCas9 protein mutants with a lost nuclease activity (SadCas9) or SaCas9 protein mutants with a reduced nuclease activity (SanCas9). Exemplarily, the amino acid sequence of SadCas9 is set forth in SEQ ID NO: 87, or is a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO: 87. The amino acid sequence of SanCas9 is set forth in SEQ ID NO: 88, or is a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO: 88.
In some alternative embodiments, the first domain is an LbCpf1 protein mutant. Furthermore, the LbCpf1 protein mutant is an LbCpf1 protein mutant with a lost nuclease activity (LbdCpf1). Exemplarily, the amino acid sequence of LbdCpf1 is set forth in SEQ ID NO: 89, or a sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to the sequence set forth in SEQ ID NO: 89.
In some embodiments, the base editor of the present disclosure enables editing of one base at any site by selecting an appropriate PAM sequence and a Cas protein recognizing the PAM sequence.
The second domain has a base modification activity. In some embodiments, the base modification activity is a deaminase activity, and the second domain is a deaminase, or a mutant, homologue or polypeptide having or at least partially having a deaminase activity. At the time of forming a ternary complex of fusion protein-gRNA mutant-target sequence, the second domain mutates the base within the editing window by deaminizing the base within the editing window.
In some embodiments, the second domain is an adenine deaminase, or a mutant, homologue or polypeptide fragment having or partially having an adenine deaminase activity. The adenine deaminase will mutate A within the editing window into G. Because of having an editing window with optimal one base, the base editor of the present disclosure could be used to realize one specific base mutation from A to G.
The amino acid sequence of the adenine deaminase, or the mutant, polypeptide or homologue thereof is not specifically limited in the present disclosure, so long as the adenine deaminase, or the mutant, polypeptide or homologue thereof could have an enzyme activity of the adenine deaminase and realize the A-to-G mutation.
In some alternative embodiments, the second domain is an enzyme having the adenine deaminase activity that is at least one selected from the group consisting of the following (c1) and (c2):
In some alternative embodiments, the second domain is tRNA adenosine deaminase TadA from Escherichia coli, or a mutant, homologue or polypeptide in which the adenosine deaminase activity is maintained. Exemplarily, the amino acid sequence of TadA may refer to NP_417054.
In some alternative embodiments, the second domain is a mutant TadA* of TadA having an adenosine deaminase activity. Exemplarily, the amino acid sequence of TadA* is set forth in SEQ ID NO: 83 or SEQ ID NO: 84. TadA from Escherichia coli or the mutant TadA* of TadA could realize the mutation from adenine A to guanine G by deaminizing the adenine within the editing window.
In some embodiments, the second domain is a cytosine deaminase, or a mutant, homologue or polypeptide fragment having or partially having the cytosine deaminase activity. The cytosine deaminase will mutate C within the editing window into T. Because of having an editing window with optimal one base, the base editor of the present disclosure could be used to realize a specific base mutation from C to T.
The present disclosure does not specifically limit the amino acid sequence of the cytosine deaminase or the mutant, polypeptide or homologue thereof, so long as it could have an enzyme activity of the cytosine deaminase and realize the C-to-T mutation.
In some alternative embodiments, the second domain is an enzyme having the cytosine deaminase activity that is at least one selected from the group consisting of the following (d1) and (d2):
Exemplarily, the second domain may be any one selected from the following cytosine deaminases, or mutants, homologues or polypeptides thereof: AID (the amino acid sequence may refer to NP_001317272, NP_065712), APOBEC3A (e.g.: AKE33285, AEH96362, ACH92046, CAK54680), APOBEC3G (e.g.: human APOBEC3G, the amino acid sequence may refer to NP_068594, NP_001336365, NP_001336366, NP_001336367), APOBEC1 (e.g.: APOBEC1, the amino acid sequence may refer to NP_001127863, NP_112436; human APOBEC1, the amino acid sequence may refer to NP_001291495, NP_001635, NP_005880), CDA1 (e.g.: Lampetra japonica CDA1, ABO15149, ABO15150).
The composition provided in the present disclosure comprises the gRNA mutant, isolated polynucleotide, recombinant expression vector, recombinant host cell or base editor according to the present disclosure. Due to the inclusion of optimized gRNAs, the base editors of the present disclosure exhibit a high specificity to the gene editing of the nucleic acid of interest, could achieve efficient editing of one base, and hold great potential in gene therapy, construction of animal and plant models, etc.
In some embodiments, the composition further comprises one or more pharmaceutically acceptable carriers. In the present disclosure, the composition is used to facilitate the administration to an organism, facilitate absorption of active ingredients, and thus exert the bioactivity. The composition of the present disclosure may be administered in any form including injection (intra-arterial, intravenous, intramuscular, intraperitoneal, or subcutaneous), mucosa administration, oral administration (an oral solid preparation or an oral liquid preparation), rectal administration, inhalation, implantation, topical administration (e.g., eyes), etc. Non-limiting examples of the oral solid preparations include, but are not limited to, powders, capsules, pastilles, granules, tablets, etc. Non-limiting examples of the liquid preparations for oral or mucosa administration include, but are not limited to, suspensions, tinctures, elixirs, solutions, etc. Non-limiting examples of the preparations for topical administration include, but are not limited to, emulsions, gels, ointments, creams, patches, pastes, foams, lotions, drops, or serum preparations. Non-limiting examples of the preparations for parenteral administration include, but are not limited to, solutions for injection, dry powders for injection, suspensions for injection, emulsions for injection, etc. The composition of the present disclosure may be further prepared into controlled-release or extended-release dosage forms (e.g., liposomes or microspheres).
In the present disclosure, the routes of administration may be varied or adapted in any applicable way to meet the requirements for the properties of drugs, the convenience of the patients and medical personnel, and other related factors.
Use of gRNA Mutant and Base Editor
In some embodiments, the present disclosure provides use of the gRNA mutant or base editor as a reagent or kit or in the preparation of a reagent or kit for single-base editing. The gRNA mutants or base editors of the present disclosure could achieve more controllable and more specific editing of one base, thereby overcoming the defect that conventional base editors cannot edit one specific base because their editing windows include multiple base-editing sites, and they are particularly useful as single-base editing tools in the fields such as gene therapy and construction of animal and plant models.
In some embodiments, the present disclosure provides use of the gRNA mutant or base editor as a medication or in the preparation of a medication for gene therapy. Because of having a narrowed editing window; the gRNA mutant or base editor could realize specific editing of one base and avoid side effects caused by additional gene editing, and it is suitable as a gene therapy medication for diseases and has a great prospect of medical applications.
Other purposes, features, and advantages of the present disclosure will become apparent from the following detailed descriptions. It is understood, however, that the detailed descriptions and specific examples (while indicating the specific embodiments of the present disclosure) are provided by way of illustration only because various changes and modifications made within the spirit and scope of the present disclosure will become obvious to a person skilled in the art after reading the detailed descriptions.
The present disclosure will be illustrated with reference to the specific examples. The reagents, samples, and the like used in the examples are all commercially available or accessible to the public by other approaches; and they are provided only as instances but are not unique to the present disclosure and may be replaced by other appropriate tools and biological materials, respectively. The experimental procedures involved may be carried out according to the conditions and methods described in Molecular Cloning: A Laboratory Manual (Third Edition) (Science Press, 2002) and may be carried out according to the manufacturers' instructions for commercially available enzymes or kits. Other experimental methods not described in detail herein are all conventional methods known to a person skilled in the art unless otherwise specified. The sequencing and gene synthesis described in the following examples have been conducted in GENEWIZ, Inc.
BE4max (Addgene: 112093) and hy BE4max (Addgene: 157942) base editor plasmids
NG-ABEmax (Addgene: 124163) base editor plasmid and SaABEmax (Addgene: 119814)
In mammalian cells, different editing sites were edited by cytosine base editors with igRNAs, and the percentages of base editing of single-window were increased by up to 38.50 folds compared to the case of using unmodified gRNAs.
Experimental process: HEK293T or Hela cells were plated in a 24-well plate at 5×105 cells/well. When the cells in each well grew to 40% to 60% confluency, two representative cytosine base editor plasmids BE4max and hyBE4max, which were commonly used at present, were transfected with Lipofectamine 2000 (Life, Invitrogen, 11668019) reagents into HEK293T or Hela, together with gRNA plasmids targeted to different editing sites and igRNA plasmids modified with insertions, deletions or mismatches at the amounts of 600 ng of editor plasmids and 300 ng of gRNA/igRNA plasmids, respectively. Transfection of each combination of plasmids was performed in triplicate. After 24 h of transfection, 5 μg/ml of puromycin (Merck, USA) was added to the medium. After 120 h of transfection, QuickExtract DNA Extraction Solution (Epicentre, USA) was used to extract genomic DNA. The regions of 200 bp to 300 bp across the edited sites were amplified by PCR using the Taq DNA polymerase (CWBIO, China). The PCR products were subjected to high-throughput sequencing to calculate the editing efficiency (GENEWIZ, China).
Experimental results: Five different genomic loci NSD1, DNMT3B, RNF2, RNF216, and NSD1 were selected. At each locus, gRNA perfectly matching the editing sites and 3 to 5 igRNAs with insertions, deletions or mismatches compared to the editing sites were selected for editing. The results showed that at the NSD1 locus of HEK293T, when BE4max and igRNA (with a deletion of C at the third position) were used, the C-to-T editing efficiency of single C at the sixth position (C6) at the editing sites was increased from 3.34±0.29% to 34.89±0.23%, and the editing specificity (namely, the ratio of editing of the single C at the sixth position to all kinds of editing) was improved from 8.40±0.66% to 71.70±0.85%; and when hyBE4max and igRNA (with a deletion of C at the third position) were used, the C-to-T editing efficiency and editing specificity of single C at the sixth position (C6) at the editing sites were both improved. In case of using different igRNAs, the editing specificities and editing efficiencies of single C at C8 of DNMT3B, C6 of RNF2, C5 of RNF216, and C6 of NSD1 were all improved to some extent. The details were shown in Table 1 to Table 4.
In mammalian cells, different editing sites were edited by adenine base editors using modified gRNA, and the percentages of base editing of single-window were increased by up to 10.15 folds compared to the case of using unmodified gRNAs.
Experimental process: HEK293T or Hela cells were plated in a 24-well plate at 5×105 cells/well. When the cells in each well grew to 40% to 60% confluency, a representative adenine base editor NG-ABEmax plasmid, which was commonly used at present, was transfected with the Lipofectamine 2000 (Life, Invitrogen, 11668019) reagent into HEK293T or Hela, together with gRNA plasmids targeted to different editing sites and igRNA plasmids modified with insertions, deletions or mismatches at the amounts of 600 ng of editor plasmids and 300 ng of gRNA/igRNA plasmids, respectively. Transfection of each combination of plasmids was performed in triplicate. After 24 h of transfection, 5 μg/ml of puromycin (Merck, USA) was added to the medium. After 120 h of transfection, QuickExtract DNA Extraction Solution (Epicentre, USA) was used to extract genomic DNA. The regions of 200 bp to 300 bp across the edited sites were amplified by PCR using the Taq DNA polymerase (CWBIO, China). The PCR products were subjected to high-throughput sequencing to calculate the editing efficiency (GENEWIZ, China).
Experimental results: Nine different genomic loci PSMB2, ABCA3, EMX1-SITE3, VISTA hs267, SNCA, ANO5, KCNQ2, NOTCH2, and GFI1 were selected. At each locus, gRNA perfectly matching the editing sites and 3 to 5 igRNAs with insertions, deletions, mismatches compared to the editing sites were selected for editing.
The results showed that at the ANO5 locus of HEK293T, when NG-ABEmax and igRNA (with base A inserted at the third position) were used, the A-to-G editing specificity of single A at the seventh position (A7) at the editing sites was improved from 27.75±0.03% to 91.34±0.94%, and the editing efficiency was increased from 15.46±0.66% to 24.94±0.59%. In case of using different igRNAs, the editing specificities and editing efficiencies of single A at A5 of PSMB2, A5 of ABCA3, A6 of EMX1-SITE3, A5 of VISTA hs267, A5 of SNCA, A5 of KCNQ2, A5 of NOTCH2, and A5 of GFI1 were all improved accordingly. The details were shown in Table 5 and Table 6.
In mammalian cells, base editors with expanded PAM frames were selected. By changing the PAM frames, it would be possible to realize precise editing at any site. As shown in
Experimental process: The nCas9 in the representative adenine base editor NG-ABEmax plasmid, which was commonly used at present, was substituted with SpRYnCas9 capable of recognizing the PAM sequence as NRN or NYN (R was A/G and Y was C/T) to construct the adenine base editor SpRY-ABEmax that could recognize any PAM frame. HEK293T cells were plated in a 24-well plate at 5×105 cells/well. When the cells in each well grew to 40% to 60% confluency, the SpRY-ABEmax editor plasmids were transfected with the Lipofectamine 2000 (Life, Invitrogen, 11668019) reagents into HEK293T, together with gRNA plasmids targeted to different editing sites and igRNA plasmids modified with insertions, deletions or mismatches at the amounts of 600 ng of editor plasmids and 300 ng of gRNA/igRNA plasmids, respectively. Transfection of each combination of plasmids was performed in triplicate. After 24 h of transfection, 5 μg/ml of puromycin (Merck, USA) was added to the medium. After 120 h of transfection, QuickExtract DNA Extraction Solution (Epicentre, USA) was used to extract genomic DNA. The regions of 200 bp to 300 bp across the edited sites were amplified by PCR using the Taq DNA polymerase (CWBIO, China). The PCR products were subjected to high-throughput sequencing to calculate the editing efficiency (GENEWIZ, China).
Experimental results: Two different genomic loci EMX1-SITE3 and NOTCH2 were selected. At the EMX1-SITE3 locus, if N20 was selected as 5′-GGAGCACACATGCCCAGGTG-3′, the PAM sequence was NGG, and the NG-ABEmax was used for editing, the editing specificity of A at the sixth position (A6) was 48.20±0.31%, and the editing specificity of A at the eighth position (A8) was 0.90±0.01%; after use of the igRNA, the editing specificity of A at the sixth position (A6) was 72.37±0.40%, and the editing specificity of A at the eighth position (A8) was 3.18±0.16% and the percentage of A8 editing was still less: if N20 was selected as 5′-GCACACATGCCCAGGTGTGG-3′, the PAM sequence was NAG, and the SpRY-ABEmax was used for editing, the original A at the sixth position would become A at the third position and the original A at the eighth position would become A at the fifth position: under such circumstances, the editing specificity of A at the third position was 0.91±0.09% and the editing specificity of A at the fifth position was 36.06±0.47% in case of editing by the corresponding gRNA; and after use of igRNA (with the base T inserted at the second position), the editing specificity of A at the third position was 0.67±0.32% and the editing specificity of A at the fifth position was 72.24±1.45%, so the percentage of editing of A at the fifth position (the original A at the seventh position) was remarkably increased. Similar effects were achieved at the NOTCH2 locus. The details were shown in Table 7 and Table 8.
In mammalian cells, different editing sites were edited by SaCas9 adenine base editors using modified gRNAs, and the percentages of base editing of single-window were all somewhat increased compared to the case of using unmodified gRNAs.
Experimental process: HEK293T cells were plated in a 24-well plate at 5×105 cells/well. When the cells in each well grew to 40% to 60% confluency, the adenine base editor SaABEmax plasmid, which was commonly used at present, was transfected with the Lipofectamine 2000 (Life, Invitrogen, 11668019) reagent into HEK293T, together with gRNA plasmids targeted to different editing sites and igRNA plasmids modified with insertions, deletions or mismatches at the amounts of 600 ng of editor plasmids and 300 ng of gRNA/igRNA plasmids, respectively. Transfection of each combination of plasmids was performed in triplicate. After 24 h of transfection, 5 μg/ml of puromycin (Merck, USA) was added to the medium. After 120 h of transfection, QuickExtract DNA Extraction Solution (Epicentre, USA) was used to extract genomic DNA. The regions of 200 bp to 300 bp across the edited sites were amplified by PCR using the Taq DNA polymerase (CWBIO, China). The PCR products were subjected to high-throughput sequencing to calculate the editing efficiency (GENEWIZ, China).
Experimental results: Two different genomic loci CFAP61 and Query_55451 were selected. At each locus, gRNA perfectly matching the editing sites and 3 to 5 igRNAs with insertions, deletions, mismatches or the like compared to the editing sites were selected for editing.
The results showed that at the CFAP61 locus of HEK293T, when SaABEmax and igRNA (with the base G inserted at the third position) were used, the A-to-G editing specificity of single A at the eleventh position (All) at the editing sites was improved from 32.88±0.35% to 54.05±0.46%, and the editing efficiency was increased from 12.83±0.27% to 15.65±0.46%. In case of using different igRNAs, the editing specificities and editing efficiencies of single A at A9 of Query_55451 were improved accordingly. The details were shown in Table 9 and Table 10.
The examples of the present disclosure described above are merely instances provided to explain the present disclosure clearly, rather than limitations on the embodiments of the present disclosure. For a person skilled in the art, other variations or modifications in different forms may be further made on the basis of the above explanations. It is not necessary nor is it possible to exhaust all embodiments. Any modifications, equivalents, improvements and so on made within the spirit and principle of the present disclosure shall all be covered within the scope of protection for the claims of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202110873929.5 | Jul 2021 | CN | national |
This is the U.S. National Phase Application under 35 U.S.C. § 371 of International Patent Application No. PCT/CN2022/107988 filed Jul. 26, 2022, which claims the benefit of Chinese Patent Application No. 202110873929.5, filed on Jul. 30, 2021 with the Chinese Patent Office, entitled “METHOD FOR NARROWING EDITING WINDOW OF BASE EDITOR, BASE EDITOR, AND USE”, both of them are incorporated by reference herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/107988 | 7/26/2022 | WO |