The instant application contains a Sequence Listing which is being submitted in ASCII format via EFS-WEB and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Dec. 13, 2022, is named 42YV363668US_ST25.txt and is 20,487 bytes in size.
The present disclosure relates to the technical field of gene editing technology, and relates to a method, composition and application for activating γ-globin gene expression.
Hemoglobin (Hb for short) is a special protein that carries and transports oxygen in erythrocytes. Hemoglobin is composed of globin and heme. Hemoglobin in adults is mainly a tetramer (α2 β2) composed of 2 α-globins and 2 β-globins, which becomes an adult hemoglobin (HbA). Mutations in the β-globin gene (HBB) can cause β-hemoglobin disorders (also known as β-hemoglobinopathies), including sickle cell disease (SCD) and β-thalassemia ((β-thal). Sickle cell anemia is caused by point mutations in the structural gene of β-globin, resulting in the production of abnormal hemoglobins (HbS). β-thalassemia is caused by partial or complete defects in the β-globin gene expression, resulting in deficiency or absence of HbA. The reduction or absence of the globin chains of hemoglobins can lead to abnormal hemoglobin structures, the erythrocytes containing the same have reduced deformability, shortened lifespan, and may initiate in situ hemolysis while in the bone marrow, and may be destroyed in advance by the visceral organs such as spleen after entering the peripheral blood circulation, resulting in anemia, iron accumulation in the body and even dysplasia. Hemoglobinopathies affect millions of people around the world. Currently, approximately 330,000 children are born with hemoglobinopathies each year, which threatens human health and even life seriously. Patients with thalassemia and sickle cell disease are mainly relieved by standardized long-term blood transfusion and iron chelation therapy, which are not only incurable, but also having high safety risks. Allogeneic hematopoietic stem cell transplantation is the only treatment technology that can cure thalassemia and sickle cell disease currently. However, it is not able to be applied clinically in large scale for the limitations brought by factors such as low success rate of bone marrow matching and risk of immune rejection. Therefore, there is an urgent need to develop novel treatments which are safe and effective.
During human development, hemoglobin is not always composed of two α-globins and two β-globins. During embryonic development and shortly after birth, hemoglobin exists in the form of a tetramer (α2γ2) composed of two α-globin chains and two γ-globin chains, called fetal hemoglobin (HbF), which has a stronger oxygen affinity than HbA. With the development of fetus, the γ-globin is gradually silenced and not expressed, while the expression of β-globin gene near the same genomic site is gradually increased. About half a year after birth, the composition and proportion of hemoglobins in the blood gradually become stable, and HbF is replaced by adult hemoglobin (HbA), with only extremely low level of HbF (less than 1% of total hemoglobins) remained.
Studies have found that some populations have mutations at specific sites where the transcription of the γ-globin gene is activated, resulting in an increase in the proportion of HbF in hemoglobins, which is called hereditary persistence of fetal hemoglobin (HPFH). For example, some different types of mutations were found in the γ-globin gene promoter region, such as −114 to −102 13 bp del c., 4 bp del c.−225 to −222, c.−114C>T, c.−117G>A, c.−158C>T, c.−167C>T, c.−170>G>A, c.−175T>G, c.−175T>C, c.−195C>G, c.−196C>T, c.−198T>C, c.−201C>T, etc. These naturally occurring mutations increase the proportion of HbF in total hemoglobins to different extents. Clinical studies have found that in few patients with β-thalassemia or sickle cell anemia, the symptoms of anemia are reduced or relieved to a certain extent or the need for blood transfusions is reduced due to that the expression of HbF makes up the shortage of HbA resulting from the apropos presence of HPFH mutations in their genomes. Inspired by this finding, scientists continue to explore various ways to induce HbF expression to achieve the purpose of treating β-hemoglobinopathies. For example, hydroxyurea and other drugs are currently used clinically to induce HbF expression to treat β-hemoglobinopathies. However, the level of induced HbF in most patients is too low to completely improve the patient's condition.
Gene editing technology brings new hope and new methods for the treatments of genetic diseases such as hemoglobinopathies. In recent years, the breakthrough development of gene editing technology has made it possible and easier to artificially change the base sequence of specific sites in the genome. The gene editing technologies relatively well developed are ZFN (zinc finger nuclease), TALEN (transcription activator-like effector nuclease) and CRISPR (clustered regularly interspaced short palindromic repeats)/Cas system (CRISPR-Cas system). By designing a specific gene editing system, the target genomic DNA sites in a living cell is cleaved to form double-strand breaks (DSBs). Cells can use a non-homologous end joining (NHEJ) repair mechanism to repair DNA nicks, which can generate mutations randomly, such as sequence insertions, deletions, base substitutions, etc. If an artificially designed DNA donor template (donor template) is provided at the same time as gene editing, cells can use homology-directed repair (HDR) mechanism to complete the repair, thereby introducing the expected base mutation form at the target site.
Currently, there are several different approaches internationally for the therapeutic strategy of enhancing HbF expression in erythrocytes by gene editing. (1) Delete or destroy the erythroid enhancer element in the intron of BCL11A gene on human chromosome 2, and the BCL11A expression in erythrocytes is reduced, thereby the transcriptional inhibitory effect of BCL11A on the γ-globin gene is relieved. (2) Delete or destroy some sequences in the γ-globin gene promoter region to prevent the binding of transcriptional repressors, or introduce naturally occurring HPFH mutation types, such as 13 base pairs deletions (13 bp del c.) at site −114 to −102. (3) Generate deletion of a large fragment of DNA of 3.5 kb to 13.6 kb, and delete the sequence containing an unknown repressor between γ-globin gene and β-globin to promote the binding of γ-globin gene to the distal LCR enhancer. The approaches have shown the effect of activating HbF expression, but the long-term efficacy or/and safety of the approaches is unclear. Therefore, new approaches need to be further developed.
The present disclosure provides a method for activating γ-globin gene transcription by gene editing. The present disclosure uses a single-stranded oligonucleotide (ssODN) containing effective active GATA element or corresponding antisense complementary sequence, to generate a few base mutations (deletions, substitutions, insertions, etc.) in the proper position of the γ-globin gene promoter region by gene editing technology (such as CRISPR-Cas gene editing system). After editing, an erythroid enhancer containing the effective active GATA element is formed on the sense or antisense strand of the γ-globin gene promoter region, the effective active GATA element includes the sequence structure of NTG-N(7-8)-WGATAR or NAG-N(7-8)-WGATAR. It can promote the transcription of γ-globin gene in mature erythrocytes and increase HbF expression in erythrocytes. Herein, W is T or A; R is A or G; N is A, G, C or T, preferably C or T; N(7-8) refers to 7 to 8 random bases.
As shown in
The present disclosure provides an ssODN containing NTG-N(7-8)-WGATAR sequence, NAG-N(7-8)-WGATAR sequence or reverse complementary sequence thereof, which can be used for gene editing of γ-globin gene. In the present disclosure, the ssODN structure comprises a 5′ homology arm, a substitution sequence (used to substitute a base sequence of HBG1 or HBG2 gene) and a 3′ homology arm. Herein, the GATA or TATC sequence in NTG-N(7-8)-WGATAR sequence, NAG-N(7-8)-WGATAR sequence or reverse complementary sequence thereof of ssODN structure can be located at any position in the ssODN, including the substitution sequence, the 5′ homology arm, the 3′ homology arm, the junction of the 5′ homology arm and the substitution sequence, and the junction of the 3′ homology arm and the substitution sequence. The length of a unilateral homology arm could be 20 to 300 nt. In the embodiments of the present disclosure, the length of the ssODN homology arm is about 40 nt. In some embodiments of the present disclosure, the ssODN is a sequence selected from the group consisting of SEQ ID NO: 40 to SEQ ID NO: 65. In some embodiments thereof, the 5′ and 3′ of the ssODN are phosphorothioate modified.
The present disclosure discloses a composition for use in γ-globin gene editing, containing ssODN, sgRNA and CRISPR-Cas nuclease. In some embodiments of the present disclosure, the site targeted by sgRNA of the gene editing system is located in the sequences shown in any one of SEQ ID NO: 32 to SEQ ID NO: 39. In some embodiments of the present disclosure, the sgRNA is the sequence shown in any one of SEQ ID NO: 69 to SEQ ID NO: 76. In some embodiments of the present disclosure, the Cas9 protein is a Cas 9 protein derived from Streptococcus pyogenes.
In the present disclosure, gRNA (guide RNA) is a RNA capable of binding and guiding a nuclease, such as sgRNA (single guide RNA).
The disclosure also discloses an electroporation method for use in γ-globin gene editing. The method utilizes electroporation technology to efficiently transfect a human hematopoietic stem cell with the composition formed by ssODN, sgRNA, and CRISPR-Cas nuclease to mediate efficient gene editing.
The disclosure also discloses a hematopoietic stem cell. The composition comprising ssODN, sgRNA and CRISPR-Cas nuclease is transferred into the hematopoietic stem cell by electroporation method. The γ-globin gene promoter region in the cell contains active GATA element, which could increase the γ-globin gene expression during erythroid differentiation.
Compared with the prior art, the present disclosure has the following beneficial effects:
The ssODN of the present disclosure is introduced into cells together with a gene editing system (such as CRISPR-Cas, TALEN, ZFN, etc.), which guides the gene editing of the γ-globin gene promoter region, and mutate at specific sites to form GATA element (such as NTG-N(7-8)-WGATAR sequence or NAG-N(7-8)-WGATAR sequence) with an effective activating effect. In some embodiments of the present disclosure, the ssODN and the CRISPR-Cas gene editing system are introduced into human hematopoietic stem cells by electroporation technology, and efficient gene editing is achieved in the γ-globin gene promoter region. Mutations are introduced at specific sites to form the expected erythroid enhancer element containing NTG-N(7-8)-WGATAR sequence or NAG-N(7-8)-WGATAR sequence. After the hematopoietic stem cells differentiate into erythrocytes, the γ-globin gene expression significantly increases and the proportion of HbF in hemoglobin significantly increases.
In order to facilitate understanding of the present disclosure, the present disclosure will be described more comprehensively with reference to the related drawings as follows. Preferred embodiments of the disclosure are shown in the drawings but the present disclosure may be embodied in many different forms and is not limited thereto. On the contrary, the purpose of providing these embodiments is to understand the present disclosure thoroughly and comprehensively.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by the skilled in the art. The terms used herein in the description of the present disclosure are for the purpose of describing specific embodiments only and are not intended to limit the present disclosure. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
The raw materials used in the following examples are all commercially available.
The disclosure provides by using of gene editing technology (such as CRISPR-Cas-mediated gene editing technology), combined with the single-stranded oligonucleotides (ssODN) designed by the present disclosure, performing efficient gene editing on stem cells and progenitor cells with erythroid differentiation ability such as CD34+ hematopoietic stem cells. Mutations are made in the promoter of HBG1 and HBG2 genes (located on human chromosome 11) encoding human γ-globin, and a sequence structure of NTG-N(7-8)-WGATAR or NAG-N(7-8)-WGATAR is artificially created on the sense or antisense strand of the promoter (
GATA-1 is a transcriptional regulator that is specifically and highly expressed in the erythroid system and plays an important role in the process of erythroid differentiation. Genes that are specifically highly expressed in erythrocytes, such as α- and β-globin and heme biosynthetic enzymes, are directly activated by GATA-1 transcription. As a transcription factor, GATA-1 selectively binds to WGATAR sequence element on chromatin DNA (wherein W represents T or A, R represents A or G, and can also be represented by T/A(GATA)A/G; the antisense complementary sequence is YTATCW, wherein Y represents T or C, and can also be represented by T/C(TATC)T/A). It can be seen that the “antisense complementary” used herein refers to “reverse complementary” known in the art. For example, multiple GATA sites have also been found in the promoter region of the human β-globin gene. Notably, not all genes containing GATA sequence would be activated by GATA-1. This is because GATA sequence is widely distributed in the genome and often act with a need to form a certain form of sequence combination (element combination) with upstream and downstream sequences (cis elements). For example, there are cis-regulatory elements such as CACCC or GC box near the GATA sequence, which usually recruit GATA-1 to bind together to the transcription factor EKLF. For example, there is a E-box element (CAGNTG, wherein N represents any base) or half E-box element (NTG) near GATA sequence, which recruits GATA-1 to bind together to the transcription fact TALE ChIP-seq studies on the binding sites of GATA-1 and TAL1 at the genome level showed that about three-quarters of the GATA-1/TAL1 co-binding sites have a obvious sequence regularity, that there is often a half E-box element (NTG) which is 7-8 bases upstream of the WGATAR sequence, which can be represented by NTG-N(7-8)-WGATAR. Experiments have shown that this sequence is an active GATA element that promotes some gene expression in erythrocytes. For example, the +58 enhancer in the BCL11A intron contains the CTG-N(7)-TGATAA sequence, and the disruption of the GATA motif in this sequence can lead to a significantly reduction of BCL11A expression in erythrocytes.
The sequences of 1400 bases upstream of the transcription start site of HBG1 and HBG2 genes have an extremely high similarity and are considered to be the promoter region. The inventor of the present disclosure has found from studies that the promoter region of the γ-globin itself contains multiple GATA sequence (or TATC) and a large number of TG sequence, but γ-globin is hardly expressed in mature erythrocytes. The inventor of the present disclosure has found through profound research that GATA (or TATC) and TG sequences of the γ-globin gene promoter do not form effective active GATA elements in position and direction (such as TG-N(7-8)-WGATAR), and therefore cannot recruit transcription activators such as GATA-1/TAL1 that are highly expressed in mature erythrocytes to bind (co-bind) to the promoter region of the γ-globin gene, so γ-globin is silenced and is hardly expressed in mature erythrocytes.
Through profound research, the inventor of the present disclosure found that there are some sites in the promoter regions of HBG1 and HBG2, the sequences of the sense or antisense strands of which can form the sequence structure of NTG-N(7-8)-WGATAR or NAG-N(7-8)-WGATAR by changes (such as substitutions, deletions, insertions, etc.) of a few bases. The inventor of the present disclosure consider that it is possible to edit a certain proportion of the genome to an expected sequence structure if a gene editing system is used to cleave the sites to form DSBs, by adding a donor DNA template (such as a single-stranded oligonucleotide ssODN) to guide the sites for HDR repair. Finally, the γ-globin gene promoter region, can act as an enhancer to recruit highly-expressed transcription activators in mature erythrocytes such as GATA-1/TAL1, etc. to bind (co-bind) to the promoter region of the γ-globin gene due to the formation of the sequence such as NTG-N(7-8)-WGATAR or NAG-N(7-8)-WGATAR, thereby promotes the high expression of γ-globin in mature erythrocytes.
Gene editing can be divided into two categories in terms of purposes. One is to destroy the sequence or structure of the target site, which is called knock out. The purpose of another type is to introduce specific sequence mutation at the target site, which is called knock in. To introduce specific sequence mutations, the most common method is to introduce a DNA donor template at the same time, and cells can use the mechanism of homologous recombination to replace the sequence in the template in the genome to achieve knock-in. DNA templates for HDR can be plasmid DNA, linearized double-stranded DNA, AAV-delivered DNA, or single-stranded oligonucleotides (ssODN). ssODN is used more and more in gene editing, as it has high efficiency and is easy to synthesis, which is particularly suitable for gene editing in cultured cells in vitro.
The ssODN structure contains a 5′ homology arm, a substitution sequence, and a 3′ homology arm. Homology arms are sequences that are homologous to the DNA regions flanking the target site intended to be edited, and are used to locate the target site on the chromosome. The corresponding sequences of the 5′ homology arm and the 3′ homology arm on the chromosome are usually discontinuous in position, with a certain space among them, such as an spacer of 1-20 nucleotides, and these spacer sequences are the targets (intended-editing sequence) of gene editing, i.e., the sequence that is desired to be substituted by gene editing. The “substitution sequence” between the 3′ terminus of the 5′ homology arm and the 5′ terminus of the 3′ homology arm on the ssODN is the desired result of gene editing. It is able to perform homologous recombination repair while inducing gene editing by adding ssODN while performing gene editing. As a result of the repair, the intended-editing sequence in the cell genome is repaired into a substitution sequence. The length of the substitution sequence in the ssODN can be shorter than that of the intended-editing sequence, in which case the result of gene editing is the deletion or/and substitution of the intended-editing sequence in the original genome. The length of the substitution sequence in the ssODN can be 0 base, in which case the intended-editing sequence on the genome is deleted. The length of the substitution sequence in the ssODN may be longer than that of the intended-editing sequence, in which case the result of gene editing is that the intended-editing sequence is replaced or/and sequence insertion.
The present disclosure is suitable for gene editing systems mediated by site-specific nucleases such as “CRISPR-Cas”, TALEN and ZFN. “CRISPR-Cas” is a gene editing technology, including but not limited to various natural or artificially designed CRISPR-Cas systems, such as the CRISPR-Cas9 system, CRISPR-Cas12 system, etc. The working principle of CRISPR-Cas9 is that crRNA (CRISPR-derived RNA) combines with tracrRNA (trans-activating RNA) through base pairing to form a tracrRNA/crRNA complex, which guides the nuclease Cas9 protein to cleave double-stranded DNA at the sequence target sites paired with crRNA. The role of tracrRNA and crRNA can also be substituted by an artificially synthesized sgRNA with a guiding role. When using other CRISPR-Cas systems, it is necessary to design the corresponding sgRNA or crRNA. When using systems such as TALEN or ZFN, the corresponding TALEN or ZFN nuclease needs to be designed according to the editing site disclosed in the present disclosure.
By analyzing the sequences (SEQ ID NO: 1 and SEQ ID NO: 2) within about 1.4 kb upstream of the transcription start sites of HBG1 and HBG2, the inventor found that the sequences of many segments satisfy that forming the sequence structure of NTG-N(7-8)-WGATAR or NAG-N(7-8)-WGATAR by changes of a few bases such as substitution, deletion, insertion, etc.. By analyzing whether there is an NGG (PAM recognized by spCas9) sequence near the target mutant base in these segments and whether the cleavage site is on target of the target sequence to be edited, five candidate targeting regions most likely to achieve efficient gene editing were finally obtained (
According to the sequence features of the five candidate regions and the predicted cleavage site position of the CRISPR-Cas9 gene editing system, the suitable mutation target types in each region is obtained by analyzing, as is shown in Table 1 (SEQ ID NO: 3 to SEQ ID NO: 31). The underline herein indicates the base that needs to be substituted, which is the desired result after gene editing. In the present disclosure, the length of the substitution base is 0-6 bases, and the effect achieved is deletion, substitution, insertion, or a combination of deletion, substitution, and insertion.
In order to cleave the DNA double-strand near the editing site of regions shown in Table 1 to form DSB, the spCas9 CRISPR-Cas system with higher cleavage efficiency (derived from Streptococcus pyogenes) is selected in the present disclosure to analyze the target sites with higher cleavage efficiency. Site_1 to site_8 were selected as candidate target sites. The identified target site of DNA sequences and PAM sequences (NGG) are shown in Table 2 (SEQ ID NO: 32 to SEQ ID NO: 39).
According to the expected mutation type shown in Table 1, and the DNA strand (sense strand or antisense strand) recognized by sgRNA in Table 2, the corresponding ssODN is designed to guide gene editing and repair (Table 3, SEQ ID NO: 40 to SEQ ID NO: 65). In the present disclosure, the ssODN structure comprises a 5′ homology arm, a substitution sequence and a 3′ homology arm. The GATA or TATC sequence in NTG-N(7-8)-WGATAR sequence, NAG-N(7-8)-WGATAR sequence, or corresponding reverse complementary sequence thereof of ssODN structure can be located at any position on the ssODN, including the substitution sequence, the 5′ homology arm, the 3′ homology arm, the junction of the 5′ homology arm and the substitution sequence, and the junction of the 3′ homology arm and the substitution sequence. The homology arms on both sides of the ssODN can be symmetric or asymmetric (the lengths on both sides are different). The ssODN with symmetric homology arms is uniformly used for experiments in the present disclosure for the convenience of system comparison. The length of the homology arms at both sides of ssODN is 20 to 300 nt. The ssODN with a homology arm of approximately 40 nt on both sides is used to illustrate in the examples of the present disclosure for the convenience of system comparison. The ssODN can be the sense strand or the antisense strand of the edited region DNA (the homology arm sequence is the same with the corresponding sense strand or antisense strand). In the present disclosure, the DNA strand with the same sgRNA recognition site is uniformly selected by ssODN as the ssODN template sequence. Except for the sequences of homology arms on both sides, the substitution base of the sequence of ssODN can be 0, 1, 2, 3, 4, 5 or 6 bases, and the effect achieved is deletion, substitution, insertion, or a combination of deletion, substitution, and insertion. The effect achieved can be to change more than 1 base on the genome, such as deletion or substitution of 1-20 bases. In the sequences as shown in Table 3 (SEQ ID NO: 40 to SEQ ID NO: 65), the underlined bases are homology arms on both sides with a length of about 40 nt, the non-underlined bases between the homology arms are substitution bases, wherein ssODN_16 has 0 substitution base between the homology arms on both sides. The first three nucleotides at both terminuses of ssODN were modified with phosphorothioate to enhance the stability of ssODN and improve its activity in gene editing.
CCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGATACT
CTATCACTATTGGTCAAGTTTGCCTTGTCAAGGCTAT
TGGTCAAGGC
CCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGATACT
CTATCTCTATTGGTCAAGTTTGCCTTGTCAAGGCTATT
GGTCAAGGC
CCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGATACT
CTATCTTATTGGTCAAGTTTGCCTTGTCAAGGCTATTG
GTCAAGGC
CCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGATACT
CTATCATATTGGTCAAGTTTGCCTTGTCAAGGCTATT
GGTCAAGGC
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCTGTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCAGTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCTCTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCACTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCTTTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCATTCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCTATCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATCAATCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATTTATCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATTTTATCAAGGCAAGGCTGGCCAACCCATGGGTGG
AGTTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
ATTATCAAGGCAAGGCTGGCCAACCCATGGGTGGAG
TTTAGCCA
CTCTAAGACTATTGGTCAAGTTTGCCTTGTCAAGGCT
A
TCAAGGCAAGGCTGGCCAACCCATGGGTGGAGTTTA
GCCA
CCTCACTGGATACTCTAAGACTATTGGTCAAGTTTGC
CTTATCAAGGCAAGGCTGGCCAACCCATGGGTGGAG
TTTAGCCA
CCTCACTGGATACTCTAAGACTATTGGTCAAGTTTGC
CTTTATCAAGGCAAGGCTGGCCAACCCATGGGTGGA
GTTTAGCCA
TGGCTAAACTCCACCCATGGGTTGGCCAGCCTTGCCT
TGATAAGGCAAACTTGACCAATAGTCTTAGAGTATCC
AGTGAGG
TGGCTAAACTCCACCCATGGGTTGGCCAGCCTTGCCT
TGATAAAGGCAAACTTGACCAATAGTCTTAGAGTATC
CAGTGAGG
TCCCCACACTATCTCAATGCAAATATCTGTCTGAAAC
GGTAGATAACCCTGGCTAAACTCCACCCATGGGTTGG
CCAGCCTTGCCT
TATCCTCTTGGGGGCCCCTTCCCCACACTATCTCAAT
GCAACAGAAACGGTCCCTGGCTAAACTCCACCCATG
GGTTGGCCAGC
CCTATGCCTAAAACACATTTCACAATCCCTGAACTTT
TCAGATAATTGGTACATGCTTTAACTTTAAACTACAG
GCCTCACTG
CCTATGCCTAAAACACATTTCACAATCCCTGAACTTT
TCAGATAAATTGGTACATGCTTTAACTTTAAACTACA
GGCCTCACTG
CCTATGCCTAAAACACATTTCACAATCCCTGAACTTT
TCAGATAAAATTGGTACATGCTTTAACTTTAAACTAC
AGGCCTCACTG
CCTATGCCTAAAACACATTTCACAATCCCTGAACTTT
TCAGATAGAATTGGTACATGCTTTAACTTTAAACTAC
AGGCCTCACTG
The sgRNA-spCas9 vector (PX459, pSpCas9(BB)-2A-Puro) is used to express the sgRNA corresponding to the target site in Table 2. The vector expresses the spCas9 protein and can express sgRNA with a length of 100 nt. The 5′ terminus of the sgRNA is a guide sequence of 20 nt, and the latter is a general sgRNA backbone sequence of 80 nt (SEQ ID NO: 66, GUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGA AAAA GUGGCACCGAGUCGGUGCUUUU). The method of cloning sgRNA is: synthesize the corresponding guide strand sequence oligo (same as the 20nt target site sequence in Table 2) of the sgRNA and complementary strand oligo (cacc is added to the 5′-terminus of the guide strand sequence, if the first nucleotide at the 5′-terminus of the guide strand is not guanine G, then caccG is added to the 5′-terminus of the guide strand; cacc is added to the 5′-terminus of the complementary strand; if a G is added to the 5′-terminus of the guide strand, then a C is added to the 3′-terminus of the complementary strand for complementary pairing). The two oligos are mixed in equal amounts, subjected to heat deformation and annealed at 95° C., and then ligated into the PX459 vector digested with the restriction enzyme Bbs I. The success of cloning was identified by colony PCR and Sanger sequencing. The PCR was performed using SEQ ID NO: 67 (5′-3′: GGACTATCATATGCTTACCGTAAC) and SEQ ID NO: 68 (5′-3′: GGCGGGCCATTTACCGTAAG) as primers. After successfully cloning, 8 plasmids were obtained, expressing sgRNAs from SEQ ID NO: 69 to SEQ ID NO: 76, respectively (Table 4).
The 8 plasmids expressed were transfected into 293T cells, and the method was referred to the method of Lipofectamine 2000™ kit. The transfected cells were cultured for 4 days, and the cells were collected to extract genomic DNA. SEQ ID NO: 77 and SEQ ID NO: 78, and SEQ ID NO: 79 and SEQ ID NO: 78 were used respectively, to amplify the fragments of HBG1 and HB G2 promoter gene by KOD Plus high-fidelity enzyme PCR method. The resulting PCR products were purified and sequenced using SEQ ID NO: 77 and SEQ ID NO: 79, respectively. The files obtained from the sequencing results were analyzed by synthego software for gene editing efficiency. As shown in
The eight sgRNA-spCas9 vectors were co-transfected with appropriate ssODN into 293T cells to test whether the addition of ssODN could induce gene editing to form the expected mutation type (occurrence of HDR repair). According to the position of the sgRNA cleavage point, the 8 sgRNA-spCas9 vectors were combined according to the sgRNA+ssODN combination method shown in Table 6 to form 30 combinations, respectively. Using Lipofectamine 2000™ reagent, ssODN was added while transfecting the plasmid, and the ssODN and the plasmid were delivered to the cells together. The transfected cells were cultured for 4 days, and the cells were collected to extract genomic DNA. Since the gene editing efficiency of HBG1 and HBG2 is almost positive correlated, and the gene editing efficiency of HBG2 is always slightly higher than that of HBG1 (
Transfection of plasmids into suspension cells using liposomes (e.g. Lipofectamine 2000™ reagent) is relatively inefficient, and transfecting suspension cells are generally introduced by electroporation. Some combinations of sgRNA and ssODN in Example 2 were taken to be introduced into K562 cells by lonza-4D electroporator, with the K562 cell transfection procedure which is built-in. The transfected cells were cultured for 4 days, and the cells were collected to extract genomic DNA. The fragment of HBG2 promoter gene was amplified by PCR to sequencing. The sequencing results analyzed by synthego software for gene editing efficiency, show that the electroporation of combinations sgRNA-1, sgRNA-2, sgRNA-5 and ssODN in K562 can achieve higher editing efficiency (
Compared with the plasmid-expressed CRISPR-Cas system, the delivery of the CRISPR-Cas system in the form of ribonucleoprotein (RNP) has been proved to have a high efficiency in gene editing and it is not easy to induce apoptosis. Commercial spCas9 protein and chemically synthesized sgRNA-1, sgRNA-2, and sgRNA-5 were used (sequences are the same as the corresponding sgRNAs in Table 4, with the terminus chemically modified) to incubate and package RNP-1, RNP-2 and RNP-5 in vitro, respectively. RNPs were delivered in combination with some ssODN into K562 cells by electroporation. The transfected cells were cultured for 4 days, and the cells were collected to extract genomic DNA. The HBG2 promoter gene fragment was amplified by PCR to conduct sequencing. The sequencing results analyzed by synthego software for gene editing efficiency, show that (
After resuscitation, mobilized peripheral blood-derived human CD34-positive cells (mPBSCs) were cultured for two days in StemSpan serum-free expansion medium (SFEM) enriched with human cytokines (SCF, TPO, Flt3L, 100 ng/ml each). ssODN and spCas9/sgRNA RNPs were delivered into mPBSCs by the EO-100 program built in the lonza-4D electroporator. After electroporation, the cells were cultured in SFEM enriched with human cytokines (SCF/TPO/F1t3L, 100 ng/ml each) for one day, and induced to differentiate erythrocytes in three stages. The first stage: culture in IMDM medium containing additives such as EPO, SCF, human AB serum, insulin, transferrin, heparin, IL-3, hydrocortisone, etc. for 7 days. On the 5th day of induced differentiation, 2×10e5 cells were taken out to extract genomic DNA, and then the HBG2 promoter fragment was amplified by PCR and sent to Sanger sequencing. Similar to the results in K562 cells, transduction of ssODN and spCas9/sgRNA RNPs by electroporation in mPBSCs resulted in efficient gene editing, with partial genome editing resulting in 30% of the expected mutation types (
Electroporated mPBSC cells and untreated mPBSC cells (control, NC) were induced for 7 days of the first stage of differentiation, and then were cultured in IMDM medium containing additives such as EPO, SCF, human AB serum, insulin, transferrin, and heparin, etc. for 4 days, and were finally cultured in IMDM medium containing additives such as EPO, human AB serum, insulin, transferrin, and heparin, etc. for 7 days. After 18 days of differentiation induction, both the electroporated mPBSCs and the untreated mPBSCs were observed dark red after centrifugation, which contained a large number of erythrocytes, indicating that the transfected mPBSCs with electroporation could differentiate into erythrocytes normally.
Near the end of differentiation induction, some cells were taken to extract mRNA, and the expression level of HBG mRNA was detected by reverse transcription-real-time quantitative PCR (RT-qPCR). Compared with non-transfected cells (NC), co-transfection of RNP-2 with ssODN_6, ssODN_13, ssODN_14, ssODN_15 and ssODN_17 all increased the expression level of HBG mRNA (
The glycosylated hemoglobin Alc detection system (VARIANT II TURBO HbA1c kit-2.0) was used to detect the hemoglobin of the erythrocytes differentiated from hematopoietic stem cells with electroporation transduction in the RNP-2 group by HPLC. The results showed that, compared with cells without gene editing, cells added with ssODN for gene editing has a significant increase in the content of fetal hemoglobin (HbF), ranging from 27% to 36%, which has more than 2-fold upregulation compared with the control group of 13.3% (
Number | Date | Country | Kind |
---|---|---|---|
202010616648.7 | Jul 2020 | CN | national |
This application is a National Stage entry application of PCT International Application No. PCT/CN2020/101936, filed with the China National Intellectual Property Administration (CNIPA) on Jul. 14, 2020. International Application No. PCT/CN2020/101936 claims priority to and benefits of Chinese Patent Application No. 202010616648.7, filed with the CNIPA on Jul. 1, 2020. The entire contents of all of the above-identified applications are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/101936 | 7/14/2020 | WO |